Skip to content
View m-a-r-i-o's full-sized avatar
💭
most of my code is now on gitLAB: https://gitlab.com/mariomario
💭
most of my code is now on gitLAB: https://gitlab.com/mariomario

Block or report m-a-r-i-o

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
m-a-r-i-o/README.md

About me

I am a physicist by training, with a PhD in astrophysics. My main research interest for the past several years has been the application of machine learning techniques to science, particularly astronomy.

Interpretable machine learning for scientific research

In 2014 -back when I was at Yonsei University, Korea- I started taking part in Kaggle competitions and I was hooked. I realized that machine learning would forever change the way we do science, as much and more than the arrival of the ability to do computer simulations had already done compared to the days of pen-and-paper calculations. But would it do so for the better? Was it going to be a positive or negative development?

After securing two Marie Curie grants to apply machine learning to astronomy and publishing about twenty papers on the subject, I understood that the kind of tools we need in scientific research do not fully overlap with those that are being developed by and for industry. Cynthia Rudin famously argued that we should stop explaining black box machine learning models for high stakes decisions and use interpretable models instead. But why stop at high stakes decisions? While many decisions we take in science are low stakes in that they do not directly affect people in the way the now infamous COMPAS software did, we still need transparent models for different reasons. The point of research is to understand what is going on, so a black-box prediction simply won't cut it. Explaining it a posteriori with XAI tools is like looking at one of those intuitive explanations of a theorem, where at the end you feel cheated of the confidence that comes from rigorously established results.

But how are you going to set up a fully interpretable classifier for a problem where the decision boundary is a fractal, like in some gravitational dynamics situations? That's the kind of questions I am pondering these days.

Causality and the social sciences

Thanks to my wife I had the opportunity to learn about many topics in statistics that are often overlooked by physicists. As Cosma Shalizi puts it statistical physicists do not learn any statistics. While things have definitely changed from 2006, we certainly still do not get to learn a bunch of beautifully clever and subtly controversial methods for teasing out causal information from observational data, such as regression discontinuity, instrumental variables, or matching. But economists do.

Of course experiments are better than quasi experiments if you can afford to run one, but we are not going to experimentally set off supernovas in the Collinder 135 star cluster anytime soon, so behold the first ever application of regression discontinuity to astronomy in Fig. 9. of this paper of mine.

Popular repositories Loading

  1. latentzehao latentzehao Public

    Analysis of GANomaly latent space for ZeHao

    Jupyter Notebook 1

  2. equipartition equipartition Public

    Forked from hwkim1226/equipartition

    equipartition project with NBODY6 (Yonsei, 2017)

    R

  3. potential potential Public

    Calculates gravitational potential and other quantities on a simulation snapshot

    R

  4. aRtistic aRtistic Public

    R

  5. rk4laneemden rk4laneemden Public

    Solver for the Lane-Emden equation for Yonsei computational astronomy course

    Python

  6. LaboratoryOfComputationalPhysics LaboratoryOfComputationalPhysics Public

    Forked from mzanetti79/LaboratoryOfComputationalPhysics

    Jupyter Notebook