Ideas

A recommendation from Machine learning safety measures:

Build software to support good practice. Many of the problems I’m talking about are quite easy to catch, or at least warn about, during the training and evaluation process. Unscaled features, class imbalance, correlated features, non-IID records, and so on. Education is essential, but software can help us notice and act on them.

Things that can go wrong, from Functional but unsafe machine learning:

Allowing information leakage across features or across records, resulting in erroneously high accuracy claims. For example, splitting related (e.g. nearby) records into the training and validation sets.
Not accounting for under-represented classes so that predictions are biased towards over-represented ones. This kind of error was common to see in the McMurray Formation of Alberta, which is 80% pay.
Forgetting to standardize or normalize numerical inputs to a model in production, producing erroneous predictions. For example, training on gamma-ray Z-scores of roughly –3 to +3, then asking for a prediction for a value of 75.
Using cost functions that do not reflect the opinions of humans with expertise about ‘good’ vs ‘bad’ predictions.
Racial or gender bias in a human resource model, such as might be used for hiring or career mapping.

From my initial planning notebook:

If we get a train/test/val flag too, then:

Non-stratified features wrt train/val/test
Different distributions of features wrt train/val/test
Different distributions of target / labels wrt train/val/test

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ideas

Other relevant writing

Clone this wiki locally