Rethink the measure of generalization error #16

psychelzh · 2024-09-25T03:50:11Z

Currently, the measure of generalization error used in summary() is the correlation between the pooled predictions and the real values. But sklearn warns against doing so:

Note on inappropriate usage of cross_val_predict

The result of cross_val_predict may be different from those obtained using cross_val_score as the elements are grouped in different ways. The function cross_val_score takes an average over cross-validation folds, whereas cross_val_predict simply returns the labels (or probabilities) from several distinct models undistinguished. Thus, cross_val_predict is not an appropriate measure of generalization error.

A sounder method is calculating generalization errors separately for each fold, and the average them. But Pearson correlations might need special treating.

The text was updated successfully, but these errors were encountered:

psychelzh · 2024-09-25T04:00:14Z

Of course, the current method just follows the original paper. So, we would leave this issue open here because it might not be proper to implement another measure for now.

psychelzh added the help wanted ❤️ we'd love your help! label Sep 25, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Rethink the measure of generalization error #16

Rethink the measure of generalization error #16

psychelzh commented Sep 25, 2024 •

edited

Loading

psychelzh commented Sep 25, 2024 •

edited

Loading

Rethink the measure of generalization error #16

Rethink the measure of generalization error #16

Comments

psychelzh commented Sep 25, 2024 • edited Loading

psychelzh commented Sep 25, 2024 • edited Loading

psychelzh commented Sep 25, 2024 •

edited

Loading

psychelzh commented Sep 25, 2024 •

edited

Loading