[BUG] Fix residual scale estimation in DoubleResidual #503

meh2135 · 2024-12-06T20:09:15Z

Reference Issues/PRs

Fixes #492

What does this implement/fix? Explain your changes.

Changes to ResidualDouble:

Fixes the method of moments estimation of scale parameters for all supported distributions with finite moments where residual_trafo is absolute or squared
Adds warnings when residual_trafo is an arbitrary transform
Adds warnings for cauchy and t with df <=2 (no first moment, and no second moment respectively)
Adds sample weight support

Does your contribution introduce a new dependency? If yes, which one?

no

What should a reviewer concentrate their feedback on?

My added test
Added warnings
anything around sample_weights

Did you add any tests for the change?

Added tests to confirm uniform quantiles for held out data when the model is correctly specified.

…mates.

…Double

…dd a warning for t dist df<3 trafo=squared, where we observe poor performance.

…n base and residual.

…sidualdouble

fkiraly

Nice!

Would you like a review right now, or is this still a work-in-progress draft?

Some comments:

we should not change the core interface in the same pull request as changes to estimators. Your idea of adding a sample_weight arg is great, but we should deal with this separately, to avoid interaction between the changes!
changes to the residual estimator look good and sensible. I will have to sit down and work out the math to check, but superficially fine.

fkiraly · 2024-12-10T14:53:25Z

skpro/regression/base/_base.py

@@ -72,7 +72,7 @@ def __rmul__(self, other):
        else:
            return NotImplemented

-    def fit(self, X, y, C=None):
+    def fit(self, X, y, C=None, sample_weight=None):


as said, good idea, but should be in a separate PR. Could you also open an issue to track adding sample weights?

What we should do (please copy this recommendation in an issue):

add a tag capability:sample_weight that tells the user whether the weights are used non-trivially

add tests in TestAllRegressors that actually passes sample weights, to see that nothing breaks. In particular, nothing should break for any estimator when passed, the current boilerplate would lead to most estimators breaking, as the argument is passed on to _fit!

in particular, the boilerplate should check the tag, and pass sample_weight on to _fit only if the tag has value True.

in the tests, we should check that _fit does have the arg sample_weight if the tag is set True.

meh2135 added 14 commits November 13, 2024 14:35

Implemented a simple sqrt for the scale of residual in residualdouble.

3acc3d6

method of moments correctiosn for ResidualDouble scale parameter esti…

b1c2a42

…mates.

Fixed a bug in scale normalization for laplace in ResidualDouble.

8d581d2

added sample weight to fit method for BaseProbaRegressor and Residual…

f4f73bf

…Double

in ResidualDouble, copy dist_params to avoid mutating argument, and a…

2badc43

…dd a warning for t dist df<3 trafo=squared, where we observe poor performance.

Added validity test for ResidualDouble. Fixed formatting of regressio…

818ed5c

…n base and residual.

added sample weights for ResidualDouble predictcv

97893fc

Added Mike Hankin to contributors

c83719e

fixed sample weight docstring comment.

42f4e86

reincorporated cauchy and arbitrary transforms into ResidualDouble.

260735d

updated contributors

996c407

More relevant warnings around t distribution degrees of freedom in re…

e5655e3

…sidualdouble

moved parameter copy to inside predict_proba to fix breaking tests.

282bde9

Merge remote-tracking branch 'upstream/main' into fix-resid-double-sq

db7df6f

fkiraly reviewed Dec 10, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG] Fix residual scale estimation in DoubleResidual #503

[BUG] Fix residual scale estimation in DoubleResidual #503

meh2135 commented Dec 6, 2024 •

edited

Loading

fkiraly left a comment

fkiraly Dec 10, 2024 •

edited

Loading

[BUG] Fix residual scale estimation in DoubleResidual #503

Are you sure you want to change the base?

[BUG] Fix residual scale estimation in DoubleResidual #503

Conversation

meh2135 commented Dec 6, 2024 • edited Loading

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Does your contribution introduce a new dependency? If yes, which one?

What should a reviewer concentrate their feedback on?

Did you add any tests for the change?

fkiraly left a comment

Choose a reason for hiding this comment

fkiraly Dec 10, 2024 • edited Loading

Choose a reason for hiding this comment

meh2135 commented Dec 6, 2024 •

edited

Loading

fkiraly Dec 10, 2024 •

edited

Loading