New configuration to specify qp.Ensemble parameterization in `CatEstimator` stages #28

drewoldag · 2023-07-27T19:02:20Z

Currently almost all subclasses of rail_base.estimator.CatEstimator will store resulting qp.Ensembles using a qp.interp gridded representation. We should add a new configuration parameter to allow users to select which qp representation is preferred. i.e. qp.hist, qp.spline, qp.packed_interp, etc...)

The work here is similar to issue #11 in that the work in this repository (rail_base) is relatively small, but the work to respect the new configuration parameter in all of the subclasses of CatEstimator will be substantial.

Also note that there will likely need to be updates made to several jupyter notebooks as well. But currently we do not have an exhaustive list of which notebooks will be affected.

The text was updated successfully, but these errors were encountered:

eacharles · 2024-05-16T01:41:06Z

So, a lot of the estimators have native representations of ensembles. How would you propose to handle this in those cases?

aimalz · 2024-05-20T22:21:49Z

In those cases, the default value of the configuration parameter for that stage would just be the (known, for that stage) native parameterization, no?

eacharles · 2024-06-04T16:34:23Z

A couple thought.

I think we should only do this in a way that only touches the base class code, not any of the sub-classes as that would be rather disruptive. This is going to be kinda tricky because we don't just write the ensemble at the end, but rather we allocate the memory at the beginning of the run() and then fill in it from the parallel processes. I.e., we will have to modify the _run() and _do_chunk_output() methods to do this.
I think a better solution than requiring parameters for the output representation would be to use parameters that default to None but that allow you to force the qp representation to a particular type.

The function
qp.factory.convert(in_dist, class_name, **kwds)
used as
new_ensemble = qp.factory.convert(orig_ensemble, self.config.qp_output_classname, **self.config.qp_output_class_pars)
or

qp.Ensemble.convert_to(self, to_class, **kwargs)
used as
new_ensemble = orig_ensemble.convert_to(qp.factory.stats[self.config.qp_output_classname, **self.config.qp_output_class_pars)

Would allow you to convert from one representation to another.

So, this could be something like:

if self.config.qp_output_classname is not None:
new_ensemble = orig_ensemble.convert_to(qp.factory.stats[self.config.qp_output_classname, **self.config.qp_output_class_pars)

aimalz added the enhancement New feature or request label Jul 27, 2023

aimalz assigned drewoldag Aug 4, 2023

drewoldag removed their assignment Apr 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

New configuration to specify qp.Ensemble parameterization in `CatEstimator` stages #28

New configuration to specify qp.Ensemble parameterization in `CatEstimator` stages #28

drewoldag commented Jul 27, 2023

eacharles commented May 16, 2024

aimalz commented May 20, 2024

eacharles commented Jun 4, 2024

New configuration to specify qp.Ensemble parameterization in CatEstimator stages #28

New configuration to specify qp.Ensemble parameterization in CatEstimator stages #28

Comments

drewoldag commented Jul 27, 2023

eacharles commented May 16, 2024

aimalz commented May 20, 2024

eacharles commented Jun 4, 2024

New configuration to specify qp.Ensemble parameterization in `CatEstimator` stages #28

New configuration to specify qp.Ensemble parameterization in `CatEstimator` stages #28