Samples parameterization #33

aimalz · 2021-05-20T22:29:49Z

Currently we can make PDFs from samples via qp.spline_from_samples but, unless I'm missing something, there isn't a parameterization whose parameters are the sample values themselves rather than the spline parameters derived from a KDE thereof. This would be very helpful for things like the PIT metric used in RAIL, which is a 1D probability distribution defined by samples.

The text was updated successfully, but these errors were encountered:

eacharles · 2021-07-13T16:50:57Z

I'm not actually sure what it means to have a PDF parametrization whose parameters are the sample values? How do you compute the pdf or cdf or ppf from sample values? It seem to me that, given some sample values, you need to convert to some other representation, e.g., qp.spline_from_samples.

Perhaps what you would like is a class that allows users to store sample values, and provides an easy interface to the conversion routines.

aimalz · 2021-07-20T19:35:13Z

Functionally, I agree that the methods would have to be implemented using the KDE as a default intermediary, so a class that initializes an ensemble from samples, outputs to samples, and connects to the conversion functions would indeed be useful.

eacharles · 2021-07-20T19:55:49Z

So, the KDE representation was really inefficient for large samples b/c it it evaluated the PDF by doing an operation that involved all the samples. And also b/c there wasn't really a smart way to implement _cdf or _ppf So what I did was to convert it to a spline. So the Spline_Gen.create_from_samples will create a PDF from samples. And of course you can generate samples from any ensemble using ens.rvs(). We could put in an explicit KDE that computes things using the samples, but we are gonna want to tell people not to use it form more than a few samples or a few PDFs, cause it is really not performant.

eacharles · 2021-07-20T20:00:46Z

It would probably be a better long term solution just to make a NB that shows how to invoke Spline_Gen.create_from_samples and maybe a function that does ensemble.write_samples() for any PDF.

eacharles · 2021-07-20T20:33:58Z

If for whatever reason you want something ensemblish that ties together the reading and writing of samples, I would actually consider using the newly minted ancillary data to do that. I.e., a spline_pdf that carries around the samples used to generate it.

aimalz · 2021-07-28T22:03:26Z

Re: KDE, I think the dominant use case would do it for lots of samples and lots of PDFs, so your concern about computation is a fair one. Perhaps the most natural thing to do is actually quantiles, where the N sample values {z} naturally define regular quantiles separated by 1/N, which could then be binned down upon conversion.

aimalz · 2023-08-02T01:36:35Z

#170 is a duplicate of this but the fresher conversation makes it the more reasonable issue to keep open.

aimalz mentioned this issue May 25, 2021

Issue/4/evaluation rearrangement LSSTDESC/rail_attic#61

Merged

aimalz assigned eacharles May 25, 2021

eacharles added the question Further information is requested label Jul 13, 2021

aimalz added parameterization new/upgraded PDF parameterization need and removed question Further information is requested labels Dec 6, 2022

aimalz added the enhancement New feature or request label Jul 18, 2023

aimalz mentioned this issue Aug 2, 2023

Samples as a parameterization #170

Open

aimalz closed this as completed Aug 2, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Samples parameterization #33

Samples parameterization #33

aimalz commented May 20, 2021

eacharles commented Jul 13, 2021 •

edited

Loading

aimalz commented Jul 20, 2021

eacharles commented Jul 20, 2021

eacharles commented Jul 20, 2021

eacharles commented Jul 20, 2021

aimalz commented Jul 28, 2021

aimalz commented Aug 2, 2023

Samples parameterization #33

Samples parameterization #33

Comments

aimalz commented May 20, 2021

eacharles commented Jul 13, 2021 • edited Loading

aimalz commented Jul 20, 2021

eacharles commented Jul 20, 2021

eacharles commented Jul 20, 2021

eacharles commented Jul 20, 2021

aimalz commented Jul 28, 2021

aimalz commented Aug 2, 2023

eacharles commented Jul 13, 2021 •

edited

Loading