Discussion regarding `errorVals`-`absoluteError`-`relativeError` #741

prisae · 2024-06-25T09:46:10Z

Data error computation

This is partly an issue (problematic default behaviour), and partly a discussion (error calculation).

Issue: Problematic default behaviour

In pygimli.frameworks.Inversion().run() and pygimli.frameworks.MarquardtInversion().run() the parameter errorVals is subtly deprecated in favour of absoluteError and relativeError.

If errorVals is not given, it is calculated via:

if errorVals is None:  # use absoluteError and/or relativeError instead
    absErr = kwargs.pop("absoluteError", 0)
    relErr = kwargs.pop("relativeError",
                        0.01 if np.allclose(absErr, 0) else 0)
    errorVals = pg.abs(absErr / np.asarray(dataVals)) + relErr

In some methods, such as marine CSEM, your data can have very small values, in the range of 1e-10 and less. This will result in np.allclose(absErr, 0) = True and hence give it a relErr = 0.01, which is useless (because the default atol of np.allclose is 1e-08).

What about either:

    relErr = kwargs.pop("relativeError",
                        0.01 if np.allclose(absErr, 0, atol=0) else 0)

or (my preference), not defaulting at all, but raise an error,

    relErr = kwargs.pop("relativeError", 0)
    if np.any(np.isclose(absErr + relErr, 0, atol=0)):
        raise Error that at least one of abs/rel error must be provided

Discussion: Error calculation

If the user provides absolute and/or relative errors, the error values are computed as follows

errorVals = pg.abs(absErr / np.asarray(dataVals)) + relErr

However, given error propagation, I think it should be

errorVals = np.sqrt(pg.abs(absErr / np.asarray(dataVals))**2 + relErr**2)

This will mostly not have a big impact at all, only in the zone where the data values approach the noise level.

The text was updated successfully, but these errors were encountered:

prisae · 2024-06-25T09:49:40Z

I thought I'd discuss both here before making a PR.

halbmy · 2024-07-01T06:45:46Z

Very good point that did not pop up before. Marine csem might indeed scale badly. Throwing an error is the better choice.

However, I disagree with the error propagation. The relative and absolute errors are not independent error sources, but simple models to estimate errors, e.g. from reciprocal analysis etc. which is common practise and simple of relative or absolute errors are considered are the main source (and small values to catch near zero data). A more rigorous error class could help supporting and analysing data and error statistics.

prisae · 2024-07-01T07:27:10Z

I'm OK to agree on disagreeing regarding noise. In this case, an error class might be indeed good, or at least not deprecating errorVals, so the user can provided directly the error values instead of absError and relError.

halbmy · 2024-07-08T18:30:28Z

Well, the reason for deprecating errorVals was because it is not clear whether it is an absolute (e.g. typical for traveltime tomography) or relative (e.g. typical for ERT as it is the same for u, r and rhoa), and to make way for a bit more rigorous handling (right now internally relative errors are used but this can lead to problems for near-zero data).

halbmy · 2024-09-24T06:31:01Z

I totally agree with your suggestions and implemented them (171e47e)

In order to keep things running, I accepted your first choice (atol=0) for the current (classic) inversion
for the future inversions based on InversionBase, I took your second (preferred) choice throwing an error if no error is given

prisae · 2024-09-27T10:48:29Z

Great!

halbmy self-assigned this Jul 1, 2024

halbmy pushed a commit that referenced this issue Sep 24, 2024

ENH: error estimation defaults for small numbers (#741)

171e47e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Discussion regarding `errorVals`-`absoluteError`-`relativeError` #741

Discussion regarding `errorVals`-`absoluteError`-`relativeError` #741

prisae commented Jun 25, 2024

prisae commented Jun 25, 2024

halbmy commented Jul 1, 2024

prisae commented Jul 1, 2024

halbmy commented Jul 8, 2024

halbmy commented Sep 24, 2024

prisae commented Sep 27, 2024

Discussion regarding errorVals-absoluteError-relativeError #741

Discussion regarding errorVals-absoluteError-relativeError #741

Comments

prisae commented Jun 25, 2024

Data error computation

Issue: Problematic default behaviour

Discussion: Error calculation

prisae commented Jun 25, 2024

halbmy commented Jul 1, 2024

prisae commented Jul 1, 2024

halbmy commented Jul 8, 2024

halbmy commented Sep 24, 2024

prisae commented Sep 27, 2024

Discussion regarding `errorVals`-`absoluteError`-`relativeError` #741

Discussion regarding `errorVals`-`absoluteError`-`relativeError` #741