Understanding large prediction intervals #510
-
Setup
DescriptionIn some cases, the prediction intervals produced can be very large or even infinite. Based on my experience, this issue often arises from insufficient data for calibration and fitting of the residual estimator – which may lead to a poorly fitted residual estimator, unrepresentative samples for calibration, or both. QuestionGiven that there is sufficient data and the estimator is well-fitted to the training data, what other factors could cause unexpectedly large prediction intervals? For example, I guess that out-of-distribution data is one of the reasons. Are there other potential issues to be aware of that might indicate when the prediction intervals might not be reliable? |
Beta Was this translation helpful? Give feedback.
Replies: 2 comments
-
Indeed, out-of-distribution data can be the cause of large prediction intervals. The model used to estimate residuals may also be poorly fitted (overfitted/underfitted), resulting in overestimating or underestimating the residual variance. Maybe our conformal predictions experts @vincentblot28 or @thibaultcordier have other reasons in mind? |
Beta Was this translation helpful? Give feedback.
-
Hi @jesuinovieira, I agree with the Valentin's answer. Maybe to better understand why you have such large prediction intervals you could do some sanity checks such as:
|
Beta Was this translation helpful? Give feedback.
Hi @jesuinovieira, I agree with the Valentin's answer. Maybe to better understand why you have such large prediction intervals you could do some sanity checks such as: