Skip to content

kraemer-lab/Mpox_Incubation

Repository files navigation

Mpox Incubation Estimation (Bayesian Workflow)

The present repository reproduces and expands the analyses carried on by Miura and colleagues (2022) [see : https://github.com/fmiura/MpxInc_2022/tree/main]. We expand the analyses by providing fully parametrised models obtaining very close results to Miura et al. (2022). we also provide an example of prior updating where we fit models to data from Italy by adjusting priors based on the Netherlands models posteriors.

Models

We implement the three models used in previous analyses: Lognormal, Gamma and Weibull models, and we add a NegativeBinomial model. All models use the same priors for parameters, with the exception of the sampling distribution (likelihood):

r ~ Beta(1, 1)

a ~ Gamma(1, 1)

e = r(E - S)

m = a + e

s ~ Gamma(1, 1)

y ~ LogNormal(μ=m, σ=s)

Where, E is a vector containing the end of the exposure period, S is a vector containing the beginning of exposure period, r is the exposure period rate, and the observed data for y corresponds to the onset of symptoms vector O - S. The model above is repeated three more times but replacing y with y ~ Gamma(μ=m, σ=s), parametrised via mean μ and standard deviation σ; y ~ Weibull(α=m, β=s) with shape α and scale β, and y ~ NegativeBinomial(μ=m, α=s) with mean μ and shape σ.

Prior Predictive Checks

The defined generic and lowly informative priors may be problematic for some models, as they may not provide good coverage for the observed data ranges or maybe they will cover unrealistic values. Prior predictives for the Netherlands data indicate that LogNormal and Weibull distributions provide very extreme values and Gamma and NegativeBinomial distributions provide more reasonable coverage, but with shifted locations (towards 0). It is probable that posterior distributions will adapt to more realistic values, as prior distributions themselves show very reasonable values within the expected day-range (i.e. up to around 10 days).

Prior predictives for the Italy data show a very similar pattern when no prior update is performed (see prior_predictive directory). However, when posterior information from the Netherlands models is included, prior predictive distributions improve substantially.

Combining data from Netherlands and Italy can serve to assess how much prior update helps to refine inference. Note that default priors are still less accurate respect to updated priors above.

Sampling

Models sampled with NUTS HMC with 1000 tuning steps and 1000 samples over 4 chains. Models sampled well with all R_hats = 1, and ESS > 1000, and BFMIs > 0.9. Rankplots below confirm good convergence from the Netherlands models, models for Italy show similar outputs (see convergence_checks directory).

Posterior Assessment

The estimated mean incubation periods from the Netherlands data reproduce previous results closely, as Table 1 shows. Posterior distributions CDF plots indicate similarity of results, with some additional variation expected from a fully parametrised model. There is a small difference in the LogNormal estimated mean (9.88), which may be explained by the inclusion of the exposure period in the computation of the incubation period mean.

When using the default Beta and Gamma priors, the models applied to Italy's data show similar results as the previous analysis, with mean incubation periods estimated to be between 9 and 10 days.

After this, we used the posterior mean and standard from r, a and s parameters from the Netherlands model and used it to parameterise the respective priors in the Italy model. (See posteriors directory for posteriors.csv files containing posterior summaries and convergence statistics). Results showed slightly improved precision.

Models applied to combined data (Netherlands + Italy) output results that are close to the model with updated priors. Note how the estimates from Italy data with updated priors (above) match the empirical CDFs of combined data closely.

Posterior Predictive Checks

Posterior predictive distributions indicate good performance of models, with a good measurement of uncertainty around the predicted mean and observed data, in particular from Gamma and NegativeBinomial models.

Model Comparison

As expected from small data models, updating priors changes the predictive capacity of models, though it is relevant to emphasise that models show high overlap in their expected log-predictive densities (ELPDs). Italy models indicate that Gamma and NegativeBinomial distributions outperform the other two distributions, in particular when using more informative priors. Note that in this case, the PSIS-LOO comparison indicates that Gamma and NegativeBinomial models have very similar weights of 47% and 53% respectively (see model_comaprison directory), while model with combined data indicates weights of 60% for Gamma, 28% for NegativeBinomial, and 12% for LogNormal. In both cases, Gamma and NegativeBinomial show very close ELPDs, and Gamma has slightly more in-sample predictive capacity.

Conclusion

Results indicate that prior updating on models applied to new data (excluding previous data) can aid the estimation process. However, more testing is needed to asses what are the concrete advantages. Present analyses attempted to reproduce and expand previous results (Miura et al. 2022), but important parts of the analysis process need to be included. A prior sensitivity analysis may be required, as models are very sensitive to prior variation. Also, more technical and theoretical considerations could be included, such as the more appropriate use of Gamma distributions for measuring elapsed times or the NegativeBinomial for assessing count data (i.e. number of days between one event and the next). This could also imply that model averaging between the models using the most appropriate sampling distributions (not necessarily the best ones at prediction) should also be considered.

References

Miura Fuminari, van Ewijk Catharina E, Backer Jantien A, Xiridou Maria, Franz Eelco, Op de Coul Eline, Brandwagt Diederik, van Cleef Brigitte, van Rijckevorsel Gini, Swaan Corien, van den Hof Susan, Wallinga Jacco. Estimated incubation period for monkeypox cases confirmed in the Netherlands, May 2022. Euro Surveill. 2022;27(24):pii=2200448. https://doi.org/10.2807/1560-7917.ES.2022.27.24.2200448

License

MIT License

Copyright (c) 2023 Kraemer Lab, University of Oxford

Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the "Software"), to deal in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Software, and to permit persons to whom the Software is furnished to do so, subject to the following conditions:

The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software.

THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.

About

Estimation of Mpox incubation times

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages