There are ongoing outbreaks of mpox globally. The Democratic Republic of Congo (DRC) is so far the worst hit with a total of 7,851 cases and 384 deaths reported between January 1 and May 26, 2024
1
. Before 2022, there were few reports of sustained mpox transmission globally.
Autori Henry Laurenson-Schafer, Nikola Sklenovská, Ana Hoxha, Steven M Kerr, Patricia Ndumbi, Julia Fitzner, Maria Almiron, Luis Alves de Sousa, Sylvie Briand, Orlando Cenciarelli, Soledad Colombe, Meg Doherty, Ibrahima Soce Fall, Christian García-Calavaro, Joana M Haussig, Masaya Kato, Abdi Rahman Mahamud, Oliver W Morgan, Pierre Nabeth, Jeremias Domingos Naiene, Wildo Araujo Navegantes, Opeayo Ogundiran, Charles Okot, Richard Pebody, Tamano Matsui, Hugo López-Gatell Ramírez, Catherine Smallwood, Raúl Francisco Pérez Tasigchana, Aisling M Vaughan, George Sie Williams, Peter Omondi Mala, Rosamund F Lewis, Boris I Pavlin, Olivier le Polain de Waroux, Basma Abdelgawad, Amarnath Babu, Evans Buliva, Finlay Campbell, Daniel Cardoso Portela Câmara, Zainab Eleiba, Blanche Johanna Greene-Cramer, Esther Hamblion, Mahmoud Hassan, Kaja Kaasik-Aaslav, Basant Mohamed, Victoria Ndarukwa, James Richard Otieno, Jeffrey Pires, Jukka Pukkila, Felix Sanni, Craig Schultz, Tika Sedai, Laila Skrowny, Manilay Phengxay, Ariuntuya Ochirpurev, Jozica Skufca, Laura Goddard, Viema Biaukula
AbstractUnderstanding and accurately estimating epidemiological delay distributions is important for public health policy. These estimates directly influence epidemic situational awareness, control strategies, and resource allocation. In this study, we explore challenges in estimating these distributions, including truncation, interval censoring, and dynamical biases. Despite their importance, these issues are frequently overlooked in the current literature, often resulting in biased conclusions. This study aims to shed light on these challenges, providing valuable insights for epidemiologists and infectious disease modellers.Our work motivates comprehensive approaches for accounting for these issues based on the underlying theoretical concepts. We also discuss simpler methods that are widely used, which do not fully account for known biases. We evaluate the statistical performance of these methods using simulated exponential growth and epidemic scenarios informed by data from the 2014-2016 Sierra Leone Ebola virus disease epidemic.Our findings highlight that using simpler methods can lead to biased estimates of vital epidemiological parameters. An approximate-latent-variable method emerges as the best overall performer, while an efficient, widely implemented interval-reduced-censoring-and-truncation method was only slightly worse. Other methods, such as a joint-primary-incidence-and-delay method and a dynamic-correction method, demonstrated good performance under certain conditions, although they have inherent limitations and may not be the best choice for more complex problems.Despite presenting a range of methods that performed well in the contexts we evaluated, residual biases persisted, predominantly due to the simplifying assumption that the distribution of event time within the censoring interval follows a uniform distribution; instead, this distribution should depend on epidemic dynamics. However, in realistic scenarios with daily censoring, these biases appeared minimal. This study underscores the need for caution when estimating epidemiological delay distributions in real-time, provides an overview of the theory that practitioners need to keep in mind when doing so with useful tools to avoid common methodological errors, and points towards areas for future research.SummaryWhat was known prior to this paperImportance of accurate estimates:Estimating epidemiological delay distributions accurately is critical for model development, epidemic forecasts, and analytic decision support.Right truncation:Right truncation describes the incomplete observation of delays, for which the primary event already occurred but the secondary event has not been observed (e.g. infections that have not yet become symptomatic and therefore not been observed). Failing to account for the right truncation can lead to underestimation of the mean delay during real-time data analysis.Interval censoring:Interval censoring arises when epidemiological events occurring in continuous time are binned into time intervals (e.g., days or weeks). Double censoring of both primary and secondary events needs to be considered when estimating delay distributions from epidemiological data. Accounting for censoring in only one event can lead to additional biases.Dynamical bias:Dynamical biases describe the effects of an epidemic’s current growth or decay rate on the observed delay distributions. Consider an analogy from demography: a growing population will contain an excess of young people, while a shrinking population will contain an excess of older people, compared to what would be expected from mortality profiles alone. Dynamical biases have been identified as significant issues in real-time epidemiological studies.Existing methods:Methods and software to adjust for censoring, truncation, and dynamic biases exist. However, many of these methods have not been systematically compared, validated, or tested outside the context in which they were originally developed. Furthermore, some of these methods do not adjust for the full range of biases.What this paper addsTheory overview:An overview of the theory required to estimate distributions is provided, helping practitioners understand the underlying principles of the methods and the connections between right truncation, dynamical bias, and interval censoring.Review of methods:This paper presents a review of methods accounting for truncation, interval censoring, and dynamical biases in estimating epidemiological delay distributions in the context of the underlying theory.Evaluation of methods:Methods were evaluated using simulations as well as data from the 2014-2016 Sierra Leone Ebola virus disease epidemic.Cautionary guidance:This work underscores the need for caution when estimating epidemiological delay distributions, provides clear signposting for which methods to use when, and points out areas for future research.Practical guidance:Guidance is also provided for those making use of delay distributions in routine practice.Key findingsImpact of neglecting biases:Neglecting truncation and censoring biases can lead to flawed estimates of important epidemiological parameters, especially in real-time epidemic settings.Equivalence of dynamical bias and right truncation:In the context of a growing epidemic, right truncation has an essentially equivalent effect as dynamical bias. Typically, we recommend correcting for one or the other, but not both.Bias in common censoring adjustment:Taking the common approach to censoring adjustment of naively discretising observed delay into daily intervals and fitting continuous-time distributions can result in biased estimates.Performance of methods:We identified an approximate-latent-variable method as the best overall performer, while an interval-reduced-censoring-andtruncation method was resource-efficient, widely implemented, and performed only slightly worse.Inherent limitations of some methods:Other methods, such as jointly estimating primary incidence and the forward delay, and dynamic bias correction, demonstrated good performance under certain conditions, but they also had inherent limitations depending on the setting.Persistence of residual biases:Residual biases persisted across all methods we investigated, largely due to the simplifying assumption that the distribution of event time within the primary censoring interval follows a uniform distribution rather than one influenced by the growth rate. These are minimal if the censoring interval is small compared to other relevant time scales, as is the case for daily censoring with most human diseases.Key limitationsDifferences between right censoring and truncation:We primarily focus on right truncation, which is most relevant when the secondary events are easier to observe than primary events (e.g., symptom onset vs. infection)—in this case, we can’t observe the delay until the secondary event has occurred. In other cases, we can directly observe the primary event and wait for the secondary event to occur (e.g., eventual recovery or death of a hospitalized individual)—in this case, it would be more appropriate to use right censoring to model the unresolved delays. For simplicity, we did not cover the right censoring in this paper.Daily censoring process:Our work considered only a daily interval censoring process for primary and secondary events. To mitigate this, we investigated scenarios with short delays and high growth rates, mimicking longer censoring intervals with extended delays and slower growth rates.Deviation from uniform distribution assumption:We show that the empirical distribution of event times within the primary censoring interval deviated from the common assumption of a uniform distribution due to epidemic dynamics. This discrepancy introduced a small absolute bias based on the length of the primary censoring window to all methods and was a particular issue when delay distributions were short relative to the censoring window’s length. In practice, other biological factors, such as circadian rhythms, are likely to have a stronger effect than the growth rate at a daily resolution. Nonetheless, our work lays out a theoretical ground for linking epidemic dynamics to a censoring process. Further work is needed to develop robust methods for wider censoring intervals.Temporal changes in delay distributions:The Ebola case study showcased considerable variation in reporting delays across the epidemic timeline, far greater than any bias due to censoring or truncation. Further work is needed to extend our methods to address such issues.Lack of other bias consideration:The idealized simulated scenarios we used did not account for observation error for either primary or secondary events, possibly favouring methods that do not account for real-world sources of biases.Limited distributions and methods considered:We only considered lognormal distributions in this study, though our findings are generalizable to other distributions. Mixture distributions and non-parametric or hazard-based methods were not included in our assessment.Exclusion of fitting discrete-time distributions:We focused on fitting continuous-time distributions throughout the paper. However, fitting discretetime distributions can be a viable option in practice, especially at a daily resolution. More work is needed to compare inferences based on discrete-time distributions vs continuous-time distributions with daily censoring.Exclusion of transmission interval distributions:Our work primarily focused on inferring distributions of non-transmission intervals, leaving out potential complications related to dependent events. Additional considerations such as shared source cases, identifying intermediate hosts, and the possibility of multiple source cases for a single infectee were not factored into our analysis.
Estimation of the effective reproductive numberRtis important for detecting changes in disease transmission over time. During the Coronavirus Disease 2019 (COVID-19) pandemic, policy makers and public health officials are usingRtto assess the effectiveness of interventions and to inform policy. However, estimation ofRtfrom available data presents several challenges, with critical implications for the interpretation of the course of the pandemic. The purpose of this document is to summarize these challenges, illustrate them with examples from synthetic data, and, where possible, make recommendations. For near real-time estimation ofRt, we recommend the approach of Cori and colleagues, which uses data from before timetand empirical estimates of the distribution of time between infections. Methods that require data from after timet, such as Wallinga and Teunis, are conceptually and methodologically less suited for near real-time estimation, but may be appropriate for retrospective analyses of how individuals infected at different time points contributed to the spread. We advise caution when using methods derived from the approach of Bettencourt and Ribeiro, as the resultingRtestimates may be biased if the underlying structural assumptions are not met. Two key challenges common to all approaches are accurate specification of the generation interval and reconstruction of the time series of new infections from observations occurring long after the moment of transmission. Naive approaches for dealing with observation delays, such as subtracting delays sampled from a distribution, can introduce bias. We provide suggestions for how to mitigate this and other technical challenges and highlight open problems inRtestimation.
Pubblicato in The American Journal of Tropical Medicine and Hygiene
Autori Kelly Charniga, Andrea M. McCollum, Christine M. Hughes, Benjamin Monroe, Joelle Kabamba, Robert Shongo Lushima, Toutou Likafi, Beatrice Nguete, Elisabeth Pukuta, Elisabeth Muyamuna, Jean-Jacques Muyembe Tamfum, Stomy Karhemere, Didine Kaba, Yoshinori Nakazawa
ABSTRACT.
Incidence of human monkeypox (mpox) has been increasing in West and Central Africa, including in the Democratic Republic of Congo (DRC), where monkeypox virus (MPXV) is endemic. Most estimates of the pathogen’s transmissibility in the DRC are based on data from the 1980s. Amid the global 2022 mpox outbreak, new estimates are needed to characterize the virus’ epidemic potential and inform outbreak control strategies. We used the R package vimes to identify clusters of laboratory-confirmed mpox cases in Tshuapa Province, DRC. Cases with both temporal and spatial data were assigned to clusters based on the disease’s serial interval and spatial kernel. We used the size of the clusters to infer the effective reproduction number, Rt, and the rate of zoonotic spillover of MPXV into the human population. Out of 1,463 confirmed mpox cases reported in Tshuapa Province between 2013 and 2017, 878 had both date of symptom onset and a location with geographic coordinates. Results include an estimated Rt of 0.82 (95% CI: 0.79–0.85) and a rate of 132 (95% CI: 122–143) spillovers per year assuming a reporting rate of 25%. This estimate of Rt is larger than most previous estimates. One potential explanation for this result is that Rt could have increased in the DRC over time owing to declining population-level immunity conferred by smallpox vaccination, which was discontinued around 1982. Rt could be overestimated if our assumption of one spillover event per cluster does not hold. Our results are consistent with increased transmissibility of MPXV in Tshuapa Province.
Methodology (stat.ME)FOS: Computer and information sciences
Autori Kelly Charniga, Sang Woo Park, Andrei R Akhmetzhanov, Anne Cori, Jonathan Dushoff, Sebastian Funk, Katelyn M Gostic, Natalie M Linton, Adrian Lison, Christopher E Overton, Juliet R C Pulliam, Thomas Ward, Simon Cauchemez, Sam Abbott
Epidemiological delays, such as incubation periods, serial intervals, and hospital lengths of stay, are among key quantities in infectious disease epidemiology that inform public health policy and clinical practice. This information is used to inform mathematical and statistical models, which in turn can inform control strategies. There are three main challenges that make delay distributions difficult to estimate. First, the data are commonly censored (e.g., symptom onset may only be reported by date instead of the exact time of day). Second, delays are often right truncated when being estimated in real time (not all events that have occurred have been observed yet). Third, during a rapidly growing or declining outbreak, overrepresentation or underrepresentation, respectively, of recently infected cases in the data can lead to bias in estimates. Studies that estimate delays rarely address all these factors and sometimes report several estimates using different combinations of adjustments, which can lead to conflicting answers and confusion about which estimates are most accurate. In this work, we formulate a checklist of best practices for estimating and reporting epidemiological delays with a focus on the incubation period and serial interval. We also propose strategies for handling common biases and identify areas where more work is needed. Our recommendations can help improve the robustness and utility of reported estimates and provide guidance for the evaluation of estimates for downstream use in transmission models or other analyses.