To guide the collection of data under emergent epidemic conditions, we reviewed compartmental models of historical Ebola outbreaks to determine their implications and limitations. We identified future modeling directions and propose that the minimal epidemiologic dataset for Ebola model construction comprises duration of incubation period and symptomatic period, distribution of secondary cases by infection setting, and compliance with intervention recommendations.
Mathematical models are used to generate epidemic projections under different scenarios, provide indicators of epidemic potential, and highlight essential needs for data. To aid the interventions in the 2014 Ebola epidemic in West Africa, in September 2014 we reviewed models of historical Ebola virus (EBOV) outbreaks (
| Feature | Model | ||
|---|---|---|---|
| Chowell et al. ( | Lekone and Finkenstädt ( | Legrand et al. ( | |
| Outbreak* | DRC 1995, Uganda 2000† | DRC 1995‡ | DRC 1995, Uganda 2000§ |
| Assumed | |||
| Homogeneous random mixing | Yes | Yes | Yes |
| All human-to-human contact | Yes | Yes | Yes |
| Considered | |||
| Nosocomial transmission | No | No | Yes |
| Burial transmission | No | No | Yes |
| No. transmission parameters | 2 (preintervention decays to postintervention) | 1 (decay to 0) | 3 (community, nosocomial, burial) |
| Distribution | Exponential | Geometric | Exponential |
| Underreporting accounted for | No | No | No |
*The DRC outbreak was caused by the Zaire strain; the Uganda outbreak was caused by the Sudan strain. DRC, Democratic Republic of Congo.
†Data sources: DRC 1995 (
| Reference | Outbreak | Model | R0 estimate | Incubation period, d (SD)† | Infectious period, d (SD) |
|---|---|---|---|---|---|
| Chowell et al. ( | DRC 1995 | SEIR‡ | 1.83 (SD 0.06) | 5.3 (0.23) | 5.61 (0.19) |
| Uganda 2000 | SEIR‡ | 1.34 (SD 0.03) | 3.35 (0.49) | 3.5 (0.67) | |
| Lekone and Finkenstädt ( | DRC 1995 | SEIR, MCMC (vague prior) | 1.383 (SD 0.127) | 9.431 (0.620) | 5.712 (0.548) |
| DRC 1995 | SEIR, MCMC (informative prior) | 1.359 (SD 0.128) | 10.11 (0.713) | 6.523 (0.564) | |
| Legrand et al. ( | DRC 1995 | Stochastic compartmental model (SEIHFR) | 2.7 (95% CI 1.9–2.8) | ||
| Uganda 2000 | Stochastic compartmental model (SEIHFR) | 2.7 (95% CI 2.5–4.1) | |||
| Eichner et al. ( | DRC 1995 | Incubation period estimate based on parameterized lognormal distribution function | 12.7 (4.31) | ||
| Ferrari et al. ( | DRC 1995 | MLE | 3.65 (95% CI 3.05–4.33) | ||
| DRC 1995 | Regression | 3.07§ | |||
| Uganda 2000 | MLE | 1.79 (95% CI 1.52–2.30) | |||
| Uganda 2000 | Regression | 2.13§ | |||
| White and Pagano ( | DRC 1995 | MLE | 1.93 (95% CI 1.74–2.78) |
*DRC, Democratic Republic of Congo; MCMC: Markov chain Monte Carlo; MLE, maximum-likelihood estimation; SEIR, susceptible-exposed-infectious-removed; SEIHFR, susceptible-exposed-infectious-hospitalized-funeral-removed. Blank cells indicate that no information was provided from the original study. †The incubation period for Ebola virus is believed to be the same as its latent period, i.e., infected persons become infectious only when symptomatic. ‡Combination differential equation model and Markov chain model. §Neither CIs nor SDs were provided in the study.
Chowell et al. (
Conceptual diagrams illustrating Ebola SEIR and SEIHFR models of historical Ebola virus outbreaks. SEIR, susceptible-exposed-infectious-removed; SEIHFR, susceptible-exposed-infectious-hospitalized-funeral-removed.
Lekone and Finkenstädt (
Legrand et al. (
These models (
Three additional studies estimated incubation period or R0 by using statistical models. Eichner et al. (
Collectively, these studies underscore that practical decisions in modeling dictate trade-offs between fitting to limited data and explicit representation of reality, including interventions. A model with a single transmission rate might fit well to data but might not be useful for decision making that evaluates intervention effects in different transmission contexts. A model with 3 transmission rates might represent transmission in community, nosocomial, and funeral contexts (e.g., [
Other features are important for understanding the probable paths of small outbreaks. These include nonexponential incubation and infectious periods (
Another issue that has not been studied is the role of spatial scale. All extensive EBOV outbreaks involved multiple scales of transmission. At the smallest scale, persons most at risk for infection are those caring for an Ebola patient. Understanding these household contacts helps estimate outbreak size. Human settlements constitute a “household of households.” Transmission occurs among households in communities, at hospitals, or at funerals. Understanding these between-household contacts is needed to determine the outbreak’s extent. Finally, understanding connections between settlements by human movements is needed to determine the paths and speed of large-scale spatial spread and therefore the total infected area and domain for surveillance and monitoring. Although the assumption of population homogeneity can be justified for models of historical EBOV outbreaks, given the limited geographic extent of those outbreaks, models for the 2014 outbreak might need to address heterogeneity in population density and human movements because of the extensive geography involved.
Two issues new to the 2014 EBOV epidemic are underreporting and compliance. To assess underreporting, perhaps comprehensive contact tracing can be performed in a small number of locales and extrapolated. If cases can be identified through 2 independent routes, then case matching can be used to identify the total number of cases (
Model fitting is craft as well as science. Modeling demands decisions, including what mathematical representations to use, the type and magnitude of variation to be considered, and the values that can be taken as given versus the values still to be estimated. In the face of data scarcity, we suggest that construction of models of the 2014 outbreak would have benefited from a minimal dataset that included 1) the mean and variance of the incubation period and symptomatic period, respectively; 2) the probability distribution of secondary cases by infection setting; and 3) compliance with recommendations. For secondary cases, in addition to the average, the commonness of outliers (super-spreaders), the frequency of zeros, and the variance in the distribution need to be known.
Methods for study of the transmission models of historical Ebola outbreaks.
These authors contributed equally to this article.
We thank Matthew Ferrari, Martin I. Meltzer, Pej Rohani, and Zhisheng Shuai for helpful discussion and critical comments on previous versions of this manuscript.
J.M.D. is supported by the National Institute of General Medical Sciences of the National Institutes of Health under award no. U01GM110744. M.G. is supported by the National Health and Medical Research Council of Australia.
Dr. Drake is an associate professor in the Odum School of Ecology, University of Georgia, Athens, Georgia, USA. His research focuses on the theory and modeling of epidemic dynamics.