Conceived and designed the experiments: MAJ. Performed the experiments: MAJ NAV. Analyzed the data: MAJ NAV BJB. Contributed reagents/materials/analysis tools: MAJ NAV NG NM. Wrote the paper: MAJ NAV BJB JES NG NM.
The global spread of infectious diseases is facilitated by the ability of infected humans to travel thousands of miles in short time spans, rapidly transporting pathogens to distant locations. Mathematical models of the actual and potential spread of specific pathogens can assist public health planning in the case of such an event. Models should generally be parsimonious, but must consider all potentially important components of the system to the greatest extent possible. We demonstrate and discuss important assumptions relative to the parameterization and structural treatment of airline travel in mathematical models. Among other findings, we show that the most common structural treatment of travelers leads to underestimation of the speed of spread and that connecting travel is critical to a realistic spread pattern. Models involving travelers can be improved significantly by relatively simple structural changes but also may require further attention to details of parameterization.
The global airline network has brought the entire world closer together than ever before, creating an environment in which pathogens can readily spread to distant locations. Influenza viruses are perhaps the most widely recognized example of this phenomena
These models are generally framed as metapopulation models, with spatially discrete populations connected by transportation networks. The discrete populations themselves are subdivided in terms of infection status with compartments for susceptible, incubating, infectious, and recovered individuals. Development of these models presents many challenges related to the characterization of the populations, the disease, and relevant travel patterns. We focus here on oft-neglected assumptions related to the characterization of travel which may affect the speed and pattern of epidemic growth and spread.
One fundamental assumption in many models is that travelers are, in essence, migrants; people who move permanently or semi-permanently from one geographic population to another
Two different model structures are shown using a simplified, two-city, Susceptible (S)-Exposed (E)-Infected (I)-Recovered (R) model. In the migration model, travelers represent migrants, mixing the populations of the two cities, City A and B. In the traveler model, some residents from City A (blue) and City B (green) travel temporarily to the other city, but eventually return to their original city. In each model, individuals may progress through the infection stages (red arrows), the rates of which may be city-dependent.
It is also generally assumed that exposure occurs via mass action, i.e., that all susceptible individuals in a given location are exposed with the same probability to the infectious agent. Even without considering travel, however, there is significant stratification within populations and people do not all interact equally
Infection with a pathogen may also have a direct impact on travel. Most models assume that infectious travelers are ill and thus are unlikely to travel
Characterization of a travel network itself is another challenge. Many models quantify travel volumes or frequencies in terms of the number of seats available on direct flights between each location and average occupancy on those flights
We used mathematical models of two generalized pathogens, one directly transmitted (modeled on severe acute respiratory syndrome-associated coronavirus (SARS-CoV)) and one vector-borne (modeled on dengue viruses (DENV)) to evaluate the effects of different structural model designs and characterizations of airline travel on the speed and pattern of epidemic spread. In addition to evaluating directly transmitted pathogens, we also include a vector-borne pathogen because the dependence on a vector increases the geographical heterogeneity in transmission dynamics and may alter the effects of travel-related assumptions for a pathogen.
Two base models were constructed, one for the directly transmitted pathogen and one for the vector-borne pathogen. Described in detail in
To introduce straightforward geographic variability in the models of the vector-borne pathogen, we randomly selected 10 cities to have
To test the effect of treating travelers as migrants rather than as temporary travelers, we compared the base, “traveler” models with “migration” models in which we eliminated the traveler compartments and assumed that all travelers immediately mixed with local populations upon arrival in a new city.
To test the role of heterogeneous travel patterns, we divided travelers into two compartments based on approximations of the duration of stay for those residing in hotels versus private homes. In the base models, travelers stay an average of 18 days in their destination (exponentially distributed)
To test the importance of infectious travelers, we compared the base models, which allow individuals to travel regardless of infection status, with models in which infectious individuals do not travel.
We also assessed three different travel network parameterizations: seat-based direct-travel only, connecting-travel-inclusive, and a skewed version of the connecting-travel. Connecting travel was estimated using a regression model of sampled itinerary data and covariates based on travel network characteristics (described in detail in
To analyze the effect of different assumptions, we ran simulations of each model under the different sets of assumptions. In each simulation, it was assumed that the global population was 100% susceptible and 10 infectious individuals were introduced into a single city. The origin city was selected from the subset of cities containing vector populations and was used for simulated epidemics of both the directly transmitted and the vector-borne pathogens. The same city was used as the origin city for the directly transmitted model.
Different models were compared by performing 100 simulations of each model and comparing quantitative outcomes via regression analysis. The timing of the first autochthonous transmission event in each city and the final epidemic size in each city were compared using a linear Gaussian regression with city as a covariate to control for intra-city correlation across simulations. Intra-simulation correlation was not found, i.e., earlier or later introduction was related to the city and the model, but not to the simulation. Statistical analyses were performed in R version 2.11.1
We contrasted two different model designs, one treating all travelers as migrants (migration models) and one treating them as temporary travelers (traveler models, the base models). For the directly transmitted pathogen, the pathogen introduction caused a pandemic.
Each line is the empirical cumulative probability (for 100 simulations) of the first autochthonous transmission event in a single city in a single model. The dotted lines are for a city closely connected to the origin city for the traveler (black) and migration (red) models. The solid lines are for a more distal city. A. is the directly transmitted pathogen and B. is the vector-borne pathogen.
The final epidemic size was slightly decreased in the migration model (0.079%, 95% CI: 0.088–0.069%) compared to the traveler model for the directly transmitted pathogen. However, long-term dynamics were generally consistent with the pandemic ending in global extinction of the pathogen within 1.5 years. In contrast, there were long-term differences in the models of the vector-borne pathogen. In the traveler model, initial epidemics were followed by global extinction of the pathogen within two years (
A. The mean proportion of the population newly infected per day for the traveler (black) and migration (red) models. The pathogen only persists in the migration model and causes a second outbreak approximately two years after the first. B. The mean proportion of the population susceptible (black), incubating and infectious (red), and recovered (blue) in the migration model over time. Approximately 20% of the population is replaced by incoming susceptibles between the two epidemics.
We designed additional traveler models (heterogeneous-traveler models), which had different periods of stay for travelers staying in hotels compared to travelers staying in homes. We investigated the effects of the traveler heterogeneity in the model with the directly transmitted pathogen by blocking the interaction of travelers staying in hotels with the rest of the population (i.e. residents or travelers staying in homes). This delayed spread to other cities by an average of 9.2 days (95% CI: 8.9–9.5 days) and reduced ultimate epidemic size by 0.36% (95% CI: 0.34–0.37%). Blocking transmission associated only with travelers staying in homes delayed spread by 5.2 days (95% CI: 4.9–5.5 days) and decreased final epidemic size by 0.12% (95% CI: 0.11–0.13%).
For the heterogeneous-traveler model with the vector-borne pathogen, blocking the interaction of travelers staying in hotels with mosquitoes reduced the speed of spread by an average of 11 days (95% CI: 9–13 days) and decreased the ultimate size of the epidemics by 0.40% (95% CI: 0.39–0.41%). In contrast, blocking traveler-associated transmission in homes slowed spread by 5.5 days (95% CI: 3.8–7.3 days) and decreased final epidemic size by 0.36% (95% CI: 0.35–0.37%).
In the base model, infectious individuals travel with the same frequency as any other individual. We constructed additional models (no-infectious-travel models), in which infectious individuals did not travel, i.e., infectious individuals neither leave their city of residence nor do they return home from another city until they have recovered. Eliminating infectious travel resulted in spread to other locations occurring approximately 3.8 days (95% CI: 2.6–5.0 days) and 3.9 days (95% CI: 2.2–5.5 days) later in the models with directly transmitted and vector-borne pathogens, respectively. In the no-infectious-travel models, the final epidemic size was slightly decreased (mean difference: −0.018%, 95% CI: −0.028–−0.007%) for the directly transmitted pathogen, and was unchanged (mean difference: 0.0007%, 95% CI: −0.0006–0.0020%) for the vector-borne pathogen.
We first assessed the role of different assumptions regarding the structure of the airline travel network using the base models (with connecting travel) compared to models with travel restricted to single-leg, direct flights (direct-travel-only models). Overall, for the directly transmitted pathogen, the onset of autochthonous transmission in secondary cities in the direct-travel model occurred on average 10.5 days (95% CI: 10.1–11.0 days) later (
Each line is the empirical cumulative probability (for 100 simulations) of the first autochthonous transmission for the connecting-travel model (black) and direct-travel model (red) for 3 cities: A. directly connected to the origin city; B. non-directly connected, intermediate distance city; and C. a distant city.
Models of the vector-borne pathogen had significant geographical heterogeneity and showed more drastic effects when travel characteristics were modified (
Each line is the empirical cumulative probability (for 100 simulations) of the first autochthonous transmission for the connecting-travel model (black) and direct-travel model (red) for 3 cities: A. directly connected to the origin city; B. non-directly connected, intermediate distance city; and C. a distant city.
The connecting-travel network was then modified to skew traffic such that more of the travelers would be residents of non-tropical areas visiting tropical areas than vice versa. Spread occurred approximately 1.6 days (95% CI: 1.3–1.9 days) earlier in the skewed-travel model for the directly transmitted pathogen. In non-tropical cities, epidemic size did not change (95% CI: −0.006–0.015%), but in tropical cities, average epidemic size increased by 0.57% (95% CI: 0.52–0.62%). The skewed-travel model of the vector-borne pathogen behaved similarly with a non-significant decrease in the speed of spread (95% CI: −0.3–3.0 days). Epidemic size decreased in the non-tropical cities by 0.040% (95% CI: 0.028–0.052%) and increased in the tropical cities by 0.26% (0.23–0.29%).
The global connectivity of air travel presents a significant public health issue as infected travelers may move pathogens long distances in short time periods. We explored various assumptions and considerations related to the incorporation of travel in models, including the structural treatment of travelers, the role of infectious travelers, the interaction of travelers with local communities, and the characterization of the travel network. Analyzed outcomes included both local epidemic characteristics (epidemic size) and global characteristics (the speed of spread).
Most of the assumptions we tested resulted in relatively small modifications to local transmission, affecting the size of epidemics by less than one percent. For example, introducing heterogeneity among travelers reduced the average epidemic size in all models by a maximum of approximately 0.40%. These reductions occur because a proportion of travelers are never exposed, thus reducing the effective susceptible population size. A greater change in local epidemic size was seen in the direct-travel-only model for the vector-borne pathogen. The direct-travel-only model assumes that every seat is potentially used by a direct traveler as opposed to a seat potentially used by a connecting traveler. This overestimation of direct travel increases the overall volume of travel and thus leads to greater local replenishment of susceptibles, a phenomenon which takes on added importance when transmission is spatially heterogeneous. Though the mechanism is different, the effect of susceptible replenishment is also seen in the migration model. In this case, the replenishment of approximately 18% of the population within a single year led to recurrent epidemics, something completely absent from the traveler model.
Another critical outcome of interest from these models is the speed of spread across the network. Treating travelers as migrants reduced the speed of spread to other areas by approximately 3.3 and 6.7 days in models of the directly transmitted and vector-borne pathogens, respectively. This difference is likely due to proportional changes in the travelling population. Once an epidemic begins in a given city, travelers leaving that city will contain a mix of previously exposed (i.e. immune) individuals proportional to the general population. In the traveler model, this proportion is reduced by half because only half of the travelers are outgoing residents; the other half are visitors returning to their home. Because there are more susceptible travelers leaving and returning to cities where the pandemic has yet to arrive, the likelihood of a newly exposed individual arriving is slightly increased in the traveler model.
Preventing infectious individuals from travelling also led to delayed introduction by an average of 3.8 and 3.9 days for the models of the directly transmitted and vector-borne pathogens, respectively. Adding heterogeneity to exposure among travelers eliminates some potential infections among travelers and thus has the same effect.
Using different travel-network designs had an even greater effect on spread. Assuming that no travelers travel beyond the cities directly connected to their residential city delayed spread of the directly transmitted pathogen by an average of 10.5 days and the vector-borne pathogen by an average of 188 days. For both, that effect was spatially heterogeneous. The cities with high connectivity to the origin city experienced early introduction and distant cities, delayed introduction. For the vector-borne pathogen, the effects are more evident because cities with vector populations are not necessarily directly connected to each other. Of the ten randomly selected cities with vectors, a single city was directly connected to the origin city and four cities were not directly connected to any other city with a vector population. For these isolated cities, the pathogen could only arrive via an individual who is exposed in one city, returns to his/her vector-free home city and later travels to another city before recovering. With the large, intervention-free epidemics produced in our model, spread to these isolated cities did reliably occur except for in the city where an epidemic only occurred in 40% of the simulations (
Skewing the flow of travelers to increase the proportion of travelers visiting tropical areas from non-tropical areas resulted in a slightly increased speed of spread in models of the directly transmitted pathogen (approximately 1.6 days). This likely resulted from the fact that the origin city was a non-tropical city, and thus infected residents were more likely to leave. For the model of the vector-borne pathogen, skewed travel did not significantly impact the speed of spread.
The models described here present various scenarios based on different assumption sets for two pathogens with particular transmission parameterizations. For example, the ratio of the incubation period to the amount of time spent in other areas is an important determinant of spread dynamics
Though our results are particular to the systems we have analyzed, they demonstrate important generalities. The assumptions analyzed are related to both model structure and parameterization. The traveler model, for example, is essentially a structural upgrade relative to the migration model as it requires a single new parameter: the duration of stay. Though we used a universal exponential distribution to describe the duration of stay in the base model, it could also be parameterized as location-specific or time-dependent. The delayed introduction and the long-term persistence associated exclusively with the unrealistic, yet mostly commonly used, migration-style model suggest that it is not ideally suited to these types of studies. Developing a traveler model rather than a migration model requires a single additional parameter and a multiplicative increase in structural complexity as each location-compartment needs to be replicated in each other location. This increase in structural complexity, however, is not a formidable barrier given the power of modern day computers.
Travel for infectious individuals is both a structural and parameterization issue. For pathogens causing particularly severe disease, it may be completely realistic for no infectious individuals to travel, but for pathogens associated with mild disease or asymptomatic infection, infectious individuals may travel at the same rate as any other individual. However, data on differential travel practices for infectious individuals is rare. In the absence of solid data, we favor having infectious individuals travel at the same rate as others, thus avoiding the delays in spread induced by an unsubstantiated assumption.
Segregating exposure in the heterogeneous-traveler model for travelers staying in homes versus hotels also reduced the speed of spread. If adequate data exist to resolve different groups and different risk factors, it is advisable to include them in the model. The necessary structural changes are relatively straightforward; the challenge lies in the parameterization.
The final assumption we assessed was the parameterization of the travel network. It had the greatest effect on the results. Assuming that travelers only travel between cities connected by direct flights biased the speed of spread in both models and resulted in truncation of some of the vector-borne pandemics. As discussed above, truncation may result from spatial heterogeneity in pathogen transmission. Spatial heterogeneity may be somewhat overstated in our vector-borne model, but some level of spatiotemporal heterogeneity is generally present even for directly transmitted pathogens
The greatest hurdle for developing more realistic travel networks for models is the difficulty of obtaining adequate data. As discussed in
The effects discussed here may be more or less pronounced for other pathogens. Notably, the importance of travel network assumptions is particularly important in the presence of geographic heterogeneity. Based on our findings, both the structural treatment of travelers and the consideration of connecting travel may be more important than currently recognized.
In future modeling efforts, we suggest at minimum assessing the sensitivity to the structural changes considered here, in particular those related to infectious travelers and treating travelers as temporary visitors rather than migrants. Improving some of the other deficiencies discussed here is also important but will require more extensive efforts due to the complexity of the problems and the paucity of relevant data.
(DOCX)
Click here for additional data file.
We thank Oscar Padro for his assistance in developing the code behind the models and Anna Stapleton for her work in the early stages of model development.