Model of COVID‑19 transmission in the U.S.A. by state

Updated July 13, 2021

The epidemiology of COVID-19 is poorly understood. To better understand the potential range of epidemic outcomes in the U.S.A, we developed a model based on the accumulation of knowledge from various locations and fit it to state-level observations of the number of reported cases and deaths.

Data

Incident Case Data & Death Reports

We fit the model to incident case and death reports for each state in the USA, as collated by The COVID Tracking Project. We show a subset of the state-level data here for Georgia, New York, Washington, and Wyoming. We did not smooth the reported data but we did exclude all negative case and death reports.

Social distancing / mobility data

To date, the primary intervention in all states has been the adoption of social distancing behaviors and improved hygiene. We measure social distancing using aggregated, anonymized locations recorded by location-based mobile phone apps, collected by Unacast. We summarize the effectiveness of social distancing behaviors as the average deviation from baseline, a statistic ranging from 0% (complete cessation of movements) to 100% (no difference from baseline).

The trajectories of human mobility in all 50 states is shown below.

Scenarios

We explore a range of scenarios for transmission. Simulations in each state start on the data on which at least 1 case was first reported. The model is fit to data on reported cases and deaths to July 13, 2021 and propagated six weeks into the future.

Possible futures scenarios

We consider three possible futures scenarios. Our scenarios are all driven by changes in human mobility: increasing human mobility increases transmission rate, while decreasing mobility decreases transmission rate. Our model includes a latent trend in transmission to account for all other factors that make transmission rate vary over time. When projecting into the future, we extract the fitted latent trend effect over the last 30 days and then take the mean over the first 20 days of that 30 day period. We ignore the last 10 days to avoid undue influence of the latent trend as it adapts to fit the most recent data.

  1. Increasing social distancing. Social distancing increases, reducing movements from the current observed level to 30% of normal, which is the reduction observed in New York City that enabled transmission to decline there. Social distancing has the impact of reducing transmission rate. Thus, it is possible that other measures – such as wide-spread, consistent wearing of face masks – could achieve the same or similar level of transmission reduction that we model as an affect of social distancing.

  2. Maintain social distancing. Social distancing continues at the last observed level of human movement patterns and the epidemic trajectory continues to follow it’s most recent path, which is also influenced by the latent trend. Our model captures current increases in case/death reports by allowing transmission rate to increase either due to mobility or the fitted latent trend. Thus, even if mobility remains constant, the most recent increase in transmission causes the trajectory to increase in many cases. As during the beginning of the epidemic, such exponential growth can, and likely will be, averted by increase vigilance through social distancing and other practices that reduce transmission.

  3. Return to normal. Social distancing returns to a level of human movement equivalent to baseline.

Model details

Key features of this model include:

  • Stochastic transmission process with a realistic level of random variation.
  • Realistic distributions of presymptomatic and symptomatic periods.
  • Different transmission rates for asymptomatic, presymptomatic, and symptomatic individuals.1
  • Time varying rates of case detection, isolation, and case notification.
  • Time varying mortality.
  • Accounting for the effect on transmission of human mobility (i.e., social distancing), as measured by data.
  • A “latent” process that captures the effect of environmental factors and other behaviors that can reduce transmission but are difficult to quantify with data (e.g., hand washing or face masks).
  • Realistic intervention scenarios (e.g. improving case detection, faster testing).

The model comprises susceptible, presymptomatic, asymptomatic, symptomatic, diagnosed, hospitalized, deceased, and recovered persons. The following compartments are included:

  • \(\boldsymbol{S}\) - Uninfected and susceptible individuals. Susceptible individuals can become infected by individuals in the \(L\), \(I_a\), \(I_{su}\), \(I_{sd}\), \(C\), and \(H\) stages. Rates of transmission from these stages can be adjusted individually.
  • \(\boldsymbol{L}\) - Individuals with latent infections who do not yet show symptoms. Those individuals can be infectious. At the end of the \(L\) stage, a fraction moves into the \(I_a\) stage, another fraction moves into the \(I_{su}\) stage, and the remainder into the \(I_{sd}\) stage.
  • \(\boldsymbol{I_a}\) - Individuals who are infected and asymptomatic. Those individuals are likely infectious, but the model allows to adjust this.
  • \(\boldsymbol{I_{su}}\) - Individuals who are infected and symptomatic, but are undetected. Those individuals are likely infectious. Individuals in this compartment never get diagnosed, and are assumed to recover.
  • \(\boldsymbol{I_{sd}}\) - Individuals who are infected and symptomatic, and are detected. Those individuals are likely infectious. Individuals in this compartment will be diagnosed and move to \(C\).
  • \(\boldsymbol{C}\) - Individuals who have been diagnosed as cases. Those individuals are likely isolated and not infectious, but the model allows to adjust this. A fraction of individuals in the \(C\) stage will naturally recover, without the need for hospitalization. The remainder moves into the \(H\) stage.
  • \(\boldsymbol{H}\) - Individuals who have been hospitalized. Those individuals are likely isolated and not infectious, but the model allows to adjust this. A fraction of individuals in the \(H\) stage will recover, the remainder will die.
  • \(\boldsymbol{R}\) - Recovered/removed individuals. Those individuals have recovered and are immune.
  • \(\boldsymbol{D}\) - Individuals who died from the infection.

To allow more realistic distributions of movement through compartments, several of these compartments are internally split into multiple stages using the linear chain trick.2

  • \(\boldsymbol{L}\) - 4 compartments
  • \(\boldsymbol{I_a}\) - 4 compartments
  • \(\boldsymbol{I_{su}}\) - 4 compartments
  • \(\boldsymbol{I_{sd}}\) - 4 compartments
  • \(\boldsymbol{C}\) - 4 compartments
  • \(\boldsymbol{H}\) - 4 compartments

The flow diagram for this model shown below.

Interventions

The following interventions are implemented:

  • Social distancing is assumed to reduce all transmission rates by some factor. This is provided as a covariate based on mobility data.
  • Decreasing time to diagnosis, which speeds up the transition from the detected class (I_sd) to the diagnosed cases class (C).
  • Improving detection, which is assumed to increase the fraction of symptomatic individuals that move into the \(I_{sd}\) compartment and will eventually be diagnosed. It is assumed to increase sigmoidally up to a maximum value.

Parameterization

This model was initially parameterized using clinical outcome reports from the epidemic in Hubei province, China and further calibrated with information about COVID-19 elsewhere in China and the United States. Transmissibility of the virus is assumed to be proportional to the level of human movement.

Key parameters estimated using maximum likelihood by iterated filtering (MIF)3 include baseline transmissibility (\(\beta_0\)), maximum ascertainment (i.e. maximum fraction of cases detected), fraction of known cases that are hospitalized, fatality rate among hospitalized cases, and the initial size of the latent class. Auxiliary parameters estimated using MIF include the intensity of the parameter random walk and dispersion parameters for observables.

Disclaimer: The COVID-19 epidemic is changing rapidly, and information that was used in the construction of this model may be incomplete or contain errors. Accordingly, these results are preliminary, provisional, and subject to change. These results have not been peer-reviewed, but have been prepared to a professional standard with the intention of providing useful interpretation of a rapidly developing event.


  1. Rong et al. 2020, Du et al. 2020

  2. Hurtado & Kirosingh 2019

  3. Ionides et al. 2006