Volume 12, Number 1—January 2006
Real-time Forecast of Multiphase Outbreak
We used a single equation with discrete phases to fit the daily cumulative case data from the 2003 severe acute respiratory syndrome outbreak in Toronto. This model enabled us to estimate turning points and case numbers during the 2 phases of this outbreak. The 3 estimated turning points are March 25, April 27, and May 24. The estimated case number during the first phase of the outbreak between February 23 and April 26 is 140.53 (95% confidence interval [CI] 115.88–165.17) if we use the data from February 23 to April 4; and 249 (95% CI: 246.67–251.25) at the end of the second phase on June 12 if we use the data from April 28 to June 4. The second phase can be detected by using case data just 3 days past the beginning of the phase, while the first and third turning points can be identified only ≈10 days afterwards. Our modeling procedure provides insights into ongoing outbreaks that may facilitate real-time public health responses.
Mathematical models have been used to predict the course of epidemics, albeit with mixed results (1). Whether and how infectious diseases are likely to spread (2–4) are affected by stochastic events (5). Once outbreaks have begun, knowing their potential severity helps public health authorities respond immediately and effectively. Much relevant information is contained in the answers to 2 questions: 1) Is the current outbreak getting better or worse? 2) How many people will be infected before the outbreak ends? Attempts to answer these questions in the early stages of an epidemic can be futile and at times misleading (6); nonetheless, we can address them with an appropriate mathematical model once sufficient time has elapsed (7). Moreover, answers can be accurate if no stochastic event occurs that could substantially alter the course of outbreaks.
We use a variation of the single-equation Richards model (8) to answer these key questions. Unlike models with several compartments commonly used to predict the spread of disease, the Richards model considers only the cumulative infective population size with saturation in growth as the outbreak progresses, caused by decreases in recruitment because of attempts to avoid contacts (e.g., wearing facemask) and implementation of control measures.
The basic premise of the Richards model is that the daily incidence curve consists of a single peak of high incidence, resulting in an S-shaped epidemic curve and a single turning point of the outbreak. These turning points, defined as times at which the rate of accumulation changes from increasing to decreasing or vice versa, can be easily located by finding the inflection point of the epidemic curve, the moment at which the trajectory begins to decline. This quantity has obvious epidemiologic importance, indicating either the beginning (i.e., moment of acceleration after deceleration) or end (i.e., moment of deceleration after acceleration) of a phase. The Richards model fits the single-phase severe acute respiratory syndrome (SARS) outbreaks in Hong Kong and Taiwan (7,9) well. However, in the case of the Toronto outbreak, the second wave of nosocomial infections in May caused the epidemic curve to deviate from the standard S shape. We propose an improvised version of the Richards model that fits the epidemic in Toronto and, subsequently, provide a simple procedure for real-time forecasts of outbreaks with secondary and tertiary waves.
The Richards model is logistic and is described by a single differential equation. The equation is given below, where I(t) is the cumulative number of infected cases at time t in days:
The model parameters are as follows: K is the carrying capacity or total case number, r is the per capita growth rate of the infected population, and a is the exponent of deviation from the standard logistic curve. Because the Richards model typically exhibits a single S-shaped curve, it is not suitable for the SARS epidemic in Canada illustrated in Figure 1.
To rectify this situation, we proposed a multistage Richards model, 1 stage for each of the S-shaped segments resulting from multiple waves of infection during this outbreak. Stages are distinguished by turning points (or inflection points), denoting acceleration after deceleration at the end of each S-shaped segment, the local minima of the corresponding incidence curves. For an n-phase epidemic outbreak, n – 1 local minima separate the n phases. For illustration, the incidence curve for Toronto given in Figure 2 contains 2 peaks (local maximum or turning point of the first type) and 1 valley (local minimum or turning point of second type). The multistage Richards model procedure requires 5 steps. First, fit the Richards model to cumulative cases on successive days by using a standard least-square routine. For single-phase outbreaks, parameter estimates (a, r, ti, K) will converge as the trajectory approaches carrying capacity K, as demonstrated in the Taiwan and Hong Kong SARS outbreaks (7,9). Second, if estimated parameters remain convergent until no more new cases are detected, the outbreak has only 1 phase. However, if the estimates begin to diverge from heretofore fixed values, one knows that a turning point denoting the start of a second phase has occurred. Third, locate the turning point, tmin, separating 2 S-shaped phases of the epidemic as the local minimum of the incidence curve (Figure 2). This is the curve for S´(t) given in the equation (1). Fourth, fit the Richards model to the cumulative case curve again, but starting from tmin + 1, the day after the start of second phase. The estimated parameters (a, r, ti, K) will again converge as the curve approaches the carrying capacity K for the second phase. Finally, repeat steps 2–4 in the event more phases occur until the outbreak ends.
By considering successive S-shaped segments of the epidemic curve separately, one can estimate the maximum case number, K, and locate the turning points, thus providing an estimate for the cumulative number of cases during each phase.
For the phase starting February 23, we estimate parameters from data ending on various dates in Table 1. We could obtain estimates for every consecutive day after recognizing the outbreak, but we only give results for every 10 days for brevity, with the first ending on March 25. The best fitting Richards model, ending on April 26 and 28, yields the parameter values given in bold letters. The estimated value for the turning point ti during this phase is computed from the estimates for r, a, and tm by using equation (2). As the initial time t = 0 is February 23 and symptom onset occurs ≈5 days after infection (10), ti = 30.43 gives the first inflection point around March 25 or first turning point (from acceleration to deceleration) for disease transmission in the Toronto area ≈5 days before March 20.
The number of cases during the phase ending on April 26 is 144, well approximated by our carrying capacity, K = 144.14 (95% confidence interval [CI] 142.19–146.09). Moreover, the results in Table 1 show that, using data from February 23 to April 4, or 10 days after the turning point of this phase, model fitting gives an estimate of K = 140.53 (95% CI 115.88–165.17). That is, given case data at the time of the outbreak, we could estimate the cumulative case number in the first phase accurately (Figure 3) 10 days after the turning point on March 25 and 22 days before the end of the first phase. This estimate also is the cumulative case number assuming no subsequent waves of infection.
Unfortunately, this was only the first wave in this outbreak, as indicated by estimates starting to diverge again after April 30. The last 2 rows of Table 1 suggest that the second turning point, the start of a second phase of this outbreak, occurred by April 30. Consequently, we go to step 3 in our procedure.
Here we use the incidence data starting on April 18 and continuing past April 30 to obtain a least-squares estimate of the minimum point tmin of the incidence curve. This choice of period ensures the minimum is contained in the time interval. Given that t = 0 is April 18, the least-squared estimate of the local minimum converges after May 18 and is tmin = 9.11 (95% CI 8.95–9.27) as shown in Table 2, along with previous estimates given every other day. This finding pinpoints the second turning point of the Toronto outbreak at April 27. Hence, April 27 separates the 2 S-shaped curves spanning the respective time periods February 23 to April 26 and April 28 to June 12, the end of the outbreak.
Again, as the data used in this article are given by onset date, which occurred after ≈5 days of incubation (10), April 22 is the actual second turning point that foretold the second wave of infections in Toronto. The index patient for the second phase had onset of respiratory symptoms, fever, and diarrhea on April 19 (11), 3 days before the turning point pinpointed by this procedure. Our result also corroborates the assessment of Health Canada, which pinpointed April 21 as the start of second phase of the outbreak in Toronto (Figure 1 in ).
Starting with the second phase of the outbreak on April 28, we again fit the cumulative case data from April 28 to the Richards model. As the case number on April 28 is 144, we use a transformation of S(t) = Sreal(t)–143, where Sreal(t) is the actual data at time t, so the initial data on April 28 used here is S(0) = 1. We again fit the model to the cumulative data ending on various dates past May 25; the results are given in Table 3 and Figure 4. The estimates start to converge after June 4, in the last 2 rows of Table 3 in bold, yielding an estimate for K of 248.96 (95% CI 246.67–251.25). Once again, the actual case number of 249 for the Toronto area outbreak (and 250 for Canada) is well approximated by our estimate of K. The estimated turning point ti = 26.36 pinpoints May 24, or a turning point for SARS infections 5 days earlier on May 19. This finding further corroborates Health Canada's assertion that, among the 79 cases that resulted from exposure at the hospital where the index patient of the second phase stayed, 78 had exposures that occurred before May 23 (11). Note also that this estimate is obtained by using data that end just 11 days after the turning point on May 24, giving an accurate prediction of the actual cumulative case number (Figure 4).
We show that the first turning point on March 25 could be detected 10 days after it occurred on April 4 (row 2 in Table 1). The second turning point on April 27, indicating that the epidemic escalated again, could be detected 5 days after it occurred by May 2 (last row in Table 1 shows the estimate for ti diverging). And the third turning point on May 24 could be detected 7 days after it occurred on May 31 (row 4 in Table 3).
Our procedure fits the data well (Figure 5), allowing us to study retrospectively the significance of various events occurring at different times. Through this procedure, we can pinpoint retrospectively the 3 key turning points for the spread of disease during the 2-phase outbreak in Toronto area. The first turning point for the spread of SARS occurred on March 20 when the first wave of infections leveled off. April 22 was the second turning point, at which time persons infected by the undetected index patient for the second wave began to experience symptoms. Our findings also concur with the World Health Organization action that lifted a travel advisory issued on April 22 that limited travel to Toronto. In retrospect, the Toronto outbreak would have ended with the first wave, if not for the single undetected case and subsequent infections that occurred before April 22. Furthermore, our results also corroborate the assessment of Health Canada, which pinpointed April 21 as the start of the second phase of the outbreak in Toronto area. The third and final turning point for the infections occurred on May 19, when the spread of disease finally leveled off.
Given incidence by onset date during the outbreak, one can use our procedure to forecast the eventual severity of current phases of the outbreak by estimating the carrying capacity, K. However, accuracy depends on having the incidence data for some time past the inflection point (7) and no new waves of infection in the future. Both points can be aptly illustrated by the Toronto outbreak. By using data from 2/23–4/14, we can predict the 95% CI of cases in the first phase of this outbreak at 137.34–148.22, 10 days before the phase ended. Incidence data 20 days after the inflection point of the first phase (March 25) would have enabled us to project the severity of the epidemic, had there not been a second wave of infection. By performing daily fits with updated case data, one could determine if parameters were converging to reliable values for the current phase of the outbreak. Similarly, for phase 2 of the Toronto outbreak, 11 days after the final inflection point (May 24), the data from April 28 to June 4 give a good estimated 95% CI of the cumulative cases of 246.67–251.25, 8 days before onset of the last case.
These results can also be used to compute the basic reproduction number, R0, for the Toronto outbreak. From Table 1, r = 0.146 for the first phase. To compare with results (9), we also assume the duration of infectiousness T to be 8.4 days, as estimated from the time from onset of symptoms in the index patient to onset of symptoms in a secondary case-patient in Singapore (12) and obtain R0 = exp[rT] = 3.41. The estimated r = 0.136 for Taiwan outbreak in (7) yields R0 = 3.08. Note that, because of the shift in the cumulative number used for the model fit of the second phase, the resulting value for r cannot be used in this simple calculation. A list of basic reproduction numbers for SARS in affected areas computed in literature by using Richards model and T = 8.4 is given in Table 4 for comparison. The larger basic reproduction numbers for Toronto (phase 1) and Taiwan, as compared with Hong Kong and Singapore, may be attributable to the relatively high percentage of nosocomial infections (13,14).
The easily implemented procedure described can be extended to analysis of turning points and severity of multiphase epidemics while ongoing. During an outbreak such as SARS, to which available data were limited and uncertain, a simple model that requires only the most basic and perhaps only easily obtainable data under these circumstances offers our best chance to a practical solution to the understanding, prediction, and timely control of the outbreak. However, one must understand that mathematical models do not provide accurate numerical predictions and can be used to forecast only in fairly gross terms (15). The accuracy of predictions depends heavily also on the assumption that no stochastic events occur in the remaining days that could significantly alter the course of the current phase of an outbreak.
Detecting the occurrence of a second turning point or start of a second phase, as outlined in Step 2 of our procedure, is especially useful as it allows us to recognize early that an epidemic is worsening, in our case on April 30 only 3 days after the turning point on April 27 (Table 1). Though predicated on the availability and accuracy of case onset data, this procedure could be a valuable tool to public health policymakers for responding to future disease outbreaks with multiple turning points.
Dr Hsieh is a professor of applied mathematics at National Chung Hsing University. His primary research interests are focused on mathematical and statistical modeling of infectious diseases epidemiology.
Mr Cheng received his master's degree in June of 2005 from National Chung Hsing University (Department of Applied Mathematics). His research interests are in the area of mathematic epidemiology.
We thank John Glasser for constructive comments and suggestions.
Y.H.H. is supported by SARS research grant (NSC 93-2751-B005-001-Y) from the National Science Council of Taiwan and thanks MITACS (Canada) for their generous financial support to attend the MITACS SARS meetings at Banff, Canada.
- May RM. Uses and abuses of mathematics in biology. Science. 2004;303:790–3.
- Ranta J, Makela PH, Arjas E. Predicting meningococcal disease outbreaks in structured populations. Stat Med. 2004;23:927–45.
- Bauch CT, Earn DJ. Vaccination and the theory of games. Proc Natl Acad Sci U S A. 2004;101:13391–4.
- Porco TC, Small PM, Blower SM. Amplification dynamics: predicting the effect of HIV on tuberculosis outbreaks. J Acquir Immune Defic Syndr. 2001;28:437–44.
- Klempner MS, Shapiro DS. Crossing the species barrier—one small step to man, one giant leap to mankind. N Engl J Med. 2004;350:1171–2.
- Razum O, Becher H, Kapaun A, Junghanss T. SARS, lay epidemiology, and fear. Lancet. 2003;361:1739–40.
- Hsieh YH, Lee JY, Chang HL. SARS epidemiology modeling. Emerg Infect Dis. 2004;10:1165–7.
- Richards FJ. A flexible growth function for empirical use. J Exp Bot. 1959;10:290–300.
- Zhou G, Yan G. Severe acute respiratory syndrome epidemic in Asia. Emerg Infect Dis. 2003;9:1608–10.
- World Health Organization. Consensus document on the epidemiology of severe acute respiratory syndrome (SARS). 17 Oct 2003. Geneva: The Organization; 2003. Available at http://www.who.int/csr/sars/en/WHOconsensus.pdf
- Wallington T, Berger L, Henry B, Shahin R, Yaffe B, Mederski B, Update: Severe acute respiratory syndrome—Toronto, 2003. Can Commun Dis Rep. 2003;29:113–7 Available at http://www.phac-aspc.gc.ca/publicat/ccdr-rmtc/03vol29/dr2913ea.html.
- Lipsitch M, Cohen T, Cooper B, Robins JM, Ma S, James L, Transmission dynamics and control of severe acute respiratory syndrome. Science. 2003;300:1966–70.
- Immunization and Respiratory Infections Division, Centre for Infectious Disease Prevention and Control. The war against an unknown pathogen: rising to the SARS challenge. Can Commun Dis Rep. 2003;29:77–82 Available at http://www.phac-aspc.gc.ca/publicat/ccdr-rmtc/03vol29/dr2909ea.html.
- Ho MS, Su IJ. Preparing to prevent severe acute respiratory syndrome and other respiratory infections. Lancet Infect Dis. 2004;4:684–9.
- McKenzie FE. Smallpox models as policy tools. Emerg Infect Dis. 2004;10:2044–7.
Suggested citation for this article: Hsieh Y-H, Cheng Y-S. Real-time forecast of multiphase epidemic outbreak. Emerg Infect Dis [serial on the Internet]. 2006 Jan [date cited]. http://dx.doi.org/10.3201/eid1201.050396