Volume 16, Number 10—October 2010
Internet Search Limitations and Pandemic Influenza, Singapore
To the Editor: In the past few years, several publications have reported that Internet search queries may usefully supplement other, traditional surveillance programs for infectious diseases (1–3). The philanthropic arm of Google offers Flu Trends, a site that provides up-to-date estimates of influenza activity in 20 countries of the Pacific Rim and Europe (4) by using data mining techniques to find good predictors of historic influenza indicators (1).
This service has yet to be extended to other countries and other diseases because access to official surveillance data is required, among other reasons. However, another Google service, Insights for Search, enables users to find and download time-series data of relative counts of arbitrary searches for a large number of countries (5). Pelat et al. have shown that a few, well-chosen searches on Google Insights provide data that closely correlate with French surveillance data for seasonal influenza, chickenpox, and gastroenteritis (3). Although Internet searches appear to be a promising tool for public health surveillance, our experience from using Google Insights in the context of pandemic (H1N1) 2009 in Singapore suggests it has important limitations.
In Singapore, the recent pandemic caused an outbreak that peaked at the start of August 2009; the first confirmed importation was at the end of May and first confirmed unlinked case was at the end of June. However, the number Google searches for “influenza,” “H1N1,” “swine flu,” and similar terms (in English and Chinese), as well as symptoms associated with the disease, peaked much earlier than did the number of cases (Figure). The number of searches surged after newsworthy events but was low during the epidemic itself and had declined to about 20% of maximum search volume by the time of the actual peak, as shown by traditional surveillance. Furthermore, no discernible local maxima were observed that corresponded to the peak in case data. In contrast, alternative traditional measures of influenza incidence—prevalence of the novel strain among viral samples and general practice surveillance (6,7)—provide a consistent description of the outbreak.
This finding echoes a major point raised by Carneiro and Mylonakis (2), namely, that without adjusting for spikes driven by disease publicity rather than the disease itself, Internet searches may lose much of their value in supplementing traditional surveillance measures. Our experience is that using Google Insights to survey a disease may not work well for diseases with considerable media exposure, in particular, emerging diseases such as pandemic (H1N1) 2009 or severe acute respiratory syndrome. Such outbreaks may require the more sophisticated approach used by Flu Trends, should it be extended to other diseases and more corners of the globe. We agree with Pelat et al. (3) that Google Insights may work well for less-publicized infectious diseases. The dividing line between well-publicized and unpublicized diseases may, however, remain ambiguous. Thus, to ensure that web search data reflect disease incidence requires validation against traditional surveillance, although in that situation, the availability of corroborating traditional methods of surveillance limits the value of web-query data.
A.R.C. received research funding from the National University of Singapore.
- Ginsberg J, Mohebbi MH, Patel RS, Brammer L, Smolinski MS, Brilliant L. Detecting influenza epidemics using search engine query data. Nature. 2009;457:1012–4.
- Carneiro HA, Mylonakis E. Google Trends: a web-based tool for real-time surveillance of disease outbreaks. Clin Infect Dis. 2009;49:1557–64.
- Pelat C, Turbelin C, Bar-Hen A, Flahaut A, Valleron A-J. More diseases tracked by using Google Trends. Emerg Infect Dis. 2009;15:1327–8.
- Flu trends [cited 2010 May 25]. http://www.google.org/flutrends/
- Insights for search beta [cited 2010 May 25]. http://www.google.com/insights/search/#
- Cutter JL, Ang LW, Lai FY, Subramony H, Ma S, James L. Outbreak of pandemic influenza A (H1N1–2009) in Singapore, May to September 2009. Ann Acad Med Singapore. 2010;39:273–82.
- Ong JB, Chen MI-C, Cook AR, Lee HC, Lee VJ, Lin RT, Real-time epidemic monitoring and forecasting of H1N1–2009 using influenza-like illness from general practice and family doctor clinics in Singapore. PLoS ONE. 2010;5:e10036.
- Figure. Number of Google searches conducted for “influenza” (black lines) and “H1N1” (gray lines) compared with number of acute respiratory infections (ARI, gray bars) reported in government clinics, Singapore, 2009. During...
Suggested citation for this article: Cook AR, Chen MI-C, Lin RTP. Internet search limitations and pandemic influenza, Singapore [letter]. Emerg Infect Dis [serial on the Internet]. 2010 Oct [date cited]. http://dx.doi.org/10.3201/eid1610.100840
Please use the form below to submit correspondence to the authors or contact them at the following address:
Alex R. Cook, Department of Statistics and Applied Probability, National University of Singapore, 6 Science Dr 2, Singapore 117546
Comment submitted successfully, thank you for your feedback.
The conclusions, findings, and opinions expressed by authors contributing to this journal do not necessarily reflect the official position of the U.S. Department of Health and Human Services, the Public Health Service, the Centers for Disease Control and Prevention, or the authors' affiliated institutions. Use of trade names is for identification only and does not imply endorsement by any of the groups named above.
- Page created: September 08, 2011
- Page last updated: September 08, 2011
- Page last reviewed: September 08, 2011
- Centers for Disease Control and Prevention,
National Center for Emerging and Zoonotic Infectious Diseases (NCEZID)
Office of the Director (OD)