Skip directly to site content Skip directly to page options Skip directly to A-Z link Skip directly to A-Z link Skip directly to A-Z link
Volume 16, Number 10—October 2010
Letter

Internet Search Limitations and Pandemic Influenza, Singapore

Cite This Article

To the Editor: In the past few years, several publications have reported that Internet search queries may usefully supplement other, traditional surveillance programs for infectious diseases (13). The philanthropic arm of Google offers Flu Trends, a site that provides up-to-date estimates of influenza activity in 20 countries of the Pacific Rim and Europe (4) by using data mining techniques to find good predictors of historic influenza indicators (1).

This service has yet to be extended to other countries and other diseases because access to official surveillance data is required, among other reasons. However, another Google service, Insights for Search, enables users to find and download time-series data of relative counts of arbitrary searches for a large number of countries (5). Pelat et al. have shown that a few, well-chosen searches on Google Insights provide data that closely correlate with French surveillance data for seasonal influenza, chickenpox, and gastroenteritis (3). Although Internet searches appear to be a promising tool for public health surveillance, our experience from using Google Insights in the context of pandemic (H1N1) 2009 in Singapore suggests it has important limitations.

Figure

Thumbnail of Number of Google searches conducted for “influenza” (black lines) and “H1N1” (gray lines) compared with number of acute respiratory infections (ARI, gray bars) reported in government clinics, Singapore, 2009. During the outbreak of pandemic (H1N1) 2009, Google search activity surged in response to newsworthy events (the World Health Organization [WHO] alert, first importation and unlinked local case, release of vaccine) but dropped substantially by the time most infections occurred

Figure. Number of Google searches conducted for “influenza” (black lines) and “H1N1” (gray lines) compared with number of acute respiratory infections (ARI, gray bars) reported in government clinics, Singapore, 2009. During the...

In Singapore, the recent pandemic caused an outbreak that peaked at the start of August 2009; the first confirmed importation was at the end of May and first confirmed unlinked case was at the end of June. However, the number Google searches for “influenza,” “H1N1,” “swine flu,” and similar terms (in English and Chinese), as well as symptoms associated with the disease, peaked much earlier than did the number of cases (Figure). The number of searches surged after newsworthy events but was low during the epidemic itself and had declined to about 20% of maximum search volume by the time of the actual peak, as shown by traditional surveillance. Furthermore, no discernible local maxima were observed that corresponded to the peak in case data. In contrast, alternative traditional measures of influenza incidence—prevalence of the novel strain among viral samples and general practice surveillance (6,7)—provide a consistent description of the outbreak.

This finding echoes a major point raised by Carneiro and Mylonakis (2), namely, that without adjusting for spikes driven by disease publicity rather than the disease itself, Internet searches may lose much of their value in supplementing traditional surveillance measures. Our experience is that using Google Insights to survey a disease may not work well for diseases with considerable media exposure, in particular, emerging diseases such as pandemic (H1N1) 2009 or severe acute respiratory syndrome. Such outbreaks may require the more sophisticated approach used by Flu Trends, should it be extended to other diseases and more corners of the globe. We agree with Pelat et al. (3) that Google Insights may work well for less-publicized infectious diseases. The dividing line between well-publicized and unpublicized diseases may, however, remain ambiguous. Thus, to ensure that web search data reflect disease incidence requires validation against traditional surveillance, although in that situation, the availability of corroborating traditional methods of surveillance limits the value of web-query data.

Top

Acknowledgment

A.R.C. received research funding from the National University of Singapore.

Top

Alex R. CookComments to Author , Mark I.C. Chen, and Raymond Tzer Pin Lin

Author affiliations: Author affiliations: National University of Singapore, Singapore (A.R. Cook, M.I.C. Chen, R.T.P. Lin); Tan Tock Seng Hospital, Singapore (M.I.C. Chen); Duke-NUS Graduate Medical School, Singapore (M.I.C. Chen); Ministry of Health, Singapore (R.T.P. Lin)

Top

References

  1. Ginsberg  J, Mohebbi  MH, Patel  RS, Brammer  L, Smolinski  MS, Brilliant  L. Detecting influenza epidemics using search engine query data. Nature. 2009;457:10124. DOIPubMed
  2. Carneiro  HA, Mylonakis  E. Google Trends: a web-based tool for real-time surveillance of disease outbreaks. Clin Infect Dis. 2009;49:155764. DOIPubMed
  3. Pelat  C, Turbelin  C, Bar-Hen  A, Flahaut  A, Valleron  A-J. More diseases tracked by using Google Trends. Emerg Infect Dis. 2009;15:13278. DOIPubMed
  4. Flu trends [cited 2010 May 25]. http://www.google.org/flutrends/
  5. Insights for search beta [cited 2010 May 25]. http://www.google.com/insights/search/#
  6. Cutter  JL, Ang  LW, Lai  FY, Subramony  H, Ma  S, James  L. Outbreak of pandemic influenza A (H1N1–2009) in Singapore, May to September 2009. Ann Acad Med Singapore. 2010;39:27382.PubMed
  7. Ong  JB, Chen  MI-C, Cook  AR, Lee  HC, Lee  VJ, Lin  RT, Real-time epidemic monitoring and forecasting of H1N1–2009 using influenza-like illness from general practice and family doctor clinics in Singapore. PLoS ONE. 2010;5:e10036. DOIPubMed

Top

Figure

Top

Cite This Article

DOI: 10.3201/eid1610.100840

Related Links

Top

Table of Contents – Volume 16, Number 10—October 2010

Comments

Please use the form below to submit correspondence to the authors or contact them at the following address:

Alex R. Cook, Department of Statistics and Applied Probability, National University of Singapore, 6 Science Dr 2, Singapore 117546

Send To

character(s) remaining.

Comment submitted successfully, thank you for your feedback.

Top

Page created: September 08, 2011
Page updated: September 08, 2011
Page reviewed: September 08, 2011
The conclusions, findings, and opinions expressed by authors contributing to this journal do not necessarily reflect the official position of the U.S. Department of Health and Human Services, the Public Health Service, the Centers for Disease Control and Prevention, or the authors' affiliated institutions. Use of trade names is for identification only and does not imply endorsement by any of the groups named above.
file_external