Skip directly to site content Skip directly to page options Skip directly to A-Z link Skip directly to A-Z link Skip directly to A-Z link

Disclaimer: Early release articles are not considered as final versions. Any changes will be reflected in the online version in the month the article is officially released.

Volume 27, Number 5—May 2021

Detecting Rapid Spread of SARS-CoV-2 Variants, France, January 26–February 16, 2021

Stéphanie Haim-Boukobza, Bénédicte Roquebert, Sabine Trombert-Paolantoni, Emmanuel Lecorche, Laura Verdurme, Vincent Foulongne, Christian Selinger, Yannis Michalakis, Mircea T. Sofonea1, and Samuel Alizon1Comments to Author 
Author affiliations: Cerba Laboratory, Saint Ouen L’Aumône, France (S. Haim-Boukobza, B. Roquebert, S. Trombert-Paolantoni, E. Lecorche, L. Verdurme); CHU de Montpellier, Montpellier, France (V. Foulongne); IRD, Montpellier (C. Selinger); CNRS, Montpellier (Y. Michalakis, S. Alizon); Université de Montpellier, Montpellier (M.T. Sofonea)

Suggested citation for this article


Variants of severe acute respiratory syndrome coronavirus 2 raise concerns regarding the control of coronavirus disease epidemics. We analyzed 40,000 specific reverse transcription PCR tests performed on positive samples during January 26–February 16, 2021 in France. We found high transmission advantage of variants and more advanced spread than anticipated.

Since the end of 2020, at least 3 strains, or “variants,” of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) bearing a high number of mutations have been associated with rapid epidemic spread in the United Kingdom (lineage B.1.1.7) (1), South Africa (lineage B.1.351) (2), and Brazil (lineage P.1) (3). Because of their increased transmissibility (4; E. Volz et al., unpub. data,; A.S. Walker et al., unpub. data, and potential ability to evade host immunity (5; S. Cele et al., unpub. data,, monitoring these variants is crucial in the context of mass vaccination.

In France, beginning February 5, 2021, every sample that tested SARS-CoV-2–positive by reverse transcription PCR (RT-PCR) underwent an additional variant-specific RT-PCR with probes targeting the Δ69–70 deletion and the N501Y mutation, both in the spike glycoprotein. Both targets are present in lineage B.1.1.7. For lineages B.1.351 and P.1, only the N501Y mutation is present. If only the Δ69–70 deletion is detected, the infection might be caused by another variant or by a wild-type strain with a deletion. Finally, if neither target is detected, the infection is considered to be caused by a wild-type strain. These tests are cheaper and easier to implement than full-genome sequencing, which enables their rapid deployment on a wide scale. We report the results of this testing program.

The Study

RT-PCR testing for SARS-CoV-2 strains was conducted using 2 assays, VirSNiP SARS-CoV-2 Spike del+501 (TIB Molbiol, and ID SARS-CoV-2/UK/SA Variant Triplex (ID Solutions,, which target the Δ69–70 deletion and N501Y mutation. We performed tests on 42,229 positive samples collected during January 26–February 16, 2021, from 40,777 persons from 12 regions in France. Most samples came from the general population, and 3,323 (7.9%) samples came from hospitals. For the 1,397 patients for whom multiple tests were performed, only the first test was considered. We only included data from persons 5–80 years of age to minimize the weight of preschool children and persons living in aged-care facilities in our analysis. Finally, we removed persons whose age or testing region was unknown. This study was approved by the Institutional Review Board of the CHU of Montpellier and is registered at (identifier NCT04738331).

Overall, we analyzed 35,208 SARS-CoV-2–positive samples from the same number of persons (Appendix 1 Table 1). Results of 6,702 (19%) variant-specific RT-PCR tests were uninterpretable, mainly because of an insufficient amplification of the control, which targets the SARS-CoV-2 N gene. These results were treated as missing in the analyses. Given that most of the variants were B.1.1.7 (Appendix 2 Figure 1), we grouped all samples bearing the N501Y mutation into a broader class of variant-positive.

We used a generalized linear model (GLM) to analyze the binary strain variable (with values wild-type or variant). The covariates were the patient’s age, the RT-PCR kit used for variant detection, the sampling date, and the geographic region from which the sample originated (Appendix 2). By using a type-II analysis of variance, we found that all covariates except the type of RT-PCR kit to be significant (Table 1). In particular, the proportion of variants increased with date and decreased with age (Appendix 2 Figure 2) and hospital origin.

Figure 1

Estimated variants frequency kinetics in study of rapid spread of severe acute respiratory syndrome coronavirus 2 spread, France, January 26–February 16, 2021. Triangles indicate the general linear model–fitted values, line indicates output of the logistic growth model estimation, and shading indicates 95% CIs. Overall estimated transmission advantage of the variants (with respect to the wild-type reproduction number) is 50 (95% CI 38%–64%) (Appendix 2). VOC, variant of concern.

Figure 1. Estimated variants frequency kinetics in study of rapid spread of severe acute respiratory syndrome coronavirus 2 spread, France, January 26–February 16, 2021. Triangles indicate the general linear model–fitted values, line...

To investigate the temporal trends, we fitted a logistic growth model to the fitted values of an analogous GLM only on the data from general population samples (Appendix 2). Assuming that variations in frequencies are driven by transmission advantages, we found that variants have a 50% (95% CI 37%–64%) transmission advantage over wild-type strains (Figure 1).

Figure 2

Regions of France in study of rapid spread of SARS-CoV-2 variants, January 26–February 16, 2021. For each region, we show the log10 of the number of tests analyzed (A), the estimated total variant frequency by February 16, 2021 (B), and the estimated percentage transmission advantage (of the variant strains relative to the wild-type strain (C). Additional details in Appendix 2 Figure 5.

Figure 2. Regions of France in study of rapid spread of SARS-CoV-2 variants, January 26–February 16, 2021. For each region, we show the log10of the number of tests analyzed (A),...

The analysis of variance already showed that variant frequency varied across regions (Table 1). We performed the logistic growth fit at the local level for regions for which adequate data was available (Figure 2). The growth advantage of the variant was more pronounced in some regions. In Ile-de-France, more than half of infections already appeared to be caused by the variants by February 16, whereas in other regions, such as Burgundy, this proportion would not be reached until March 2021. However, some regions were less well represented in this analysis, which could affect local estimates.

Finally, we investigated the correlation between the increase in variant frequency among positive tests in a region and the temporal reproduction number, denoted Rt, in that same region. Rt was estimated from coronavirus disease intensive care unit admission data by using the EpiEstim method (6) with a serial interval from Nishiura et al. (7), as described in Reyné et al. (unpub. data, (Appendix 2). We used the Spearman rank correlation test and found a positive but nonsignificant trend (ρ = 0.50; p = 0.09) (Appendix 2 Figure 3).


We used 2 variant-specific RT-PCR tests to detect the fraction of infections caused by SARS-CoV-2 lineages B.1.1.7, B.351, and P.1 in regions in France during January 25–February 16, 2021. We did not find any significant difference between the 2 specific RT-PCR kits used, suggesting that similar data collected in France could be pooled. Our results have several practical implications.

In general, we found that many infections screened were caused by variants, especially B.1.1.7, and the trend increased over time. On the basis of our estimates, by February 16, 2021, more than half of SARS-CoV-2 infections in France could have been caused by variants, although with pronounced spatial heterogeneity. In a conservative scenario, where all uninterpretable tests were assumed to be caused by the wild type, most infections would have been caused by variants by the end of week 7 of 2021, and the estimated variants transmission advantage was 36% (95% CI 26%–48%) (Appendix 2 Figure 4).

Variant-positive samples originated from significantly younger patients, which is consistent with an earlier report (E. Volz et al., unpub. data) but contrasts with Davies et al. (4). Our analysis did not enable us to discriminate between epidemiologic effects (e.g., if variants’ transmission chains were seeded in different populations than the wild types), sampling biases, or biologic effects. Additional data from RT-PCR amplification cycles could provide useful insights. Finally, earlier reports have found variant proportion to be associated with higher basic reproduction number (4; E. Volz et al., unpub. data). We found such a trend among regions in France, but it is not statistically significant.

A limitation of this study is that, in spite of its intensity, the sampling was performed retrospectively. This could generate biases if, for instance, transmission chains associated with variants were increasingly sampled. However, we found that samples that originated in hospitals were associated with a lower variant detection. Since testing in the general population is usually performed a week after infection and hospital admissions occur ≈2 weeks after infection (M.T. Sofonea et al., unpub. data,, we expect hospital data would reflect an older state of the epidemic than screening data. RT-PCR does not have the resolution of full-genome sequencing, and other variants of concern could be underestimated or missed with this approach. However, the time scale considered and the relatively slow evolutionary rate of SARS-CoV-2 make this approach appropriate to monitor variant spread. Furthermore, next-generation sequencing performed on 48 samples showed a strong consistency with the specific RT-PCR tests (Cohen κ of 1 for the TIB Molbiol test and 0.87 or 0.88 for the ID Solutions test depending on the variant; data not shown).

These results illustrate that variant-specific RT-PCRs are an option for SARS-CoV-2 epidemic monitoring because of their affordability and rapid deployment. They also demonstrate that SARS-CoV-2 variants spread in France was faster than anticipated (L.D. Domenico et al., unpub. data,, which stresses the importance of swift public health responses.

Dr. Haim-Boukobza is a virologist and head of the Department of Infectious Disease at Cerba Laboratory, Saint Ouen L’Aumône, France. Her primary research interests are viral hepatitis, human papillomaviruses, and HIV, with additional interests in gut and vaginal microbiomes.





  1. Rambaut  A, Loman  N, Pybus  O, Barclay  W, Barrett  J, Carabelli  A, et al. Preliminary genomic characterisation of an emergent SARS-CoV-2 lineage in the UK defined by a novel set of spike mutations. Virological. 2021 [cited 2020 Dec 9].
  2. Tegally  H, Wilkinson  E, Giovanetti  M, Iranzadeh  A, Fonseca  V, Giandhari  J, et al. Emergence of a SARS-CoV-2 variant of concern with mutations in spike glycoprotein. Nature. 2021; Epub ahead of print. DOIPubMed
  3. Faria  NR, Claro  IM, Candido  D, Moyses Franco  L, Andrade  PS, Coletti  TM, et al. Genomic characterisation of an emergent SARS-CoV-2 lineage in Manaus: preliminary findings. Virological. 2021 [cited 2021 Jan 12].
  4. Davies  NG, Abbott  S, Barnard  RC, Jarvis  CI, Kucharski  AJ, Munday  J, et al. Estimated transmissibility and severity of novel SARS-CoV-2 variant of concern 202012/01 in England. Science. 2021; [Epub ahead of print]. DOI
  5. Thomson  EC, Rosen  LE, Shepherd  JG, Spreafico  R, da Silva Filipe  A, Wojcechowskyj  JA, et al.; ISARIC4C Investigators; COVID-19 Genomics UK (COG-UK) Consortium. Circulating SARS-CoV-2 spike N439K variants maintain fitness while evading antibody-mediated immunity. Cell. 2021;184:11711187.e20. DOIPubMed
  6. Cori  A, Ferguson  NM, Fraser  C, Cauchemez  S. A new framework and software to estimate time-varying reproduction numbers during epidemics. Am J Epidemiol. 2013;178:150512. DOIPubMed
  7. Nishiura  H, Linton  NM, Akhmetzhanov  AR. Serial interval of novel coronavirus (COVID-19) infections. Int J Infect Dis. 2020;93:2846. DOIPubMed




Suggested citation for this article: Haim-Boukobza S, Roquebert B, Trombert-Paolantoni S, Lecorche E, Verdurme L, Foulongne V, et al. Detecting rapid spread of SARS-CoV-2 variants, France, January 26–February 16, 2021. Emerg Infect Dis. 2021 May [date cited].

DOI: 10.3201/eid2705.210397

Original Publication Date: March 26, 2021

1These authors contributed equally to this article.

Table of Contents – Volume 27, Number 5—May 2021


Please use the form below to submit correspondence to the authors or contact them at the following address:

Samuel Alizon, MIVEGEC, CNRS, IRD, Université de Montpellier, 911 av. Agropolis, 34394 Montpellier CEDEX 5, France

Send To

10000 character(s) remaining.


Page created: March 22, 2021
Page updated: March 26, 2021
Page reviewed: March 26, 2021
The conclusions, findings, and opinions expressed by authors contributing to this journal do not necessarily reflect the official position of the U.S. Department of Health and Human Services, the Public Health Service, the Centers for Disease Control and Prevention, or the authors' affiliated institutions. Use of trade names is for identification only and does not imply endorsement by any of the groups named above.