Skip directly to search Skip directly to A to Z list Skip directly to page options Skip directly to site content

Volume 7, Number 3—June 2001

Synopsis

Spoligotype Database of Mycobacterium tuberculosis: Biogeographic Distribution of Shared Types and Epidemiologic and Phylogenetic Perspectives

Christophe Sola*Comments to Author , Ingrid Filliol*, Maria Cristina Gutierrez†, Igor Mokrousov*, Véronique Vincent†, and Nalin Rastogi*Comments to Author 
Author affiliations: *Institut Pasteur de Guadeloupe, Pointe à Pitre, Guadeloupe; †Centre National de Référence des Mycobactéries, Institut Pasteur, Paris, France

Main Article

Table 3

Total number of matches found in matching analysis of the shared types (n=69) found at two geographic locations*

Americas Asia*
Matches* Africa N. America Central America and Caribbean S. America Europe Middle East Far East
Africa 3a 3b 2c 1d 5e 0 0
North America NA+ 6f 4g 8h 0 1
Central America 2i 4j 5k 0 0
South America 3l 4m 0 0
Europe 17n 1 0
Asia (Middle East 0 0
Asia (Far East) 0

*Indices a to n refer to the designation of the matching types. For full description of the matching shared type, see Figure 1. Spoligotyping data for isolates from Asia are scarce; hence, only two matches involving the Middle East and Far East were found (shared types 127 and 249, respectively). †NA, not applicable (matches were searched only for shared types existing between two countries or regions; as no data were available for Canada, comparison of isolates within North America was not feasible).

Main Article

1For this purpose, the independent sampling sizes for Europe and the USA were taken as n1 and n2, the number of individuals within a given shared-type "x" was k1 and k2, and in this case, the representativeness of the two samples was p1=k1/n1 and P2=k2/n2, respectively. To assess if the divergence observed between p1 and p2 was due to sampling bias or the existence of two distinct populations, the percentage of individuals (p0) harboring shared-type "x" in the population studied was estimated by the equation p0= k1+k2/n1+n2=n1p1+n2p2/n1+n2. The distribution of the percentage of shared-type "x" in the sample sizes n1 and n2 follows a normal distribution with a mean p0 and a standard deviation of formula imageand formula imagerespectively, and the difference d=p1-p2 follows a normal distribution of mean p0-p0=0 and of variance σd2p12p22 = p0q0/n1+p0q0/n2 or σd2=p0q0 (1/n1+1/n2). The two samples being independent, the two variances were additive; the standard deviation σd= was calculated, and the homogeneity of the samples tested was assessed using the quotient d/σd=p1-p2/formula image. If the absolute value of the quotient d/σd<2, the two samples were considered to belong to a same population (CI 95%) and the variation observed in the distribution of isolates for given shared types could be due to a sampling bias. Inversely, if d/σd>2, then the differences observed in the distribution of isolates for given shared types were statistically significant and not due to potential sample bias.

TOP