A Systematic Approach for Discovering Novel, Clinically Relevant Bacteria
Figure 2. Sequence quality and number of ambiguous bases for 673 unidentified bacterial isolates. The median sequence length was 480 bases, with 84% of sequences in the range of 461 to 500 bases (A). The median phred sequence quality score was 45 (B). Most sequences had no ambiguous positions (n = 416, 61.8%). Up to 18 ambiguous positions were seen in isolates with multiple, nonidentical copies of the 16S rRNA gene (C). The x-axes indicate relative frequency. Numbers above columns represent isolate counts.