Rapid assignment of bacterial pathogens into predefined populations is an important first step for epidemiological tracking. For clonal species, a single allele can theoretically define a population. For non-clonal species such as Burkholderia pseudomallei, however, shared allelic states between distantly related isolates make it more difficult to identify population defining characteristics. Two distinct B. pseudomallei populations have been previously identified using multilocus sequence typing (MLST). These populations correlate with the major foci of endemicity (Australia and Southeast Asia). Here, we use multiple Bayesian approaches to evaluate the compositional robustness of these populations, and provide assignment results for MLST sequence types (STs). Our goal was to provide a reference for assigning STs to an established population without the need for further computational analyses. We also provide allele frequency results for each population to enable estimation of population assignment even when novel STs are discovered. The ability for humans and potentially contaminated goods to move rapidly across the globe complicates the task of identifying the source of an infection or outbreak. Population genetic dynamics of B. pseudomallei are particularly complicated relative to other bacterial pathogens, but the work here provides the ability for broad scale population assignment. As there is currently no independent empirical measure of successful population assignment, we provide comprehensive analytical details of our comparisons to enable the reader to evaluate the robustness of population designations and assignments as they pertain to individual research questions. Finer scale subdivision and verification of current population compositions will likely be possible with genotyping data that more comprehensively samples the genome. The approach used here may be valuable for other non-clonal pathogens that lack simple group-defining genetic characteristics and provides a rapid reference for epidemiologists wishing to track the origin of infection without the need to compile population data and learn population assignment algorithms.
References
[1]
Currie BJ, Dance DAB, Cheng AC (2008) The global distribution of Burkholderia pseudomallei and melioidosis: an update. Trans R Soc Trop Med Hyg 102: S1–S4. doi: 10.1016/S0035-9203(08)70002-6
[2]
Aldhous P (2005) Tropical medicine: melioidosis? Never heard of it. Nature 434: 692–693. doi: 10.1038/434692a
[3]
Tuanyok A, Auerbach RK, Brettin TS, Bruce DC, Munk AC, et al. (2007) A horizontal gene transfer event defines two distinct groups within Burkholderia pseudomallei that have dissimilar geographic distributions. J Bacteriol 189: 9044–9049. doi: 10.1128/JB.01264-07
[4]
White NJ (2003) Melioidosis. Lancet 361: 1715–1722. doi: 10.1016/S0140-6736(03)13374-0
Maiden MC (2006) Multilocus sequence typing of bacteria. Annu Rev Microbiol 60: 561–588. doi: 10.1146/annurev.micro.59.030804.121325
[8]
Godoy D, Randle G, Simpson AJ, Aanensen DM, Pitt TL, et al. (2003) Multilocus sequence typing and evolutionary relationships among the causative agents of melioidosis and glanders, Burkholderia pseudomallei and Burkholderia mallei. J Clin Microbiol 41: 2068–2079. doi: 10.1128/JCM.41.5.2068-2079.2003
[9]
Vesaratchavest M, Tumapa S, Day NP, Wuthiekanun V, Chierakul W, et al. (2006) Nonrandom distribution of Burkholderia pseudomallei clones in relation to geographical location and virulence. J Clin Microbiol 44: 2553–2557. doi: 10.1128/JCM.00629-06
[10]
Currie BJ, Thomas AD, Godoy D, Dance DA, Cheng AC, et al. (2007) Australian and Thai isolates of Burkholderia pseudomallei are distinct by multilocus sequence typing: revision of a case of mistaken identity. J Clin Microbiol 45: 3828–3829. doi: 10.1128/JCM.01590-07
[11]
Pearson T, Giffard P, Beckstrom-Sternberg S, Auerbach R, Hornstra H, et al. (2009) Phylogeographic reconstruction of a bacterial species with high levels of lateral gene transfer. BMC Biol 7: 78. doi: 10.1186/1741-7007-7-78
[12]
Turner SL, Young JP (2000) The glutamine synthetases of rhizobia: phylogenetics and evolutionary implications. Mol Biol Evol 17: 309–319. doi: 10.1093/oxfordjournals.molbev.a026311
[13]
Pritchard JK, Stephens M, Donnelly P (2000) Inference of population structure using multilocus genotype data. Genetics 155: 945–959.
[14]
Falush D, Wirth T, Linz B, Pritchard JK, Stephens M, et al. (2003) Traces of human migrations in Helicobacter pylori populations. Science 299: 1582–1585. doi: 10.1126/science.1080857
[15]
Falush D, Stephens M, Pritchard JK (2003) Inference of population structure using multilocus genotype data: linked loci and correlated allele frequencies. Genetics 164: 1567–1587.
[16]
Duan J, Mu J, Thera MA, Joy D, Kosakovsky Pond SL, et al. (2008) Population structure of the genes encoding the polymorphic Plasmodium falciparum apical membrane antigen 1: implications for vaccine design. Proc Natl Acad Sci U S A 105: 7857–7862. doi: 10.1073/pnas.0802328105
[17]
Evanno G, Regnaut S, Goudet J (2005) Detecting the number of clusters of individuals using the software STRUCTURE: a simulation study. Mol Ecol 14: 2611–2620. doi: 10.1111/j.1365-294X.2005.02553.x
[18]
Latch E, Dharmarajan G, Glaubitz J, Rhodes OJ (2006) Relative performance of Bayesian clustering software for inferring population substructure and individual assignment at low levels of population differentiation. Conservation Genetics 7: 295–302. doi: 10.1007/s10592-005-9098-1
[19]
Corander J, Gyllenberg M, Koski T (2007) Random partition models and exchangeability for Bayesian identification of population structure. Bull Math Biol 69: 797–815. doi: 10.1007/s11538-006-9161-1
[20]
Corander J, Marttinen P (2006) Bayesian identification of admixture events using multilocus molecular markers. Mol Ecol 15: 2833–2843. doi: 10.1111/j.1365-294X.2006.02994.x
[21]
Corander J, Marttinen P, Siren J, Tang J (2008) Enhanced Bayesian modelling in BAPS software for learning genetic structures of populations. BMC Bioinformatics 9: 539. doi: 10.1186/1471-2105-9-539
[22]
Corrander J, Waldmann P, Marttinen P, Sillanpaa MJ (2004) BAPS 2: enhanced possibilities for the analysis of genetic population structure. Bioinformatics 20: doi: 10.1093/bioinformatics/bth250
[23]
Peakall RaSPE (2006) GENALEX 6: genetic analysis in Excel. Population genetic software for teaching and research. Molecular Ecology Notes 288–295. doi: 10.1111/j.1471-8286.2005.01155.x
[24]
Smouse PE, Peakall R (1999) Spatial autocorrelation analysis of individual multiallele and multilocus genetic structure. Heredity 82(Pt 5): 561–573. doi: 10.1038/sj.hdy.6885180
[25]
Price EP, Thiruvenkataswamy V, Mickan L, Unicomb L, Rios RE, et al. (2006) Genotyping of Campylobacter jejuni using seven single-nucleotide polymorphisms in combination with flaA short variable region sequencing. J Med Microbiol 55: 1061–1070. doi: 10.1099/jmm.0.46460-0
[26]
Robertson GA, Thiruvenkataswamy V, Shilling H, Price EP, Huygens F, et al. (2004) Identification and interrogation of highly informative single nucleotide polymorphism sets defined by bacterial multilocus sequence typing databases. J Med Microbiol 53: 35–45. doi: 10.1099/jmm.0.05365-0
[27]
Stephens AJ, Huygens F, Inman-Bamber J, Price EP, Nimmo GR, et al. (2006) Methicillin-resistant Staphylococcus aureus genotyping using a small set of polymorphisms. J Med Microbiol 55: 43–51. doi: 10.1099/jmm.0.46157-0
[28]
Price EP, Inman-Bamber J, Thiruvenkataswamy V, Huygens F, Giffard PM (2007) Computer-aided identification of polymorphism sets diagnostic for groups of bacterial and viral genetic variants. BMC Bioinformatics 8: 278. doi: 10.1186/1471-2105-8-278
[29]
Hanage WP, Fraser C, Spratt BG (2006) The impact of homologous recombination on the generation of diversity in bacteria. J Theor Biol 239: 210–219. doi: 10.1016/j.jtbi.2005.08.035