Methods Specimens and species The MLST database [13] contained 378 sequences from clinical specimens or bacterial isolates (July 2009), of which 199 were from Sweden and the remaining 179 from Europe, Africa, North America and Australia. The strains included in the analysis are listed in additional file 1: appendix 1. The last 121 bp of the hctB gene are excluded from the MLST analysis. Consequently, additional sequencing was performed as previously described Fer-1 datasheet [11] but with the reverse primer hctB_R1 (5′-ATTTCGACTCAGCCAATAAATACA-3′). Sequences covering the hctB gene were aligned with ClustalW with default values in the BioEdit 7.0 sequence
alignment editor (Ibis Therapeutics, Carlsbad, CA). The repetitive elements were aligned based on homology according to neighbour-joining TPCA-1 in vivo phylogenetic analysis of the different types of repeat element. Obtained sequence variants were submitted to GenBank and the accession numbers are listed in additional file 2: appendix 2. Accession numbers for Hc2 in other Chlamydiales and Hc2-like proteins in other genera are listed in additional file 3: appendix 3. Sequence analysis Repetitive amino acid elements were found with Dottup plots using a word size of 20 and Pepinfo
was used to create plots that show the charge distribution. Both Dottup and Pepinfo are part of EMBOSS (The European Biology Open Software Suite, EMBnet, http://www.emboss.org). Phylogenetic analyses Firstly, the phylogenetic relationship of the different types of repetitive element was estimated with a neighbour-joining analysis [26] based on the absolute number of base differences between the repeat element sequences (since this number is small, correction for multiple substitutions is not necessary). The resulting tree (Figure 3C) was used as the guide tree for manually adjusting the alignment of the repetitive elements (Figure 3B) in the alignment of the MLST sequences that include hctB. Secondly, the phylogeny of the 41 variants
of MLST targets was inferred using a Bayesian approach [e.g.', Edoxaban [27]]. The analysis was done with MrBayes 3.1.2, running under MPI [28]. A Bayesian analysis needs an explicit substitution model, and this was selected based on a hierarchical likelihood ratio test (ηLRT) approach [29] using Modeltest [30] together with PAUP* 4.0b10 [31]. MrBayes uses a Metropolis-coupled Markov chain Monte Carlo method to compute the posterior probabilities for the clades. This algorithm has no defined stop condition, but runs for a number of generations and must be monitored for convergence, and thus completion, of the algorithm. The convergence was assessed by monitoring the continuous-valued parameters using the software Tracer 1.4 [32], resulting in the Bayesian analysis being run for a total of 107 generations; the first 2.