RESUMO
BACKGROUND: Whole microbiome RNASeq (metatranscriptomics) has emerged as a powerful technology to functionally interrogate microbial communities. A key challenge is how best to process, analyze, and interpret these complex datasets. In a typical application, a single metatranscriptomic dataset may comprise from tens to hundreds of millions of sequence reads. These reads must first be processed and filtered for low quality and potential contaminants, before being annotated with taxonomic and functional labels and subsequently collated to generate global bacterial gene expression profiles. RESULTS: Here, we present MetaPro, a flexible, massively scalable metatranscriptomic data analysis pipeline that is cross-platform compatible through its implementation within a Docker framework. MetaPro starts with raw sequence read input (single-end or paired-end reads) and processes them through a tiered series of filtering, assembly, and annotation steps. In addition to yielding a final list of bacterial genes and their relative expression, MetaPro delivers a taxonomic breakdown based on the consensus of complementary prediction algorithms, together with a focused breakdown of enzymes, readily visualized through the Cytoscape network visualization tool. We benchmark the performance of MetaPro against two current state-of-the-art pipelines and demonstrate improved performance and functionality. CONCLUSIONS: MetaPro represents an effective integrated solution for the processing and analysis of metatranscriptomic datasets. Its modular architecture allows new algorithms to be deployed as they are developed, ensuring its longevity. To aid user uptake of the pipeline, MetaPro, together with an established tutorial that has been developed for educational purposes, is made freely available at https://github.com/ParkinsonLab/MetaPro . The software is freely available under the GNU general public license v3. Video Abstract.
Assuntos
Microbiota , Microbiota/genética , Software , Algoritmos , Bactérias/genética , Genes BacterianosRESUMO
Understanding the metabolic activity of a microbial community, at both the level of the individual microbe and the whole microbiome, provides fundamental biological, biochemical, and clinical insights into the nature of the microbial community and interactions with their hosts in health and disease. Here, we discuss a method to examine the expression of metabolic pathways in microbial communities using data from metatranscriptomic next-generation sequencing data. The methodology described here encompasses enzyme function annotation, differential enzyme expression and pathway enrichment analyses, and visualization of metabolic networks with differential enzyme expression levels.
Assuntos
Sequenciamento de Nucleotídeos em Larga Escala/métodos , Redes e Vias Metabólicas , Microbiota , Transcriptoma , Humanos , MetagenômicaRESUMO
In Tables 9, 10 and 11 of this paper, a few minor errors were noticed in the information for the type strains of the following new name combinations proposed in our paper.
RESUMO
The genus Mycoplasma, including species earlier classified in the genera Eperythrozoon and Haemobartonella, contains ~ 120 species and constitutes an extensively polyphyletic assemblage of bacteria within the phylum Tenericutes. Due to their small genome sizes and lack of unique characteristics, the relationships among the mycoplasmas/Tenericutes are not reliably discerned. Using genome sequences for 140 Tenericutes, their evolutionary relationships were examined using multiple independent approaches. Phylogenomic trees were constructed for 63 conserved proteins, 45 ribosomal proteins, three main subunits of RNA polymerase and 16S rRNA gene sequences. In all of these trees, Tenericutes species reliably grouped into four main clades designated as the "Acholeplasma", "Spiroplasma", "Pneumoniae" and "Hominis" clusters. These clades are also distinguished based on a similarity matrix constructed based on 16S rRNA gene sequences. Mycoplasma species were dispersed across 3 of these 4 clades highlighting their extensive polyphyly. In parallel, our comparative genomic analyses have identified > 100 conserved signature indels (CSIs) and 14 conserved signature proteins (CSPs), which are uniquely shared by the members of four identified clades, strongly supporting their monophyly and identifying them in molecular terms. Mycoplasma mycoides, the type species of the genus Mycoplasma, and a small number of other Mycoplasma species, formed a strongly supported clade within the "Spiroplasma" cluster. Nine CSIs and 14 CSPs reliably distinguish this clade from all other Mycoplasmatales species. The remainder of the Mycoplasmatales species are part of the "Pneumoniae" and "Hominis" clusters, which group together in phylogenetic trees. Here we are proposing that the order Mycoplasmatales should be emended to encompass only the Mycoplasma species within the "Spiroplasma" cluster and that a new order, Mycoplasmoidales ord. nov., should be created to encompass the other Mycoplasma species. The "Pneumoniae" and the "Hominis" clusters are proposed as two new families, Mycoplasmoidaceae fam. nov., which includes the genera Eperythrozoon, Ureaplasma, and the newly proposed genera Malacoplasma and Mycoplasmoides, and Metamycoplasmataceae fam. nov. to contain the newly proposed genera Metamycoplasma, Mycoplasmopsis, and Mesomycoplasma. The results presented here allow reliable discernment, both in phylogenetic and molecular terms, of the members of the two proposed families as well as different described genera within these families including members of the genus Eperythrozoon, which is comprised of uncultivable organisms. The taxonomic reclassifications proposed here, which more accurately portray the genetic diversity among the Tenericutes/Mycoplasma species, provide a new framework for understanding the biological and clinical aspects of these important microbes.
Assuntos
Genoma Bacteriano , Filogenia , Tenericutes/classificação , Tenericutes/genética , Sequência de Aminoácidos , Proteínas de Bactérias , Sequência de Bases , Sequência Conservada , DNA Bacteriano/genética , Variação Genética , Mutação INDEL , Mycoplasma/classificação , Mycoplasma/genética , RNA Ribossômico 16S/genética , Proteínas Ribossômicas , Análise de Sequência de DNA , Especificidade da EspécieRESUMO
The evolution and diversification of different types of photosynthetic reaction centers (RCs) remains an important unresolved problem. We report here novel sequence features of the core proteins from Type I RCs (RC-I) and Type II RCs (RC-II) whose analyses provide important insights into the evolution of the RCs. The sequence alignments of the RC-I core proteins contain two conserved inserts or deletions (indels), a 3 amino acid (aa) indel that is uniquely found in all RC-I homologs from Cyanobacteria (both PsaA and PsaB) and a 1 aa indel that is specifically shared by the Chlorobi and Acidobacteria homologs. Ancestral sequence reconstruction provides evidence that the RC-I core protein from Heliobacteriaceae (PshA), lacking these indels, is most closely related to the ancestral RC-I protein. Thus, the identified 3 aa and 1 aa indels in the RC-I protein sequences must have been deletions, which occurred, respectively, in an ancestor of the modern Cyanobacteria containing a homodimeric form of RC-I and in a common ancestor of the RC-I core protein from Chlorobi and Acidobacteria. We also report a conserved 1 aa indel in the RC-II protein sequences that is commonly shared by all homologs from Cyanobacteria but not found in the homologs from Chloroflexi, Proteobacteria and Gemmatimonadetes. Ancestral sequence reconstruction provides evidence that the RC-II subunits lacking this indel are more similar to the ancestral RC-II protein. The results of flexible structural alignments of the indel-containing region of the RC-II protein with the homologous region in the RC-I core protein, which shares structural similarity with the RC-II homologs, support the view that the 1 aa indel present in the RC-II homologs from Cyanobacteria is a deletion, which was not present in the ancestral form of the RC-II protein. Our analyses of the conserved indels found in the RC-I and RC-II proteins, thus, support the view that the earliest photosynthetic lineages with living descendants likely contained only a single RC (RC-I or RC-II), and the presence of both RC-I and RC-II in a linked state, as found in the modern Cyanobacteria, is a derivation from these earlier phototrophs.
Assuntos
Mutação INDEL , Fotossíntese , Complexo de Proteínas do Centro de Reação Fotossintética/genética , Sequência de Aminoácidos , Filogenia , Homologia de Sequência de AminoácidosRESUMO
Understanding of the phylogeny and interrelationships of the genera within the order 'Enterobacteriales' has proven difficult using the 16S rRNA gene and other single-gene or limited multi-gene approaches. In this work, we have completed comprehensive comparative genomic analyses of the members of the order 'Enterobacteriales' which includes phylogenetic reconstructions based on 1548 core proteins, 53 ribosomal proteins and four multilocus sequence analysis proteins, as well as examining the overall genome similarity amongst the members of this order. The results of these analyses all support the existence of seven distinct monophyletic groups of genera within the order 'Enterobacteriales'. In parallel, our analyses of protein sequences from the 'Enterobacteriales' genomes have identified numerous molecular characteristics in the forms of conserved signature insertions/deletions, which are specifically shared by the members of the identified clades and independently support their monophyly and distinctness. Many of these groupings, either in part or in whole, have been recognized in previous evolutionary studies, but have not been consistently resolved as monophyletic entities in 16S rRNA gene trees. The work presented here represents the first comprehensive, genome-scale taxonomic analysis of the entirety of the order 'Enterobacteriales'. On the basis of phylogenetic analyses and the numerous identified conserved molecular characteristics, which clearly distinguish members of the order 'Enterobacteriales' and the seven reported clades within this order, a proposal is made here for the order Enterobacterales ord. nov. which consists of seven families: Enterobacteriaceae, Erwiniaceae fam. nov., Pectobacteriaceae fam. nov., Yersiniaceae fam. nov., Hafniaceae fam. nov., Morganellaceae fam. nov., and Budviciaceae fam. nov.
Assuntos
Gammaproteobacteria/classificação , Filogenia , Sequência de Aminoácidos , Técnicas de Tipagem Bacteriana , DNA Bacteriano/genética , Mutação INDEL , Tipagem de Sequências Multilocus , RNA Ribossômico 16S/genética , Análise de Sequência de DNARESUMO
The phylum "Deinococcus-Thermus" contains two heavily researched groups of extremophilic bacteria: the highly radioresistant order Deinococcales and the thermophilic order Thermales. Very few characteristics are known that are uniquely shared by members of the phylum "Deinococcus-Thermus". Comprehensive phylogenetic and comparative analyses of >65 "Deinococcus-Thermus" genomes reported here have identified numerous molecular signatures in the forms of conserved signature insertions/deletions (CSIs) and conserved signature proteins (CSPs), which provide distinguishing characteristics of the phylum "Deinococcus-Thermus" and its main groups. We have identified 58 unique CSIs and 155 unique CSPs that delineate different phylogenetic groups within the phylum. Of these identified traits, 24 CSIs and 29 CSPs are characteristic of the phylum "Deinococcus-Thermus" and they provide novel and reliable means to circumscribe/describe this phylum. An additional 3 CSIs and 3 CSPs are characteristic of the order Deinococcales, and 6 CSIs and 51 CSPs are characteristic of the order Thermales. The remaining 25 CSIs and 72 CSPs identified in this study are distinctive traits of genus level groups within the phylum "Deinococcus-Thermus". The molecular characteristics identified in this work provide novel and independent support for the common ancestry of the members of the phylum "Deinococcus-Thermus" and provide a new means to distinguish the main constituent clades of the phylum. Additionally, the CSIs and CSPs identified in this work may play a role in the unique extremophilic adaptations of the members of this phylum and further functional analyses of these characteristics could provide novel biochemical insights into the unique adaptations found within the phylum "Deinococcus-Thermus".
Assuntos
Proteínas de Bactérias/genética , Deinococcus/classificação , Deinococcus/genética , Genoma Bacteriano/genética , Tolerância a Radiação/genética , Termotolerância/genética , Thermus/classificação , Thermus/genética , Técnicas de Tipagem Bacteriana , DNA Bacteriano/genética , Mutação INDEL/genética , Filogenia , RNA Ribossômico 16S/genética , Valina-tRNA Ligase/ultraestruturaRESUMO
The order Bifidobacteriales comprises a diverse variety of species found in the gastrointestinal tract of humans and other animals, some of which are opportunistic pathogens, whereas a number of others exhibit health-promoting effects. However, currently very few biochemical or molecular characteristics are known which are specific for the order Bifidobacteriales, or specific clades within this order, which distinguish them from other bacteria. This study reports the results of detailed comparative genomic and phylogenetic studies on 62 genome-sequenced species/strains from the order Bifidobacteriales. In a robust phylogenetic tree for the Bifidobacteriales constructed based on 614 core proteins, a number of well-resolved clades were observed including a clade separating the Scarodvia-related genera (Scardovia clade) from the genera Bifidobacterium and Gardnerella, as well as a number of previously reported clusters of Bifidobacterium spp. In parallel, our comparative analyses of protein sequences from the Bifidobacteriales genomes have identified numerous molecular markers that are specific for this group of bacteria. Of these markers, 32 conserved signature indels (CSIs) in widely distributed proteins and 10 signature proteins are distinctive characteristics of all sequenced Bifidobacteriales species and provide novel and highly specific means for distinguishing these bacteria. In addition, multiple other molecular signatures are specific for the following clades of Bifidobacteriales: (i) 5 CSIs specific for a clade comprising of the Scardovia-related genera; (ii) 3 CSIs and 2 CSPs specific for a clade consisting of the Bifidobacterium and Gardnerella spp.; (iii) multiple other signatures demarcating a number of clusters of the B. asteroides-and B. longum- related species. The described molecular markers provide novel and reliable means for distinguishing the Bifidobacteriales and a number of their clades in molecular terms and for the classification of these bacteria. The Bifidobacteriales-specific CSIs, found in important proteins, are predicted to play important roles in modifying the cellular functions of the affected proteins. Hence, biochemical studies on the cellular functions of these CSIs could lead to discovery of novel characteristics of either all Bifidobacteriales, or specific groups of bacteria within this order. Some of the functions affected/modified by these genetic changes could also be important for the probiotic/pathogenic activities of the bifidobacteria.
RESUMO
The evolutionary interrelationships between the archaeal organisms which comprise the class Halobacteria have proven difficult to elucidate using traditional phylogenetic tools. The class currently contains three orders. However, little is known about the family level relationships within these orders. In this work, we have completed a comprehensive comparative analysis of 129 sequenced genomes from members of the class Halobacteria in order to identify shared molecular characteristics, in the forms of conserved signature insertions/deletions (CSIs) and conserved signature proteins (CSPs), which can provide reliable evidence, independent of phylogenetic trees, that the species from the groups in which they are found are specifically related to each other due to common ancestry. Here we present 20 CSIs and 31 CSPs which are unique characteristics of infra-order level groups of genera within the class Halobacteria. We also present 40 CSIs and 234 CSPs which are characteristic of Haloarcula, Halococcus, Haloferax, or Halorubrum. Importantly, the CSIs and CSPs identified here provide evidence that the order Haloferacales contains two main groups, one consisting of Haloferax and related genera supported by four CSIs and five CSPs and the other consisting of Halorubrum and related genera supported by four CSPs. We have also identified molecular characteristics that suggest that the polyphyletic order Halobacteriales contains at least two large monophyletic clusters of organisms in addition to the polyphyletic members of the order, one cluster consisting of Haloarcula and related genera supported by ten CSIs and nineteen CSPs and the other group consisting of the members of the genus Halococcus supported by nine CSIs and 23 CSPs. We have also produced a highly robust phylogenetic tree based on the concatenated sequences of 766 proteins which provide additional support for the relationships identified by the CSIs and CSPs. On the basis of the phylogenetic analyses and the identified conserved molecular characteristics presented here, we propose a division of the order Haloferacales into two families, an emended family Haloferacaceae and Halorubraceae fam. nov. and a division of the order Halobacteriales into three families, an emended family Halobacteriaceae, Haloarculaceae fam. nov., and Halococcaceae fam. nov.
Assuntos
Halobacteriaceae/classificação , Halobacteriales/classificação , Sequência de Aminoácidos , Proteínas Arqueais/genética , Sequência de Bases , Sequência Conservada , DNA Arqueal/análise , DNA Arqueal/genética , Genoma Arqueal , Halobacteriaceae/genética , Halobacteriales/genética , Mutação INDEL , Filogenia , RNA Ribossômico 16S/genética , Alinhamento de Sequência , Análise de Sequência de DNARESUMO
The phylum Chlamydiae contains nine ecologically and genetically diverse families all placed within a single order. In this work, we have completed a comprehensive comparative analysis of 36 sequenced Chlamydiae genomes in order to identify shared molecular characteristics, namely conserved signature insertions/deletions (CSIs) and conserved signature proteins (CSPs), which can serve as distinguishing characteristics of supra-familial clusters within the phylum Chlamydiae. Our analysis has led to the identification of 32 CSIs which are specific to clusters within the phylum Chlamydiae at various phylogenetic depths. Importantly, 17 CSIs and 98 CSPs were found to be specific for the family Chlamydiaceae while another 3 CSI variants and 15 CSPs were specific for a grouping of the families Criblamydiaceae, Parachlamydiaceae, Simkaniaceae and Waddliaceae. These two clusters were also found to be distinguishable in 16S rRNA based phylogenetic trees, concatenated protein based phylogenetic trees, character compatibility based phylogenetic analyses, and on the basis of 16S rRNA gene sequence identity and average amino acid identity values. On the basis of the identified molecular characteristics, branching in phylogenetic trees, and the genetic distance between the two clusters within the phylum Chlamydiae we propose a division of the class Chlamydiia into two orders: an emended order Chlamydiales, containing the family Chlamydiaceae and the closely related Candidatus family Clavichlamydiaceae, and the novel order Parachlamydiales ord. nov. containing the families Parachlamydiaceae, Simkaniaceae and Waddliaceae and the Candidatus families Criblamydiaceae, Parilichlamydiaceae, Piscichlamydiaceae, and Rhabdochlamydiaceae. We also include a brief discussion of the reunification of the genera Chlamydia and Chlamydophila.
Assuntos
Chlamydiales/classificação , Chlamydiales/genética , Variação Genética , Proteínas de Bactérias/genética , Análise por Conglomerados , Biologia Computacional , Sequência Conservada , DNA Bacteriano/química , DNA Bacteriano/genética , DNA Ribossômico/química , DNA Ribossômico/genética , Genoma Bacteriano , Filogenia , RNA Ribossômico 16S/genética , Análise de Sequência de DNARESUMO
The class Negativicutes is currently divided into one order and two families on the basis of 16S rRNA gene sequence phylogenies. We report here comprehensive comparative genomic analyses of the sequenced members of the class Negativicutes to demarcate its different evolutionary groups in molecular terms, independently of phylogenetic trees. Our comparative genomic analyses have identified 14 conserved signature indels (CSIs) and 48 conserved signature proteins (CSPs) that either are specific for the entire class or differentiate four main groups within the class. Two CSIs and nine CSPs are shared uniquely by all or most members of the class Negativicutes, distinguishing this class from all other sequenced members of the phylum Firmicutes. Four other CSIs and six CSPs were specific characteristics of the family Acidaminococcaceae, two CSIs and four CSPs were uniquely present in the family Veillonellaceae, six CSIs and eight CSPs were found only in Selenomonas and related genera, and 17 CSPs were identified uniquely in Sporomusa and related genera. Four additional CSPs support a pairing of the groups containing the genera Selenomonas and Sporomusa. We also report detailed phylogenetic analyses for the Negativicutes based on core protein sequences and 16S rRNA gene sequences, which strongly support the four main groups identified by CSIs and by CSPs. Based on the results from different lines of investigation, we propose a division of the class Negativicutes into an emended order Selenomonadales containing the new families Selenomonadaceae fam. nov. and Sporomusaceae fam. nov. and two new orders, Acidaminococcales ord. nov. and Veillonellales ord. nov., respectively containing the families Acidaminococcaceae and Veillonellaceae.
Assuntos
Veillonellaceae , Sequência de Aminoácidos , Aprepitanto , Proteínas de Bactérias/genética , DNA Bacteriano/genética , Ácidos Graxos/genética , Genoma Bacteriano , Bactérias Gram-Positivas/genética , Mutação INDEL , Dados de Sequência Molecular , Morfolinas , Filogenia , RNA Ribossômico 16S/genética , Análise de Sequência de DNA , Veillonellaceae/genéticaRESUMO
The genera Actinobacillus, Haemophilus, and Pasteurella exhibit extensive polyphyletic branching in phylogenetic trees and do not represent coherent clusters of species. In this study, we have utilized molecular signatures identified through comparative genomic analyses in conjunction with genome based and multilocus sequence based phylogenetic analyses to clarify the phylogenetic and taxonomic boundary of these genera. We have identified large clusters of Actinobacillus, Haemophilus, and Pasteurella species which represent the "sensu stricto" members of these genera. We have identified 3, 7, and 6 conserved signature indels (CSIs), which are specifically shared by sensu stricto members of Actinobacillus, Haemophilus, and Pasteurella, respectively. We have also identified two different sets of CSIs that are unique characteristics of the pathogen containing genera Aggregatibacter and Mannheimia, respectively. It is now possible to demarcate the genera Actinobacillus sensu stricto, Haemophilus sensu stricto, and Pasteurella sensu stricto on the basis of discrete molecular signatures. The other members of the genera Actinobacillus, Haemophilus, and Pasteurella that do not fall within the "sensu stricto" clades and do not contain these molecular signatures should be reclassified as other genera. The CSIs identified here also provide useful diagnostic targets for the identification of current and novel members of the indicated genera.
RESUMO
The current taxonomy of the order Xanthomonadales is highly problematic and no comprehensive phylogenomic studies have been completed that include the most divergent members within the order. In this work, we have completed a phylogenomic analysis of a wide range of genomes, five of which were sequenced for the first time for this work, representing the vast majority of the diversity within the order Xanthomonadales. Using comparative genomic techniques, we have identified a large number of conserved signature inserts/deletions (CSIs) that are specifically found in different groups of related organisms, at different taxonomic levels, within the order. Our phylogenetic analyses do not support a monophyletic grouping of the members of the order Xanthomonadales and no CSIs were identified which are uniquely shared by all sequenced species within this order. However, our work has identified 10 CSIs which are specific to all members of the family Xanthomonadaceae and an additional 10 and 11 CSIs that are specific to one of two phylogenetically well-defined clades within the family Xanthomonadaceae. On the basis of the identified CSIs and the results of phylogenomic analyses, we propose a new taxonomic framework for the order Xanthomonadales. In this proposal, the families Algiphilaceae and Solimonadaceae (Nevskiaceae), which do not branch with the other members of the order Xanthomonadales, are transferred into the order Nevskiales ord. nov. The remaining members of the order Xanthomonadales are divided into two families: the family Xanthomonadaceae, containing the genus Xanthomonas and its closest relatives, and a new family, Rhodanobacteraceae fam. nov., containing the genus Rhodanobacter and its closest relatives. Additionally, we have also emended descriptions of the order Lysobacterales, the family Lysobacteraceae, and the family Nevskiaceae to indicate that they are earlier synonyms of the order Xanthomonadales, the family Xanthomonadaceae, and the family Solimonadaceae, respectively.
Assuntos
Gammaproteobacteria/classificação , Gammaproteobacteria/genética , Variação Genética , Filogenia , Análise por Conglomerados , DNA Bacteriano/química , DNA Bacteriano/genética , Mutação INDEL , Dados de Sequência Molecular , Análise de Sequência de DNARESUMO
The genus Borrelia contains two groups of organisms: the causative agents of Lyme disease and their relatives and the causative agents of relapsing fever and their relatives. These two groups are morphologically indistinguishable and are difficult to distinguish biochemically. In this work, we have carried out detailed comparative genomic analyses on protein sequences from 38 Borrelia genomes to identify molecular markers in the forms of conserved signature inserts/deletions (CSIs) that are specifically found in the Borrelia homologues, and conserved signature proteins (CSPs) which are uniquely present in Borrelia species. Our analyses have identified 31 CSIs and 82 CSPs that are uniquely shared by all sequenced Borrelia species, providing molecular markers for this group of organisms. In addition, our work has identified 7 CSIs and 21 CSPs which are uniquely found in the Lyme disease Borrelia species and eight CSIs and four CSPs that are specific for members of the relapsing fever Borrelia group. Additionally, 38 other CSIs, in proteins which are uniquely found in Borrelia species, also distinguish these two groups of Borrelia. The identified CSIs and CSPs provide novel and highly specific molecular markers for identification and distinguishing between the Lyme disease Borrelia and the relapsing fever Borrelia species. We also report the results of average nucleotide identity (ANI) analysis on Borrelia genomes and phylogenetic analysis for these species based upon 16S rRNA sequences and concatenated sequences for 25 conserved proteins. These analyses also support the distinctness of the two Borrelia clades. On the basis of the identified molecular markers, the results from ANI and phylogenetic studies, and the distinct pathogenicity profiles and arthropod vectors used by different Borrelia spp. for their transmission, we are proposing a division of the genus Borrelia into two separate genera: an emended genus Borrelia, containing the causative agents of relapsing fever and a novel genus, Borreliella gen. nov., containing the causative agents of Lyme disease.
Assuntos
Borrelia/classificação , Borrelia/genética , Marcadores Genéticos , Doença de Lyme/microbiologia , Filogenia , Febre Recorrente/microbiologia , Proteínas de Bactérias/genética , Borrelia/isolamento & purificação , Análise por Conglomerados , DNA Bacteriano/química , DNA Bacteriano/genética , DNA Ribossômico/química , DNA Ribossômico/genética , Humanos , Dados de Sequência Molecular , RNA Ribossômico 16S/genética , Análise de Sequência de DNARESUMO
The genus Burkholderia contains large number of diverse species which include many clinically important organisms, phytopathogens, as well as environmental species. However, currently, there is a paucity of biochemical or molecular characteristics which can reliably distinguish different groups of Burkholderia species. We report here the results of detailed phylogenetic and comparative genomic analyses of 45 sequenced species of the genus Burkholderia. In phylogenetic trees based upon concatenated sequences for 21 conserved proteins as well as 16S rRNA gene sequence based trees, members of the genus Burkholderia grouped into two major clades. Within these main clades a number of smaller clades including those corresponding to the clinically important Burkholderia cepacia complex (BCC) and the Burkholderia pseudomallei groups were also clearly distinguished. Our comparative analysis of protein sequences from Burkholderia spp. has identified 42 highly specific molecular markers in the form of conserved sequence indels (CSIs) that are uniquely found in a number of well-defined groups of Burkholderia spp. Six of these CSIs are specific for a group of Burkholderia spp. (referred to as Clade I in this work) which contains all clinically relevant members of the genus (viz. the BCC and the B. pseudomallei group) as well as the phytopathogenic Burkholderia spp. The second main clade (Clade II), which is composed of environmental Burkholderia species, is also distinguished by 2 identified CSIs that are specific for this group. Additionally, our work has also identified multiple CSIs that serve to clearly demarcate a number of smaller groups of Burkholderia spp. including 3 CSIs that are specific for the B. cepacia complex, 4 CSIs that are uniquely found in the B. pseudomallei group, 5 CSIs that are specific for the phytopathogenic Burkholderia spp. and 22 other CSI that distinguish two groups within Clade II. The described molecular markers provide highly specific means for the demarcation of different groups of Burkholderia spp. and they also offer novel and useful targets for the development of diagnostic assays for the clinically important members of the BCC or the pseudomallei groups. Based upon the results of phylogenetic analyses, the identified CSIs and the pathogenicity profile of Burkholderia species, we are proposing a division of the genus Burkholderia into two genera. In this new proposal, the emended genus Burkholderia will correspond to the Clade I and it will contain only the clinically relevant and phytopathogenic Burkholderia species. All other Burkholderia spp., which are primarily environmental, will be transferred to a new genus Paraburkholderia gen. nov.
RESUMO
[This corrects the article on p. 217 in vol. 4, PMID: 23908650.].
RESUMO
The Spirochaetes species cause many important diseases including syphilis and Lyme disease. Except for their containing a distinctive endoflagella, no other molecular or biochemical characteristics are presently known that are specific for either all Spirochaetes or its different families. We report detailed comparative and phylogenomic analyses of protein sequences from Spirochaetes genomes to understand their evolutionary relationships and to identify molecular signatures for this group. These studies have identified 38 conserved signature indels (CSIs) that are specific for either all members of the phylum Spirochaetes or its different main clades. Of these CSIs, a 3 aa insert in the FlgC protein is uniquely shared by all sequenced Spirochaetes providing a molecular marker for this phylum. Seven, six, and five CSIs in different proteins are specific for members of the families Spirochaetaceae, Brachyspiraceae, and Leptospiraceae, respectively. Of the 19 other identified CSIs, 3 are uniquely shared by members of the genera Sphaerochaeta, Spirochaeta, and Treponema, whereas 16 others are specific for the genus Borrelia. A monophyletic grouping of the genera Sphaerochaeta, Spirochaeta, and Treponema distinct from the genus Borrelia is also strongly supported by phylogenetic trees based upon concatenated sequences of 22 conserved proteins. The molecular markers described here provide novel and more definitive means for identification and demarcation of different main groups of Spirochaetes. To accommodate the extensive genetic diversity of the Spirochaetes as revealed by different CSIs and phylogenetic analyses, it is proposed that the four families of this phylum should be elevated to the order level taxonomic ranks (viz. Spirochaetales, Brevinematales ord. nov., Brachyspiriales ord. nov., and Leptospiriales ord. nov.). It is further proposed that the genera Borrelia and Cristispira be transferred to a new family Borreliaceae fam. nov. within the order Spirochaetales.
RESUMO
The species from the order Neisseriales are currently distinguished from other bacteria on the basis of branching in 16S rRNA gene trees. For this order containing a single family, Neisseriaceae, no distinctive molecular, biochemical, or phenotypic characters are presently known. We report here detailed phylogenetic and comparative analyses on the 27 genome sequenced species of the order Neisseriales. Our comparative genomic analyses have identified 54 conserved signature indels (CSIs) in widely distributed proteins that are specific for either all of the sequenced Neisseriales species or a number of clades within this order that are also supported by phylogenetic analyses. Of these CSIs, 11 are specifically present in all of the sequenced species from this order, but are not found in homologous proteins from any other bacteria. These CSIs provide novel molecular markers specific for, and delimiting, this order. Twenty-one CSIs in diverse proteins are specific for a group comprised of the genera Neisseria, Eikenella, Kingella, and Simonsiella (Clade I), which are obligate host-associated organisms, lacking flagella and exhibiting varied morphology. The species from these genera also formed a strongly supported clade in phylogenetic trees based upon concatenated protein sequences; a monophyletic grouping of these genera and other genera displaying similar morphological characteristics was also observed in the 16S rRNA gene tree. A second clade (Clade II), supported by seven of the identified CSIs and phylogenetic trees based upon concatenated protein sequences, grouped together species from the genera Chromobacterium, Laribacter, and Pseudogulbenkiania that are rod-shaped bacteria, which display flagella-based motility and are capable of free living. The remainder of the CSIs were uniquely shared by smaller groups within these two main clades. Our analyses also provide novel insights into the evolutionary history of the Neisseriales and suggest that the CSIs that are specific for the Clade I species may play an important role in the evolution of obligate host-association within this order. On the basis of phylogenetic analysis, the identified CSIs, and conserved phenotypic characteristics of different Neisseriales genera, we propose a division of this order into two families: an emended family Neisseriaceae (corresponding to Clade I) containing the genera Alysiella, Bergeriella, Conchiformibius, Eikenella, Kingella, Neisseria, Simonsiella, Stenoxybacter, Uruburuella and Vitreoscilla and a new family, Chromobacteriaceae fam. nov., harboring the remainder of the genera from this order (viz. Andreprevotia, Aquaspirillum, Aquitalea, Chitinibacter, Chitinilyticum, Chitiniphilus, Chromobacterium, Deefgea, Formivibrio, Gulbenkiania, Iodobacter, Jeongeupia, Laribacter, Leeia, Microvirgula, Paludibacterium, Pseudogulbenkiania, Silvimonas, and Vogesella).