Pesquisa | Biblioteca Virtual em Saúde

1.

Defining the core Arabidopsis thaliana root microbiome.

Lundberg, Derek S; Lebeis, Sarah L; Paredes, Sur Herrera; Yourstone, Scott; Gehring, Jase; Malfatti, Stephanie; Tremblay, Julien; Engelbrektson, Anna; Kunin, Victor; Del Rio, Tijana Glavina; Edgar, Robert C; Eickhorst, Thilo; Ley, Ruth E; Hugenholtz, Philip; Tringe, Susannah Green; Dangl, Jeffery L.

Nature ; 488(7409): 86-90, 2012 Aug 02.

Artigo em Inglês | MEDLINE | ID: mdl-22859206

RESUMO

Land plants associate with a root microbiota distinct from the complex microbial community present in surrounding soil. The microbiota colonizing the rhizosphere (immediately surrounding the root) and the endophytic compartment (within the root) contribute to plant growth, productivity, carbon sequestration and phytoremediation. Colonization of the root occurs despite a sophisticated plant immune system, suggesting finely tuned discrimination of mutualists and commensals from pathogens. Genetic principles governing the derivation of host-specific endophyte communities from soil communities are poorly understood. Here we report the pyrosequencing of the bacterial 16S ribosomal RNA gene of more than 600 Arabidopsis thaliana plants to test the hypotheses that the root rhizosphere and endophytic compartment microbiota of plants grown under controlled conditions in natural soils are sufficiently dependent on the host to remain consistent across different soil types and developmental stages, and sufficiently dependent on host genotype to vary between inbred Arabidopsis accessions. We describe different bacterial communities in two geochemically distinct bulk soils and in rhizosphere and endophytic compartments prepared from roots grown in these soils. The communities in each compartment are strongly influenced by soil type. Endophytic compartments from both soils feature overlapping, low-complexity communities that are markedly enriched in Actinobacteria and specific families from other phyla, notably Proteobacteria. Some bacteria vary quantitatively between plants of different developmental stage and genotype. Our rigorous definition of an endophytic compartment microbiome should facilitate controlled dissection of plant-microbe interactions derived from complex soil communities.

Assuntos

Arabidopsis/microbiologia , Endófitos/classificação , Endófitos/isolamento & purificação , Metagenoma , Raízes de Plantas/microbiologia , Microbiologia do Solo , Actinobacteria/genética , Actinobacteria/isolamento & purificação , Arabidopsis/classificação , Arabidopsis/crescimento & desenvolvimento , Endófitos/genética , Genótipo , Hibridização in Situ Fluorescente , Raízes de Plantas/classificação , Raízes de Plantas/crescimento & desenvolvimento , Proteobactérias/genética , Proteobactérias/isolamento & purificação , RNA Ribossômico 16S/genética , RNA Ribossômico 16S/isolamento & purificação , Rizosfera , Ribotipagem , Análise de Sequência de DNA , Simbiose

2.

A phylogeny-driven genomic encyclopaedia of Bacteria and Archaea.

Wu, Dongying; Hugenholtz, Philip; Mavromatis, Konstantinos; Pukall, Rüdiger; Dalin, Eileen; Ivanova, Natalia N; Kunin, Victor; Goodwin, Lynne; Wu, Martin; Tindall, Brian J; Hooper, Sean D; Pati, Amrita; Lykidis, Athanasios; Spring, Stefan; Anderson, Iain J; D'haeseleer, Patrik; Zemla, Adam; Singer, Mitchell; Lapidus, Alla; Nolan, Matt; Copeland, Alex; Han, Cliff; Chen, Feng; Cheng, Jan-Fang; Lucas, Susan; Kerfeld, Cheryl; Lang, Elke; Gronow, Sabine; Chain, Patrick; Bruce, David; Rubin, Edward M; Kyrpides, Nikos C; Klenk, Hans-Peter; Eisen, Jonathan A.

Nature ; 462(7276): 1056-60, 2009 Dec 24.

Artigo em Inglês | MEDLINE | ID: mdl-20033048

RESUMO

Sequencing of bacterial and archaeal genomes has revolutionized our understanding of the many roles played by microorganisms. There are now nearly 1,000 completed bacterial and archaeal genomes available, most of which were chosen for sequencing on the basis of their physiology. As a result, the perspective provided by the currently available genomes is limited by a highly biased phylogenetic distribution. To explore the value added by choosing microbial genomes for sequencing on the basis of their evolutionary relationships, we have sequenced and analysed the genomes of 56 culturable species of Bacteria and Archaea selected to maximize phylogenetic coverage. Analysis of these genomes demonstrated pronounced benefits (compared to an equivalent set of genomes randomly selected from the existing database) in diverse areas including the reconstruction of phylogenetic history, the discovery of new protein families and biological properties, and the prediction of functions for known genes from other organisms. Our results strongly support the need for systematic 'phylogenomic' efforts to compile a phylogeny-driven 'Genomic Encyclopedia of Bacteria and Archaea' in order to derive maximum knowledge from existing microbial genome data as well as from genome sequences to come.

Assuntos

Archaea/classificação , Archaea/genética , Bactérias/classificação , Bactérias/genética , Genoma Arqueal/genética , Genoma Bacteriano/genética , Filogenia , Actinas/química , Sequência de Aminoácidos , Proteínas de Bactérias/química , Biodiversidade , Bases de Dados Genéticas , Genes de RNAr/genética , Modelos Moleculares , Dados de Sequência Molecular , Estrutura Terciária de Proteína , Alinhamento de Sequência

3.

Effects of OTU clustering and PCR artifacts on microbial diversity estimates.

Patin, Nastassia V; Kunin, Victor; Lidström, Ulrika; Ashby, Matthew N.

Microb Ecol ; 65(3): 709-19, 2013 Apr.

Artigo em Inglês | MEDLINE | ID: mdl-23233090

RESUMO

Next-generation sequencing has increased the coverage of microbial diversity surveys by orders of magnitude, but differentiating artifacts from rare environmental sequences remains a challenge. Clustering 16S rRNA sequences into operational taxonomic units (OTUs) organizes sequence data into groups of 97 % identity, helping to reduce data volumes and avoid analyzing sequencing artifacts by grouping them with real sequences. Here, we analyze sequence abundance distributions across environmental samples and show that 16S rRNA sequences of >99 % identity can represent functionally distinct microorganisms, rendering OTU clustering problematic when the goal is an accurate analysis of organism distribution. Strict postsequencing quality control (QC) filters eliminated the most prevalent artifacts without clustering. Further experiments proved that DNA polymerase errors in polymerase chain reaction (PCR) generate a significant number of substitution errors, most of which pass QC filters. Based on our findings, we recommend minimizing the number of PCR cycles in DNA library preparation and applying strict postsequencing QC filters to reduce the most prevalent artifacts while maintaining a high level of accuracy in diversity estimates. We further recommend correlating rare and abundant sequences across environmental samples, rather than clustering into OTUs, to identify remaining sequence artifacts without losing the resolution afforded by high-throughput sequencing.

Assuntos

Actinomycetales/genética , Biodiversidade , Reação em Cadeia da Polimerase/normas , Actinomycetales/classificação , Actinomycetales/isolamento & purificação , Primers do DNA/genética , DNA Bacteriano/genética , Sequenciamento de Nucleotídeos em Larga Escala , Reação em Cadeia da Polimerase/métodos , RNA Ribossômico 16S/genética

4.

Metagenomic and functional analysis of hindgut microbiota of a wood-feeding higher termite.

Warnecke, Falk; Luginbühl, Peter; Ivanova, Natalia; Ghassemian, Majid; Richardson, Toby H; Stege, Justin T; Cayouette, Michelle; McHardy, Alice C; Djordjevic, Gordana; Aboushadi, Nahla; Sorek, Rotem; Tringe, Susannah G; Podar, Mircea; Martin, Hector Garcia; Kunin, Victor; Dalevi, Daniel; Madejska, Julita; Kirton, Edward; Platt, Darren; Szeto, Ernest; Salamov, Asaf; Barry, Kerrie; Mikhailova, Natalia; Kyrpides, Nikos C; Matson, Eric G; Ottesen, Elizabeth A; Zhang, Xinning; Hernández, Myriam; Murillo, Catalina; Acosta, Luis G; Rigoutsos, Isidore; Tamayo, Giselle; Green, Brian D; Chang, Cathy; Rubin, Edward M; Mathur, Eric J; Robertson, Dan E; Hugenholtz, Philip; Leadbetter, Jared R.

Nature ; 450(7169): 560-5, 2007 Nov 22.

Artigo em Inglês | MEDLINE | ID: mdl-18033299

RESUMO

From the standpoints of both basic research and biotechnology, there is considerable interest in reaching a clearer understanding of the diversity of biological mechanisms employed during lignocellulose degradation. Globally, termites are an extremely successful group of wood-degrading organisms and are therefore important both for their roles in carbon turnover in the environment and as potential sources of biochemical catalysts for efforts aimed at converting wood into biofuels. Only recently have data supported any direct role for the symbiotic bacteria in the gut of the termite in cellulose and xylan hydrolysis. Here we use a metagenomic analysis of the bacterial community resident in the hindgut paunch of a wood-feeding 'higher' Nasutitermes species (which do not contain cellulose-fermenting protozoa) to show the presence of a large, diverse set of bacterial genes for cellulose and xylan hydrolysis. Many of these genes were expressed in vivo or had cellulase activity in vitro, and further analyses implicate spirochete and fibrobacter species in gut lignocellulose degradation. New insights into other important symbiotic functions including H2 metabolism, CO2-reductive acetogenesis and N2 fixation are also provided by this first system-wide gene analysis of a microbial community specialized towards plant lignocellulose degradation. Our results underscore how complex even a 1-microl environment can be.

Assuntos

Bactérias/metabolismo , Genoma Bacteriano/genética , Genômica , Intestinos/microbiologia , Isópteros/metabolismo , Isópteros/microbiologia , Madeira/metabolismo , Animais , Bactérias/enzimologia , Bactérias/genética , Bactérias/isolamento & purificação , Fontes de Energia Bioelétrica , Carbono/metabolismo , Domínio Catalítico , Celulose/metabolismo , Costa Rica , Genes Bacterianos/genética , Glicosídeo Hidrolases/química , Glicosídeo Hidrolases/genética , Glicosídeo Hidrolases/metabolismo , Hidrólise , Lignina/metabolismo , Modelos Biológicos , Dados de Sequência Molecular , Reação em Cadeia da Polimerase , Simbiose , Madeira/química , Xilanos/metabolismo

5.

A korarchaeal genome reveals insights into the evolution of the Archaea.

Elkins, James G; Podar, Mircea; Graham, David E; Makarova, Kira S; Wolf, Yuri; Randau, Lennart; Hedlund, Brian P; Brochier-Armanet, Céline; Kunin, Victor; Anderson, Iain; Lapidus, Alla; Goltsman, Eugene; Barry, Kerrie; Koonin, Eugene V; Hugenholtz, Phil; Kyrpides, Nikos; Wanner, Gerhard; Richardson, Paul; Keller, Martin; Stetter, Karl O.

Proc Natl Acad Sci U S A ; 105(23): 8102-7, 2008 Jun 10.

Artigo em Inglês | MEDLINE | ID: mdl-18535141

RESUMO

The candidate division Korarchaeota comprises a group of uncultivated microorganisms that, by their small subunit rRNA phylogeny, may have diverged early from the major archaeal phyla Crenarchaeota and Euryarchaeota. Here, we report the initial characterization of a member of the Korarchaeota with the proposed name, "Candidatus Korarchaeum cryptofilum," which exhibits an ultrathin filamentous morphology. To investigate possible ancestral relationships between deep-branching Korarchaeota and other phyla, we used whole-genome shotgun sequencing to construct a complete composite korarchaeal genome from enriched cells. The genome was assembled into a single contig 1.59 Mb in length with a G + C content of 49%. Of the 1,617 predicted protein-coding genes, 1,382 (85%) could be assigned to a revised set of archaeal Clusters of Orthologous Groups (COGs). The predicted gene functions suggest that the organism relies on a simple mode of peptide fermentation for carbon and energy and lacks the ability to synthesize de novo purines, CoA, and several other cofactors. Phylogenetic analyses based on conserved single genes and concatenated protein sequences positioned the korarchaeote as a deep archaeal lineage with an apparent affinity to the Crenarchaeota. However, the predicted gene content revealed that several conserved cellular systems, such as cell division, DNA replication, and tRNA maturation, resemble the counterparts in the Euryarchaeota. In light of the known composition of archaeal genomes, the Korarchaeota might have retained a set of cellular features that represents the ancestral archaeal form.

Assuntos

Evolução Biológica , Genoma Arqueal/genética , Korarchaeota/genética , Ciclo Celular , Replicação do DNA , Metabolismo Energético , Evolução Molecular , Korarchaeota/citologia , Korarchaeota/ultraestrutura , Filogenia , Biossíntese de Proteínas , Análise de Sequência de DNA , Transcrição Gênica

6.

Wrinkles in the rare biosphere: pyrosequencing errors can lead to artificial inflation of diversity estimates.

Kunin, Victor; Engelbrektson, Anna; Ochman, Howard; Hugenholtz, Philip.

Environ Microbiol ; 12(1): 118-23, 2010 Jan.

Artigo em Inglês | MEDLINE | ID: mdl-19725865

RESUMO

Massively parallel pyrosequencing of the small subunit (16S) ribosomal RNA gene has revealed that the extent of rare microbial populations in several environments, the 'rare biosphere', is orders of magnitude higher than previously thought. One important caveat with this method is that sequencing error could artificially inflate diversity estimates. Although the per-base error of 16S rDNA amplicon pyrosequencing has been shown to be as good as or lower than Sanger sequencing, no direct assessments of pyrosequencing errors on diversity estimates have been reported. Using only Escherichia coli MG1655 as a reference template, we find that 16S rDNA diversity is grossly overestimated unless relatively stringent read quality filtering and low clustering thresholds are applied. In particular, the common practice of removing reads with unresolved bases and anomalous read lengths is insufficient to ensure accurate estimates of microbial diversity. Furthermore, common and reproducible homopolymer length errors can result in relatively abundant spurious phylotypes further confounding data interpretation. We suggest that stringent quality-based trimming of 16S pyrotags and clustering thresholds no greater than 97% identity should be used to avoid overestimates of the rare biosphere.

Assuntos

Biodiversidade , RNA Ribossômico 16S/genética , Análise de Sequência de DNA/métodos , Análise por Conglomerados , DNA Bacteriano/genética , Escherichia coli/genética , Genes Bacterianos , Variação Genética , Alinhamento de Sequência

7.

Metatranscriptomic array analysis of 'Candidatus Accumulibacter phosphatis'-enriched enhanced biological phosphorus removal sludge.

He, Shaomei; Kunin, Victor; Haynes, Matthew; Martin, Hector Garcia; Ivanova, Natalia; Rohwer, Forest; Hugenholtz, Philip; McMahon, Katherine D.

Environ Microbiol ; 12(5): 1205-17, 2010 May.

Artigo em Inglês | MEDLINE | ID: mdl-20148930

RESUMO

Here we report the first metatranscriptomic analysis of gene expression and regulation of 'Candidatus Accumulibacter'-enriched lab-scale sludge during enhanced biological phosphorus removal (EBPR). Medium density oligonucleotide microarrays were generated with probes targeting most predicted genes hypothesized to be important for the EBPR phenotype. RNA samples were collected at the early stage of anaerobic and aerobic phases (15 min after acetate addition and switching to aeration respectively). We detected the expression of a number of genes involved in the carbon and phosphate metabolisms, as proposed by EBPR models (e.g. polyhydroxyalkanoate synthesis, a split TCA cycle through methylmalonyl-CoA pathway, and polyphosphate formation), as well as novel genes discovered through metagenomic analysis. The comparison between the early stage anaerobic and aerobic gene expression profiles showed that expression levels of most genes were not significantly different between the two stages. The majority of upregulated genes in the aerobic sample are predicted to encode functions such as transcription, translation and protein translocation, reflecting the rapid growth phase of Accumulibacter shortly after being switched to aerobic conditions. Components of the TCA cycle and machinery involved in ATP synthesis were also upregulated during the early aerobic phase. These findings support the predictions of EBPR metabolic models that the oxidation of intracellularly stored carbon polymers through the TCA cycle provides ATP for cell growth when oxygen becomes available. Nitrous oxide reductase was among the very few Accumulibacter genes upregulated in the anaerobic sample, suggesting that its expression is likely induced by the deprivation of oxygen.

Assuntos

Proteínas de Bactérias/metabolismo , Betaproteobacteria/metabolismo , Perfilação da Expressão Gênica , Análise de Sequência com Séries de Oligonucleotídeos/métodos , Fósforo/metabolismo , Esgotos/microbiologia , Aerobiose , Anaerobiose , Proteínas de Bactérias/genética , Betaproteobacteria/genética , Betaproteobacteria/crescimento & desenvolvimento , Biodegradação Ambiental , Regulação da Expressão Gênica , Metagenômica , RNA Bacteriano/análise , RNA Bacteriano/genética , RNA Bacteriano/isolamento & purificação

8.

Millimeter-scale genetic gradients and community-level molecular convergence in a hypersaline microbial mat.

Kunin, Victor; Raes, Jeroen; Harris, J Kirk; Spear, John R; Walker, Jeffrey J; Ivanova, Natalia; von Mering, Christian; Bebout, Brad M; Pace, Norman R; Bork, Peer; Hugenholtz, Philip.

Mol Syst Biol ; 4: 198, 2008.

Artigo em Inglês | MEDLINE | ID: mdl-18523433

RESUMO

To investigate the extent of genetic stratification in structured microbial communities, we compared the metagenomes of 10 successive layers of a phylogenetically complex hypersaline mat from Guerrero Negro, Mexico. We found pronounced millimeter-scale genetic gradients that were consistent with the physicochemical profile of the mat. Despite these gradients, all layers displayed near-identical and acid-shifted isoelectric point profiles due to a molecular convergence of amino-acid usage, indicating that hypersalinity enforces an overriding selective pressure on the mat community.

Assuntos

Genética Microbiana , Salinidade , Seleção Genética , Aminoácidos/metabolismo , México

9.

Metagenomic analysis of two enhanced biological phosphorus removal (EBPR) sludge communities.

García Martín, Héctor; Ivanova, Natalia; Kunin, Victor; Warnecke, Falk; Barry, Kerrie W; McHardy, Alice C; Yeates, Christine; He, Shaomei; Salamov, Asaf A; Szeto, Ernest; Dalin, Eileen; Putnam, Nik H; Shapiro, Harris J; Pangilinan, Jasmyn L; Rigoutsos, Isidore; Kyrpides, Nikos C; Blackall, Linda Louise; McMahon, Katherine D; Hugenholtz, Philip.

Nat Biotechnol ; 24(10): 1263-9, 2006 Oct.

Artigo em Inglês | MEDLINE | ID: mdl-16998472

RESUMO

Enhanced biological phosphorus removal (EBPR) is one of the best-studied microbially mediated industrial processes because of its ecological and economic relevance. Despite this, it is not well understood at the metabolic level. Here we present a metagenomic analysis of two lab-scale EBPR sludges dominated by the uncultured bacterium, "Candidatus Accumulibacter phosphatis." The analysis sheds light on several controversies in EBPR metabolic models and provides hypotheses explaining the dominance of A. phosphatis in this habitat, its lifestyle outside EBPR and probable cultivation requirements. Comparison of the same species from different EBPR sludges highlights recent evolutionary dynamics in the A. phosphatis genome that could be linked to mechanisms for environmental adaptation. In spite of an apparent lack of phylogenetic overlap in the flanking communities of the two sludges studied, common functional themes were found, at least one of them complementary to the inferred metabolism of the dominant organism. The present study provides a much needed blueprint for a systems-level understanding of EBPR and illustrates that metagenomics enables detailed, often novel, insights into even well-studied biological systems.

Assuntos

Betaproteobacteria/genética , Betaproteobacteria/metabolismo , Genoma Bacteriano , Fósforo/metabolismo , Esgotos/microbiologia , Adaptação Biológica , Fósforo/isolamento & purificação , Eliminação de Resíduos Líquidos

10.

TreeQ-VISTA: an interactive tree visualization tool with functional annotation query capabilities.

Gu, Shengyin; Anderson, Iain; Kunin, Victor; Cipriano, Michael; Minovitsky, Simon; Weber, Gunther; Amenta, Nina; Hamann, Bernd; Dubchak, Inna.

Bioinformatics ; 23(6): 764-6, 2007 Mar 15.

Artigo em Inglês | MEDLINE | ID: mdl-17234642

RESUMO

UNLABELLED: We describe a general multiplatform exploratory tool called TreeQ-Vista, designed for presenting functional annotations in a phylogenetic context. Traits, such as phenotypic and genomic properties, are interactively queried from a user-provided relational database with a user-friendly interface which provides a set of tools for users with or without SQL knowledge. The query results are projected onto a phylogenetic tree and can be displayed in multiple color groups. A rich set of browsing, grouping and query tools are provided to facilitate trait exploration, comparison and analysis. AVAILABILITY: The program, detailed tutorial and examples are available online (http:/genome.lbl.gov/vista/TreeQVista).

Assuntos

Mapeamento Cromossômico/métodos , Bases de Dados Genéticas , Evolução Molecular , Armazenamento e Recuperação da Informação/métodos , Modelos Genéticos , Software , Interface Usuário-Computador , Gráficos por Computador , Simulação por Computador , Sistemas de Gerenciamento de Base de Dados , Filogenia

11.

Denoising inferred functional association networks obtained by gene fusion analysis.

Kamburov, Atanas; Goldovsky, Leon; Freilich, Shiri; Kapazoglou, Aliki; Kunin, Victor; Enright, Anton J; Tsaftaris, Athanasios; Ouzounis, Christos A.

BMC Genomics ; 8: 460, 2007 Dec 14.

Artigo em Inglês | MEDLINE | ID: mdl-18081932

RESUMO

BACKGROUND: Gene fusion detection - also known as the 'Rosetta Stone' method - involves the identification of fused composite genes in a set of reference genomes, which indicates potential interactions between its un-fused counterpart genes in query genomes. The precision of this method typically improves with an ever-increasing number of reference genomes. RESULTS: In order to explore the usefulness and scope of this approach for protein interaction prediction and generate a high-quality, non-redundant set of interacting pairs of proteins across a wide taxonomic range, we have exhaustively performed gene fusion analysis for 184 genomes using an efficient variant of a previously developed protocol. By analyzing interaction graphs and applying a threshold that limits the maximum number of possible interactions within the largest graph components, we show that we can reduce the number of implausible interactions due to the detection of promiscuous domains. With this generally applicable approach, we generate a robust set of over 2 million distinct and testable interactions encompassing 696,894 proteins in 184 species or strains, most of which have never been the subject of high-throughput experimental proteomics. We investigate the cumulative effect of increasing numbers of genomes on the fidelity and quantity of predictions, and show that, for large numbers of genomes, predictions do not become saturated but continue to grow linearly, for the majority of the species. We also examine the percentage of component (and composite) proteins with relation to the number of genes and further validate the functional categories that are highly represented in this robust set of detected genome-wide interactions. CONCLUSION: We illustrate the phylogenetic and functional diversity of gene fusion events across genomes, and their usefulness for accurate prediction of protein interaction and function.

Assuntos

Fusão Gênica , Redes Reguladoras de Genes , Arabidopsis/genética , Proteínas de Bactérias/metabolismo , Chlamydia/genética , Variação Genética , Genoma , Filogenia , Proteínas de Plantas/metabolismo , Ligação Proteica , Reprodutibilidade dos Testes

12.

An experimental metagenome data management and analysis system.

Markowitz, Victor M; Ivanova, Natalia; Palaniappan, Krishna; Szeto, Ernest; Korzeniewski, Frank; Lykidis, Athanasios; Anderson, Iain; Mavromatis, Konstantinos; Mavrommatis, Konstantinos; Kunin, Victor; Garcia Martin, Hector; Dubchak, Inna; Hugenholtz, Phil; Kyrpides, Nikos C.

Bioinformatics ; 22(14): e359-67, 2006 Jul 15.

Artigo em Inglês | MEDLINE | ID: mdl-16873494

RESUMO

The application of shotgun sequencing to environmental samples has revealed a new universe of microbial community genomes (metagenomes) involving previously uncultured organisms. Metagenome analysis, which is expected to provide a comprehensive picture of the gene functions and metabolic capacity for microbial communities, needs to be conducted in the context of a comprehensive data management and analysis system. We present in this paper IMG/M, an experimental metagenome data management and analysis system that is based on the Integrated Microbial Genomes (IMG) system. IMG/M provides tools and viewers for analyzing both metagenomes and isolate genomes individually or in a comparative context. IMG/M is available at http://img.jgi.doe.gov/m.

Assuntos

Fenômenos Fisiológicos Bacterianos , Proteínas de Bactérias/fisiologia , Sistemas de Gerenciamento de Base de Dados , Bases de Dados Genéticas , Genoma Bacteriano/genética , Modelos Biológicos , Proteoma/metabolismo , Armazenamento e Recuperação da Informação/métodos , Transdução de Sinais/fisiologia , Interface Usuário-Computador

13.

Measuring genome conservation across taxa: divided strains and united kingdoms.

Kunin, Victor; Ahren, Dag; Goldovsky, Leon; Janssen, Paul; Ouzounis, Christos A.

Nucleic Acids Res ; 33(2): 616-21, 2005.

Artigo em Inglês | MEDLINE | ID: mdl-15681613

RESUMO

Species evolutionary relationships have traditionally been defined by sequence similarities of phylogenetic marker molecules, recently followed by whole-genome phylogenies based on gene order, average ortholog similarity or gene content. Here, we introduce genome conservation--a novel metric of evolutionary distances between species that simultaneously takes into account, both gene content and sequence similarity at the whole-genome level. Genome conservation represents a robust distance measure, as demonstrated by accurate phylogenetic reconstructions. The genome conservation matrix for all presently sequenced organisms exhibits a remarkable ability to define evolutionary relationships across all taxonomic ranges. An assessment of taxonomic ranks with genome conservation shows that certain ranks are inadequately described and raises the possibility for a more precise and quantitative taxonomy in the future. All phylogenetic reconstructions are available at the genome phylogeny server: .

Assuntos

Biologia Computacional/métodos , Genômica/métodos , Filogenia , Bactérias/classificação , Bactérias/genética , Evolução Molecular , Genoma Bacteriano , Proteobactérias/classificação , Proteobactérias/genética

14.

Expansion of the BioCyc collection of pathway/genome databases to 160 genomes.

Karp, Peter D; Ouzounis, Christos A; Moore-Kochlacs, Caroline; Goldovsky, Leon; Kaipa, Pallavi; Ahrén, Dag; Tsoka, Sophia; Darzentas, Nikos; Kunin, Victor; López-Bigas, Núria.

Nucleic Acids Res ; 33(19): 6083-9, 2005.

Artigo em Inglês | MEDLINE | ID: mdl-16246909

RESUMO

The BioCyc database collection is a set of 160 pathway/genome databases (PGDBs) for most eukaryotic and prokaryotic species whose genomes have been completely sequenced to date. Each PGDB in the BioCyc collection describes the genome and predicted metabolic network of a single organism, inferred from the MetaCyc database, which is a reference source on metabolic pathways from multiple organisms. In addition, each bacterial PGDB includes predicted operons for the corresponding species. The BioCyc collection provides a unique resource for computational systems biology, namely global and comparative analyses of genomes and metabolic networks, and a supplement to the BioCyc resource of curated PGDBs. The Omics viewer available through the BioCyc website allows scientists to visualize combinations of gene expression, proteomics and metabolomics data on the metabolic maps of these organisms. This paper discusses the computational methodology by which the BioCyc collection has been expanded, and presents an aggregate analysis of the collection that includes the range of number of pathways present in these organisms, and the most frequently observed pathways. We seek scientists to adopt and curate individual PGDBs within the BioCyc collection. Only by harnessing the expertise of many scientists we can hope to produce biological databases, which accurately reflect the depth and breadth of knowledge that the biomedical research community is producing.

Assuntos

Bases de Dados Genéticas , Genoma , Animais , Biologia Computacional , Genoma Arqueal , Genoma Bacteriano , Genômica , Humanos , Metabolismo/genética

15.

A minimal estimate for the gene content of the last universal common ancestor--exobiology from a terrestrial perspective.

Ouzounis, Christos A; Kunin, Victor; Darzentas, Nikos; Goldovsky, Leon.

Res Microbiol ; 157(1): 57-68, 2006.

Artigo em Inglês | MEDLINE | ID: mdl-16431085

RESUMO

Using an algorithm for ancestral state inference of gene content, given a large number of extant genome sequences and a phylogenetic tree, we aim to reconstruct the gene content of the last universal common ancestor (LUCA), a hypothetical life form that presumably was the progenitor of the three domains of life. The method allows for gene loss, previously found to be a major factor in shaping gene content, and thus the estimate of LUCA's gene content appears to be substantially higher than that proposed previously, with a typical number of over 1000 gene families, of which more than 90% are also functionally characterized. More precisely, when only prokaryotes are considered, the number varies between 1006 and 1189 gene families while when eukaryotes are also included, this number increases to between 1344 and 1529 families depending on the underlying phylogenetic tree. Therefore, the common belief that the hypothetical genome of LUCA should resemble those of the smallest extant genomes of obligate parasites is not supported by recent advances in computational genomics. Instead, a fairly complex genome similar to those of free-living prokaryotes, with a variety of functional capabilities including metabolic transformation, information processing, membrane/transport proteins and complex regulation, shared between the three domains of life, emerges as the most likely progenitor of life on Earth, with profound repercussions for planetary exploration and exobiology.

Assuntos

Planeta Terra , Evolução Molecular , Exobiologia , Genoma , Filogenia , Algoritmos , Transferência Genética Horizontal

16.

Protein families and TRIBES in genome sequence space.

Enright, Anton J; Kunin, Victor; Ouzounis, Christos A.

Nucleic Acids Res ; 31(15): 4632-8, 2003 Aug 01.

Artigo em Inglês | MEDLINE | ID: mdl-12888524

RESUMO

Accurate detection of protein families allows assignment of protein function and the analysis of functional diversity in complete genomes. Recently, we presented a novel algorithm called TribeMCL for the detection of protein families that is both accurate and efficient. This method allows family analysis to be carried out on a very large scale. Using TribeMCL, we have generated a resource called TRIBES that contains protein family information, comprising annotations, protein sequence alignments and phylogenetic distributions describing 311 257 proteins from 83 completely sequenced genomes. The analysis of at least 60 934 detected protein families reveals that, with the essential families excluded, paralogy levels are similar between prokaryotes, irrespective of genome size. The number of essential families is estimated to be between 366 and 426. We also show that the currently known space of protein families is scale free and discuss the implications of this distribution. In addition, we show that smaller families are often formed by shorter proteins and discuss the reasons for this intriguing pattern. Finally, we analyse the functional diversity of protein families in entire genome sequences. The TRIBES protein family resource is accessible at http://www.ebi.ac.uk/research/cgg/tribes/.

Assuntos

Genoma , Proteínas/classificação , Análise de Sequência de Proteína/métodos , Algoritmos , Sequência de Aminoácidos , Análise por Conglomerados , Bases de Dados de Proteínas , Filogenia , Proteínas/química , Proteínas/genética , Alinhamento de Sequência

17.

Clustering the annotation space of proteins.

Kunin, Victor; Ouzounis, Christos A.

BMC Bioinformatics ; 6: 24, 2005 Feb 09.

Artigo em Inglês | MEDLINE | ID: mdl-15703069

RESUMO

BACKGROUND: Current protein clustering methods rely on either sequence or functional similarities between proteins, thereby limiting inferences to one of these areas. RESULTS: Here we report a new approach, named CLAN, which clusters proteins according to both annotation and sequence similarity. This approach is extremely fast, clustering the complete SwissProt database within minutes. It is also accurate, recovering consistent protein families agreeing on average in more than 97% with sequence-based protein families from Pfam. Discrepancies between sequence- and annotation-based clusters were scrutinized and the reasons reported. We demonstrate examples for each of these cases, and thoroughly discuss an example of a propagated error in SwissProt: a vacuolar ATPase subunit M9.2 erroneously annotated as vacuolar ATP synthase subunit H. CLAN algorithm is available from the authors and the CLAN database is accessible at http://maine.ebi.ac.uk:8000/cgi-bin/clan/ClanSearch.pl CONCLUSIONS: CLAN creates refined function-and-sequence specific protein families that can be used for identification and annotation of unknown family members. It also allows easy identification of erroneous annotations by spotting inconsistencies between similarities on annotation and sequence levels.

Assuntos

Biologia Computacional/métodos , Proteínas/química , Adenosina Trifosfatases/química , Trifosfato de Adenosina/química , Algoritmos , Análise por Conglomerados , Gráficos por Computador , Bases de Dados Factuais , Bases de Dados Genéticas , Bases de Dados de Proteínas , Reações Falso-Negativas , Genoma , Humanos , Armazenamento e Recuperação da Informação , Internet , Modelos Estatísticos , Linguagens de Programação , Dobramento de Proteína , Reprodutibilidade dos Testes , Alinhamento de Sequência , Análise de Sequência de Proteína , Software , Homologia Estrutural de Proteína , Interface Usuário-Computador , ATPases Vacuolares Próton-Translocadoras/química

18.

Experimental factors affecting PCR-based estimates of microbial species richness and evenness.

Engelbrektson, Anna; Kunin, Victor; Wrighton, Kelly C; Zvenigorodsky, Natasha; Chen, Feng; Ochman, Howard; Hugenholtz, Philip.

ISME J ; 4(5): 642-7, 2010 May.

Artigo em Inglês | MEDLINE | ID: mdl-20090784

RESUMO

Pyrosequencing of 16S rRNA gene amplicons for microbial community profiling can, for equivalent costs, yield more than two orders of magnitude more sensitivity than traditional PCR cloning and Sanger sequencing. With this increased sensitivity and the ability to analyze multiple samples in parallel, it has become possible to evaluate several technical aspects of PCR-based community structure profiling methods. We tested the effect of amplicon length and primer pair on estimates of species richness (number of species) and evenness (relative abundance of species) by assessing the potentially tractable microbial community residing in the termite hindgut. Two regions of the 16S rRNA gene were sequenced from one of two common priming sites, spanning the V1-V2 or V8 regions, using amplicons ranging in length from 352 to 1443 bp. Our results show that both amplicon length and primer pair markedly influence estimates of richness and evenness. However, estimates of species evenness are consistent among different primer pairs targeting the same region. These results highlight the importance of experimental methodology when comparing diversity estimates across communities.

Assuntos

Bactérias/classificação , Técnicas de Tipagem Bacteriana/métodos , Isópteros/microbiologia , Reação em Cadeia da Polimerase , Análise de Sequência de DNA/métodos , Animais , Bactérias/genética , Técnicas de Tipagem Bacteriana/economia , Primers do DNA , DNA Bacteriano/genética , RNA Ribossômico 16S/genética , Análise de Sequência de DNA/economia

19.

Genome analysis of the anaerobic thermohalophilic bacterium Halothermothrix orenii.

Mavromatis, Konstantinos; Ivanova, Natalia; Anderson, Iain; Lykidis, Athanasios; Hooper, Sean D; Sun, Hui; Kunin, Victor; Lapidus, Alla; Hugenholtz, Philip; Patel, Bharat; Kyrpides, Nikos C.

PLoS One ; 4(1): e4192, 2009.

Artigo em Inglês | MEDLINE | ID: mdl-19145256

RESUMO

Halothermothirx orenii is a strictly anaerobic thermohalophilic bacterium isolated from sediment of a Tunisian salt lake. It belongs to the order Halanaerobiales in the phylum Firmicutes. The complete sequence revealed that the genome consists of one circular chromosome of 2578146 bps encoding 2451 predicted genes. This is the first genome sequence of an organism belonging to the Haloanaerobiales. Features of both Gram positive and Gram negative bacteria were identified with the presence of both a sporulating mechanism typical of Firmicutes and a characteristic Gram negative lipopolysaccharide being the most prominent. Protein sequence analyses and metabolic reconstruction reveal a unique combination of strategies for thermophilic and halophilic adaptation. H. orenii can serve as a model organism for the study of the evolution of the Gram negative phenotype as well as the adaptation under thermohalophilic conditions and the development of biotechnological applications under conditions that require high temperatures and high salt concentrations.

Assuntos

Bactérias Anaeróbias/genética , Genoma Bacteriano , Halobacteriales/genética , DNA Circular/genética , Bactérias Gram-Negativas/genética , Halogênios , Temperatura Alta , Lipopolissacarídeos , Microbiologia da Água

20.

CRISPR--a widespread system that provides acquired resistance against phages in bacteria and archaea.

Sorek, Rotem; Kunin, Victor; Hugenholtz, Philip.

Nat Rev Microbiol ; 6(3): 181-6, 2008 Mar.

Artigo em Inglês | MEDLINE | ID: mdl-18157154

RESUMO

Arrays of clustered, regularly interspaced short palindromic repeats (CRISPRs) are widespread in the genomes of many bacteria and almost all archaea. These arrays are composed of direct repeats that are separated by similarly sized non-repetitive spacers. CRISPR arrays, together with a group of associated proteins, confer resistance to phages, possibly by an RNA-interference-like mechanism. This Progress discusses the structure and function of this newly recognized antiviral mechanism.

Assuntos

Archaea/genética , Bactérias/genética , Sequências Repetitivas Dispersas/fisiologia , Archaea/virologia , Bactérias/virologia , Proteínas de Bactérias/genética , Bacteriófagos/fisiologia , DNA Intergênico , Inativação Gênica , Genoma Arqueal , Genoma Bacteriano , Família Multigênica/genética , Interferência Viral

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA