Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 36
Filtrar
Mais filtros










Base de dados
Intervalo de ano de publicação
1.
Microb Ecol ; 65(3): 709-19, 2013 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-23233090

RESUMO

Next-generation sequencing has increased the coverage of microbial diversity surveys by orders of magnitude, but differentiating artifacts from rare environmental sequences remains a challenge. Clustering 16S rRNA sequences into operational taxonomic units (OTUs) organizes sequence data into groups of 97 % identity, helping to reduce data volumes and avoid analyzing sequencing artifacts by grouping them with real sequences. Here, we analyze sequence abundance distributions across environmental samples and show that 16S rRNA sequences of >99 % identity can represent functionally distinct microorganisms, rendering OTU clustering problematic when the goal is an accurate analysis of organism distribution. Strict postsequencing quality control (QC) filters eliminated the most prevalent artifacts without clustering. Further experiments proved that DNA polymerase errors in polymerase chain reaction (PCR) generate a significant number of substitution errors, most of which pass QC filters. Based on our findings, we recommend minimizing the number of PCR cycles in DNA library preparation and applying strict postsequencing QC filters to reduce the most prevalent artifacts while maintaining a high level of accuracy in diversity estimates. We further recommend correlating rare and abundant sequences across environmental samples, rather than clustering into OTUs, to identify remaining sequence artifacts without losing the resolution afforded by high-throughput sequencing.


Assuntos
Actinomycetales/genética , Biodiversidade , Reação em Cadeia da Polimerase/normas , Actinomycetales/classificação , Actinomycetales/isolamento & purificação , Primers do DNA/genética , DNA Bacteriano/genética , Sequenciamento de Nucleotídeos em Larga Escala , Reação em Cadeia da Polimerase/métodos , RNA Ribossômico 16S/genética
2.
Nature ; 488(7409): 86-90, 2012 Aug 02.
Artigo em Inglês | MEDLINE | ID: mdl-22859206

RESUMO

Land plants associate with a root microbiota distinct from the complex microbial community present in surrounding soil. The microbiota colonizing the rhizosphere (immediately surrounding the root) and the endophytic compartment (within the root) contribute to plant growth, productivity, carbon sequestration and phytoremediation. Colonization of the root occurs despite a sophisticated plant immune system, suggesting finely tuned discrimination of mutualists and commensals from pathogens. Genetic principles governing the derivation of host-specific endophyte communities from soil communities are poorly understood. Here we report the pyrosequencing of the bacterial 16S ribosomal RNA gene of more than 600 Arabidopsis thaliana plants to test the hypotheses that the root rhizosphere and endophytic compartment microbiota of plants grown under controlled conditions in natural soils are sufficiently dependent on the host to remain consistent across different soil types and developmental stages, and sufficiently dependent on host genotype to vary between inbred Arabidopsis accessions. We describe different bacterial communities in two geochemically distinct bulk soils and in rhizosphere and endophytic compartments prepared from roots grown in these soils. The communities in each compartment are strongly influenced by soil type. Endophytic compartments from both soils feature overlapping, low-complexity communities that are markedly enriched in Actinobacteria and specific families from other phyla, notably Proteobacteria. Some bacteria vary quantitatively between plants of different developmental stage and genotype. Our rigorous definition of an endophytic compartment microbiome should facilitate controlled dissection of plant-microbe interactions derived from complex soil communities.


Assuntos
Arabidopsis/microbiologia , Endófitos/classificação , Endófitos/isolamento & purificação , Metagenoma , Raízes de Plantas/microbiologia , Microbiologia do Solo , Actinobacteria/genética , Actinobacteria/isolamento & purificação , Arabidopsis/classificação , Arabidopsis/crescimento & desenvolvimento , Endófitos/genética , Genótipo , Hibridização in Situ Fluorescente , Raízes de Plantas/classificação , Raízes de Plantas/crescimento & desenvolvimento , Proteobactérias/genética , Proteobactérias/isolamento & purificação , RNA Ribossômico 16S/genética , RNA Ribossômico 16S/isolamento & purificação , Rizosfera , Ribotipagem , Análise de Sequência de DNA , Simbiose
3.
Environ Microbiol ; 12(5): 1205-17, 2010 May.
Artigo em Inglês | MEDLINE | ID: mdl-20148930

RESUMO

Here we report the first metatranscriptomic analysis of gene expression and regulation of 'Candidatus Accumulibacter'-enriched lab-scale sludge during enhanced biological phosphorus removal (EBPR). Medium density oligonucleotide microarrays were generated with probes targeting most predicted genes hypothesized to be important for the EBPR phenotype. RNA samples were collected at the early stage of anaerobic and aerobic phases (15 min after acetate addition and switching to aeration respectively). We detected the expression of a number of genes involved in the carbon and phosphate metabolisms, as proposed by EBPR models (e.g. polyhydroxyalkanoate synthesis, a split TCA cycle through methylmalonyl-CoA pathway, and polyphosphate formation), as well as novel genes discovered through metagenomic analysis. The comparison between the early stage anaerobic and aerobic gene expression profiles showed that expression levels of most genes were not significantly different between the two stages. The majority of upregulated genes in the aerobic sample are predicted to encode functions such as transcription, translation and protein translocation, reflecting the rapid growth phase of Accumulibacter shortly after being switched to aerobic conditions. Components of the TCA cycle and machinery involved in ATP synthesis were also upregulated during the early aerobic phase. These findings support the predictions of EBPR metabolic models that the oxidation of intracellularly stored carbon polymers through the TCA cycle provides ATP for cell growth when oxygen becomes available. Nitrous oxide reductase was among the very few Accumulibacter genes upregulated in the anaerobic sample, suggesting that its expression is likely induced by the deprivation of oxygen.


Assuntos
Proteínas de Bactérias/metabolismo , Betaproteobacteria/metabolismo , Perfilação da Expressão Gênica , Análise de Sequência com Séries de Oligonucleotídeos/métodos , Fósforo/metabolismo , Esgotos/microbiologia , Aerobiose , Anaerobiose , Proteínas de Bactérias/genética , Betaproteobacteria/genética , Betaproteobacteria/crescimento & desenvolvimento , Biodegradação Ambiental , Regulação da Expressão Gênica , Metagenômica , RNA Bacteriano/análise , RNA Bacteriano/genética , RNA Bacteriano/isolamento & purificação
4.
ISME J ; 4(5): 642-7, 2010 May.
Artigo em Inglês | MEDLINE | ID: mdl-20090784

RESUMO

Pyrosequencing of 16S rRNA gene amplicons for microbial community profiling can, for equivalent costs, yield more than two orders of magnitude more sensitivity than traditional PCR cloning and Sanger sequencing. With this increased sensitivity and the ability to analyze multiple samples in parallel, it has become possible to evaluate several technical aspects of PCR-based community structure profiling methods. We tested the effect of amplicon length and primer pair on estimates of species richness (number of species) and evenness (relative abundance of species) by assessing the potentially tractable microbial community residing in the termite hindgut. Two regions of the 16S rRNA gene were sequenced from one of two common priming sites, spanning the V1-V2 or V8 regions, using amplicons ranging in length from 352 to 1443 bp. Our results show that both amplicon length and primer pair markedly influence estimates of richness and evenness. However, estimates of species evenness are consistent among different primer pairs targeting the same region. These results highlight the importance of experimental methodology when comparing diversity estimates across communities.


Assuntos
Bactérias/classificação , Técnicas de Tipagem Bacteriana/métodos , Isópteros/microbiologia , Reação em Cadeia da Polimerase , Análise de Sequência de DNA/métodos , Animais , Bactérias/genética , Técnicas de Tipagem Bacteriana/economia , Primers do DNA , DNA Bacteriano/genética , RNA Ribossômico 16S/genética , Análise de Sequência de DNA/economia
5.
Environ Microbiol ; 12(1): 118-23, 2010 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-19725865

RESUMO

Massively parallel pyrosequencing of the small subunit (16S) ribosomal RNA gene has revealed that the extent of rare microbial populations in several environments, the 'rare biosphere', is orders of magnitude higher than previously thought. One important caveat with this method is that sequencing error could artificially inflate diversity estimates. Although the per-base error of 16S rDNA amplicon pyrosequencing has been shown to be as good as or lower than Sanger sequencing, no direct assessments of pyrosequencing errors on diversity estimates have been reported. Using only Escherichia coli MG1655 as a reference template, we find that 16S rDNA diversity is grossly overestimated unless relatively stringent read quality filtering and low clustering thresholds are applied. In particular, the common practice of removing reads with unresolved bases and anomalous read lengths is insufficient to ensure accurate estimates of microbial diversity. Furthermore, common and reproducible homopolymer length errors can result in relatively abundant spurious phylotypes further confounding data interpretation. We suggest that stringent quality-based trimming of 16S pyrotags and clustering thresholds no greater than 97% identity should be used to avoid overestimates of the rare biosphere.


Assuntos
Biodiversidade , RNA Ribossômico 16S/genética , Análise de Sequência de DNA/métodos , Análise por Conglomerados , DNA Bacteriano/genética , Escherichia coli/genética , Genes Bacterianos , Variação Genética , Alinhamento de Sequência
6.
Nature ; 462(7276): 1056-60, 2009 Dec 24.
Artigo em Inglês | MEDLINE | ID: mdl-20033048

RESUMO

Sequencing of bacterial and archaeal genomes has revolutionized our understanding of the many roles played by microorganisms. There are now nearly 1,000 completed bacterial and archaeal genomes available, most of which were chosen for sequencing on the basis of their physiology. As a result, the perspective provided by the currently available genomes is limited by a highly biased phylogenetic distribution. To explore the value added by choosing microbial genomes for sequencing on the basis of their evolutionary relationships, we have sequenced and analysed the genomes of 56 culturable species of Bacteria and Archaea selected to maximize phylogenetic coverage. Analysis of these genomes demonstrated pronounced benefits (compared to an equivalent set of genomes randomly selected from the existing database) in diverse areas including the reconstruction of phylogenetic history, the discovery of new protein families and biological properties, and the prediction of functions for known genes from other organisms. Our results strongly support the need for systematic 'phylogenomic' efforts to compile a phylogeny-driven 'Genomic Encyclopedia of Bacteria and Archaea' in order to derive maximum knowledge from existing microbial genome data as well as from genome sequences to come.


Assuntos
Archaea/classificação , Archaea/genética , Bactérias/classificação , Bactérias/genética , Genoma Arqueal/genética , Genoma Bacteriano/genética , Filogenia , Actinas/química , Sequência de Aminoácidos , Proteínas de Bactérias/química , Biodiversidade , Bases de Dados Genéticas , Genes de RNAr/genética , Modelos Moleculares , Dados de Sequência Molecular , Estrutura Terciária de Proteína , Alinhamento de Sequência
7.
PLoS One ; 4(1): e4192, 2009.
Artigo em Inglês | MEDLINE | ID: mdl-19145256

RESUMO

Halothermothirx orenii is a strictly anaerobic thermohalophilic bacterium isolated from sediment of a Tunisian salt lake. It belongs to the order Halanaerobiales in the phylum Firmicutes. The complete sequence revealed that the genome consists of one circular chromosome of 2578146 bps encoding 2451 predicted genes. This is the first genome sequence of an organism belonging to the Haloanaerobiales. Features of both Gram positive and Gram negative bacteria were identified with the presence of both a sporulating mechanism typical of Firmicutes and a characteristic Gram negative lipopolysaccharide being the most prominent. Protein sequence analyses and metabolic reconstruction reveal a unique combination of strategies for thermophilic and halophilic adaptation. H. orenii can serve as a model organism for the study of the evolution of the Gram negative phenotype as well as the adaptation under thermohalophilic conditions and the development of biotechnological applications under conditions that require high temperatures and high salt concentrations.


Assuntos
Bactérias Anaeróbias/genética , Genoma Bacteriano , Halobacteriales/genética , DNA Circular/genética , Bactérias Gram-Negativas/genética , Halogênios , Temperatura Alta , Lipopolissacarídeos , Microbiologia da Água
8.
Microbiol Mol Biol Rev ; 72(4): 557-78, Table of Contents, 2008 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-19052320

RESUMO

As random shotgun metagenomic projects proliferate and become the dominant source of publicly available sequence data, procedures for the best practices in their execution and analysis become increasingly important. Based on our experience at the Joint Genome Institute, we describe the chain of decisions accompanying a metagenomic project from the viewpoint of the bioinformatic analysis step by step. We guide the reader through a standard workflow for a metagenomic project beginning with presequencing considerations such as community composition and sequence data type that will greatly influence downstream analyses. We proceed with recommendations for sampling and data generation including sample and metadata collection, community profiling, construction of shotgun libraries, and sequencing strategies. We then discuss the application of generic sequence processing steps (read preprocessing, assembly, and gene prediction and annotation) to metagenomic data sets in contrast to genome projects. Different types of data analyses particular to metagenomes are then presented, including binning, dominant population analysis, and gene-centric analysis. Finally, data management issues are presented and discussed. We hope that this review will assist bioinformaticians and biologists in making better-informed decisions on their journey during a metagenomic project.


Assuntos
Sistemas de Gerenciamento de Base de Dados , Genoma/genética , Genômica/métodos , Armazenamento e Recuperação da Informação/métodos , Bases de Dados Genéticas , Microbiologia Ambiental , Genômica/normas , Análise de Sequência de DNA
9.
Mol Syst Biol ; 4: 198, 2008.
Artigo em Inglês | MEDLINE | ID: mdl-18523433

RESUMO

To investigate the extent of genetic stratification in structured microbial communities, we compared the metagenomes of 10 successive layers of a phylogenetically complex hypersaline mat from Guerrero Negro, Mexico. We found pronounced millimeter-scale genetic gradients that were consistent with the physicochemical profile of the mat. Despite these gradients, all layers displayed near-identical and acid-shifted isoelectric point profiles due to a molecular convergence of amino-acid usage, indicating that hypersalinity enforces an overriding selective pressure on the mat community.


Assuntos
Genética Microbiana , Salinidade , Seleção Genética , Aminoácidos/metabolismo , México
10.
Proc Natl Acad Sci U S A ; 105(23): 8102-7, 2008 Jun 10.
Artigo em Inglês | MEDLINE | ID: mdl-18535141

RESUMO

The candidate division Korarchaeota comprises a group of uncultivated microorganisms that, by their small subunit rRNA phylogeny, may have diverged early from the major archaeal phyla Crenarchaeota and Euryarchaeota. Here, we report the initial characterization of a member of the Korarchaeota with the proposed name, "Candidatus Korarchaeum cryptofilum," which exhibits an ultrathin filamentous morphology. To investigate possible ancestral relationships between deep-branching Korarchaeota and other phyla, we used whole-genome shotgun sequencing to construct a complete composite korarchaeal genome from enriched cells. The genome was assembled into a single contig 1.59 Mb in length with a G + C content of 49%. Of the 1,617 predicted protein-coding genes, 1,382 (85%) could be assigned to a revised set of archaeal Clusters of Orthologous Groups (COGs). The predicted gene functions suggest that the organism relies on a simple mode of peptide fermentation for carbon and energy and lacks the ability to synthesize de novo purines, CoA, and several other cofactors. Phylogenetic analyses based on conserved single genes and concatenated protein sequences positioned the korarchaeote as a deep archaeal lineage with an apparent affinity to the Crenarchaeota. However, the predicted gene content revealed that several conserved cellular systems, such as cell division, DNA replication, and tRNA maturation, resemble the counterparts in the Euryarchaeota. In light of the known composition of archaeal genomes, the Korarchaeota might have retained a set of cellular features that represents the ancestral archaeal form.


Assuntos
Evolução Biológica , Genoma Arqueal/genética , Korarchaeota/genética , Ciclo Celular , Replicação do DNA , Metabolismo Energético , Evolução Molecular , Korarchaeota/citologia , Korarchaeota/ultraestrutura , Filogenia , Biossíntese de Proteínas , Análise de Sequência de DNA , Transcrição Gênica
11.
Genome Res ; 18(2): 293-7, 2008 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-18077539

RESUMO

Using a combination of bacterial and phage-targeted metagenomics, we analyzed two geographically remote sludge bioreactors enriched in a single bacterial species Candidatus Accumulibacter phosphatis (CAP). We inferred unrestricted global movement of this species and identified aquatic ecosystems as the primary environmental reservoirs facilitating dispersal. Highly related and geographically remote CAP strains differed principally in genomic regions encoding phage defense mechanisms. We found that CAP populations were high density, clonal, and nonrecombining, providing natural targets for "kill-the-winner" phage predation. Community expression analysis demonstrated that phages were consistently active in the bioreactor community. Genomic signatures linking CAP to past phage exposures were observed mostly between local phage and host. We conclude that CAP strains disperse globally but must adapt to phage predation pressure locally.


Assuntos
Adaptação Biológica/genética , Bactérias/genética , Bactérias/virologia , Bacteriófagos/genética , Genética Populacional , Filogenia , Esgotos/microbiologia , Sequência de Bases , Biologia Computacional/métodos , Demografia , Componentes do Gene , Perfilação da Expressão Gênica , Análise em Microsséries , Dados de Sequência Molecular , Dinâmica Populacional , Queensland , Análise de Sequência de DNA , Wisconsin
12.
Nat Rev Microbiol ; 6(3): 181-6, 2008 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-18157154

RESUMO

Arrays of clustered, regularly interspaced short palindromic repeats (CRISPRs) are widespread in the genomes of many bacteria and almost all archaea. These arrays are composed of direct repeats that are separated by similarly sized non-repetitive spacers. CRISPR arrays, together with a group of associated proteins, confer resistance to phages, possibly by an RNA-interference-like mechanism. This Progress discusses the structure and function of this newly recognized antiviral mechanism.


Assuntos
Archaea/genética , Bactérias/genética , Sequências Repetitivas Dispersas/fisiologia , Archaea/virologia , Bactérias/virologia , Proteínas de Bactérias/genética , Bacteriófagos/fisiologia , DNA Intergênico , Inativação Gênica , Genoma Arqueal , Genoma Bacteriano , Família Multigênica/genética , Interferência Viral
13.
BMC Genomics ; 8: 460, 2007 Dec 14.
Artigo em Inglês | MEDLINE | ID: mdl-18081932

RESUMO

BACKGROUND: Gene fusion detection - also known as the 'Rosetta Stone' method - involves the identification of fused composite genes in a set of reference genomes, which indicates potential interactions between its un-fused counterpart genes in query genomes. The precision of this method typically improves with an ever-increasing number of reference genomes. RESULTS: In order to explore the usefulness and scope of this approach for protein interaction prediction and generate a high-quality, non-redundant set of interacting pairs of proteins across a wide taxonomic range, we have exhaustively performed gene fusion analysis for 184 genomes using an efficient variant of a previously developed protocol. By analyzing interaction graphs and applying a threshold that limits the maximum number of possible interactions within the largest graph components, we show that we can reduce the number of implausible interactions due to the detection of promiscuous domains. With this generally applicable approach, we generate a robust set of over 2 million distinct and testable interactions encompassing 696,894 proteins in 184 species or strains, most of which have never been the subject of high-throughput experimental proteomics. We investigate the cumulative effect of increasing numbers of genomes on the fidelity and quantity of predictions, and show that, for large numbers of genomes, predictions do not become saturated but continue to grow linearly, for the majority of the species. We also examine the percentage of component (and composite) proteins with relation to the number of genes and further validate the functional categories that are highly represented in this robust set of detected genome-wide interactions. CONCLUSION: We illustrate the phylogenetic and functional diversity of gene fusion events across genomes, and their usefulness for accurate prediction of protein interaction and function.


Assuntos
Fusão Gênica , Redes Reguladoras de Genes , Arabidopsis/genética , Proteínas de Bactérias/metabolismo , Chlamydia/genética , Variação Genética , Genoma , Filogenia , Proteínas de Plantas/metabolismo , Ligação Proteica , Reprodutibilidade dos Testes
14.
Nature ; 450(7169): 560-5, 2007 Nov 22.
Artigo em Inglês | MEDLINE | ID: mdl-18033299

RESUMO

From the standpoints of both basic research and biotechnology, there is considerable interest in reaching a clearer understanding of the diversity of biological mechanisms employed during lignocellulose degradation. Globally, termites are an extremely successful group of wood-degrading organisms and are therefore important both for their roles in carbon turnover in the environment and as potential sources of biochemical catalysts for efforts aimed at converting wood into biofuels. Only recently have data supported any direct role for the symbiotic bacteria in the gut of the termite in cellulose and xylan hydrolysis. Here we use a metagenomic analysis of the bacterial community resident in the hindgut paunch of a wood-feeding 'higher' Nasutitermes species (which do not contain cellulose-fermenting protozoa) to show the presence of a large, diverse set of bacterial genes for cellulose and xylan hydrolysis. Many of these genes were expressed in vivo or had cellulase activity in vitro, and further analyses implicate spirochete and fibrobacter species in gut lignocellulose degradation. New insights into other important symbiotic functions including H2 metabolism, CO2-reductive acetogenesis and N2 fixation are also provided by this first system-wide gene analysis of a microbial community specialized towards plant lignocellulose degradation. Our results underscore how complex even a 1-microl environment can be.


Assuntos
Bactérias/metabolismo , Genoma Bacteriano/genética , Genômica , Intestinos/microbiologia , Isópteros/metabolismo , Isópteros/microbiologia , Madeira/metabolismo , Animais , Bactérias/enzimologia , Bactérias/genética , Bactérias/isolamento & purificação , Fontes de Energia Bioelétrica , Carbono/metabolismo , Domínio Catalítico , Celulose/metabolismo , Costa Rica , Genes Bacterianos/genética , Glicosídeo Hidrolases/química , Glicosídeo Hidrolases/genética , Glicosídeo Hidrolases/metabolismo , Hidrólise , Lignina/metabolismo , Modelos Biológicos , Dados de Sequência Molecular , Reação em Cadeia da Polimerase , Simbiose , Madeira/química , Xilanos/metabolismo
15.
Genome Biol ; 8(4): R61, 2007.
Artigo em Inglês | MEDLINE | ID: mdl-17442114

RESUMO

BACKGROUND: Clustered regularly interspaced short palindromic repeats (CRISPRs) are a novel class of direct repeats, separated by unique spacer sequences of similar length, that are present in approximately 40% of bacterial and most archaeal genomes analyzed to date. More than 40 gene families, called CRISPR-associated sequences (CASs), appear in conjunction with these repeats and are thought to be involved in the propagation and functioning of CRISPRs. It has been recently shown that CRISPR provides acquired resistance against viruses in prokaryotes. RESULTS: Here we analyze CRISPR repeats identified in 195 microbial genomes and show that they can be organized into multiple clusters based on sequence similarity. Some of the clusters present stable, highly conserved RNA secondary structures, while others lack detectable structures. Stable secondary structures exhibit multiple compensatory base changes in the stem region, indicating evolutionary and functional conservation. CONCLUSION: We show that the repeat-based classification corresponds to, and expands upon, a previously reported CAS gene-based classification, including specific relationships between CRISPR and CAS subtypes.


Assuntos
Archaea/genética , Bactérias/genética , Evolução Molecular , RNA Arqueal/química , RNA Bacteriano/química , Sequências Repetitivas de Ácido Nucleico , Sequência de Bases , Análise por Conglomerados , Sequência Conservada , Genoma Bacteriano , Família Multigênica , Conformação de Ácido Nucleico , Análise de Sequência de RNA , Software
16.
Bioinformatics ; 23(6): 764-6, 2007 Mar 15.
Artigo em Inglês | MEDLINE | ID: mdl-17234642

RESUMO

UNLABELLED: We describe a general multiplatform exploratory tool called TreeQ-Vista, designed for presenting functional annotations in a phylogenetic context. Traits, such as phenotypic and genomic properties, are interactively queried from a user-provided relational database with a user-friendly interface which provides a set of tools for users with or without SQL knowledge. The query results are projected onto a phylogenetic tree and can be displayed in multiple color groups. A rich set of browsing, grouping and query tools are provided to facilitate trait exploration, comparison and analysis. AVAILABILITY: The program, detailed tutorial and examples are available online (http:/genome.lbl.gov/vista/TreeQVista).


Assuntos
Mapeamento Cromossômico/métodos , Bases de Dados Genéticas , Evolução Molecular , Armazenamento e Recuperação da Informação/métodos , Modelos Genéticos , Software , Interface Usuário-Computador , Gráficos por Computador , Simulação por Computador , Sistemas de Gerenciamento de Base de Dados , Filogenia
17.
Nat Biotechnol ; 24(10): 1263-9, 2006 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-16998472

RESUMO

Enhanced biological phosphorus removal (EBPR) is one of the best-studied microbially mediated industrial processes because of its ecological and economic relevance. Despite this, it is not well understood at the metabolic level. Here we present a metagenomic analysis of two lab-scale EBPR sludges dominated by the uncultured bacterium, "Candidatus Accumulibacter phosphatis." The analysis sheds light on several controversies in EBPR metabolic models and provides hypotheses explaining the dominance of A. phosphatis in this habitat, its lifestyle outside EBPR and probable cultivation requirements. Comparison of the same species from different EBPR sludges highlights recent evolutionary dynamics in the A. phosphatis genome that could be linked to mechanisms for environmental adaptation. In spite of an apparent lack of phylogenetic overlap in the flanking communities of the two sludges studied, common functional themes were found, at least one of them complementary to the inferred metabolism of the dominant organism. The present study provides a much needed blueprint for a systems-level understanding of EBPR and illustrates that metagenomics enables detailed, often novel, insights into even well-studied biological systems.


Assuntos
Betaproteobacteria/genética , Betaproteobacteria/metabolismo , Genoma Bacteriano , Fósforo/metabolismo , Esgotos/microbiologia , Adaptação Biológica , Fósforo/isolamento & purificação , Eliminação de Resíduos Líquidos
18.
Bioinformatics ; 22(14): e359-67, 2006 Jul 15.
Artigo em Inglês | MEDLINE | ID: mdl-16873494

RESUMO

The application of shotgun sequencing to environmental samples has revealed a new universe of microbial community genomes (metagenomes) involving previously uncultured organisms. Metagenome analysis, which is expected to provide a comprehensive picture of the gene functions and metabolic capacity for microbial communities, needs to be conducted in the context of a comprehensive data management and analysis system. We present in this paper IMG/M, an experimental metagenome data management and analysis system that is based on the Integrated Microbial Genomes (IMG) system. IMG/M provides tools and viewers for analyzing both metagenomes and isolate genomes individually or in a comparative context. IMG/M is available at http://img.jgi.doe.gov/m.


Assuntos
Fenômenos Fisiológicos Bacterianos , Proteínas de Bactérias/fisiologia , Sistemas de Gerenciamento de Base de Dados , Bases de Dados Genéticas , Genoma Bacteriano/genética , Modelos Biológicos , Proteoma/metabolismo , Armazenamento e Recuperação da Informação/métodos , Transdução de Sinais/fisiologia , Interface Usuário-Computador
19.
Res Microbiol ; 157(1): 57-68, 2006.
Artigo em Inglês | MEDLINE | ID: mdl-16431085

RESUMO

Using an algorithm for ancestral state inference of gene content, given a large number of extant genome sequences and a phylogenetic tree, we aim to reconstruct the gene content of the last universal common ancestor (LUCA), a hypothetical life form that presumably was the progenitor of the three domains of life. The method allows for gene loss, previously found to be a major factor in shaping gene content, and thus the estimate of LUCA's gene content appears to be substantially higher than that proposed previously, with a typical number of over 1000 gene families, of which more than 90% are also functionally characterized. More precisely, when only prokaryotes are considered, the number varies between 1006 and 1189 gene families while when eukaryotes are also included, this number increases to between 1344 and 1529 families depending on the underlying phylogenetic tree. Therefore, the common belief that the hypothetical genome of LUCA should resemble those of the smallest extant genomes of obligate parasites is not supported by recent advances in computational genomics. Instead, a fairly complex genome similar to those of free-living prokaryotes, with a variety of functional capabilities including metabolic transformation, information processing, membrane/transport proteins and complex regulation, shared between the three domains of life, emerges as the most likely progenitor of life on Earth, with profound repercussions for planetary exploration and exobiology.


Assuntos
Planeta Terra , Evolução Molecular , Exobiologia , Genoma , Filogenia , Algoritmos , Transferência Genética Horizontal
20.
Nucleic Acids Res ; 33(19): 6083-9, 2005.
Artigo em Inglês | MEDLINE | ID: mdl-16246909

RESUMO

The BioCyc database collection is a set of 160 pathway/genome databases (PGDBs) for most eukaryotic and prokaryotic species whose genomes have been completely sequenced to date. Each PGDB in the BioCyc collection describes the genome and predicted metabolic network of a single organism, inferred from the MetaCyc database, which is a reference source on metabolic pathways from multiple organisms. In addition, each bacterial PGDB includes predicted operons for the corresponding species. The BioCyc collection provides a unique resource for computational systems biology, namely global and comparative analyses of genomes and metabolic networks, and a supplement to the BioCyc resource of curated PGDBs. The Omics viewer available through the BioCyc website allows scientists to visualize combinations of gene expression, proteomics and metabolomics data on the metabolic maps of these organisms. This paper discusses the computational methodology by which the BioCyc collection has been expanded, and presents an aggregate analysis of the collection that includes the range of number of pathways present in these organisms, and the most frequently observed pathways. We seek scientists to adopt and curate individual PGDBs within the BioCyc collection. Only by harnessing the expertise of many scientists we can hope to produce biological databases, which accurately reflect the depth and breadth of knowledge that the biomedical research community is producing.


Assuntos
Bases de Dados Genéticas , Genoma , Animais , Biologia Computacional , Genoma Arqueal , Genoma Bacteriano , Genômica , Humanos , Metabolismo/genética
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...