Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 58
Filtrar
1.
Bioinformatics ; 35(21): 4402-4404, 2019 11 01.
Artigo em Inglês | MEDLINE | ID: mdl-31086982

RESUMO

SUMMARY: To address the need for improved phage annotation tools that scale, we created an automated throughput annotation pipeline: multiple-genome Phage Annotation Toolkit and Evaluator (multiPhATE). multiPhATE is a throughput pipeline driver that invokes an annotation pipeline (PhATE) across a user-specified set of phage genomes. This tool incorporates a de novo phage gene calling algorithm and assigns putative functions to gene calls using protein-, virus- and phage-centric databases. multiPhATE's modular construction allows the user to implement all or any portion of the analyses by acquiring local instances of the desired databases and specifying the desired analyses in a configuration file. We demonstrate multiPhATE by annotating two newly sequenced Yersinia pestis phage genomes. Within multiPhATE, the PhATE processing pipeline can be readily implemented across multiple processors, making it adaptable for throughput sequencing projects. Software documentation assists the user in configuring the system. AVAILABILITY AND IMPLEMENTATION: multiPhATE was implemented in Python 3.7, and runs as a command-line code under Linux or Unix. multiPhATE is freely available under an open-source BSD3 license from https://github.com/carolzhou/multiPhATE. Instructions for acquiring the databases and third-party codes used by multiPhATE are included in the distribution README file. Users may report bugs by submitting to the github issues page associated with the multiPhATE distribution. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Bacteriófagos , Biologia Computacional , Algoritmos , Genoma , Software
2.
Nature ; 499(7459): 431-7, 2013 Jul 25.
Artigo em Inglês | MEDLINE | ID: mdl-23851394

RESUMO

Genome sequencing enhances our understanding of the biological world by providing blueprints for the evolutionary and functional diversity that shapes the biosphere. However, microbial genomes that are currently available are of limited phylogenetic breadth, owing to our historical inability to cultivate most microorganisms in the laboratory. We apply single-cell genomics to target and sequence 201 uncultivated archaeal and bacterial cells from nine diverse habitats belonging to 29 major mostly uncharted branches of the tree of life, so-called 'microbial dark matter'. With this additional genomic information, we are able to resolve many intra- and inter-phylum-level relationships and to propose two new superphyla. We uncover unexpected metabolic features that extend our understanding of biology and challenge established boundaries between the three domains of life. These include a novel amino acid use for the opal stop codon, an archaeal-type purine synthesis in Bacteria and complete sigma factors in Archaea similar to those in Bacteria. The single-cell genomes also served to phylogenetically anchor up to 20% of metagenomic reads in some habitats, facilitating organism-level interpretation of ecosystem function. This study greatly expands the genomic representation of the tree of life and provides a systematic step towards a better understanding of biological evolution on our planet.


Assuntos
Archaea/classificação , Archaea/genética , Bactérias/classificação , Bactérias/genética , Metagenômica , Filogenia , Archaea/isolamento & purificação , Archaea/metabolismo , Bactérias/isolamento & purificação , Bactérias/metabolismo , Ecossistema , Genoma Arqueal/genética , Genoma Bacteriano/genética , Metagenoma/genética , Dados de Sequência Molecular , Análise de Sequência de DNA , Análise de Célula Única
3.
Nature ; 488(7409): 86-90, 2012 Aug 02.
Artigo em Inglês | MEDLINE | ID: mdl-22859206

RESUMO

Land plants associate with a root microbiota distinct from the complex microbial community present in surrounding soil. The microbiota colonizing the rhizosphere (immediately surrounding the root) and the endophytic compartment (within the root) contribute to plant growth, productivity, carbon sequestration and phytoremediation. Colonization of the root occurs despite a sophisticated plant immune system, suggesting finely tuned discrimination of mutualists and commensals from pathogens. Genetic principles governing the derivation of host-specific endophyte communities from soil communities are poorly understood. Here we report the pyrosequencing of the bacterial 16S ribosomal RNA gene of more than 600 Arabidopsis thaliana plants to test the hypotheses that the root rhizosphere and endophytic compartment microbiota of plants grown under controlled conditions in natural soils are sufficiently dependent on the host to remain consistent across different soil types and developmental stages, and sufficiently dependent on host genotype to vary between inbred Arabidopsis accessions. We describe different bacterial communities in two geochemically distinct bulk soils and in rhizosphere and endophytic compartments prepared from roots grown in these soils. The communities in each compartment are strongly influenced by soil type. Endophytic compartments from both soils feature overlapping, low-complexity communities that are markedly enriched in Actinobacteria and specific families from other phyla, notably Proteobacteria. Some bacteria vary quantitatively between plants of different developmental stage and genotype. Our rigorous definition of an endophytic compartment microbiome should facilitate controlled dissection of plant-microbe interactions derived from complex soil communities.


Assuntos
Arabidopsis/microbiologia , Endófitos/classificação , Endófitos/isolamento & purificação , Metagenoma , Raízes de Plantas/microbiologia , Microbiologia do Solo , Actinobacteria/genética , Actinobacteria/isolamento & purificação , Arabidopsis/classificação , Arabidopsis/crescimento & desenvolvimento , Endófitos/genética , Genótipo , Hibridização in Situ Fluorescente , Raízes de Plantas/classificação , Raízes de Plantas/crescimento & desenvolvimento , Proteobactérias/genética , Proteobactérias/isolamento & purificação , RNA Ribossômico 16S/genética , RNA Ribossômico 16S/isolamento & purificação , Rizosfera , Ribotipagem , Análise de Sequência de DNA , Simbiose
4.
BMC Genomics ; 18(1): 334, 2017 04 28.
Artigo em Inglês | MEDLINE | ID: mdl-28454561

RESUMO

BACKGROUND: Examination of complex biological systems has long been achieved through methodical investigation of the system's individual components. While informative, this strategy often leads to inappropriate conclusions about the system as a whole. With the advent of high-throughput "omic" technologies, however, researchers can now simultaneously analyze an entire system at the level of molecule (DNA, RNA, protein, metabolite) and process (transcription, translation, enzyme catalysis). This strategy reduces the likelihood of improper conclusions, provides a framework for elucidation of genotype-phenotype relationships, and brings finer resolution to comparative genomic experiments. Here, we apply a multi-omic approach to analyze the gene expression profiles of two closely related Pseudomonas aeruginosa strains grown in n-alkanes or glycerol. RESULTS: The environmental P. aeruginosa isolate ATCC 33988 consumed medium-length (C10-C16) n-alkanes more rapidly than the laboratory strain PAO1, despite high genome sequence identity (average nucleotide identity >99%). Our data shows that ATCC 33988 induces a characteristic set of genes at the transcriptional, translational and post-translational levels during growth on alkanes, many of which differ from those expressed by PAO1. Of particular interest was the lack of expression from the rhl operon of the quorum sensing (QS) system, resulting in no measurable rhamnolipid production by ATCC 33988. Further examination showed that ATCC 33988 lacked the entire lasI/lasR arm of the QS response. Instead of promoting expression of QS genes, ATCC 33988 up-regulates a small subset of its genome, including operons responsible for specific alkaline proteases and sphingosine metabolism. CONCLUSION: This work represents the first time results from RNA-seq, microarray, ribosome footprinting, proteomics, and small molecule LC-MS experiments have been integrated to compare gene expression in bacteria. Together, these data provide insights as to why strain ATCC 33988 is better adapted for growth and survival on n-alkanes.


Assuntos
Alcanos/farmacologia , Biologia Computacional/métodos , Pseudomonas aeruginosa/efeitos dos fármacos , Perfilação da Expressão Gênica , Glicolipídeos/metabolismo , Pseudomonas aeruginosa/citologia , Pseudomonas aeruginosa/genética , Pseudomonas aeruginosa/metabolismo , Percepção de Quorum/efeitos dos fármacos
5.
Proc Natl Acad Sci U S A ; 111(13): 4904-9, 2014 Apr 01.
Artigo em Inglês | MEDLINE | ID: mdl-24632729

RESUMO

The large volumes of sequencing data required to sample deeply the microbial communities of complex environments pose new challenges to sequence analysis. De novo metagenomic assembly effectively reduces the total amount of data to be analyzed but requires substantial computational resources. We combine two preassembly filtering approaches--digital normalization and partitioning--to generate previously intractable large metagenome assemblies. Using a human-gut mock community dataset, we demonstrate that these methods result in assemblies nearly identical to assemblies from unprocessed data. We then assemble two large soil metagenomes totaling 398 billion bp (equivalent to 88,000 Escherichia coli genomes) from matched Iowa corn and native prairie soils. The resulting assembled contigs could be used to identify molecular interactions and reaction networks of known metabolic pathways using the Kyoto Encyclopedia of Genes and Genomes Orthology database. Nonetheless, more than 60% of predicted proteins in assemblies could not be annotated against known databases. Many of these unknown proteins were abundant in both corn and prairie soils, highlighting the benefits of assembly for the discovery and characterization of novelty in soil biodiversity. Moreover, 80% of the sequencing data could not be assembled because of low coverage, suggesting that considerably more sequencing data are needed to characterize the functional content of soil.


Assuntos
Biodiversidade , Metagenoma/genética , Microbiologia do Solo , Solo , Trato Gastrointestinal/microbiologia , Humanos , Iowa , Especificidade da Espécie , Zea mays/genética
6.
Proc Natl Acad Sci U S A ; 111(47): E5096-104, 2014 Nov 25.
Artigo em Inglês | MEDLINE | ID: mdl-25385629

RESUMO

Bacteria play many important roles in animal digestive systems, including the provision of enzymes critical to digestion. Typically, complex communities of bacteria reside in the gut lumen in direct contact with the ingested materials they help to digest. Here, we demonstrate a previously undescribed digestive strategy in the wood-eating marine bivalve Bankia setacea, wherein digestive bacteria are housed in a location remote from the gut. These bivalves, commonly known as shipworms, lack a resident microbiota in the gut compartment where wood is digested but harbor endosymbiotic bacteria within specialized cells in their gills. We show that this comparatively simple bacterial community produces wood-degrading enzymes that are selectively translocated from gill to gut. These enzymes, which include just a small subset of the predicted wood-degrading enzymes encoded in the endosymbiont genomes, accumulate in the gut to the near exclusion of other endosymbiont-made proteins. This strategy of remote enzyme production provides the shipworm with a mechanism to capture liberated sugars from wood without competition from an endogenous gut microbiota. Because only those proteins required for wood digestion are translocated to the gut, this newly described system reveals which of many possible enzymes and enzyme combinations are minimally required for wood degradation. Thus, although it has historically had negative impacts on human welfare, the shipworm digestive process now has the potential to have a positive impact on industries that convert wood and other plant biomass to renewable fuels, fine chemicals, food, feeds, textiles, and paper products.


Assuntos
Bactérias/classificação , Digestão , Comportamento Alimentar , Brânquias/microbiologia , Moluscos/metabolismo , Madeira , Animais , Metagenoma , Dados de Sequência Molecular , Filogenia
7.
Proc Natl Acad Sci U S A ; 110(46): 18590-5, 2013 Nov 12.
Artigo em Inglês | MEDLINE | ID: mdl-24167248

RESUMO

We present the complete genomic sequence of the essential symbiont Polynucleobacter necessarius (Betaproteobacteria), which is a valuable case study for several reasons. First, it is hosted by a ciliated protist, Euplotes; bacterial symbionts of ciliates are still poorly known because of a lack of extensive molecular data. Second, the single species P. necessarius contains both symbiotic and free-living strains, allowing for a comparison between closely related organisms with different ecologies. Third, free-living P. necessarius strains are exceptional by themselves because of their small genome size, reduced metabolic flexibility, and high worldwide abundance in freshwater systems. We provide a comparative analysis of P. necessarius metabolism and explore the peculiar features of a genome reduction that occurred on an already streamlined genome. We compare this unusual system with current hypotheses for genome erosion in symbionts and free-living bacteria, propose modifications to the presently accepted model, and discuss the potential consequences of translesion DNA polymerase loss.


Assuntos
Burkholderiaceae/genética , Euplotes/microbiologia , Evolução Molecular , Tamanho do Genoma/genética , Genoma Bacteriano/genética , Simbiose/genética , Sequência de Aminoácidos , Sequência de Bases , Biologia Computacional , Anotação de Sequência Molecular , Dados de Sequência Molecular , Alinhamento de Sequência , Análise de Sequência de DNA
8.
PLoS Genet ; 8(10): e1003012, 2012.
Artigo em Inglês | MEDLINE | ID: mdl-23133394

RESUMO

Terrestrial arthropods are commonly infected with maternally inherited bacterial symbionts that cause cytoplasmic incompatibility (CI). In CI, the outcome of crosses between symbiont-infected males and uninfected females is reproductive failure, increasing the relative fitness of infected females and leading to spread of the symbiont in the host population. CI symbionts have profound impacts on host genetic structure and ecology and may lead to speciation and the rapid evolution of sex determination systems. Cardinium hertigii, a member of the Bacteroidetes and symbiont of the parasitic wasp Encarsia pergandiella, is the only known bacterium other than the Alphaproteobacteria Wolbachia to cause CI. Here we report the genome sequence of Cardinium hertigii cEper1. Comparison with the genomes of CI-inducing Wolbachia pipientis strains wMel, wRi, and wPip provides a unique opportunity to pinpoint shared proteins mediating host cell interaction, including some candidate proteins for CI that have not previously been investigated. The genome of Cardinium lacks all major biosynthetic pathways but harbors a complete biotin biosynthesis pathway, suggesting a potential role for Cardinium in host nutrition. Cardinium lacks known protein secretion systems but encodes a putative phage-derived secretion system distantly related to the antifeeding prophage of the entomopathogen Serratia entomophila. Lastly, while Cardinium and Wolbachia genomes show only a functional overlap of proteins, they show no evidence of laterally transferred elements that would suggest common ancestry of CI in both lineages. Instead, comparative genomics suggests an independent evolution of CI in Cardinium and Wolbachia and provides a novel context for understanding the mechanistic basis of CI.


Assuntos
Bacteroidetes/genética , Genoma Bacteriano , Genômica , Animais , Artrópodes/microbiologia , Proteínas de Bactérias/genética , Proteínas de Bactérias/metabolismo , Bacteroidetes/metabolismo , Evolução Biológica , Vias Biossintéticas , Citoplasma/metabolismo , Ordem dos Genes , Interações Hospedeiro-Patógeno , Simbiose/genética
9.
Nat Genet ; 37(2): 153-9, 2005 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-15640799

RESUMO

Francisella tularensis is one of the most infectious human pathogens known. In the past, both the former Soviet Union and the US had programs to develop weapons containing the bacterium. We report the complete genome sequence of a highly virulent isolate of F. tularensis (1,892,819 bp). The sequence uncovers previously uncharacterized genes encoding type IV pili, a surface polysaccharide and iron-acquisition systems. Several virulence-associated genes were located in a putative pathogenicity island, which was duplicated in the genome. More than 10% of the putative coding sequences contained insertion-deletion or substitution mutations and seemed to be deteriorating. The genome is rich in IS elements, including IS630 Tc-1 mariner family transposons, which are not expected in a prokaryote. We used a computational method for predicting metabolic pathways and found an unexpectedly high proportion of disrupted pathways, explaining the fastidious nutritional requirements of the bacterium. The loss of biosynthetic pathways indicates that F. tularensis is an obligate host-dependent bacterium in its natural life cycle. Our results have implications for our understanding of how highly virulent human pathogens evolve and will expedite strategies to combat them.


Assuntos
Francisella tularensis/genética , Genoma Bacteriano , Sequência de Bases , Elementos de DNA Transponíveis , Francisella tularensis/crescimento & desenvolvimento , Ilhas Genômicas , Ferro/metabolismo , Dados de Sequência Molecular , Mutação , Análise de Sequência de DNA , Virulência/genética
10.
Appl Environ Microbiol ; 79(12): 3770-8, 2013 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-23584789

RESUMO

Plants represent a large reservoir of organic carbon comprised primarily of recalcitrant polymers that most metazoans are unable to deconstruct. Many herbivores gain access to nutrients in this material indirectly by associating with microbial symbionts, and leaf-cutter ants are a paradigmatic example. These ants use fresh foliar biomass as manure to cultivate gardens composed primarily of Leucoagaricus gongylophorus, a basidiomycetous fungus that produces specialized hyphal swellings that serve as a food source for the host ant colony. Although leaf-cutter ants are conspicuous herbivores that contribute substantially to carbon turnover in Neotropical ecosystems, the process through which plant biomass is degraded in their fungus gardens is not well understood. Here we present the first draft genome of L. gongylophorus, and, using genomic and metaproteomic tools, we investigate its role in lignocellulose degradation in the gardens of both Atta cephalotes and Acromyrmex echinatior leaf-cutter ants. We show that L. gongylophorus produces a diversity of lignocellulases in ant gardens and is likely the primary driver of plant biomass degradation in these ecosystems. We also show that this fungus produces distinct sets of lignocellulases throughout the different stages of biomass degradation, including numerous cellulases and laccases that likely play an important role in lignocellulose degradation. Our study provides a detailed analysis of plant biomass degradation in leaf-cutter ant fungus gardens and insight into the enzymes underlying the symbiosis between these dominant herbivores and their obligate fungal cultivar.


Assuntos
Agaricales/enzimologia , Formigas/fisiologia , Celulases/genética , Genoma Fúngico/genética , Simbiose/fisiologia , Agaricales/genética , Sequência de Aminoácidos , Animais , Sequência de Bases , Celulases/metabolismo , Análise por Conglomerados , Herbivoria/fisiologia , Lignina/metabolismo , Dados de Sequência Molecular , Panamá , Filogenia , Plantas/metabolismo , Proteômica , Análise de Sequência de DNA , Homologia de Sequência , Especificidade da Espécie
11.
J Bacteriol ; 194(12): 3266, 2012 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-22628499

RESUMO

Francisella philomiragia is a saprophytic gammaproteobacterium found only occasionally in immunocompromised individuals and is the nearest neighbor to the causative agent of tularemia and category A select agent Francisella tularensis. To shed insight into the key genetic differences and the evolution of these two distinct lineages, we sequenced the first complete genome of F. philomiragia strain ATCC 25017, which was isolated as a free-living microorganism from water in Bear River Refuge, Utah.


Assuntos
DNA Bacteriano/química , DNA Bacteriano/genética , Francisella/genética , Genoma Bacteriano , Francisella/isolamento & purificação , Água Doce/microbiologia , Dados de Sequência Molecular , Análise de Sequência de DNA , Utah
12.
Environ Microbiol ; 14(9): 2405-16, 2012 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-22616650

RESUMO

The Deepwater Horizon oil spill resulted in a massive influx of hydrocarbons into the Gulf of Mexico (the Gulf). To better understand the fate of the oil, we enriched and isolated indigenous hydrocarbon-degrading bacteria from deep, uncontaminated waters from the Gulf with oil (Macondo MC252) and dispersant used during the spill (COREXIT 9500). During 20 days of incubation at 5°C, CO(2) evolution, hydrocarbon concentrations and the microbial community composition were determined. Approximately 60% to 25% of the dissolved oil with or without COREXIT, respectively, was degraded, in addition to some hydrocarbons in the COREXIT. FeCl(2) addition initially increased respiration rates, but not the total amount of hydrocarbons degraded. 16S rRNA gene sequencing revealed a succession in the microbial community over time, with an increase in abundance of Colwellia and Oceanospirillales during the incubations. Flocs formed during incubations with oil and/or COREXIT in the absence of FeCl(2) . Synchrotron radiation-based Fourier transform infrared (SR-FTIR) spectromicroscopy revealed that the flocs were comprised of oil, carbohydrates and biomass. Colwellia were the dominant bacteria in the flocs. Colwellia sp. strain RC25 was isolated from one of the enrichments and confirmed to rapidly degrade high amounts (approximately 75%) of the MC252 oil at 5°C. Together these data highlight several features that provide Colwellia with the capacity to degrade oil in cold, deep marine habitats, including aggregation together with oil droplets into flocs and hydrocarbon degradation ability.


Assuntos
Lipídeos , Poluição por Petróleo , Petróleo/metabolismo , Microbiologia da Água , Alteromonadaceae/genética , Alteromonadaceae/isolamento & purificação , Alteromonadaceae/metabolismo , Alteromonadaceae/ultraestrutura , Bactérias/classificação , Bactérias/genética , Bactérias/crescimento & desenvolvimento , Bactérias/metabolismo , Carga Bacteriana , Gammaproteobacteria/genética , Gammaproteobacteria/metabolismo , Guerra do Golfo , RNA Ribossômico 16S/genética
13.
J Bacteriol ; 193(16): 4274-5, 2011 Aug.
Artigo em Inglês | MEDLINE | ID: mdl-21685287

RESUMO

Ochrobactrum anthropi is a common soil alphaproteobacterium that colonizes a wide spectrum of organisms and is being increasingly recognized as an opportunistic human pathogen. Potentially life-threatening infections, such as endocarditis, are included in the list of reported O. anthropi infections. These reports, together with the scant number of studies and the organism's phylogenetic proximity to the highly pathogenic brucellae, make O. anthropi an attractive model of bacterial pathogenicity. Here we report the genome sequence of the type strain O. anthropi ATCC 49188, which revealed the presence of two chromosomes and four plasmids.


Assuntos
Genoma Bacteriano , Infecções por Bactérias Gram-Negativas/microbiologia , Ochrobactrum anthropi/classificação , Ochrobactrum anthropi/genética , Simbiose , Animais , Humanos , Dados de Sequência Molecular
14.
Proc Natl Acad Sci U S A ; 105(47): 18543-8, 2008 Nov 25.
Artigo em Inglês | MEDLINE | ID: mdl-19020098

RESUMO

The bacterial genus Rhodopseudomonas is comprised of photosynthetic bacteria found widely distributed in aquatic sediments. Members of the genus catalyze hydrogen gas production, carbon dioxide sequestration, and biomass turnover. The genome sequence of Rhodopseudomonas palustris CGA009 revealed a surprising richness of metabolic versatility that would seem to explain its ability to live in a heterogeneous environment like sediment. However, there is considerable genotypic diversity among Rhodopseudomonas isolates. Here we report the complete genome sequences of four additional members of the genus isolated from a restricted geographical area. The sequences confirm that the isolates belong to a coherent taxonomic unit, but they also have significant differences. Whole genome alignments show that the circular chromosomes of the isolates consist of a collinear backbone with a moderate number of genomic rearrangements that impact local gene order and orientation. There are 3,319 genes, 70% of the genes in each genome, shared by four or more strains. Between 10% and 18% of the genes in each genome are strain specific. Some of these genes suggest specialized physiological traits, which we verified experimentally, that include expanded light harvesting, oxygen respiration, and nitrogen fixation capabilities, as well as anaerobic fermentation. Strain-specific adaptations include traits that may be useful in bioenergy applications. This work suggests that against a backdrop of metabolic versatility that is a defining characteristic of Rhodopseudomonas, different ecotypes have evolved to take advantage of physical and chemical conditions in sediment microenvironments that are too small for human observation.


Assuntos
Genoma Bacteriano , Sedimentos Geológicos/microbiologia , Rodopseudomonas/genética , Adaptação Fisiológica , Água Doce , Rearranjo Gênico , Dados de Sequência Molecular , Fixação de Nitrogênio , Fotossíntese , Filogenia , Rodopseudomonas/classificação , Rodopseudomonas/fisiologia , Microbiologia da Água
15.
G3 (Bethesda) ; 11(5)2021 05 07.
Artigo em Inglês | MEDLINE | ID: mdl-33734357

RESUMO

To address a need for improved tools for annotation and comparative genomics of bacteriophage genomes, we developed multiPhATE2. As an extension of multiPhATE, a functional annotation code released previously, multiPhATE2 performs gene finding using multiple algorithms, compares the results of the algorithms, performs functional annotation of coding sequences, and incorporates additional search algorithms and databases to extend the search space of the original code. MultiPhATE2 performs gene matching among sets of closely related bacteriophage genomes, and uses multiprocessing to speed computations. MultiPhATE2 can be re-started at multiple points within the workflow to allow the user to examine intermediate results and adjust the subsequent computations accordingly. In addition, multiPhATE2 accommodates custom gene calls and sequence databases, again adding flexibility. MultiPhATE2 was implemented in Python 3.7 and runs as a command-line code under Linux or MAC operating systems. Full documentation is provided as a README file and a Wiki website.


Assuntos
Bacteriófagos , Algoritmos , Bacteriófagos/genética , Genoma , Genômica , Anotação de Sequência Molecular , Software
16.
Microorganisms ; 9(1)2021 Jan 08.
Artigo em Inglês | MEDLINE | ID: mdl-33429904

RESUMO

One of the main steps in gene-finding in prokaryotes is determining which open reading frames encode for a protein, and which occur by chance alone. There are many different methods to differentiate the two; the most prevalent approach is using shared homology with a database of known genes. This method presents many pitfalls, most notably the catch that you only find genes that you have seen before. The four most popular prokaryotic gene-prediction programs (GeneMark, Glimmer, Prodigal, Phanotate) all use a protein-coding training model to predict protein-coding genes, with the latter three allowing for the training model to be created ab initio from the input genome. Different methods are available for creating the training model, and to increase the accuracy of such tools, we present here GOODORFS, a method for identifying protein-coding genes within a set of all possible open reading frames (ORFS). Our workflow begins with taking the amino acid frequencies of each ORF, calculating an entropy density profile (EDP), using KMeans to cluster the EDPs, and then selecting the cluster with the lowest variation as the coding ORFs. To test the efficacy of our method, we ran GOODORFS on 14,179 annotated phage genomes, and compared our results to the initial training-set creation step of four other similar methods (Glimmer, MED2, PHANOTATE, Prodigal). We found that GOODORFS was the most accurate (0.94) and had the best F1-score (0.85), while Glimmer had the highest precision (0.92) and PHANOTATE had the highest recall (0.96).

17.
Nature ; 424(6952): 1042-7, 2003 Aug 28.
Artigo em Inglês | MEDLINE | ID: mdl-12917642

RESUMO

The marine unicellular cyanobacterium Prochlorococcus is the smallest-known oxygen-evolving autotroph. It numerically dominates the phytoplankton in the tropical and subtropical oceans, and is responsible for a significant fraction of global photosynthesis. Here we compare the genomes of two Prochlorococcus strains that span the largest evolutionary distance within the Prochlorococcus lineage and that have different minimum, maximum and optimal light intensities for growth. The high-light-adapted ecotype has the smallest genome (1,657,990 base pairs, 1,716 genes) of any known oxygenic phototroph, whereas the genome of its low-light-adapted counterpart is significantly larger, at 2,410,873 base pairs (2,275 genes). The comparative architectures of these two strains reveal dynamic genomes that are constantly changing in response to myriad selection pressures. Although the two strains have 1,350 genes in common, a significant number are not shared, and these have been differentially retained from the common ancestor, or acquired through duplication or lateral transfer. Some of these genes have obvious roles in determining the relative fitness of the ecotypes in response to key environmental variables, and hence in regulating their distribution and abundance in the oceans.


Assuntos
Evolução Biológica , Cianobactérias/classificação , Cianobactérias/genética , Meio Ambiente , Genoma Bacteriano , Adaptação Fisiológica/efeitos da radiação , Cianobactérias/efeitos da radiação , Genes Bacterianos/genética , Luz , Dados de Sequência Molecular , Oceanos e Mares , Filogenia
18.
Appl Microbiol Biotechnol ; 88(1): 389-99, 2010 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-20676626

RESUMO

The carboxylate platform utilizes a mixed microbial community to convert lignocellulosic biomass into chemicals and fuels. While much of the platform is well understood, little is known about its microbiology. Mesophilic (40 degrees C) and thermophilic (55 degrees C) fermentations employing a sorghum feedstock and marine sediment inoculum were profiled using 16S rRNA tag-pyrosequencing over the course of a 30-day incubation. The contrasting fermentation temperatures converted similar amounts of biomass, but the mesophilic community was significantly more productive, and the two temperatures differed significantly with respect to propionic and butyric acid production. Pyrotag sequencing revealed the presence of dynamic communities that responded rapidly to temperature and changed substantially over time. Both temperatures were dominated by bacteria resembling Clostridia, but they shared few taxa in common. The species-rich mesophilic community harbored a variety of Bacteroidetes, Actinobacteria, and gamma-Proteobacteria, whereas the thermophilic community was composed mainly of Clostridia and Bacilli. Despite differences in composition and productivity, similar patterns of functional class dynamics were observed. Over time, organisms resembling known cellulose degraders decreased in abundance, while organisms resembling known xylose degraders increased. Improved understanding of the carboxylate platform's microbiology will help refine platform performance and contribute to our growing knowledge regarding biomass conversion and biofuel production processes.


Assuntos
Bactérias/classificação , Bactérias/metabolismo , Biodiversidade , Biocombustíveis , Reatores Biológicos/microbiologia , Ácidos Carboxílicos/metabolismo , Bactérias/genética , DNA Bacteriano/química , DNA Bacteriano/genética , DNA Ribossômico/química , DNA Ribossômico/genética , Lignina/metabolismo , RNA Ribossômico 16S/genética , Análise de Sequência de DNA , Sorghum/metabolismo , Temperatura
19.
PLoS Biol ; 4(12): e383, 2006 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-17105352

RESUMO

Presented here is the complete genome sequence of Thiomicrospira crunogena XCL-2, representative of ubiquitous chemolithoautotrophic sulfur-oxidizing bacteria isolated from deep-sea hydrothermal vents. This gammaproteobacterium has a single chromosome (2,427,734 base pairs), and its genome illustrates many of the adaptations that have enabled it to thrive at vents globally. It has 14 methyl-accepting chemotaxis protein genes, including four that may assist in positioning it in the redoxcline. A relative abundance of coding sequences (CDSs) encoding regulatory proteins likely control the expression of genes encoding carboxysomes, multiple dissolved inorganic nitrogen and phosphate transporters, as well as a phosphonate operon, which provide this species with a variety of options for acquiring these substrates from the environment. Thiom. crunogena XCL-2 is unusual among obligate sulfur-oxidizing bacteria in relying on the Sox system for the oxidation of reduced sulfur compounds. The genome has characteristics consistent with an obligately chemolithoautotrophic lifestyle, including few transporters predicted to have organic allocrits, and Calvin-Benson-Bassham cycle CDSs scattered throughout the genome.


Assuntos
Genoma Bacteriano , Piscirickettsiaceae/genética , Aderência Bacteriana/genética , Dióxido de Carbono/metabolismo , Quimiotaxia/genética , Dados de Sequência Molecular , Fosfatos/metabolismo , Piscirickettsiaceae/metabolismo , Prófagos/genética , Alinhamento de Sequência , Transdução de Sinais
20.
Appl Environ Microbiol ; 74(4): 1145-56, 2008 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-18065616

RESUMO

Sulfur-oxidizing epsilonproteobacteria are common in a variety of sulfidogenic environments. These autotrophic and mixotrophic sulfur-oxidizing bacteria are believed to contribute substantially to the oxidative portion of the global sulfur cycle. In order to better understand the ecology and roles of sulfur-oxidizing epsilonproteobacteria, in particular those of the widespread genus Sulfurimonas, in biogeochemical cycles, the genome of Sulfurimonas denitrificans DSM1251 was sequenced. This genome has many features, including a larger size (2.2 Mbp), that suggest a greater degree of metabolic versatility or responsiveness to the environment than seen for most of the other sequenced epsilonproteobacteria. A branched electron transport chain is apparent, with genes encoding complexes for the oxidation of hydrogen, reduced sulfur compounds, and formate and the reduction of nitrate and oxygen. Genes are present for a complete, autotrophic reductive citric acid cycle. Many genes are present that could facilitate growth in the spatially and temporally heterogeneous sediment habitat from where Sulfurimonas denitrificans was originally isolated. Many resistance-nodulation-development family transporter genes (10 total) are present; of these, several are predicted to encode heavy metal efflux transporters. An elaborate arsenal of sensory and regulatory protein-encoding genes is in place, as are genes necessary to prevent and respond to oxidative stress.


Assuntos
Epsilonproteobacteria/genética , Genoma Bacteriano , Filogenia , Transdução de Sinais/genética , Sequência de Bases , Carbono/metabolismo , Carboxiliases/genética , Crescimento Quimioautotrófico , Quimiotaxia/genética , Mapeamento Cromossômico , Biologia Computacional , Transporte de Elétrons/fisiologia , Modelos Biológicos , Dados de Sequência Molecular , Nitrogênio/metabolismo , Estresse Oxidativo/genética , Análise de Sequência de DNA
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA