Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 36
Filtrar
Más filtros

Banco de datos
País/Región como asunto
Tipo del documento
Intervalo de año de publicación
1.
Nucleic Acids Res ; 41(Database issue): D648-54, 2013 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-23203984

RESUMEN

The metaMicrobesOnline database (freely available at http://meta.MicrobesOnline.org) offers phylogenetic analysis of genes from microbial genomes and metagenomes. Gene trees are constructed for canonical gene families such as COG and Pfam. Such gene trees allow for rapid homologue analysis and subfamily comparison of genes from multiple metagenomes and comparisons with genes from microbial isolates. Additionally, the genome browser permits genome context comparisons, which may be used to determine the closest sequenced genome or suggest functionally associated genes. Lastly, the domain browser permits rapid comparison of protein domain organization within genes of interest from metagenomes and complete microbial genomes.


Asunto(s)
Bases de Datos Genéticas , Metagenoma , Metagenómica , Filogenia , Genoma , Genoma Arqueal , Genoma Bacteriano , Genoma Fúngico , Genómica , Internet , Estructura Terciaria de Proteína , Programas Informáticos , Sintenía
2.
Nucleic Acids Res ; 40(Web Server issue): W604-8, 2012 Jul.
Artículo en Inglés | MEDLINE | ID: mdl-22700702

RESUMEN

Web services application programming interface (API) was developed to provide a programmatic access to the regulatory interactions accumulated in the RegPrecise database (http://regprecise.lbl.gov), a core resource on transcriptional regulation for the microbial domain of the Department of Energy (DOE) Systems Biology Knowledgebase. RegPrecise captures and visualize regulogs, sets of genes controlled by orthologous regulators in several closely related bacterial genomes, that were reconstructed by comparative genomics. The current release of RegPrecise 2.0 includes >1400 regulogs controlled either by protein transcription factors or by conserved ribonucleic acid regulatory motifs in >250 genomes from 24 taxonomic groups of bacteria. The reference regulons accumulated in RegPrecise can serve as a basis for automatic annotation of regulatory interactions in newly sequenced genomes. The developed API provides an efficient access to the RegPrecise data by a comprehensive set of 14 web service resources. The RegPrecise web services API is freely accessible at http://regprecise.lbl.gov/RegPrecise/services.jsp with no login requirements.


Asunto(s)
Regulación Bacteriana de la Expresión Génica , Regulón , Programas Informáticos , Transcripción Genética , Redes Reguladoras de Genes , Genoma Bacteriano , Genómica/métodos , Internet , Motivos de Nucleótidos , Secuencias Reguladoras de Ácido Ribonucleico , Factores de Transcripción/metabolismo , Interfaz Usuario-Computador
3.
PLoS Genet ; 7(6): e1002070, 2011 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-21695235

RESUMEN

The plant-pathogenic fungus Mycosphaerella graminicola (asexual stage: Septoria tritici) causes septoria tritici blotch, a disease that greatly reduces the yield and quality of wheat. This disease is economically important in most wheat-growing areas worldwide and threatens global food production. Control of the disease has been hampered by a limited understanding of the genetic and biochemical bases of pathogenicity, including mechanisms of infection and of resistance in the host. Unlike most other plant pathogens, M. graminicola has a long latent period during which it evades host defenses. Although this type of stealth pathogenicity occurs commonly in Mycosphaerella and other Dothideomycetes, the largest class of plant-pathogenic fungi, its genetic basis is not known. To address this problem, the genome of M. graminicola was sequenced completely. The finished genome contains 21 chromosomes, eight of which could be lost with no visible effect on the fungus and thus are dispensable. This eight-chromosome dispensome is dynamic in field and progeny isolates, is different from the core genome in gene and repeat content, and appears to have originated by ancient horizontal transfer from an unknown donor. Synteny plots of the M. graminicola chromosomes versus those of the only other sequenced Dothideomycete, Stagonospora nodorum, revealed conservation of gene content but not order or orientation, suggesting a high rate of intra-chromosomal rearrangement in one or both species. This observed "mesosynteny" is very different from synteny seen between other organisms. A surprising feature of the M. graminicola genome compared to other sequenced plant pathogens was that it contained very few genes for enzymes that break down plant cell walls, which was more similar to endophytes than to pathogens. The stealth pathogenesis of M. graminicola probably involves degradation of proteins rather than carbohydrates to evade host defenses during the biotrophic stage of infection and may have evolved from endophytic ancestors.


Asunto(s)
Ascomicetos/genética , Cromosomas Fúngicos/genética , Genoma Fúngico/genética , Ascomicetos/metabolismo , Ascomicetos/patogenicidad , Reordenamiento Génico , Enfermedades de las Plantas/microbiología , Sintenía , Triticum/microbiología
4.
mSystems ; 9(1): e0002623, 2024 Jan 23.
Artículo en Inglés | MEDLINE | ID: mdl-38078749

RESUMEN

Microbial communities have evolved to colonize all ecosystems of the planet, from the deep sea to the human gut. Microbes survive by sensing, responding, and adapting to immediate environmental cues. This process is driven by signal transduction proteins such as histidine kinases, which use their sensing domains to bind or otherwise detect environmental cues and "transduce" signals to adjust internal processes. We hypothesized that an ecosystem's unique stimuli leave a sensor "fingerprint," able to identify and shed insight on ecosystem conditions. To test this, we collected 20,712 publicly available metagenomes from Host-associated, Environmental, and Engineered ecosystems across the globe. We extracted and clustered the collection's nearly 18M unique sensory domains into 113,712 similar groupings with MMseqs2. We built gradient-boosted decision tree machine learning models and found we could classify the ecosystem type (accuracy: 87%) and predict the levels of different physical parameters (R2 score: 83%) using the sensor cluster abundance as features. Feature importance enables identification of the most predictive sensors to differentiate between ecosystems which can lead to mechanistic interpretations if the sensor domains are well annotated. To demonstrate this, a machine learning model was trained to predict patient's disease state and used to identify domains related to oxygen sensing present in a healthy gut but missing in patients with abnormal conditions. Moreover, since 98.7% of identified sensor domains are uncharacterized, importance ranking can be used to prioritize sensors to determine what ecosystem function they may be sensing. Furthermore, these new predictive sensors can function as targets for novel sensor engineering with applications in biotechnology, ecosystem maintenance, and medicine.IMPORTANCEMicrobes infect, colonize, and proliferate due to their ability to sense and respond quickly to their surroundings. In this research, we extract the sensory proteins from a diverse range of environmental, engineered, and host-associated metagenomes. We trained machine learning classifiers using sensors as features such that it is possible to predict the ecosystem for a metagenome from its sensor profile. We use the optimized model's feature importance to identify the most impactful and predictive sensors in different environments. We next use the sensor profile from human gut metagenomes to classify their disease states and explore which sensors can explain differences between diseases. The sensors most predictive of environmental labels here, most of which correspond to uncharacterized proteins, are a useful starting point for the discovery of important environment signals and the development of possible diagnostic interventions.


Asunto(s)
Metagenómica , Microbiota , Humanos , Metagenoma , Aprendizaje Automático , Planeta Tierra
5.
Nat Protoc ; 18(1): 208-238, 2023 01.
Artículo en Inglés | MEDLINE | ID: mdl-36376589

RESUMEN

Uncultivated Bacteria and Archaea account for the vast majority of species on Earth, but obtaining their genomes directly from the environment, using shotgun sequencing, has only become possible recently. To realize the hope of capturing Earth's microbial genetic complement and to facilitate the investigation of the functional roles of specific lineages in a given ecosystem, technologies that accelerate the recovery of high-quality genomes are necessary. We present a series of analysis steps and data products for the extraction of high-quality metagenome-assembled genomes (MAGs) from microbiomes using the U.S. Department of Energy Systems Biology Knowledgebase (KBase) platform ( http://www.kbase.us/ ). Overall, these steps take about a day to obtain extracted genomes when starting from smaller environmental shotgun read libraries, or up to about a week from larger libraries. In KBase, the process is end-to-end, allowing a user to go from the initial sequencing reads all the way through to MAGs, which can then be analyzed with other KBase capabilities such as phylogenetic placement, functional assignment, metabolic modeling, pangenome functional profiling, RNA-Seq and others. While portions of such capabilities are available individually from other resources, the combination of the intuitive usability, data interoperability and integration of tools in a freely available computational resource makes KBase a powerful platform for obtaining MAGs from microbiomes. While this workflow offers tools for each of the key steps in the genome extraction process, it also provides a scaffold that can be easily extended with additional MAG recovery and analysis tools, via the KBase software development kit (SDK).


Asunto(s)
Metagenoma , Microbiota , Filogenia , Genoma Bacteriano , Microbiota/genética , Bacterias/genética , Metagenómica
6.
Nucleic Acids Res ; 38(14): e146, 2010 Aug.
Artículo en Inglés | MEDLINE | ID: mdl-20494978

RESUMEN

Systems-level analyses of non-model microorganisms are limited by the existence of numerous uncharacterized genes and a corresponding over-reliance on automated computational annotations. One solution to this challenge is to disrupt gene function using DNA tag technology, which has been highly successful in parallelizing reverse genetics in Saccharomyces cerevisiae and has led to discoveries in gene function, genetic interactions and drug mechanism of action. To extend the yeast DNA tag methodology to a wide variety of microorganisms and applications, we have created a universal, sequence-verified TagModule collection. A hallmark of the 4280 TagModules is that they are cloned into a Gateway entry vector, thus facilitating rapid transfer to any compatible genetic system. Here, we describe the application of the TagModules to rapidly generate tagged mutants by transposon mutagenesis in the metal-reducing bacterium Shewanella oneidensis MR-1 and the pathogenic yeast Candida albicans. Our results demonstrate the optimal hybridization properties of the TagModule collection, the flexibility in applying the strategy to diverse microorganisms and the biological insights that can be gained from fitness profiling tagged mutant collections. The publicly available TagModule collection is a platform-independent resource for the functional genomics of a wide range of microbial systems in the post-genome era.


Asunto(s)
Candida albicans/genética , Mutagénesis Insercional/métodos , Shewanella/genética , Elementos Transponibles de ADN , Análisis de Secuencia por Matrices de Oligonucleótidos , Lugares Marcados de Secuencia
7.
Nucleic Acids Res ; 38(Database issue): D396-400, 2010 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-19906701

RESUMEN

Since 2003, MicrobesOnline (http://www.microbesonline.org) has been providing a community resource for comparative and functional genome analysis. The portal includes over 1000 complete genomes of bacteria, archaea and fungi and thousands of expression microarrays from diverse organisms ranging from model organisms such as Escherichia coli and Saccharomyces cerevisiae to environmental microbes such as Desulfovibrio vulgaris and Shewanella oneidensis. To assist in annotating genes and in reconstructing their evolutionary history, MicrobesOnline includes a comparative genome browser based on phylogenetic trees for every gene family as well as a species tree. To identify co-regulated genes, MicrobesOnline can search for genes based on their expression profile, and provides tools for identifying regulatory motifs and seeing if they are conserved. MicrobesOnline also includes fast phylogenetic profile searches, comparative views of metabolic pathways, operon predictions, a workbench for sequence analysis and integration with RegTransBase and other microbial genome resources. The next update of MicrobesOnline will contain significant new functionality, including comparative analysis of metagenomic sequence data. Programmatic access to the database, along with source code and documentation, is available at http://microbesonline.org/programmers.html.


Asunto(s)
Bacterias/genética , Biología Computacional/métodos , Bases de Datos Genéticas , Bases de Datos de Ácidos Nucleicos , Algoritmos , Biología Computacional/tendencias , Bases de Datos de Proteínas , Perfilación de la Expresión Génica , Genoma Bacteriano , Almacenamiento y Recuperación de la Información/métodos , Internet , Análisis de Secuencia por Matrices de Oligonucleótidos , Estructura Terciaria de Proteína , Programas Informáticos
8.
ISME Commun ; 1(1): 77, 2021 Dec 14.
Artículo en Inglés | MEDLINE | ID: mdl-36765102

RESUMEN

Microbes drive myriad ecosystem processes, but under strong influence from viruses. Because studying viruses in complex systems requires different tools than those for microbes, they remain underexplored. To combat this, we previously aggregated double-stranded DNA (dsDNA) virus analysis capabilities and resources into 'iVirus' on the CyVerse collaborative cyberinfrastructure. Here we substantially expand iVirus's functionality and accessibility, to iVirus 2.0, as follows. First, core iVirus apps were integrated into the Department of Energy's Systems Biology KnowledgeBase (KBase) to provide an additional analytical platform. Second, at CyVerse, 20 software tools (apps) were upgraded or added as new tools and capabilities. Third, nearly 20-fold more sequence reads were aggregated to capture new data and environments. Finally, documentation, as "live" protocols, was updated to maximize user interaction with and contribution to infrastructure development. Together, iVirus 2.0 serves as a uniquely central and accessible analytical platform for studying how viruses, particularly dsDNA viruses, impact diverse microbial ecosystems.

9.
Nat Biotechnol ; 39(4): 499-509, 2021 04.
Artículo en Inglés | MEDLINE | ID: mdl-33169036

RESUMEN

The reconstruction of bacterial and archaeal genomes from shotgun metagenomes has enabled insights into the ecology and evolution of environmental and host-associated microbiomes. Here we applied this approach to >10,000 metagenomes collected from diverse habitats covering all of Earth's continents and oceans, including metagenomes from human and animal hosts, engineered environments, and natural and agricultural soils, to capture extant microbial, metabolic and functional potential. This comprehensive catalog includes 52,515 metagenome-assembled genomes representing 12,556 novel candidate species-level operational taxonomic units spanning 135 phyla. The catalog expands the known phylogenetic diversity of bacteria and archaea by 44% and is broadly available for streamlined comparative analyses, interactive exploration, metabolic modeling and bulk download. We demonstrate the utility of this collection for understanding secondary-metabolite biosynthetic potential and for resolving thousands of new host linkages to uncultivated viruses. This resource underscores the value of genome-centric approaches for revealing genomic properties of uncultivated microorganisms that affect ecosystem processes.


Asunto(s)
Archaea/genética , Bacterias/genética , Metabolómica/métodos , Metagenoma , Metagenómica/métodos , Virus/genética , Microbiología del Aire , Animales , Archaea/clasificación , Archaea/aislamiento & purificación , Bacterias/clasificación , Bacterias/aislamiento & purificación , Catálogos como Asunto , Ecosistema , Humanos , Filogenia , Microbiología del Suelo , Virus/aislamiento & purificación , Microbiología del Agua
10.
Mol Biol Evol ; 26(7): 1641-50, 2009 Jul.
Artículo en Inglés | MEDLINE | ID: mdl-19377059

RESUMEN

Gene families are growing rapidly, but standard methods for inferring phylogenies do not scale to alignments with over 10,000 sequences. We present FastTree, a method for constructing large phylogenies and for estimating their reliability. Instead of storing a distance matrix, FastTree stores sequence profiles of internal nodes in the tree. FastTree uses these profiles to implement Neighbor-Joining and uses heuristics to quickly identify candidate joins. FastTree then uses nearest neighbor interchanges to reduce the length of the tree. For an alignment with N sequences, L sites, and a different characters, a distance matrix requires O(N(2)) space and O(N(2)L) time, but FastTree requires just O(NLa + N ) memory and O(N log (N)La) time. To estimate the tree's reliability, FastTree uses local bootstrapping, which gives another 100-fold speedup over a distance matrix. For example, FastTree computed a tree and support values for 158,022 distinct 16S ribosomal RNAs in 17 h and 2.4 GB of memory. Just computing pairwise Jukes-Cantor distances and storing them, without inferring a tree or bootstrapping, would require 17 h and 50 GB of memory. In simulations, FastTree was slightly more accurate than Neighbor-Joining, BIONJ, or FastME; on genuine alignments, FastTree's topologies had higher likelihoods. FastTree is available at http://microbesonline.org/fasttree.


Asunto(s)
Algoritmos , Proteínas/genética , Alineación de Secuencia/métodos , Evolución Molecular , Modelos Genéticos , Filogenia
11.
Nature ; 428(6982): 529-35, 2004 Apr 01.
Artículo en Inglés | MEDLINE | ID: mdl-15057824

RESUMEN

Chromosome 19 has the highest gene density of all human chromosomes, more than double the genome-wide average. The large clustered gene families, corresponding high G + C content, CpG islands and density of repetitive DNA indicate a chromosome rich in biological and evolutionary significance. Here we describe 55.8 million base pairs of highly accurate finished sequence representing 99.9% of the euchromatin portion of the chromosome. Manual curation of gene loci reveals 1,461 protein-coding genes and 321 pseudogenes. Among these are genes directly implicated in mendelian disorders, including familial hypercholesterolaemia and insulin-resistant diabetes. Nearly one-quarter of these genes belong to tandemly arranged families, encompassing more than 25% of the chromosome. Comparative analyses show a fascinating picture of conservation and divergence, revealing large blocks of gene orthology with rodents, scattered regions with more recent gene family expansions and deletions, and segments of coding and non-coding conservation with the distant fish species Takifugu.


Asunto(s)
Cromosomas Humanos Par 19/genética , Genes/genética , Mapeo Físico de Cromosoma , Empalme Alternativo/genética , Animales , Composición de Base , Secuencia Conservada/genética , Islas de CpG/genética , Evolución Molecular , Duplicación de Gen , Genética Médica , Humanos , Ratones , Datos de Secuencia Molecular , Familia de Multigenes/genética , Seudogenes/genética , Análisis de Secuencia de ADN
12.
Nat Biotechnol ; 25(3): 319-26, 2007 Mar.
Artículo en Inglés | MEDLINE | ID: mdl-17334359

RESUMEN

Xylose is a major constituent of plant lignocellulose, and its fermentation is important for the bioconversion of plant biomass to fuels and chemicals. Pichia stipitis is a well-studied, native xylose-fermenting yeast. The mechanism and regulation of xylose metabolism in P. stipitis have been characterized and genes from P. stipitis have been used to engineer xylose metabolism in Saccharomyces cerevisiae. We have sequenced and assembled the complete genome of P. stipitis. The sequence data have revealed unusual aspects of genome organization, numerous genes for bioconversion, a preliminary insight into regulation of central metabolic pathways and several examples of colocalized genes with related functions. The genome sequence provides insight into how P. stipitis regulates its redox balance while very efficiently fermenting xylose under microaerobic conditions.


Asunto(s)
Vías Biosintéticas/genética , Celulosa/metabolismo , Genoma Bacteriano/genética , Lignina/metabolismo , Pichia/genética , Xilosa/metabolismo , Biomasa , ADN de Hongos/análisis , Fermentación , Biblioteca de Genes , Datos de Secuencia Molecular , Filogenia , Pichia/enzimología , Alineación de Secuencia , Análisis de Secuencia de ADN
13.
BMC Genomics ; 10: 131, 2009 Mar 25.
Artículo en Inglés | MEDLINE | ID: mdl-19321007

RESUMEN

BACKGROUND: Iron homeostasis of Shewanella oneidensis, a gamma-proteobacterium possessing high iron content, is regulated by a global transcription factor Fur. However, knowledge is incomplete about other biological pathways that respond to changes in iron concentration, as well as details of the responses. In this work, we integrate physiological, transcriptomics and genetic approaches to delineate the iron response of S. oneidensis. RESULTS: We show that the iron response in S. oneidensis is a rapid process. Temporal gene expression profiles were examined for iron depletion and repletion, and a gene co-expression network was reconstructed. Modules of iron acquisition systems, anaerobic energy metabolism and protein degradation were the most noteworthy in the gene network. Bioinformatics analyses suggested that genes in each of the modules might be regulated by DNA-binding proteins Fur, CRP and RpoH, respectively. Closer inspection of these modules revealed a transcriptional regulator (SO2426) involved in iron acquisition and ten transcriptional factors involved in anaerobic energy metabolism. Selected genes in the network were analyzed by genetic studies. Disruption of genes encoding a putative alcaligin biosynthesis protein (SO3032) and a gene previously implicated in protein degradation (SO2017) led to severe growth deficiency under iron depletion conditions. Disruption of a novel transcriptional factor (SO1415) caused deficiency in both anaerobic iron reduction and growth with thiosulfate or TMAO as an electronic acceptor, suggesting that SO1415 is required for specific branches of anaerobic energy metabolism pathways. CONCLUSION: Using a reconstructed gene network, we identified major biological pathways that were differentially expressed during iron depletion and repletion. Genetic studies not only demonstrated the importance of iron acquisition and protein degradation for iron depletion, but also characterized a novel transcriptional factor (SO1415) with a role in anaerobic energy metabolism.


Asunto(s)
Redes Reguladoras de Genes , Hierro/metabolismo , Shewanella/genética , Shewanella/metabolismo , Proteínas Bacterianas/genética , Proteínas Bacterianas/metabolismo , Proteínas de Unión al ADN/metabolismo , Perfilación de la Expresión Génica , Genes Reguladores , Análisis de Secuencia por Matrices de Oligonucleótidos , ARN Bacteriano/genética , Factores de Transcripción/metabolismo
14.
Environ Microbiol ; 11(9): 2244-52, 2009 Sep.
Artículo en Inglés | MEDLINE | ID: mdl-19737303

RESUMEN

The genome of Desulfovibrio vulgaris strain DePue, a sulfate-reducing Deltaproteobacterium isolated from heavy metal-impacted lake sediment, was completely sequenced and compared with the type strain D. vulgaris Hildenborough. The two genomes share a high degree of relatedness and synteny, but harbour distinct prophage and signatures of past phage encounters. In addition to a highly variable phage contribution, the genome of strain DePue contains a cluster of open-reading frames not found in strain Hildenborough coding for the production and export of a capsule exopolysaccharide, possibly of relevance to heavy metal resistance. Comparative whole-genome microarray analysis on four additional D. vulgaris strains established greater interstrain variation within regions associated with phage insertion and exopolysaccharide biosynthesis.


Asunto(s)
Desulfovibrio vulgaris/genética , Genoma Bacteriano , Secuencias Repetitivas Esparcidas , Bacteriófagos/genética , ADN Bacteriano/análisis , Desulfovibrio vulgaris/clasificación , Islas Genómicas , Análisis por Micromatrices , Polisacáridos Bacterianos/biosíntesis , Polisacáridos Bacterianos/genética
15.
Biotechnol Bioeng ; 102(4): 1161-9, 2009 Mar 01.
Artículo en Inglés | MEDLINE | ID: mdl-19031428

RESUMEN

Shewanella spp. are a group of facultative anaerobic bacteria widely distributed in marine and freshwater environments. In this study, we profiled the central metabolic fluxes of eight recently sequenced Shewanella species grown under the same condition in minimal medium with [3-13C] lactate. Although the tested Shewanella species had slightly different growth rates (0.23-0.29 h(-1)) and produced different amounts of acetate and pyruvate during early exponential growth (pseudo-steady state), the relative intracellular metabolic flux distributions were remarkably similar. This result indicates that Shewanella species share similar regulation in regard to central carbon metabolic fluxes under steady growth conditions: the maintenance of metabolic robustness is not only evident in a single species under genetic perturbations (Fischer and Sauer, 2005; Nat Genet 37(6):636-640), but also observed through evolutionary related microbial species. This remarkable conservation of relative flux profiles through phylogenetic differences prompts us to introduce the concept of metabotype as an alternative scheme to classify microbial fluxomics. On the other hand, Shewanella spp. display flexibility in the relative flux profiles when switching their metabolism from consuming lactate to consuming pyruvate and acetate.


Asunto(s)
Carbono/metabolismo , Shewanella/metabolismo , Ácido Acético/metabolismo , Isótopos de Carbono/metabolismo , Medios de Cultivo/química , Ácido Láctico/metabolismo , Ácido Pirúvico/metabolismo
16.
PLoS Biol ; 3(10): e314, 2005 Oct.
Artículo en Inglés | MEDLINE | ID: mdl-16128622

RESUMEN

The hypothesis that the relatively large and complex vertebrate genome was created by two ancient, whole genome duplications has been hotly debated, but remains unresolved. We reconstructed the evolutionary relationships of all gene families from the complete gene sets of a tunicate, fish, mouse, and human, and then determined when each gene duplicated relative to the evolutionary tree of the organisms. We confirmed the results of earlier studies that there remains little signal of these events in numbers of duplicated genes, gene tree topology, or the number of genes per multigene family. However, when we plotted the genomic map positions of only the subset of paralogous genes that were duplicated prior to the fish-tetrapod split, their global physical organization provides unmistakable evidence of two distinct genome duplication events early in vertebrate evolution indicated by clear patterns of four-way paralogous regions covering a large part of the human genome. Our results highlight the potential for these large-scale genomic events to have driven the evolutionary success of the vertebrate lineage.


Asunto(s)
Evolución Molecular , Genes Duplicados , Vertebrados/genética , Animales , Ciona intestinalis/genética , Biología Computacional , Drosophila/genética , Genoma , Humanos , Ratones , Familia de Multigenes , Filogenia , Takifugu/genética
17.
PLoS Comput Biol ; 3(9): 1739-50, 2007 Sep.
Artículo en Inglés | MEDLINE | ID: mdl-17845071

RESUMEN

Transcription factors (TFs) form large paralogous gene families and have complex evolutionary histories. Here, we ask whether putative orthologs of TFs, from bidirectional best BLAST hits (BBHs), are evolutionary orthologs with conserved functions. We show that BBHs of TFs from distantly related bacteria are usually not evolutionary orthologs. Furthermore, the false orthologs usually respond to different signals and regulate distinct pathways, while the few BBHs that are evolutionary orthologs do have conserved functions. To test the conservation of regulatory interactions, we analyze expression patterns. We find that regulatory relationships between TFs and their regulated genes are usually not conserved for BBHs in Escherichia coli K12 and Bacillus subtilis. Even in the much more closely related bacteria Vibrio cholerae and Shewanella oneidensis MR-1, predicting regulation from E. coli BBHs has high error rates. Using gene-regulon correlations, we identify genes whose expression pattern differs between E. coli and S. oneidensis. Using literature searches and sequence analysis, we show that these changes in expression patterns reflect changes in gene regulation, even for evolutionary orthologs. We conclude that the evolution of bacterial regulation should be analyzed with phylogenetic trees, rather than BBHs, and that bacterial regulatory networks evolve more rapidly than previously thought.


Asunto(s)
Proteínas Bacterianas/genética , Secuencia Conservada/genética , Análisis Mutacional de ADN/métodos , Evolución Molecular , Regulación Bacteriana de la Expresión Génica/genética , Secuencias Reguladoras de Ácidos Nucleicos/genética , Factores de Transcripción/genética , Variación Genética/genética , Relación Estructura-Actividad
18.
Nucleic Acids Res ; 34(Database issue): D572-80, 2006 Jan 01.
Artículo en Inglés | MEDLINE | ID: mdl-16381935

RESUMEN

TreeFam is a database of phylogenetic trees of gene families found in animals. It aims to develop a curated resource that presents the accurate evolutionary history of all animal gene families, as well as reliable ortholog and paralog assignments. Curated families are being added progressively, based on seed alignments and trees in a similar fashion to Pfam. Release 1.1 of TreeFam contains curated trees for 690 families and automatically generated trees for another 11 646 families. These represent over 128 000 genes from nine fully sequenced animal genomes and over 45 000 other animal proteins from UniProt; approximately 40-85% of proteins encoded in the fully sequenced animal genomes are included in TreeFam. TreeFam is freely available at http://www.treefam.org and http://treefam.genomics.org.cn.


Asunto(s)
Bases de Datos Genéticas , Genes , Filogenia , Proteínas/clasificación , Animales , Evolución Molecular , Genómica , Humanos , Internet , Ratones , Proteínas/genética , Ratas , Programas Informáticos , Interfaz Usuario-Computador
20.
Stand Genomic Sci ; 12: 23, 2017.
Artículo en Inglés | MEDLINE | ID: mdl-28194258

RESUMEN

Hexavalent Chromium [Cr(VI)] is a widespread contaminant found in soil, sediment, and ground water in several DOE sites, including Hanford 100 H area. In order to stimulate microbially mediated reduction of Cr(VI) at this site, a poly-lactate hydrogen release compound was injected into the chromium contaminated aquifer. Targeted enrichment of dominant nitrate-reducing bacteria post injection resulted in the isolation of Pseudomonas stutzeri strain RCH2. P. stutzeri strain RCH2 was isolated using acetate as the electron donor and is a complete denitrifier. Experiments with anaerobic washed cell suspension of strain RCH2 revealed it could reduce Cr(VI) and Fe(III). The genome of strain RCH2 was sequenced using a combination of Illumina and 454 sequencing technologies and contained a circular chromosome of 4.6 Mb and three plasmids. Global genome comparisons of strain RCH2 with six other fully sequenced P. stutzeri strains revealed most genomic regions are conserved, however strain RCH2 has an additional 244 genes, some of which are involved in chemotaxis, Flp pilus biogenesis and pyruvate/2-oxogluturate complex formation.

SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA