Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 38
Filtrar
Mais filtros










Base de dados
Intervalo de ano de publicação
1.
Nucleic Acids Res ; 51(D1): D678-D689, 2023 01 06.
Artigo em Inglês | MEDLINE | ID: mdl-36350631

RESUMO

The National Institute of Allergy and Infectious Diseases (NIAID) established the Bioinformatics Resource Center (BRC) program to assist researchers with analyzing the growing body of genome sequence and other omics-related data. In this report, we describe the merger of the PAThosystems Resource Integration Center (PATRIC), the Influenza Research Database (IRD) and the Virus Pathogen Database and Analysis Resource (ViPR) BRCs to form the Bacterial and Viral Bioinformatics Resource Center (BV-BRC) https://www.bv-brc.org/. The combined BV-BRC leverages the functionality of the bacterial and viral resources to provide a unified data model, enhanced web-based visualization and analysis tools, bioinformatics services, and a powerful suite of command line tools that benefit the bacterial and viral research communities.


Assuntos
Genômica , Software , Vírus , Humanos , Bactérias/genética , Biologia Computacional , Bases de Dados Genéticas , Influenza Humana , Vírus/genética
2.
Nucleic Acids Res ; 48(D1): D606-D612, 2020 01 08.
Artigo em Inglês | MEDLINE | ID: mdl-31667520

RESUMO

The PathoSystems Resource Integration Center (PATRIC) is the bacterial Bioinformatics Resource Center funded by the National Institute of Allergy and Infectious Diseases (https://www.patricbrc.org). PATRIC supports bioinformatic analyses of all bacteria with a special emphasis on pathogens, offering a rich comparative analysis environment that provides users with access to over 250 000 uniformly annotated and publicly available genomes with curated metadata. PATRIC offers web-based visualization and comparative analysis tools, a private workspace in which users can analyze their own data in the context of the public collections, services that streamline complex bioinformatic workflows and command-line tools for bulk data analysis. Over the past several years, as genomic and other omics-related experiments have become more cost-effective and widespread, we have observed considerable growth in the usage of and demand for easy-to-use, publicly available bioinformatic tools and services. Here we report the recent updates to the PATRIC resource, including new web-based comparative analysis tools, eight new services and the release of a command-line interface to access, query and analyze data.


Assuntos
Bactérias/genética , Biologia Computacional/métodos , Bases de Dados Genéticas , Algoritmos , Animais , Caenorhabditis elegans/genética , Galinhas/genética , Drosophila melanogaster/genética , Interações Hospedeiro-Patógeno/genética , Humanos , Internet , Macaca mulatta/genética , Metagenômica , Camundongos , National Institute of Allergy and Infectious Diseases (U.S.) , Fenótipo , Filogenia , Ratos , Suínos/genética , Estados Unidos , Peixe-Zebra/genética
3.
Brief Bioinform ; 20(4): 1094-1102, 2019 07 19.
Artigo em Inglês | MEDLINE | ID: mdl-28968762

RESUMO

The Pathosystems Resource Integration Center (PATRIC, www.patricbrc.org) is designed to provide researchers with the tools and services that they need to perform genomic and other 'omic' data analyses. In response to mounting concern over antimicrobial resistance (AMR), the PATRIC team has been developing new tools that help researchers understand AMR and its genetic determinants. To support comparative analyses, we have added AMR phenotype data to over 15 000 genomes in the PATRIC database, often assembling genomes from reads in public archives and collecting their associated AMR panel data from the literature to augment the collection. We have also been using this collection of AMR metadata to build machine learning-based classifiers that can predict the AMR phenotypes and the genomic regions associated with resistance for genomes being submitted to the annotation service. Likewise, we have undertaken a large AMR protein annotation effort by manually curating data from the literature and public repositories. This collection of 7370 AMR reference proteins, which contains many protein annotations (functional roles) that are unique to PATRIC and RAST, has been manually curated so that it projects stably across genomes. The collection currently projects to 1 610 744 proteins in the PATRIC database. Finally, the PATRIC Web site has been expanded to enable AMR-based custom page views so that researchers can easily explore AMR data and design experiments based on whole genomes or individual genes.


Assuntos
Biologia Computacional/métodos , Bases de Dados Genéticas , Resistência Microbiana a Medicamentos/genética , Integração de Sistemas , Biologia Computacional/tendências , Bases de Dados Genéticas/estatística & dados numéricos , Genoma Microbiano , Humanos , Internet , Anotação de Sequência Molecular
4.
Nat Commun ; 9(1): 4908, 2018 11 21.
Artigo em Inglês | MEDLINE | ID: mdl-30464174

RESUMO

Sulfolobus islandicus is a model microorganism in the TACK superphylum of the Archaea, a key lineage in the evolutionary history of cells. Here we report a genome-wide identification of the repertoire of genes essential to S. islandicus growth in culture. We confirm previous targeted gene knockouts, uncover the non-essentiality of functions assumed to be essential to the Sulfolobus cell, including the proteinaceous S-layer, and highlight essential genes whose functions are yet to be determined. Phyletic distributions illustrate the potential transitions that may have occurred during the evolution of this archaeal microorganism, and highlight sets of genes that may have been associated with each transition. We use this comparative context as a lens to focus future research on archaea-specific uncharacterized essential genes that may provide valuable insights into the evolutionary history of cells.


Assuntos
Genes Essenciais , Genoma Arqueal , Sulfolobus/genética , Evolução Biológica , DNA Topoisomerases Tipo I/genética , Teste de Complementação Genética , Glicoproteínas de Membrana/genética , Sulfolobus/ultraestrutura
5.
Nucleic Acids Res ; 45(D1): D535-D542, 2017 01 04.
Artigo em Inglês | MEDLINE | ID: mdl-27899627

RESUMO

The Pathosystems Resource Integration Center (PATRIC) is the bacterial Bioinformatics Resource Center (https://www.patricbrc.org). Recent changes to PATRIC include a redesign of the web interface and some new services that provide users with a platform that takes them from raw reads to an integrated analysis experience. The redesigned interface allows researchers direct access to tools and data, and the emphasis has changed to user-created genome-groups, with detailed summaries and views of the data that researchers have selected. Perhaps the biggest change has been the enhanced capability for researchers to analyze their private data and compare it to the available public data. Researchers can assemble their raw sequence reads and annotate the contigs using RASTtk. PATRIC also provides services for RNA-Seq, variation, model reconstruction and differential expression analysis, all delivered through an updated private workspace. Private data can be compared by 'virtual integration' to any of PATRIC's public data. The number of genomes available for comparison in PATRIC has expanded to over 80 000, with a special emphasis on genomes with antimicrobial resistance data. PATRIC uses this data to improve both subsystem annotation and k-mer classification, and tags new genomes as having signatures that indicate susceptibility or resistance to specific antibiotics.


Assuntos
Bactérias/genética , Biologia Computacional/métodos , Bases de Dados Genéticas , Genoma Bacteriano , Genômica/métodos , Antibacterianos/farmacologia , Bactérias/efeitos dos fármacos , Bactérias/metabolismo , Proteínas de Bactérias/genética , Proteínas de Bactérias/metabolismo , Farmacorresistência Bacteriana , Anotação de Sequência Molecular , Proteoma , Proteômica/métodos , Software , Navegador
6.
Alzheimers Dement ; 13(5): 510-519, 2017 May.
Artigo em Inglês | MEDLINE | ID: mdl-27793643

RESUMO

INTRODUCTION: We have comprehensively described the expression profiles of mitochondrial DNA and nuclear DNA genes that encode subunits of the respiratory oxidative phosphorylation (OXPHOS) complexes (I-V) in the hippocampus from young controls, age matched, mild cognitively impaired (MCI), and Alzheimer's disease (AD) subjects. METHODS: Hippocampal tissues from 44 non-AD controls (NC), 10 amnestic MCI, and 18 AD cases were analyzed on Affymetrix Hg-U133 plus 2.0 arrays. RESULTS: The microarray data revealed significant down regulation in OXPHOS genes in AD, particularly those encoded in the nucleus. In contrast, there was up regulation of the same gene(s) in MCI subjects compared to AD and ND cases. No significant differences were observed in mtDNA genes identified in the array between AD, ND, and MCI subjects except one mt-ND6. DISCUSSION: Our findings suggest that restoration of the expression of nuclear-encoded OXPHOS genes in aging could be a viable strategy for blunting AD progression.


Assuntos
Envelhecimento/genética , Doença de Alzheimer/genética , Transtornos Cognitivos/genética , Mitocôndrias/genética , Fosforilação Oxidativa , Adulto , Idoso de 80 Anos ou mais , Autopsia , Feminino , Hipocampo , Humanos , Masculino , Análise de Sequência com Séries de Oligonucleotídeos
7.
Front Microbiol ; 7: 118, 2016.
Artigo em Inglês | MEDLINE | ID: mdl-26903996

RESUMO

The ability to build accurate protein families is a fundamental operation in bioinformatics that influences comparative analyses, genome annotation, and metabolic modeling. For several years we have been maintaining protein families for all microbial genomes in the PATRIC database (Pathosystems Resource Integration Center, patricbrc.org) in order to drive many of the comparative analysis tools that are available through the PATRIC website. However, due to the burgeoning number of genomes, traditional approaches for generating protein families are becoming prohibitive. In this report, we describe a new approach for generating protein families, which we call PATtyFams. This method uses the k-mer-based function assignments available through RAST (Rapid Annotation using Subsystem Technology) to rapidly guide family formation, and then differentiates the function-based groups into families using a Markov Cluster algorithm (MCL). This new approach for generating protein families is rapid, scalable and has properties that are consistent with alignment-based methods.

8.
Genome Biol Evol ; 7(12): 3337-57, 2015 Nov 19.
Artigo em Inglês | MEDLINE | ID: mdl-26590210

RESUMO

The large repABC plasmids of the order Rhizobiales with Class I quorum-regulated conjugative transfer systems often define the nature of the bacterium that harbors them. These otherwise diverse plasmids contain a core of highly conserved genes for replication and conjugation raising the question of their evolutionary relationships. In an analysis of 18 such plasmids these elements fall into two organizational classes, Group I and Group II, based on the sites at which cargo DNA is located. Cladograms constructed from proteins of the transfer and quorum-sensing components indicated that those of the Group I plasmids, while coevolving, have diverged from those coevolving proteins of the Group II plasmids. Moreover, within these groups the phylogenies of the proteins usually occupy similar, if not identical, tree topologies. Remarkably, such relationships were not seen among proteins of the replication system; although RepA and RepB coevolve, RepC does not. Nor do the replication proteins coevolve with the proteins of the transfer and quorum-sensing systems. Functional analysis was mostly consistent with phylogenies. TraR activated promoters from plasmids within its group, but not between groups and dimerized with TraR proteins from within but not between groups. However, oriT sequences, which are highly conserved, were processed by the transfer system of plasmids regardless of group. We conclude that these plasmids diverged into two classes based on the locations at which cargo DNA is inserted, that the quorum-sensing and transfer functions are coevolving within but not between the two groups, and that this divergent evolution extends to function.


Assuntos
Proteínas de Bactérias/genética , DNA Helicases/genética , Evolução Molecular , Transferência Genética Horizontal , Percepção de Quorum/genética , Rhizobiaceae/genética , Transativadores/genética , Plasmídeos/genética
9.
PLoS One ; 10(6): e0126883, 2015.
Artigo em Inglês | MEDLINE | ID: mdl-26039056

RESUMO

The Salmonella enterica serovars Enteritidis, Dublin, and Gallinarum are closely related but differ in virulence and host range. To identify the genetic elements responsible for these differences and to better understand how these serovars are evolving, we sequenced the genomes of Enteritidis strain LK5 and Dublin strain SARB12 and compared these genomes to the publicly available Enteritidis P125109, Dublin CT 02021853 and Dublin SD3246 genome sequences. We also compared the publicly available Gallinarum genome sequences from biotype Gallinarum 287/91 and Pullorum RKS5078. Using bioinformatic approaches, we identified single nucleotide polymorphisms, insertions, deletions, and differences in prophage and pseudogene content between strains belonging to the same serovar. Through our analysis we also identified several prophage cargo genes and pseudogenes that affect virulence and may contribute to a host-specific, systemic lifestyle. These results strongly argue that the Enteritidis, Dublin and Gallinarum serovars of Salmonella enterica evolve by acquiring new genes through horizontal gene transfer, followed by the formation of pseudogenes. The loss of genes necessary for a gastrointestinal lifestyle ultimately leads to a systemic lifestyle and niche exclusion in the host-specific serovars.


Assuntos
Genoma Bacteriano , Mutação , Polimorfismo de Nucleotídeo Único , Salmonella enteritidis/genética , Salmonella enteritidis/patogenicidade , Sorogrupo
10.
Sci Rep ; 5: 8365, 2015 Feb 10.
Artigo em Inglês | MEDLINE | ID: mdl-25666585

RESUMO

The RAST (Rapid Annotation using Subsystem Technology) annotation engine was built in 2008 to annotate bacterial and archaeal genomes. It works by offering a standard software pipeline for identifying genomic features (i.e., protein-encoding genes and RNA) and annotating their functions. Recently, in order to make RAST a more useful research tool and to keep pace with advancements in bioinformatics, it has become desirable to build a version of RAST that is both customizable and extensible. In this paper, we describe the RAST tool kit (RASTtk), a modular version of RAST that enables researchers to build custom annotation pipelines. RASTtk offers a choice of software for identifying and annotating genomic features as well as the ability to add custom features to an annotation job. RASTtk also accommodates the batch submission of genomes and the ability to customize annotation protocols for batch submissions. This is the first major software restructuring of RAST since its inception.


Assuntos
Anotação de Sequência Molecular/métodos , Software
11.
J Bacteriol ; 196(5): 1031-44, 2014 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-24363349

RESUMO

The Ti plasmid in Agrobacterium tumefaciens strain 15955 carries two alleles of traR that regulate conjugative transfer. The first is a functional allele, called traR, that is transcriptionally induced by the opine octopine. The second, trlR, is a nonfunctional, dominant-negative mutant located in an operon that is inducible by the opine mannopine (MOP). Based on these findings, we predicted that there exist wild-type agrobacterial strains harboring plasmids in which MOP induces a functional traR and, hence, conjugation. We analyzed 11 MOP-utilizing field isolates and found five where MOP induced transfer of the MOP-catabolic element and increased production of the acyl-homoserine lactone (acyl-HSL) quormone. The transmissible elements in these five strains represent a set of highly related plasmids. Sequence analysis of one such plasmid, pAoF64/95, revealed that the 176-kb element is not a Ti plasmid but carries genes for catabolism of MOP, mannopinic acid (MOA), agropinic acid (AGA), and the agrocinopines. The plasmid additionally carries all of the genes required for conjugative transfer, including the regulatory genes traR, traI, and traM. The traR gene, however, is not located in the MOP catabolism region. The gene, instead, is monocistronic and located within the tra-trb-rep gene cluster. A traR mutant failed to transfer the plasmid and produced little to no quormone even when grown with MOP, indicating that TraRpAoF64/95 is the activator of the tra regulon. A traM mutant was constitutive for transfer and acyl-HSL production, indicating that the anti-activator function of TraM is conserved.


Assuntos
Agrobacterium tumefaciens/metabolismo , Conjugação Genética/fisiologia , Manitol/análogos & derivados , Plasmídeos/metabolismo , Percepção de Quorum , Acil-Butirolactonas/metabolismo , Agrobacterium tumefaciens/genética , Proteínas de Bactérias/genética , Mapeamento Cromossômico , Cromossomos Bacterianos/genética , Manitol/farmacologia , Dados de Sequência Molecular , Plasmídeos/genética , Fatores de Transcrição/genética
12.
Nucleic Acids Res ; 42(Database issue): D206-14, 2014 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-24293654

RESUMO

In 2004, the SEED (http://pubseed.theseed.org/) was created to provide consistent and accurate genome annotations across thousands of genomes and as a platform for discovering and developing de novo annotations. The SEED is a constantly updated integration of genomic data with a genome database, web front end, API and server scripts. It is used by many scientists for predicting gene functions and discovering new pathways. In addition to being a powerful database for bioinformatics research, the SEED also houses subsystems (collections of functionally related protein families) and their derived FIGfams (protein families), which represent the core of the RAST annotation engine (http://rast.nmpdr.org/). When a new genome is submitted to RAST, genes are called and their annotations are made by comparison to the FIGfam collection. If the genome is made public, it is then housed within the SEED and its proteins populate the FIGfam collection. This annotation cycle has proven to be a robust and scalable solution to the problem of annotating the exponentially increasing number of genomes. To date, >12 000 users worldwide have annotated >60 000 distinct genomes using RAST. Here we describe the interconnectedness of the SEED database and RAST, the RAST annotation pipeline and updates to both resources.


Assuntos
Bases de Dados Genéticas , Genoma Arqueal , Genoma Bacteriano , Anotação de Sequência Molecular , Proteínas de Bactérias/química , Proteínas de Bactérias/genética , Proteínas de Bactérias/fisiologia , Genômica , Internet , Software
13.
3 Biotech ; 4(3): 331-335, 2014 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-28324432

RESUMO

Maintaining consistency in genome annotations is important for supporting many computational tasks, particularly metabolic modeling. The SEED project has implemented a process that improves annotation consistencies across microbial genomes for proteins with conserved sequences and genomic context. In this research report, we describe this process and show how this effort has resulted in improvements to microbial genome annotations in the SEED. We also compare SEED annotation consistencies with other commonly used resources such as IMG (the Joint Genome Institute's Integrated Microbial Genomes system), RefSeq (the National Center for Biotechnology Information's Reference Sequence Database), Swiss-Prot (the annotated protein sequence database of the Swiss Institute of Bioinformatics, European Molecular Biology Laboratory and the European Bioinformatics Institute) and TrEMBL (Translated European Molecular Biology Laboratory nucleotide sequence data Library). Our analysis indicates that manual and computational efforts are paying off for the databases where consistency is a major goal.

14.
J Int Neuropsychol Soc ; 19(8): 863-72, 2013 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-23829951

RESUMO

To study the natural recovery from sports concussion, 12 concussed high school football athletes and 12 matched uninjured teammates were evaluated with symptom rating scales, tests of postural balance and cognition, and an event-related fMRI study during performance of a load-dependent working memory task at 13 h and 7 weeks following injury. Injured athletes showed the expected postconcussive symptoms and cognitive decline with decreased reaction time (RT) and increased RT variability on a working memory task during the acute period and an apparent full recovery 7 weeks later. Brain activation patterns showed decreased activation of right hemisphere attentional networks in injured athletes relative to controls during the acute period with a reversed pattern of activation (injured > controls) in the same networks at 7 weeks following injury. These changes coincided with a decrease in self-reported postconcussive symptoms and improved cognitive test performance in the injured athletes. Results from this exploratory study suggest that decreased activation of right hemisphere attentional networks mediate the cognitive changes and postconcussion symptoms observed during the acute period following concussion. Conversely, improvement in cognitive functioning and postconcussive symptoms during the subacute period may be mediated by compensatory increases in activation of this same attentional network.


Assuntos
Traumatismos em Atletas/complicações , Mapeamento Encefálico , Encéfalo/patologia , Síndrome Pós-Concussão/etiologia , Síndrome Pós-Concussão/patologia , Recuperação de Função Fisiológica/fisiologia , Adolescente , Encéfalo/irrigação sanguínea , Estudos de Casos e Controles , Humanos , Processamento de Imagem Assistida por Computador , Imageamento por Ressonância Magnética , Masculino , Testes Neuropsicológicos , Oxigênio/sangue , Estudos Retrospectivos , Índice de Gravidade de Doença
15.
Int J Syst Evol Microbiol ; 63(Pt 7): 2727-2741, 2013 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-23606477

RESUMO

The tree of life is paramount for achieving an integrated understanding of microbial evolution and the relationships between physiology, genealogy and genomics. It provides the framework for interpreting environmental sequence data, whether applied to microbial ecology or to human health. However, there remain many instances where there is ambiguity in our understanding of the phylogeny of major lineages, and/or confounding nomenclature. Here we apply recent genomic sequence data to examine the evolutionary history of members of the classes Mollicutes (phylum Tenericutes) and Erysipelotrichia (phylum Firmicutes). Consistent with previous analyses, we find evidence of a specific relationship between them in molecular phylogenies and signatures of the 16S rRNA, 23S rRNA, ribosomal proteins and aminoacyl-tRNA synthetase proteins. Furthermore, by mapping functions over the phylogenetic tree we find that the erysipelotrichia lineages are involved in various stages of genomic reduction, having lost (often repeatedly) a variety of metabolic functions and the ability to form endospores. Although molecular phylogeny has driven numerous taxonomic revisions, we find it puzzling that the most recent taxonomic revision of the phyla Firmicutes and Tenericutes has further separated them into distinct phyla, rather than reflecting their common roots.


Assuntos
Genoma Bacteriano , Filogenia , Tenericutes/classificação , Aminoacil-tRNA Sintetases/genética , Proteínas de Bactérias/genética , DNA Bacteriano/genética , Conformação de Ácido Nucleico , RNA Ribossômico 16S/genética , RNA Ribossômico 23S/genética , Proteínas Ribossômicas/genética , Alinhamento de Sequência , Tenericutes/genética
16.
PLoS One ; 7(10): e48053, 2012.
Artigo em Inglês | MEDLINE | ID: mdl-23110173

RESUMO

The remarkable advance in sequencing technology and the rising interest in medical and environmental microbiology, biotechnology, and synthetic biology resulted in a deluge of published microbial genomes. Yet, genome annotation, comparison, and modeling remain a major bottleneck to the translation of sequence information into biological knowledge, hence computational analysis tools are continuously being developed for rapid genome annotation and interpretation. Among the earliest, most comprehensive resources for prokaryotic genome analysis, the SEED project, initiated in 2003 as an integration of genomic data and analysis tools, now contains >5,000 complete genomes, a constantly updated set of curated annotations embodied in a large and growing collection of encoded subsystems, a derived set of protein families, and hundreds of genome-scale metabolic models. Until recently, however, maintaining current copies of the SEED code and data at remote locations has been a pressing issue. To allow high-performance remote access to the SEED database, we developed the SEED Servers (http://www.theseed.org/servers): four network-based servers intended to expose the data in the underlying relational database, support basic annotation services, offer programmatic access to the capabilities of the RAST annotation server, and provide access to a growing collection of metabolic models that support flux balance analysis. The SEED servers offer open access to regularly updated data, the ability to annotate prokaryotic genomes, the ability to create metabolic reconstructions and detailed models of metabolism, and access to hundreds of existing metabolic models. This work offers and supports a framework upon which other groups can build independent research efforts. Large integrations of genomic data represent one of the major intellectual resources driving research in biology, and programmatic access to the SEED data will provide significant utility to a broad collection of potential users.


Assuntos
Biologia Computacional/métodos , Bases de Dados Factuais/estatística & dados numéricos , Armazenamento e Recuperação da Informação/métodos , Software , Escherichia coli/genética , Escherichia coli/metabolismo , Genômica/métodos , Genômica/estatística & dados numéricos , Internet , Metabolômica/métodos , Metabolômica/estatística & dados numéricos , Anotação de Sequência Molecular/métodos , Anotação de Sequência Molecular/estatística & dados numéricos , Reprodutibilidade dos Testes
17.
Proc Natl Acad Sci U S A ; 108(50): 20154-9, 2011 Dec 13.
Artigo em Inglês | MEDLINE | ID: mdl-22128332

RESUMO

Most bacterial and archaeal genomes contain many genes with little or no similarity to other genes, a property that impedes identification of gene origins. By comparing the codon usage of genes shared among strains (primarily vertically inherited genes) and genes unique to one strain (primarily recently horizontally acquired genes), we found that the plurality of unique genes in Escherichia coli and Salmonella enterica are much more similar to each other than are their vertically inherited genes. We conclude that E. coli and S. enterica derive these unique genes from a common source, a supraspecies phylogenetic group that includes the organisms themselves. The phylogenetic range of the sharing appears to include other (but not all) members of the Enterobacteriaceae. We found evidence of similar gene sharing in other bacterial and archaeal taxa. Thus, we conclude that frequent gene exchange, particularly that of genetic novelties, extends well beyond accepted species boundaries.


Assuntos
Escherichia coli/genética , Transferência Genética Horizontal/genética , Genes Bacterianos/genética , Salmonella enterica/genética , Homologia de Sequência do Ácido Nucleico , Códon/genética , Filogenia , Especificidade da Espécie
18.
Mol Biol Evol ; 28(1): 211-21, 2011 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-20679093

RESUMO

Codon usage can provide insights into the nature of the genes in a genome. Genes that are "native" to a genome (have not been recently acquired by horizontal transfer) range in codon usage from a low-bias "typical" usage to a more biased "high-expression" usage characteristic of genes encoding abundant proteins. Genes that differ from these native codon usages are candidates for foreign genes that have been recently acquired by horizontal gene transfer. In this study, we present a method for characterizing the codon usages of native genes--both typical and highly expressed--within a genome. Each gene is evaluated relative to a half line (or axis) in a 59D space of codon usage. The axis begins at the modal codon usage, the usage that matches the largest number of genes in the genome, and it passes through a point representing the codon usage of a set of genes with expression-related bias. A gene whose codon usage matches (does not significantly differ from) a point on this axis is a candidate native gene, and the location of its projection onto the axis provides a general estimate of its expression level. A gene that differs significantly from all points on the axis is a candidate foreign gene. This automated approach offers significant improvements over existing methods. We illustrate this by analyzing the genomes of Pseudomonas aeruginosa PAO1 and Bacillus anthracis A0248, which can be difficult to analyze with commonly used methods due to their biased base compositions. Finally, we use this approach to measure the proportion of candidate foreign genes in 923 bacterial and archaeal genomes. The organisms with the most homogeneous genomes (containing the fewest candidate foreign genes) are mostly endosymbionts and parasites, though with exceptions that include Pelagibacter ubique and Beutenbergia cavernae. The organisms with the most heterogeneous genomes (containing the most candidate foreign genes) include members of the genera Bacteroides, Corynebacterium, Desulfotalea, Neisseria, Xylella, and Thermobaculum.


Assuntos
Códon , Genes Bacterianos , Genoma Bacteriano , Algoritmos , Bacillus anthracis/genética , Composição de Bases/genética , Escherichia coli/genética , Regulação Bacteriana da Expressão Gênica , Transferência Genética Horizontal , Genes Arqueais , Pseudomonas aeruginosa/genética
19.
Am J Primatol ; 73(2): 119-26, 2011 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-20853395

RESUMO

Humans and baboons (Papio spp.) share considerable anatomical and physiological similarities in their reproductive tracts. Given the similarities, it is reasonable to expect that the normal vaginal microbial composition (microbiota) of baboons would be similar to that of humans. We have used a 16S rRNA phylogenetic approach to assess the composition of the baboon vaginal microbiota in a set of nine animals from a captive facility and six from the wild. Results show that although Gram-positive bacteria dominate in baboons as they do in humans, there are major differences between the vaginal microbiota of baboons and that of humans. In contrast to humans, the species of Gram-positive bacteria (Firmicutes) were taxa other than Lactobacillus species. In addition, some groups of Gram-negative bacteria that are not normally abundant in humans were found in the baboon samples. A further level of difference was also seen even within the same bacterial phylogenetic group, as baboon strains tended to be more phylogenetically distinct from human strains than human strains were with each other. Finally, results of our analysis suggests that co-evolution of microbes and their hosts cannot account for the major differences between the microbiota of baboons and that of humans because divergences between the major bacterial genera were too ancient to have occurred since primates evolved. Instead, the primate vaginal tracts appear to have acquired discrete subsets of bacteria from the vast diversity of bacteria available in the environment and established a community responsive to and compatible with host species physiology.


Assuntos
Bactérias Gram-Negativas/classificação , Bactérias Gram-Positivas/classificação , Metagenoma , Papio hamadryas/microbiologia , Vagina/microbiologia , Animais , Evolução Biológica , DNA Bacteriano/genética , Feminino , Bactérias Gram-Negativas/genética , Bactérias Gram-Negativas/fisiologia , Bactérias Gram-Positivas/genética , Bactérias Gram-Positivas/fisiologia , Humanos , Quênia , Papio hamadryas/fisiologia , Filogenia , RNA Ribossômico 16S/genética , Texas
20.
PLoS One ; 5(6): e10866, 2010 Jun 02.
Artigo em Inglês | MEDLINE | ID: mdl-20532250

RESUMO

BACKGROUND: The replication of DNA in Archaea and eukaryotes requires several ancillary complexes, including proliferating cell nuclear antigen (PCNA), replication factor C (RFC), and the minichromosome maintenance (MCM) complex. Bacterial DNA replication utilizes comparable proteins, but these are distantly related phylogenetically to their archaeal and eukaryotic counterparts at best. METHODOLOGY/PRINCIPAL FINDINGS: While the structures of each of the complexes do not differ significantly between the archaeal and eukaryotic versions thereof, the evolutionary dynamic in the two cases does. The number of subunits in each complex is constant across all taxa. However, they vary subtly with regard to composition. In some taxa the subunits are all identical in sequence, while in others some are homologous rather than identical. In the case of eukaryotes, there is no phylogenetic variation in the makeup of each complex-all appear to derive from a common eukaryotic ancestor. This is not the case in Archaea, where the relationship between the subunits within each complex varies taxon-to-taxon. We have performed a detailed phylogenetic analysis of these relationships in order to better understand the gene duplications and divergences that gave rise to the homologous subunits in Archaea. CONCLUSION/SIGNIFICANCE: This domain level difference in evolution suggests that different forces have driven the evolution of DNA replication proteins in each of these two domains. In addition, the phylogenies of all three gene families support the distinctiveness of the proposed archaeal phylum Thaumarchaeota.


Assuntos
Archaea/genética , Replicação do DNA , DNA Arqueal/genética , Evolução Molecular , Células Eucarióticas , Filogenia , Antígeno Nuclear de Célula em Proliferação/genética , Proteína de Replicação C/genética
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...