Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 77
Filtrar
Mais filtros

Base de dados
País/Região como assunto
Tipo de documento
Intervalo de ano de publicação
2.
Nucleic Acids Res ; 48(D1): D579-D589, 2020 01 08.
Artigo em Inglês | MEDLINE | ID: mdl-31647104

RESUMO

Large-scale genome sequencing and the increasingly massive use of high-throughput approaches produce a vast amount of new information that completely transforms our understanding of thousands of microbial species. However, despite the development of powerful bioinformatics approaches, full interpretation of the content of these genomes remains a difficult task. Launched in 2005, the MicroScope platform (https://www.genoscope.cns.fr/agc/microscope) has been under continuous development and provides analysis for prokaryotic genome projects together with metabolic network reconstruction and post-genomic experiments allowing users to improve the understanding of gene functions. Here we present new improvements of the MicroScope user interface for genome selection, navigation and expert gene annotation. Automatic functional annotation procedures of the platform have also been updated and we added several new tools for the functional annotation of genes and genomic regions. We finally focus on new tools and pipeline developed to perform comparative analyses on hundreds of genomes based on pangenome graphs. To date, MicroScope contains data for >11 800 microbial genomes, part of which are manually curated and maintained by microbiologists (>4500 personal accounts in September 2019). The platform enables collaborative work in a rich comparative genomic context and improves community-based curation efforts.


Assuntos
Genes Arqueais , Genes Bacterianos , Genômica/métodos , Anotação de Sequência Molecular/métodos , Software , Bases de Dados Genéticas , Redes e Vias Metabólicas
3.
Brief Bioinform ; 20(4): 1071-1084, 2019 07 19.
Artigo em Inglês | MEDLINE | ID: mdl-28968784

RESUMO

The overwhelming list of new bacterial genomes becoming available on a daily basis makes accurate genome annotation an essential step that ultimately determines the relevance of thousands of genomes stored in public databanks. The MicroScope platform (http://www.genoscope.cns.fr/agc/microscope) is an integrative resource that supports systematic and efficient revision of microbial genome annotation, data management and comparative analysis. Starting from the results of our syntactic, functional and relational annotation pipelines, MicroScope provides an integrated environment for the expert annotation and comparative analysis of prokaryotic genomes. It combines tools and graphical interfaces to analyze genomes and to perform the manual curation of gene function in a comparative genomics and metabolic context. In this article, we describe the free-of-charge MicroScope services for the annotation and analysis of microbial (meta)genomes, transcriptomic and re-sequencing data. Then, the functionalities of the platform are presented in a way providing practical guidance and help to the nonspecialists in bioinformatics. Newly integrated analysis tools (i.e. prediction of virulence and resistance genes in bacterial genomes) and original method recently developed (the pan-genome graph representation) are also described. Integrated environments such as MicroScope clearly contribute, through the user community, to help maintaining accurate resources.


Assuntos
Genoma Microbiano , Genômica/métodos , Anotação de Sequência Molecular/métodos , Software , Biologia Computacional , Gráficos por Computador , Sistemas de Gerenciamento de Base de Dados , Bases de Dados de Compostos Químicos , Genômica/estatística & dados numéricos , Internet , Redes e Vias Metabólicas/genética , Fenômenos Microbiológicos , Anotação de Sequência Molecular/estatística & dados numéricos , Interface Usuário-Computador
4.
Bioinformatics ; 36(Suppl_2): i651-i658, 2020 12 30.
Artigo em Inglês | MEDLINE | ID: mdl-33381850

RESUMO

MOTIVATION: Horizontal gene transfer (HGT) is a major source of variability in prokaryotic genomes. Regions of genome plasticity (RGPs) are clusters of genes located in highly variable genomic regions. Most of them arise from HGT and correspond to genomic islands (GIs). The study of those regions at the species level has become increasingly difficult with the data deluge of genomes. To date, no methods are available to identify GIs using hundreds of genomes to explore their diversity. RESULTS: We present here the panRGP method that predicts RGPs using pangenome graphs made of all available genomes for a given species. It allows the study of thousands of genomes in order to access the diversity of RGPs and to predict spots of insertions. It gave the best predictions when benchmarked along other GI detection tools against a reference dataset. In addition, we illustrated its use on metagenome assembled genomes by redefining the borders of the leuX tRNA hotspot, a well-studied spot of insertion in Escherichia coli. panRPG is a scalable and reliable tool to predict GIs and spots making it an ideal approach for large comparative studies. AVAILABILITY AND IMPLEMENTATION: The methods presented in the current work are available through the following software: https://github.com/labgem/PPanGGOLiN. Detailed results and scripts to compute the benchmark metrics are available at https://github.com/axbazin/panrgp_supdata.


Assuntos
Ilhas Genômicas , Software , Transferência Genética Horizontal , Ilhas Genômicas/genética , Genômica , Metagenoma
5.
PLoS Comput Biol ; 16(3): e1007732, 2020 03.
Artigo em Inglês | MEDLINE | ID: mdl-32191703

RESUMO

The use of comparative genomics for functional, evolutionary, and epidemiological studies requires methods to classify gene families in terms of occurrence in a given species. These methods usually lack multivariate statistical models to infer the partitions and the optimal number of classes and don't account for genome organization. We introduce a graph structure to model pangenomes in which nodes represent gene families and edges represent genomic neighborhood. Our method, named PPanGGOLiN, partitions nodes using an Expectation-Maximization algorithm based on multivariate Bernoulli Mixture Model coupled with a Markov Random Field. This approach takes into account the topology of the graph and the presence/absence of genes in pangenomes to classify gene families into persistent, cloud, and one or several shell partitions. By analyzing the partitioned pangenome graphs of isolate genomes from 439 species and metagenome-assembled genomes from 78 species, we demonstrate that our method is effective in estimating the persistent genome. Interestingly, it shows that the shell genome is a key element to understand genome dynamics, presumably because it reflects how genes present at intermediate frequencies drive adaptation of species, and its proportion in genomes is independent of genome size. The graph-based approach proposed by PPanGGOLiN is useful to depict the overall genomic diversity of thousands of strains in a compact structure and provides an effective basis for very large scale comparative genomics. The software is freely available at https://github.com/labgem/PPanGGOLiN.


Assuntos
Genoma Bacteriano/genética , Genômica/métodos , Software , Algoritmos , Bactérias/classificação , Bactérias/genética , Análise Multivariada
6.
Appl Environ Microbiol ; 86(20)2020 10 01.
Artigo em Inglês | MEDLINE | ID: mdl-32769182

RESUMO

We sought to identify and study the antibiofilm protein secreted by the marine bacterium Pseudoalteromonas sp. strain 3J6. The latter is active against marine and terrestrial bacteria, including Pseudomonas aeruginosa clinical strains forming different biofilm types. Several amino acid sequences were obtained from the partially purified antibiofilm protein, named alterocin. The Pseudoalteromonas sp. 3J6 genome was sequenced, and a candidate alt gene was identified by comparing the genome-encoded proteins to the sequences from purified alterocin. Expressing the alt gene in another nonactive Pseudoalteromonas sp. strain, 3J3, demonstrated that it is responsible for the antibiofilm activity. Alterocin is a 139-residue protein that includes a predicted 20-residue signal sequence, which would be cleaved off upon export by the general secretion system. No sequence homology was found between alterocin and proteins of known functions. The alt gene is not part of an operon and adjacent genes do not seem related to alterocin production, immunity, or regulation, suggesting that these functions are not fulfilled by devoted proteins. During growth in liquid medium, the alt mRNA level peaked during the stationary phase. A single promoter was experimentally identified, and several inverted repeats could be binding sites for regulators. alt genes were found in about 30% of the Pseudoalteromonas genomes and in only a few instances of other marine bacteria of the Hahella and Paraglaciecola genera. Comparative genomics yielded the hypothesis that alt gene losses occurred within the Pseudoalteromonas genus. Overall, alterocin is a novel kind of antibiofilm protein of ecological and biotechnological interest.IMPORTANCE Biofilms are microbial communities that develop on solid surfaces or interfaces and are detrimental in a number of fields, including for example food industry, aquaculture, and medicine. In the latter, antibiotics are insufficient to clear biofilm infections, leading to chronic infections such as in the case of infection by Pseudomonas aeruginosa of the lungs of cystic fibrosis patients. Antibiofilm molecules are thus urgently needed to be used in conjunction with conventional antibiotics, as well as in other fields of application, especially if they are environmentally friendly molecules. Here, we describe alterocin, a novel antibiofilm protein secreted by a marine bacterium belonging to the Pseudoalteromonas genus, and its gene. Alterocin homologs were found in about 30% of Pseudoalteromonas strains, indicating that this new family of antibiofilm proteins likely plays an important albeit nonessential function in the biology of these bacteria. This study opens up the possibility of a variety of applications.


Assuntos
Antibacterianos/farmacologia , Proteínas de Bactérias/genética , Biofilmes/efeitos dos fármacos , Pseudoalteromonas/genética , Proteínas de Bactérias/biossíntese
8.
Nat Chem Biol ; 13(8): 858-866, 2017 Aug.
Artigo em Inglês | MEDLINE | ID: mdl-28581482

RESUMO

Experimental validation of enzyme function is crucial for genome interpretation, but it remains challenging because it cannot be scaled up to accommodate the constant accumulation of genome sequences. We tackled this issue for the MetA and MetX enzyme families, phylogenetically unrelated families of acyl-L-homoserine transferases involved in L-methionine biosynthesis. Members of these families are prone to incorrect annotation because MetX and MetA enzymes are assumed to always use acetyl-CoA and succinyl-CoA, respectively. We determined the enzymatic activities of 100 enzymes from diverse species, and interpreted the results by structural classification of active sites based on protein structure modeling. We predict that >60% of the 10,000 sequences from these families currently present in databases are incorrectly annotated, and suggest that acetyl-CoA was originally the sole substrate of these isofunctional enzymes, which evolved to use exclusively succinyl-CoA in the most recent bacteria. We also uncovered a divergent subgroup of MetX enzymes in fungi that participate only in L-cysteine biosynthesis as O-succinyl-L-serine transferases.


Assuntos
Acetiltransferases/metabolismo , Evolução Molecular , Metionina/biossíntese , Acinetobacter/enzimologia , Escherichia coli/enzimologia
9.
Nucleic Acids Res ; 45(D1): D517-D528, 2017 01 04.
Artigo em Inglês | MEDLINE | ID: mdl-27899624

RESUMO

The annotation of genomes from NGS platforms needs to be automated and fully integrated. However, maintaining consistency and accuracy in genome annotation is a challenging problem because millions of protein database entries are not assigned reliable functions. This shortcoming limits the knowledge that can be extracted from genomes and metabolic models. Launched in 2005, the MicroScope platform (http://www.genoscope.cns.fr/agc/microscope) is an integrative resource that supports systematic and efficient revision of microbial genome annotation, data management and comparative analysis. Effective comparative analysis requires a consistent and complete view of biological data, and therefore, support for reviewing the quality of functional annotation is critical. MicroScope allows users to analyze microbial (meta)genomes together with post-genomic experiment results if any (i.e. transcriptomics, re-sequencing of evolved strains, mutant collections, phenotype data). It combines tools and graphical interfaces to analyze genomes and to perform the expert curation of gene functions in a comparative context. Starting with a short overview of the MicroScope system, this paper focuses on some major improvements of the Web interface, mainly for the submission of genomic data and on original tools and pipelines that have been developed and integrated in the platform: computation of pan-genomes and prediction of biosynthetic gene clusters. Today the resource contains data for more than 6000 microbial genomes, and among the 2700 personal accounts (65% of which are now from foreign countries), 14% of the users are performing expert annotations, on at least a weekly basis, contributing to improve the quality of microbial genome annotations.


Assuntos
Bases de Dados Genéticas , Metagenoma , Metagenômica/métodos , Microbiota/genética , Biologia Computacional/métodos , Evolução Molecular , Metaboloma , Metabolômica/métodos , Família Multigênica , Polimorfismo de Nucleotídeo Único , Software
10.
BMC Bioinformatics ; 19(1): 132, 2018 04 11.
Artigo em Inglês | MEDLINE | ID: mdl-29642842

RESUMO

BACKGROUND: High quality functional annotation is essential for understanding the phenotypic consequences encoded in a genome. Despite improvements in bioinformatics methods, millions of sequences in databanks are not assigned reliable functions. The curation of protein functions in the context of biological processes is a way to evaluate and improve their annotation. RESULTS: We developed an expert system using paraconsistent logic, named GROOLS (Genomic Rule Object-Oriented Logic System), that evaluates the completeness and the consistency of predicted functions through biological processes like metabolic pathways. Using a generic and hierarchical representation of knowledge, biological processes are modeled in a graph from which observations (i.e. predictions and expectations) are propagated by rules. At the end of the reasoning, conclusions are assigned to biological process components and highlight uncertainties and inconsistencies. Results on 14 microbial organisms are presented. CONCLUSIONS: GROOLS software is designed to evaluate the overall accuracy of functional unit and pathway predictions according to organism experimental data like growth phenotypes. It assists biocurators in the functional annotation of proteins by focusing on missing or contradictory observations.


Assuntos
Algoritmos , Fenômenos Biológicos , Biologia Computacional/métodos , Genoma , Anotação de Sequência Molecular , Software , Acinetobacter/genética , Vias Biossintéticas/genética , Cisteína/biossíntese , Bases de Dados Factuais
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA