Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 5 de 5
Filtrar
Más filtros










Base de datos
Intervalo de año de publicación
2.
Bioinformatics ; 36(Suppl_2): i651-i658, 2020 12 30.
Artículo en Inglés | MEDLINE | ID: mdl-33381850

RESUMEN

MOTIVATION: Horizontal gene transfer (HGT) is a major source of variability in prokaryotic genomes. Regions of genome plasticity (RGPs) are clusters of genes located in highly variable genomic regions. Most of them arise from HGT and correspond to genomic islands (GIs). The study of those regions at the species level has become increasingly difficult with the data deluge of genomes. To date, no methods are available to identify GIs using hundreds of genomes to explore their diversity. RESULTS: We present here the panRGP method that predicts RGPs using pangenome graphs made of all available genomes for a given species. It allows the study of thousands of genomes in order to access the diversity of RGPs and to predict spots of insertions. It gave the best predictions when benchmarked along other GI detection tools against a reference dataset. In addition, we illustrated its use on metagenome assembled genomes by redefining the borders of the leuX tRNA hotspot, a well-studied spot of insertion in Escherichia coli. panRPG is a scalable and reliable tool to predict GIs and spots making it an ideal approach for large comparative studies. AVAILABILITY AND IMPLEMENTATION: The methods presented in the current work are available through the following software: https://github.com/labgem/PPanGGOLiN. Detailed results and scripts to compute the benchmark metrics are available at https://github.com/axbazin/panrgp_supdata.


Asunto(s)
Islas Genómicas , Programas Informáticos , Transferencia de Gen Horizontal , Islas Genómicas/genética , Genómica , Metagenoma
3.
Nat Commun ; 11(1): 5541, 2020 11 02.
Artículo en Inglés | MEDLINE | ID: mdl-33139723

RESUMEN

The bacterium Neisseria meningitidis causes life-threatening meningitis and sepsis. Here, we construct a complete collection of defined mutants in protein-coding genes of this organism, identifying all genes that are essential under laboratory conditions. The collection, named NeMeSys 2.0, consists of individual mutants in 1584 non-essential genes. We identify 391 essential genes, which are associated with basic functions such as expression and preservation of genome information, cell membrane structure and function, and metabolism. We use this collection to shed light on the functions of diverse genes, including a gene encoding a member of a previously unrecognised class of histidinol-phosphatases; a set of 20 genes required for type IV pili function; and several conditionally essential genes encoding antitoxins and/or immunity proteins. We expect that NeMeSys 2.0 will facilitate the phenotypic profiling of a major human bacterial pathogen.


Asunto(s)
Genes Bacterianos/genética , Genes Esenciales/genética , Mutación , Neisseria meningitidis/genética , Neisseria meningitidis/metabolismo , Fenotipo , Proteínas Bacterianas/metabolismo , Biología Computacional , Proteínas Fimbrias/genética , Proteínas Fimbrias/metabolismo , Fimbrias Bacterianas/genética , Fimbrias Bacterianas/metabolismo , Perfilación de la Expresión Génica , Regulación Bacteriana de la Expresión Génica , Genoma Bacteriano , Humanos , Neisseria meningitidis/patogenicidad
4.
PLoS Comput Biol ; 16(3): e1007732, 2020 03.
Artículo en Inglés | MEDLINE | ID: mdl-32191703

RESUMEN

The use of comparative genomics for functional, evolutionary, and epidemiological studies requires methods to classify gene families in terms of occurrence in a given species. These methods usually lack multivariate statistical models to infer the partitions and the optimal number of classes and don't account for genome organization. We introduce a graph structure to model pangenomes in which nodes represent gene families and edges represent genomic neighborhood. Our method, named PPanGGOLiN, partitions nodes using an Expectation-Maximization algorithm based on multivariate Bernoulli Mixture Model coupled with a Markov Random Field. This approach takes into account the topology of the graph and the presence/absence of genes in pangenomes to classify gene families into persistent, cloud, and one or several shell partitions. By analyzing the partitioned pangenome graphs of isolate genomes from 439 species and metagenome-assembled genomes from 78 species, we demonstrate that our method is effective in estimating the persistent genome. Interestingly, it shows that the shell genome is a key element to understand genome dynamics, presumably because it reflects how genes present at intermediate frequencies drive adaptation of species, and its proportion in genomes is independent of genome size. The graph-based approach proposed by PPanGGOLiN is useful to depict the overall genomic diversity of thousands of strains in a compact structure and provides an effective basis for very large scale comparative genomics. The software is freely available at https://github.com/labgem/PPanGGOLiN.


Asunto(s)
Genoma Bacteriano/genética , Genómica/métodos , Programas Informáticos , Algoritmos , Bacterias/clasificación , Bacterias/genética , Análisis Multivariante
5.
Nucleic Acids Res ; 48(D1): D579-D589, 2020 01 08.
Artículo en Inglés | MEDLINE | ID: mdl-31647104

RESUMEN

Large-scale genome sequencing and the increasingly massive use of high-throughput approaches produce a vast amount of new information that completely transforms our understanding of thousands of microbial species. However, despite the development of powerful bioinformatics approaches, full interpretation of the content of these genomes remains a difficult task. Launched in 2005, the MicroScope platform (https://www.genoscope.cns.fr/agc/microscope) has been under continuous development and provides analysis for prokaryotic genome projects together with metabolic network reconstruction and post-genomic experiments allowing users to improve the understanding of gene functions. Here we present new improvements of the MicroScope user interface for genome selection, navigation and expert gene annotation. Automatic functional annotation procedures of the platform have also been updated and we added several new tools for the functional annotation of genes and genomic regions. We finally focus on new tools and pipeline developed to perform comparative analyses on hundreds of genomes based on pangenome graphs. To date, MicroScope contains data for >11 800 microbial genomes, part of which are manually curated and maintained by microbiologists (>4500 personal accounts in September 2019). The platform enables collaborative work in a rich comparative genomic context and improves community-based curation efforts.


Asunto(s)
Genes Arqueales , Genes Bacterianos , Genómica/métodos , Anotación de Secuencia Molecular/métodos , Programas Informáticos , Bases de Datos Genéticas , Redes y Vías Metabólicas
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA