Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 11 de 11
Filtrar
1.
Nucleic Acids Res ; 51(D1): D760-D766, 2023 01 06.
Artículo en Inglés | MEDLINE | ID: mdl-36408900

RESUMEN

The interpretation of genomic, transcriptomic and other microbial 'omics data is highly dependent on the availability of well-annotated genomes. As the number of publicly available microbial genomes continues to increase exponentially, the need for quality control and consistent annotation is becoming critical. We present proGenomes3, a database of 907 388 high-quality genomes containing 4 billion genes that passed stringent criteria and have been consistently annotated using multiple functional and taxonomic databases including mobile genetic elements and biosynthetic gene clusters. proGenomes3 encompasses 41 171 species-level clusters, defined based on universal single copy marker genes, for which pan-genomes and contextual habitat annotations are provided. The database is available at http://progenomes.embl.de/.


Asunto(s)
Genoma , Células Procariotas , Bases de Datos Genéticas , Genómica , Anotación de Secuencia Molecular , Bacterias/clasificación , Bacterias/genética
2.
Nucleic Acids Res ; 50(6): 3155-3168, 2022 04 08.
Artículo en Inglés | MEDLINE | ID: mdl-35323968

RESUMEN

Prokaryotic Mobile Genetic Elements (MGEs) such as transposons, integrons, phages and plasmids, play important roles in prokaryotic evolution and in the dispersal of cargo functions like antibiotic resistance. However, each of these MGE types is usually annotated and analysed individually, hampering a global understanding of phylogenetic and environmental patterns of MGE dispersal. We thus developed a computational framework that captures diverse MGE types, their cargos and MGE-mediated horizontal transfer events, using recombinases as ubiquitous MGE marker genes and pangenome information for MGE boundary estimation. Applied to ∼84k genomes with habitat annotation, we mapped 2.8 million MGE-specific recombinases to six operational MGE types, which together contain on average 13% of all the genes in a genome. Transposable elements (TEs) dominated across all taxa (∼1.7 million occurrences), outnumbering phages and phage-like elements (<0.4 million). We recorded numerous MGE-mediated horizontal transfer events across diverse phyla and habitats involving all MGE types, disentangled and quantified the extent of hitchhiking of TEs (17%) and integrons (63%) with other MGE categories, and established TEs as dominant carriers of antibiotic resistance genes. We integrated all these findings into a resource (proMGE.embl.de), which should facilitate future studies on the large mobile part of genomes and its horizontal dispersal.


Asunto(s)
Bacterias , Bacteriófagos , Bacterias/genética , Bacteriófagos/genética , Elementos Transponibles de ADN/genética , Farmacorresistencia Microbiana/genética , Transferencia de Gen Horizontal , Filogenia , Recombinasas/genética
3.
Nature ; 601(7892): 252-256, 2022 01.
Artículo en Inglés | MEDLINE | ID: mdl-34912116

RESUMEN

Microbial genes encode the majority of the functional repertoire of life on earth. However, despite increasing efforts in metagenomic sequencing of various habitats1-3, little is known about the distribution of genes across the global biosphere, with implications for human and planetary health. Here we constructed a non-redundant gene catalogue of 303 million species-level genes (clustered at 95% nucleotide identity) from 13,174 publicly available metagenomes across 14 major habitats and use it to show that most genes are specific to a single habitat. The small fraction of genes found in multiple habitats is enriched in antibiotic-resistance genes and markers for mobile genetic elements. By further clustering these species-level genes into 32 million protein families, we observed that a small fraction of these families contain the majority of the genes (0.6% of families account for 50% of the genes). The majority of species-level genes and protein families are rare. Furthermore, species-level genes, and in particular the rare ones, show low rates of positive (adaptive) selection, supporting a model in which most genetic variability observed within each protein family is neutral or nearly neutral.


Asunto(s)
Metagenoma , Metagenómica , Antibacterianos/farmacología , Farmacorresistencia Microbiana , Ecosistema , Humanos , Metagenoma/genética
4.
Genome Biol ; 22(1): 178, 2021 06 13.
Artículo en Inglés | MEDLINE | ID: mdl-34120611

RESUMEN

Genomes are critical units in microbiology, yet ascertaining quality in prokaryotic genome assemblies remains a formidable challenge. We present GUNC (the Genome UNClutterer), a tool that accurately detects and quantifies genome chimerism based on the lineage homogeneity of individual contigs using a genome's full complement of genes. GUNC complements existing approaches by targeting previously underdetected types of contamination: we conservatively estimate that 5.7% of genomes in GenBank, 5.2% in RefSeq, and 15-30% of pre-filtered "high-quality" metagenome-assembled genomes in recent studies are undetected chimeras. GUNC provides a fast and robust tool to substantially improve prokaryotic genome quality.


Asunto(s)
Quimerismo , Biología Computacional/métodos , Genoma Bacteriano , Metagenoma , Proteobacteria/genética , Programas Informáticos , Mapeo Contig , Metagenómica/métodos , Filogenia , Células Procariotas/citología , Células Procariotas/metabolismo
5.
Nucleic Acids Res ; 49(D1): D458-D460, 2021 01 08.
Artículo en Inglés | MEDLINE | ID: mdl-33104802

RESUMEN

SMART (Simple Modular Architecture Research Tool) is a web resource (https://smart.embl.de) for the identification and annotation of protein domains and the analysis of protein domain architectures. SMART version 9 contains manually curated models for more than 1300 protein domains, with a topical set of 68 new models added since our last update article (1). All the new models are for diverse recombinase families and subfamilies and as a set they provide a comprehensive overview of mobile element recombinases namely transposase, integrase, relaxase, resolvase, cas1 casposase and Xer like cellular recombinase. Further updates include the synchronization of the underlying protein databases with UniProt (2), Ensembl (3) and STRING (4), greatly increasing the total number of annotated domains and other protein features available in architecture analysis mode. Furthermore, SMART's vector-based protein display engine has been extended and updated to use the latest web technologies and the domain architecture analysis components have been optimized to handle the increased number of protein features available.


Asunto(s)
Proteínas/química , Programas Informáticos , Bases de Datos de Proteínas , Humanos , Internet , Anotación de Secuencia Molecular , Filogenia , Unión Proteica , Estructura Terciaria de Proteína
6.
mSphere ; 2(6)2017.
Artículo en Inglés | MEDLINE | ID: mdl-29205228

RESUMEN

Horizontal gene transfer is a major driving force behind the genomic diversity seen in prokaryotes. The cryptic rac prophage in Escherichia coli K-12 carries the gene for a putative transcription factor RacR, whose deletion is lethal. We have shown that the essentiality of racR in E. coli K-12 is attributed to its role in transcriptionally repressing toxin gene(s) called ydaS and ydaT, which are adjacent to and coded divergently to racR. IMPORTANCE Transcription factors in the bacterium E. coli are rarely essential, and when they are essential, they are largely toxin-antitoxin systems. While studying transcription factors encoded in horizontally acquired regions in E. coli, we realized that the protein RacR, a putative transcription factor encoded by a gene on the rac prophage, is an essential protein. Here, using genetics, biochemistry, and bioinformatics, we show that its essentiality derives from its role as a transcriptional repressor of the ydaS and ydaT genes, whose products are toxic to the cell. Unlike type II toxin-antitoxin systems in which transcriptional regulation involves complexes of the toxin and antitoxin, repression by RacR is sufficient to keep ydaS transcriptionally silent.

7.
G3 (Bethesda) ; 6(6): 1597-606, 2016 06 01.
Artículo en Inglés | MEDLINE | ID: mdl-27172194

RESUMEN

Genomes evolve not only in base sequence but also in terms of their architecture, defined by gene organization and chromosome topology. Whereas genome sequence data inform us about the changes in base sequences for a large variety of organisms, the study of chromosome topology is restricted to a few model organisms studied using microscopy and chromosome conformation capture techniques. Here, we exploit whole genome sequence data to study the link between gene organization and chromosome topology in bacteria. Using comparative genomics across ∼250 pairs of closely related bacteria we show that: (a) many organisms show a high degree of interreplichore translocations throughout the chromosome and not limited to the inversion-prone terminus (ter) or the origin of replication (oriC); (b) translocation maps may reflect chromosome topologies; and (c) symmetric interreplichore translocations do not disrupt the distance of a gene from oriC or affect gene expression states or strand biases in gene densities. In summary, we suggest that translocation maps might be a first line in defining a gross chromosome topology given a pair of closely related genome sequences.


Asunto(s)
Bacterias/genética , Cromosomas Bacterianos , Replicación del ADN , Genoma Bacteriano , Genómica , Bacterias/clasificación , Mapeo Cromosómico , Evolución Molecular , Genes Bacterianos , Genómica/métodos , Recombinación Genética , Translocación Genética
8.
PLoS One ; 8(3): e60013, 2013.
Artículo en Inglés | MEDLINE | ID: mdl-23544121

RESUMEN

Staphylococcus aureus is a major human pathogen, first recognized as a leading cause of hospital-acquired infections. Community-associated S. aureus (CA-SA) pose a greater threat due to increase in severity of infection and disease among children and healthy adults. CA-SA strains in India are genetically diverse, among which is the sequence type (ST) 772, which has now spread to Australia, Europe and Japan. Towards understanding the genetic characteristics of ST772, we obtained draft genome sequences of five relevant clinical isolates and studied the properties of their PVL-carrying prophages, whose presence is a defining hallmark of CA-SA. We show that this is a novel prophage, which carries the structural genes of the hlb-carrying prophage and includes the sea enterotoxin. This architecture probably emerged early within the ST772 lineage, at least in India. The sea gene, unique to ST772 PVL, despite having promoter sequence characteristics typical of low expression, appears to be highly expressed during early phase of growth in laboratory conditions. We speculate that this might be a consequence of its novel sequence context. The crippled nature of the hlb-converting prophage in ST772 suggests that widespread mobility of the sea enterotoxin might be a selective force behind its 'transfer' to the PVL prophage. Wild type ST772 strains induced strong proliferative responses as well as high cytotoxic activity against neutrophils, likely mediated by superantigen SEA and the PVL toxin respectively. Both proliferation and cytotoxicity were markedly reduced in a cured ST772 strain indicating the impact of the phage on virulence. The presence of SEA alongside the genes for the immune system-modulating PVL toxin may contribute to the success and virulence of ST772.


Asunto(s)
Toxinas Bacterianas/metabolismo , Enterotoxinas/metabolismo , Exotoxinas/metabolismo , Genoma Bacteriano/genética , Leucocidinas/metabolismo , Profagos/metabolismo , Análisis de Secuencia de ADN , Staphylococcus aureus/genética , Staphylococcus aureus/virología , Toxinas Bacterianas/genética , Secuencia de Bases , Enterotoxinas/genética , Exotoxinas/genética , Proteínas Hemolisinas/genética , Proteínas Hemolisinas/metabolismo , Humanos , India , Leucocidinas/genética , Datos de Secuencia Molecular , Profagos/ultraestructura , ARN Mensajero/genética , ARN Mensajero/metabolismo , Esfingomielina Fosfodiesterasa/genética , Esfingomielina Fosfodiesterasa/metabolismo , Staphylococcus aureus/aislamiento & purificación , Staphylococcus aureus/ultraestructura
9.
J Bacteriol ; 194(24): 6946-7, 2012 Dec.
Artículo en Inglés | MEDLINE | ID: mdl-23209210

RESUMEN

We report the draft genome sequence of methicillin-resistant Staphylococcus aureus (MRSA) strain ST672, an emerging disease clone in India, from a septicemia patient. The genome size is about 2.82 Mb with 2,485 open reading frames (ORFs). The staphylococcal cassette chromosome mec (SCCmec) element (type V) and immune evasion cluster appear to be different from those of strain ST772 on preliminary examination.


Asunto(s)
Bacteriemia/microbiología , Genoma Bacteriano , Staphylococcus aureus Resistente a Meticilina/genética , Infecciones Estafilocócicas/microbiología , Proteínas Bacterianas/genética , Técnicas de Tipificación Bacteriana , Secuencia de Bases , ADN Bacteriano/genética , Humanos , Staphylococcus aureus Resistente a Meticilina/aislamiento & purificación , Datos de Secuencia Molecular , Sistemas de Lectura Abierta/genética , Proteínas de Unión a las Penicilinas , Análisis de Secuencia de ADN
10.
J Bacteriol ; 194(14): 3727-8, 2012 Jul.
Artículo en Inglés | MEDLINE | ID: mdl-22740659

RESUMEN

We report the draft genome sequence of an ST772 Staphylococcus aureus disease isolate carrying staphylococcal cassette chromosome mec (SCCmec) type V from a pyomyositis patient. Our de novo short read assembly is ∼2.8 Mb and encodes a unique Panton-Valentine leukocidin (PVL) phage with structural genes similar to those of ϕ7247PVL and novel lysogenic genes at the N termini.


Asunto(s)
Genoma Bacteriano , Staphylococcus aureus/genética , Clonación Molecular , India/epidemiología , Datos de Secuencia Molecular , Piomiositis/epidemiología , Piomiositis/microbiología , Infecciones Estafilocócicas/epidemiología , Infecciones Estafilocócicas/microbiología , Staphylococcus aureus/efectos de los fármacos
11.
Nat Commun ; 3: 886, 2012 Jun 06.
Artículo en Inglés | MEDLINE | ID: mdl-22673913

RESUMEN

DNA cytosine methylation regulates gene expression in mammals. In bacteria, its role in gene expression and genome architecture is less understood. Here we perform high-throughput sequencing of bisulfite-treated genomic DNA from Escherichia coli K12 to describe, for the first time, the extent of cytosine methylation of bacterial DNA at single-base resolution. Whereas most target sites (C(m)CWGG) are fully methylated in stationary phase cells, many sites with an extended CC(m)CWGG motif are only partially methylated in exponentially growing cells. We speculate that these partially methylated sites may be selected, as these are slightly correlated with the risk of spontaneous, non-synonymous conversion of methylated cytosines to thymines. Microarray analysis in a cytosine methylation-deficient mutant of E. coli shows increased expression of the stress response sigma factor RpoS and many of its targets in stationary phase. Thus, DNA cytosine methylation is a regulator of stationary phase gene expression in E. coli.


Asunto(s)
Citosina/metabolismo , Escherichia coli/genética , Metilación de ADN/fisiología , Regulación Bacteriana de la Expresión Génica/genética , Transcripción Genética/genética
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA