Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 73
Filtrar
Más filtros

Banco de datos
Tipo del documento
Intervalo de año de publicación
1.
Proc Natl Acad Sci U S A ; 121(10): e2310852121, 2024 Mar 05.
Artículo en Inglés | MEDLINE | ID: mdl-38416678

RESUMEN

Enterococci are gut microbes of most land animals. Likely appearing first in the guts of arthropods as they moved onto land, they diversified over hundreds of millions of years adapting to evolving hosts and host diets. Over 60 enterococcal species are now known. Two species, Enterococcus faecalis and Enterococcus faecium, are common constituents of the human microbiome. They are also now leading causes of multidrug-resistant hospital-associated infection. The basis for host association of enterococcal species is unknown. To begin identifying traits that drive host association, we collected 886 enterococcal strains from widely diverse hosts, ecologies, and geographies. This identified 18 previously undescribed species expanding genus diversity by >25%. These species harbor diverse genes including toxins and systems for detoxification and resource acquisition. Enterococcus faecalis and E. faecium were isolated from diverse hosts highlighting their generalist properties. Most other species showed a more restricted distribution indicative of specialized host association. The expanded species diversity permitted the Enterococcus genus phylogeny to be viewed with unprecedented resolution, allowing features to be identified that distinguish its four deeply rooted clades, and the entry of genes associated with range expansion such as B-vitamin biosynthesis and flagellar motility to be mapped to the phylogeny. This work provides an unprecedentedly broad and deep view of the genus Enterococcus, including insights into its evolution, potential new threats to human health, and where substantial additional enterococcal diversity is likely to be found.


Asunto(s)
Enterococcus faecium , Infecciones por Bacterias Grampositivas , Animales , Humanos , Enterococcus/genética , Antibacterianos/farmacología , Enterococcus faecium/genética , Enterococcus faecalis/genética , Filogenia , Pruebas de Sensibilidad Microbiana , Farmacorresistencia Bacteriana
2.
Bioinformatics ; 40(6)2024 Jun 03.
Artículo en Inglés | MEDLINE | ID: mdl-38775729

RESUMEN

MOTIVATION: Today, we know the function of only a small fraction of the protein sequences predicted from genomic data. This problem is even more salient for bacteria, which represent some of the most phylogenetically and metabolically diverse taxa on Earth. This low rate of bacterial gene annotation is compounded by the fact that most function prediction algorithms have focused on eukaryotes, and conventional annotation approaches rely on the presence of similar sequences in existing databases. However, often there are no such sequences for novel bacterial proteins. Thus, we need improved gene function prediction methods tailored for bacteria. Recently, transformer-based language models-adopted from the natural language processing field-have been used to obtain new representations of proteins, to replace amino acid sequences. These representations, referred to as protein embeddings, have shown promise for improving annotation of eukaryotes, but there have been only limited applications on bacterial genomes. RESULTS: To predict gene functions in bacteria, we developed SAFPred, a novel synteny-aware gene function prediction tool based on protein embeddings from state-of-the-art protein language models. SAFpred also leverages the unique operon structure of bacteria through conserved synteny. SAFPred outperformed both conventional sequence-based annotation methods and state-of-the-art methods on multiple bacterial species, including for distant homolog detection, where the sequence similarity to the proteins in the training set was as low as 40%. Using SAFPred to identify gene functions across diverse enterococci, of which some species are major clinical threats, we identified 11 previously unrecognized putative novel toxins, with potential significance to human and animal health. AVAILABILITY AND IMPLEMENTATION: https://github.com/AbeelLab/safpred.


Asunto(s)
Algoritmos , Proteínas Bacterianas , Genoma Bacteriano , Proteínas Bacterianas/genética , Proteínas Bacterianas/metabolismo , Programas Informáticos , Bacterias/genética , Sintenía , Biología Computacional/métodos , Anotación de Secuencia Molecular/métodos
3.
Bioinformatics ; 39(10)2023 10 03.
Artículo en Inglés | MEDLINE | ID: mdl-37796811

RESUMEN

MOTIVATION: Plasmids are carriers for antimicrobial resistance (AMR) genes and can exchange genetic material with other structures, contributing to the spread of AMR. There is no reliable approach to identify the transfer of AMR genes across plasmids. This is mainly due to the absence of a method to assess the phylogenetic distance of plasmids, as they show large DNA sequence variability. Identifying and quantifying such transfer can provide novel insight into the role of small mobile elements and resistant plasmid regions in the spread of AMR. RESULTS: We developed SHIP, a novel method to quantify plasmid similarity based on the dynamics of plasmid evolution. This allowed us to find conserved fragments containing AMR genes in structurally different and phylogenetically distant plasmids, which is evidence for lateral transfer. Our results show that regions carrying AMR genes are highly mobilizable between plasmids through transposons, integrons, and recombination events, and contribute to the spread of AMR. Identified transferred fragments include a multi-resistant complex class 1 integron in Escherichia coli and Klebsiella pneumoniae, and a region encoding tetracycline resistance transferred through recombination in Enterococcus faecalis. AVAILABILITY AND IMPLEMENTATION: The code developed in this work is available at https://github.com/AbeelLab/plasmidHGT.


Asunto(s)
Antibacterianos , Farmacorresistencia Bacteriana , Antibacterianos/farmacología , Filogenia , Farmacorresistencia Bacteriana/genética , Plásmidos/genética , Escherichia coli/genética , Integrones/genética , Transferencia de Gen Horizontal
4.
J Immunol ; 209(8): 1555-1565, 2022 10 15.
Artículo en Inglés | MEDLINE | ID: mdl-36096642

RESUMEN

Tuberculosis (TB) remains one of the deadliest infectious diseases worldwide, posing great social and economic burden to affected countries. Novel vaccine approaches are needed to increase protective immunity against the causative agent Mycobacterium tuberculosis (Mtb) and to reduce the development of active TB disease in latently infected individuals. Donor-unrestricted T cell responses represent such novel potential vaccine targets. HLA-E-restricted T cell responses have been shown to play an important role in protection against TB and other infections, and recent studies have demonstrated that these cells can be primed in vitro. However, the identification of novel pathogen-derived HLA-E binding peptides presented by infected target cells has been limited by the lack of accurate prediction algorithms for HLA-E binding. In this study, we developed an improved HLA-E binding peptide prediction algorithm and implemented it to identify (to our knowledge) novel Mtb-derived peptides with capacity to induce CD8+ T cell activation and that were recognized by specific HLA-E-restricted T cells in Mycobacterium-exposed humans. Altogether, we present a novel algorithm for the identification of pathogen- or self-derived HLA-E-presented peptides.


Asunto(s)
Mycobacterium tuberculosis , Tuberculosis , Antígenos Bacterianos , Linfocitos T CD8-positivos , Epítopos de Linfocito T , Antígenos de Histocompatibilidad Clase I , Humanos , Péptidos , Antígenos HLA-E
5.
BMC Bioinformatics ; 24(1): 400, 2023 Oct 26.
Artículo en Inglés | MEDLINE | ID: mdl-37884897

RESUMEN

BACKGROUND: Pan-genome graphs are gaining importance in the field of bioinformatics as data structures to represent and jointly analyze multiple genomes. Compacted de Bruijn graphs are inherently suited for this purpose, as their graph topology naturally reveals similarity and divergence within the pan-genome. Most state-of-the-art pan-genome graphs are represented explicitly in terms of nodes and edges. Recently, an alternative, implicit graph representation was proposed that builds directly upon the unidirectional FM-index. As such, a memory-efficient graph data structure is obtained that inherits the FM-index' backward search functionality. However, this representation suffers from a number of shortcomings in terms of functionality and algorithmic performance. RESULTS: We present a data structure for a pan-genome, compacted de Bruijn graph that aims to address these shortcomings. It is built on the bidirectional FM-index, extending the ability of its unidirectional counterpart to navigate and search the graph in both directions. All basic graph navigation steps can be performed in constant time. Based on these features, we implement subgraph visualization as well as lossless approximate pattern matching to the graph using search schemes. We demonstrate that we can retrieve all occurrences corresponding to a read within a certain edit distance in a very efficient manner. Through a case study, we show the potential of exploiting the information embedded in the graph's topology through visualization and sequence alignment. CONCLUSIONS: We propose a memory-efficient representation of the pan-genome graph that supports subgraph visualization and lossless approximate pattern matching of reads against the graph using search schemes. The C++ source code of our software, called Nexus, is available at https://github.com/biointec/nexus under AGPL-3.0 license.


Asunto(s)
Algoritmos , Genoma , Análisis de Secuencia de ADN , Programas Informáticos , Biología Computacional
6.
BMC Genomics ; 24(1): 143, 2023 Mar 23.
Artículo en Inglés | MEDLINE | ID: mdl-36959546

RESUMEN

Genomes of four Streptomyces isolates, two putative new species (Streptomyces sp. JH14 and Streptomyces sp. JH34) and two non thaxtomin-producing pathogens (Streptomyces sp. JH002 and Streptomyces sp. JH010) isolated from potato fields in Colombia were selected to investigate their taxonomic classification, their pathogenicity, and the production of unique secondary metabolites of Streptomycetes inhabiting potato crops in this region. The average nucleotide identity (ANI) value calculated between Streptomyces sp. JH34 and its closest relatives (92.23%) classified this isolate as a new species. However, Streptomyces sp. JH14 could not be classified as a new species due to the lack of genomic data of closely related strains. Phylogenetic analysis based on 231 single-copy core genes, confirmed that the two pathogenic isolates (Streptomyces sp. JH010 and JH002) belong to Streptomyces pratensis and Streptomyces xiamenensis, respectively, are distant from the most well-known pathogenic species, and belong to two different lineages. We did not find orthogroups of protein-coding genes characteristic of scab-causing Streptomycetes shared by all known pathogenic species. Most genes involved in biosynthesis of known virulence factors are not present in the scab-causing isolates (Streptomyces sp. JH002 and Streptomyces sp. JH010). However, Tat-system substrates likely involved in pathogenicity in Streptomyces sp. JH002 and Streptomyces sp. JH010 were identified. Lastly, the presence of a putative mono-ADP-ribosyl transferase, homologous to the virulence factor scabin, was confirmed in Streptomyces sp. JH002. The described pathogenic isolates likely produce virulence factors uncommon in Streptomyces species, including a histidine phosphatase and a metalloprotease potentially produced by Streptomyces sp. JH002, and a pectinesterase, potentially produced by Streptomyces sp. JH010. Biosynthetic gene clusters (BGCs) showed the presence of clusters associated with the synthesis of medicinal compounds and BGCs potentially linked to pathogenicity in Streptomyces sp. JH010 and JH002. Interestingly, BGCs that have not been previously reported were also found. Our findings suggest that the four isolates produce novel secondary metabolites and metabolites with medicinal properties.


Asunto(s)
Solanum tuberosum , Streptomyces , Virulencia/genética , Filogenia , Factores de Virulencia/genética , Factores de Virulencia/metabolismo , Streptomyces/genética , Streptomyces/metabolismo , Genómica , Enfermedades de las Plantas
7.
Bioinformatics ; 38(24): 5352-5359, 2022 12 13.
Artículo en Inglés | MEDLINE | ID: mdl-36308461

RESUMEN

MOTIVATION: Haplotypes are the set of alleles co-occurring on a single chromosome and inherited together to the next generation. Because a monoploid reference genome loses this co-occurrence information, it has limited use in associating phenotypes with allelic combinations of genotypes. Therefore, methods to reconstruct the complete haplotypes from DNA sequencing data are crucial. Recently, several attempts have been made at haplotype reconstructions, but significant limitations remain. High-quality continuous haplotypes cannot be created reliably, particularly when there are few differences between the homologous chromosomes. RESULTS: Here, we introduce HAT, a haplotype assembly tool that exploits short and long reads along with a reference genome to reconstruct haplotypes. HAT tries to take advantage of the accuracy of short reads and the length of the long reads to reconstruct haplotypes. We tested HAT on the aneuploid yeast strain Saccharomyces pastorianus CBS1483 and multiple simulated polyploid datasets of the same strain, showing that it outperforms existing tools. AVAILABILITY AND IMPLEMENTATION: https://github.com/AbeelLab/hat/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Secuenciación de Nucleótidos de Alto Rendimiento , Comportamiento del Uso de la Herramienta , Haplotipos , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Análisis de Secuencia de ADN/métodos , Alelos , Algoritmos
8.
Antonie Van Leeuwenhoek ; 116(7): 667-685, 2023 Jul.
Artículo en Inglés | MEDLINE | ID: mdl-37156983

RESUMEN

The transformation of environmental microorganisms by extracellular DNA is an overlooked mechanism of horizontal gene transfer and evolution. It initiates the acquisition of exogenous genes and propagates antimicrobial resistance alongside vertical and conjugative transfers. We combined mixed-culture biotechnology and Hi-C sequencing to elucidate the transformation of wastewater microorganisms with a synthetic plasmid encoding GFP and kanamycin resistance genes, in the mixed culture of chemostats exposed to kanamycin at concentrations representing wastewater, gut and polluted environments (0.01-2.5-50-100 mg L-1). We found that the phylogenetically distant Gram-negative Runella (102 Hi-C links), Bosea (35), Gemmobacter (33) and Zoogloea (24) spp., and Gram-positive Microbacterium sp. (90) were transformed by the foreign plasmid, under high antibiotic exposure (50 mg L-1). In addition, the antibiotic pressure shifted the origin of aminoglycoside resistance genes from genomic DNA to mobile genetic elements on plasmids accumulating in microorganisms. These results reveal the power of Hi-C sequencing to catch and surveil the transfer of xenogenetic elements inside microbiomes.


Asunto(s)
Microbiota , Aguas Residuales , Antibacterianos/uso terapéutico , Plásmidos/genética , ADN , Transferencia de Gen Horizontal , Conjugación Genética
9.
PLoS Comput Biol ; 16(1): e1007314, 2020 01.
Artículo en Inglés | MEDLINE | ID: mdl-31971941

RESUMEN

The last decade has witnessed a remarkable increase in our ability to measure genetic information. Advancements of sequencing technologies are challenging the existing methods of data storage and analysis. While methods to cope with the data deluge are progressing, many biologists have lagged behind due to the fast pace of computational advancements and tools available to address their scientific questions. Future generations of biologists must be more computationally aware and capable. This means they should be trained to give them the computational skills to keep pace with technological developments. Here, we propose a model that bridges experimental and bioinformatics concepts using the Oxford Nanopore Technologies (ONT) sequencing platform. We provide both a guide to begin to empower the new generation of educators, scientists, and students in performing long-read assembly of bacterial and bacteriophage genomes and a standalone virtual machine containing all the required software and learning materials for the course.


Asunto(s)
Biología Computacional/educación , Secuenciación de Nanoporos , Humanos , Programas Informáticos
10.
Proc Natl Acad Sci U S A ; 115(17): 4429-4434, 2018 04 24.
Artículo en Inglés | MEDLINE | ID: mdl-29643074

RESUMEN

Many fungi are polykaryotic, containing multiple nuclei per cell. In the case of heterokaryons, there are different nuclear types within a single cell. It is unknown what the different nuclear types contribute in terms of mRNA expression levels in fungal heterokaryons. Each cell of the mushroom Agaricus bisporus contains two to 25 nuclei of two nuclear types originating from two parental strains. Using RNA-sequencing data, we assess the differential mRNA contribution of individual nuclear types and its functional impact. We studied differential expression between genes of the two nuclear types, P1 and P2, throughout mushroom development in various tissue types. P1 and P2 produced specific mRNA profiles that changed through mushroom development. Differential regulation occurred at the gene level, rather than at the locus, chromosomal, or nuclear level. P1 dominated mRNA production throughout development, and P2 showed more differentially up-regulated genes in important functional groups. In the vegetative mycelium, P2 up-regulated almost threefold more metabolism genes and carbohydrate active enzymes (cazymes) than P1, suggesting phenotypic differences in growth. We identified widespread transcriptomic variation between the nuclear types of A. bisporus Our method enables studying nucleus-specific expression, which likely influences the phenotype of a fungus in a polykaryotic stage. Our findings have a wider impact to better understand gene regulation in fungi in a heterokaryotic state. This work provides insight into the transcriptomic variation introduced by genomic nuclear separation.


Asunto(s)
Agaricus/metabolismo , Núcleo Celular/metabolismo , Regulación Fúngica de la Expresión Génica/fisiología , ARN de Hongos/biosíntesis , ARN Mensajero/biosíntesis , Regulación hacia Arriba/fisiología , Agaricus/genética , Núcleo Celular/genética , ARN de Hongos/genética , ARN Mensajero/genética , Transcriptoma/fisiología
11.
BMC Genomics ; 21(1): 80, 2020 Jan 28.
Artículo en Inglés | MEDLINE | ID: mdl-31992201

RESUMEN

BACKGROUND: Mixed infections of Mycobacterium tuberculosis and antibiotic heteroresistance continue to complicate tuberculosis (TB) diagnosis and treatment. Detection of mixed infections has been limited to molecular genotyping techniques, which lack the sensitivity and resolution to accurately estimate the multiplicity of TB infections. In contrast, whole genome sequencing offers sensitive views of the genetic differences between strains of M. tuberculosis within a sample. Although metagenomic tools exist to classify strains in a metagenomic sample, most tools have been developed for more divergent species, and therefore cannot provide the sensitivity required to disentangle strains within closely related bacterial species such as M. tuberculosis. Here we present QuantTB, a method to identify and quantify individual M. tuberculosis strains in whole genome sequencing data. QuantTB uses SNP markers to determine the combination of strains that best explain the allelic variation observed in a sample. QuantTB outputs a list of identified strains, their corresponding relative abundances, and a list of drugs for which resistance-conferring mutations (or heteroresistance) have been predicted within the sample. RESULTS: We show that QuantTB has a high degree of resolution and is capable of differentiating communities differing by less than 25 SNPs and identifying strains down to 1× coverage. Using simulated data, we found QuantTB outperformed other metagenomic strain identification tools at detecting strains and quantifying strain multiplicity. In a real-world scenario, using a dataset of 50 paired clinical isolates from a study of patients with either reinfections or relapses, we found that QuantTB could detect mixed infections and reinfections at rates concordant with a manually curated approach. CONCLUSION: QuantTB can determine infection multiplicity, identify hetero-resistance patterns, enable differentiation between relapse and re-infection, and clarify transmission events across seemingly unrelated patients - even in low-coverage (1×) samples. QuantTB outperforms existing tools and promises to serve as a valuable resource for both clinicians and researchers working with clinical TB samples.


Asunto(s)
Biología Computacional/métodos , Genoma Bacteriano , Genómica , Mycobacterium tuberculosis/genética , Tuberculosis/microbiología , Secuenciación Completa del Genoma , Algoritmos , Antituberculosos/farmacología , Bases de Datos Genéticas , Farmacorresistencia Bacteriana , Genómica/métodos , Mycobacterium tuberculosis/clasificación , Mycobacterium tuberculosis/efectos de los fármacos , Filogenia , Polimorfismo de Nucleótido Simple , Tuberculosis/tratamiento farmacológico
12.
BMC Genomics ; 20(1): 916, 2019 Dec 02.
Artículo en Inglés | MEDLINE | ID: mdl-31791228

RESUMEN

BACKGROUND: The lager brewing yeast, S. pastorianus, is a hybrid between S. cerevisiae and S. eubayanus with extensive chromosome aneuploidy. S. pastorianus is subdivided into Group 1 and Group 2 strains, where Group 2 strains have higher copy number and a larger degree of heterozygosity for S. cerevisiae chromosomes. As a result, Group 2 strains were hypothesized to have emerged from a hybridization event distinct from Group 1 strains. Current genome assemblies of S. pastorianus strains are incomplete and highly fragmented, limiting our ability to investigate their evolutionary history. RESULTS: To fill this gap, we generated a chromosome-level genome assembly of the S. pastorianus strain CBS 1483 from Oxford Nanopore MinION DNA sequencing data and analysed the newly assembled subtelomeric regions and chromosome heterozygosity. To analyse the evolutionary history of S. pastorianus strains, we developed Alpaca: a method to compute sequence similarity between genomes without assuming linear evolution. Alpaca revealed high similarities between the S. cerevisiae subgenomes of Group 1 and 2 strains, and marked differences from sequenced S. cerevisiae strains. CONCLUSIONS: Our findings suggest that Group 1 and Group 2 strains originated from a single hybridization involving a heterozygous S. cerevisiae strain, followed by different evolutionary trajectories. The clear differences between both groups may originate from a severe population bottleneck caused by the isolation of the first pure cultures. Alpaca provides a computationally inexpensive method to analyse evolutionary relationships while considering non-linear evolution such as horizontal gene transfer and sexual reproduction, providing a complementary viewpoint beyond traditional phylogenetic approaches.


Asunto(s)
Genoma Fúngico , Saccharomyces cerevisiae/genética , Saccharomyces/genética , Cerveza , Cromosomas Fúngicos , Haploidia , Secuenciación de Nucleótidos de Alto Rendimiento , Hibridación Genética , Secuenciación de Nanoporos
13.
Thorax ; 74(9): 882-889, 2019 09.
Artículo en Inglés | MEDLINE | ID: mdl-31048508

RESUMEN

BACKGROUND: While the international spread of multidrug-resistant (MDR) Mycobacterium tuberculosis strains is an acknowledged public health threat, a broad and more comprehensive examination of the global spread of MDR-tuberculosis (TB) using whole-genome sequencing has not yet been performed. METHODS: In a global dataset of 5310 M. tuberculosis whole-genome sequences isolated from five continents, we performed a phylogenetic analysis to identify and characterise clades of MDR-TB with respect to geographic dispersion. RESULTS: Extensive international dissemination of MDR-TB was observed, with identification of 32 migrant MDR-TB clades with descendants isolated in 17 unique countries. Relatively recent movement of strains from both Beijing and non-Beijing lineages indicated successful global spread of varied genetic backgrounds. Migrant MDR-TB clade members shared relatively recent common ancestry, with a median estimate of divergence of 13-27 years. Migrant extensively drug-resistant (XDR)-TB clades were not observed, although development of XDR-TB within migratory MDR-TB clades was common. CONCLUSIONS: Application of genomic techniques to investigate global MDR migration patterns revealed extensive global spread of MDR clades between countries of varying TB burden. Further expansion of genomic studies to incorporate isolates from diverse global settings into a single analysis, as well as data sharing platforms that facilitate genomic data sharing across country lines, may allow for future epidemiological analyses to monitor for international transmission of MDR-TB. In addition, efforts to perform routine whole-genome sequencing on all newly identified M. tuberculosis, like in England, will serve to better our understanding of the transmission dynamics of MDR-TB globally.


Asunto(s)
Salud Global , Mycobacterium tuberculosis/genética , Tuberculosis Resistente a Múltiples Medicamentos/genética , Tuberculosis Resistente a Múltiples Medicamentos/microbiología , Secuenciación Completa del Genoma , Humanos , Epidemiología Molecular , Filogenia
14.
Bioinformatics ; 34(17): i732-i742, 2018 09 01.
Artículo en Inglés | MEDLINE | ID: mdl-30423098

RESUMEN

Motivation: A long-standing limitation in comparative genomic studies is the dependency on a reference genome, which hinders the spectrum of genetic diversity that can be identified across a population of organisms. This is especially true in the microbial world where genome architectures can significantly vary. There is therefore a need for computational methods that can simultaneously analyze the architectures of multiple genomes without introducing bias from a reference. Results: In this article, we present Ptolemy: a novel method for studying the diversity of genome architectures-such as structural variation and pan-genomes-across a collection of microbial assemblies without the need of a reference. Ptolemy is a 'top-down' approach to compare whole genome assemblies. Genomes are represented as labeled multi-directed graphs-known as quivers-which are then merged into a single, canonical quiver by identifying 'gene anchors' via synteny analysis. The canonical quiver represents an approximate, structural alignment of all genomes in a given collection encoding structural variation across (sub-) populations within the collection. We highlight various applications of Ptolemy by analyzing structural variation and the pan-genomes of different datasets composing of Mycobacterium, Saccharomyces, Escherichia and Shigella species. Our results show that Ptolemy is flexible and can handle both conserved and highly dynamic genome architectures. Ptolemy is user-friendly-requires only FASTA-formatted assembly along with a corresponding GFF-formatted file-and resource-friendly-can align 24 genomes in ∼10 mins with four CPUs and <2 GB of RAM. Availability and implementation: Github: https://github.com/AbeelLab/ptolemy. Supplementary information: Supplementary data are available at Bioinformatics online.


Asunto(s)
Genoma Microbiano , Sintenía , Programas Informáticos
15.
Nature ; 499(7457): 178-83, 2013 Jul 11.
Artículo en Inglés | MEDLINE | ID: mdl-23823726

RESUMEN

We have taken the first steps towards a complete reconstruction of the Mycobacterium tuberculosis regulatory network based on ChIP-Seq and combined this reconstruction with system-wide profiling of messenger RNAs, proteins, metabolites and lipids during hypoxia and re-aeration. Adaptations to hypoxia are thought to have a prominent role in M. tuberculosis pathogenesis. Using ChIP-Seq combined with expression data from the induction of the same factors, we have reconstructed a draft regulatory network based on 50 transcription factors. This network model revealed a direct interconnection between the hypoxic response, lipid catabolism, lipid anabolism and the production of cell wall lipids. As a validation of this model, in response to oxygen availability we observe substantial alterations in lipid content and changes in gene expression and metabolites in corresponding metabolic pathways. The regulatory network reveals transcription factors underlying these changes, allows us to computationally predict expression changes, and indicates that Rv0081 is a regulatory hub.


Asunto(s)
Redes Reguladoras de Genes , Hipoxia/genética , Redes y Vías Metabólicas/genética , Mycobacterium tuberculosis/genética , Mycobacterium tuberculosis/metabolismo , Adaptación Fisiológica , Proteínas Bacterianas/genética , Proteínas Bacterianas/metabolismo , Sitios de Unión , Inmunoprecipitación de Cromatina , Perfilación de la Expresión Génica , Redes Reguladoras de Genes/genética , Genómica , Hipoxia/metabolismo , Metabolismo de los Lípidos/genética , Modelos Biológicos , Mycobacterium tuberculosis/efectos de los fármacos , Mycobacterium tuberculosis/fisiología , Oxígeno/farmacología , Proteolisis , ARN Mensajero/genética , ARN Mensajero/metabolismo , Reproducibilidad de los Resultados , Análisis de Secuencia de ADN , Factores de Transcripción/genética , Factores de Transcripción/metabolismo , Tuberculosis/metabolismo , Tuberculosis/microbiología
16.
Clin Infect Dis ; 64(11): 1494-1501, 2017 Jun 01.
Artículo en Inglés | MEDLINE | ID: mdl-28498943

RESUMEN

BACKGROUND.: India is home to 25% of all tuberculosis cases and the second highest number of multidrug resistant cases worldwide. However, little is known about the genetic diversity and resistance determinants of Indian Mycobacterium tuberculosis, particularly for the primary lineages found in India, lineages 1 and 3. METHODS.: We whole genome sequenced 223 randomly selected M. tuberculosis strains from 196 patients within the Tiruvallur and Madurai districts of Tamil Nadu in Southern India. Using comparative genomics, we examined genetic diversity, transmission patterns, and evolution of resistance. RESULTS.: Genomic analyses revealed (11) prevalence of strains from lineages 1 and 3, (11) recent transmission of strains among patients from the same treatment centers, (11) emergence of drug resistance within patients over time, (11) resistance gained in an order typical of strains from different lineages and geographies, (11) underperformance of known resistance-conferring mutations to explain phenotypic resistance in Indian strains relative to studies focused on other geographies, and (11) the possibility that resistance arose through mutations not previously implicated in resistance, or through infections with multiple strains that confound genotype-based prediction of resistance. CONCLUSIONS.: In addition to substantially expanding the genomic perspectives of lineages 1 and 3, sequencing and analysis of M. tuberculosis whole genomes from Southern India highlight challenges of infection control and rapid diagnosis of resistant tuberculosis using current technologies. Further studies are needed to fully explore the complement of diversity and resistance determinants within endemic M. tuberculosis populations.


Asunto(s)
Farmacorresistencia Bacteriana Múltiple/genética , Genoma Bacteriano , Mycobacterium tuberculosis/genética , Tuberculosis/diagnóstico , Tuberculosis/microbiología , Adulto , Antituberculosos/farmacología , Secuencia de Bases , Femenino , Variación Genética , Humanos , India/epidemiología , Masculino , Mutación , Mycobacterium tuberculosis/clasificación , Mycobacterium tuberculosis/efectos de los fármacos , Filogenia , Reacción en Cadena de la Polimerasa , Tuberculosis/epidemiología , Tuberculosis/transmisión
17.
Genome Res ; 24(10): 1686-97, 2014 Oct.
Artículo en Inglés | MEDLINE | ID: mdl-25024162

RESUMEN

The comprehension of protein and DNA binding in vivo is essential to understand gene regulation. Chromatin immunoprecipitation followed by sequencing (ChIP-seq) provides a global map of the regulatory binding network. Most ChIP-seq analysis tools focus on identifying binding regions from coverage enrichment. However, less work has been performed to infer the physical and regulatory details inside the enriched regions. This research extends a previous blind-deconvolution approach to develop a post-peak-calling algorithm that improves binding site resolution and predicts cooperative interactions. At the core of our new method is a physically motivated model that characterizes the binding signal as an extreme value distribution. This model suggests a mathematical framework to study physical properties of DNA shearing from the ChIP-seq coverage. The model explains the ChIP-seq coverage with two signals: The first considers DNA fragments with only a single binding event, whereas the second considers fragments with two binding events (a double-binding signal). The model incorporates motif discovery and is able to detect multiple sites in an enriched region with single-nucleotide resolution, high sensitivity, and high specificity. Our method improves peak caller sensitivity, from less than 45% up to 94%, at a false positive rate < 11% for a set of 47 experimentally validated prokaryotic sites. It also improves resolution of highly enriched regions of large-scale eukaryotic data sets. The double-binding signal provides a novel application in ChIP-seq analysis: the identification of cooperative interaction. Predictions of known cooperative binding sites show a 0.85 area under an ROC curve.


Asunto(s)
Algoritmos , Sitios de Unión , Biología Computacional/métodos , Inmunoprecipitación de Cromatina , Proteínas de Unión al ADN/metabolismo , Modelos Genéticos , Nucleótidos/metabolismo , Análisis de Secuencia de ADN
18.
J Clin Microbiol ; 55(2): 457-469, 2017 02.
Artículo en Inglés | MEDLINE | ID: mdl-27903602

RESUMEN

The emergence and spread of drug-resistant Mycobacterium tuberculosis (DR-TB) are critical global health issues. Eastern Europe has some of the highest incidences of DR-TB, particularly multidrug-resistant (MDR) and extensively drug-resistant (XDR) TB. To better understand the genetic composition and evolution of MDR- and XDR-TB in the region, we sequenced and analyzed the genomes of 138 M. tuberculosis isolates from 97 patients sampled between 2010 and 2013 in Minsk, Belarus. MDR and XDR-TB isolates were significantly more likely to belong to the Beijing lineage than to the Euro-American lineage, and known resistance-conferring loci accounted for the majority of phenotypic resistance to first- and second-line drugs in MDR and XDR-TB. Using a phylogenomic approach, we estimated that the majority of MDR-TB was due to the recent transmission of already-resistant M. tuberculosis strains rather than repeated de novo evolution of resistance within patients, while XDR-TB was acquired through both routes. Longitudinal sampling of M. tuberculosis from 34 patients with treatment failure showed that most strains persisted genetically unchanged during treatment or acquired resistance to fluoroquinolones. HIV+ patients were significantly more likely to have multiple infections over time than HIV- patients, highlighting a specific need for careful infection control in these patients. These data provide a better understanding of the genomic composition, transmission, and evolution of MDR- and XDR-TB in Belarus and will enable improved diagnostics, treatment protocols, and prognostic decision-making.


Asunto(s)
Evolución Molecular , Genoma Bacteriano , Mycobacterium tuberculosis/genética , Análisis de Secuencia de ADN , Tuberculosis Resistente a Múltiples Medicamentos/microbiología , Antituberculosos/farmacología , Transmisión de Enfermedad Infecciosa , Genotipo , Humanos , Estudios Longitudinales , Epidemiología Molecular , República de Belarús/epidemiología , Tuberculosis Resistente a Múltiples Medicamentos/epidemiología , Tuberculosis Resistente a Múltiples Medicamentos/transmisión
19.
Plant Cell ; 26(1): 210-29, 2014 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-24443518

RESUMEN

The transcriptional coactivator ANGUSTIFOLIA3 (AN3) stimulates cell proliferation during Arabidopsis thaliana leaf development, but the molecular mechanism is largely unknown. Here, we show that inducible nuclear localization of AN3 during initial leaf growth results in differential expression of important transcriptional regulators, including GROWTH REGULATING FACTORs (GRFs). Chromatin purification further revealed the presence of AN3 at the loci of GRF5, GRF6, CYTOKININ RESPONSE FACTOR2, CONSTANS-LIKE5 (COL5), HECATE1 (HEC1), and ARABIDOPSIS RESPONSE REGULATOR4 (ARR4). Tandem affinity purification of protein complexes using AN3 as bait identified plant SWITCH/SUCROSE NONFERMENTING (SWI/SNF) chromatin remodeling complexes formed around the ATPases BRAHMA (BRM) or SPLAYED. Moreover, SWI/SNF ASSOCIATED PROTEIN 73B (SWP73B) is recruited by AN3 to the promoters of GRF5, GRF3, COL5, and ARR4, and both SWP73B and BRM occupy the HEC1 promoter. Furthermore, we show that AN3 and BRM genetically interact. The data indicate that AN3 associates with chromatin remodelers to regulate transcription. In addition, modification of SWI3C expression levels increases leaf size, underlining the importance of chromatin dynamics for growth regulation. Our results place the SWI/SNF-AN3 module as a major player at the transition from cell proliferation to cell differentiation in a developing leaf.


Asunto(s)
Proteínas de Arabidopsis/fisiología , Arabidopsis/genética , Ensamble y Desensamble de Cromatina , Regulación de la Expresión Génica de las Plantas , Proteínas Represoras/fisiología , Adenosina Trifosfatasas/metabolismo , Arabidopsis/citología , Arabidopsis/crecimiento & desarrollo , Proteínas de Arabidopsis/genética , Proteínas de Arabidopsis/metabolismo , Sitios de Unión , Diferenciación Celular , Proliferación Celular , Proteínas Cromosómicas no Histona/metabolismo , Proteínas Cromosómicas no Histona/fisiología , Ciclina B/genética , Ciclina B/metabolismo , Genoma de Planta , Hojas de la Planta/citología , Hojas de la Planta/genética , Hojas de la Planta/crecimiento & desarrollo , Regiones Promotoras Genéticas , Proteínas Represoras/genética , Proteínas Represoras/metabolismo
20.
FEMS Yeast Res ; 17(7)2017 11 01.
Artículo en Inglés | MEDLINE | ID: mdl-28961779

RESUMEN

The haploid Saccharomyces cerevisiae strain CEN.PK113-7D is a popular model system for metabolic engineering and systems biology research. Current genome assemblies are based on short-read sequencing data scaffolded based on homology to strain S288C. However, these assemblies contain large sequence gaps, particularly in subtelomeric regions, and the assumption of perfect homology to S288C for scaffolding introduces bias. In this study, we obtained a near-complete genome assembly of CEN.PK113-7D using only Oxford Nanopore Technology's MinION sequencing platform. Fifteen of the 16 chromosomes, the mitochondrial genome and the 2-µm plasmid are assembled in single contigs and all but one chromosome starts or ends in a telomere repeat. This improved genome assembly contains 770 Kbp of added sequence containing 248 gene annotations in comparison to the previous assembly of CEN.PK113-7D. Many of these genes encode functions determining fitness in specific growth conditions and are therefore highly relevant for various industrial applications. Furthermore, we discovered a translocation between chromosomes III and VIII that caused misidentification of a MAL locus in the previous CEN.PK113-7D assembly. This study demonstrates the power of long-read sequencing by providing a high-quality reference assembly and annotation of CEN.PK113-7D and places a caveat on assumed genome stability of microorganisms.


Asunto(s)
Genoma Fúngico , Genómica , Secuenciación de Nucleótidos de Alto Rendimiento , Nanoporos , Saccharomyces cerevisiae/genética , Análisis de Secuencia de ADN , Cromosomas Fúngicos , Biología Computacional/métodos , Heterogeneidad Genética , Genómica/métodos , Translocación Genética
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA