Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 73
Filtrar
1.
Proc Natl Acad Sci U S A ; 121(10): e2310852121, 2024 Mar 05.
Artigo em Inglês | MEDLINE | ID: mdl-38416678

RESUMO

Enterococci are gut microbes of most land animals. Likely appearing first in the guts of arthropods as they moved onto land, they diversified over hundreds of millions of years adapting to evolving hosts and host diets. Over 60 enterococcal species are now known. Two species, Enterococcus faecalis and Enterococcus faecium, are common constituents of the human microbiome. They are also now leading causes of multidrug-resistant hospital-associated infection. The basis for host association of enterococcal species is unknown. To begin identifying traits that drive host association, we collected 886 enterococcal strains from widely diverse hosts, ecologies, and geographies. This identified 18 previously undescribed species expanding genus diversity by >25%. These species harbor diverse genes including toxins and systems for detoxification and resource acquisition. Enterococcus faecalis and E. faecium were isolated from diverse hosts highlighting their generalist properties. Most other species showed a more restricted distribution indicative of specialized host association. The expanded species diversity permitted the Enterococcus genus phylogeny to be viewed with unprecedented resolution, allowing features to be identified that distinguish its four deeply rooted clades, and the entry of genes associated with range expansion such as B-vitamin biosynthesis and flagellar motility to be mapped to the phylogeny. This work provides an unprecedentedly broad and deep view of the genus Enterococcus, including insights into its evolution, potential new threats to human health, and where substantial additional enterococcal diversity is likely to be found.


Assuntos
Enterococcus faecium , Infecções por Bactérias Gram-Positivas , Animais , Humanos , Enterococcus/genética , Antibacterianos/farmacologia , Enterococcus faecium/genética , Enterococcus faecalis/genética , Filogenia , Testes de Sensibilidade Microbiana , Farmacorresistência Bacteriana
2.
Bioinformatics ; 40(6)2024 Jun 03.
Artigo em Inglês | MEDLINE | ID: mdl-38775729

RESUMO

MOTIVATION: Today, we know the function of only a small fraction of the protein sequences predicted from genomic data. This problem is even more salient for bacteria, which represent some of the most phylogenetically and metabolically diverse taxa on Earth. This low rate of bacterial gene annotation is compounded by the fact that most function prediction algorithms have focused on eukaryotes, and conventional annotation approaches rely on the presence of similar sequences in existing databases. However, often there are no such sequences for novel bacterial proteins. Thus, we need improved gene function prediction methods tailored for bacteria. Recently, transformer-based language models-adopted from the natural language processing field-have been used to obtain new representations of proteins, to replace amino acid sequences. These representations, referred to as protein embeddings, have shown promise for improving annotation of eukaryotes, but there have been only limited applications on bacterial genomes. RESULTS: To predict gene functions in bacteria, we developed SAFPred, a novel synteny-aware gene function prediction tool based on protein embeddings from state-of-the-art protein language models. SAFpred also leverages the unique operon structure of bacteria through conserved synteny. SAFPred outperformed both conventional sequence-based annotation methods and state-of-the-art methods on multiple bacterial species, including for distant homolog detection, where the sequence similarity to the proteins in the training set was as low as 40%. Using SAFPred to identify gene functions across diverse enterococci, of which some species are major clinical threats, we identified 11 previously unrecognized putative novel toxins, with potential significance to human and animal health. AVAILABILITY AND IMPLEMENTATION: https://github.com/AbeelLab/safpred.


Assuntos
Algoritmos , Proteínas de Bactérias , Genoma Bacteriano , Proteínas de Bactérias/genética , Proteínas de Bactérias/metabolismo , Software , Bactérias/genética , Sintenia , Biologia Computacional/métodos , Anotação de Sequência Molecular/métodos
3.
Bioinformatics ; 39(10)2023 10 03.
Artigo em Inglês | MEDLINE | ID: mdl-37796811

RESUMO

MOTIVATION: Plasmids are carriers for antimicrobial resistance (AMR) genes and can exchange genetic material with other structures, contributing to the spread of AMR. There is no reliable approach to identify the transfer of AMR genes across plasmids. This is mainly due to the absence of a method to assess the phylogenetic distance of plasmids, as they show large DNA sequence variability. Identifying and quantifying such transfer can provide novel insight into the role of small mobile elements and resistant plasmid regions in the spread of AMR. RESULTS: We developed SHIP, a novel method to quantify plasmid similarity based on the dynamics of plasmid evolution. This allowed us to find conserved fragments containing AMR genes in structurally different and phylogenetically distant plasmids, which is evidence for lateral transfer. Our results show that regions carrying AMR genes are highly mobilizable between plasmids through transposons, integrons, and recombination events, and contribute to the spread of AMR. Identified transferred fragments include a multi-resistant complex class 1 integron in Escherichia coli and Klebsiella pneumoniae, and a region encoding tetracycline resistance transferred through recombination in Enterococcus faecalis. AVAILABILITY AND IMPLEMENTATION: The code developed in this work is available at https://github.com/AbeelLab/plasmidHGT.


Assuntos
Antibacterianos , Farmacorresistência Bacteriana , Antibacterianos/farmacologia , Filogenia , Farmacorresistência Bacteriana/genética , Plasmídeos/genética , Escherichia coli/genética , Integrons/genética , Transferência Genética Horizontal
4.
J Immunol ; 209(8): 1555-1565, 2022 10 15.
Artigo em Inglês | MEDLINE | ID: mdl-36096642

RESUMO

Tuberculosis (TB) remains one of the deadliest infectious diseases worldwide, posing great social and economic burden to affected countries. Novel vaccine approaches are needed to increase protective immunity against the causative agent Mycobacterium tuberculosis (Mtb) and to reduce the development of active TB disease in latently infected individuals. Donor-unrestricted T cell responses represent such novel potential vaccine targets. HLA-E-restricted T cell responses have been shown to play an important role in protection against TB and other infections, and recent studies have demonstrated that these cells can be primed in vitro. However, the identification of novel pathogen-derived HLA-E binding peptides presented by infected target cells has been limited by the lack of accurate prediction algorithms for HLA-E binding. In this study, we developed an improved HLA-E binding peptide prediction algorithm and implemented it to identify (to our knowledge) novel Mtb-derived peptides with capacity to induce CD8+ T cell activation and that were recognized by specific HLA-E-restricted T cells in Mycobacterium-exposed humans. Altogether, we present a novel algorithm for the identification of pathogen- or self-derived HLA-E-presented peptides.


Assuntos
Mycobacterium tuberculosis , Tuberculose , Antígenos de Bactérias , Linfócitos T CD8-Positivos , Epitopos de Linfócito T , Antígenos de Histocompatibilidade Classe I , Humanos , Peptídeos , Antígenos HLA-E
5.
BMC Bioinformatics ; 24(1): 400, 2023 Oct 26.
Artigo em Inglês | MEDLINE | ID: mdl-37884897

RESUMO

BACKGROUND: Pan-genome graphs are gaining importance in the field of bioinformatics as data structures to represent and jointly analyze multiple genomes. Compacted de Bruijn graphs are inherently suited for this purpose, as their graph topology naturally reveals similarity and divergence within the pan-genome. Most state-of-the-art pan-genome graphs are represented explicitly in terms of nodes and edges. Recently, an alternative, implicit graph representation was proposed that builds directly upon the unidirectional FM-index. As such, a memory-efficient graph data structure is obtained that inherits the FM-index' backward search functionality. However, this representation suffers from a number of shortcomings in terms of functionality and algorithmic performance. RESULTS: We present a data structure for a pan-genome, compacted de Bruijn graph that aims to address these shortcomings. It is built on the bidirectional FM-index, extending the ability of its unidirectional counterpart to navigate and search the graph in both directions. All basic graph navigation steps can be performed in constant time. Based on these features, we implement subgraph visualization as well as lossless approximate pattern matching to the graph using search schemes. We demonstrate that we can retrieve all occurrences corresponding to a read within a certain edit distance in a very efficient manner. Through a case study, we show the potential of exploiting the information embedded in the graph's topology through visualization and sequence alignment. CONCLUSIONS: We propose a memory-efficient representation of the pan-genome graph that supports subgraph visualization and lossless approximate pattern matching of reads against the graph using search schemes. The C++ source code of our software, called Nexus, is available at https://github.com/biointec/nexus under AGPL-3.0 license.


Assuntos
Algoritmos , Genoma , Análise de Sequência de DNA , Software , Biologia Computacional
6.
BMC Genomics ; 24(1): 143, 2023 Mar 23.
Artigo em Inglês | MEDLINE | ID: mdl-36959546

RESUMO

Genomes of four Streptomyces isolates, two putative new species (Streptomyces sp. JH14 and Streptomyces sp. JH34) and two non thaxtomin-producing pathogens (Streptomyces sp. JH002 and Streptomyces sp. JH010) isolated from potato fields in Colombia were selected to investigate their taxonomic classification, their pathogenicity, and the production of unique secondary metabolites of Streptomycetes inhabiting potato crops in this region. The average nucleotide identity (ANI) value calculated between Streptomyces sp. JH34 and its closest relatives (92.23%) classified this isolate as a new species. However, Streptomyces sp. JH14 could not be classified as a new species due to the lack of genomic data of closely related strains. Phylogenetic analysis based on 231 single-copy core genes, confirmed that the two pathogenic isolates (Streptomyces sp. JH010 and JH002) belong to Streptomyces pratensis and Streptomyces xiamenensis, respectively, are distant from the most well-known pathogenic species, and belong to two different lineages. We did not find orthogroups of protein-coding genes characteristic of scab-causing Streptomycetes shared by all known pathogenic species. Most genes involved in biosynthesis of known virulence factors are not present in the scab-causing isolates (Streptomyces sp. JH002 and Streptomyces sp. JH010). However, Tat-system substrates likely involved in pathogenicity in Streptomyces sp. JH002 and Streptomyces sp. JH010 were identified. Lastly, the presence of a putative mono-ADP-ribosyl transferase, homologous to the virulence factor scabin, was confirmed in Streptomyces sp. JH002. The described pathogenic isolates likely produce virulence factors uncommon in Streptomyces species, including a histidine phosphatase and a metalloprotease potentially produced by Streptomyces sp. JH002, and a pectinesterase, potentially produced by Streptomyces sp. JH010. Biosynthetic gene clusters (BGCs) showed the presence of clusters associated with the synthesis of medicinal compounds and BGCs potentially linked to pathogenicity in Streptomyces sp. JH010 and JH002. Interestingly, BGCs that have not been previously reported were also found. Our findings suggest that the four isolates produce novel secondary metabolites and metabolites with medicinal properties.


Assuntos
Solanum tuberosum , Streptomyces , Virulência/genética , Filogenia , Fatores de Virulência/genética , Fatores de Virulência/metabolismo , Streptomyces/genética , Streptomyces/metabolismo , Genômica , Doenças das Plantas
7.
Bioinformatics ; 38(24): 5352-5359, 2022 12 13.
Artigo em Inglês | MEDLINE | ID: mdl-36308461

RESUMO

MOTIVATION: Haplotypes are the set of alleles co-occurring on a single chromosome and inherited together to the next generation. Because a monoploid reference genome loses this co-occurrence information, it has limited use in associating phenotypes with allelic combinations of genotypes. Therefore, methods to reconstruct the complete haplotypes from DNA sequencing data are crucial. Recently, several attempts have been made at haplotype reconstructions, but significant limitations remain. High-quality continuous haplotypes cannot be created reliably, particularly when there are few differences between the homologous chromosomes. RESULTS: Here, we introduce HAT, a haplotype assembly tool that exploits short and long reads along with a reference genome to reconstruct haplotypes. HAT tries to take advantage of the accuracy of short reads and the length of the long reads to reconstruct haplotypes. We tested HAT on the aneuploid yeast strain Saccharomyces pastorianus CBS1483 and multiple simulated polyploid datasets of the same strain, showing that it outperforms existing tools. AVAILABILITY AND IMPLEMENTATION: https://github.com/AbeelLab/hat/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Sequenciamento de Nucleotídeos em Larga Escala , Comportamento de Utilização de Ferramentas , Haplótipos , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Análise de Sequência de DNA/métodos , Alelos , Algoritmos
8.
Antonie Van Leeuwenhoek ; 116(7): 667-685, 2023 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-37156983

RESUMO

The transformation of environmental microorganisms by extracellular DNA is an overlooked mechanism of horizontal gene transfer and evolution. It initiates the acquisition of exogenous genes and propagates antimicrobial resistance alongside vertical and conjugative transfers. We combined mixed-culture biotechnology and Hi-C sequencing to elucidate the transformation of wastewater microorganisms with a synthetic plasmid encoding GFP and kanamycin resistance genes, in the mixed culture of chemostats exposed to kanamycin at concentrations representing wastewater, gut and polluted environments (0.01-2.5-50-100 mg L-1). We found that the phylogenetically distant Gram-negative Runella (102 Hi-C links), Bosea (35), Gemmobacter (33) and Zoogloea (24) spp., and Gram-positive Microbacterium sp. (90) were transformed by the foreign plasmid, under high antibiotic exposure (50 mg L-1). In addition, the antibiotic pressure shifted the origin of aminoglycoside resistance genes from genomic DNA to mobile genetic elements on plasmids accumulating in microorganisms. These results reveal the power of Hi-C sequencing to catch and surveil the transfer of xenogenetic elements inside microbiomes.


Assuntos
Microbiota , Águas Residuárias , Antibacterianos/uso terapêutico , Plasmídeos/genética , DNA , Transferência Genética Horizontal , Conjugação Genética
9.
PLoS Comput Biol ; 16(1): e1007314, 2020 01.
Artigo em Inglês | MEDLINE | ID: mdl-31971941

RESUMO

The last decade has witnessed a remarkable increase in our ability to measure genetic information. Advancements of sequencing technologies are challenging the existing methods of data storage and analysis. While methods to cope with the data deluge are progressing, many biologists have lagged behind due to the fast pace of computational advancements and tools available to address their scientific questions. Future generations of biologists must be more computationally aware and capable. This means they should be trained to give them the computational skills to keep pace with technological developments. Here, we propose a model that bridges experimental and bioinformatics concepts using the Oxford Nanopore Technologies (ONT) sequencing platform. We provide both a guide to begin to empower the new generation of educators, scientists, and students in performing long-read assembly of bacterial and bacteriophage genomes and a standalone virtual machine containing all the required software and learning materials for the course.


Assuntos
Biologia Computacional/educação , Sequenciamento por Nanoporos , Humanos , Software
10.
Proc Natl Acad Sci U S A ; 115(17): 4429-4434, 2018 04 24.
Artigo em Inglês | MEDLINE | ID: mdl-29643074

RESUMO

Many fungi are polykaryotic, containing multiple nuclei per cell. In the case of heterokaryons, there are different nuclear types within a single cell. It is unknown what the different nuclear types contribute in terms of mRNA expression levels in fungal heterokaryons. Each cell of the mushroom Agaricus bisporus contains two to 25 nuclei of two nuclear types originating from two parental strains. Using RNA-sequencing data, we assess the differential mRNA contribution of individual nuclear types and its functional impact. We studied differential expression between genes of the two nuclear types, P1 and P2, throughout mushroom development in various tissue types. P1 and P2 produced specific mRNA profiles that changed through mushroom development. Differential regulation occurred at the gene level, rather than at the locus, chromosomal, or nuclear level. P1 dominated mRNA production throughout development, and P2 showed more differentially up-regulated genes in important functional groups. In the vegetative mycelium, P2 up-regulated almost threefold more metabolism genes and carbohydrate active enzymes (cazymes) than P1, suggesting phenotypic differences in growth. We identified widespread transcriptomic variation between the nuclear types of A. bisporus Our method enables studying nucleus-specific expression, which likely influences the phenotype of a fungus in a polykaryotic stage. Our findings have a wider impact to better understand gene regulation in fungi in a heterokaryotic state. This work provides insight into the transcriptomic variation introduced by genomic nuclear separation.


Assuntos
Agaricus/metabolismo , Núcleo Celular/metabolismo , Regulação Fúngica da Expressão Gênica/fisiologia , RNA Fúngico/biossíntese , RNA Mensageiro/biossíntese , Regulação para Cima/fisiologia , Agaricus/genética , Núcleo Celular/genética , RNA Fúngico/genética , RNA Mensageiro/genética , Transcriptoma/fisiologia
11.
BMC Genomics ; 21(1): 80, 2020 Jan 28.
Artigo em Inglês | MEDLINE | ID: mdl-31992201

RESUMO

BACKGROUND: Mixed infections of Mycobacterium tuberculosis and antibiotic heteroresistance continue to complicate tuberculosis (TB) diagnosis and treatment. Detection of mixed infections has been limited to molecular genotyping techniques, which lack the sensitivity and resolution to accurately estimate the multiplicity of TB infections. In contrast, whole genome sequencing offers sensitive views of the genetic differences between strains of M. tuberculosis within a sample. Although metagenomic tools exist to classify strains in a metagenomic sample, most tools have been developed for more divergent species, and therefore cannot provide the sensitivity required to disentangle strains within closely related bacterial species such as M. tuberculosis. Here we present QuantTB, a method to identify and quantify individual M. tuberculosis strains in whole genome sequencing data. QuantTB uses SNP markers to determine the combination of strains that best explain the allelic variation observed in a sample. QuantTB outputs a list of identified strains, their corresponding relative abundances, and a list of drugs for which resistance-conferring mutations (or heteroresistance) have been predicted within the sample. RESULTS: We show that QuantTB has a high degree of resolution and is capable of differentiating communities differing by less than 25 SNPs and identifying strains down to 1× coverage. Using simulated data, we found QuantTB outperformed other metagenomic strain identification tools at detecting strains and quantifying strain multiplicity. In a real-world scenario, using a dataset of 50 paired clinical isolates from a study of patients with either reinfections or relapses, we found that QuantTB could detect mixed infections and reinfections at rates concordant with a manually curated approach. CONCLUSION: QuantTB can determine infection multiplicity, identify hetero-resistance patterns, enable differentiation between relapse and re-infection, and clarify transmission events across seemingly unrelated patients - even in low-coverage (1×) samples. QuantTB outperforms existing tools and promises to serve as a valuable resource for both clinicians and researchers working with clinical TB samples.


Assuntos
Biologia Computacional/métodos , Genoma Bacteriano , Genômica , Mycobacterium tuberculosis/genética , Tuberculose/microbiologia , Sequenciamento Completo do Genoma , Algoritmos , Antituberculosos/farmacologia , Bases de Dados Genéticas , Farmacorresistência Bacteriana , Genômica/métodos , Mycobacterium tuberculosis/classificação , Mycobacterium tuberculosis/efeitos dos fármacos , Filogenia , Polimorfismo de Nucleotídeo Único , Tuberculose/tratamento farmacológico
12.
BMC Genomics ; 20(1): 916, 2019 Dec 02.
Artigo em Inglês | MEDLINE | ID: mdl-31791228

RESUMO

BACKGROUND: The lager brewing yeast, S. pastorianus, is a hybrid between S. cerevisiae and S. eubayanus with extensive chromosome aneuploidy. S. pastorianus is subdivided into Group 1 and Group 2 strains, where Group 2 strains have higher copy number and a larger degree of heterozygosity for S. cerevisiae chromosomes. As a result, Group 2 strains were hypothesized to have emerged from a hybridization event distinct from Group 1 strains. Current genome assemblies of S. pastorianus strains are incomplete and highly fragmented, limiting our ability to investigate their evolutionary history. RESULTS: To fill this gap, we generated a chromosome-level genome assembly of the S. pastorianus strain CBS 1483 from Oxford Nanopore MinION DNA sequencing data and analysed the newly assembled subtelomeric regions and chromosome heterozygosity. To analyse the evolutionary history of S. pastorianus strains, we developed Alpaca: a method to compute sequence similarity between genomes without assuming linear evolution. Alpaca revealed high similarities between the S. cerevisiae subgenomes of Group 1 and 2 strains, and marked differences from sequenced S. cerevisiae strains. CONCLUSIONS: Our findings suggest that Group 1 and Group 2 strains originated from a single hybridization involving a heterozygous S. cerevisiae strain, followed by different evolutionary trajectories. The clear differences between both groups may originate from a severe population bottleneck caused by the isolation of the first pure cultures. Alpaca provides a computationally inexpensive method to analyse evolutionary relationships while considering non-linear evolution such as horizontal gene transfer and sexual reproduction, providing a complementary viewpoint beyond traditional phylogenetic approaches.


Assuntos
Genoma Fúngico , Saccharomyces cerevisiae/genética , Saccharomyces/genética , Cerveja , Cromossomos Fúngicos , Haploidia , Sequenciamento de Nucleotídeos em Larga Escala , Hibridização Genética , Sequenciamento por Nanoporos
13.
Thorax ; 74(9): 882-889, 2019 09.
Artigo em Inglês | MEDLINE | ID: mdl-31048508

RESUMO

BACKGROUND: While the international spread of multidrug-resistant (MDR) Mycobacterium tuberculosis strains is an acknowledged public health threat, a broad and more comprehensive examination of the global spread of MDR-tuberculosis (TB) using whole-genome sequencing has not yet been performed. METHODS: In a global dataset of 5310 M. tuberculosis whole-genome sequences isolated from five continents, we performed a phylogenetic analysis to identify and characterise clades of MDR-TB with respect to geographic dispersion. RESULTS: Extensive international dissemination of MDR-TB was observed, with identification of 32 migrant MDR-TB clades with descendants isolated in 17 unique countries. Relatively recent movement of strains from both Beijing and non-Beijing lineages indicated successful global spread of varied genetic backgrounds. Migrant MDR-TB clade members shared relatively recent common ancestry, with a median estimate of divergence of 13-27 years. Migrant extensively drug-resistant (XDR)-TB clades were not observed, although development of XDR-TB within migratory MDR-TB clades was common. CONCLUSIONS: Application of genomic techniques to investigate global MDR migration patterns revealed extensive global spread of MDR clades between countries of varying TB burden. Further expansion of genomic studies to incorporate isolates from diverse global settings into a single analysis, as well as data sharing platforms that facilitate genomic data sharing across country lines, may allow for future epidemiological analyses to monitor for international transmission of MDR-TB. In addition, efforts to perform routine whole-genome sequencing on all newly identified M. tuberculosis, like in England, will serve to better our understanding of the transmission dynamics of MDR-TB globally.


Assuntos
Saúde Global , Mycobacterium tuberculosis/genética , Tuberculose Resistente a Múltiplos Medicamentos/genética , Tuberculose Resistente a Múltiplos Medicamentos/microbiologia , Sequenciamento Completo do Genoma , Humanos , Epidemiologia Molecular , Filogenia
14.
Bioinformatics ; 34(17): i732-i742, 2018 09 01.
Artigo em Inglês | MEDLINE | ID: mdl-30423098

RESUMO

Motivation: A long-standing limitation in comparative genomic studies is the dependency on a reference genome, which hinders the spectrum of genetic diversity that can be identified across a population of organisms. This is especially true in the microbial world where genome architectures can significantly vary. There is therefore a need for computational methods that can simultaneously analyze the architectures of multiple genomes without introducing bias from a reference. Results: In this article, we present Ptolemy: a novel method for studying the diversity of genome architectures-such as structural variation and pan-genomes-across a collection of microbial assemblies without the need of a reference. Ptolemy is a 'top-down' approach to compare whole genome assemblies. Genomes are represented as labeled multi-directed graphs-known as quivers-which are then merged into a single, canonical quiver by identifying 'gene anchors' via synteny analysis. The canonical quiver represents an approximate, structural alignment of all genomes in a given collection encoding structural variation across (sub-) populations within the collection. We highlight various applications of Ptolemy by analyzing structural variation and the pan-genomes of different datasets composing of Mycobacterium, Saccharomyces, Escherichia and Shigella species. Our results show that Ptolemy is flexible and can handle both conserved and highly dynamic genome architectures. Ptolemy is user-friendly-requires only FASTA-formatted assembly along with a corresponding GFF-formatted file-and resource-friendly-can align 24 genomes in ∼10 mins with four CPUs and <2 GB of RAM. Availability and implementation: Github: https://github.com/AbeelLab/ptolemy. Supplementary information: Supplementary data are available at Bioinformatics online.


Assuntos
Genoma Microbiano , Sintenia , Software
15.
Nature ; 499(7457): 178-83, 2013 Jul 11.
Artigo em Inglês | MEDLINE | ID: mdl-23823726

RESUMO

We have taken the first steps towards a complete reconstruction of the Mycobacterium tuberculosis regulatory network based on ChIP-Seq and combined this reconstruction with system-wide profiling of messenger RNAs, proteins, metabolites and lipids during hypoxia and re-aeration. Adaptations to hypoxia are thought to have a prominent role in M. tuberculosis pathogenesis. Using ChIP-Seq combined with expression data from the induction of the same factors, we have reconstructed a draft regulatory network based on 50 transcription factors. This network model revealed a direct interconnection between the hypoxic response, lipid catabolism, lipid anabolism and the production of cell wall lipids. As a validation of this model, in response to oxygen availability we observe substantial alterations in lipid content and changes in gene expression and metabolites in corresponding metabolic pathways. The regulatory network reveals transcription factors underlying these changes, allows us to computationally predict expression changes, and indicates that Rv0081 is a regulatory hub.


Assuntos
Redes Reguladoras de Genes , Hipóxia/genética , Redes e Vias Metabólicas/genética , Mycobacterium tuberculosis/genética , Mycobacterium tuberculosis/metabolismo , Adaptação Fisiológica , Proteínas de Bactérias/genética , Proteínas de Bactérias/metabolismo , Sítios de Ligação , Imunoprecipitação da Cromatina , Perfilação da Expressão Gênica , Redes Reguladoras de Genes/genética , Genômica , Hipóxia/metabolismo , Metabolismo dos Lipídeos/genética , Modelos Biológicos , Mycobacterium tuberculosis/efeitos dos fármacos , Mycobacterium tuberculosis/fisiologia , Oxigênio/farmacologia , Proteólise , RNA Mensageiro/genética , RNA Mensageiro/metabolismo , Reprodutibilidade dos Testes , Análise de Sequência de DNA , Fatores de Transcrição/genética , Fatores de Transcrição/metabolismo , Tuberculose/metabolismo , Tuberculose/microbiologia
16.
Clin Infect Dis ; 64(11): 1494-1501, 2017 Jun 01.
Artigo em Inglês | MEDLINE | ID: mdl-28498943

RESUMO

BACKGROUND.: India is home to 25% of all tuberculosis cases and the second highest number of multidrug resistant cases worldwide. However, little is known about the genetic diversity and resistance determinants of Indian Mycobacterium tuberculosis, particularly for the primary lineages found in India, lineages 1 and 3. METHODS.: We whole genome sequenced 223 randomly selected M. tuberculosis strains from 196 patients within the Tiruvallur and Madurai districts of Tamil Nadu in Southern India. Using comparative genomics, we examined genetic diversity, transmission patterns, and evolution of resistance. RESULTS.: Genomic analyses revealed (11) prevalence of strains from lineages 1 and 3, (11) recent transmission of strains among patients from the same treatment centers, (11) emergence of drug resistance within patients over time, (11) resistance gained in an order typical of strains from different lineages and geographies, (11) underperformance of known resistance-conferring mutations to explain phenotypic resistance in Indian strains relative to studies focused on other geographies, and (11) the possibility that resistance arose through mutations not previously implicated in resistance, or through infections with multiple strains that confound genotype-based prediction of resistance. CONCLUSIONS.: In addition to substantially expanding the genomic perspectives of lineages 1 and 3, sequencing and analysis of M. tuberculosis whole genomes from Southern India highlight challenges of infection control and rapid diagnosis of resistant tuberculosis using current technologies. Further studies are needed to fully explore the complement of diversity and resistance determinants within endemic M. tuberculosis populations.


Assuntos
Farmacorresistência Bacteriana Múltipla/genética , Genoma Bacteriano , Mycobacterium tuberculosis/genética , Tuberculose/diagnóstico , Tuberculose/microbiologia , Adulto , Antituberculosos/farmacologia , Sequência de Bases , Feminino , Variação Genética , Humanos , Índia/epidemiologia , Masculino , Mutação , Mycobacterium tuberculosis/classificação , Mycobacterium tuberculosis/efeitos dos fármacos , Filogenia , Reação em Cadeia da Polimerase , Tuberculose/epidemiologia , Tuberculose/transmissão
17.
Genome Res ; 24(10): 1686-97, 2014 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-25024162

RESUMO

The comprehension of protein and DNA binding in vivo is essential to understand gene regulation. Chromatin immunoprecipitation followed by sequencing (ChIP-seq) provides a global map of the regulatory binding network. Most ChIP-seq analysis tools focus on identifying binding regions from coverage enrichment. However, less work has been performed to infer the physical and regulatory details inside the enriched regions. This research extends a previous blind-deconvolution approach to develop a post-peak-calling algorithm that improves binding site resolution and predicts cooperative interactions. At the core of our new method is a physically motivated model that characterizes the binding signal as an extreme value distribution. This model suggests a mathematical framework to study physical properties of DNA shearing from the ChIP-seq coverage. The model explains the ChIP-seq coverage with two signals: The first considers DNA fragments with only a single binding event, whereas the second considers fragments with two binding events (a double-binding signal). The model incorporates motif discovery and is able to detect multiple sites in an enriched region with single-nucleotide resolution, high sensitivity, and high specificity. Our method improves peak caller sensitivity, from less than 45% up to 94%, at a false positive rate < 11% for a set of 47 experimentally validated prokaryotic sites. It also improves resolution of highly enriched regions of large-scale eukaryotic data sets. The double-binding signal provides a novel application in ChIP-seq analysis: the identification of cooperative interaction. Predictions of known cooperative binding sites show a 0.85 area under an ROC curve.


Assuntos
Algoritmos , Sítios de Ligação , Biologia Computacional/métodos , Imunoprecipitação da Cromatina , Proteínas de Ligação a DNA/metabolismo , Modelos Genéticos , Nucleotídeos/metabolismo , Análise de Sequência de DNA
18.
J Clin Microbiol ; 55(2): 457-469, 2017 02.
Artigo em Inglês | MEDLINE | ID: mdl-27903602

RESUMO

The emergence and spread of drug-resistant Mycobacterium tuberculosis (DR-TB) are critical global health issues. Eastern Europe has some of the highest incidences of DR-TB, particularly multidrug-resistant (MDR) and extensively drug-resistant (XDR) TB. To better understand the genetic composition and evolution of MDR- and XDR-TB in the region, we sequenced and analyzed the genomes of 138 M. tuberculosis isolates from 97 patients sampled between 2010 and 2013 in Minsk, Belarus. MDR and XDR-TB isolates were significantly more likely to belong to the Beijing lineage than to the Euro-American lineage, and known resistance-conferring loci accounted for the majority of phenotypic resistance to first- and second-line drugs in MDR and XDR-TB. Using a phylogenomic approach, we estimated that the majority of MDR-TB was due to the recent transmission of already-resistant M. tuberculosis strains rather than repeated de novo evolution of resistance within patients, while XDR-TB was acquired through both routes. Longitudinal sampling of M. tuberculosis from 34 patients with treatment failure showed that most strains persisted genetically unchanged during treatment or acquired resistance to fluoroquinolones. HIV+ patients were significantly more likely to have multiple infections over time than HIV- patients, highlighting a specific need for careful infection control in these patients. These data provide a better understanding of the genomic composition, transmission, and evolution of MDR- and XDR-TB in Belarus and will enable improved diagnostics, treatment protocols, and prognostic decision-making.


Assuntos
Evolução Molecular , Genoma Bacteriano , Mycobacterium tuberculosis/genética , Análise de Sequência de DNA , Tuberculose Resistente a Múltiplos Medicamentos/microbiologia , Antituberculosos/farmacologia , Transmissão de Doença Infecciosa , Genótipo , Humanos , Estudos Longitudinais , Epidemiologia Molecular , República de Belarus/epidemiologia , Tuberculose Resistente a Múltiplos Medicamentos/epidemiologia , Tuberculose Resistente a Múltiplos Medicamentos/transmissão
19.
Plant Cell ; 26(1): 210-29, 2014 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-24443518

RESUMO

The transcriptional coactivator ANGUSTIFOLIA3 (AN3) stimulates cell proliferation during Arabidopsis thaliana leaf development, but the molecular mechanism is largely unknown. Here, we show that inducible nuclear localization of AN3 during initial leaf growth results in differential expression of important transcriptional regulators, including GROWTH REGULATING FACTORs (GRFs). Chromatin purification further revealed the presence of AN3 at the loci of GRF5, GRF6, CYTOKININ RESPONSE FACTOR2, CONSTANS-LIKE5 (COL5), HECATE1 (HEC1), and ARABIDOPSIS RESPONSE REGULATOR4 (ARR4). Tandem affinity purification of protein complexes using AN3 as bait identified plant SWITCH/SUCROSE NONFERMENTING (SWI/SNF) chromatin remodeling complexes formed around the ATPases BRAHMA (BRM) or SPLAYED. Moreover, SWI/SNF ASSOCIATED PROTEIN 73B (SWP73B) is recruited by AN3 to the promoters of GRF5, GRF3, COL5, and ARR4, and both SWP73B and BRM occupy the HEC1 promoter. Furthermore, we show that AN3 and BRM genetically interact. The data indicate that AN3 associates with chromatin remodelers to regulate transcription. In addition, modification of SWI3C expression levels increases leaf size, underlining the importance of chromatin dynamics for growth regulation. Our results place the SWI/SNF-AN3 module as a major player at the transition from cell proliferation to cell differentiation in a developing leaf.


Assuntos
Proteínas de Arabidopsis/fisiologia , Arabidopsis/genética , Montagem e Desmontagem da Cromatina , Regulação da Expressão Gênica de Plantas , Proteínas Repressoras/fisiologia , Adenosina Trifosfatases/metabolismo , Arabidopsis/citologia , Arabidopsis/crescimento & desenvolvimento , Proteínas de Arabidopsis/genética , Proteínas de Arabidopsis/metabolismo , Sítios de Ligação , Diferenciação Celular , Proliferação de Células , Proteínas Cromossômicas não Histona/metabolismo , Proteínas Cromossômicas não Histona/fisiologia , Ciclina B/genética , Ciclina B/metabolismo , Genoma de Planta , Folhas de Planta/citologia , Folhas de Planta/genética , Folhas de Planta/crescimento & desenvolvimento , Regiões Promotoras Genéticas , Proteínas Repressoras/genética , Proteínas Repressoras/metabolismo
20.
FEMS Yeast Res ; 17(7)2017 11 01.
Artigo em Inglês | MEDLINE | ID: mdl-28961779

RESUMO

The haploid Saccharomyces cerevisiae strain CEN.PK113-7D is a popular model system for metabolic engineering and systems biology research. Current genome assemblies are based on short-read sequencing data scaffolded based on homology to strain S288C. However, these assemblies contain large sequence gaps, particularly in subtelomeric regions, and the assumption of perfect homology to S288C for scaffolding introduces bias. In this study, we obtained a near-complete genome assembly of CEN.PK113-7D using only Oxford Nanopore Technology's MinION sequencing platform. Fifteen of the 16 chromosomes, the mitochondrial genome and the 2-µm plasmid are assembled in single contigs and all but one chromosome starts or ends in a telomere repeat. This improved genome assembly contains 770 Kbp of added sequence containing 248 gene annotations in comparison to the previous assembly of CEN.PK113-7D. Many of these genes encode functions determining fitness in specific growth conditions and are therefore highly relevant for various industrial applications. Furthermore, we discovered a translocation between chromosomes III and VIII that caused misidentification of a MAL locus in the previous CEN.PK113-7D assembly. This study demonstrates the power of long-read sequencing by providing a high-quality reference assembly and annotation of CEN.PK113-7D and places a caveat on assumed genome stability of microorganisms.


Assuntos
Genoma Fúngico , Genômica , Sequenciamento de Nucleotídeos em Larga Escala , Nanoporos , Saccharomyces cerevisiae/genética , Análise de Sequência de DNA , Cromossomos Fúngicos , Biologia Computacional/métodos , Heterogeneidade Genética , Genômica/métodos , Translocação Genética
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA