Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 8 de 8
Filtrar
Más filtros

Banco de datos
Tipo de estudio
País/Región como asunto
Tipo del documento
Intervalo de año de publicación
1.
Proc Natl Acad Sci U S A ; 120(28): e2301394120, 2023 07 11.
Artículo en Inglés | MEDLINE | ID: mdl-37399390

RESUMEN

Phase variation induced by insertions and deletions (INDELs) in genomic homopolymeric tracts (HT) can silence and regulate genes in pathogenic bacteria, but this process is not characterized in MTBC (Mycobacterium tuberculosis complex) adaptation. We leverage 31,428 diverse clinical isolates to identify genomic regions including phase-variants under positive selection. Of 87,651 INDEL events that emerge repeatedly across the phylogeny, 12.4% are phase-variants within HTs (0.02% of the genome by length). We estimated the in-vitro frameshift rate in a neutral HT at 100× the neutral substitution rate at [Formula: see text] frameshifts/HT/year. Using neutral evolution simulations, we identified 4,098 substitutions and 45 phase-variants to be putatively adaptive to MTBC (P < 0.002). We experimentally confirm that a putatively adaptive phase-variant alters the expression of espA, a critical mediator of ESX-1-dependent virulence. Our evidence supports the hypothesis that phase variation in the ESX-1 system of MTBC can act as a toggle between antigenicity and survival in the host.


Asunto(s)
Mycobacterium tuberculosis , Mycobacterium tuberculosis/genética , Variación de la Fase , Genómica , Adaptación Fisiológica/genética , Virulencia/genética , Filogenia , Genoma Bacteriano
2.
Bioinformatics ; 2024 Jul 23.
Artículo en Inglés | MEDLINE | ID: mdl-39041615

RESUMEN

MOTIVATION: The gene content regulates the biology of an organism. It varies between species and between individuals of the same species. Although tools have been developed to identify gene content changes in bacterial genomes, none is applicable to collections of large eukaryotic genomes such as the human pangenome. RESULTS: We developed pangene, a computational tool to identify gene orientation, gene order and gene copy-number changes in a collection of genomes. Pangene aligns a set of input protein sequences to the genomes, resolves redundancies between protein sequences and constructs a gene graph with each genome represented as a walk in the graph. It additionally finds subgraphs, which we call bibubbles, that capture gene content changes. Applied to the human pangenome, pangene identifies known gene-level variations and reveals complex haplotypes that are not well studied before. Pangene also works with high-quality bacterial pangenome and reports similar numbers of core and accessory genes in comparison to existing tools. AVAILABILITY AND IMPLEMENTATION: Source code at https://github.com/lh3/pangene; pre-built pangene graphs can be downloaded from https://zenodo.org/records/8118576 and visualized at https://pangene.bioinweb.org.

3.
Clin Infect Dis ; 78(6): 1677-1679, 2024 Jun 14.
Artículo en Inglés | MEDLINE | ID: mdl-38636953

RESUMEN

Active case finding leveraging new molecular diagnostics and chest X-rays with automated interpretation algorithms is increasingly being developed for high-risk populations to drive down tuberculosis incidence. We consider why such an approach did not deliver a decline in tuberculosis prevalence in Brazilian prison populations and what to consider next.


Asunto(s)
Tamizaje Masivo , Tuberculosis , Humanos , Brasil/epidemiología , Tamizaje Masivo/métodos , Tuberculosis/diagnóstico , Tuberculosis/epidemiología , Prevalencia , Prisioneros , Incidencia , Prisiones
4.
Bioinformatics ; 38(7): 1781-1787, 2022 03 28.
Artículo en Inglés | MEDLINE | ID: mdl-35020793

RESUMEN

MOTIVATION: Short-read whole-genome sequencing (WGS) is a vital tool for clinical applications and basic research. Genetic divergence from the reference genome, repetitive sequences and sequencing bias reduces the performance of variant calling using short-read alignment, but the loss in recall and specificity has not been adequately characterized. To benchmark short-read variant calling, we used 36 diverse clinical Mycobacterium tuberculosis (Mtb) isolates dually sequenced with Illumina short-reads and PacBio long-reads. We systematically studied the short-read variant calling accuracy and the influence of sequence uniqueness, reference bias and GC content. RESULTS: Reference-based Illumina variant calling demonstrated a maximum recall of 89.0% and minimum precision of 98.5% across parameters evaluated. The approach that maximized variant recall while still maintaining high precision (<99%) was tuning the mapping quality filtering threshold, i.e. confidence of the read mapping (recall = 85.8%, precision = 99.1%, MQ ≥ 40). Additional masking of repetitive sequence content is an alternative conservative approach to variant calling that increases precision at cost to recall (recall = 70.2%, precision = 99.6%, MQ ≥ 40). Of the genomic positions typically excluded for Mtb, 68% are accurately called using Illumina WGS including 52/168 PE/PPE genes (34.5%). From these results, we present a refined list of low confidence regions across the Mtb genome, which we found to frequently overlap with regions with structural variation, low sequence uniqueness and low sequencing coverage. Our benchmarking results have broad implications for the use of WGS in the study of Mtb biology, inference of transmission in public health surveillance systems and more generally for WGS applications in other organisms. AVAILABILITY AND IMPLEMENTATION: All relevant code is available at https://github.com/farhat-lab/mtb-illumina-wgs-evaluation. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Mycobacterium tuberculosis , Tuberculosis , Humanos , Benchmarking , Mycobacterium tuberculosis/genética , Programas Informáticos , Análisis de Secuencia de ADN/métodos , Secuenciación de Nucleótidos de Alto Rendimiento/métodos
5.
ArXiv ; 2024 May 29.
Artículo en Inglés | MEDLINE | ID: mdl-38463499

RESUMEN

Motivation: The gene content regulates the biology of an organism. It varies between species and between individuals of the same species. Although tools have been developed to identify gene content changes in bacterial genomes, none is applicable to collections of large eukaryotic genomes such as the human pangenome. Results: We developed pangene, a computational tool to identify gene orientation, gene order and gene copy-number changes in a collection of genomes. Pangene aligns a set of input protein sequences to the genomes, resolves redundancies between protein sequences and constructs a gene graph with each genome represented as a walk in the graph. It additionally finds subgraphs, which we call bibubbles, that capture gene content changes. Applied to the human pangenome, pangene identifies known gene-level variations and reveals complex haplotypes that are not well studied before. Pangene also works with high-quality bacterial pangenome and reports similar numbers of core and accessory genes in comparison to existing tools. Availability and implementation: Source code at https://github.com/lh3/pangene; pre-built pangene graphs can be downloaded from https://zenodo.org/records/8118576 and visualized at https://pangene.bioinweb.org.

6.
PLOS Digit Health ; 3(8): e0000566, 2024 Aug.
Artículo en Inglés | MEDLINE | ID: mdl-39178177

RESUMEN

Automated data transmission from diagnostic instrument networks to a central database at the Ministries of Health has the potential of providing real-time quality data not only on diagnostic instrument performance, but also continuous disease surveillance and patient care. We aimed at sharing how a locally developed novel diagnostic connectivity solution channels actionable data from diagnostic instruments to the national dashboards for disease control in Uganda between May 2022 and May 2023. The diagnostic connectivity solution was successfully configured on a selected network of multiplexing diagnostic instruments at 260 sites in Uganda, providing a layered access of data. Of these, 909,674 test results were automatically collected from 269 "GeneXpert" machines, 5597 test results from 28 "Truenat" and >12,000 were from 3 digital x-ray devices to different stakeholder levels to ensure optimal use of data for their intended purpose. The government and relevant stakeholders are empowered with usable and actionable data from the diagnostic instruments. The successful implementation of the diagnostic connectivity solution depended on some key operational strategies namely; sustained internet connectivity and short message services, stakeholder engagement, a strong in-country laboratory coordination network, human resource capacity building, establishing a network for the diagnostic instruments, and integration with existing health data collection tools. Poor bandwidth at some locations was a major hindrance for the successful implementation of the connectivity solution. Maintaining stakeholder engagement at the clinical level is key for sustaining diagnostic data connectivity. The locally developed diagnostic connectivity solution as a digital health technology offers the chance to collect high-quality data on a number of parameters for disease control, including error analysis, thereby strengthening the quality of data from the networked diagnostic sites to relevant stakeholders.

7.
Elife ; 102021 02 01.
Artículo en Inglés | MEDLINE | ID: mdl-33522489

RESUMEN

Tuberculosis (TB) is a leading cause of death globally. Understanding the population dynamics of TB's causative agent Mycobacterium tuberculosis complex (Mtbc) in-host is vital for understanding the efficacy of antibiotic treatment. We use longitudinally collected clinical Mtbc isolates that underwent Whole-Genome Sequencing from the sputa of 200 patients to investigate Mtbc diversity during the course of active TB disease after excluding 107 cases suspected of reinfection, mixed infection or contamination. Of the 178/200 patients with persistent clonal infection >2 months, 27 developed new resistance mutations between sampling with 20/27 occurring in patients with pre-existing resistance. Low abundance resistance variants at a purity of ≥19% in the first isolate predict fixation in the subsequent sample. We identify significant in-host variation in 27 genes, including antibiotic resistance genes, metabolic genes and genes known to modulate host innate immunity and confirm several to be under positive selection by assessing phylogenetic convergence across a genetically diverse sample of 20,352 isolates.


Asunto(s)
Inmunidad Innata/genética , Mycobacterium tuberculosis/genética , Tuberculosis/microbiología , Farmacorresistencia Bacteriana/genética , Genética de Población , Humanos , Mycobacterium tuberculosis/inmunología , Mycobacterium tuberculosis/metabolismo , Filogenia , Polimorfismo de Nucleótido Simple , Reinfección/microbiología , Esputo/microbiología , Insuficiencia del Tratamiento , Tuberculosis/tratamiento farmacológico , Secuenciación Completa del Genoma
8.
Nat Commun ; 12(1): 6099, 2021 10 20.
Artículo en Inglés | MEDLINE | ID: mdl-34671035

RESUMEN

Mycobacterium tuberculosis is a clonal pathogen proposed to have co-evolved with its human host for millennia, yet our understanding of its genomic diversity and biogeography remains incomplete. Here we use a combination of phylogenetics and dimensionality reduction to reevaluate the population structure of M. tuberculosis, providing an in-depth analysis of the ancient Indo-Oceanic Lineage 1 and the modern Central Asian Lineage 3, and expanding our understanding of Lineages 2 and 4. We assess sub-lineages using genomic sequences from 4939 pan-susceptible strains, and find 30 new genetically distinct clades that we validate in a dataset of 4645 independent isolates. We find a consistent geographically restricted or unrestricted pattern for 20 groups, including three groups of Lineage 1. The distribution of terminal branch lengths across the M. tuberculosis phylogeny supports the hypothesis of a higher transmissibility of Lineages 2 and 4, in comparison with Lineages 3 and 1, on a global scale. We define an expanded barcode of 95 single nucleotide substitutions that allows rapid identification of 69 M. tuberculosis sub-lineages and 26 additional internal groups. Our results paint a higher resolution picture of the M. tuberculosis phylogeny and biogeography.


Asunto(s)
Mycobacterium tuberculosis/clasificación , Filogenia , Tuberculosis/transmisión , Código de Barras del ADN Taxonómico , Evolución Molecular , Genoma Bacteriano/genética , Humanos , Mycobacterium tuberculosis/genética , Mycobacterium tuberculosis/aislamiento & purificación , Filogeografía , Polimorfismo de Nucleótido Simple , Programas Informáticos , Tuberculosis/microbiología
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA