RESUMO
Genetic differences between different Mycobacterium tuberculosis complex (MTBC) strains determine their ability to transmit within different host populations, their latency times, and their drug resistance profiles. Said differences usually emerge through de novo mutations and are maintained or discarded by the balance of evolutionary forces. Using a dataset of â¼5,000 strains representing global MTBC diversity, we determined the past and present selective forces that have shaped the current variability observed in the pathogen population. We identified regions that have evolved under changing types of selection since the time of the MTBC common ancestor. Our approach highlighted striking differences in the genome regions relevant for hostpathogen interaction and, in particular, suggested an adaptive role for the sensor protein of two-component systems. In addition, we applied our approach to successfully identify potential determinants of resistance to drugs administered as second-line tuberculosis treatments.
Assuntos
Mycobacterium tuberculosis , Evolução Molecular , Genoma Bacteriano , Mutação , Mycobacterium tuberculosis/genética , Filogenia , Seleção Genética , Análise de Sequência de DNARESUMO
Multidrug-resistant(MDR) tuberculosis in Southern Africa is of great concern, exacerbated by the spread of a clone harboring a mutation missed by Xpert Ultra. In Southern Mozambique, the presence of such mutation and rising cases of non-MDR isoniazid resistance highlights the need to ensure accurate detection of antimicrobial-resistance in the country.
Assuntos
Mycobacterium tuberculosis , Tuberculose Resistente a Múltiplos Medicamentos , Humanos , Rifampina/farmacologia , Rifampina/uso terapêutico , Mycobacterium tuberculosis/genética , Tuberculose Resistente a Múltiplos Medicamentos/diagnóstico , Tuberculose Resistente a Múltiplos Medicamentos/tratamento farmacológico , Farmacorresistência Bacteriana/genética , Moçambique , Mutação , Sensibilidade e EspecificidadeRESUMO
Culturing and genomic sequencing of Mycobacterium tuberculosis (MTB) from tuberculosis (TB) cases is the basis for many research and clinical applications. The alternative, culture-free sequencing from diagnostic samples, is promising but poses challenges to obtain and analyse the MTB genome. Paradoxically, culture is assumed to impose a diversity bottleneck, which, if true, would entail unexplored consequences. To unravel this paradox we generate high-quality genomes of sputum-culture pairs from two different settings after developing a workflow for sequencing from sputum and a tailored bioinformatics analysis. Careful downstream comparisons reveal sources of sputum-culture incongruences due to false positive/negative variation associated with factors like low input MTB DNA or variable genomic depths. After accounting for these factors, contrary to the bottleneck dogma, we identify a 97% variant agreement within sputum-culture pairs, with a high correlation also in the variants' frequency (0.98). The combined analysis from five different settings and more than 100 available samples shows that our results can be extrapolated to different TB epidemic scenarios, demonstrating that for the cases tested culture accurately mirrors clinical samples.
Assuntos
Variação Genética , Mycobacterium tuberculosis , Escarro , Mycobacterium tuberculosis/genética , Mycobacterium tuberculosis/isolamento & purificação , Escarro/microbiologia , Humanos , Tuberculose/microbiologia , Tuberculose/diagnóstico , Genoma Bacteriano , DNA Bacteriano/genética , Tuberculose Pulmonar/microbiologia , Tuberculose Pulmonar/diagnósticoRESUMO
In the management of infectious disease outbreaks, grouping cases into clusters and understanding their underlying epidemiology are fundamental tasks. In genomic epidemiology, clusters are typically identified either using pathogen sequences alone or with sequences in combination with epidemiological data such as location and time of collection. However, it may not be feasible to culture and sequence all pathogen isolates, so sequence data may not be available for all cases. This presents challenges for identifying clusters and understanding epidemiology, because these cases may be important for transmission. Demographic, clinical and location data are likely to be available for unsequenced cases, and comprise partial information about their clustering. Here, we use statistical modelling to assign unsequenced cases to clusters already identified by genomic methods, assuming that a more direct method of linking individuals, such as contact tracing, is not available. We build our model on pairwise similarity between cases to predict whether cases cluster together, in contrast to using individual case data to predict the cases' clusters. We then develop methods that allow us to determine whether a pair of unsequenced cases are likely to cluster together, to group them into their most probable clusters, to identify which are most likely to be members of a specific (known) cluster, and to estimate the true size of a known cluster given a set of unsequenced cases. We apply our method to tuberculosis data from Valencia, Spain. Among other applications, we find that clustering can be predicted successfully using spatial distance between cases and whether nationality is the same. We can identify the correct cluster for an unsequenced case, among 38 possible clusters, with an accuracy of approximately 35â%, higher than both direct multinomial regression (17â%) and random selection (< 5â%).
Assuntos
Surtos de Doenças , Genômica , Humanos , Análise por Conglomerados , Modelos LogísticosRESUMO
Outbreak strains of Mycobacterium tuberculosis are promising candidates as targets in the search for intrinsic determinants of transmissibility, as they are responsible for many cases with sustained transmission; however, the use of low-resolution typing methods and restricted geographical investigations represent flaws in assessing the success of long-lived outbreak strains. We can now address the nature of outbreak strains by combining large genomic data sets and phylodynamic approaches. We retrospectively sequenced the whole genome of representative samples assigned to an outbreak circulating in the Canary Islands (the GC strain) since 1993, which accounts for ~20% of local tuberculosis cases. We selected a panel of specific single nucleotide polymorphism (SNP) markers for an in-silico search for additional outbreak-related sequences within publicly available tuberculosis genomic data. Using this information, we inferred the origin, spread, and epidemiological parameters of the GC strain. Our approach allowed us to accurately trace the historical and more recent dispersion of the GC strain. We provide evidence of a highly successful nature within the Canarian archipelago but limited expansion abroad. Estimation of epidemiological parameters from genomic data disagree with a distinctive biology of the GC strain. With the increasing availability of genomic data allowing for the accurate inference of strain spread and critical epidemiological parameters, we can now revisit the link between Mycobacterium tuberculosis genotypes and transmission, as is routinely carried out for SARS-CoV-2 variants of concern. We demonstrate that social determinants rather than intrinsically higher bacterial transmissibility better explain the success of the GC strain. Importantly, our approach can be used to trace and characterize strains of interest worldwide. IMPORTANCE Infectious disease outbreaks represent a significant problem for public health. Tracing outbreak expansion and understanding the main factors behind emergence and persistence remain critical to effective disease control. Our study allows researchers and public health authorities to use Whole-Genome Sequencing-based methods to trace outbreaks, and shows how available epidemiological information helps to evaluate the factors underpinning outbreak persistence. Taking advantage of all the freely available information placed in public repositories, researchers can accurately establish the expansion of an outbreak beyond original boundaries, and determine the potential risk of a strain to inform health authorities which, in turn, can define target strategies to mitigate expansion and persistence. Finally, we show the need to evaluate strain transmissibility in different geographic contexts to unequivocally associate spread to local or pathogenic factors, an important lesson taken from genomic surveillance of SARS-CoV-2.
RESUMO
BACKGROUND: From the start of the SARS-CoV-2 outbreak, global sequencing efforts have generated an unprecedented amount of genomic data. Nonetheless, unequal sampling between high-income and low-income countries hinders the implementation of genomic surveillance systems at the global and local level. Filling the knowledge gaps of genomic information and understanding pandemic dynamics in low-income countries is essential for public health decision making and to prepare for future pandemics. In this context, we aimed to discover the timing and origin of SARS-CoV-2 variant introductions in Mozambique, taking advantage of pandemic-scale phylogenies. METHODS: We did a retrospective, observational study in southern Mozambique. Patients from Manhiça presenting with respiratory symptoms were recruited, and those enrolled in clinical trials were excluded. Data were included from three sources: (1) a prospective hospital-based surveillance study (MozCOVID), recruiting patients living in Manhiça, attending the Manhiça district hospital, and fulfilling the criteria of suspected COVID-19 case according to WHO; (2) symptomatic and asymptomatic individuals with SARS-CoV-2 infection recruited by the National Surveillance system; and (3) sequences from SARS-CoV-2-infected Mozambican cases deposited on the Global Initiative on Sharing Avian Influenza Data database. Positive samples amenable for sequencing were analysed. We used Ultrafast Sample placement on Existing tRees to understand the dynamics of beta and delta waves, using available genomic data. This tool can reconstruct a phylogeny with millions of sequences by efficient sample placement in a tree. We reconstructed a phylogeny (~7·6 million sequences) adding new and publicly available beta and delta sequences. FINDINGS: A total of 5793 patients were recruited between Nov 1, 2020, and Aug 31, 2021. During this time, 133â328 COVID-19 cases were reported in Mozambique. 280 good quality new SARS-CoV-2 sequences were obtained after the inclusion criteria were applied and an additional 652 beta (B.1.351) and delta (B.1.617.2) public sequences were included from Mozambique. We evaluated 373 beta and 559 delta sequences. We identified 187 beta introductions (including 295 sequences), divided in 42 transmission groups and 145 unique introductions, mostly from South Africa, between August, 2020 and July, 2021. For delta, we identified 220 introductions (including 494 sequences), with 49 transmission groups and 171 unique introductions, mostly from the UK, India, and South Africa, between April and November, 2021. INTERPRETATION: The timing and origin of introductions suggests that movement restrictions effectively avoided introductions from non-African countries, but not from surrounding countries. Our results raise questions about the imbalance between the consequences of restrictions and health benefits. This new understanding of pandemic dynamics in Mozambique can be used to inform public health interventions to control the spread of new variants. FUNDING: European and Developing Countries Clinical Trials, European Research Council, Bill & Melinda Gates Foundation, and Agència de Gestió d'Ajuts Universitaris i de Recerca.
Assuntos
COVID-19 , SARS-CoV-2 , Humanos , SARS-CoV-2/genética , COVID-19/epidemiologia , COVID-19/prevenção & controle , Pandemias/prevenção & controle , Filogenia , Moçambique/epidemiologia , Estudos Retrospectivos , Estudos ProspectivosRESUMO
Genomic studies of the Mycobacterium tuberculosis complex (MTBC) might shed light on the dynamics of its transmission, especially in high-burden settings, where recent outbreaks are embedded in the complex natural history of the disease. To this end, we conducted a 1 year prospective surveillance-based study in Mozambique. We applied whole-genome sequencing (WGS) to 295 positive cultures. We fully characterized MTBC isolates by phylogenetics and dating evaluation, and carried out a molecular epidemiology analysis to investigate further associations with pre-defined transmission risk factors. The majority of strains (49.5%, 136/275) belonged to lineage (L) 4; 57.8â% of them (159/275) were in genomic transmission clusters (cut-off 5 SNPs), and a strikingly high proportion (45.5%) shared an identical genotype (0 SNP pairwise distance). We found two 'likely endemic' clades, comprising 67 strains, belonging to L1.2, which dated back to the late 19th century and were associated with recent spread among people living with human immunodeficiency virus (PLHIV). We describe for the first time the population structure of MTBC in our region, a high tuberculosis (TB)/HIV burden area. Clustering analysis revealed an unforeseen pattern of spread and high rates of progression to active TB, suggesting weaknesses in TB control activities. The long-term presence of local strains in Mozambique, which were responsible for large transmission among HIV/TB-coinfected patients, calls into question the role of HIV in TB transmission.
Assuntos
Infecções por HIV , Mycobacterium tuberculosis , Tuberculose , Infecções por HIV/epidemiologia , Humanos , Moçambique/epidemiologia , Mycobacterium tuberculosis/genética , Estudos Prospectivos , Tuberculose/epidemiologiaRESUMO
Transmission is a driver of tuberculosis (TB) epidemics in high-burden regions, with assumed negligible impact in low-burden areas. However, we still lack a full characterization of transmission dynamics in settings with similar and different burdens. Genomic epidemiology can greatly help to quantify transmission, but the lack of whole genome sequencing population-based studies has hampered its application. Here, we generate a population-based dataset from Valencia region and compare it with available datasets from different TB-burden settings to reveal transmission dynamics heterogeneity and its public health implications. We sequenced the whole genome of 785 Mycobacterium tuberculosis strains and linked genomes to patient epidemiological data. We use a pairwise distance clustering approach and phylodynamic methods to characterize transmission events over the last 150 years, in different TB-burden regions. Our results underscore significant differences in transmission between low-burden TB settings, i.e., clustering in Valencia region is higher (47.4%) than in Oxfordshire (27%), and similar to a high-burden area as Malawi (49.8%). By modeling times of the transmission links, we observed that settings with high transmission rate are associated with decades of uninterrupted transmission, irrespective of burden. Together, our results reveal that burden and transmission are not necessarily linked due to the role of past epidemics in the ongoing TB incidence, and highlight the need for in-depth characterization of transmission dynamics and specifically tailored TB control strategies.
Assuntos
Epidemias , Mycobacterium tuberculosis , Tuberculose , Humanos , Mycobacterium tuberculosis/genética , Dinâmica Populacional , Tuberculose/epidemiologia , Sequenciamento Completo do GenomaRESUMO
It is expected that the next generation of biotech crops displaying enhanced quality traits with benefits to both farmers and consumers will have a better acceptance than first generation biotech crops and will improve public perception of genetic engineering. This will only be true if they are proven to be as safe as traditionally bred crops. In contrast with the first generation of biotech crops where only a single trait is modified, the next generation of biotech crops will add a new level of complexity inherent to the mechanisms underlying their output traits. In this study, a comprehensive evaluation of the comparative safety approach on a quality-improved biotech crop with metabolic modifications is presented. Three genetically engineered potato lines with silenced polyphenol oxidase (Ppo) transcripts and reduced tuber browning were characterized at both physiological and molecular levels and showed to be equivalent to wild-type (WT) plants when yield-associated traits and photosynthesis were evaluated. Analysis of the primary metabolism revealed several unintended metabolic modifications in the engineered tubers, providing evidence for potential compositional inequivalence between transgenic lines and WT controls. The silencing construct sequence was in silico analysed for potential allergenic cross-reactivity, and no similarities to known allergenic proteins were identified. Moreover, in vivo intake safety evaluation showed no adverse effects in physiological parameters. Taken together, these results provide the first evidence supporting that the safety of next generation biotech crops can be properly assessed following the current evaluation criterion, even if the transgenic and WT crops are not substantially equivalent.
Assuntos
Inocuidade dos Alimentos , Engenharia Genética , Solanum tuberosum/genética , Alérgenos/análise , Animais , Catecol Oxidase/genética , Biologia Computacional , Feminino , Inativação Gênica , Camundongos , Camundongos Endogâmicos BALB C , FotossínteseRESUMO
We have detected two mutations in the spike protein of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) at amino acid positions 1163 and 1167 that appeared independently in multiple transmission clusters and different genetic backgrounds. Furthermore, both mutations appeared together in a cluster of 1,627 sequences belonging to clade 20E. This cluster is characterized by 12 additional single nucleotide polymorphisms but no deletions. The available structural information on the S protein in the pre- and postfusion conformations predicts that both mutations confer rigidity, which could potentially decrease viral fitness. Accordingly, we observed reduced infectivity of this spike genotype relative to the ancestral 20E sequence in vitro, and the levels of viral RNA in nasopharyngeal swabs were not significantly higher. Furthermore, the mutations did not impact thermal stability or antibody neutralization by sera from vaccinated individuals but moderately reduce neutralization by convalescent-phase sera from the early stages of the pandemic. Despite multiple successful appearances of the two spike mutations during the first year of SARS-CoV-2 evolution, the genotype with both mutations was displaced upon the expansion of the 20I (Alpha) variant. The midterm fate of the genotype investigated was consistent with the lack of advantage observed in the clinical and experimental data. IMPORTANCE We observed repeated, independent emergence of mutations in the SARS-CoV-2 spike involving amino acids 1163 and 1167, within the HR2 functional motif. Conclusions derived from evolutionary and genomic diversity analysis suggest that the co-occurrence of both mutations might pose an advantage for the virus and therefore a threat to effective control of the epidemic. However, biological characterization, including in vitro experiments and analysis of clinical data, indicated no clear benefit in terms of stability or infectivity. In agreement with this, continuous epidemiological surveillance conducted months after the first observations revealed that both mutations did not successfully outcompete other variants and stopped circulating 9 months after their initial detection. Additionally, we evaluated the potential of both mutations to escape neutralizing antibodies, finding that the presence of these two mutations on their own is not likely to confer antibody escape. Our results provide an example of how newly emerged spike mutations can be assessed to better understand the risk posed by new variants and indicate that some spike mutations confer no clear advantage to the virus despite independently emerging multiple times and are eventually displaced by fitter variants.
Assuntos
Evolução Molecular , Mutação , Fenótipo , SARS-CoV-2/genética , Glicoproteína da Espícula de Coronavírus/genética , Anticorpos Neutralizantes/imunologia , COVID-19/virologia , Europa (Continente) , Variação Genética , Genoma Viral , Humanos , Testes de Neutralização , SARS-CoV-2/imunologiaRESUMO
The coronavirus disease 2019 (COVID-19) pandemic has affected the world radically since 2020. Spain was one of the European countries with the highest incidence during the first wave. As a part of a consortium to monitor and study the evolution of the epidemic, we sequenced 2,170 samples, diagnosed mostly before lockdown measures. Here, we identified at least 500 introductions from multiple international sources and documented the early rise of two dominant Spanish epidemic clades (SECs), probably amplified by superspreading events. Both SECs were related closely to the initial Asian variants of SARS-CoV-2 and spread widely across Spain. We inferred a substantial reduction in the effective reproductive number of both SECs due to public-health interventions (Re < 1), also reflected in the replacement of SECs by a new variant over the summer of 2020. In summary, we reveal a notable difference in the initial genetic makeup of SARS-CoV-2 in Spain compared with other European countries and show evidence to support the effectiveness of lockdown measures in controlling virus spread, even for the most successful genetic variants.
Assuntos
COVID-19/epidemiologia , COVID-19/transmissão , Controle de Doenças Transmissíveis/organização & administração , Modelos Estatísticos , SARS-CoV-2/genética , COVID-19/virologia , Controle de Doenças Transmissíveis/métodos , Humanos , Incidência , Filogenia , Distanciamento Físico , Quarentena/métodos , Quarentena/organização & administração , SARS-CoV-2/classificação , SARS-CoV-2/patogenicidade , Índice de Gravidade de Doença , Espanha/epidemiologiaRESUMO
Tuberculosis (TB) surveillance is scarce in most African countries, even though it is the continent with the greatest disease incidence according to the World Health Organization. Liberia is within the 30 countries with the highest TB burden, probably as a consequence of the long civil war and the recent Ebola outbreak, both crippling the health system and depreciating the TB prevention and control programmes. Due to difficulties working in the country, there is a lack of resistance surveys and bacillus characterization. Here, we use genome sequencing of Mycobacteriumtuberculosis clinical isolates to fill this gap. Our results highlight that the bacillus population structure is dominated by lineage 4 strains that harbour an outstanding genetic diversity, higher than in the rest of Africa as a whole. Coalescent analyses demonstrate that strains currently circulating in Liberia were introduced several times beginning in the early year 600 CE until very recently coinciding with migratory movements associated with the civil war and Ebola epidemics. A higher multidrug-resistant (MDR)-TB frequency (23.5 %) than current estimates was obtained together with non-catalogued drug-resistance mutations. Additionally, 39 % of strains were in genomic clusters revealing that ongoing transmission is a major contribution to the TB burden in the country. Our report emphasizes the importance of TB surveillance and control in African countries where bacillus diversity, MDR-TB prevalence and transmission are coalescing to jeopardize TB control programmes.
Assuntos
Mycobacterium tuberculosis/genética , Tuberculose Resistente a Múltiplos Medicamentos/epidemiologia , Farmacorresistência Bacteriana Múltipla/genética , Variação Genética , Humanos , Libéria/epidemiologia , Epidemiologia Molecular , Mutação , Tuberculose Resistente a Múltiplos Medicamentos/microbiologia , Tuberculose Resistente a Múltiplos Medicamentos/transmissãoRESUMO
Mycobacterium tuberculosis (Mtb) infection results in a spectrum of clinical and histopathologic manifestations. It has been proposed that the environmental and immune pressures associated with different contexts of infection have different consequences for the associated bacterial populations, affecting drug susceptibility and the emergence of resistance. However, there is little concrete evidence for this model. We prospectively collected sputum samples from 18 newly diagnosed and treatment-naïve patients with tuberculosis and sequenced 795 colony-derived Mtb isolates. Mutant accumulation rates varied considerably between different bacilli isolated from the same individual, and where high rates of mutation were observed, the mutational spectrum was consistent with reactive oxygen species-induced mutagenesis. Elevated bacterial mutation rates were identified in isolates from HIV-negative but not HIV-positive individuals, suggesting that they were immune-driven. These results support the model that mutagenesis of Mtb in vivo is modulated by the host environment, which could drive the emergence of variants associated with drug resistance in a host-dependent manner.
Assuntos
COVID-19/virologia , SARS-CoV-2/genética , Adolescente , Adulto , Idoso , Idoso de 80 Anos ou mais , Criança , Pré-Escolar , Feminino , Genômica , Humanos , Lactente , Masculino , Pessoa de Meia-Idade , Espanha/epidemiologia , Adulto JovemRESUMO
BACKGROUND: Senecio is the largest genus in the Asteraceae family growing in all environments around the world. It displays taxonomic and systematical difficulties. Cytogenetic knowledge of this genus is ancient, scarce and mainly restricted to chromosome number records. RESULTS: In this study we analyzed chromosome number, meiotic configuration, bivalent morphology, meiotic behavior and pollen grain stainability on 100 accessions of 27 different polyploid Senecio L. sect Senecio entities. Median, standard deviation and mode were calculated for number and position of chiasmata and meiotic recombination was statistically evaluated. Although high frequency of multivalents and associated meiotic irregularities are expected in high polyploids, bivalents predominance and, consequently, regular meiosis were observed, with normal sporogenesis and high pollen grain stainability. CONCLUSION: Depletion in the total chiasmata was significant only in some species but the terminal position was preferential in all the entities analyzed, indicating significant reduction in recombination. The regular meiosis observed suggest that intra and intergenomic reorganization process occur quickly and efficiently in this genus. Mechanisms of diploidization, common to all polyploids, are reinforced by the strong reduction in crossing-over rushing polyploids stabilization.
RESUMO
In the biological domain, clustering is based on the assumption that genes or metabolites involved in a common biological process are coexpressed/coaccumulated under the control of the same regulatory network. Thus, a detailed inspection of the grouped patterns to verify their memberships to well-known metabolic pathways could be very useful for the evaluation of clusters from a biological perspective. The aim of this work is to propose a novel approach for the comparison of clustering methods over metabolic data sets, including prior biological knowledge about the relation among elements that constitute the clusters. A way of measuring the biological significance of clustering solutions is proposed. This is addressed from the perspective of the usefulness of the clusters to identify those patterns that change in coordination and belong to common pathways of metabolic regulation. The measure summarizes in a compact way the objective analysis of clustering methods, which respects coherence and clusters distribution. It also evaluates the biological internal connections of such clusters considering common pathways. The proposed measure was tested in two biological databases using three clustering methods.