RESUMO
Novel transmission routes can allow infectious diseases to spread, often with devastating consequences. Ectoparasitic varroa mites vector a diversity of RNA viruses, having switched hosts from the eastern to western honey bees (Apis cerana to Apis mellifera). They provide an opportunity to explore how novel transmission routes shape disease epidemiology. As the principal driver of the spread of deformed wing viruses (mainly DWV-A and DWV-B), varroa infestation has also driven global honey bee health declines. The more virulent DWV-B strain has been replacing the original DWV-A strain in many regions over the past two decades. Yet, how these viruses originated and spread remains poorly understood. Here, we use a phylogeographic analysis based on whole-genome data to reconstruct the origins and demography of DWV spread. We found that, rather than reemerging in western honey bees after varroa switched hosts, as suggested by previous work, DWV-A most likely originated in East Asia and spread in the mid-20th century. It also showed a massive population size expansion following the varroa host switch. By contrast, DWV-B was most likely acquired more recently from a source outside East Asia and appears absent from the original varroa host. These results highlight the dynamic nature of viral adaptation, whereby a vector's host switch can give rise to competing and increasingly virulent disease pandemics. The evolutionary novelty and rapid global spread of these host-virus interactions, together with observed spillover into other species, illustrate how increasing globalization poses urgent threats to biodiversity and food security.
Assuntos
Vírus de RNA , Varroidae , Abelhas , Animais , Vírus de RNA/genética , Evolução Biológica , Interações entre Hospedeiro e Microrganismos , FilogeografiaRESUMO
Phylogenetic methods are widely used to reconstruct the evolutionary relationships among species and individuals. However, recombination can obscure ancestral relationships as individuals may inherit different regions of their genome from different ancestors. It is, therefore, often necessary to detect recombination events, locate recombination breakpoints, and select recombination-free alignments prior to reconstructing phylogenetic trees. While many earlier studies have examined the power of different methods to detect recombination, very few have examined the ability of these methods to accurately locate recombination breakpoints. In this study, we simulated genome sequences based on ancestral recombination graphs and explored the accuracy of three popular recombination detection methods: MaxChi, 3SEQ, and Genetic Algorithm Recombination Detection. The accuracy of inferred breakpoint locations was evaluated along with the key factors contributing to variation in accuracy across datasets. While many different genomic features contribute to the variation in performance across methods, the number of informative sites consistent with the pattern of inheritance between parent and recombinant child sequences always has the greatest contribution to accuracy. While partitioning sequence alignments based on identified recombination breakpoints can greatly decrease phylogenetic error, the quality of phylogenetic reconstructions depends very little on how breakpoints are chosen to partition the alignment. Our work sheds light on how different features of recombinant genomes affect the performance of recombination detection methods and suggests best practices for reconstructing phylogenies based on recombination-free alignments.
Assuntos
Algoritmos , Filogenia , Recombinação Genética , Pontos de Quebra do Cromossomo , Alinhamento de Sequência/métodos , Modelos GenéticosRESUMO
In the presence of recombination individuals may inherit different regions of their genome from different ancestors, resulting in a mosaic of phylogenetic histories across their genome. Ancestral recombination graphs (ARGs) can capture how phylogenetic relationships vary across the genome due to recombination, but reconstructing ARGs from genomic sequence data is notoriously difficult. Here, we present a method for reconciling discordant phylogenetic trees and reconstructing ARGs using maximum agreement forests (MAFs). Given two discordant trees, a MAF identifies the smallest possible set of topologically concordant subtrees present in both trees. We show how discordant trees can be reconciled through their MAF in a way that retains discordances strongly supported by sequence data while eliminating conflicts likely attributable to phylogenetic noise. We further show how MAFs and our reconciliation approach can be combined to select a path of local trees across the genome that maximizes the likelihood of the genomic sequence data, minimizes discordance between neighboring local trees, and identifies the recombination events necessary to explain remaining discordances to obtain a fully connected ARG. While heuristic, our ARG reconstruction approach is often as accurate as more exact methods while being much more computationally efficient. Moreover, important demographic parameters such as recombination rates can be accurately estimated from reconstructed ARGs. Finally, we apply our approach to plant infecting RNA viruses in the genus Potyvirus to demonstrate how true recombination events can be disentangled from phylogenetic noise using our ARG reconstruction methods.
Assuntos
Genoma , Recombinação Genética , Humanos , Filogenia , Modelos Genéticos , AlgoritmosRESUMO
Movement of individuals between populations or demes is often restricted, especially between geographically isolated populations. The structured coalescent provides an elegant theoretical framework for describing how movement between populations shapes the genealogical history of sampled individuals and thereby structures genetic variation within and between populations. However, in the presence of recombination an individual may inherit different regions of their genome from different parents, resulting in a mosaic of genealogical histories across the genome, which can be represented by an Ancestral Recombination Graph (ARG). In this case, different genomic regions may have different ancestral histories and so different histories of movement between populations. Recombination therefore poses an additional challenge to phylogeographic methods that aim to reconstruct the movement of individuals from genealogies, although also a potential benefit in that different loci may contain additional information about movement. Here, we introduce the Structured Coalescent with Ancestral Recombination (SCAR) model, which builds on recent approximations to the structured coalescent by incorporating recombination into the ancestry of sampled individuals. The SCAR model allows us to infer how the migration history of sampled individuals varies across the genome from ARGs, and improves estimation of key population genetic parameters such as population sizes, recombination rates and migration rates. Using the SCAR model, we explore the potential and limitations of phylogeographic inference using full ARGs. We then apply the SCAR to lineages of the recombining fungus Aspergillus flavus sampled across the United States to explore patterns of recombination and migration across the genome.
Assuntos
Genoma , Modelos Genéticos , Genética Populacional , Humanos , Filogeografia , Densidade Demográfica , Recombinação Genética/genéticaRESUMO
High-throughput sequencing (HTS) and sequence mining tools revolutionized virus detection and discovery in recent years, and implementing them with classical plant virology techniques results in a powerful approach to characterize viruses. An example of a virus discovered through HTS is Solanum nigrum ilarvirus 1 (SnIV1) (Bromoviridae), which was recently reported in various solanaceous plants from France, Slovenia, Greece, and South Africa. It was likewise detected in grapevines (Vitaceae) and several Fabaceae and Rosaceae plant species. Such a diverse set of source organisms is atypical for ilarviruses, thus warranting further investigation. In this study, modern and classical virological tools were combined to accelerate the characterization of SnIV1. Through HTS-based virome surveys, mining of sequence read archive datasets, and a literature search, SnIV1 was further identified from diverse plant and non-plant sources globally. SnIV1 isolates showed relatively low variability compared with other phylogenetically related ilarviruses. Phylogenetic analyses showed a distinct basal clade of isolates from Europe, whereas the rest formed clades of mixed geographic origin. Furthermore, systemic infection of SnIV1 in Solanum villosum and its mechanical and graft transmissibility to solanaceous species were demonstrated. Near-identical SnIV1 genomes from the inoculum (S. villosum) and inoculated Nicotiana benthamiana were sequenced, thus partially fulfilling Koch's postulates. SnIV1 was shown to be seed-transmitted and potentially pollen-borne, has spherical virions, and possibly induces histopathological changes in infected N. benthamiana leaf tissues. Overall, this study provides information to better understand the diversity, global presence, and pathobiology of SnIV1; however, its possible emergence as a destructive pathogen remains uncertain. [Formula: see text] Copyright © 2023 The Author(s). This is an open access article distributed under the CC BY 4.0 International license.
Assuntos
Ilarvirus , Solanum , Filogenia , Doenças das Plantas , NicotianaRESUMO
Phylogeography combines geographic information with phylogenetic and population genomic approaches to infer the evolutionary history of a species or population in a geographic context. This approach has been instrumental in understanding the emergence, spread, and evolution of a range of plant pathogens. In particular, phylogeography can address questions about where a pathogen originated, whether it is native or introduced, and when and how often introductions occurred. We review the theory, methods, and approaches underpinning phylogeographic inference and highlight applications providing novel insights into the emergence and spread of select pathogens. We hope that this review will be useful in assessing the power, pitfalls, and opportunities presented by various phylogeographic approaches.
Assuntos
Modelos Genéticos , Doenças das Plantas , Filogenia , FilogeografiaRESUMO
Modern phylodynamic methods interpret an inferred phylogenetic tree as a partial transmission chain providing information about the dynamic process of transmission and removal (where removal may be due to recovery, death, or behavior change). Birth-death and coalescent processes have been introduced to model the stochastic dynamics of epidemic spread under common epidemiological models such as the SIS and SIR models and are successfully used to infer phylogenetic trees together with transmission (birth) and removal (death) rates. These methods either integrate analytically over past incidence and prevalence to infer rate parameters, and thus cannot explicitly infer past incidence or prevalence, or allow such inference only in the coalescent limit of large population size. Here, we introduce a particle filtering framework to explicitly infer prevalence and incidence trajectories along with phylogenies and epidemiological model parameters from genomic sequences and case count data in a manner consistent with the underlying birth-death model. After demonstrating the accuracy of this method on simulated data, we use it to assess the prevalence through time of the early 2014 Ebola outbreak in Sierra Leone.
Assuntos
Genômica/métodos , Incidência , Epidemiologia Molecular/métodos , Prevalência , Teorema de Bayes , Doença pelo Vírus Ebola/epidemiologia , Humanos , Serra Leoa/epidemiologiaRESUMO
Phylogenetics and phylodynamics are central topics in modern evolutionary biology. Phylogenetic methods reconstruct the evolutionary relationships among organisms, whereas phylodynamic approaches reveal the underlying diversification processes that lead to the observed relationships. These two fields have many practical applications in disciplines as diverse as epidemiology, developmental biology, palaeontology, ecology, and linguistics. The combination of increasingly large genetic data sets and increases in computing power is facilitating the development of more sophisticated phylogenetic and phylodynamic methods. Big data sets allow us to answer complex questions. However, since the required analyses are highly specific to the particular data set and question, a black-box method is not sufficient anymore. Instead, biologists are required to be actively involved with modeling decisions during data analysis. The modular design of the Bayesian phylogenetic software package BEAST 2 enables, and in fact enforces, this involvement. At the same time, the modular design enables computational biology groups to develop new methods at a rapid rate. A thorough understanding of the models and algorithms used by inference software is a critical prerequisite for successful hypothesis formulation and assessment. In particular, there is a need for more readily available resources aimed at helping interested scientists equip themselves with the skills to confidently use cutting-edge phylogenetic analysis software. These resources will also benefit researchers who do not have access to similar courses or training at their home institutions. Here, we introduce the "Taming the Beast" (https://taming-the-beast.github.io/) resource, which was developed as part of a workshop series bearing the same name, to facilitate the usage of the Bayesian phylogenetic software package BEAST 2.
Assuntos
Biologia Computacional/educação , Biologia Computacional/métodos , Filogenia , Software , Materiais de Ensino , AlgoritmosRESUMO
Phylogeographic methods can help reveal the movement of genes between populations of organisms. This has been widely done to quantify pathogen movement between different host populations, the migration history of humans, and the geographic spread of languages or gene flow between species using the location or state of samples alongside sequence data. Phylogenies therefore offer insights into migration processes not available from classic epidemiological or occurrence data alone. Phylogeographic methods have however several known shortcomings. In particular, one of the most widely used methods treats migration the same as mutation, and therefore does not incorporate information about population demography. This may lead to severe biases in estimated migration rates for data sets where sampling is biased across populations. The structured coalescent on the other hand allows us to coherently model the migration and coalescent process, but current implementations struggle with complex data sets due to the need to infer ancestral migration histories. Thus, approximations to the structured coalescent, which integrate over all ancestral migration histories, have been developed. However, the validity and robustness of these approximations remain unclear. We present an exact numerical solution to the structured coalescent that does not require the inference of migration histories. Although this solution is computationally unfeasible for large data sets, it clarifies the assumptions of previously developed approximate methods and allows us to provide an improved approximation to the structured coalescent. We have implemented these methods in BEAST2, and we show how these methods compare under different scenarios.
Assuntos
Genética Populacional/métodos , Filogeografia/métodos , Simulação por Computador , Demografia/métodos , Fluxo Gênico/genética , Humanos , Modelos Genéticos , Filogenia , Filogeografia/estatística & dados numéricosRESUMO
Viral phylogenetic methods contribute to understanding how HIV spreads in populations, and thereby help guide the design of prevention interventions. So far, most analyses have been applied to well-sampled concentrated HIV-1 epidemics in wealthy countries. To direct the use of phylogenetic tools to where the impact of HIV-1 is greatest, the Phylogenetics And Networks for Generalized HIV Epidemics in Africa (PANGEA-HIV) consortium generates full-genome viral sequences from across sub-Saharan Africa. Analyzing these data presents new challenges, since epidemics are principally driven by heterosexual transmission and a smaller fraction of cases is sampled. Here, we show that viral phylogenetic tools can be adapted and used to estimate epidemiological quantities of central importance to HIV-1 prevention in sub-Saharan Africa. We used a community-wide methods comparison exercise on simulated data, where participants were blinded to the true dynamics they were inferring. Two distinct simulations captured generalized HIV-1 epidemics, before and after a large community-level intervention that reduced infection levels. Five research groups participated. Structured coalescent modeling approaches were most successful: phylogenetic estimates of HIV-1 incidence, incidence reductions, and the proportion of transmissions from individuals in their first 3 months of infection correlated with the true values (Pearson correlation > 90%), with small bias. However, on some simulations, true values were markedly outside reported confidence or credibility intervals. The blinded comparison revealed current limits and strengths in using HIV phylogenetics in challenging settings, provided benchmarks for future methods' development, and supports using the latest generation of phylogenetic tools to advance HIV surveillance and prevention.
Assuntos
Infecções por HIV/epidemiologia , Infecções por HIV/virologia , HIV-1/genética , África Subsaariana/epidemiologia , Simulação por Computador , Epidemias , Feminino , Infecções por HIV/prevenção & controle , Infecções por HIV/transmissão , Humanos , Incidência , Masculino , FilogeniaRESUMO
Phylodynamic models are widely used in infectious disease epidemiology to infer the dynamics and structure of pathogen populations. However, these models generally assume that individual hosts contact one another at random, ignoring the fact that many pathogens spread through highly structured contact networks. We present a new framework for phylodynamics on local contact networks based on pairwise epidemiological models that track the status of pairs of nodes in the network rather than just individuals. Shifting our focus from individuals to pairs leads naturally to coalescent models that describe how lineages move through networks and the rate at which lineages coalesce. These pairwise coalescent models not only consider how network structure directly shapes pathogen phylogenies, but also how the relationship between phylogenies and contact networks changes depending on epidemic dynamics and the fraction of infected hosts sampled. By considering pathogen phylogenies in a probabilistic framework, these coalescent models can also be used to estimate the statistical properties of contact networks directly from phylogenies using likelihood-based inference. We use this framework to explore how much information phylogenies retain about the underlying structure of contact networks and to infer the structure of a sexual contact network underlying a large HIV-1 sub-epidemic in Switzerland.
Assuntos
Surtos de Doenças/estatística & dados numéricos , Infecções por HIV/epidemiologia , Infecções por HIV/virologia , HIV-1/genética , Modelos Estatísticos , Parceiros Sexuais , Evolução Biológica , Simulação por Computador , Variação Genética/genética , Humanos , Filogenia , Prevalência , Suíça/epidemiologiaRESUMO
Coalescent methods are widely used to infer the demographic history of populations from gene genealogies. These approaches-often referred to as phylodynamic methods-have proven especially useful for reconstructing the dynamics of rapidly evolving viral pathogens. Yet, population dynamics inferred from viral genealogies often differ widely from those observed from other sources of epidemiological data, such as hospitalization records. We demonstrate how a modeling framework that allows for the direct fitting of mechanistic epidemiological models to genealogies can be used to test different hypotheses about what ecological factors cause phylodynamic inferences to differ from observed dynamics. We use this framework to test different hypotheses about why dengue serotype 1 (DENV-1) population dynamics in southern Vietnam inferred using existing phylodynamic methods differ from hospitalization data. Specifically, we consider how factors such as seasonality, vector dynamics, and spatial structure can affect inferences drawn from genealogies. The coalescent models we derive to take into account vector dynamics and spatial structure reveal that these ecological complexities can substantially affect coalescent rates among lineages. We show that incorporating these additional ecological complexities into coalescent models can also greatly improve estimates of historical population dynamics and lead to new insights into the factors shaping viral genealogies.
Assuntos
Vírus da Dengue/genética , Dengue/epidemiologia , Filogenia , Dinâmica Populacional , Vírus da Dengue/classificação , Humanos , Filogeografia , Vietnã/epidemiologiaRESUMO
Coalescent theory is routinely used to estimate past population dynamics and demographic parameters from genealogies. While early work in coalescent theory only considered simple demographic models, advances in theory have allowed for increasingly complex demographic scenarios to be considered. The success of this approach has lead to coalescent-based inference methods being applied to populations with rapidly changing population dynamics, including pathogens like RNA viruses. However, fitting epidemiological models to genealogies via coalescent models remains a challenging task, because pathogen populations often exhibit complex, nonlinear dynamics and are structured by multiple factors. Moreover, it often becomes necessary to consider stochastic variation in population dynamics when fitting such complex models to real data. Using recently developed structured coalescent models that accommodate complex population dynamics and population structure, we develop a statistical framework for fitting stochastic epidemiological models to genealogies. By combining particle filtering methods with Bayesian Markov chain Monte Carlo methods, we are able to fit a wide class of stochastic, nonlinear epidemiological models with different forms of population structure to genealogies. We demonstrate our framework using two structured epidemiological models: a model with disease progression between multiple stages of infection and a two-population model reflecting spatial structure. We apply the multi-stage model to HIV genealogies and show that the proposed method can be used to estimate the stage-specific transmission rates and prevalence of HIV. Finally, using the two-population model we explore how much information about population structure is contained in genealogies and what sample sizes are necessary to reliably infer parameters like migration rates.
Assuntos
Estudos Epidemiológicos , Modelos Estatísticos , Filogenia , Algoritmos , Teorema de Bayes , Funções Verossimilhança , Cadeias de Markov , Método de Monte Carlo , Processos EstocásticosRESUMO
Phylogenetic analyses of genes with demonstrated involvement in evolutionary transitions can be an important means of resolving conflicting hypotheses about evolutionary history or process. In sunflower, two genes have previously been shown to have experienced selective sweeps during its early domestication. In the present study, we identified a third candidate early domestication gene and conducted haplotype analyses of all three genes to address a recent, controversial hypothesis about the origin of cultivated sunflower. Although the scientific consensus had long been that sunflower was domesticated once in eastern North America, the discovery of pre-Columbian sunflower remains at archaeological sites in Mexico led to the proposal of a second domestication center in southern Mexico. Previous molecular studies with neutral markers were consistent with the former hypothesis. However, only two indigenous Mexican cultivars were included in these studies, and their provenance and genetic purity have been questioned. Therefore, we sequenced regions of the three candidate domestication genes containing SNPs diagnostic for domestication from large, newly collected samples of Mexican sunflower landraces and Mexican wild populations from a broad geographic range. The new germplasm also was genotyped for 12 microsatellite loci. Our evidence from multiple evolutionarily important loci and from neutral markers supports a single domestication event for extant cultivated sunflower in eastern North America.
Assuntos
Agricultura , Alelos , Helianthus/genética , Agricultura/história , Frequência do Gene/genética , Genes de Plantas/genética , Marcadores Genéticos , Variação Genética , Geografia , Haplótipos/genética , Helianthus/enzimologia , História Antiga , México , Oxigenases de Função Mista/genética , Dados de Sequência Molecular , Nucleotídeos/genética , Filogenia , Seleção Genética , Homologia de Sequência de AminoácidosRESUMO
Phylodynamics - the field aiming to quantitatively integrate the ecological and evolutionary dynamics of rapidly evolving populations like those of RNA viruses - increasingly relies upon coalescent approaches to infer past population dynamics from reconstructed genealogies. As sequence data have become more abundant, these approaches are beginning to be used on populations undergoing rapid and rather complex dynamics. In such cases, the simple demographic models that current phylodynamic methods employ can be limiting. First, these models are not ideal for yielding biological insight into the processes that drive the dynamics of the populations of interest. Second, these models differ in form from mechanistic and often stochastic population dynamic models that are currently widely used when fitting models to time series data. As such, their use does not allow for both genealogical data and time series data to be considered in tandem when conducting inference. Here, we present a flexible statistical framework for phylodynamic inference that goes beyond these current limitations. The framework we present employs a recently developed method known as particle MCMC to fit stochastic, nonlinear mechanistic models for complex population dynamics to gene genealogies and time series data in a Bayesian framework. We demonstrate our approach using a nonlinear Susceptible-Infected-Recovered (SIR) model for the transmission dynamics of an infectious disease and show through simulations that it provides accurate estimates of past disease dynamics and key epidemiological parameters from genealogies with or without accompanying time series data.
Assuntos
Algoritmos , Transmissão de Doença Infecciosa , Métodos Epidemiológicos , Modelos Biológicos , Dinâmica não Linear , Teorema de Bayes , Biologia Computacional/métodos , Epidemias , Método de Monte Carlo , Filogenia , Dinâmica Populacional , Prevalência , Processos EstocásticosRESUMO
Population genetic diversity of Plasmodium falciparum antigenic loci is high despite large bottlenecks in population size during the parasite life cycle. The prevalence of genetically distinct haplotypes at these loci, while well characterized in humans, has not been thoroughly compared between human and mosquito hosts. We assessed parasite haplotype prevalence, diversity, and evenness using human and mosquito P. falciparum infections collected from the same households during a 14-month longitudinal cohort study using amplicon deep sequencing of two antigenic gene fragments (ama1 and csp). To a prior set of infected humans (n = 1,175/2,813; 86.2% sequencing success) and mosquito abdomens (n = 199/1,448; 95.5% sequencing success), we added sequences from infected mosquito heads (n = 134/1,448; 98.5% sequencing success). The overall and sample-level parasite populations were more diverse in mosquitoes than in humans. Additionally, haplotype prevalences were more even in the P. falciparum human population than in the mosquito population, consistent with balancing selection occurring at these loci in humans. In contrast, we observed that infections in humans were more likely to harbor a dominant haplotype than infections in mosquitoes, potentially due to removal of unfit strains by the human immune system. Finally, within a given mosquito, there was little overlap in genetic composition of abdomen and head infections, suggesting that infections may be cleared from the abdomen during a mosquito's lifespan. Taken together, our observations provide evidence for the mosquito vector acting as a reservoir of sequence diversity in malaria parasite populations. IMPORTANCE Plasmodium falciparum is the deadliest human malaria parasite, and infections consisting of concurrent, multiple strains are common in regions of high endemicity. During transitions within and between the parasite's mosquito and human hosts, these strains are subject to population bottlenecks, and distinct parasite strains may have differential fitness in the various environments encountered. These bottlenecks and fitness differences may lead to differences in strain prevalence and diversity between hosts. We investigated differences in genetic diversity and evenness between P. falciparum parasites in human and mosquito hosts collected from the same households during a 14-month longitudinal study in Kenya. Compared to human parasite populations and infections, P. falciparum parasites observed in mosquito populations and infections were more diverse by multiple population genetic metrics. This suggests that the mosquito vector acts as a reservoir of sequence diversity in malaria parasite populations.
Assuntos
Culicidae , Variação Genética , Malária Falciparum , Plasmodium falciparum , Animais , Humanos , Culicidae/parasitologia , Estudos Longitudinais , Malária Falciparum/parasitologia , Plasmodium falciparum/genéticaRESUMO
The prevalence of extended-spectrum beta-lactamases (ESBLs) among clinical isolates of Escherichia coli has been increasing, with this spread driven by ESBL-encoding plasmids. However, the epidemiology of ESBL-disseminating plasmids remains understudied, obscuring the roles of individual plasmid lineages in ESBL spread. To address this, we performed an in-depth genomic investigation of 149 clinical ESBL-like E. coli isolates from a tertiary care hospital. We obtained high-quality assemblies for 446 plasmids, revealing an extensive map of plasmid sharing that crosses time, space, and bacterial sequence type boundaries. Through a sequence-based network, we identified specific plasmid lineages that are responsible for the dissemination of major ESBLs. Notably, we demonstrate that IncF plasmids separate into 2 distinct lineages that are enriched for different ESBLs and occupy distinct host ranges. Our work provides a detailed picture of plasmid-mediated spread of ESBLs, demonstrating the extensive sequence diversity within identified lineages, while highlighting the genetic elements that underlie the persistence of these plasmids within the clinical E. coli population. IMPORTANCE The increasing incidence of nosocomial infections with extended-spectrum beta-lactamase (ESBL)-producing Escherichia coli represents a significant threat to public health, given the limited treatment options available for such infections. The rapid ESBL spread is suggested to be driven by localization of the resistance genes on conjugative plasmids. Here, we identify the contributions of different plasmid lineages in the nosocomial spread of ESBLs. We provide further support for plasmid-mediated spread of ESBLs but demonstrate that some ESBL genes rely on dissemination through plasmids more than the others. We identify key plasmid lineages that are enriched in major ESBL genes and highlight the encoded genetic elements that facilitate the transmission and stable maintenance of these plasmid groups within the clinical E. coli population. Overall, our work provides valuable insight into the dissemination of ESBLs through plasmids, furthering our understating of factors underlying the increased prevalence of these genes in nosocomial settings.
Assuntos
Infecções por Escherichia coli , Escherichia coli , Humanos , Escherichia coli/genética , Infecções por Escherichia coli/epidemiologia , beta-Lactamases/genética , Plasmídeos/genética , HospitaisRESUMO
Antigenically variable RNA viruses are significant contributors to the burden of infectious disease worldwide. One reason for their ubiquity is their ability to escape herd immunity through rapid antigenic evolution and thereby to reinfect previously infected hosts. However, the ways in which these viruses evolve antigenically are highly diverse. Some have only limited diversity in the long-run, with every emergence of a new antigenic variant coupled with a replacement of the older variant. Other viruses rapidly accumulate antigenic diversity over time. Others still exhibit dynamics that can be considered evolutionary intermediates between these two extremes. Here, we present a theoretical framework that aims to understand these differences in evolutionary patterns by considering a virus's epidemiological dynamics in a given host population. Our framework, based on a dimensionless number, probabilistically anticipates patterns of viral antigenic diversification and thereby quantifies a virus's evolutionary potential. It is therefore similar in spirit to the basic reproduction number, the well-known dimensionless number which quantifies a pathogen's reproductive potential. We further outline how our theoretical framework can be applied to empirical viral systems, using influenza A/H3N2 as a case study. We end with predictions of our framework and work that remains to be done to further integrate viral evolutionary dynamics with disease ecology.
Assuntos
Variação Antigênica , Antígenos Virais/química , Evolução Molecular , Vírus de RNA/genética , Filogenia , Vírus de RNA/imunologia , Vírus de RNA/isolamento & purificaçãoRESUMO
The fitness of a pathogen is a composite phenotype determined by many different factors influencing growth rates both within and between hosts. Determining what factors shape fitness at the host population-level is especially challenging because both intrinsic factors like pathogen genetics and extrinsic factors such as host behavior influence between-host transmission potential. This challenge has been highlighted by controversy surrounding the population-level fitness effects of mutations in the SARS-CoV-2 genome and their relative importance when compared against non-genetic factors shaping transmission dynamics. Building upon phylodynamic birth-death models, we develop a new framework to learn how hundreds of genetic and non-genetic factors have shaped the fitness of SARS-CoV-2. We estimate the fitness effects of all amino acid variants and several structural variants that have circulated in the United States between February 2020 and March 2021 from viral phylogenies. We also estimate how much fitness variation among pathogen lineages is attributable to genetic versus non-genetic factors such as spatial heterogeneity in transmission rates. Before September 2020, most fitness variation between lineages can be explained by background spatial heterogeneity in transmission rates across geographic regions. Starting in late 2020, genetic variation in fitness increased dramatically with the emergence of several new lineages including B.1.1.7, B.1.427, B.1.429 and B.1.526. Our analysis also indicates that genetic variants in less well-explored genomic regions outside of Spike may be contributing significantly to overall fitness variation in the viral population.