RESUMEN
The standard neutral model of molecular evolution has traditionally been used as the null model for population genomics. We gathered a collection of 45 genome-wide site frequency spectra from a diverse set of species, most of which display an excess of low and high frequency variants compared to the expectation of the standard neutral model, resulting in U-shaped spectra. We show that multiple merger coalescent models often provide a better fit to these observations than the standard Kingman coalescent. Hence, in many circumstances these under-utilized models may serve as the more appropriate reference for genomic analyses. We further discuss the underlying evolutionary processes that may result in the widespread U-shape of frequency spectra.
Asunto(s)
Evolución Biológica , Evolución Molecular , Modelos GenéticosRESUMEN
Developing tools to accurately predict the clinical prevalence of drug-resistant mutations is a key step toward generating more effective therapeutics. Here we describe a high-throughput CRISPR-Cas9-based saturated mutagenesis approach to generate comprehensive libraries of point mutations at a defined genomic location and systematically study their effect on cell growth. As proof of concept, we mutagenized a selected region within the leukemic oncogene BCR-ABL1 Using bulk competitions with a deep-sequencing readout, we analyzed hundreds of mutations under multiple drug conditions and found that the effects of mutations on growth in the presence or absence of drug were critical for predicting clinically relevant resistant mutations, many of which were cancer adaptive in the absence of drug pressure. Using this approach, we identified all clinically isolated BCR-ABL1 mutations and achieved a prediction score that correlated highly with their clinical prevalence. The strategy described here can be broadly applied to a variety of oncogenes to predict patient mutations and evaluate resistance susceptibility in the development of new therapeutics.
Asunto(s)
Sistemas CRISPR-Cas/genética , Resistencia a Antineoplásicos/genética , Mutagénesis/genética , Animales , Antineoplásicos/farmacología , Sistemas CRISPR-Cas/efectos de los fármacos , Línea Celular Tumoral , Repeticiones Palindrómicas Cortas Agrupadas y Regularmente Espaciadas/efectos de los fármacos , Repeticiones Palindrómicas Cortas Agrupadas y Regularmente Espaciadas/genética , Resistencia a Antineoplásicos/efectos de los fármacos , Proteínas de Fusión bcr-abl/genética , Leucemia/tratamiento farmacológico , Leucemia/genética , Ratones , Mutagénesis/efectos de los fármacos , Oncogenes/genética , Mutación Puntual/efectos de los fármacos , Mutación Puntual/genéticaRESUMEN
The study of fitness landscapes, which aims at mapping genotypes to fitness, is receiving ever-increasing attention. Novel experimental approaches combined with next-generation sequencing (NGS) methods enable accurate and extensive studies of the fitness effects of mutations, allowing us to test theoretical predictions and improve our understanding of the shape of the true underlying fitness landscape and its implications for the predictability and repeatability of evolution. Here, we present a uniquely large multiallelic fitness landscape comprising 640 engineered mutants that represent all possible combinations of 13 amino acid-changing mutations at 6 sites in the heat-shock protein Hsp90 in Saccharomyces cerevisiae under elevated salinity. Despite a prevalent pattern of negative epistasis in the landscape, we find that the global fitness peak is reached via four positively epistatic mutations. Combining traditional and extending recently proposed theoretical and statistical approaches, we quantify features of the global multiallelic fitness landscape. Using subsets of the data, we demonstrate that extrapolation beyond a known part of the landscape is difficult owing to both local ruggedness and amino acid-specific epistatic hotspots and that inference is additionally confounded by the nonrandom choice of mutations for experimental fitness landscapes.
Asunto(s)
Adaptación Biológica , Evolución Biológica , Aptitud Genética , Proteínas HSP90 de Choque Térmico/genética , Proteínas de Saccharomyces cerevisiae/genética , Saccharomyces cerevisiae/genética , Epistasis Genética , Mutación Missense , SalinidadRESUMEN
Intrahost and interhost assessments of viral diversity are often treated as measures of separate and distinct evolutionary processes, with numerous investigations reporting seemingly incompatible results between the two. For example, in human cytomegalovirus, the nucleotide diversity estimates are 10-fold higher for interhost data, while the number of segregating (i.e., polymorphic) sites is 6-fold lower. These results have been interpreted as demonstrating that sampled intrahost variants are strongly deleterious. In reality, however, these observations are fully consistent with standard population genetic expectations. Here, we analyze published intra- and interhost data sets within this framework, utilizing statistical inference tools to quantify the fitness effects of segregating mutations. Further, we utilize population level simulations to clarify expectations under common evolutionary models. Contrary to common claims in the literature, these results suggest that most observed polymorphisms are likely nearly neutral with regard to fitness and that standard population genetic models in fact well predict observed levels of both intra- and interhost variability. IMPORTANCE With the increasing number of evolutionary virology studies examining both intrahost and interhost patterns of genomic variation, a number of seemingly incompatible results have emerged, revolving around the far greater level of observed intrahost than interhost variation. This has led many authors to suggest that the great majority of sampled within-host polymorphisms are strongly deleterious. Here, we demonstrate that there is in fact no incompatibility of these results and, indeed, that the vast majority of sampled within-host variation is likely neutral. These results thus represent a major shift in the current view of observed viral variation.
Asunto(s)
Infecciones por Citomegalovirus/virología , Citomegalovirus/genética , Alelos , Evolución Molecular , Frecuencia de los Genes , Genes Virales , Aptitud Genética , Humanos , Modelos Genéticos , Filogenia , Polimorfismo GenéticoRESUMEN
Fitness landscapes map the relationship between genotypes and fitness. However, most fitness landscape studies ignore the genetic architecture imposed by the codon table and thereby neglect the potential role of synonymous mutations. To quantify the fitness effects of synonymous mutations and their potential impact on adaptation on a fitness landscape, we use a new software based on Bayesian Monte Carlo Markov Chain methods and re-estimate selection coefficients of all possible codon mutations across 9 amino acid positions in Saccharomyces cerevisiae Hsp90 across 6 environments. We quantify the distribution of fitness effects of synonymous mutations and show that it is dominated by many mutations of small or no effect and few mutations of larger effect. We then compare the shape of the codon fitness landscape across amino acid positions and environments, and quantify how the consideration of synonymous fitness effects changes the evolutionary dynamics on these fitness landscapes. Together these results highlight a possible role of synonymous mutations in adaptation and indicate the potential mis-inference when they are neglected in fitness landscape studies.
Asunto(s)
Codón , Aptitud Genética , Proteínas HSP90 de Choque Térmico/genética , Proteínas de Saccharomyces cerevisiae/genética , Saccharomyces cerevisiae/genética , Saccharomyces cerevisiae/fisiología , Adaptación Fisiológica/genética , Teorema de Bayes , Epistasis Genética , Evolución Molecular , Genes Fúngicos , Proteínas HSP90 de Choque Térmico/química , Cadenas de Markov , Mutación , Proteínas de Saccharomyces cerevisiae/químicaAsunto(s)
Macrodatos , Evolución Molecular , Aptitud Genética , Variación Genética , Humanos , Procesos EstocásticosRESUMEN
Nonequilibrium demography impacts coalescent genealogies leaving detectable, well-studied signatures of variation. However, similar genomic footprints are also expected under models of large reproductive skew, posing a serious problem when trying to make inference. Furthermore, current approaches consider only one of the two processes at a time, neglecting any genomic signal that could arise from their simultaneous effects, preventing the possibility of jointly inferring parameters relating to both offspring distribution and population history. Here, we develop an extended Moran model with exponential population growth, and demonstrate that the underlying ancestral process converges to a time-inhomogeneous psi-coalescent. However, by applying a nonlinear change of time scale-analogous to the Kingman coalescent-we find that the ancestral process can be rescaled to its time-homogeneous analog, allowing the process to be simulated quickly and efficiently. Furthermore, we derive analytical expressions for the expected site-frequency spectrum under the time-inhomogeneous psi-coalescent, and develop an approximate-likelihood framework for the joint estimation of the coalescent and growth parameters. By means of extensive simulation, we demonstrate that both can be estimated accurately from whole-genome data. In addition, not accounting for demography can lead to serious biases in the inferred coalescent model, with broad implications for genomic studies ranging from ecology to conservation biology. Finally, we use our method to analyze sequence data from Japanese sardine populations, and find evidence of high variation in individual reproductive success, but few signs of a recent demographic expansion.
Asunto(s)
Algoritmos , Modelos TeóricosRESUMEN
The extinction of RNA virus populations upon application of a mutagenic drug is frequently referred to as evidence for the existence of an error threshold, above which the population cannot sustain the mutational load. To explain the extinction process after reaching this threshold, models of lethal mutagenesis have been proposed, in which extinction is described as a deterministic (and thus population size-independent) process. As a separate body of literature, the population genetics community has developed models of mutational meltdown, which focus on the stochastic (and thus population-size dependent) processes governing extinction. However, recent extensions of both models have blurred these boundaries. Here, we first clarify definitions in terms of assumptions, expectations, and relevant parameter spaces, and then assess similarities and differences. As concepts from both fields converge, we argue for a unified theoretical framework that is focused on the evolutionary processes at play, rather than dispute over terminology.
RESUMEN
Influenza virus inflicts a heavy death toll annually and resistance to existing antiviral drugs has generated interest in the development of agents with novel mechanisms of action. Favipiravir is an antiviral drug that acts by increasing the genome-wide mutation rate of influenza A virus (IAV). Potential synergistic benefits of combining oseltamivir and favipiravir have been demonstrated in animal models of influenza, but the population-level effects of combining the drugs are unknown. In order to elucidate the underlying evolutionary processes at play, we performed genome-wide sequencing of IAV experimental populations subjected to serial passaging in vitro under a combined protocol of oseltamivir and favipiravir. We describe the interplay between mutation, selection, and genetic drift that ultimately culminates in population extinction. In particular, selective sweeps around oseltamivir resistance mutations reduce genome-wide variation while deleterious mutations hitchhike to fixation given the increased mutational load generated by favipiravir. This latter effect reduces viral fitness and accelerates extinction compared with IAV populations treated with favipiravir alone, but risks spreading both established and newly emerging mutations, including possible drug resistance mutations, if transmission occurs before the viral populations are eradicated.
Asunto(s)
Amidas/farmacología , Antivirales/farmacología , Evolución Biológica , Virus de la Influenza A/efectos de los fármacos , Infecciones por Orthomyxoviridae/tratamiento farmacológico , Oseltamivir/farmacología , Pirazinas/farmacología , Animales , Línea Celular , Perros , Genética de Población , Virus de la Influenza A/clasificación , Tasa de Mutación , Infecciones por Orthomyxoviridae/virologíaRESUMEN
During his well-known debate with Fisher regarding the phenotypic dataset of Panaxia dominula, Wright suggested fluctuating selection as a potential explanation for the observed change in allele frequencies. This model has since been invoked in a number of analyses, with the focus of discussion centering mainly on random or oscillatory fluctuations of selection intensities. Here, we present a novel method to consider nonrandom changes in selection intensities using Wright-Fisher approximate Bayesian (ABC)-based approaches, in order to detect and evaluate a change in selection strength from time-sampled data. This novel method jointly estimates the position of a change point as well as the strength of both corresponding selection coefficients (and dominance for diploid cases) from the allele trajectory. The simulation studies of this method reveal the combinations of parameter ranges and input values that optimize performance, thus indicating optimal experimental design strategies. We apply this approach to both the historical dataset of P. dominula in order to shed light on this historical debate, as well as to whole-genome time-serial data from influenza virus in order to identify sites with changing selection intensities in response to drug treatment.
Asunto(s)
Genética de Población , Modelos Genéticos , Polimorfismo Genético , Selección Genética , Algoritmos , Alelos , Teorema de Bayes , Evolución Biológica , Simulación por Computador , Interacciones Huésped-Patógeno/genética , Interacciones Huésped-Patógeno/inmunología , Mutación , Polimorfismo de Nucleótido Simple , Curva ROCRESUMEN
The characterization of the distribution of mutational effects is a key goal in evolutionary biology. Recently developed deep-sequencing approaches allow for accurate and simultaneous estimation of the fitness effects of hundreds of engineered mutations by monitoring their relative abundance across time points in a single bulk competition. Naturally, the achievable resolution of the estimated fitness effects depends on the specific experimental setup, the organism and type of mutations studied, and the sequencing technology utilized, among other factors. By means of analytical approximations and simulations, we provide guidelines for optimizing time-sampled deep-sequencing bulk competition experiments, focusing on the number of mutants, the sequencing depth, and the number of sampled time points. Our analytical results show that sampling more time points together with extending the duration of the experiment improves the achievable precision disproportionately compared with increasing the sequencing depth or reducing the number of competing mutants. Even if the duration of the experiment is fixed, sampling more time points and clustering these at the beginning and the end of the experiment increase experimental power and allow for efficient and precise assessment of the entire range of selection coefficients. Finally, we provide a formula for calculating the 95%-confidence interval for the measurement error estimate, which we implement as an interactive web tool. This allows for quantification of the maximum expected a priori precision of the experimental setup, as well as for a statistical threshold for determining deviations from neutrality for specific selection coefficient estimates.
Asunto(s)
Análisis Mutacional de ADN/métodos , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Modelos Estadísticos , Evolución Biológica , Biometría , Aptitud Genética , Genética de Población/métodos , Modelos Genéticos , Mutación , Selección GenéticaRESUMEN
The rapid evolution of drug resistance remains a critical public health concern. The treatment of influenza A virus (IAV) has proven particularly challenging, due to the ability of the virus to develop resistance against current antivirals and vaccines. Here, we evaluate a novel antiviral drug therapy, favipiravir, for which the mechanism of action in IAV involves an interaction with the viral RNA-dependent RNA polymerase resulting in an effective increase in the viral mutation rate. We used an experimental evolution framework, combined with novel population genetic method development for inference from time-sampled data, to evaluate the effectiveness of favipiravir against IAV. Evaluating whole genome polymorphism data across 15 time points under multiple drug concentrations and in controls, we present the first evidence for the ability of IAV populations to effectively adapt to low concentrations of favipiravir. In contrast, under high concentrations, we observe population extinction, indicative of mutational meltdown. We discuss the observed dynamics with respect to the evolutionary forces at play and emphasize the utility of evolutionary theory to inform drug development.
Asunto(s)
Amidas/farmacología , Antirretrovirales/farmacología , Farmacorresistencia Viral/genética , Subtipo H1N1 del Virus de la Influenza A/genética , Tasa de Mutación , Pirazinas/farmacología , Animales , Perros , Evolución Molecular , Subtipo H1N1 del Virus de la Influenza A/efectos de los fármacos , Células de Riñón Canino Madin Darby , Polimorfismo GenéticoRESUMEN
Adaptation lies at the heart of Darwinian evolution. Accordingly, numerous studies have tried to provide a formal framework for the description of the adaptive process. Of these, two complementary modeling approaches have emerged: While so-called adaptive-walk models consider adaptation from the successive fixation of de novo mutations only, quantitative genetic models assume that adaptation proceeds exclusively from preexisting standing genetic variation. The latter approach, however, has focused on short-term evolution of population means and variances rather than on the statistical properties of adaptive substitutions. Our aim is to combine these two approaches by describing the ecological and genetic factors that determine the genetic basis of adaptation from standing genetic variation in terms of the effect-size distribution of individual alleles. Specifically, we consider the evolution of a quantitative trait to a gradually changing environment. By means of analytical approximations, we derive the distribution of adaptive substitutions from standing genetic variation, that is, the distribution of the phenotypic effects of those alleles from the standing variation that become fixed during adaptation. Our results are checked against individual-based simulations. We find that, compared to adaptation from de novo mutations, (i) adaptation from standing variation proceeds by the fixation of more alleles of small effect and (ii) populations that adapt from standing genetic variation can traverse larger distances in phenotype space and, thus, have a higher potential for adaptation if the rate of environmental change is fast rather than slow.
Asunto(s)
Adaptación Fisiológica , Variación Genética , Modelos Genéticos , Fenotipo , Alelos , Ambiente , Evolución Molecular , Mutación , ProbabilidadRESUMEN
An increasing number of studies demonstrate phenotypic and genetic changes in natural populations that are subject to climate change, and there is hope that some of these changes will contribute to avoiding species extinctions ('evolutionary rescue'). Here, we review theoretical models of rapid evolution in quantitative traits that can shed light on the potential for adaptation to a changing climate. Our focus is on quantitative-genetic models with selection for a moving phenotypic optimum. We point out that there is no one-to-one relationship between the rate of adaptation and population survival, because the former depends on relative fitness and the latter on absolute fitness. Nevertheless, previous estimates that sustainable rates of genetically based change usually do not exceed 0.1 haldanes (i.e., phenotypic standard deviations per generation) are probably correct. Survival can be greatly facilitated by phenotypic plasticity, and heritable variation in plasticity can further speed up genetic evolution. Multivariate selection and genetic correlations are frequently assumed to constrain adaptation, but this is not necessarily the case and depends on the geometric relationship between the fitness landscape and the structure of genetic variation. Similar conclusions hold for adaptation to shifting spatial gradients. Recent models of adaptation in multispecies communities indicate that the potential for rapid evolution is strongly influenced by interspecific competition.
RESUMEN
Fisher's geometric model has been widely used to study the effects of pleiotropy and organismic complexity on phenotypic adaptation. Here, we study a version of Fisher's model in which a population adapts to a gradually moving optimum. Key parameters are the rate of environmental change, the dimensionality of phenotype space, and the patterns of mutational and selectional correlations. We focus on the distribution of adaptive substitutions, that is, the multivariate distribution of the phenotypic effects of fixed beneficial mutations. Our main results are based on an "adaptive-walk approximation," which is checked against individual-based simulations. We find that (1) the distribution of adaptive substitutions is strongly affected by the ecological dynamics and largely depends on a single composite parameter γ, which scales the rate of environmental change by the "adaptive potential" of the population; (2) the distribution of adaptive substitution reflects the shape of the fitness landscape if the environment changes slowly, whereas it mirrors the distribution of new mutations if the environment changes fast; (3) in contrast to classical models of adaptation assuming a constant optimum, with a moving optimum, more complex organisms evolve via larger adaptive steps.
Asunto(s)
Adaptación Fisiológica/genética , Evolución Biológica , Modelos Genéticos , Fenotipo , Selección Genética , Ambiente , Pleiotropía Genética , Genética de Población , Mutación/genética , Factores de TiempoRESUMEN
Condensing the information of a total of 1551 to 469 influenza A H1N1 isolates we investigated the frequency of host shifts among bird, human and swine. Phylogenies of hemagglutinin and neuraminidase as well as ancestral host reconstructions were simultaneously inferred in a Bayesian framework. The surface proteins had to be analyzed separately because of reassortment. Also the different tree topologies indicated the different evolutionary histories of these genes. The majority of interspecies transmissions involved isolates from swine confirming the role of pigs as "mixing vessel" for the influenza A virus. This was emphasized by the investigation of host specific amino acid positions. However, the simultaneous estimation of phylogeny and ancestral states resulted in considerable ambiguity in particular at deeper nodes and at the root cautioning against overstated conclusions. Our analysis highlights the urge of intensifying influenza surveillance programs for porcine hosts.