Pesquisa | Portal Regional da BVS

1.

diplo-locus: A lightweight toolkit for inference and simulation of time-series genetic data under general diploid selection.

Cheng, Xiaoheng; Steinrücken, Matthias.

bioRxiv ; 2023 Oct 17.

Artigo em Inglês | MEDLINE | ID: mdl-37905072

RESUMO

Summary: Whole-genome time-series allele frequency data are becoming more prevalent as ancient DNA (aDNA) sequences and data from evolve-and-resequence (E&R) experiments are generated at a rapid pace. Such data presents unprecedented opportunities to elucidate the dynamics of adaptative genetic variation. However, despite many methods to infer parameters of selection models from allele frequency trajectories available in the literature, few provide user-friendly implementations for large-scale empirical applications. Here, we present diplo-locus, an open-source Python package that provides functionality to simulate and perform inference from time-series under the Wright-Fisher diffusion with general diploid selection. The package includes Python modules as well as command-line tools. Availability: Python package and command-line tool avilable at: https://github.com/steinrue/diplo_locus or https://pypi.org/project/diplo-locus/.

2.

Reply to Barton et al: signatures of natural selection during the Black Death.

Vilgalys, Tauras P; Klunk, Jennifer; Demeure, Christian E; Cheng, Xiaoheng; Shiratori, Mari; Madej, Julien; Beau, Rémi; Elli, Derek; Patino, Maria I; Redfern, Rebecca; DeWitte, Sharon N; Gamble, Julia A; Boldsen, Jesper L; Carmichael, Ann; Varlik, Nükhet; Eaton, Katherine; Grenier, Jean-Christophe; Golding, G Brian; Devault, Alison; Rouillard, Jean-Marie; Yotova, Vania; Sindeaux, Renata; Ye, Chun Jimmie; Bikaran, Matin; Dumaine, Anne; Brinkworth, Jessica F; Missiakas, Dominique; Rouleau, Guy A; Steinrücken, Matthias; Pizarro-Cerdá, Javier; Poinar, Hendrik N; Barreiro, Luis B.

bioRxiv ; 2023 Apr 07.

Artigo em Inglês | MEDLINE | ID: mdl-37066254

RESUMO

Barton et al.1 raise several statistical concerns regarding our original analyses2 that highlight the challenge of inferring natural selection using ancient genomic data. We show here that these concerns have limited impact on our original conclusions. Specifically, we recover the same signature of enrichment for high FST values at the immune loci relative to putatively neutral sites after switching the allele frequency estimation method to a maximum likelihood approach, filtering to only consider known human variants, and down-sampling our data to the same mean coverage across sites. Furthermore, using permutations, we show that the rs2549794 variant near ERAP2 continues to emerge as the strongest candidate for selection (p = 1.2×10-5), falling below the Bonferroni-corrected significance threshold recommended by Barton et al. Importantly, the evidence for selection on ERAP2 is further supported by functional data demonstrating the impact of the ERAP2 genotype on the immune response to Y. pestis and by epidemiological data from an independent group showing that the putatively selected allele during the Black Death protects against severe respiratory infection in contemporary populations.

3.

Evolution of immune genes is associated with the Black Death.

Klunk, Jennifer; Vilgalys, Tauras P; Demeure, Christian E; Cheng, Xiaoheng; Shiratori, Mari; Madej, Julien; Beau, Rémi; Elli, Derek; Patino, Maria I; Redfern, Rebecca; DeWitte, Sharon N; Gamble, Julia A; Boldsen, Jesper L; Carmichael, Ann; Varlik, Nükhet; Eaton, Katherine; Grenier, Jean-Christophe; Golding, G Brian; Devault, Alison; Rouillard, Jean-Marie; Yotova, Vania; Sindeaux, Renata; Ye, Chun Jimmie; Bikaran, Matin; Dumaine, Anne; Brinkworth, Jessica F; Missiakas, Dominique; Rouleau, Guy A; Steinrücken, Matthias; Pizarro-Cerdá, Javier; Poinar, Hendrik N; Barreiro, Luis B.

Nature ; 611(7935): 312-319, 2022 Nov.

Artigo em Inglês | MEDLINE | ID: mdl-36261521

RESUMO

Infectious diseases are among the strongest selective pressures driving human evolution1,2. This includes the single greatest mortality event in recorded history, the first outbreak of the second pandemic of plague, commonly called the Black Death, which was caused by the bacterium Yersinia pestis3. This pandemic devastated Afro-Eurasia, killing up to 30-50% of the population4. To identify loci that may have been under selection during the Black Death, we characterized genetic variation around immune-related genes from 206 ancient DNA extracts, stemming from two different European populations before, during and after the Black Death. Immune loci are strongly enriched for highly differentiated sites relative to a set of non-immune loci, suggesting positive selection. We identify 245 variants that are highly differentiated within the London dataset, four of which were replicated in an independent cohort from Denmark, and represent the strongest candidates for positive selection. The selected allele for one of these variants, rs2549794, is associated with the production of a full-length (versus truncated) ERAP2 transcript, variation in cytokine response to Y. pestis and increased ability to control intracellular Y. pestis in macrophages. Finally, we show that protective variants overlap with alleles that are today associated with increased susceptibility to autoimmune diseases, providing empirical evidence for the role played by past pandemics in shaping present-day susceptibility to disease.

Assuntos

DNA Antigo , Predisposição Genética para Doença , Imunidade , Peste , Seleção Genética , Yersinia pestis , Humanos , Aminopeptidases/genética , Aminopeptidases/imunologia , Peste/genética , Peste/imunologia , Peste/microbiologia , Peste/mortalidade , Yersinia pestis/imunologia , Yersinia pestis/patogenicidade , Seleção Genética/imunologia , Europa (Continente)/epidemiologia , Europa (Continente)/etnologia , Imunidade/genética , Conjuntos de Dados como Assunto , Londres/epidemiologia , Dinamarca/epidemiologia

4.

Robust inference of population size histories from genomic sequencing data.

Upadhya, Gautam; Steinrücken, Matthias.

PLoS Comput Biol ; 18(9): e1010419, 2022 09.

Artigo em Inglês | MEDLINE | ID: mdl-36112715

RESUMO

Unraveling the complex demographic histories of natural populations is a central problem in population genetics. Understanding past demographic events is of general anthropological interest, but is also an important step in establishing accurate null models when identifying adaptive or disease-associated genetic variation. An important class of tools for inferring past population size changes from genomic sequence data are Coalescent Hidden Markov Models (CHMMs). These models make efficient use of the linkage information in population genomic datasets by using the local genealogies relating sampled individuals as latent states that evolve along the chromosome in an HMM framework. Extending these models to large sample sizes is challenging, since the number of possible latent states increases rapidly. Here, we present our method CHIMP (CHMM History-Inference Maximum-Likelihood Procedure), a novel CHMM method for inferring the size history of a population. It can be applied to large samples (hundreds of haplotypes) and only requires unphased genomes as input. The two implementations of CHIMP that we present here use either the height of the genealogical tree (TMRCA) or the total branch length, respectively, as the latent variable at each position in the genome. The requisite transition and emission probabilities are obtained by numerically solving certain systems of differential equations derived from the ancestral process with recombination. The parameters of the population size history are subsequently inferred using an Expectation-Maximization algorithm. In addition, we implement a composite likelihood scheme to allow the method to scale to large sample sizes. We demonstrate the efficiency and accuracy of our method in a variety of benchmark tests using simulated data and present comparisons to other state-of-the-art methods. Specifically, our implementation using TMRCA as the latent variable shows comparable performance and provides accurate estimates of effective population sizes in intermediate and ancient times. Our method is agnostic to the phasing of the data, which makes it a promising alternative in scenarios where high quality data is not available, and has potential applications for pseudo-haploid data.

Assuntos

Genética Populacional , Modelos Genéticos , Algoritmos , Simulação por Computador , Genômica , Humanos , Cadeias de Markov , Densidade Demográfica

5.

Polygenic score accuracy in ancient samples: Quantifying the effects of allelic turnover.

Carlson, Maryn O; Rice, Daniel P; Berg, Jeremy J; Steinrücken, Matthias.

PLoS Genet ; 18(5): e1010170, 2022 05.

Artigo em Inglês | MEDLINE | ID: mdl-35522704

RESUMO

Polygenic scores link the genotypes of ancient individuals to their phenotypes, which are often unobservable, offering a tantalizing opportunity to reconstruct complex trait evolution. In practice, however, interpretation of ancient polygenic scores is subject to numerous assumptions. For one, the genome-wide association (GWA) studies from which polygenic scores are derived, can only estimate effect sizes for loci segregating in contemporary populations. Therefore, a GWA study may not correctly identify all loci relevant to trait variation in the ancient population. In addition, the frequencies of trait-associated loci may have changed in the intervening years. Here, we devise a theoretical framework to quantify the effect of this allelic turnover on the statistical properties of polygenic scores as functions of population genetic dynamics, trait architecture, power to detect significant loci, and the age of the ancient sample. We model the allele frequencies of loci underlying trait variation using the Wright-Fisher diffusion, and employ the spectral representation of its transition density to find analytical expressions for several error metrics, including the expected sample correlation between the polygenic scores of ancient individuals and their true phenotypes, referred to as polygenic score accuracy. Our theory also applies to a two-population scenario and demonstrates that allelic turnover alone may explain a substantial percentage of the reduced accuracy observed in cross-population predictions, akin to those performed in human genetics. Finally, we use simulations to explore the effects of recent directional selection, a bias-inducing process, on the statistics of interest. We find that even in the presence of bias, weak selection induces minimal deviations from our neutral expectations for the decay of polygenic score accuracy. By quantifying the limitations of polygenic scores in an explicit evolutionary context, our work lays the foundation for the development of more sophisticated statistical procedures to analyze both temporally and geographically resolved polygenic scores.

Assuntos

Estudo de Associação Genômica Ampla , Herança Multifatorial , Alelos , Frequência do Gene/genética , Estudo de Associação Genômica Ampla/métodos , Modelos Genéticos , Herança Multifatorial/genética , Seleção Genética

6.

Recurrent inversion polymorphisms in humans associate with genetic instability and genomic disorders.

Porubsky, David; Höps, Wolfram; Ashraf, Hufsah; Hsieh, PingHsun; Rodriguez-Martin, Bernardo; Yilmaz, Feyza; Ebler, Jana; Hallast, Pille; Maria Maggiolini, Flavia Angela; Harvey, William T; Henning, Barbara; Audano, Peter A; Gordon, David S; Ebert, Peter; Hasenfeld, Patrick; Benito, Eva; Zhu, Qihui; Lee, Charles; Antonacci, Francesca; Steinrücken, Matthias; Beck, Christine R; Sanders, Ashley D; Marschall, Tobias; Eichler, Evan E; Korbel, Jan O.

Cell ; 185(11): 1986-2005.e26, 2022 05 26.

Artigo em Inglês | MEDLINE | ID: mdl-35525246

RESUMO

Unlike copy number variants (CNVs), inversions remain an underexplored genetic variation class. By integrating multiple genomic technologies, we discover 729 inversions in 41 human genomes. Approximately 85% of inversions <2 kbp form by twin-priming during L1 retrotransposition; 80% of the larger inversions are balanced and affect twice as many nucleotides as CNVs. Balanced inversions show an excess of common variants, and 72% are flanked by segmental duplications (SDs) or retrotransposons. Since flanking repeats promote non-allelic homologous recombination, we developed complementary approaches to identify recurrent inversion formation. We describe 40 recurrent inversions encompassing 0.6% of the genome, showing inversion rates up to 2.7 × 10-4 per locus per generation. Recurrent inversions exhibit a sex-chromosomal bias and co-localize with genomic disorder critical regions. We propose that inversion recurrence results in an elevated number of heterozygous carriers and structural SD diversity, which increases mutability in the population and predisposes specific haplotypes to disease-causing CNVs.

Assuntos

Inversão Cromossômica , Duplicações Segmentares Genômicas , Inversão Cromossômica/genética , Variações do Número de Cópias de DNA/genética , Genoma Humano , Genômica , Humanos

7.

Properties of 2-locus genealogies and linkage disequilibrium in temporally structured samples.

Biddanda, Arjun; Steinrücken, Matthias; Novembre, John.

Genetics ; 221(1)2022 05 05.

Artigo em Inglês | MEDLINE | ID: mdl-35294015

RESUMO

Archeogenetics has been revolutionary, revealing insights into demographic history and recent positive selection. However, most studies to date have ignored the nonrandom association of genetic variants at different loci (i.e. linkage disequilibrium). This may be in part because basic properties of linkage disequilibrium in samples from different times are still not well understood. Here, we derive several results for summary statistics of haplotypic variation under a model with time-stratified sampling: (1) The correlation between the number of pairwise differences observed between time-staggered samples (πΔt) in models with and without strict population continuity; (2) The product of the linkage disequilibrium coefficient, D, between ancient and modern samples, which is a measure of haplotypic similarity between modern and ancient samples; and (3) The expected switch rate in the Li and Stephens haplotype copying model. The latter has implications for genotype imputation and phasing in ancient samples with modern reference panels. Overall, these results provide a characterization of how haplotype patterns are affected by sample age, recombination rates, and population sizes. We expect these results will help guide the interpretation and analysis of haplotype data from ancient and modern samples.

Assuntos

Arqueologia/métodos , Genética Populacional/métodos , Genótipo , Haplótipos , Humanos , Desequilíbrio de Ligação , Densidade Demográfica

8.

A numerical framework for genetic hitchhiking in populations of variable size.

Friedlander, Eric; Steinrücken, Matthias.

Genetics ; 220(3)2022 03 03.

Artigo em Inglês | MEDLINE | ID: mdl-35143667

RESUMO

Natural selection on beneficial or deleterious alleles results in an increase or decrease, respectively, of their frequency within the population. Due to chromosomal linkage, the dynamics of the selected site affect the genetic variation at nearby neutral loci in a process commonly referred to as genetic hitchhiking. Changes in population size, however, can yield patterns in genomic data that mimic the effects of selection. Accurately modeling these dynamics is thus crucial to understanding how selection and past population size changes impact observed patterns of genetic variation. Here, we model the evolution of haplotype frequencies with the Wright-Fisher diffusion to study the impact of selection on linked neutral variation. Explicit solutions are not known for the dynamics of this diffusion when selection and recombination act simultaneously. Thus, we present a method for numerically evaluating the Wright-Fisher diffusion dynamics of 2 linked loci separated by a certain recombination distance when selection is acting. We can account for arbitrary population size histories explicitly using this approach. A key step in the method is to express the moments of the associated transition density, or sampling probabilities, as solutions to ordinary differential equations. Numerically solving these differential equations relies on a novel accurate and numerically efficient technique to estimate higher order moments from lower order moments. We demonstrate how this numerical framework can be used to quantify the reduction and recovery of genetic diversity around a selected locus over time and elucidate distortions in the site-frequency-spectra of neutral variation linked to loci under selection in various demographic settings. The method can be readily extended to more general modes of selection and applied in likelihood frameworks to detect loci under selection and infer the strength of the selective pressure.

Assuntos

Modelos Genéticos , Seleção Genética , Alelos , Ligação Genética , Variação Genética , Genética Populacional

9.

Parental relatedness through time revealed by runs of homozygosity in ancient DNA.

Ringbauer, Harald; Novembre, John; Steinrücken, Matthias.

Nat Commun ; 12(1): 5425, 2021 09 14.

Artigo em Inglês | MEDLINE | ID: mdl-34521843

RESUMO

Parental relatedness of present-day humans varies substantially across the globe, but little is known about the past. Here we analyze ancient DNA, leveraging that parental relatedness leaves genomic traces in the form of runs of homozygosity. We present an approach to identify such runs in low-coverage ancient DNA data aided by haplotype information from a modern phased reference panel. Simulation and experiments show that this method robustly detects runs of homozygosity longer than 4 centimorgan for ancient individuals with at least 0.3 × coverage. Analyzing genomic data from 1,785 ancient humans who lived in the last 45,000 years, we detect low rates of first cousin or closer unions across most ancient populations. Moreover, we find a marked decay in background parental relatedness co-occurring with or shortly after the advent of sedentary agriculture. We observe this signal, likely linked to increasing local population sizes, across several geographic transects worldwide.

Assuntos

DNA Antigo/análise , Genoma Humano , Haplótipos , Homozigoto , Padrões de Herança , Dinâmica Populacional/história , Agricultura/história , Feminino , História Antiga , Humanos , Masculino

10.

Increased rate of close-kin unions in the central Andes in the half millennium before European contact.

Ringbauer, Harald; Steinrücken, Matthias; Fehren-Schmitz, Lars; Reich, David.

Curr Biol ; 30(17): R980-R981, 2020 09 07.

Artigo em Inglês | MEDLINE | ID: mdl-32898493

RESUMO

Analyzing ancient DNA of the central Andes, Ringbauer and colleagues identify a markedly elevated rate of unions of closely related parents after ca. 1000 CE. This change of mating preferences sheds new light on a unique system of social organization based on ancestry ("ayllu") whereby within-group unions were preferred to facilitate sharing of resources.

Assuntos

DNA Antigo/análise , Endogamia/história , Endogamia/métodos , Reprodução , História Antiga , História Medieval , Humanos , América do Sul

11.

Inference of complex population histories using whole-genome sequences from multiple populations.

Steinrücken, Matthias; Kamm, Jack; Spence, Jeffrey P; Song, Yun S.

Proc Natl Acad Sci U S A ; 116(34): 17115-17120, 2019 08 20.

Artigo em Inglês | MEDLINE | ID: mdl-31387977

RESUMO

There has been much interest in analyzing genome-scale DNA sequence data to infer population histories, but inference methods developed hitherto are limited in model complexity and computational scalability. Here we present an efficient, flexible statistical method, diCal2, that can use whole-genome sequence data from multiple populations to infer complex demographic models involving population size changes, population splits, admixture, and migration. Applying our method to data from Australian, East Asian, European, and Papuan populations, we find that the population ancestral to Australians and Papuans started separating from East Asians and Europeans about 100,000 y ago, and that the separation of East Asians and Europeans started about 50,000 y ago, with pervasive gene flow between all pairs of populations.

Assuntos

Fluxo Gênico , Estudo de Associação Genômica Ampla , Migração Humana , Modelos Genéticos , Havaiano Nativo ou Outro Ilhéu do Pacífico/genética , Sequenciamento Completo do Genoma , Austrália , Genética Populacional , História Antiga , Humanos , Havaiano Nativo ou Outro Ilhéu do Pacífico/história

12.

Inference of population history using coalescent HMMs: review and outlook.

Spence, Jeffrey P; Steinrücken, Matthias; Terhorst, Jonathan; Song, Yun S.

Curr Opin Genet Dev ; 53: 70-76, 2018 12.

Artigo em Inglês | MEDLINE | ID: mdl-30056275

RESUMO

Studying how diverse human populations are related is of historical and anthropological interest, in addition to providing a realistic null model for testing for signatures of natural selection or disease associations. Furthermore, understanding the demographic histories of other species is playing an increasingly important role in conservation genetics. A number of statistical methods have been developed to infer population demographic histories using whole-genome sequence data, with recent advances focusing on allowing for more flexible modeling choices, scaling to larger data sets, and increasing statistical power. Here we review coalescent hidden Markov models, a powerful class of population genetic inference methods that can utilize linkage disequilibrium information effectively. We highlight recent advances, give advice for practitioners, point out potential pitfalls, and present possible future research directions.

Assuntos

Evolução Molecular , Genética Populacional , Seleção Genética/genética , Genoma Humano/genética , Humanos , Cadeias de Markov , Sequenciamento Completo do Genoma

13.

Model-based detection and analysis of introgressed Neanderthal ancestry in modern humans.

Steinrücken, Matthias; Spence, Jeffrey P; Kamm, John A; Wieczorek, Emilia; Song, Yun S.

Mol Ecol ; 27(19): 3873-3888, 2018 10.

Artigo em Inglês | MEDLINE | ID: mdl-29603507

RESUMO

Genetic evidence has revealed that the ancestors of modern human populations outside Africa and their hominin sister groups, notably Neanderthals, exchanged genetic material in the past. The distribution of these introgressed sequence tracts along modern-day human genomes provides insight into the selective forces acting on them and the role of introgression in the evolutionary history of hominins. Studying introgression patterns on the X-chromosome is of particular interest, as sex chromosomes are thought to play a special role in speciation. Recent studies have developed methods to localize introgressed ancestries, reporting long regions that are depleted of Neanderthal introgression and enriched in genes, suggesting negative selection against the Neanderthal variants. On the other hand, enriched Neanderthal ancestry in hair- and skin-related genes suggests that some introgressed variants facilitated adaptation to new environments. Here, we present a model-based introgression detection method called dical-admix. We demonstrate its efficiency and accuracy through extensive simulations and apply it to detect tracts of Neanderthal introgression in modern human individuals from the 1000 Genomes Project. Our findings are largely concordant with previous studies, consistent with weak selection against Neanderthal ancestry. We find evidence that selection against Neanderthal ancestry was due to higher genetic load in Neanderthals resulting from small effective population size, rather than widespread Dobzhansky-Müller incompatibilities (DMIs) that could contribute to reproductive isolation. Moreover, we confirm the previously reported low level of introgression on the X-chromosome, but find little evidence that DMIs contributed to this pattern.

Assuntos

Genética Populacional , Genoma Humano , Modelos Genéticos , Homem de Neandertal/genética , Animais , Cromossomos Humanos X/genética , Simulação por Computador , Carga Genética , Humanos , Hibridização Genética , Cadeias de Markov , Densidade Demográfica , Seleção Genética

14.

Terminal Pleistocene Alaskan genome reveals first founding population of Native Americans.

Moreno-Mayar, J Víctor; Potter, Ben A; Vinner, Lasse; Steinrücken, Matthias; Rasmussen, Simon; Terhorst, Jonathan; Kamm, John A; Albrechtsen, Anders; Malaspinas, Anna-Sapfo; Sikora, Martin; Reuther, Joshua D; Irish, Joel D; Malhi, Ripan S; Orlando, Ludovic; Song, Yun S; Nielsen, Rasmus; Meltzer, David J; Willerslev, Eske.

Nature ; 553(7687): 203-207, 2018 01 11.

Artigo em Inglês | MEDLINE | ID: mdl-29323294

RESUMO

Despite broad agreement that the Americas were initially populated via Beringia, the land bridge that connected far northeast Asia with northwestern North America during the Pleistocene epoch, when and how the peopling of the Americas occurred remains unresolved. Analyses of human remains from Late Pleistocene Alaska are important to resolving the timing and dispersal of these populations. The remains of two infants were recovered at Upward Sun River (USR), and have been dated to around 11.5 thousand years ago (ka). Here, by sequencing the USR1 genome to an average coverage of approximately 17 times, we show that USR1 is most closely related to Native Americans, but falls basal to all previously sequenced contemporary and ancient Native Americans. As such, USR1 represents a distinct Ancient Beringian population. Using demographic modelling, we infer that the Ancient Beringian population and ancestors of other Native Americans descended from a single founding population that initially split from East Asians around 36 ± 1.5 ka, with gene flow persisting until around 25 ± 1.1 ka. Gene flow from ancient north Eurasians into all Native Americans took place 25-20 ka, with Ancient Beringians branching off around 22-18.1 ka. Our findings support a long-term genetic structure in ancestral Native Americans, consistent with the Beringian 'standstill model'. We show that the basal northern and southern Native American branches, to which all other Native Americans belong, diverged around 17.5-14.6 ka, and that this probably occurred south of the North American ice sheets. We also show that after 11.5 ka, some of the northern Native American populations received gene flow from a Siberian population most closely related to Koryaks, but not Palaeo-Eskimos, Inuits or Kets, and that Native American gene flow into Inuits was through northern and not southern Native American groups. Our findings further suggest that the far-northern North American presence of northern Native Americans is from a back migration that replaced or absorbed the initial founding population of Ancient Beringians.

Assuntos

Efeito Fundador , Genoma Humano/genética , Indígenas Norte-Americanos/genética , Modelos Genéticos , Filogenia , Alaska , Ásia Oriental/etnologia , Fluxo Gênico , Genética Populacional , História Antiga , Migração Humana , Humanos , Lactente , Rios , Sibéria/etnologia , Fatores de Tempo

15.

Computing the joint distribution of the total tree length across loci in populations with variable size.

Miroshnikov, Alexey; Steinrücken, Matthias.

Theor Popul Biol ; 118: 1-19, 2017 12.

Artigo em Inglês | MEDLINE | ID: mdl-28943126

RESUMO

In recent years, a number of methods have been developed to infer complex demographic histories, especially historical population size changes, from genomic sequence data. Coalescent Hidden Markov Models have proven to be particularly useful for this type of inference. Due to the Markovian structure of these models, an essential building block is the joint distribution of local genealogical trees, or statistics of these genealogies, at two neighboring loci in populations of variable size. Here, we present a novel method to compute the marginal and the joint distribution of the total length of the genealogical trees at two loci separated by at most one recombination event for samples of arbitrary size. To our knowledge, no method to compute these distributions has been presented in the literature to date. We show that they can be obtained from the solution of certain hyperbolic systems of partial differential equations. We present a numerical algorithm, based on the method of characteristics, that can be used to efficiently and accurately solve these systems and compute the marginal and the joint distributions. We demonstrate its utility to study the properties of the joint distribution. Our flexible method can be straightforwardly extended to handle an arbitrary fixed number of recombination events, to include the distributions of other statistics of the genealogies as well, and can also be applied in structured populations.

Assuntos

Linhagem , Densidade Demográfica , Humanos , Cadeias de Markov , Recombinação Genética

16.

The Effects of Population Size Histories on Estimates of Selection Coefficients from Time-Series Genetic Data.

Jewett, Ethan M; Steinrücken, Matthias; Song, Yun S.

Mol Biol Evol ; 33(11): 3002-3027, 2016 11.

Artigo em Inglês | MEDLINE | ID: mdl-27550904

RESUMO

Many approaches have been developed for inferring selection coefficients from time series data while accounting for genetic drift. These approaches have been motivated by the intuition that properly accounting for the population size history can significantly improve estimates of selective strengths. However, the improvement in inference accuracy that can be attained by modeling drift has not been characterized. Here, by comparing maximum likelihood estimates of selection coefficients that account for the true population size history with estimates that ignore drift by assuming allele frequencies evolve deterministically in a population of infinite size, we address the following questions: how much can modeling the population size history improve estimates of selection coefficients? How much can mis-inferred population sizes hurt inferences of selection coefficients? We conduct our analysis under the discrete Wright-Fisher model by deriving the exact probability of an allele frequency trajectory in a population of time-varying size and we replicate our results under the diffusion model. For both models, we find that ignoring drift leads to estimates of selection coefficients that are nearly as accurate as estimates that account for the true population history, even when population sizes are small and drift is high. This result is of interest because inference methods that ignore drift are widely used in evolutionary studies and can be many orders of magnitude faster than methods that account for population sizes.

Assuntos

Genética Populacional/métodos , Modelos Genéticos , Seleção Genética , Evolução Biológica , Simulação por Computador , Frequência do Gene , Deriva Genética , Funções Verossimilhança , Densidade Demográfica

17.

SpectralTDF: transition densities of diffusion processes with time-varying selection parameters, mutation rates and effective population sizes.

Steinrücken, Matthias; Jewett, Ethan M; Song, Yun S.

Bioinformatics ; 32(5): 795-7, 2016 03 01.

Artigo em Inglês | MEDLINE | ID: mdl-26556388

RESUMO

MOTIVATION: In the Wright-Fisher diffusion, the transition density function describes the time evolution of the population-wide frequency of an allele. This function has several practical applications in population genetics and computing it for biologically realistic scenarios with selection and demography is an important problem. RESULTS: We develop an efficient method for finding a spectral representation of the transition density function for a general model where the effective population size, selection coefficients and mutation parameters vary over time in a piecewise constant manner. AVAILABILITY AND IMPLEMENTATION: The method, called SpectralTDF, is available at https://sourceforge.net/projects/spectraltdf/ CONTACT: yss@berkeley.edu SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Assuntos

Taxa de Mutação , Genética Populacional , Modelos Genéticos , Mutação , Seleção Genética

18.

POPULATION GENETICS. Genomic evidence for the Pleistocene and recent population history of Native Americans.

Raghavan, Maanasa; Steinrücken, Matthias; Harris, Kelley; Schiffels, Stephan; Rasmussen, Simon; DeGiorgio, Michael; Albrechtsen, Anders; Valdiosera, Cristina; Ávila-Arcos, María C; Malaspinas, Anna-Sapfo; Eriksson, Anders; Moltke, Ida; Metspalu, Mait; Homburger, Julian R; Wall, Jeff; Cornejo, Omar E; Moreno-Mayar, J Víctor; Korneliussen, Thorfinn S; Pierre, Tracey; Rasmussen, Morten; Campos, Paula F; de Barros Damgaard, Peter; Allentoft, Morten E; Lindo, John; Metspalu, Ene; Rodríguez-Varela, Ricardo; Mansilla, Josefina; Henrickson, Celeste; Seguin-Orlando, Andaine; Malmström, Helena; Stafford, Thomas; Shringarpure, Suyash S; Moreno-Estrada, Andrés; Karmin, Monika; Tambets, Kristiina; Bergström, Anders; Xue, Yali; Warmuth, Vera; Friend, Andrew D; Singarayer, Joy; Valdes, Paul; Balloux, Francois; Leboreiro, Ilán; Vera, Jose Luis; Rangel-Villalobos, Hector; Pettener, Davide; Luiselli, Donata; Davis, Loren G; Heyer, Evelyne; Zollikofer, Christoph P E.

Science ; 349(6250): aab3884, 2015 Aug 21.

Artigo em Inglês | MEDLINE | ID: mdl-26198033

RESUMO

How and when the Americas were populated remains contentious. Using ancient and modern genome-wide data, we found that the ancestors of all present-day Native Americans, including Athabascans and Amerindians, entered the Americas as a single migration wave from Siberia no earlier than 23 thousand years ago (ka) and after no more than an 8000-year isolation period in Beringia. After their arrival to the Americas, ancestral Native Americans diversified into two basal genetic branches around 13 ka, one that is now dispersed across North and South America and the other restricted to North America. Subsequent gene flow resulted in some Native Americans sharing ancestry with present-day East Asians (including Siberians) and, more distantly, Australo-Melanesians. Putative "Paleoamerican" relict populations, including the historical Mexican Pericúes and South American Fuego-Patagonians, are not directly related to modern Australo-Melanesians as suggested by the Paleoamerican Model.

Assuntos

Migração Humana/história , Indígenas Norte-Americanos/história , América , Fluxo Gênico , Genômica , História Antiga , Humanos , Indígenas Norte-Americanos/genética , Modelos Genéticos , Sibéria

19.

Transition Densities and Sample Frequency Spectra of Diffusion Processes with Selection and Variable Population Size.

Zivkovic, Daniel; Steinrücken, Matthias; Song, Yun S; Stephan, Wolfgang.

Genetics ; 200(2): 601-17, 2015 Jun.

Artigo em Inglês | MEDLINE | ID: mdl-25873633

RESUMO

Advances in empirical population genetics have made apparent the need for models that simultaneously account for selection and demography. To address this need, we here study the Wright-Fisher diffusion under selection and variable effective population size. In the case of genic selection and piecewise-constant effective population sizes, we obtain the transition density by extending a recently developed method for computing an accurate spectral representation for a constant population size. Utilizing this extension, we show how to compute the sample frequency spectrum in the presence of genic selection and an arbitrary number of instantaneous changes in the effective population size. We also develop an alternate, efficient algorithm for computing the sample frequency spectrum using a moment-based approach. We apply these methods to answer the following questions: If neutrality is incorrectly assumed when there is selection, what effects does it have on demographic parameter estimation? Can the impact of negative selection be observed in populations that undergo strong exponential growth?

Assuntos

Genética Populacional , Modelos Genéticos , Seleção Genética , Algoritmos , Densidade Demográfica

20.

A NOVEL SPECTRAL METHOD FOR INFERRING GENERAL DIPLOID SELECTION FROM TIME SERIES GENETIC DATA.

Steinrücken, Matthias; Bhaskar, Anand; Song, Yun S.

Ann Appl Stat ; 8(4): 2203-2222, 2014 Dec.

Artigo em Inglês | MEDLINE | ID: mdl-25598858

RESUMO

The increased availability of time series genetic variation data from experimental evolution studies and ancient DNA samples has created new opportunities to identify genomic regions under selective pressure and to estimate their associated fitness parameters. However, it is a challenging problem to compute the likelihood of non-neutral models for the population allele frequency dynamics, given the observed temporal DNA data. Here, we develop a novel spectral algorithm to analytically and efficiently integrate over all possible frequency trajectories between consecutive time points. This advance circumvents the limitations of existing methods which require fine-tuning the discretization of the population allele frequency space when numerically approximating requisite integrals. Furthermore, our method is flexible enough to handle general diploid models of selection where the heterozygote and homozygote fitness parameters can take any values, while previous methods focused on only a few restricted models of selection. We demonstrate the utility of our method on simulated data and also apply it to analyze ancient DNA data from genetic loci associated with coat coloration in horses. In contrast to previous studies, our exploration of the full fitness parameter space reveals that a heterozygote-advantage form of balancing selection may have been acting on these loci.

RESUMO

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA