Búsqueda | Portal Regional de la BVS

1.

Human evolution: Neanderthal footprints in African genomes.

Ragsdale, Aaron P.

Curr Biol ; 33(22): R1197-R1200, 2023 11 20.

Artículo en Inglés | MEDLINE | ID: mdl-37989099

RESUMEN

Human and Neanderthal populations met and mixed on multiple occasions over evolutionary time, resulting in the exchange of genetic material. New genomic analyses of diverse African populations reveal a history of bidirectional gene flow and selection acting on introgressed alleles.

Asunto(s)

Evolución Molecular , Genoma Humano , Hombre de Neandertal , Animales , Humanos , Alelos , Flujo Génico , Genómica , Hombre de Neandertal/genética , Selección Genética , Pueblo Africano

2.

Mexican Biobank advances population and medical genomics of diverse ancestries.

Sohail, Mashaal; Palma-Martínez, María J; Chong, Amanda Y; Quinto-Cortés, Consuelo D; Barberena-Jonas, Carmina; Medina-Muñoz, Santiago G; Ragsdale, Aaron; Delgado-Sánchez, Guadalupe; Cruz-Hervert, Luis Pablo; Ferreyra-Reyes, Leticia; Ferreira-Guerrero, Elizabeth; Mongua-Rodríguez, Norma; Canizales-Quintero, Sergio; Jimenez-Kaufmann, Andrés; Moreno-Macías, Hortensia; Aguilar-Salinas, Carlos A; Auckland, Kathryn; Cortés, Adrián; Acuña-Alonzo, Víctor; Gignoux, Christopher R; Wojcik, Genevieve L; Ioannidis, Alexander G; Fernández-Valverde, Selene L; Hill, Adrian V S; Tusié-Luna, María Teresa; Mentzer, Alexander J; Novembre, John; García-García, Lourdes; Moreno-Estrada, Andrés.

Nature ; 622(7984): 775-783, 2023 Oct.

Artículo en Inglés | MEDLINE | ID: mdl-37821706

RESUMEN

Latin America continues to be severely underrepresented in genomics research, and fine-scale genetic histories and complex trait architectures remain hidden owing to insufficient data1. To fill this gap, the Mexican Biobank project genotyped 6,057 individuals from 898 rural and urban localities across all 32 states in Mexico at a resolution of 1.8 million genome-wide markers with linked complex trait and disease information creating a valuable nationwide genotype-phenotype database. Here, using ancestry deconvolution and inference of identity-by-descent segments, we inferred ancestral population sizes across Mesoamerican regions over time, unravelling Indigenous, colonial and postcolonial demographic dynamics2-6. We observed variation in runs of homozygosity among genomic regions with different ancestries reflecting distinct demographic histories and, in turn, different distributions of rare deleterious variants. We conducted genome-wide association studies (GWAS) for 22 complex traits and found that several traits are better predicted using the Mexican Biobank GWAS compared to the UK Biobank GWAS7,8. We identified genetic and environmental factors associating with trait variation, such as the length of the genome in runs of homozygosity as a predictor for body mass index, triglycerides, glucose and height. This study provides insights into the genetic histories of individuals in Mexico and dissects their complex trait architectures, both crucial for making precision and preventive medicine initiatives accessible worldwide.

Asunto(s)

Bancos de Muestras Biológicas , Genética Médica , Genoma Humano , Genómica , Hispánicos o Latinos , Humanos , Glucemia/genética , Glucemia/metabolismo , Estatura/genética , Índice de Masa Corporal , Interacción Gen-Ambiente , Marcadores Genéticos/genética , Estudio de Asociación del Genoma Completo , Hispánicos o Latinos/clasificación , Hispánicos o Latinos/genética , Homocigoto , México , Fenotipo , Triglicéridos/sangre , Triglicéridos/genética , Reino Unido , Genoma Humano/genética

3.

Demographic modeling of admixed Latin American populations from whole genomes.

Medina-Muñoz, Santiago G; Ortega-Del Vecchyo, Diego; Cruz-Hervert, Luis Pablo; Ferreyra-Reyes, Leticia; García-García, Lourdes; Moreno-Estrada, Andrés; Ragsdale, Aaron P.

Am J Hum Genet ; 110(10): 1804-1816, 2023 10 05.

Artículo en Inglés | MEDLINE | ID: mdl-37725976

RESUMEN

Demographic models of Latin American populations often fail to fully capture their complex evolutionary history, which has been shaped by both recent admixture and deeper-in-time demographic events. To address this gap, we used high-coverage whole-genome data from Indigenous American ancestries in present-day Mexico and existing genomes from across Latin America to infer multiple demographic models that capture the impact of different timescales on genetic diversity. Our approach, which combines analyses of allele frequencies and ancestry tract length distributions, represents a significant improvement over current models in predicting patterns of genetic variation in admixed Latin American populations. We jointly modeled the contribution of European, African, East Asian, and Indigenous American ancestries into present-day Latin American populations. We infer that the ancestors of Indigenous Americans and East Asians diverged â¼30 thousand years ago, and we characterize genetic contributions of recent migrations from East and Southeast Asia to Peru and Mexico. Our inferred demographic histories are consistent across different genomic regions and annotations, suggesting that our inferences are robust to the potential effects of linked selection. In conjunction with published distributions of fitness effects for new nonsynonymous mutations in humans, we show in large-scale simulations that our models recover important features of both neutral and deleterious variation. By providing a more realistic framework for understanding the evolutionary history of Latin American populations, our models can help address the historical under-representation of admixed groups in genomics research and can be a valuable resource for future studies of populations with complex admixture and demographic histories.

Asunto(s)

Genética de Población , Genoma Humano , Humanos , América Latina , Genoma Humano/genética , Demografía , Blanco

4.

The genomic footprint of whaling and isolation in fin whale populations.

Nigenda-Morales, Sergio F; Lin, Meixi; Nuñez-Valencia, Paulina G; Kyriazis, Christopher C; Beichman, Annabel C; Robinson, Jacqueline A; Ragsdale, Aaron P; Urbán R, Jorge; Archer, Frederick I; Viloria-Gómora, Lorena; Pérez-Álvarez, María José; Poulin, Elie; Lohmueller, Kirk E; Moreno-Estrada, Andrés; Wayne, Robert K.

Nat Commun ; 14(1): 5465, 2023 09 12.

Artículo en Inglés | MEDLINE | ID: mdl-37699896

RESUMEN

Twentieth century industrial whaling pushed several species to the brink of extinction, with fin whales being the most impacted. However, a small, resident population in the Gulf of California was not targeted by whaling. Here, we analyzed 50 whole-genomes from the Eastern North Pacific (ENP) and Gulf of California (GOC) fin whale populations to investigate their demographic history and the genomic effects of natural and human-induced bottlenecks. We show that the two populations diverged ~16,000 years ago, after which the ENP population expanded and then suffered a 99% reduction in effective size during the whaling period. In contrast, the GOC population remained small and isolated, receiving less than one migrant per generation. However, this low level of migration has been crucial for maintaining its viability. Our study exposes the severity of whaling, emphasizes the importance of migration, and demonstrates the use of genome-based analyses and simulations to inform conservation strategies.

Asunto(s)

Ballena de Aleta , Humanos , Animales , Genómica , Industrias

5.

Multiple Sources of Uncertainty Confound Inference of Historical Human Generation Times.

Ragsdale, Aaron P; Thornton, Kevin R.

Mol Biol Evol ; 40(8)2023 08 03.

Artículo en Inglés | MEDLINE | ID: mdl-37450583

RESUMEN

Wang et al. (2023) recently proposed an approach to infer the history of human generation intervals from changes in mutation profiles over time. As the relative proportions of different mutation types depend on the ages of parents, binning variants by the time they arose allows for the inference of changes in average paternal and maternal generation intervals. Applying this approach to published allele age estimates, Wang et al. (2023) inferred long-lasting sex differences in average generation times and surprisingly found that ancestral generation times of West African populations remained substantially higher than those of Eurasian populations extending tens of thousands of generations into the past. Here, we argue that the results and interpretations in Wang et al. (2023) are primarily driven by noise and biases in input data and a lack of validation using independent approaches for estimating allele ages. With the recent development of methods to reconstruct genome-wide gene genealogies, coalescence times, and allele ages, we caution that downstream analyses may be strongly influenced by uncharacterized biases in their output.

Asunto(s)

Incertidumbre , Humanos , Femenino , Masculino , Mutación , Alelos

6.

Publisher Correction: A weakly structured stem for human origins in Africa.

Ragsdale, Aaron P; Weaver, Timothy D; Atkinson, Elizabeth G; Hoal, Eileen G; Möller, Marlo; Henn, Brenna M; Gravel, Simon.

Nature ; 620(7972): E11, 2023 Aug.

Artículo en Inglés | MEDLINE | ID: mdl-37460744

7.

Expanding the stdpopsim species catalog, and lessons learned for realistic genome simulations.

Lauterbur, M Elise; Cavassim, Maria Izabel A; Gladstein, Ariella L; Gower, Graham; Pope, Nathaniel S; Tsambos, Georgia; Adrion, Jeffrey; Belsare, Saurabh; Biddanda, Arjun; Caudill, Victoria; Cury, Jean; Echevarria, Ignacio; Haller, Benjamin C; Hasan, Ahmed R; Huang, Xin; Iasi, Leonardo Nicola Martin; Noskova, Ekaterina; Obsteter, Jana; Pavinato, Vitor Antonio Correa; Pearson, Alice; Peede, David; Perez, Manolo F; Rodrigues, Murillo F; Smith, Chris C R; Spence, Jeffrey P; Teterina, Anastasia; Tittes, Silas; Unneberg, Per; Vazquez, Juan Manuel; Waples, Ryan K; Wohns, Anthony Wilder; Wong, Yan; Baumdicker, Franz; Cartwright, Reed A; Gorjanc, Gregor; Gutenkunst, Ryan N; Kelleher, Jerome; Kern, Andrew D; Ragsdale, Aaron P; Ralph, Peter L; Schrider, Daniel R; Gronau, Ilan.

Elife ; 122023 06 21.

Artículo en Inglés | MEDLINE | ID: mdl-37342968

RESUMEN

Simulation is a key tool in population genetics for both methods development and empirical research, but producing simulations that recapitulate the main features of genomic datasets remains a major obstacle. Today, more realistic simulations are possible thanks to large increases in the quantity and quality of available genetic data, and the sophistication of inference and simulation software. However, implementing these simulations still requires substantial time and specialized knowledge. These challenges are especially pronounced for simulating genomes for species that are not well-studied, since it is not always clear what information is required to produce simulations with a level of realism sufficient to confidently answer a given question. The community-developed framework stdpopsim seeks to lower this barrier by facilitating the simulation of complex population genetic models using up-to-date information. The initial version of stdpopsim focused on establishing this framework using six well-characterized model species (Adrion et al., 2020). Here, we report on major improvements made in the new release of stdpopsim (version 0.2), which includes a significant expansion of the species catalog and substantial additions to simulation capabilities. Features added to improve the realism of the simulated genomes include non-crossover recombination and provision of species-specific genomic annotations. Through community-driven efforts, we expanded the number of species in the catalog more than threefold and broadened coverage across the tree of life. During the process of expanding the catalog, we have identified common sticking points and developed the best practices for setting up genome-scale simulations. We describe the input data required for generating a realistic simulation, suggest good practices for obtaining the relevant information from the literature, and discuss common pitfalls and major considerations. These improvements to stdpopsim aim to further promote the use of realistic whole-genome population genetic simulations, especially in non-model organisms, making them available, transparent, and accessible to everyone.

Asunto(s)

Genoma , Programas Informáticos , Simulación por Computador , Genética de Población , Genómica

8.

A weakly structured stem for human origins in Africa.

Ragsdale, Aaron P; Weaver, Timothy D; Atkinson, Elizabeth G; Hoal, Eileen G; Möller, Marlo; Henn, Brenna M; Gravel, Simon.

Nature ; 617(7962): 755-763, 2023 05.

Artículo en Inglés | MEDLINE | ID: mdl-37198480

RESUMEN

Despite broad agreement that Homo sapiens originated in Africa, considerable uncertainty surrounds specific models of divergence and migration across the continent1. Progress is hampered by a shortage of fossil and genomic data, as well as variability in previous estimates of divergence times1. Here we seek to discriminate among such models by considering linkage disequilibrium and diversity-based statistics, optimized for rapid, complex demographic inference2. We infer detailed demographic models for populations across Africa, including eastern and western representatives, and newly sequenced whole genomes from 44 Nama (Khoe-San) individuals from southern Africa. We infer a reticulated African population history in which present-day population structure dates back to Marine Isotope Stage 5. The earliest population divergence among contemporary populations occurred 120,000 to 135,000 years ago and was preceded by links between two or more weakly differentiated ancestral Homo populations connected by gene flow over hundreds of thousands of years. Such weakly structured stem models explain patterns of polymorphism that had previously been attributed to contributions from archaic hominins in Africa2-7. In contrast to models with archaic introgression, we predict that fossil remains from coexisting ancestral populations should be genetically and morphologically similar, and that only an inferred 1-4% of genetic differentiation among contemporary human populations can be attributed to genetic drift between stem populations. We show that model misspecification explains the variation in previous estimates of divergence times, and argue that studying a range of models is key to making robust inferences about deep history.

Asunto(s)

Genética de Población , Migración Humana , Filogenia , Humanos , África/etnología , Fósiles , Flujo Génico , Flujo Genético , Introgresión Genética , Genoma Humano , Historia Antigua , Migración Humana/historia , Desequilibrio de Ligamiento/genética , Polimorfismo Genético , Factores de Tiempo

9.

Demes: a standard format for demographic models.

Gower, Graham; Ragsdale, Aaron P; Bisschop, Gertjan; Gutenkunst, Ryan N; Hartfield, Matthew; Noskova, Ekaterina; Schiffels, Stephan; Struck, Travis J; Kelleher, Jerome; Thornton, Kevin R.

Genetics ; 222(3)2022 11 01.

Artículo en Inglés | MEDLINE | ID: mdl-36173327

RESUMEN

Understanding the demographic history of populations is a key goal in population genetics, and with improving methods and data, ever more complex models are being proposed and tested. Demographic models of current interest typically consist of a set of discrete populations, their sizes and growth rates, and continuous and pulse migrations between those populations over a number of epochs, which can require dozens of parameters to fully describe. There is currently no standard format to define such models, significantly hampering progress in the field. In particular, the important task of translating the model descriptions in published work into input suitable for population genetic simulators is labor intensive and error prone. We propose the Demes data model and file format, built on widely used technologies, to alleviate these issues. Demes provide a well-defined and unambiguous model of populations and their properties that is straightforward to implement in software, and a text file format that is designed for simplicity and clarity. We provide thoroughly tested implementations of Demes parsers in multiple languages including Python and C, and showcase initial support in several simulators and inference methods. An introduction to the file format and a detailed specification are available at https://popsim-consortium.github.io/demes-spec-docs/.

Asunto(s)

Genética de Población , Programas Informáticos , Demografía

10.

Local fitness and epistatic effects lead to distinct patterns of linkage disequilibrium in protein-coding genes.

Ragsdale, Aaron P.

Genetics ; 221(4)2022 07 30.

Artículo en Inglés | MEDLINE | ID: mdl-35736370

RESUMEN

Selected mutations interfere and interact with evolutionary processes at nearby loci, distorting allele frequency trajectories and creating correlations between pairs of mutations. Recent studies have used patterns of linkage disequilibrium between selected variants to test for selective interference and epistatic interactions, with some disagreement over interpreting observations from data. Interpretation is hindered by a lack of analytic or even numerical expectations for patterns of variation between pairs of loci under the combined effects of selection, dominance, epistasis, and demography. Here, I develop a numerical approach to compute the expected two-locus sampling distribution under diploid selection with arbitrary epistasis and dominance, recombination, and variable population size. I use this to explore how epistasis and dominance affect expected signed linkage disequilibrium, including for nonsteady-state demography relevant to human populations. Using whole-genome sequencing data from humans, I explore genome-wide patterns of linkage disequilibrium within protein-coding genes. I show that positive linkage disequilibrium between missense mutations within genes is driven by strong positive allele-frequency correlations between mutations that fall within the same annotated conserved domain, pointing to compensatory mutations or antagonistic epistasis as the prevailing mode of interaction within conserved genic elements. Linkage disequilibrium between missense mutations is reduced outside of conserved domains, as expected under Hill-Robertson interference. This variation in both mutational fitness effects and selective interactions within protein-coding genes calls for more refined inferences of the joint distribution of fitness and interactive effects, and the methods presented here should prove useful in that pursuit.

Asunto(s)

Epistasis Genética , Modelos Genéticos , Evolución Biológica , Frecuencia de los Genes , Humanos , Desequilibrio de Ligamiento , Selección Genética

11.

Efficient ancestry and mutation simulation with msprime 1.0.

Baumdicker, Franz; Bisschop, Gertjan; Goldstein, Daniel; Gower, Graham; Ragsdale, Aaron P; Tsambos, Georgia; Zhu, Sha; Eldon, Bjarki; Ellerman, E Castedo; Galloway, Jared G; Gladstein, Ariella L; Gorjanc, Gregor; Guo, Bing; Jeffery, Ben; Kretzschumar, Warren W; Lohse, Konrad; Matschiner, Michael; Nelson, Dominic; Pope, Nathaniel S; Quinto-Cortés, Consuelo D; Rodrigues, Murillo F; Saunack, Kumar; Sellinger, Thibaut; Thornton, Kevin; van Kemenade, Hugo; Wohns, Anthony W; Wong, Yan; Gravel, Simon; Kern, Andrew D; Koskela, Jere; Ralph, Peter L; Kelleher, Jerome.

Genetics ; 220(3)2022 03 03.

Artículo en Inglés | MEDLINE | ID: mdl-34897427

RESUMEN

Stochastic simulation is a key tool in population genetics, since the models involved are often analytically intractable and simulation is usually the only way of obtaining ground-truth data to evaluate inferences. Because of this, a large number of specialized simulation programs have been developed, each filling a particular niche, but with largely overlapping functionality and a substantial duplication of effort. Here, we introduce msprime version 1.0, which efficiently implements ancestry and mutation simulations based on the succinct tree sequence data structure and the tskit library. We summarize msprime's many features, and show that its performance is excellent, often many times faster and more memory efficient than specialized alternatives. These high-performance features have been thoroughly tested and validated, and built using a collaborative, open source development model, which reduces duplication of effort and promotes software quality via community engagement.

Asunto(s)

Algoritmos , Modelos Genéticos , Simulación por Computador , Genética de Población , Mutación , Programas Informáticos

12.

Diversification, spread, and admixture of octoploid strawberry in the Western Hemisphere.

Bird, Kevin A; Hardigan, Michael A; Ragsdale, Aaron P; Knapp, Steven J; VanBuren, Robert; Edger, Patrick P.

Am J Bot ; 108(11): 2269-2281, 2021 11.

Artículo en Inglés | MEDLINE | ID: mdl-34636416

RESUMEN

PREMISE: Polyploid species often have complex evolutionary histories that have, until recently, been intractable due to limitations of genomic resources. While recent work has further uncovered the evolutionary history of the octoploid strawberry (Fragaria L.), there are still open questions. Much is unknown about the evolutionary relationship of the wild octoploid species, Fragaria virginiana and Fragaria chiloensis, and gene flow within and among species after the formation of the octoploid genome. METHODS: We leveraged a collection of wild octoploid ecotypes of strawberry representing the recognized subspecies and ranging from Alaska to southern Chile, and a high-density SNP array to investigate wild octoploid strawberry evolution. Evolutionary relationships were interrogated with phylogenetic analysis and genetic clustering algorithms. Additionally, admixture among and within species is assessed with model-based and tree-based approaches. RESULTS: Phylogenetic analysis revealed that the two octoploid strawberry species are monophyletic sister lineages. The genetic clustering results show substructure between North and South American F. chiloensis populations. Additionally, model-based and tree-based methods support gene flow within and among the two octoploid species, including newly identified admixture in the Hawaiian F. chiloensis subsp. sandwicensis population. CONCLUSIONS: F. virginiana and F. chiloensis are supported as monophyletic and sister lineages. All but one of the subspecies show extensive paraphyly. Furthermore, phylogenetic relationships among F. chiloensis populations supports a single population range expansion southward from North America. The inter- and intraspecific relationships of octoploid strawberry are complex and suggest substantial gene flow between sympatric populations among and within species.

Asunto(s)

Fragaria , Américas , Fragaria/genética , Genoma de Planta , Filogenia , Poliploidía

13.

Assumptions about frequency-dependent architectures of complex traits bias measures of functional enrichment.

Zabad, Shadi; Ragsdale, Aaron P; Sun, Rosie; Li, Yue; Gravel, Simon.

Genet Epidemiol ; 45(6): 621-632, 2021 09.

Artículo en Inglés | MEDLINE | ID: mdl-34157784

RESUMEN

Linkage-Disequilibrium Score Regression (LDSC) is a popular framework for analyzing Genome-wide Association Studies (GWAS) summary statistics that allows for estimating single nucleotide polymorphism heritability, confounding, and functional enrichment of genetic variants with different annotations. Recent work has highlighted the influence of implicit and explicit assumptions of the model on the biological interpretation of the results. In this study, we explored a formulation of LDSC that replaces the r2 measure of LD with a recently proposed unbiased estimator of the D2 statistic. In addition to modest statistical difference across estimators, this derivation highlighted implicit and unrealistic assumptions about the relationship between allele frequency, effect size, and annotation status. We carry out a systematic comparison of alternative LDSC formulations by applying them to summary statistics from 47 GWAS traits. Our results show that commonly used models likely underestimate functional enrichment. These results highlight the importance of calibrating the LDSC model to achieve a more robust understanding of polygenic traits.

Asunto(s)

Estudio de Asociación del Genoma Completo , Herencia Multifactorial , Humanos , Desequilibrio de Ligamiento , Modelos Genéticos , Polimorfismo de Nucleótido Simple

14.

Inferring Genome-Wide Correlations of Mutation Fitness Effects between Populations.

Huang, Xin; Fortier, Alyssa Lyn; Coffman, Alec J; Struck, Travis J; Irby, Megan N; James, Jennifer E; León-Burguete, José E; Ragsdale, Aaron P; Gutenkunst, Ryan N.

Mol Biol Evol ; 38(10): 4588-4602, 2021 09 27.

Artículo en Inglés | MEDLINE | ID: mdl-34043790

RESUMEN

The effect of a mutation on fitness may differ between populations depending on environmental and genetic context, but little is known about the factors that underlie such differences. To quantify genome-wide correlations in mutation fitness effects, we developed a novel concept called a joint distribution of fitness effects (DFE) between populations. We then proposed a new statistic w to measure the DFE correlation between populations. Using simulation, we showed that inferring the DFE correlation from the joint allele frequency spectrum is statistically precise and robust. Using population genomic data, we inferred DFE correlations of populations in humans, Drosophila melanogaster, and wild tomatoes. In these species, we found that the overall correlation of the joint DFE was inversely related to genetic differentiation. In humans and D. melanogaster, deleterious mutations had a lower DFE correlation than tolerated mutations, indicating a complex joint DFE. Altogether, the DFE correlation can be reliably inferred, and it offers extensive insight into the genetics of population divergence.

Asunto(s)

Drosophila melanogaster , Aptitud Genética , Animales , Drosophila melanogaster/genética , Frecuencia de los Genes , Genoma , Modelos Genéticos , Mutación

15.

Nonparametric coalescent inference of mutation spectrum history and demography.

DeWitt, William S; Harris, Kameron Decker; Ragsdale, Aaron P; Harris, Kelley.

Proc Natl Acad Sci U S A ; 118(21)2021 05 25.

Artículo en Inglés | MEDLINE | ID: mdl-34016747

RESUMEN

As populations boom and bust, the accumulation of genetic diversity is modulated, encoding histories of living populations in present-day variation. Many methods exist to decode these histories, and all must make strong model assumptions. It is typical to assume that mutations accumulate uniformly across the genome at a constant rate that does not vary between closely related populations. However, recent work shows that mutational processes in human and great ape populations vary across genomic regions and evolve over time. This perturbs the mutation spectrum (relative mutation rates in different local nucleotide contexts). Here, we develop theoretical tools in the framework of Kingman's coalescent to accommodate mutation spectrum dynamics. We present mutation spectrum history inference (mushi), a method to perform nonparametric inference of demographic and mutation spectrum histories from allele frequency data. We use mushi to reconstruct trajectories of effective population size and mutation spectrum divergence between human populations, identify mutation signatures and their dynamics in different human populations, and calibrate the timing of a previously reported mutational pulse in the ancestors of Europeans. We show that mutation spectrum histories can be placed in a well-studied theoretical setting and rigorously inferred from genomic variation data, like other features of evolutionary history.

Asunto(s)

Frecuencia de los Genes/genética , Genética de Población/estadística & datos numéricos , Modelos Genéticos , Mutación/genética , Animales , Variación Genética/genética , Genómica , Hominidae/genética , Humanos , Tasa de Mutación , Densidad de Población

16.

Brassica rapa Domestication: Untangling Wild and Feral Forms and Convergence of Crop Morphotypes.

McAlvay, Alex C; Ragsdale, Aaron P; Mabry, Makenzie E; Qi, Xinshuai; Bird, Kevin A; Velasco, Pablo; An, Hong; Pires, J Chris; Emshwiller, Eve.

Mol Biol Evol ; 38(8): 3358-3372, 2021 07 29.

Artículo en Inglés | MEDLINE | ID: mdl-33930151

RESUMEN

The study of domestication contributes to our knowledge of evolution and crop genetic resources. Human selection has shaped wild Brassica rapa into diverse turnip, leafy, and oilseed crops. Despite its worldwide economic importance and potential as a model for understanding diversification under domestication, insights into the number of domestication events and initial crop(s) domesticated in B. rapa have been limited due to a lack of clarity about the wild or feral status of conspecific noncrop relatives. To address this gap and reconstruct the domestication history of B. rapa, we analyzed 68,468 genotyping-by-sequencing-derived single nucleotide polymorphisms for 416 samples in the largest diversity panel of domesticated and weedy B. rapa to date. To further understand the center of origin, we modeled the potential range of wild B. rapa during the mid-Holocene. Our analyses of genetic diversity across B. rapa morphotypes suggest that noncrop samples from the Caucasus, Siberia, and Italy may be truly wild, whereas those occurring in the Americas and much of Europe are feral. Clustering, tree-based analyses, and parameterized demographic inference further indicate that turnips were likely the first crop type domesticated, from which leafy types in East Asia and Europe were selected from distinct lineages. These findings clarify the domestication history and nature of wild crop genetic resources for B. rapa, which provides the first step toward investigating cases of possible parallel selection, the domestication and feralization syndrome, and novel germplasm for Brassica crop improvement.

Asunto(s)

Brassica rapa/genética , Productos Agrícolas/genética , Domesticación , Modelos Genéticos , Malezas/genética , Introgresión Genética , Variación Genética , Técnicas de Genotipaje , Filogeografía , Selección Genética

17.

Lessons Learned from Bugs in Models of Human History.

Ragsdale, Aaron P; Nelson, Dominic; Gravel, Simon; Kelleher, Jerome.

Am J Hum Genet ; 107(4): 583-588, 2020 10 01.

Artículo en Inglés | MEDLINE | ID: mdl-33007197

RESUMEN

Simulation plays a central role in population genomics studies. Recent years have seen rapid improvements in software efficiency that make it possible to simulate large genomic regions for many individuals sampled from large numbers of populations. As the complexity of the demographic models we study grows, however, there is an ever-increasing opportunity to introduce bugs in their implementation. Here, we describe two errors made in defining population genetic models using the msprime coalescent simulator that have found their way into the published record. We discuss how these errors have affected downstream analyses and give recommendations for software developers and users to reduce the risk of such errors.

Asunto(s)

Genética de Población/tendencias , Genoma Humano , Modelos Genéticos , Programas Informáticos , Algoritmos , Simulación por Computador , Demografía , Variación Genética , Genética de Población/historia , Historia Antigua , Migración Humana/historia , Migración Humana/estadística & datos numéricos , Humanos

18.

A community-maintained standard library of population genetic models.

Adrion, Jeffrey R; Cole, Christopher B; Dukler, Noah; Galloway, Jared G; Gladstein, Ariella L; Gower, Graham; Kyriazis, Christopher C; Ragsdale, Aaron P; Tsambos, Georgia; Baumdicker, Franz; Carlson, Jedidiah; Cartwright, Reed A; Durvasula, Arun; Gronau, Ilan; Kim, Bernard Y; McKenzie, Patrick; Messer, Philipp W; Noskova, Ekaterina; Ortega-Del Vecchyo, Diego; Racimo, Fernando; Struck, Travis J; Gravel, Simon; Gutenkunst, Ryan N; Lohmueller, Kirk E; Ralph, Peter L; Schrider, Daniel R; Siepel, Adam; Kelleher, Jerome; Kern, Andrew D.

Elife ; 92020 06 23.

Artículo en Inglés | MEDLINE | ID: mdl-32573438

RESUMEN

The explosion in population genomic data demands ever more complex modes of analysis, and increasingly, these analyses depend on sophisticated simulations. Recent advances in population genetic simulation have made it possible to simulate large and complex models, but specifying such models for a particular simulation engine remains a difficult and error-prone task. Computational genetics researchers currently re-implement simulation models independently, leading to inconsistency and duplication of effort. This situation presents a major barrier to empirical researchers seeking to use simulations for power analyses of upcoming studies or sanity checks on existing genomic data. Population genetics, as a field, also lacks standard benchmarks by which new tools for inference might be measured. Here, we describe a new resource, stdpopsim, that attempts to rectify this situation. Stdpopsim is a community-driven open source project, which provides easy access to a growing catalog of published simulation models from a range of organisms and supports multiple simulation engine backends. This resource is available as a well-documented python library with a simple command-line interface. We share some examples demonstrating how stdpopsim can be used to systematically compare demographic inference methods, and we encourage a broader community of developers to contribute to this growing resource.

Asunto(s)

Genética de Población , Biblioteca Genómica , Modelos Genéticos , Animales , Arabidopsis/genética , Perros/genética , Drosophila melanogaster/genética , Escherichia coli/genética , Genética de Población/métodos , Genética de Población/organización & administración , Genoma/genética , Genoma Humano/genética , Humanos , Pongo abelii/genética

19.

Accounting for long-range correlations in genome-wide simulations of large cohorts.

Nelson, Dominic; Kelleher, Jerome; Ragsdale, Aaron P; Moreau, Claudia; McVean, Gil; Gravel, Simon.

PLoS Genet ; 16(5): e1008619, 2020 05.

Artículo en Inglés | MEDLINE | ID: mdl-32369493

RESUMEN

Coalescent simulations are widely used to examine the effects of evolution and demographic history on the genetic makeup of populations. Thanks to recent progress in algorithms and data structures, simulators such as the widely-used msprime now provide genome-wide simulations for millions of individuals. However, this software relies on classic coalescent theory and its assumptions that sample sizes are small and that the region being simulated is short. Here we show that coalescent simulations of long regions of the genome exhibit large biases in identity-by-descent (IBD), long-range linkage disequilibrium (LD), and ancestry patterns, particularly when the sample size is large. We present a Wright-Fisher extension to msprime, and show that it produces more realistic distributions of IBD, LD, and ancestry proportions, while also addressing more subtle biases of the coalescent. Further, these extensions are more computationally efficient than state-of-the-art coalescent simulations when simulating long regions, including whole-genome data. For shorter regions, efficiency can be maintained via a hybrid model which simulates the recent past under the Wright-Fisher model and uses coalescent simulations in the distant past.

Asunto(s)

Algoritmos , Secuencia de Bases/fisiología , Genética de Población/métodos , Estudio de Asociación del Genoma Completo/métodos , Modelos Genéticos , Estudios de Cohortes , Simulación por Computador , Evolución Molecular , Genoma/genética , Estudio de Asociación del Genoma Completo/estadística & datos numéricos , Humanos , Desequilibrio de Ligamiento , Recombinación Genética/fisiología , Tamaño de la Muestra

20.

Unbiased Estimation of Linkage Disequilibrium from Unphased Data.

Ragsdale, Aaron P; Gravel, Simon.

Mol Biol Evol ; 37(3): 923-932, 2020 03 01.

Artículo en Inglés | MEDLINE | ID: mdl-31697386

RESUMEN

Linkage disequilibrium (LD) is used to infer evolutionary history, to identify genomic regions under selection, and to dissect the relationship between genotype and phenotype. In each case, we require accurate estimates of LD statistics from sequencing data. Unphased data present a challenge because multilocus haplotypes cannot be inferred exactly. Widely used estimators for the common statistics r2 and D2 exhibit large and variable upward biases that complicate interpretation and comparison across cohorts. Here, we show how to find unbiased estimators for a wide range of two-locus statistics, including D2, for both single and multiple randomly mating populations. These unbiased statistics are particularly well suited to estimate effective population sizes from unlinked loci in small populations. We develop a simple inference pipeline and use it to refine estimates of recent effective population sizes of the threatened Channel Island Fox populations.

Asunto(s)

Biología Computacional/métodos , Zorros/genética , Animales , Frecuencia de los Genes , Genética de Población , Genotipo , Haplotipos , Desequilibrio de Ligamiento , Modelos Genéticos , Fenotipo , Polimorfismo de Nucleótido Simple , Densidad de Población , Selección Genética

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

ENVIAR RESULTADO:

SELECCIÓN DE REFERENCIAS

DETALLE DE LA BÚSQUEDA