Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 153
Filtrar
1.
bioRxiv ; 2024 Jul 23.
Artículo en Inglés | MEDLINE | ID: mdl-39091879

RESUMEN

Circadian rhythms not only coordinate the timing of wake and sleep but also regulate homeostasis within the body, including glucose metabolism. However, the genetic variants that contribute to temporal control of glucose levels have not been previously examined. Using data from 420,000 individuals from the UK Biobank and replicating our findings in 100,000 individuals from the Estonian Biobank, we show that diurnal serum glucose is under genetic control. We discover a robust temporal association of glucose levels at the Melatonin receptor 1B (MTNR1B) (rs10830963, P = 1e-22) and a canonical circadian pacemaker gene Cryptochrome 2 (CRY2) loci (rs12419690, P = 1e-16). Furthermore, we show that sleep modulates serum glucose levels and the genetic variants have a separate mechanism of diurnal control. Finally, we show that these variants independently modulate risk of type 2 diabetes. Our findings, together with earlier genetic and epidemiological evidence, show a clear connection between sleep and metabolism and highlight variation at MTNR1B and CRY2 as temporal regulators for glucose levels.

2.
medRxiv ; 2024 Jul 31.
Artículo en Inglés | MEDLINE | ID: mdl-39132491

RESUMEN

The human leukocyte antigen (HLA) region plays an important role in human health through involvement in immune cell recognition and maturation. While genetic variation in the HLA region is associated with many diseases, the pleiotropic patterns of these associations have not been systematically investigated. Here, we developed a haplotype approach to investigate disease associations phenome-wide for 412,181 Finnish individuals and 2,459 traits. Across the 1,035 diseases with a GWAS association, we found a 17-fold average per-SNP enrichment of hits in the HLA region. Altogether, we identified 7,649 HLA associations across 647 traits, including 1,750 associations uncovered by haplotype analysis. We find some haplotypes show trade-offs between diseases, while others consistently increase risk across traits, indicating a complex pleiotropic landscape involving a range of diseases. This study highlights the extensive impact of HLA variation on disease risk, and underscores the importance of classical and non-classical genes, as well as non-coding variation.

3.
Cell Genom ; : 100629, 2024 Aug 02.
Artículo en Inglés | MEDLINE | ID: mdl-39111318

RESUMEN

With hundreds of copies of rDNA, it is unknown whether they possess sequence variations that form different types of ribosomes. Here, we developed an algorithm for long-read variant calling, termed RGA, which revealed that variations in human rDNA loci are predominantly insertion-deletion (indel) variants. We developed full-length rRNA sequencing (RIBO-RT) and in situ sequencing (SWITCH-seq), which showed that translating ribosomes possess variation in rRNA. Over 1,000 variants are lowly expressed. However, tens of variants are abundant and form distinct rRNA subtypes with different structures near indels as revealed by long-read rRNA structure probing coupled to dimethyl sulfate sequencing. rRNA subtypes show differential expression in endoderm/ectoderm-derived tissues, and in cancer, low-abundance rRNA variants can become highly expressed. Together, this study identifies the diversity of ribosomes at the level of rRNA variants, their chromosomal location, and unique structure as well as the association of ribosome variation with tissue-specific biology and cancer.

4.
medRxiv ; 2024 Aug 02.
Artículo en Inglés | MEDLINE | ID: mdl-39132496

RESUMEN

Background: Genetic factors play an important role in prostate cancer (PCa) development with polygenic risk scores (PRS) predicting disease risk across genetic ancestries. However, there are few convincing modifiable factors for PCa and little is known about their potential interaction with genetic risk. We analyzed incident PCa cases (n=6,155) and controls (n=98,257) of European and African ancestry from the UK Biobank (UKB) cohort to evaluate the role of neighborhood socioeconomic status (nSES)-and how it may interact with PRS-on PCa risk. Methods: We evaluated a multi-ancestry PCa PRS containing 269 genetic variants to understand the association of germline genetics with PCa in UKB. Using the English Indices of Deprivation, a set of validated metrics that quantify lack of resources within geographical areas, we performed logistic regression to investigate the main effects and interactions between nSES deprivation, PCa PRS, and PCa. Results: The PCa PRS was strongly associated with PCa (OR=2.04; 95%CI=2.00-2.09; P<0.001). Additionally, nSES deprivation indices were inversely associated with PCa: employment (OR=0.91; 95%CI=0.86-0.96; P<0.001), education (OR=0.94; 95%CI=0.83-0.98; P<0.001), health (OR=0.91; 95%CI=0.86-0.96; P<0.001), and income (OR=0.91; 95%CI=0.86-0.96; P<0.001). The PRS effects showed little heterogeneity across nSES deprivation indices, except for the Townsend Index (P=0.03). Conclusions: We reaffirmed genetics as a risk factor for PCa and identified nSES deprivation domains that influence PCa detection and are potentially correlated with environmental exposures that are a risk factor for PCa. These findings also suggest that nSES and genetic risk factors for PCa act independently.

5.
bioRxiv ; 2024 Jun 18.
Artículo en Inglés | MEDLINE | ID: mdl-38948774

RESUMEN

CRISPR screens are powerful tools to identify key genes that underlie biological processes. One important type of screen uses fluorescence activated cell sorting (FACS) to sort perturbed cells into bins based on the expression level of marker genes, followed by guide RNA (gRNA) sequencing. Analysis of these data presents several statistical challenges due to multiple factors including the discrete nature of the bins and typically small numbers of replicate experiments. To address these challenges, we developed a robust and powerful Bayesian random effects model and software package called Waterbear. Furthermore, we used Waterbear to explore how various experimental design parameters affect statistical power to establish principled guidelines for future screens. Finally, we experimentally validated our experimental design model findings that, when using Waterbear for analysis, high power is maintained even at low cell coverage and a high multiplicity of infection. We anticipate that Waterbear will be of broad utility for analyzing FACS-based CRISPR screens.

6.
bioRxiv ; 2024 Jun 17.
Artículo en Inglés | MEDLINE | ID: mdl-38948697

RESUMEN

Natural selection on complex traits is difficult to study in part due to the ascertainment inherent to genome-wide association studies (GWAS). The power to detect a trait-associated variant in GWAS is a function of frequency and effect size - but for traits under selection, the effect size of a variant determines the strength of selection against it, constraining its frequency. To account for GWAS ascertainment, we propose studying the joint distribution of allele frequencies across populations, conditional on the frequencies in the GWAS cohort. Before considering these conditional frequency spectra, we first characterized the impact of selection and non-equilibrium demography on allele frequency dynamics forwards and backwards in time. We then used these results to understand conditional frequency spectra under realistic human demography. Finally, we investigated empirical conditional frequency spectra for GWAS variants associated with 106 complex traits, finding compelling evidence for either stabilizing or purifying selection. Our results provide insight into polygenic score portability and other properties of variants ascertained with GWAS, highlighting the utility of conditional frequency spectra.

7.
bioRxiv ; 2024 Jul 05.
Artículo en Inglés | MEDLINE | ID: mdl-39005431

RESUMEN

Gene regulatory networks (GRNs) govern many core developmental and biological processes underlying human complex traits. Even with broad-scale efforts to characterize the effects of molecular perturbations and interpret gene coexpression, it remains challenging to infer the architecture of gene regulation in a precise and efficient manner. Key properties of GRNs, like hierarchical structure, modular organization, and sparsity, provide both challenges and opportunities for this objective. Here, we seek to better understand properties of GRNs using a new approach to simulate their structure and model their function. We produce realistic network structures with a novel generating algorithm based on insights from small-world network theory, and we model gene expression regulation using stochastic differential equations formulated to accommodate modeling molecular perturbations. With these tools, we systematically describe the effects of gene knockouts within and across GRNs, finding a subset of networks that recapitulate features of a recent genome-scale perturbation study. With deeper analysis of these exemplar networks, we consider future avenues to map the architecture of gene expression regulation using data from cells in perturbed and unperturbed states, finding that while perturbation data are critical to discover specific regulatory interactions, data from unperturbed cells may be sufficient to reveal regulatory programs.

8.
Nat Genet ; 56(8): 1632-1643, 2024 Aug.
Artículo en Inglés | MEDLINE | ID: mdl-38977852

RESUMEN

Measures of selective constraint on genes have been used for many applications, including clinical interpretation of rare coding variants, disease gene discovery and studies of genome evolution. However, widely used metrics are severely underpowered at detecting constraints for the shortest ~25% of genes, potentially causing important pathogenic mutations to be overlooked. Here we developed a framework combining a population genetics model with machine learning on gene features to enable accurate inference of an interpretable constraint metric, shet. Our estimates outperform existing metrics for prioritizing genes important for cell essentiality, human disease and other phenotypes, especially for short genes. Our estimates of selective constraint should have wide utility for characterizing genes relevant to human disease. Finally, our inference framework, GeneBayes, provides a flexible platform that can improve the estimation of many gene-level properties, such as rare variant burden or gene expression differences.


Asunto(s)
Teorema de Bayes , Evolución Molecular , Genética de Población , Modelos Genéticos , Humanos , Genética de Población/métodos , Aprendizaje Automático , Selección Genética , Mutación , Fenotipo
9.
bioRxiv ; 2024 May 29.
Artículo en Inglés | MEDLINE | ID: mdl-38853998

RESUMEN

Deep learning approaches have made significant advances in predicting cell type-specific chromatin patterns from the identity and arrangement of transcription factor (TF) binding motifs. However, most models have been applied in unperturbed contexts, precluding a predictive understanding of how chromatin state responds to TF perturbation. Here, we used transfer learning to train and interpret deep learning models that use DNA sequence to predict, with accuracy approaching experimental reproducibility, how the concentration of two dosage-sensitive TFs (TWIST1, SOX9) affects regulatory element (RE) chromatin accessibility in facial progenitor cells. High-affinity motifs that allow for heterotypic TF co-binding and are concentrated at the center of REs buffer against quantitative changes in TF dosage and strongly predict unperturbed accessibility. In contrast, motifs with low-affinity or homotypic binding distributed throughout REs lead to sensitive responses with minimal contributions to unperturbed accessibility. Both buffering and sensitizing features show signatures of purifying selection. We validated these predictive sequence features using reporter assays and showed that a biophysical model of TF-nucleosome competition can explain the sensitizing effect of low-affinity motifs. Our approach of combining transfer learning and quantitative measurements of the chromatin response to TF dosage therefore represents a powerful method to reveal additional layers of the cis-regulatory code.

10.
bioRxiv ; 2024 Apr 10.
Artículo en Inglés | MEDLINE | ID: mdl-37292653

RESUMEN

Measures of selective constraint on genes have been used for many applications including clinical interpretation of rare coding variants, disease gene discovery, and studies of genome evolution. However, widely-used metrics are severely underpowered at detecting constraint for the shortest ~25% of genes, potentially causing important pathogenic mutations to be over-looked. We developed a framework combining a population genetics model with machine learning on gene features to enable accurate inference of an interpretable constraint metric, s het . Our estimates outperform existing metrics for prioritizing genes important for cell essentiality, human disease, and other phenotypes, especially for short genes. Our new estimates of selective constraint should have wide utility for characterizing genes relevant to human disease. Finally, our inference framework, GeneBayes, provides a flexible platform that can improve estimation of many gene-level properties, such as rare variant burden or gene expression differences.

11.
Nature ; 625(7996): 805-812, 2024 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-38093011

RESUMEN

CRISPR-enabled screening is a powerful tool for the discovery of genes that control T cell function and has nominated candidate targets for immunotherapies1-6. However, new approaches are required to probe specific nucleotide sequences within key genes. Systematic mutagenesis in primary human T cells could reveal alleles that tune specific phenotypes. DNA base editors are powerful tools for introducing targeted mutations with high efficiency7,8. Here we develop a large-scale base-editing mutagenesis platform with the goal of pinpointing nucleotides that encode amino acid residues that tune primary human T cell activation responses. We generated a library of around 117,000 single guide RNA molecules targeting base editors to protein-coding sites across 385 genes implicated in T cell function and systematically identified protein domains and specific amino acid residues that regulate T cell activation and cytokine production. We found a broad spectrum of alleles with variants encoding critical residues in proteins including PIK3CD, VAV1, LCP2, PLCG1 and DGKZ, including both gain-of-function and loss-of-function mutations. We validated the functional effects of many alleles and further demonstrated that base-editing hits could positively and negatively tune T cell cytotoxic function. Finally, higher-resolution screening using a base editor with relaxed protospacer-adjacent motif requirements9 (NG versus NGG) revealed specific structural domains and protein-protein interaction sites that can be targeted to tune T cell functions. Base-editing screens in primary immune cells thus provide biochemical insights with the potential to accelerate immunotherapy design.


Asunto(s)
Alelos , Edición Génica , Mutagénesis , Linfocitos T , Humanos , Aminoácidos/genética , Sistemas CRISPR-Cas/genética , Mutagénesis/genética , ARN Guía de Sistemas CRISPR-Cas/genética , Linfocitos T/inmunología , Linfocitos T/metabolismo , Activación de Linfocitos , Citocinas/biosíntesis , Citocinas/metabolismo , Mutación con Ganancia de Función , Mutación con Pérdida de Función
12.
Nat Genet ; 55(11): 1866-1875, 2023 Nov.
Artículo en Inglés | MEDLINE | ID: mdl-37857933

RESUMEN

Most signals in genome-wide association studies (GWAS) of complex traits implicate noncoding genetic variants with putative gene regulatory effects. However, currently identified regulatory variants, notably expression quantitative trait loci (eQTLs), explain only a small fraction of GWAS signals. Here, we show that GWAS and cis-eQTL hits are systematically different: eQTLs cluster strongly near transcription start sites, whereas GWAS hits do not. Genes near GWAS hits are enriched in key functional annotations, are under strong selective constraint and have complex regulatory landscapes across different tissue/cell types, whereas genes near eQTLs are depleted of most functional annotations, show relaxed constraint, and have simpler regulatory landscapes. We describe a model to understand these observations, including how natural selection on complex traits hinders discovery of functionally relevant eQTLs. Our results imply that GWAS and eQTL studies are systematically biased toward different types of variant, and support the use of complementary functional approaches alongside the next generation of eQTL studies.


Asunto(s)
Estudio de Asociación del Genoma Completo , Herencia Multifactorial , Regulación de la Expresión Génica/genética , Sitios de Carácter Cuantitativo/genética , Expresión Génica , Polimorfismo de Nucleótido Simple/genética
13.
Cell Genom ; 3(10): 100401, 2023 Oct 11.
Artículo en Inglés | MEDLINE | ID: mdl-37868038

RESUMEN

Each human genome has tens of thousands of rare genetic variants; however, identifying impactful rare variants remains a major challenge. We demonstrate how use of personal multi-omics can enable identification of impactful rare variants by using the Multi-Ethnic Study of Atherosclerosis, which included several hundred individuals, with whole-genome sequencing, transcriptomes, methylomes, and proteomes collected across two time points, 10 years apart. We evaluated each multi-omics phenotype's ability to separately and jointly inform functional rare variation. By combining expression and protein data, we observed rare stop variants 62 times and rare frameshift variants 216 times as frequently as controls, compared to 13-27 times as frequently for expression or protein effects alone. We extended a Bayesian hierarchical model, "Watershed," to prioritize specific rare variants underlying multi-omics signals across the regulatory cascade. With this approach, we identified rare variants that exhibited large effect sizes on multiple complex traits including height, schizophrenia, and Alzheimer's disease.

14.
bioRxiv ; 2023 Oct 24.
Artículo en Inglés | MEDLINE | ID: mdl-37745614

RESUMEN

The effects of genetic variation on complex traits act mainly through changes in gene regulation. Although many genetic variants have been linked to target genes in cis, the trans-regulatory cascade mediating their effects remains largely uncharacterized. Mapping trans-regulators based on natural genetic variation, including eQTL mapping, has been challenging due to small effects. Experimental perturbation approaches offer a complementary and powerful approach to mapping trans-regulators. We used CRISPR knockouts of 84 genes in primary CD4+ T cells to perturb an immune cell gene network, targeting both inborn error of immunity (IEI) disease transcription factors (TFs) and background TFs matched in constraint and expression level, but without a known immune disease association. We developed a novel Bayesian structure learning method called Linear Latent Causal Bayes (LLCB) to estimate the gene regulatory network from perturbation data and observed 211 directed edges among the genes which could not be detected in existing CD4+ trans-eQTL data. We used LLCB to characterize the differences between the IEI and background TFs, finding that the gene groups were highly interconnected, but that IEI TFs were much more likely to regulate immune cell specific pathways and immune GWAS genes. We further characterized nine coherent gene programs based on downstream effects of the TFs and linked these modules to regulation of GWAS genes, finding that canonical JAK-STAT family members are regulated by KMT2A, a global epigenetic regulator. These analyses reveal the trans-regulatory cascade from upstream epigenetic regulator to intermediate TFs to downstream effector cytokines and elucidate the logic linking immune GWAS genes to key signaling pathways.

15.
Genetics ; 225(3)2023 11 01.
Artículo en Inglés | MEDLINE | ID: mdl-37724741

RESUMEN

The discrete-time Wright-Fisher (DTWF) model and its diffusion limit are central to population genetics. These models can describe the forward-in-time evolution of allele frequencies in a population resulting from genetic drift, mutation, and selection. Computing likelihoods under the diffusion process is feasible, but the diffusion approximation breaks down for large samples or in the presence of strong selection. Existing methods for computing likelihoods under the DTWF model do not scale to current exome sequencing sample sizes in the hundreds of thousands. Here, we present a scalable algorithm that approximates the DTWF model with provably bounded error. Our approach relies on two key observations about the DTWF model. The first is that transition probabilities under the model are approximately sparse. The second is that transition distributions for similar starting allele frequencies are extremely close as distributions. Together, these observations enable approximate matrix-vector multiplication in linear (as opposed to the usual quadratic) time. We prove similar properties for Hypergeometric distributions, enabling fast computation of likelihoods for subsamples of the population. We show theoretically and in practice that this approximation is highly accurate and can scale to population sizes in the tens of millions, paving the way for rigorous biobank-scale inference. Finally, we use our results to estimate the impact of larger samples on estimating selection coefficients for loss-of-function variants. We find that increasing sample sizes beyond existing large exome sequencing cohorts will provide essentially no additional information except for genes with the most extreme fitness effects.


Asunto(s)
Bancos de Muestras Biológicas , Genética de Población , Frecuencia de los Genes , Flujo Genético , Probabilidad , Modelos Genéticos , Selección Genética
16.
Nat Ecol Evol ; 7(9): 1515-1524, 2023 09.
Artículo en Inglés | MEDLINE | ID: mdl-37592021

RESUMEN

The Iron Age was a dynamic period in central Mediterranean history, with the expansion of Greek and Phoenician colonies and the growth of Carthage into the dominant maritime power of the Mediterranean. These events were facilitated by the ease of long-distance travel following major advances in seafaring. We know from the archaeological record that trade goods and materials were moving across great distances in unprecedented quantities, but it is unclear how these patterns correlate with human mobility. Here, to investigate population mobility and interactions directly, we sequenced the genomes of 30 ancient individuals from coastal cities around the central Mediterranean, in Tunisia, Sardinia and central Italy. We observe a meaningful contribution of autochthonous populations, as well as highly heterogeneous ancestry including many individuals with non-local ancestries from other parts of the Mediterranean region. These results highlight both the role of local populations and the extreme interconnectedness of populations in the Iron Age Mediterranean. By studying these trans-Mediterranean neighbours together, we explore the complex interplay between local continuity and mobility that shaped the Iron Age societies of the central Mediterranean.


Asunto(s)
ADN Antiguo , Migración Humana , Región Mediterránea , Arqueología , Migración Humana/historia , Humanos , Análisis de Componente Principal , Genética Humana , ADN Antiguo/análisis , Análisis de Secuencia de ADN , Entierro , Antropología , Historia Antigua
17.
Science ; 381(6658): eade6289, 2023 08 11.
Artículo en Inglés | MEDLINE | ID: mdl-37561850

RESUMEN

Skin color, one of the most diverse human traits, is determined by the quantity, type, and distribution of melanin. In this study, we leveraged the light-scattering properties of melanin to conduct a genome-wide screen for regulators of melanogenesis. We identified 169 functionally diverse genes that converge on melanosome biogenesis, endosomal transport, and gene regulation, of which 135 represented previously unknown associations with pigmentation. In agreement with their melanin-promoting function, the majority of screen hits were up-regulated in melanocytes from darkly pigmented individuals. We further unraveled functions of KLF6 as a transcription factor that regulates melanosome maturation and pigmentation in vivo, and of the endosomal trafficking protein COMMD3 in modulating melanosomal pH. Our study reveals a plethora of melanin-promoting genes, with broad implications for human variation, cell biology, and medicine.


Asunto(s)
Proteínas Adaptadoras Transductoras de Señales , Factor 6 Similar a Kruppel , Melaninas , Melanocitos , Melanosomas , Pigmentación de la Piel , Humanos , Melaninas/biosíntesis , Melaninas/genética , Melanocitos/metabolismo , Melanosomas/metabolismo , Pigmentación de la Piel/genética , Estudio de Asociación del Genoma Completo , Proteínas Adaptadoras Transductoras de Señales/genética , Proteínas Adaptadoras Transductoras de Señales/metabolismo , Factor 6 Similar a Kruppel/genética , Factor 6 Similar a Kruppel/metabolismo , Endosomas/metabolismo , Animales , Ratones , Línea Celular Tumoral
18.
Nature ; 621(7977): 188-195, 2023 Sep.
Artículo en Inglés | MEDLINE | ID: mdl-37648854

RESUMEN

γδ T cells are potent anticancer effectors with the potential to target tumours broadly, independent of patient-specific neoantigens or human leukocyte antigen background1-5. γδ T cells can sense conserved cell stress signals prevalent in transformed cells2,3, although the mechanisms behind the targeting of stressed target cells remain poorly characterized. Vγ9Vδ2 T cells-the most abundant subset of human γδ T cells4-recognize a protein complex containing butyrophilin 2A1 (BTN2A1) and BTN3A1 (refs. 6-8), a widely expressed cell surface protein that is activated by phosphoantigens abundantly produced by tumour cells. Here we combined genome-wide CRISPR screens in target cancer cells to identify pathways that regulate γδ T cell killing and BTN3A cell surface expression. The screens showed previously unappreciated multilayered regulation of BTN3A abundance on the cell surface and triggering of γδ T cells through transcription, post-translational modifications and membrane trafficking. In addition, diverse genetic perturbations and inhibitors disrupting metabolic pathways in the cancer cells, particularly ATP-producing processes, were found to alter BTN3A levels. This induction of both BTN3A and BTN2A1 during metabolic crises is dependent on AMP-activated protein kinase (AMPK). Finally, small-molecule activation of AMPK in a cell line model and in patient-derived tumour organoids led to increased expression of the BTN2A1-BTN3A complex and increased Vγ9Vδ2 T cell receptor-mediated killing. This AMPK-dependent mechanism of metabolic stress-induced ligand upregulation deepens our understanding of γδ T cell stress surveillance and suggests new avenues available to enhance γδ T cell anticancer activity.


Asunto(s)
Sistemas CRISPR-Cas , Edición Génica , Neoplasias , Receptores de Antígenos de Linfocitos T gamma-delta , Linfocitos T , Humanos , Proteínas Quinasas Activadas por AMP/genética , Proteínas Quinasas Activadas por AMP/metabolismo , Línea Celular , Membrana Celular/metabolismo , Neoplasias/genética , Neoplasias/inmunología , Neoplasias/metabolismo , Receptores de Antígenos de Linfocitos T gamma-delta/inmunología , Receptores de Antígenos de Linfocitos T gamma-delta/metabolismo , Linfocitos T/inmunología , Linfocitos T/metabolismo
19.
Res Sq ; 2023 Jun 13.
Artículo en Inglés | MEDLINE | ID: mdl-37398424

RESUMEN

Measures of selective constraint on genes have been used for many applications including clinical interpretation of rare coding variants, disease gene discovery, and studies of genome evolution. However, widely-used metrics are severely underpowered at detecting constraint for the shortest ~25% of genes, potentially causing important pathogenic mutations to be overlooked. We developed a framework combining a population genetics model with machine learning on gene features to enable accurate inference of an interpretable constraint metric, shet. Our estimates outperform existing metrics for prioritizing genes important for cell essentiality, human disease, and other phenotypes, especially for short genes. Our new estimates of selective constraint should have wide utility for characterizing genes relevant to human disease. Finally, our inference framework, GeneBayes, provides a flexible platform that can improve estimation of many gene-level properties, such as rare variant burden or gene expression differences.

20.
Genetics ; 224(3)2023 Jul 06.
Artículo en Inglés | MEDLINE | ID: mdl-37410594

RESUMEN

Members of genetically admixed populations possess ancestry from multiple source groups, and studies of human genetic admixture frequently estimate ancestry components corresponding to fractions of individual genomes that trace to specific ancestral populations. However, the same numerical ancestry fraction can represent a wide array of admixture scenarios within an individual's genealogy. Using a mechanistic model of admixture, we consider admixture genealogically: how many ancestors from the source populations does the admixture represent? We consider African-Americans, for whom continent-level estimates produce a 75-85% value for African ancestry on average and 15-25% for European ancestry. Genetic studies together with key features of African-American demographic history suggest ranges for parameters of a simple three-epoch model. Considering parameter sets compatible with estimates of current ancestry levels, we infer that if all genealogical lines of a random African-American born during 1960-1965 are traced back until they reach members of source populations, the mean over parameter sets of the expected number of genealogical lines terminating with African individuals is 314 (interquartile range 240-376), and the mean of the expected number terminating in Europeans is 51 (interquartile range 32-69). Across discrete generations, the peak number of African genealogical ancestors occurs in birth cohorts from the early 1700s, and the probability exceeds 50% that at least one European ancestor was born more recently than 1835. Our genealogical perspective can contribute to further understanding the admixture processes that underlie admixed populations. For African-Americans, the results provide insight both on how many of the ancestors of a typical African-American might have been forcibly displaced in the Transatlantic Slave Trade and on how many separate European admixture events might exist in a typical African-American genealogy.


Asunto(s)
Población Negra , Negro o Afroamericano , Humanos , Población Negra/genética , Negro o Afroamericano/genética , Genética de Población
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA