RESUMO
Approximately 15 genes have been directly associated with skin pigmentation variation in humans, leading to its characterization as a relatively simple trait. However, by assembling a global survey of quantitative skin pigmentation phenotypes, we demonstrate that pigmentation is more complex than previously assumed, with genetic architecture varying by latitude. We investigate polygenicity in the KhoeSan populations indigenous to southern Africa who have considerably lighter skin than equatorial Africans. We demonstrate that skin pigmentation is highly heritable, but known pigmentation loci explain only a small fraction of the variance. Rather, baseline skin pigmentation is a complex, polygenic trait in the KhoeSan. Despite this, we identify canonical and non-canonical skin pigmentation loci, including near SLC24A5, TYRP1, SMARCA2/VLDLR, and SNX13, using a genome-wide association approach complemented by targeted resequencing. By considering diverse, under-studied African populations, we show how the architecture of skin pigmentation can vary across humans subject to different local evolutionary pressures.
Assuntos
Pigmentação da Pele , África , População Negra/genética , Humanos , Polimorfismo de Nucleotídeo ÚnicoRESUMO
The earliest events during human tumour initiation, although poorly characterized, may hold clues to malignancy detection and prevention1. Here we model occult preneoplasia by biallelic inactivation of TP53, a common early event in gastric cancer, in human gastric organoids. Causal relationships between this initiating genetic lesion and resulting phenotypes were established using experimental evolution in multiple clonally derived cultures over 2 years. TP53 loss elicited progressive aneuploidy, including copy number alterations and structural variants prevalent in gastric cancers, with evident preferred orders. Longitudinal single-cell sequencing of TP53-deficient gastric organoids similarly indicates progression towards malignant transcriptional programmes. Moreover, high-throughput lineage tracing with expressed cellular barcodes demonstrates reproducible dynamics whereby initially rare subclones with shared transcriptional programmes repeatedly attain clonal dominance. This powerful platform for experimental evolution exposes stringent selection, clonal interference and a marked degree of phenotypic convergence in premalignant epithelial organoids. These data imply predictability in the earliest stages of tumorigenesis and show evolutionary constraints and barriers to malignant transformation, with implications for earlier detection and interception of aggressive, genome-instable tumours.
Assuntos
Transformação Celular Neoplásica , Evolução Clonal , Lesões Pré-Cancerosas , Seleção Genética , Neoplasias Gástricas , Humanos , Transformação Celular Neoplásica/genética , Transformação Celular Neoplásica/patologia , Evolução Clonal/genética , Instabilidade Genômica , Mutação , Neoplasias Gástricas/genética , Neoplasias Gástricas/patologia , Lesões Pré-Cancerosas/genética , Lesões Pré-Cancerosas/patologia , Organoides/metabolismo , Organoides/patologia , Aneuploidia , Variações do Número de Cópias de DNA , Análise de Célula Única , Proteína Supressora de Tumor p53/deficiência , Proteína Supressora de Tumor p53/genética , Progressão da Doença , Linhagem da CélulaRESUMO
Polynesia was settled in a series of extraordinary voyages across an ocean spanning one third of the Earth1, but the sequences of islands settled remain unknown and their timings disputed. Currently, several centuries separate the dates suggested by different archaeological surveys2-4. Here, using genome-wide data from merely 430 modern individuals from 21 key Pacific island populations and novel ancestry-specific computational analyses, we unravel the detailed genetic history of this vast, dispersed island network. Our reconstruction of the branching Polynesian migration sequence reveals a serial founder expansion, characterized by directional loss of variants, that originated in Samoa and spread first through the Cook Islands (Rarotonga), then to the Society (Totaiete ma) Islands (11th century), the western Austral (Tuha'a Pae) Islands and Tuamotu Archipelago (12th century), and finally to the widely separated, but genetically connected, megalithic statue-building cultures of the Marquesas (Te Henua 'Enana) Islands in the north, Raivavae in the south, and Easter Island (Rapa Nui), the easternmost of the Polynesian islands, settled in approximately AD 1200 via Mangareva.
Assuntos
Genoma Humano/genética , Genômica , Migração Humana/história , Havaiano Nativo ou Outro Ilhéu do Pacífico/genética , Feminino , História Medieval , Humanos , Masculino , PolinésiaRESUMO
The possibility of voyaging contact between prehistoric Polynesian and Native American populations has long intrigued researchers. Proponents have pointed to the existence of New World crops, such as the sweet potato and bottle gourd, in the Polynesian archaeological record, but nowhere else outside the pre-Columbian Americas1-6, while critics have argued that these botanical dispersals need not have been human mediated7. The Norwegian explorer Thor Heyerdahl controversially suggested that prehistoric South American populations had an important role in the settlement of east Polynesia and particularly of Easter Island (Rapa Nui)2. Several limited molecular genetic studies have reached opposing conclusions, and the possibility continues to be as hotly contested today as it was when first suggested8-12. Here we analyse genome-wide variation in individuals from islands across Polynesia for signs of Native American admixture, analysing 807 individuals from 17 island populations and 15 Pacific coast Native American groups. We find conclusive evidence for prehistoric contact of Polynesian individuals with Native American individuals (around AD 1200) contemporaneous with the settlement of remote Oceania13-15. Our analyses suggest strongly that a single contact event occurred in eastern Polynesia, before the settlement of Rapa Nui, between Polynesian individuals and a Native American group most closely related to the indigenous inhabitants of present-day Colombia.
Assuntos
Fluxo Gênico/genética , Genoma Humano/genética , Migração Humana/história , Indígenas Centro-Americanos/genética , Indígenas Sul-Americanos/genética , Ilhas , Havaiano Nativo ou Outro Ilhéu do Pacífico/genética , América Central/etnologia , Colômbia/etnologia , Europa (Continente)/etnologia , Genética Populacional , História Medieval , Humanos , Polimorfismo de Nucleotídeo Único/genética , Polinésia , América do Sul/etnologia , Fatores de TempoRESUMO
Preeclampsia is a multi-organ complication of pregnancy characterized by sudden hypertension and proteinuria that is among the leading causes of preterm delivery and maternal morbidity and mortality worldwide. The heterogeneity of preeclampsia poses a challenge for understanding its etiology and molecular basis. Intriguingly, risk for the condition increases in high-altitude regions such as the Peruvian Andes. To investigate the genetic basis of preeclampsia in a population living at high altitude, we characterized genome-wide variation in a cohort of preeclamptic and healthy Andean families (n = 883) from Puno, Peru, a city located above 3,800 meters of altitude. Our study collected genomic DNA and medical records from case-control trios and duos in local hospital settings. We generated genotype data for 439,314 SNPs, determined global ancestry patterns, and mapped associations between genetic variants and preeclampsia phenotypes. A transmission disequilibrium test (TDT) revealed variants near genes of biological importance for placental and blood vessel function. The top candidate region was found on chromosome 13 of the fetal genome and contains clotting factor genes PROZ, F7, and F10. These findings provide supporting evidence that common genetic variants within coagulation genes play an important role in preeclampsia. A selection scan revealed a potential adaptive signal around the ADAM12 locus on chromosome 10, implicated in pregnancy disorders. Our discovery of an association in a functional pathway relevant to pregnancy physiology in an understudied population of Native American origin demonstrates the increased power of family-based study design and underscores the importance of conducting genetic research in diverse populations.
Assuntos
Pré-Eclâmpsia , Altitude , Fatores de Coagulação Sanguínea , Proteínas Sanguíneas/genética , Estudos de Casos e Controles , Fator VII/genética , Fator X/genética , Feminino , Humanos , Peru/epidemiologia , Placenta , Pré-Eclâmpsia/epidemiologia , Pré-Eclâmpsia/genética , GravidezRESUMO
Genome-wide association studies (GWAS) have laid the foundation for investigations into the biology of complex traits, drug development and clinical guidelines. However, the majority of discovery efforts are based on data from populations of European ancestry1-3. In light of the differential genetic architecture that is known to exist between populations, bias in representation can exacerbate existing disease and healthcare disparities. Critical variants may be missed if they have a low frequency or are completely absent in European populations, especially as the field shifts its attention towards rare variants, which are more likely to be population-specific4-10. Additionally, effect sizes and their derived risk prediction scores derived in one population may not accurately extrapolate to other populations11,12. Here we demonstrate the value of diverse, multi-ethnic participants in large-scale genomic studies. The Population Architecture using Genomics and Epidemiology (PAGE) study conducted a GWAS of 26 clinical and behavioural phenotypes in 49,839 non-European individuals. Using strategies tailored for analysis of multi-ethnic and admixed populations, we describe a framework for analysing diverse populations, identify 27 novel loci and 38 secondary signals at known loci, as well as replicate 1,444 GWAS catalogue associations across these traits. Our data show evidence of effect-size heterogeneity across ancestries for published GWAS associations, substantial benefits for fine-mapping using diverse cohorts and insights into clinical implications. In the United States-where minority populations have a disproportionately higher burden of chronic conditions13-the lack of representation of diverse populations in genetic research will result in inequitable access to precision medicine for those with the highest burden of disease. We strongly advocate for continued, large genome-wide efforts in diverse populations to maximize genetic discovery and reduce health disparities.
Assuntos
Povo Asiático/genética , População Negra/genética , Estudo de Associação Genômica Ampla/métodos , Hispânico ou Latino/genética , Grupos Minoritários , Herança Multifatorial/genética , Saúde da Mulher , Estatura/genética , Estudos de Coortes , Feminino , Genética Médica/métodos , Equidade em Saúde/tendências , Disparidades nos Níveis de Saúde , Humanos , Masculino , Estados UnidosRESUMO
Over a decade ago, the Atacama humanoid skeleton (Ata) was discovered in the Atacama region of Chile. The Ata specimen carried a strange phenotype-6-in stature, fewer than expected ribs, elongated cranium, and accelerated bone age-leading to speculation that this was a preserved nonhuman primate, human fetus harboring genetic mutations, or even an extraterrestrial. We previously reported that it was human by DNA analysis with an estimated bone age of about 6-8 yr at the time of demise. To determine the possible genetic drivers of the observed morphology, DNA from the specimen was subjected to whole-genome sequencing using the Illumina HiSeq platform with an average 11.5× coverage of 101-bp, paired-end reads. In total, 3,356,569 single nucleotide variations (SNVs) were found as compared to the human reference genome, 518,365 insertions and deletions (indels), and 1047 structural variations (SVs) were detected. Here, we present the detailed whole-genome analysis showing that Ata is a female of human origin, likely of Chilean descent, and its genome harbors mutations in genes (COL1A1, COL2A1, KMT2D, FLNB, ATR, TRIP11, PCNT) previously linked with diseases of small stature, rib anomalies, cranial malformations, premature joint fusion, and osteochondrodysplasia (also known as skeletal dysplasia). Together, these findings provide a molecular characterization of Ata's peculiar phenotype, which likely results from multiple known and novel putative gene mutations affecting bone development and ossification.
Assuntos
DNA Antigo/análise , Genoma Humano/genética , Osteocondrodisplasias/genética , Sequenciamento Completo do Genoma , Animais , Feminino , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Mutação INDEL , Anotação de Sequência Molecular , Mutação/genética , Osteocondrodisplasias/fisiopatologia , Fenótipo , Polimorfismo de Nucleotídeo Único/genéticaRESUMO
The extent to which prehistoric migrations of farmers influenced the genetic pool of western North Africans remains unclear. Archaeological evidence suggests that the Neolithization process may have happened through the adoption of innovations by local Epipaleolithic communities or by demic diffusion from the Eastern Mediterranean shores or Iberia. Here, we present an analysis of individuals' genome sequences from Early and Late Neolithic sites in Morocco and from Early Neolithic individuals from southern Iberia. We show that Early Neolithic Moroccans (â¼5,000 BCE) are similar to Later Stone Age individuals from the same region and possess an endemic element retained in present-day Maghrebi populations, confirming a long-term genetic continuity in the region. This scenario is consistent with Early Neolithic traditions in North Africa deriving from Epipaleolithic communities that adopted certain agricultural techniques from neighboring populations. Among Eurasian ancient populations, Early Neolithic Moroccans are distantly related to Levantine Natufian hunter-gatherers (â¼9,000 BCE) and Pre-Pottery Neolithic farmers (â¼6,500 BCE). Late Neolithic (â¼3,000 BCE) Moroccans, in contrast, share an Iberian component, supporting theories of trans-Gibraltar gene flow and indicating that Neolithization of North Africa involved both the movement of ideas and people. Lastly, the southern Iberian Early Neolithic samples share the same genetic composition as the Cardial Mediterranean Neolithic culture that reached Iberia â¼5,500 BCE. The cultural and genetic similarities between Iberian and North African Neolithic traditions further reinforce the model of an Iberian migration into the Maghreb.
Assuntos
Etnicidade/genética , Genoma Humano , Migração Humana/história , África do Norte , Agricultura/história , Cromossomos Humanos Y/genética , DNA Mitocondrial/genética , Etnicidade/história , Europa (Continente) , Fluxo Gênico , Biblioteca Gênica , Genética Populacional , História Antiga , Humanos , Oriente Médio , Marrocos , Análise de Sequência de DNA , Espanha/etnologiaRESUMO
Reduced representation sequencing methods such as genotyping-by-sequencing (GBS) enable low-cost measurement of genetic variation without the need for a reference genome assembly. These methods are widely used in genetic mapping and population genetics studies, especially with non-model organisms. Variant calling error rates, however, are higher in GBS than in standard sequencing, in particular due to restriction site polymorphisms, and few computational tools exist that specifically model and correct these errors. We developed a statistical method to remove errors caused by restriction site polymorphisms, implemented in the software package GBStools. We evaluated it in several simulated data sets, varying in number of samples, mean coverage and population mutation rate, and in two empirical human data sets (N = 8 and N = 63 samples). In our simulations, GBStools improved genotype accuracy more than commonly used filters such as Hardy-Weinberg equilibrium p-values. GBStools is most effective at removing genotype errors in data sets over 100 samples when coverage is 40X or higher, and the improvement is most pronounced in species with high genomic diversity. We also demonstrate the utility of GBS and GBStools for human population genetic inference in Argentine populations and reveal widely varying individual ancestry proportions and an excess of singletons, consistent with recent population growth.
Assuntos
Alelos , Técnicas de Genotipagem , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Software , Estatística como Assunto , Genética Populacional , Humanos , Polimorfismo de Nucleotídeo Único/genéticaRESUMO
BACKGROUND: As most ancient biological samples have low levels of endogenous DNA, it is advantageous to enrich for specific genomic regions prior to sequencing. One approach-in-solution capture-enrichment-retrieves sequences of interest and reduces the fraction of microbial DNA. In this work, we implement a capture-enrichment approach targeting informative regions of the Y chromosome in six human archaeological remains excavated in the Caribbean and dated between 200 and 3000 years BP. We compare the recovery rate of Y-chromosome capture (YCC) alone, whole-genome capture followed by YCC (WGC + YCC) versus non-enriched (pre-capture) libraries. RESULTS: The six samples show different levels of initial endogenous content, with very low (< 0.05%, 4 samples) or low (0.1-1.54%, 2 samples) percentages of sequenced reads mapping to the human genome. We recover 12-9549 times more targeted unique Y-chromosome sequences after capture, where 0.0-6.2% (WGC + YCC) and 0.0-23.5% (YCC) of the sequence reads were on-target, compared to 0.0-0.00003% pre-capture. In samples with endogenous DNA content greater than 0.1%, we found that WGC followed by YCC (WGC + YCC) yields lower enrichment due to the loss of complexity in consecutive capture experiments, whereas in samples with lower endogenous content, the libraries' initial low complexity leads to minor proportions of Y-chromosome reads. Finally, increasing recovery of informative sites enabled us to assign Y-chromosome haplogroups to some of the archeological remains and gain insights about their paternal lineages and origins. CONCLUSIONS: We present to our knowledge the first in-solution capture-enrichment method targeting the human Y-chromosome in aDNA sequencing libraries. YCC and WGC + YCC enrichments lead to an increase in the amount of Y-DNA sequences, as compared to libraries not enriched for the Y-chromosome. Our probe design effectively recovers regions of the Y-chromosome bearing phylogenetically informative sites, allowing us to identify paternal lineages with less sequencing than needed for pre-capture libraries. Finally, we recommend considering the endogenous content in the experimental design and avoiding consecutive rounds of capture, as clonality increases considerably with each round.
Assuntos
Cromossomos Humanos Y , DNA Antigo/análise , DNA Antigo/isolamento & purificação , Biblioteca Gênica , Análise de Sequência de DNA/métodos , Sequenciamento Completo do Genoma/métodos , Genômica , História Antiga , HumanosRESUMO
The mangrove rivulus, Kryptolebias marmoratus, is one of only two self-fertilizing hermaphroditic fish species and inhabits mangrove forests. While selfing can be advantageous, it reduces heterozygosity and decreases genetic diversity. Studies using microsatellites found that there are variable levels of selfing among populations of K. marmoratus, but overall, there is a low rate of outcrossing and, therefore, low heterozygosity. In this study, we used whole-genome data to assess the levels of heterozygosity in different lineages of the mangrove rivulus and infer the phylogenetic relationships among those lineages. We sequenced whole genomes from 15 lineages that were completely homozygous at microsatellite loci and used single nucleotide polymorphisms (SNPs) to determine heterozygosity levels. More variation was uncovered than in studies using microsatellite data because of the resolution of full genome sequencing data. Moreover, missense polymorphisms were found most often in genes associated with immune function and reproduction. Inferred phylogenetic relationships suggest that lineages largely group by their geographic distribution. The use of whole-genome data provided further insight into genetic diversity in this unique species. Although this study was limited by the number of lineages that were available, these data suggest that there is previously undescribed variation within lineages of K. marmoratus that could have functional consequences and (or) inform us about the limits to selfing (e.g., genetic load, accumulation of deleterious mutations) and selection that might favor the maintenance of heterozygosity. These results highlight the need to sequence additional individuals within and among lineages.
Assuntos
Ciprinodontiformes/genética , Organismos Hermafroditas/genética , Autofertilização , Sequenciamento Completo do Genoma/métodos , Animais , Região do Caribe , Ciprinodontiformes/classificação , Genoma/genética , Geografia , Organismos Hermafroditas/classificação , Heterozigoto , Homozigoto , Repetições de Microssatélites/genética , Filogenia , Polimorfismo de Nucleotídeo Único , Vertebrados/classificação , Vertebrados/genéticaRESUMO
Variation in gene expression can provide insights into organismal responses to environmental stress and physiological mechanisms mediating adaptation to habitats with contrasting environmental conditions. We performed an RNA-sequencing experiment to quantify gene expression patterns in fish adapted to habitats with different combinations of environmental stressors, including the presence of toxic hydrogen sulphide (H2 S) and the absence of light in caves. We specifically asked how gene expression varies among populations living in different habitats, whether population differences were consistent among organs, and whether there is evidence for shared expression responses in populations exposed to the same stressors. We analysed organ-specific transcriptome-wide data from four ecotypes of Poecilia mexicana (nonsulphidic surface, sulphidic surface, nonsulphidic cave and sulphidic cave). The majority of variation in gene expression was correlated with organ type, and the presence of specific environmental stressors elicited unique expression differences among organs. Shared patterns of gene expression between populations exposed to the same environmental stressors increased with levels of organismal organization (from transcript to gene to physiological pathway). In addition, shared patterns of gene expression were more common between populations from sulphidic than populations from cave habitats, potentially indicating that physiochemical stressors with clear biochemical consequences can constrain the diversity of adaptive solutions that mitigate their adverse effects. Overall, our analyses provided insights into transcriptional variation in a unique system, in which adaptation to H2 S and darkness coincide. Functional annotations of differentially expressed genes provide a springboard for investigating physiological mechanisms putatively underlying adaptation to extreme environments.
Assuntos
Adaptação Fisiológica/genética , Cavernas , Ecossistema , Genética Populacional , Sulfeto de Hidrogênio , Poecilia/genética , Animais , Extremófilos/genética , Expressão GênicaRESUMO
Organoids are powerful experimental models for studying the ontogeny and progression of various diseases including cancer. Organoids are conventionally cultured in bulk using an extracellular matrix mimic. However, bulk-cultured organoids physically overlap, making it impossible to track the growth of individual organoids over time in high throughput. Moreover, local spatial variations in bulk matrix properties make it difficult to assess whether observed phenotypic heterogeneity between organoids results from intrinsic cell differences or differences in the microenvironment. Here, we developed a microwell-based method that enables high-throughput quantification of image-based parameters for organoids grown from single cells, which can further be retrieved from their microwells for molecular profiling. Coupled with a deep learning image-processing pipeline, we characterized phenotypic traits including growth rates, cellular movement, and apical-basal polarity in two CRISPR-engineered human gastric organoid models, identifying genomic changes associated with increased growth rate and changes in accessibility and expression correlated with apical-basal polarity. A record of this paper's transparent peer review process is included in the supplemental information.
Assuntos
Repetições Palindrômicas Curtas Agrupadas e Regularmente Espaçadas , Matriz Extracelular , Humanos , Movimento Celular , Genômica , OrganoidesRESUMO
Mutations in ARID1A rank among the most common molecular aberrations in human cancer. However, oncogenic consequences of ARID1A mutation in human cells remain poorly defined due to lack of forward genetic models. Here, CRISPR/Cas9-mediated ARID1A knockout (KO) in primary TP53-/- human gastric organoids induced morphologic dysplasia, tumorigenicity, and mucinous differentiation. Genetic WNT/ß-catenin activation rescued mucinous differentiation, but not hyperproliferation, suggesting alternative pathways of ARID1A KO-mediated transformation. ARID1A mutation induced transcriptional regulatory modules characteristic of microsatellite instability and Epstein-Barr virus-associated subtype human gastric cancer, including FOXM1-associated mitotic genes and BIRC5/survivin. Convergently, high-throughput compound screening indicated selective vulnerability of ARID1A-deficient organoids to inhibition of BIRC5/survivin, functionally implicating this pathway as an essential mediator of ARID1A KO-dependent early-stage gastric tumorigenesis. Overall, we define distinct pathways downstream of oncogenic ARID1A mutation, with nonessential WNT-inhibited mucinous differentiation in parallel with essential transcriptional FOXM1/BIRC5-stimulated proliferation, illustrating the general utility of organoid-based forward genetic cancer analysis in human cells. SIGNIFICANCE: We establish the first human forward genetic modeling of a commonly mutated tumor suppressor gene, ARID1A. Our study integrates diverse modalities including CRISPR/Cas9 genome editing, organoid culture, systems biology, and small-molecule screening to derive novel insights into early transformation mechanisms of ARID1A-deficient gastric cancers.See related commentary by Zafra and Dow, p. 1327.This article is highlighted in the In This Issue feature, p. 1307.
Assuntos
Sistemas CRISPR-Cas , Transformação Celular Neoplásica , Proteínas de Ligação a DNA/genética , Neoplasias Gástricas/genética , Fatores de Transcrição/genética , Humanos , Modelos Biológicos , MutaçãoRESUMO
Domestication has had a strong impact on the development of modern societies. We sequenced 200 genomes of the chocolate plant Theobroma cacao L. to show for the first time to our knowledge that a single population, the Criollo population, underwent strong domestication ~3600 years ago (95% CI: 2481-13,806 years ago). We also show that during the process of domestication, there was strong selection for genes involved in the metabolism of the colored protectants anthocyanins and the stimulant theobromine, as well as disease resistance genes. Our analyses show that domesticated populations of T. cacao (Criollo) maintain a higher proportion of high-frequency deleterious mutations. We also show for the first time the negative consequences of the increased accumulation of deleterious mutations during domestication on the fitness of individuals (significant reduction in kilograms of beans per hectare per year as Criollo ancestry increases, as estimated from a GLM, P = 0.000425).
RESUMO
We carried out an admixture mapping study of lipid traits in two samples from Mexico City. Native American locus ancestry was significantly associated with triglyceride levels in a broad region of chromosome 11 overlapping the BUD13, ZNF259 and APOA5 genes. In our fine-mapping analysis of this region using dense genome-wide data, rs964184 is the only marker included in the 99% credible set of SNPs, providing strong support for rs964184 as the causal variant within this region. The frequency of the allele associated with increased triglyceride concentrations (rs964184-G) is between 30-40% higher in Native American populations from Mexico than in European populations. The evidence currently available for this variant indicates that it may be exerting its effect through three potential mechanisms: 1) modification of enhancer activity, 2) regulation of the expression of several genes in cis and/or trans, or 3) modification of the methylation patterns of the promoter of the APOA5 gene.