RESUMEN
SignificanceCalifornia supports a high cultural and linguistic diversity of Indigenous peoples. In a partnership of researchers with the Muwekma Ohlone tribe, we studied genomes of eight present-day tribal members and 12 ancient individuals from two archaeological sites in the San Francisco Bay Area, spanning â¼2,000 y. We find that compared to genomes of Indigenous individuals from throughout the Americas, the 12 ancient individuals are most genetically similar to ancient individuals from Southern California, and that despite spanning a large time period, they share distinctive ancestry. This ancestry is also shared with present-day tribal members, providing evidence of genetic continuity between past and present Indigenous individuals in the region, in contrast to some popular reconstructions based on archaeological and linguistic information.
Asunto(s)
Genómica , Pueblos Indígenas , Arqueología , ADN Antiguo , Genética de Población , Historia Antigua , Humanos , Lingüística , San FranciscoRESUMEN
An increasing body of archaeological and genomic evidence has hinted at a complex settlement process of the Americas by humans. This is especially true for South America, where unexpected ancestral signals have raised perplexing scenarios for the early migrations into different regions of the continent. Here, we present ancient human genomes from the archaeologically rich Northeast Brazil and compare them to ancient and present-day genomic data. We find a distinct relationship between ancient genomes from Northeast Brazil, Lagoa Santa, Uruguay and Panama, representing evidence for ancient migration routes along South America's Atlantic coast. To further add to the existing complexity, we also detect greater Denisovan than Neanderthal ancestry in ancient Uruguay and Panama individuals. Moreover, we find a strong Australasian signal in an ancient genome from Panama. This work sheds light on the deep demographic history of eastern South America and presents a starting point for future fine-scale investigations on the regional level.
Asunto(s)
Migración Humana , Hombre de Neandertal , Humanos , Historia Antigua , Animales , Genómica , Genoma Humano , BrasilRESUMEN
We have previously hypothesized that relatively small and isolated rural communities may experience founder effects, defined as the genetic ramifications of small population sizes at the time of a community's establishment. To explore this, we used an Illumina Infinium Omni2.5Exome-8 chip to collect data from 157 individuals from four Illinois communities, three rural and one urban. Genetic diversity estimates of 999,259 autosomal markers suggested that the reduction in heterozygosity due to shared ancestry was approximately 0, indicating a randomly mating population. An eigenanalysis, which is similar to a principal component analysis but run on a genetic coancestry matrix, conducted in the SNPRelate R package revealed that most of these individuals formed one cluster, with a few putative outliers obscuring population variation. An additional eigenanalysis on the same markers in a combined data set including the 2,504 individuals in the 1000 Genomes database found that most of the 157 Illinois individuals clustered into one group in close proximity to individuals of European descent. A final eigenanalysis of the Illinois individuals with the 503 individuals of European descent (within the 1000 Genomes Project) revealed two clusters of individuals and likely two source populations; one British and one consisting of multiple European subpopulations. We therefore demonstrate the feasibility of examining genetic relatedness across Illinois populations and assessing the number of source populations using publicly available databases. When assessed, population structure information can contribute to the understanding of genetic history in rural populations.
Asunto(s)
Variación Genética/genética , Genética de Población/estadística & datos numéricos , Población Blanca/genética , Efecto Fundador , Estudio de Asociación del Genoma Completo , Humanos , Illinois/epidemiología , Análisis de Componente Principal , Población RuralRESUMEN
This study presents genetic data for nine Native American populations from northern North America. Analyses of genetic variation focus on the Pacific Northwest (PNW). Using mitochondrial, Y chromosomal, and autosomal DNA variants, we aimed to more closely address the relationships of geography and language with present genetic diversity among the regional PNW Native American populations. Patterns of genetic diversity exhibited by the three genetic systems were consistent with our hypotheses: genetic variation was more strongly explained by geographic proximity than by linguistic structure. Our findings were corroborated through a variety on analytic approaches, with the unrooted trees for the three genetic systems consistently separating inland from coastal PNW populations. Furthermore, analyses of molecular variance support the trends exhibited by the unrooted trees, with geographic partitioning of PNW populations (FCT = 19.43%, p = 0.010 ± 0.009) accounting for over twice as much of the observed genetic variation as linguistic partitioning of the same populations (FCT = 9.15%, p = 0.193 ± 0.013). These findings demonstrate a consensus with previous PNW population studies examining the relationships of genome-wide variation, mitochondrial haplogroup frequencies, and skeletal morphology with geography and language.
Asunto(s)
Genética de Población , Indígenas Norteamericanos/genética , Filogenia , Cromosomas Humanos Y , Análisis por Conglomerados , ADN Mitocondrial/genética , Emigración e Inmigración , Variación Genética , Geografía , Humanos , Lingüística , Noroeste de Estados Unidos , Análisis de Secuencia de ADNRESUMEN
For decades, the peopling of the Americas has been explored through the analysis of uniparentally inherited genetic systems in Native American populations and the comparison of these genetic data with current linguistic groupings. In northern North America, two language families predominate: Eskimo-Aleut and Na-Dene. Although the genetic evidence from nuclear and mtDNA loci suggest that speakers of these language families share a distinct biological origin, this model has not been examined using data from paternally inherited Y chromosomes. To test this hypothesis and elucidate the migration histories of Eskimoan- and Athapaskan-speaking populations, we analyzed Y-chromosomal data from Inuvialuit, Gwich'in, and Tlich populations living in the Northwest Territories of Canada. Over 100 biallelic markers and 19 chromosome short tandem repeats (STRs) were genotyped to produce a high-resolution dataset of Y chromosomes from these groups. Among these markers is an SNP discovered in the Inuvialuit that differentiates them from other Aboriginal and Native American populations. The data suggest that Canadian Eskimoan- and Athapaskan-speaking populations are genetically distinct from one another and that the formation of these groups was the result of two population expansions that occurred after the initial movement of people into the Americas. In addition, the population history of Athapaskan speakers is complex, with the Tlich being distinct from other Athapaskan groups. The high-resolution biallelic data also make clear that Y-chromosomal diversity among the first Native Americans was greater than previously recognized.
Asunto(s)
Cromosomas Humanos Y/genética , Variación Genética , Indígenas Norteamericanos/genética , Inuk/genética , Filogenia , Canadá , Cromosomas Humanos Par 19/genética , Emigración e Inmigración , Frecuencia de los Genes , Genética de Población/métodos , Genotipo , Geografía , Haplotipos/genética , Humanos , Masculino , Repeticiones de Microsatélite/genética , Mutación , Tasa de Mutación , Polimorfismo de Nucleótido SimpleRESUMEN
Puerto Rico and the surrounding islands rest on the eastern fringe of the Caribbean's Greater Antilles, located less than 100 miles northwest of the Lesser Antilles. Puerto Ricans are genetic descendants of pre-Columbian peoples, as well as peoples of European and African descent through 500 years of migration to the island. To infer these patterns of pre-Columbian and historic peopling of the Caribbean, we characterized genetic diversity in 326 individuals from the southeastern region of Puerto Rico and the island municipality of Vieques. We sequenced the mitochondrial DNA (mtDNA) control region of all of the samples and the complete mitogenomes of 12 of them to infer their putative place of origin. In addition, we genotyped 121 male samples for 25 Y-chromosome single nucleotide polymorphism and 17 STR loci. Approximately 60% of the participants had indigenous mtDNA haplotypes (mostly from haplogroups A2 and C1), while 25% had African and 15% European haplotypes. Three A2 sublineages were unique to the Greater Antilles, one of which was similar to Mesoamerican types, while C1b haplogroups showed links to South America, suggesting that people reached the island from the two distinct continental source areas. However, none of the male participants had indigenous Y-chromosomes, with 85% of them instead being European/Mediterranean and 15% sub-Saharan African in origin. West Eurasian Y-chromosome short tandem repeat haplotypes were quite diverse and showed similarities to those observed in southern Europe, North Africa and the Middle East. These results attest to the distinct, yet equally complex, pasts for the male and female ancestors of modern day Puerto Ricans.
Asunto(s)
Variación Genética/genética , Haplotipos/genética , Indígenas Sudamericanos/genética , Población Blanca/genética , Antropología Física , Cromosomas Humanos Y/genética , ADN Mitocondrial/genética , Femenino , Migración Humana , Humanos , Masculino , Puerto Rico , Indias OccidentalesRESUMEN
Most studies focusing on human high-altitude adaptation in the Andean highlands have thus far been focused on Peruvian populations. We present high-coverage whole genomes from Indigenous people living in the Ecuadorian highlands and perform multi-method scans to detect positive natural selection. We identified regions of the genome that show signals of strong selection to both cardiovascular and hypoxia pathways, which are distinct from those uncovered in Peruvian populations. However, the strongest signals of selection were related to regions of the genome that are involved in immune function related to tuberculosis. Given our estimated timing of this selection event, the Indigenous people of Ecuador may have adapted to Mycobacterium tuberculosis thousands of years before the arrival of Europeans. Furthermore, we detect a population collapse that coincides with the arrival of Europeans, which is more severe than other regions of the Andes, suggesting differing effects of contact across high-altitude populations.
RESUMEN
The linguistically distinctive Haida and Tlingit tribes of Southeast Alaska are known for their rich material culture, complex social organization, and elaborate ritual practices. However, much less is known about these tribes from a population genetic perspective. For this reason, we analyzed mtDNA and Y-chromosome variation in Haida and Tlingit populations to elucidate several key issues pertaining to the history of this region. These included the genetic relationships of Haida and Tlingit to other indigenous groups in Alaska and Canada; the relationship between linguistic and genetic data for populations assigned to the Na-Dene linguistic family, specifically, the inclusion of Haida with Athapaskan, Eyak, and Tlingit in the language family; the possible influence of matrilineal clan structure on patterns of genetic variation in Haida and Tlingit populations; and the impact of European entry into the region on the genetic diversity of these indigenous communities. Our analysis indicates that, while sharing a "northern" genetic profile, the Haida and the Tlingit are genetically distinctive from each other. In addition, Tlingit groups themselves differ across their geographic range, in part due to interactions of Tlingit tribes with Athapaskan and Eyak groups to the north. The data also reveal a strong influence of maternal clan identity on mtDNA variation in these groups, as well as the significant influence of non-native males on Y-chromosome diversity. These results yield new details about the histories of the Haida and Tlingit tribes in this region.
Asunto(s)
Emigración e Inmigración/historia , Indígenas Norteamericanos/genética , Indígenas Norteamericanos/historia , Lenguaje/historia , Alaska , Análisis de Varianza , Cromosomas Humanos Y , ADN Mitocondrial/genética , Femenino , Efecto Fundador , Haplotipos , Historia Antigua , Humanos , Masculino , Repeticiones de Microsatélite , Polimorfismo de Nucleótido SimpleRESUMEN
Discovered in the early 16th century by European colonists, Bermuda is an isolated set of islands located in the mid-Atlantic. Shortly after its discovery, Bermuda became the first English colony to forcibly import its labor by trafficking in enslaved Africans, white ethnic minorities, and indigenous Americans. Oral traditions circulating today among contemporary tribes from the northeastern United States recount these same events, while, in Bermuda, St. David's Islanders consider their histories to be linked to a complex Native American, European, and African past. To investigate the influence of historical events on biological ancestry and native cultural identity, we analyzed genetic variation in 111 members of Bermuda's self-proclaimed St. David's Island Native Community. Our results reveal that the majority of mitochondrial DNA (mtDNA) and Y-chromosome haplotypes are of African and West Eurasian origin. However, unlike other English-speaking New World colonies, most African mtDNA haplotypes appear to derive from central and southeast Africa, reflecting the extent of maritime activities in the region. In light of genealogical and oral historical data from the St. David's community, the low frequency of Native American mtDNA and NRY lineages may reflect the influence of genetic drift, the demographic impact of European colonization, and historical admixture with persons of non-native backgrounds, which began with the settlement of the islands. By comparing the genetic data with genealogical and historical information, we are able to reconstruct the complex history of this Bermudian community, which is unique among New World populations.
Asunto(s)
Población Negra/genética , Variación Genética , Genética de Población , Indígenas Norteamericanos/genética , Población Blanca/genética , Bermudas , Cromosomas Humanos Y , ADN Mitocondrial/genética , Flujo Genético , Haplotipos , Humanos , Masculino , Repeticiones de Microsatélite , FilogeniaRESUMEN
Native Mexican populations are crucial for understanding the genetic ancestry of Aztec descendants and coexisting ethnolinguistic groups in the Valley of Mexico and elucidating the population dynamics of the prehistoric colonization of the Americas. Mesoamerican societies were multicultural in nature and also experienced significant admixture during Spanish colonization of the region. Despite these facts, Native Mexican Y chromosome diversity has been greatly understudied. To further elucidate their genetic history, we conducted a high-resolution Y chromosome analysis with Chichimecas, Nahuas, Otomies, Popolocas, Tepehuas, and Totonacas using 19 Y-short tandem repeat and 21 single nucleotide polymorphism loci. We detected enormous paternal genetic diversity in these groups, with haplogroups Q-MEH2, Q-M3, Q-Z768, Q-L663, Q-Z780, and Q-PV3 being identified. These data affirmed the southward colonization of the Americas via Beringia and connected Native Mexicans with indigenous populations from South-Central Siberia and Canada. They also suggested that multiple population dispersals gave rise to Y chromosome diversity in these populations.
RESUMEN
Aligning sequences for phylogenetic analysis (multiple sequence alignment; MSA) is an important, but increasingly computationally expensive step with the recent surge in DNA sequence data. Much of this sequence data is publicly available, but can be extremely fragmentary (i.e., a combination of full genomes and genomic fragments), which can compound the computational issues related to MSA. Traditionally, alignments are produced with automated algorithms and then checked and/or corrected "by eye" prior to phylogenetic inference. However, this manual curation is inefficient at the data scales required of modern phylogenetics and results in alignments that are not reproducible. Recently, methods have been developed for fully automating alignments of large data sets, but it is unclear if these methods produce alignments that result in compatible phylogenies when compared to more traditional alignment approaches that combined automated and manual methods. Here we use approximately 33,000 publicly available sequences from the hepatitis B virus (HBV), a globally distributed and rapidly evolving virus, to compare different alignment approaches. Using one data set comprised exclusively of whole genomes and a second that also included sequence fragments, we compared three MSA methods: (1) a purely automated approach using traditional software, (2) an automated approach including by eye manual editing, and (3) more recent fully automated approaches. To understand how these methods affect phylogenetic results, we compared resulting tree topologies based on these different alignment methods using multiple metrics. We further determined if the monophyly of existing HBV genotypes was supported in phylogenies estimated from each alignment type and under different statistical support thresholds. Traditional and fully automated alignments produced similar HBV phylogenies. Although there was variability between branch support thresholds, allowing lower support thresholds tended to result in more differences among trees. Therefore, differences between the trees could be best explained by phylogenetic uncertainty unrelated to the MSA method used. Nevertheless, automated alignment approaches did not require human intervention and were therefore considerably less time-intensive than traditional approaches. Because of this, we conclude that fully automated algorithms for MSA are fully compatible with older methods even in extremely difficult to align data sets. Additionally, we found that most HBV diagnostic genotypes did not correspond to evolutionarily-sound groups, regardless of alignment type and support threshold. This suggests there may be errors in genotype classification in the database or that HBV genotypes may need a revision.
RESUMEN
The search for a method that utilizes biological information to predict humans' place of origin has occupied scientists for millennia. Over the past four decades, scientists have employed genetic data in an effort to achieve this goal but with limited success. While biogeographical algorithms using next-generation sequencing data have achieved an accuracy of 700 km in Europe, they were inaccurate elsewhere. Here we describe the Geographic Population Structure (GPS) algorithm and demonstrate its accuracy with three data sets using 40,000-130,000 SNPs. GPS placed 83% of worldwide individuals in their country of origin. Applied to over 200 Sardinians villagers, GPS placed a quarter of them in their villages and most of the rest within 50 km of their villages. GPS's accuracy and power to infer the biogeography of worldwide individuals down to their country or, in some cases, village, of origin, underscores the promise of admixture-based methods for biogeography and has ramifications for genetic ancestry testing.