RESUMO
MOTIVATION: Environmental monitoring of pathogens provides an accurate and timely source of information for public health authorities and policymakers. In the last 2 years, wastewater sequencing proved to be an effective way of detection and quantification of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) variants circulating in population. Wastewater sequencing produces substantial amounts of geographical and genomic data. Proper visualization of spatial and temporal patterns in these data is crucial for the assessment of the epidemiological situation and forecasting. Here, we present a web-based dashboard application for the visualization and analysis of data obtained from sequencing of environmental samples. The dashboard provides multi-layered visualization of geographical and genomic data. It allows to display frequencies of detected pathogen variants as well as individual mutation frequencies. The features of WAVES (Web-based tool for Analysis and Visualization of Environmental Samples) for early tracking and detection of novel variants in the wastewater are demonstrated in an example of BA.1 variant and the signature Spike mutation S: E484A. WAVES dashboard is easily customized through the editable configuration file and can be used for different types of pathogens and environmental samples. AVAILABILITY AND IMPLEMENTATION: WAVES source code is freely available at https://github.com/ptriska/WavesDash under MIT license. A demo version of this application can be accessed at: https://wavesdashboard.azurewebsites.net/.
Assuntos
COVID-19 , Águas Residuárias , Humanos , SARS-CoV-2/genética , Software , InternetRESUMO
Based on current studies, the incidence of Ewing sarcoma (ES) varies significantly by race and ethnicity, with the disease being most common in patients of European ancestry. However, race/ethnicity has generally been self-reported rather than formally evaluated at a population level using DNA evidence. Additionally, mitochondrial dysfunction is a hallmark of ES, yet there have been no reported studies of mitochondrial genetics in ES. Thus, we evaluated both the mitochondrial and nuclear ancestries of 420 pediatric ES patients in the United States using whole-genome sequencing. We found that the mitochondrial DNA (mtDNA) genomes of only six (1.4 %) patients belonged to African L haplogroups, while those of 90 % of the patients belonged to macrohaplogroup R, which includes haplogroup H, the most common maternal lineage in Europe. Compared to the general US population, European haplogroups were significantly enriched in ES patients (p < 2.2e-16) and the African haplogroups are significantly impoverished (p < 4.6e-16). Using the ancestry informative markers defined in a National Genographic study, the vast majority of patients exhibited significant nuclear ancestry originating from the Mediterranean, Northern Europe, and Southwest Asia, including all six patients with African L mtDNAs. Very few had primarily African nuclear ancestry. This is the first genomic epidemiology study to simultaneously interrogate the mitochondrial and nuclear ancestries of ES patients. While supporting previous findings of enriched European ancestry in ES patients, these results also suggest alternative hypotheses for the significant contribution of mitochondrial ancestry in ES patients, as well as the protective role of African ancestry.
Assuntos
DNA Mitocondrial , Sarcoma de Ewing , Humanos , Criança , DNA Mitocondrial/genética , Haplótipos , Sarcoma de Ewing/genética , População Negra , Mitocôndrias/genéticaRESUMO
BACKGROUND: Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) has undergone different molecular changes, sprouting genetic variants of the original wildtype. Clinical comparisons between patients infected with alpha versus delta are scarce. METHODS: In this retrospective observational study, adult patients hospitalized with coronavirus disease 2019 (COVID-19) due to confirmed SARS-CoV2 alpha or delta infection were included. Patient characteristics, virologic and laboratory parameters, as well as the clinical course were compared in patients infected with alpha vs. delta variants. RESULTS: A total of 106 patients infected with alpha and 215 patients infected with delta were included. Patients infected with the delta variant were admitted to hospital earlier after symptom onset (6 vs. 7 days, pâ¯< 0.001). Blood levels of Creactive protein (43.3 vs. 62.9â¯mg/l, pâ¯= 0.02) and neutrophil count (3.81 vs. 4.53â¯G/l, pâ¯= 0.06) were lower in delta patients. Furthermore, at hospital admission cycle threshold (CT) values were significantly lower in patients infected with the delta variant (22.3 vs. 24.9, pâ¯< 0.001). Patients infected with the delta variant needed supplemental oxygen less often during disease course (50% vs. 64%, pâ¯= 0.02). Furthermore, there was a statistically non-significant trend towards a lower ICU admission rate among delta patients (16% vs. 24%, pâ¯= 0.08) CONCLUSION: Patients diagnosed with the delta variant were admitted to the hospital earlier, had a less severe course of disease and a higher viral replication on admission. This may provide a window of opportunity for antivirals in the hospital setting.
Assuntos
COVID-19 , SARS-CoV-2 , Adulto , Humanos , SARS-CoV-2/genética , COVID-19/diagnóstico , COVID-19/epidemiologia , Hospitalização , Estudos RetrospectivosRESUMO
The Sahel/Savannah belt harbors diverse populations with different demographic histories and different subsistence patterns. However, populations from this large African region are notably under-represented in genomic research. To investigate the population structure and adaptation history of populations from the Sahel/Savannah space, we generated dense genome-wide genotype data of 327 individuals-comprising 14 ethnolinguistic groups, including 10 previously unsampled populations. Our results highlight fine-scale population structure and complex patterns of admixture, particularly in Fulani groups and Arabic-speaking populations. Among all studied Sahelian populations, only the Rashaayda Arabic-speaking population from eastern Sudan shows a lack of gene flow from African groups, which is consistent with the short history of this population in the African continent. They are recent migrants from Saudi Arabia with evidence of strong genetic isolation during the last few generations and a strong demographic bottleneck. This population also presents a strong selection signal in a genomic region around the CNR1 gene associated with substance dependence and chronic stress. In Western Sahelian populations, signatures of selection were detected in several other genetic regions, including pathways associated with lactase persistence, immune response, and malaria resistance. Taken together, these findings refine our current knowledge of genetic diversity, population structure, migration, admixture and adaptation of human populations in the Sahel/Savannah belt and contribute to our understanding of human history and health.
Assuntos
Fluxo Gênico , Genética Populacional , Humanos , População Negra , Etnicidade/genética , Lactase/genética , HaplótiposRESUMO
SARS-CoV-2 surveillance by wastewater-based epidemiology is poised to provide a complementary approach to sequencing individual cases. However, robust quantification of variants and de novo detection of emerging variants remains challenging for existing strategies. We deep sequenced 3,413 wastewater samples representing 94 municipal catchments, covering >59% of the population of Austria, from December 2020 to February 2022. Our system of variant quantification in sewage pipeline designed for robustness (termed VaQuERo) enabled us to deduce the spatiotemporal abundance of predefined variants from complex wastewater samples. These results were validated against epidemiological records of >311,000 individual cases. Furthermore, we describe elevated viral genetic diversity during the Delta variant period, provide a framework to predict emerging variants and measure the reproductive advantage of variants of concern by calculating variant-specific reproduction numbers from wastewater. Together, this study demonstrates the power of national-scale WBE to support public health and promises particular value for countries without extensive individual monitoring.
Assuntos
COVID-19 , Vigilância Epidemiológica Baseada em Águas Residuárias , Humanos , Águas Residuárias , SARS-CoV-2/genética , COVID-19/epidemiologia , RNA ViralRESUMO
OBJECTIVES: Mode of subsistence is an important factor influencing dietary habits and the genetic structure of various populations through differential intensity of gene flow and selection pressures. Previous studies suggest that in Africa Taste 2 Receptor Member 16 (TAS2R16), which encodes the 7-transmembrane receptor protein for bitterness, might also be under positive selection pressure. METHODS: However, since sampling coverage of populations was limited, we created a new TAS2R16 population dataset from across the African Sahel/Savannah belt representing various local populations of differing subsistence modes, linguistic affiliations, and geographic provenience. We sequenced the TAS2R16 exon gene and analyzed 2250 haplotypes among 19 populations. RESULTS: We found no evidence for selection as a driving force of genetic variation at this locus; instead, we discovered a highly significant correlation between TAS2R16 genetic and geographical distances based on provenience of the sampled populations, strongly suggesting that genetic drift most likely prevailed over positive selection at this specific locus. We also found significant correlations with other independent loci, mainly in sedentary farmers. DISCUSSION: Our results do not support the notion that the genetic diversity of TAS2R16 in Sahelian populations was shaped by selective pressures. This could result from several alternative and not mutually exclusive mechanisms, of which the possibility that, due to the pleiotropic nature of TAS2R16, selective pressures on other traits could counterbalance those acting on bitter taste perception, or that the change of diet in the Neolithic generally relaxed selective pressure on this gene.
Assuntos
Papilas Gustativas , Paladar , Humanos , Paladar/genética , África , Estruturas Genéticas , DemografiaRESUMO
Fusion of the ZNF384 gene as the 3' partner to several different 5' partner genes occurs recurrently in B-cell precursor acute lymphoblastic and mixed phenotype B/myeloid leukemia. These canonical fusions (ZNF384r) contain the complete ZNF384 coding sequence and are associated with a specific gene expression signature. Cases with this signature, but without canonical ZNF384 fusions (ZNF384r-like cases), have been described previously. Although some have been shown to harbor ZNF362 fusions, the primary aberrations remain unknown in a major proportion. We studied 3 patients with the ZNF384r signature and unknown primary genetic background and identified a previously unknown class of genetic aberration affecting the last exon of ZNF384 and resulting in disruption of the C-terminal portion of the ZNF384 protein. Importantly, in 2 cases, the ZNF384 aberration, indel, was missed during the bioinformatic analysis but revealed by the manual, targeted reanalysis. Two cases with the novel aberrations had a mixed (B/myeloid) immunophenotype commonly associated with canonical ZNF384 fusions. In conclusion, we present leukemia cases with a novel class of ZNF384 aberrations that phenocopy leukemia with ZNF384r. Therefore, we show that part of the so-called ZNF384r-like cases represent the same genetic subtype as leukemia with canonical ZNF384 fusions.
Assuntos
Leucemia Mieloide Aguda , Transativadores , Humanos , Imunofenotipagem , Leucemia Mieloide Aguda/genética , Transativadores/genética , Fatores de Transcrição , TranscriptomaRESUMO
BACKGROUND: We previously established the landscape of mitochondrial DNA (mtDNA) mutations in 23 subtypes of pediatric malignancies, characterized mtDNA mutation profiles among these subtypes, and provided statistically significant evidence for a contributory role of mtDNA mutations to pediatric malignancies. METHODS: To further delineate the spectrum of mtDNA mutations in pediatric central nervous system (CNS) tumors, we analyzed 545 tumor-normal paired whole-genome sequencing datasets from the Children's Brain Tumor Tissue Consortium. RESULTS: Germline mtDNA variants were used to determine the haplogroup, and maternal ancestry, which was not significantly different among tumor types. Among 166 (30.5%) tumors we detected 220 somatic mtDNA mutations, primarily missense mutations (36.8%), as well as 22 loss-of-function mutations. Different pediatric CNS tumor subtypes had distinct mtDNA mutation profiles. The number of mtDNA mutations per tumor ranged from 0.20 (dysembryoplastic neuroepithelial tumor [DNET]) to 0.75 (meningiomas). The average heteroplasmy was 10.7%, ranging from 4.6% in atypical teratoid/rhabdoid tumor (AT/RT) to 26% in diffuse intrinsic pontine glioma. High-grade gliomas had a significant higher number of mtDNA mutations per sample than low-grade gliomas (0.6 vs 0.27) (P = .004), with almost twice as many missense mtDNA mutations per sample (0.24 vs 0.11), and higher average heteroplasmy levels (16% vs 10%). Recurrent mtDNA mutations may represent hotspots which may serve as biologic markers of disease. CONCLUSIONS: Our findings demonstrate varying contributions of mtDNA mutations in different subtypes of CNS tumors. Sequencing the mtDNA genome may ultimately be used to characterize CNS tumors at diagnosis and monitor disease progression.
RESUMO
The Sahel/Savannah belt of Africa is a contact zone between two subsistence systems (nomadic pastoralism and sedentary farming) and of two groups of populations, namely Eurasians penetrating from northern Africa southwards and sub-Saharan Africans migrating northwards. Because pastoralism is characterized by a high degree of mobility, it leaves few significant archaeological traces. Demographic history seen through the lens of population genetic studies complements our historical and archaeological knowledge in this African region. In this review, we highlight recent advances in our understanding of demographic history in the Sahel/Savannah belt as revealed by genetic studies. We show the impact of food-producing subsistence strategies on population structure and the somewhat different migration patterns in the western and eastern part of the region. Genomic studies show that the gene pool of various groups of Sahelians consists in a complex mosaic of several ancestries. We also touch upon various signals of genetic adaptations such as lactase persistence, taste sensitivity and malaria resistance, all of which have different distribution patterns among Sahelian populations. Overall, genetic studies contribute to gain a deeper understanding about the demographic and adaptive history of human populations in this specific African region and beyond.
Assuntos
População Negra/genética , Genética Populacional/métodos , África Austral/etnologia , Agricultura , DNA Mitocondrial/genética , Pool Gênico , Haplótipos , Migração Humana , HumanosRESUMO
OBJECTIVES: The Sahel belt is occupied by populations who use two types of subsistence strategy, nomadic pastoralism and sedentary farming, and who belong to three linguistic families, Niger-Congo, Nilo-Saharan, and Afro-Asiatic. Little is known, however, about the origins of these two populations and their mutual genetic relationships. MATERIALS AND METHODS: We have built a large dataset of mitochondrial DNA sequences and Y chromosomal STR haplotypes of pastoralists and farmers belonging to all three linguistic phyla in the western, central, and eastern parts of the Sahel. We calculated pairwise genetic, geographic, and linguistic distances between populations and analyzed the effects of geography, language, and subsistence on population genetic structure. RESULTS: We found that subsistence mode significantly contributed to the generally low population structure in the Sahel and that language affiliation plays a more important role for pastoralists than for farmers. We also demonstrated that geographic isolation significantly influenced the population structure of sedentary farmers but not of nomadic pastoralists. Finally, we found haplotypes shared between the Fulani and Arabic-speaking Baggara, supporting the theory of Baggarization, which explains the recent adaptation of Arabic-speaking nomads in the Sahel region through contact with autochthonous sub-Saharan populations. CONCLUSIONS: Based on various genetic and archaeological evidence pertaining to the Sahel, we suggest that the idea of a bidirectional Sahelian corridor is valid, but that pastoralists made a more important contribution to its population structure. It is also possible that agropastoralists diverged into farmers and pastoralists in the early stages of formation of the Sahelian gene pool.
Assuntos
Cromossomos Humanos Y , DNA Mitocondrial/análise , Variação Genética , Estilo de Vida , Repetições de Microssatélites , África Central , África Oriental , África Ocidental , Agricultura/classificação , Evolução Cultural , Migração Humana , Humanos , Estilo de Vida/etnologia , MasculinoRESUMO
The mitochondrial genome is small, 16.5 kb, and yet complex to study due to an abundance of mitochondria in any given cell or tissue. Mitochondrial DNA (mtDNA) mutations have been previously described in cancer, many of which were detected at low heteroplasmy. In this study we enriched the mitochondrial genome in primary pediatric tumors for detection of mtDNA variants. We completed mtDNA enrichment using REPLI-g, Agilent SureSelect, and long-range polymerase chain reaction (LRPCR) followed by next generation sequencing (NGS) on Illumina platforms. Primary tumor and germline genomic DNA from a variety of pediatric central nervous system (CNS) and extra-CNS solid tumors were analyzed by the three different methods. Although all three methods performed equally well for detecting variants at high heteroplasmy or homoplasmy, only LRPCR and SureSelect-based enrichment methods provided consistent results for variants that were present at less than five percent heteroplasmy. We then applied both LRPCR and SureSelect to three successive samples from a patient with multiply-recurrent gliofibroma and detected a low-level novel mutation as well as a change in heteroplasmy levels of a synonymous variant that was correlated with progression of disease. IMPLICATION: This study demonstrates that LRPCR and SureSelect enrichment, but not REPLI-g, followed by NGS are accurate methods for studying the mtDNA variations at low heteroplasmy, which may be applied to studying mtDNA mutations in cancer.
Assuntos
Neoplasias do Sistema Nervoso Central/genética , Variação Genética/genética , Genoma Mitocondrial/genética , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Técnicas de Amplificação de Ácido Nucleico/métodos , Reação em Cadeia da Polimerase/métodos , Sistema Nervoso Central/patologia , DNA Mitocondrial/genética , Heteroplasmia/genética , Humanos , Mitocôndrias/genéticaRESUMO
Little is known about the spectrum of mitochondrial DNA (mtDNA) mutations across pediatric malignancies. In this study, we analyzed matched tumor and normal whole genome sequencing data from 616 pediatric patients with hematopoietic malignancies, solid tumors, and brain tumors. We identified 391 mtDNA mutations in 284 tumors including 45 loss-of-function mutations, which clustered at four statistically significant hotspots in MT-COX3, MT-ND4, and MT-ND5, and at a mutation hotspot in MT-tRNA-MET. A skewed ratio (4.83) of nonsynonymous versus synonymous (dN/dS) mtDNA mutations with high statistical significance was identified on the basis of Monte Carlo simulations in the tumors. In comparison, opposite ratios of 0.44 and 0.93 were observed in 616 matched normal tissues and in 249 blood samples from children without cancer, respectively. mtDNA mutations varied by cancer type and mtDNA haplogroup. Collectively, these results suggest that deleterious mtDNA mutations play a role in the development and progression of pediatric cancers. SIGNIFICANCE: This pan-cancer mtDNA study establishes the landscape of germline and tumor mtDNA mutations and identifies hotspots of tumor mtDNA mutations to pinpoint key mitochondrial functions in pediatric malignancies.
Assuntos
DNA Mitocondrial/genética , Mutação , Neoplasias/genética , Estudos de Casos e Controles , Criança , Feminino , Genoma Mitocondrial , Humanos , MasculinoRESUMO
Ethnic groups can display differential genetic susceptibility to infectious diseases. The arthropod-born viral dengue disease is one such disease, with empirical and limited genetic evidence showing that African ancestry may be protective against the haemorrhagic phenotype. Global ancestry analysis based on high-throughput genotyping in admixed populations can be used to test this hypothesis, while admixture mapping can map candidate protective genes. A Cuban dengue fever cohort was genotyped using a 2.5 million SNP chip. Global ancestry was ascertained through ADMIXTURE and used in a fine-matched corrected association study, while local ancestry was inferred by the RFMix algorithm. The expression of candidate genes was evaluated by RT-PCR in a Cuban dengue patient cohort and gene set enrichment analysis was performed in a Thai dengue transcriptome. OSBPL10 and RXRA candidate genes were identified, with most significant SNPs placed in inferred weak enhancers, promoters and lncRNAs. OSBPL10 had significantly lower expression in Africans than Europeans, while for RXRA several SNPs may differentially regulate its transcription between Africans and Europeans. Their expression was confirmed to change through dengue disease progression in Cuban patients and to vary with disease severity in a Thai transcriptome dataset. These genes interact in the LXR/RXR activation pathway that integrates lipid metabolism and immune functions, being a key player in dengue virus entrance into cells, its replication therein and in cytokine production. Knockdown of OSBPL10 expression in THP-1 cells by two shRNAs followed by DENV2 infection tests led to a significant reduction in DENV replication, being a direct functional proof that the lower OSBPL10 expression profile in Africans protects this ancestry against dengue disease.
Assuntos
Metabolismo dos Lipídeos/genética , Receptores de Esteroides/genética , Receptor X Retinoide alfa/genética , Dengue Grave/genética , População Negra/genética , Cuba/etnologia , Predisposição Genética para Doença , Estudo de Associação Genômica Ampla , Genótipo , Humanos , Análise de Sequência com Séries de Oligonucleotídeos , Reação em Cadeia da Polimerase , Polimorfismo de Nucleotídeo Único , Dengue Grave/etnologiaRESUMO
BACKGROUND: The history of human populations occupying the plains and mountain ridges separating Europe from Asia has been eventful, as these natural obstacles were crossed westward by multiple waves of Turkic and Uralic-speaking migrants as well as eastward by Europeans. Unfortunately, the material records of history of this region are not dense enough to reconstruct details of population history. These considerations stimulate growing interest to obtain a genetic picture of the demographic history of migrations and admixture in Northern Eurasia. RESULTS: We genotyped and analyzed 1076 individuals from 30 populations with geographical coverage spanning from Baltic Sea to Baikal Lake. Our dense sampling allowed us to describe in detail the population structure, provide insight into genomic history of numerous European and Asian populations, and significantly increase quantity of genetic data available for modern populations in region of North Eurasia. Our study doubles the amount of genome-wide profiles available for this region. We detected unusually high amount of shared identical-by-descent (IBD) genomic segments between several Siberian populations, such as Khanty and Ket, providing evidence of genetic relatedness across vast geographic distances and between speakers of different language families. Additionally, we observed excessive IBD sharing between Khanty and Bashkir, a group of Turkic speakers from Southern Urals region. While adding some weight to the "Finno-Ugric" origin of Bashkir, our studies highlighted that the Bashkir genepool lacks the main "core", being a multi-layered amalgamation of Turkic, Ugric, Finnish and Indo-European contributions, which points at intricacy of genetic interface between Turkic and Uralic populations. Comparison of the genetic structure of Siberian ethnicities and the geography of the region they inhabit point at existence of the "Great Siberian Vortex" directing genetic exchanges in populations across the Siberian part of Asia. Slavic speakers of Eastern Europe are, in general, very similar in their genetic composition. Ukrainians, Belarusians and Russians have almost identical proportions of Caucasus and Northern European components and have virtually no Asian influence. We capitalized on wide geographic span of our sampling to address intriguing question about the place of origin of Russian Starovers, an enigmatic Eastern Orthodox Old Believers religious group relocated to Siberia in seventeenth century. A comparative reAdmix analysis, complemented by IBD sharing, placed their roots in the region of the Northern European Plain, occupied by North Russians and Finno-Ugric Komi and Karelian people. Russians from Novosibirsk and Russian Starover exhibit ancestral proportions close to that of European Eastern Slavs, however, they also include between five to 10 % of Central Siberian ancestry, not present at this level in their European counterparts. CONCLUSIONS: Our project has patched the hole in the genetic map of Eurasia: we demonstrated complexity of genetic structure of Northern Eurasians, existence of East-West and North-South genetic gradients, and assessed different inputs of ancient populations into modern populations.
Assuntos
Emigração e Imigração/história , Etnicidade/genética , Genética Populacional , Algoritmos , Ásia , DNA , Conjuntos de Dados como Assunto , Europa (Continente) , Feminino , Variação Genética , Técnicas de Genotipagem , História do Século XV , História do Século XVI , História do Século XVII , História do Século XVIII , História do Século XIX , História do Século XX , História do Século XXI , História Antiga , História Medieval , Humanos , Masculino , Federação RussaRESUMO
The Kets, an ethnic group in the Yenisei River basin, Russia, are considered the last nomadic hunter-gatherers of Siberia, and Ket language has no transparent affiliation with any language family. We investigated connections between the Kets and Siberian and North American populations, with emphasis on the Mal'ta and Paleo-Eskimo ancient genomes, using original data from 46 unrelated samples of Kets and 42 samples of their neighboring ethnic groups (Uralic-speaking Nganasans, Enets, and Selkups). We genotyped over 130,000 autosomal SNPs, identified mitochondrial and Y-chromosomal haplogroups, and performed high-coverage genome sequencing of two Ket individuals. We established that Nganasans, Kets, Selkups, and Yukaghirs form a cluster of populations most closely related to Paleo-Eskimos in Siberia (not considering indigenous populations of Chukotka and Kamchatka). Kets are closely related to modern Selkups and to some Bronze and Iron Age populations of the Altai region, with all these groups sharing a high degree of Mal'ta ancestry. Implications of these findings for the linguistic hypothesis uniting Ket and Na-Dene languages into a language macrofamily are discussed.
Assuntos
DNA Mitocondrial/genética , Etnicidade/genética , Genoma Humano , Inuíte/genética , Filogenia , Polimorfismo de Nucleotídeo Único , Cromossomos Humanos Y , Variação Genética , Haplótipos , Migração Humana , Humanos , Idioma , Filogeografia , SibériaRESUMO
Genome-wide studies of African populations have the potential to reveal powerful insights into the evolution of our species, as these diverse populations have been exposed to intense selective pressures imposed by infectious diseases, diet, and environmental factors. Within Africa, the Sahel Belt extensively overlaps the geographical center of several endemic infections such as malaria, trypanosomiasis, meningitis, and hemorrhagic fevers. We screened 2.5 million single nucleotide polymorphisms in 161 individuals from 13 Sahelian populations, which together with published data cover Western, Central, and Eastern Sahel, and include both nomadic and sedentary groups. We confirmed the role of this Belt as a main corridor for human migrations across the continent. Strong admixture was observed in both Central and Eastern Sahelian populations, with North Africans and Near Eastern/Arabians, respectively, but it was inexistent in Western Sahelian populations. Genome-wide local ancestry inference in admixed Sahelian populations revealed several candidate regions that were significantly enriched for non-autochthonous haplotypes, and many showed to be under positive selection. The DARC gene region in Arabs and Nubians was enriched for African ancestry, whereas the RAB3GAP1/LCT/MCM6 region in Oromo, the TAS2R gene family in Fulani, and the ALMS1/NAT8 in Turkana and Samburu were enriched for non-African ancestry. Signals of positive selection varied in terms of geographic amplitude. Some genomic regions were selected across the Belt, the most striking example being the malaria-related DARC gene. Others were Western-specific (oxytocin, calcium, and heart pathways), Eastern-specific (lipid pathways), or even population-restricted (TAS2R genes in Fulani, which may reflect sexual selection).
Assuntos
Genoma Humano , Migração Humana , Seleção Genética , Acetiltransferases/genética , África , Proteínas de Ciclo Celular , Sistema do Grupo Sanguíneo Duffy/genética , Haplótipos , Febres Hemorrágicas Virais/genética , Humanos , Malária/genética , Meningite/genética , Componente 6 do Complexo de Manutenção de Minicromossomo/genética , Polimorfismo de Nucleotídeo Único , Proteínas/genética , Receptores de Superfície Celular/genética , Receptores Acoplados a Proteínas G/genéticaRESUMO
The genetic structure of human populations is extraordinarily complex and of fundamental importance to studies of anthropology, evolution, and medicine. As increasingly many individuals are of mixed origin, there is an unmet need for tools that can infer multiple origins. Misclassification of such individuals can lead to incorrect and costly misinterpretations of genomic data, primarily in disease studies and drug trials. We present an advanced tool to infer ancestry that can identify the biogeographic origins of highly mixed individuals. reAdmix can incorporate individual's knowledge of ancestors (e.g. having some ancestors from Turkey or a Scottish grandmother). reAdmix is an online tool available at http://chcb.saban-chla.usc.edu/reAdmix/.
Assuntos
Evolução Biológica , Biologia Computacional , Etnicidade/genética , Genética Médica/métodos , Animais , Humanos , SoftwareRESUMO
Multiple questions relating to contributions of cultural and demographical factors in the process of human geographical dispersal remain largely unanswered. India, a land of early human settlement and the resulting diversity is a good place to look for some of the answers. In this study, we explored the genetic structure of India using a diverse panel of 78 males genotyped using the GenoChip. Their genome-wide single-nucleotide polymorphism (SNP) diversity was examined in the context of various covariates that influence Indian gene pool. Admixture analysis of genome-wide SNP data showed high proportion of the Southwest Asian component in all of the Indian samples. Hierarchical clustering based on admixture proportions revealed seven distinct clusters correlating to geographical and linguistic affiliations. Convex hull overlay of Y-chromosomal haplogroups on the genome-wide SNP principal component analysis brought out distinct non-overlapping polygons of F*-M89, H*-M69, L1-M27, O2a-M95 and O3a3c1-M117, suggesting a male-mediated migration and expansion of the Indian gene pool. Lack of similar correlation with mitochondrial DNA clades indicated a shared genetic ancestry of females. We suggest that ancient male-mediated migratory events and settlement in various regional niches led to the present day scenario and peopling of India.
Assuntos
Emigração e Imigração , Fluxo Gênico , Pool Gênico , Cromossomos Humanos Y/genética , Análise por Conglomerados , Emigração e Imigração/estatística & dados numéricos , Feminino , Genética Populacional , Genoma Humano , Estudo de Associação Genômica Ampla , Humanos , Índia , Masculino , Polimorfismo de Nucleotídeo ÚnicoRESUMO
At the crossroads between Africa and Eurasia, Arabia is necessarily a melting pot, its peoples enriched by successive gene flow over the generations. Estimating the timing and impact of these multiple migrations are important steps in reconstructing the key demographic events in the human history. However, current methods based on genome-wide information identify admixture events inefficiently, tending to estimate only the more recent ages, as here in the case of admixture events across the Red Sea (~8-37 generations for African input into Arabia, and 30-90 generations for "back-to-Africa" migrations). An mtDNA-based founder analysis, corroborated by detailed analysis of the whole-mtDNA genome, affords an alternative means by which to identify, date and quantify multiple migration events at greater time depths, across the full range of modern human history, albeit for the maternal line of descent only. In Arabia, this approach enables us to infer several major pulses of dispersal between the Near East and Arabia, most likely via the Gulf corridor. Although some relict lineages survive in Arabia from the time of the out-of-Africa dispersal, 60 ka, the major episodes in the peopling of the Peninsula took place from north to south in the Late Glacial and, to a lesser extent, the immediate post-glacial/Neolithic. Exchanges across the Red Sea were mainly due to the Arab slave trade and maritime dominance (from ~2.5 ka to very recent times), but had already begun by the early Holocene, fuelled by the establishment of maritime networks since ~8 ka. The main "back-to-Africa" migrations, again undetected by genome-wide dating analyses, occurred in the Late Glacial period for introductions into eastern Africa, whilst the Neolithic was more significant for migrations towards North Africa.
Assuntos
DNA Mitocondrial/genética , Demografia/história , Fluxo Gênico , Migração Humana/história , África , Arábia , Efeito Fundador , Genômica , Haplótipos , História Antiga , Humanos , Filogenia , Análise de Componente PrincipalRESUMO
Mitochondrial proteins are coded by nuclear (nDNA) and mitochondrial (mtDNA) genes, implying a complex cross-talk between the two genomes. Here we investigated the diversity displayed in 104 nuclear-coded mitochondrial proteins from 1,092 individuals from the 1000 Genomes dataset, in order to evaluate if these genes are under the effects of purifying selection and how that selection compares with their mitochondrial encoded counterparts. Only the very rare variants (frequency < 0.1%) in these nDNA genes are indistinguishable from a random set from all possible variants in terms of predicted pathogenicity score, but more frequent variants display distinct signs of purifying selection. Comparisons of selection strength indicate stronger selection in the mtDNA genes compared to this set of nDNA genes, accounted for by the high hydrophobicity of the proteins coded by the mtDNA. Most of the predicted pathogenic variants in the nDNA genes were restricted to a single continental population. The proportion of individuals having at least one potential pathogenic mutation in this gene set was significantly lower in Europeans than in Africans and Asians. This difference may reflect demographic asymmetries, since African and Asian populations experienced main expansions in middle Holocene, while in Europeans the main expansions occurred earlier in the post-glacial period.