Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 13 de 13
Filtrar
1.
Nucleic Acids Res ; 46(D1): D984-D993, 2018 01 04.
Artigo em Inglês | MEDLINE | ID: mdl-29112749

RESUMO

There are a growing number of studies focusing on delineating genetic variations that are associated with complex human traits and diseases due to recent advances in next-generation sequencing technologies. However, identifying and prioritizing disease-associated causal variants relies on understanding the distribution of genetic variations within and among populations. The PGG.Population database documents 7122 genomes representing 356 global populations from 107 countries and provides essential information for researchers to understand human genomic diversity and genetic ancestry. These data and information can facilitate the design of research studies and the interpretation of results of both evolutionary and medical studies involving human populations. The database is carefully maintained and constantly updated when new data are available. We included miscellaneous functions and a user-friendly graphical interface for visualization of genomic diversity, population relationships (genetic affinity), ancestral makeup, footprints of natural selection, and population history etc. Moreover, PGG.Population provides a useful feature for users to analyze data and visualize results in a dynamic style via online illustration. The long-term ambition of the PGG.Population, together with the joint efforts from other researchers who contribute their data to our database, is to create a comprehensive depository of geographic and ethnic variation of human genome, as well as a platform bringing influence on future practitioners of medicine and clinical investigators. PGG.Population is available at https://www.pggpopulation.org.


Assuntos
Bases de Dados Genéticas , Variação Genética , Genética Populacional , Genoma Humano , Etnicidade/genética , Genômica , Genótipo , Sequenciamento de Nucleotídeos em Larga Escala , Humanos
2.
Am J Hum Genet ; 99(3): 580-594, 2016 09 01.
Artigo em Inglês | MEDLINE | ID: mdl-27569548

RESUMO

The origin of Tibetans remains one of the most contentious puzzles in history, anthropology, and genetics. Analyses of deeply sequenced (30×-60×) genomes of 38 Tibetan highlanders and 39 Han Chinese lowlanders, together with available data on archaic and modern humans, allow us to comprehensively characterize the ancestral makeup of Tibetans and uncover their origins. Non-modern human sequences compose ∼6% of the Tibetan gene pool and form unique haplotypes in some genomic regions, where Denisovan-like, Neanderthal-like, ancient-Siberian-like, and unknown ancestries are entangled and elevated. The shared ancestry of Tibetan-enriched sequences dates back to ∼62,000-38,000 years ago, predating the Last Glacial Maximum (LGM) and representing early colonization of the plateau. Nonetheless, most of the Tibetan gene pool is of modern human origin and diverged from that of Han Chinese ∼15,000 to ∼9,000 years ago, which can be largely attributed to post-LGM arrivals. Analysis of ∼200 contemporary populations showed that Tibetans share ancestry with populations from East Asia (∼82%), Central Asia and Siberia (∼11%), South Asia (∼6%), and western Eurasia and Oceania (∼1%). Our results support that Tibetans arose from a mixture of multiple ancestral gene pools but that their origins are much more complicated and ancient than previously suspected. We provide compelling evidence of the co-existence of Paleolithic and Neolithic ancestries in the Tibetan gene pool, indicating a genetic continuity between pre-historical highland-foragers and present-day Tibetans. In particular, highly differentiated sequences harbored in highlanders' genomes were most likely inherited from pre-LGM settlers of multiple ancestral origins (SUNDer) and maintained in high frequency by natural selection.


Assuntos
Povo Asiático/genética , Fluxo Gênico/genética , Genoma Humano/genética , Filogenia , Altitude , Animais , China/etnologia , Etnicidade/genética , Pool Gênico , Genética Populacional , Haplótipos/genética , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Masculino , Modelos Genéticos , Homem de Neandertal/genética , Oceania/etnologia , Seleção Genética , Tibet
3.
Mol Biol Evol ; 34(10): 2572-2582, 2017 10 01.
Artigo em Inglês | MEDLINE | ID: mdl-28595347

RESUMO

The Uyghur people residing in Xinjiang, a territory located in the far west of China and crossed by the Silk Road, are a key ethnic group for understanding the history of human dispersion in Eurasia. Here we assessed the genetic structure and ancestry of 951 Xinjiang's Uyghurs (XJU) representing 14 geographical subpopulations. We observed a southwest and northeast differentiation within XJU, which was likely shaped jointly by the Tianshan Mountains, which traverses from east to west as a natural barrier, and gene flow from both east and west directions. In XJU, we identified four major ancestral components that were potentially derived from two earlier admixed groups: one from the West, harboring European (25-37%) and South Asian ancestries (12-20%), and the other from the East, with Siberian (15-17%) and East Asian (29-47%) ancestries. By using a newly developed method, MultiWaver, the complex admixture history of XJU was modeled as a two-wave admixture. An ancient wave was dated back to ∼3,750 years ago (ya), which is much earlier than that estimated by previous studies, but fits within the range of dating of mummies that exhibited European features that were discovered in the Tarim basin, which is situated in southern Xinjiang (4,000-2,000 ya); a more recent wave occurred around 750 ya, which is in agreement with the estimate from a recent study using other methods. We unveiled a more complex scenario of ancestral origins and admixture history in XJU than previously reported, which further suggests Bronze Age massive migrations in Eurasia and East-West contacts across the Silk Road.


Assuntos
Povo Asiático/genética , Etnicidade/genética , Genética Populacional/métodos , China/etnologia , Fluxo Gênico , Geografia , Haplótipos/genética , Humanos , Filogeografia , Polimorfismo de Nucleotídeo Único/genética , População Branca/genética
4.
Am J Hum Genet ; 97(1): 54-66, 2015 Jul 02.
Artigo em Inglês | MEDLINE | ID: mdl-26073780

RESUMO

Tibetan high-altitude adaptation (HAA) has been studied extensively, and many candidate genes have been reported. Subsequent efforts targeting HAA functional variants, however, have not been that successful (e.g., no functional variant has been suggested for the top candidate HAA gene, EPAS1). With WinXPCNVer, a method developed in this study, we detected in microarray data a Tibetan-enriched deletion (TED) carried by 90% of Tibetans; 50% were homozygous for the deletion, whereas only 3% carried the TED and 0% carried the homozygous deletion in 2,792 worldwide samples (p < 10(-15)). We employed long PCR and Sanger sequencing technologies to determine the exact copy number and breakpoints of the TED in 70 additional Tibetan and 182 diverse samples. The TED had identical boundaries (chr2: 46,694,276-46,697,683; hg19) and was 80 kb downstream of EPAS1. Notably, the TED was in strong linkage disequilibrium (LD; r(2) = 0.8) with EPAS1 variants associated with reduced blood concentrations of hemoglobin. It was also in complete LD with the 5-SNP motif, which was suspected to be introgressed from Denisovans, but the deletion itself was absent from the Denisovan sequence. Correspondingly, we detected that footprints of positive selection for the TED occurred 12,803 (95% confidence interval = 12,075-14,725) years ago. We further whole-genome deep sequenced (>60×) seven Tibetans and verified the TED but failed to identify any other copy-number variations with comparable patterns, giving this TED top priority for further study. We speculate that the specific patterns of the TED resulted from its own functionality in HAA of Tibetans or LD with a functional variant of EPAS1.


Assuntos
Adaptação Biológica/genética , Altitude , Fatores de Transcrição Hélice-Alça-Hélice Básicos/genética , Variações do Número de Cópias de DNA/genética , Etnicidade/genética , Evolução Molecular , Hominidae/genética , Algoritmos , Animais , Sequência de Bases , Genética Populacional , Hemoglobinas/genética , Hemoglobinas/metabolismo , Humanos , Desequilíbrio de Ligação , Análise em Microsséries/métodos , Dados de Sequência Molecular , Reação em Cadeia da Polimerase/métodos , Análise de Sequência de DNA , Tibet
5.
Heredity (Edinb) ; 121(1): 52-63, 2018 07.
Artigo em Inglês | MEDLINE | ID: mdl-29358727

RESUMO

The ancestral tracks in admixed genomes are valuable for population history inference. While a few methods have been developed to infer admixture history based on ancestral tracks, these methods suffer the same flaw: only population admixture history under some specific models can be inferred. In addition, the inference of history might be biased or even unreliable if the specific model deviates from the real situation. To address this problem, we firstly proposed a general discrete admixture model to describe the admixture history with multiple ancestral populations and multiple-wave admixtures. We next deduced the length distribution of ancestral tracks under the general discrete admixture model. We further developed a new method, MultiWaver, to explore multiple-wave admixture histories. Our method could automatically determine an optimal admixture model based on the length distribution of ancestral tracks, and estimate the corresponding parameters under this optimal model. Specifically, we used a likelihood ratio test (LRT) to determine the number of admixture waves, and implemented an expectation-maximization (EM) algorithm to estimate parameters. We used simulation studies to validate the reliability and effectiveness of our method. Finally, good performance was observed when our method was applied to real data sets of African Americans and Mexicans, and new insights were gained into the admixture history of Uyghurs and Hazaras.


Assuntos
Genética Populacional , Modelos Genéticos , Algoritmos , Simulação por Computador , Análise de Dados , Evolução Molecular , Genética Populacional/métodos , Humanos , Densidade Demográfica , Reprodutibilidade dos Testes
6.
Hum Genet ; 135(11): 1279-1286, 2016 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-27487801

RESUMO

Hair straightness/curliness is one of the most conspicuous features of human variation and is particularly diverse among populations. A recent genome-wide scan found common variants in the Trichohyalin (TCHH) gene that are associated with hair straightness in Europeans, but different genes might affect this phenotype in other populations. By sampling 2899 Han Chinese, we performed the first genome-wide scan of hair straightness in East Asians, and found EDAR (rs3827760) as the predominant gene (P = 4.67 × 10-16), accounting for 3.66 % of the total variance. The candidate gene approach did not find further significant associations, suggesting that hair straightness may be affected by a large number of genes with subtle effects. Notably, genetic variants associated with hair straightness in Europeans are generally low in frequency in Han Chinese, and vice versa. To evaluate the relative contribution of these variants, we performed a second genome-wide scan in 709 samples from the Uyghur, an admixed population with both eastern and western Eurasian ancestries. In Uyghurs, both EDAR (rs3827760: P = 1.92 × 10-12) and TCHH (rs11803731: P = 1.46 × 10-3) are associated with hair straightness, but EDAR (OR 0.415) has a greater effect than TCHH (OR 0.575). We found no significant interaction between EDAR and TCHH (P = 0.645), suggesting that these two genes affect hair straightness through different mechanisms. Furthermore, haplotype analysis indicates that TCHH is not subject to selection. While EDAR is under strong selection in East Asia, it does not appear to be subject to selection after the admixture in Uyghurs. These suggest that hair straightness is unlikely a trait under selection.


Assuntos
Antígenos/genética , Receptor Edar/genética , Estudo de Associação Genômica Ampla , Cabelo , Proteínas de Filamentos Intermediários/genética , Povo Asiático/genética , China , Feminino , Frequência do Gene , Predisposição Genética para Doença , Cabelo/crescimento & desenvolvimento , Cabelo/metabolismo , Cabelo/ultraestrutura , Haplótipos , Humanos , Masculino , Fenótipo , Polimorfismo de Nucleotídeo Único , População Branca/genética
7.
J Hum Genet ; 59(11): 615-22, 2014 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-25253659

RESUMO

The Middle East (ME) is an important crossroad where modern humans migrated 'out of Africa' and spread into Europe and Asia. After the initial peopling and long-term isolation leading to well-differentiated populations, the ME also had a crucial role in subsequent human migrations among Africa, Europe and Asia; thus, recent population admixture has been common in the ME. On the other hand, consanguinity, a well-known practice in the ME, often reduces genetic diversity and works in opposition to admixture. Here, we explored the degree to which admixture and consanguinity jointly affected genetic diversity in ME populations. Genome-wide single-nucleotide polymorphism data were generated in two representative ME populations (Arabian and Iranian), with comparisons made with populations worldwide. Our results revealed an overall higher genetic diversity in both ME populations relative to other non-African populations. We identified a much larger number of long runs of homozygosity in ME populations than in any other populations, which was most likely attributed to high levels of consanguineous marriages that significantly decreased both individual and population heterozygosity. Additionally, we were able to distinguish African, European and Asian ancestries in ME populations and quantify the impact of admixture and consanguinity with statistical approaches. Interestingly, genomic regions with significantly excessive ancestry from individual source populations are functionally enriched in olfactory pathways, which were suspected to be under natural selection. Our findings suggest that genetic admixture, consanguinity and natural selection have collectively shaped the genetic diversity of ME populations, which has important implications in both evolutionary studies and medical practices.


Assuntos
Consanguinidade , Variação Genética/genética , Genética Populacional/estatística & dados numéricos , Polimorfismo de Nucleotídeo Único , Algoritmos , Ásia/etnologia , Povo Asiático/genética , Europa (Continente)/etnologia , Feminino , Frequência do Gene , Genética Populacional/métodos , Genoma Humano/genética , Genótipo , Humanos , Modelos Lineares , Masculino , Oriente Médio , Modelos Genéticos , População Branca/genética
8.
EBioMedicine ; 105: 105195, 2024 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-38870545

RESUMO

BACKGROUND: Response to antipsychotic drugs (APD) varies greatly among individuals and is affected by genetic factors. This study aims to demonstrate genome-wide associations between copy number variants (CNVs) and response to APD in patients with schizophrenia. METHODS: A total of 3030 patients of Han Chinese ethnicity randomly received APD (aripiprazole, olanzapine, quetiapine, risperidone, ziprasidone, haloperidol and perphenazine) treatment for six weeks. This study is a secondary data analysis. Percentage change on the Positive and Negative Syndrome Scale (PANSS) reduction was used to assess APD efficacy, and more than 50% change was considered as APD response. Associations between CNV burden, gene set, CNV loci and CNV break-point and APD efficacy were analysed. FINDINGS: Higher CNV losses burden decreased the odds of 6-week APD response (OR = 0.66 [0.44, 0.98]). CNV losses in synaptic pathway involved in neurotransmitters were associated with 2-week PANSS reduction rate. CNV involved in sialylation (1p31.1 losses) and cellular metabolism (19q13.32 gains) associated with 6-week PANSS reduction rate at genome-wide significant level. Additional 36 CNVs associated with PANSS factors improvement. The OR of protective CNVs for 6-week APD response was 3.10 (95% CI: 1.33-7.19) and risk CNVs was 8.47 (95% CI: 1.92-37.43). CNV interacted with genetic risk score on APD efficacy (Beta = -1.53, SE = 0.66, P = 0.021). The area under curve to differ 6-week APD response attained 80.45% (95% CI: 78.07%-82.82%). INTERPRETATION: Copy number variants contributed to poor APD efficacy and synaptic pathway involved in neurotransmitter was highlighted. FUNDING: National Natural Science Foundation of China, National Key R&D Program of China, China Postdoctoral Science Foundation.


Assuntos
Antipsicóticos , Variações do Número de Cópias de DNA , Estudo de Associação Genômica Ampla , Esquizofrenia , Adulto , Feminino , Humanos , Masculino , Pessoa de Meia-Idade , Antipsicóticos/uso terapêutico , China , População do Leste Asiático , Esquizofrenia/tratamento farmacológico , Esquizofrenia/genética , Resultado do Tratamento
9.
bioRxiv ; 2023 Aug 21.
Artigo em Inglês | MEDLINE | ID: mdl-37662416

RESUMO

Blood lipid traits are treatable and heritable risk factors for heart disease, a leading cause of mortality worldwide. Although genome-wide association studies (GWAS) have discovered hundreds of variants associated with lipids in humans, most of the causal mechanisms of lipids remain unknown. To better understand the biological processes underlying lipid metabolism, we investigated the associations of plasma protein levels with total cholesterol (TC), triglycerides (TG), high-density lipoprotein cholesterol (HDL), and low-density lipoprotein cholesterol (LDL) in blood. We trained protein prediction models based on samples in the Multi-Ethnic Study of Atherosclerosis (MESA) and applied them to conduct proteome-wide association studies (PWAS) for lipids using the Global Lipids Genetics Consortium (GLGC) data. Of the 749 proteins tested, 42 were significantly associated with at least one lipid trait. Furthermore, we performed transcriptome-wide association studies (TWAS) for lipids using 9,714 gene expression prediction models trained on samples from peripheral blood mononuclear cells (PBMCs) in MESA and 49 tissues in the Genotype-Tissue Expression (GTEx) project. We found that although PWAS and TWAS can show different directions of associations in an individual gene, 40 out of 49 tissues showed a positive correlation between PWAS and TWAS signed p-values across all the genes, which suggests a high-level consistency between proteome-lipid associations and transcriptome-lipid associations.

10.
Eur J Hum Genet ; 27(1): 133-139, 2019 01.
Artigo em Inglês | MEDLINE | ID: mdl-30206356

RESUMO

Our goal in developing the MultiWaver software series was to be able to infer population admixture history under various complex scenarios. The earlier version of MultiWaver considered only discrete admixture models. Here, we report a newly developed version, MultiWaver 2.0, that implements a more flexible framework and is capable of inferring multiple-wave admixture histories under both discrete and continuous admixture models. MultiWaver 2.0 can automatically select an optimal admixture model based on the length distribution of ancestral tracks of chromosomes, and the program can estimate the corresponding parameters under the selected model. Specifically, for discrete admixture models, we used a likelihood ratio test (LRT) to determine the optimal discrete model and an expectation-maximization algorithm to estimate the parameters. In addition, according to the principles of the Bayesian Information Criterion (BIC), we compared the optimal discrete model with several continuous admixture models. In MultiWaver 2.0, we also applied a bootstrapping technique to provide levels of support for the chosen model and the confidence interval (CI) of the estimations of admixture time. Simulation studies validated the reliability and effectiveness of our method. Finally, the program performed well when applied to real datasets of typical admixed populations, such as African Americans, Uyghurs, and Hazaras.


Assuntos
Fluxo Gênico , Genética Populacional/métodos , Software , Humanos
11.
Natl Sci Rev ; 6(6): 1201-1222, 2019 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-34691999

RESUMO

Human genetic adaptation to high altitudes (>2500 m) has been extensively studied over the last few years, but few functional adaptive genetic variants have been identified, largely owing to the lack of deep-genome sequencing data available to previous studies. Here, we build a list of putative adaptive variants, including 63 missense, 7 loss-of-function, 1,298 evolutionarily conserved variants and 509 expression quantitative traits loci. Notably, the top signal of selection is located in TMEM247, a transmembrane protein-coding gene. The Tibetan version of TMEM247 harbors one high-frequency (76.3%) missense variant, rs116983452 (c.248C > T; p.Ala83Val), with the T allele derived from archaic ancestry and carried by >94% of Tibetans but absent or in low frequencies (<3%) in non-Tibetan populations. The rs116983452-T is strongly and positively correlated with altitude and significantly associated with reduced hemoglobin concentration (p = 5.78 × 10-5), red blood cell count (p = 5.72 × 10-7) and hematocrit (p = 2.57 × 10-6). In particular, TMEM247-rs116983452 shows greater effect size and better predicts the phenotypic outcome than any EPAS1 variants in association with adaptive traits in Tibetans. Modeling the interaction between TMEM247-rs116983452 and EPAS1 variants indicates weak but statistically significant epistatic effects. Our results support that multiple variants may jointly deliver the fitness of the Tibetans on the plateau, where a complex model is needed to elucidate the adaptive evolution mechanism.

12.
Genomics Proteomics Bioinformatics ; 16(5): 382-385, 2018 10.
Artigo em Inglês | MEDLINE | ID: mdl-30472416

RESUMO

Ancestry composition of populations and individuals has been extensively investigated in recent years due to advances in the genotyping and sequencing technologies. As the number of populations and individuals used for ancestry inference increases remarkably, say more than 100 populations or 1000 individuals, it is usually challenging to present the ancestry composition in a traditional way using a rectangular graph. To address this issue, we developed a program, AncestryPainter, which can illustrate the ancestry composition of populations and individuals with a rounded and nice-looking graph to save space. Individuals are depicted as length-fixed bars partitioned into colored segments representing different ancestries, and the population of interest can be highlighted as a pie chart in the center of the circle plot. In addition, AncestryPainter can also be applied to display personal ancestry in a way similar to that for displaying population ancestry. AncestryPainter is publicly available at http://www.picb.ac.cn/PGG/resource.php.


Assuntos
Genética Populacional , Software , Gráficos por Computador , Humanos
13.
Genome Biol ; 18(1): 115, 2017 06 15.
Artigo em Inglês | MEDLINE | ID: mdl-28619099

RESUMO

BACKGROUND: The genetic relationships reported by recent studies between Sherpas and Tibetans are controversial. To gain insights into the population history and the genetic basis of high-altitude adaptation of the two groups, we analyzed genome-wide data in 111 Sherpas (Tibet and Nepal) and 177 Tibetans (Tibet and Qinghai), together with available data from present-day human populations. RESULTS: Sherpas and Tibetans show considerable genetic differences and can be distinguished as two distinct groups, even though the divergence between them (~3200-11,300 years ago) is much later than that between Han Chinese and either of the two groups (~6200-16,000 years ago). Sub-population structures exist in both Sherpas and Tibetans, corresponding to geographical or linguistic groups. Differentiation of genetic variants between Sherpas and Tibetans associated with adaptation to either high-altitude or ultraviolet radiation were identified and validated by genotyping additional Sherpa and Tibetan samples. CONCLUSIONS: Our analyses indicate that both Sherpas and Tibetans are admixed populations, but the findings do not support the previous hypothesis that Tibetans derive their ancestry from Sherpas and Han Chinese. Compared to Tibetans, Sherpas show higher levels of South Asian ancestry, while Tibetans show higher levels of East Asian and Central Asian/Siberian ancestry. We propose a new model to elucidate the differentiated demographic histories and local adaptations of Sherpas and Tibetans.


Assuntos
Aclimatação/genética , Adaptação Fisiológica/genética , Doença da Altitude/genética , Variação Genética , Altitude , Povo Asiático/genética , Etnicidade/genética , Genética Populacional/história , Genótipo , Haplótipos/genética , História Antiga , Humanos , Tibet
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA