Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 27
Filtrar
Mais filtros

Base de dados
País/Região como assunto
Tipo de documento
País de afiliação
Intervalo de ano de publicação
1.
Nucleic Acids Res ; 51(D1): D1109-D1116, 2023 01 06.
Artigo em Inglês | MEDLINE | ID: mdl-36243989

RESUMO

Structural variations (SVs) play important roles in human evolution and diseases, but there is a lack of data resources concerning representative samples, especially for East Asians. Taking advantage of both next-generation sequencing and third-generation sequencing data at the whole-genome level, we developed the database PGG.SV to provide a practical platform for both regionally and globally representative structural variants. In its current version, PGG.SV archives 584 277 SVs obtained from whole-genome sequencing data of 6048 samples, including 1030 long-read sequencing genomes representing 177 global populations. PGG.SV provides (i) high-quality SVs with fine-scale and precise genomic locations in both GRCh37 and GRCh38, covering underrepresented SVs in existing sequencing and microarray data; (ii) hierarchical estimation of SV prevalence in geographical populations; (iii) informative annotations of SV-related genes, potential functions and clinical effects; (iv) an analysis platform to facilitate SV-based case-control association studies and (v) various visualization tools for understanding the SV structures in the human genome. Taken together, PGG.SV provides a user-friendly online interface, easy-to-use analysis tools and a detailed presentation of results. PGG.SV is freely accessible via https://www.biosino.org/pggsv.


Assuntos
Genômica , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Genômica/métodos , Sequenciamento Completo do Genoma , Genoma Humano , Bases de Dados Genéticas , Variação Estrutural do Genoma , Análise de Sequência de DNA/métodos
2.
Hum Mol Genet ; 27(6): 1067-1077, 2018 03 15.
Artigo em Inglês | MEDLINE | ID: mdl-29346564

RESUMO

Transcriptomic diversity across human populations reflects differential regulatory mechanisms. Allelic-imbalanced gene expression is a genetic regulatory mechanism that contributes to human phenotypic variation. To systematically investigate genome-wide allele-specific expression (ASE), we analyzed RNA-Seq data from European and African populations provided by the Geuvadis project. We identified 11 sites in 8 genes showing ASE in both Europeans and Africans, and 9 sites in 9 genes showing population-specific ASE, including both novel and known ASE signals. Notably, the top signal of differentiated ASE between inter-continental populations was observed in DNAJC15, of which the derived allele of rs12015, a single nucleotide polymorphism (SNP), showed significantly higher expression than did the ancestral allele specifically in European individuals. We identified a unique haplotype of DNAJC15, where a few SNPs highly differentiated between European and African populations were strongly linked to sites with high ASE. Among these, SNP rs17553284 affected the binding of several transcription factors as well as the genotype-dependent expression of DNAJC15. Therefore, we speculated that rs17553284 could be a regulatory causal variant that mediates the ASE of rs12015. We found several variations in ASE between intercontinental populations. The highly differentiated ASE genes identified here may implicate in the phenotypic variations among populations that are both evolutionarily and medically important.


Assuntos
População Negra/genética , Frequência do Gene , População Branca/genética , Regulação da Expressão Gênica , Variação Genética , Estudo de Associação Genômica Ampla/métodos , Proteínas de Choque Térmico HSP40/genética , Haplótipos , Humanos , Polimorfismo de Nucleotídeo Único , Locos de Características Quantitativas , Análise de Sequência de RNA/métodos , Transcriptoma
3.
BMC Genomics ; 20(1): 842, 2019 Nov 12.
Artigo em Inglês | MEDLINE | ID: mdl-31718558

RESUMO

BACKGROUND: Recent advances in genomic technologies have facilitated genome-wide investigation of human genetic variations. However, most efforts have focused on the major populations, yet trio genomes of indigenous populations from Southeast Asia have been under-investigated. RESULTS: We analyzed the whole-genome deep sequencing data (~ 30×) of five native trios from Peninsular Malaysia and North Borneo, and characterized the genomic variants, including single nucleotide variants (SNVs), small insertions and deletions (indels) and copy number variants (CNVs). We discovered approximately 6.9 million SNVs, 1.2 million indels, and 9000 CNVs in the 15 samples, of which 2.7% SNVs, 2.3% indels and 22% CNVs were novel, implying the insufficient coverage of population diversity in existing databases. We identified a higher proportion of novel variants in the Orang Asli (OA) samples, i.e., the indigenous people from Peninsular Malaysia, than that of the North Bornean (NB) samples, likely due to more complex demographic history and long-time isolation of the OA groups. We used the pedigree information to identify de novo variants and estimated the autosomal mutation rates to be 0.81 × 10- 8 - 1.33 × 10- 8, 1.0 × 10- 9 - 2.9 × 10- 9, and ~ 0.001 per site per generation for SNVs, indels, and CNVs, respectively. The trio-genomes also allowed for haplotype phasing with high accuracy, which serves as references to the future genomic studies of OA and NB populations. In addition, high-frequency inherited CNVs specific to OA or NB were identified. One example is a 50-kb duplication in DEFA1B detected only in the Negrito trios, implying plausible effects on host defense against the exposure of diverse microbial in tropical rainforest environment of these hunter-gatherers. The CNVs shared between OA and NB groups were much fewer than those specific to each group. Nevertheless, we identified a 142-kb duplication in AMY1A in all the 15 samples, and this gene is associated with the high-starch diet. Moreover, novel insertions shared with archaic hominids were identified in our samples. CONCLUSION: Our study presents a full catalogue of the genome variants of the native Malaysian populations, which is a complement of the genome diversity in Southeast Asians. It implies specific population history of the native inhabitants, and demonstrated the necessity of more genome sequencing efforts on the multi-ethnic native groups of Malaysia and Southeast Asia.


Assuntos
Variação Genética , Genoma Humano , Animais , Bornéu/etnologia , Variações do Número de Cópias de DNA , Sequenciamento de Nucleotídeos em Larga Escala , Hominidae/genética , Humanos , Mutação INDEL , Malásia/etnologia , Taxa de Mutação
4.
Am J Hum Genet ; 99(3): 580-594, 2016 09 01.
Artigo em Inglês | MEDLINE | ID: mdl-27569548

RESUMO

The origin of Tibetans remains one of the most contentious puzzles in history, anthropology, and genetics. Analyses of deeply sequenced (30×-60×) genomes of 38 Tibetan highlanders and 39 Han Chinese lowlanders, together with available data on archaic and modern humans, allow us to comprehensively characterize the ancestral makeup of Tibetans and uncover their origins. Non-modern human sequences compose ∼6% of the Tibetan gene pool and form unique haplotypes in some genomic regions, where Denisovan-like, Neanderthal-like, ancient-Siberian-like, and unknown ancestries are entangled and elevated. The shared ancestry of Tibetan-enriched sequences dates back to ∼62,000-38,000 years ago, predating the Last Glacial Maximum (LGM) and representing early colonization of the plateau. Nonetheless, most of the Tibetan gene pool is of modern human origin and diverged from that of Han Chinese ∼15,000 to ∼9,000 years ago, which can be largely attributed to post-LGM arrivals. Analysis of ∼200 contemporary populations showed that Tibetans share ancestry with populations from East Asia (∼82%), Central Asia and Siberia (∼11%), South Asia (∼6%), and western Eurasia and Oceania (∼1%). Our results support that Tibetans arose from a mixture of multiple ancestral gene pools but that their origins are much more complicated and ancient than previously suspected. We provide compelling evidence of the co-existence of Paleolithic and Neolithic ancestries in the Tibetan gene pool, indicating a genetic continuity between pre-historical highland-foragers and present-day Tibetans. In particular, highly differentiated sequences harbored in highlanders' genomes were most likely inherited from pre-LGM settlers of multiple ancestral origins (SUNDer) and maintained in high frequency by natural selection.


Assuntos
Povo Asiático/genética , Fluxo Gênico/genética , Genoma Humano/genética , Filogenia , Altitude , Animais , China/etnologia , Etnicidade/genética , Pool Gênico , Genética Populacional , Haplótipos/genética , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Masculino , Modelos Genéticos , Homem de Neandertal/genética , Oceania/etnologia , Seleção Genética , Tibet
6.
Mol Biol Evol ; 34(10): 2572-2582, 2017 10 01.
Artigo em Inglês | MEDLINE | ID: mdl-28595347

RESUMO

The Uyghur people residing in Xinjiang, a territory located in the far west of China and crossed by the Silk Road, are a key ethnic group for understanding the history of human dispersion in Eurasia. Here we assessed the genetic structure and ancestry of 951 Xinjiang's Uyghurs (XJU) representing 14 geographical subpopulations. We observed a southwest and northeast differentiation within XJU, which was likely shaped jointly by the Tianshan Mountains, which traverses from east to west as a natural barrier, and gene flow from both east and west directions. In XJU, we identified four major ancestral components that were potentially derived from two earlier admixed groups: one from the West, harboring European (25-37%) and South Asian ancestries (12-20%), and the other from the East, with Siberian (15-17%) and East Asian (29-47%) ancestries. By using a newly developed method, MultiWaver, the complex admixture history of XJU was modeled as a two-wave admixture. An ancient wave was dated back to ∼3,750 years ago (ya), which is much earlier than that estimated by previous studies, but fits within the range of dating of mummies that exhibited European features that were discovered in the Tarim basin, which is situated in southern Xinjiang (4,000-2,000 ya); a more recent wave occurred around 750 ya, which is in agreement with the estimate from a recent study using other methods. We unveiled a more complex scenario of ancestral origins and admixture history in XJU than previously reported, which further suggests Bronze Age massive migrations in Eurasia and East-West contacts across the Silk Road.


Assuntos
Povo Asiático/genética , Etnicidade/genética , Genética Populacional/métodos , China/etnologia , Fluxo Gênico , Geografia , Haplótipos/genética , Humanos , Filogeografia , Polimorfismo de Nucleotídeo Único/genética , População Branca/genética
7.
Am J Hum Genet ; 97(1): 54-66, 2015 Jul 02.
Artigo em Inglês | MEDLINE | ID: mdl-26073780

RESUMO

Tibetan high-altitude adaptation (HAA) has been studied extensively, and many candidate genes have been reported. Subsequent efforts targeting HAA functional variants, however, have not been that successful (e.g., no functional variant has been suggested for the top candidate HAA gene, EPAS1). With WinXPCNVer, a method developed in this study, we detected in microarray data a Tibetan-enriched deletion (TED) carried by 90% of Tibetans; 50% were homozygous for the deletion, whereas only 3% carried the TED and 0% carried the homozygous deletion in 2,792 worldwide samples (p < 10(-15)). We employed long PCR and Sanger sequencing technologies to determine the exact copy number and breakpoints of the TED in 70 additional Tibetan and 182 diverse samples. The TED had identical boundaries (chr2: 46,694,276-46,697,683; hg19) and was 80 kb downstream of EPAS1. Notably, the TED was in strong linkage disequilibrium (LD; r(2) = 0.8) with EPAS1 variants associated with reduced blood concentrations of hemoglobin. It was also in complete LD with the 5-SNP motif, which was suspected to be introgressed from Denisovans, but the deletion itself was absent from the Denisovan sequence. Correspondingly, we detected that footprints of positive selection for the TED occurred 12,803 (95% confidence interval = 12,075-14,725) years ago. We further whole-genome deep sequenced (>60×) seven Tibetans and verified the TED but failed to identify any other copy-number variations with comparable patterns, giving this TED top priority for further study. We speculate that the specific patterns of the TED resulted from its own functionality in HAA of Tibetans or LD with a functional variant of EPAS1.


Assuntos
Adaptação Biológica/genética , Altitude , Fatores de Transcrição Hélice-Alça-Hélice Básicos/genética , Variações do Número de Cópias de DNA/genética , Etnicidade/genética , Evolução Molecular , Hominidae/genética , Algoritmos , Animais , Sequência de Bases , Genética Populacional , Hemoglobinas/genética , Hemoglobinas/metabolismo , Humanos , Desequilíbrio de Ligação , Análise em Microsséries/métodos , Dados de Sequência Molecular , Reação em Cadeia da Polimerase/métodos , Análise de Sequência de DNA , Tibet
8.
Heredity (Edinb) ; 120(1): 83-89, 2018 01.
Artigo em Inglês | MEDLINE | ID: mdl-29234170

RESUMO

Disease-associated variants in the human genome are continually being identified using DNA sequencing technologies that are especially effective for Mendelian disorders. Here we sequenced whole genome to high coverage (>30×) of 6 members of a 7-generation family with dwarfism from a consanguineous tribe in Pakistan to determine the causal variant(s). We identified a missense variant rs111033552 (c.2011T>C [p.Ser671Pro]) located in COL10A1 (encodes the alpha chain of type X collagen) as the most likely contributor to the dwarfism. We further confirmed the variant in 22 family members using Sanger sequencing. All affected individuals are heterozygous for the missense mutation rs111033552 and no individual homozygous was observed. Moreover, the mutation was absent in 69,985 individuals representing >150 global populations. Taking advantage of whole-genome sequencing data, we also examined other variant forms, including copy number variation and insertion/deletion, but failed to identify such variants enriched in the affected individuals. Thus rs111033552 had priority for linkage with dwarfism.


Assuntos
Colágeno Tipo XI/genética , Nanismo/genética , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Mutação de Sentido Incorreto , Mutação Puntual , Sequenciamento Completo do Genoma/métodos , Adolescente , Adulto , Sequência de Bases , Criança , Consanguinidade , Saúde da Família , Feminino , Predisposição Genética para Doença/genética , Heterozigoto , Humanos , Masculino , Pessoa de Meia-Idade , Paquistão , Linhagem , Polimorfismo de Nucleotídeo Único , Homologia de Sequência do Ácido Nucleico , Adulto Jovem
9.
J Med Genet ; 54(10): 685-692, 2017 10.
Artigo em Inglês | MEDLINE | ID: mdl-28705883

RESUMO

BACKGROUND: Copy number variation (CNV) is a valuable source of genetic diversity in the human genome and a well-recognised cause of various genetic diseases. However, CNVs have been considerably under-represented in population-based studies, particularly the Han Chinese which is the largest ethnic group in the world. OBJECTIVES: To build a representative CNV map for the Han Chinese population. METHODS: We conducted a genome-wide CNV study involving 451 male Han Chinese samples from 11 geographical regions encompassing 28 dialect groups, representing a less-biased panel compared with the currently available data. We detected CNVs by using 4.2M NimbleGen comparative genomic hybridisation array and whole-genome deep sequencing of 51 samples to optimise the filtering conditions in CNV discovery. RESULTS: A comprehensive Han Chinese CNV map was built based on a set of high-quality variants (positive predictive value >0.8, with sizes ranging from 369 bp to 4.16 Mb and a median of 5907 bp). The map consists of 4012 CNV regions (CNVRs), and more than half are novel to the 30 East Asian CNV Project and the 1000 Genomes Project Phase 3. We further identified 81 CNVRs specific to regional groups, which was indicative of the subpopulation structure within the Han Chinese population. CONCLUSIONS: Our data are complementary to public data sources, and the CNV map may facilitate in the identification of pathogenic CNVs and further biomedical research studies involving the Han Chinese population.


Assuntos
Povo Asiático/genética , Variações do Número de Cópias de DNA , Etnicidade/genética , Variação Genética , Genoma Humano , China , Humanos , Masculino
10.
Hum Genet ; 135(11): 1279-1286, 2016 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-27487801

RESUMO

Hair straightness/curliness is one of the most conspicuous features of human variation and is particularly diverse among populations. A recent genome-wide scan found common variants in the Trichohyalin (TCHH) gene that are associated with hair straightness in Europeans, but different genes might affect this phenotype in other populations. By sampling 2899 Han Chinese, we performed the first genome-wide scan of hair straightness in East Asians, and found EDAR (rs3827760) as the predominant gene (P = 4.67 × 10-16), accounting for 3.66 % of the total variance. The candidate gene approach did not find further significant associations, suggesting that hair straightness may be affected by a large number of genes with subtle effects. Notably, genetic variants associated with hair straightness in Europeans are generally low in frequency in Han Chinese, and vice versa. To evaluate the relative contribution of these variants, we performed a second genome-wide scan in 709 samples from the Uyghur, an admixed population with both eastern and western Eurasian ancestries. In Uyghurs, both EDAR (rs3827760: P = 1.92 × 10-12) and TCHH (rs11803731: P = 1.46 × 10-3) are associated with hair straightness, but EDAR (OR 0.415) has a greater effect than TCHH (OR 0.575). We found no significant interaction between EDAR and TCHH (P = 0.645), suggesting that these two genes affect hair straightness through different mechanisms. Furthermore, haplotype analysis indicates that TCHH is not subject to selection. While EDAR is under strong selection in East Asia, it does not appear to be subject to selection after the admixture in Uyghurs. These suggest that hair straightness is unlikely a trait under selection.


Assuntos
Antígenos/genética , Receptor Edar/genética , Estudo de Associação Genômica Ampla , Cabelo , Proteínas de Filamentos Intermediários/genética , Povo Asiático/genética , China , Feminino , Frequência do Gene , Predisposição Genética para Doença , Cabelo/crescimento & desenvolvimento , Cabelo/metabolismo , Cabelo/ultraestrutura , Haplótipos , Humanos , Masculino , Fenótipo , Polimorfismo de Nucleotídeo Único , População Branca/genética
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA