Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 31
Filtrar
1.
J Hepatol ; 80(1): 41-52, 2024 01.
Artigo em Inglês | MEDLINE | ID: mdl-37858684

RESUMO

BACKGROUND & AIMS: HBsAg loss is only observed in a small proportion of patients with chronic hepatitis B (CHB) who undergo interferon treatment. Investigating the host factors crucial for functional cure of CHB can aid in identifying individuals who would benefit from peginterferon-α (Peg-IFNα) therapy. METHODS: We conducted a genome-wide association study (GWAS) by enrolling 48 patients with CHB who achieved HBsAg loss after Peg-IFNα treatment and 47 patients who didn't. In the validation stage, we included 224 patients, of whom 90 had achieved HBsAg loss, to validate the identified significant single nucleotide polymorphisms. To verify the functional involvement of the candidate genes identified, we performed a series of in vitro and in vivo experiments. RESULTS: GWAS results indicated a significant association between the rs7519753 C allele and serum HBsAg loss in patients with CHB after Peg-IFNα treatment (p = 4.85 × 10-8, odds ratio = 14.47). This association was also observed in two independent validation cohorts. Expression quantitative trait locus analysis revealed higher hepatic TP53BP2 expression in individuals carrying the rs7519753 C allele (p = 2.90 × 10-6). RNA-sequencing of liver biopsies from patients with CHB after Peg-IFNα treatment revealed that hepatic TP53BP2 levels were significantly higher in the HBsAg loss group compared to the HBsAg persistence group (p = 0.035). In vitro and in vivo experiments demonstrated that loss of TP53BP2 decreased interferon-stimulated gene levels and the anti-HBV effect of IFN-α. Mechanistically, TP53BP2 was found to downregulate SOCS2, thereby facilitating JAK/STAT signaling. CONCLUSION: The rs7519753 C allele is associated with elevated hepatic TP53BP2 expression and an increased probability of serum HBsAg loss post-Peg-IFNα treatment in patients with CHB. TP53BP2 enhances the response of the hepatocyte to IFN-α by suppressing SOCS2 expression. IMPACT AND IMPLICATIONS: Chronic hepatitis B (CHB) remains a global public health issue. Although current antiviral therapies are more effective in halting disease progression, only a few patients achieve functional cure for hepatitis B with HBsAg loss, highlighting the urgent need for a cure for CHB. This study revealed that the rs7519753 C allele, which is associated with high expression of hepatic TP53BP2, significantly increases the likelihood of serum HBsAg loss in patients with CHB undergoing Peg-IFNα treatment. This finding not only provides a promising predictor for HBsAg loss but identifies a potential therapeutic target for Peg-IFNα treatment. We believe our results are of great interest to a wide range of stakeholders based on their potential clinical implications.


Assuntos
Antivirais , Hepatite B Crônica , Humanos , Antivirais/uso terapêutico , Antígenos de Superfície da Hepatite B/genética , Hepatite B Crônica/tratamento farmacológico , Hepatite B Crônica/genética , Estudo de Associação Genômica Ampla , Quimioterapia Combinada , Interferon-alfa/farmacologia , Interferon-alfa/uso terapêutico , Polietilenoglicóis/uso terapêutico , Antígenos E da Hepatite B , Proteínas Recombinantes/uso terapêutico , Resultado do Tratamento , DNA Viral/genética , Proteínas Reguladoras de Apoptose
2.
Nature ; 548(7665): 87-91, 2017 08 03.
Artigo em Inglês | MEDLINE | ID: mdl-28746312

RESUMO

Hundreds of thousands of human genomes are now being sequenced to characterize genetic variation and use this information to augment association mapping studies of complex disorders and other phenotypic traits. Genetic variation is identified mainly by mapping short reads to the reference genome or by performing local assembly. However, these approaches are biased against discovery of structural variants and variation in the more complex parts of the genome. Hence, large-scale de novo assembly is needed. Here we show that it is possible to construct excellent de novo assemblies from high-coverage sequencing with mate-pair libraries extending up to 20 kilobases. We report de novo assemblies of 150 individuals (50 trios) from the GenomeDenmark project. The quality of these assemblies is similar to those obtained using the more expensive long-read technology. We use the assemblies to identify a rich set of structural variants including many novel insertions and demonstrate how this variant catalogue enables further deciphering of known association mapping signals. We leverage the assemblies to provide 100 completely resolved major histocompatibility complex haplotypes and to resolve major parts of the Y chromosome. Our study provides a regional reference genome that we expect will improve the power of future association mapping studies and hence pave the way for precision medicine initiatives, which now are being launched in many countries including Denmark.


Assuntos
Variação Genética/genética , Genética Populacional/normas , Genoma Humano/genética , Genômica/normas , Análise de Sequência de DNA/normas , Adulto , Alelos , Criança , Cromossomos Humanos Y/genética , Dinamarca , Feminino , Haplótipos/genética , Humanos , Complexo Principal de Histocompatibilidade/genética , Masculino , Idade Materna , Taxa de Mutação , Idade Paterna , Mutação Puntual/genética , Padrões de Referência
3.
Nature ; 496(7443): 87-90, 2013 Apr 04.
Artigo em Inglês | MEDLINE | ID: mdl-23535596

RESUMO

Bread wheat (Triticum aestivum, AABBDD) is one of the most widely cultivated and consumed food crops in the world. However, the complex polyploid nature of its genome makes genetic and functional analyses extremely challenging. The A genome, as a basic genome of bread wheat and other polyploid wheats, for example, T. turgidum (AABB), T. timopheevii (AAGG) and T. zhukovskyi (AAGGA(m)A(m)), is central to wheat evolution, domestication and genetic improvement. The progenitor species of the A genome is the diploid wild einkorn wheat T. urartu, which resembles cultivated wheat more extensively than do Aegilops speltoides (the ancestor of the B genome) and Ae. tauschii (the donor of the D genome), especially in the morphology and development of spike and seed. Here we present the generation, assembly and analysis of a whole-genome shotgun draft sequence of the T. urartu genome. We identified protein-coding gene models, performed genome structure analyses and assessed its utility for analysing agronomically important genes and for developing molecular markers. Our T. urartu genome assembly provides a diploid reference for analysis of polyploid wheat genomes and is a valuable resource for the genetic improvement of wheat.


Assuntos
Genoma de Planta/genética , Triticum/genética , Sequência de Bases , Brachypodium/genética , Produtos Agrícolas/classificação , Produtos Agrícolas/genética , Diploide , Marcadores Genéticos/genética , Dados de Sequência Molecular , Oryza/genética , Filogenia , Sorghum/genética , Sintenia/genética , Triticum/classificação , Zea mays/genética
4.
Genet Med ; 20(7): 697-707, 2018 07.
Artigo em Inglês | MEDLINE | ID: mdl-29095815

RESUMO

PURPOSE: Recent studies demonstrate that whole-genome sequencing enables detection of cryptic rearrangements in apparently balanced chromosomal rearrangements (also known as balanced chromosomal abnormalities, BCAs) previously identified by conventional cytogenetic methods. We aimed to assess our analytical tool for detecting BCAs in the 1000 Genomes Project without knowing which bands were affected. METHODS: The 1000 Genomes Project provides an unprecedented integrated map of structural variants in phenotypically normal subjects, but there is no information on potential inclusion of subjects with apparent BCAs akin to those traditionally detected in diagnostic cytogenetics laboratories. We applied our analytical tool to 1,166 genomes from the 1000 Genomes Project with sufficient physical coverage (8.25-fold). RESULTS: With this approach, we detected four reciprocal balanced translocations and four inversions, ranging in size from 57.9 kb to 13.3 Mb, all of which were confirmed by cytogenetic methods and polymerase chain reaction studies. One of these DNAs has a subtle translocation that is not readily identified by chromosome analysis because of the similarity of the banding patterns and size of exchanged segments, and another results in disruption of all transcripts of an OMIM gene. CONCLUSION: Our study demonstrates the extension of utilizing low-pass whole-genome sequencing for unbiased detection of BCAs including translocations and inversions previously unknown in the 1000 Genomes Project.


Assuntos
Transtornos Cromossômicos/diagnóstico , Análise Citogenética/métodos , Aberrações Cromossômicas , Inversão Cromossômica/genética , Cromossomos/genética , Rearranjo Gênico/genética , Genoma/genética , Projeto Genoma Humano , Humanos , Cariotipagem/métodos , Translocação Genética/genética , Sequenciamento Completo do Genoma/métodos
5.
Mol Biol Evol ; 33(5): 1177-87, 2016 05.
Artigo em Inglês | MEDLINE | ID: mdl-26744415

RESUMO

Skin lightening among Eurasians is thought to have been a convergence occurring independently in Europe and East Asia as an adaptation to high latitude environments. Among Europeans, several genes responsible for such lightening have been found, but the information available for East Asians is much more limited. Here, a genome-wide comparison between dark-skinned Africans and Austro-Asiatic speaking aborigines and light-skinned northern Han Chinese identified the pigmentation gene OCA2, showing unusually deep allelic divergence between these groups. An amino acid substitution (His615Arg) of OCA2 prevalent in most East Asian populations-but absent in Africans and Europeans-was significantly associated with skin lightening among northern Han Chinese. Further transgenic and targeted gene modification analyses of zebrafish and mouse both exhibited the phenotypic effect of the OCA2 variant manifesting decreased melanin production. These results indicate that OCA2 plays an important role in the convergent skin lightening of East Asians during recent human evolution.


Assuntos
Povo Asiático/genética , Proteínas de Membrana Transportadoras/genética , Pigmentação da Pele/genética , Adolescente , Alelos , Substituição de Aminoácidos , Evolução Biológica , População Negra/genética , Criança , Etnicidade/genética , Evolução Molecular , Feminino , Frequência do Gene , Estudos de Associação Genética/métodos , Variação Genética , Genética Populacional/métodos , Haplótipos , Humanos , Masculino , Proteínas de Membrana Transportadoras/sangue , Proteínas de Membrana Transportadoras/metabolismo , Polimorfismo de Nucleotídeo Único , Seleção Genética , Pigmentação da Pele/fisiologia , População Branca/genética , Adulto Jovem
6.
Nature ; 463(7279): 311-7, 2010 Jan 21.
Artigo em Inglês | MEDLINE | ID: mdl-20010809

RESUMO

Using next-generation sequencing technology alone, we have successfully generated and assembled a draft sequence of the giant panda genome. The assembled contigs (2.25 gigabases (Gb)) cover approximately 94% of the whole genome, and the remaining gaps (0.05 Gb) seem to contain carnivore-specific repeats and tandem repeats. Comparisons with the dog and human showed that the panda genome has a lower divergence rate. The assessment of panda genes potentially underlying some of its unique traits indicated that its bamboo diet might be more dependent on its gut microbiome than its own genetic composition. We also identified more than 2.7 million heterozygous single nucleotide polymorphisms in the diploid genome. Our data and analyses provide a foundation for promoting mammalian genetic research, and demonstrate the feasibility for using next-generation sequencing technologies for accurate, cost-effective and rapid de novo assembly of large eukaryotic genomes.


Assuntos
Genoma/genética , Genômica , Ursidae/genética , Algoritmos , Animais , China , Sequência Conservada/genética , Mapeamento de Sequências Contíguas , Dieta/veterinária , Cães , Evolução Molecular , Feminino , Fertilidade/genética , Fertilidade/fisiologia , Heterozigoto , Humanos , Família Multigênica/genética , Polimorfismo de Nucleotídeo Único/genética , Receptores Acoplados a Proteínas G/genética , Alinhamento de Sequência , Análise de Sequência de DNA , Sintenia/genética , Ursidae/classificação , Ursidae/fisiologia
7.
Nature ; 463(7282): 757-62, 2010 Feb 11.
Artigo em Inglês | MEDLINE | ID: mdl-20148029

RESUMO

We report here the genome sequence of an ancient human. Obtained from approximately 4,000-year-old permafrost-preserved hair, the genome represents a male individual from the first known culture to settle in Greenland. Sequenced to an average depth of 20x, we recover 79% of the diploid genome, an amount close to the practical limit of current sequencing technologies. We identify 353,151 high-confidence single-nucleotide polymorphisms (SNPs), of which 6.8% have not been reported previously. We estimate raw read contamination to be no higher than 0.8%. We use functional SNP assessment to assign possible phenotypic characteristics of the individual that belonged to a culture whose location has yielded only trace human remains. We compare the high-confidence SNPs to those of contemporary populations to find the populations most closely related to the individual. This provides evidence for a migration from Siberia into the New World some 5,500 years ago, independent of that giving rise to the modern Native Americans and Inuit.


Assuntos
Criopreservação , Extinção Biológica , Genoma Humano/genética , Inuíte/genética , Emigração e Imigração/história , Genética Populacional , Genômica , Genótipo , Groenlândia , Cabelo , História Antiga , Humanos , Masculino , Fenótipo , Filogenia , Polimorfismo de Nucleotídeo Único/genética , Análise de Sequência de DNA , Sibéria/etnologia
8.
Proc Natl Acad Sci U S A ; 110(35): 14492-7, 2013 Aug 27.
Artigo em Inglês | MEDLINE | ID: mdl-23940322

RESUMO

The growing world population and shrinkage of arable land demand yield improvement of rice, one of the most important staple crops. To elucidate the genetic basis of yield and uncover its associated loci in rice, we resequenced the core recombinant inbred lines of Liang-You-Pei-Jiu, the widely cultivated super hybrid rice, and constructed a high-resolution linkage map. We detected 43 yield-associated quantitative trait loci, of which 20 are unique. Based on the high-density physical map, the genome sequences of paternal variety 93-11 and maternal cultivar PA64s of Liang-You-Pei-Jiu were significantly improved. The large recombinant inbred line population combined with plentiful high-quality single nucleotide polymorphisms and insertions/deletions between parental genomes allowed us to fine-map two quantitative trait loci, qSN8 and qSPB1, and to identify days to heading8 and lax panicle1 as candidate genes, respectively. The quantitative trait locus qSN8 was further confirmed to be days to heading8 by a complementation test. Our study provided an ideal platform for molecular breeding by targeting and dissecting yield-associated loci in rice.


Assuntos
Genoma de Planta , Hibridização Genética , Oryza/genética , Recombinação Genética , Ligação Genética , Locos de Características Quantitativas
9.
BMC Genomics ; 15: 262, 2014 Apr 04.
Artigo em Inglês | MEDLINE | ID: mdl-24708091

RESUMO

BACKGROUND: Targeted capture of genomic regions reduces sequencing cost while generating higher coverage by allowing biomedical researchers to focus on specific loci of interest, such as exons. Targeted capture also has the potential to facilitate the generation of genomic data from DNA collected via saliva or buccal cells. DNA samples derived from these cell types tend to have a lower human DNA yield, may be degraded from age and/or have contamination from bacteria or other ambient oral microbiota. However, thousands of samples have been previously collected from these cell types, and saliva collection has the advantage that it is a non-invasive and appropriate for a wide variety of research. RESULTS: We demonstrate successful enrichment and sequencing of 15 South African KhoeSan exomes and 2 full genomes with samples initially derived from saliva. The expanded exome dataset enables us to characterize genetic diversity free from ascertainment bias for multiple KhoeSan populations, including new exome data from six HGDP Namibian San, revealing substantial population structure across the Kalahari Desert region. Additionally, we discover and independently verify thirty-one previously unknown KIR alleles using methods we developed to accurately map and call the highly polymorphic HLA and KIR loci from exome capture data. Finally, we show that exome capture of saliva-derived DNA yields sufficient non-human sequences to characterize oral microbial communities, including detection of bacteria linked to oral disease (e.g. Prevotella melaninogenica). For comparison, two samples were sequenced using standard full genome library preparation without exome capture and we found no systematic bias of metagenomic information between exome-captured and non-captured data. CONCLUSIONS: DNA from human saliva samples, collected and extracted using standard procedures, can be used to successfully sequence high quality human exomes, and metagenomic data can be derived from non-human reads. We find that individuals from the Kalahari carry a higher oral pathogenic microbial load than samples surveyed in the Human Microbiome Project. Additionally, rare variants present in the exomes suggest strong population structure across different KhoeSan populations.


Assuntos
Exoma , Genômica , Metagenômica , Saliva/química , Saliva/microbiologia , Genoma Humano , Genótipo , Antígenos HLA/genética , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Humanos , Microbiota , Dados de Sequência Molecular , Boca/microbiologia , Polimorfismo de Nucleotídeo Único , Locos de Características Quantitativas , Receptores KIR/genética
10.
BMC Plant Biol ; 14: 83, 2014 Apr 01.
Artigo em Inglês | MEDLINE | ID: mdl-24684805

RESUMO

BACKGROUND: Drought stress is one of the major limiting factors for maize production. With the availability of maize B73 reference genome and whole-genome resequencing of 15 maize inbreds, common variants (CV) and clustering analyses were applied to identify non-synonymous SNPs (nsSNPs) and corresponding candidate genes for drought tolerance. RESULTS: A total of 524 nsSNPs that were associated with 271 candidate genes involved in plant hormone regulation, carbohydrate and sugar metabolism, signaling molecules regulation, redox reaction and acclimation of photosynthesis to environment were detected by CV and cluster analyses. Most of the nsSNPs identified were clustered in bin 1.07 region that harbored six previously reported QTL with relatively high phenotypic variation explained for drought tolerance. Genes Ontology (GO) analysis of candidate genes revealed that there were 35 GO terms related to biotic stimulus and membrane-bounded organelle, showing significant differences between the candidate genes and the reference B73 background. Changes of expression level in these candidate genes for drought tolerance were detected using RNA sequencing for fertilized ovary, basal leaf meristem tissue and roots collected under drought stressed and well-watered conditions. The results indicated that 70% of candidate genes showed significantly expression changes under two water treatments and our strategies for mining candidate genes are feasible and relatively efficient. CONCLUSIONS: Our results successfully revealed candidate nsSNPs and associated genes for drought tolerance by comparative sequence analysis of 16 maize inbred lines. Both methods we applied were proved to be efficient for identifying candidate genes for complex traits through the next-generation sequencing technologies (NGS). These selected genes will not only facilitate understanding of genetic basis of drought stress response, but also accelerate genetic improvement through marker-assisted selection in maize.


Assuntos
Adaptação Fisiológica/genética , Estudos de Associação Genética , Genoma de Planta/genética , Análise de Sequência de DNA/métodos , Zea mays/genética , Zea mays/fisiologia , Cromossomos de Plantas/genética , Análise por Conglomerados , Desidratação , Secas , Ontologia Genética , Genes de Plantas , Genótipo , Endogamia , Desnaturação de Ácido Nucleico/genética , Reação em Cadeia da Polimerase , Polimorfismo de Nucleotídeo Único/genética , Locos de Características Quantitativas/genética , RNA Mensageiro/genética , RNA Mensageiro/metabolismo , Reprodutibilidade dos Testes
11.
BMC Genomics ; 14: 579, 2013 Aug 28.
Artigo em Inglês | MEDLINE | ID: mdl-23984715

RESUMO

BACKGROUND: Artificial selection played an important role in the origin of modern Glycine max cultivars from the wild soybean Glycine soja. To elucidate the consequences of artificial selection accompanying the domestication and modern improvement of soybean, 25 new and 30 published whole-genome re-sequencing accessions, which represent wild, domesticated landrace, and Chinese elite soybean populations were analyzed. RESULTS: A total of 5,102,244 single nucleotide polymorphisms (SNPs) and 707,969 insertion/deletions were identified. Among the SNPs detected, 25.5% were not described previously. We found that artificial selection during domestication led to more pronounced reduction in the genetic diversity of soybean than the switch from landraces to elite cultivars. Only a small proportion (2.99%) of the whole genomic regions appear to be affected by artificial selection for preferred agricultural traits. The selection regions were not distributed randomly or uniformly throughout the genome. Instead, clusters of selection hotspots in certain genomic regions were observed. Moreover, a set of candidate genes (4.38% of the total annotated genes) significantly affected by selection underlying soybean domestication and genetic improvement were identified. CONCLUSIONS: Given the uniqueness of the soybean germplasm sequenced, this study drew a clear picture of human-mediated evolution of the soybean genomes. The genomic resources and information provided by this study would also facilitate the discovery of genes/loci underlying agronomically important traits.


Assuntos
Genoma de Planta , Glycine max/genética , Teorema de Bayes , Cruzamento , Evolução Molecular , Genética Populacional , Haplótipos , Humanos , Mutação INDEL , Anotação de Sequência Molecular , Filogenia , Polimorfismo de Nucleotídeo Único , Locos de Características Quantitativas , Seleção Genética , Análise de Sequência de DNA
12.
J Virol ; 85(21): 11291-9, 2011 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-21880770

RESUMO

Epstein-Barr virus (EBV)-encoded molecules have been detected in the tumor tissues of several cancers, including nasopharyngeal carcinoma (NPC), suggesting that EBV plays an important role in tumorigenesis. However, the nature of EBV with respect to genome width in vivo and whether EBV undergoes clonal expansion in the tumor tissues are still poorly understood. In this study, next-generation sequencing (NGS) was used to sequence DNA extracted directly from the tumor tissue of a patient with NPC. Apart from the human sequences, a clinically isolated EBV genome 164.7 kb in size was successfully assembled and named GD2 (GenBank accession number HQ020558). Sequence and phylogenetic analyses showed that GD2 was closely related to GD1, a previously assembled variant derived from a patient with NPC. GD2 contains the most prevalent EBV variants reported in Cantonese patients with NPC, suggesting that it might be the prevalent strain in this population. Furthermore, GD2 could be grouped into a single subtype according to common classification criteria and contains only 6 heterozygous point mutations, suggesting the monoclonal expansion of GD2 in NPC. This study represents the first genome-wide analysis of a clinical isolate of EBV directly extracted from NPC tissue. Our study reveals that NGS allows the characterization of genome-wide variations of EBV in clinical tumors and provides evidence of monoclonal expansion of EBV in vivo. The pipeline could also be applied to the study of other pathogen-related malignancies. With additional NGS studies of NPC, it might be possible to uncover the potential causative EBV variant involved in NPC.


Assuntos
DNA Viral/genética , Infecções por Vírus Epstein-Barr/complicações , Herpesvirus Humano 4/genética , Herpesvirus Humano 4/isolamento & purificação , Neoplasias Nasofaríngeas/virologia , Carcinoma , China , Análise por Conglomerados , DNA Viral/química , Herpesvirus Humano 4/classificação , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Dados de Sequência Molecular , Carcinoma Nasofaríngeo , Filogenia , Análise de Sequência de DNA , Homologia de Sequência
13.
Ecol Evol ; 11(1): 390-401, 2021 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-33437437

RESUMO

Ancient DNA research has developed rapidly over the past few decades due to improvements in PCR and next-generation sequencing (NGS) technologies, but challenges still exist. One major challenge in relation to ancient DNA research is to recover genuine endogenous ancient DNA sequences from raw sequencing data. This is often difficult due to degradation of ancient DNA and high levels of contamination, especially homologous contamination that has extremely similar genetic background with that of the real ancient DNA. In this study, we collected whole-genome sequencing (WGS) data from 6 ancient samples to compare different mapping algorithms. To further explore more effective methods to separate endogenous DNA from homologous contaminations, we attempted to recover reads based on ancient DNA specific characteristics of deamination, depurination, and DNA fragmentation with different parameters. We propose a quick and improved pipeline for separating endogenous ancient DNA while simultaneously decreasing homologous contaminations to very low proportions. Our goal in this research was to develop useful recommendations for ancient DNA mapping and for separation of endogenous DNA to facilitate future studies of ancient DNA.

14.
Genome Med ; 13(1): 18, 2021 02 05.
Artigo em Inglês | MEDLINE | ID: mdl-33546747

RESUMO

BACKGROUND: Noninvasive prenatal testing (NIPT) of recessive monogenic diseases depends heavily on knowing the correct parental haplotypes. However, the currently used family-based haplotyping method requires pedigrees, and molecular haplotyping is highly challenging due to its high cost, long turnaround time, and complexity. Here, we proposed a new two-step approach, population-based haplotyping-NIPT (PBH-NIPT), using α-thalassemia and ß-thalassemia as prototypes. METHODS: First, we deduced parental haplotypes with Beagle 4.0 with training on a large retrospective carrier screening dataset (4356 thalassemia carrier screening-positive cases). Second, we inferred fetal haplotypes using a parental haplotype-assisted hidden Markov model (HMM) and the Viterbi algorithm. RESULTS: With this approach, we enrolled 59 couples at risk of having a fetus with thalassemia and successfully inferred 94.1% (111/118) of fetal alleles. We confirmed these alleles by invasive prenatal diagnosis, with 99.1% (110/111) accuracy (95% CI, 95.1-100%). CONCLUSIONS: These results demonstrate that PBH-NIPT is a sensitive, fast, and inexpensive strategy for NIPT of thalassemia.


Assuntos
Haplótipos/genética , Teste Pré-Natal não Invasivo , Pais , Talassemia alfa/genética , Talassemia beta/genética , Genética Populacional , Humanos , Tamanho da Amostra
15.
Curr Protoc Hum Genet ; 96: 8.18.1-8.18.16, 2018 01 24.
Artigo em Inglês | MEDLINE | ID: mdl-29364520

RESUMO

Balanced chromosomal rearrangements (or balanced chromosome abnormalities, BCAs) are common chromosomal structural variants. Emerging studies have demonstrated the feasibility of using whole-genome sequencing (WGS) for detection of BCA-associated breakpoints, but the requirement for a priori knowledge of the rearranged regions from G-banded chromosome analysis limits its application. The protocols described here are based on low-pass WGS for detecting BCA events independent from chromosome analysis, and has been validated using genomic data from the 1000 Genomes Project. This approach adopts non-size-selected mate-pair library (3∼8 kb) with 2∼3 µg DNA as input, and requires only 30 million read-pairs (50 bp, equivalent to 1-fold base-coverage) for each sample. The complete procedure takes 13 days and the total cost is estimated to be less than $600 (USD) per sample. © 2018 by John Wiley & Sons, Inc.


Assuntos
Aberrações Cromossômicas , Transtornos Cromossômicos/genética , Genoma Humano/genética , Sequenciamento Completo do Genoma , Transtornos Cromossômicos/patologia , Mapeamento Cromossômico , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Humanos , Translocação Genética
16.
Gigascience ; 7(4): 1-12, 2018 04 01.
Artigo em Inglês | MEDLINE | ID: mdl-29300887

RESUMO

Background: Characterization of genetic variations in maize has been challenging, mainly due to deterioration of collinearity between individual genomes in the species. An international consortium of maize research groups combined resources to develop the maize haplotype version 3 (HapMap 3), built from whole-genome sequencing data from 1218 maize lines, covering predomestication and domesticated Zea mays varieties across the world. Results: A new computational pipeline was set up to process more than 12 trillion bp of sequencing data, and a set of population genetics filters was applied to identify more than 83 million variant sites. Conclusions: We identified polymorphisms in regions where collinearity is largely preserved in the maize species. However, the fact that the B73 genome used as the reference only represents a fraction of all haplotypes is still an important limiting factor.


Assuntos
Genoma de Planta , Haplótipos , Zea mays/genética , Variação Genética
17.
Nat Genet ; 50(12): 1696-1704, 2018 12.
Artigo em Inglês | MEDLINE | ID: mdl-30397334

RESUMO

The genetic variation in Northern Asian populations is currently undersampled. To address this, we generated a new genetic variation reference panel by whole-genome sequencing of 175 ethnic Mongolians, representing six tribes. The cataloged variation in the panel shows strong population stratification among these tribes, which correlates with the diverse demographic histories in the region. Incorporating our results with the 1000 Genomes Project panel identifies derived alleles shared between Finns and Mongolians/Siberians, suggesting that substantial gene flow between northern Eurasian populations has occurred in the past. Furthermore, we highlight that North, East, and Southeast Asian populations are more aligned with each other than these groups are with South Asian and Oceanian populations.


Assuntos
Povo Asiático/etnologia , Povo Asiático/genética , Genética Populacional , América/epidemiologia , Ásia Setentrional/epidemiologia , Povo Asiático/estatística & dados numéricos , Europa (Continente)/epidemiologia , Ásia Oriental/epidemiologia , Feminino , Fluxo Gênico , Genoma Humano , Humanos , Masculino , Mongólia/etnologia , Filogenia , Sequenciamento Completo do Genoma
18.
Gigascience ; 6(9): 1-7, 2017 09 01.
Artigo em Inglês | MEDLINE | ID: mdl-28938720

RESUMO

Next-generation sequencing provides a high-resolution insight into human genetic information. However, the focus of previous studies has primarily been on low-coverage data due to the high cost of sequencing. Although the 1000 Genomes Project and the Haplotype Reference Consortium have both provided powerful reference panels for imputation, low-frequency and novel variants remain difficult to discover and call with accuracy on the basis of low-coverage data. Deep sequencing provides an optimal solution for the problem of these low-frequency and novel variants. Although whole-exome sequencing is also a viable choice for exome regions, it cannot account for noncoding regions, sometimes resulting in the absence of important, causal variants. For Han Chinese populations, the majority of variants have been discovered based upon low-coverage data from the 1000 Genomes Project. However, high-coverage, whole-genome sequencing data are limited for any population, and a large amount of low-frequency, population-specific variants remain uncharacterized. We have performed whole-genome sequencing at a high depth (∼×80) of 90 unrelated individuals of Chinese ancestry, collected from the 1000 Genomes Project samples, including 45 Northern Han Chinese and 45 Southern Han Chinese samples. Eighty-three of these 90 have been sequenced by the 1000 Genomes Project. We have identified 12 568 804 single nucleotide polymorphisms, 2 074 210 short InDels, and 26 142 structural variations from these 90 samples. Compared to the Han Chinese data from the 1000 Genomes Project, we have found 7 000 629 novel variants with low frequency (defined as minor allele frequency < 5%), including 5 813 503 single nucleotide polymorphisms, 1 169 199 InDels, and 17 927 structural variants. Using deep sequencing data, we have built a greatly expanded spectrum of genetic variation for the Han Chinese genome. Compared to the 1000 Genomes Project, these Han Chinese deep sequencing data enhance the characterization of a large number of low-frequency, novel variants. This will be a valuable resource for promoting Chinese genetics research and medical development. Additionally, it will provide a valuable supplement to the 1000 Genomes Project, as well as to other human genome projects.


Assuntos
Povo Asiático/genética , Genoma Humano , Frequência do Gene , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Humanos , Mutação INDEL , Polimorfismo de Nucleotídeo Único , Sequenciamento Completo do Genoma/métodos
19.
Gigascience ; 6(9): 1-11, 2017 09 01.
Artigo em Inglês | MEDLINE | ID: mdl-28938719

RESUMO

Active retrotransposons play important roles during evolution and continue to shape our genomes today, especially in genetic polymorphisms underlying a diverse set of diseases. However, studies of human retrotransposon insertion polymorphisms (RIPs) based on whole-genome deep sequencing at the population level have not been sufficiently undertaken, despite the obvious need for a thorough characterization of RIPs in the general population. Herein, we present a novel and efficient computational tool called Specific Insertions Detector (SID) for the detection of non-reference RIPs. We demonstrate that SID is suitable for high-depth whole-genome sequencing data using paired-end reads obtained from simulated and real datasets. We construct a comprehensive RIP database using a large population of 90 Han Chinese individuals with a mean ×68 depth per individual. In total, we identify 9342 recent RIPs, and 8433 of these RIPs are novel compared with dbRIP, including 5826 Alu, 2169 long interspersed nuclear element 1 (L1), 383 SVA, and 55 long terminal repeats. Among the 9342 RIPs, 4828 were located in gene regions and 5 were located in protein-coding regions. We demonstrate that RIPs can, in principle, be an informative resource to perform population evolution and phylogenetic analyses. Taking the demographic effects into account, we identify a weak negative selection on SVA and L1 but an approximately neutral selection for Alu elements based on the frequency spectrum of RIPs. SID is a powerful open-source program for the detection of non-reference RIPs. We built a non-reference RIP dataset that greatly enhanced the diversity of RIPs detected in the general population, and it should be invaluable to researchers interested in many aspects of human evolution, genetics, and disease. As a proof of concept, we demonstrate that the RIPs can be used as biomarkers in a similar way as single nucleotide polymorphisms.


Assuntos
Genoma Humano , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Polimorfismo Genético , Retroelementos , Sequenciamento Completo do Genoma/métodos , Povo Asiático/genética , Humanos
20.
J Diabetes Res ; 2015: 613236, 2015.
Artigo em Inglês | MEDLINE | ID: mdl-26290879

RESUMO

The large scale genome wide association studies (GWAS) have identified approximately 80 single nucleotide polymorphisms (SNPs) conferring susceptibility to type 2 diabetes (T2D). However, most of these loci have not been replicated in diverse populations and much genetic heterogeneity has been observed across ethnic groups. We tested 28 SNPs previously found to be associated with T2D by GWAS in a Mongolian sample of Northern China (497 diagnosed with T2D and 469 controls) for association with T2D and diabetes related quantitative traits. We replicated T2D association of 11 SNPs, namely, rs7578326 (IRS1), rs1531343 (HMGA2), rs8042680 (PRC1), rs7578597 (THADA), rs1333051 (CDKN2), rs6723108 (TMEM163), rs163182 and rs2237897 (KCNQ1), rs1387153 (MTNR1B), rs243021 (BCL11A), and rs10229583 (PAX4) in our sample. Further, we showed that risk allele of the strongest T2D associated SNP in our sample, rs757832 (IRS1), is associated with increased level of TG. We observed substantial difference of T2D risk allele frequency between the Mongolian sample and the 1000G Caucasian sample for a few SNPs, including rs6723108 (TMEM163) whose risk allele reaches near fixation in the Mongolian sample. Further study of genetic architecture of these variants in susceptibility of T2D is needed to understand the role of these variants in heterogeneous populations.


Assuntos
Diabetes Mellitus Tipo 2/etnologia , Diabetes Mellitus Tipo 2/genética , Variação Genética , Polimorfismo de Nucleotídeo Único , Adulto , Idoso , Alelos , Povo Asiático , Índice de Massa Corporal , China , Feminino , Frequência do Gene , Marcadores Genéticos , Predisposição Genética para Doença , Estudo de Associação Genômica Ampla , Genótipo , Humanos , Modelos Logísticos , Masculino , Pessoa de Meia-Idade , Mongólia , Controle de Qualidade , Triglicerídeos/sangue
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA