Your browser doesn't support javascript.
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 79
Filtrar
Mais filtros










Base de dados
Intervalo de ano de publicação
1.
Nat Rev Nephrol ; 15(9): 590, 2019 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-31363178

RESUMO

An amendment to this paper has been published and can be accessed via a link at the top of the paper.

4.
Nature ; 570(7759): 71-76, 2019 06.
Artigo em Inglês | MEDLINE | ID: mdl-31118516

RESUMO

Protein-coding genetic variants that strongly affect disease risk can yield relevant clues to disease pathogenesis. Here we report exome-sequencing analyses of 20,791 individuals with type 2 diabetes (T2D) and 24,440 non-diabetic control participants from 5 ancestries. We identify gene-level associations of rare variants (with minor allele frequencies of less than 0.5%) in 4 genes at exome-wide significance, including a series of more than 30 SLC30A8 alleles that conveys protection against T2D, and in 12 gene sets, including those corresponding to T2D drug targets (P = 6.1 × 10-3) and candidate genes from knockout mice (P = 5.2 × 10-3). Within our study, the strongest T2D gene-level signals for rare variants explain at most 25% of the heritability of the strongest common single-variant signals, and the gene-level effect sizes of the rare variants that we observed in established T2D drug targets will require 75,000-185,000 sequenced cases to achieve exome-wide significance. We propose a method to interpret these modest rare-variant associations and to incorporate these associations into future target or gene prioritization efforts.


Assuntos
Diabetes Mellitus Tipo 2/genética , Exoma/genética , Sequenciamento Completo do Exoma , Animais , Estudos de Casos e Controles , Técnicas de Apoio para a Decisão , Feminino , Frequência do Gene , Estudo de Associação Genômica Ampla , Humanos , Masculino , Camundongos , Camundongos Knockout
5.
Nat Commun ; 10(1): 1847, 2019 04 23.
Artigo em Inglês | MEDLINE | ID: mdl-31015462

RESUMO

Chronic kidney disease (CKD) is a growing health burden currently affecting 10-15% of adults worldwide. Estimated glomerular filtration rate (eGFR) as a marker of kidney function is commonly used to diagnose CKD. We analyze eGFR data from the Nord-Trøndelag Health Study and Michigan Genomics Initiative and perform a GWAS meta-analysis with public summary statistics, more than doubling the sample size of previous meta-analyses. We identify 147 loci (53 novel) associated with eGFR, including genes involved in transcriptional regulation, kidney development, cellular signaling, metabolism, and solute transport. Additionally, sex-stratified analysis identifies one locus with more significant effects in women than men. Using genetic risk scores constructed from these eGFR meta-analysis results, we show that associated variants are generally predictive of CKD with only modest improvements in detection compared with other known clinical risk factors. Collectively, these results yield additional insight into the genetic factors underlying kidney function and progression to CKD.


Assuntos
Loci Gênicos , Estudo de Associação Genômica Ampla , Taxa de Filtração Glomerular/genética , Insuficiência Renal Crônica/genética , Feminino , Carga Global da Doença , Humanos , Rim/fisiopatologia , Masculino , Prognóstico , Insuficiência Renal Crônica/diagnóstico , Insuficiência Renal Crônica/epidemiologia , Insuficiência Renal Crônica/fisiopatologia , Medição de Risco/métodos , Fatores de Risco , Fatores Sexuais
6.
Hum Genet ; 138(4): 307-326, 2019 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-30820706

RESUMO

Genome-wide association studies have reported 56 independently associated colorectal cancer (CRC) risk variants, most of which are non-coding and believed to exert their effects by modulating gene expression. The computational method PrediXcan uses cis-regulatory variant predictors to impute expression and perform gene-level association tests in GWAS without directly measured transcriptomes. In this study, we used reference datasets from colon (n = 169) and whole blood (n = 922) transcriptomes to test CRC association with genetically determined expression levels in a genome-wide analysis of 12,186 cases and 14,718 controls. Three novel associations were discovered from colon transverse models at FDR ≤ 0.2 and further evaluated in an independent replication including 32,825 cases and 39,933 controls. After adjusting for multiple comparisons, we found statistically significant associations using colon transcriptome models with TRIM4 (discovery P = 2.2 × 10- 4, replication P = 0.01), and PYGL (discovery P = 2.3 × 10- 4, replication P = 6.7 × 10- 4). Interestingly, both genes encode proteins that influence redox homeostasis and are related to cellular metabolic reprogramming in tumors, implicating a novel CRC pathway linked to cell growth and proliferation. Defining CRC risk regions as one megabase up- and downstream of one of the 56 independent risk variants, we defined 44 non-overlapping CRC-risk regions. Among these risk regions, we identified genes associated with CRC (P < 0.05) in 34/44 CRC-risk regions. Importantly, CRC association was found for two genes in the previously reported 2q25 locus, CXCR1 and CXCR2, which are potential cancer therapeutic targets. These findings provide strong candidate genes to prioritize for subsequent laboratory follow-up of GWAS loci. This study is the first to implement PrediXcan in a large colorectal cancer study and findings highlight the utility of integrating transcriptome data in GWAS for discovery of, and biological insight into, risk loci.


Assuntos
Neoplasias Colorretais/diagnóstico , Neoplasias Colorretais/genética , Predisposição Genética para Doença , Estudo de Associação Genômica Ampla , Polimorfismo de Nucleotídeo Único , Estudos de Casos e Controles , Neoplasias Colorretais/epidemiologia , Expressão Gênica , Regulação Neoplásica da Expressão Gênica , Frequência do Gene , Estudo de Associação Genômica Ampla/estatística & dados numéricos , Humanos , Valor Preditivo dos Testes , Prognóstico , Fatores de Risco
7.
Plant Physiol ; 179(4): 1444-1456, 2019 04.
Artigo em Inglês | MEDLINE | ID: mdl-30718350

RESUMO

Single-cell RNA sequencing (scRNA-seq) has been used extensively to study cell-specific gene expression in animals, but it has not been widely applied to plants. Here, we describe the use of a commercially available droplet-based microfluidics platform for high-throughput scRNA-seq to obtain single-cell transcriptomes from protoplasts of more than 10,000 Arabidopsis (Arabidopsis thaliana) root cells. We find that all major tissues and developmental stages are represented in this single-cell transcriptome population. Further, distinct subpopulations and rare cell types, including putative quiescent center cells, were identified. A focused analysis of root epidermal cell transcriptomes defined developmental trajectories for individual cells progressing from meristematic through mature stages of root-hair and nonhair cell differentiation. In addition, single-cell transcriptomes were obtained from root epidermis mutants, enabling a comparative analysis of gene expression at single-cell resolution and providing an unprecedented view of the impact of the mutated genes. Overall, this study demonstrates the feasibility and utility of scRNA-seq in plants and provides a first-generation gene expression map of the Arabidopsis root at single-cell resolution.


Assuntos
Arabidopsis/metabolismo , Raízes de Plantas/metabolismo , Análise de Célula Única , Transcriptoma , Arabidopsis/citologia , Estudos de Viabilidade , Epiderme Vegetal/metabolismo , Raízes de Plantas/citologia , Protoplastos/metabolismo , Análise de Sequência de RNA
8.
Nat Genet ; 2018 Dec 03.
Artigo em Inglês | MEDLINE | ID: mdl-30510241

RESUMO

To further dissect the genetic architecture of colorectal cancer (CRC), we performed whole-genome sequencing of 1,439 cases and 720 controls, imputed discovered sequence variants and Haplotype Reference Consortium panel variants into genome-wide association study data, and tested for association in 34,869 cases and 29,051 controls. Findings were followed up in an additional 23,262 cases and 38,296 controls. We discovered a strongly protective 0.3% frequency variant signal at CHD1. In a combined meta-analysis of 125,478 individuals, we identified 40 new independent signals at P < 5 × 10-8, bringing the number of known independent signals for CRC to ~100. New signals implicate lower-frequency variants, Krüppel-like factors, Hedgehog signaling, Hippo-YAP signaling, long noncoding RNAs and somatic drivers, and support a role for immune function. Heritability analyses suggest that CRC risk is highly polygenic, and larger, more comprehensive studies enabling rare variant analysis will improve understanding of biology underlying this risk and influence personalized screening strategies and drug development.

9.
Nat Commun ; 9(1): 4038, 2018 10 02.
Artigo em Inglês | MEDLINE | ID: mdl-30279509

RESUMO

Hundreds of thousands of human whole genome sequencing (WGS) datasets will be generated over the next few years. These data are more valuable in aggregate: joint analysis of genomes from many sources increases sample size and statistical power. A central challenge for joint analysis is that different WGS data processing pipelines cause substantial differences in variant calling in combined datasets, necessitating computationally expensive reprocessing. This approach is no longer tenable given the scale of current studies and data volumes. Here, we define WGS data processing standards that allow different groups to produce functionally equivalent (FE) results, yet still innovate on data processing pipelines. We present initial FE pipelines developed at five genome centers and show that they yield similar variant calling results and produce significantly less variability than sequencing replicates. This work alleviates a key technical bottleneck for genome aggregation and helps lay the foundation for community-wide human genetics studies.

10.
Bioinformatics ; 2018 Jul 02.
Artigo em Inglês | MEDLINE | ID: mdl-30204848

RESUMO

Summary: Estimating linkage disequilibrium (LD) is essential for a wide range of summary statistics-based association methods for genome-wide association studies. Large genetic datasets, e.g. the TOPMed WGS project and UK Biobank, enable more accurate and comprehensive LD estimates, but increase the computational burden of LD estimation. Here, we describe emeraLD (Efficient Methods for Estimation and Random Access of LD), a computational tool that leverages sparsity and haplotype structure to estimate LD up to 2 orders of magnitude faster than current tools. Availability and implementation: emeraLD is implemented in C++, and is open source under GPLv3. Source code and documentation are freely available at http://github.com/statgen/emeraLD. Supplementary information: Supplementary data are available at Bioinformatics online.

11.
Nat Commun ; 9(1): 3753, 2018 09 14.
Artigo em Inglês | MEDLINE | ID: mdl-30218074

RESUMO

A detailed understanding of the genome-wide variability of single-nucleotide germline mutation rates is essential to studying human genome evolution. Here, we use ~36 million singleton variants from 3560 whole-genome sequences to infer fine-scale patterns of mutation rate heterogeneity. Mutability is jointly affected by adjacent nucleotide context and diverse genomic features of the surrounding region, including histone modifications, replication timing, and recombination rate, sometimes suggesting specific mutagenic mechanisms. Remarkably, GC content, DNase hypersensitivity, CpG islands, and H3K36 trimethylation are associated with both increased and decreased mutation rates depending on nucleotide context. We validate these estimated effects in an independent dataset of ~46,000 de novo mutations, and confirm our estimates are more accurate than previously published results based on ancestrally older variants without considering genomic features. Our results thus provide the most refined portrait to date of the factors contributing to genome-wide variability of the human germline mutation rate.

12.
G3 (Bethesda) ; 8(10): 3255-3267, 2018 10 03.
Artigo em Inglês | MEDLINE | ID: mdl-30131328

RESUMO

The emergence of very large cohorts in genomic research has facilitated a focus on genotype-imputation strategies to power rare variant association. These strategies have benefited from improvements in imputation methods and association tests, however little attention has been paid to ways in which array design can increase rare variant association power. Therefore, we developed a novel framework to select tag SNPs using the reference panel of 26 populations from Phase 3 of the 1000 Genomes Project. We evaluate tag SNP performance via mean imputed r2 at untyped sites using leave-one-out internal validation and standard imputation methods, rather than pairwise linkage disequilibrium. Moving beyond pairwise metrics allows us to account for haplotype diversity across the genome for improve imputation accuracy and demonstrates population-specific biases from pairwise estimates. We also examine array design strategies that contrast multi-ethnic cohorts vs. single populations, and show a boost in performance for the former can be obtained by prioritizing tag SNPs that contribute information across multiple populations simultaneously. Using our framework, we demonstrate increased imputation accuracy for rare variants (frequency < 1%) by 0.5-3.1% for an array of one million sites and 0.7-7.1% for an array of 500,000 sites, depending on the population. Finally, we show how recent explosive growth in non-African populations means tag SNPs capture on average 30% fewer other variants than in African populations. The unified framework presented here will enable investigators to make informed decisions for the design of new arrays, and help empower the next phase of rare variant association for global health.

13.
Nat Genet ; 50(9): 1335-1341, 2018 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-30104761

RESUMO

In genome-wide association studies (GWAS) for thousands of phenotypes in large biobanks, most binary traits have substantially fewer cases than controls. Both of the widely used approaches, the linear mixed model and the recently proposed logistic mixed model, perform poorly; they produce large type I error rates when used to analyze unbalanced case-control phenotypes. Here we propose a scalable and accurate generalized mixed model association test that uses the saddlepoint approximation to calibrate the distribution of score test statistics. This method, SAIGE (Scalable and Accurate Implementation of GEneralized mixed model), provides accurate P values even when case-control ratios are extremely unbalanced. SAIGE uses state-of-art optimization strategies to reduce computational costs; hence, it is applicable to GWAS for thousands of phenotypes by large biobanks. Through the analysis of UK Biobank data of 408,961 samples from white British participants with European ancestry for > 1,400 binary phenotypes, we show that SAIGE can efficiently analyze large sample data, controlling for unbalanced case-control ratios and sample relatedness.

14.
Diabetes ; 67(9): 1892-1902, 2018 09.
Artigo em Inglês | MEDLINE | ID: mdl-29941447

RESUMO

We investigated ethnicity-specific exonic variants of type 2 diabetes (T2D) and its related clinical phenotypes in an East Asian population. We performed whole-exome sequencing in 917 T2D case and control subjects, and the findings were validated by exome array genotyping in 3,026 participants. In silico replication was conducted for seven nonsynonymous variants in an additional 13,122 participants. Single-variant and gene-based association tests for T2D were analyzed. A total of 728,838 variants were identified by whole-exome sequencing. Among nonsynonymous variants, PAX4 Arg192His increased risk of T2D and GLP1R Arg131Gln decreased risk of T2D in genome-wide significance (odds ratio [OR] 1.48, P = 4.47 × 10-16 and OR 0.84, P = 3.55 × 10-8, respectively). Another variant at PAX4 192 codon Arg192Ser was nominally associated with T2D (OR 1.62, P = 5.18 × 10-4). In T2D patients, PAX4 Arg192His was associated with earlier age at diagnosis, and GLP1R Arg131Gln was associated with decreased risk of cardiovascular disease. In control subjects without diabetes, the PAX4 Arg192His was associated with higher fasting glucose and GLP1R Arg131Gln was associated with lower fasting glucose and HbA1c level. Gene-based analysis revealed that SLC30A8 was most significantly associated with decreased risk of T2D (P = 1.0 × 10-4). In summary, we have identified nonsynonymous variants associated with risk of T2D and related phenotypes in Koreans.


Assuntos
Diabetes Mellitus Tipo 2/genética , Predisposição Genética para Doença , Variação Genética , Receptor do Peptídeo Semelhante ao Glucagon 1/genética , Proteínas de Homeodomínio/genética , Fatores de Transcrição Box Pareados/genética , Polimorfismo de Nucleotídeo Único , Idoso , Alelos , Substituição de Aminoácidos , Grupo com Ancestrais do Continente Asiático , Estudos de Casos e Controles , Estudos de Coortes , Biologia Computacional , Bases de Dados Genéticas , Diabetes Mellitus Tipo 2/metabolismo , Sistemas Especialistas , Feminino , Frequência do Gene , Estudos de Associação Genética , Estudo de Associação Genômica Ampla , Receptor do Peptídeo Semelhante ao Glucagon 1/química , Receptor do Peptídeo Semelhante ao Glucagon 1/metabolismo , Proteínas de Homeodomínio/química , Proteínas de Homeodomínio/metabolismo , Humanos , Masculino , Pessoa de Meia-Idade , Fatores de Transcrição Box Pareados/química , Fatores de Transcrição Box Pareados/metabolismo , República da Coreia , Sequenciamento Completo do Exoma
15.
Nat Biotechnol ; 36(1): 89-94, 2018 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-29227470

RESUMO

Droplet single-cell RNA-sequencing (dscRNA-seq) has enabled rapid, massively parallel profiling of transcriptomes. However, assessing differential expression across multiple individuals has been hampered by inefficient sample processing and technical batch effects. Here we describe a computational tool, demuxlet, that harnesses natural genetic variation to determine the sample identity of each droplet containing a single cell (singlet) and detect droplets containing two cells (doublets). These capabilities enable multiplexed dscRNA-seq experiments in which cells from unrelated individuals are pooled and captured at higher throughput than in standard workflows. Using simulated data, we show that 50 single-nucleotide polymorphisms (SNPs) per cell are sufficient to assign 97% of singlets and identify 92% of doublets in pools of up to 64 individuals. Given genotyping data for each of eight pooled samples, demuxlet correctly recovers the sample identity of >99% of singlets and identifies doublets at rates consistent with previous estimates. We apply demuxlet to assess cell-type-specific changes in gene expression in 8 pooled lupus patient samples treated with interferon (IFN)-ß and perform eQTL analysis on 23 pooled samples.

16.
Genet Epidemiol ; 41(8): 744-755, 2017 12.
Artigo em Inglês | MEDLINE | ID: mdl-28861891

RESUMO

The accuracy of genotype imputation depends upon two factors: the sample size of the reference panel and the genetic similarity between the reference panel and the target samples. When multiple reference panels are not consented to combine together, it is unclear how to combine the imputation results to optimize the power of genetic association studies. We compared the accuracy of 9,265 Norwegian genomes imputed from three reference panels-1000 Genomes phase 3 (1000G), Haplotype Reference Consortium (HRC), and a reference panel containing 2,201 Norwegian participants from the population-based Nord Trøndelag Health Study (HUNT) from low-pass genome sequencing. We observed that the population-matched reference panel allowed for imputation of more population-specific variants with lower frequency (minor allele frequency (MAF) between 0.05% and 0.5%). The overall imputation accuracy from the population-specific panel was substantially higher than 1000G and was comparable with HRC, despite HRC being 15-fold larger. These results recapitulate the value of population-specific reference panels for genotype imputation. We also evaluated different strategies to utilize multiple sets of imputed genotypes to increase the power of association studies. We observed that testing association for all variants imputed from any panel results in higher power to detect association than the alternative strategy of including only one version of each genetic variant, selected for having the highest imputation quality metric. This was particularly true for lower frequency variants (MAF < 1%), even after adjusting for the additional multiple testing burden.


Assuntos
Estudo de Associação Genômica Ampla , Grupo com Ancestrais do Continente Europeu/genética , Frequência do Gene , Variação Genética , Genótipo , Haplótipos , Humanos , Desequilíbrio de Ligação , Noruega , Polimorfismo de Nucleotídeo Único , Sequenciamento Completo do Genoma
17.
Nat Commun ; 8: 15481, 2017 05 25.
Artigo em Inglês | MEDLINE | ID: mdl-28541271

RESUMO

Bicuspid aortic valve (BAV) is a heritable congenital heart defect and an important risk factor for valvulopathy and aortopathy. Here we report a genome-wide association scan of 466 BAV cases and 4,660 age, sex and ethnicity-matched controls with replication in up to 1,326 cases and 8,103 controls. We identify association with a noncoding variant 151 kb from the gene encoding the cardiac-specific transcription factor, GATA4, and near-significance for p.Ser377Gly in GATA4. GATA4 was interrupted by CRISPR-Cas9 in induced pluripotent stem cells from healthy donors. The disruption of GATA4 significantly impaired the transition from endothelial cells into mesenchymal cells, a critical step in heart valve development.


Assuntos
Valva Aórtica/anormalidades , Fator de Transcrição GATA4/genética , Variação Genética , Doenças das Valvas Cardíacas/genética , Substituição de Aminoácidos , Valva Aórtica/embriologia , Valva Aórtica/metabolismo , Sistemas CRISPR-Cas , Estudos de Casos e Controles , Transdiferenciação Celular/genética , Feminino , Fator de Transcrição GATA4/deficiência , Fator de Transcrição GATA4/metabolismo , Redes Reguladoras de Genes , Estudo de Associação Genômica Ampla , Cardiopatias Congênitas/genética , Doenças das Valvas Cardíacas/embriologia , Doenças das Valvas Cardíacas/metabolismo , Humanos , Células-Tronco Pluripotentes Induzidas/metabolismo , Células-Tronco Pluripotentes Induzidas/patologia , Masculino , Mutação de Sentido Incorreto , Fenótipo , RNA não Traduzido/genética
18.
PeerJ ; 5: e3329, 2017.
Artigo em Inglês | MEDLINE | ID: mdl-28503383

RESUMO

The objective of this study was to evaluate the effects of social relationships on the feed intake, eating behavior, and growth, upon exposure to a novel diet, in Hanwoo (Bos taurus coreanae) heifers during pregnancy. Twenty-four pregnant Hanwoo heifers, averaging 438 ± 27.8 kg in weight, 21 months in age, and 194 ± 8.5 days in pregnancy, were involved in a two-month (eight weeks) experiment. The heifers were randomly assigned to either the single housing group (SG; one individual per pen, n = 12), or the paired housing group (PG; two individuals per pen, n = 12). All pens were of the same size (5 × 5 m) and provided with one feed bin, which automatically recorded the individual feed intake and eating behavior. As the experiment began, the diet of the heifers was switched from a total mixed ration (TMR; 250 g/kg ryegrass straw and 750 g/kg concentrate mix) to a forage-only diet (mixed hay cubes composed of 500 g/kg alfalfa, 250 g/kg timothy, and 250 g/kg blue grass hay). The heifers were fed ad libitum twice a day. The individual feed intake and eating behavior were recorded daily throughout the experiment, and body weights (BWs) were measured every four weeks before the morning feeding. PG animals visited the feed bin 22% less often than SG. PG, however, stayed 39% longer in the feed bin and consumed 40% more feed per visit, compared with SG. Consequently, PG heifers spent 23% more time in eating and had 16% more daily dry matter intake than SG during the experiment. Average daily gain during the experimental period tended to be greater in PG than in SG. When pregnant Hanwoo heifers encountered a novel diet, social relationships (i.e., presence of a pen-mate) enhanced their time spent eating and feed intake. Social interactions, even with an unfamiliar individual, may be helpful for pregnant Hanwoo heifers cope with a diet challenge compared to solitary situation.

19.
Diabetes ; 66(7): 2019-2032, 2017 07.
Artigo em Inglês | MEDLINE | ID: mdl-28341696

RESUMO

To identify novel coding association signals and facilitate characterization of mechanisms influencing glycemic traits and type 2 diabetes risk, we analyzed 109,215 variants derived from exome array genotyping together with an additional 390,225 variants from exome sequence in up to 39,339 normoglycemic individuals from five ancestry groups. We identified a novel association between the coding variant (p.Pro50Thr) in AKT2 and fasting plasma insulin (FI), a gene in which rare fully penetrant mutations are causal for monogenic glycemic disorders. The low-frequency allele is associated with a 12% increase in FI levels. This variant is present at 1.1% frequency in Finns but virtually absent in individuals from other ancestries. Carriers of the FI-increasing allele had increased 2-h insulin values, decreased insulin sensitivity, and increased risk of type 2 diabetes (odds ratio 1.05). In cellular studies, the AKT2-Thr50 protein exhibited a partial loss of function. We extend the allelic spectrum for coding variants in AKT2 associated with disorders of glucose homeostasis and demonstrate bidirectional effects of variants within the pleckstrin homology domain of AKT2.


Assuntos
Diabetes Mellitus Tipo 2/genética , Grupo com Ancestrais do Continente Europeu/genética , Jejum/metabolismo , Resistência à Insulina/genética , Insulina/metabolismo , Proteínas Proto-Oncogênicas c-akt/genética , Afro-Americanos/genética , Alelos , Grupo com Ancestrais do Continente Asiático/genética , Estudos de Casos e Controles , Diabetes Mellitus Tipo 2/metabolismo , Finlândia , Frequência do Gene , Predisposição Genética para Doença , Genótipo , Hispano-Americanos/genética , Humanos , Razão de Chances
20.
Am J Hum Genet ; 99(4): 791-801, 2016 Oct 06.
Artigo em Inglês | MEDLINE | ID: mdl-27666372

RESUMO

Massively parallel whole-genome sequencing (WGS) data have ushered in a new era in human genetics. These data are now being used to understand the role of rare variants in complex traits and to advance the goals of precision medicine. The technological and computing advances that have enabled us to generate WGS data on thousands of individuals have also outpaced our ability to perform analyses in scientifically and statistically rigorous and thoughtful ways. The past several years have witnessed the application of whole-exome sequencing (WES) to complex traits and diseases. From our analysis of NHLBI Exome Sequencing Project (ESP) data, not only have a number of important disease and complex trait association findings emerged, but our collective experience offers some valuable lessons for WGS initiatives. These include caveats associated with generating automated pipelines for quality control and analysis of rare variants; the importance of studying minority populations; sample size requirements and efficient study designs for identifying rare-variant associations; and the significance of incidental findings in population-based genetic research. With the ESP as an example, we offer guidance and a framework on how to conduct a large-scale association study in the era of WGS.


Assuntos
Exoma/genética , Estudos de Associação Genética/métodos , Estudos de Associação Genética/normas , National Heart, Lung, and Blood Institute (U.S.) , Feminino , Variação Genética , Genoma Humano/genética , Guias como Assunto , Humanos , Masculino , Controle de Qualidade , Reprodutibilidade dos Testes , Análise de Sequência de DNA , Estados Unidos
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA