Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 48
Filtrar
1.
Alzheimers Dement ; 20(5): 3290-3304, 2024 May.
Artigo em Inglês | MEDLINE | ID: mdl-38511601

RESUMO

INTRODUCTION: Genome-wide association studies (GWAS) have identified loci associated with Alzheimer's disease (AD) but did not identify specific causal genes or variants within those loci. Analysis of whole genome sequence (WGS) data, which interrogates the entire genome and captures rare variations, may identify causal variants within GWAS loci. METHODS: We performed single common variant association analysis and rare variant aggregate analyses in the pooled population (N cases = 2184, N controls = 2383) and targeted analyses in subpopulations using WGS data from the Alzheimer's Disease Sequencing Project (ADSP). The analyses were restricted to variants within 100 kb of 83 previously identified GWAS lead variants. RESULTS: Seventeen variants were significantly associated with AD within five genomic regions implicating the genes OARD1/NFYA/TREML1, JAZF1, FERMT2, and SLC24A4. KAT8 was implicated by both single variant and rare variant aggregate analyses. DISCUSSION: This study demonstrates the utility of leveraging WGS to gain insights into AD loci identified via GWAS.


Assuntos
Doença de Alzheimer , Estudo de Associação Genômica Ampla , Sequenciamento Completo do Genoma , Humanos , Doença de Alzheimer/genética , Feminino , Masculino , Predisposição Genética para Doença/genética , Idoso , Polimorfismo de Nucleotídeo Único/genética , Variação Genética/genética
2.
JAMA Cardiol ; 9(3): 263-271, 2024 Mar 01.
Artigo em Inglês | MEDLINE | ID: mdl-38294787

RESUMO

Importance: Familial hypercholesterolemia (FH) is a genetic disorder that often results in severely high low-density lipoprotein cholesterol (LDL-C) and high risk of premature coronary heart disease (CHD). However, the impact of FH variants on CHD risk among individuals with moderately elevated LDL-C is not well quantified. Objective: To assess CHD risk associated with FH variants among individuals with moderately (130-189 mg/dL) and severely (≥190 mg/dL) elevated LDL-C and to quantify excess CHD deaths attributable to FH variants in US adults. Design, Setting, and Participants: A total of 21 426 individuals without preexisting CHD from 6 US cohort studies (Atherosclerosis Risk in Communities study, Coronary Artery Risk Development in Young Adults study, Cardiovascular Health Study, Framingham Heart Study Offspring cohort, Jackson Heart Study, and Multi-Ethnic Study of Atherosclerosis) were included, 63 of whom had an FH variant. Data were collected from 1971 to 2018, and the median (IQR) follow-up was 18 (13-28) years. Data were analyzed from March to May 2023. Exposures: LDL-C, cumulative past LDL-C, FH variant status. Main Outcomes and Measures: Cox proportional hazards models estimated associations between FH variants and incident CHD. The Cardiovascular Disease Policy Model projected excess CHD deaths associated with FH variants in US adults. Results: Of the 21 426 individuals without preexisting CHD (mean [SD] age 52.1 [15.5] years; 12 041 [56.2%] female), an FH variant was found in 22 individuals with moderately elevated LDL-C (0.3%) and in 33 individuals with severely elevated LDL-C (2.5%). The adjusted hazard ratios for incident CHD comparing those with and without FH variants were 2.9 (95% CI, 1.4-6.0) and 2.6 (95% CI, 1.4-4.9) among individuals with moderately and severely elevated LDL-C, respectively. The association between FH variants and CHD was slightly attenuated when further adjusting for baseline LDL-C level, whereas the association was no longer statistically significant after adjusting for cumulative past LDL-C exposure. Among US adults 20 years and older with no history of CHD and LDL-C 130 mg/dL or higher, more than 417 000 carry an FH variant and were projected to experience more than 12 000 excess CHD deaths in those with moderately elevated LDL-C and 15 000 in those with severely elevated LDL-C compared with individuals without an FH variant. Conclusions and Relevance: In this pooled cohort study, the presence of FH variants was associated with a 2-fold higher CHD risk, even when LDL-C was only moderately elevated. The increased CHD risk appeared to be largely explained by the higher cumulative LDL-C exposure in individuals with an FH variant compared to those without. Further research is needed to assess the value of adding genetic testing to traditional phenotypic FH screening.


Assuntos
Aterosclerose , Doenças Cardiovasculares , Doença da Artéria Coronariana , Hipercolesterolemia , Hiperlipoproteinemia Tipo II , Adulto Jovem , Humanos , Feminino , Pessoa de Meia-Idade , Masculino , Hipercolesterolemia/complicações , LDL-Colesterol/genética , Doenças Cardiovasculares/prevenção & controle , Estudos de Coortes , Fatores de Risco , Hiperlipoproteinemia Tipo II/diagnóstico , Doença da Artéria Coronariana/complicações , Aterosclerose/complicações , Fatores de Risco de Doenças Cardíacas
3.
bioRxiv ; 2023 Nov 02.
Artigo em Inglês | MEDLINE | ID: mdl-37961350

RESUMO

Large-scale whole-genome sequencing (WGS) studies have improved our understanding of the contributions of coding and noncoding rare variants to complex human traits. Leveraging association effect sizes across multiple traits in WGS rare variant association analysis can improve statistical power over single-trait analysis, and also detect pleiotropic genes and regions. Existing multi-trait methods have limited ability to perform rare variant analysis of large-scale WGS data. We propose MultiSTAAR, a statistical framework and computationally-scalable analytical pipeline for functionally-informed multi-trait rare variant analysis in large-scale WGS studies. MultiSTAAR accounts for relatedness, population structure and correlation among phenotypes by jointly analyzing multiple traits, and further empowers rare variant association analysis by incorporating multiple functional annotations. We applied MultiSTAAR to jointly analyze three lipid traits (low-density lipoprotein cholesterol, high-density lipoprotein cholesterol and triglycerides) in 61,861 multi-ethnic samples from the Trans-Omics for Precision Medicine (TOPMed) Program. We discovered new associations with lipid traits missed by single-trait analysis, including rare variants within an enhancer of NIPSNAP3A and an intergenic region on chromosome 1.

4.
Clin Epigenetics ; 15(1): 173, 2023 10 27.
Artigo em Inglês | MEDLINE | ID: mdl-37891690

RESUMO

BACKGROUND: Insulin resistance (IR) is a major risk factor for Alzheimer's disease (AD) dementia. The mechanisms by which IR predisposes to AD are not well-understood. Epigenetic studies may help identify molecular signatures of IR associated with AD, thus improving our understanding of the biological and regulatory mechanisms linking IR and AD. METHODS: We conducted an epigenome-wide association study of IR, quantified using the homeostatic model assessment of IR (HOMA-IR) and adjusted for body mass index, in 3,167 participants from the Framingham Heart Study (FHS) without type 2 diabetes at the time of blood draw used for methylation measurement. We identified DNA methylation markers associated with IR at the genome-wide level accounting for multiple testing (P < 1.1 × 10-7) and evaluated their association with neurological traits in participants from the FHS (N = 3040) and the Religious Orders Study/Memory and Aging Project (ROSMAP, N = 707). DNA methylation profiles were measured in blood (FHS) or dorsolateral prefrontal cortex (ROSMAP) using the Illumina HumanMethylation450 BeadChip. Linear regressions (ROSMAP) or mixed-effects models accounting for familial relatedness (FHS) adjusted for age, sex, cohort, self-reported race, batch, and cell type proportions were used to assess associations between DNA methylation and neurological traits accounting for multiple testing. RESULTS: We confirmed the strong association of blood DNA methylation with IR at three loci (cg17901584-DHCR24, cg17058475-CPT1A, cg00574958-CPT1A, and cg06500161-ABCG1). In FHS, higher levels of blood DNA methylation at cg00574958 and cg17058475 were both associated with lower IR (P = 2.4 × 10-11 and P = 9.0 × 10-8), larger total brain volumes (P = 0.03 and P = 9.7 × 10-4), and smaller log lateral ventricular volumes (P = 0.07 and P = 0.03). In ROSMAP, higher levels of brain DNA methylation at the same two CPT1A markers were associated with greater risk of cognitive impairment (P = 0.005 and P = 0.02) and higher AD-related indices (CERAD score: P = 5 × 10-4 and 0.001; Braak stage: P = 0.004 and P = 0.01). CONCLUSIONS: Our results suggest potentially distinct epigenetic regulatory mechanisms between peripheral blood and dorsolateral prefrontal cortex tissues underlying IR and AD at CPT1A locus.


Assuntos
Doença de Alzheimer , Diabetes Mellitus Tipo 2 , Resistência à Insulina , Humanos , Doença de Alzheimer/genética , Diabetes Mellitus Tipo 2/genética , Metilação de DNA , Epigênese Genética , Marcadores Genéticos , Estudo de Associação Genômica Ampla/métodos , Resistência à Insulina/genética
5.
medRxiv ; 2023 Aug 29.
Artigo em Inglês | MEDLINE | ID: mdl-37693453

RESUMO

INTRODUCTION: Genome-wide association studies (GWAS) have identified loci associated with Alzheimer's disease (AD) but did not identify specific causal genes or variants within those loci. Analysis of whole genome sequence (WGS) data, which interrogates the entire genome and captures rare variations, may identify causal variants within GWAS loci. METHODS: We performed single common variant association analysis and rare variant aggregate analyses in the pooled population (N cases=2,184, N controls=2,383) and targeted analyses in sub-populations using WGS data from the Alzheimer's Disease Sequencing Project (ADSP). The analyses were restricted to variants within 100 kb of 83 previously identified GWAS lead variants. RESULTS: Seventeen variants were significantly associated with AD within five genomic regions implicating the genes OARD1/NFYA/TREML1, JAZF1, FERMT2, and SLC24A4. KAT8 was implicated by both single variant and rare variant aggregate analyses. DISCUSSION: This study demonstrates the utility of leveraging WGS to gain insights into AD loci identified via GWAS.

6.
medRxiv ; 2023 Sep 02.
Artigo em Inglês | MEDLINE | ID: mdl-37693521

RESUMO

Alzheimer's Disease (AD) is a common disorder of the elderly that is both highly heritable and genetically heterogeneous. Here, we investigated the association between AD and both common variants and aggregates of rare coding and noncoding variants in 13,371 individuals of diverse ancestry with whole genome sequence (WGS) data. Pooled-population analyses identified genetic variants in or near APOE, BIN1, and LINC00320 significantly associated with AD (p < 5×10-8). Population-specific analyses identified a haplotype on chromosome 14 including PSEN1 associated with AD in Hispanics, further supported by aggregate testing of rare coding and noncoding variants in this region. Finally, we observed suggestive associations (p < 5×10-5) of aggregates of rare coding rare variants in ABCA7 among non-Hispanic Whites (p=5.4×10-6), and rare noncoding variants in the promoter of TOMM40 distinct of APOE in pooled-population analyses (p=7.2×10-8). Complementary pooled-population and population-specific analyses offered unique insights into the genetic architecture of AD.

7.
Sci Rep ; 13(1): 12952, 2023 08 10.
Artigo em Inglês | MEDLINE | ID: mdl-37563237

RESUMO

Expression quantitative trait methylation (eQTM) analysis identifies DNA CpG sites at which methylation is associated with gene expression. The present study describes an eQTM resource of CpG-transcript pairs derived from whole blood DNA methylation and RNA sequencing gene expression data in 2115 Framingham Heart Study participants. We identified 70,047 significant cis CpG-transcript pairs at p < 1E-7 where the top most significant eGenes (i.e., gene transcripts associated with a CpG) were enriched in biological pathways related to cell signaling, and for 1208 clinical traits (enrichment false discovery rate [FDR] ≤ 0.05). We also identified 246,667 significant trans CpG-transcript pairs at p < 1E-14 where the top most significant eGenes were enriched in biological pathways related to activation of the immune response, and for 1191 clinical traits (enrichment FDR ≤ 0.05). Independent and external replication of the top 1000 significant cis and trans CpG-transcript pairs was completed in the Women's Health Initiative and Jackson Heart Study cohorts. Using significant cis CpG-transcript pairs, we identified significant mediation of the association between CpG sites and cardiometabolic traits through gene expression and identified shared genetic regulation between CpGs and transcripts associated with cardiometabolic traits. In conclusion, we developed a robust and powerful resource of whole blood eQTM CpG-transcript pairs that can help inform future functional studies that seek to understand the molecular basis of disease.


Assuntos
Doenças Cardiovasculares , Metilação de DNA , Humanos , Feminino , Locos de Características Quantitativas , Regulação da Expressão Gênica , Estudos Longitudinais , Doenças Cardiovasculares/genética , Ilhas de CpG/genética , Estudo de Associação Genômica Ampla
9.
Cell Rep Med ; 3(12): 100844, 2022 12 20.
Artigo em Inglês | MEDLINE | ID: mdl-36513073

RESUMO

We develop a closed-form Haseman-Elston estimator for genetic and environmental correlation coefficients between complex phenotypes, which we term HEc, that is as precise as GCTA yet ∼20× faster. We estimate genetic and environmental correlations between over 7,000 phenotype pairs in subgroups from the Trans-Omics in Precision Medicine (TOPMed) program. We demonstrate substantial differences in both heritabilities and genetic correlations for multiple phenotypes and phenotype pairs between individuals of self-reported Black, Hispanic/Latino, and White backgrounds. We similarly observe differences in many of the genetic and environmental correlations between genders. To estimate the contribution of genetics to the observed phenotypic correlation, we introduce "fractional genetic correlation" as the fraction of phenotypic correlation explained by genetics. Finally, we quantify the enrichment of correlations between phenotypic domains, each of which is comprised of multiple phenotypes. Altogether, we demonstrate that the observed correlations between complex human phenotypes depend on the genetic background of the individuals, their gender, and their environment.


Assuntos
Patrimônio Genético , Humanos , Masculino , Feminino , Fenótipo
10.
Sci Rep ; 12(1): 20167, 2022 11 23.
Artigo em Inglês | MEDLINE | ID: mdl-36424512

RESUMO

To create a scientific resource of expression quantitative trail loci (eQTL), we conducted a genome-wide association study (GWAS) using genotypes obtained from whole genome sequencing (WGS) of DNA and gene expression levels from RNA sequencing (RNA-seq) of whole blood in 2622 participants in Framingham Heart Study. We identified 6,778,286 cis-eQTL variant-gene transcript (eGene) pairs at p < 5 × 10-8 (2,855,111 unique cis-eQTL variants and 15,982 unique eGenes) and 1,469,754 trans-eQTL variant-eGene pairs at p < 1e-12 (526,056 unique trans-eQTL variants and 7233 unique eGenes). In addition, 442,379 cis-eQTL variants were associated with expression of 1518 long non-protein coding RNAs (lncRNAs). Gene Ontology (GO) analyses revealed that the top GO terms for cis-eGenes are enriched for immune functions (FDR < 0.05). The cis-eQTL variants are enriched for SNPs reported to be associated with 815 traits in prior GWAS, including cardiovascular disease risk factors. As proof of concept, we used this eQTL resource in conjunction with genetic variants from public GWAS databases in causal inference testing (e.g., COVID-19 severity). After Bonferroni correction, Mendelian randomization analyses identified putative causal associations of 60 eGenes with systolic blood pressure, 13 genes with coronary artery disease, and seven genes with COVID-19 severity. This study created a comprehensive eQTL resource via BioData Catalyst that will be made available to the scientific community. This will advance understanding of the genetic architecture of gene expression underlying a wide range of diseases.


Assuntos
Predisposição Genética para Doença , Estudo de Associação Genômica Ampla , Locos de Características Quantitativas , Humanos , DNA , Expressão Gênica , Locos de Características Quantitativas/genética , Análise de Sequência de RNA
11.
Sci Rep ; 12(1): 19564, 2022 11 15.
Artigo em Inglês | MEDLINE | ID: mdl-36380121

RESUMO

DNA methylation commonly occurs at cytosine-phosphate-guanine sites (CpGs) that can serve as biomarkers for many diseases. We analyzed whole genome sequencing data to identify DNA methylation quantitative trait loci (mQTLs) in 4126 Framingham Heart Study participants. Our mQTL mapping identified 94,362,817 cis-mQTLvariant-CpG pairs (for 210,156 unique autosomal CpGs) at P < 1e-7 and 33,572,145 trans-mQTL variant-CpG pairs (for 213,606 unique autosomal CpGs) at P < 1e-14. Using cis-mQTL variants for 1258 CpGs associated with seven cardiovascular disease (CVD) risk factors, we found 104 unique CpGs that colocalized with at least one CVD trait. For example, cg11554650 (PPP1R18) colocalized with type 2 diabetes, and was driven by a single nucleotide polymorphism (rs2516396). We performed Mendelian randomization (MR) analysis and demonstrated 58 putatively causal relations of CVD risk factor-associated CpGs to one or more risk factors (e.g., cg05337441 [APOB] with LDL; MR P = 1.2e-99, and 17 causal associations with coronary artery disease (e.g. cg08129017 [SREBF1] with coronary artery disease; MR P = 5e-13). We also showed that three CpGs, e.g., cg14893161 (PM20D1), are putatively causally associated with COVID-19 severity. To assist in future analyses of the role of DNA methylation in disease pathogenesis, we have posted a comprehensive summary data set in the National Heart, Lung, and Blood Institute's BioData Catalyst.


Assuntos
COVID-19 , Doença da Artéria Coronariana , Diabetes Mellitus Tipo 2 , Humanos , Metilação de DNA , Diabetes Mellitus Tipo 2/genética , Doença da Artéria Coronariana/genética , Locos de Características Quantitativas , Polimorfismo de Nucleotídeo Único , Citosina , Ilhas de CpG/genética , Estudo de Associação Genômica Ampla
12.
Res Sq ; 2022 May 31.
Artigo em Inglês | MEDLINE | ID: mdl-35664994

RESUMO

To create a scientific resource of expression quantitative trail loci (eQTL), we conducted a genome-wide association study (GWAS) using genotypes obtained from whole genome sequencing (WGS) of DNA and gene expression levels from RNA sequencing (RNA-seq) of whole blood in 2622 participants in Framingham Heart Study. We identified 6,778,286 cis -eQTL variant-gene transcript (eGene) pairs at p < 5x10 - 8 (2,855,111 unique cis -eQTL variants and 15,982 unique eGenes) and 1,469,754 trans -eQTL variant-eGene pairs at p < 1e-12 (526,056 unique trans -eQTL variants and 7,233 unique eGenes). In addition, 442,379 cis -eQTL variants were associated with expression of 1518 long non-protein coding RNAs (lncRNAs). Gene Ontology (GO) analyses revealed that the top GO terms for cis- eGenes are enriched for immune functions (FDR < 0.05). The cis -eQTL variants are enriched for SNPs reported to be associated with 815 traits in prior GWAS, including cardiovascular disease risk factors. As proof of concept, we used this eQTL resource in conjunction with genetic variants from public GWAS databases in causal inference testing (e.g., COVID-19 severity). After Bonferroni correction, Mendelian randomization analyses identified putative causal associations of 60 eGenes with systolic blood pressure, 13 genes with coronary artery disease, and seven genes with COVID-19 severity. This study created a comprehensive eQTL resource via BioData Catalyst that will be made available to the scientific community. This will advance understanding of the genetic architecture of gene expression underlying a wide range of diseases.

13.
Am J Hum Genet ; 109(6): 1077-1091, 2022 06 02.
Artigo em Inglês | MEDLINE | ID: mdl-35580588

RESUMO

Hearing loss is one of the top contributors to years lived with disability and is a risk factor for dementia. Molecular evidence on the cellular origins of hearing loss in humans is growing. Here, we performed a genome-wide association meta-analysis of clinically diagnosed and self-reported hearing impairment on 723,266 individuals and identified 48 significant loci, 10 of which are novel. A large proportion of associations comprised missense variants, half of which lie within known familial hearing loss loci. We used single-cell RNA-sequencing data from mouse cochlea and brain and mapped common-variant genomic results to spindle, root, and basal cells from the stria vascularis, a structure in the cochlea necessary for normal hearing. Our findings indicate the importance of the stria vascularis in the mechanism of hearing impairment, providing future paths for developing targets for therapeutic intervention in hearing loss.


Assuntos
Surdez , Perda Auditiva , Animais , Cóclea , Estudo de Associação Genômica Ampla , Perda Auditiva/genética , Humanos , Camundongos , Estria Vascular
14.
medRxiv ; 2022 May 03.
Artigo em Inglês | MEDLINE | ID: mdl-35547845

RESUMO

To create a scientific resource of expression quantitative trail loci (eQTL), we conducted a genome-wide association study (GWAS) using genotypes obtained from whole genome sequencing (WGS) of DNA and gene expression levels from RNA sequencing (RNA-seq) of whole blood in 2622 participants in Framingham Heart Study. We identified 6,778,286 cis -eQTL variant-gene transcript (eGene) pairs at p <5×10 -8 (2,855,111 unique cis -eQTL variants and 15,982 unique eGenes) and 1,469,754 trans -eQTL variant-eGene pairs at p <1e-12 (526,056 unique trans -eQTL variants and 7,233 unique eGenes). In addition, 442,379 cis -eQTL variants were associated with expression of 1518 long non-protein coding RNAs (lncRNAs). Gene Ontology (GO) analyses revealed that the top GO terms for cis- eGenes are enriched for immune functions (FDR <0.05). The cis -eQTL variants are enriched for SNPs reported to be associated with 815 traits in prior GWAS, including cardiovascular disease risk factors. As proof of concept, we used this eQTL resource in conjunction with genetic variants from public GWAS databases in causal inference testing (e.g., COVID-19 severity). After Bonferroni correction, Mendelian randomization analyses identified putative causal associations of 60 eGenes with systolic blood pressure, 13 genes with coronary artery disease, and seven genes with COVID-19 severity. This study created a comprehensive eQTL resource via BioData Catalyst that will be made available to the scientific community. This will advance understanding of the genetic architecture of gene expression underlying a wide range of diseases.

15.
Stroke ; 53(3): 875-885, 2022 03.
Artigo em Inglês | MEDLINE | ID: mdl-34727735

RESUMO

BACKGROUND AND PURPOSE: Stroke is the leading cause of death and long-term disability worldwide. Previous genome-wide association studies identified 51 loci associated with stroke (mostly ischemic) and its subtypes among predominantly European populations. Using whole-genome sequencing in ancestrally diverse populations from the Trans-Omics for Precision Medicine (TOPMed) Program, we aimed to identify novel variants, especially low-frequency or ancestry-specific variants, associated with all stroke, ischemic stroke and its subtypes (large artery, cardioembolic, and small vessel), and hemorrhagic stroke and its subtypes (intracerebral and subarachnoid). METHODS: Whole-genome sequencing data were available for 6833 stroke cases and 27 116 controls, including 22 315 European, 7877 Black, 2616 Hispanic/Latino, 850 Asian, 54 Native American, and 237 other ancestry participants. In TOPMed, we performed single variant association analysis examining 40 million common variants and aggregated association analysis focusing on rare variants. We also combined TOPMed European populations with over 28 000 additional European participants from the UK BioBank genome-wide array data through meta-analysis. RESULTS: In the single variant association analysis in TOPMed, we identified one novel locus 13q33 for large artery at whole-genome-wide significance (P<5.00×10-9) and 4 novel loci at genome-wide significance (P<5.00×10-8), all of which need confirmation in independent studies. Lead variants in all 5 loci are low-frequency but are more common in non-European populations. An aggregation of synonymous rare variants within the gene C6orf26 demonstrated suggestive evidence of association for hemorrhagic stroke (P<3.11×10-6). By meta-analyzing European ancestry samples in TOPMed and UK BioBank, we replicated several previously reported stroke loci including PITX2, HDAC9, ZFHX3, and LRCH1. CONCLUSIONS: We represent the first association analysis for stroke and its subtypes using whole-genome sequencing data from ancestrally diverse populations. While our findings suggest the potential benefits of combining whole-genome sequencing data with populations of diverse genetic backgrounds to identify possible low-frequency or ancestry-specific variants, they also highlight the need to increase genome coverage and sample sizes.


Assuntos
Loci Gênicos , Predisposição Genética para Doença , Polimorfismo de Nucleotídeo Único , Medicina de Precisão , Grupos Raciais/genética , Acidente Vascular Cerebral/genética , Idoso , Idoso de 80 Anos ou mais , Feminino , Estudo de Associação Genômica Ampla , Humanos , Masculino , Pessoa de Meia-Idade , Sequenciamento Completo do Genoma
16.
Am J Epidemiol ; 190(10): 1977-1992, 2021 10 01.
Artigo em Inglês | MEDLINE | ID: mdl-33861317

RESUMO

Genotype-phenotype association studies often combine phenotype data from multiple studies to increase statistical power. Harmonization of the data usually requires substantial effort due to heterogeneity in phenotype definitions, study design, data collection procedures, and data-set organization. Here we describe a centralized system for phenotype harmonization that includes input from phenotype domain and study experts, quality control, documentation, reproducible results, and data-sharing mechanisms. This system was developed for the National Heart, Lung, and Blood Institute's Trans-Omics for Precision Medicine (TOPMed) program, which is generating genomic and other -omics data for more than 80 studies with extensive phenotype data. To date, 63 phenotypes have been harmonized across thousands of participants (recruited in 1948-2012) from up to 17 studies per phenotype. Here we discuss challenges in this undertaking and how they were addressed. The harmonized phenotype data and associated documentation have been submitted to National Institutes of Health data repositories for controlled access by the scientific community. We also provide materials to facilitate future harmonization efforts by the community, which include 1) the software code used to generate the 63 harmonized phenotypes, enabling others to reproduce, modify, or extend these harmonizations to additional studies, and 2) the results of labeling thousands of phenotype variables with controlled vocabulary terms.


Assuntos
Estudos de Associação Genética/métodos , Fenômica/métodos , Medicina de Precisão/métodos , Agregação de Dados , Humanos , Disseminação de Informação , National Heart, Lung, and Blood Institute (U.S.) , Fenótipo , Avaliação de Programas e Projetos de Saúde , Estados Unidos
17.
Proc Natl Acad Sci U S A ; 117(5): 2560-2569, 2020 02 04.
Artigo em Inglês | MEDLINE | ID: mdl-31964835

RESUMO

De novo mutations (DNMs), or mutations that appear in an individual despite not being seen in their parents, are an important source of genetic variation whose impact is relevant to studies of human evolution, genetics, and disease. Utilizing high-coverage whole-genome sequencing data as part of the Trans-Omics for Precision Medicine (TOPMed) Program, we called 93,325 single-nucleotide DNMs across 1,465 trios from an array of diverse human populations, and used them to directly estimate and analyze DNM counts, rates, and spectra. We find a significant positive correlation between local recombination rate and local DNM rate, and that DNM rate explains a substantial portion (8.98 to 34.92%, depending on the model) of the genome-wide variation in population-level genetic variation from 41K unrelated TOPMed samples. Genome-wide heterozygosity does correlate with DNM rate, but only explains <1% of variation. While we are underpowered to see small differences, we do not find significant differences in DNM rate between individuals of European, African, and Latino ancestry, nor across ancestrally distinct segments within admixed individuals. However, we did find significantly fewer DNMs in Amish individuals, even when compared with other Europeans, and even after accounting for parental age and sequencing center. Specifically, we found significant reductions in the number of C→A and T→C mutations in the Amish, which seem to underpin their overall reduction in DNMs. Finally, we calculated near-zero estimates of narrow sense heritability (h2), which suggest that variation in DNM rate is significantly shaped by nonadditive genetic effects and the environment.


Assuntos
Amish/genética , Genoma Humano , Adulto , Estudos de Coortes , Análise Mutacional de DNA , Feminino , Genética Populacional , Heterozigoto , Humanos , Masculino , Mutação , Linhagem , Sequenciamento Completo do Genoma , Adulto Jovem
18.
Sci Rep ; 9(1): 15192, 2019 10 23.
Artigo em Inglês | MEDLINE | ID: mdl-31645637

RESUMO

Previous research has shown that genes play a substantial role in determining a person's susceptibility to age-related hearing impairment. The existing studies on this subject have different results, which may be caused by difficulties in determining the phenotype or the limited number of participants involved. Here, we have gathered the largest sample to date (discovery n = 9,675; replication n = 10,963; validation n = 356,141), and examined phenotypes that represented low/mid and high frequency hearing loss on the pure tone audiogram. We identified 7 loci that were either replicated and/or validated, of which 5 loci are novel in hearing. Especially the ILDR1 gene is a high profile candidate, as it contains our top SNP, is a known hearing loss gene, has been linked to age-related hearing impairment before, and in addition is preferentially expressed within hair cells of the inner ear. By verifying all previously published SNPs, we can present a paper that combines all new and existing findings to date, giving a complete overview of the genetic architecture of age-related hearing impairment. This is of importance as age-related hearing impairment is highly prevalent in our ageing society and represents a large socio-economic burden.


Assuntos
Envelhecimento/genética , Loci Gênicos , Predisposição Genética para Doença , Estudo de Associação Genômica Ampla , Perda Auditiva/genética , Animais , Vias Auditivas/metabolismo , Feminino , Regulação da Expressão Gênica , Humanos , Masculino , Camundongos , Pessoa de Meia-Idade , Anotação de Sequência Molecular , Fenótipo , Reprodutibilidade dos Testes
19.
Nature ; 570(7759): 71-76, 2019 06.
Artigo em Inglês | MEDLINE | ID: mdl-31118516

RESUMO

Protein-coding genetic variants that strongly affect disease risk can yield relevant clues to disease pathogenesis. Here we report exome-sequencing analyses of 20,791 individuals with type 2 diabetes (T2D) and 24,440 non-diabetic control participants from 5 ancestries. We identify gene-level associations of rare variants (with minor allele frequencies of less than 0.5%) in 4 genes at exome-wide significance, including a series of more than 30 SLC30A8 alleles that conveys protection against T2D, and in 12 gene sets, including those corresponding to T2D drug targets (P = 6.1 × 10-3) and candidate genes from knockout mice (P = 5.2 × 10-3). Within our study, the strongest T2D gene-level signals for rare variants explain at most 25% of the heritability of the strongest common single-variant signals, and the gene-level effect sizes of the rare variants that we observed in established T2D drug targets will require 75,000-185,000 sequenced cases to achieve exome-wide significance. We propose a method to interpret these modest rare-variant associations and to incorporate these associations into future target or gene prioritization efforts.


Assuntos
Diabetes Mellitus Tipo 2/genética , Sequenciamento do Exoma , Exoma/genética , Animais , Estudos de Casos e Controles , Técnicas de Apoio para a Decisão , Feminino , Frequência do Gene , Estudo de Associação Genômica Ampla , Humanos , Masculino , Camundongos , Camundongos Knockout
20.
Nat Genet ; 51(3): 452-469, 2019 03.
Artigo em Inglês | MEDLINE | ID: mdl-30778226

RESUMO

Body-fat distribution is a risk factor for adverse cardiovascular health consequences. We analyzed the association of body-fat distribution, assessed by waist-to-hip ratio adjusted for body mass index, with 228,985 predicted coding and splice site variants available on exome arrays in up to 344,369 individuals from five major ancestries (discovery) and 132,177 European-ancestry individuals (validation). We identified 15 common (minor allele frequency, MAF ≥5%) and nine low-frequency or rare (MAF <5%) coding novel variants. Pathway/gene set enrichment analyses identified lipid particle, adiponectin, abnormal white adipose tissue physiology and bone development and morphology as important contributors to fat distribution, while cross-trait associations highlight cardiometabolic traits. In functional follow-up analyses, specifically in Drosophila RNAi-knockdowns, we observed a significant increase in the total body triglyceride levels for two genes (DNAH10 and PLXND1). We implicate novel genes in fat distribution, stressing the importance of interrogating low-frequency and protein-coding variants.


Assuntos
Predisposição Genética para Doença/genética , Variação Genética/genética , Homeostase/genética , Lipídeos/genética , Proteínas/genética , Animais , Distribuição da Gordura Corporal/métodos , Índice de Massa Corporal , Estudos de Casos e Controles , Drosophila/genética , Exoma/genética , Feminino , Frequência do Gene/genética , Estudo de Associação Genômica Ampla/métodos , Humanos , Masculino , Fatores de Risco , Relação Cintura-Quadril/métodos
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA