Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 699
Filtrar
Mais filtros

Base de dados
Tipo de documento
Intervalo de ano de publicação
1.
Am J Hum Genet ; 111(3): 445-455, 2024 03 07.
Artigo em Inglês | MEDLINE | ID: mdl-38320554

RESUMO

Regulation of transcription and translation are mechanisms through which genetic variants affect complex traits. Expression quantitative trait locus (eQTL) studies have been more successful at identifying cis-eQTL (within 1 Mb of the transcription start site) than trans-eQTL. Here, we tested the cis component of gene expression for association with observed plasma protein levels to identify cis- and trans-acting genes that regulate protein levels. We used transcriptome prediction models from 49 Genotype-Tissue Expression (GTEx) Project tissues to predict the cis component of gene expression and tested the predicted expression of every gene in every tissue for association with the observed abundance of 3,622 plasma proteins measured in 3,301 individuals from the INTERVAL study. We tested significant results for replication in 971 individuals from the Trans-omics for Precision Medicine (TOPMed) Multi-Ethnic Study of Atherosclerosis (MESA). We found 1,168 and 1,210 cis- and trans-acting associations that replicated in TOPMed (FDR < 0.05) with a median expected true positive rate (π1) across tissues of 0.806 and 0.390, respectively. The target proteins of trans-acting genes were enriched for transcription factor binding sites and autoimmune diseases in the GWAS catalog. Furthermore, we found a higher correlation between predicted expression and protein levels of the same underlying gene (R = 0.17) than observed expression (R = 0.10, p = 7.50 × 10-11). This indicates the cis-acting genetically regulated (heritable) component of gene expression is more consistent across tissues than total observed expression (genetics + environment) and is useful in uncovering the function of SNPs associated with complex traits.


Assuntos
Proteoma , Transcriptoma , Humanos , Transcriptoma/genética , Proteoma/genética , Herança Multifatorial , Locos de Características Quantitativas/genética , Estudo de Associação Genômica Ampla , Polimorfismo de Nucleotídeo Único/genética
2.
Am J Hum Genet ; 111(1): 133-149, 2024 01 04.
Artigo em Inglês | MEDLINE | ID: mdl-38181730

RESUMO

Bulk-tissue molecular quantitative trait loci (QTLs) have been the starting point for interpreting disease-associated variants, and context-specific QTLs show particular relevance for disease. Here, we present the results of mapping interaction QTLs (iQTLs) for cell type, age, and other phenotypic variables in multi-omic, longitudinal data from the blood of individuals of diverse ancestries. By modeling the interaction between genotype and estimated cell-type proportions, we demonstrate that cell-type iQTLs could be considered as proxies for cell-type-specific QTL effects, particularly for the most abundant cell type in the tissue. The interpretation of age iQTLs, however, warrants caution because the moderation effect of age on the genotype and molecular phenotype association could be mediated by changes in cell-type composition. Finally, we show that cell-type iQTLs contribute to cell-type-specific enrichment of diseases that, in combination with additional functional data, could guide future functional studies. Overall, this study highlights the use of iQTLs to gain insights into the context specificity of regulatory effects.


Assuntos
Regulação da Expressão Gênica , Locos de Características Quantitativas , Humanos , Locos de Características Quantitativas/genética , Genótipo , Fenótipo
3.
Am J Hum Genet ; 111(5): 990-995, 2024 05 02.
Artigo em Inglês | MEDLINE | ID: mdl-38636510

RESUMO

Since genotype imputation was introduced, researchers have been relying on the estimated imputation quality from imputation software to perform post-imputation quality control (QC). However, this quality estimate (denoted as Rsq) performs less well for lower-frequency variants. We recently published MagicalRsq, a machine-learning-based imputation quality calibration, which leverages additional typed markers from the same cohort and outperforms Rsq as a QC metric. In this work, we extended the original MagicalRsq to allow cross-cohort model training and named the new model MagicalRsq-X. We removed the cohort-specific estimated minor allele frequency and included linkage disequilibrium scores and recombination rates as additional features. Leveraging whole-genome sequencing data from TOPMed, specifically participants in the BioMe, JHS, WHI, and MESA studies, we performed comprehensive cross-cohort evaluations for predominantly European and African ancestral individuals based on their inferred global ancestry with the 1000 Genomes and Human Genome Diversity Project data as reference. Our results suggest MagicalRsq-X outperforms Rsq in almost every setting, with 7.3%-14.4% improvement in squared Pearson correlation with true R2, corresponding to 85-218 K variant gains. We further developed a metric to quantify the genetic distances of a target cohort relative to a reference cohort and showed that such metric largely explained the performance of MagicalRsq-X models. Finally, we found MagicalRsq-X saved up to 53 known genome-wide significant variants in one of the largest blood cell trait GWASs that would be missed using the original Rsq for QC. In conclusion, MagicalRsq-X shows superiority for post-imputation QC and benefits genetic studies by distinguishing well and poorly imputed lower-frequency variants.


Assuntos
Frequência do Gene , Genótipo , Polimorfismo de Nucleotídeo Único , Software , Humanos , Estudos de Coortes , Desequilíbrio de Ligação , Estudo de Associação Genômica Ampla/métodos , Genoma Humano , Controle de Qualidade , Aprendizado de Máquina , Sequenciamento Completo do Genoma/normas , Sequenciamento Completo do Genoma/métodos
4.
Hum Mol Genet ; 33(16): 1429-1441, 2024 Aug 06.
Artigo em Inglês | MEDLINE | ID: mdl-38747556

RESUMO

Inflammation biomarkers can provide valuable insight into the role of inflammatory processes in many diseases and conditions. Sequencing based analyses of such biomarkers can also serve as an exemplar of the genetic architecture of quantitative traits. To evaluate the biological insight, which can be provided by a multi-ancestry, whole-genome based association study, we performed a comprehensive analysis of 21 inflammation biomarkers from up to 38 465 individuals with whole-genome sequencing from the Trans-Omics for Precision Medicine (TOPMed) program (with varying sample size by trait, where the minimum sample size was n = 737 for MMP-1). We identified 22 distinct single-variant associations across 6 traits-E-selectin, intercellular adhesion molecule 1, interleukin-6, lipoprotein-associated phospholipase A2 activity and mass, and P-selectin-that remained significant after conditioning on previously identified associations for these inflammatory biomarkers. We further expanded upon known biomarker associations by pairing the single-variant analysis with a rare variant set-based analysis that further identified 19 significant rare variant set-based associations with 5 traits. These signals were distinct from both significant single variant association signals within TOPMed and genetic signals observed in prior studies, demonstrating the complementary value of performing both single and rare variant analyses when analyzing quantitative traits. We also confirm several previously reported signals from semi-quantitative proteomics platforms. Many of these signals demonstrate the extensive allelic heterogeneity and ancestry-differentiated variant-trait associations common for inflammation biomarkers, a characteristic we hypothesize will be increasingly observed with well-powered, large-scale analyses of complex traits.


Assuntos
Biomarcadores , Estudo de Associação Genômica Ampla , Inflamação , Medicina de Precisão , Sequenciamento Completo do Genoma , Humanos , Medicina de Precisão/métodos , Inflamação/genética , Estudo de Associação Genômica Ampla/métodos , Sequenciamento Completo do Genoma/métodos , Polimorfismo de Nucleotídeo Único , Locos de Características Quantitativas , Predisposição Genética para Doença , Feminino , Interleucina-6/genética
5.
Blood ; 2024 Sep 03.
Artigo em Inglês | MEDLINE | ID: mdl-39226462

RESUMO

Genetic studies have identified numerous regions associated with plasma fibrinogen levels in Europeans, yet missing heritability and limited inclusion of non-Europeans necessitates further studies with improved power and sensitivity. Compared with array-based genotyping, whole genome sequencing (WGS) data provides better coverage of the genome and better representation of non-European variants. To better understand the genetic landscape regulating plasma fibrinogen levels, we meta-analyzed WGS data from the NHLBI's Trans-Omics for Precision Medicine (TOPMed) program (n=32,572), with array-based genotype data from the Cohorts for Heart and Aging Research in Genomic Epidemiology (CHARGE) Consortium (n=131,340) imputed to the TOPMed or Haplotype Reference Consortium panel. We identified 18 loci that have not been identified in prior genetic studies of fibrinogen. Of these, four are driven by common variants of small effect with reported MAF at least 10 percentage points higher in African populations. Three signals (SERPINA1, ZFP36L2, and TLR10) contain predicted deleterious missense variants. Two loci, SOCS3 and HPN, each harbor two conditionally distinct, non-coding variants. The gene region encoding the fibrinogen protein chain subunits (FGG;FGB;FGA), contains 7 distinct signals, including one novel signal driven by rs28577061, a variant common in African ancestry populations but extremely rare in Europeans (MAFAFR=0.180; MAFEUR=0.008). Through phenome-wide association studies in the VA Million Veteran Program, we found associations between fibrinogen polygenic risk scores and thrombotic and inflammatory disease phenotypes, including an association with gout. Our findings demonstrate the utility of WGS to augment genetic discovery in diverse populations and offer new insights for putative mechanisms of fibrinogen regulation.

6.
Nature ; 581(7809): 444-451, 2020 05.
Artigo em Inglês | MEDLINE | ID: mdl-32461652

RESUMO

Structural variants (SVs) rearrange large segments of DNA1 and can have profound consequences in evolution and human disease2,3. As national biobanks, disease-association studies, and clinical genetic testing have grown increasingly reliant on genome sequencing, population references such as the Genome Aggregation Database (gnomAD)4 have become integral in the interpretation of single-nucleotide variants (SNVs)5. However, there are no reference maps of SVs from high-coverage genome sequencing comparable to those for SNVs. Here we present a reference of sequence-resolved SVs constructed from 14,891 genomes across diverse global populations (54% non-European) in gnomAD. We discovered a rich and complex landscape of 433,371 SVs, from which we estimate that SVs are responsible for 25-29% of all rare protein-truncating events per genome. We found strong correlations between natural selection against damaging SNVs and rare SVs that disrupt or duplicate protein-coding sequence, which suggests that genes that are highly intolerant to loss-of-function are also sensitive to increased dosage6. We also uncovered modest selection against noncoding SVs in cis-regulatory elements, although selection against protein-truncating SVs was stronger than all noncoding effects. Finally, we identified very large (over one megabase), rare SVs in 3.9% of samples, and estimate that 0.13% of individuals may carry an SV that meets the existing criteria for clinically important incidental findings7. This SV resource is freely distributed via the gnomAD browser8 and will have broad utility in population genetics, disease-association studies, and diagnostic screening.


Assuntos
Doença/genética , Variação Genética , Genética Médica/normas , Genética Populacional/normas , Genoma Humano/genética , Feminino , Testes Genéticos , Técnicas de Genotipagem , Humanos , Masculino , Pessoa de Meia-Idade , Mutação , Polimorfismo de Nucleotídeo Único/genética , Grupos Raciais/genética , Padrões de Referência , Seleção Genética , Sequenciamento Completo do Genoma
7.
PLoS Genet ; 19(5): e1010517, 2023 05.
Artigo em Inglês | MEDLINE | ID: mdl-37216410

RESUMO

Integrative approaches that simultaneously model multi-omics data have gained increasing popularity because they provide holistic system biology views of multiple or all components in a biological system of interest. Canonical correlation analysis (CCA) is a correlation-based integrative method designed to extract latent features shared between multiple assays by finding the linear combinations of features-referred to as canonical variables (CVs)-within each assay that achieve maximal across-assay correlation. Although widely acknowledged as a powerful approach for multi-omics data, CCA has not been systematically applied to multi-omics data in large cohort studies, which has only recently become available. Here, we adapted sparse multiple CCA (SMCCA), a widely-used derivative of CCA, to proteomics and methylomics data from the Multi-Ethnic Study of Atherosclerosis (MESA) and Jackson Heart Study (JHS). To tackle challenges encountered when applying SMCCA to MESA and JHS, our adaptations include the incorporation of the Gram-Schmidt (GS) algorithm with SMCCA to improve orthogonality among CVs, and the development of Sparse Supervised Multiple CCA (SSMCCA) to allow supervised integration analysis for more than two assays. Effective application of SMCCA to the two real datasets reveals important findings. Applying our SMCCA-GS to MESA and JHS, we identified strong associations between blood cell counts and protein abundance, suggesting that adjustment of blood cell composition should be considered in protein-based association studies. Importantly, CVs obtained from two independent cohorts also demonstrate transferability across the cohorts. For example, proteomic CVs learned from JHS, when transferred to MESA, explain similar amounts of blood cell count phenotypic variance in MESA, explaining 39.0% ~ 50.0% variation in JHS and 38.9% ~ 49.1% in MESA. Similar transferability was observed for other omics-CV-trait pairs. This suggests that biologically meaningful and cohort-agnostic variation is captured by CVs. We anticipate that applying our SMCCA-GS and SSMCCA on various cohorts would help identify cohort-agnostic biologically meaningful relationships between multi-omics data and phenotypic traits.


Assuntos
Análise de Correlação Canônica , Proteômica , Humanos , Proteômica/métodos , Multiômica , Estudos de Coortes
8.
Hum Mol Genet ; 32(6): 1048-1060, 2023 03 06.
Artigo em Inglês | MEDLINE | ID: mdl-36444934

RESUMO

Diabetic kidney disease (DKD) is recognized as an important public health challenge. However, its genomic mechanisms are poorly understood. To identify rare variants for DKD, we conducted a whole-exome sequencing (WES) study leveraging large cohorts well-phenotyped for chronic kidney disease and diabetes. Our two-stage WES study included 4372 European and African ancestry participants from the Chronic Renal Insufficiency Cohort and Atherosclerosis Risk in Communities studies (stage 1) and 11 487 multi-ancestry Trans-Omics for Precision Medicine participants (stage 2). Generalized linear mixed models, which accounted for genetic relatedness and adjusted for age, sex and ancestry, were used to test associations between single variants and DKD. Gene-based aggregate rare variant analyses were conducted using an optimized sequence kernel association test implemented within our mixed model framework. We identified four novel exome-wide significant DKD-related loci through initiating diabetes. In single-variant analyses, participants carrying a rare, in-frame insertion in the DIS3L2 gene (rs141560952) exhibited a 193-fold increased odds [95% confidence interval (CI): 33.6, 1105] of DKD compared with noncarriers (P = 3.59 × 10-9). Likewise, each copy of a low-frequency KRT6B splice-site variant (rs425827) conferred a 5.31-fold higher odds (95% CI: 3.06, 9.21) of DKD (P = 2.72 × 10-9). Aggregate gene-based analyses further identified ERAP2 (P = 4.03 × 10-8) and NPEPPS (P = 1.51 × 10-7), which are both expressed in the kidney and implicated in renin-angiotensin-aldosterone system modulated immune response. In the largest WES study of DKD, we identified novel rare variant loci attaining exome-wide significance. These findings provide new insights into the molecular mechanisms underlying DKD.


Assuntos
Diabetes Mellitus , Nefropatias Diabéticas , Insuficiência Renal Crônica , Humanos , Aminopeptidases , Nefropatias Diabéticas/genética , Sequenciamento do Exoma , Rim , Insuficiência Renal Crônica/genética
9.
Am J Hum Genet ; 109(6): 1175-1181, 2022 06 02.
Artigo em Inglês | MEDLINE | ID: mdl-35504290

RESUMO

Current publicly available tools that allow rapid exploration of linkage disequilibrium (LD) between markers (e.g., HaploReg and LDlink) are based on whole-genome sequence (WGS) data from 2,504 individuals in the 1000 Genomes Project. Here, we present TOP-LD, an online tool to explore LD inferred with high-coverage (∼30×) WGS data from 15,578 individuals in the NHLBI Trans-Omics for Precision Medicine (TOPMed) program. TOP-LD provides a significant upgrade compared to current LD tools, as the TOPMed WGS data provide a more comprehensive representation of genetic variation than the 1000 Genomes data, particularly for rare variants and in the specific populations that we analyzed. For example, TOP-LD encompasses LD information for 150.3, 62.2, and 36.7 million variants for European, African, and East Asian ancestral samples, respectively, offering 2.6- to 9.1-fold increase in variant coverage compared to HaploReg 4.0 or LDlink. In addition, TOP-LD includes tens of thousands of structural variants (SVs). We demonstrate the value of TOP-LD in fine-mapping at the GGT1 locus associated with gamma glutamyltransferase in the African ancestry participants in UK Biobank. Beyond fine-mapping, TOP-LD can facilitate a wide range of applications that are based on summary statistics and estimates of LD. TOP-LD is freely available online.


Assuntos
Estudo de Associação Genômica Ampla , Medicina de Precisão , Povo Asiático , Humanos , Desequilíbrio de Ligação/genética , Polimorfismo de Nucleotídeo Único/genética , Sequenciamento Completo do Genoma
10.
Am J Hum Genet ; 109(7): 1286-1297, 2022 07 07.
Artigo em Inglês | MEDLINE | ID: mdl-35716666

RESUMO

Despite the growing number of genome-wide association studies (GWASs), it remains unclear to what extent gene-by-gene and gene-by-environment interactions influence complex traits in humans. The magnitude of genetic interactions in complex traits has been difficult to quantify because GWASs are generally underpowered to detect individual interactions of small effect. Here, we develop a method to test for genetic interactions that aggregates information across all trait-associated loci. Specifically, we test whether SNPs in regions of European ancestry shared between European American and admixed African American individuals have the same causal effect sizes. We hypothesize that in African Americans, the presence of genetic interactions will drive the causal effect sizes of SNPs in regions of European ancestry to be more similar to those of SNPs in regions of African ancestry. We apply our method to two traits: gene expression in 296 African Americans and 482 European Americans in the Multi-Ethnic Study of Atherosclerosis (MESA) and low-density lipoprotein cholesterol (LDL-C) in 74K African Americans and 296K European Americans in the Million Veteran Program (MVP). We find significant evidence for genetic interactions in our analysis of gene expression; for LDL-C, we observe a similar point estimate, although this is not significant, most likely due to lower statistical power. These results suggest that gene-by-gene or gene-by-environment interactions modify the effect sizes of causal variants in human complex traits.


Assuntos
Estudo de Associação Genômica Ampla , Herança Multifatorial , LDL-Colesterol , Expressão Gênica , Humanos , Herança Multifatorial/genética , Polimorfismo de Nucleotídeo Único/genética , População Branca/genética
11.
Am J Hum Genet ; 109(5): 857-870, 2022 05 05.
Artigo em Inglês | MEDLINE | ID: mdl-35385699

RESUMO

While polygenic risk scores (PRSs) enable early identification of genetic risk for chronic obstructive pulmonary disease (COPD), predictive performance is limited when the discovery and target populations are not well matched. Hypothesizing that the biological mechanisms of disease are shared across ancestry groups, we introduce a PrediXcan-derived polygenic transcriptome risk score (PTRS) to improve cross-ethnic portability of risk prediction. We constructed the PTRS using summary statistics from application of PrediXcan on large-scale GWASs of lung function (forced expiratory volume in 1 s [FEV1] and its ratio to forced vital capacity [FEV1/FVC]) in the UK Biobank. We examined prediction performance and cross-ethnic portability of PTRS through smoking-stratified analyses both on 29,381 multi-ethnic participants from TOPMed population/family-based cohorts and on 11,771 multi-ethnic participants from TOPMed COPD-enriched studies. Analyses were carried out for two dichotomous COPD traits (moderate-to-severe and severe COPD) and two quantitative lung function traits (FEV1 and FEV1/FVC). While the proposed PTRS showed weaker associations with disease than PRS for European ancestry, the PTRS showed stronger association with COPD than PRS for African Americans (e.g., odds ratio [OR] = 1.24 [95% confidence interval [CI]: 1.08-1.43] for PTRS versus 1.10 [0.96-1.26] for PRS among heavy smokers with ≥ 40 pack-years of smoking) for moderate-to-severe COPD. Cross-ethnic portability of the PTRS was significantly higher than the PRS (paired t test p < 2.2 × 10-16 with portability gains ranging from 5% to 28%) for both dichotomous COPD traits and across all smoking strata. Our study demonstrates the value of PTRS for improved cross-ethnic portability compared to PRS in predicting COPD risk.


Assuntos
Doença Pulmonar Obstrutiva Crônica , Transcriptoma , Humanos , Pulmão , National Heart, Lung, and Blood Institute (U.S.) , Doença Pulmonar Obstrutiva Crônica/genética , Fatores de Risco , Estados Unidos/epidemiologia
12.
Hepatology ; 2024 May 22.
Artigo em Inglês | MEDLINE | ID: mdl-38776184

RESUMO

BACKGROUND AND AIMS: The common genetic variant rs641738 C>T is a risk factor for metabolic dysfunction-associated steatotic liver disease and metabolic dysfunction-associated steatohepatitis (MASH), including liver fibrosis, and is associated with decreased expression of the phospholipid-remodeling enzyme MBOAT7 (LPIAT1). However, whether restoring MBOAT7 expression in established metabolic dysfunction-associated steatotic liver disease dampens the progression to liver fibrosis and, importantly, the mechanism through which decreased MBOAT7 expression exacerbates MASH fibrosis remain unclear. APPROACH AND RESULTS: We first showed that hepatocyte MBOAT7 restoration in mice with diet-induced steatohepatitis slows the progression to liver fibrosis. Conversely, when hepatocyte-MBOAT7 was silenced in mice with established hepatosteatosis, liver fibrosis but not hepatosteatosis was exacerbated. Mechanistic studies revealed that hepatocyte-MBOAT7 restoration in MASH mice lowered hepatocyte-TAZ (WWTR1), which is known to promote MASH fibrosis. Conversely, hepatocyte-MBOAT7 silencing enhanced TAZ upregulation in MASH. Finally, we discovered that changes in hepatocyte phospholipids due to MBOAT7 loss-of-function promote a cholesterol trafficking pathway that upregulates TAZ and the TAZ-induced profibrotic factor Indian hedgehog (IHH). As evidence for relevance in humans, we found that the livers of individuals with MASH carrying the rs641738-T allele had higher hepatocyte nuclear TAZ, indicating higher TAZ activity and increased IHH mRNA. CONCLUSIONS: This study provides evidence for a novel mechanism linking MBOAT7-LoF to MASH fibrosis, adds new insight into an established genetic locus for MASH, and, given the druggability of hepatocyte TAZ for MASH fibrosis, suggests a personalized medicine approach for subjects at increased risk for MASH fibrosis due to inheritance of variants that lower MBOAT7.

13.
PLoS Genet ; 18(9): e1010294, 2022 09.
Artigo em Inglês | MEDLINE | ID: mdl-36048760

RESUMO

For Alzheimer's disease-a leading cause of dementia and global morbidity-improved identification of presymptomatic high-risk individuals and identification of new circulating biomarkers are key public health needs. Here, we tested the hypothesis that a polygenic predictor of risk for Alzheimer's disease would identify a subset of the population with increased risk of clinically diagnosed dementia, subclinical neurocognitive dysfunction, and a differing circulating proteomic profile. Using summary association statistics from a recent genome-wide association study, we first developed a polygenic predictor of Alzheimer's disease comprised of 7.1 million common DNA variants. We noted a 7.3-fold (95% CI 4.8 to 11.0; p < 0.001) gradient in risk across deciles of the score among 288,289 middle-aged participants of the UK Biobank study. In cross-sectional analyses stratified by age, minimal differences in risk of Alzheimer's disease and performance on a digit recall test were present according to polygenic score decile at age 50 years, but significant gradients emerged by age 65. Similarly, among 30,541 participants of the Mass General Brigham Biobank, we again noted no significant differences in Alzheimer's disease diagnosis at younger ages across deciles of the score, but for those over 65 years we noted an odds ratio of 2.0 (95% CI 1.3 to 3.2; p = 0.002) in the top versus bottom decile of the polygenic score. To understand the proteomic signature of inherited risk, we performed aptamer-based profiling in 636 blood donors (mean age 43 years) with very high or low polygenic scores. In addition to the well-known apolipoprotein E biomarker, this analysis identified 27 additional proteins, several of which have known roles related to disease pathogenesis. Differences in protein concentrations were consistent even among the youngest subset of blood donors (mean age 33 years). Of these 28 proteins, 7 of the 8 proteins with concentrations available were similarly associated with the polygenic score in participants of the Multi-Ethnic Study of Atherosclerosis. These data highlight the potential for a DNA-based score to identify high-risk individuals during the prolonged presymptomatic phase of Alzheimer's disease and to enable biomarker discovery based on profiling of young individuals in the extremes of the score distribution.


Assuntos
Doença de Alzheimer , Adulto , Idoso , Doença de Alzheimer/patologia , Biomarcadores , Estudos Transversais , Estudo de Associação Genômica Ampla , Humanos , Pessoa de Meia-Idade , Proteômica
14.
PLoS Genet ; 18(9): e1010356, 2022 09.
Artigo em Inglês | MEDLINE | ID: mdl-36137075

RESUMO

Rare variants in ten genes have been reported to cause Mendelian sleep conditions characterised by extreme sleep duration or timing. These include familial natural short sleep (ADRB1, DEC2/BHLHE41, GRM1 and NPSR1), advanced sleep phase (PER2, PER3, CRY2, CSNK1D and TIMELESS) and delayed sleep phase (CRY1). The association of variants in these genes with extreme sleep conditions were usually based on clinically ascertained families, and their effects when identified in the population are unknown. We aimed to determine the effects of these variants on sleep traits in large population-based cohorts. We performed genetic association analysis of variants previously reported to be causal for Mendelian sleep and circadian conditions. Analyses were performed using 191,929 individuals with data on sleep and whole-exome or genome-sequence data from 4 population-based studies: UK Biobank, FINRISK, Health-2000-2001, and the Multi-Ethnic Study of Atherosclerosis (MESA). We identified sleep disorders from self-report, hospital and primary care data. We estimated sleep duration and timing measures from self-report and accelerometery data. We identified carriers for 10 out of 12 previously reported pathogenic variants for 8 of the 10 genes. They ranged in frequency from 1 individual with the variant in CSNK1D to 1,574 individuals with a reported variant in the PER3 gene in the UK Biobank. No carriers for variants reported in NPSR1 or PER2 were identified. We found no association between variants analyzed and extreme sleep or circadian phenotypes. Using sleep timing as a proxy measure for sleep phase, only PER3 and CRY1 variants demonstrated association with earlier and later sleep timing, respectively; however, the magnitude of effect was smaller than previously reported (sleep midpoint ~7 mins earlier and ~5 mins later, respectively). We also performed burden tests of protein truncating (PTVs) or rare missense variants for the 10 genes. Only PTVs in PER2 and PER3 were associated with a relevant trait (for example, 64 individuals with a PTV in PER2 had an odds ratio of 4.4 for being "definitely a morning person", P = 4x10-8; and had a 57-minute earlier midpoint sleep, P = 5x10-7). Our results indicate that previously reported variants for Mendelian sleep and circadian conditions are often not highly penetrant when ascertained incidentally from the general population.


Assuntos
Ritmo Circadiano , Transtornos do Sono-Vigília , Ritmo Circadiano/genética , Humanos , Fenótipo , Receptores Acoplados a Proteínas G/genética , Sono/genética , Transtornos do Sono-Vigília/genética
15.
PLoS Genet ; 18(12): e1010557, 2022 12.
Artigo em Inglês | MEDLINE | ID: mdl-36574455

RESUMO

Genetic association studies of many heritable traits resulting from physiological testing often have modest sample sizes due to the cost and burden of the required phenotyping. This reduces statistical power and limits discovery of multiple genetic associations. We present a strategy to leverage pleiotropy between traits to both discover new loci and to provide mechanistic hypotheses of the underlying pathophysiology. Specifically, we combine a colocalization test with a locus-level test of pleiotropy. In simulations, we show that this approach is highly selective for identifying true pleiotropy driven by the same causative variant, thereby improves the chance to replicate the associations in underpowered validation cohorts and leads to higher interpretability. Here, as an exemplar, we use Obstructive Sleep Apnea (OSA), a common disorder diagnosed using overnight multi-channel physiological testing. We leverage pleiotropy with relevant cellular and cardio-metabolic phenotypes and gene expression traits to map new risk loci in an underpowered OSA GWAS. We identify several pleiotropic loci harboring suggestive associations to OSA and genome-wide significant associations to other traits, and show that their OSA association replicates in independent cohorts of diverse ancestries. By investigating pleiotropic loci, our strategy allows proposing new hypotheses about OSA pathobiology across many physiological layers. For example, we identify and replicate the pleiotropy across the plateletcrit, OSA and an eQTL of DNA primase subunit 1 (PRIM1) in immune cells. We find suggestive links between OSA, a measure of lung function (FEV1/FVC), and an eQTL of matrix metallopeptidase 15 (MMP15) in lung tissue. We also link a previously known genome-wide significant peak for OSA in the hexokinase 1 (HK1) locus to hematocrit and other red blood cell related traits. Thus, the analysis of pleiotropic associations has the potential to assemble diverse phenotypes into a chain of mechanistic hypotheses that provide insight into the pathogenesis of complex human diseases.


Assuntos
Estudo de Associação Genômica Ampla , Apneia Obstrutiva do Sono , Humanos , Estudo de Associação Genômica Ampla/métodos , Fenótipo , Estudos de Associação Genética , Sono , Pleiotropia Genética , Polimorfismo de Nucleotídeo Único , DNA Primase
16.
Diabetologia ; 2024 Sep 30.
Artigo em Inglês | MEDLINE | ID: mdl-39349773

RESUMO

AIMS/HYPOTHESIS: Several studies have reported associations between specific proteins and type 2 diabetes risk in European populations. To better understand the role played by proteins in type 2 diabetes aetiology across diverse populations, we conducted a large proteome-wide association study using genetic instruments across four racial and ethnic groups: African; Asian; Hispanic/Latino; and European. METHODS: Genome and plasma proteome data from the Multi-Ethnic Study of Atherosclerosis (MESA) study involving 182 African, 69 Asian, 284 Hispanic/Latino and 409 European individuals residing in the USA were used to establish protein prediction models by using potentially associated cis- and trans-SNPs. The models were applied to genome-wide association study summary statistics of 250,127 type 2 diabetes cases and 1,222,941 controls from different racial and ethnic populations. RESULTS: We identified three, 44 and one protein associated with type 2 diabetes risk in Asian, European and Hispanic/Latino populations, respectively. Meta-analysis identified 40 proteins associated with type 2 diabetes risk across the populations, including well-established as well as novel proteins not yet implicated in type 2 diabetes development. CONCLUSIONS/INTERPRETATION: Our study improves our understanding of the aetiology of type 2 diabetes in diverse populations. DATA AVAILABILITY: The summary statistics of multi-ethnic type 2 diabetes GWAS of MVP, DIAMANTE, Biobank Japan and other studies are available from The database of Genotypes and Phenotypes (dbGaP) under accession number phs001672.v3.p1. MESA genetic, proteome and covariate data can be accessed through dbGaP under phs000209.v13.p3. All code is available on GitHub ( https://github.com/Arthur1021/MESA-1K-PWAS ).

17.
PLoS Med ; 21(9): e1004464, 2024 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-39316596

RESUMO

BACKGROUND: Biological age may be estimated by proteomic aging clocks (PACs). Previous published PACs were constructed either in smaller studies or mainly in white individuals, and they used proteomic measures from only one-time point. In this study, we created de novo PACs and compared their performance to published PACs at 2 different time points in the Atherosclerosis Risk in Communities (ARIC) study of white and black participants (around 75% white and 25% black). MEDTHODS AND FINDINGS: A total of 4,712 plasma proteins were measured using SomaScan in blood samples collected in 1990 to 1992 from 11,761 midlife participants (aged 46 to 70 years) and in 2011 to 2013 from 5,183 late-life participants (aged 66 to 90 years). The de novo ARIC PACs were constructed by training them against chronological age using elastic net regression in two-thirds of healthy participants in midlife and late life and validated in the remaining one-third of healthy participants at the corresponding time point. We also computed 3 published PACs. We estimated age acceleration for each PAC as residuals after regressing each PAC on chronological age. We also calculated the change in age acceleration from midlife to late life. We examined the associations of age acceleration and change in age acceleration with mortality through 2019 from all-cause, cardiovascular disease (CVD), cancer, and lower respiratory disease (LRD) using Cox proportional hazards regression in participants (irrespective of health) after excluding the training set. The model was adjusted for chronological age, smoking, body mass index (BMI), and other confounders. We externally validated the midlife PAC using the Multi-Ethnic Study of Atherosclerosis (MESA) Exam 1 data. The ARIC PACs had a slightly stronger correlation with chronological age than published PACs in healthy participants at each time point. Associations with mortality were similar for the ARIC PACs and published PACs. For late-life and midlife age acceleration for the ARIC PACs, respectively, hazard ratios (HRs) per 1 standard deviation were 1.65 and 1.38 (both p < 0.001) for all-cause mortality, 1.37 and 1.20 (both p < 0.001) for CVD mortality, 1.21 (p = 0.028) and 1.04 (p = 0.280) for cancer mortality, and 1.68 and 1.36 (both p < 0.001) for LRD mortality. For the change in age acceleration, HRs for all-cause, CVD, and LRD mortality were comparable to the HRs for late-life age acceleration. The association between the change in age acceleration and cancer mortality was not significant. The external validation of the midlife PAC in MESA showed significant associations with mortality, as observed for midlife participants in ARIC. The main limitation is that our PACs were constructed in midlife and late-life participants. It is unknown whether these PACs could be applied to young individuals. CONCLUSIONS: In this longitudinal study, we found that the ARIC PACs and published PACs were similarly associated with an increased risk of mortality. These findings suggested that PACs show promise as biomarkers of biological age. PACs may be serve as tools to predict mortality and evaluate the effect of anti-aging lifestyle and therapeutic interventions.


Assuntos
Envelhecimento , Proteômica , Humanos , Pessoa de Meia-Idade , Idoso , Proteômica/métodos , Feminino , Masculino , Idoso de 80 Anos ou mais , Estudos de Coortes , Doenças Cardiovasculares/mortalidade , Aterosclerose/sangue , Aterosclerose/epidemiologia , Fatores de Risco
18.
Hum Mol Genet ; 31(22): 3873-3885, 2022 11 10.
Artigo em Inglês | MEDLINE | ID: mdl-35766891

RESUMO

RATIONALE: Genetic variation has a substantial contribution to chronic obstructive pulmonary disease (COPD) and lung function measurements. Heritability estimates using genome-wide genotyping data can be biased if analyses do not appropriately account for the nonuniform distribution of genetic effects across the allele frequency and linkage disequilibrium (LD) spectrum. In addition, the contribution of rare variants has been unclear. OBJECTIVES: We sought to assess the heritability of COPD and lung function using whole-genome sequence data from the Trans-Omics for Precision Medicine program. METHODS: Using the genome-based restricted maximum likelihood method, we partitioned the genome into bins based on minor allele frequency and LD scores and estimated heritability of COPD, FEV1% predicted and FEV1/FVC ratio in 11 051 European ancestry and 5853 African-American participants. MEASUREMENTS AND MAIN RESULTS: In European ancestry participants, the estimated heritability of COPD, FEV1% predicted and FEV1/FVC ratio were 35.5%, 55.6% and 32.5%, of which 18.8%, 19.7%, 17.8% were from common variants, and 16.6%, 35.8%, and 14.6% were from rare variants. These estimates had wide confidence intervals, with common variants and some sets of rare variants showing a statistically significant contribution (P-value < 0.05). In African-Americans, common variant heritability was similar to European ancestry participants, but lower sample size precluded calculation of rare variant heritability. CONCLUSIONS: Our study provides updated and unbiased estimates of heritability for COPD and lung function, and suggests an important contribution of rare variants. Larger studies of more diverse ancestry will improve accuracy of these estimates.


Assuntos
Predisposição Genética para Doença , Doença Pulmonar Obstrutiva Crônica , Humanos , Polimorfismo de Nucleotídeo Único/genética , Doença Pulmonar Obstrutiva Crônica/genética , Estudo de Associação Genômica Ampla , Fenótipo
19.
Hum Mol Genet ; 31(20): 3566-3579, 2022 10 10.
Artigo em Inglês | MEDLINE | ID: mdl-35234888

RESUMO

Progressive dilation of the infrarenal aortic diameter is a consequence of the ageing process and is considered the main determinant of abdominal aortic aneurysm (AAA). We aimed to investigate the genetic and clinical determinants of abdominal aortic diameter (AAD). We conducted a meta-analysis of genome-wide association studies in 10 cohorts (n = 13 542) imputed to the 1000 Genome Project reference panel including 12 815 subjects in the discovery phase and 727 subjects [Partners Biobank cohort 1 (PBIO)] as replication. Maximum anterior-posterior diameter of the infrarenal aorta was used as AAD. We also included exome array data (n = 14 480) from seven epidemiologic studies. Single-variant and gene-based associations were done using SeqMeta package. A Mendelian randomization analysis was applied to investigate the causal effect of a number of clinical risk factors on AAD. In genome-wide association study (GWAS) on AAD, rs74448815 in the intronic region of LDLRAD4 reached genome-wide significance (beta = -0.02, SE = 0.004, P-value = 2.10 × 10-8). The association replicated in the PBIO1 cohort (P-value = 8.19 × 10-4). In exome-array single-variant analysis (P-value threshold = 9 × 10-7), the lowest P-value was found for rs239259 located in SLC22A20 (beta = 0.007, P-value = 1.2 × 10-5). In the gene-based analysis (P-value threshold = 1.85 × 10-6), PCSK5 showed an association with AAD (P-value = 8.03 × 10-7). Furthermore, in Mendelian randomization analyses, we found evidence for genetic association of pulse pressure (beta = -0.003, P-value = 0.02), triglycerides (beta = -0.16, P-value = 0.008) and height (beta = 0.03, P-value < 0.0001), known risk factors for AAA, consistent with a causal association with AAD. Our findings point to new biology as well as highlighting gene regions in mechanisms that have previously been implicated in the genetics of other vascular diseases.


Assuntos
Estudo de Associação Genômica Ampla , Análise da Randomização Mendeliana , Exoma/genética , Humanos , Polimorfismo de Nucleotídeo Único/genética , Triglicerídeos
20.
Am J Hum Genet ; 108(5): 874-893, 2021 05 06.
Artigo em Inglês | MEDLINE | ID: mdl-33887194

RESUMO

Whole-genome sequencing (WGS), a powerful tool for detecting novel coding and non-coding disease-causing variants, has largely been applied to clinical diagnosis of inherited disorders. Here we leveraged WGS data in up to 62,653 ethnically diverse participants from the NHLBI Trans-Omics for Precision Medicine (TOPMed) program and assessed statistical association of variants with seven red blood cell (RBC) quantitative traits. We discovered 14 single variant-RBC trait associations at 12 genomic loci, which have not been reported previously. Several of the RBC trait-variant associations (RPN1, ELL2, MIDN, HBB, HBA1, PIEZO1, and G6PD) were replicated in independent GWAS datasets imputed to the TOPMed reference panel. Most of these discovered variants are rare/low frequency, and several are observed disproportionately among non-European Ancestry (African, Hispanic/Latino, or East Asian) populations. We identified a 3 bp indel p.Lys2169del (g.88717175_88717177TCT[4]) (common only in the Ashkenazi Jewish population) of PIEZO1, a gene responsible for the Mendelian red cell disorder hereditary xerocytosis (MIM: 194380), associated with higher mean corpuscular hemoglobin concentration (MCHC). In stepwise conditional analysis and in gene-based rare variant aggregated association analysis, we identified several of the variants in HBB, HBA1, TMPRSS6, and G6PD that represent the carrier state for known coding, promoter, or splice site loss-of-function variants that cause inherited RBC disorders. Finally, we applied base and nuclease editing to demonstrate that the sentinel variant rs112097551 (nearest gene RPN1) acts through a cis-regulatory element that exerts long-range control of the gene RUVBL1 which is essential for hematopoiesis. Together, these results demonstrate the utility of WGS in ethnically diverse population-based samples and gene editing for expanding knowledge of the genetic architecture of quantitative hematologic traits and suggest a continuum between complex trait and Mendelian red cell disorders.


Assuntos
Eritrócitos/metabolismo , Eritrócitos/patologia , Estudo de Associação Genômica Ampla , National Heart, Lung, and Blood Institute (U.S.)/organização & administração , Fenótipo , Adulto , Idoso , Cromossomos Humanos Par 16/genética , Conjuntos de Dados como Assunto , Feminino , Edição de Genes , Variação Genética/genética , Células HEK293 , Humanos , Masculino , Pessoa de Meia-Idade , Controle de Qualidade , Reprodutibilidade dos Testes , Estados Unidos
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA