Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 348
Filtrar
Más filtros

Tipo del documento
Intervalo de año de publicación
1.
Cell ; 179(4): 984-1002.e36, 2019 10 31.
Artículo en Inglés | MEDLINE | ID: mdl-31675503

RESUMEN

Genomic studies in African populations provide unique opportunities to understand disease etiology, human diversity, and population history. In the largest study of its kind, comprising genome-wide data from 6,400 individuals and whole-genome sequences from 1,978 individuals from rural Uganda, we find evidence of geographically correlated fine-scale population substructure. Historically, the ancestry of modern Ugandans was best represented by a mixture of ancient East African pastoralists. We demonstrate the value of the largest sequence panel from Africa to date as an imputation resource. Examining 34 cardiometabolic traits, we show systematic differences in trait heritability between European and African populations, probably reflecting the differential impact of genes and environment. In a multi-trait pan-African GWAS of up to 14,126 individuals, we identify novel loci associated with anthropometric, hematological, lipid, and glycemic traits. We find that several functionally important signals are driven by Africa-specific variants, highlighting the value of studying diverse populations across the region.


Asunto(s)
Población Negra/genética , Predisposición Genética a la Enfermedad , Genoma Humano/genética , Genómica , Femenino , Frecuencia de los Genes/genética , Estudio de Asociación del Genoma Completo , Humanos , Masculino , Polimorfismo de Nucleótido Simple/genética , Uganda/epidemiología , Secuenciación Completa del Genoma
2.
Am J Hum Genet ; 111(5): 990-995, 2024 May 02.
Artículo en Inglés | MEDLINE | ID: mdl-38636510

RESUMEN

Since genotype imputation was introduced, researchers have been relying on the estimated imputation quality from imputation software to perform post-imputation quality control (QC). However, this quality estimate (denoted as Rsq) performs less well for lower-frequency variants. We recently published MagicalRsq, a machine-learning-based imputation quality calibration, which leverages additional typed markers from the same cohort and outperforms Rsq as a QC metric. In this work, we extended the original MagicalRsq to allow cross-cohort model training and named the new model MagicalRsq-X. We removed the cohort-specific estimated minor allele frequency and included linkage disequilibrium scores and recombination rates as additional features. Leveraging whole-genome sequencing data from TOPMed, specifically participants in the BioMe, JHS, WHI, and MESA studies, we performed comprehensive cross-cohort evaluations for predominantly European and African ancestral individuals based on their inferred global ancestry with the 1000 Genomes and Human Genome Diversity Project data as reference. Our results suggest MagicalRsq-X outperforms Rsq in almost every setting, with 7.3%-14.4% improvement in squared Pearson correlation with true R2, corresponding to 85-218 K variant gains. We further developed a metric to quantify the genetic distances of a target cohort relative to a reference cohort and showed that such metric largely explained the performance of MagicalRsq-X models. Finally, we found MagicalRsq-X saved up to 53 known genome-wide significant variants in one of the largest blood cell trait GWASs that would be missed using the original Rsq for QC. In conclusion, MagicalRsq-X shows superiority for post-imputation QC and benefits genetic studies by distinguishing well and poorly imputed lower-frequency variants.


Asunto(s)
Frecuencia de los Genes , Genotipo , Polimorfismo de Nucleótido Simple , Programas Informáticos , Humanos , Estudios de Cohortes , Desequilibrio de Ligamiento , Estudio de Asociación del Genoma Completo/métodos , Genoma Humano , Control de Calidad , Aprendizaje Automático , Secuenciación Completa del Genoma/normas , Secuenciación Completa del Genoma/métodos
3.
Nature ; 591(7849): 211-219, 2021 03.
Artículo en Inglés | MEDLINE | ID: mdl-33692554

RESUMEN

Polygenic risk scores (PRSs), which often aggregate results from genome-wide association studies, can bridge the gap between initial discovery efforts and clinical applications for the estimation of disease risk using genetics. However, there is notable heterogeneity in the application and reporting of these risk scores, which hinders the translation of PRSs into clinical care. Here, in a collaboration between the Clinical Genome Resource (ClinGen) Complex Disease Working Group and the Polygenic Score (PGS) Catalog, we present the Polygenic Risk Score Reporting Standards (PRS-RS), in which we update the Genetic Risk Prediction Studies (GRIPS) Statement to reflect the present state of the field. Drawing on the input of experts in epidemiology, statistics, disease-specific applications, implementation and policy, this comprehensive reporting framework defines the minimal information that is needed to interpret and evaluate PRSs, especially with respect to downstream clinical applications. Items span detailed descriptions of study populations, statistical methods for the development and validation of PRSs and considerations for the potential limitations of these scores. In addition, we emphasize the need for data availability and transparency, and we encourage researchers to deposit and share PRSs through the PGS Catalog to facilitate reproducibility and comparative benchmarking. By providing these criteria in a structured format that builds on existing standards and ontologies, the use of this framework in publishing PRSs will facilitate translation into clinical care and progress towards defining best practice.


Asunto(s)
Predisposición Genética a la Enfermedad , Genética Médica/normas , Herencia Multifactorial/genética , Humanos , Reproducibilidad de los Resultados , Medición de Riesgo/normas
4.
Hum Mol Genet ; 2024 Jun 16.
Artículo en Inglés | MEDLINE | ID: mdl-38879759

RESUMEN

Venous thromboembolism (VTE) is a significant contributor to morbidity and mortality, with large disparities in incidence rates between Black and White Americans. Polygenic risk scores (PRSs) limited to variants discovered in genome-wide association studies in European-ancestry samples can identify European-ancestry individuals at high risk of VTE. However, there is limited evidence on whether high-dimensional PRS constructed using more sophisticated methods and more diverse training data can enhance the predictive ability and their utility across diverse populations. We developed PRSs for VTE using summary statistics from the International Network against Venous Thrombosis (INVENT) consortium genome-wide association studies meta-analyses of European- (71 771 cases and 1 059 740 controls) and African-ancestry samples (7482 cases and 129 975 controls). We used LDpred2 and PRS-CSx to construct ancestry-specific and multi-ancestry PRSs and evaluated their performance in an independent European- (6781 cases and 103 016 controls) and African-ancestry sample (1385 cases and 12 569 controls). Multi-ancestry PRSs with weights tuned in European-ancestry samples slightly outperformed ancestry-specific PRSs in European-ancestry test samples (e.g. the area under the receiver operating curve [AUC] was 0.609 for PRS-CSx_combinedEUR and 0.608 for PRS-CSxEUR [P = 0.00029]). Multi-ancestry PRSs with weights tuned in African-ancestry samples also outperformed ancestry-specific PRSs in African-ancestry test samples (PRS-CSxAFR: AUC = 0.58, PRS-CSx_combined AFR: AUC = 0.59), although this difference was not statistically significant (P = 0.34). The highest fifth percentile of the best-performing PRS was associated with 1.9-fold and 1.68-fold increased risk for VTE among European- and African-ancestry subjects, respectively, relative to those in the middle stratum. These findings suggest that the multi-ancestry PRS might be used to improve performance across diverse populations to identify individuals at highest risk for VTE.

5.
Hum Mol Genet ; 2024 May 15.
Artículo en Inglés | MEDLINE | ID: mdl-38747556

RESUMEN

Inflammation biomarkers can provide valuable insight into the role of inflammatory processes in many diseases and conditions. Sequencing based analyses of such biomarkers can also serve as an exemplar of the genetic architecture of quantitative traits. To evaluate the biological insight, which can be provided by a multi-ancestry, whole-genome based association study, we performed a comprehensive analysis of 21 inflammation biomarkers from up to 38 465 individuals with whole-genome sequencing from the Trans-Omics for Precision Medicine (TOPMed) program (with varying sample size by trait, where the minimum sample size was n = 737 for MMP-1). We identified 22 distinct single-variant associations across 6 traits-E-selectin, intercellular adhesion molecule 1, interleukin-6, lipoprotein-associated phospholipase A2 activity and mass, and P-selectin-that remained significant after conditioning on previously identified associations for these inflammatory biomarkers. We further expanded upon known biomarker associations by pairing the single-variant analysis with a rare variant set-based analysis that further identified 19 significant rare variant set-based associations with 5 traits. These signals were distinct from both significant single variant association signals within TOPMed and genetic signals observed in prior studies, demonstrating the complementary value of performing both single and rare variant analyses when analyzing quantitative traits. We also confirm several previously reported signals from semi-quantitative proteomics platforms. Many of these signals demonstrate the extensive allelic heterogeneity and ancestry-differentiated variant-trait associations common for inflammation biomarkers, a characteristic we hypothesize will be increasingly observed with well-powered, large-scale analyses of complex traits.

6.
Am J Hum Genet ; 110(11): 1853-1862, 2023 11 02.
Artículo en Inglés | MEDLINE | ID: mdl-37875120

RESUMEN

The heritability explained by local ancestry markers in an admixed population (hγ2) provides crucial insight into the genetic architecture of a complex disease or trait. Estimation of hγ2 can be susceptible to biases due to population structure in ancestral populations. Here, we present heritability estimation from admixture mapping summary statistics (HAMSTA), an approach that uses summary statistics from admixture mapping to infer heritability explained by local ancestry while adjusting for biases due to ancestral stratification. Through extensive simulations, we demonstrate that HAMSTA hγ2 estimates are approximately unbiased and are robust to ancestral stratification compared to existing approaches. In the presence of ancestral stratification, we show a HAMSTA-derived sampling scheme provides a calibrated family-wise error rate (FWER) of ∼5% for admixture mapping, unlike existing FWER estimation approaches. We apply HAMSTA to 20 quantitative phenotypes of up to 15,988 self-reported African American individuals in the Population Architecture using Genomics and Epidemiology (PAGE) study. We observe hˆγ2 in the 20 phenotypes range from 0.0025 to 0.033 (mean hˆγ2 = 0.012 ± 9.2 × 10-4), which translates to hˆ2 ranging from 0.062 to 0.85 (mean hˆ2 = 0.30 ± 0.023). Across these phenotypes we find little evidence of inflation due to ancestral population stratification in current admixture mapping studies (mean inflation factor of 0.99 ± 0.001). Overall, HAMSTA provides a fast and powerful approach to estimate genome-wide heritability and evaluate biases in test statistics of admixture mapping studies.


Asunto(s)
Negro o Afroamericano , Genética de Población , Humanos , Mapeo Cromosómico , Fenotipo , Polimorfismo de Nucleótido Simple/genética
7.
Am J Hum Genet ; 110(10): 1704-1717, 2023 10 05.
Artículo en Inglés | MEDLINE | ID: mdl-37802043

RESUMEN

Long non-coding RNAs (lncRNAs) are known to perform important regulatory functions in lipid metabolism. Large-scale whole-genome sequencing (WGS) studies and new statistical methods for variant set tests now provide an opportunity to assess more associations between rare variants in lncRNA genes and complex traits across the genome. In this study, we used high-coverage WGS from 66,329 participants of diverse ancestries with measurement of blood lipids and lipoproteins (LDL-C, HDL-C, TC, and TG) in the National Heart, Lung, and Blood Institute (NHLBI) Trans-Omics for Precision Medicine (TOPMed) program to investigate the role of lncRNAs in lipid variability. We aggregated rare variants for 165,375 lncRNA genes based on their genomic locations and conducted rare-variant aggregate association tests using the STAAR (variant-set test for association using annotation information) framework. We performed STAAR conditional analysis adjusting for common variants in known lipid GWAS loci and rare-coding variants in nearby protein-coding genes. Our analyses revealed 83 rare lncRNA variant sets significantly associated with blood lipid levels, all of which were located in known lipid GWAS loci (in a ±500-kb window of a Global Lipids Genetics Consortium index variant). Notably, 61 out of 83 signals (73%) were conditionally independent of common regulatory variation and rare protein-coding variation at the same loci. We replicated 34 out of 61 (56%) conditionally independent associations using the independent UK Biobank WGS data. Our results expand the genetic architecture of blood lipids to rare variants in lncRNAs.


Asunto(s)
ARN Largo no Codificante , Humanos , ARN Largo no Codificante/genética , Estudio de Asociación del Genoma Completo , Medicina de Precisión , Secuenciación Completa del Genoma/métodos , Lípidos/genética , Polimorfismo de Nucleótido Simple/genética
8.
Genet Epidemiol ; 2024 Jun 28.
Artículo en Inglés | MEDLINE | ID: mdl-38940271

RESUMEN

In most Proteome-Wide Association Studies (PWAS), variants near the protein-coding gene (±1 Mb), also known as cis single nucleotide polymorphisms (SNPs), are used to predict protein levels, which are then tested for association with phenotypes. However, proteins can be regulated through variants outside of the cis region. An intermediate GWAS step to identify protein quantitative trait loci (pQTL) allows for the inclusion of trans SNPs outside the cis region in protein-level prediction models. Here, we assess the prediction of 540 proteins in 1002 individuals from the Women's Health Initiative (WHI), split equally into a GWAS set, an elastic net training set, and a testing set. We compared the testing r2 between measured and predicted protein levels using this proposed approach, to the testing r2 using only cis SNPs. The two methods usually resulted in similar testing r2, but some proteins showed a significant increase in testing r2 with our method. For example, for cartilage acidic protein 1, the testing r2 increased from 0.101 to 0.351. We also demonstrate reproducible findings for predicted protein association with lipid and blood cell traits in WHI participants without proteomics data and in UK Biobank utilizing our PWAS weights.

9.
Hum Mol Genet ; 32(6): 1048-1060, 2023 03 06.
Artículo en Inglés | MEDLINE | ID: mdl-36444934

RESUMEN

Diabetic kidney disease (DKD) is recognized as an important public health challenge. However, its genomic mechanisms are poorly understood. To identify rare variants for DKD, we conducted a whole-exome sequencing (WES) study leveraging large cohorts well-phenotyped for chronic kidney disease and diabetes. Our two-stage WES study included 4372 European and African ancestry participants from the Chronic Renal Insufficiency Cohort and Atherosclerosis Risk in Communities studies (stage 1) and 11 487 multi-ancestry Trans-Omics for Precision Medicine participants (stage 2). Generalized linear mixed models, which accounted for genetic relatedness and adjusted for age, sex and ancestry, were used to test associations between single variants and DKD. Gene-based aggregate rare variant analyses were conducted using an optimized sequence kernel association test implemented within our mixed model framework. We identified four novel exome-wide significant DKD-related loci through initiating diabetes. In single-variant analyses, participants carrying a rare, in-frame insertion in the DIS3L2 gene (rs141560952) exhibited a 193-fold increased odds [95% confidence interval (CI): 33.6, 1105] of DKD compared with noncarriers (P = 3.59 × 10-9). Likewise, each copy of a low-frequency KRT6B splice-site variant (rs425827) conferred a 5.31-fold higher odds (95% CI: 3.06, 9.21) of DKD (P = 2.72 × 10-9). Aggregate gene-based analyses further identified ERAP2 (P = 4.03 × 10-8) and NPEPPS (P = 1.51 × 10-7), which are both expressed in the kidney and implicated in renin-angiotensin-aldosterone system modulated immune response. In the largest WES study of DKD, we identified novel rare variant loci attaining exome-wide significance. These findings provide new insights into the molecular mechanisms underlying DKD.


Asunto(s)
Diabetes Mellitus , Nefropatías Diabéticas , Insuficiencia Renal Crónica , Humanos , Aminopeptidasas , Nefropatías Diabéticas/genética , Secuenciación del Exoma , Riñón , Insuficiencia Renal Crónica/genética
10.
Am J Hum Genet ; 109(6): 1055-1064, 2022 06 02.
Artículo en Inglés | MEDLINE | ID: mdl-35588732

RESUMEN

Polygenic risk scores (PRSs) quantify the contribution of multiple genetic loci to an individual's likelihood of a complex trait or disease. However, existing PRSs estimate this likelihood with common genetic variants, excluding the impact of rare variants. Here, we report on a method to identify rare variants associated with outlier gene expression and integrate their impact into PRS predictions for body mass index (BMI), obesity, and bariatric surgery. Between the top and bottom 10%, we observed a 20.8% increase in risk for obesity (p = 3 × 10-14), 62.3% increase in risk for severe obesity (p = 1 × 10-6), and median 5.29 years earlier onset for bariatric surgery (p = 0.008), as a function of expression outlier-associated rare variant burden when controlling for common variant PRS. We show that these predictions were more significant than integrating the effects of rare protein-truncating variants (PTVs), observing a mean 19% increase in phenotypic variance explained with expression outlier-associated rare variants when compared with PTVs (p = 2 × 10-15). We replicated these findings by using data from the Million Veteran Program and demonstrated that PRSs across multiple traits and diseases can benefit from the inclusion of expression outlier-associated rare variants identified through population-scale transcriptome sequencing.


Asunto(s)
Herencia Multifactorial , Obesidad , Índice de Masa Corporal , Predisposición Genética a la Enfermedad , Estudio de Asociación del Genoma Completo , Humanos , Herencia Multifactorial/genética , Obesidad/genética , Fenotipo , Factores de Riesgo
11.
Am J Hum Genet ; 109(6): 1175-1181, 2022 06 02.
Artículo en Inglés | MEDLINE | ID: mdl-35504290

RESUMEN

Current publicly available tools that allow rapid exploration of linkage disequilibrium (LD) between markers (e.g., HaploReg and LDlink) are based on whole-genome sequence (WGS) data from 2,504 individuals in the 1000 Genomes Project. Here, we present TOP-LD, an online tool to explore LD inferred with high-coverage (∼30×) WGS data from 15,578 individuals in the NHLBI Trans-Omics for Precision Medicine (TOPMed) program. TOP-LD provides a significant upgrade compared to current LD tools, as the TOPMed WGS data provide a more comprehensive representation of genetic variation than the 1000 Genomes data, particularly for rare variants and in the specific populations that we analyzed. For example, TOP-LD encompasses LD information for 150.3, 62.2, and 36.7 million variants for European, African, and East Asian ancestral samples, respectively, offering 2.6- to 9.1-fold increase in variant coverage compared to HaploReg 4.0 or LDlink. In addition, TOP-LD includes tens of thousands of structural variants (SVs). We demonstrate the value of TOP-LD in fine-mapping at the GGT1 locus associated with gamma glutamyltransferase in the African ancestry participants in UK Biobank. Beyond fine-mapping, TOP-LD can facilitate a wide range of applications that are based on summary statistics and estimates of LD. TOP-LD is freely available online.


Asunto(s)
Estudio de Asociación del Genoma Completo , Medicina de Precisión , Pueblo Asiatico , Humanos , Desequilibrio de Ligamiento/genética , Polimorfismo de Nucleótido Simple/genética , Secuenciación Completa del Genoma
12.
Am J Hum Genet ; 109(4): 669-679, 2022 04 07.
Artículo en Inglés | MEDLINE | ID: mdl-35263625

RESUMEN

One mechanism by which genetic factors influence complex traits and diseases is altering gene expression. Direct measurement of gene expression in relevant tissues is rarely tenable; however, genetically regulated gene expression (GReX) can be estimated using prediction models derived from large multi-omic datasets. These approaches have led to the discovery of many gene-trait associations, but whether models derived from predominantly European ancestry (EA) reference panels can map novel associations in ancestrally diverse populations remains unclear. We applied PrediXcan to impute GReX in 51,520 ancestrally diverse Population Architecture using Genomics and Epidemiology (PAGE) participants (35% African American, 45% Hispanic/Latino, 10% Asian, and 7% Hawaiian) across 25 key cardiometabolic traits and relevant tissues to identify 102 novel associations. We then compared associations in PAGE to those in a random subset of 50,000 White British participants from UK Biobank (UKBB50k) for height and body mass index (BMI). We identified 517 associations across 47 tissues in PAGE but not UKBB50k, demonstrating the importance of diverse samples in identifying trait-associated GReX. We observed that variants used in PrediXcan models were either more or less differentiated across continental-level populations than matched-control variants depending on the specific population reflecting sampling bias. Additionally, variants from identified genes specific to either PAGE or UKBB50k analyses were more ancestrally differentiated than those in genes detected in both analyses, underlining the value of population-specific discoveries. This suggests that while EA-derived transcriptome imputation models can identify new associations in non-EA populations, models derived from closely matched reference panels may yield further insights. Our findings call for more diversity in reference datasets of tissue-specific gene expression.


Asunto(s)
Enfermedades Cardiovasculares , Estudio de Asociación del Genoma Completo , Predisposición Genética a la Enfermedad , Humanos , Estilo de Vida , Polimorfismo de Nucleótido Simple , Transcriptoma
13.
Nat Methods ; 19(12): 1599-1611, 2022 12.
Artículo en Inglés | MEDLINE | ID: mdl-36303018

RESUMEN

Large-scale whole-genome sequencing studies have enabled analysis of noncoding rare-variant (RV) associations with complex human diseases and traits. Variant-set analysis is a powerful approach to study RV association. However, existing methods have limited ability in analyzing the noncoding genome. We propose a computationally efficient and robust noncoding RV association detection framework, STAARpipeline, to automatically annotate a whole-genome sequencing study and perform flexible noncoding RV association analysis, including gene-centric analysis and fixed window-based and dynamic window-based non-gene-centric analysis by incorporating variant functional annotations. In gene-centric analysis, STAARpipeline uses STAAR to group noncoding variants based on functional categories of genes and incorporate multiple functional annotations. In non-gene-centric analysis, STAARpipeline uses SCANG-STAAR to incorporate dynamic window sizes and multiple functional annotations. We apply STAARpipeline to identify noncoding RV sets associated with four lipid traits in 21,015 discovery samples from the Trans-Omics for Precision Medicine (TOPMed) program and replicate several of them in an additional 9,123 TOPMed samples. We also analyze five non-lipid TOPMed traits.


Asunto(s)
Estudio de Asociación del Genoma Completo , Genoma , Humanos , Estudio de Asociación del Genoma Completo/métodos , Secuenciación Completa del Genoma/métodos , Fenotipo , Variación Genética
14.
Circ Res ; 133(5): 376-386, 2023 08 18.
Artículo en Inglés | MEDLINE | ID: mdl-37489536

RESUMEN

BACKGROUND: Premature menopause is a risk factor for accelerated cardiovascular aging, but underlying mechanisms remain incompletely understood. This study investigated the role of leukocyte telomere length (LTL), a marker of cellular aging and genomic instability, in the association of premature menopause with cardiovascular disease. METHODS: Participants from the UK Biobank and Women's Health Initiative with complete reproductive history and LTL measurements were included. Primary analyses tested the association between age at menopause and LTL using multivariable-adjusted linear regression. Secondary analyses stratified women by history of gynecologic surgery. Mendelian randomization was used to infer causal relationships between LTL and age at natural menopause. Multivariable-adjusted Cox regression and mediation analyses tested the joint associations of premature menopause and LTL with incident coronary artery disease. RESULTS: This study included 130 254 postmenopausal women (UK Biobank: n=122 224; Women's Health Initiative: n=8030), of whom 4809 (3.7%) had experienced menopause before age 40. Earlier menopause was associated with shorter LTL (meta-analyzed ß=-0.02 SD/5 years of earlier menopause [95% CI, -0.02 to -0.01]; P=7.2×10-12). This association was stronger and significant in both cohorts for women with natural/spontaneous menopause (meta-analyzed ß=-0.04 SD/5 years of earlier menopause [95% CI, -0.04 to -0.03]; P<2.2×10-16) and was independent of hormone therapy use. Mendelian randomization supported a causal association of shorter genetically predicted LTL with earlier age at natural menopause. LTL and age at menopause were independently associated with incident coronary artery disease, and mediation analyses indicated small but significant mediation effects of LTL in the association of menopausal age with coronary artery disease. CONCLUSIONS: Earlier age at menopause is associated with shorter LTL, especially among women with natural menopause. Accelerated telomere shortening may contribute to the heightened cardiovascular risk associated with premature menopause.


Asunto(s)
Enfermedad de la Arteria Coronaria , Menopausia Prematura , Adulto , Femenino , Humanos , Enfermedad de la Arteria Coronaria/epidemiología , Enfermedad de la Arteria Coronaria/genética , Leucocitos , Menopausia/genética , Posmenopausia/genética , Telómero/genética
15.
Genet Epidemiol ; 47(1): 45-60, 2023 02.
Artículo en Inglés | MEDLINE | ID: mdl-36116031

RESUMEN

Populations of non-European ancestry are substantially underrepresented in genome-wide association studies (GWAS). As genetic effects can differ between ancestries due to possibly different causal variants or linkage disequilibrium patterns, a meta-analysis that includes GWAS of all populations yields biased estimation in each of the populations and the bias disproportionately impacts non-European ancestry populations. This is because meta-analysis combines study-specific estimates with inverse variance as the weights, which causes biases towards studies with the largest sample size, typical of the European ancestry population. In this paper, we propose two empirical Bayes (EB) estimators to borrow the strength of information across populations although accounting for between-population heterogeneity. Extensive simulation studies show that the proposed EB estimators are largely unbiased and improve efficiency compared to the population-specific estimator. In contrast, even though the meta-analysis estimator has a much smaller variance, it yields significant bias when the genetic effect is heterogeneous across populations. We apply the proposed EB estimators to a large-scale trans-ancestry GWAS of stroke and demonstrate that the EB estimators reduce the variance of the population-specific estimator substantially, with the effect estimates close to the population-specific estimates.


Asunto(s)
Estudio de Asociación del Genoma Completo , Modelos Genéticos , Humanos , Teorema de Bayes , Simulación por Computador , Desequilibrio de Ligamiento
16.
Hum Mol Genet ; 31(3): 347-361, 2022 02 03.
Artículo en Inglés | MEDLINE | ID: mdl-34553764

RESUMEN

Platelets play a key role in thrombosis and hemostasis. Platelet count (PLT) and mean platelet volume (MPV) are highly heritable quantitative traits, with hundreds of genetic signals previously identified, mostly in European ancestry populations. We here utilize whole genome sequencing (WGS) from NHLBI's Trans-Omics for Precision Medicine initiative (TOPMed) in a large multi-ethnic sample to further explore common and rare variation contributing to PLT (n = 61 200) and MPV (n = 23 485). We identified and replicated secondary signals at MPL (rs532784633) and PECAM1 (rs73345162), both more common in African ancestry populations. We also observed rare variation in Mendelian platelet-related disorder genes influencing variation in platelet traits in TOPMed cohorts (not enriched for blood disorders). For example, association of GP9 with lower PLT and higher MPV was partly driven by a pathogenic Bernard-Soulier syndrome variant (rs5030764, p.Asn61Ser), and the signals at TUBB1 and CD36 were partly driven by loss of function variants not annotated as pathogenic in ClinVar (rs199948010 and rs571975065). However, residual signal remained for these gene-based signals after adjusting for lead variants, suggesting that additional variants in Mendelian genes with impacts in general population cohorts remain to be identified. Gene-based signals were also identified at several genome-wide association study identified loci for genes not annotated for Mendelian platelet disorders (PTPRH, TET2, CHEK2), with somatic variation driving the result at TET2. These results highlight the value of WGS in populations of diverse genetic ancestry to identify novel regulatory and coding signals, even for well-studied traits like platelet traits.


Asunto(s)
Estudio de Asociación del Genoma Completo , Medicina de Precisión , Plaquetas , Humanos , National Heart, Lung, and Blood Institute (U.S.) , Fenotipo , Polimorfismo de Nucleótido Simple , Medicina de Precisión/métodos , Estados Unidos
17.
N Engl J Med ; 384(5): 440-451, 2021 02 04.
Artículo en Inglés | MEDLINE | ID: mdl-33471974

RESUMEN

BACKGROUND: Population-based estimates of the risk of breast cancer associated with germline pathogenic variants in cancer-predisposition genes are critically needed for risk assessment and management in women with inherited pathogenic variants. METHODS: In a population-based case-control study, we performed sequencing using a custom multigene amplicon-based panel to identify germline pathogenic variants in 28 cancer-predisposition genes among 32,247 women with breast cancer (case patients) and 32,544 unaffected women (controls) from population-based studies in the Cancer Risk Estimates Related to Susceptibility (CARRIERS) consortium. Associations between pathogenic variants in each gene and the risk of breast cancer were assessed. RESULTS: Pathogenic variants in 12 established breast cancer-predisposition genes were detected in 5.03% of case patients and in 1.63% of controls. Pathogenic variants in BRCA1 and BRCA2 were associated with a high risk of breast cancer, with odds ratios of 7.62 (95% confidence interval [CI], 5.33 to 11.27) and 5.23 (95% CI, 4.09 to 6.77), respectively. Pathogenic variants in PALB2 were associated with a moderate risk (odds ratio, 3.83; 95% CI, 2.68 to 5.63). Pathogenic variants in BARD1, RAD51C, and RAD51D were associated with increased risks of estrogen receptor-negative breast cancer and triple-negative breast cancer, whereas pathogenic variants in ATM, CDH1, and CHEK2 were associated with an increased risk of estrogen receptor-positive breast cancer. Pathogenic variants in 16 candidate breast cancer-predisposition genes, including the c.657_661del5 founder pathogenic variant in NBN, were not associated with an increased risk of breast cancer. CONCLUSIONS: This study provides estimates of the prevalence and risk of breast cancer associated with pathogenic variants in known breast cancer-predisposition genes in the U.S. population. These estimates can inform cancer testing and screening and improve clinical management strategies for women in the general population with inherited pathogenic variants in these genes. (Funded by the National Institutes of Health and the Breast Cancer Research Foundation.).


Asunto(s)
Neoplasias de la Mama/genética , Predisposición Genética a la Enfermedad/genética , Variación Genética , Adulto , Anciano , Anciano de 80 o más Años , Estudios de Casos y Controles , Femenino , Humanos , Persona de Mediana Edad , Mutación , Oportunidad Relativa , Riesgo , Análisis de Secuencia de ADN , Adulto Joven
18.
JAMA ; 331(20): 1748-1760, 2024 05 28.
Artículo en Inglés | MEDLINE | ID: mdl-38691368

RESUMEN

Importance: Approximately 55 million people in the US and approximately 1.1 billion people worldwide are postmenopausal women. To inform clinical practice about the health effects of menopausal hormone therapy, calcium plus vitamin D supplementation, and a low-fat dietary pattern, the Women's Health Initiative (WHI) enrolled 161 808 postmenopausal US women (N = 68 132 in the clinical trials) aged 50 to 79 years at baseline from 1993 to 1998, and followed them up for up to 20 years. Observations: The WHI clinical trial results do not support hormone therapy with oral conjugated equine estrogens plus medroxyprogesterone acetate for postmenopausal women or conjugated equine estrogens alone for those with prior hysterectomy to prevent cardiovascular disease, dementia, or other chronic diseases. However, hormone therapy is effective for treating moderate to severe vasomotor and other menopausal symptoms. These benefits of hormone therapy in early menopause, combined with lower rates of adverse effects of hormone therapy in early compared with later menopause, support initiation of hormone therapy before age 60 years for women without contraindications to hormone therapy who have bothersome menopausal symptoms. The WHI results do not support routinely recommending calcium plus vitamin D supplementation for fracture prevention in all postmenopausal women. However, calcium and vitamin D are appropriate for women who do not meet national guidelines for recommended intakes of these nutrients through diet. A low-fat dietary pattern with increased fruit, vegetable, and grain consumption did not prevent the primary outcomes of breast or colorectal cancer but was associated with lower rates of the secondary outcome of breast cancer mortality during long-term follow-up. Conclusions and Relevance: For postmenopausal women, the WHI randomized clinical trials do not support menopausal hormone therapy to prevent cardiovascular disease or other chronic diseases. Menopausal hormone therapy is appropriate to treat bothersome vasomotor symptoms among women in early menopause, without contraindications, who are interested in taking hormone therapy. The WHI evidence does not support routine supplementation with calcium plus vitamin D for menopausal women to prevent fractures or a low-fat diet with increased fruits, vegetables, and grains to prevent breast or colorectal cancer. A potential role of a low-fat dietary pattern in reducing breast cancer mortality, a secondary outcome, warrants further study.


Asunto(s)
Neoplasias de la Mama , Enfermedades Cardiovasculares , Suplementos Dietéticos , Terapia de Reemplazo de Estrógeno , Salud de la Mujer , Anciano , Femenino , Humanos , Persona de Mediana Edad , Neoplasias de la Mama/prevención & control , Calcio/uso terapéutico , Calcio/administración & dosificación , Calcio de la Dieta/administración & dosificación , Enfermedades Cardiovasculares/prevención & control , Dieta con Restricción de Grasas , Terapia de Reemplazo de Estrógeno/efectos adversos , Estrógenos Conjugados (USP)/uso terapéutico , Estrógenos Conjugados (USP)/administración & dosificación , Estrógenos Conjugados (USP)/efectos adversos , Sofocos/tratamiento farmacológico , Acetato de Medroxiprogesterona/administración & dosificación , Acetato de Medroxiprogesterona/uso terapéutico , Acetato de Medroxiprogesterona/efectos adversos , Osteoporosis Posmenopáusica/prevención & control , Osteoporosis Posmenopáusica/tratamiento farmacológico , Posmenopausia , Ensayos Clínicos Controlados Aleatorios como Asunto , Vitamina D/uso terapéutico , Vitamina D/administración & dosificación , Estados Unidos
19.
Diabetologia ; 66(7): 1273-1288, 2023 07.
Artículo en Inglés | MEDLINE | ID: mdl-37148359

RESUMEN

AIMS/HYPOTHESIS: The Latino population has been systematically underrepresented in large-scale genetic analyses, and previous studies have relied on the imputation of ungenotyped variants based on the 1000 Genomes (1000G) imputation panel, which results in suboptimal capture of low-frequency or Latino-enriched variants. The National Heart, Lung, and Blood Institute (NHLBI) Trans-Omics for Precision Medicine (TOPMed) released the largest multi-ancestry genotype reference panel representing a unique opportunity to analyse rare genetic variations in the Latino population. We hypothesise that a more comprehensive analysis of low/rare variation using the TOPMed panel would improve our knowledge of the genetics of type 2 diabetes in the Latino population. METHODS: We evaluated the TOPMed imputation performance using genotyping array and whole-exome sequence data in six Latino cohorts. To evaluate the ability of TOPMed imputation to increase the number of identified loci, we performed a Latino type 2 diabetes genome-wide association study (GWAS) meta-analysis in 8150 individuals with type 2 diabetes and 10,735 control individuals and replicated the results in six additional cohorts including whole-genome sequence data from the All of Us cohort. RESULTS: Compared with imputation with 1000G, the TOPMed panel improved the identification of rare and low-frequency variants. We identified 26 genome-wide significant signals including a novel variant (minor allele frequency 1.7%; OR 1.37, p=3.4 × 10-9). A Latino-tailored polygenic score constructed from our data and GWAS data from East Asian and European populations improved the prediction accuracy in a Latino target dataset, explaining up to 7.6% of the type 2 diabetes risk variance. CONCLUSIONS/INTERPRETATION: Our results demonstrate the utility of TOPMed imputation for identifying low-frequency variants in understudied populations, leading to the discovery of novel disease associations and the improvement of polygenic scores. DATA AVAILABILITY: Full summary statistics are available through the Common Metabolic Diseases Knowledge Portal ( https://t2d.hugeamp.org/downloads.html ) and through the GWAS catalog ( https://www.ebi.ac.uk/gwas/ , accession ID: GCST90255648). Polygenic score (PS) weights for each ancestry are available via the PGS catalog ( https://www.pgscatalog.org , publication ID: PGP000445, scores IDs: PGS003443, PGS003444 and PGS003445).


Asunto(s)
Diabetes Mellitus Tipo 2 , Salud Poblacional , Humanos , Estudio de Asociación del Genoma Completo , Diabetes Mellitus Tipo 2/genética , Medicina de Precisión , Genotipo , Hispánicos o Latinos/genética , Polimorfismo de Nucleótido Simple/genética
20.
Genet Epidemiol ; 46(1): 3-16, 2022 02.
Artículo en Inglés | MEDLINE | ID: mdl-34779012

RESUMEN

Hematological measures are important intermediate clinical phenotypes for many acute and chronic diseases and are highly heritable. Although genome-wide association studies (GWAS) have identified thousands of loci containing trait-associated variants, the causal genes underlying these associations are often uncertain. To better understand the underlying genetic regulatory mechanisms, we performed a transcriptome-wide association study (TWAS) to systematically investigate the association between genetically predicted gene expression and hematological measures in 54,542 Europeans from the Genetic Epidemiology Research on Aging cohort. We found 239 significant gene-trait associations with hematological measures; we replicated 71 associations at p < 0.05 in a TWAS meta-analysis consisting of up to 35,900 Europeans from the Women's Health Initiative, Atherosclerosis Risk in Communities Study, and BioMe Biobank. Additionally, we attempted to refine this list of candidate genes by performing conditional analyses, adjusting for individual variants previously associated with hematological measures, and performed further fine-mapping of TWAS loci. To facilitate interpretation of our findings, we designed an R Shiny application to interactively visualize our TWAS results by integrating them with additional genetic data sources (GWAS, TWAS from multiple reference panels, conditional analyses, known GWAS variants, etc.). Our results and application highlight frequently overlooked TWAS challenges and illustrate the complexity of TWAS fine-mapping.


Asunto(s)
Estudio de Asociación del Genoma Completo , Transcriptoma , Células Sanguíneas , Femenino , Predisposición Genética a la Enfermedad , Estudio de Asociación del Genoma Completo/métodos , Humanos , Fenotipo , Polimorfismo de Nucleótido Simple , Sitios de Carácter Cuantitativo
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA