Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 41.516
Filtrar
Mais filtros

Intervalo de ano de publicação
1.
Cell ; 182(5): 1198-1213.e14, 2020 09 03.
Artigo em Inglês | MEDLINE | ID: mdl-32888493

RESUMO

Most loci identified by GWASs have been found in populations of European ancestry (EUR). In trans-ethnic meta-analyses for 15 hematological traits in 746,667 participants, including 184,535 non-EUR individuals, we identified 5,552 trait-variant associations at p < 5 × 10-9, including 71 novel associations not found in EUR populations. We also identified 28 additional novel variants in ancestry-specific, non-EUR meta-analyses, including an IL7 missense variant in South Asians associated with lymphocyte count in vivo and IL-7 secretion levels in vitro. Fine-mapping prioritized variants annotated as functional and generated 95% credible sets that were 30% smaller when using the trans-ethnic as opposed to the EUR-only results. We explored the clinical significance and predictive value of trans-ethnic variants in multiple populations and compared genetic architecture and the effect of natural selection on these blood phenotypes between populations. Altogether, our results for hematological traits highlight the value of a more global representation of populations in genetic studies.


Assuntos
Povo Asiático/genética , Mutação de Sentido Incorreto/genética , Polimorfismo de Nucleotídeo Único/genética , População Branca/genética , Genética , Estudo de Associação Genômica Ampla/métodos , Células HEK293 , Humanos , Interleucina-7/genética , Fenótipo
2.
Cell ; 183(7): 1962-1985.e31, 2020 12 23.
Artigo em Inglês | MEDLINE | ID: mdl-33242424

RESUMO

We report a comprehensive proteogenomics analysis, including whole-genome sequencing, RNA sequencing, and proteomics and phosphoproteomics profiling, of 218 tumors across 7 histological types of childhood brain cancer: low-grade glioma (n = 93), ependymoma (32), high-grade glioma (25), medulloblastoma (22), ganglioglioma (18), craniopharyngioma (16), and atypical teratoid rhabdoid tumor (12). Proteomics data identify common biological themes that span histological boundaries, suggesting that treatments used for one histological type may be applied effectively to other tumors sharing similar proteomics features. Immune landscape characterization reveals diverse tumor microenvironments across and within diagnoses. Proteomics data further reveal functional effects of somatic mutations and copy number variations (CNVs) not evident in transcriptomics data. Kinase-substrate association and co-expression network analysis identify important biological mechanisms of tumorigenesis. This is the first large-scale proteogenomics analysis across traditional histological boundaries to uncover foundational pediatric brain tumor biology and inform rational treatment selection.


Assuntos
Neoplasias Encefálicas/genética , Neoplasias Encefálicas/patologia , Proteogenômica , Neoplasias Encefálicas/imunologia , Criança , Variações do Número de Cópias de DNA/genética , Regulação Neoplásica da Expressão Gênica , Redes Reguladoras de Genes , Genoma Humano , Glioma/genética , Glioma/patologia , Humanos , Linfócitos do Interstício Tumoral/imunologia , Mutação/genética , Gradação de Tumores , Recidiva Local de Neoplasia/patologia , Fosfoproteínas/metabolismo , Fosforilação , RNA Mensageiro/genética , RNA Mensageiro/metabolismo , Transcriptoma/genética
3.
Cell ; 177(3): 587-596.e9, 2019 04 18.
Artigo em Inglês | MEDLINE | ID: mdl-31002795

RESUMO

Severe obesity is a rapidly growing global health threat. Although often attributed to unhealthy lifestyle choices or environmental factors, obesity is known to be heritable and highly polygenic; the majority of inherited susceptibility is related to the cumulative effect of many common DNA variants. Here we derive and validate a new polygenic predictor comprised of 2.1 million common variants to quantify this susceptibility and test this predictor in more than 300,000 individuals ranging from middle age to birth. Among middle-aged adults, we observe a 13-kg gradient in weight and a 25-fold gradient in risk of severe obesity across polygenic score deciles. In a longitudinal birth cohort, we note minimal differences in birthweight across score deciles, but a significant gradient emerged in early childhood and reached 12 kg by 18 years of age. This new approach to quantify inherited susceptibility to obesity affords new opportunities for clinical prevention and mechanistic assessment.


Assuntos
Peso Corporal , Herança Multifatorial/genética , Obesidade/patologia , Adolescente , Índice de Massa Corporal , Criança , Bases de Dados Factuais , Feminino , Estudo de Associação Genômica Ampla , Humanos , Recém-Nascido , Estudos Longitudinais , Masculino , Pessoa de Meia-Idade , Obesidade/genética , Fatores de Risco , Índice de Gravidade de Doença
4.
Physiol Rev ; 103(3): 2039-2055, 2023 07 01.
Artigo em Inglês | MEDLINE | ID: mdl-36634218

RESUMO

Genome-wide association studies (GWAS) aim to identify common genetic variants that are associated with traits and diseases. Since 2005, more than 5,000 GWAS have been published for almost as many traits. These studies have offered insights into the loci and genes underlying phenotypic traits, have highlighted genetic correlations across traits and diseases, and are beginning to demonstrate clinical utility by identifying individuals at increased risk for common diseases. GWAS have been widely utilized across cardiovascular diseases and associated phenotypic traits, with insights facilitated by multicenter registry studies and large biobank data sets. In this review, we describe how GWAS have informed the genetic architecture of cardiovascular diseases and the insights they have provided into disease pathophysiology, using archetypal conditions for both common and rare diseases. We also describe how biobank data sets can complement disease-specific studies, particularly for rarer cardiovascular diseases, and how findings from GWAS have the potential to impact on clinical care. Finally, we discuss the outstanding challenges facing research in this field and how they can be addressed.


Assuntos
Doenças Cardiovasculares , Estudo de Associação Genômica Ampla , Humanos , Doenças Cardiovasculares/genética , Fenótipo , Predisposição Genética para Doença , Estudos Multicêntricos como Assunto
5.
Annu Rev Genet ; 54: 189-211, 2020 11 23.
Artigo em Inglês | MEDLINE | ID: mdl-32867542

RESUMO

Canalization refers to the evolution of populations such that the number of individuals who deviate from the optimum trait, or experience disease, is minimized. In the presence of rapid cultural, environmental, or genetic change, the reverse process of decanalization may contribute to observed increases in disease prevalence. This review starts by defining relevant concepts, drawing distinctions between the canalization of populations and robustness of individuals. It then considers evidence pertaining to three continuous traits and six domains of disease. In each case, existing genetic evidence for genotype-by-environment interactions is insufficient to support a strong inference of decanalization, but we argue that the advent of genome-wide polygenic risk assessment now makes an empirical evaluation of the role of canalization in preventing disease possible. Finally, the contributions of both rare and common variants to congenital abnormality and adult onset disease are considered in light of a new kerplunk model of genetic effects.


Assuntos
Doenças Genéticas Inatas/genética , Genoma/genética , Genética Humana/métodos , Variação Genética/genética , Genótipo , Humanos , Filogenia
6.
Annu Rev Pharmacol Toxicol ; 64: 33-51, 2024 Jan 23.
Artigo em Inglês | MEDLINE | ID: mdl-37506333

RESUMO

Interindividual variability in genes encoding drug-metabolizing enzymes, transporters, receptors, and human leukocyte antigens has a major impact on a patient's response to drugs with regard to efficacy and safety. Enabled by both technological and conceptual advances, the field of pharmacogenomics is developing rapidly. Major progress in omics profiling methods has enabled novel genotypic and phenotypic characterization of patients and biobanks. These developments are paralleled by advances in machine learning, which have allowed us to parse the immense wealth of data and establish novel genetic markers and polygenic models for drug selection and dosing. Pharmacogenomics has recently become more widespread in clinical practice to personalize treatment and to develop new drugs tailored to specific patient populations. In this review, we provide an overview of the latest developments in the field and discuss the way forward, including how to address the missing heritability, develop novel polygenic models, and further improve the clinical implementation of pharmacogenomics.


Assuntos
Proteínas de Membrana Transportadoras , Farmacogenética , Humanos , Tecnologia
7.
Am J Hum Genet ; 111(6): 1006-1017, 2024 Jun 06.
Artigo em Inglês | MEDLINE | ID: mdl-38703768

RESUMO

We present shaPRS, a method that leverages widespread pleiotropy between traits or shared genetic effects across ancestries, to improve the accuracy of polygenic scores. The method uses genome-wide summary statistics from two diseases or ancestries to improve the genetic effect estimate and standard error at SNPs where there is homogeneity of effect between the two datasets. When there is significant evidence of heterogeneity, the genetic effect from the disease or population closest to the target population is maintained. We show via simulation and a series of real-world examples that shaPRS substantially enhances the accuracy of polygenic risk scores (PRSs) for complex diseases and greatly improves PRS performance across ancestries. shaPRS is a PRS pre-processing method that is agnostic to the actual PRS generation method, and as a result, it can be integrated into existing PRS generation pipelines and continue to be applied as more performant PRS methods are developed over time.


Assuntos
Predisposição Genética para Doença , Estudo de Associação Genômica Ampla , Herança Multifatorial , Polimorfismo de Nucleotídeo Único , Herança Multifatorial/genética , Humanos , Modelos Genéticos , Simulação por Computador , Pleiotropia Genética , Fenótipo
8.
Proc Natl Acad Sci U S A ; 121(4): e2308942121, 2024 Jan 23.
Artigo em Inglês | MEDLINE | ID: mdl-38241441

RESUMO

In the Antibody Mediated Prevention (AMP) trials (HVTN 704/HPTN 085 and HVTN 703/HPTN 081), prevention efficacy (PE) of the monoclonal broadly neutralizing antibody (bnAb) VRC01 (vs. placebo) against HIV-1 acquisition diagnosis varied according to the HIV-1 Envelope (Env) neutralization sensitivity to VRC01, as measured by 80% inhibitory concentration (IC80). Here, we performed a genotypic sieve analysis, a complementary approach to gaining insight into correlates of protection that assesses how PE varies with HIV-1 sequence features. We analyzed HIV-1 Env amino acid (AA) sequences from the earliest available HIV-1 RNA-positive plasma samples from AMP participants diagnosed with HIV-1 and identified Env sequence features that associated with PE. The strongest Env AA sequence correlate in both trials was VRC01 epitope distance that quantifies the divergence of the VRC01 epitope in an acquired HIV-1 isolate from the VRC01 epitope of reference HIV-1 strains that were most sensitive to VRC01-mediated neutralization. In HVTN 704/HPTN 085, the Env sequence-based predicted probability that VRC01 IC80 against the acquired isolate exceeded 1 µg/mL also significantly associated with PE. In HVTN 703/HPTN 081, a physicochemical-weighted Hamming distance across 50 VRC01 binding-associated Env AA positions of the acquired isolate from the most VRC01-sensitive HIV-1 strain significantly associated with PE. These results suggest that incorporating mutation scoring by BLOSUM62 and weighting by the strength of interactions at AA positions in the epitope:VRC01 interface can optimize performance of an Env sequence-based biomarker of VRC01 prevention efficacy. Future work could determine whether these results extend to other bnAbs and bnAb combinations.


Assuntos
Infecções por HIV , Soropositividade para HIV , HIV-1 , Humanos , Anticorpos Amplamente Neutralizantes , Anticorpos Neutralizantes , Anticorpos Anti-HIV , Epitopos/genética
9.
Hum Mol Genet ; 2024 Apr 27.
Artigo em Inglês | MEDLINE | ID: mdl-38679805

RESUMO

Late-Onset Alzheimer's Disease (LOAD) is a heterogeneous neurodegenerative disorder with complex etiology and high heritability. Its multifactorial risk profile and large portions of unexplained heritability suggest the involvement of yet unidentified genetic risk factors. Here we describe the "whole person" genetic risk landscape of polygenic risk scores for 2218 traits in 2044 elderly individuals and test if novel eigen-PRSs derived from clustered subnetworks of single-trait PRSs can improve the prediction of LOAD diagnosis, rates of cognitive decline, and canonical LOAD neuropathology. Network analyses revealed distinct clusters of PRSs with clinical and biological interpretability. Novel eigen-PRSs (ePRS) from these clusters significantly improved LOAD-related phenotypes prediction over current state-of-the-art LOAD PRS models. Notably, an ePRS representing clusters of traits related to cholesterol levels was able to improve variance explained in a model of the brain-wide beta-amyloid burden by 1.7% (likelihood ratio test P = 9.02 × 10-7). All associations of ePRS with LOAD phenotypes were eliminated by the removal of APOE-proximal loci. However, our association analysis identified modules characterized by PRSs of high cholesterol and LOAD. We believe this is due to the influence of the APOE region from both PRSs. We found significantly higher mean SNP effects for LOAD in the intersecting APOE region SNPs. Combining genetic risk factors for vascular traits and dementia could improve current single-trait PRS models of LOAD, enhancing the use of PRS in risk stratification. Our results are catalogued for the scientific community, to aid in generating new hypotheses based on our maps of clustered PRSs and associations with LOAD-related phenotypes.

10.
Hum Mol Genet ; 33(2): 198-210, 2024 Jan 07.
Artigo em Inglês | MEDLINE | ID: mdl-37802914

RESUMO

CYP2A6, a genetically variable enzyme, inactivates nicotine, activates carcinogens, and metabolizes many pharmaceuticals. Variation in CYP2A6 influences smoking behaviors and tobacco-related disease risk. This phenome-wide association study examined associations between a reconstructed version of our weighted genetic risk score (wGRS) for CYP2A6 activity with diseases in the UK Biobank (N = 395 887). Causal effects of phenotypic CYP2A6 activity (measured as the nicotine metabolite ratio: 3'-hydroxycotinine/cotinine) on the phenome-wide significant (PWS) signals were then estimated in two-sample Mendelian Randomization using the wGRS as the instrument. Time-to-diagnosis age was compared between faster versus slower CYP2A6 metabolizers for the PWS signals in survival analyses. In the total sample, six PWS signals were identified: two lung cancers and four obstructive respiratory diseases PheCodes, where faster CYP2A6 activity was associated with greater disease risk (Ps < 1 × 10-6). A significant CYP2A6-by-smoking status interaction was found (Psinteraction < 0.05); in current smokers, the same six PWS signals were found as identified in the total group, whereas no PWS signals were found in former or never smokers. In the total sample and current smokers, CYP2A6 activity causal estimates on the six PWS signals were significant in Mendelian Randomization (Ps < 5 × 10-5). Additionally, faster CYP2A6 metabolizer status was associated with younger age of disease diagnosis for the six PWS signals (Ps < 5 × 10-4, in current smokers). These findings support a role for faster CYP2A6 activity as a causal risk factor for lung cancers and obstructive respiratory diseases among current smokers, and a younger onset of these diseases. This research utilized the UK Biobank Resource.


Assuntos
Neoplasias Pulmonares , Doenças Respiratórias , Humanos , Nicotina/genética , Análise da Randomização Mendeliana , Fumar/efeitos adversos , Fumar/genética , Neoplasias Pulmonares/genética , Doenças Respiratórias/complicações , Citocromo P-450 CYP2A6/genética , Citocromo P-450 CYP2A6/metabolismo
11.
Hum Mol Genet ; 2024 Jun 16.
Artigo em Inglês | MEDLINE | ID: mdl-38879759

RESUMO

Venous thromboembolism (VTE) is a significant contributor to morbidity and mortality, with large disparities in incidence rates between Black and White Americans. Polygenic risk scores (PRSs) limited to variants discovered in genome-wide association studies in European-ancestry samples can identify European-ancestry individuals at high risk of VTE. However, there is limited evidence on whether high-dimensional PRS constructed using more sophisticated methods and more diverse training data can enhance the predictive ability and their utility across diverse populations. We developed PRSs for VTE using summary statistics from the International Network against Venous Thrombosis (INVENT) consortium genome-wide association studies meta-analyses of European- (71 771 cases and 1 059 740 controls) and African-ancestry samples (7482 cases and 129 975 controls). We used LDpred2 and PRS-CSx to construct ancestry-specific and multi-ancestry PRSs and evaluated their performance in an independent European- (6781 cases and 103 016 controls) and African-ancestry sample (1385 cases and 12 569 controls). Multi-ancestry PRSs with weights tuned in European-ancestry samples slightly outperformed ancestry-specific PRSs in European-ancestry test samples (e.g. the area under the receiver operating curve [AUC] was 0.609 for PRS-CSx_combinedEUR and 0.608 for PRS-CSxEUR [P = 0.00029]). Multi-ancestry PRSs with weights tuned in African-ancestry samples also outperformed ancestry-specific PRSs in African-ancestry test samples (PRS-CSxAFR: AUC = 0.58, PRS-CSx_combined AFR: AUC = 0.59), although this difference was not statistically significant (P = 0.34). The highest fifth percentile of the best-performing PRS was associated with 1.9-fold and 1.68-fold increased risk for VTE among European- and African-ancestry subjects, respectively, relative to those in the middle stratum. These findings suggest that the multi-ancestry PRS might be used to improve performance across diverse populations to identify individuals at highest risk for VTE.

12.
Hum Mol Genet ; 33(14): 1262-1272, 2024 Jul 06.
Artigo em Inglês | MEDLINE | ID: mdl-38676403

RESUMO

BACKGROUND: Genetic susceptibility to various chronic diseases has been shown to influence heart failure (HF) risk. However, the underlying biological pathways, particularly the role of leukocyte telomere length (LTL), are largely unknown. We investigated the impact of genetic susceptibility to chronic diseases and various traits on HF risk, and whether LTL mediates or modifies the pathways. METHODS: We conducted prospective cohort analyses on 404 883 European participants from the UK Biobank, including 9989 incident HF cases. Multivariable Cox regression was used to estimate associations between HF risk and 24 polygenic risk scores (PRSs) for various diseases or traits previously generated using a Bayesian approach. We assessed multiplicative interactions between the PRSs and LTL previously measured in the UK Biobank using quantitative PCR. Causal mediation analyses were conducted to estimate the proportion of the total effect of PRSs acting indirectly through LTL, an integrative marker of biological aging. RESULTS: We identified 9 PRSs associated with HF risk, including those for various cardiovascular diseases or traits, rheumatoid arthritis (P = 1.3E-04), and asthma (P = 1.8E-08). Additionally, longer LTL was strongly associated with decreased HF risk (P-trend = 1.7E-08). Notably, LTL strengthened the asthma-HF relationship significantly (P-interaction = 2.8E-03). However, LTL mediated only 1.13% (P < 0.001) of the total effect of the asthma PRS on HF risk. CONCLUSIONS: Our findings shed light onto the shared genetic susceptibility between HF risk, asthma, rheumatoid arthritis, and other traits. Longer LTL strengthened the genetic effect of asthma in the pathway to HF. These results support consideration of LTL and PRSs in HF risk prediction.


Assuntos
Predisposição Genética para Doença , Insuficiência Cardíaca , Leucócitos , Telômero , Humanos , Insuficiência Cardíaca/genética , Insuficiência Cardíaca/epidemiologia , Feminino , Leucócitos/metabolismo , Masculino , Pessoa de Meia-Idade , Telômero/genética , Doença Crônica , Idoso , Estudos Prospectivos , Homeostase do Telômero/genética , Fatores de Risco , Polimorfismo de Nucleotídeo Único , Adulto , Herança Multifatorial/genética , Estudo de Associação Genômica Ampla , População Branca/genética , População Europeia
13.
Trends Genet ; 39(2): 98-108, 2023 02.
Artigo em Inglês | MEDLINE | ID: mdl-36564319

RESUMO

Traditional classification of genetic diseases as monogenic and polygenic has lagged far behind scientific progress. In this opinion article, we propose and define a new terminology, genetically transitional disease (GTD), referring to cases where a large-effect mutation is necessary, but not sufficient, to cause disease. This leads to a working disease nosology based on gradients of four types of genetic architecture: monogenic, polygenic, GTD, and mixed. We present four scenarios under which GTD may occur; namely, subsets of traditionally Mendelian disease, modifiable Tier 1 monogenic conditions, variable penetrance, and situations where a genetic mutational spectrum produces qualitatively divergent pathologies. The implications of the new nosology in precision medicine are discussed, in which therapeutic options may target the molecular cause or the disease phenotype.


Assuntos
Medicina Genômica , Herança Multifatorial , Humanos , Fenótipo , Mutação , Herança Multifatorial/genética , Predisposição Genética para Doença
14.
Annu Rev Med ; 75: 233-245, 2024 Jan 29.
Artigo em Inglês | MEDLINE | ID: mdl-37751367

RESUMO

The MELD (model for end-stage liver disease) 3.0 score was developed to replace the MELD-Na score that is currently used to prioritize liver allocation for cirrhotic patients awaiting liver transplantation in the United States. The MELD 3.0 calculator includes new inputs from patient sex and serum albumin levels and has new weights for serum sodium, bilirubin, international normalized ratio, and creatinine levels. It is expected that use of MELD 3.0 scores will reduce overall waitlist mortality modestly and improve access for female liver transplant candidates. The utility of MELD 3.0 and PELDcre (pediatric end-stage liver disease, creatinine) scores for risk stratification in cirrhotic patients undergoing major abdominal surgery, placement of a transjugular intrahepatic portosystemic shunt, and other interventions requires further study. This article reviews the background of the MELD score and the rationale to create MELD 3.0 as well as potential implications of using this newer risk stratification tool in clinical practice.


Assuntos
Doença Hepática Terminal , Humanos , Feminino , Estados Unidos , Criança , Doença Hepática Terminal/cirurgia , Creatinina , Índice de Gravidade de Doença , Cirrose Hepática/cirurgia , Estudos Retrospectivos , Prognóstico
15.
Am J Hum Genet ; 110(11): 1888-1902, 2023 11 02.
Artigo em Inglês | MEDLINE | ID: mdl-37890495

RESUMO

Admixed individuals offer unique opportunities for addressing limited transferability in polygenic scores (PGSs), given the substantial trans-ancestry genetic correlation in many complex traits. However, they are rarely considered in PGS training, given the challenges in representing ancestry-matched linkage-disequilibrium reference panels for admixed individuals. Here we present inclusive PGS (iPGS), which captures ancestry-shared genetic effects by finding the exact solution for penalized regression on individual-level data and is thus naturally applicable to admixed individuals. We validate our approach in a simulation study across 33 configurations with varying heritability, polygenicity, and ancestry composition in the training set. When iPGS is applied to n = 237,055 ancestry-diverse individuals in the UK Biobank, it shows the greatest improvements in Africans by 48.9% on average across 60 quantitative traits and up to 50-fold improvements for some traits (neutrophil count, R2 = 0.058) over the baseline model trained on the same number of European individuals. When we allowed iPGS to use n = 284,661 individuals, we observed an average improvement of 60.8% for African, 11.6% for South Asian, 7.3% for non-British White, 4.8% for White British, and 17.8% for the other individuals. We further developed iPGS+refit to jointly model the ancestry-shared and -dependent genetic effects when heterogeneous genetic associations were present. For neutrophil count, for example, iPGS+refit showed the highest predictive performance in the African group (R2 = 0.115), which exceeds the best predictive performance for the White British group (R2 = 0.090 in the iPGS model), even though only 1.49% of individuals used in the iPGS training are of African ancestry. Our results indicate the power of including diverse individuals for developing more equitable PGS models.


Assuntos
Herança Multifatorial , População Branca , Humanos , Herança Multifatorial/genética , População Branca/genética , Fenótipo , População Negra/genética , Povo Asiático/genética , Estudo de Associação Genômica Ampla/métodos
16.
Am J Hum Genet ; 110(7): 1200-1206, 2023 07 06.
Artigo em Inglês | MEDLINE | ID: mdl-37311464

RESUMO

Genome-wide polygenic risk scores (GW-PRSs) have been reported to have better predictive ability than PRSs based on genome-wide significance thresholds across numerous traits. We compared the predictive ability of several GW-PRS approaches to a recently developed PRS of 269 established prostate cancer-risk variants from multi-ancestry GWASs and fine-mapping studies (PRS269). GW-PRS models were trained with a large and diverse prostate cancer GWAS of 107,247 cases and 127,006 controls that we previously used to develop the multi-ancestry PRS269. Resulting models were independently tested in 1,586 cases and 1,047 controls of African ancestry from the California Uganda Study and 8,046 cases and 191,825 controls of European ancestry from the UK Biobank and further validated in 13,643 cases and 210,214 controls of European ancestry and 6,353 cases and 53,362 controls of African ancestry from the Million Veteran Program. In the testing data, the best performing GW-PRS approach had AUCs of 0.656 (95% CI = 0.635-0.677) in African and 0.844 (95% CI = 0.840-0.848) in European ancestry men and corresponding prostate cancer ORs of 1.83 (95% CI = 1.67-2.00) and 2.19 (95% CI = 2.14-2.25), respectively, for each SD unit increase in the GW-PRS. Compared to the GW-PRS, in African and European ancestry men, the PRS269 had larger or similar AUCs (AUC = 0.679, 95% CI = 0.659-0.700 and AUC = 0.845, 95% CI = 0.841-0.849, respectively) and comparable prostate cancer ORs (OR = 2.05, 95% CI = 1.87-2.26 and OR = 2.21, 95% CI = 2.16-2.26, respectively). Findings were similar in the validation studies. This investigation suggests that current GW-PRS approaches may not improve the ability to predict prostate cancer risk compared to the PRS269 developed from multi-ancestry GWASs and fine-mapping.


Assuntos
Predisposição Genética para Doença , Neoplasias da Próstata , Humanos , Masculino , População Negra/genética , Estudo de Associação Genômica Ampla , Herança Multifatorial/genética , Neoplasias da Próstata/genética , Fatores de Risco , População Branca/genética
17.
Am J Hum Genet ; 110(1): 13-22, 2023 01 05.
Artigo em Inglês | MEDLINE | ID: mdl-36460009

RESUMO

Polygenic risk score (PRS) has demonstrated its great utility in biomedical research through identifying high-risk individuals for different diseases from their genotypes. However, the broader application of PRS to the general population is hindered by the limited transferability of PRS developed in Europeans to non-European populations. To improve PRS prediction accuracy in non-European populations, we develop a statistical method called SDPRX that can effectively integrate genome wide association study summary statistics from different populations. SDPRX automatically adjusts for linkage disequilibrium differences between populations and characterizes the joint distribution of the effect sizes of a variant in two populations to be both null, population specific, or shared with correlation. Through simulations and applications to real traits, we show that SDPRX improves the prediction performance over existing methods in non-European populations.


Assuntos
Estudo de Associação Genômica Ampla , Herança Multifatorial , Humanos , Herança Multifatorial/genética , Estudo de Associação Genômica Ampla/métodos , Predisposição Genética para Doença , Fatores de Risco , Genótipo
18.
Am J Hum Genet ; 110(5): 741-761, 2023 05 04.
Artigo em Inglês | MEDLINE | ID: mdl-37030289

RESUMO

The advent of large-scale genome-wide association studies (GWASs) has motivated the development of statistical methods for phenotype prediction with single-nucleotide polymorphism (SNP) array data. These polygenic risk score (PRS) methods use a multiple linear regression framework to infer joint effect sizes of all genetic variants on the trait. Among the subset of PRS methods that operate on GWAS summary statistics, sparse Bayesian methods have shown competitive predictive ability. However, most existing Bayesian approaches employ Markov chain Monte Carlo (MCMC) algorithms, which are computationally inefficient and do not scale favorably to higher dimensions, for posterior inference. Here, we introduce variational inference of polygenic risk scores (VIPRS), a Bayesian summary statistics-based PRS method that utilizes variational inference techniques to approximate the posterior distribution for the effect sizes. Our experiments with 36 simulation configurations and 12 real phenotypes from the UK Biobank dataset demonstrated that VIPRS is consistently competitive with the state-of-the-art in prediction accuracy while being more than twice as fast as popular MCMC-based approaches. This performance advantage is robust across a variety of genetic architectures, SNP heritabilities, and independent GWAS cohorts. In addition to its competitive accuracy on the "White British" samples, VIPRS showed improved transferability when applied to other ethnic groups, with up to 1.7-fold increase in R2 among individuals of Nigerian ancestry for low-density lipoprotein (LDL) cholesterol. To illustrate its scalability, we applied VIPRS to a dataset of 9.6 million genetic markers, which conferred further improvements in prediction accuracy for highly polygenic traits, such as height.


Assuntos
Estudo de Associação Genômica Ampla , Herança Multifatorial , Humanos , Herança Multifatorial/genética , Estudo de Associação Genômica Ampla/métodos , Teorema de Bayes , Polimorfismo de Nucleotídeo Único/genética , Fatores de Risco , Predisposição Genética para Doença
19.
Am J Hum Genet ; 110(11): 1903-1918, 2023 11 02.
Artigo em Inglês | MEDLINE | ID: mdl-37816352

RESUMO

Despite whole-genome sequencing (WGS), many cases of single-gene disorders remain unsolved, impeding diagnosis and preventative care for people whose disease-causing variants escape detection. Since early WGS data analytic steps prioritize protein-coding sequences, to simultaneously prioritize variants in non-coding regions rich in transcribed and critical regulatory sequences, we developed GROFFFY, an analytic tool that integrates coordinates for regions with experimental evidence of functionality. Applied to WGS data from solved and unsolved hereditary hemorrhagic telangiectasia (HHT) recruits to the 100,000 Genomes Project, GROFFFY-based filtration reduced the mean number of variants/DNA from 4,867,167 to 21,486, without deleting disease-causal variants. In three unsolved cases (two related), GROFFFY identified ultra-rare deletions within the 3' untranslated region (UTR) of the tumor suppressor SMAD4, where germline loss-of-function alleles cause combined HHT and colonic polyposis (MIM: 175050). Sited >5.4 kb distal to coding DNA, the deletions did not modify or generate microRNA binding sites, but instead disrupted the sequence context of the final cleavage and polyadenylation site necessary for protein production: By iFoldRNA, an AAUAAA-adjacent 16-nucleotide deletion brought the cleavage site into inaccessible neighboring secondary structures, while a 4-nucleotide deletion unfolded the downstream RNA polymerase II roadblock. SMAD4 RNA expression differed to control-derived RNA from resting and cycloheximide-stressed peripheral blood mononuclear cells. Patterns predicted the mutational site for an unrelated HHT/polyposis-affected individual, where a complex insertion was subsequently identified. In conclusion, we describe a functional rare variant type that impacts regulatory systems based on RNA polyadenylation. Extension of coding sequence-focused gene panels is required to capture these variants.


Assuntos
Proteína Smad4 , Telangiectasia Hemorrágica Hereditária , Humanos , Sequência de Bases , DNA , Leucócitos Mononucleares/patologia , Nucleotídeos , Poliadenilação/genética , RNA , Proteína Smad4/genética , Telangiectasia Hemorrágica Hereditária/genética , Sequenciamento Completo do Genoma
20.
Am J Hum Genet ; 110(5): 722-740, 2023 05 04.
Artigo em Inglês | MEDLINE | ID: mdl-37060905

RESUMO

Coronary artery disease (CAD) is a pandemic disease where up to half of the risk is explained by genetic factors. Advanced insights into the genetic basis of CAD require deeper understanding of the contributions of different cell types, molecular pathways, and genes to disease heritability. Here, we investigate the biological diversity of atherosclerosis-associated cell states and interrogate their contribution to the genetic risk of CAD by using single-cell and bulk RNA sequencing (RNA-seq) of mouse and human lesions. We identified 12 disease-associated cell states that we characterized further by gene set functional profiling, ligand-receptor prediction, and transcription factor inference. Importantly, Vcam1+ smooth muscle cell state genes contributed most to SNP-based heritability of CAD. In line with this, genetic variants near smooth muscle cell state genes and regulatory elements explained the largest fraction of CAD-risk variance between individuals. Using this information for variant prioritization, we derived a hybrid polygenic risk score (PRS) that demonstrated improved performance over a classical PRS. Our results provide insights into the biological mechanisms associated with CAD risk, which could make a promising contribution to precision medicine and tailored therapeutic interventions in the future.


Assuntos
Aterosclerose , Doença da Artéria Coronariana , Humanos , Aterosclerose/genética , Doença da Artéria Coronariana/genética , Doença da Artéria Coronariana/patologia , Fatores de Risco , Regulação da Expressão Gênica , Predisposição Genética para Doença , Estudo de Associação Genômica Ampla/métodos , Polimorfismo de Nucleotídeo Único/genética
SELEÇÃO DE REFERÊNCIAS
Detalhe da pesquisa