Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 2.393
Filter
Add more filters

Publication year range
1.
Cell ; 185(10): 1676-1693.e23, 2022 05 12.
Article in English | MEDLINE | ID: mdl-35489334

ABSTRACT

Epidemiological studies reveal that marijuana increases the risk of cardiovascular disease (CVD); however, little is known about the mechanism. Δ9-tetrahydrocannabinol (Δ9-THC), the psychoactive component of marijuana, binds to cannabinoid receptor 1 (CB1/CNR1) in the vasculature and is implicated in CVD. A UK Biobank analysis found that cannabis was an risk factor for CVD. We found that marijuana smoking activated inflammatory cytokines implicated in CVD. In silico virtual screening identified genistein, a soybean isoflavone, as a putative CB1 antagonist. Human-induced pluripotent stem cell-derived endothelial cells were used to model Δ9-THC-induced inflammation and oxidative stress via NF-κB signaling. Knockdown of the CB1 receptor with siRNA, CRISPR interference, and genistein attenuated the effects of Δ9-THC. In mice, genistein blocked Δ9-THC-induced endothelial dysfunction in wire myograph, reduced atherosclerotic plaque, and had minimal penetration of the central nervous system. Genistein is a CB1 antagonist that attenuates Δ9-THC-induced atherosclerosis.


Subject(s)
Cannabis , Cardiovascular Diseases , Hallucinogens , Analgesics , Animals , Cannabinoid Receptor Agonists/pharmacology , Dronabinol/pharmacology , Endothelial Cells , Genistein/pharmacology , Genistein/therapeutic use , Inflammation/drug therapy , Mice , Receptor, Cannabinoid, CB1 , Receptors, Cannabinoid
2.
Cell ; 180(1): 188-204.e22, 2020 01 09.
Article in English | MEDLINE | ID: mdl-31883794

ABSTRACT

Glioblastomas exhibit vast inter- and intra-tumoral heterogeneity, complicating the development of effective therapeutic strategies. Current in vitro models are limited in preserving the cellular and mutational diversity of parental tumors and require a prolonged generation time. Here, we report methods for generating and biobanking patient-derived glioblastoma organoids (GBOs) that recapitulate the histological features, cellular diversity, gene expression, and mutational profiles of their corresponding parental tumors. GBOs can be generated quickly with high reliability and exhibit rapid, aggressive infiltration when transplanted into adult rodent brains. We further demonstrate the utility of GBOs to test personalized therapies by correlating GBO mutational profiles with responses to specific drugs and by modeling chimeric antigen receptor T cell immunotherapy. Our studies show that GBOs maintain many key features of glioblastomas and can be rapidly deployed to investigate patient-specific treatment strategies. Additionally, our live biobank establishes a rich resource for basic and translational glioblastoma research.


Subject(s)
Cell Culture Techniques/methods , Glioblastoma/metabolism , Organoids/growth & development , Adult , Aged , Aged, 80 and over , Animals , Biological Specimen Banks , Female , Glioblastoma/genetics , Glioblastoma/pathology , Humans , Male , Mice , Mice, Nude , Middle Aged , Models, Biological , Organoids/metabolism , Reproducibility of Results , Xenograft Model Antitumor Assays/methods
3.
Cell ; 182(5): 1214-1231.e11, 2020 09 03.
Article in English | MEDLINE | ID: mdl-32888494

ABSTRACT

Blood cells play essential roles in human health, underpinning physiological processes such as immunity, oxygen transport, and clotting, which when perturbed cause a significant global health burden. Here we integrate data from UK Biobank and a large-scale international collaborative effort, including data for 563,085 European ancestry participants, and discover 5,106 new genetic variants independently associated with 29 blood cell phenotypes covering a range of variation impacting hematopoiesis. We holistically characterize the genetic architecture of hematopoiesis, assess the relevance of the omnigenic model to blood cell phenotypes, delineate relevant hematopoietic cell states influenced by regulatory genetic variants and gene networks, identify novel splice-altering variants mediating the associations, and assess the polygenic prediction potential for blood traits and clinical disorders at the interface of complex and Mendelian genetics. These results show the power of large-scale blood cell trait GWAS to interrogate clinically meaningful variants across a wide allelic spectrum of human variation.


Subject(s)
Genetic Predisposition to Disease/genetics , Multifactorial Inheritance/genetics , Female , Gene Regulatory Networks/genetics , Genome-Wide Association Study/methods , Hematopoiesis/genetics , Humans , Male , Phenotype , Polymorphism, Single Nucleotide/genetics
4.
Cell ; 177(3): 597-607.e9, 2019 04 18.
Article in English | MEDLINE | ID: mdl-31002796

ABSTRACT

The melanocortin 4 receptor (MC4R) is a G protein-coupled receptor whose disruption causes obesity. We functionally characterized 61 MC4R variants identified in 0.5 million people from UK Biobank and examined their associations with body mass index (BMI) and obesity-related cardiometabolic diseases. We found that the maximal efficacy of ß-arrestin recruitment to MC4R, rather than canonical Gαs-mediated cyclic adenosine-monophosphate production, explained 88% of the variance in the association of MC4R variants with BMI. While most MC4R variants caused loss of function, a subset caused gain of function; these variants were associated with significantly lower BMI and lower odds of obesity, type 2 diabetes, and coronary artery disease. Protective associations were driven by MC4R variants exhibiting signaling bias toward ß-arrestin recruitment and increased mitogen-activated protein kinase pathway activation. Harnessing ß-arrestin-biased MC4R signaling may represent an effective strategy for weight loss and the treatment of obesity-related cardiometabolic diseases.


Subject(s)
Gain of Function Mutation/genetics , Obesity/pathology , Receptor, Melanocortin, Type 4/genetics , Signal Transduction , Adult , Aged , Body Mass Index , Coronary Artery Disease/complications , Coronary Artery Disease/metabolism , Coronary Artery Disease/pathology , Cyclic AMP/metabolism , Databases, Factual , Diabetes Mellitus, Type 2/complications , Diabetes Mellitus, Type 2/metabolism , Diabetes Mellitus, Type 2/pathology , Female , GTP-Binding Protein alpha Subunits, Gs/metabolism , Genetic Predisposition to Disease , Genotype , Humans , Male , Middle Aged , Obesity/complications , Obesity/metabolism , Polymorphism, Single Nucleotide , Receptor, Melanocortin, Type 4/chemistry , Receptor, Melanocortin, Type 4/metabolism , beta-Arrestins/metabolism
5.
Cell ; 177(3): 587-596.e9, 2019 04 18.
Article in English | MEDLINE | ID: mdl-31002795

ABSTRACT

Severe obesity is a rapidly growing global health threat. Although often attributed to unhealthy lifestyle choices or environmental factors, obesity is known to be heritable and highly polygenic; the majority of inherited susceptibility is related to the cumulative effect of many common DNA variants. Here we derive and validate a new polygenic predictor comprised of 2.1 million common variants to quantify this susceptibility and test this predictor in more than 300,000 individuals ranging from middle age to birth. Among middle-aged adults, we observe a 13-kg gradient in weight and a 25-fold gradient in risk of severe obesity across polygenic score deciles. In a longitudinal birth cohort, we note minimal differences in birthweight across score deciles, but a significant gradient emerged in early childhood and reached 12 kg by 18 years of age. This new approach to quantify inherited susceptibility to obesity affords new opportunities for clinical prevention and mechanistic assessment.


Subject(s)
Body Weight , Multifactorial Inheritance/genetics , Obesity/pathology , Adolescent , Body Mass Index , Child , Databases, Factual , Female , Genome-Wide Association Study , Humans , Infant, Newborn , Longitudinal Studies , Male , Middle Aged , Obesity/genetics , Risk Factors , Severity of Illness Index
6.
Cell ; 172(1-2): 373-386.e10, 2018 01 11.
Article in English | MEDLINE | ID: mdl-29224780

ABSTRACT

Breast cancer (BC) comprises multiple distinct subtypes that differ genetically, pathologically, and clinically. Here, we describe a robust protocol for long-term culturing of human mammary epithelial organoids. Using this protocol, >100 primary and metastatic BC organoid lines were generated, broadly recapitulating the diversity of the disease. BC organoid morphologies typically matched the histopathology, hormone receptor status, and HER2 status of the original tumor. DNA copy number variations as well as sequence changes were consistent within tumor-organoid pairs and largely retained even after extended passaging. BC organoids furthermore populated all major gene-expression-based classification groups and allowed in vitro drug screens that were consistent with in vivo xeno-transplantations and patient response. This study describes a representative collection of well-characterized BC organoids available for cancer research and drug development, as well as a strategy to assess in vitro drug response in a personalized fashion.


Subject(s)
Breast Neoplasms/pathology , Genetic Heterogeneity , Organoids/pathology , Tissue Banks , Animals , Antineoplastic Agents/pharmacology , Breast Neoplasms/genetics , Cells, Cultured , Drug Screening Assays, Antitumor/methods , Female , Humans , Mice , Mice, Nude , Organoids/drug effects , Precision Medicine/methods
7.
Am J Hum Genet ; 111(4): 680-690, 2024 04 04.
Article in English | MEDLINE | ID: mdl-38490208

ABSTRACT

We propose TetraHer, a method for estimating the liability heritability of binary phenotypes. TetraHer has five key features. First, it can be applied to data from complex pedigrees that contain multiple types of relationships. Second, it can correct for ascertainment of cases or controls. Third, it can accommodate covariates. Fourth, it can model the contribution of common environment. Fifth, it produces a likelihood that can be used for significance testing. We first demonstrate the validity of TetraHer on simulated data. We then use TetraHer to estimate liability heritability for 229 codes from the tenth International Classification of Diseases (ICD-10). We identify 107 codes with significant heritability (p < 0.05/229), which can be used in future analyses for investigating the genetic architecture of human diseases.


Subject(s)
Genome-Wide Association Study , Models, Genetic , Humans , Pedigree , Phenotype , Polymorphism, Single Nucleotide
8.
Am J Hum Genet ; 111(2): 295-308, 2024 02 01.
Article in English | MEDLINE | ID: mdl-38232728

ABSTRACT

Infectious agents contribute significantly to the global burden of diseases through both acute infection and their chronic sequelae. We leveraged the UK Biobank to identify genetic loci that influence humoral immune response to multiple infections. From 45 genome-wide association studies in 9,611 participants from UK Biobank, we identified NFKB1 as a locus associated with quantitative antibody responses to multiple pathogens, including those from the herpes, retro-, and polyoma-virus families. An insertion-deletion variant thought to affect NFKB1 expression (rs28362491), was mapped as the likely causal variant and could play a key role in regulation of the immune response. Using 121 infection- and inflammation-related traits in 487,297 UK Biobank participants, we show that the deletion allele was associated with an increased risk of infection from diverse pathogens but had a protective effect against allergic disease. We propose that altered expression of NFKB1, as a result of the deletion, modulates hematopoietic pathways and likely impacts cell survival, antibody production, and inflammation. Taken together, we show that disruptions to the tightly regulated immune processes may tip the balance between exacerbated immune responses and allergy, or increased risk of infection and impaired resolution of inflammation.


Subject(s)
Genetic Predisposition to Disease , Hypersensitivity , Inflammation , Humans , Genome-Wide Association Study , Hypersensitivity/genetics , Inflammation/genetics , NF-kappa B p50 Subunit/genetics , UK Biobank
9.
Am J Hum Genet ; 111(7): 1431-1447, 2024 07 11.
Article in English | MEDLINE | ID: mdl-38908374

ABSTRACT

Methods of estimating polygenic scores (PGSs) from genome-wide association studies are increasingly utilized. However, independent method evaluation is lacking, and method comparisons are often limited. Here, we evaluate polygenic scores derived via seven methods in five biobank studies (totaling about 1.2 million participants) across 16 diseases and quantitative traits, building on a reference-standardized framework. We conducted meta-analyses to quantify the effects of method choice, hyperparameter tuning, method ensembling, and the target biobank on PGS performance. We found that no single method consistently outperformed all others. PGS effect sizes were more variable between biobanks than between methods within biobanks when methods were well tuned. Differences between methods were largest for the two investigated autoimmune diseases, seropositive rheumatoid arthritis and type 1 diabetes. For most methods, cross-validation was more reliable for tuning hyperparameters than automatic tuning (without the use of target data). For a given target phenotype, elastic net models combining PGS across methods (ensemble PGS) tuned in the UK Biobank provided consistent, high, and cross-biobank transferable performance, increasing PGS effect sizes (ß coefficients) by a median of 5.0% relative to LDpred2 and MegaPRS (the two best-performing single methods when tuned with cross-validation). Our interactively browsable online-results and open-source workflow prspipe provide a rich resource and reference for the analysis of polygenic scoring methods across biobanks.


Subject(s)
Biological Specimen Banks , Genome-Wide Association Study , Multifactorial Inheritance , Humans , Multifactorial Inheritance/genetics , Phenotype , Diabetes Mellitus, Type 1/genetics , Polymorphism, Single Nucleotide , Machine Learning
10.
Am J Hum Genet ; 111(7): 1462-1480, 2024 07 11.
Article in English | MEDLINE | ID: mdl-38866020

ABSTRACT

Understanding the contribution of gene-environment interactions (GxE) to complex trait variation can provide insights into disease mechanisms, explain sources of heritability, and improve genetic risk prediction. While large biobanks with genetic and deep phenotypic data hold promise for obtaining novel insights into GxE, our understanding of GxE architecture in complex traits remains limited. We introduce a method to estimate the proportion of trait variance explained by GxE (GxE heritability) and additive genetic effects (additive heritability) across the genome and within specific genomic annotations. We show that our method is accurate in simulations and computationally efficient for biobank-scale datasets. We applied our method to common array SNPs (MAF ≥1%), fifty quantitative traits, and four environmental variables (smoking, sex, age, and statin usage) in unrelated white British individuals in the UK Biobank. We found 68 trait-E pairs with significant genome-wide GxE heritability (p<0.05/200) with a ratio of GxE to additive heritability of ≈6.8% on average. Analyzing ≈8 million imputed SNPs (MAF ≥0.1%), we documented an approximate 28% increase in genome-wide GxE heritability compared to array SNPs. We partitioned GxE heritability across minor allele frequency (MAF) and local linkage disequilibrium (LD) values, revealing that, like additive allelic effects, GxE allelic effects tend to increase with decreasing MAF and LD. Analyzing GxE heritability near genes highly expressed in specific tissues, we find significant brain-specific enrichment for body mass index (BMI) and basal metabolic rate in the context of smoking and adipose-specific enrichment for waist-hip ratio (WHR) in the context of sex.


Subject(s)
Gene-Environment Interaction , Genome-Wide Association Study , Multifactorial Inheritance , Polymorphism, Single Nucleotide , Humans , Multifactorial Inheritance/genetics , Male , Female , Quantitative Trait, Heritable , Phenotype , Models, Genetic , Quantitative Trait Loci
11.
Proc Natl Acad Sci U S A ; 121(39): e2412315121, 2024 Sep 24.
Article in English | MEDLINE | ID: mdl-39302970

ABSTRACT

Sex-differential selection (SDS), which occurs when the fitness effects of alleles differ between males and females, can have profound impacts on the maintenance of genetic variation, disease risk, and other key aspects of natural populations. Because the sexes mix their autosomal genomes each generation, quantifying SDS is not possible using conventional population genetic approaches. Here, we introduce a method that exploits subtle sex differences in haplotype frequencies resulting from SDS acting in the current generation. Using data from 300K individuals in the UK Biobank, we estimate the strength of SDS throughout the genome. While only a handful of loci under SDS are individually significant, we uncover highly polygenic signals of genome-wide SDS for both viability and fecundity. Selection coefficients of [Formula: see text] may be typical. Despite its ubiquity, SDS may impose a mortality load of less than 1%. An interesting life-history tradeoff emerges: Alleles that increase viability more strongly in females than males tend to increase fecundity more strongly in males than in females. Finally, we find marginal evidence of SDS on fecundity acting on alleles affecting arm fat-free mass. Taken together, our findings connect the long-standing evidence of SDS acting on human phenotypes with its impact on the genome.


Subject(s)
Fertility , Multifactorial Inheritance , Humans , Male , Female , Multifactorial Inheritance/genetics , Fertility/genetics , Selection, Genetic , Haplotypes , Alleles , Sex Characteristics , Genome-Wide Association Study , Genome, Human
12.
Hum Mol Genet ; 33(2): 198-210, 2024 Jan 07.
Article in English | MEDLINE | ID: mdl-37802914

ABSTRACT

CYP2A6, a genetically variable enzyme, inactivates nicotine, activates carcinogens, and metabolizes many pharmaceuticals. Variation in CYP2A6 influences smoking behaviors and tobacco-related disease risk. This phenome-wide association study examined associations between a reconstructed version of our weighted genetic risk score (wGRS) for CYP2A6 activity with diseases in the UK Biobank (N = 395 887). Causal effects of phenotypic CYP2A6 activity (measured as the nicotine metabolite ratio: 3'-hydroxycotinine/cotinine) on the phenome-wide significant (PWS) signals were then estimated in two-sample Mendelian Randomization using the wGRS as the instrument. Time-to-diagnosis age was compared between faster versus slower CYP2A6 metabolizers for the PWS signals in survival analyses. In the total sample, six PWS signals were identified: two lung cancers and four obstructive respiratory diseases PheCodes, where faster CYP2A6 activity was associated with greater disease risk (Ps < 1 × 10-6). A significant CYP2A6-by-smoking status interaction was found (Psinteraction < 0.05); in current smokers, the same six PWS signals were found as identified in the total group, whereas no PWS signals were found in former or never smokers. In the total sample and current smokers, CYP2A6 activity causal estimates on the six PWS signals were significant in Mendelian Randomization (Ps < 5 × 10-5). Additionally, faster CYP2A6 metabolizer status was associated with younger age of disease diagnosis for the six PWS signals (Ps < 5 × 10-4, in current smokers). These findings support a role for faster CYP2A6 activity as a causal risk factor for lung cancers and obstructive respiratory diseases among current smokers, and a younger onset of these diseases. This research utilized the UK Biobank Resource.


Subject(s)
Lung Neoplasms , Respiratory Tract Diseases , Humans , Nicotine/genetics , Mendelian Randomization Analysis , Smoking/adverse effects , Smoking/genetics , Lung Neoplasms/genetics , Respiratory Tract Diseases/complications , Cytochrome P-450 CYP2A6/genetics , Cytochrome P-450 CYP2A6/metabolism
13.
Hum Mol Genet ; 33(5): 465-474, 2024 Feb 18.
Article in English | MEDLINE | ID: mdl-37988592

ABSTRACT

Whole genome sequencing (WGS) from large clinically unselected cohorts provides a unique opportunity to assess the penetrance and expressivity of rare and/or known pathogenic mitochondrial variants in population. Using WGS from 179 862 clinically unselected individuals from the UK Biobank, we performed extensive single and rare variant aggregation association analyses of 15 881 mtDNA variants and 73 known pathogenic variants with 15 mitochondrial disease-relevant phenotypes. We identified 12 homoplasmic and one heteroplasmic variant (m.3243A>G) with genome-wide significant associations in our clinically unselected cohort. Heteroplasmic m.3243A>G (MAF = 0.0002, a known pathogenic variant) was associated with diabetes, deafness and heart failure and 12 homoplasmic variants increased aspartate aminotransferase levels including three low-frequency variants (MAF ~0.002 and beta~0.3 SD). Most pathogenic mitochondrial disease variants (n = 66/74) were rare in the population (<1:9000). Aggregated or single variant analysis of pathogenic variants showed low penetrance in unselected settings for the relevant phenotypes, except m.3243A>G. Multi-system disease risk and penetrance of diabetes, deafness and heart failure greatly increased with m.3243A>G level ≥ 10%. The odds ratio of these traits increased from 5.61, 12.3 and 10.1 to 25.1, 55.0 and 39.5, respectively. Diabetes risk with m.3243A>G was further influenced by type 2 diabetes genetic risk. Our study of mitochondrial variation in a large-unselected population identified novel associations and demonstrated that pathogenic mitochondrial variants have lower penetrance in clinically unselected settings. m.3243A>G was an exception at higher heteroplasmy showing a significant impact on health making it a good candidate for incidental reporting.


Subject(s)
Deafness , Diabetes Mellitus, Type 2 , Heart Failure , Mitochondrial Diseases , Humans , Penetrance , Diabetes Mellitus, Type 2/genetics , DNA, Mitochondrial/genetics , Mitochondrial Diseases/genetics , Deafness/genetics , Mutation
14.
Am J Hum Genet ; 110(10): 1628-1647, 2023 10 05.
Article in English | MEDLINE | ID: mdl-37757824

ABSTRACT

Pharmacogenomics (PGx) is an integral part of precision medicine and contributes to the maximization of drug efficacy and reduction of adverse drug event risk. Accurate information on PGx allele frequencies improves the implementation of PGx. Nonetheless, curating such information from published allele data is time and resource intensive. The limited number of allelic variants in most studies leads to an underestimation of certain alleles. We applied the Pharmacogenomics Clinical Annotation Tool (PharmCAT) on an integrated 200K UK Biobank genetic dataset (N = 200,044). Based on PharmCAT results, we estimated PGx frequencies (alleles, diplotypes, phenotypes, and activity scores) for 17 pharmacogenes in five biogeographic groups: European, Central/South Asian, East Asian, Afro-Caribbean, and Sub-Saharan African. PGx frequencies were distinct for each biogeographic group. Even biogeographic groups with similar proportions of phenotypes were driven by different sets of dominant PGx alleles. PharmCAT also identified "no-function" alleles that were rare or seldom tested in certain groups by previous studies, e.g., SLCO1B1∗31 in the Afro-Caribbean (3.0%) and Sub-Saharan African (3.9%) groups. Estimated PGx frequencies are disseminated via the PharmGKB (The Pharmacogenomics Knowledgebase: www.pharmgkb.org). We demonstrate that genetic biobanks such as the UK Biobank are a robust resource for estimating PGx frequencies. Improving our understanding of PGx allele and phenotype frequencies provides guidance for future PGx studies and clinical genetic test panel design, and better serves individuals from wider biogeographic backgrounds.


Subject(s)
Biological Specimen Banks , Pharmacogenetics , Humans , Pharmacogenetics/methods , Alleles , Precision Medicine/methods , Gene Frequency/genetics , Liver-Specific Organic Anion Transporter 1
15.
Am J Hum Genet ; 110(1): 161-165, 2023 01 05.
Article in English | MEDLINE | ID: mdl-36450278

ABSTRACT

The first release of UK Biobank whole-genome sequence data contains 150,119 genomes. We present an open-source pipeline for filtering, phasing, and indexing these genomes on the cloud-based UK Biobank Research Analysis Platform. This pipeline makes it possible to apply haplotype-based methods to UK Biobank whole-genome sequence data. The pipeline uses BCFtools for marker filtering, Beagle for genotype phasing, and Tabix for VCF indexing. We used the pipeline to phase 406 million single-nucleotide variants on chromosomes 1-22 and X at a cost of £2,309. The maximum time required to process a chromosome was 2.6 days. In order to assess phase accuracy, we modified the pipeline to exclude trio parents. We observed a switch error rate of 0.0016 on chromosome 20 in the White British trio offspring. If we exclude markers with nonmajor allele frequency < 0.1% after phasing, this switch error rate decreases by 80% to 0.00032.


Subject(s)
Biological Specimen Banks , Genome , Humans , Dogs , Animals , Genotype , Haplotypes/genetics , Polymorphism, Single Nucleotide/genetics , United Kingdom , Algorithms , Sequence Analysis, DNA/methods
16.
Am J Hum Genet ; 110(1): 23-29, 2023 01 05.
Article in English | MEDLINE | ID: mdl-36480927

ABSTRACT

We present LDAK-GBAT, a tool for gene-based association testing using summary statistics from genome-wide association studies that is computationally efficient, produces well-calibrated p values, and is significantly more powerful than existing tools. LDAK-GBAT takes approximately 30 min to analyze imputed data (2.9M common, genic SNPs), requiring less than 10 Gb memory. It shows good control of type 1 error given an appropriate reference panel. Across 109 phenotypes (82 from the UK Biobank, 18 from the Million Veteran Program, and nine from the Psychiatric Genetics Consortium), LDAK-GBAT finds on average 19% (SE: 1%) more significant genes than the existing tool sumFREGAT-ACAT, with even greater gains in comparison with MAGMA, GCTA-fastBAT, sumFREGAT-SKAT-O, and sumFREGAT-PCA.


Subject(s)
Genetic Testing , Genome-Wide Association Study , Phenotype , Polymorphism, Single Nucleotide/genetics
17.
Am J Hum Genet ; 110(4): 648-662, 2023 04 06.
Article in English | MEDLINE | ID: mdl-36977412

ABSTRACT

Several breast cancer susceptibility genes have been discovered, but more are likely to exist. To identify additional breast cancer susceptibility genes, we used the founder population of Poland and performed whole-exome sequencing on 510 women with familial breast cancer and 308 control subjects. We identified a rare mutation in ATRIP (GenBank: NM_130384.3: c.1152_1155del [p.Gly385Ter]) in two women with breast cancer. At the validation phase, we found this variant in 42/16,085 unselected Polish breast cancer-affected individuals and in 11/9,285 control subjects (OR = 2.14, 95% CI = 1.13-4.28, p = 0.02). By analyzing the sequence data of the UK Biobank study participants (450,000 individuals), we identified ATRIP loss-of-function variants among 13/15,643 breast cancer-affected individuals versus 40/157,943 control subjects (OR = 3.28, 95% CI = 1.76-6.14, p < 0.001). Immunohistochemistry and functional studies showed the ATRIP c.1152_1155del variant allele is weakly expressed compared to the wild-type allele, and truncated ATRIP fails to perform its normal function to prevent replicative stress. We showed that tumors of women with breast cancer who have a germline ATRIP mutation have loss of heterozygosity at the site of ATRIP mutation and genomic homologous recombination deficiency. ATRIP is a critical partner of ATR that binds to RPA coating single-stranded DNA at sites of stalled DNA replication forks. Proper activation of ATR-ATRIP elicits a DNA damage checkpoint crucial in regulating cellular responses to DNA replication stress. Based on our observations, we conclude ATRIP is a breast cancer susceptibility gene candidate linking DNA replication stress to breast cancer.


Subject(s)
Adaptor Proteins, Signal Transducing , Breast Neoplasms , DNA-Binding Proteins , Female , Humans , Adaptor Proteins, Signal Transducing/genetics , Biological Specimen Banks , Breast Neoplasms/genetics , Cell Cycle Proteins/genetics , DNA Damage , DNA-Binding Proteins/genetics , Poland/epidemiology , Replication Protein A/genetics , Replication Protein A/metabolism , United Kingdom/epidemiology
18.
Am J Hum Genet ; 110(3): 487-498, 2023 03 02.
Article in English | MEDLINE | ID: mdl-36809768

ABSTRACT

Genome-wide association studies (GWASs) have established the contribution of common and low-frequency variants to metabolic blood measurements in the UK Biobank (UKB). To complement existing GWAS findings, we assessed the contribution of rare protein-coding variants in relation to 355 metabolic blood measurements-including 325 predominantly lipid-related nuclear magnetic resonance (NMR)-derived blood metabolite measurements (Nightingale Health Plc) and 30 clinical blood biomarkers-using 412,393 exome sequences from four genetically diverse ancestries in the UKB. Gene-level collapsing analyses were conducted to evaluate a diverse range of rare-variant architectures for the metabolic blood measurements. Altogether, we identified significant associations (p < 1 × 10-8) for 205 distinct genes that involved 1,968 significant relationships for the Nightingale blood metabolite measurements and 331 for the clinical blood biomarkers. These include associations for rare non-synonymous variants in PLIN1 and CREB3L3 with lipid metabolite measurements and SYT7 with creatinine, among others, which may not only provide insights into novel biology but also deepen our understanding of established disease mechanisms. Of the study-wide significant clinical biomarker associations, 40% were not previously detected on analyzing coding variants in a GWAS in the same cohort, reinforcing the importance of studying rare variation to fully understand the genetic architecture of metabolic blood measurements.


Subject(s)
Genetic Predisposition to Disease , Genome-Wide Association Study , Humans , Biological Specimen Banks , Biomarkers , Lipids , United Kingdom , Polymorphism, Single Nucleotide
19.
Am J Hum Genet ; 110(5): 741-761, 2023 05 04.
Article in English | MEDLINE | ID: mdl-37030289

ABSTRACT

The advent of large-scale genome-wide association studies (GWASs) has motivated the development of statistical methods for phenotype prediction with single-nucleotide polymorphism (SNP) array data. These polygenic risk score (PRS) methods use a multiple linear regression framework to infer joint effect sizes of all genetic variants on the trait. Among the subset of PRS methods that operate on GWAS summary statistics, sparse Bayesian methods have shown competitive predictive ability. However, most existing Bayesian approaches employ Markov chain Monte Carlo (MCMC) algorithms, which are computationally inefficient and do not scale favorably to higher dimensions, for posterior inference. Here, we introduce variational inference of polygenic risk scores (VIPRS), a Bayesian summary statistics-based PRS method that utilizes variational inference techniques to approximate the posterior distribution for the effect sizes. Our experiments with 36 simulation configurations and 12 real phenotypes from the UK Biobank dataset demonstrated that VIPRS is consistently competitive with the state-of-the-art in prediction accuracy while being more than twice as fast as popular MCMC-based approaches. This performance advantage is robust across a variety of genetic architectures, SNP heritabilities, and independent GWAS cohorts. In addition to its competitive accuracy on the "White British" samples, VIPRS showed improved transferability when applied to other ethnic groups, with up to 1.7-fold increase in R2 among individuals of Nigerian ancestry for low-density lipoprotein (LDL) cholesterol. To illustrate its scalability, we applied VIPRS to a dataset of 9.6 million genetic markers, which conferred further improvements in prediction accuracy for highly polygenic traits, such as height.


Subject(s)
Genome-Wide Association Study , Multifactorial Inheritance , Humans , Multifactorial Inheritance/genetics , Genome-Wide Association Study/methods , Bayes Theorem , Polymorphism, Single Nucleotide/genetics , Risk Factors , Genetic Predisposition to Disease
20.
Am J Hum Genet ; 110(11): 1888-1902, 2023 11 02.
Article in English | MEDLINE | ID: mdl-37890495

ABSTRACT

Admixed individuals offer unique opportunities for addressing limited transferability in polygenic scores (PGSs), given the substantial trans-ancestry genetic correlation in many complex traits. However, they are rarely considered in PGS training, given the challenges in representing ancestry-matched linkage-disequilibrium reference panels for admixed individuals. Here we present inclusive PGS (iPGS), which captures ancestry-shared genetic effects by finding the exact solution for penalized regression on individual-level data and is thus naturally applicable to admixed individuals. We validate our approach in a simulation study across 33 configurations with varying heritability, polygenicity, and ancestry composition in the training set. When iPGS is applied to n = 237,055 ancestry-diverse individuals in the UK Biobank, it shows the greatest improvements in Africans by 48.9% on average across 60 quantitative traits and up to 50-fold improvements for some traits (neutrophil count, R2 = 0.058) over the baseline model trained on the same number of European individuals. When we allowed iPGS to use n = 284,661 individuals, we observed an average improvement of 60.8% for African, 11.6% for South Asian, 7.3% for non-British White, 4.8% for White British, and 17.8% for the other individuals. We further developed iPGS+refit to jointly model the ancestry-shared and -dependent genetic effects when heterogeneous genetic associations were present. For neutrophil count, for example, iPGS+refit showed the highest predictive performance in the African group (R2 = 0.115), which exceeds the best predictive performance for the White British group (R2 = 0.090 in the iPGS model), even though only 1.49% of individuals used in the iPGS training are of African ancestry. Our results indicate the power of including diverse individuals for developing more equitable PGS models.


Subject(s)
Multifactorial Inheritance , White People , Humans , Multifactorial Inheritance/genetics , White People/genetics , Phenotype , Black People/genetics , Asian People/genetics , Genome-Wide Association Study/methods
SELECTION OF CITATIONS
SEARCH DETAIL