Search | VHL Regional Portal

1.

Interaction molecular QTL mapping discovers cellular and environmental modifiers of genetic regulatory effects.

Kasela, Silva; Aguet, François; Kim-Hellmuth, Sarah; Brown, Brielin C; Nachun, Daniel C; Tracy, Russell P; Durda, Peter; Liu, Yongmei; Taylor, Kent D; Johnson, W Craig; Van Den Berg, David; Gabriel, Stacey; Gupta, Namrata; Smith, Joshua D; Blackwell, Thomas W; Rotter, Jerome I; Ardlie, Kristin G; Manichaikul, Ani; Rich, Stephen S; Barr, R Graham; Lappalainen, Tuuli.

Am J Hum Genet ; 111(1): 133-149, 2024 Jan 04.

Article in English | MEDLINE | ID: mdl-38181730

ABSTRACT

Bulk-tissue molecular quantitative trait loci (QTLs) have been the starting point for interpreting disease-associated variants, and context-specific QTLs show particular relevance for disease. Here, we present the results of mapping interaction QTLs (iQTLs) for cell type, age, and other phenotypic variables in multi-omic, longitudinal data from the blood of individuals of diverse ancestries. By modeling the interaction between genotype and estimated cell-type proportions, we demonstrate that cell-type iQTLs could be considered as proxies for cell-type-specific QTL effects, particularly for the most abundant cell type in the tissue. The interpretation of age iQTLs, however, warrants caution because the moderation effect of age on the genotype and molecular phenotype association could be mediated by changes in cell-type composition. Finally, we show that cell-type iQTLs contribute to cell-type-specific enrichment of diseases that, in combination with additional functional data, could guide future functional studies. Overall, this study highlights the use of iQTLs to gain insights into the context specificity of regulatory effects.

Subject(s)

Gene Expression Regulation , Quantitative Trait Loci , Humans , Quantitative Trait Loci/genetics , Genotype , Phenotype

2.

Author Correction: A genomic mutational constraint map using variation in 76,156 human genomes.

Chen, Siwei; Francioli, Laurent C; Goodrich, Julia K; Collins, Ryan L; Kanai, Masahiro; Wang, Qingbo; Alföldi, Jessica; Watts, Nicholas A; Vittal, Christopher; Gauthier, Laura D; Poterba, Timothy; Wilson, Michael W; Tarasova, Yekaterina; Phu, William; Grant, Riley; Yohannes, Mary T; Koenig, Zan; Farjoun, Yossi; Banks, Eric; Donnelly, Stacey; Gabriel, Stacey; Gupta, Namrata; Ferriera, Steven; Tolonen, Charlotte; Novod, Sam; Bergelson, Louis; Roazen, David; Ruano-Rubio, Valentin; Covarrubias, Miguel; Llanwarne, Christopher; Petrillo, Nikelle; Wade, Gordon; Jeandet, Thibault; Munshi, Ruchi; Tibbetts, Kathleen; O'Donnell-Luria, Anne; Solomonson, Matthew; Seed, Cotton; Martin, Alicia R; Talkowski, Michael E; Rehm, Heidi L; Daly, Mark J; Tiao, Grace; Neale, Benjamin M; MacArthur, Daniel G; Karczewski, Konrad J.

Nature ; 626(7997): E1, 2024 Feb.

Article in English | MEDLINE | ID: mdl-38225470

3.

A genomic mutational constraint map using variation in 76,156 human genomes.

Chen, Siwei; Francioli, Laurent C; Goodrich, Julia K; Collins, Ryan L; Kanai, Masahiro; Wang, Qingbo; Alföldi, Jessica; Watts, Nicholas A; Vittal, Christopher; Gauthier, Laura D; Poterba, Timothy; Wilson, Michael W; Tarasova, Yekaterina; Phu, William; Grant, Riley; Yohannes, Mary T; Koenig, Zan; Farjoun, Yossi; Banks, Eric; Donnelly, Stacey; Gabriel, Stacey; Gupta, Namrata; Ferriera, Steven; Tolonen, Charlotte; Novod, Sam; Bergelson, Louis; Roazen, David; Ruano-Rubio, Valentin; Covarrubias, Miguel; Llanwarne, Christopher; Petrillo, Nikelle; Wade, Gordon; Jeandet, Thibault; Munshi, Ruchi; Tibbetts, Kathleen; O'Donnell-Luria, Anne; Solomonson, Matthew; Seed, Cotton; Martin, Alicia R; Talkowski, Michael E; Rehm, Heidi L; Daly, Mark J; Tiao, Grace; Neale, Benjamin M; MacArthur, Daniel G; Karczewski, Konrad J.

Nature ; 625(7993): 92-100, 2024 Jan.

Article in English | MEDLINE | ID: mdl-38057664

ABSTRACT

The depletion of disruptive variation caused by purifying natural selection (constraint) has been widely used to investigate protein-coding genes underlying human disorders1-4, but attempts to assess constraint for non-protein-coding regions have proved more difficult. Here we aggregate, process and release a dataset of 76,156 human genomes from the Genome Aggregation Database (gnomAD)-the largest public open-access human genome allele frequency reference dataset-and use it to build a genomic constraint map for the whole genome (genomic non-coding constraint of haploinsufficient variation (Gnocchi)). We present a refined mutational model that incorporates local sequence context and regional genomic features to detect depletions of variation. As expected, the average constraint for protein-coding sequences is stronger than that for non-coding regions. Within the non-coding genome, constrained regions are enriched for known regulatory elements and variants that are implicated in complex human diseases and traits, facilitating the triangulation of biological annotation, disease association and natural selection to non-coding DNA analysis. More constrained regulatory elements tend to regulate more constrained protein-coding genes, which in turn suggests that non-coding constraint can aid the identification of constrained genes that are as yet unrecognized by current gene constraint metrics. We demonstrate that this genome-wide constraint map improves the identification and interpretation of functional human genetic variation.

Subject(s)

Genome, Human , Genomics , Models, Genetic , Mutation , Humans , Access to Information , Databases, Genetic , Datasets as Topic , Gene Frequency , Genome, Human/genetics , Mutation/genetics , Selection, Genetic

4.

Author Correction: Multiplexed CRISPR-based microfluidic platform for clinical testing of respiratory viruses and identification of SARS-CoV-2 variants.

Welch, Nicole L; Zhu, Meilin; Hua, Catherine; Weller, Juliane; Mirhashemi, Marzieh Ezzaty; Nguyen, Tien G; Mantena, Sreekar; Bauer, Matthew R; Shaw, Bennett M; Ackerman, Cheri M; Thakku, Sri Gowtham; Tse, Megan W; Kehe, Jared; Uwera, Marie-Martine; Eversley, Jacqueline S; Bielwaski, Derek A; McGrath, Graham; Braidt, Joseph; Johnson, Jeremy; Cerrato, Felecia; Moreno, Gage K; Krasilnikova, Lydia A; Petros, Brittany A; Gionet, Gabrielle L; King, Ewa; Huard, Richard C; Jalbert, Samantha K; Cleary, Michael L; Fitzgerald, Nicholas A; Gabriel, Stacey B; Gallagher, Glen R; Smole, Sandra C; Madoff, Lawrence C; Brown, Catherine M; Keller, Matthew W; Wilson, Malania M; Kirby, Marie K; Barnes, John R; Park, Daniel J; Siddle, Katherine J; Happi, Christian T; Hung, Deborah T; Springer, Michael; MacInnis, Bronwyn L; Lemieux, Jacob E; Rosenberg, Eric; Branda, John A; Blainey, Paul C; Sabeti, Pardis C; Myhrvold, Cameron.

Nat Med ; 30(1): 307, 2024 Jan.

Article in English | MEDLINE | ID: mdl-37946059

5.

High-throughput RNA isoform sequencing using programmed cDNA concatenation.

Al'Khafaji, Aziz M; Smith, Jonathan T; Garimella, Kiran V; Babadi, Mehrtash; Popic, Victoria; Sade-Feldman, Moshe; Gatzen, Michael; Sarkizova, Siranush; Schwartz, Marc A; Blaum, Emily M; Day, Allyson; Costello, Maura; Bowers, Tera; Gabriel, Stacey; Banks, Eric; Philippakis, Anthony A; Boland, Genevieve M; Blainey, Paul C; Hacohen, Nir.

Nat Biotechnol ; 42(4): 582-586, 2024 Apr.

Article in English | MEDLINE | ID: mdl-37291427

ABSTRACT

Full-length RNA-sequencing methods using long-read technologies can capture complete transcript isoforms, but their throughput is limited. We introduce multiplexed arrays isoform sequencing (MAS-ISO-seq), a technique for programmably concatenating complementary DNAs (cDNAs) into molecules optimal for long-read sequencing, increasing the throughput >15-fold to nearly 40 million cDNA reads per run on the Sequel IIe sequencer. When applied to single-cell RNA sequencing of tumor-infiltrating T cells, MAS-ISO-seq demonstrated a 12- to 32-fold increase in the discovery of differentially spliced genes.

Subject(s)

High-Throughput Nucleotide Sequencing , RNA Isoforms , DNA, Complementary/genetics , RNA Isoforms/genetics , High-Throughput Nucleotide Sequencing/methods , Protein Isoforms/genetics , Sequence Analysis, RNA/methods , Transcriptome , Gene Expression Profiling/methods , RNA/genetics

6.

Type 2 Diabetes Modifies the Association of CAD Genomic Risk Variants With Subclinical Atherosclerosis.

Hasbani, Natalie R; Westerman, Kenneth E; Kwak, Soo Heon; Chen, Han; Li, Xihao; Di Corpo, Daniel; Wessel, Jennifer; Bis, Joshua C; Sarnowski, Chloè; Wu, Peitao; Bielak, Lawrence F; Guo, Xiuqing; Heard-Costa, Nancy; Kinney, Gregory L; Mahaney, Michael C; Montasser, May E; Palmer, Nicholette D; Raffield, Laura M; Terry, James G; Yanek, Lisa R; Bon, Jessica; Bowden, Donald W; Brody, Jennifer A; Duggirala, Ravindranath; Jacobs, David R; Kalyani, Rita R; Lange, Leslie A; Mitchell, Braxton D; Smith, Jennifer A; Taylor, Kent D; Carson, April P; Curran, Joanne E; Fornage, Myriam; Freedman, Barry I; Gabriel, Stacey; Gibbs, Richard A; Gupta, Namrata; Kardia, Sharon L R; Kral, Brian G; Momin, Zeineen; Newman, Anne B; Post, Wendy S; Viaud-Martinez, Karine A; Young, Kendra A; Becker, Lewis C; Bertoni, Alain G; Blangero, John; Carr, John J; Pratte, Katherine; Psaty, Bruce M.

Circ Genom Precis Med ; 16(6): e004176, 2023 Dec.

Article in English | MEDLINE | ID: mdl-38014529

ABSTRACT

BACKGROUND: Individuals with type 2 diabetes (T2D) have an increased risk of coronary artery disease (CAD), but questions remain about the underlying pathology. Identifying which CAD loci are modified by T2D in the development of subclinical atherosclerosis (coronary artery calcification [CAC], carotid intima-media thickness, or carotid plaque) may improve our understanding of the mechanisms leading to the increased CAD in T2D. METHODS: We compared the common and rare variant associations of known CAD loci from the literature on CAC, carotid intima-media thickness, and carotid plaque in up to 29â670 participants, including up to 24â157 normoglycemic controls and 5513 T2D cases leveraging whole-genome sequencing data from the Trans-Omics for Precision Medicine program. We included first-order T2D interaction terms in each model to determine whether CAD loci were modified by T2D. The genetic main and interaction effects were assessed using a joint test to determine whether a CAD variant, or gene-based rare variant set, was associated with the respective subclinical atherosclerosis measures and then further determined whether these loci had a significant interaction test. RESULTS: Using a Bonferroni-corrected significance threshold of P<1.6×10-4, we identified 3 genes (ATP1B1, ARVCF, and LIPG) associated with CAC and 2 genes (ABCG8 and EIF2B2) associated with carotid intima-media thickness and carotid plaque, respectively, through gene-based rare variant set analysis. Both ATP1B1 and ARVCF also had significantly different associations for CAC in T2D cases versus controls. No significant interaction tests were identified through the candidate single-variant analysis. CONCLUSIONS: These results highlight T2D as an important modifier of rare variant associations in CAD loci with CAC.

Subject(s)

Atherosclerosis , Coronary Artery Disease , Diabetes Mellitus, Type 2 , Plaque, Atherosclerotic , Humans , Coronary Artery Disease/genetics , Diabetes Mellitus, Type 2/complications , Diabetes Mellitus, Type 2/genetics , Carotid Intima-Media Thickness , Risk Factors , Atherosclerosis/genetics , Genomics

7.

The functional impact of rare variation across the regulatory cascade.

Li, Taibo; Ferraro, Nicole; Strober, Benjamin J; Aguet, Francois; Kasela, Silva; Arvanitis, Marios; Ni, Bohan; Wiel, Laurens; Hershberg, Elliot; Ardlie, Kristin; Arking, Dan E; Beer, Rebecca L; Brody, Jennifer; Blackwell, Thomas W; Clish, Clary; Gabriel, Stacey; Gerszten, Robert; Guo, Xiuqing; Gupta, Namrata; Johnson, W Craig; Lappalainen, Tuuli; Lin, Henry J; Liu, Yongmei; Nickerson, Deborah A; Papanicolaou, George; Pritchard, Jonathan K; Qasba, Pankaj; Shojaie, Ali; Smith, Josh; Sotoodehnia, Nona; Taylor, Kent D; Tracy, Russell P; Van Den Berg, David; Wheeler, Matthew T; Rich, Stephen S; Rotter, Jerome I; Battle, Alexis; Montgomery, Stephen B.

Cell Genom ; 3(10): 100401, 2023 Oct 11.

Article in English | MEDLINE | ID: mdl-37868038

ABSTRACT

Each human genome has tens of thousands of rare genetic variants; however, identifying impactful rare variants remains a major challenge. We demonstrate how use of personal multi-omics can enable identification of impactful rare variants by using the Multi-Ethnic Study of Atherosclerosis, which included several hundred individuals, with whole-genome sequencing, transcriptomes, methylomes, and proteomes collected across two time points, 10 years apart. We evaluated each multi-omics phenotype's ability to separately and jointly inform functional rare variation. By combining expression and protein data, we observed rare stop variants 62 times and rare frameshift variants 216 times as frequently as controls, compared to 13-27 times as frequently for expression or protein effects alone. We extended a Bayesian hierarchical model, "Watershed," to prioritize specific rare variants underlying multi-omics signals across the regulatory cascade. With this approach, we identified rare variants that exhibited large effect sizes on multiple complex traits including height, schizophrenia, and Alzheimer's disease.

8.

Rare variants in long non-coding RNAs are associated with blood lipid levels in the TOPMed whole-genome sequencing study.

Wang, Yuxuan; Selvaraj, Margaret Sunitha; Li, Xihao; Li, Zilin; Holdcraft, Jacob A; Arnett, Donna K; Bis, Joshua C; Blangero, John; Boerwinkle, Eric; Bowden, Donald W; Cade, Brian E; Carlson, Jenna C; Carson, April P; Chen, Yii-Der Ida; Curran, Joanne E; de Vries, Paul S; Dutcher, Susan K; Ellinor, Patrick T; Floyd, James S; Fornage, Myriam; Freedman, Barry I; Gabriel, Stacey; Germer, Soren; Gibbs, Richard A; Guo, Xiuqing; He, Jiang; Heard-Costa, Nancy; Hildalgo, Bertha; Hou, Lifang; Irvin, Marguerite R; Joehanes, Roby; Kaplan, Robert C; Kardia, Sharon Lr; Kelly, Tanika N; Kim, Ryan; Kooperberg, Charles; Kral, Brian G; Levy, Daniel; Li, Changwei; Liu, Chunyu; Lloyd-Jone, Don; Loos, Ruth Jf; Mahaney, Michael C; Martin, Lisa W; Mathias, Rasika A; Minster, Ryan L; Mitchell, Braxton D; Montasser, May E; Morrison, Alanna C; Murabito, Joanne M.

Am J Hum Genet ; 110(10): 1704-1717, 2023 10 05.

Article in English | MEDLINE | ID: mdl-37802043

ABSTRACT

Long non-coding RNAs (lncRNAs) are known to perform important regulatory functions in lipid metabolism. Large-scale whole-genome sequencing (WGS) studies and new statistical methods for variant set tests now provide an opportunity to assess more associations between rare variants in lncRNA genes and complex traits across the genome. In this study, we used high-coverage WGS from 66,329 participants of diverse ancestries with measurement of blood lipids and lipoproteins (LDL-C, HDL-C, TC, and TG) in the National Heart, Lung, and Blood Institute (NHLBI) Trans-Omics for Precision Medicine (TOPMed) program to investigate the role of lncRNAs in lipid variability. We aggregated rare variants for 165,375 lncRNA genes based on their genomic locations and conducted rare-variant aggregate association tests using the STAAR (variant-set test for association using annotation information) framework. We performed STAAR conditional analysis adjusting for common variants in known lipid GWAS loci and rare-coding variants in nearby protein-coding genes. Our analyses revealed 83 rare lncRNA variant sets significantly associated with blood lipid levels, all of which were located in known lipid GWAS loci (in a ±500-kb window of a Global Lipids Genetics Consortium index variant). Notably, 61 out of 83 signals (73%) were conditionally independent of common regulatory variation and rare protein-coding variation at the same loci. We replicated 34 out of 61 (56%) conditionally independent associations using the independent UK Biobank WGS data. Our results expand the genetic architecture of blood lipids to rare variants in lncRNAs.

Subject(s)

RNA, Long Noncoding , Humans , RNA, Long Noncoding/genetics , Genome-Wide Association Study , Precision Medicine , Whole Genome Sequencing/methods , Lipids/genetics , Polymorphism, Single Nucleotide/genetics

9.

Proteome-Wide Association Studies for Blood Lipids and Comparison with Transcriptome-Wide Association Studies.

Zhang, Daiwei; Gao, Boran; Feng, Qidi; Manichaikul, Ani; Peloso, Gina M; Tracy, Russell P; Durda, Peter; Taylor, Kent D; Liu, Yongmei; Johnson, W Craig; Gabriel, Stacey; Gupta, Namrata; Smith, Joshua D; Aguet, Francois; Ardlie, Kristin G; Blackwell, Thomas W; Gerszten, Robert E; Rich, Stephen S; Rotter, Jerome I; Scott, Laura J; Zhou, Xiang; Lee, Seunggeun.

bioRxiv ; 2023 Aug 21.

Article in English | MEDLINE | ID: mdl-37662416

ABSTRACT

Blood lipid traits are treatable and heritable risk factors for heart disease, a leading cause of mortality worldwide. Although genome-wide association studies (GWAS) have discovered hundreds of variants associated with lipids in humans, most of the causal mechanisms of lipids remain unknown. To better understand the biological processes underlying lipid metabolism, we investigated the associations of plasma protein levels with total cholesterol (TC), triglycerides (TG), high-density lipoprotein cholesterol (HDL), and low-density lipoprotein cholesterol (LDL) in blood. We trained protein prediction models based on samples in the Multi-Ethnic Study of Atherosclerosis (MESA) and applied them to conduct proteome-wide association studies (PWAS) for lipids using the Global Lipids Genetics Consortium (GLGC) data. Of the 749 proteins tested, 42 were significantly associated with at least one lipid trait. Furthermore, we performed transcriptome-wide association studies (TWAS) for lipids using 9,714 gene expression prediction models trained on samples from peripheral blood mononuclear cells (PBMCs) in MESA and 49 tissues in the Genotype-Tissue Expression (GTEx) project. We found that although PWAS and TWAS can show different directions of associations in an individual gene, 40 out of 49 tissues showed a positive correlation between PWAS and TWAS signed p-values across all the genes, which suggests a high-level consistency between proteome-lipid associations and transcriptome-lipid associations.

10.

Author Correction: Multi-ancestry genome-wide association analyses improve resolution of genes and pathways influencing lung function and chronic obstructive pulmonary disease risk.

Shrine, Nick; Izquierdo, Abril G; Chen, Jing; Packer, Richard; Hall, Robert J; Guyatt, Anna L; Batini, Chiara; Thompson, Rebecca J; Pavuluri, Chandan; Malik, Vidhi; Hobbs, Brian D; Moll, Matthew; Kim, Wonji; Tal-Singer, Ruth; Bakke, Per; Fawcett, Katherine A; John, Catherine; Coley, Kayesha; Piga, Noemi Nicole; Pozarickij, Alfred; Lin, Kuang; Millwood, Iona Y; Chen, Zhengming; Li, Liming; Wijnant, Sara R A; Lahousse, Lies; Brusselle, Guy; Uitterlinden, Andre G; Manichaikul, Ani; Oelsner, Elizabeth C; Rich, Stephen S; Barr, R Graham; Kerr, Shona M; Vitart, Veronique; Brown, Michael R; Wielscher, Matthias; Imboden, Medea; Jeong, Ayoung; Bartz, Traci M; Gharib, Sina A; Flexeder, Claudia; Karrasch, Stefan; Gieger, Christian; Peters, Annette; Stubbe, Beate; Hu, Xiaowei; Ortega, Victor E; Meyers, Deborah A; Bleecker, Eugene R; Gabriel, Stacey B.

Nat Genet ; 55(10): 1778-1779, 2023 Oct.

Article in English | MEDLINE | ID: mdl-37749248

11.

Interaction molecular QTL mapping discovers cellular and environmental modifiers of genetic regulatory effects.

Kasela, Silva; Aguet, François; Kim-Hellmuth, Sarah; Brown, Brielin C; Nachun, Daniel C; Tracy, Russell P; Durda, Peter; Liu, Yongmei; Taylor, Kent D; Craig Johnson, W; Berg, David Van Den; Gabriel, Stacey; Gupta, Namrata; Smith, Joshua D; Blackwell, Thomas W; Rotter, Jerome I; Ardlie, Kristin G; Manichaikul, Ani; Rich, Stephen S; Graham Barr, R; Lappalainen, Tuuli.

bioRxiv ; 2023 Jun 29.

Article in English | MEDLINE | ID: mdl-37425716

ABSTRACT

Bulk tissue molecular quantitative trait loci (QTLs) have been the starting point for interpreting disease-associated variants, while context-specific QTLs show particular relevance for disease. Here, we present the results of mapping interaction QTLs (iQTLs) for cell type, age, and other phenotypic variables in multi-omic, longitudinal data from blood of individuals of diverse ancestries. By modeling the interaction between genotype and estimated cell type proportions, we demonstrate that cell type iQTLs could be considered as proxies for cell type-specific QTL effects. The interpretation of age iQTLs, however, warrants caution as the moderation effect of age on the genotype and molecular phenotype association may be mediated by changes in cell type composition. Finally, we show that cell type iQTLs contribute to cell type-specific enrichment of diseases that, in combination with additional functional data, may guide future functional studies. Overall, this study highlights iQTLs to gain insights into the context-specificity of regulatory effects.

12.

Rare variants in long non-coding RNAs are associated with blood lipid levels in the TOPMed Whole Genome Sequencing Study.

Wang, Yuxuan; Selvaraj, Margaret Sunitha; Li, Xihao; Li, Zilin; Holdcraft, Jacob A; Arnett, Donna K; Bis, Joshua C; Blangero, John; Boerwinkle, Eric; Bowden, Donald W; Cade, Brian E; Carlson, Jenna C; Carson, April P; Chen, Yii-Der Ida; Curran, Joanne E; de Vries, Paul S; Dutcher, Susan K; Ellinor, Patrick T; Floyd, James S; Fornage, Myriam; Freedman, Barry I; Gabriel, Stacey; Germer, Soren; Gibbs, Richard A; Guo, Xiuqing; He, Jiang; Heard-Costa, Nancy; Hildalgo, Bertha; Hou, Lifang; Irvin, Marguerite R; Joehanes, Roby; Kaplan, Robert C; Kardia, Sharon Lr; Kelly, Tanika N; Kim, Ryan; Kooperberg, Charles; Kral, Brian G; Levy, Daniel; Li, Changwei; Liu, Chunyu; Lloyd-Jone, Don; Loos, Ruth Jf; Mahaney, Michael C; Martin, Lisa W; Mathias, Rasika A; Minster, Ryan L; Mitchell, Braxton D; Montasser, May E; Morrison, Alanna C; Murabito, Joanne M.

medRxiv ; 2023 Jun 29.

Article in English | MEDLINE | ID: mdl-37425772

ABSTRACT

Long non-coding RNAs (lncRNAs) are known to perform important regulatory functions. Large-scale whole genome sequencing (WGS) studies and new statistical methods for variant set tests now provide an opportunity to assess the associations between rare variants in lncRNA genes and complex traits across the genome. In this study, we used high-coverage WGS from 66,329 participants of diverse ancestries with blood lipid levels (LDL-C, HDL-C, TC, and TG) in the National Heart, Lung, and Blood Institute (NHLBI) Trans-Omics for Precision Medicine (TOPMed) program to investigate the role of lncRNAs in lipid variability. We aggregated rare variants for 165,375 lncRNA genes based on their genomic locations and conducted rare variant aggregate association tests using the STAAR (variant-Set Test for Association using Annotation infoRmation) framework. We performed STAAR conditional analysis adjusting for common variants in known lipid GWAS loci and rare coding variants in nearby protein coding genes. Our analyses revealed 83 rare lncRNA variant sets significantly associated with blood lipid levels, all of which were located in known lipid GWAS loci (in a ±500 kb window of a Global Lipids Genetics Consortium index variant). Notably, 61 out of 83 signals (73%) were conditionally independent of common regulatory variations and rare protein coding variations at the same loci. We replicated 34 out of 61 (56%) conditionally independent associations using the independent UK Biobank WGS data. Our results expand the genetic architecture of blood lipids to rare variants in lncRNA, implicating new therapeutic opportunities.

13.

Genetic control of mRNA splicing as a potential mechanism for incomplete penetrance of rare coding variants.

Einson, Jonah; Glinos, Dafni; Boerwinkle, Eric; Castaldi, Peter; Darbar, Dawood; de Andrade, Mariza; Ellinor, Patrick; Fornage, Myriam; Gabriel, Stacey; Germer, Soren; Gibbs, Richard; Hersh, Craig P; Johnsen, Jill; Kaplan, Robert; Konkle, Barbara A; Kooperberg, Charles; Nassir, Rami; Loos, Ruth J F; Meyers, Deborah A; Mitchell, Braxton D; Psaty, Bruce; Vasan, Ramachandran S; Rich, Stephen S; Rienstra, Michael; Rotter, Jerome I; Saferali, Aabida; Shoemaker, Moore Benjamin; Silverman, Edwin; Smith, Albert Vernon; Mohammadi, Pejman; Castel, Stephane E; Iossifov, Ivan; Lappalainen, Tuuli.

Genetics ; 224(4)2023 08 09.

Article in English | MEDLINE | ID: mdl-37348055

ABSTRACT

Exonic variants present some of the strongest links between genotype and phenotype. However, these variants can have significant inter-individual pathogenicity differences, known as variable penetrance. In this study, we propose a model where genetically controlled mRNA splicing modulates the pathogenicity of exonic variants. By first cataloging exonic inclusion from RNA-sequencing data in GTEx V8, we find that pathogenic alleles are depleted on highly included exons. Using a large-scale phased whole genome sequencing data from the TOPMed consortium, we observe that this effect may be driven by common splice-regulatory genetic variants, and that natural selection acts on haplotype configurations that reduce the transcript inclusion of putatively pathogenic variants, especially when limiting to haploinsufficient genes. Finally, we test if this effect may be relevant for autism risk using families from the Simons Simplex Collection, but find that splicing of pathogenic alleles has a penetrance reducing effect here as well. Overall, our results indicate that common splice-regulatory variants may play a role in reducing the damaging effects of rare exonic variants.

Subject(s)

RNA Splice Sites , RNA Splicing , Penetrance , Exons , Genotype , RNA, Messenger/genetics , Alternative Splicing

14.

Gene expression in African Americans, Puerto Ricans and Mexican Americans reveals ancestry-specific patterns of genetic architecture.

Kachuri, Linda; Mak, Angel C Y; Hu, Donglei; Eng, Celeste; Huntsman, Scott; Elhawary, Jennifer R; Gupta, Namrata; Gabriel, Stacey; Xiao, Shujie; Keys, Kevin L; Oni-Orisan, Akinyemi; Rodríguez-Santana, José R; LeNoir, Michael A; Borrell, Luisa N; Zaitlen, Noah A; Williams, L Keoki; Gignoux, Christopher R; Burchard, Esteban González; Ziv, Elad.

Nat Genet ; 55(6): 952-963, 2023 06.

Article in English | MEDLINE | ID: mdl-37231098

ABSTRACT

We explored ancestry-related differences in the genetic architecture of whole-blood gene expression using whole-genome and RNA sequencing data from 2,733 African Americans, Puerto Ricans and Mexican Americans. We found that heritability of gene expression significantly increased with greater proportions of African genetic ancestry and decreased with higher proportions of Indigenous American ancestry, reflecting the relationship between heterozygosity and genetic variance. Among heritable protein-coding genes, the prevalence of ancestry-specific expression quantitative trait loci (anc-eQTLs) was 30% in African ancestry and 8% for Indigenous American ancestry segments. Most anc-eQTLs (89%) were driven by population differences in allele frequency. Transcriptome-wide association analyses of multi-ancestry summary statistics for 28 traits identified 79% more gene-trait associations using transcriptome prediction models trained in our admixed population than models trained using data from the Genotype-Tissue Expression project. Our study highlights the importance of measuring gene expression across large and ancestrally diverse populations for enabling new discoveries and reducing disparities.

Subject(s)

Black or African American , Hispanic or Latino , Mexican Americans , Humans , Black or African American/genetics , Genome-Wide Association Study , Hispanic or Latino/genetics , Mexican Americans/genetics , Phenotype , Polymorphism, Single Nucleotide , Transcriptome

15.

Canonical correlation analysis for multi-omics: Application to cross-cohort analysis.

Jiang, Min-Zhi; Aguet, François; Ardlie, Kristin; Chen, Jiawen; Cornell, Elaine; Cruz, Dan; Durda, Peter; Gabriel, Stacey B; Gerszten, Robert E; Guo, Xiuqing; Johnson, Craig W; Kasela, Silva; Lange, Leslie A; Lappalainen, Tuuli; Liu, Yongmei; Reiner, Alex P; Smith, Josh; Sofer, Tamar; Taylor, Kent D; Tracy, Russell P; VanDenBerg, David J; Wilson, James G; Rich, Stephen S; Rotter, Jerome I; Love, Michael I; Raffield, Laura M; Li, Yun.

PLoS Genet ; 19(5): e1010517, 2023 05.

Article in English | MEDLINE | ID: mdl-37216410

ABSTRACT

Integrative approaches that simultaneously model multi-omics data have gained increasing popularity because they provide holistic system biology views of multiple or all components in a biological system of interest. Canonical correlation analysis (CCA) is a correlation-based integrative method designed to extract latent features shared between multiple assays by finding the linear combinations of features-referred to as canonical variables (CVs)-within each assay that achieve maximal across-assay correlation. Although widely acknowledged as a powerful approach for multi-omics data, CCA has not been systematically applied to multi-omics data in large cohort studies, which has only recently become available. Here, we adapted sparse multiple CCA (SMCCA), a widely-used derivative of CCA, to proteomics and methylomics data from the Multi-Ethnic Study of Atherosclerosis (MESA) and Jackson Heart Study (JHS). To tackle challenges encountered when applying SMCCA to MESA and JHS, our adaptations include the incorporation of the Gram-Schmidt (GS) algorithm with SMCCA to improve orthogonality among CVs, and the development of Sparse Supervised Multiple CCA (SSMCCA) to allow supervised integration analysis for more than two assays. Effective application of SMCCA to the two real datasets reveals important findings. Applying our SMCCA-GS to MESA and JHS, we identified strong associations between blood cell counts and protein abundance, suggesting that adjustment of blood cell composition should be considered in protein-based association studies. Importantly, CVs obtained from two independent cohorts also demonstrate transferability across the cohorts. For example, proteomic CVs learned from JHS, when transferred to MESA, explain similar amounts of blood cell count phenotypic variance in MESA, explaining 39.0% ~ 50.0% variation in JHS and 38.9% ~ 49.1% in MESA. Similar transferability was observed for other omics-CV-trait pairs. This suggests that biologically meaningful and cohort-agnostic variation is captured by CVs. We anticipate that applying our SMCCA-GS and SSMCCA on various cohorts would help identify cohort-agnostic biologically meaningful relationships between multi-omics data and phenotypic traits.

Subject(s)

Canonical Correlation Analysis , Proteomics , Humans , Proteomics/methods , Multiomics , Cohort Studies

16.

Multi-ancestry genome-wide association analyses improve resolution of genes and pathways influencing lung function and chronic obstructive pulmonary disease risk.

Shrine, Nick; Izquierdo, Abril G; Chen, Jing; Packer, Richard; Hall, Robert J; Guyatt, Anna L; Batini, Chiara; Thompson, Rebecca J; Pavuluri, Chandan; Malik, Vidhi; Hobbs, Brian D; Moll, Matthew; Kim, Wonji; Tal-Singer, Ruth; Bakke, Per; Fawcett, Katherine A; John, Catherine; Coley, Kayesha; Piga, Noemi Nicole; Pozarickij, Alfred; Lin, Kuang; Millwood, Iona Y; Chen, Zhengming; Li, Liming; Wijnant, Sara R A; Lahousse, Lies; Brusselle, Guy; Uitterlinden, Andre G; Manichaikul, Ani; Oelsner, Elizabeth C; Rich, Stephen S; Barr, R Graham; Kerr, Shona M; Vitart, Veronique; Brown, Michael R; Wielscher, Matthias; Imboden, Medea; Jeong, Ayoung; Bartz, Traci M; Gharib, Sina A; Flexeder, Claudia; Karrasch, Stefan; Gieger, Christian; Peters, Annette; Stubbe, Beate; Hu, Xiaowei; Ortega, Victor E; Meyers, Deborah A; Bleecker, Eugene R; Gabriel, Stacey B.

Nat Genet ; 55(3): 410-422, 2023 03.

Article in English | MEDLINE | ID: mdl-36914875

ABSTRACT

Lung-function impairment underlies chronic obstructive pulmonary disease (COPD) and predicts mortality. In the largest multi-ancestry genome-wide association meta-analysis of lung function to date, comprising 580,869 participants, we identified 1,020 independent association signals implicating 559 genes supported by ≥2 criteria from a systematic variant-to-gene mapping framework. These genes were enriched in 29 pathways. Individual variants showed heterogeneity across ancestries, age and smoking groups, and collectively as a genetic risk score showed strong association with COPD across ancestry groups. We undertook phenome-wide association studies for selected associated variants as well as trait and pathway-specific genetic risk scores to infer possible consequences of intervening in pathways underlying lung function. We highlight new putative causal variants, genes, proteins and pathways, including those targeted by existing drugs. These findings bring us closer to understanding the mechanisms underlying lung function and COPD, and should inform functional genomics experiments and potentially future COPD therapies.

Subject(s)

Lung , Pulmonary Disease, Chronic Obstructive , Humans , Genome-Wide Association Study , Genetic Predisposition to Disease/genetics , Pulmonary Disease, Chronic Obstructive/genetics , Smoking/adverse effects , Smoking/genetics , Polymorphism, Single Nucleotide/genetics

17.

Whole Genome Analysis of Venous Thromboembolism: the Trans-Omics for Precision Medicine Program.

Seyerle, Amanda A; Laurie, Cecelia A; Coombes, Brandon J; Jain, Deepti; Conomos, Matthew P; Brody, Jennifer; Chen, Ming-Huei; Gogarten, Stephanie M; Beutel, Kathleen M; Gupta, Namrata; Heckbert, Susan R; Jackson, Rebecca D; Johnson, Andrew D; Ko, Darae; Manson, JoAnn E; McKnight, Barbara; Metcalf, Ginger A; Morrison, Alanna C; Reiner, Alexander P; Sofer, Tamar; Tang, Weihong; Wiggins, Kerri L; Boerwinkle, Eric; de Andrade, Mariza; Gabriel, Stacey B; Gibbs, Richard A; Laurie, Cathy C; Psaty, Bruce M; Vasan, Ramachandran S; Rice, Ken; Kooperberg, Charles; Pankow, James S; Smith, Nicholas L; Pankratz, Nathan.

Circ Genom Precis Med ; 16(2): e003532, 2023 04.

Article in English | MEDLINE | ID: mdl-36960714

ABSTRACT

BACKGROUND: Risk for venous thromboembolism has a strong genetic component. Whole genome sequencing from the TOPMed program (Trans-Omics for Precision Medicine) allowed us to look for new associations, particularly rare variants missed by standard genome-wide association studies. METHODS: The 3793 cases and 7834 controls (11.6% of cases were individuals of African, Hispanic/Latino, or Asian ancestry) were analyzed using a single variant approach and an aggregate gene-based approach using our primary filter (included only loss-of-function and missense variants predicted to be deleterious) and our secondary filter (included all missense variants). RESULTS: Single variant analyses identified associations at 5 known loci. Aggregate gene-based analyses identified only PROC (odds ratio, 6.2 for carriers of rare variants; P=7.4×10-14) when using our primary filter. Employing our secondary variant filter led to a smaller effect size at PROC (odds ratio, 3.8; P=1.6×10-14), while excluding variants found only in rare isoforms led to a larger one (odds ratio, 7.5). Different filtering strategies improved the signal for 2 other known genes: PROS1 became significant (minimum P=1.8×10-6 with the secondary filter), while SERPINC1 did not (minimum P=4.4×10-5 with minor allele frequency <0.0005). Results were largely the same when restricting the analyses to include only unprovoked cases; however, one novel gene, MS4A1, became significant (P=4.4×10-7 using all missense variants with minor allele frequency <0.0005). CONCLUSIONS: Here, we have demonstrated the importance of using multiple variant filtering strategies, as we detected additional genes when filtering variants based on their predicted deleteriousness, frequency, and presence on the most expressed isoforms. Our primary analyses did not identify new candidate loci; thus larger follow-up studies are needed to replicate the novel MS4A1 locus and to identify additional rare variation associated with venous thromboembolism.

Subject(s)

Genome-Wide Association Study , Venous Thromboembolism , Humans , Venous Thromboembolism/genetics , Precision Medicine , Genetic Predisposition to Disease , Gene Frequency

18.

Genetic control of mRNA splicing as a potential mechanism for incomplete penetrance of rare coding variants.

Einson, Jonah; Glinos, Dafni; Boerwinkle, Eric; Castaldi, Peter; Darbar, Dawood; de Andrade, Mariza; Ellinor, Patrick; Fornage, Myriam; Gabriel, Stacey; Germer, Soren; Gibbs, Richard; Hersh, Craig P; Johnsen, Jill; Kaplan, Robert; Konkle, Barbara A; Kooperberg, Charles; Nassir, Rami; Loos, Ruth J F; Meyers, Deborah A; Mitchell, Braxton D; Psaty, Bruce; Vasan, Ramachandran S; Rich, Stephen S; Rienstra, Michael; Rotter, Jerome I; Saferali, Aabida; Shoemaker, M Benjamin; Silverman, Edwin; Smith, Albert Vernon; Mohammadi, Pejman; Castel, Stephane E; Iossifov, Ivan; Lappalainen, Tuuli.

bioRxiv ; 2023 Jan 31.

Article in English | MEDLINE | ID: mdl-36778406

ABSTRACT

Exonic variants present some of the strongest links between genotype and phenotype. However, these variants can have significant inter-individual pathogenicity differences, known as variable penetrance. In this study, we propose a model where genetically controlled mRNA splicing modulates the pathogenicity of exonic variants. By first cataloging exonic inclusion from RNA-seq data in GTEx v8, we find that pathogenic alleles are depleted on highly included exons. Using a large-scale phased WGS data from the TOPMed consortium, we observe that this effect may be driven by common splice-regulatory genetic variants, and that natural selection acts on haplotype configurations that reduce the transcript inclusion of putatively pathogenic variants, especially when limiting to haploinsufficient genes. Finally, we test if this effect may be relevant for autism risk using families from the Simons Simplex Collection, but find that splicing of pathogenic alleles has a penetrance reducing effect here as well. Overall, our results indicate that common splice-regulatory variants may play a role in reducing the damaging effects of rare exonic variants.

19.

Mechanisms of response and resistance to combined decitabine and ipilimumab for advanced myeloid disease.

Penter, Livius; Liu, Yang; Wolff, Jacquelyn O; Yang, Lin; Taing, Len; Jhaveri, Aashna; Southard, Jackson; Patel, Manishkumar; Cullen, Nicole M; Pfaff, Kathleen L; Cieri, Nicoletta; Oliveira, Giacomo; Kim-Schulze, Seunghee; Ranasinghe, Srinika; Leonard, Rebecca; Robertson, Taylor; Morgan, Elizabeth A; Chen, Helen X; Song, Minkyung H; Thurin, Magdalena; Li, Shuqiang; Rodig, Scott J; Cibulskis, Carrie; Gabriel, Stacey; Bachireddy, Pavan; Ritz, Jerome; Streicher, Howard; Neuberg, Donna S; Hodi, F Stephen; Davids, Matthew S; Gnjatic, Sacha; Livak, Kenneth J; Altreuter, Jennifer; Michor, Franziska; Soiffer, Robert J; Garcia, Jacqueline S; Wu, Catherine J.

Blood ; 141(15): 1817-1830, 2023 04 13.

Article in English | MEDLINE | ID: mdl-36706355

ABSTRACT

The challenge of eradicating leukemia in patients with acute myelogenous leukemia (AML) after initial cytoreduction has motivated modern efforts to combine synergistic active modalities including immunotherapy. Recently, the ETCTN/CTEP 10026 study tested the combination of the DNA methyltransferase inhibitor decitabine together with the immune checkpoint inhibitor ipilimumab for AML/myelodysplastic syndrome (MDS) either after allogeneic hematopoietic stem cell transplantation (HSCT) or in the HSCT-naïve setting. Integrative transcriptome-based analysis of 304 961 individual marrow-infiltrating cells for 18 of 48 subjects treated on study revealed the strong association of response with a high baseline ratio of T to AML cells. Clinical responses were predominantly driven by decitabine-induced cytoreduction. Evidence of immune activation was only apparent after ipilimumab exposure, which altered CD4+ T-cell gene expression, in line with ongoing T-cell differentiation and increased frequency of marrow-infiltrating regulatory T cells. For post-HSCT samples, relapse could be attributed to insufficient clearing of malignant clones in progenitor cell populations. In contrast to AML/MDS bone marrow, the transcriptomes of leukemia cutis samples from patients with durable remission after ipilimumab monotherapy showed evidence of increased infiltration with antigen-experienced resident memory T cells and higher expression of CTLA-4 and FOXP3. Altogether, activity of combined decitabine and ipilimumab is impacted by cellular expression states within the microenvironmental niche of leukemic cells. The inadequate elimination of leukemic progenitors mandates urgent development of novel approaches for targeting these cell populations to generate long-lasting responses. This trial was registered at www.clinicaltrials.gov as #NCT02890329.

Subject(s)

Hematopoietic Stem Cell Transplantation , Leukemia, Myeloid, Acute , Myelodysplastic Syndromes , Humans , Ipilimumab/therapeutic use , Decitabine/therapeutic use , Myelodysplastic Syndromes/genetics , Leukemia, Myeloid, Acute/drug therapy , Leukemia, Myeloid, Acute/genetics , Leukemia, Myeloid, Acute/pathology , Recurrence

20.

Gene expression associations with body mass index in the Multi-Ethnic Study of Atherosclerosis.

Vargas, Luciana B; Lange, Leslie A; Ferrier, Kendra; Aguet, François; Ardlie, Kristin; Gabriel, Stacey; Gupta, Namrata; Smith, Joshua D; Blackwell, Thomas W; Ding, Jingzhong; Durda, Peter; Tracy, Russell P; Liu, Yongmei; Taylor, Kent D; Craig Johnson, W; Rich, Stephen S; Rotter, Jerome I; Lange, Ethan M; Konigsberg, Iain R.

Int J Obes (Lond) ; 47(2): 109-116, 2023 02.

Article in English | MEDLINE | ID: mdl-36463326

ABSTRACT

BACKGROUND/OBJECTIVES: Obesity, defined as excessive fat accumulation that represents a health risk, is increasing in adults and children, reaching global epidemic proportions. Body mass index (BMI) correlates with body fat and future health risk, yet differs in prediction by fat distribution, across populations and by age. Nonetheless, few genetic studies of BMI have been conducted in ancestrally diverse populations. Gene expression association with BMI was assessed in the Multi-Ethnic Study of Atherosclerosis (MESA) in four self-identified race and ethnicity (SIRE) groups to identify genes associated with obesity. SUBJECTS/METHODS: RNA-sequencing was performed on 1096 MESA participants (37.8% white, 24.3% Hispanic, 28.4% African American, and 9.5% Chinese American) and linear models were used to assess the association of expression from each gene for its effect on BMI, adjusting for age, sex, sequencing center, study site, five expression and four genetic principal components in each self-identified race group. Sample-size-weighted meta-analysis was performed to identify genes with BMI-associated expression across ancestry groups. RESULTS: Within individual SIRE groups, there were zero to three genes whose expression is significantly (p < 1.97 × 10-6) associated with BMI. Across all groups, 45 genes were identified by meta-analysis whose expression was significantly associated with BMI, explaining 29.7% of BMI variation. The 45 genes are expressed in a variety of tissues and cell types and are enriched for obesity-related processes including erythrocyte function, oxygen binding and transport, and JAK-STAT signaling. CONCLUSIONS: We have identified genes whose expression is significantly associated with obesity in a multi-ethnic cohort. We have identified novel genes associated with BMI as well as confirmed previously identified genes from earlier genetic analyses. These novel genes and their biological pathways represent new targets for understanding the biology of obesity as well as new therapeutic intervention to reduce obesity and improve global public health.

Subject(s)

Body Mass Index , Gene Expression , Obesity , Adult , Child , Humans , Atherosclerosis , Obesity/epidemiology , Obesity/genetics

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

ABSTRACT

Subject(s)

ABSTRACT

ABSTRACT

ABSTRACT

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

SEND TO:

SELECTION OF CITATIONS

SEARCH DETAIL