Búsqueda | BVS Bolivia

1.

Stimulating Wnt signaling reveals context-dependent genetic effects on gene regulation in primary human neural progenitors.

Matoba, Nana; Le, Brandon D; Valone, Jordan M; Wolter, Justin M; Mory, Jessica T; Liang, Dan; Aygün, Nil; Broadaway, K Alaine; Bond, Marielle L; Mohlke, Karen L; Zylka, Mark J; Love, Michael I; Stein, Jason L.

Nat Neurosci ; 2024 Sep 30.

Artículo en Inglés | MEDLINE | ID: mdl-39349663

RESUMEN

Gene regulatory effects have been difficult to detect at many non-coding loci associated with brain-related traits, likely because some genetic variants have distinct functions in specific contexts. To explore context-dependent gene regulation, we measured chromatin accessibility and gene expression after activation of the canonical Wnt pathway in primary human neural progenitors (n = 82 donors). We found that TCF/LEF motifs and brain-structure-associated and neuropsychiatric-disorder-associated variants were enriched within Wnt-responsive regulatory elements. Genetically influenced regulatory elements were enriched in genomic regions under positive selection along the human lineage. Wnt pathway stimulation increased detection of genetically influenced regulatory elements/genes by 66%/53% and enabled identification of 397 regulatory elements primed to regulate gene expression. Stimulation also increased identification of shared genetic effects on molecular and complex brain traits by up to 70%, suggesting that genetic variant function during neurodevelopmental patterning can lead to differences in adult brain and behavioral traits.

2.

Genetics of cell-type-specific post-transcriptional gene regulation during human neurogenesis.

Aygün, Nil; Vuong, Celine; Krupa, Oleh; Mory, Jessica; Le, Brandon D; Valone, Jordan M; Liang, Dan; Shafie, Beck; Zhang, Pan; Salinda, Angelo; Wen, Cindy; Gandal, Michael J; Love, Michael I; de la Torre-Ubieta, Luis; Stein, Jason L.

Am J Hum Genet ; 111(9): 1877-1898, 2024 Sep 05.

Artículo en Inglés | MEDLINE | ID: mdl-39168119

RESUMEN

The function of some genetic variants associated with brain-relevant traits has been explained through colocalization with expression quantitative trait loci (eQTL) conducted in bulk postmortem adult brain tissue. However, many brain-trait associated loci have unknown cellular or molecular function. These genetic variants may exert context-specific function on different molecular phenotypes including post-transcriptional changes. Here, we identified genetic regulation of RNA editing and alternative polyadenylation (APA) within a cell-type-specific population of human neural progenitors and neurons. More RNA editing and isoforms utilizing longer polyadenylation sequences were observed in neurons, likely due to higher expression of genes encoding the proteins mediating these post-transcriptional events. We also detected hundreds of cell-type-specific editing quantitative trait loci (edQTLs) and alternative polyadenylation QTLs (apaQTLs). We found colocalizations of a neuron edQTL in CCDC88A with educational attainment and a progenitor apaQTL in EP300 with schizophrenia, suggesting that genetically mediated post-transcriptional regulation during brain development leads to differences in brain function.

Asunto(s)

Neurogénesis , Neuronas , Sitios de Carácter Cuantitativo , Humanos , Neurogénesis/genética , Neuronas/metabolismo , Edición de ARN/genética , Poliadenilación/genética , Esquizofrenia/genética , Regulación de la Expresión Génica , Células-Madre Neurales/metabolismo , Células-Madre Neurales/citología , Encéfalo/metabolismo , Procesamiento Postranscripcional del ARN/genética

3.

Cross-site reproducibility of human cortical organoids reveals consistent cell type composition and architecture.

Glass, Madison R; Waxman, Elisa A; Yamashita, Satoshi; Lafferty, Michael; Beltran, Alvaro A; Farah, Tala; Patel, Niyanta K; Singla, Rubal; Matoba, Nana; Ahmed, Sara; Srivastava, Mary; Drake, Emma; Davis, Liam T; Yeturi, Meghana; Sun, Kexin; Love, Michael I; Hashimoto-Torii, Kazue; French, Deborah L; Stein, Jason L.

Stem Cell Reports ; 19(9): 1351-1367, 2024 Sep 10.

Artículo en Inglés | MEDLINE | ID: mdl-39178845

RESUMEN

While guided human cortical organoid (hCO) protocols reproducibly generate cortical cell types at one site, variability in hCO phenotypes across sites using a harmonized protocol has not yet been evaluated. To determine the cross-site reproducibility of hCO differentiation, three independent research groups assayed hCOs in multiple differentiation replicates from one induced pluripotent stem cell (iPSC) line using a harmonized miniaturized spinning bioreactor protocol across 3 months. hCOs were mostly cortical progenitor and neuronal cell types in reproducible proportions that were consistently organized in cortical wall-like buds. Cross-site differences were detected in hCO size and expression of metabolism and cellular stress genes. Variability in hCO phenotypes correlated with stem cell gene expression prior to differentiation and technical factors associated with seeding, suggesting iPSC quality and treatment are important for differentiation outcomes. Cross-site reproducibility of hCO cell type proportions and organization encourages future prospective meta-analytic studies modeling neurodevelopmental disorders in hCOs.

Asunto(s)

Diferenciación Celular , Corteza Cerebral , Células Madre Pluripotentes Inducidas , Organoides , Humanos , Organoides/citología , Organoides/metabolismo , Células Madre Pluripotentes Inducidas/citología , Células Madre Pluripotentes Inducidas/metabolismo , Reproducibilidad de los Resultados , Corteza Cerebral/citología , Corteza Cerebral/metabolismo , Neuronas/metabolismo , Neuronas/citología , Técnicas de Cultivo de Célula/métodos , Fenotipo

4.

Liver eQTL meta-analysis illuminates potential molecular mechanisms of cardiometabolic traits.

Broadaway, K Alaine; Brotman, Sarah M; Rosen, Jonathan D; Currin, Kevin W; Alkhawaja, Abdalla A; Etheridge, Amy S; Wright, Fred; Gallins, Paul; Jima, Dereje; Zhou, Yi-Hui; Love, Michael I; Innocenti, Federico; Mohlke, Karen L.

Am J Hum Genet ; 111(9): 1899-1913, 2024 Sep 05.

Artículo en Inglés | MEDLINE | ID: mdl-39173627

RESUMEN

Understanding the molecular mechanisms of complex traits is essential for developing targeted interventions. We analyzed liver expression quantitative-trait locus (eQTL) meta-analysis data on 1,183 participants to identify conditionally distinct signals. We found 9,013 eQTL signals for 6,564 genes; 23% of eGenes had two signals, and 6% had three or more signals. We then integrated the eQTL results with data from 29 cardiometabolic genome-wide association study (GWAS) traits and identified 1,582 GWAS-eQTL colocalizations for 747 eGenes. Non-primary eQTL signals accounted for 17% of all colocalizations. Isolating signals by conditional analysis prior to coloc resulted in 37% more colocalizations than using marginal eQTL and GWAS data, highlighting the importance of signal isolation. Isolating signals also led to stronger evidence of colocalization: among 343 eQTL-GWAS signal pairs in multi-signal regions, analyses that isolated the signals of interest resulted in higher posterior probability of colocalization for 41% of tests. Leveraging allelic heterogeneity, we predicted causal effects of gene expression on liver traits for four genes. To predict functional variants and regulatory elements, we colocalized eQTL with liver chromatin accessibility QTL (caQTL) and found 391 colocalizations, including 73 with non-primary eQTL signals and 60 eQTL signals that colocalized with both a caQTL and a GWAS signal. Finally, we used publicly available massively parallel reporter assays in HepG2 to highlight 14 eQTL signals that include at least one expression-modulating variant. This multi-faceted approach to unraveling the genetic underpinnings of liver-related traits could lead to therapeutic development.

Asunto(s)

Estudio de Asociación del Genoma Completo , Hígado , Sitios de Carácter Cuantitativo , Humanos , Alelos , Enfermedades Cardiovasculares/genética , Predisposición Genética a la Enfermedad , Hígado/metabolismo , Fenotipo , Polimorfismo de Nucleótido Simple

5.

Extensive co-regulation of neighboring genes complicates the use of eQTLs in target gene prioritization.

Tambets, Ralf; Kolde, Anastassia; Kolberg, Peep; Love, Michael I; Alasoo, Kaur.

HGG Adv ; 5(4): 100348, 2024 Oct 10.

Artículo en Inglés | MEDLINE | ID: mdl-39210598

RESUMEN

Identifying causal genes underlying genome-wide association studies (GWASs) is a fundamental problem in human genetics. Although colocalization with gene expression quantitative trait loci (eQTLs) is often used to prioritize GWAS target genes, systematic benchmarking has been limited due to unavailability of large ground truth datasets. Here, we re-analyzed plasma protein QTL data from 3,301 individuals of the INTERVAL cohort together with 131 eQTL Catalog datasets. Focusing on variants located within or close to the affected protein identified 793 proteins with at least one cis-pQTL where we could assume that the most likely causal gene was the gene coding for the protein. We then benchmarked the ability of cis-eQTLs to recover these causal genes by comparing three Bayesian colocalization methods (coloc.susie, coloc.abf, and CLPP) and five Mendelian randomization (MR) approaches (three varieties of inverse-variance weighted MR, MR-RAPS, and MRLocus). We found that assigning fine-mapped pQTLs to their closest protein coding genes outperformed all colocalization methods regarding both precision (71.9%) and recall (76.9%). Furthermore, the colocalization method with the highest recall (coloc.susie - 46.3%) also had the lowest precision (45.1%). Combining evidence from multiple conditionally distinct colocalizing QTLs with MR increased precision to 81%, but this was accompanied by a large reduction in recall to 7.1%. Furthermore, the choice of the MR method greatly affected performance, with the standard inverse-variance-weighted MR often producing many false positives. Our results highlight that linking GWAS variants to target genes remains challenging with eQTL evidence alone, and prioritizing novel targets requires triangulation of evidence from multiple sources.

Asunto(s)

Estudio de Asociación del Genoma Completo , Polimorfismo de Nucleótido Simple , Sitios de Carácter Cuantitativo , Humanos , Polimorfismo de Nucleótido Simple/genética , Teorema de Bayes , Análisis de la Aleatorización Mendeliana/métodos , Regulación de la Expresión Génica

6.

Assessing Etiologic Heterogeneity for Multinomial Outcome with Two-Phase Outcome-Dependent Sampling Design.

Reifeis, Sarah A; Hudgens, Michael G; Troester, Melissa A; Love, Michael I.

Am J Epidemiol ; 2024 Jul 16.

Artículo en Inglés | MEDLINE | ID: mdl-39010753

RESUMEN

Etiologic heterogeneity occurs when distinct sets of events or exposures give rise to different subtypes of disease. Inference about subtype-specific exposure effects from two-phase outcome-dependent sampling data requires adjustment for both confounding and the sampling design. Common approaches to inference for these effects do not necessarily appropriately adjust for these sources of bias, or allow for formal comparisons of effects across different subtypes. Herein, using inverse probability weighting (IPW) to fit a multinomial model is shown to yield valid inference with this sampling design for subtype-specific exposure effects and contrasts thereof. The IPW approach is compared to common regression-based methods for assessing exposure effect heterogeneity using simulations. The methods are applied to estimate subtype-specific effects of various exposures on breast cancer risk in the Carolina Breast Cancer Study.

7.

Response eQTLs, chromatin accessibility, and 3D chromatin structure in chondrocytes provide mechanistic insight into osteoarthritis risk.

Kramer, Nicole E; Coryell, Philip; D'Costa, Susan; Thulson, Eliza; Byun, Seyoun; Kim, HyunAh; Parkus, Sylvie M; Bond, Marielle L; Shine, Jacqueline; Chubinskaya, Susanna; Love, Michael I; Mohlke, Karen L; Diekman, Brian O; Loeser, Richard F; Phanstiel, Douglas H.

bioRxiv ; 2024 May 05.

Artículo en Inglés | MEDLINE | ID: mdl-38952796

RESUMEN

Osteoarthritis (OA) poses a significant healthcare burden with limited treatment options. While genome-wide association studies (GWAS) have identified over 100 OA-associated loci, translating these findings into therapeutic targets remains challenging. Integrating expression quantitative trait loci (eQTL), 3D chromatin structure, and other genomic approaches with OA GWAS data offers a promising approach to elucidate disease mechanisms; however, comprehensive eQTL maps in OA-relevant tissues and conditions remain scarce. We mapped gene expression, chromatin accessibility, and 3D chromatin structure in primary human articular chondrocytes in both resting and OA-mimicking conditions. We identified thousands of differentially expressed genes, including those associated with differences in sex and age. RNA-seq in chondrocytes from 101 donors across two conditions uncovered 3782 unique eGenes, including 420 that exhibited strong and significant condition-specific effects. Colocalization with OA GWAS signals revealed 13 putative OA risk genes, 10 of which have not been previously identified. Chromatin accessibility and 3D chromatin structure provided insights into the mechanisms and conditional specificity of these variants. Our findings shed light on OA pathogenesis and highlight potential targets for therapeutic development. Highlights: ∘ Comprehensive analysis of sex- and age-related global gene expression in human chondrocytes revealed differences that correlate with osteoarthritis ∘ First response eQTLs in chondrocytes treated with an OA-related stimulus ∘ Deeply sequenced Hi-C in resting and activated chondrocytes helps connect OA risk variants to their putative causal genes ∘ Colocalization analysis reveals 13 (including 10 novel) putative OA risk genes.

8.

The tidyomics ecosystem: enhancing omic data analyses.

Hutchison, William J; Keyes, Timothy J; Crowell, Helena L; Serizay, Jacques; Soneson, Charlotte; Davis, Eric S; Sato, Noriaki; Moses, Lambda; Tarlinton, Boyd; Nahid, Abdullah A; Kosmac, Miha; Clayssen, Quentin; Yuan, Victor; Mu, Wancen; Park, Ji-Eun; Mamede, Izabela; Ryu, Min Hyung; Axisa, Pierre-Paul; Paiz, Paulina; Poon, Chi-Lam; Tang, Ming; Gottardo, Raphael; Morgan, Martin; Lee, Stuart; Lawrence, Michael; Hicks, Stephanie C; Nolan, Garry P; Davis, Kara L; Papenfuss, Anthony T; Love, Michael I; Mangiola, Stefano.

Nat Methods ; 21(7): 1166-1170, 2024 Jul.

Artículo en Inglés | MEDLINE | ID: mdl-38877315

RESUMEN

The growth of omic data presents evolving challenges in data manipulation, analysis and integration. Addressing these challenges, Bioconductor provides an extensive community-driven biological data analysis platform. Meanwhile, tidy R programming offers a revolutionary data organization and manipulation standard. Here we present the tidyomics software ecosystem, bridging Bioconductor to the tidy R paradigm. This ecosystem aims to streamline omic analysis, ease learning and encourage cross-disciplinary collaborations. We demonstrate the effectiveness of tidyomics by analyzing 7.5 million peripheral blood mononuclear cells from the Human Cell Atlas, spanning six data frameworks and ten analysis tools.

Asunto(s)

Programas Informáticos , Humanos , Biología Computacional/métodos , Leucocitos Mononucleares/metabolismo , Leucocitos Mononucleares/citología , Genómica/métodos , Análisis de Datos

9.

The tidyomics ecosystem: Enhancing omic data analyses.

Hutchison, William J; Keyes, Timothy J; Crowell, Helena L; Serizay, Jacques; Soneson, Charlotte; Davis, Eric S; Sato, Noriaki; Moses, Lambda; Tarlinton, Boyd; Nahid, Abdullah A; Kosmac, Miha; Clayssen, Quentin; Yuan, Victor; Mu, Wancen; Park, Ji-Eun; Mamede, Izabela; Ryu, Min Hyung; Axisa, Pierre-Paul; Paiz, Paulina; Poon, Chi-Lam; Tang, Ming; Gottardo, Raphael; Morgan, Martin; Lee, Stuart; Lawrence, Michael; Hicks, Stephanie C; Nolan, Garry P; Davis, Kara L; Papenfuss, Anthony T; Love, Michael I; Mangiola, Stefano.

bioRxiv ; 2024 May 22.

Artículo en Inglés | MEDLINE | ID: mdl-38826347

RESUMEN

The growth of omic data presents evolving challenges in data manipulation, analysis, and integration. Addressing these challenges, Bioconductor1 provides an extensive community-driven biological data analysis platform. Meanwhile, tidy R programming2 offers a revolutionary standard for data organisation and manipulation. Here, we present the tidyomics software ecosystem, bridging Bioconductor to the tidy R paradigm. This ecosystem aims to streamline omic analysis, ease learning, and encourage cross-disciplinary collaborations. We demonstrate the effectiveness of tidyomics by analysing 7.5 million peripheral blood mononuclear cells from the Human Cell Atlas3, spanning six data frameworks and ten analysis tools.

10.

Cross-ancestry atlas of gene, isoform, and splicing regulation in the developing human brain.

Wen, Cindy; Margolis, Michael; Dai, Rujia; Zhang, Pan; Przytycki, Pawel F; Vo, Daniel D; Bhattacharya, Arjun; Matoba, Nana; Tang, Miao; Jiao, Chuan; Kim, Minsoo; Tsai, Ellen; Hoh, Celine; Aygün, Nil; Walker, Rebecca L; Chatzinakos, Christos; Clarke, Declan; Pratt, Henry; Peters, Mette A; Gerstein, Mark; Daskalakis, Nikolaos P; Weng, Zhiping; Jaffe, Andrew E; Kleinman, Joel E; Hyde, Thomas M; Weinberger, Daniel R; Bray, Nicholas J; Sestan, Nenad; Geschwind, Daniel H; Roeder, Kathryn; Gusev, Alexander; Pasaniuc, Bogdan; Stein, Jason L; Love, Michael I; Pollard, Katherine S; Liu, Chunyu; Gandal, Michael J.

Science ; 384(6698): eadh0829, 2024 May 24.

Artículo en Inglés | MEDLINE | ID: mdl-38781368

RESUMEN

Neuropsychiatric genome-wide association studies (GWASs), including those for autism spectrum disorder and schizophrenia, show strong enrichment for regulatory elements in the developing brain. However, prioritizing risk genes and mechanisms is challenging without a unified regulatory atlas. Across 672 diverse developing human brains, we identified 15,752 genes harboring gene, isoform, and/or splicing quantitative trait loci, mapping 3739 to cellular contexts. Gene expression heritability drops during development, likely reflecting both increasing cellular heterogeneity and the intrinsic properties of neuronal maturation. Isoform-level regulation, particularly in the second trimester, mediated the largest proportion of GWAS heritability. Through colocalization, we prioritized mechanisms for about 60% of GWAS loci across five disorders, exceeding adult brain findings. Finally, we contextualized results within gene and isoform coexpression networks, revealing the comprehensive landscape of transcriptome regulation in development and disease.

Asunto(s)

Empalme Alternativo , Encéfalo , Regulación del Desarrollo de la Expresión Génica , Trastornos Mentales , Humanos , Atlas como Asunto , Trastorno del Espectro Autista/genética , Encéfalo/metabolismo , Encéfalo/crecimiento & desarrollo , Encéfalo/embriología , Redes Reguladoras de Genes , Estudio de Asociación del Genoma Completo , Isoformas de Proteínas/genética , Isoformas de Proteínas/metabolismo , Sitios de Carácter Cuantitativo , Esquizofrenia/genética , Transcriptoma , Trastornos Mentales/genética

11.

Joint genotypic and phenotypic outcome modeling improves base editing variant effect quantification.

Ryu, Jayoung; Barkal, Sam; Yu, Tian; Jankowiak, Martin; Zhou, Yunzhuo; Francoeur, Matthew; Phan, Quang Vinh; Li, Zhijian; Tognon, Manuel; Brown, Lara; Love, Michael I; Bhat, Vineel; Lettre, Guillaume; Ascher, David B; Cassa, Christopher A; Sherwood, Richard I; Pinello, Luca.

Nat Genet ; 56(5): 925-937, 2024 May.

Artículo en Inglés | MEDLINE | ID: mdl-38658794

RESUMEN

CRISPR base editing screens enable analysis of disease-associated variants at scale; however, variable efficiency and precision confounds the assessment of variant-induced phenotypes. Here, we provide an integrated experimental and computational pipeline that improves estimation of variant effects in base editing screens. We use a reporter construct to measure guide RNA (gRNA) editing outcomes alongside their phenotypic consequences and introduce base editor screen analysis with activity normalization (BEAN), a Bayesian network that uses per-guide editing outcomes provided by the reporter and target site chromatin accessibility to estimate variant impacts. BEAN outperforms existing tools in variant effect quantification. We use BEAN to pinpoint common regulatory variants that alter low-density lipoprotein (LDL) uptake, implicating previously unreported genes. Additionally, through saturation base editing of LDLR, we accurately quantify missense variant pathogenicity that is consistent with measurements in UK Biobank patients and identify underlying structural mechanisms. This work provides a widely applicable approach to improve the power of base editing screens for disease-associated variant characterization.

Asunto(s)

Sistemas CRISPR-Cas , Edición Génica , Genotipo , Fenotipo , ARN Guía de Sistemas CRISPR-Cas , Humanos , Edición Génica/métodos , ARN Guía de Sistemas CRISPR-Cas/genética , Teorema de Bayes , Receptores de LDL/genética , Células HEK293

12.

Machine learning methods for predicting guide RNA effects in CRISPR epigenome editing experiments.

Mu, Wancen; Luo, Tianyou; Barrera, Alejandro; Bounds, Lexi R; Klann, Tyler S; Ter Weele, Maria; Bryois, Julien; Crawford, Gregory E; Sullivan, Patrick F; Gersbach, Charles A; Love, Michael I; Li, Yun.

bioRxiv ; 2024 Apr 19.

Artículo en Inglés | MEDLINE | ID: mdl-38659894

RESUMEN

CRISPR epigenomic editing technologies enable functional interrogation of non-coding elements. However, current computational methods for guide RNA (gRNA) design do not effectively predict the power potential, molecular and cellular impact to optimize for efficient gRNAs, which are crucial for successful applications of these technologies. We present "launch-dCas9" (machine LeArning based UNified CompreHensive framework for CRISPR-dCas9) to predict gRNA impact from multiple perspectives, including cell fitness, wildtype abundance (gauging power potential), and gene expression in single cells. Our launchdCas9, built and evaluated using experiments involving >1 million gRNAs targeted across the human genome, demonstrates relatively high prediction accuracy (AUC up to 0.81) and generalizes across cell lines. Method-prioritized top gRNA(s) are 4.6-fold more likely to exert effects, compared to other gRNAs in the same cis-regulatory region. Furthermore, launchdCas9 identifies the most critical sequence-related features and functional annotations from >40 features considered. Our results establish launch-dCas9 as a promising approach to design gRNAs for CRISPR epigenomic experiments.

13.

Diffsig: Associating Risk Factors with Mutational Signatures.

Park, Ji-Eun; Smith, Markia A; Van Alsten, Sarah C; Walens, Andrea; Wu, Di; Hoadley, Katherine A; Troester, Melissa A; Love, Michael I.

Cancer Epidemiol Biomarkers Prev ; 33(5): 721-730, 2024 May 01.

Artículo en Inglés | MEDLINE | ID: mdl-38426904

RESUMEN

BACKGROUND: Somatic mutational signatures elucidate molecular vulnerabilities to therapy, and therefore detecting signatures and classifying tumors with respect to signatures has clinical value. However, identifying the etiology of the mutational signatures remains a statistical challenge, with both small sample sizes and high variability in classification algorithms posing barriers. As a result, few signatures have been strongly linked to particular risk factors. METHODS: Here, we develop a statistical model, Diffsig, for estimating the association of one or more continuous or categorical risk factors with DNA mutational signatures. Diffsig takes into account the uncertainty associated with assigning signatures to samples as well as multiple risk factors' simultaneous effect on observed DNA mutations. RESULTS: We applied Diffsig to breast cancer data to assess relationships between five established breast-relevant mutational signatures and etiologic variables, confirming known mechanisms of cancer development. In simulation, our model was capable of accurately estimating expected associations in a variety of contexts. CONCLUSIONS: Diffsig allows researchers to quantify and perform inference on the associations of risk factors with mutational signatures. IMPACT: We expect Diffsig to provide more robust associations of risk factors with signatures to lead to better understanding of the tumor development process and improved models of tumorigenesis.

Asunto(s)

Neoplasias de la Mama , Mutación , Humanos , Factores de Riesgo , Femenino , Neoplasias de la Mama/genética , Modelos Estadísticos , Algoritmos

14.

Adipose tissue eQTL meta-analysis reveals the contribution of allelic heterogeneity to gene expression regulation and cardiometabolic traits.

Brotman, Sarah M; El-Sayed Moustafa, Julia S; Guan, Li; Broadaway, K Alaine; Wang, Dongmeng; Jackson, Anne U; Welch, Ryan; Currin, Kevin W; Tomlinson, Max; Vadlamudi, Swarooparani; Stringham, Heather M; Roberts, Amy L; Lakka, Timo A; Oravilahti, Anniina; Silva, Lilian Fernandes; Narisu, Narisu; Erdos, Michael R; Yan, Tingfen; Bonnycastle, Lori L; Raulerson, Chelsea K; Raza, Yasrab; Yan, Xinyu; Parker, Stephen C J; Kuusisto, Johanna; Pajukanta, Päivi; Tuomilehto, Jaakko; Collins, Francis S; Boehnke, Michael; Love, Michael I; Koistinen, Heikki A; Laakso, Markku; Mohlke, Karen L; Small, Kerrin S; Scott, Laura J.

bioRxiv ; 2023 Oct 27.

Artículo en Inglés | MEDLINE | ID: mdl-37961277

RESUMEN

Complete characterization of the genetic effects on gene expression is needed to elucidate tissue biology and the etiology of complex traits. Here, we analyzed 2,344 subcutaneous adipose tissue samples and identified 34K conditionally distinct expression quantitative trait locus (eQTL) signals in 18K genes. Over half of eQTL genes exhibited at least two eQTL signals. Compared to primary signals, non-primary signals had lower effect sizes, lower minor allele frequencies, and less promoter enrichment; they corresponded to genes with higher heritability and higher tolerance for loss of function. Colocalization of eQTL with conditionally distinct genome-wide association study signals for 28 cardiometabolic traits identified 3,605 eQTL signals for 1,861 genes. Inclusion of non-primary eQTL signals increased colocalized signals by 46%. Among 30 genes with ≥2 pairs of colocalized signals, 21 showed a mediating gene dosage effect on the trait. Thus, expanded eQTL identification reveals more mechanisms underlying complex traits and improves understanding of the complexity of gene expression regulation.

15.

Systematic investigation of allelic regulatory activity of schizophrenia-associated common variants.

McAfee, Jessica C; Lee, Sool; Lee, Jiseok; Bell, Jessica L; Krupa, Oleh; Davis, Jessica; Insigne, Kimberly; Bond, Marielle L; Zhao, Nanxiang; Boyle, Alan P; Phanstiel, Douglas H; Love, Michael I; Stein, Jason L; Ruzicka, W Brad; Davila-Velderrain, Jose; Kosuri, Sriram; Won, Hyejung.

Cell Genom ; 3(10): 100404, 2023 Oct 11.

Artículo en Inglés | MEDLINE | ID: mdl-37868037

RESUMEN

Genome-wide association studies (GWASs) have successfully identified 145 genomic regions that contribute to schizophrenia risk, but linkage disequilibrium makes it challenging to discern causal variants. We performed a massively parallel reporter assay (MPRA) on 5,173 fine-mapped schizophrenia GWAS variants in primary human neural progenitors and identified 439 variants with allelic regulatory effects (MPRA-positive variants). Transcription factor binding had modest predictive power, while fine-map posterior probability, enhancer overlap, and evolutionary conservation failed to predict MPRA-positive variants. Furthermore, 64% of MPRA-positive variants did not exhibit expressive quantitative trait loci signature, suggesting that MPRA could identify yet unexplored variants with regulatory potentials. To predict the combinatorial effect of MPRA-positive variants on gene regulation, we propose an accessibility-by-contact model that combines MPRA-measured allelic activity with neuronal chromatin architecture.

16.

Comprehensive evaluation of methods for differential expression analysis of metatranscriptomics data.

Cho, Hunyong; Qu, Yixiang; Liu, Chuwen; Tang, Boyang; Lyu, Ruiqi; Lin, Bridget M; Roach, Jeffrey; Azcarate-Peril, M Andrea; Aguiar Ribeiro, Apoena; Love, Michael I; Divaris, Kimon; Wu, Di.

Brief Bioinform ; 24(5)2023 09 20.

Artículo en Inglés | MEDLINE | ID: mdl-37738402

RESUMEN

Understanding the function of the human microbiome is important but the development of statistical methods specifically for the microbial gene expression (i.e. metatranscriptomics) is in its infancy. Many currently employed differential expression analysis methods have been designed for different data types and have not been evaluated in metatranscriptomics settings. To address this gap, we undertook a comprehensive evaluation and benchmarking of 10 differential analysis methods for metatranscriptomics data. We used a combination of real and simulated data to evaluate performance (i.e. type I error, false discovery rate and sensitivity) of the following methods: log-normal (LN), logistic-beta (LB), MAST, DESeq2, metagenomeSeq, ANCOM-BC, LEfSe, ALDEx2, Kruskal-Wallis and two-part Kruskal-Wallis. The simulation was informed by supragingival biofilm microbiome data from 300 preschool-age children enrolled in a study of childhood dental disease (early childhood caries, ECC), whereas validations were sought in two additional datasets from the ECC study and an inflammatory bowel disease study. The LB test showed the highest sensitivity in both small and large samples and reasonably controlled type I error. Contrarily, MAST was hampered by inflated type I error. Upon application of the LN and LB tests in the ECC study, we found that genes C8PHV7 and C8PEV7, harbored by the lactate-producing Campylobacter gracilis, had the strongest association with childhood dental disease. This comprehensive model evaluation offers practical guidance for selection of appropriate methods for rigorous analyses of differential expression in metatranscriptomics. Selection of an optimal method increases the possibility of detecting true signals while minimizing the chance of claiming false ones.

Asunto(s)

Benchmarking , Enfermedades Estomatognáticas , Niño , Humanos , Preescolar , Biopelículas , Simulación por Computador , Ácido Láctico

17.

Joint genotypic and phenotypic outcome modeling improves base editing variant effect quantification.

Ryu, Jayoung; Barkal, Sam; Yu, Tian; Jankowiak, Martin; Zhou, Yunzhuo; Francoeur, Matthew; Phan, Quang Vinh; Li, Zhijian; Tognon, Manuel; Brown, Lara; Love, Michael I; Lettre, Guillaume; Ascher, David B; Cassa, Christopher A; Sherwood, Richard I; Pinello, Luca.

medRxiv ; 2023 Sep 10.

Artículo en Inglés | MEDLINE | ID: mdl-37732177

RESUMEN

CRISPR base editing screens are powerful tools for studying disease-associated variants at scale. However, the efficiency and precision of base editing perturbations vary, confounding the assessment of variant-induced phenotypic effects. Here, we provide an integrated pipeline that improves the estimation of variant impact in base editing screens. We perform high-throughput ABE8e-SpRY base editing screens with an integrated reporter construct to measure the editing efficiency and outcomes of each gRNA alongside their phenotypic consequences. We introduce BEAN, a Bayesian network that accounts for per-guide editing outcomes and target site chromatin accessibility to estimate variant impacts. We show this pipeline attains superior performance compared to existing tools in variant classification and effect size quantification. We use BEAN to pinpoint common variants that alter LDL uptake, implicating novel genes. Additionally, through saturation base editing of LDLR, we enable accurate quantitative prediction of the effects of missense variants on LDL-C levels, which aligns with measurements in UK Biobank individuals, and identify structural mechanisms underlying variant pathogenicity. This work provides a widely applicable approach to improve the power of base editor screens for disease-associated variant characterization.

18.

Genetics of cell-type-specific post-transcriptional gene regulation during human neurogenesis.

Aygün, Nil; Krupa, Oleh; Mory, Jessica; Le, Brandon; Valone, Jordan; Liang, Dan; Love, Michael I; Stein, Jason L.

bioRxiv ; 2023 Sep 01.

Artículo en Inglés | MEDLINE | ID: mdl-37693528

RESUMEN

The function of some genetic variants associated with brain-relevant traits has been explained through colocalization with expression quantitative trait loci (eQTL) conducted in bulk post-mortem adult brain tissue. However, many brain-trait associated loci have unknown cellular or molecular function. These genetic variants may exert context-specific function on different molecular phenotypes including post-transcriptional changes. Here, we identified genetic regulation of RNA-editing and alternative polyadenylation (APA), within a cell-type-specific population of human neural progenitors and neurons. More RNA-editing and isoforms utilizing longer polyadenylation sequences were observed in neurons, likely due to higher expression of genes encoding the proteins mediating these post-transcriptional events. We also detected hundreds of cell-type-specific editing quantitative trait loci (edQTLs) and alternative polyadenylation QTLs (apaQTLs). We found colocalizations of a neuron edQTL in CCDC88A with educational attainment and a progenitor apaQTL in EP300 with schizophrenia, suggesting genetically mediated post-transcriptional regulation during brain development lead to differences in brain function.

19.

Chromatin loop dynamics during cellular differentiation are associated with changes to both anchor and internal regulatory features.

Bond, Marielle L; Davis, Eric S; Quiroga, Ivana Y; Dey, Anubha; Kiran, Manjari; Love, Michael I; Won, Hyejung; Phanstiel, Douglas H.

Genome Res ; 33(8): 1258-1268, 2023 08.

Artículo en Inglés | MEDLINE | ID: mdl-37699658

RESUMEN

Three-dimensional (3D) chromatin structure has been shown to play a role in regulating gene transcription during biological transitions. Although our understanding of loop formation and maintenance is rapidly improving, much less is known about the mechanisms driving changes in looping and the impact of differential looping on gene transcription. One limitation has been a lack of well-powered differential looping data sets. To address this, we conducted a deeply sequenced Hi-C time course of megakaryocyte development comprising four biological replicates and 6 billion reads per time point. Statistical analysis revealed 1503 differential loops. Gained loop anchors were enriched for AP-1 occupancy and were characterized by large increases in histone H3K27ac (over 11-fold) but relatively small increases in CTCF and RAD21 binding (1.26- and 1.23-fold, respectively). Linear modeling revealed that changes in histone H3K27ac, chromatin accessibility, and JUN binding were better correlated with changes in looping than RAD21 and almost as well correlated as CTCF. Changes to epigenetic features between-rather than at-boundaries were highly predictive of changes in looping. Together these data suggest that although CTCF and RAD21 may be the core machinery dictating where loops form, other features (both at the anchors and within the loop boundaries) may play a larger role than previously anticipated in determining the relative loop strength across cell types and conditions.

Asunto(s)

Cromatina , Histonas , Histonas/metabolismo , Factor de Unión a CCCTC/genética , Factor de Unión a CCCTC/metabolismo , Cromatina/genética , Cromosomas/metabolismo , Diferenciación Celular/genética

20.

Cell-Type Composition Affects Adipose Gene Expression Associations With Cardiometabolic Traits.

Brotman, Sarah M; Oravilahti, Anniina; Rosen, Jonathan D; Alvarez, Marcus; Heinonen, Sini; van der Kolk, Birgitta W; Fernandes Silva, Lilian; Perrin, Hannah J; Vadlamudi, Swarooparani; Pylant, Cortney; Deochand, Sonia; Basta, Patricia V; Valone, Jordan M; Narain, Morgan N; Stringham, Heather M; Boehnke, Michael; Kuusisto, Johanna; Love, Michael I; Pietiläinen, Kirsi H; Pajukanta, Päivi; Laakso, Markku; Mohlke, Karen L.

Diabetes ; 72(11): 1707-1718, 2023 11 01.

Artículo en Inglés | MEDLINE | ID: mdl-37647564

RESUMEN

Understanding differences in adipose gene expression between individuals with different levels of clinical traits may reveal the genes and mechanisms leading to cardiometabolic diseases. However, adipose is a heterogeneous tissue. To account for cell-type heterogeneity, we estimated cell-type proportions in 859 subcutaneous adipose tissue samples with bulk RNA sequencing (RNA-seq) using a reference single-nuclear RNA-seq data set. Cell-type proportions were associated with cardiometabolic traits; for example, higher macrophage and adipocyte proportions were associated with higher and lower BMI, respectively. We evaluated cell-type proportions and BMI as covariates in tests of association between >25,000 gene expression levels and 22 cardiometabolic traits. For >95% of genes, the optimal, or best-fit, models included BMI as a covariate, and for 79% of associations, the optimal models also included cell type. After adjusting for the optimal covariates, we identified 2,664 significant associations (P ≤ 2e-6) for 1,252 genes and 14 traits. Among genes proposed to affect cardiometabolic traits based on colocalized genome-wide association study and adipose expression quantitative trait locus signals, 25 showed a corresponding association between trait and gene expression levels. Overall, these results suggest the importance of modeling cell-type proportion when identifying gene expression associations with cardiometabolic traits.

Asunto(s)

Enfermedades Cardiovasculares , Estudio de Asociación del Genoma Completo , Humanos , Índice de Masa Corporal , Obesidad/genética , Expresión Génica , Enfermedades Cardiovasculares/genética

RESUMEN

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

RESUMEN

RESUMEN

Asunto(s)

RESUMEN

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

RESUMEN

Asunto(s)

RESUMEN

RESUMEN

RESUMEN

Asunto(s)

RESUMEN

RESUMEN

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

ENVIAR RESULTADO:

SELECCIÓN DE REFERENCIAS

DETALLE DE LA BÚSQUEDA