Your browser doesn't support javascript.
loading
GRAM: A GeneRAlized Model to predict the molecular effect of a non-coding variant in a cell-type specific manner.
Lou, Shaoke; Cotter, Kellie A; Li, Tianxiao; Liang, Jin; Mohsen, Hussein; Liu, Jason; Zhang, Jing; Cohen, Sandra; Xu, Jinrui; Yu, Haiyuan; Rubin, Mark A; Gerstein, Mark.
Afiliación
  • Lou S; Program in Computational Biology and Bioinformatics, Yale University, New Haven, Connecticut, United States of America.
  • Cotter KA; Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, Connecticut, United States of America.
  • Li T; Department for BioMedical Research, University of Bern, CH, Bern, Switzerland.
  • Liang J; Program in Computational Biology and Bioinformatics, Yale University, New Haven, Connecticut, United States of America.
  • Mohsen H; Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, Connecticut, United States of America.
  • Liu J; Weill Institute for Cell and Molecular Biology, Cornell University, Ithaca, New York, United States of America.
  • Zhang J; Program in Computational Biology and Bioinformatics, Yale University, New Haven, Connecticut, United States of America.
  • Cohen S; Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, Connecticut, United States of America.
  • Xu J; Program in the History of Science and Medicine, Yale University, New Haven, Connecticut, United States of America.
  • Yu H; Program in Computational Biology and Bioinformatics, Yale University, New Haven, Connecticut, United States of America.
  • Rubin MA; Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, Connecticut, United States of America.
  • Gerstein M; Program in Computational Biology and Bioinformatics, Yale University, New Haven, Connecticut, United States of America.
PLoS Genet ; 15(8): e1007860, 2019 08.
Article en En | MEDLINE | ID: mdl-31469829
There has been much effort to prioritize genomic variants with respect to their impact on "function". However, function is often not precisely defined: sometimes it is the disease association of a variant; on other occasions, it reflects a molecular effect on transcription or epigenetics. Here, we coupled multiple genomic predictors to build GRAM, a GeneRAlized Model, to predict a well-defined experimental target: the expression-modulating effect of a non-coding variant on its associated gene, in a transferable, cell-specific manner. Firstly, we performed feature engineering: using LASSO, a regularized linear model, we found transcription factor (TF) binding most predictive, especially for TFs that are hubs in the regulatory network; in contrast, evolutionary conservation, a popular feature in many other variant-impact predictors, has almost no contribution. Moreover, TF binding inferred from in vitro SELEX is as effective as that from in vivo ChIP-Seq. Second, we implemented GRAM integrating only SELEX features and expression profiles; thus, the program combines a universal regulatory score with an easily obtainable modifier reflecting the particular cell type. We benchmarked GRAM on large-scale MPRA datasets, achieving AUROC scores of 0.72 in GM12878 and 0.66 in a multi-cell line dataset. We then evaluated the performance of GRAM on targeted regions using luciferase assays in the MCF7 and K562 cell lines. We noted that changing the insertion position of the construct relative to the reporter gene gave very different results, highlighting the importance of carefully defining the exact prediction target of the model. Finally, we illustrated the utility of GRAM in fine-mapping causal variants and developed a practical software pipeline to carry this out. In particular, we demonstrated in specific examples how the pipeline could pinpoint variants that directly modulate gene expression within a larger linkage-disequilibrium block associated with a phenotype of interest (e.g., for an eQTL).
Asunto(s)

Texto completo: 1 Bases de datos: MEDLINE Asunto principal: Variación Genética / Regulación de la Expresión Génica / Análisis de Secuencia de ADN Tipo de estudio: Prognostic_studies / Risk_factors_studies Límite: Humans Idioma: En Revista: PLoS Genet Asunto de la revista: GENETICA Año: 2019 Tipo del documento: Article País de afiliación: Estados Unidos

Texto completo: 1 Bases de datos: MEDLINE Asunto principal: Variación Genética / Regulación de la Expresión Génica / Análisis de Secuencia de ADN Tipo de estudio: Prognostic_studies / Risk_factors_studies Límite: Humans Idioma: En Revista: PLoS Genet Asunto de la revista: GENETICA Año: 2019 Tipo del documento: Article País de afiliación: Estados Unidos