Búsqueda | Portal Regional de la BVS

High-resolution genome-wide mapping of chromosome-arm-scale truncations induced by CRISPR-Cas9 editing.

Lazar, Nathan H; Celik, Safiye; Chen, Lu; Fay, Marta M; Irish, Jonathan C; Jensen, James; Tillinghast, Conor A; Urbanik, John; Bone, William P; Gibson, Christopher C; Haque, Imran S.

Nat Genet ; 56(7): 1482-1493, 2024 Jul.

Artículo en Inglés | MEDLINE | ID: mdl-38811841

RESUMEN

Clustered regularly interspaced short palindromic repeats (CRISPR)-CRISPR-associated protein 9 (Cas9) is a powerful tool for introducing targeted mutations in DNA, but recent studies have shown that it can have unintended effects such as structural changes. However, these studies have not yet looked genome wide or across data types. Here we performed a phenotypic CRISPR-Cas9 scan targeting 17,065 genes in primary human cells, revealing a 'proximity bias' in which CRISPR knockouts show unexpected similarities to unrelated genes on the same chromosome arm. This bias was found to be consistent across cell types, laboratories, Cas9 delivery methods and assay modalities, and the data suggest that it is caused by telomeric truncations of chromosome arms, with cell cycle and apoptotic pathways playing a mediating role. Additionally, a simple correction is demonstrated to mitigate this pervasive bias while preserving biological relationships. This previously uncharacterized effect has implications for functional genomic studies using CRISPR-Cas9, with applications in discovery biology, drug-target identification, cell therapies and genetic therapeutics.

Asunto(s)

Sistemas CRISPR-Cas , Edición Génica , Humanos , Edición Génica/métodos , Mapeo Cromosómico/métodos , Genoma Humano

A deep profile of gene expression across 18 human cancers.

Qiu, Wei; Dincer, Ayse B; Janizek, Joseph D; Celik, Safiye; Pittet, Mikael; Naxerova, Kamila; Lee, Su-In.

bioRxiv ; 2024 Mar 17.

Artículo en Inglés | MEDLINE | ID: mdl-38559197

RESUMEN

Clinically and biologically valuable information may reside untapped in large cancer gene expression data sets. Deep unsupervised learning has the potential to extract this information with unprecedented efficacy but has thus far been hampered by a lack of biological interpretability and robustness. Here, we present DeepProfile, a comprehensive framework that addresses current challenges in applying unsupervised deep learning to gene expression profiles. We use DeepProfile to learn low-dimensional latent spaces for 18 human cancers from 50,211 transcriptomes. DeepProfile outperforms existing dimensionality reduction methods with respect to biological interpretability. Using DeepProfile interpretability methods, we show that genes that are universally important in defining the latent spaces across all cancer types control immune cell activation, while cancer type-specific genes and pathways define molecular disease subtypes. By linking DeepProfile latent variables to secondary tumor characteristics, we discover that tumor mutation burden is closely associated with the expression of cell cycle-related genes. DNA mismatch repair and MHC class II antigen presentation pathway expression, on the other hand, are consistently associated with patient survival. We validate these results through Kaplan-Meier analyses and nominate tumor-associated macrophages as an important source of survival-correlated MHC class II transcripts. Our results illustrate the power of unsupervised deep learning for discovery of novel cancer biology from existing gene expression data.

Uncovering expression signatures of synergistic drug responses via ensembles of explainable machine-learning models.

Janizek, Joseph D; Dincer, Ayse B; Celik, Safiye; Chen, Hugh; Chen, William; Naxerova, Kamila; Lee, Su-In.

Nat Biomed Eng ; 7(6): 811-829, 2023 06.

Artículo en Inglés | MEDLINE | ID: mdl-37127711

RESUMEN

Machine learning may aid the choice of optimal combinations of anticancer drugs by explaining the molecular basis of their synergy. By combining accurate models with interpretable insights, explainable machine learning promises to accelerate data-driven cancer pharmacology. However, owing to the highly correlated and high-dimensional nature of transcriptomic data, naively applying current explainable machine-learning strategies to large transcriptomic datasets leads to suboptimal outcomes. Here by using feature attribution methods, we show that the quality of the explanations can be increased by leveraging ensembles of explainable machine-learning models. We applied the approach to a dataset of 133 combinations of 46 anticancer drugs tested in ex vivo tumour samples from 285 patients with acute myeloid leukaemia and uncovered a haematopoietic-differentiation signature underlying drug combinations with therapeutic synergy. Ensembles of machine-learning models trained to predict drug combination synergies on the basis of gene-expression data may improve the feature attribution quality of complex machine-learning models.

Asunto(s)

Perfilación de la Expresión Génica , Aprendizaje Automático , Humanos , Transcriptoma

PAUSE: principled feature attribution for unsupervised gene expression analysis.

Janizek, Joseph D; Spiro, Anna; Celik, Safiye; Blue, Ben W; Russell, John C; Lee, Ting-I; Kaeberlin, Matt; Lee, Su-In.

Genome Biol ; 24(1): 81, 2023 04 19.

Artículo en Inglés | MEDLINE | ID: mdl-37076856

RESUMEN

As interest in using unsupervised deep learning models to analyze gene expression data has grown, an increasing number of methods have been developed to make these models more interpretable. These methods can be separated into two groups: post hoc analyses of black box models through feature attribution methods and approaches to build inherently interpretable models through biologically-constrained architectures. We argue that these approaches are not mutually exclusive, but can in fact be usefully combined. We propose PAUSE ( https://github.com/suinleelab/PAUSE ), an unsupervised pathway attribution method that identifies major sources of transcriptomic variation when combined with biologically-constrained neural network models.

Asunto(s)

Perfilación de la Expresión Génica , Transcriptoma , Redes Neurales de la Computación

Unified AI framework to uncover deep interrelationships between gene expression and Alzheimer's disease neuropathologies.

Beebe-Wang, Nicasia; Celik, Safiye; Weinberger, Ethan; Sturmfels, Pascal; De Jager, Philip L; Mostafavi, Sara; Lee, Su-In.

Nat Commun ; 12(1): 5369, 2021 09 10.

Artículo en Inglés | MEDLINE | ID: mdl-34508095

RESUMEN

Deep neural networks (DNNs) capture complex relationships among variables, however, because they require copious samples, their potential has yet to be fully tapped for understanding relationships between gene expression and human phenotypes. Here we introduce an analysis framework, namely MD-AD (Multi-task Deep learning for Alzheimer's Disease neuropathology), which leverages an unexpected synergy between DNNs and multi-cohort settings. In these settings, true joint analysis can be stymied using conventional statistical methods, which require "harmonized" phenotypes and tend to capture cohort-level variations, obscuring subtler true disease signals. Instead, MD-AD incorporates related phenotypes sparsely measured across cohorts, and learns interactions between genes and phenotypes not discovered using linear models, identifying subtler signals than cohort-level variations which can be uniquely recapitulated in animal models and across tissues. We show that MD-AD exploits sex-specific relationships between microglial immune response and neuropathology, providing a nuanced context for the association between inflammatory genes and Alzheimer's Disease.

Asunto(s)

Enfermedad de Alzheimer/genética , Encéfalo/patología , Aprendizaje Profundo , Regulación de la Expresión Génica/inmunología , Microglía/inmunología , Anciano , Anciano de 80 o más Años , Enfermedad de Alzheimer/complicaciones , Enfermedad de Alzheimer/patología , Animales , Encéfalo/citología , Encéfalo/inmunología , Estudios de Cohortes , Conjuntos de Datos como Asunto , Femenino , Humanos , Masculino , Ratones , Microglía/patología , RNA-Seq , Factores Sexuales

A machine learning approach to integrate big data for precision medicine in acute myeloid leukemia.

Lee, Su-In; Celik, Safiye; Logsdon, Benjamin A; Lundberg, Scott M; Martins, Timothy J; Oehler, Vivian G; Estey, Elihu H; Miller, Chris P; Chien, Sylvia; Dai, Jin; Saxena, Akanksha; Blau, C Anthony; Becker, Pamela S.

Nat Commun ; 9(1): 42, 2018 01 03.

Artículo en Inglés | MEDLINE | ID: mdl-29298978

RESUMEN

Cancers that appear pathologically similar often respond differently to the same drug regimens. Methods to better match patients to drugs are in high demand. We demonstrate a promising approach to identify robust molecular markers for targeted treatment of acute myeloid leukemia (AML) by introducing: data from 30 AML patients including genome-wide gene expression profiles and in vitro sensitivity to 160 chemotherapy drugs, a computational method to identify reliable gene expression markers for drug sensitivity by incorporating multi-omic prior information relevant to each gene's potential to drive cancer. We show that our method outperforms several state-of-the-art approaches in identifying molecular markers replicated in validation data and predicting drug sensitivity accurately. Finally, we identify SMARCA4 as a marker and driver of sensitivity to topoisomerase II inhibitors, mitoxantrone, and etoposide, in AML by showing that cell lines transduced to have high SMARCA4 expression reveal dramatically increased sensitivity to these agents.

Asunto(s)

ADN Helicasas/genética , Resistencia a Antineoplásicos/genética , Leucemia Mieloide Aguda/genética , Aprendizaje Automático , Proteínas Nucleares/genética , Medicina de Precisión/métodos , Factores de Transcripción/genética , Algoritmos , Antineoplásicos/farmacología , Antineoplásicos/uso terapéutico , Biomarcadores de Tumor/metabolismo , Línea Celular , Conjuntos de Datos como Asunto , Etopósido/farmacología , Etopósido/uso terapéutico , Humanos , Leucemia Mieloide Aguda/tratamiento farmacológico , Inhibidores de Topoisomerasa II/farmacología , Inhibidores de Topoisomerasa II/uso terapéutico

Extracting a low-dimensional description of multiple gene expression datasets reveals a potential driver for tumor-associated stroma in ovarian cancer.

Celik, Safiye; Logsdon, Benjamin A; Battle, Stephanie; Drescher, Charles W; Rendi, Mara; Hawkins, R David; Lee, Su-In.

Genome Med ; 8(1): 66, 2016 06 10.

Artículo en Inglés | MEDLINE | ID: mdl-27287041

RESUMEN

Patterns in expression data conserved across multiple independent disease studies are likely to represent important molecular events underlying the disease. We present the INSPIRE method to infer modules of co-expressed genes and the dependencies among the modules from multiple expression datasets that may contain different sets of genes. We show that INSPIRE infers more accurate models than existing methods to extract low-dimensional representation of expression data. We demonstrate that applying INSPIRE to nine ovarian cancer datasets leads to a new marker and potential driver of tumor-associated stroma, HOPX, followed by experimental validation. The implementation of INSPIRE is available at http://inspire.cs.washington.edu .

Asunto(s)

Biomarcadores de Tumor/genética , Biología Computacional/métodos , Proteínas de Homeodominio/genética , Neoplasias Ováricas/genética , Proteínas Supresoras de Tumor/genética , Bases de Datos Genéticas , Femenino , Perfilación de la Expresión Génica , Regulación Neoplásica de la Expresión Génica , Proteínas de Homeodominio/metabolismo , Humanos , Proteínas Supresoras de Tumor/metabolismo , Aprendizaje Automático no Supervisado

RESUMEN

Asunto(s)

RESUMEN

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

ENVIAR RESULTADO:

SELECCIÓN DE REFERENCIAS

DETALLE DE LA BÚSQUEDA