Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 7 de 7
Filtrar
Mais filtros








Base de dados
Intervalo de ano de publicação
1.
Nat Genet ; 56(7): 1482-1493, 2024 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-38811841

RESUMO

Clustered regularly interspaced short palindromic repeats (CRISPR)-CRISPR-associated protein 9 (Cas9) is a powerful tool for introducing targeted mutations in DNA, but recent studies have shown that it can have unintended effects such as structural changes. However, these studies have not yet looked genome wide or across data types. Here we performed a phenotypic CRISPR-Cas9 scan targeting 17,065 genes in primary human cells, revealing a 'proximity bias' in which CRISPR knockouts show unexpected similarities to unrelated genes on the same chromosome arm. This bias was found to be consistent across cell types, laboratories, Cas9 delivery methods and assay modalities, and the data suggest that it is caused by telomeric truncations of chromosome arms, with cell cycle and apoptotic pathways playing a mediating role. Additionally, a simple correction is demonstrated to mitigate this pervasive bias while preserving biological relationships. This previously uncharacterized effect has implications for functional genomic studies using CRISPR-Cas9, with applications in discovery biology, drug-target identification, cell therapies and genetic therapeutics.


Assuntos
Sistemas CRISPR-Cas , Edição de Genes , Humanos , Edição de Genes/métodos , Mapeamento Cromossômico/métodos , Genoma Humano
2.
bioRxiv ; 2024 Mar 17.
Artigo em Inglês | MEDLINE | ID: mdl-38559197

RESUMO

Clinically and biologically valuable information may reside untapped in large cancer gene expression data sets. Deep unsupervised learning has the potential to extract this information with unprecedented efficacy but has thus far been hampered by a lack of biological interpretability and robustness. Here, we present DeepProfile, a comprehensive framework that addresses current challenges in applying unsupervised deep learning to gene expression profiles. We use DeepProfile to learn low-dimensional latent spaces for 18 human cancers from 50,211 transcriptomes. DeepProfile outperforms existing dimensionality reduction methods with respect to biological interpretability. Using DeepProfile interpretability methods, we show that genes that are universally important in defining the latent spaces across all cancer types control immune cell activation, while cancer type-specific genes and pathways define molecular disease subtypes. By linking DeepProfile latent variables to secondary tumor characteristics, we discover that tumor mutation burden is closely associated with the expression of cell cycle-related genes. DNA mismatch repair and MHC class II antigen presentation pathway expression, on the other hand, are consistently associated with patient survival. We validate these results through Kaplan-Meier analyses and nominate tumor-associated macrophages as an important source of survival-correlated MHC class II transcripts. Our results illustrate the power of unsupervised deep learning for discovery of novel cancer biology from existing gene expression data.

3.
Nat Biomed Eng ; 7(6): 811-829, 2023 06.
Artigo em Inglês | MEDLINE | ID: mdl-37127711

RESUMO

Machine learning may aid the choice of optimal combinations of anticancer drugs by explaining the molecular basis of their synergy. By combining accurate models with interpretable insights, explainable machine learning promises to accelerate data-driven cancer pharmacology. However, owing to the highly correlated and high-dimensional nature of transcriptomic data, naively applying current explainable machine-learning strategies to large transcriptomic datasets leads to suboptimal outcomes. Here by using feature attribution methods, we show that the quality of the explanations can be increased by leveraging ensembles of explainable machine-learning models. We applied the approach to a dataset of 133 combinations of 46 anticancer drugs tested in ex vivo tumour samples from 285 patients with acute myeloid leukaemia and uncovered a haematopoietic-differentiation signature underlying drug combinations with therapeutic synergy. Ensembles of machine-learning models trained to predict drug combination synergies on the basis of gene-expression data may improve the feature attribution quality of complex machine-learning models.


Assuntos
Perfilação da Expressão Gênica , Aprendizado de Máquina , Humanos , Transcriptoma
4.
Genome Biol ; 24(1): 81, 2023 04 19.
Artigo em Inglês | MEDLINE | ID: mdl-37076856

RESUMO

As interest in using unsupervised deep learning models to analyze gene expression data has grown, an increasing number of methods have been developed to make these models more interpretable. These methods can be separated into two groups: post hoc analyses of black box models through feature attribution methods and approaches to build inherently interpretable models through biologically-constrained architectures. We argue that these approaches are not mutually exclusive, but can in fact be usefully combined. We propose PAUSE ( https://github.com/suinleelab/PAUSE ), an unsupervised pathway attribution method that identifies major sources of transcriptomic variation when combined with biologically-constrained neural network models.


Assuntos
Perfilação da Expressão Gênica , Transcriptoma , Redes Neurais de Computação
5.
Nat Commun ; 12(1): 5369, 2021 09 10.
Artigo em Inglês | MEDLINE | ID: mdl-34508095

RESUMO

Deep neural networks (DNNs) capture complex relationships among variables, however, because they require copious samples, their potential has yet to be fully tapped for understanding relationships between gene expression and human phenotypes. Here we introduce an analysis framework, namely MD-AD (Multi-task Deep learning for Alzheimer's Disease neuropathology), which leverages an unexpected synergy between DNNs and multi-cohort settings. In these settings, true joint analysis can be stymied using conventional statistical methods, which require "harmonized" phenotypes and tend to capture cohort-level variations, obscuring subtler true disease signals. Instead, MD-AD incorporates related phenotypes sparsely measured across cohorts, and learns interactions between genes and phenotypes not discovered using linear models, identifying subtler signals than cohort-level variations which can be uniquely recapitulated in animal models and across tissues. We show that MD-AD exploits sex-specific relationships between microglial immune response and neuropathology, providing a nuanced context for the association between inflammatory genes and Alzheimer's Disease.


Assuntos
Doença de Alzheimer/genética , Encéfalo/patologia , Aprendizado Profundo , Regulação da Expressão Gênica/imunologia , Microglia/imunologia , Idoso , Idoso de 80 Anos ou mais , Doença de Alzheimer/complicações , Doença de Alzheimer/patologia , Animais , Encéfalo/citologia , Encéfalo/imunologia , Estudos de Coortes , Conjuntos de Dados como Assunto , Feminino , Humanos , Masculino , Camundongos , Microglia/patologia , RNA-Seq , Fatores Sexuais
6.
Nat Commun ; 9(1): 42, 2018 01 03.
Artigo em Inglês | MEDLINE | ID: mdl-29298978

RESUMO

Cancers that appear pathologically similar often respond differently to the same drug regimens. Methods to better match patients to drugs are in high demand. We demonstrate a promising approach to identify robust molecular markers for targeted treatment of acute myeloid leukemia (AML) by introducing: data from 30 AML patients including genome-wide gene expression profiles and in vitro sensitivity to 160 chemotherapy drugs, a computational method to identify reliable gene expression markers for drug sensitivity by incorporating multi-omic prior information relevant to each gene's potential to drive cancer. We show that our method outperforms several state-of-the-art approaches in identifying molecular markers replicated in validation data and predicting drug sensitivity accurately. Finally, we identify SMARCA4 as a marker and driver of sensitivity to topoisomerase II inhibitors, mitoxantrone, and etoposide, in AML by showing that cell lines transduced to have high SMARCA4 expression reveal dramatically increased sensitivity to these agents.


Assuntos
DNA Helicases/genética , Resistencia a Medicamentos Antineoplásicos/genética , Leucemia Mieloide Aguda/genética , Aprendizado de Máquina , Proteínas Nucleares/genética , Medicina de Precisão/métodos , Fatores de Transcrição/genética , Algoritmos , Antineoplásicos/farmacologia , Antineoplásicos/uso terapêutico , Biomarcadores Tumorais/metabolismo , Linhagem Celular , Conjuntos de Dados como Assunto , Etoposídeo/farmacologia , Etoposídeo/uso terapêutico , Humanos , Leucemia Mieloide Aguda/tratamento farmacológico , Inibidores da Topoisomerase II/farmacologia , Inibidores da Topoisomerase II/uso terapêutico
7.
Genome Med ; 8(1): 66, 2016 06 10.
Artigo em Inglês | MEDLINE | ID: mdl-27287041

RESUMO

Patterns in expression data conserved across multiple independent disease studies are likely to represent important molecular events underlying the disease. We present the INSPIRE method to infer modules of co-expressed genes and the dependencies among the modules from multiple expression datasets that may contain different sets of genes. We show that INSPIRE infers more accurate models than existing methods to extract low-dimensional representation of expression data. We demonstrate that applying INSPIRE to nine ovarian cancer datasets leads to a new marker and potential driver of tumor-associated stroma, HOPX, followed by experimental validation. The implementation of INSPIRE is available at http://inspire.cs.washington.edu .


Assuntos
Biomarcadores Tumorais/genética , Biologia Computacional/métodos , Proteínas de Homeodomínio/genética , Neoplasias Ovarianas/genética , Proteínas Supressoras de Tumor/genética , Bases de Dados Genéticas , Feminino , Perfilação da Expressão Gênica , Regulação Neoplásica da Expressão Gênica , Proteínas de Homeodomínio/metabolismo , Humanos , Proteínas Supressoras de Tumor/metabolismo , Aprendizado de Máquina não Supervisionado
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA