Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 6 de 6
Filtrar
Mais filtros











Base de dados
Intervalo de ano de publicação
1.
Genome Med ; 13(1): 112, 2021 07 14.
Artigo em Inglês | MEDLINE | ID: mdl-34261540

RESUMO

Multi-omics data are good resources for prognosis and survival prediction; however, these are difficult to integrate computationally. We introduce DeepProg, a novel ensemble framework of deep-learning and machine-learning approaches that robustly predicts patient survival subtypes using multi-omics data. It identifies two optimal survival subtypes in most cancers and yields significantly better risk-stratification than other multi-omics integration methods. DeepProg is highly predictive, exemplified by two liver cancer (C-index 0.73-0.80) and five breast cancer datasets (C-index 0.68-0.73). Pan-cancer analysis associates common genomic signatures in poor survival subtypes with extracellular matrix modeling, immune deregulation, and mitosis processes. DeepProg is freely available at https://github.com/lanagarmire/DeepProg.


Assuntos
Biologia Computacional/métodos , Aprendizado Profundo , Aprendizado de Máquina , Software , Algoritmos , Bases de Dados Genéticas , Feminino , Regulação Neoplásica da Expressão Gênica , Genômica/métodos , Humanos , Modelos Teóricos , Neoplasias/diagnóstico , Neoplasias/etiologia , Neoplasias/metabolismo , Neoplasias/mortalidade , Prognóstico , Reprodutibilidade dos Testes , Navegador
2.
JCI Insight ; 6(2)2021 01 25.
Artigo em Inglês | MEDLINE | ID: mdl-33320836

RESUMO

The G/T transversion rs35705950, located approximately 3 kb upstream of the MUC5B start site, is the cardinal risk factor for idiopathic pulmonary fibrosis (IPF). Here, we investigate the function and chromatin structure of this -3 kb region and provide evidence that it functions as a classically defined enhancer subject to epigenetic programming. We use nascent transcript analysis to show that RNA polymerase II loads within 10 bp of the G/T transversion site, definitively establishing enhancer function for the region. By integrating Assay for Transposase-Accessible Chromatin using sequencing (ATAC-seq) analysis of fresh and cultured human airway epithelial cells with nuclease sensitivity data, we demonstrate that this region is in accessible chromatin that affects the expression of MUC5B. Through applying paired single-nucleus RNA- and ATAC-seq to frozen tissue from IPF lungs, we extend these findings directly to disease, with results indicating that epigenetic programming of the -3 kb enhancer in IPF occurs in both MUC5B-expressing and nonexpressing lineages. In aggregate, our results indicate that the MUC5B-associated variant rs35705950 resides within an enhancer that is subject to epigenetic remodeling and contributes to pathologic misexpression in IPF.


Assuntos
Fibrose Pulmonar Idiopática/genética , Mucina-5B/genética , Células A549 , Sítios de Ligação/genética , Linhagem Celular , Cromatina/genética , Cromatina/metabolismo , Elementos Facilitadores Genéticos , Epigênese Genética , Mutação com Ganho de Função , Regulação da Expressão Gênica , Predisposição Genética para Doença , Humanos , Fibrose Pulmonar Idiopática/metabolismo , Pulmão/metabolismo , Modelos Genéticos , Polimorfismo de Nucleotídeo Único , Proteínas Proto-Oncogênicas c-ets/metabolismo , RNA Polimerase II/metabolismo , Fator de Transcrição STAT3/metabolismo
3.
Clin Cancer Res ; 25(2): 463-472, 2019 01 15.
Artigo em Inglês | MEDLINE | ID: mdl-30242023

RESUMO

Although driver genes in hepatocellular carcinoma (HCC) have been investigated in various previous genetic studies, prevalence of key driver genes among heterogeneous populations is unknown. Moreover, the phenotypic associations of these driver genes are poorly understood. This report aims to reveal the phenotypic impacts of a group of consensus driver genes in HCC. We used MutSigCV and OncodriveFM modules implemented in the IntOGen pipeline to identify consensus driver genes across six HCC cohorts comprising 1,494 samples in total. To access their global impacts, we used The Cancer Genome Atlas (TCGA) mutations and copy-number variations to predict the transcriptomics data, under generalized linear models. We further investigated the associations of the consensus driver genes to patient survival, age, gender, race, and risk factors. We identify 10 consensus driver genes across six HCC cohorts in total. Integrative analysis of driver mutations, copy-number variations, and transcriptomic data reveals that these consensus driver mutations and their copy-number variations are associated with a majority (62.5%) of the mRNA transcriptome but only a small fraction (8.9%) of miRNAs. Genes associated with TP53, CTNNB1, and ARID1A mutations contribute to the tripod of most densely connected pathway clusters. These driver genes are significantly associated with patients' overall survival. Some driver genes are significantly linked to HCC gender (CTNNB1, ALB, TP53, and AXIN1), race (TP53 and CDKN2A), and age (RB1) disparities. This study prioritizes a group of consensus drivers in HCC, which collectively show vast impacts on the phenotypes. These driver genes may warrant as valuable therapeutic targets of HCC.


Assuntos
Carcinoma Hepatocelular/diagnóstico , Carcinoma Hepatocelular/genética , Predisposição Genética para Doença , Neoplasias Hepáticas/diagnóstico , Neoplasias Hepáticas/genética , Oncogenes , Fenótipo , Algoritmos , Biologia Computacional/métodos , Perfilação da Expressão Gênica , Regulação Neoplásica da Expressão Gênica , Redes Reguladoras de Genes , Estudos de Associação Genética , Humanos , Modelos Biológicos , Mutação , Transcriptoma
4.
AMIA Jt Summits Transl Sci Proc ; 2017: 197-206, 2018.
Artigo em Inglês | MEDLINE | ID: mdl-29888072

RESUMO

We propose an unsupervised multi-omics integration pipeline, using deep-learning autoencoder algorithm, to predict the survival subtypes in bladder cancer (BC). We used TCGA dataset comprising mRNA, miRNA and methylation to infer two survival subtypes. We then constructed a supervised classification model to predict the survival subgroups of any new individual sample. Our training data gave two subgroups with significant survival differences (p-value=8e-4), where high-risk survival subgroup was enriched with KRT6/14 overexpression and PI3K-Akt pathways. We tested the robustness of model by randomly splitting the main dataset into multiple training and test folds, which gave overall significant p-values. Then, we successfully inferred the subtypes for a subset of samples kept as test dataset (p-value=0.03). We further applied our pipeline to predict the survival subgroups from another validation dataset with miRNA data (p-value=0.02). Conclusively, present pipeline is an effective approach to infer the survival subtype of a new sample, exemplified by BC.

5.
Clin Cancer Res ; 24(6): 1248-1259, 2018 03 15.
Artigo em Inglês | MEDLINE | ID: mdl-28982688

RESUMO

Identifying robust survival subgroups of hepatocellular carcinoma (HCC) will significantly improve patient care. Currently, endeavor of integrating multi-omics data to explicitly predict HCC survival from multiple patient cohorts is lacking. To fill this gap, we present a deep learning (DL)-based model on HCC that robustly differentiates survival subpopulations of patients in six cohorts. We built the DL-based, survival-sensitive model on 360 HCC patients' data using RNA sequencing (RNA-Seq), miRNA sequencing (miRNA-Seq), and methylation data from The Cancer Genome Atlas (TCGA), which predicts prognosis as good as an alternative model where genomics and clinical data are both considered. This DL-based model provides two optimal subgroups of patients with significant survival differences (P = 7.13e-6) and good model fitness [concordance index (C-index) = 0.68]. More aggressive subtype is associated with frequent TP53 inactivation mutations, higher expression of stemness markers (KRT19 and EPCAM) and tumor marker BIRC5, and activated Wnt and Akt signaling pathways. We validated this multi-omics model on five external datasets of various omics types: LIRI-JP cohort (n = 230, C-index = 0.75), NCI cohort (n = 221, C-index = 0.67), Chinese cohort (n = 166, C-index = 0.69), E-TABM-36 cohort (n = 40, C-index = 0.77), and Hawaiian cohort (n = 27, C-index = 0.82). This is the first study to employ DL to identify multi-omics features linked to the differential survival of patients with HCC. Given its robustness over multiple cohorts, we expect this workflow to be useful at predicting HCC prognosis prediction. Clin Cancer Res; 24(6); 1248-59. ©2017 AACR.


Assuntos
Biomarcadores Tumorais , Aprendizado Profundo , Genômica , Neoplasias Hepáticas/genética , Neoplasias Hepáticas/metabolismo , Metabolômica , Proteômica , Algoritmos , Biologia Computacional/métodos , Perfilação da Expressão Gênica , Redes Reguladoras de Genes , Genômica/métodos , Humanos , Neoplasias Hepáticas/mortalidade , Neoplasias Hepáticas/patologia , Metabolômica/métodos , Prognóstico , Modelos de Riscos Proporcionais , Proteômica/métodos , Reprodutibilidade dos Testes , Transcriptoma
6.
Front Genet ; 7: 163, 2016.
Artigo em Inglês | MEDLINE | ID: mdl-27708664

RESUMO

The emerging single-cell RNA-Seq (scRNA-Seq) technology holds the promise to revolutionize our understanding of diseases and associated biological processes at an unprecedented resolution. It opens the door to reveal intercellular heterogeneity and has been employed to a variety of applications, ranging from characterizing cancer cells subpopulations to elucidating tumor resistance mechanisms. Parallel to improving experimental protocols to deal with technological issues, deriving new analytical methods to interpret the complexity in scRNA-Seq data is just as challenging. Here, we review current state-of-the-art bioinformatics tools and methods for scRNA-Seq analysis, as well as addressing some critical analytical challenges that the field faces.

SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA