Your browser doesn't support javascript.
loading
MODIG: integrating multi-omics and multi-dimensional gene network for cancer driver gene identification based on graph attention network model.
Zhao, Wenyi; Gu, Xun; Chen, Shuqing; Wu, Jian; Zhou, Zhan.
Afiliação
  • Zhao W; Institute of Drug Metabolism and Pharmaceutical Analysis and Zhejiang Provincial Key Laboratory of Anti-Cancer Drug Research, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, China.
  • Gu X; Collaborative Innovation Center of Artificial Intelligence by MOE and Zhejiang Provincial Government, College of Computer Science and Technology, Zhejiang University, Hangzhou 310027, China.
  • Chen S; Innovation Institute for Artificial Intelligence in Medicine, Zhejiang University, Hangzhou 310018, China.
  • Wu J; Department of Genetics, Development and Cell Biology, Iowa State University, Ames, IA 50011, USA.
  • Zhou Z; Institute of Drug Metabolism and Pharmaceutical Analysis and Zhejiang Provincial Key Laboratory of Anti-Cancer Drug Research, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, China.
Bioinformatics ; 38(21): 4901-4907, 2022 10 31.
Article em En | MEDLINE | ID: mdl-36094338
ABSTRACT
MOTIVATION Identifying genes that play a causal role in cancer evolution remains one of the biggest challenges in cancer biology. With the accumulation of high-throughput multi-omics data over decades, it becomes a great challenge to effectively integrate these data into the identification of cancer driver genes.

RESULTS:

Here, we propose MODIG, a graph attention network (GAT)-based framework to identify cancer driver genes by combining multi-omics pan-cancer data (mutations, copy number variants, gene expression and methylation levels) with multi-dimensional gene networks. First, we established diverse types of gene relationship maps based on protein-protein interactions, gene sequence similarity, KEGG pathway co-occurrence, gene co-expression patterns and gene ontology. Then, we constructed a multi-dimensional gene network consisting of approximately 20 000 genes as nodes and five types of gene associations as multiplex edges. We applied a GAT to model within-dimension interactions to generate a gene representation for each dimension based on this graph. Moreover, we introduced a joint learning module to fuse multiple dimension-specific representations to generate general gene representations. Finally, we used the obtained gene representation to perform a semi-supervised driver gene identification task. The experiment results show that MODIG outperforms the baseline models in terms of area under precision-recall curves and area under the receiver operating characteristic curves. AVAILABILITY AND IMPLEMENTATION The MODIG program is available at https//github.com/zjupgx/modig. The code and data underlying this article are also available on Zenodo, at https//doi.org/10.5281/zenodo.7057241. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Assuntos

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Redes Reguladoras de Genes / Neoplasias Tipo de estudo: Diagnostic_studies / Prognostic_studies Limite: Humans Idioma: En Ano de publicação: 2022 Tipo de documento: Article

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Redes Reguladoras de Genes / Neoplasias Tipo de estudo: Diagnostic_studies / Prognostic_studies Limite: Humans Idioma: En Ano de publicação: 2022 Tipo de documento: Article