Your browser doesn't support javascript.
loading
DMIL-IsoFun: predicting isoform function using deep multi-instance learning.
Yu, Guoxian; Zhou, Guangjie; Zhang, Xiangliang; Domeniconi, Carlotta; Guo, Maozu.
Afiliação
  • Yu G; School of Software, Shandong University, Jinan 250101, China.
  • Zhou G; College of Computer and Information Sciences, Southwest University, Chongqing 400715, China.
  • Zhang X; Computer, Electrical, and Mathematical Sciences and Engineering Division, King Abdullah University of Science and Technology, Thuwal, Saudi Arabia.
  • Domeniconi C; School of Software, Shandong University, Jinan 250101, China.
  • Guo M; College of Computer and Information Sciences, Southwest University, Chongqing 400715, China.
Bioinformatics ; 37(24): 4818-4825, 2021 12 11.
Article em En | MEDLINE | ID: mdl-34282449
ABSTRACT
MOTIVATION Alternative splicing creates the considerable proteomic diversity and complexity on relatively limited genome. Proteoforms translated from alternatively spliced isoforms of a gene actually execute the biological functions of this gene, which reflect the functional knowledge of genes at a finer granular level. Recently, some computational approaches have been proposed to differentiate isoform functions using sequence and expression data. However, their performance is far from being desirable, mainly due to the imbalance and lack of annotations at isoform-level, and the difficulty of modeling gene-isoform relations.

RESULT:

We propose a deep multi-instance learning-based framework (DMIL-IsoFun) to differentiate the functions of isoforms. DMIL-IsoFun firstly introduces a multi-instance learning convolution neural network trained with isoform sequences and gene-level annotations to extract the feature vectors and initialize the annotations of isoforms, and then uses a class-imbalance Graph Convolution Network to refine the annotations of individual isoforms based on the isoform co-expression network and extracted features. Extensive experimental results show that DMIL-IsoFun improves the Smin and Fmax of state-of-the-art solutions by at least 29.6% and 40.8%. The effectiveness of DMIL-IsoFun is further confirmed on a testbed of human multiple-isoform genes, and maize isoforms related with photosynthesis. AVAILABILITY AND IMPLEMENTATION The code and data are available at http//www.sdu-idea.cn/codes.php?name=DMIL-Isofun. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Assuntos

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Processamento Alternativo / Proteômica Tipo de estudo: Prognostic_studies / Risk_factors_studies Limite: Humans Idioma: En Revista: Bioinformatics Ano de publicação: 2021 Tipo de documento: Article

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Processamento Alternativo / Proteômica Tipo de estudo: Prognostic_studies / Risk_factors_studies Limite: Humans Idioma: En Revista: Bioinformatics Ano de publicação: 2021 Tipo de documento: Article