Your browser doesn't support javascript.
loading
Compositional framework for multitask learning in the identification of cleavage sites of HIV-1 protease.
Singh, Deepak; Sisodia, Dilip Singh; Singh, Pradeep.
Afiliação
  • Singh D; Department of Computer Science and Engineering, National Institute of Technology, Raipur, C.G, India. Electronic address: dsingh.phd2016.cs@nitrr.ac.in.
  • Sisodia DS; Department of Computer Science and Engineering, National Institute of Technology, Raipur, C.G, India. Electronic address: dssisodia.cs@nitrr.ac.in.
  • Singh P; Department of Computer Science and Engineering, National Institute of Technology, Raipur, C.G, India. Electronic address: psingh.cs@nitrr.ac.in.
J Biomed Inform ; 102: 103376, 2020 02.
Article em En | MEDLINE | ID: mdl-31935461
ABSTRACT
Inadequate patient samples and costly annotated data generations result into the smaller dataset in the biomedical domain. Due to which the predictions with a trained model that usually reveal a single small dataset association are fail to derive robust insights. To cope with the data sparsity, a promising strategy of combining data from the different related tasks is exercised in various application. Motivated by, successful work in the various bioinformatics application, we propose a multitask learning model based on multi-kernel that exploits the dependencies among various related tasks. This work aims to combine the knowledge from experimental studies of the different dataset to build stronger predictive models for HIV-1 protease cleavage sites prediction. In this study, a set of peptide data from one source is referred as 'task' and to integrate interactions from multiple tasks; our method exploits the common features and parameters sharing across the data source. The proposed framework uses feature integration, feature selection, multi-kernel and multifactorial evolutionary algorithm to model multitask learning. The framework considered seven different feature descriptors and four different kernel variants of support vector machines to form the optimal multi-kernel learning model. To validate the effectiveness of the model, the performance parameters such as average accuracy, and area under curve have been evaluated on the suggested model. We also carried out Friedman and post hoc statistical test to substantiate the significant improvement achieved by the proposed framework. The result obtained following the extensive experiment confirms the belief that multitask learning in cleavage site identification can improve the performance.
Assuntos
Palavras-chave

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Algoritmos / Protease de HIV / Biologia Computacional Tipo de estudo: Diagnostic_studies / Prognostic_studies Idioma: En Ano de publicação: 2020 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Algoritmos / Protease de HIV / Biologia Computacional Tipo de estudo: Diagnostic_studies / Prognostic_studies Idioma: En Ano de publicação: 2020 Tipo de documento: Article