Your browser doesn't support javascript.
loading
Predicting membrane protein types using various decision tree classifiers based on various modes of general PseAAC for imbalanced datasets.
Sankari, E Siva; Manimegalai, D.
Afiliação
  • Sankari ES; Department of CSE, Government College of Engineering, Tirunelveli, Tamil Nadu, India. Electronic address: sivasankari@gcetly.ac.in.
  • Manimegalai D; Department of IT, National Engineering College, Kovilpatti, Tamil Nadu, India. Electronic address: megalai_nec@yahoo.co.in.
J Theor Biol ; 435: 208-217, 2017 12 21.
Article em En | MEDLINE | ID: mdl-28941868
ABSTRACT
Predicting membrane protein types is an important and challenging research area in bioinformatics and proteomics. Traditional biophysical methods are used to classify membrane protein types. Due to large exploration of uncharacterized protein sequences in databases, traditional methods are very time consuming, expensive and susceptible to errors. Hence, it is highly desirable to develop a robust, reliable, and efficient method to predict membrane protein types. Imbalanced datasets and large datasets are often handled well by decision tree classifiers. Since imbalanced datasets are taken, the performance of various decision tree classifiers such as Decision Tree (DT), Classification And Regression Tree (CART), C4.5, Random tree, REP (Reduced Error Pruning) tree, ensemble methods such as Adaboost, RUS (Random Under Sampling) boost, Rotation forest and Random forest are analysed. Among the various decision tree classifiers Random forest performs well in less time with good accuracy of 96.35%. Another inference is RUS boost decision tree classifier is able to classify one or two samples in the class with very less samples while the other classifiers such as DT, Adaboost, Rotation forest and Random forest are not sensitive for the classes with fewer samples. Also the performance of decision tree classifiers is compared with SVM (Support Vector Machine) and Naive Bayes classifier.
Assuntos
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Árvores de Decisões / Bases de Dados de Proteínas / Proteínas de Membrana Tipo de estudo: Health_economic_evaluation / Prognostic_studies / Risk_factors_studies Idioma: En Revista: J Theor Biol Ano de publicação: 2017 Tipo de documento: Article

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Árvores de Decisões / Bases de Dados de Proteínas / Proteínas de Membrana Tipo de estudo: Health_economic_evaluation / Prognostic_studies / Risk_factors_studies Idioma: En Revista: J Theor Biol Ano de publicação: 2017 Tipo de documento: Article