Your browser doesn't support javascript.
loading
Prediction of active sites of enzymes by maximum relevance minimum redundancy (mRMR) feature selection.
Gao, Yu-Fei; Li, Bi-Qing; Cai, Yu-Dong; Feng, Kai-Yan; Li, Zhan-Dong; Jiang, Yang.
Afiliação
  • Gao YF; Department of Surgery, China-Japan Union Hospital of Jilin University, Changchun, People's Republic of China. jy7555@163.com
Mol Biosyst ; 9(1): 61-9, 2013 Jan 27.
Article em En | MEDLINE | ID: mdl-23117653
ABSTRACT
Identification of catalytic residues plays a key role in understanding how enzymes work. Although numerous computational methods have been developed to predict catalytic residues and active sites, the prediction accuracy remains relatively low with high false positives. In this work, we developed a novel predictor based on the Random Forest algorithm (RF) aided by the maximum relevance minimum redundancy (mRMR) method and incremental feature selection (IFS). We incorporated features of physicochemical/biochemical properties, sequence conservation, residual disorder, secondary structure and solvent accessibility to predict active sites of enzymes and achieved an overall accuracy of 0.885687 and MCC of 0.689226 on an independent test dataset. Feature analysis showed that every category of the features except disorder contributed to the identification of active sites. It was also shown via the site-specific feature analysis that the features derived from the active site itself contributed most to the active site determination. Our prediction method may become a useful tool for identifying the active sites and the key features identified by the paper may provide valuable insights into the mechanism of catalysis.
Assuntos

Texto completo: 1 Bases de dados: MEDLINE Assunto principal: Biologia Computacional / Enzimas / Modelos Químicos Tipo de estudo: Prognostic_studies / Risk_factors_studies Idioma: En Revista: Mol Biosyst Assunto da revista: BIOLOGIA MOLECULAR / BIOQUIMICA Ano de publicação: 2013 Tipo de documento: Article

Texto completo: 1 Bases de dados: MEDLINE Assunto principal: Biologia Computacional / Enzimas / Modelos Químicos Tipo de estudo: Prognostic_studies / Risk_factors_studies Idioma: En Revista: Mol Biosyst Assunto da revista: BIOLOGIA MOLECULAR / BIOQUIMICA Ano de publicação: 2013 Tipo de documento: Article