Your browser doesn't support javascript.
loading
A critical assessment of the feature selection methods used for biomarker discovery in current metaproteomics studies.
Tang, Jing; Wang, Yunxia; Fu, Jianbo; Zhou, Ying; Luo, Yongchao; Zhang, Ying; Li, Bo; Yang, Qingxia; Xue, Weiwei; Lou, Yan; Qiu, Yunqing; Zhu, Feng.
Afiliação
  • Tang J; College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, China.
  • Wang Y; Department of Bioinformatics, Chongqing Medical University, Chongqing, China.
  • Fu J; College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, China.
  • Zhou Y; College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, China.
  • Luo Y; College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, China.
  • Zhang Y; College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, China.
  • Li B; College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, China.
  • Yang Q; School of Pharmaceutical Sciences, Chongqing University, Chongqing, China.
  • Xue W; College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, China.
  • Lou Y; School of Pharmaceutical Sciences, Chongqing University, Chongqing, China.
  • Qiu Y; School of Pharmaceutical Sciences, Chongqing University, Chongqing, China.
  • Zhu F; Zhejiang Provincial Key Laboratory for Drug Clinical Research and Evaluation, The First Affiliated Hospital, Zhejiang University, Hangzhou, Zhejiang, China.
Brief Bioinform ; 21(4): 1378-1390, 2020 07 15.
Article em En | MEDLINE | ID: mdl-31197323
ABSTRACT
Microbial community (MC) has great impact on mediating complex disease indications, biogeochemical cycling and agricultural productivities, which makes metaproteomics powerful technique for quantifying diverse and dynamic composition of proteins or peptides. The key role of biostatistical strategies in MC study is reported to be underestimated, especially the appropriate application of feature selection method (FSM) is largely ignored. Although extensive efforts have been devoted to assessing the performance of FSMs, previous studies focused only on their classification accuracy without considering their ability to correctly and comprehensively identify the spiked proteins. In this study, the performances of 14 FSMs were comprehensively assessed based on two key criteria (both sample classification and spiked protein discovery) using a variety of metaproteomics benchmarks. First, the classification accuracies of those 14 FSMs were evaluated. Then, their abilities in identifying the proteins of different spiked concentrations were assessed. Finally, seven FSMs (FC, LMEB, OPLS-DA, PLS-DA, SAM, SVM-RFE and T-Test) were identified as performing consistently superior or good under both criteria with the PLS-DA performing consistently superior. In summary, this study served as comprehensive analysis on the performances of current FSMs and could provide a valuable guideline for researchers in metaproteomics.
Assuntos
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Proteômica Tipo de estudo: Prognostic_studies Idioma: En Revista: Brief Bioinform Assunto da revista: BIOLOGIA / INFORMATICA MEDICA Ano de publicação: 2020 Tipo de documento: Article País de afiliação: China

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Proteômica Tipo de estudo: Prognostic_studies Idioma: En Revista: Brief Bioinform Assunto da revista: BIOLOGIA / INFORMATICA MEDICA Ano de publicação: 2020 Tipo de documento: Article País de afiliação: China