Your browser doesn't support javascript.
loading
Improving the computational efficiency of recursive cluster elimination for gene selection.
Luo, Lin-Kai; Huang, Deng-Feng; Ye, Ling-Jun; Zhou, Qi-Feng; Shao, Gui-Fang; Peng, Hong.
Afiliación
  • Luo LK; Department of Automation, Xiamen University, Xiamen 361005, PR China. luolk@xmu.edu.cn
Article en En | MEDLINE | ID: mdl-20479497
ABSTRACT
The gene expression data are usually provided with a large number of genes and a relatively small number of samples, which brings a lot of new challenges. Selecting those informative genes becomes the main issue in microarray data analysis. Recursive cluster elimination based on support vector machine (SVM-RCE) has shown the better classification accuracy on some microarray data sets than recursive feature elimination based on support vector machine (SVM-RFE). However, SVM-RCE is extremely time-consuming. In this paper, we propose an improved method of SVM-RCE called ISVM-RCE. ISVM-RCE first trains a SVM model with all clusters, then applies the infinite norm of weight coefficient vector in each cluster to score the cluster, finally eliminates the gene clusters with the lowest score. In addition, ISVM-RCE eliminates genes within the clusters instead of removing a cluster of genes when the number of clusters is small. We have tested ISVM-RCE on six gene expression data sets and compared their performances with SVM-RCE and linear-discriminant-analysis-based RFE (LDA-RFE). The experiment results on these data sets show that ISVM-RCE greatly reduces the time cost of SVM-RCE, meanwhile obtains comparable classification performance as SVM-RCE, while LDA-RFE is not stable.
Asunto(s)

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Asunto principal: Análisis por Conglomerados / Biología Computacional / Análisis de Secuencia por Matrices de Oligonucleótidos / Perfilación de la Expresión Génica / Bases de Datos Genéticas Tipo de estudio: Prognostic_studies Límite: Humans / Male Idioma: En Revista: ACM Trans Comput Biol Bioinform Asunto de la revista: BIOLOGIA / INFORMATICA MEDICA Año: 2011 Tipo del documento: Article

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Asunto principal: Análisis por Conglomerados / Biología Computacional / Análisis de Secuencia por Matrices de Oligonucleótidos / Perfilación de la Expresión Génica / Bases de Datos Genéticas Tipo de estudio: Prognostic_studies Límite: Humans / Male Idioma: En Revista: ACM Trans Comput Biol Bioinform Asunto de la revista: BIOLOGIA / INFORMATICA MEDICA Año: 2011 Tipo del documento: Article