Your browser doesn't support javascript.
loading
Vicinal support vector classifier using supervised kernel-based clustering.
Yang, Xulei; Cao, Aize; Song, Qing; Schaefer, Gerald; Su, Yi.
Afiliação
  • Yang X; Department of Computing Science, Institute of High Performance Computing, Agency for Science, Technology and Research (A*STAR), 1 Fusionopolis Way, Singapore 138632, Singapore; School of Electrical and Electronics Engineering, Nanyang Technological University, 50 Nanyang Avenue, Singapore 639798, Si
  • Cao A; The Institute for Medicine and Public Health, Vanderbilt University, 2525 West End Avenue, Suite 600, Nashville, TN 37203-1738, USA.
  • Song Q; School of Electrical and Electronics Engineering, Nanyang Technological University, 50 Nanyang Avenue, Singapore 639798, Singapore.
  • Schaefer G; Department of Computer Science, Loughborough University, Leicestershire LE11 3TU, UK.
  • Su Y; Department of Computing Science, Institute of High Performance Computing, Agency for Science, Technology and Research (A*STAR), 1 Fusionopolis Way, Singapore 138632, Singapore.
Artif Intell Med ; 60(3): 189-96, 2014 Mar.
Article em En | MEDLINE | ID: mdl-24637294
ABSTRACT

OBJECTIVE:

Support vector machines (SVMs) have drawn considerable attention due to their high generalisation ability and superior classification performance compared to other pattern recognition algorithms. However, the assumption that the learning data is identically generated from unknown probability distributions may limit the application of SVMs for real problems. In this paper, we propose a vicinal support vector classifier (VSVC) which is shown to be able to effectively handle practical applications where the learning data may originate from different probability distributions.

METHODS:

The proposed VSVC method utilises a set of new vicinal kernel functions which are constructed based on supervised clustering in the kernel-induced feature space. Our proposed approach comprises two steps. In the clustering step, a supervised kernel-based deterministic annealing (SKDA) clustering algorithm is employed to partition the training data into different soft vicinal areas of the feature space in order to construct the vicinal kernel functions. In the training step, the SVM technique is used to minimise the vicinal risk function under the constraints of the vicinal areas defined in the SKDA clustering step.

RESULTS:

Experimental results on both artificial and real medical datasets show our proposed VSVC achieves better classification accuracy and lower computational time compared to a standard SVM. For an artificial dataset constructed from non-separated data, the classification accuracy of VSVC is between 95.5% and 96.25% (using different cluster numbers) which compares favourably to the 94.5% achieved by SVM. The VSVC training time is between 8.75s and 17.83s (for 2-8 clusters), considerable less than the 65.0s required by SVM. On a real mammography dataset, the best classification accuracy of VSVC is 85.7% and thus clearly outperforms a standard SVM which obtains an accuracy of only 82.1%. A similar performance improvement is confirmed on two further real datasets, a breast cancer dataset (74.01% vs. 72.52%) and a heart dataset (84.77% vs. 83.81%), coupled with a reduction in terms of learning time (32.07s vs. 92.08s and 25.00s vs. 53.31s, respectively). Furthermore, the VSVC results in the number of support vectors being equal to the specified cluster number, and hence in a much sparser solution compared to a standard SVM.

CONCLUSION:

Incorporating a supervised clustering algorithm into the SVM technique leads to a sparse but effective solution, while making the proposed VSVC adaptive to different probability distributions of the training data.
Assuntos
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Análise por Conglomerados / Máquina de Vetores de Suporte Tipo de estudo: Diagnostic_studies Limite: Female / Humans Idioma: En Revista: Artif Intell Med Assunto da revista: INFORMATICA MEDICA Ano de publicação: 2014 Tipo de documento: Article País de afiliação: Eslovênia

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Análise por Conglomerados / Máquina de Vetores de Suporte Tipo de estudo: Diagnostic_studies Limite: Female / Humans Idioma: En Revista: Artif Intell Med Assunto da revista: INFORMATICA MEDICA Ano de publicação: 2014 Tipo de documento: Article País de afiliação: Eslovênia