Your browser doesn't support javascript.
loading
Comparative approaches for classification of diabetes mellitus data: Machine learning paradigm.
Maniruzzaman, Md; Kumar, Nishith; Menhazul Abedin, Md; Shaykhul Islam, Md; Suri, Harman S; El-Baz, Ayman S; Suri, Jasjit S.
Afiliação
  • Maniruzzaman M; Department of Statistics, University of Rajshahi, Rajshahi, Bangladesh. Electronic address: monir.stat91@gmail.com.
  • Kumar N; Department of Statistics, Bangabandhu Sheikh Mujibur Rahman Science and Technology University, Gopalganj, Bangladesh. Electronic address: nk.bru09@gmail.com.
  • Menhazul Abedin M; Statistics Discipline, Khulna University, Khulna, Bangladesh. Electronic address: menhaz70@gmail.com.
  • Shaykhul Islam M; Department of Statistics, University of Rajshahi, Rajshahi, Bangladesh. Electronic address: shaykhulrustat@gmail.com.
  • Suri HS; Diabetic Care Division, AtheroPoint LLC, Roseville, CA, USA. Electronic address: hsingh574@gmail.com.
  • El-Baz AS; Department of Bioengineering, J.B Speed School of Engineering, University of Louisville, Louisville, KY, USA. Electronic address: ayman_elbazz@yahoo.com.
  • Suri JS; Diabetic Care Division, AtheroPoint LLC, Roseville, CA, USA; Department of Electrical Engineering, Idaho State University (Affl.), Idaho, USA. Electronic address: jasjit.suri@atheropoint.com.
Comput Methods Programs Biomed ; 152: 23-34, 2017 Dec.
Article em En | MEDLINE | ID: mdl-29054258
ABSTRACT
BACKGROUND AND

OBJECTIVE:

Diabetes is a silent killer. The main cause of this disease is the presence of excessive amounts of metabolites such as glucose. There were about 387 million diabetic people all over the world in 2014. The financial burden of this disease has been calculated to be about $13,700 per year. According to the World Health Organization (WHO), these figures will more than double by the year 2030. This cost will be reduced dramatically if someone can predict diabetes statistically on the basis of some covariates. Although several classification techniques are available, it is very difficult to classify diabetes. The main objectives of this paper are as follows (i) Gaussian process classification (GPC), (ii) comparative classifier for diabetes data classification, (iii) data analysis using the cross-validation approach, (iv) interpretation of the data analysis and (v) benchmarking our method against others.

METHODS:

To classify diabetes, several classification techniques are used such as linear discriminant analysis (LDA), quadratic discriminant analysis (QDA), and Naive Bayes (NB). However, most of the medical data show non-normality, non-linearity and inherent correlation structure. So in this paper we adapted Gaussian process (GP)-based classification technique using three kernels namely linear, polynomial and radial basis kernel. We also investigate the performance of a GP-based classification technique in comparison to existing techniques such as LDA, QDA and NB. Performances are evaluated by using the accuracy (ACC), sensitivity (SE), specificity (SP), positive predictive value (PPV), negative predictive value (NPV) and receiver-operating characteristic (ROC) curves.

RESULTS:

Pima Indian diabetes dataset is taken as part of the study. This consists of 768 patients, of which 268 patients are diabetic and 500 patients are controls. Our machine learning system shows the performance of GP-based model as ACC 81.97%, SE 91.79%, SP 63.33%, PPV 84.91% and NPV 62.50% which are larger compared to other methods.
Assuntos
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Contexto em Saúde: 2_ODS3 Base de dados: MEDLINE Assunto principal: Diabetes Mellitus / Aprendizado de Máquina Tipo de estudo: Prognostic_studies Limite: Humans Idioma: En Revista: Comput Methods Programs Biomed Ano de publicação: 2017 Tipo de documento: Article

Texto completo: 1 Coleções: 01-internacional Contexto em Saúde: 2_ODS3 Base de dados: MEDLINE Assunto principal: Diabetes Mellitus / Aprendizado de Máquina Tipo de estudo: Prognostic_studies Limite: Humans Idioma: En Revista: Comput Methods Programs Biomed Ano de publicação: 2017 Tipo de documento: Article