Your browser doesn't support javascript.
loading
Comparison of machine learning algorithms for the prediction of five-year survival in oral squamous cell carcinoma.
Alkhadar, Huda; Macluskey, Michaelina; White, Sharon; Ellis, Ian; Gardner, Alexander.
Afiliação
  • Alkhadar H; Unit of Cell and Molecular Biology, Dundee Dental School, University of Dundee, Dundee, UK.
  • Macluskey M; Department of Oral Surgery, Medicine and Pathology, Dundee Dental School, University of Dundee, Dundee, UK.
  • White S; Department of Oral Surgery, Medicine and Pathology, Dundee Dental School, University of Dundee, Dundee, UK.
  • Ellis I; Unit of Cell and Molecular Biology, Dundee Dental School, University of Dundee, Dundee, UK.
  • Gardner A; Department of Restorative Dentistry, Dundee Dental School, University of Dundee, Dundee, UK.
J Oral Pathol Med ; 50(4): 378-384, 2021 Apr.
Article em En | MEDLINE | ID: mdl-33220109
BACKGROUND/AIM: Machine learning analyses of cancer outcomes for oral cancer remain sparse compared to other types of cancer like breast or lung. The purpose of the present study was to compare the performance of machine learning algorithms in the prediction of global, recurrence-free five-year survival in oral cancer patients based on clinical and histopathological data. METHODS: Data were gathered retrospectively from 416 patients with oral squamous cell carcinoma. The data set was divided into training and test data set (75:25 split). Training performance of five machine learning algorithms (Logistic regression, K-nearest neighbours, Naïve Bayes, Decision tree and Random forest classifiers) for prediction was assessed by k-fold cross-validation. Variables used in the machine learning models were age, sex, pain symptoms, grade of lesion, lymphovascular invasion, extracapsular extension, perineural invasion, bone invasion and type of treatment. Variable importance was assessed and model performance on the testing data was assessed using receiver operating characteristic curves, accuracy, sensitivity, specificity and F1 score. RESULTS: The best performing model was the Decision tree classifier, followed by the Logistic Regression model (accuracy 76% and 60%, respectively). The Naïve Bayes model did not display any predictive value with 0% specificity. CONCLUSIONS: Machine learning presents a promising and accessible toolset for improving prediction of oral cancer outcomes. Our findings add to a growing body of evidence that Decision tree models are useful in models in predicting OSCC outcomes. We would advise that future similar studies explore a variety of machine learning models including Logistic regression to help evaluate model performance.
Assuntos
Palavras-chave

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Neoplasias Bucais / Carcinoma de Células Escamosas / Neoplasias de Cabeça e Pescoço Idioma: En Ano de publicação: 2021 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Neoplasias Bucais / Carcinoma de Células Escamosas / Neoplasias de Cabeça e Pescoço Idioma: En Ano de publicação: 2021 Tipo de documento: Article