A comprehensive dataset of annotated oral cavity images for diagnosis of oral cancer and oral potentially malignant disorders.
Oral Oncol
; 156: 106946, 2024 Sep.
Article
em En
| MEDLINE
| ID: mdl-39002299
ABSTRACT
OBJECTIVES:
This study aims to address the critical gap of unavailability of publicly accessible oral cavity image datasets for developing machine learning (ML) and artificial intelligence (AI) technologies for the diagnosis and prognosis of oral cancer (OCA) and oral potentially malignant disorders (OPMD), with a particular focus on the high prevalence and delayed diagnosis in Asia. MATERIALS ANDMETHODS:
Following ethical approval and informed written consent, images of the oral cavity were obtained from mobile phone cameras and clinical data was extracted from hospital records from patients attending to the Dental Teaching Hospital, Peradeniya, Sri Lanka. After data management and hosting, image categorization and annotations were done by clinicians using a custom-made software tool developed by the research team.RESULTS:
A dataset comprising 3000 high-quality, anonymized images obtained from 714 patients were classified into four distinct categories healthy, benign, OPMD, and OCA. Images were annotated with polygonal shaped oral cavity and lesion boundaries. Each image is accompanied by patient metadata, including age, sex, diagnosis, and risk factor profiles such as smoking, alcohol, and betel chewing habits.CONCLUSION:
Researchers can utilize the annotated images in the COCO format, along with the patients' metadata, to enhance ML and AI algorithm development.Palavras-chave
Texto completo:
1
Coleções:
01-internacional
Base de dados:
MEDLINE
Assunto principal:
Neoplasias Bucais
Limite:
Adolescent
/
Adult
/
Aged
/
Aged80
/
Female
/
Humans
/
Male
/
Middle aged
Idioma:
En
Revista:
Oral Oncol
Assunto da revista:
NEOPLASIAS
Ano de publicação:
2024
Tipo de documento:
Article