Your browser doesn't support javascript.
loading
Diagnosis of malignancy in oropharyngeal confocal laser endomicroscopy using GPT 4.0 with vision.
Sievert, Matti; Aubreville, Marc; Mueller, Sarina Katrin; Eckstein, Markus; Breininger, Katharina; Iro, Heinrich; Goncalves, Miguel.
Afiliación
  • Sievert M; Department of Otorhinolaryngology, Head and Neck Surgery, Friedrich Alexander University of Erlangen-Nuremberg, Erlangen University Hospital, Erlangen, Germany.
  • Aubreville M; Technische Hochschule Ingolstadt, Ingolstadt, Germany.
  • Mueller SK; Department of Otorhinolaryngology, Head and Neck Surgery, Friedrich Alexander University of Erlangen-Nuremberg, Erlangen University Hospital, Erlangen, Germany.
  • Eckstein M; Institute of Pathology, Friedrich-Alexander-Universität Erlangen-Nürnberg, University Hospital, Erlangen, Germany.
  • Breininger K; Department of Artificial Intelligence in Biomedical Engineering, Friedrich-Alexander-Universität Erlangen-Nürnberg, Erlangen, Germany.
  • Iro H; Department of Otorhinolaryngology, Head and Neck Surgery, Friedrich Alexander University of Erlangen-Nuremberg, Erlangen University Hospital, Erlangen, Germany.
  • Goncalves M; Department of Otorhinolaryngology, Plastic and Aesthetic Operations, University Hospital Würzburg, Joseph-Schneider-Straße 11, 97080, Würzburg, Germany. Goncalves_M@ukw.de.
Eur Arch Otorhinolaryngol ; 281(4): 2115-2122, 2024 Apr.
Article en En | MEDLINE | ID: mdl-38329525
ABSTRACT

PURPOSE:

Confocal Laser Endomicroscopy (CLE) is an imaging tool, that has demonstrated potential for intraoperative, real-time, non-invasive, microscopical assessment of surgical margins of oropharyngeal squamous cell carcinoma (OPSCC). However, interpreting CLE images remains challenging. This study investigates the application of OpenAI's Generative Pretrained Transformer (GPT) 4.0 with Vision capabilities for automated classification of CLE images in OPSCC.

METHODS:

CLE Images of histological confirmed SCC or healthy mucosa from a database of 12 809 CLE images from 5 patients with OPSCC were retrieved and anonymized. Using a training data set of 16 images, a validation set of 139 images, comprising SCC (83 images, 59.7%) and healthy normal mucosa (56 images, 40.3%) was classified using the application programming interface (API) of GPT4.0. The same set of images was also classified by CLE experts (two surgeons and one pathologist), who were blinded to the histology. Diagnostic metrics, the reliability of GPT and inter-rater reliability were assessed.

RESULTS:

Overall accuracy of the GPT model was 71.2%, the intra-rater agreement was κ = 0.837, indicating an almost perfect agreement across the three runs of GPT-generated results. Human experts achieved an accuracy of 88.5% with a substantial level of agreement (κ = 0.773).

CONCLUSIONS:

Though limited to a specific clinical framework, patient and image set, this study sheds light on some previously unexplored diagnostic capabilities of large language models using few-shot prompting. It suggests the model`s ability to extrapolate information and classify CLE images with minimal example data. Whether future versions of the model can achieve clinically relevant diagnostic accuracy, especially in uncurated data sets, remains to be investigated.
Asunto(s)
Palabras clave

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Asunto principal: Neoplasias de Cabeza y Cuello Tipo de estudio: Diagnostic_studies Límite: Humans Idioma: En Revista: Eur Arch Otorhinolaryngol Asunto de la revista: OTORRINOLARINGOLOGIA Año: 2024 Tipo del documento: Article País de afiliación: Alemania

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Asunto principal: Neoplasias de Cabeza y Cuello Tipo de estudio: Diagnostic_studies Límite: Humans Idioma: En Revista: Eur Arch Otorhinolaryngol Asunto de la revista: OTORRINOLARINGOLOGIA Año: 2024 Tipo del documento: Article País de afiliación: Alemania
...