Your browser doesn't support javascript.
loading
Deep Learning Transformer Models for Building a Comprehensive and Real-time Trauma Observatory: Development and Validation Study.
Chenais, Gabrielle; Gil-Jardiné, Cédric; Touchais, Hélène; Avalos Fernandez, Marta; Contrand, Benjamin; Tellier, Eric; Combes, Xavier; Bourdois, Loick; Revel, Philippe; Lagarde, Emmanuel.
Afiliação
  • Chenais G; Unit 1219, Bordeaux Public Health Center, Institut National de la Santé et de la Recherche Médicale, Bordeaux, France.
  • Gil-Jardiné C; Unit 1219, Bordeaux Public Health Center, Institut National de la Santé et de la Recherche Médicale, Bordeaux, France.
  • Touchais H; Emergency Department, Bordeaux University Hospital, Bordeaux, France.
  • Avalos Fernandez M; Unit 1219, Bordeaux Public Health Center, Institut National de la Santé et de la Recherche Médicale, Bordeaux, France.
  • Contrand B; Unit 1219, Bordeaux Public Health Center, Institut National de la Santé et de la Recherche Médicale, Bordeaux, France.
  • Tellier E; Statistics in Systems Biology and Translational Medicine Team, University of Bordeaux, Institut National de Recherche en Sciences et Technologies du Numérique, Talence, France.
  • Combes X; Unit 1219, Bordeaux Public Health Center, Institut National de la Santé et de la Recherche Médicale, Bordeaux, France.
  • Bourdois L; Unit 1219, Bordeaux Public Health Center, Institut National de la Santé et de la Recherche Médicale, Bordeaux, France.
  • Revel P; Emergency Department, Bordeaux University Hospital, Bordeaux, France.
  • Lagarde E; Emergency Department, Bordeaux University Hospital, Bordeaux, France.
JMIR AI ; 2: e40843, 2023 Jan 12.
Article em En | MEDLINE | ID: mdl-38875539
ABSTRACT

BACKGROUND:

Public health surveillance relies on the collection of data, often in near-real time. Recent advances in natural language processing make it possible to envisage an automated system for extracting information from electronic health records.

OBJECTIVE:

To study the feasibility of setting up a national trauma observatory in France, we compared the performance of several automatic language processing methods in a multiclass classification task of unstructured clinical notes.

METHODS:

A total of 69,110 free-text clinical notes related to visits to the emergency departments of the University Hospital of Bordeaux, France, between 2012 and 2019 were manually annotated. Among these clinical notes, 32.5% (22,481/69,110) were traumas. We trained 4 transformer models (deep learning models that encompass attention mechanism) and compared them with the term frequency-inverse document frequency associated with the support vector machine method.

RESULTS:

The transformer models consistently performed better than the term frequency-inverse document frequency and a support vector machine. Among the transformers, the GPTanam model pretrained with a French corpus with an additional autosupervised learning step on 306,368 unlabeled clinical notes showed the best performance with a micro F1-score of 0.969.

CONCLUSIONS:

The transformers proved efficient at the multiclass classification of narrative and medical data. Further steps for improvement should focus on the expansion of abbreviations and multioutput multiclass classification.
Palavras-chave

Texto completo: 1 Base de dados: MEDLINE Idioma: En Revista: JMIR AI Ano de publicação: 2023 Tipo de documento: Article País de afiliação: França

Texto completo: 1 Base de dados: MEDLINE Idioma: En Revista: JMIR AI Ano de publicação: 2023 Tipo de documento: Article País de afiliação: França