Can reproducibility be improved in clinical natural language processing? A study of 7 clinical NLP suites.

Digan, William; Névéol, Aurélie; Neuraz, Antoine; Wack, Maxime; Baudoin, David; Burgun, Anita; Rance, Bastien

Digan, William; Névéol, Aurélie; Neuraz, Antoine; Wack, Maxime; Baudoin, David; Burgun, Anita; Rance, Bastien.

Afiliación

Digan W; INSERM, Centre de Recherche des Cordeliers, UMRS 1138, Université de Paris, Université Sorbonne Paris Cité, Paris, France.
Névéol A; Department of Medical Informatics, Hôpital Européen Georges Pompidou, Assistance publique-Hôpitaux de Paris, Paris, France.
Neuraz A; Université Paris Saclay, CNRS, LIMSI, Orsay, France.
Wack M; INSERM, Centre de Recherche des Cordeliers, UMRS 1138, Université de Paris, Université Sorbonne Paris Cité, Paris, France.
Baudoin D; Department of Medical Informatics, Necker Children's Hospital, Assistance publique-Hôpitaux de Paris, Paris, France.
Burgun A; INSERM, Centre de Recherche des Cordeliers, UMRS 1138, Université de Paris, Université Sorbonne Paris Cité, Paris, France.
Rance B; Department of Medical Informatics, Hôpital Européen Georges Pompidou, Assistance publique-Hôpitaux de Paris, Paris, France.

J Am Med Inform Assoc ; 28(3): 504-515, 2021 03 01.

Article en En | MEDLINE | ID: mdl-33319904

ABSTRACT

ABSTRACT

BACKGROUND:

The increasing complexity of data streams and computational processes in modern clinical health information systems makes reproducibility challenging. Clinical natural language processing (NLP) pipelines are routinely leveraged for the secondary use of data. Workflow management systems (WMS) have been widely used in bioinformatics to handle the reproducibility bottleneck.

OBJECTIVE:

To evaluate if WMS and other bioinformatics practices could impact the reproducibility of clinical NLP frameworks. MATERIALS AND

METHODS:

Based on the literature across multiple researcho fields (NLP, bioinformatics and clinical informatics) we selected articles which (1) review reproducibility practices and (2) highlight a set of rules or guidelines to ensure tool or pipeline reproducibility. We aggregate insight from the literature to define reproducibility recommendations. Finally, we assess the compliance of 7 NLP frameworks to the recommendations.

RESULTS:

We identified 40 reproducibility features from 8 selected articles. Frameworks based on WMS match more than 50% of features (26 features for LAPPS Grid, 22 features for OpenMinted) compared to 18 features for current clinical NLP framework (cTakes, CLAMP) and 17 features for GATE, ScispaCy, and Textflows.

DISCUSSION:

34 recommendations are endorsed by at least 2 articles from our selection. Overall, 15 features were adopted by every NLP Framework. Nevertheless, frameworks based on WMS had a better compliance with the features.

CONCLUSION:

NLP frameworks could benefit from lessons learned from the bioinformatics field (eg, public repositories of curated tools and workflows or use of containers for shareability) to enhance the reproducibility in a clinical setting.

Asunto(s)

Procesamiento de Lenguaje Natural; Reproducibilidad de los Resultados; Biología Computacional; Sistemas de Administración de Bases de Datos; Informática Médica

Palabras clave

containerization; meaningful use; natural language processing; reproducibility of results; workflow

Texto completo

Imprimir

XML

PubMed Links

Buscar en Google

Texto completo: 1 Bases de datos: MEDLINE Asunto principal: Procesamiento de Lenguaje Natural / Reproducibilidad de los Resultados Tipo de estudio: Guideline / Prognostic_studies / Qualitative_research Idioma: En Revista: J Am Med Inform Assoc Asunto de la revista: INFORMATICA MEDICA Año: 2021 Tipo del documento: Article País de afiliación: Francia

Texto completo

Imprimir

XML

PubMed Links

Buscar en Google