Búsqueda | Biblioteca Virtual en Salud

Processing of Short-Form Content in Clinical Narratives: Systematic Scoping Review.

Kugic, Amila; Martin, Ingrid; Modersohn, Luise; Pallaoro, Peter; Kreuzthaler, Markus; Schulz, Stefan; Boeker, Martin.

J Med Internet Res ; 26: e57852, 2024 Sep 26.

Artículo en Inglés | MEDLINE | ID: mdl-39325515

RESUMEN

BACKGROUND: Clinical narratives are essential components of electronic health records. The adoption of electronic health records has increased documentation time for hospital staff, leading to the use of abbreviations and acronyms more frequently. This brevity can potentially hinder comprehension for both professionals and patients. OBJECTIVE: This review aims to provide an overview of the types of short forms found in clinical narratives, as well as the natural language processing (NLP) techniques used for their identification, expansion, and disambiguation. METHODS: In the databases Web of Science, Embase, MEDLINE, EBMR (Evidence-Based Medicine Reviews), and ACL Anthology, publications that met the inclusion criteria were searched according to PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) guidelines for a systematic scoping review. Original, peer-reviewed publications focusing on short-form processing in human clinical narratives were included, covering the period from January 2018 to February 2023. Short-form types were extracted, and multidimensional research methodologies were assigned to each target objective (identification, expansion, and disambiguation). NLP study recommendations and study characteristics were systematically assigned occurrence rates for evaluation. RESULTS: Out of a total of 6639 records, only 19 articles were included in the final analysis. Rule-based approaches were predominantly used for identifying short forms, while string similarity and vector representations were applied for expansion. Embeddings and deep learning approaches were used for disambiguation. CONCLUSIONS: The scope and types of what constitutes a clinical short form were often not explicitly defined by the authors. This lack of definition poses challenges for reproducibility and for determining whether specific methodologies are suitable for different types of short forms. Analysis of a subset of NLP recommendations for assessing quality and reproducibility revealed only partial adherence to these recommendations. Single-character abbreviations were underrepresented in studies on clinical narrative processing, as were investigations in languages other than English. Future research should focus on these 2 areas, and each paper should include descriptions of the types of content analyzed.

Asunto(s)

Registros Electrónicos de Salud , Narración , Procesamiento de Lenguaje Natural , Humanos

O2 supplementation disambiguation in clinical narratives to support retrospective COVID-19 studies.

Abdulnazar, Akhila; Kugic, Amila; Schulz, Stefan; Stadlbauer, Vanessa; Kreuzthaler, Markus.

BMC Med Inform Decis Mak ; 24(1): 29, 2024 Jan 31.

Artículo en Inglés | MEDLINE | ID: mdl-38297364

RESUMEN

BACKGROUND: Oxygen saturation, a key indicator of COVID-19 severity, poses challenges, especially in cases of silent hypoxemia. Electronic health records (EHRs) often contain supplemental oxygen information within clinical narratives. Streamlining patient identification based on oxygen levels is crucial for COVID-19 research, underscoring the need for automated classifiers in discharge summaries to ease the manual review burden on physicians. METHOD: We analysed text lines extracted from anonymised COVID-19 patient discharge summaries in German to perform a binary classification task, differentiating patients who received oxygen supplementation and those who did not. Various machine learning (ML) algorithms, including classical ML to deep learning (DL) models, were compared. Classifier decisions were explained using Local Interpretable Model-agnostic Explanations (LIME), which visualize the model decisions. RESULT: Classical ML to DL models achieved comparable performance in classification, with an F-measure varying between 0.942 and 0.955, whereas the classical ML approaches were faster. Visualisation of embedding representation of input data reveals notable variations in the encoding patterns between classic and DL encoders. Furthermore, LIME explanations provide insights into the most relevant features at token level that contribute to these observed differences. CONCLUSION: Despite a general tendency towards deep learning, these use cases show that classical approaches yield comparable results at lower computational cost. Model prediction explanations using LIME in textual and visual layouts provided a qualitative explanation for the model performance.

Asunto(s)

COVID-19 , Compuestos de Calcio , Óxidos , Humanos , Estudios Retrospectivos , Oxígeno , Suplementos Dietéticos

Embedding-based terminology expansion via secondary use of large clinical real-world datasets.

Kugic, Amila; Pfeifer, Bastian; Schulz, Stefan; Kreuzthaler, Markus.

J Biomed Inform ; 147: 104497, 2023 11.

Artículo en Inglés | MEDLINE | ID: mdl-37777164

RESUMEN

A log-likelihood based co-occurrence analysis of â¼1.9 million de-identified ICD-10 codes and related short textual problem list entries generated possible term candidates at a significance level of p<0.01. These top 10 term candidates, consisting of 1 to 5-grams, were used as seed terms for an embedding based nearest neighbor approach to fetch additional synonyms, hypernyms and hyponyms in the respective n-gram embedding spaces by leveraging two different language models. This was done to analyze the lexicality of the resulting term candidates and to compare the term classifications of both models. We found no difference in system performance during the processing of lexical and non-lexical content, i.e. abbreviations, acronyms, etc. Additionally, an application-oriented analysis of the SapBERT (Self-Alignment Pretraining for Biomedical Entity Representations) language model indicates suitable performance for the extraction of all term classifications such as synonyms, hypernyms, and hyponyms.

Asunto(s)

Lenguaje , Procesamiento de Lenguaje Natural , Funciones de Verosimilitud , Análisis por Conglomerados

Term Candidate Generation to Enrich Clinical Terminologies with Large Language Models.

Kugic, Amila; Schulz, Stefan; Kreuzthaler, Markus.

Stud Health Technol Inform ; 316: 695-699, 2024 Aug 22.

Artículo en Inglés | MEDLINE | ID: mdl-39176890

RESUMEN

Annotated language resources derived from clinical routine documentation form an intriguing asset for secondary use case scenarios. In this investigation, we report on how such a resource can be leveraged to identify additional term candidates for a chosen set of ICD-10 codes. We conducted a log-likelihood analysis, considering the co-occurrence of approximately 1.9 million de-identified ICD-10 codes alongside corresponding brief textual entries from problem lists in German. This analysis aimed to identify potential candidates with statistical significance set at p < 0.01, which were used as seed terms to harvest additional candidates by interfacing to a large language model in a second step. The proposed approach can identify additional term candidates at suitable performance values: hypernyms MAP@5=0.801, synonyms MAP@5 = 0.723 and hyponyms MAP@5 = 0.507. The re-use of existing annotated clinical datasets, in combination with large language models, presents an interesting strategy to bridge the lexical gap in standardized clinical terminologies and real-world jargon.

Asunto(s)

Clasificación Internacional de Enfermedades , Procesamiento de Lenguaje Natural , Vocabulario Controlado , Humanos , Terminología como Asunto , Registros Electrónicos de Salud/clasificación , Alemania

Disambiguation of acronyms in clinical narratives with large language models.

Kugic, Amila; Schulz, Stefan; Kreuzthaler, Markus.

J Am Med Inform Assoc ; 31(9): 2040-2046, 2024 Sep 01.

Artículo en Inglés | MEDLINE | ID: mdl-38917444

RESUMEN

OBJECTIVE: To assess the performance of large language models (LLMs) for zero-shot disambiguation of acronyms in clinical narratives. MATERIALS AND METHODS: Clinical narratives in English, German, and Portuguese were applied for testing the performance of four LLMs: GPT-3.5, GPT-4, Llama-2-7b-chat, and Llama-2-70b-chat. For English, the anonymized Clinical Abbreviation Sense Inventory (CASI, University of Minnesota) was used. For German and Portuguese, at least 500 text spans were processed. The output of LLM models, prompted with contextual information, was analyzed to compare their acronym disambiguation capability, grouped by document-level metadata, the source language, and the LLM. RESULTS: On CASI, GPT-3.5 achieved 0.91 in accuracy. GPT-4 outperformed GPT-3.5 across all datasets, reaching 0.98 in accuracy for CASI, 0.86 and 0.65 for two German datasets, and 0.88 for Portuguese. Llama models only reached 0.73 for CASI and failed severely for German and Portuguese. Across LLMs, performance decreased from English to German and Portuguese processing languages. There was no evidence that additional document-level metadata had a significant effect. CONCLUSION: For English clinical narratives, acronym resolution by GPT-4 can be recommended to improve readability of clinical text by patients and professionals. For German and Portuguese, better models are needed. Llama models, which are particularly interesting for processing sensitive content on premise, cannot yet be recommended for acronym resolution.

Asunto(s)

Abreviaturas como Asunto , Procesamiento de Lenguaje Natural , Humanos , Lenguaje , Narración , Registros Electrónicos de Salud

Clinical Acronym Disambiguation via ChatGPT and BING.

Kugic, Amila; Kreuzthaler, Markus; Schulz, Stefan.

Stud Health Technol Inform ; 309: 78-82, 2023 Oct 20.

Artículo en Inglés | MEDLINE | ID: mdl-37869810

RESUMEN

Clinical texts are written with acronyms, abbreviations and medical jargon expressions to save time. This hinders full comprehension not just for medical experts but also laypeople. This paper attempts to disambiguate acronyms with their given context by comparing a web mining approach via the search engine BING and a conversational agent approach using ChatGPT with the aim to see, if these methods can supply a viable resolution for the input acronym. Both approaches are automated via application programming interfaces. Possible term candidates are extracted using natural language processing-oriented functionality. The conversational agent approach surpasses the baseline for web mining without plausibility thresholds in precision, recall and F1-measure, while scoring similarly only in precision for high threshold values.

Asunto(s)

Procesamiento de Lenguaje Natural , Programas Informáticos , Motor de Búsqueda , Comunicación , Escritura

Data-Driven Identification of Clinical Real-World Expressions Linked to ICD.

Kugic, Amila; Pfeifer, Bastian; Schulz, Stefan; Kreuzthaler, Markus.

Stud Health Technol Inform ; 302: 827-828, 2023 May 18.

Artículo en Inglés | MEDLINE | ID: mdl-37203508

RESUMEN

A semi-structured clinical problem list containing â¼1.9 million de-identified entries linked to ICD-10 codes was used to identify closely related real-world expressions. A log-likelihood based co-occurrence analysis generated seed-terms, which were integrated as part of a k-NN search, by leveraging SapBERT for the generation of an embedding representation.

Asunto(s)

Registros Electrónicos de Salud , Procesamiento de Lenguaje Natural , Funciones de Verosimilitud

Evaluating shallow and deep learning strategies for the 2018 n2c2 shared task on clinical text classification.

Oleynik, Michel; Kugic, Amila; Kasác, Zdenko; Kreuzthaler, Markus.

J Am Med Inform Assoc ; 26(11): 1247-1254, 2019 11 01.

Artículo en Inglés | MEDLINE | ID: mdl-31512729

RESUMEN

OBJECTIVE: Automated clinical phenotyping is challenging because word-based features quickly turn it into a high-dimensional problem, in which the small, privacy-restricted, training datasets might lead to overfitting. Pretrained embeddings might solve this issue by reusing input representation schemes trained on a larger dataset. We sought to evaluate shallow and deep learning text classifiers and the impact of pretrained embeddings in a small clinical dataset. MATERIALS AND METHODS: We participated in the 2018 National NLP Clinical Challenges (n2c2) Shared Task on cohort selection and received an annotated dataset with medical narratives of 202 patients for multilabel binary text classification. We set our baseline to a majority classifier, to which we compared a rule-based classifier and orthogonal machine learning strategies: support vector machines, logistic regression, and long short-term memory neural networks. We evaluated logistic regression and long short-term memory using both self-trained and pretrained BioWordVec word embeddings as input representation schemes. RESULTS: Rule-based classifier showed the highest overall micro F1 score (0.9100), with which we finished first in the challenge. Shallow machine learning strategies showed lower overall micro F1 scores, but still higher than deep learning strategies and the baseline. We could not show a difference in classification efficiency between self-trained and pretrained embeddings. DISCUSSION: Clinical context, negation, and value-based criteria hindered shallow machine learning approaches, while deep learning strategies could not capture the term diversity due to the small training dataset. CONCLUSION: Shallow methods for clinical phenotyping can still outperform deep learning methods in small imbalanced data, even when supported by pretrained embeddings.

Asunto(s)

Ensayos Clínicos como Asunto/métodos , Minería de Datos/métodos , Aprendizaje Automático , Procesamiento de Lenguaje Natural , Selección de Paciente , Clasificación , Aprendizaje Profundo , Humanos , Modelos Logísticos , Redes Neurales de la Computación

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

ENVIAR RESULTADO:

SELECCIÓN DE REFERENCIAS

Detalles de la búsqueda