Pesquisa | BVS Aleitamento Materno

Medication event extraction in clinical notes: Contribution of the WisPerMed team to the n2c2 2022 challenge.

Schäfer, Henning; Idrissi-Yaghir, Ahmad; Bewersdorff, Jeanette; Frihat, Sameh; Friedrich, Christoph M; Zesch, Torsten.

J Biomed Inform ; 143: 104400, 2023 07.

Artigo em Inglês | MEDLINE | ID: mdl-37211196

RESUMO

In this work, we describe the findings of the 'WisPerMed' team from their participation in Track 1 (Contextualized Medication Event Extraction) of the n2c2 2022 challenge. We tackle two tasks: (i) medication extraction, which involves extracting all mentions of medications from the clinical notes, and (ii) event classification, which involves classifying the medication mentions based on whether a change in the medication has been discussed. To address the long lengths of clinical texts, which often exceed the maximum token length that models based on the transformer-architecture can handle, various approaches, such as the use of ClinicalBERT with a sliding window approach and Longformer-based models, are employed. In addition, domain adaptation through masked language modeling and preprocessing steps such as sentence splitting are utilized to improve model performance. Since both tasks were treated as named entity recognition (NER) problems, a sanity check was performed in the second release to eliminate possible weaknesses in the medication detection itself. This check used the medication spans to remove false positive predictions and replace missed tokens with the highest softmax probability of the disposition types. The effectiveness of these approaches is evaluated through multiple submissions to the tasks, as well as with post-challenge results, with a focus on the DeBERTa v3 model and its disentangled attention mechanism. Results show that the DeBERTa v3 model performs well in both the NER task and the event classification task.

Assuntos

Idioma , Processamento de Linguagem Natural

ROCOv2: Radiology Objects in COntext Version 2, an Updated Multimodal Image Dataset.

Rückert, Johannes; Bloch, Louise; Brüngel, Raphael; Idrissi-Yaghir, Ahmad; Schäfer, Henning; Schmidt, Cynthia S; Koitka, Sven; Pelka, Obioma; Abacha, Asma Ben; G Seco de Herrera, Alba; Müller, Henning; Horn, Peter A; Nensa, Felix; Friedrich, Christoph M.

Sci Data ; 11(1): 688, 2024 Jun 26.

Artigo em Inglês | MEDLINE | ID: mdl-38926396

RESUMO

Automated medical image analysis systems often require large amounts of training data with high quality labels, which are difficult and time consuming to generate. This paper introduces Radiology Object in COntext version 2 (ROCOv2), a multimodal dataset consisting of radiological images and associated medical concepts and captions extracted from the PMC Open Access subset. It is an updated version of the ROCO dataset published in 2018, and adds 35,705 new images added to PMC since 2018. It further provides manually curated concepts for imaging modalities with additional anatomical and directional concepts for X-rays. The dataset consists of 79,789 images and has been used, with minor modifications, in the concept detection and caption prediction tasks of ImageCLEFmedical Caption 2023. The dataset is suitable for training image annotation models based on image-caption pairs, or for multi-label image classification using Unified Medical Language System (UMLS) concepts provided with each image. In addition, it can serve for pre-training of medical domain models, and evaluation of deep learning models for multi-task learning.

Assuntos

Imagem Multimodal , Radiologia , Humanos , Processamento de Imagem Assistida por Computador , Unified Medical Language System

RESUMO

Assuntos

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA