Pesquisa | Biblioteca Virtual em Saúde

1.

Prompt Engineering Paradigms for Medical Applications: Scoping Review.

Zaghir, Jamil; Naguib, Marco; Bjelogrlic, Mina; Névéol, Aurélie; Tannier, Xavier; Lovis, Christian.

J Med Internet Res ; 26: e60501, 2024 Sep 10.

Artigo em Inglês | MEDLINE | ID: mdl-39255030

RESUMO

BACKGROUND: Prompt engineering, focusing on crafting effective prompts to large language models (LLMs), has garnered attention for its capabilities at harnessing the potential of LLMs. This is even more crucial in the medical domain due to its specialized terminology and language technicity. Clinical natural language processing applications must navigate complex language and ensure privacy compliance. Prompt engineering offers a novel approach by designing tailored prompts to guide models in exploiting clinically relevant information from complex medical texts. Despite its promise, the efficacy of prompt engineering in the medical domain remains to be fully explored. OBJECTIVE: The aim of the study is to review research efforts and technical approaches in prompt engineering for medical applications as well as provide an overview of opportunities and challenges for clinical practice. METHODS: Databases indexing the fields of medicine, computer science, and medical informatics were queried in order to identify relevant published papers. Since prompt engineering is an emerging field, preprint databases were also considered. Multiple data were extracted, such as the prompt paradigm, the involved LLMs, the languages of the study, the domain of the topic, the baselines, and several learning, design, and architecture strategies specific to prompt engineering. We include studies that apply prompt engineering-based methods to the medical domain, published between 2022 and 2024, and covering multiple prompt paradigms such as prompt learning (PL), prompt tuning (PT), and prompt design (PD). RESULTS: We included 114 recent prompt engineering studies. Among the 3 prompt paradigms, we have observed that PD is the most prevalent (78 papers). In 12 papers, PD, PL, and PT terms were used interchangeably. While ChatGPT is the most commonly used LLM, we have identified 7 studies using this LLM on a sensitive clinical data set. Chain-of-thought, present in 17 studies, emerges as the most frequent PD technique. While PL and PT papers typically provide a baseline for evaluating prompt-based approaches, 61% (48/78) of the PD studies do not report any nonprompt-related baseline. Finally, we individually examine each of the key prompt engineering-specific information reported across papers and find that many studies neglect to explicitly mention them, posing a challenge for advancing prompt engineering research. CONCLUSIONS: In addition to reporting on trends and the scientific landscape of prompt engineering, we provide reporting guidelines for future studies to help advance research in the medical field. We also disclose tables and figures summarizing medical prompt engineering papers available and hope that future contributions will leverage these existing works to better advance the field.

Assuntos

Processamento de Linguagem Natural , Humanos , Informática Médica/métodos

2.

Efficient Clinical Information Extraction from Breast Radiology Reports in French.

Zaghir, Jamil; Lokaj, Belinda; Kinkel, Karen; Djema, Amal-Dahila; Turbé, Hugues; Bjelogrlic, Mina; Durand de Gevigney, Valentin; Schmid, Jérôme; Lovis, Christian; Goldman, Jean-Philippe.

Stud Health Technol Inform ; 316: 1780-1784, 2024 Aug 22.

Artigo em Inglês | MEDLINE | ID: mdl-39176562

RESUMO

Radiology reports contain crucial patient information, in addition to images, that can be automatically extracted for secondary uses such as clinical support and research for diagnosis. We tested several classifiers to classify 1,218 breast MRI reports in French from two Swiss clinical centers. Logistic regression performed better for both internal (accuracy > 0.95 and macro-F1 > 0.86) and external data (accuracy > 0.81 and macro-F1 > 0.41). Automating this task will facilitate efficient extraction of targeted clinical parameters and provide a good basis for future annotation processes through automatic pre-annotation.

Assuntos

Neoplasias da Mama , Imageamento por Ressonância Magnética , Humanos , Feminino , Neoplasias da Mama/diagnóstico por imagem , França , Sistemas de Informação em Radiologia , Registros Eletrônicos de Saúde , Processamento de Linguagem Natural , Suíça , Mineração de Dados

3.

Unsupervised Extraction of Body-Text from Clinical PDF Documents.

Bensahla, Adel; Zaghir, Jamil; Gaudet-Blavignac, Christophe; Lovis, Christian.

Stud Health Technol Inform ; 316: 214-215, 2024 Aug 22.

Artigo em Inglês | MEDLINE | ID: mdl-39176711

RESUMO

Automatic extraction of body-text within clinical PDF documents is necessary to enhance downstream NLP tasks but remains a challenge. This study presents an unsupervised algorithm designed to extract body-text leveraging large volume of data. Using DBSCAN clustering over aggregate pages, our method extracts and organize text blocks using their content and coordinates. Evaluation results demonstrate precision scores ranging from 0.82 to 0.98, recall scores from 0.62 to 0.94, and F1-scores from 0.71 to 0.96 across various medical specialty sources. Future work includes dynamic parameter adjustments for improved accuracy and using larger datasets.

Assuntos

Processamento de Linguagem Natural , Algoritmos , Mineração de Dados/métodos , Humanos , Registros Eletrônicos de Saúde , Aprendizado de Máquina não Supervisionado

4.

Automatic Classification of Conclusions from Multi-Tracer Reports of PET Brain Imaging in Cognitive Impairment.

Goldman, Jean-Philippe; Jané, Pablo; Zaghir, Jamil; Pirazzo Andrade Teixeira, Eliluane; Peretti, Débora Elisa; Garibotto, Valentina; Lovis, Christian.

Stud Health Technol Inform ; 316: 560-564, 2024 Aug 22.

Artigo em Inglês | MEDLINE | ID: mdl-39176804

RESUMO

The goal of this paper is to build an automatic way to interpret conclusions from brain molecular imaging reports performed for investigation of cognitive disturbances (FDG, Amyloid and Tau PET) by comparing several traditional machine learning (ML) techniques-based text classification methods. Two purposes are defined: to identify positive or negative results in all three modalities, and to extract diagnostic impressions for Alzheimer's Disease (AD), Fronto-Temporal Dementia (FTD), Lewy Bodies Dementia (LBD) based on metabolism of perfusion patterns. A dataset was created by manual parallel annotation of 1668 conclusions of reports from the Nuclear Medicine and Molecular Imaging Division of Geneva University Hospitals. The 6 Machine Learning (ML) algorithms (Support Vector Machine (Linear and Radial Basis function), Naive Bayes, Logistic Regression, Random Forrest, and K-Nearest Neighbors) were trained and evaluated with a 5-fold cross-validation scheme to assess their performance and generalizability. The best classifier was SVM showing the following accuracies: FDG (0.97), Tau (0.94), Amyloid (0.98), Oriented Diagnostic (0.87 for a diagnosis among AD, FTD, LBD, undetermined, other), paving the way for a paradigm shift in the field of data handling in nuclear medicine research.

Assuntos

Disfunção Cognitiva , Tomografia por Emissão de Pósitrons , Humanos , Disfunção Cognitiva/diagnóstico por imagem , Disfunção Cognitiva/classificação , Encéfalo/diagnóstico por imagem , Aprendizado de Máquina , Doença de Alzheimer/diagnóstico por imagem , Doença de Alzheimer/classificação , Processamento de Linguagem Natural , Máquina de Vetores de Suporte , Sensibilidade e Especificidade , Suíça , Reprodutibilidade dos Testes

5.

Beyond Tokens: Fair Evaluation of French Large Language Models for Clinical Named Entity Recognition.

Zaghir, Jamil; Bjelogrlic, Mina; Goldman, Jean-Philippe; Bensahla, Adel; Zheng, Yuanyuan; Lovis, Christian.

Stud Health Technol Inform ; 316: 666-670, 2024 Aug 22.

Artigo em Inglês | MEDLINE | ID: mdl-39176830

RESUMO

Named Entity Recognition (NER) models based on Transformers have gained prominence for their impressive performance in various languages and domains. This work delves into the often-overlooked aspect of entity-level metrics and exposes significant discrepancies between token and entity-level evaluations. The study utilizes a corpus of synthetic French oncological reports annotated with entities representing oncological morphologies. Four different French BERT-based models are fine-tuned for token classification, and their performance is rigorously assessed at both token and entity-level. In addition to fine-tuning, we evaluate ChatGPT's ability to perform NER through prompt engineering techniques. The findings reveal a notable disparity in model effectiveness when transitioning from token to entity-level metrics, highlighting the importance of comprehensive evaluation methodologies in NER tasks. Furthermore, in comparison to BERT, ChatGPT remains limited when it comes to detecting advanced entities in French.

Assuntos

Processamento de Linguagem Natural , França , Humanos , Registros Eletrônicos de Saúde , Idioma , Neoplasias , Vocabulário Controlado

6.

MedFrenchmark, a Small Set for Benchmarking Generative LLMs in Medical French.

Quercia, Amandine; Zaghir, Jamil; Lovis, Christian; Gaudet-Blavignac, Christophe.

Stud Health Technol Inform ; 316: 601-605, 2024 Aug 22.

Artigo em Inglês | MEDLINE | ID: mdl-39176814

RESUMO

Generative Large Language Models (LLMs) have become ubiquitous in various fields, including healthcare and medicine. Consequently, there is growing interest in leveraging LLMs for medical applications, leading to the emergence of novel models daily. However, evaluation and benchmarking frameworks for LLMs are scarce, particularly those tailored for medical French. To address this gap, we introduce a minimal benchmark consisting of 114 open questions designed to assess the medical capabilities of LLMs in French. The proposed benchmark encompasses a wide range of medical domains, reflecting real-world clinical scenarios' complexity. A preliminary validation involved testing seven widely used LLMs with a parameter size of 7 billion. Results revealed significant variability in performance, emphasizing the importance of rigorous evaluation before deploying LLMs in medical settings. In conclusion, we present a novel and valuable resource for rapidly evaluating LLMs in medical French. By promoting greater accountability and standardization, this benchmark has the potential to enhance trustworthiness and utility in harnessing LLMs for medical applications.

Assuntos

Benchmarking , Simulação por Computador , França

7.

Exploring the Latest Highlights in Medical Natural Language Processing across Multiple Languages: A Survey.

Shaitarova, Anastassia; Zaghir, Jamil; Lavelli, Alberto; Krauthammer, Michael; Rinaldi, Fabio.

Yearb Med Inform ; 32(1): 230-243, 2023 Aug.

Artigo em Inglês | MEDLINE | ID: mdl-38147865

RESUMO

OBJECTIVES: This survey aims to provide an overview of the current state of biomedical and clinical Natural Language Processing (NLP) research and practice in Languages other than English (LoE). We pay special attention to data resources, language models, and popular NLP downstream tasks. METHODS: We explore the literature on clinical and biomedical NLP from the years 2020-2022, focusing on the challenges of multilinguality and LoE. We query online databases and manually select relevant publications. We also use recent NLP review papers to identify the possible information lacunae. RESULTS: Our work confirms the recent trend towards the use of transformer-based language models for a variety of NLP tasks in medical domains. In addition, there has been an increase in the availability of annotated datasets for clinical NLP in LoE, particularly in European languages such as Spanish, German and French. Common NLP tasks addressed in medical NLP research in LoE include information extraction, named entity recognition, normalization, linking, and negation detection. However, there is still a need for the development of annotated datasets and models specifically tailored to the unique characteristics and challenges of medical text in some of these languages, especially low-resources ones. Lastly, this survey highlights the progress of medical NLP in LoE, and helps at identifying opportunities for future research and development in this field.

Assuntos

Pesquisa Biomédica , Idioma , Processamento de Linguagem Natural , Bases de Dados Factuais , Armazenamento e Recuperação da Informação

8.

Caregivers Interactions with Clinical Autocomplete Tool: A Retrospective Study.

Zaghir, Jamil; Goldman, Jean-Philippe; Bjelogrlic, Mina; Gaudet-Blavignac, Christophe; Lovis, Christian.

Stud Health Technol Inform ; 295: 132-135, 2022 Jun 29.

Artigo em Inglês | MEDLINE | ID: mdl-35773825

RESUMO

Hospital caregivers report patient data while being under constant pressure. These records include structured information, with some of them being derived from a restricted list of terms. Finding the right term from a large terminology can be time-consuming, harming the clinician's productivity. To deal with this hurdle, an autocomplete system is employed, providing the closest terms after a prefix is typed. While this software application clearly smoothens the term searching, this paper studies the influences of the tool on caregivers' reporting, inspecting the evolution of their typing conduct over time.

Assuntos

Cuidadores , Software , Hospitais , Humanos , Estudos Retrospectivos

9.

Deep SNOMED CT Enabled Large Clinical Database About COVID-19.

Gaudet-Blavignac, Christophe; Ehrsam, Julien; Turbe, Hugues; Keszthelyi, Daniel; Zaghir, Jamil; Lovis, Christian.

Stud Health Technol Inform ; 294: 317-321, 2022 May 25.

Artigo em Inglês | MEDLINE | ID: mdl-35612084

RESUMO

In spring 2020, as the COVID-19 pandemic is in its first wave in Europe, the University hospitals of Geneva (HUG) is tasked to take care of all Covid inpatients of the Geneva canton. It is a crisis with very little tools to support decision-taking authorities, and very little is known about the Covid disease. The need to know more, and fast, highlighted numerous challenges in the whole data pipeline processes. This paper describes the decisions taken and processes developed to build a unified database to support several secondary usages of clinical data, including governance and research. HUG had to answer to 5 major waves of COVID-19 patients since the beginning of 2020. In this context, a database for COVID-19 related data has been created to support the governance of the hospital in their answer to this crisis. The principles about this database were a) a clearly defined cohort; b) a clearly defined dataset and c) a clearly defined semantics. This approach resulted in more than 28 000 variables encoded in SNOMED CT and 1 540 human readable labels. It covers more than 216 000 patients and 590 000 inpatient stays. This database is used daily since the beginning of the pandemic to feed the "Predict" dashboards of HUG and prediction reports as well as several research projects.

Assuntos

COVID-19 , Systematized Nomenclature of Medicine , Bases de Dados Factuais , Humanos , Pandemias , Semântica

10.

Classification of Oncology Treatment Responses from French Radiology Reports with Supervised Machine Learning.

Goldman, Jean-Philippe; Mottin, Luc; Zaghir, Jamil; Keszthelyi, Daniel; Lokaj, Belinda; Turbé, Hugues; Gobeil, Julien; Ruch, Patrick; Ehrsam, Julien; Lovis, Christian.

Stud Health Technol Inform ; 294: 849-853, 2022 May 25.

Artigo em Inglês | MEDLINE | ID: mdl-35612224

RESUMO

The present study shows first attempts to automatically classify oncology treatment responses on the basis of the textual conclusion sections of radiology reports according to the RECIST classification. After a robust and extended manual annotation of 543 conclusion sections (5-to-50-word long), and after the training of several machine learning techniques (from traditional machine learning to deep learning), the best results show an accuracy score of 0.90 for a two-class classification (non-progressive vs. progressive disease) and of 0.82 for a four-class classification (complete response, partial response, stable disease, progressive disease) both with Logistic Regression approach. Some innovative solutions are further suggested to improve these scores in the future.

Assuntos

Radiologia , Aprendizado de Máquina , Processamento de Linguagem Natural , Radiografia , Relatório de Pesquisa , Aprendizado de Máquina Supervisionado

11.

Performance of Machine Learning Methods to Classify French Medical Publications.

Zaghir, Jamil; Goldman, Jean-Philippe; Bjelogrlic, Mina; Keszthelyi, Daniel; Gaudet-Blavignac, Christophe; Turbé, Hugues; Lokaj, Belinda; Lovis, Christian.

Stud Health Technol Inform ; 294: 874-875, 2022 May 25.

Artigo em Inglês | MEDLINE | ID: mdl-35612232

RESUMO

Many medical narratives are read by care professionals in their preferred language. These documents can be produced by organizations, authorities or national publishers. However, they are often hardly findable using the usual query engines based on English such as PubMed. This work explores the possibility to automatically categorize medical documents in French following an automatic Natural Language Processing pipeline. The pipeline is used to compare the performance of 6 different machine learning and deep neural network approaches on a large dataset of peer-reviewed weekly published Swiss medical journal in French covering major topics in medicine over the last 15 years. An accuracy of 96% was achieved for 5-topic classification and 81% for 20-topic classification.

Assuntos

Aprendizado de Máquina , Processamento de Linguagem Natural , Idioma , Redes Neurais de Computação , PubMed

12.

A Lightweight and Interpretable Model to Classify Bundle Branch Blocks from ECG Signals.

Turbé, Hugues; Bjelogrlic, Mina; Namdar, Mehdi; Gaudet-Blavignac, Christophe; Zaghir, Jamil; Goldman, Jean-Philippe; Lokaj, Belinda; Lovis, Christian.

Stud Health Technol Inform ; 294: 43-47, 2022 May 25.

Artigo em Inglês | MEDLINE | ID: mdl-35612013

RESUMO

Automatic classification of ECG signals has been a longtime research area with large progress having been made recently. However these advances have been achieved with increasingly complex models at the expense of model's interpretability. In this research, a new model based on multivariate autoregressive model (MAR) coefficients combined with a tree-based model to classify bundle branch blocks is proposed. The advantage of the presented approach is to build a lightweight model which combined with post-hoc interpretability can bring new insights into important cross-lead dependencies which are indicative of the diseases of interest.

Assuntos

Bloqueio de Ramo , Eletrocardiografia , Algoritmos , Bloqueio de Ramo/diagnóstico , Humanos

13.

Real-world Patient Trajectory Prediction from Clinical Notes Using Artificial Neural Networks and UMLS-Based Extraction of Concepts.

Zaghir, Jamil; Rodrigues-Jr, Jose F; Goeuriot, Lorraine; Amer-Yahia, Sihem.

J Healthc Inform Res ; 5(4): 474-496, 2021 Dec.

Artigo em Inglês | MEDLINE | ID: mdl-35419508

RESUMO

As more data is generated from medical attendances and as Artificial Neural Networks gain momentum in research and industry, computer-aided medical prognosis has become a promising technology. A common approach to perform automated prognoses relies on textual clinical notes extracted from Electronic Health Records (EHRs). Data from EHRs are fed to neural networks that produce a set with the most probable medical problems to which a patient is subject in her/his clinical future, including clinical conditions, mortality, and readmission. Following this research line, we introduce a methodology that takes advantage of the unstructured text found in clinical notes by applying preprocessing, concepts extraction, and fine-tuned neural networks to predict the most probable medical problems to follow in a patient's clinical trajectory. Different from former works that focus on word embeddings and raw sets of extracted concepts, we generate a refined set of Unified Medical Language System (UMLS) concepts by applying a similarity threshold filter and a list of acceptable concept types. In our prediction experiments, our method demonstrated AUC-ROC performance of 0.91 for diagnosis codes, 0.93 for mortality, and 0.72 for readmission, determining an efficacy that rivals state-of-the-art works. Our findings contribute to the development of automated prognosis systems in hospitals where text is the main source of clinical history.

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA