Your browser doesn't support javascript.
loading
Privacy-preserving large language models for structured medical information retrieval.
Wiest, Isabella Catharina; Ferber, Dyke; Zhu, Jiefu; van Treeck, Marko; Meyer, Sonja K; Juglan, Radhika; Carrero, Zunamys I; Paech, Daniel; Kleesiek, Jens; Ebert, Matthias P; Truhn, Daniel; Kather, Jakob Nikolas.
Afiliação
  • Wiest IC; Department of Medicine II, Medical Faculty Mannheim, Heidelberg University, Mannheim, Germany.
  • Ferber D; Else Kroener Fresenius Center for Digital Health, Faculty of Medicine and University Hospital Carl Gustav Carus, TUD Dresden University of Technology, Dresden, Germany.
  • Zhu J; Else Kroener Fresenius Center for Digital Health, Faculty of Medicine and University Hospital Carl Gustav Carus, TUD Dresden University of Technology, Dresden, Germany.
  • van Treeck M; Department of Medical Oncology, National Center for Tumor Diseases (NCT), Heidelberg University Hospital, Heidelberg, Germany.
  • Meyer SK; Else Kroener Fresenius Center for Digital Health, Faculty of Medicine and University Hospital Carl Gustav Carus, TUD Dresden University of Technology, Dresden, Germany.
  • Juglan R; Else Kroener Fresenius Center for Digital Health, Faculty of Medicine and University Hospital Carl Gustav Carus, TUD Dresden University of Technology, Dresden, Germany.
  • Carrero ZI; Department of Surgery I, University Hospital Würzburg, Würzburg, Germany.
  • Paech D; Else Kroener Fresenius Center for Digital Health, Faculty of Medicine and University Hospital Carl Gustav Carus, TUD Dresden University of Technology, Dresden, Germany.
  • Kleesiek J; Else Kroener Fresenius Center for Digital Health, Faculty of Medicine and University Hospital Carl Gustav Carus, TUD Dresden University of Technology, Dresden, Germany.
  • Ebert MP; German Cancer Research Center, Division of Radiology, Heidelberg, Germany.
  • Truhn D; University Hospital Bonn, Clinic for Neuroradiology, Bonn, Germany.
  • Kather JN; Institut für KI in der Medizin (IKIM), Universitätsmedizin Essen, Girardetstr. 2, 45131, Essen, Germany.
NPJ Digit Med ; 7(1): 257, 2024 Sep 20.
Article em En | MEDLINE | ID: mdl-39304709
ABSTRACT
Most clinical information is encoded as free text, not accessible for quantitative analysis. This study presents an open-source pipeline using the local large language model (LLM) "Llama 2" to extract quantitative information from clinical text and evaluates its performance in identifying features of decompensated liver cirrhosis. The LLM identified five key clinical features in a zero- and one-shot manner from 500 patient medical histories in the MIMIC IV dataset. We compared LLMs of three sizes and various prompt engineering approaches, with predictions compared against ground truth from three blinded medical experts. Our pipeline achieved high accuracy, detecting liver cirrhosis with 100% sensitivity and 96% specificity. High sensitivities and specificities were also yielded for detecting ascites (95%, 95%), confusion (76%, 94%), abdominal pain (84%, 97%), and shortness of breath (87%, 97%) using the 70 billion parameter model, which outperformed smaller versions. Our study successfully demonstrates the capability of locally deployed LLMs to extract clinical information from free text with low hardware requirements.

Texto completo: 1 Base de dados: MEDLINE Idioma: En Ano de publicação: 2024 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Idioma: En Ano de publicação: 2024 Tipo de documento: Article