Large language models for biomedicine: foundations, opportunities, challenges, and best practices.

Sahoo, Satya S; Plasek, Joseph M; Xu, Hua; Uzuner, Özlem; Cohen, Trevor; Yetisgen, Meliha; Liu, Hongfang; Meystre, Stéphane; Wang, Yanshan

Sahoo, Satya S; Plasek, Joseph M; Xu, Hua; Uzuner, Özlem; Cohen, Trevor; Yetisgen, Meliha; Liu, Hongfang; Meystre, Stéphane; Wang, Yanshan.

Afiliação

Sahoo SS; Department of Population and Quantitative Health Sciences, School of Medicine, Case Western Reserve University, Cleveland, OH 44122, United States.
Plasek JM; Division of General Internal Medicine and Primary Care, Brigham and Women's Hospital, Harvard Medical School, Cambridge, MA 02115, United States.
Xu H; Section of Biomedical Informatics and Data Science, School of Medicine, Yale University, New Haven, CT 06510, United States.
Uzuner Ö; Department of Information Sciences and Technology, George Mason University, Fairfax, VA 22030, United States.
Cohen T; Department of Biomedical Informatics and Medical Education, University of Washington, Seattle, WA 98109, United States.
Yetisgen M; Department of Biomedical Informatics and Medical Education, University of Washington, Seattle, WA 98109, United States.
Liu H; Department of Health Data Science and Artificial Intelligence, The University of Texas Health Science Center at Houston, Houston, TX 77030, United States.
Meystre S; Dipartimento tecnologie innovative, Institute of Digital Technologies for Personalised Healthcare, University of Applied Sciences and Arts of Southern Switzerland, Lugano, 6962, Switzerland.
Wang Y; Department of Health Information Management, University of Pittsburgh, Pittsburgh, PA 15260, United States.

J Am Med Inform Assoc ; 31(9): 2114-2124, 2024 Sep 01.

Article em En | MEDLINE | ID: mdl-38657567

ABSTRACT

ABSTRACT

OBJECTIVES:

Generative large language models (LLMs) are a subset of transformers-based neural network architecture models. LLMs have successfully leveraged a combination of an increased number of parameters, improvements in computational efficiency, and large pre-training datasets to perform a wide spectrum of natural language processing (NLP) tasks. Using a few examples (few-shot) or no examples (zero-shot) for prompt-tuning has enabled LLMs to achieve state-of-the-art performance in a broad range of NLP applications. This article by the American Medical Informatics Association (AMIA) NLP Working Group characterizes the opportunities, challenges, and best practices for our community to leverage and advance the integration of LLMs in downstream NLP applications effectively. This can be accomplished through a variety of approaches, including augmented prompting, instruction prompt tuning, and reinforcement learning from human feedback (RLHF). TARGET AUDIENCE Our focus is on making LLMs accessible to the broader biomedical informatics community, including clinicians and researchers who may be unfamiliar with NLP. Additionally, NLP practitioners may gain insight from the described best practices. SCOPE We focus on 3 broad categories of NLP tasks, namely natural language understanding, natural language inferencing, and natural language generation. We review the emerging trends in prompt tuning, instruction fine-tuning, and evaluation metrics used for LLMs while drawing attention to several issues that impact biomedical NLP applications, including falsehoods in generated text (confabulation/hallucinations), toxicity, and dataset contamination leading to overfitting. We also review potential approaches to address some of these current challenges in LLMs, such as chain of thought prompting, and the phenomena of emergent capabilities observed in LLMs that can be leveraged to address complex NLP challenge in biomedical applications.

Assuntos

Processamento de Linguagem Natural; Redes Neurais de Computação; Informática Médica; Humanos

Palavras-chave

clinical natural language processing; large language models; medical informatics applications; transfer learning; transformer neural networks

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Processamento de Linguagem Natural / Redes Neurais de Computação Limite: Humans Idioma: En Revista: J Am Med Inform Assoc Assunto da revista: INFORMATICA MEDICA Ano de publicação: 2024 Tipo de documento: Article País de afiliação: Estados Unidos

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google