Pre-trained language models in medicine: A survey.

Luo, Xudong; Deng, Zhiqi; Yang, Binxia; Luo, Michael Y

Luo, Xudong; Deng, Zhiqi; Yang, Binxia; Luo, Michael Y.

Affiliation

Luo X; School of Computer Science and Engineering, Guangxi Normal University, Guilin 541004, China; Guangxi Key Lab of Multi-source Information Mining, Guangxi Normal University, Guilin 541004, China; Key Laboratory of Education Blockchain and Intelligent Technology, Ministry of Education, Guangxi Normal U
Deng Z; School of Computer Science and Engineering, Guangxi Normal University, Guilin 541004, China; Guangxi Key Lab of Multi-source Information Mining, Guangxi Normal University, Guilin 541004, China; Key Laboratory of Education Blockchain and Intelligent Technology, Ministry of Education, Guangxi Normal U
Yang B; School of Computer Science and Engineering, Guangxi Normal University, Guilin 541004, China; Guangxi Key Lab of Multi-source Information Mining, Guangxi Normal University, Guilin 541004, China; Key Laboratory of Education Blockchain and Intelligent Technology, Ministry of Education, Guangxi Normal U
Luo MY; Emmanuel College, Cambridge University, Cambridge, CB2 3AP, UK. Electronic address: myl41@cam.ac.uk.

Artif Intell Med ; 154: 102904, 2024 08.

Article de En | MEDLINE | ID: mdl-38917600

ABSTRACT

ABSTRACT

With the rapid progress in Natural Language Processing (NLP), Pre-trained Language Models (PLM) such as BERT, BioBERT, and ChatGPT have shown great potential in various medical NLP tasks. This paper surveys the cutting-edge achievements in applying PLMs to various medical NLP tasks. Specifically, we first brief PLMS and outline the research of PLMs in medicine. Next, we categorise and discuss the types of tasks in medical NLP, covering text summarisation, question-answering, machine translation, sentiment analysis, named entity recognition, information extraction, medical education, relation extraction, and text mining. For each type of task, we first provide an overview of the basic concepts, the main methodologies, the advantages of applying PLMs, the basic steps of applying PLMs application, the datasets for training and testing, and the metrics for task evaluation. Subsequently, a summary of recent important research findings is presented, analysing their motivations, strengths vs weaknesses, similarities vs differences, and discussing potential limitations. Also, we assess the quality and influence of the research reviewed in this paper by comparing the citation count of the papers reviewed and the reputation and impact of the conferences and journals where they are published. Through these indicators, we further identify the most concerned research topics currently. Finally, we look forward to future research directions, including enhancing models' reliability, explainability, and fairness, to promote the application of PLMs in clinical practice. In addition, this survey also collect some download links of some model codes and the relevant datasets, which are valuable references for researchers applying NLP techniques in medicine and medical professionals seeking to enhance their expertise and healthcare service through AI technology.

Sujet(s)
Mots clés

BERT; GPT; Healthcare; Medical science; Natural language processing; Pre-trained language model

Texte intégral

Ajouter à My VHL

Imprimer

XML

PubMed Links

Recherche sur Google

Texte intégral: 1 Collection: 01-internacional Base de données: MEDLINE Sujet principal: Traitement du langage naturel / Fouille de données Limites: Humans Langue: En Journal: Artif Intell Med Sujet du journal: INFORMATICA MEDICA Année: 2024 Type de document: Article Pays de publication: Pays-Bas

Texte intégral

Ajouter à My VHL

Imprimer

XML

PubMed Links

Recherche sur Google