Comparisons of Quality, Correctness, and Similarity Between ChatGPT-Generated and Human-Written Abstracts for Basic Research: Cross-Sectional Study.

Cheng, Shu-Li; Tsai, Shih-Jen; Bai, Ya-Mei; Ko, Chih-Hung; Hsu, Chih-Wei; Yang, Fu-Chi; Tsai, Chia-Kuang; Tu, Yu-Kang; Yang, Szu-Nian; Tseng, Ping-Tao; Hsu, Tien-Wei; Liang, Chih-Sung; Su, Kuan-Pin

Cheng, Shu-Li; Tsai, Shih-Jen; Bai, Ya-Mei; Ko, Chih-Hung; Hsu, Chih-Wei; Yang, Fu-Chi; Tsai, Chia-Kuang; Tu, Yu-Kang; Yang, Szu-Nian; Tseng, Ping-Tao; Hsu, Tien-Wei; Liang, Chih-Sung; Su, Kuan-Pin.

Afiliação

Cheng SL; Department of Nursing, Mackay Medical College, Taipei, Taiwan.
Tsai SJ; Department of Psychiatry, Taipei Veterans General Hospital, Taipei, Taiwan.
Bai YM; Division of Psychiatry, School of Medicine, National Yang-Ming University, Taipei, Taiwan.
Ko CH; Department of Psychiatry, Taipei Veterans General Hospital, Taipei, Taiwan.
Hsu CW; Division of Psychiatry, School of Medicine, National Yang-Ming University, Taipei, Taiwan.
Yang FC; Department of Psychiatry, Kaohsiung Medical University Hospital, Kaohsiung, Taiwan.
Tsai CK; Department of Psychiatry, College of Medicine, Kaohsiung Medical University, Kaohsiung, Taiwan.
Tu YK; Department of Psychiatry, Kaohsiung Municipal Siaogang Hospital, Kaohsiung Medical University, Kaohsiung, Taiwan.
Yang SN; Department of Psychiatry, Kaohsiung Chang Gung Memorial Hospital, Kaohsiung, Taiwan.
Tseng PT; Department of Neurology, Tri-Service General Hospital, National Defense Medical Center, Taipei, Taiwan.
Hsu TW; Department of Neurology, Tri-Service General Hospital, National Defense Medical Center, Taipei, Taiwan.
Liang CS; Institute of Epidemiology and Preventive Medicine, College of Public Health, National Taiwan University, Taipei, Taiwan.
Su KP; Department of Dentistry, National Taiwan University Hospital, Taipei, Taiwan.

J Med Internet Res ; 25: e51229, 2023 12 25.

Article em En | MEDLINE | ID: mdl-38145486

ABSTRACT

ABSTRACT

BACKGROUND:

ChatGPT may act as a research assistant to help organize the direction of thinking and summarize research findings. However, few studies have examined the quality, similarity (abstracts being similar to the original one), and accuracy of the abstracts generated by ChatGPT when researchers provide full-text basic research papers.

OBJECTIVE:

We aimed to assess the applicability of an artificial intelligence (AI) model in generating abstracts for basic preclinical research.

METHODS:

We selected 30 basic research papers from Nature, Genome Biology, and Biological Psychiatry. Excluding abstracts, we inputted the full text into ChatPDF, an application of a language model based on ChatGPT, and we prompted it to generate abstracts with the same style as used in the original papers. A total of 8 experts were invited to evaluate the quality of these abstracts (based on a Likert scale of 0-10) and identify which abstracts were generated by ChatPDF, using a blind approach. These abstracts were also evaluated for their similarity to the original abstracts and the accuracy of the AI content.

RESULTS:

The quality of ChatGPT-generated abstracts was lower than that of the actual abstracts (10-point Likert scale mean 4.72, SD 2.09 vs mean 8.09, SD 1.03; P<.001). The difference in quality was significant in the unstructured format (mean difference -4.33; 95% CI -4.79 to -3.86; P<.001) but minimal in the 4-subheading structured format (mean difference -2.33; 95% CI -2.79 to -1.86). Among the 30 ChatGPT-generated abstracts, 3 showed wrong conclusions, and 10 were identified as AI content. The mean percentage of similarity between the original and the generated abstracts was not high (2.10%-4.40%). The blinded reviewers achieved a 93% (224/240) accuracy rate in guessing which abstracts were written using ChatGPT.

CONCLUSIONS:

Using ChatGPT to generate a scientific abstract may not lead to issues of similarity when using real full texts written by humans. However, the quality of the ChatGPT-generated abstracts was suboptimal, and their accuracy was not 100%.

Assuntos

Inteligência Artificial; Pesquisa; Humanos; Estudos Transversais; Pesquisadores; Idioma

Palavras-chave

AI-generated scientific content; ChatGPT; LLM; NLP; abstract; abstracts; academic research; artificial intelligence; extract; extraction; generation; generative; language model; language models; natural language processing; plagiarism; publication; publications; scientific research; text; textual

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Pesquisa / Inteligência Artificial Limite: Humans Idioma: En Revista: J Med Internet Res Assunto da revista: INFORMATICA MEDICA Ano de publicação: 2023 Tipo de documento: Article País de afiliação: Taiwan

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google