Responses of Five Different Artificial Intelligence Chatbots to the Top Searched Queries About Erectile Dysfunction: A Comparative Analysis.

Sahin, Mehmet Fatih; Ates, Hüseyin; Keles, Anil; Özcan, Ridvan; Dogan, Çagri; Akgül, Murat; Yazici, Cenk Murat

Sahin, Mehmet Fatih; Ates, Hüseyin; Keles, Anil; Özcan, Ridvan; Dogan, Çagri; Akgül, Murat; Yazici, Cenk Murat.

Afiliação

Sahin MF; Faculty of Medicine Department of Urology, Tekirdag Namik Kemal University, Süleymanpasa, Tekirdag, 59020, Turkey. mfatihsahin@gmail.com.
Ates H; Faculty of Medicine Department of Urology, Tekirdag Namik Kemal University, Süleymanpasa, Tekirdag, 59020, Turkey.
Keles A; Faculty of Medicine Department of Urology, Tekirdag Namik Kemal University, Süleymanpasa, Tekirdag, 59020, Turkey.
Özcan R; Department of Urology, Bursa State Hospital, Nilüfer, Bursa, 16110, Turkey.
Dogan Ç; Faculty of Medicine Department of Urology, Tekirdag Namik Kemal University, Süleymanpasa, Tekirdag, 59020, Turkey.
Akgül M; Faculty of Medicine Department of Urology, Tekirdag Namik Kemal University, Süleymanpasa, Tekirdag, 59020, Turkey.
Yazici CM; Faculty of Medicine Department of Urology, Tekirdag Namik Kemal University, Süleymanpasa, Tekirdag, 59020, Turkey.

J Med Syst ; 48(1): 38, 2024 Apr 03.

Article em En | MEDLINE | ID: mdl-38568432

ABSTRACT

ABSTRACT

The aim of the study is to evaluate and compare the quality and readability of responses generated by five different artificial intelligence (AI) chatbots-ChatGPT, Bard, Bing, Ernie, and Copilot-to the top searched queries of erectile dysfunction (ED). Google Trends was used to identify ED-related relevant phrases. Each AI chatbot received a specific sequence of 25 frequently searched terms as input. Responses were evaluated using DISCERN, Ensuring Quality Information for Patients (EQIP), and Flesch-Kincaid Grade Level (FKGL) and Reading Ease (FKRE) metrics. The top three most frequently searched phrases were "erectile dysfunction cause", "how to erectile dysfunction," and "erectile dysfunction treatment." Zimbabwe, Zambia, and Ghana exhibited the highest level of interest in ED. None of the AI chatbots achieved the necessary degree of readability. However, Bard exhibited significantly higher FKRE and FKGL ratings (p = 0.001), and Copilot achieved better EQIP and DISCERN ratings than the other chatbots (p = 0.001). Bard exhibited the simplest linguistic framework and posed the least challenge in terms of readability and comprehension, and Copilot's text quality on ED was superior to the other chatbots. As new chatbots are introduced, their understandability and text quality increase, providing better guidance to patients.

Assuntos

Inteligência Artificial; Disfunção Erétil; Masculino; Humanos; Software; Benchmarking; Linguística

Palavras-chave

Artificial intelligence; Bard; Bing; ChatGPT; Chatbot; Copilot; Erectile dysfunction; Ernie bot

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Inteligência Artificial / Disfunção Erétil Idioma: En Ano de publicação: 2024 Tipo de documento: Article País de afiliação: Turquia

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google