Fuzzy ensemble of fined tuned BERT models for domain-specific sentiment analysis of software engineering dataset.

Anwar, Zeeshan; Afzal, Hammad; Altaf, Naima; Kadry, Seifedine; Kim, Jungeun

Anwar, Zeeshan; Afzal, Hammad; Altaf, Naima; Kadry, Seifedine; Kim, Jungeun.

Afiliación

Anwar Z; Department of Computer Software Engineering, National University of Sciences and Technology, Islamabad, Pakistan.
Afzal H; Department of Computer Software Engineering, National University of Sciences and Technology, Islamabad, Pakistan.
Altaf N; Department of Computer Software Engineering, National University of Sciences and Technology, Islamabad, Pakistan.
Kadry S; Department of Applied Data Science, Noroff University College, Kristiansand, Norway.
Kim J; Artificial Intelligence Research Center (AIRC), Ajman University, Ajman, United Arab Emirates.

PLoS One ; 19(5): e0300279, 2024.

Article en En | MEDLINE | ID: mdl-38805433

ABSTRACT

ABSTRACT

Software engineers post their opinions about various topics on social media that can be collectively mined using Sentiment Analysis. Analyzing this opinion is useful because it can provide insight into developers' feedback about various tools and topics. General-purpose sentiment analysis tools do not work well in the software domain because most of these tools are trained on movies and review datasets. Therefore, efforts are underway to develop domain-specific sentiment analysis tools for the Software Engineering (SE) domain. However, existing domain-specific tools for SE struggle to compute negative and neutral sentiments and can not be used on all SE datasets. This work uses a hybrid technique based on deep learning and a fine-tuned BERT model, i.e., Bert-Base, Bert-Large, Bert-LSTM, Bert-GRU, and Bert-CNN presented that is adapted as a domain-specific sentiment analysis tool for Community Question Answering datasets (named as Fuzzy Ensemble). Five different variants of fine-tuned BERT on the SE dataset are developed, and an ensemble of these fine-tuned models is taken using fuzzy logic. The trained model is evaluated on four publicly available benchmark datasets, i.e., Stack Overflow, JavaLib, Jira, and Code Review, using various evaluation metrics. The fuzzy Ensemble model is also compared with the state-of-the-art sentiment analysis tools for the software engineering domain, i.e., SentiStrength-SE, Senti4SD, SentiCR, and Generative Pre-Training Transformer (GPT). GPT mode is fine-tuned by the authors for domain-specific sentiment analysis. The Fuzzy Ensemble model covers the limitation of existing tools and improve accuracy to predict neutral sentiments even on diverse dataset. The fuzzy Ensemble model performs superior to state-of-the-art tools by achieving a maximum F1-score of 0.883.

Asunto(s)

Lógica Difusa; Programas Informáticos; Humanos; Medios de Comunicación Sociales; Aprendizaje Profundo

Texto completo

Añadir a Mi BVS

Imprimir

XML

PubMed Links

Buscar en Google

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Asunto principal: Programas Informáticos / Lógica Difusa Límite: Humans Idioma: En Revista: PLoS One Asunto de la revista: CIENCIA / MEDICINA Año: 2024 Tipo del documento: Article País de afiliación: Pakistán Pais de publicación: Estados Unidos

Texto completo

Añadir a Mi BVS

Imprimir

XML

PubMed Links

Buscar en Google