Your browser doesn't support javascript.
loading
Improving sentiment classification using a RoBERTa-based hybrid model.
Semary, Noura A; Ahmed, Wesam; Amin, Khalid; Plawiak, Pawel; Hammad, Mohamed.
Afiliación
  • Semary NA; Department of Information Technology, Faculty of Computers and Information, Menoufia University, Shibin El Kom, Egypt.
  • Ahmed W; Department of Information Technology, Faculty of Computers and Information, Menoufia University, Shibin El Kom, Egypt.
  • Amin K; Department of Information Technology, Faculty of Computers and Artificial Intelligence, South Valley University, Hurghada, Egypt.
  • Plawiak P; Department of Information Technology, Faculty of Computers and Information, Menoufia University, Shibin El Kom, Egypt.
  • Hammad M; Department of Computer Science, Faculty of Computer Science and Telecommunications, Cracow University of Technology, Krakow, Poland.
Front Hum Neurosci ; 17: 1292010, 2023.
Article en En | MEDLINE | ID: mdl-38130432
ABSTRACT

Introduction:

Several attempts have been made to enhance text-based sentiment analysis's performance. The classifiers and word embedding models have been among the most prominent attempts. This work aims to develop a hybrid deep learning approach that combines the advantages of transformer models and sequence models with the elimination of sequence models' shortcomings.

Methods:

In this paper, we present a hybrid model based on the transformer model and deep learning models to enhance sentiment classification process. Robustly optimized BERT (RoBERTa) was selected for the representative vectors of the input sentences and the Long Short-Term Memory (LSTM) model in conjunction with the Convolutional Neural Networks (CNN) model was used to improve the suggested model's ability to comprehend the semantics and context of each input sentence. We tested the proposed model with two datasets with different topics. The first dataset is a Twitter review of US airlines and the second is the IMDb movie reviews dataset. We propose using word embeddings in conjunction with the SMOTE technique to overcome the challenge of imbalanced classes of the Twitter dataset.

Results:

With an accuracy of 96.28% on the IMDb reviews dataset and 94.2% on the Twitter reviews dataset, the hybrid model that has been suggested outperforms the standard methods.

Discussion:

It is clear from these results that the proposed hybrid RoBERTa-(CNN+ LSTM) method is an effective model in sentiment classification.
Palabras clave

Texto completo: 1 Base de datos: MEDLINE Idioma: En Revista: Front Hum Neurosci Año: 2023 Tipo del documento: Article País de afiliación: Egipto

Texto completo: 1 Base de datos: MEDLINE Idioma: En Revista: Front Hum Neurosci Año: 2023 Tipo del documento: Article País de afiliación: Egipto