Pesquisa | Portal de Pesquisa da BVS

A Lightweight Sentiment Analysis Framework for a Micro-Intelligent Terminal.

Wei, Lin; Wang, Zhenyuan; Xu, Jing; Shi, Yucheng; Wang, Qingxian; Shi, Lei; Tao, Yongcai; Gao, Yufei.

Sensors (Basel) ; 23(2)2023 Jan 09.

Artigo em Inglês | MEDLINE | ID: mdl-36679538

RESUMO

Sentiment analysis aims to mine polarity features in the text, which can empower intelligent terminals to recognize opinions and further enhance interaction capabilities with customers. Considerable progress has been made using recurrent neural networks or pre-trained models to learn semantic representations. However, recently published models with complex structures require increasing computational resources to reach state-of-the-art (SOTA) performance. It is still a significant challenge to deploy these models to run on micro-intelligent terminals with limited computing power and memory. This paper proposes a lightweight and efficient framework based on hybrid multi-grained embedding on sentiment analysis (MC-GGRU). The gated recurrent unit model is designed to incorporate a global attention structure that allows contextual representations to be learned from unstructured text using word tokens. In addition, a multi-grained feature layer can further enrich sentence representation features with implicit semantics from characters. Through hybrid multi-grained representation, MC-GGRU achieves high inference performance with a shallow structure. The experimental results of five public datasets show that our method achieves SOTA for sentiment classification with a trade-off between accuracy and speed.

Assuntos

Semântica , Análise de Sentimentos , Idioma , Redes Neurais de Computação , Aprendizado de Máquina

Topic2features: a novel framework to classify noisy and sparse textual data using LDA topic distributions.

Wahid, Junaid Abdul; Shi, Lei; Gao, Yufei; Yang, Bei; Tao, Yongcai; Wei, Lin; Hussain, Shabir.

PeerJ Comput Sci ; 7: e677, 2021.

Artigo em Inglês | MEDLINE | ID: mdl-34458576

RESUMO

In supervised machine learning, specifically in classification tasks, selecting and analyzing the feature vector to achieve better results is one of the most important tasks. Traditional methods such as comparing the features' cosine similarity and exploring the datasets manually to check which feature vector is suitable is relatively time consuming. Many classification tasks failed to achieve better classification results because of poor feature vector selection and sparseness of data. In this paper, we proposed a novel framework, topic2features (T2F), to deal with short and sparse data using the topic distributions of hidden topics gathered from dataset and converting into feature vectors to build supervised classifier. For this we leveraged the unsupervised topic modelling LDA (latent dirichlet allocation) approach to retrieve the topic distributions employed in supervised learning algorithms. We made use of labelled data and topic distributions of hidden topics that were generated from that data. We explored how the representation based on topics affect the classification performance by applying supervised classification algorithms. Additionally, we did careful evaluation on two types of datasets and compared them with baseline approaches without topic distributions and other comparable methods. The results show that our framework performs significantly better in terms of classification performance compared to the baseline(without T2F) approaches and also yields improvement in terms of F1 score compared to other compared approaches.

RESUMO

Assuntos

RESUMO

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA