Your browser doesn't support javascript.
loading
TClustVID: A novel machine learning classification model to investigate topics and sentiment in COVID-19 tweets.
Satu, Md Shahriare; Khan, Md Imran; Mahmud, Mufti; Uddin, Shahadat; Summers, Matthew A; Quinn, Julian M W; Moni, Mohammad Ali.
Afiliación
  • Satu MS; Department of Management Information Systems, Noakhali Science & Technology University, Noakhali, 3814, Bangladesh.
  • Khan MI; Department of Computer Scienc & Engineering, Gono Bishwabidyalay, Savar, Dhaka, 1344, Bangladesh.
  • Mahmud M; Department of Computer Science, and Medical Technology Innovation Facility, Nottingham Trent University, Clifton Campus, Clifton, Nottingham - NG11 8NS, UK.
  • Uddin S; Complex Systems Research Group, Faculty of Engineering, The University of Sydney, Darlington, NSW 2008, Australia.
  • Summers MA; The Garvan Institute of Medical Research, Healthy Ageing Theme, Darlinghurst, NSW 2010, Australia.
  • Quinn JMW; The Garvan Institute of Medical Research, Healthy Ageing Theme, Darlinghurst, NSW 2010, Australia.
  • Moni MA; The Garvan Institute of Medical Research, Healthy Ageing Theme, Darlinghurst, NSW 2010, Australia.
Knowl Based Syst ; 226: 107126, 2021 Aug 17.
Article en En | MEDLINE | ID: mdl-33972817
ABSTRACT
COVID-19, caused by SARS-CoV2 infection, varies greatly in its severity but presents with serious respiratory symptoms with vascular and other complications, particularly in older adults. The disease can be spread by both symptomatic and asymptomatic infected individuals. Uncertainty remains over key aspects of the virus infectiousness (particularly the newly emerging variants) and the disease has had severe economic impacts globally. For these reasons, COVID-19 is the subject of intense and widespread discussion on social media platforms including Facebook and Twitter. These public forums substantially influence public opinions and in some cases can exacerbate the widespread panic and misinformation spread during the crisis. Thus, this work aimed to design an intelligent clustering-based classification and topic extracting model named TClustVID that analyzes COVID-19-related public tweets to extract significant sentiments with high accuracy. We gathered COVID-19 Twitter datasets from the IEEE Dataport repository and employed a range of data preprocessing methods to clean the raw data, then applied tokenization and produced a word-to-index dictionary. Thereafter, different classifications were employed on these datasets which enabled the exploration of the performance of traditional classification and TClustVID. Our analysis found that TClustVID showed higher performance compared to traditional methodologies that are determined by clustering criteria. Finally, we extracted significant topics from the clusters, split them into positive, neutral and negative sentiments, and identified the most frequent topics using the proposed model. This approach is able to rapidly identify commonly prevailing aspects of public opinions and attitudes related to COVID-19 and infection prevention strategies spreading among different populations.
Palabras clave

Texto completo: 1 Banco de datos: MEDLINE Tipo de estudio: Prognostic_studies Idioma: En Revista: Knowl Based Syst Año: 2021 Tipo del documento: Article País de afiliación: Bangladesh

Texto completo: 1 Banco de datos: MEDLINE Tipo de estudio: Prognostic_studies Idioma: En Revista: Knowl Based Syst Año: 2021 Tipo del documento: Article País de afiliación: Bangladesh