RESUMO
INTRODUCTION: Engagement with mobile health (mHealth) technologies among patients with hypertension is linked to reduced blood pressure and improved patient understanding of the condition. AIM: This study aimed to evaluate the psychometric properties (validity and reliability) of the TWente Engagement with Ehealth Technologies Scale (TWEETS) in an Italian cohort with hypertension. This study is the first attempt to evaluate the psychometric characteristics of the TWEETS in this population. METHODS: The study was conducted in three phases. The first phase encompassed the translation and cultural adaptation of the TWEETS to the Italian setting. The second phase involved an expert panel evaluating the instrument's face and content validities. The third phase was a cross-sectional study aiming to test construct validity and reliability. Adults diagnosed with hypertension were eligible for participation. Additional inclusion criteria included stable antihypertensive treatment for at least 2 weeks before enrolment and the provision of written informed consent. Patients were taught how to use two mHealth devices using the teach-back method. RESULTS: A total of 131 patients were enrolled. Exploratory and confirmatory factor analyses revealed that the TWEETS had a one-factor structure and a good level of fit. Cronbach's alpha coefficients suggested good internal consistency. CONCLUSIONS: The findings indicate that the TWEETS is a valuable tool for assessing the engagement of patients with hypertension with mHealth devices. Further assessment is needed in various cohorts to confirm the psychometric equivalence of the construct across different groups with hypertension.
RESUMO
The Internet's development has prompted social media to become an essential channel for disseminating disaster-related information. Increasing the accuracy of emotional polarity recognition in tweets is conducive to the government or rescue organizations understanding the public's demands and responding appropriately. Existing sentiment analysis models have some limitations of applicability. Therefore, this research proposes an IDBO-CNN-BiLSTM model combining the swarm intelligence optimization algorithm and deep learning methods. First, the Dung Beetle Optimization (DBO) algorithm is improved by adopting the Latin hypercube sampling, integrating the Osprey Optimization Algorithm (OOA), and introducing an adaptive Gaussian-Cauchy mixture mutation disturbance. The improved DBO (IDBO) algorithm is then utilized to optimize the Convolutional Neural Network-Bidirectional Long Short-Term Memory (CNN-BiLSTM) model's hyperparameters. Finally, the IDBO-CNN-BiLSTM model is constructed to classify the emotional tendencies of tweets associated with the Hurricane Harvey event. The empirical analysis indicates that the proposed model achieves an accuracy of 0.8033, outperforming other single and hybrid models. In contrast with the GWO, WOA, and DBO algorithms, the accuracy is enhanced by 2.89%, 2.82%, and 2.72%, respectively. This study proves that the IDBO-CNN-BiLSTM model can be applied to assist emergency decision-making in natural disasters.
RESUMO
Sarcasm in Sentiment Analysis (SA) is important due to the sense of sarcasm in sentences that differs from their literal meaning. Analysis of Arabic sarcasm still has many challenges like implicit indirect idioms to express the opinion, and lack of Arabic sarcasm corpus. In this paper, we proposed a new detecting model for sarcasm in Arabic tweets called the ArSa-Tweet model. It is based on implementing and developing Deep Learning (DL) models to classify tweets as sarcastic or not. The development of our proposed model consists of adding main improvements by applying robust preprocessing steps before feeding the data to the adapted DL models. The adapted DL models are LSTM, Multi-headed CNN-LSTM-GRU, BERT, AraBert-V01, and AraBert-V02. In addition, we proposed ArSa-data as a golden corpus that consists of Arabic tweets. A comparative process shows that our proposed ArSa-Tweet method has the most impact accuracy rate based on deploying the AraBert-V02 model, which obtains the best performance results in all accuracy metrics when compared with other methods.
RESUMO
OBJECTIVE: The authors sought to quantify the role of social media-related academic activity through use of the Altmetric score (a composite score based on social media attention from a variety of sources) and investigate its potential impact on the number of citations received at 3 years postpublication (articles published between January 2019 and December 2019). METHODS: Articles published in the top 12 neurosurgical journals according to Google Scholar (based on 5-year Web of Science impact factors, 2017-2021) were identified. Data collected included days since publication, Altmetric scores, and total number of tweets (posts), and 3-year citations were obtained from Google Scholar. A multiple linear regression model was created that featured a blocking method to stratify confounding variables from most to least contributing. Furthermore, the data were dichotomized by publications with ≥ 10 citations (top 25th percentile) and those with < 10 to analyze the impact of the score on total number of citations received at 3 years, using an independent-samples Mann-Whitney U-test. RESULTS: Among 6721 included articles, the mean Altmetric score was 3.76 ± 15.69 and the mean number of citations received was 9.61 ± 22.16. When accounting for relevant control variables, the Altmetric score was a significant predictor of the total number of citations accumulated at 3 years (variability of 10.17%). On statistical testing, the Altmetric score was significantly higher in publications with ≥ 10 citations (p < 0.001). CONCLUSIONS: The authors report a strong, statistically significant correlation between the Altmetric score and the number of citations received. To their knowledge, this is the first study to demonstrate the impact of social media academic activity on neurosurgery article citation dissemination, potentially influencing resident medical education.
RESUMO
BACKGROUND: Data from the social media platform X (formerly Twitter) can provide insights into the types of language that are used when discussing drug use. In past research using latent Dirichlet allocation (LDA), we found that tweets containing "street names" of prescription drugs were difficult to classify due to the similarity to other colloquialisms and lack of clarity over how the terms were used. Conversely, "brand name" references were more amenable to machine-driven categorization. OBJECTIVE: This study sought to use next-generation techniques (beyond LDA) from natural language processing to reprocess X data and automatically cluster groups of tweets into topics to differentiate between street- and brand-name data sets. We also aimed to analyze the differences in emotional valence between the 2 data sets to study the relationship between engagement on social media and sentiment. METHODS: We used the Twitter application programming interface to collect tweets that contained the street and brand name of a prescription drug within the tweet. Using BERTopic in combination with Uniform Manifold Approximation and Projection and k-means, we generated topics for the street-name corpus (n=170,618) and brand-name corpus (n=245,145). Valence Aware Dictionary and Sentiment Reasoner (VADER) scores were used to classify whether tweets within the topics had positive, negative, or neutral sentiments. Two different logistic regression classifiers were used to predict the sentiment label within each corpus. The first model used a tweet's engagement metrics and topic ID to predict the label, while the second model used those features in addition to the top 5000 tweets with the largest term-frequency-inverse document frequency score. RESULTS: Using BERTopic, we identified 40 topics for the street-name data set and 5 topics for the brand-name data set, which we generalized into 8 and 5 topics of discussion, respectively. Four of the general themes of discussion in the brand-name corpus referenced drug use, while 2 themes of discussion in the street-name corpus referenced drug use. From the VADER scores, we found that both corpora were inclined toward positive sentiment. Adding the vectorized tweet text increased the accuracy of our models by around 40% compared with the models that did not incorporate the tweet text in both corpora. CONCLUSIONS: BERTopic was able to classify tweets well. As with LDA, the discussion using brand names was more similar between tweets than the discussion using street names. VADER scores could only be logically applied to the brand-name corpus because of the high prevalence of non-drug-related topics in the street-name data. Brand-name tweets either discussed drugs positively or negatively, with few posts having a neutral emotionality. From our machine learning models, engagement alone was not enough to predict the sentiment label; the added context from the tweets was needed to understand the emotionality of a tweet.
Assuntos
Redes Neurais de Computação , Medicamentos sob Prescrição , Mídias Sociais , Mídias Sociais/estatística & dados numéricos , Humanos , Processamento de Linguagem NaturalRESUMO
Event of the disastrous scenarios are actively discussed on microblogging platforms like Twitter which can lead to chaotic situations. In the era of machine learning and deep learning, these chaotic situations can be effectively controlled by developing efficient methods and models that can assist in classifying real and fake tweets. In this research article, an efficient method named BERT Embedding based CNN model with RMSProp Optimizer is proposed to effectively classify the tweets related disastrous scenario. Tweet classification is carried out via some of the popular the machine learning algorithms such as logistic regression and decision tree classifiers. Noting the low accuracy of machine learning models, Convolutional Neural Network (CNN) based deep learning model is selected as the primary classification method. CNNs performance is improved via optimization of the parameters with gradient based optimizers. To further elevate accuracy and to capture contextual semantics from the text data, BERT embeddings are included in the proposed model. The performance of proposed method - BERT Embedding based CNN model with RMSProp Optimizer achieved an F1 score of 0.80 and an Accuracy of 0.83. The methodology presented in this research article is comprised of the following key contributions:â¢Identification of suitable text classification model that can effectively capture complex patterns when dealing with large vocabularies or nuanced language structures in disaster management scenarios.â¢The method explores the gradient based optimization techniques such as Adam Optimizer, Stochastic Gradient Descent (SGD) Optimizer, AdaGrad, and RMSprop Optimizer to identify the most appropriate optimizer that meets the characteristics of the dataset and the CNN model architecture.â¢"BERT Embedding based CNN model with RMSProp Optimizer" - a method to classify the disaster tweets and capture semantic representations by leveraging BERT embeddings with appropriate feature selection is presented and models are validated with appropriate comparative analysis.
RESUMO
The prevalence of cyberbullying has reached an alarming rate, affecting approximately 54% of teenagers who experience various forms of cyberbullying, including offensive hate speech, threats, and racism. This research introduces a comprehensive dataset and system for cyberbullying detection in Urdu tweets, leveraging a spectrum of machine learning approaches including traditional models and advanced deep learning techniques. The objectives of this study are threefold. Firstly, a dataset consisting of 12,500 annotated tweets in Urdu is created, and it is made publicly available to the research community. Secondly, annotation guidelines for Urdu text with appropriate labels for cyberbullying detection are developed. Finally, a series of experiments is conducted to assess the performance of machine learning and deep learning techniques in detecting cyberbullying. The results indicate that fastText deep learning models outperform other models in cyberbullying detection. This study demonstrates its efficacy in effectively detecting and classifying cyberbullying incidents in Urdu tweets, contributing to the broader effort of creating a safer digital environment.
RESUMO
BACKGROUND: During the COVID-19 pandemic, Twitter (recently rebranded as "X") was the most widely used social media platform with over 2 million cancer-related tweets. The increasing use of social media among patients and family members, providers, and organizations has allowed for novel methods of studying cancer communication. OBJECTIVE: This study aimed to examine pediatric cancer-related tweets to capture the experiences of patients and survivors of cancer, their caregivers, medical providers, and other stakeholders. We assessed the public sentiment and content of tweets related to pediatric cancer over a time period representative of the COVID-19 pandemic. METHODS: All English-language tweets related to pediatric cancer posted from December 11, 2019, to May 7, 2022, globally, were obtained using the Twitter application programming interface. Sentiment analyses were computed based on Bing, AFINN, and NRC lexicons. We conducted a supplemental nonlexicon-based sentiment analysis with ChatGPT (version 3.0) to validate our findings with a random subset of 150 tweets. We conducted a qualitative content analysis to manually code the content of a random subset of 800 tweets. RESULTS: A total of 161,135 unique tweets related to pediatric cancer were identified. Sentiment analyses showed that there were more positive words than negative words. Via the Bing lexicon, the most common positive words were support, love, amazing, heaven, and happy, and the most common negative words were grief, risk, hard, abuse, and miss. Via the NRC lexicon, most tweets were categorized under sentiment types of positive, trust, and joy. Overall positive sentiment was consistent across lexicons and confirmed with supplemental ChatGPT (version 3.0) analysis. Percent agreement between raters for qualitative coding was 91%, and the top 10 codes were awareness, personal experiences, research, caregiver experiences, patient experiences, policy and the law, treatment, end of life, pharmaceuticals and drugs, and survivorship. Qualitative content analysis showed that Twitter users commonly used the social media platform to promote public awareness of pediatric cancer and to share personal experiences with pediatric cancer from the perspective of patients or survivors and their caregivers. Twitter was frequently used for health knowledge dissemination of research findings and federal policies that support treatment and affordable medical care. CONCLUSIONS: Twitter may serve as an effective means for researchers to examine pediatric cancer communication and public sentiment around the globe. Despite the public mental health crisis during the COVID-19 pandemic, overall sentiments of pediatric cancer-related tweets were positive. Content of pediatric cancer tweets focused on health and treatment information, social support, and raising awareness of pediatric cancer.
RESUMO
BACKGROUND: Nonprofit organizations are increasingly using social media to improve their communication strategies with the broader population. However, within the domain of human service nonprofits, there is hesitancy to fully use social media tools, and there is limited scope among organizational personnel in applying their potential beyond self-promotion and service advertisement. There is a pressing need for greater conceptual clarity to support education and training on the varied reasons for using social media to increase organizational outcomes. OBJECTIVE: This study leverages the potential of Twitter (subsequently rebranded as X [X Corp]) to examine the online communication content within a sample (n=133) of nonprofit sexual assault (SA) centers in Canada. To achieve this, we developed a typology using a qualitative and supervised machine learning model for the automatic classification of tweets posted by these centers. METHODS: Using a mixed methods approach that combines machine learning and qualitative analysis, we manually coded 10,809 tweets from 133 SA centers in Canada, spanning the period from March 2009 to March 2023. These manually labeled tweets were used as the training data set for the supervised machine learning process, which allowed us to classify 286,551 organizational tweets. The classification model based on supervised machine learning yielded satisfactory results, prompting the use of unsupervised machine learning to classify the topics within each thematic category and identify latent topics. The qualitative thematic analysis, in combination with topic modeling, provided a contextual understanding of each theme. Sentiment analysis was conducted to reveal the emotions conveyed in the tweets. We conducted validation of the model with 2 independent data sets. RESULTS: Manual annotation of 10,809 tweets identified seven thematic categories: (1) community engagement, (2) organization administration, (3) public awareness, (4) political advocacy, (5) support for others, (6) partnerships, and (7) appreciation. Organization administration was the most frequent segment, and political advocacy and partnerships were the smallest segments. The supervised machine learning model achieved an accuracy of 63.4% in classifying tweets. The sentiment analysis revealed a prevalence of neutral sentiment across all categories. The emotion analysis indicated that fear was predominant, whereas joy was associated with the partnership and appreciation tweets. Topic modeling identified distinct themes within each category, providing valuable insights into the prevalent discussions surrounding SA and related issues. CONCLUSIONS: This research contributes an original theoretical model that sheds light on how human service nonprofits use social media to achieve their online organizational communication objectives across 7 thematic categories. The study advances our comprehension of social media use by nonprofits, presenting a comprehensive typology that captures the diverse communication objectives and contents of these organizations, which provide content to expand training and education for nonprofit leaders to connect and engage with the public, policy experts, other organizations, and potential service users.
Assuntos
Organizações sem Fins Lucrativos , Mídias Sociais , Mídias Sociais/estatística & dados numéricos , Humanos , Canadá , Aprendizado de MáquinaRESUMO
The automatic speech identification in Arabic tweets has generated substantial attention among academics in the fields of text mining and natural language processing (NLP). The quantity of studies done on this subject has experienced significant growth. This study aims to provide an overview of this field by conducting a systematic review of literature that focuses on automatic hate speech identification, particularly in the Arabic language. The goal is to examine the research trends in Arabic hate speech identification and offer guidance to researchers by highlighting the most significant studies published between 2018 and 2023. This systematic study addresses five specific research questions concerning the types of the Arabic language used, hate speech categories, classification techniques, feature engineering techniques, performance metrics, validation methods, existing challenges faced by researchers, and potential future research directions. Through a comprehensive search across nine academic databases, 24 studies that met the predefined inclusion criteria and quality assessment were identified. The review findings revealed the existence of many Arabic linguistic varieties used in hate speech on Twitter, with modern standard Arabic (MSA) being the most prominent. In identification techniques, machine learning categories are the most used technique for Arabic hate speech identification. The result also shows different feature engineering techniques used and indicates that N-gram and CBOW are the most used techniques. F1-score, precision, recall, and accuracy were also identified as the most used performance metric. The review also shows that the most used validation method is the train/test split method. Therefore, the findings of this study can serve as valuable guidance for researchers in enhancing the efficacy of their models in future investigations. Besides, algorithm development, policy rule regulation, community management, and legal and ethical consideration are other real-world applications that can be reaped from this research.
RESUMO
Purpose: The main aim of our study was to explore the utility of artificial intelligence (AI) in diagnosing autism spectrum disorder (ASD). The study primarily focused on using machine learning (ML) and deep learning (DL) models to detect ASD potential cases by analyzing text inputs, especially from social media platforms like Twitter. This is to overcome the ongoing challenges in ASD diagnosis, such as the requirement for specialized professionals and extensive resources. Timely identification, particularly in children, is essential to provide immediate intervention and support, thereby improving the quality of life for affected individuals. Methods: We employed natural language processing (NLP) techniques along with ML models like decision trees, extreme gradient boosting (XGB), k-nearest neighbors algorithm (KNN), and DL models such as recurrent neural networks (RNN), long short-term memory (LSTM), bidirectional long short-term memory (Bi-LSTM), bidirectional encoder representations from transformers (BERT and BERTweet). We extracted a dataset of 404,627 tweets from Twitter users using the platform's API and classified them based on whether they were written by individuals claiming to have ASD (ASD users) or by those without ASD (non-ASD users). From this dataset, we used a subset of 90,000 tweets (45,000 from each classification group) for the training and testing of these models. Results: The application of our AI models yielded promising results, with the predictive model reaching an accuracy of almost 88% when classifying texts that potentially originated from individuals with ASD. Conclusion: Our research demonstrated the potential of using AI, particularly DL models, in enhancing the accuracy of ASD detection and diagnosis. This innovative approach signifies the critical role AI can play in advancing early diagnostic techniques, enabling better patient outcomes and underlining the importance of early identification of ASD, especially in children.
RESUMO
BACKGROUND: Indoor tanning is a preventable risk factor for skin cancer. Statewide shutdowns during the COVID-19 pandemic resulted in temporary closures of tanning businesses. Little is known about how tanners reacted to losing access to tanning businesses. OBJECTIVE: This study aimed to analyze Twitter (subsequently rebranded as X) chatter about indoor tanning during the statewide pandemic shutdowns. METHODS: We collected tweets from March 15 to April 30, 2020, and performed a directed content analysis of a random sample of 20% (1165/5811) of tweets from each week. The 2 coders independently rated themes (κ=0.67-1.0; 94%-100% agreement). RESULTS: About half (589/1165, 50.6%) of tweets were by people unlikely to indoor tan, and most of these mocked tanners or the act of tanning (562/589, 94.9%). A total of 34% (402/1165) of tweets were posted by users likely to indoor tan, and most of these (260/402, 64.7%) mentioned missing tanning beds, often citing appearance- or mood-related reasons or withdrawal. Some tweets by tanners expressed a desire to purchase or use home tanning beds (90/402, 22%), while only 3.9% (16/402) mentioned tanning alternatives (eg, self-tanner). Very few tweets (29/1165, 2.5%) were public health messages about the dangers of indoor tanning. CONCLUSIONS: Findings revealed that during statewide shutdowns, half of the tweets about indoor tanning were mocking tanning bed users and the tanned look, while about one-third were indoor tanners reacting to their inability to access tanning beds. Future work is needed to understand emerging trends in tanning post pandemic.
RESUMO
BACKGROUND: Information on social media may affect peoples' contraceptive decision making. We performed an exploratory analysis of contraceptive content on Twitter (recently renamed X), a popular social media platform. METHODS: We selected a random subset of 1% of publicly available, English-language tweets related to reversible, prescription contraceptive methods posted between January 2014 and December 2019. We oversampled tweets for the contraceptive patch to ensure at least 200 tweets per method. To create the codebook, we identified common themes specific to tweet content topics, tweet sources, and tweets soliciting information or providing advice. All posts were coded by two team members, and differences were adjudicated by a third reviewer. Descriptive analyses were reported with accompanying qualitative findings. RESULTS: During the study period, 457,369 tweets about reversible contraceptive methods were published, with a random sample of 4,434 tweets used for final analysis. Tweets most frequently discussed contraceptive method decision-making (26.7%) and side effects (20.5%), particularly for long-acting reversible contraceptive methods and the depot medroxyprogesterone acetate shot. Tweets about logistics of use or adherence were common for short-acting reversible contraceptives. Tweets were frequently posted by contraceptive consumers (50.6%). A small proportion of tweets explicitly requested information (6.2%) or provided advice (4.2%). CONCLUSIONS: Clinicians should be aware that individuals are exposed to information through Twitter that may affect contraceptive perceptions and decision making, particularly regarding long-acting reversible contraceptives. Social media is a valuable source for studying contraceptive beliefs missing in traditional health research and may be used by professionals to disseminate accurate contraceptive information.
RESUMO
The COVID-19 pandemic has sparked numerous discussions on social media platforms, with users sharing their views on topics such as mask-wearing and vaccination. To facilitate the evaluation of neural models for stance detection and premise classification, we organized the Social Media Mining for Health (SMM4H) 2022 Shared Task 2. This competition utilized manually annotated posts on three COVID-19-related topics: school closures, stay-at-home orders, and wearing masks. In this paper, we extend the previous work and present newly collected data on vaccination from Twitter to assess the performance of models on a different topic. To enhance the accuracy and effectiveness of our evaluation, we employed various strategies to aggregate tweet texts with claims, including models with feature-level (early) fusion and dual-view architectures from the SMM4H 2022 Task 2 leaderboard. Our primary objective was to create a valuable dataset and perform an extensive experimental evaluation to support future research in argument mining in the health domain.
Assuntos
COVID-19 , Mídias Sociais , Humanos , Pandemias , Mineração de Dados , Coleta de DadosRESUMO
BACKGROUND: There has been a growth surge in the use of social media among individuals today. The widespread adoption of these platforms, coupled with their engaging features, presents a unique opportunity for the dissemination of health advocacy information. Social media is known as a powerful tool used to share health policy and advocacy efforts and disseminate health information to digital community members and networks. Yet, there is still a gap in the full exploitation of this powerful instrument, among health care professionals, for health advocacy campaigns. OBJECTIVE: This paper aims to describe the process of mobilizing social media platforms such as Twitter (rebranded to X Corp in 2023) for health advocacy of the digital community. Additionally, it aims to share the lessons and insights gained during this digital health advocacy engagement process. METHODS: We performed a comprehensive review of Twitter analytical data to examine the impact of our social media posts. We then consolidated these analytic reports with our meeting logs to describe our systematic, iterative, and collaborative design process to implement social media efforts and generate key lessons learned. RESULTS: Our review of monthly Twitter analytical reports and regular team meeting logs revealed several themes for successful and less successful practices in relation to our social media-based health advocacy efforts. The successful practices noted by the team included using personable, picture-based tweets; using a series of posts on a particular topic rather than an isolated post; leveraging team members' and partners' collaborations in shared posts; incorporating hashtags in tweets; using a balanced mix of texts and graphics in posts; using inclusive (nondestigmatizing) languages in tweeted posts; and use of polls to share tweets. Among the many lessons learned, we also experienced limitations including a lack of comprehensive statistics on Twitter usage for health care-related purposes such as health advocacy and limits in collating the estimates of the actual impact made on the intended digital community members by our posts. CONCLUSIONS: Twitter has been successfully used in promoting health advocacy content, and the social media team aims to explore other social media platforms that have a wider reach than Twitter. We will continue making necessary adjustments in strategies, techniques, and styles to engage the audience as we expand onto new platforms like Instagram and TikTok for health advocacy promotions.
RESUMO
Social media platforms empower us in several ways, from information dissemination to consumption. While these platforms are useful in promoting citizen journalism, public awareness, etc., they have misuse potential. Malicious users use them to disseminate hate speech, offensive content, rumor, etc. to promote social and political agendas or to harm individuals, entities, and organizations. Oftentimes, general users unconsciously share information without verifying it or unintentionally post harmful messages. Some of such content often gets deleted either by the platform due to the violation of terms and policies or by users themselves for different reasons, e.g., regret. There is a wide range of studies in characterizing, understanding, and predicting deleted content. However, studies that aim to identify the fine-grained reasons (e.g., posts are offensive, hate speech, or no identifiable reason) behind deleted content are limited. In this study, we address an existing gap by identifying and categorizing deleted tweets, especially within the Arabic context. We label them based on fine-grained disinformation categories. We have curated a dataset of 40K tweets, annotated with both coarse and fine-grained labels. Following this, we designed models to predict the likelihood of tweets being deleted and to identify the potential reasons for their deletion. Our experiments, conducted using a variety of classic and transformer models, indicate that performance surpasses the majority baseline (e.g., 25% absolute improvement for fine-grained labels). We believe that such models can assist in moderating social media posts even before they are published.
RESUMO
Coronavirus (COVID-19) is a newly discovered viral disease from the SARS-CoV-2 family. This has caused a moral panic resulting in the spread of informative and uninformative information about COVID-19 and its effects. Twitter is a popular social media platform used extensively during the current outbreak. This paper aims to predict informative tweets related to COVID-19 on Twitter using a novel set of fuzzy rules involving deep learning techniques. This study focuses on identifying informative tweets during the pandemic to provide the public with trustworthy information and forecast how quickly diseases could spread. In this case, we have implemented RoBERTa and CT-BERT models using the fuzzy methodology to identify COVID-19 patient tweets. The proposed architecture combines deep learning transformer models RoBERTa and CT-BERT with the fuzzy technique to categorize posts as INFORMATIVE or UNINFORMATIVE. We performed a comparative analysis of our method with machine learning models and deep learning approaches. The results show that our proposed model can classify informative and uninformative tweets with an accuracy of 91.40% and an F1-score of 91.94% using the COVID-19 English tweet dataset. The proposed model is accurate and ready for real-world application.
Assuntos
COVID-19 , Aprendizado Profundo , Humanos , SARS-CoV-2 , Aprendizado de Máquina , PandemiasRESUMO
Background: The World Health Organization, the Centers for Disease Control and Prevention, and the Gerontological Society of America have made efforts to raise awareness on ageist language and propose appropriate terms to denote the older adult population. The COVID-19 pandemic and older adults' vulnerability to the disease have perpetuated hostile ageist discourse on social media. This is an opportune time to understand the prevalence and use of ageist language and discuss the ways forward. Objective: This study aimed to understand the prevalence and situated use of ageist terms on Twitter. Methods: We collected 60.32 million tweets between March and July 2020 containing terms related to COVID-19. We then conducted a mixed methods study comprising a content analysis and a descriptive quantitative analysis. Results: A total of 58,930 tweets contained the ageist terms "old people" or "elderly." The more appropriate term "older adult" was found in 11,328 tweets. Twitter users used ageist terms (eg, "old people" and "elderly") to criticize ageist messages (17/60, 28%), showing a lack of understanding of appropriate terms to describe older adults. Highly hostile ageist content against older adults came from tweets that contained the derogatory terms "old people" (22/30, 73%) or "elderly" (13/30, 43%). Conclusions: The public discourse observed on Twitter shows a continued lack of understanding of appropriate terms to use when referring to older adults. Effort is needed to eliminate the perpetuation of ageist messages that challenge healthy aging. Our study highlights the need to inform the public about appropriate language use and ageism.
RESUMO
BACKGROUND: Twitter has become a popular platform for individuals to broadcast their daily experiences and opinions on a wide range of topics and emotions. Tweets from patients with cancer could offer insights into their needs. However, limited research has been conducted using Twitter data to understand the needs of patients with cancer despite the substantial amount of health-related data posted on the platform daily. OBJECTIVE: This study aimed to uncover the potential of using Twitter data to understand the perspectives and experiences of patients with thyroid cancer at a global level. METHODS: This retrospective descriptive study collected tweets relevant to thyroid cancer in 2020 using the Twitter scraping tool. Only English-language tweets were included, and data preprocessing was performed to remove irrelevant tweets, duplicates, and retweets. Both tweets and Twitter users were manually classified into various groups based on the content. Each tweet underwent sentiment analysis and was classified as either positive, neutral, or negative. RESULTS: A total of 13,135 tweets related to thyroid cancer were analyzed. The authors of the tweets included patients with thyroid cancer (3225 tweets, 24.6%), patient's families and friends (2449 tweets, 18.6%), medical journals and media (1733 tweets, 13.2%), health care professionals (1093 tweets, 8.3%), and medical health organizations (940 tweets, 7.2%), respectively. The most discussed topics related to living with cancer (3650 tweets, 27.8%), treatment (2891 tweets, 22%), diagnosis (1613 tweets, 12.3%), risk factors and prevention (1137 tweets, 8.7%), and research (953 tweets, 7.3%). An average of 36 tweets pertaining to thyroid cancer were posted daily. Notably, the release of a film addressing thyroid cancer and the public disclosure of a news reporter's personal diagnosis of thyroid cancer resulted in a significant escalation in the volume of tweets. From the sentiment analysis, 53.5% (7025/13,135) of tweets were classified as neutral statements and 32.7% (4299/13,135) of tweets expressed negative emotions. Tweets from patients with thyroid cancer had the highest proportion of negative emotion (1385/3225 tweets, 42.9%), particularly when discussing symptoms. CONCLUSIONS: This study provides new insights on using Twitter data as a valuable data source to understand the experiences of patients with thyroid cancer. Twitter may provide an opportunity to improve patient and physician engagement or apply as a potential research data source.