Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 3 de 3
Filtrar
Mais filtros

Base de dados
Assunto principal
Tipo de documento
Intervalo de ano de publicação
1.
PLOS Digit Health ; 3(7): e0000545, 2024 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-39078813

RESUMO

Manually labeling data for supervised learning is time and energy consuming; therefore, lexicon-based models such as VADER and TextBlob are used to automatically label data. However, it is argued that automated labels do not have the accuracy required for training an efficient model. Although automated labeling is frequently used for stance detection, automated stance labels have not been properly evaluated, in the previous works. In this work, to assess the accuracy of VADER and TextBlob automated labels for stance analysis, we first manually label a Twitter, now X, dataset related to M-pox stance detection. We then fine-tune different transformer-based models on the hand-labeled M-pox dataset, and compare their accuracy before and after fine-tuning, with the accuracy of automated labeled data. Our results indicated that the fine-tuned models surpassed the accuracy of VADER and TextBlob automated labels by up to 38% and 72.5%, respectively. Topic modeling further shows that fine-tuning diminished the scope of misclassified tweets to specific sub-topics. We conclude that fine-tuning transformer models on hand-labeled data for stance detection, elevates the accuracy to a superior level that is significantly higher than automated stance detection labels. This study verifies that automated stance detection labels are not reliable for sensitive use-cases such as health-related purposes. Manually labeled data is more convenient for developing Natural Language Processing (NLP) models that study and analyze mass opinions and conversations on social media platforms, during crises such as pandemics and epidemics.

2.
BMC Med Inform Decis Mak ; 23(1): 19, 2023 01 26.
Artigo em Inglês | MEDLINE | ID: mdl-36703133

RESUMO

The coronavirus disease 2019 (COVID-19) has developed into a pandemic. Data-driven techniques can be used to inform and guide public health decision- and policy-makers. In generalizing the spread of a virus over a large area, such as a province, it must be assumed that the transmission occurs as a stochastic process. It is therefore very difficult for policy and decision makers to understand and visualize the location specific dynamics of the virus on a more granular level. A primary concern is exposing local virus hot-spots, in order to inform and implement non-pharmaceutical interventions. A hot-spot is defined as an area experiencing exponential growth relative to the generalised growth of the pandemic. This paper uses the first and second waves of the COVID-19 epidemic in Gauteng Province, South Africa, as a case study. The study aims provide a data-driven methodology and comprehensive case study to expose location specific virus dynamics within a given area. The methodology uses an unsupervised Gaussian Mixture model to cluster cases at a desired granularity. This is combined with an epidemiological analysis to quantify each cluster's severity, progression and whether it can be defined as a hot-spot.


Assuntos
COVID-19 , Humanos , COVID-19/epidemiologia , Inteligência Artificial , África do Sul/epidemiologia , Big Data , Pandemias
3.
Artigo em Inglês | MEDLINE | ID: mdl-34299827

RESUMO

The impact of the still ongoing "Coronavirus Disease 2019" (COVID-19) pandemic has been and is still vast, affecting not only global human health and stretching healthcare facilities, but also profoundly disrupting societal and economic systems worldwide. The nature of the way the virus spreads causes cases to come in further recurring waves. This is due a complex array of biological, societal and environmental factors, including the novel nature of the emerging pathogen. Other parameters explaining the epidemic trend consisting of recurring waves are logistic-organizational challenges in the implementation of the vaccine roll-out, scarcity of doses and human resources, seasonality, meteorological drivers, and community heterogeneity, as well as cycles of strengthening and easing/lifting of the mitigation interventions. Therefore, it is crucial to be able to have an early alert system to identify when another wave of cases is about to occur. The availability of a variety of newly developed indicators allows for the exploration of multi-feature prediction models for case data. Ten indicators were selected as features for our prediction model. The model chosen is a Recurrent Neural Network with Long Short-Term Memory. This paper documents the development of an early alert/detection system that functions by predicting future daily confirmed cases based on a series of features that include mobility and stringency indices, and epidemiological parameters. The model is trained on the intermittent period in between the first and the second wave, in all of the South African provinces.


Assuntos
COVID-19 , Humanos , Memória de Curto Prazo , Redes Neurais de Computação , Pandemias , SARS-CoV-2
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA