How to apply zero-shot learning to text data in substance use research: An overview and tutorial with media data.
Addiction
; 119(5): 951-959, 2024 05.
Article
en En
| MEDLINE
| ID: mdl-38212974
ABSTRACT
A vast amount of media-related text data is generated daily in the form of social media posts, news stories or academic articles. These text data provide opportunities for researchers to analyse and understand how substance-related issues are being discussed. The main methods to analyse large text data (content analyses or specifically trained deep-learning models) require substantial manual annotation and resources. A machine-learning approach called 'zero-shot learning' may be quicker, more flexible and require fewer resources. Zero-shot learning uses models trained on large, unlabelled (or weakly labelled) data sets to classify previously unseen data into categories on which the model has not been specifically trained. This means that a pre-existing zero-shot learning model can be used to analyse media-related text data without the need for task-specific annotation or model training. This approach may be particularly important for analysing data that is time critical. This article describes the relatively new concept of zero-shot learning and how it can be applied to text data in substance use research, including a brief practical tutorial.
Palabras clave
Texto completo:
1
Base de datos:
MEDLINE
Asunto principal:
Trastornos Relacionados con Sustancias
/
Medios de Comunicación Sociales
Tipo de estudio:
Guideline
Idioma:
En
Revista:
Addiction
Asunto de la revista:
TRANSTORNOS RELACIONADOS COM SUBSTANCIAS
Año:
2024
Tipo del documento:
Article