Machine learning in systematic reviews: Comparing automated text clustering with Lingo3G and human researcher categorization in a rapid review.

Muller, Ashley Elizabeth; Ames, Heather Melanie R; Jardim, Patricia Sofia Jacobsen; Rose, Christopher James

Muller, Ashley Elizabeth; Ames, Heather Melanie R; Jardim, Patricia Sofia Jacobsen; Rose, Christopher James.

Afiliação

Muller AE; Norwegian Institute of Public Health, Skøyen, Norway.
Ames HMR; Norwegian Institute of Public Health, Skøyen, Norway.
Jardim PSJ; Cochrane Consumer and Communication Group, Centre for Health Communication and Participation, School of Psychology and Public Health, La Trobe University, Bundoora, Victoria, Australia.
Rose CJ; Norwegian Institute of Public Health, Skøyen, Norway.

Res Synth Methods ; 13(2): 229-241, 2022 Mar.

Article em En | MEDLINE | ID: mdl-34919321

ABSTRACT

ABSTRACT

Systematic reviews are resource-intensive. The machine learning tools being developed mostly focus on the study identification process, but tools to assist in analysis and categorization are also needed. One possibility is to use unsupervised automatic text clustering, in which each study is automatically assigned to one or more meaningful clusters. Our main aim was to assess the usefulness of an automated clustering method, Lingo3G, in categorizing studies in a simplified rapid review, then compare performance (precision and recall) of this method compared to manual categorization. We randomly assigned all 128 studies in a review to be coded by a human researcher blinded to cluster assignment (mimicking two independent researchers) or by a human researcher non-blinded to cluster assignment (mimicking one researcher checking another's work). We compared time use, precision and recall of manual categorization versus automated clustering. Automated clustering and manual categorization organized studies by population and intervention/context. Automated clustering failed to identify two manually identified categories but identified one additional category not identified by the human researcher. We estimate that automated clustering has similar precision to both blinded and non-blinded researchers (e.g., 88% vs. 89%), but higher recall (e.g., 89% vs. 84%). Manual categorization required 49% more time than automated clustering. Using a specific clustering algorithm, automated clustering can be helpful with categorization of and identifying patterns across studies in simpler systematic reviews. We found that the clustering was sensitive enough to group studies according to linguistic differences that often corresponded to the manual categories.

Assuntos

Algoritmos; Aprendizado de Máquina; Análise por Conglomerados; Humanos; Projetos de Pesquisa; Revisões Sistemáticas como Assunto

Palavras-chave

Lingo3G; clustering; machine learning; scoping reviews; systematic review

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Algoritmos / Aprendizado de Máquina Tipo de estudo: Prognostic_studies / Systematic_reviews Limite: Humans Idioma: En Ano de publicação: 2022 Tipo de documento: Article

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google