Your browser doesn't support javascript.
loading
An evaluation of DistillerSR's machine learning-based prioritization tool for title/abstract screening - impact on reviewer-relevant outcomes.
Hamel, C; Kelly, S E; Thavorn, K; Rice, D B; Wells, G A; Hutton, B.
Afiliación
  • Hamel C; Clinical Epidemiology Program, Ottawa Hospital Research Institute, 501 Smyth Road, Box 201b, Ottawa, Ontario, K1H 8L6, Canada. cahamel@ohri.ca.
  • Kelly SE; Department of Medicine, University of Split, Split, Croatia. cahamel@ohri.ca.
  • Thavorn K; Cardiovascular Research Methods Centre, University of Ottawa Heart Institute, Ottawa, Ontario, Canada.
  • Rice DB; School of Epidemiology and Public Health, University of Ottawa, Ottawa, Ontario, Canada.
  • Wells GA; Clinical Epidemiology Program, Ottawa Hospital Research Institute, 501 Smyth Road, Box 201b, Ottawa, Ontario, K1H 8L6, Canada.
  • Hutton B; School of Epidemiology and Public Health, University of Ottawa, Ottawa, Ontario, Canada.
BMC Med Res Methodol ; 20(1): 256, 2020 10 15.
Article en En | MEDLINE | ID: mdl-33059590
ABSTRACT

BACKGROUND:

Systematic reviews often require substantial resources, partially due to the large number of records identified during searching. Although artificial intelligence may not be ready to fully replace human reviewers, it may accelerate and reduce the screening burden. Using DistillerSR (May 2020 release), we evaluated the performance of the prioritization simulation tool to determine the reduction in screening burden and time savings.

METHODS:

Using a true recall @ 95%, response sets from 10 completed systematic reviews were used to evaluate (i) the reduction of screening burden; (ii) the accuracy of the prioritization algorithm; and (iii) the hours saved when a modified screening approach was implemented. To account for variation in the simulations, and to introduce randomness (through shuffling the references), 10 simulations were run for each review. Means, standard deviations, medians and interquartile ranges (IQR) are presented.

RESULTS:

Among the 10 systematic reviews, using true recall @ 95% there was a median reduction in screening burden of 47.1% (IQR 37.5 to 58.0%). A median of 41.2% (IQR 33.4 to 46.9%) of the excluded records needed to be screened to achieve true recall @ 95%. The median title/abstract screening hours saved using a modified screening approach at a true recall @ 95% was 29.8 h (IQR 28.1 to 74.7 h). This was increased to a median of 36 h (IQR 32.2 to 79.7 h) when considering the time saved not retrieving and screening full texts of the remaining 5% of records not yet identified as included at title/abstract. Among the 100 simulations (10 simulations per review), none of these 5% of records were a final included study in the systematic review. The reduction in screening burden to achieve true recall @ 95% compared to @ 100% resulted in a reduced screening burden median of 40.6% (IQR 38.3 to 54.2%).

CONCLUSIONS:

The prioritization tool in DistillerSR can reduce screening burden. A modified or stop screening approach once a true recall @ 95% is achieved appears to be a valid method for rapid reviews, and perhaps systematic reviews. This needs to be further evaluated in prospective reviews using the estimated recall.
Asunto(s)
Palabras clave

Texto completo: 1 Colección: 01-internacional Banco de datos: MEDLINE Asunto principal: Inteligencia Artificial / Aprendizaje Automático Tipo de estudio: Diagnostic_studies / Observational_studies / Screening_studies Límite: Humans Idioma: En Revista: BMC Med Res Methodol Asunto de la revista: MEDICINA Año: 2020 Tipo del documento: Article País de afiliación: Canadá

Texto completo: 1 Colección: 01-internacional Banco de datos: MEDLINE Asunto principal: Inteligencia Artificial / Aprendizaje Automático Tipo de estudio: Diagnostic_studies / Observational_studies / Screening_studies Límite: Humans Idioma: En Revista: BMC Med Res Methodol Asunto de la revista: MEDICINA Año: 2020 Tipo del documento: Article País de afiliación: Canadá