Argument mining as rapid screening tool of COVID-19 literature quality: Preliminary evidence.

Brambilla, Gianfranco; Rosi, Antonella; Antici, Francesco; Galassi, Andrea; Giansanti, Daniele; Magurano, Fabio; Ruggeri, Federico; Torroni, Paolo; Cisbani, Evaristo; Lippi, Marco

Brambilla, Gianfranco; Rosi, Antonella; Antici, Francesco; Galassi, Andrea; Giansanti, Daniele; Magurano, Fabio; Ruggeri, Federico; Torroni, Paolo; Cisbani, Evaristo; Lippi, Marco.

Afiliação

Brambilla G; Istituto Superiore di Sanità, Rome, Italy.
Rosi A; Istituto Superiore di Sanità, Rome, Italy.
Antici F; Department of Computer Science and Engineering, University of Bologna, Bologna, Italy.
Galassi A; Department of Computer Science and Engineering, University of Bologna, Bologna, Italy.
Giansanti D; Istituto Superiore di Sanità, Rome, Italy.
Magurano F; Istituto Superiore di Sanità, Rome, Italy.
Ruggeri F; Department of Computer Science and Engineering, University of Bologna, Bologna, Italy.
Torroni P; Department of Computer Science and Engineering, University of Bologna, Bologna, Italy.
Cisbani E; Istituto Superiore di Sanità, Rome, Italy.
Lippi M; Department of Sciences and Methods for Engineering, University of Modena and Reggio Emilia, Reggio Emilia, Italy.

Front Public Health ; 10: 945181, 2022.

Article em En | MEDLINE | ID: mdl-35923956

ABSTRACT

ABSTRACT

Background:

The COVID-19 pandemic prompted the scientific community to share timely evidence, also in the form of pre-printed papers, not peer reviewed yet.

Purpose:

To develop an artificial intelligence system for the analysis of the scientific literature by leveraging on recent developments in the field of Argument Mining.

Methodology:

Scientific quality criteria were borrowed from two selected Cochrane systematic reviews. Four independent reviewers gave a blind evaluation on a 1-5 scale to 40 papers for each review. These scores were matched with the automatic analysis performed by an AM system named MARGOT, which detected claims and supporting evidence for the cited papers. Outcomes were evaluated with inter-rater indices (Cohen's Kappa, Krippendorff's Alpha, s* statistics).

Results:

MARGOT performs differently on the two selected Cochrane reviews the inter-rater indices show a fair-to-moderate agreement of the most relevant MARGOT metrics both with Cochrane and the skilled interval scores, with larger values for one of the two reviews. Discussion and

conclusions:

The noted discrepancy could rely on a limitation of the MARGOT system that can be improved; yet, the level of agreement between human reviewers also suggests a different complexity between the two reviews in debating controversial arguments. These preliminary results encourage to expand and deepen the investigation to other topics and a larger number of highly specialized reviewers, to reduce uncertainty in the evaluation process, thus supporting the retraining of AM systems.

Assuntos

Inteligência Artificial; COVID-19; COVID-19/diagnóstico; COVID-19/epidemiologia; Humanos; Pandemias; Reprodutibilidade dos Testes; Pesquisa

Palavras-chave

COVID-19; argument mining; artificial intelligence; inter-rater agreement; scientific literature quality assessment

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Inteligência Artificial / COVID-19 Tipo de estudo: Diagnostic_studies / Screening_studies Limite: Humans Idioma: En Ano de publicação: 2022 Tipo de documento: Article

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google