Evaluating human versus machine learning performance in classifying research abstracts.

Goh, Yeow Chong; Cai, Xin Qing; Theseira, Walter; Ko, Giovanni; Khor, Khiam Aik

Goh, Yeow Chong; Cai, Xin Qing; Theseira, Walter; Ko, Giovanni; Khor, Khiam Aik.

Afiliação

Goh YC; School of Mechanical and Aerospace Engineering, Nanyang Technological University, Singapore, Singapore.
Cai XQ; School of Mechanical and Aerospace Engineering, Nanyang Technological University, Singapore, Singapore.
Theseira W; School of Business, Singapore University of Social Sciences, Singapore, Singapore.
Ko G; School of Economics, Singapore Management University, Singapore, Singapore.
Khor KA; Talent Recruitment and Career Support (TRACS) Office and Bibliometrics Analysis, Nanyang Technological University, Singapore, Singapore.

Scientometrics ; 125(2): 1197-1212, 2020.

Article em En | MEDLINE | ID: mdl-32836529

ABSTRACT

ABSTRACT

We study whether humans or machine learning (ML) classification models are better at classifying scientific research abstracts according to a fixed set of discipline groups. We recruit both undergraduate and postgraduate assistants for this task in separate stages, and compare their performance against the support vectors machine ML algorithm at classifying European Research Council Starting Grant project abstracts to their actual evaluation panels, which are organised by discipline groups. On average, ML is more accurate than human classifiers, across a variety of training and test datasets, and across evaluation panels. ML classifiers trained on different training sets are also more reliable than human classifiers, meaning that different ML classifiers are more consistent in assigning the same classifications to any given abstract, compared to different human classifiers. While the top five percentile of human classifiers can outperform ML in limited cases, selection and training of such classifiers is likely costly and difficult compared to training ML models. Our results suggest ML models are a cost effective and highly accurate method for addressing problems in comparative bibliometric analysis, such as harmonising the discipline classifications of research from different funding agencies or countries.

Palavras-chave

Discipline classification; Supervised classification; Text classification

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google

Texto completo: 1 Base de dados: MEDLINE Idioma: En Ano de publicação: 2020 Tipo de documento: Article

Texto completo

Imprimir

XML

PubMed Links