Use of a Machine-learning Method for Predicting Highly Cited Articles Within General Radiology Journals.

Rosenkrantz, Andrew B; Doshi, Ankur M; Ginocchio, Luke A; Aphinyanaphongs, Yindalon

Rosenkrantz, Andrew B; Doshi, Ankur M; Ginocchio, Luke A; Aphinyanaphongs, Yindalon.

Afiliação

Rosenkrantz AB; Department of Radiology, NYU Langone Medical Center, 660 First Avenue, 3rd Floor, New York, NY 10016. Electronic address: Andrew.Rosenkrantz@nyumc.org.
Doshi AM; Department of Radiology, NYU Langone Medical Center, 660 First Avenue, 3rd Floor, New York, NY 10016.
Ginocchio LA; Department of Radiology, NYU Langone Medical Center, 660 First Avenue, 3rd Floor, New York, NY 10016.
Aphinyanaphongs Y; Center for Healthcare Innovation and Delivery Science, NYU Langone Medical Center, New York, New York.

Acad Radiol ; 23(12): 1573-1581, 2016 12.

Article em En | MEDLINE | ID: mdl-27692588

RESUMO

RATIONALE AND OBJECTIVES: This study aimed to assess the performance of a text classification machine-learning model in predicting highly cited articles within the recent radiological literature and to identify the model's most influential article features. MATERIALS AND METHODS: We downloaded from PubMed the title, abstract, and medical subject heading terms for 10,065 articles published in 25 general radiology journals in 2012 and 2013. Three machine-learning models were applied to predict the top 10% of included articles in terms of the number of citations to the article in 2014 (reflecting the 2-year time window in conventional impact factor calculations). The model having the highest area under the curve was selected to derive a list of article features (words) predicting high citation volume, which was iteratively reduced to identify the smallest possible core feature list maintaining predictive power. Overall themes were qualitatively assigned to the core features. RESULTS: The regularized logistic regression (Bayesian binary regression) model had highest performance, achieving an area under the curve of 0.814 in predicting articles in the top 10% of citation volume. We reduced the initial 14,083 features to 210 features that maintain predictivity. These features corresponded with topics relating to various imaging techniques (eg, diffusion-weighted magnetic resonance imaging, hyperpolarized magnetic resonance imaging, dual-energy computed tomography, computed tomography reconstruction algorithms, tomosynthesis, elastography, and computer-aided diagnosis), particular pathologies (prostate cancer; thyroid nodules; hepatic adenoma, hepatocellular carcinoma, non-alcoholic fatty liver disease), and other topics (radiation dose, electroporation, education, general oncology, gadolinium, statistics). CONCLUSIONS: Machine learning can be successfully applied to create specific feature-based models for predicting articles likely to achieve high influence within the radiological literature.

Assuntos

Aprendizado de Máquina; Publicações Periódicas como Assunto/estatística & dados numéricos; Radiologia/estatística & dados numéricos; Área Sob a Curva; Teorema de Bayes; Bibliometria; Humanos; Fator de Impacto de Revistas; Publicações/estatística & dados numéricos; Editoração/estatística & dados numéricos

Palavras-chave

Radiology; bibliometrics; biomedical journals; machine learning

Texto completo

Adicionar na Minha BVS

Imprimir

XML

PubMed Links

Buscar no Google

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Publicações Periódicas como Assunto / Radiologia / Aprendizado de Máquina Tipo de estudo: Evaluation_studies / Prognostic_studies / Risk_factors_studies Limite: Humans Idioma: En Revista: Acad Radiol Assunto da revista: RADIOLOGIA Ano de publicação: 2016 Tipo de documento: Article País de publicação: Estados Unidos

Texto completo

Adicionar na Minha BVS

Imprimir

XML

PubMed Links

Buscar no Google