Your browser doesn't support javascript.
loading
Predicting substantive biomedical citations without full text.
Hoppe, Travis A; Arabi, Salsabil; Hutchins, B Ian.
Afiliación
  • Hoppe TA; Office of the Director, National Center for Health Statistics, Centers for Disease Control and Prevention, Hyattsville, MD 20782.
  • Arabi S; Information School, School of Computer, Data, and Information Sciences, College of Letters and Science, University of Wisconsin-Madison, Madison, WI 53706.
  • Hutchins BI; Information School, School of Computer, Data, and Information Sciences, College of Letters and Science, University of Wisconsin-Madison, Madison, WI 53706.
Proc Natl Acad Sci U S A ; 120(30): e2213697120, 2023 07 25.
Article en En | MEDLINE | ID: mdl-37463199
ABSTRACT
Insights from biomedical citation networks can be used to identify promising avenues for accelerating research and its downstream bench-to-bedside translation. Citation analysis generally assumes that each citation documents substantive knowledge transfer that informed the conception, design, or execution of the main experiments. Citations may exist for other reasons. In this paper, we take advantage of late-stage citations added during peer review because these are less likely to represent substantive knowledge flow. Using a large, comprehensive feature set of open access data, we train a predictive model to identify late-stage citations. The model relies only on the title, abstract, and citations to previous articles but not the full-text or future citations patterns, making it suitable for publications as soon as they are released, or those behind a paywall (the vast majority). We find that high prediction scores identify late-stage citations that were likely added during the peer review process as well as those more likely to be rhetorical, such as journal self-citations added during review. Our model conversely gives low prediction scores to early-stage citations and citation classes that are known to represent substantive knowledge transfer. Using this model, we find that US federally funded biomedical research publications represent 30% of the predicted early-stage (and more likely to be substantive) knowledge transfer from basic studies to clinical research, even though these comprise only 10% of the literature. This is a threefold overrepresentation in this important type of knowledge flow.
Asunto(s)
Palabras clave

Texto completo: 1 Colección: 01-internacional Banco de datos: MEDLINE Asunto principal: Revisión por Pares / Investigación Biomédica Tipo de estudio: Prognostic_studies / Risk_factors_studies Idioma: En Revista: Proc Natl Acad Sci U S A Año: 2023 Tipo del documento: Article

Texto completo: 1 Colección: 01-internacional Banco de datos: MEDLINE Asunto principal: Revisión por Pares / Investigación Biomédica Tipo de estudio: Prognostic_studies / Risk_factors_studies Idioma: En Revista: Proc Natl Acad Sci U S A Año: 2023 Tipo del documento: Article