Your browser doesn't support javascript.
loading
The changing landscape of text mining: a review of approaches for ecology and evolution.
Farrell, Maxwell J; Le Guillarme, Nicolas; Brierley, Liam; Hunter, Bronwen; Scheepens, Daan; Willoughby, Anna; Yates, Andrew; Mideo, Nicole.
Afiliación
  • Farrell MJ; Department of Ecology & Evolutionary Biology, University of Toronto, Toronto, Ontario, Canada.
  • Le Guillarme N; School of Biodiversity, One Health & Veterinary Medicine, University of Glasgow, Glasgow, UK.
  • Brierley L; MRC-University of Glasgow Centre for Virus Research, Glasgow, UK.
  • Hunter B; Université Grenoble Alpes, CNRS, LECA, Laboratoire d'Ecologie Alpine, Grenoble, France.
  • Scheepens D; MRC-University of Glasgow Centre for Virus Research, Glasgow, UK.
  • Willoughby A; Department of Health Data Science, University of Liverpool, Liverpool, UK.
  • Yates A; School of Life Sciences, University of Sussex, Brighton, UK.
  • Mideo N; Division of Biosciences, University College London, London, UK.
Proc Biol Sci ; 291(2027): 20240423, 2024 Jul.
Article en En | MEDLINE | ID: mdl-39082244
ABSTRACT
In ecology and evolutionary biology, the synthesis and modelling of data from published literature are commonly used to generate insights and test theories across systems. However, the tasks of searching, screening, and extracting data from literature are often arduous. Researchers may manually process hundreds to thousands of articles for systematic reviews, meta-analyses, and compiling synthetic datasets. As relevant articles expand to tens or hundreds of thousands, computer-based approaches can increase the efficiency, transparency and reproducibility of literature-based research. Methods available for text mining are rapidly changing owing to developments in machine learning-based language models. We review the growing landscape of approaches, mapping them onto three broad paradigms (frequency-based approaches, traditional Natural Language Processing and deep learning-based language models). This serves as an entry point to learn foundational and cutting-edge concepts, vocabularies, and methods to foster integration of these tools into ecological and evolutionary research. We cover approaches for modelling ecological texts, generating training data, developing custom models and interacting with large language models and discuss challenges and possible solutions to implementing these methods in ecology and evolution.
Asunto(s)
Palabras clave

Texto completo: 1 Banco de datos: MEDLINE Asunto principal: Procesamiento de Lenguaje Natural / Ecología / Evolución Biológica / Minería de Datos Idioma: En Año: 2024 Tipo del documento: Article

Texto completo: 1 Banco de datos: MEDLINE Asunto principal: Procesamiento de Lenguaje Natural / Ecología / Evolución Biológica / Minería de Datos Idioma: En Año: 2024 Tipo del documento: Article