Your browser doesn't support javascript.
loading
Dataset search in biodiversity research: Do metadata in data repositories reflect scholarly information needs?
Löffler, Felicitas; Wesp, Valentin; König-Ries, Birgitta; Klan, Friederike.
Afiliación
  • Löffler F; Heinz Nixdorf Chair for Distributed Information Systems, Department of Mathematics and Computer Science, Friedrich Schiller University Jena, Jena, Germany.
  • Wesp V; Heinz Nixdorf Chair for Distributed Information Systems, Department of Mathematics and Computer Science, Friedrich Schiller University Jena, Jena, Germany.
  • König-Ries B; Heinz Nixdorf Chair for Distributed Information Systems, Department of Mathematics and Computer Science, Friedrich Schiller University Jena, Jena, Germany.
  • Klan F; Michael-Stifel-Center for Data-Driven and Simulation Science, Jena, Germany.
PLoS One ; 16(3): e0246099, 2021.
Article en En | MEDLINE | ID: mdl-33760822
ABSTRACT
The increasing amount of publicly available research data provides the opportunity to link and integrate data in order to create and prove novel hypotheses, to repeat experiments or to compare recent data to data collected at a different time or place. However, recent studies have shown that retrieving relevant data for data reuse is a time-consuming task in daily research practice. In this study, we explore what hampers dataset retrieval in biodiversity research, a field that produces a large amount of heterogeneous data. In particular, we focus on scholarly search interests and metadata, the primary source of data in a dataset retrieval system. We show that existing metadata currently poorly reflect information needs and therefore are the biggest obstacle in retrieving relevant data. Our findings indicate that for data seekers in the biodiversity domain environments, materials and chemicals, species, biological and chemical processes, locations, data parameters and data types are important information categories. These interests are well covered in metadata elements of domain-specific standards. However, instead of utilizing these standards, large data repositories tend to use metadata standards with domain-independent metadata fields that cover search interests only to some extent. A second problem are arbitrary keywords utilized in descriptive fields such as title, description or subject. Keywords support scholars in a full text search only if the provided terms syntactically match or their semantic relationship to terms used in a user query is known.
Asunto(s)

Texto completo: 1 Banco de datos: MEDLINE Asunto principal: Investigación / Biodiversidad / Minería de Datos / Metadatos Tipo de estudio: Guideline Idioma: En Revista: PLoS One Asunto de la revista: CIENCIA / MEDICINA Año: 2021 Tipo del documento: Article País de afiliación: Alemania

Texto completo: 1 Banco de datos: MEDLINE Asunto principal: Investigación / Biodiversidad / Minería de Datos / Metadatos Tipo de estudio: Guideline Idioma: En Revista: PLoS One Asunto de la revista: CIENCIA / MEDICINA Año: 2021 Tipo del documento: Article País de afiliación: Alemania