Your browser doesn't support javascript.
loading
Structure-based knowledge acquisition from electronic lab notebooks for research data provenance documentation.
Schröder, Max; Staehlke, Susanne; Groth, Paul; Nebe, J Barbara; Spors, Sascha; Krüger, Frank.
Afiliação
  • Schröder M; Institute of Communications Engineering, University of Rostock, Rostock, Germany. max.schroeder@uni-rostock.de.
  • Staehlke S; University Library, University of Rostock, Rostock, Germany. max.schroeder@uni-rostock.de.
  • Groth P; Department of Cell Biology, University Medical Center Rostock, Rostock, Germany.
  • Nebe JB; Informatics Institute, University of Amsterdam, Amsterdam, Netherlands.
  • Spors S; Department of Cell Biology, University Medical Center Rostock, Rostock, Germany.
  • Krüger F; Department Life, Light & Matter, University of Rostock, Rostock, Germany.
J Biomed Semantics ; 13(1): 4, 2022 01 31.
Article em En | MEDLINE | ID: mdl-35101121
BACKGROUND: Electronic Laboratory Notebooks (ELNs) are used to document experiments and investigations in the wet-lab. Protocols in ELNs contain a detailed description of the conducted steps including the necessary information to understand the procedure and the raised research data as well as to reproduce the research investigation. The purpose of this study is to investigate whether such ELN protocols can be used to create semantic documentation of the provenance of research data by the use of ontologies and linked data methodologies. METHODS: Based on an ELN protocol of a biomedical wet-lab experiment, a retrospective provenance model of the raised research data describing the details of the experiment in a machine-interpretable way is manually engineered. Furthermore, an automated approach for knowledge acquisition from ELN protocols is derived from these results. This structure-based approach exploits the structure in the experiment's description such as headings, tables, and links, to translate the ELN protocol into a semantic knowledge representation. To satisfy the Findable, Accessible, Interoperable, and Reuseable (FAIR) guiding principles, a ready-to-publish bundle is created that contains the research data together with their semantic documentation. RESULTS: While the manual modelling efforts serve as proof of concept by employing one protocol, the automated structure-based approach demonstrates the potential generalisation with seven ELN protocols. For each of those protocols, a ready-to-publish bundle is created and, by employing the SPARQL query language, it is illustrated that questions about the processes and the obtained research data can be answered. CONCLUSIONS: The semantic documentation of research data obtained from the ELN protocols allows for the representation of the retrospective provenance of research data in a machine-interpretable way. Research Object Crate (RO-Crate) bundles including these models enable researchers to easily share the research data including the corresponding documentation, but also to search and relate the experiment to each other.
Assuntos
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Documentação / Bases de Conhecimento Tipo de estudo: Observational_studies Idioma: En Ano de publicação: 2022 Tipo de documento: Article

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Documentação / Bases de Conhecimento Tipo de estudo: Observational_studies Idioma: En Ano de publicação: 2022 Tipo de documento: Article