Your browser doesn't support javascript.
loading
Information extraction pipelines for knowledge graphs.
Jaradeh, Mohamad Yaser; Singh, Kuldeep; Stocker, Markus; Both, Andreas; Auer, Sören.
Afiliação
  • Jaradeh MY; L3S Research Center, Leibniz University Hannover, Hanover, Germany.
  • Singh K; Zerotha-Research and Cerence GmbH, Aachen, Germany.
  • Stocker M; TIB Leibniz Information Centre for Science and Technology, Hanover, Germany.
  • Both A; Anhalt University of Applied Sciences, Bernburg, Germany.
  • Auer S; TIB Leibniz Information Centre for Science and Technology, Hanover, Germany.
Knowl Inf Syst ; 65(5): 1989-2016, 2023.
Article em En | MEDLINE | ID: mdl-36643405
ABSTRACT
In the last decade, a large number of knowledge graph (KG) completion approaches were proposed. Albeit effective, these efforts are disjoint, and their collective strengths and weaknesses in effective KG completion have not been studied in the literature. We extend Plumber, a framework that brings together the research community's disjoint efforts on KG completion. We include more components into the architecture of Plumber  to comprise 40 reusable components for various KG completion subtasks, such as coreference resolution, entity linking, and relation extraction. Using these components, Plumber dynamically generates suitable knowledge extraction pipelines and offers overall 432 distinct pipelines. We study the optimization problem of choosing optimal pipelines based on input sentences. To do so, we train a transformer-based classification model that extracts contextual embeddings from the input and finds an appropriate pipeline. We study the efficacy of Plumber for extracting the KG triples using standard datasets over three KGs DBpedia, Wikidata, and Open Research Knowledge Graph. Our results demonstrate the effectiveness of Plumber in dynamically generating KG completion pipelines, outperforming all baselines agnostic of the underlying KG. Furthermore, we provide an analysis of collective failure cases, study the similarities and synergies among integrated components and discuss their limitations.
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Idioma: En Ano de publicação: 2023 Tipo de documento: Article

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Idioma: En Ano de publicação: 2023 Tipo de documento: Article