Your browser doesn't support javascript.
loading
Leveraging Text-to-Text Pretrained Language Models for Question Answering in Chemistry.
Tran, Dan; Pascazio, Laura; Akroyd, Jethro; Mosbach, Sebastian; Kraft, Markus.
Afiliación
  • Tran D; CARES, Cambridge Centre for Advanced Research and Education in Singapore, 1 Create Way, CREATE Tower, #05-05, Singapore 138602, Singapore.
  • Pascazio L; CARES, Cambridge Centre for Advanced Research and Education in Singapore, 1 Create Way, CREATE Tower, #05-05, Singapore 138602, Singapore.
  • Akroyd J; CARES, Cambridge Centre for Advanced Research and Education in Singapore, 1 Create Way, CREATE Tower, #05-05, Singapore 138602, Singapore.
  • Mosbach S; Department of Chemical Engineering and Biotechnology, University of Cambridge, Philippa Fawcett Drive, Cambridge CB3 0AS, U.K.
  • Kraft M; CMCL Innovations, Sheraton House, Castle Park, Cambridge CB3 0AX, U.K.
ACS Omega ; 9(12): 13883-13896, 2024 Mar 26.
Article en En | MEDLINE | ID: mdl-38559914
ABSTRACT
In this study, we present a question answering (QA) system for chemistry, named Marie, with the use of a text-to-text pretrained language model to attain accurate data retrieval. The underlying data store is "The World Avatar" (TWA), a general world model consisting of a knowledge graph that evolves over time. TWA includes information about chemical species such as their chemical and physical properties, applications, and chemical classifications. Building upon our previous work on KGQA for chemistry, this advanced version of Marie leverages a fine-tuned Flan-T5 model to seamlessly translate natural language questions into SPARQL queries with no separate components for entity and relation linking. The developed QA system demonstrates competence in providing accurate results for complex queries that involve many relation hops as well as showcasing the ability to balance correctness and speed for real-world usage. This new approach offers significant advantages over the prior implementation that relied on knowledge graph embedding. Specifically, the updated system boasts high accuracy and great flexibility in accommodating changes and evolution of the data stored in the knowledge graph without necessitating retraining. Our evaluation results underscore the efficacy of the improved system, highlighting its superior accuracy and the ability in answering complex questions compared to its predecessor.

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Idioma: En Revista: ACS Omega Año: 2024 Tipo del documento: Article País de afiliación: Singapur Pais de publicación: Estados Unidos

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Idioma: En Revista: ACS Omega Año: 2024 Tipo del documento: Article País de afiliación: Singapur Pais de publicación: Estados Unidos