Your browser doesn't support javascript.
loading
A representation model for biological entities by fusing structured axioms with unstructured texts.
Lou, Peiliang; Dong, YuXin; Jimeno Yepes, Antonio; Li, Chen.
Afiliación
  • Lou P; School of Computer Science and Technology, Xi'an Jiaotong University, Xi'an, Shaanxi 710049, China.
  • Dong Y; Key Laboratory of Intelligent Networks and Network Security (Xi'an Jiaotong University), Ministry of Education, Xi'an, Shaanxi 710049, China.
  • Jimeno Yepes A; School of Computer Science and Technology, Xi'an Jiaotong University, Xi'an, Shaanxi 710049, China.
  • Li C; IBM Research Australia, Southbank, VIC 3006, Australia.
Bioinformatics ; 37(8): 1156-1163, 2021 05 23.
Article en En | MEDLINE | ID: mdl-33107905
ABSTRACT
MOTIVATION Structured semantic resources, for example, biological knowledge bases and ontologies, formally define biological concepts, entities and their semantic relationships, manifested as structured axioms and unstructured texts (e.g. textual definitions). The resources contain accurate expressions of biological reality and have been used by machine-learning models to assist intelligent applications like knowledge discovery. The current methods use both the axioms and definitions as plain texts in representation learning (RL). However, since the axioms are machine-readable while the natural language is human-understandable, difference in meaning of token and structure impedes the representations to encode desirable biological knowledge.

RESULTS:

We propose ERBK, a RL model of bio-entities. Instead of using the axioms and definitions as a textual corpus, our method uses knowledge graph embedding method and deep convolutional neural models to encode the axioms and definitions respectively. The representations could not only encode more underlying biological knowledge but also be further applied to zero-shot circumstance where existing approaches fall short. Experimental evaluations show that ERBK outperforms the existing methods for predicting protein-protein interactions and gene-disease associations. Moreover, it shows that ERBK still maintains promising performance under the zero-shot circumstance. We believe the representations and the method have certain generality and could extend to other types of bio-relation. AVAILABILITY AND IMPLEMENTATION The source code is available at the gitlab repository https//gitlab.com/BioAI/erbk. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Asunto(s)

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Asunto principal: Bases del Conocimiento / Aprendizaje Automático Tipo de estudio: Prognostic_studies Límite: Humans Idioma: En Revista: Bioinformatics Asunto de la revista: INFORMATICA MEDICA Año: 2021 Tipo del documento: Article País de afiliación: China

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Asunto principal: Bases del Conocimiento / Aprendizaje Automático Tipo de estudio: Prognostic_studies Límite: Humans Idioma: En Revista: Bioinformatics Asunto de la revista: INFORMATICA MEDICA Año: 2021 Tipo del documento: Article País de afiliación: China
...