Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 9 de 9
Filter
Add more filters











Database
Language
Publication year range
1.
Neural Netw ; 179: 106479, 2024 Jun 20.
Article in English | MEDLINE | ID: mdl-39146716

ABSTRACT

Multi-Modal Entity Alignment (MMEA), aiming to discover matching entity pairs on two multi-modal knowledge graphs (MMKGs), is an essential task in knowledge graph fusion. Through mining feature information of MMKGs, entities are aligned to tackle the issue that an MMKG is incapable of effective integration. The recent attempt at neighbors and attribute fusion mainly focuses on aggregating multi-modal attributes, neglecting the structure effect with multi-modal attributes for entity alignment. This paper proposes an innovative approach, namely TriFac, to exploit embedding refinement for factorizing the original multi-modal knowledge graphs through a two-stage MMKG factorization. Notably, we propose triplet-aware graph neural networks to aggregate multi-relational features. We propose multi-modal fusion for aggregating multiple features and design three novel metrics to measure knowledge graph factorization performance on the unified factorized latent space. Empirical results indicate the effectiveness of TriFac, surpassing previous state-of-the-art models on two MMEA datasets and a power system dataset.

2.
Neural Netw ; 179: 106583, 2024 Jul 27.
Article in English | MEDLINE | ID: mdl-39111163

ABSTRACT

Entity alignment is a crucial task in knowledge graphs, aiming to match corresponding entities from different knowledge graphs. Due to the scarcity of pre-aligned entities in real-world scenarios, research focused on unsupervised entity alignment has become more popular. However, current unsupervised entity alignment methods suffer from a lack of informative entity guidance, hindering their ability to accurately predict challenging entities with similar names and structures. To solve these problems, we present an unsupervised multi-view contrastive learning framework with an attention-based reranking strategy for entity alignment, named AR-Align. In AR-Align, two kinds of data augmentation methods are employed to provide a complementary view for neighborhood and attribute, respectively. Next, a multi-view contrastive learning method is introduced to reduce the semantic gap between different views of the augmented entities. Moreover, an attention-based reranking strategy is proposed to rerank the hard entities through calculating their weighted sum of embedding similarities on different structures. Experimental results indicate that AR-Align outperforms most both supervised and unsupervised state-of-the-art methods on three benchmark datasets.

3.
Neural Netw ; 173: 106178, 2024 May.
Article in English | MEDLINE | ID: mdl-38367354

ABSTRACT

Entity alignment refers to discovering the entity pairs with the same realistic meaning in different knowledge graphs. This technology is of great significance for completing and fusing knowledge graphs. Recently, methods based on knowledge representation learning have achieved remarkable achievements in entity alignment. However, most existing approaches do not mine hidden information in the knowledge graph as much as possible. This paper suggests SCMEA, a novel cross-lingual entity alignment framework based on multi-aspect information fusion and bidirectional contrastive learning. SCMEA initially adopts diverse representation learning models to embed multi-aspect information of entities and integrates them into a unified embedding space with an adaptive weighted mechanism to overcome the missing information and the problem of different-aspect information are not uniform. Then, we propose a stacked relation-entity co-enhanced model to further improve the representations of entities, wherein relation representation is modeled using an Entity Collector with Global Entity Attention. Finally, a combined loss function based on improved bidirectional contrastive learning is introduced to optimize model parameters and entity representation, effectively mitigating the hubness problem and accelerating model convergence. We conduct extensive experiments to evaluate the alignment performance of SCMEA. The overall experimental results, ablation studies, and analysis performed on five cross-lingual datasets demonstrate that our model achieves varying degrees of performance improvement and verifies the effectiveness and robustness of the model.


Subject(s)
Knowledge , Learning
4.
Neural Netw ; 172: 106143, 2024 Apr.
Article in English | MEDLINE | ID: mdl-38309139

ABSTRACT

Entity alignment aims to construct a complete knowledge graph (KG) by matching the same entities in multi-source KGs. Existing researches on entity alignment mainly focuses on static multi-relational data in knowledge graphs. However, the relationships or attributes between entities often possess temporal characteristics as well. Neglecting these temporal characteristics can frequently lead to alignment errors. Compared to studying entity alignment in temporal knowledge graphs, there are relatively few efforts on entity alignment in cross-lingual temporal knowledge graphs. Therefore, in this paper, we put forward an entity alignment method for cross-lingual temporal knowledge graphs, namely CTEA. Based on GCN and TransE, CTEA combines entity embeddings, relation embeddings and attribute embeddings to design a joint embedding model, which is more conducive to generating transferable entity embedding. In the meantime, the distance calculation between elements and the similarity calculation of entity pairs are combined to enhance the reliability of cross-lingual entity alignment. Experiments shows that the proposed CTEA model improves Hits@m and MRR by about 0.8∼2.4 percentage points compared with the latest methods.


Subject(s)
Knowledge , Pattern Recognition, Automated , Reproducibility of Results
5.
Sensors (Basel) ; 23(16)2023 Aug 10.
Article in English | MEDLINE | ID: mdl-37631633

ABSTRACT

Cross-lingual entity alignment in knowledge graphs is a crucial task in knowledge fusion. This task involves learning low-dimensional embeddings for nodes in different knowledge graphs and identifying equivalent entities across them by measuring the distances between their representation vectors. Existing alignment models use neural network modules and the nearest neighbors algorithm to find suitable entity pairs. However, these models often ignore the importance of local structural features of entities during the alignment stage, which may lead to reduced matching accuracy. Specifically, nodes that are poorly represented may not benefit from their surrounding context. In this article, we propose a novel alignment model called SSR, which leverages the node embedding algorithm in graphs to select candidate entities and then rearranges them by local structural similarity in the source and target knowledge graphs. Our approach improves the performance of existing approaches and is compatible with them. We demonstrate the effectiveness of our approach on the DBP15k dataset, showing that it outperforms existing methods while requiring less time.

6.
Neural Netw ; 162: 83-98, 2023 May.
Article in English | MEDLINE | ID: mdl-36893693

ABSTRACT

Entity alignment refers to matching entities with the same realistic meaning in different knowledge graphs. The structure of a knowledge graph provides the global signal for entity alignment. But in the real world, a knowledge graph provides insufficient structural information in general. Moreover, the problem of knowledge graph heterogeneity is common. The semantic and string information can alleviate the problems caused by the sparse and heterogeneous nature of knowledge graphs, yet both of them have not been fully utilized by most existing work. Therefore, we propose an entity alignment model based on multiple information (EAMI), which employs structural, semantic and string information. EAMI learns the structural representation of a knowledge graph by using multi-layer graph convolutional networks. To acquire more accurate entity vector representation, we incorporate the attribute semantic representation into the structural representation. In addition, to further improve entity alignment, we study the entity name string information. There is no training required to calculate the similarity of entity names. Our model is tested on publicly available cross-lingual datasets and cross-resource datasets, and the experimental results demonstrate the effectiveness of our model.


Subject(s)
Knowledge , Pattern Recognition, Automated , Knowledge Bases , Learning , Semantics
7.
Methods ; 205: 133-139, 2022 09.
Article in English | MEDLINE | ID: mdl-35798258

ABSTRACT

Entity alignment aims at associating semantically similar entities in knowledge graphs from different sources. It is widely used in the integration and construction of professional medical knowledge. The existing deep learning methods lack term-level embedding representation, which limits the performance of entity alignment and causes a massive computational overhead. To address these problems, we propose a Siamese-based BERT (SiBERT) for Chinese medical entities alignment. SiBERT generates term-level embedding based on word embedding sequences to enhance the features of entities in similarity calculation. The process of entity alignment contains three steps. Specifically, the SiBERT is firstly pre-trained with synonym dictionary in the public domain, and transferred to the task of medical entity alignment. Secondly, four different categories of entities (disease, symptom, treatment, and examination) are labeled based on the standard terms selected from standard terms dataset. The entities and their standard terms form term pairs to train SiBERT. Finally, combined with the entity alignment algorithm, the most similar standard term is selected as the final result. To evaluate the effectiveness of our method, we conduct extensive experiments on real-world datasets. The experimental results illustrate that SiBERT network is superior to other compared algorithms both in alignment accuracy and computational efficiency.


Subject(s)
Algorithms , Deep Learning , China , Electronic Health Records , Semantics , Vocabulary, Controlled
8.
J Biomed Inform ; 113: 103628, 2021 01.
Article in English | MEDLINE | ID: mdl-33232839

ABSTRACT

Enriching terminology base (TB) is an important and continuous process, since formal term can be renamed and new term alias emerges all the time. As a potential supplementary for TB enrichment, electronic health record (EHR) is a fundamental source for clinical research and practise. The task to align the set of external terms in EHRs to TB can be regarded as entity alignment without structure information. Conventional approaches mainly use internal structural information of multiple knowledge bases (KBs) to map entities and their counterparts among KBs. However, the external terms in EHRs are independent clinical terms, which lack of interrelations. To achieve entity alignment in this case, we proposed a novel automatic TB enrichment approach, named semantic & structure embeddings-based relevancy prediction (S2ERP). To obtain the semantic embedding of external terms, we fed them with formal entity into a pre-trained language model. Meanwhile, a graph convolutional network was used to obtain the structure embeddings of the synonyms and hyponyms in TB. Afterwards, S2ERP combines both embeddings to measure the relevancy. Experimental results on clinical indicator TB, collected from 38 top-class hospitals of Shanghai Hospital Development Center, showed that the proposed approach outperforms baseline methods by 14.16% in Hits@1.


Subject(s)
Electronic Health Records , Knowledge Bases , China , Natural Language Processing , Semantics
9.
BMC Med Inform Decis Mak ; 20(Suppl 14): 331, 2020 12 15.
Article in English | MEDLINE | ID: mdl-33323114

ABSTRACT

BACKGROUND: Laboratory indicator test results in electronic health records have been applied to many clinical big data analysis. However, it is quite common that the same laboratory examination item (i.e., lab indicator) is presented using different names in Chinese due to the translation problem and the habit problem of various hospitals, which results in distortion of analysis results. METHODS: A framework with a recall model and a binary classification model is proposed, which could reduce the alignment scale and improve the accuracy of lab indicator normalization. To reduce alignment scale, tf-idf is used for candidate selection. To assure the accuracy of output, we utilize enhanced sequential inference model for binary classification. And active learning is applied with a selection strategy which is proposed for reducing annotation cost. RESULTS: Since our indicator standardization method mainly focuses on Chinese indicator inconsistency, we perform our experiment on Shanghai Hospital Development Center and select clinical data from 8 hospitals. The method achieves a F1-score 92.08[Formula: see text] in our final binary classification. As for active learning, the new strategy proposed performs better than random baseline and could outperform the result trained on full data with only 43[Formula: see text] training data. A case study on heart failure clinic analysis conducted on the sub-dataset collected from SHDC shows that our proposed method is practical in the application with good performance. CONCLUSION: This work demonstrates that the structure we proposed can be effectively applied to lab indicator normalization. And active learning is also suitable for this task for cost reduction. Such a method is also valuable in data cleaning, data mining, text extracting and entity alignment.


Subject(s)
Electronic Health Records , Heart Failure , China , Delivery of Health Care , Heart Failure/diagnosis , Humans , Reference Standards
SELECTION OF CITATIONS
SEARCH DETAIL