Graph Convolutional Multi-Label Hashing for Cross-Modal Retrieval.

Shen, Xiaobo; Chen, Yinfan; Liu, Weiwei; Zheng, Yuhui; Sun, Quan-Sen; Pan, Shirui

Shen, Xiaobo; Chen, Yinfan; Liu, Weiwei; Zheng, Yuhui; Sun, Quan-Sen; Pan, Shirui.

IEEE Trans Neural Netw Learn Syst ; PP2024 Jul 19.

Article en En | MEDLINE | ID: mdl-39028597

ABSTRACT

ABSTRACT

Cross-modal hashing encodes different modalities of multimodal data into low-dimensional Hamming space for fast cross-modal retrieval. In multi-label cross-modal retrieval, multimodal data are often annotated with multiple labels, and some labels, e.g.", ocean" and "cloud", often co-occur. However, existing cross-modal hashing methods overlook label dependency that is crucial for improving performance. To fulfill this gap, this article proposes graph convolutional multi-label hashing (GCMLH) for effective multi-label cross-modal retrieval. Specifically, GCMLH first generates word embedding of each label and develops label encoder to learn highly correlated label embedding via graph convolutional network (GCN). In addition, GCMLH develops feature encoder for each modality, and feature fusion module to generate highly semantic feature via GCN. GCMLH uses teacher-student learning scheme to transfer knowledge from the teacher modules, i.e., label encoder and feature fusion module, to the student module, i.e., feature encoder, such that learned hash code can well exploit multi-label dependency and multimodal semantic structure. Extensive empirical results on several benchmarks demonstrate the superiority of the proposed method over existing state-of-the-arts.

Texto completo

Añadir a Mi BVS

Imprimir

XML

PubMed Links

Buscar en Google

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Idioma: En Revista: IEEE Trans Neural Netw Learn Syst Año: 2024 Tipo del documento: Article