Contrastive Transformer Hashing for Compact Video Representation.

Shen, Xiaobo; Zhou, Yue; Yuan, Yun-Hao; Yang, Xichen; Lan, Long; Zheng, Yuhui

Shen, Xiaobo; Zhou, Yue; Yuan, Yun-Hao; Yang, Xichen; Lan, Long; Zheng, Yuhui.

IEEE Trans Image Process ; 32: 5992-6003, 2023.

Article em En | MEDLINE | ID: mdl-37903046

ABSTRACT

ABSTRACT

Video hashing learns compact representation by mapping video into low-dimensional Hamming space and has achieved promising performance in large-scale video retrieval. It is challenging to effectively exploit temporal and spatial structure in an unsupervised setting. To fulfill this gap, this paper proposes Contrastive Transformer Hashing (CTH) for effective video retrieval. Specifically, CTH develops a bidirectional transformer autoencoder, based on which visual reconstruction loss is proposed. CTH is more powerful to capture bidirectional correlations among frames than conventional unidirectional models. In addition, CTH devises multi-modality contrastive loss to reveal intrinsic structure among videos. CTH constructs inter-modality and intra-modality triplet sets and proposes multi-modality contrastive loss to exploit inter-modality and intra-modality similarities simultaneously. We perform video retrieval tasks on four benchmark datasets, i.e., UCF101, HMDB51, SVW30, FCVID using the learned compact hash representation, and extensive empirical results demonstrate the proposed CTH outperforms several state-of-the-art video hashing methods.

Texto completo

Adicionar na Minha BVS

Imprimir

XML

PubMed Links

Buscar no Google

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Idioma: En Revista: IEEE Trans Image Process Assunto da revista: INFORMATICA MEDICA Ano de publicação: 2023 Tipo de documento: Article

Texto completo

Adicionar na Minha BVS

Imprimir

XML

PubMed Links

Buscar no Google