Sequential Cross-Modal Hashing Learning via Multi-scale Correlation Mining-Reference-Cited by-同舟云学术

Sequential Cross-Modal Hashing Learning via Multi-scale Correlation Mining

Published:2020-01-10 Issue:4 Volume:15 Page:1-20
ISSN:1551-6857
Container-title:ACM Transactions on Multimedia Computing, Communications, and Applications
language:en
Short-container-title:ACM Trans. Multimedia Comput. Commun. Appl.

Author:

Ye Zhaoda¹,Peng Yuxin¹

Affiliation:

1. Peking University, Beijing, China

Abstract

Cross-modal hashing aims to map heterogeneous multimedia data into a common Hamming space through hash function, and achieves fast and flexible cross-modal retrieval. Most existing cross-modal hashing methods learn hash function by mining the correlation among multimedia data, but ignore the important property of multimedia data: Each modality of multimedia data has features of different scales, such as texture, object, and scene features in the image, which can provide complementary information for boosting retrieval task. The correlations among the multi-scale features are more abundant than the correlations between single features of multimedia data, which reveal finer underlying structures of the multimedia data and can be used for effective hashing function learning. Therefore, we propose the M ulti-scale C orrelation S equential C ross-modal H ashing ( MCSCH ) approach, and its main contributions can be summarized as follows: (1) Multi-scale feature guided sequential hashing learning method is proposed to share the information from features of different scales through an RNN-based network and generate the hash codes sequentially. The features of different scales are used to guide the hash codes generation, which can enhance the diversity of the hash codes and weaken the influence of errors in specific features, such as false object features caused by occlusion. (2) Multi-scale correlation mining strategy is proposed to align the features of different scales in different modalities and mine the correlations among aligned features. These correlations reveal the finer underlying structure of multimedia data and can help to boost the hash function learning. (3) Correlation evaluation network evaluates the importance of the correlations to select the worthwhile correlations, and increases the impact of these correlations for hash function learning. Experiments on two widely-used 2-media datasets and a 5-media dataset demonstrate the effectiveness of our proposed MCSCH approach.

Funder

National Natural Science Foundation of China

Publisher

Association for Computing Machinery (ACM)

Subject

Computer Networks and Communications,Hardware and Architecture

Link

https://dl.acm.org/doi/pdf/10.1145/3356338

Reference52 articles.

1. Mixed image-keyword query adaptive hashing over multilabel images;Liu Xianglong;ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM),2014

2. Hashing with Angular Reconstructive Embeddings

3. Image retrieval with query-adaptive hashing;Liu Dong;ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM),2013

4. Bit-Scalable Deep Hashing With Regularized Similarity Learning for Image Retrieval and Person Re-Identification

5. Iterative Quantization: A Procrustean Approach to Learning Binary Codes for Large-Scale Image Retrieval

Cited by 16 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Semantic-alignment transformer and adversary hashing for cross-modal retrieval;Applied Intelligence;2024-06-10

2. Robust Image Hashing via CP Decomposition and DCT for Copy Detection;ACM Transactions on Multimedia Computing, Communications, and Applications;2024-04-25

3. Deep Neighborhood-aware Proxy Hashing with Uniform Distribution Constraint for Cross-modal Retrieval;ACM Transactions on Multimedia Computing, Communications, and Applications;2024-03-08

4. Supervised Hierarchical Online Hashing for Cross-modal Retrieval;ACM Transactions on Multimedia Computing, Communications, and Applications;2024-01-11

5. Supervised Consensus Anchor Graph Hashing for Cross Modal Retrieval;IEEE Access;2024