EconPapers    
Economics at your fingertips  
 

DSCEH: Dual-Stream Correlation-Enhanced Deep Hashing for Image Retrieval

Yulin Yang, Huizhen Chen, Rongkai Liu, Shuning Liu, Yu Zhan, Chao Hu () and Ronghua Shi
Additional contact information
Yulin Yang: School of Computer Science and Engineering, Central South University, Changsha 410083, China
Huizhen Chen: School of Computer Science and Engineering, Central South University, Changsha 410083, China
Rongkai Liu: School of Computer Science and Engineering, Central South University, Changsha 410083, China
Shuning Liu: School of Computer Science and Engineering, Central South University, Changsha 410083, China
Yu Zhan: China Telecom, Changsha 410083, China
Chao Hu: School of Electronic Information, Central South University, Changsha 410083, China
Ronghua Shi: School of Electronic Information, Central South University, Changsha 410083, China

Mathematics, 2024, vol. 12, issue 14, 1-16

Abstract: Deep Hashing is widely used for large-scale image-retrieval tasks to speed up the retrieval process. Current deep hashing methods are mainly based on the Convolutional Neural Network (CNN) or Vision Transformer (VIT). They only use the local or global features for low-dimensional mapping and only use the similarity loss function to optimize the correlation between pairwise or triplet images. Therefore, the effectiveness of deep hashing methods is limited. In this paper, we propose a dual-stream correlation-enhanced deep hashing framework (DSCEH), which uses the local and global features of the image for low-dimensional mapping and optimizes the correlation of images from the model architecture. DSCEH consists of two main steps: model training and deep-hash-based retrieval. During the training phase, a dual-network structure comprising CNN and VIT is employed for feature extraction. Subsequently, feature fusion is achieved through a concatenation operation, followed by similarity evaluation based on the class token acquired from VIT to establish edge relationships. The Graph Convolutional Network is then utilized to enhance correlation optimization between images, resulting in the generation of high-quality hash codes. This stage facilitates the development of an optimized hash model for image retrieval. In the retrieval stage, all images within the database and the to-be-retrieved images are initially mapped to hash codes using the aforementioned hash model. The retrieval results are subsequently determined based on the Hamming distance between the hash codes. We conduct experiments on three datasets: CIFAR-10, MSCOCO, and NUSWIDE. Experimental results show the superior performance of DSCEH, which helps with fast and accurate image retrieval.

Keywords: deephashing; convolutional neural network; vision transformer; graph convolutional network (search for similar items in EconPapers)
JEL-codes: C (search for similar items in EconPapers)
Date: 2024
References: View complete reference list from CitEc
Citations:

Downloads: (external link)
https://www.mdpi.com/2227-7390/12/14/2221/pdf (application/pdf)
https://www.mdpi.com/2227-7390/12/14/2221/ (text/html)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:gam:jmathe:v:12:y:2024:i:14:p:2221-:d:1436306

Access Statistics for this article

Mathematics is currently edited by Ms. Emma He

More articles in Mathematics from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().

 
Page updated 2025-03-19
Handle: RePEc:gam:jmathe:v:12:y:2024:i:14:p:2221-:d:1436306