Masked Feature Compression for Object Detection

Dai, Chengjie; Song, Tiantian; Jin, Yuxuan; Ren, Yixiang; Yang, Bowei; Song, Guanghua

Masked Feature Compression for Object Detection

Chengjie Dai, Tiantian Song, Yuxuan Jin, Yixiang Ren, Bowei Yang and Guanghua Song ()
Additional contact information
Chengjie Dai: The School of Aeronautics and Astronautics, Zhejiang University, Hangzhou 310027, China
Tiantian Song: The Department of Mathematics, The University of Manchester, Manchester M13 9PL, UK
Yuxuan Jin: The School of Aeronautics and Astronautics, Zhejiang University, Hangzhou 310027, China
Yixiang Ren: The School of Aeronautics and Astronautics, Zhejiang University, Hangzhou 310027, China
Bowei Yang: The School of Aeronautics and Astronautics, Zhejiang University, Hangzhou 310027, China
Guanghua Song: The School of Aeronautics and Astronautics, Zhejiang University, Hangzhou 310027, China

Mathematics, 2024, vol. 12, issue 12, 1-20

Abstract: Deploying high-accuracy detection models on lightweight edge devices (e.g., drones) is challenging due to hardware constraints. To achieve satisfactory detection results, a common solution is to compress and transmit the images to a cloud server where powerful models can be used. However, the image compression process for transmission may lead to a reduction in detection accuracy. In this paper, we propose a feature compression method tailored for object detection tasks, and it can be easily integrated with existing learned image compression models. In the method, the encoding process consists of two steps. Firstly, we use a feature extractor to obtain the low-level feature, and then use a mask generator to obtain an object mask to select regions containing objects. Secondly, we use a neural network encoder to compress the masked feature. As for decoding, a neural network decoder is used to restore the compressed representation into the feature that can be directly inputted into the object detection model. The experimental results demonstrate that our method surpasses existing compression techniques. Specifically, when compared to one of the leading methods—TCM2023—our approach achieves a 25.3% reduction in compressed file size and a 6.9% increase in mAP0.5.

Keywords: image compression; feature compression; object detection (search for similar items in EconPapers)
JEL-codes: C (search for similar items in EconPapers)
Date: 2024
References: View complete reference list from CitEc
Citations:

Downloads: (external link)
https://www.mdpi.com/2227-7390/12/12/1848/pdf (application/pdf)
https://www.mdpi.com/2227-7390/12/12/1848/ (text/html)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:gam:jmathe:v:12:y:2024:i:12:p:1848-:d:1414494

Access Statistics for this article

Mathematics is currently edited by Ms. Emma He

More articles in Mathematics from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().