Fine-Grained Image Classification Based on Cross-Attention Network
Zhiwen Zheng,
Juxiang Zhou,
Jianhou Gan,
Sen Luo and
Wei Gao
Additional contact information
Zhiwen Zheng: Yunnan Normal University, China
Juxiang Zhou: Yunnan Normal University, China
Jianhou Gan: Yunnan Normal University, China
Sen Luo: Yunnan Normal University, China
Wei Gao: Yunnan Normal University, China
International Journal on Semantic Web and Information Systems (IJSWIS), 2022, vol. 18, issue 1, 1-12
Abstract:
Due to the high similarity of fine-grained image subclasses, small inter-class changes and large intra-class changes are caused, which leads to the difficulty of fine-grained image classification task. However, existing convolutional neural networks have been unable to effectively solve this problem. Aiming at the above-mentioned fine-grained image classification problem, this paper proposes a multi-scale and multi-level ViT model. First, through data augmentation techniques, the accuracy of fine-grained image classification can be effectively improved. Secondly, the small-scale input and large-scale input of the model make the input image have more feature ex-pressions. The subsequent multi-layeredness effectively utilizes the results of the previous layer of ViT, so that the data of the previous layer can be more effectively used in the next layer of ViT. Finally, cross-attention allows the results of two scale inputs to be fused in a reasonable way. The proposed model is competitive with current mainstream state-of-the-art methods on multiple datasets.
Date: 2022
References: View complete reference list from CitEc
Citations:
Downloads: (external link)
http://services.igi-global.com/resolvedoi/resolve.aspx?doi=10.4018/IJSWIS.315747 (application/pdf)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:igg:jswis0:v:18:y:2022:i:1:p:1-12
Access Statistics for this article
International Journal on Semantic Web and Information Systems (IJSWIS) is currently edited by Brij Gupta
More articles in International Journal on Semantic Web and Information Systems (IJSWIS) from IGI Global
Bibliographic data for series maintained by Journal Editor ().