Aerial small target detection algorithm based on cross-scale separated attention
Ju Liang,
Fan Wang,
Jia Chen,
Hai-Yan Huang and
Zu-Fan Dou
PLOS ONE, 2025, vol. 20, issue 11, 1-26
Abstract:
In UAV aerial photography scenarios, targets exhibit characteristics such as multi-scale distribution, a high proportion of small targets, complex occlusions, and strong background interference. These characteristics impose high demands on detection algorithms in terms of fine-grained feature extraction, cross-scale fusion capability, and occlusion resistance.The YOLOv11s model has significant limitations in practical applications: its feature extraction module has a single semantic representation, the traditional feature pyramid network has limited capability to detect multi-scale targets, and it lacks an effective feature compensation mechanism when targets are occluded.To address these issues, we propose a UAV aerial small target detection algorithm named UAS-YOLO (Universal Inverted Bottleneck with Adaptive BiFPN and Separated and Enhancement Attention module YOLO), which incorporates three key optimizations. First, an Adaptive Bidirectional Feature Pyramid Network (ABiFPN) is designed as the Neck structure. Through cross-scale connections and dynamic weighted fusion, ABiFPN adjusts weight allocation based on target scale characteristics, focusing on enhancing feature integration for scales related to small targets and improving multi-scale feature representation capability. Second, a Separated and Enhancement Attention Module (SEAM) is introduced to replace the original SPPF module. This module focuses on key target regions, enhances effective feature responses in unoccluded areas, and specifically compensates for information loss in occluded regions, thereby improving the detection stability of occluded small targets. Third, a Universal Inverted Bottleneck (UIB) structure is proposed, which is fused with the C3K2 module to form the C3K2_UIB module. By leveraging dynamic channel attention and spatial feature recalibration, C3K2_UIB suppresses background noise; although this increases parameters by 34%, it achieves improved detection accuracy through efficient feature selection, striking a balance between accuracy and complexity.Experimental results show that on the VisDrone2019 dataset and the TinyPerson dataset from Kaggle, the mean Average Precision (mAP) of the algorithm is increased by 4.9 and 2.1 percentage points, respectively. Moreover, it demonstrates greater advantages compared to existing advanced algorithms, effectively addressing the challenge of small target detection in complex UAV scenarios.
Date: 2025
References: Add references at CitEc
Citations:
Downloads: (external link)
https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0337318 (text/html)
https://journals.plos.org/plosone/article/file?id= ... 37318&type=printable (application/pdf)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:plo:pone00:0337318
DOI: 10.1371/journal.pone.0337318
Access Statistics for this article
More articles in PLOS ONE from Public Library of Science
Bibliographic data for series maintained by plosone ().