Improving Oriented Object Detection by Scene Classification and Task-Aligned Focal Loss
Xiaoliang Qian,
Shaoguan Gao,
Wei Deng and
Wei Wang ()
Additional contact information
Xiaoliang Qian: College of Electrical and Information Engineering, Zhengzhou University of Light Industry, Zhengzhou 450002, China
Shaoguan Gao: College of Electrical and Information Engineering, Zhengzhou University of Light Industry, Zhengzhou 450002, China
Wei Deng: College of Electrical and Information Engineering, Zhengzhou University of Light Industry, Zhengzhou 450002, China
Wei Wang: College of Electrical and Information Engineering, Zhengzhou University of Light Industry, Zhengzhou 450002, China
Mathematics, 2024, vol. 12, issue 9, 1-18
Abstract:
Oriented object detection (OOD) can precisely detect objects with arbitrary direction in remote sensing images (RSIs). Up to now, the two-stage OOD methods have attracted more attention because of their high detection accuracy. However, the two-stage methods only rely on the features of each proposal for object recognition, which leads to the misclassification problem because of the intra-class diversity, inter-class similarity and clutter backgrounds in RSIs. To address the above problem, an OOD model combining scene classification is proposed. Considering the fact that each foreground object has a strong contextual relationship with the scene of the RSI, a scene classification branch is added to the baseline OOD model, and the scene classification result of input RSI is used to exclude the impossible categories. To focus on the hard instances and enhance the consistency between classification and regression, a task-aligned focal loss (TFL) which combines the classification difficulty with the regression loss is proposed, and TFL assigns lager weights to the hard instances and optimizes the classification and regression branches simultaneously. The ablation study proves the effectiveness of scene classification branch, TFL and their combination. The comparisons with 15 and 14 OOD methods on the DOTA and DIOR-R datasets validate the superiority of our method.
Keywords: oriented object detection; remote sensing image; scene classification branch; task-aligned focal loss (search for similar items in EconPapers)
JEL-codes: C (search for similar items in EconPapers)
Date: 2024
References: View complete reference list from CitEc
Citations:
Downloads: (external link)
https://www.mdpi.com/2227-7390/12/9/1343/pdf (application/pdf)
https://www.mdpi.com/2227-7390/12/9/1343/ (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:gam:jmathe:v:12:y:2024:i:9:p:1343-:d:1385019
Access Statistics for this article
Mathematics is currently edited by Ms. Emma He
More articles in Mathematics from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().