A Kind of Water Surface Multi-Scale Object Detection Method Based on Improved YOLOv5 Network
Zhongli Ma,
Yi Wan,
Jiajia Liu (),
Ruojin An and
Lili Wu
Additional contact information
Zhongli Ma: College of Automation, Chengdu University of Information Technology, Chengdu 610103, China
Yi Wan: College of Automation, Chengdu University of Information Technology, Chengdu 610103, China
Jiajia Liu: College of Automation, Chengdu University of Information Technology, Chengdu 610103, China
Ruojin An: College of Automation, Chengdu University of Information Technology, Chengdu 610103, China
Lili Wu: College of Automation, Chengdu University of Information Technology, Chengdu 610103, China
Mathematics, 2023, vol. 11, issue 13, 1-18
Abstract:
Visual-based object detection systems are essential components of intelligent equipment for water surface environments. The diversity of water surface target types, uneven distribution of sizes, and difficulties in dataset construction pose significant challenges for water surface object detection. This article proposes an improved YOLOv5 target detection method to address the characteristics of diverse types, large quantities, and multiple scales of actual water surface targets. The improved YOLOv5 model optimizes the extraction of bounding boxes using K-means++ to obtain a broader distribution of predefined bounding boxes, thereby enhancing the detection accuracy for multi-scale targets. We introduce the GAMAttention mechanism into the backbone network of the model to alleviate the significant performance difference between large and small targets caused by their multi-scale nature. The spatial pyramid pooling module in the backbone network is replaced to enhance the perception ability of the model in segmenting targets of different scales. Finally, the Focal loss classification loss function is incorporated to address the issues of overfitting and poor accuracy caused by imbalanced class distribution in the training data. We conduct comparative tests on a self-constructed dataset comprising ten categories of water surface targets using four algorithms: Faster R-CNN, YOLOv4, YOLOv5, and the proposed improved YOLOv5. The experimental results demonstrate that the improved model achieves the best detection accuracy, with an 8% improvement in mAP@0.5 compared to the original YOLOv5 in multi-scale water surface object detection.
Keywords: surface target detection; YOLOv5; multi-scale targets; spatial pyramid pooling; attention mechanism (search for similar items in EconPapers)
JEL-codes: C (search for similar items in EconPapers)
Date: 2023
References: View complete reference list from CitEc
Citations:
Downloads: (external link)
https://www.mdpi.com/2227-7390/11/13/2936/pdf (application/pdf)
https://www.mdpi.com/2227-7390/11/13/2936/ (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:gam:jmathe:v:11:y:2023:i:13:p:2936-:d:1183715
Access Statistics for this article
Mathematics is currently edited by Ms. Emma He
More articles in Mathematics from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().