Detection of River Floating Garbage Based on Improved YOLOv5
Xingshuai Yang,
Jingyi Zhao,
Li Zhao,
Haiyang Zhang,
Li Li,
Zhanlin Ji () and
Ivan Ganchev ()
Additional contact information
Xingshuai Yang: College of Artificial Intelligence, North China University of Science and Technology, Tangshan 063210, China
Jingyi Zhao: College of Artificial Intelligence, North China University of Science and Technology, Tangshan 063210, China
Li Zhao: Research Institute of Information Technology, Tsinghua University, Beijing 100080, China
Haiyang Zhang: Department of Computing, Xi’an Jiaotong-Liverpool University, Suzhou 215000, China
Li Li: College of Artificial Intelligence, North China University of Science and Technology, Tangshan 063210, China
Zhanlin Ji: College of Artificial Intelligence, North China University of Science and Technology, Tangshan 063210, China
Ivan Ganchev: Telecommunications Research Centre (TRC), University of Limerick, V94 T9PX Limerick, Ireland
Mathematics, 2022, vol. 10, issue 22, 1-20
Abstract:
The random dumping of garbage in rivers has led to the continuous deterioration of water quality and affected people’s living environment. The accuracy of detection of garbage floating in rivers is greatly affected by factors such as floating speed, night/daytime natural light, viewing angle and position, etc. This paper proposes a novel detection model, called YOLOv5_ CBS , for the detection of garbage objects floating in rivers, based on improvements of the YOLOv5 model. Firstly, a coordinate attention ( C A) mechanism is added to the original C3 module (without compressing the number of channels in the bottleneck), forming a new C3-CA-Uncompress Bottleneck (CCUB) module for improving the size of the receptive field and allowing the model to pay more attention to important parts of the processed images. Then, the Path Aggregation Network (PAN) in YOLOv5 is replaced with a Bidirectional Feature Pyramid Network ( B iFPN), as proposed by other researchers, to enhance the depth of information mining and improve the feature extraction capability and detection performance of the model. In addition, the Complete Intersection over Union (CIoU) loss function, which was originally used in YOLOv5 for the calculation of location score of the compound loss, is replaced with the SCYLLA-IoU ( S IoU) loss function, so as to speed up the model convergence and improve its regression precision. The results, obtained through experiments conducted on two datasets, demonstrate that the proposed YOLOv5_CBS model outperforms the original YOLOv5 model, along with three other state-of-the-art models (Faster R-CNN, YOLOv3, and YOLOv4), when used for river floating garbage objects detection, in terms of the recall , average precision , and F1 score achieved by reaching respective values of 0.885, 90.85%, and 0.8669 on the private dataset, and 0.865, 92.18%, and 0.9006 on the Flow-Img public dataset.
Keywords: computer vision; object detection; YOLOv5; coordinate attention; Bidirectional Feature Pyramid Network (BiFPN); SCYLLA-IoU (SIoU) loss (search for similar items in EconPapers)
JEL-codes: C (search for similar items in EconPapers)
Date: 2022
References: View complete reference list from CitEc
Citations: View citations in EconPapers (2)
Downloads: (external link)
https://www.mdpi.com/2227-7390/10/22/4366/pdf (application/pdf)
https://www.mdpi.com/2227-7390/10/22/4366/ (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:gam:jmathe:v:10:y:2022:i:22:p:4366-:d:978440
Access Statistics for this article
Mathematics is currently edited by Ms. Emma He
More articles in Mathematics from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().