Research on Multi-Modal Pedestrian Detection and Tracking Algorithm Based on Deep Learning

Zhao, Rui; Hao, Jutao; Huo, Huan

Research on Multi-Modal Pedestrian Detection and Tracking Algorithm Based on Deep Learning

Rui Zhao, Jutao Hao and Huan Huo ()
Additional contact information
Rui Zhao: Faculty of Engineering and IT, University of Technology Sydney, Ultimo 2007, Australia
Jutao Hao: School of Electric Information Engineering, Shanghai Dianji University, Shuihua Rd., Shanghai 201306, China
Huan Huo: Faculty of Engineering and IT, University of Technology Sydney, Ultimo 2007, Australia

Future Internet, 2024, vol. 16, issue 6, 1-14

Abstract: In the realm of intelligent transportation, pedestrian detection has witnessed significant advancements. However, it continues to grapple with challenging issues, notably the detection of pedestrians in complex lighting scenarios. Conventional visible light mode imaging is profoundly affected by varying lighting conditions. Under optimal daytime lighting, visibility is enhanced, leading to superior pedestrian detection outcomes. Conversely, under low-light conditions, visible light mode imaging falters due to the inadequate provision of pedestrian target information, resulting in a marked decline in detection efficacy. In this context, infrared light mode imaging emerges as a valuable supplement, bolstering pedestrian information provision. This paper delves into pedestrian detection and tracking algorithms within a multi-modal image framework grounded in deep learning methodologies. Leveraging the YOLOv4 algorithm as a foundation, augmented by a channel stack fusion module, a novel multi-modal pedestrian detection algorithm tailored for intelligent transportation is proposed. This algorithm capitalizes on the fusion of visible and infrared light mode image features to enhance pedestrian detection performance amidst complex road environments. Experimental findings demonstrate that compared to the Visible-YOLOv4 algorithm, renowned for its high performance, the proposed Double-YOLOv4-CSE algorithm exhibits a notable improvement, boasting a 5.0% accuracy rate enhancement and a 6.9% reduction in logarithmic average missing rate. This research’s goal is to ensure that the algorithm can run smoothly even on a low configuration 1080 Ti GPU and to improve the algorithm’s coverage at the application layer, making it affordable and practical for both urban and rural areas. This addresses the broader research problem within the scope of smart cities and remote ends with limited computational power.

Keywords: deep learning; multimodal; smart cities; future internet; pedestrian detection; feature fusion; YOLOv4 algorithm (search for similar items in EconPapers)
JEL-codes: O3 (search for similar items in EconPapers)
Date: 2024
References: View complete reference list from CitEc
Citations:

Downloads: (external link)
https://www.mdpi.com/1999-5903/16/6/194/pdf (application/pdf)
https://www.mdpi.com/1999-5903/16/6/194/ (text/html)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:gam:jftint:v:16:y:2024:i:6:p:194-:d:1406139

Access Statistics for this article

Future Internet is currently edited by Ms. Grace You

More articles in Future Internet from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().