FID-YOLO: A pedestrian detection model integrating multispectral information in complex environments
Di Yang,
Xilong Zhang and
Peng Wang
PLOS ONE, 2026, vol. 21, issue 3, 1-26
Abstract:
The advancement of pedestrian detection technology is of great importance for various applications such as intelligent driving, object tracking, and robot navigation. Many studies in this field have demonstrated that image quality significantly contributes to the precision of detection. However, unexpected factors such as adverse weather, occlusions, and scale variations, which extremely weaken the main features of the detected objects, leading to a decrease in detection accuracy. To address these problems, we propose a Feature-enriched Image Detection-YOLO (FID-YOLO), to improve pedestrian detection performance in complex environments by integrating visible and infrared light information. Specifically, we design an illumination-aware image fusion module for visible and infrared image information fusion to generate a new image within more information to enrich pedestrian features. Then, a cascaded feature aggregation module using reparameterization and channel shuffle is introduced to enhance the model’s understanding and generalization capabilities for complex scenes. Furthermore, we exploit a scale-adaptive feature detection head for YOLO detector, which solves the problem of detecting small objects at varying object scales. Experiments on M3FD and LLVIP datasets demonstrate that FID-YOLO outperforms the benchmark models in pedestrian detection. Additionally, we validate the indispensability of each proposed module through ablation experiments.
Date: 2026
References: Add references at CitEc
Citations:
Downloads: (external link)
https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0342054 (text/html)
https://journals.plos.org/plosone/article/file?id= ... 42054&type=printable (application/pdf)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:plo:pone00:0342054
DOI: 10.1371/journal.pone.0342054
Access Statistics for this article
More articles in PLOS ONE from Public Library of Science
Bibliographic data for series maintained by plosone ().