Improvement of Construction Workers’ Drowsiness Detection and Classification via Text-to-Image Augmentation and Computer Vision
Daegyo Jung,
Yejun Lee,
Kihyun Jeong,
Jeehee Lee,
Jinwoo Kim,
Hyunjung Park and
Jungho Jeon ()
Additional contact information
Daegyo Jung: Department of Architectural Engineering, Pusan National University, Busandaehak-ro 63beon-gil, Geumjeong-gu, Busan 46241, Republic of Korea
Yejun Lee: Department of Architectural Engineering, Pusan National University, Busandaehak-ro 63beon-gil, Geumjeong-gu, Busan 46241, Republic of Korea
Kihyun Jeong: Department of Architectural Engineering, Pusan National University, Busandaehak-ro 63beon-gil, Geumjeong-gu, Busan 46241, Republic of Korea
Jeehee Lee: School of Architecture and Building Science, Chung-Ang University, 84, Heukseok-ro, Dongjak-gu, Seoul 06974, Republic of Korea
Jinwoo Kim: Department of Architectural Engineering, Kumoh National Institute of Technology, Daehak-ro, Gumi-si 39177, Republic of Korea
Hyunjung Park: Department of Architecture, Silla University, Baekyang-daero 700beon-gil, Sasang-gu, Busan 46958, Republic of Korea
Jungho Jeon: Department of Architectural Engineering, Pusan National University, Busandaehak-ro 63beon-gil, Geumjeong-gu, Busan 46241, Republic of Korea
Sustainability, 2025, vol. 17, issue 20, 1-15
Abstract:
Detecting and classifying construction workers’ drowsiness is critical in the construction safety management domain. Research efforts to increase the reliability of drowsiness detection through image augmentation and computer vision approaches face two key challenges: the related size constraints and the number of manual tasks associated with creating input images necessary for training vision algorithms. Although text-to-image (T2I) has emerged as a promising alternative, the dynamic relationship between T2I-driven image characteristics (e.g., contextual relevance), different computer vision algorithms, and the resulting performance remains lacking. To address the gap, this study proposes T2I-centered computer vision approaches for enhanced drowsiness detection by creating four separate image sets (e.g., construction vs. non-construction) labeled using the polygon method, developing two detection models (YOLOv8 and YOLO11), and comparing the performance. The results showed that the use of construction domain-specific images for training both YOLOv8 and YOLO11 led to higher mAP@50 of 68.2% and 56.6%, respectively, compared to those trained using non-construction images (53.4% and 53.5%). Also, increasing the number of T2I-generated training images improved mAP@50 from 68.2% (baseline) to 95.3% for YOLOv8 and 56.6% to 93.3% for YOLO11. The findings demonstrate the effectiveness of leveraging the T2I augmentation approach for improved construction workers’ drowsiness detection.
Keywords: construction safety and health; drowsiness detection; construction workers; computer vision; text-to-image (search for similar items in EconPapers)
JEL-codes: O13 Q Q0 Q2 Q3 Q5 Q56 (search for similar items in EconPapers)
Date: 2025
References: Add references at CitEc
Citations:
Downloads: (external link)
https://www.mdpi.com/2071-1050/17/20/9158/pdf (application/pdf)
https://www.mdpi.com/2071-1050/17/20/9158/ (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:gam:jsusta:v:17:y:2025:i:20:p:9158-:d:1772378
Access Statistics for this article
Sustainability is currently edited by Ms. Alexandra Wu
More articles in Sustainability from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().