Comparing Resampling Algorithms and Classifiers for Modeling Traffic Risk Prediction

Wang, Bo; Zhang, Chi; Wong, Yiik Diew; Hou, Lei; Zhang, Min; Xiang, Yujie

Comparing Resampling Algorithms and Classifiers for Modeling Traffic Risk Prediction

Bo Wang, Chi Zhang (), Yiik Diew Wong (), Lei Hou, Min Zhang and Yujie Xiang
Additional contact information
Bo Wang: School of Highway, Chang’an University, Xi’an 710064, China
Chi Zhang: School of Highway, Chang’an University, Xi’an 710064, China
Yiik Diew Wong: School of Civil and Environmental Engineering, Nanyang Technological University, Singapore 639798, Singapore
Lei Hou: School of Engineering, STEM College, RMIT University, Melbourne, VIC 3001, Australia
Min Zhang: College of Transportation Engineering, Chang’an University, Xi’an 710064, China
Yujie Xiang: School of Highway, Chang’an University, Xi’an 710064, China

IJERPH, 2022, vol. 19, issue 20, 1-23

Abstract: Road infrastructure has significant effects on road traffic safety and needs further examination. In terms of traffic crash prediction, recent studies have started to develop deep learning classification algorithms. However, given the uncertainty of traffic crashes, predicting the traffic risk potential of different road sections remains a challenge. To bridge this knowledge gap, this study investigated a real-world expressway and collected its traffic crash data between 2013 and 2020. Then, according to the time-spatial density ratio ( Pts ), road sections were assigned into three classes corresponding to low, medium, and high risk levels of traffic. Next, different classifiers were compared that were trained using the transformed and resampled feature data to construct a traffic crash risk prediction model. Last, but not least, partial dependence plots (PDPs) were employed to interpret the results and analyze the importance of individual features describing the geometry, pavement, structure, and weather conditions. The results showed that a variety of data balancing algorithms improved the performance of the classifiers, the ensemble classifier superseded the others in terms of the performance metrics, and the combined SMOTEENN and random forest algorithms improved the classification accuracy the most. In the future, the proposed traffic crash risk prediction method will be tested in more road maintenance and design safety assessment scenarios.

Keywords: traffic crash risk prediction; resampling algorithms; classifiers; performance evaluation measures; feature importance (search for similar items in EconPapers)
JEL-codes: I I1 I3 Q Q5 (search for similar items in EconPapers)
Date: 2022
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (1)

Downloads: (external link)
https://www.mdpi.com/1660-4601/19/20/13693/pdf (application/pdf)
https://www.mdpi.com/1660-4601/19/20/13693/ (text/html)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:gam:jijerp:v:19:y:2022:i:20:p:13693-:d:949571

Access Statistics for this article

IJERPH is currently edited by Ms. Jenna Liu

More articles in IJERPH from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().