MRI-Copula: A Hybrid Copula–Machine Learning Framework for Multivariate Risk Indexing in Urban Traffic Safety
Fayez Alanazi (),
Abdalziz Alruwaili and
Amir Shtayat
Additional contact information
Fayez Alanazi: Civil Engineering Department, College of Engineering, Jouf University, Sakaka 72388, Saudi Arabia
Abdalziz Alruwaili: Civil Engineering Department, College of Engineering, Jouf University, Sakaka 72388, Saudi Arabia
Amir Shtayat: Department of City Planning and Design, Jordan University of Science and Technology, Irbid 22110, Jordan
Sustainability, 2025, vol. 17, issue 20, 1-21
Abstract:
Predicting road crash severity remains a major challenge in transportation safety research, requiring models that combine predictive accuracy, interpretability, and computational efficiency. This study introduces a Multi-Risk Index based on Copula Integration (MRI-Copula)—a hybrid framework that integrates Categorical Boosting (CatBoost) with SHapley Additive exPlanations (SHAP) and Vine Copula dependence modeling to assess and predict crash severity. The approach leverages CatBoost–SHAP to quantify the marginal contribution of each risk factor while maintaining model transparency and employs copula-based tail dependence to capture the joint escalation of risk under extreme crash conditions. Using a dataset of 877 police-reported crashes from Jeddah, Saudi Arabia, the framework constructs three interpretable sub-indices—Environmental Risk Index (ERI), Behavioural Risk Index (BRI), and Systemic Risk Index (SRI)—representing distinct domains of crash causation. These indices are combined through a convex weighting parameter (α), optimized via cross-validation (optimal α = 0.80), ensuring a balanced integration of predictive and dependence-based information. Comparative evaluation across multiple classifiers—CatBoost, Light Gradient Boosting Machine (LightGBM), Histogram-based Gradient Boosting (HistGB), and Logistic Regression—demonstrated the robustness of the framework. The CatBoost + MRI-Copula configuration achieved the highest predictive performance (AUC = 0.986; F1 = 0.904), while LightGBM and HistGB offered comparable accuracy (AUC ≈ 0.958; F1 ≈ 0.89) at a fraction of the computational time (≤1 s versus 32 s for CatBoost), highlighting a trade-off between analytical precision and scalability. Consequently, the MRI-Copula framework provides a transparent and theoretically grounded foundation for data-driven road safety management. It bridges predictive analytics and decision support offering a scalable, interpretable, and policy-relevant tool for proactive crash risk mitigation.
Keywords: crash severity; vine copula; machine learning; CatBoost; SHAP; multivariate risk index; explainable AI (search for similar items in EconPapers)
JEL-codes: O13 Q Q0 Q2 Q3 Q5 Q56 (search for similar items in EconPapers)
Date: 2025
References: Add references at CitEc
Citations:
Downloads: (external link)
https://www.mdpi.com/2071-1050/17/20/9210/pdf (application/pdf)
https://www.mdpi.com/2071-1050/17/20/9210/ (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:gam:jsusta:v:17:y:2025:i:20:p:9210-:d:1773504
Access Statistics for this article
Sustainability is currently edited by Ms. Alexandra Wu
More articles in Sustainability from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().