Identifying the Most Significant Features for Stress Prediction of Automobile Drivers: A Comprehensive Study
May Y. Al-Nashashibi (),
Nuha El-Khalili (),
Wa’el Hadi,
Abedal-Kareem Al-Banna () and
Ghassan Issa ()
Additional contact information
May Y. Al-Nashashibi: Computer Science Department, University of Petra, Amman, Jordan
Nuha El-Khalili: Software Engineering Department, University of Petra, Amman, Jordan
Wa’el Hadi: Information Security Department, University of Petra, Amman, Jordan
Abedal-Kareem Al-Banna: Data Science n Artificial Intelligence Department, University of Petra, Amman, Jordan
Ghassan Issa: School of IT, Skyline University, Sharjah, UAE
Journal of Information & Knowledge Management (JIKM), 2024, vol. 23, issue 02, 1-53
Abstract:
Objective: This paper used three feature selection methods on a Jordanian automobile drivers’ dataset to identify the most significant features for stress prediction algorithm performance. The dataset contains “stress†and “no-stress†classes with 30 features, categorised into physiological and contextual subsets.Methods: Eighteen classifiers from six prediction algorithm categories were evaluated: Rule-based, Tree-based, Ensemble-based, Function-based, Naïve Bayes-based and Lazy-based. Three Feature Subset Selection (FSS) methods were used: Gain Ratio, Chi-square and feature separation. Eight evaluation measures included F1, Accuracy, Specificity, Sensitivity, Kappa Statistics, Mean Absolute Error (MAE), Area Under Curve (AUC) and Precision Recall Curve Area (PRCA).Results: Among the classifiers, Lazy-based LocalKNN performed significantly well in F1, Accuracy, Kappa and MAE. Naïve Bayes-based Bayesian Network excelled in other measures. The original dataset with all features yielded the best overall performance, followed by the physiological-only subset. Gain Ratio and Chi-square FSS methods also showed promising results, though not significant.Conclusion: Four physiological (EMG, EMG Amplitude, Heart rate, Respiration Amplitude) and seven contextual (time range of driving, gender, age, driving skills, general accidents, last year’s accidents, stress frequency) features contributed to the best prediction outcomes. The study highlights the importance of proper feature selection and identifies optimal algorithms for specific measures.
Keywords: Stress prediction; feature selection methods; physiological features; contextual; automobile drivers; evaluation metrics (search for similar items in EconPapers)
Date: 2024
References: Add references at CitEc
Citations:
Downloads: (external link)
http://www.worldscientific.com/doi/abs/10.1142/S0219649223500648
Access to full text is restricted to subscribers
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:wsi:jikmxx:v:23:y:2024:i:02:n:s0219649223500648
Ordering information: This journal article can be ordered from
DOI: 10.1142/S0219649223500648
Access Statistics for this article
Journal of Information & Knowledge Management (JIKM) is currently edited by Professor Suliman Hawamdeh
More articles in Journal of Information & Knowledge Management (JIKM) from World Scientific Publishing Co. Pte. Ltd.
Bibliographic data for series maintained by Tai Tone Lim ().