A Novel Outlier-Robust Accuracy Measure for Machine Learning Regression Using a Non-Convex Distance Metric
Ahmad B. Hassanat (),
Mohammad Khaled Alqaralleh,
Ahmad S. Tarawneh (),
Khalid Almohammadi,
Maha Alamri,
Abdulkareem Alzahrani,
Ghada A. Altarawneh and
Rania Alhalaseh
Additional contact information
Ahmad B. Hassanat: Faculty of Information Technology, Mutah University, Karak 61710, Jordan
Mohammad Khaled Alqaralleh: Faculty of Information Technology, Mutah University, Karak 61710, Jordan
Ahmad S. Tarawneh: Faculty of Information Technology, Mutah University, Karak 61710, Jordan
Khalid Almohammadi: Department of Computer Science, Applied College, University of Tabuk, Tabuk 47512, Saudi Arabia
Maha Alamri: Department of Systems and Networking, Faculty of Computing and Information, Al-Baha University, Al-Baha 65779, Saudi Arabia
Abdulkareem Alzahrani: Department of Computer Science, Faculty of Computing and Information, Al-Baha University, Al-Baha 65779, Saudi Arabia
Ghada A. Altarawneh: Faculty of Business, Mutah University, Karak 61710, Jordan
Rania Alhalaseh: Faculty of Information Technology, Mutah University, Karak 61710, Jordan
Mathematics, 2024, vol. 12, issue 22, 1-20
Abstract:
Regression, a supervised machine learning approach, establishes relationships between independent variables and a continuous dependent variable. It is widely applied in areas like price prediction and time series forecasting. The performance of regression models is typically assessed using error metrics such as the Mean Squared Error (MSE), Mean Absolute Error (MAE), and Root Mean Squared Error (RMSE). However, these metrics present challenges including sensitivity to outliers (notably MSE and RMSE) and scale dependency, which complicates comparisons across different models. Additionally, traditional metrics sometimes yield values that are difficult to interpret across various problems. Consequently, there is a need for a metric that consistently reflects regression model performance, independent of the problem domain, data scale, and outlier presence. To overcome these shortcomings, this paper introduces a new regression accuracy measure based on the Hassanat distance, a non-convex distance metric. This measure is not only invariant to outliers but also easy to interpret as it provides an accuracy-like value that ranges from 0 to 1 (or 0–100%). We validate the proposed metric against traditional measures across multiple benchmarks, demonstrating its robustness under various model scenarios and data types. Hence, we suggest it as a new standard for assessing regression models’ accuracy.
Keywords: regression; machine learning; performance assessment; Hassanat distance (search for similar items in EconPapers)
JEL-codes: C (search for similar items in EconPapers)
Date: 2024
References: View references in EconPapers View complete reference list from CitEc
Citations:
Downloads: (external link)
https://www.mdpi.com/2227-7390/12/22/3623/pdf (application/pdf)
https://www.mdpi.com/2227-7390/12/22/3623/ (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:gam:jmathe:v:12:y:2024:i:22:p:3623-:d:1525092
Access Statistics for this article
Mathematics is currently edited by Ms. Emma He
More articles in Mathematics from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().