Corr-SHAP: Correlation-Aware Sampling for Faithful SHAP Value Estimation

Hamdi, Ridha El; Charaabi, Hana; Hdhiri, Ibtissam; Njah, Mohamed

Corr-SHAP: Correlation-Aware Sampling for Faithful SHAP Value Estimation

Ridha El Hamdi, Hana Charaabi, Ibtissam Hdhiri and Mohamed Njah

Acta Informatica Pragensia, vol. preprint

Abstract: Background: SHapley Additive exPlanations (SHAP) methods are widely used to interpret machine learning models, yet most implementations assume feature independence. This assumption rarely holds in practice, especially when features are correlated, leading to biased and unstable attributions.Objective: We introduce Corr-SHAP, a correlation-aware SHAP approach that produces more faithful and stable feature attributions by explicitly modeling feature dependencies. Our aim is to enhance the accuracy, robustness, and scalability of SHAP explanations for models trained on correlated data.Methods: Corr-SHAP models feature correlations via a multivariate Gaussian approximation with a Ledoit-Wolf covariance estimator. We design a correlation-aware sampling distribution that penalizes redundant coalitions, improving computational efficiency in higher dimensions. To correct the induced bias, we employ a Self-Normalized Importance Sampling estimator, which re-weights samples by the ratio of the true Shapley kernel to the sampling probability. Our analysis establishes high probability error bounds in terms of Effective Sample Size, extending convergence guarantees to correlated feature spaces.Results: Across synthetic and real-world datasets, Corr-SHAP achieves Shapley value estimates that closely align with Kernel SHAP, while exhibiting substantially lower variance and more stable feature rankings. In correlated clusters, Corr-SHAP systematically down-weights redundant features, improving ranking fidelity without introducing bias. To further support scalability, we demonstrate that combining Corr-SHAP with Leverage-SHAP reduces variance in higher-dimensional settings.Conclusion: Corr-SHAP provides a statistically grounded and computationally efficient framework for SHAP value estimation under feature correlation. By integrating correlation modeling, bias correction, and variance reduction, it scales beyond small toy problems and delivers explanations that are both accurate and reliable, making it a valuable tool for practitioners analyzing complex real-world datasets.

Keywords: Explainable artificial intelligence; XAI; SHapley Additive exPlanations; Feature correlation; Model interpretability; Importance sampling; Variance reduction (search for similar items in EconPapers)
References: Add references at CitEc
Citations:

Downloads: (external link)
http://aip.vse.cz/doi/10.18267/j.aip.306.html (text/html)
free of charge

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:prg:jnlaip:v:preprint:id:306

Ordering information: This journal article can be ordered from
Redakce Acta Informatica Pragensia, Katedra systémové analýzy, Vysoká škola ekonomická v Praze, nám. W. Churchilla 4, 130 67 Praha 3
http://aip.vse.cz

DOI: 10.18267/j.aip.306

Access Statistics for this article

Acta Informatica Pragensia is currently edited by Editorial Office

More articles in Acta Informatica Pragensia from Prague University of Economics and Business Contact information at EDIRC.
Bibliographic data for series maintained by Stanislav Vojir ().