Accuracy of explanations of machine learning models for credit decisions
Andres Alonso and
José Manuel Carbó
Additional contact information
José Manuel Carbó: Banco de España
No 2222, Working Papers from Banco de España
Abstract:
One of the biggest challenges for the application of machine learning (ML) models in finance is how to explain their results. In recent years, innovative interpretability techniques have appeared to assist in this task, although their usefulness is still a matter of debate within the industry. In this article we propose a novel framework to assess how accurate these techniques are. Our work is based on the generation of synthetic datasets. This allows us to define the importance of the variables, so we can calculate to what extent the explanations given by these techniques match the ground truth of our data. We perform an empirical exercise in which we apply two non-interpretable ML models (XGBoost and Deep Learning) to the synthetic datasets, , and then we explain their results using two popular interpretability techniques, SHAP and permutation Feature Importance (FI). We conclude that generating synthetic datasets shows potential as a useful approach for supervisors and practitioners who wish to assess interpretability techniques.
Keywords: synthetic datasets; artificial intelligence; interpretability; machine learning; credit assessment (search for similar items in EconPapers)
JEL-codes: C55 C63 G17 (search for similar items in EconPapers)
Pages: 45 pages
Date: 2022-06
New Economics Papers: this item is included in nep-big, nep-cmp and nep-pay
References: View references in EconPapers View complete reference list from CitEc
Citations:
Downloads: (external link)
https://www.bde.es/f/webbde/SES/Secciones/Publicac ... 22/Files/dt2222e.pdf First version, June 2022 (application/pdf)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:bde:wpaper:2222
Access Statistics for this paper
More papers in Working Papers from Banco de España Contact information at EDIRC.
Bibliographic data for series maintained by Ángel Rodríguez. Electronic Dissemination of Information Unit. Research Department. Banco de España ().