Assessment of creditworthiness models privacy-preserving training with synthetic data

Mu\~noz-Cancino, Ricardo; Bravo, Cristi\'an; R\'ios, Sebasti\'an A.; Gra\~na, Manuel

Assessment of creditworthiness models privacy-preserving training with synthetic data

Ricardo Mu\~noz-Cancino, Cristi\'an Bravo, Sebasti\'an A. R\'ios and Manuel Gra\~na

Abstract: Credit scoring models are the primary instrument used by financial institutions to manage credit risk. The scarcity of research on behavioral scoring is due to the difficult data access. Financial institutions have to maintain the privacy and security of borrowers' information refrain them from collaborating in research initiatives. In this work, we present a methodology that allows us to evaluate the performance of models trained with synthetic data when they are applied to real-world data. Our results show that synthetic data quality is increasingly poor when the number of attributes increases. However, creditworthiness assessment models trained with synthetic data show a reduction of 3\% of AUC and 6\% of KS when compared with models trained with real data. These results have a significant impact since they encourage credit risk investigation from synthetic data, making it possible to maintain borrowers' privacy and to address problems that until now have been hampered by the availability of information.

Date: 2022-12
New Economics Papers: this item is included in nep-ban and nep-rmg
References: View references in EconPapers View complete reference list from CitEc
Citations:

Published in Hybrid Artificial Intelligent Systems. HAIS 2022. Lecture Notes in Computer Science(), vol 13469

Downloads: (external link)
http://arxiv.org/pdf/2301.01212 Latest version (application/pdf)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:arx:papers:2301.01212

Access Statistics for this paper

More papers in Papers from arXiv.org
Bibliographic data for series maintained by arXiv administrators ().