Regularization of Autoencoders for Bank Client Profiling Based on Financial Transactions

Filchenkov, Andrey; Khanzhina, Natalia; Tsai, Arina; Smetannikov, Ivan

Regularization of Autoencoders for Bank Client Profiling Based on Financial Transactions

Andrey Filchenkov, Natalia Khanzhina, Arina Tsai and Ivan Smetannikov
Additional contact information
Andrey Filchenkov: Machine Learning Lab, ITMO University, 49 Kronverksky Pr., 197101 St. Petersburg, Russia
Natalia Khanzhina: Machine Learning Lab, ITMO University, 49 Kronverksky Pr., 197101 St. Petersburg, Russia
Arina Tsai: Computer Technologies Department, Formerly ITMO University, 49 Kronverksky Pr., 197101 St. Petersburg, Russia
Ivan Smetannikov: Machine Learning Lab, ITMO University, 49 Kronverksky Pr., 197101 St. Petersburg, Russia

Risks, 2021, vol. 9, issue 3, 1-16

Abstract: Predicting if a client is worth giving a loan—credit scoring—is one of the most essential and popular problems in banking. Predictive models for this goal are built on the assumption that there is a dependency between the client’s profile before the loan approval and their future behavior. However, circumstances that cause changes in the client’s behavior may not depend on their will and cannot be predicted by their profile. Such clients may be considered “noisy” as their eventual belonging to the defaulters class results rather from random factors than from some predictable rules. Excluding such clients from the dataset may be helpful in building more accurate predictive models. In this paper, we report on primary results on testing the hypothesis that a client can become a defaulter in two scenarios: intentionally and unintentionally. We verify our hypothesis applying data driven regularized classification using an autoencoder to client profiles. To model an intention as a hidden variable, we propose an especially designed regularizer for the autoencoder. The regularizer aims to obtain a representation of defaulters that includes a cluster of intentional defaulters and unintentional defaulters as outliers. The outliers were detected by our model and excluded from the dataset. This improved the credit scoring model and confirmed our hypothesis.

Keywords: clustering; autoencoder; regularization; neural networks; machine learning; credit scoring; transaction profiling; defaulters (search for similar items in EconPapers)
JEL-codes: C G0 G1 G2 G3 K2 M2 M4 (search for similar items in EconPapers)
Date: 2021
References: View references in EconPapers View complete reference list from CitEc
Citations:

Downloads: (external link)
https://www.mdpi.com/2227-9091/9/3/54/pdf (application/pdf)
https://www.mdpi.com/2227-9091/9/3/54/ (text/html)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:gam:jrisks:v:9:y:2021:i:3:p:54-:d:519306

Access Statistics for this article

Risks is currently edited by Mr. Claude Zhang

More articles in Risks from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().