Textual analysis and credit scoring: a new matrix factorization approach
Bingjie Dong,
Ying Zhou,
Xuanming Sun and
Jingjing Zhu
Journal of the Operational Research Society, 2025, vol. 76, issue 6, 1189-1203
Abstract:
Credit scoring models are important for financial institutions’ credit decisions. This study examined how variables are extracted from loan statements and whether textual variables can improve the accuracy of the default model. We used a combination of forward selection and non-negative matrix factorization to extract variables from loan statements. We also built a credit scoring model using both loan statement and numerical data. The results show that in the comparative analysis, the credit scoring model built using the optimal cut-off logistic regression model and the two types of data had the highest accuracy. Moreover, compared with the credit scoring model constructed using the deep learning method based on word vectors, the credit scoring model in this study had better interpretation. The regression analysis revealed that the variables from the loan statement have a significant effect on the default status.
Date: 2025
References: Add references at CitEc
Citations:
Downloads: (external link)
http://hdl.handle.net/10.1080/01605682.2024.2416908 (text/html)
Access to full text is restricted to subscribers.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:taf:tjorxx:v:76:y:2025:i:6:p:1189-1203
Ordering information: This journal article can be ordered from
http://www.tandfonline.com/pricing/journal/tjor20
DOI: 10.1080/01605682.2024.2416908
Access Statistics for this article
Journal of the Operational Research Society is currently edited by Tom Archibald
More articles in Journal of the Operational Research Society from Taylor & Francis Journals
Bibliographic data for series maintained by Chris Longhurst ().