EconPapers    
Economics at your fingertips  
 

Machine Learning for Bankruptcy Prediction in the American Stock Market: Dataset and Benchmarks

Gianfranco Lombardo, Mattia Pellegrino, George Adosoglou, Stefano Cagnoni, Panos M. Pardalos and Agostino Poggi ()
Additional contact information
Gianfranco Lombardo: Department of Engineering and Architecture, University of Parma, 43124 Parma, Italy
Mattia Pellegrino: Department of Engineering and Architecture, University of Parma, 43124 Parma, Italy
George Adosoglou: Department of Industrial and Systems Engineering, University of Florida, Gainesville, FL 32611, USA
Stefano Cagnoni: Department of Engineering and Architecture, University of Parma, 43124 Parma, Italy
Panos M. Pardalos: Department of Industrial and Systems Engineering, University of Florida, Gainesville, FL 32611, USA
Agostino Poggi: Department of Engineering and Architecture, University of Parma, 43124 Parma, Italy

Future Internet, 2022, vol. 14, issue 8, 1-23

Abstract: Predicting corporate bankruptcy is one of the fundamental tasks in credit risk assessment. In particular, since the 2007/2008 financial crisis, it has become a priority for most financial institutions, practitioners, and academics. The recent advancements in machine learning (ML) enabled the development of several models for bankruptcy prediction. The most challenging aspect of this task is dealing with the class imbalance due to the rarity of bankruptcy events in the real economy. Furthermore, a fair comparison in the literature is difficult to make because bankruptcy datasets are not publicly available and because studies often restrict their datasets to specific economic sectors and markets and/or time periods. In this work, we investigated the design and the application of different ML models to two different tasks related to default events: (a) estimating survival probabilities over time; (b) default prediction using time-series accounting data with different lengths. The entire dataset used for the experiments has been made available to the scientific community for further research and benchmarking purposes. The dataset pertains to 8262 different public companies listed on the American stock market between 1999 and 2018. Finally, in light of the results obtained, we critically discuss the most interesting metrics as proposed benchmarks for future studies.

Keywords: bankruptcy prediction; deep learning; multi-head; LSTM; machine learning; stock market (search for similar items in EconPapers)
JEL-codes: O3 (search for similar items in EconPapers)
Date: 2022
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (6)

Downloads: (external link)
https://www.mdpi.com/1999-5903/14/8/244/pdf (application/pdf)
https://www.mdpi.com/1999-5903/14/8/244/ (text/html)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:gam:jftint:v:14:y:2022:i:8:p:244-:d:894540

Access Statistics for this article

Future Internet is currently edited by Ms. Grace You

More articles in Future Internet from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().

 
Page updated 2025-03-19
Handle: RePEc:gam:jftint:v:14:y:2022:i:8:p:244-:d:894540