EconPapers    
Economics at your fingertips  
 

The Good, the Better and the Challenging:Insights into Predicting High-Growth Firms using Machine Learning

Sermet Pekin and Aykut Sengul

Working Papers from Research and Monetary Policy Department, Central Bank of the Republic of Turkey

Abstract: This study aims to classify high-growth firms using several machine learning algorithms, including K-Nearest Neighbors, Logistic Regression with L1 (Lasso) and L2 (Ridge) Regularization, XGBoost, Gradient Descent, Naive Bayes and Random Forest. Leveraging a dataset composed of financial metrics and firm characteristics between 2009 and 2022 with 1,318,799 unique firms (averaging 554,178 annually), we evaluate the performance of each model using metrics such as MCC, ROC AUC, accuracy, precision, recall and F1-score. In our study, ROC AUC values ranged from 0.53 to 0.87 for employee-high growth and from 0.53 to 0.91 for turnover-high growth, depending on the method used. Our findings indicate that XGBoost achieves the highest performance, followed by Random Forest and Logistic Regression, demonstrating their effectiveness in distinguishing between high-growth and non-high-growth firms. Conversely, KNN and Naive Bayes yield lower accuracy. Furthermore, our findings reveal that growth opportunity emerges as the most significant factor in our study. This research contributes valuable insights in identifying high-growth firms and underscores the potential of machine learning in economic prediction.

Keywords: High-growth firms; Machine learning; Prediction; Firm dynamics (search for similar items in EconPapers)
JEL-codes: C40 C55 C60 C81 L25 (search for similar items in EconPapers)
Date: 2024
New Economics Papers: this item is included in nep-big, nep-cmp, nep-ent and nep-sbm
References: View references in EconPapers View complete reference list from CitEc
Citations:

Downloads: (external link)
https://www.tcmb.gov.tr/wps/wcm/connect/f6ba5939-e ... 26edd66d1e0d-pg3jixk (application/pdf)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:tcb:wpaper:2413

Access Statistics for this paper

More papers in Working Papers from Research and Monetary Policy Department, Central Bank of the Republic of Turkey Contact information at EDIRC.
Bibliographic data for series maintained by Sermet Pekin () and Ilker Cakar () and ().

 
Page updated 2025-03-23
Handle: RePEc:tcb:wpaper:2413