EconPapers    
Economics at your fingertips  
 

Unlocking Stock Return Predictions: Using Financial Statements with Random Forest and PCA

Yinan Jin ()
Additional contact information
Yinan Jin: Beijing University of Technology, College of Computer Science

A chapter in Proceedings of the International Workshop on Navigating the Digital Business Frontier for Sustainable Financial Innovation (ICDEBA 2024), 2025, pp 664-673 from Springer

Abstract: Abstract Financial statements are pivotal for forecasting the future performance of stocks. Harnessing the random forest machine learning model, this study aims to enhance the prediction of quarterly stock returns by focusing on twelve critical financial indicators. This paper utilized Principal Component Analysis (PCA) for dimensionality reduction and feature selection, aiming to optimize the model's predictive accuracy. The dataset encompassed quarterly financial statements and stock data for the 100 constituent stocks of the NASDAQ 100 index from 2010 to 2020. The PCA analysis revealed that reducing the input features to six dimensions significantly improved the model's predictive performance, as indicated by Mean Squared Error (MSE) and Mean Absolute Error (MAE). This finding suggests that an overabundance of components can introduce unnecessary complexity, potentially detracting from the model's predictive capabilities. The feature importance assessment, conducted using the random forest algorithm, identified Volatility, Revenue Growth Rate, and Return as the most influential predictors. Notably, the optimal predictive performance was achieved with the inclusion of seven and five top features, respectively, highlighting the non-linear relationship between the number of features and model performance. This comprehensive study underscores the utility of the random forest model in predicting stock returns and emphasizes the critical role of dimensionality reduction and feature selection refinement in enhancing predictive accuracy.

Keywords: Random Forest; PCA (search for similar items in EconPapers)
Date: 2025
References: Add references at CitEc
Citations:

There are no downloads for this item, see the EconPapers FAQ for hints about obtaining it.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:spr:advbcp:978-94-6463-652-9_69

Ordering information: This item can be ordered from
http://www.springer.com/9789464636529

DOI: 10.2991/978-94-6463-652-9_69

Access Statistics for this chapter

More chapters in Advances in Economics, Business and Management Research from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().

 
Page updated 2026-05-22
Handle: RePEc:spr:advbcp:978-94-6463-652-9_69