Combining Machine Learning Classifiers for Stock Trading with Effective Feature Extraction
A. K. M. Amanat Ullah,
Fahim Imtiaz,
Miftah Uddin Md Ihsan,
Md. Golam Rabiul Alam and
Mahbub Majumdar
Papers from arXiv.org
Abstract:
The unpredictability and volatility of the stock market render it challenging to make a substantial profit using any generalised scheme. Many previous studies tried different techniques to build a machine learning model, which can make a significant profit in the US stock market by performing live trading. However, very few studies have focused on the importance of finding the best features for a particular trading period. Our top approach used the performance to narrow down the features from a total of 148 to about 30. Furthermore, the top 25 features were dynamically selected before each time training our machine learning model. It uses ensemble learning with four classifiers: Gaussian Naive Bayes, Decision Tree, Logistic Regression with L1 regularization, and Stochastic Gradient Descent, to decide whether to go long or short on a particular stock. Our best model performed daily trade between July 2011 and January 2019, generating 54.35% profit. Finally, our work showcased that mixtures of weighted classifiers perform better than any individual predictor of making trading decisions in the stock market.
Date: 2021-07, Revised 2023-08
New Economics Papers: this item is included in nep-big, nep-cmp, nep-fmk and nep-isf
References: View references in EconPapers View complete reference list from CitEc
Citations:
Published in Int. J. Computational Science and Engineering, Vol. 26 No.1, (2023)
Downloads: (external link)
http://arxiv.org/pdf/2107.13148 Latest version (application/pdf)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:arx:papers:2107.13148
Access Statistics for this paper
More papers in Papers from arXiv.org
Bibliographic data for series maintained by arXiv administrators ().