Deep neural networks, gradient-boosted trees, random forests: Statistical arbitrage on the S&P 500

Krauss, Christopher; Do, Xuan Anh; Huck, Nicolas

Deep neural networks, gradient-boosted trees, random forests: Statistical arbitrage on the S&P 500

Christopher Krauss, Xuan Anh Do and Nicolas Huck ()
Additional contact information
Christopher Krauss: FAU - Friedrich-Alexander Universität Erlangen-Nürnberg = University of Erlangen-Nuremberg
Xuan Anh Do: FAU - Friedrich-Alexander Universität Erlangen-Nürnberg = University of Erlangen-Nuremberg

Post-Print from HAL

Abstract: In recent years, machine learning research has gained momentum: new developments in the field of deep learning allow for multiple levels of abstraction and are starting to supersede well-known and powerful tree-based techniques mainly operating on the original feature space. All these methods can be applied to various fields, including finance. This paper implements and analyzes the effectiveness of deep neural networks (DNN), gradient-boosted-trees (GBT), random forests (RAF), and several ensembles of these methods in the context of statistical arbitrage. Each model is trained on lagged returns of all stocks in the S&P 500, after elimination of survivor bias. From 1992 to 2015, daily one-day-ahead trading signals are generated based on the probability forecast of a stock to outperform the general market. The highest k probabilities are converted into long and the lowest k probabilities into short positions, thus censoring the less certain middle part of the ranking. Empirical findings are promising. A simple, equal-weighted ensemble (ENS1) consisting of one deep neural network, one gradient-boosted tree, and one random forest produces out-of-sample returns exceeding 0.45 percent per day for k=10, prior to transaction costs. Irrespective of the fact that profits are declining in recent years, our findings pose a severe challenge to the semi-strong form of market efficiency.

Keywords: Gradient-boosting; Deep learning; Finance; Ensemble learning; Random forests (search for similar items in EconPapers)
Date: 2017
References: Add references at CitEc
Citations: View citations in EconPapers (155)

Published in European Journal of Operational Research, 2017, 259 (2), pp.689-702. ⟨10.1016/j.ejor.2016.10.031⟩

There are no downloads for this item, see the EconPapers FAQ for hints about obtaining it.

Related works:
Journal Article: Deep neural networks, gradient-boosted trees, random forests: Statistical arbitrage on the S&P 500 (2017)
Working Paper: Deep neural networks, gradient-boosted trees, random forests: Statistical arbitrage on the S&P 500 (2016)
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:hal:journl:hal-01515120

DOI: 10.1016/j.ejor.2016.10.031

Access Statistics for this paper

More papers in Post-Print from HAL
Bibliographic data for series maintained by CCSD ().