Exploiting social media with higher-order Factorization Machines: Statistical arbitrage on high-frequency data of the S&P 500
Johannes Stübinger and
No 13/2017, FAU Discussion Papers in Economics from Friedrich-Alexander University Erlangen-Nuremberg, Institute for Economics
Over the past 15 years,there have been a number of studies using text mining for predicting stock market data. Two recent publications employed support vector machines and second-order Factorization Machines, respectively, to this end. However, these approaches either completely neglect interactions between the features extracted from the text, or they only account for second-order interactions. In thispaper, weapply higher-order Factorization Machines, for which efficient training algorithms have only been available since 2016. As Factorization Machines require hyperparameters to be specified, we also introduce the novel adaptive-order algorithm for automatically determining them. Our studyis the first one tomake use of social media data for predicting high-frequency stock returns, namely the ones of the S&P 500 stock constituents. We show that, unlike a trading strategy employing support vector machines, Factorization-Machine-based strategies attain positive returns after transactions costs for the years 2014 and 2015. Especially the approach applying thea daptive-order algorithm outperforms classical approaches with respect to a multitude of criteria, and it features very favorable characteristics.
Keywords: finance; factorization machine; social media; statistical arbitrage; high-frequency data (search for similar items in EconPapers)
New Economics Papers: this item is included in nep-mst
References: View references in EconPapers View complete reference list from CitEc
Citations Track citations by RSS feed
Downloads: (external link)
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
Persistent link: https://EconPapers.repec.org/RePEc:zbw:iwqwdp:132017
Access Statistics for this paper
More papers in FAU Discussion Papers in Economics from Friedrich-Alexander University Erlangen-Nuremberg, Institute for Economics Contact information at EDIRC.
Bibliographic data for series maintained by ZBW - Leibniz Information Centre for Economics ().