EconPapers    
Economics at your fingertips  
 

A hybrid random forest to predict soccer matches in international tournaments

Groll Andreas (), Ley Cristophe, Schauberger Gunther and Hans Van Eetvelde
Additional contact information
Groll Andreas: TU Dortmund University, Faculty Statistics, Vogelpothsweg 87, 44227 Dortmund, Germany
Ley Cristophe: Ghent University, Department of Applied Mathematics, Computer Science and Statistics, Krijgslaan 281, S9, Campus Sterre, Ghent 9000, Belgium
Schauberger Gunther: Technische Universitaet Muenchen, Department of Sport and Health Sciences, Munich, Bavaria, Germany
Hans Van Eetvelde: Ghent University, Department of Applied Mathematics, Computer Science and Statistics, Krijgslaan 281, S9, Campus Sterre, Ghent 9000, Belgium

Journal of Quantitative Analysis in Sports, 2019, vol. 15, issue 4, 271-287

Abstract: In this work, we propose a new hybrid modeling approach for the scores of international soccer matches which combines random forests with Poisson ranking methods. While the random forest is based on the competing teams’ covariate information, the latter method estimates ability parameters on historical match data that adequately reflect the current strength of the teams. We compare the new hybrid random forest model to its separate building blocks as well as to conventional Poisson regression models with regard to their predictive performance on all matches from the four FIFA World Cups 2002–2014. It turns out that by combining the random forest with the team ability parameters from the ranking methods as an additional covariate the predictive power can be improved substantially. Finally, the hybrid random forest is used (in advance of the tournament) to predict the FIFA World Cup 2018. To complete our analysis on the previous World Cup data, the corresponding 64 matches serve as an independent validation data set and we are able to confirm the compelling predictive potential of the hybrid random forest which clearly outperforms all other methods including the betting odds.

Keywords: FIFA World Cup 2018; random forests; soccer; sports tournaments; team abilities (search for similar items in EconPapers)
Date: 2019
References: Add references at CitEc
Citations: View citations in EconPapers (1)

Downloads: (external link)
https://doi.org/10.1515/jqas-2018-0060 (text/html)
For access to full text, subscription to the journal or payment for the individual article is required.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:bpj:jqsprt:v:15:y:2019:i:4:p:271-287:n:1

Ordering information: This journal article can be ordered from
https://www.degruyter.com/journal/key/jqas/html

DOI: 10.1515/jqas-2018-0060

Access Statistics for this article

Journal of Quantitative Analysis in Sports is currently edited by Mark Glickman

More articles in Journal of Quantitative Analysis in Sports from De Gruyter
Bibliographic data for series maintained by Peter Golla ().

 
Page updated 2025-03-19
Handle: RePEc:bpj:jqsprt:v:15:y:2019:i:4:p:271-287:n:1