Clustering, Forecasting and Cluster Forecasting: using k-medoids, k-NNs and random forests for cluster selection
Dinesh Reddy Vangumalli,
Konstantinos Nikolopoulos () and
Additional contact information
Dinesh Reddy Vangumalli: Oracle America Inc
Konstantia Litsiou: Manchester Metropolitan University
No 19016, Working Papers from Bangor Business School, Prifysgol Bangor University (Cymru / Wales)
Data analysts when facing a forecasting task involving a large number of time series, they regularly employ one of the following two methodological approaches: either select a single forecasting method for the entire dataset (aggregate selection), or use the best forecasting method for each time series (individual selection). There is evidence in the predictive analytics literature that the former is more robust than the latter, as in individual selection you tend to overfit models to the data. A third approach is to firstly identify homogeneous clusters within the dataset, and then select a single forecasting method for each cluster (cluster selection). This research examines the performance of three well-celebrated machine learning clustering methods: k-medoids, k-NN and random forests. We then forecast every cluster with the best possible method, and the performance is compared to that of aggregate selection. The aforementioned methods are very often used for classification tasks, but since in our case there is no set of predefined classes, the methods are used for pure clustering. The evaluation is performed in the 645 yearly series of the M3 competition. The empirical evidence suggests that: a) random forests provide the best clusters for the sequential forecasting task, and b) cluster selection has the potential to outperform aggregate selection.
Keywords: Clustering; k-medoids; Nearest Neighbors; Random Forests; Forecasting (search for similar items in EconPapers)
New Economics Papers: this item is included in nep-big, nep-cmp and nep-for
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (1) Track citations by RSS feed
Downloads: (external link)
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
Persistent link: https://EconPapers.repec.org/RePEc:bng:wpaper:19016
Access Statistics for this paper
More papers in Working Papers from Bangor Business School, Prifysgol Bangor University (Cymru / Wales) Contact information at EDIRC.
Bibliographic data for series maintained by Alan Thomas ().