MASSIVELY DISTRIBUTED CONCEPT DRIFT HANDLING IN LARGE NETWORKS
István Hegedűs (),
Róbert Ormándi () and
Márk Jelasity ()
Additional contact information
István Hegedűs: University of Szeged, Szeged, H-6720, Hungary
Róbert Ormándi: University of Szeged, Szeged, H-6720, Hungary
Márk Jelasity: University of Szeged and Hungarian Academy of Sciences, Szeged, H-6720, Hungary
Advances in Complex Systems (ACS), 2013, vol. 16, issue 04n05, 1-28
Abstract:
Massively distributed data mining in large networks such as smart device platforms and peer-to-peer systems is a rapidly developing research area. One important problem here is concept drift, where global data patterns (movement, preferences, activities, etc.) change according to the actual set of participating users, the weather, the time of day, or as a result of events such as accidents or even natural catastrophes. In an important case — when the network is very large but only a few training samples can be obtained at each node locally — no efficient distributed solution is known that could follow concept drift efficiently. This case is characteristic of smart device platforms where each device stores only one local observation or data record related to a learning problem. Here we present two algorithms to handle concept drift. None of the algorithms collects data to a central location, instead models of the data perform random walks in the network, while being improved using an online learning algorithm. The first algorithm achieves adaptivity by maintaining young as well as old models in the network according to a fixed age distribution. The second one measures the performance of models locally, and discards them if they are judged outdated. We demonstrate through a thorough experimental analysis that our algorithms outperform the known competing methods if the number of independent local samples is limited relative to the speed of drift: a typical scenario in our targeted application domains. The two algorithms have different strengths: while the age distribution approach is very simple and efficient, explicit drift detection can be useful in monitoring applications to trigger control action.
Keywords: Adaptive classification; concept drift; gossip learning; P2P (search for similar items in EconPapers)
Date: 2013
References: View complete reference list from CitEc
Citations:
Downloads: (external link)
http://www.worldscientific.com/doi/abs/10.1142/S0219525913500215
Access to full text is restricted to subscribers
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:wsi:acsxxx:v:16:y:2013:i:04n05:n:s0219525913500215
Ordering information: This journal article can be ordered from
DOI: 10.1142/S0219525913500215
Access Statistics for this article
Advances in Complex Systems (ACS) is currently edited by Frank Schweitzer
More articles in Advances in Complex Systems (ACS) from World Scientific Publishing Co. Pte. Ltd.
Bibliographic data for series maintained by Tai Tone Lim ().