EconPapers    
Economics at your fingertips  
 

Distributed document clustering algorithms: a recent survey

J.E. Judith and J. Jayakumari

International Journal of Enterprise Network Management, 2015, vol. 6, issue 3, 207-221

Abstract: Distributed data mining paradigm is an active research area due to the enormous volume of data that are to be processed from across a wide cluster of data nodes. Document clustering algorithms are widely applied in a variety of distributed environments like peer-to-peer networks, wireless sensor networks, etc. This paper entails a comprehensive review on most of the recent distributed document clustering algorithms that is ultimately making massive impacts on the technological realm. These algorithms are analysed based on few pivotal elements such as clustering quality, scale-up, speed-up and accuracy. Recent advances in technology have developed MapReduce-based distributed document clustering algorithms, which show dramatic improvements in the aforementioned analytical elements. Based on the review, intelligent discussions are presented for algorithm development and implementation.

Keywords: distributed documents; document clustering; speed-up; scale-up; MapReduce; clustering algorithms; data mining. (search for similar items in EconPapers)
Date: 2015
References: Add references at CitEc
Citations:

Downloads: (external link)
http://www.inderscience.com/link.php?id=71134 (text/html)
Access to full text is restricted to subscribers.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:ids:ijenma:v:6:y:2015:i:3:p:207-221

Access Statistics for this article

More articles in International Journal of Enterprise Network Management from Inderscience Enterprises Ltd
Bibliographic data for series maintained by Sarah Parker ().

 
Page updated 2025-03-19
Handle: RePEc:ids:ijenma:v:6:y:2015:i:3:p:207-221