An Efficient Grid-Based K-Prototypes Algorithm for Sustainable Decision-Making on Spatial Objects
Hong-Jun Jang,
Byoungwook Kim,
Jongwan Kim and
Soon-Young Jung
Additional contact information
Hong-Jun Jang: Department of Computer Science and Engineering, Korea University, Seoul 02841, Korea
Byoungwook Kim: Department of Computer Engineering, Dongguk University, Gyeongju 38066, Korea
Jongwan Kim: Smith Liberal Arts College, Sahmyook University, Seoul 01795, Korea
Soon-Young Jung: Department of Computer Science and Engineering, Korea University, Seoul 02841, Korea
Sustainability, 2018, vol. 10, issue 8, 1-20
Abstract:
Data mining plays a critical role in sustainable decision-making. Although the k-prototypes algorithm is one of the best-known algorithms for clustering both numeric and categorical data, clustering a large number of spatial objects with mixed numeric and categorical attributes is still inefficient due to complexity. In this paper, we propose an efficient grid-based k-prototypes algorithm, GK-prototypes, which achieves high performance for clustering spatial objects. The first proposed algorithm utilizes both maximum and minimum distance between cluster centers and a cell, which can reduce unnecessary distance calculation. The second proposed algorithm as an extension of the first proposed algorithm, utilizes spatial dependence; spatial data tends to be similar to objects that are close. Each cell has a bitmap index which stores the categorical values of all objects within the same cell for each attribute. This bitmap index can improve performance if the categorical data is skewed. Experimental results show that the proposed algorithms can achieve better performance than the existing pruning techniques of the k-prototypes algorithm.
Keywords: clustering; spatial data; grid-based k-prototypes; data mining; sustainability (search for similar items in EconPapers)
JEL-codes: O13 Q Q0 Q2 Q3 Q5 Q56 (search for similar items in EconPapers)
Date: 2018
References: View complete reference list from CitEc
Citations: View citations in EconPapers (2)
Downloads: (external link)
https://www.mdpi.com/2071-1050/10/8/2614/pdf (application/pdf)
https://www.mdpi.com/2071-1050/10/8/2614/ (text/html)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:gam:jsusta:v:10:y:2018:i:8:p:2614-:d:159983
Access Statistics for this article
Sustainability is currently edited by Ms. Alexandra Wu
More articles in Sustainability from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().