EconPapers    
Economics at your fingertips  
 

Spectral Clustering of Mixed-Type Data

Felix Mbuga and Cristina Tortora
Additional contact information
Felix Mbuga: Department of Mathematics and Statistics, San José State University, San Jose, CA 95116, USA
Cristina Tortora: Department of Mathematics and Statistics, San José State University, San Jose, CA 95116, USA

Stats, 2021, vol. 5, issue 1, 1-11

Abstract: Cluster analysis seeks to assign objects with similar characteristics into groups called clusters so that objects within a group are similar to each other and dissimilar to objects in other groups. Spectral clustering has been shown to perform well in different scenarios on continuous data: it can detect convex and non-convex clusters, and can detect overlapping clusters. However, the constraint on continuous data can be limiting in real applications where data are often of mixed-type, i.e., data that contains both continuous and categorical features. This paper looks at extending spectral clustering to mixed-type data. The new method replaces the Euclidean-based similarity distance used in conventional spectral clustering with different dissimilarity measures for continuous and categorical variables. A global dissimilarity measure is than computed using a weighted sum, and a Gaussian kernel is used to convert the dissimilarity matrix into a similarity matrix. The new method includes an automatic tuning of the variable weight and kernel parameter. The performance of spectral clustering in different scenarios is compared with that of two state-of-the-art mixed-type data clustering methods, k -prototypes and KAMILA, using several simulated and real data sets.

Keywords: cluster analysis; spectral clustering; mixed-type data (search for similar items in EconPapers)
JEL-codes: C1 C10 C11 C14 C15 C16 (search for similar items in EconPapers)
Date: 2021
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (1)

Downloads: (external link)
https://www.mdpi.com/2571-905X/5/1/1/pdf (application/pdf)
https://www.mdpi.com/2571-905X/5/1/1/ (text/html)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:gam:jstats:v:5:y:2021:i:1:p:1-11:d:709232

Access Statistics for this article

Stats is currently edited by Mrs. Minnie Li

More articles in Stats from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().

 
Page updated 2025-03-19
Handle: RePEc:gam:jstats:v:5:y:2021:i:1:p:1-11:d:709232