Economics at your fingertips  

New Method Of Variable Selection For Binary Data Cluster Analysis

Jerzy Korzeniewski ()

Statistics in Transition new series, 2016, vol. 17, issue 2, 295-304

Abstract: Cluster analysis of binary data is a relatively poorly developed task in comparison with cluster analysis for data measured on stronger scales. For example, at the stage of variable selection one can use many methods arranged for arbitrary measurement scales but the results are usually of poor quality. In practice, the only methods dedicated for variable selection for binary data are the ones proposed by Brusco (2004), Dash et al. (2000) and Talavera (2000). In this paper the efficiency of these methods will be discussed with reference to the marketing type data of Dimitriadou et al. (2002). Moreover, the primary objective is a new proposal of variable selection method based on connecting the filtering of the input set of all variables with grouping of sets of variables similar with respect to similar groupings of objects. The new method is an attempt to link good features of two entirely different approaches to variable selection in cluster analysis, i.e. filtering methods and wrapper methods. The new method of variable selection returns best results when the classical k-means method of objects grouping is slightly modified.

Keywords: cluster analysis; market segmentation; selection of variables; binary data; k-means grouping (search for similar items in EconPapers)
Date: 2016
References: View references in EconPapers View complete reference list from CitEc
Citations: Track citations by RSS feed

Downloads: (external link) (application/pdf)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link:

Access Statistics for this article

Statistics in Transition new series is currently edited by Włodzimierz Okrasa

More articles in Statistics in Transition new series from Główny Urząd Statystyczny (Polska) Contact information at EDIRC.
Bibliographic data for series maintained by Beata Witek ().

Page updated 2023-04-12
Handle: RePEc:csb:stintr:v:17:y:2016:i:2:p:295-304