Finding Groups in Large Data Sets
Adrian Muller ()
No 02-18, CEPE Working paper series from CEPE Center for Energy Policy and Economics, ETH Zurich
Abstract:
This paper aims to give an overview of methods to find groups in large data sets, such as household expenditure survey data. These methods are grouped in three: cluster analysis, dimension reduction and basic explorative methods. The emphasis is put on a critical analysis and potential drawbacks, especially of inputs that have to be provided by the researcher. These may impose some structure not present in the data, thus defeating the purpose of revealing intrinsic patterns. In general, the more elaborate methods, such as cluster analysis, are delicate to apply, especially in the context of social sciences. Often, it may be best to limit oneself to more transparent approaches such as comparisons of basic statistics.
Pages: 20 pages
Date: 2002-10
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (4)
Downloads: (external link)
http://www.cepe.ethz.ch/publications/workingPapers/CEPE_WP18.pdf (application/pdf)
Our link check indicates that this URL is bad, the error code is: 404 Not found UA (http://www.cepe.ethz.ch/publications/workingPapers/CEPE_WP18.pdf [301 Moved Permanently]--> https://www.cepe.ethz.ch/publications/workingPapers/CEPE_WP18.pdf [301 Moved Permanently]--> https://cepe.ethz.ch/publications/workingPapers/CEPE_WP18.pdf)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:cee:wpcepe:02-18
Access Statistics for this paper
More papers in CEPE Working paper series from CEPE Center for Energy Policy and Economics, ETH Zurich Contact information at EDIRC.
Bibliographic data for series maintained by Carlos Ordas ().