EconPapers    
Economics at your fingertips  
 

Computing exact clustering posteriors with subset convolution

Jukka Kohonen and Jukka Corander

Communications in Statistics - Theory and Methods, 2016, vol. 45, issue 10, 3048-3058

Abstract: An exponential-time exact algorithm is provided for the task of clustering n items of data into k clusters. Instead of seeking one partition, posterior probabilities are computed for summary statistics: the number of clusters and pairwise co-occurrence. The method is based on subset convolution and yields the posterior distribution for the number of clusters in O(n3n) operations or O(n32n) using fast subset convolution. Pairwise co-occurrence probabilities are then obtained in O(n32n) operations. This is considerably faster than exhaustive enumeration of all partitions.

Date: 2016
References: Add references at CitEc
Citations:

Downloads: (external link)
http://hdl.handle.net/10.1080/03610926.2014.894070 (text/html)
Access to full text is restricted to subscribers.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:taf:lstaxx:v:45:y:2016:i:10:p:3048-3058

Ordering information: This journal article can be ordered from
http://www.tandfonline.com/pricing/journal/lsta20

DOI: 10.1080/03610926.2014.894070

Access Statistics for this article

Communications in Statistics - Theory and Methods is currently edited by Debbie Iscoe

More articles in Communications in Statistics - Theory and Methods from Taylor & Francis Journals
Bibliographic data for series maintained by Chris Longhurst ().

 
Page updated 2025-03-20
Handle: RePEc:taf:lstaxx:v:45:y:2016:i:10:p:3048-3058