EconPapers    
Economics at your fingertips  
 

Detecting small clusters in the stochastic block model

Fei Ye (), Jingsong Xiao (), Weidong Ma (), Shiwen Jin () and Ying Yang ()
Additional contact information
Fei Ye: Capital University of Economics and Business
Jingsong Xiao: Tsinghua University
Weidong Ma: Tsinghua University
Shiwen Jin: Tsinghua University
Ying Yang: Tsinghua University

Statistical Papers, 2025, vol. 66, issue 2, No 7, 34 pages

Abstract: Abstract In the study of community detection, the stochastic block model (SBM) is frequently used as an ideal model. Many community detection strategies have been proposed and proved to be consistent under the SBM. However, almost all of these consistencies were established on the common assumption that all communities are balanced. When the communities are not balanced and some communities have small size, those strategies might be less efficient. In this paper, we consider the SBM with small clusters, under which the communities consist of several large clusters that have balanced sizes and some small clusters that have sizes of order smaller than the sizes of large clusters, and propose a two-step method to efficiently detect small clusters as well as the community structure of large clusters. In the first step, to get an initial estimator of the community structure, we treat the nodes in small clusters as outliers and utilize a robust community detecting method to classify the majority of nodes in large clusters correctly. In the second step, we pick out the nodes in small clusters using the entry-wise deviation, and update the community structure. We demonstrate that, under mild conditions, our method can consistently recover the large communities and identify each node in small clusters. Simulation results show that the proposed approach performs well whether the initial estimator is obtained by the semidefinite programming or the regularized spectral clustering. We also illustrate our method on real world networks.

Keywords: Stochastic block model; Community detection; Small clusters; Outlier detection; 62H30 (search for similar items in EconPapers)
Date: 2025
References: View references in EconPapers View complete reference list from CitEc
Citations:

Downloads: (external link)
http://link.springer.com/10.1007/s00362-025-01660-7 Abstract (text/html)
Access to the full text of the articles in this series is restricted.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:spr:stpapr:v:66:y:2025:i:2:d:10.1007_s00362-025-01660-7

Ordering information: This journal article can be ordered from
http://www.springer. ... business/journal/362

DOI: 10.1007/s00362-025-01660-7

Access Statistics for this article

Statistical Papers is currently edited by C. Müller, W. Krämer and W.G. Müller

More articles in Statistical Papers from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().

 
Page updated 2025-03-20
Handle: RePEc:spr:stpapr:v:66:y:2025:i:2:d:10.1007_s00362-025-01660-7