EconPapers    
Economics at your fingertips  
 

GOLFS: feature selection via combining both global and local information for high dimensional clustering

Zhaoyu Xing, Yang Wan, Juan Wen () and Wei Zhong ()
Additional contact information
Zhaoyu Xing: Xiamen University
Yang Wan: ByteDance Ltd.
Juan Wen: Xiamen University
Wei Zhong: Xiamen University

Computational Statistics, 2024, vol. 39, issue 5, No 10, 2675 pages

Abstract: Abstract It is important to identify the discriminative features for high dimensional clustering. However, due to the lack of cluster labels, the regularization methods developed for supervised feature selection can not be directly applied. To learn the pseudo labels and select the discriminative features simultaneously, we propose a new unsupervised feature selection method, named GlObal and Local information combined Feature Selection (GOLFS), for high dimensional clustering problems. The GOLFS algorithm combines both local geometric structure via manifold learning and global correlation structure of samples via regularized self-representation to select the discriminative features. The combination improves the accuracy of both feature selection and clustering by exploiting more comprehensive information. In addition, an iterative algorithm is proposed to solve the optimization problem and the convergency is proved. Simulations and two real data applications demonstrate the excellent finite-sample performance of GOLFS on both feature selection and clustering.

Keywords: Feature selection; High dimensionality; $$l_{2{; }1}$$ l 2; 1 -norm; Manifold learning; Regularized self-representation; Spectral clustering (search for similar items in EconPapers)
Date: 2024
References: View references in EconPapers View complete reference list from CitEc
Citations:

Downloads: (external link)
http://link.springer.com/10.1007/s00180-023-01393-x Abstract (text/html)
Access to the full text of the articles in this series is restricted.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:spr:compst:v:39:y:2024:i:5:d:10.1007_s00180-023-01393-x

Ordering information: This journal article can be ordered from
http://www.springer.com/statistics/journal/180/PS2

DOI: 10.1007/s00180-023-01393-x

Access Statistics for this article

Computational Statistics is currently edited by Wataru Sakamoto, Ricardo Cao and Jürgen Symanzik

More articles in Computational Statistics from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().

 
Page updated 2025-03-20
Handle: RePEc:spr:compst:v:39:y:2024:i:5:d:10.1007_s00180-023-01393-x