EconPapers    
Economics at your fingertips  
 

Simple Poisson PCA: an algorithm for (sparse) feature extraction with simultaneous dimension determination

Luke Smallman (), William Underwood and Andreas Artemiou
Additional contact information
Luke Smallman: Cardiff University
William Underwood: University of Oxford
Andreas Artemiou: Cardiff University

Computational Statistics, 2020, vol. 35, issue 2, No 7, 559-577

Abstract: Abstract Dimension reduction tools offer a popular approach to analysis of high-dimensional big data. In this paper, we propose an algorithm for sparse Principal Component Analysis for non-Gaussian data. Since our interest for the algorithm stems from applications in text data analysis we focus on the Poisson distribution which has been used extensively in analysing text data. In addition to sparsity our algorithm is able to effectively determine the desired number of principal components in the model (order determination). The good performance of our proposal is demonstrated with both synthetic and real data examples.

Keywords: L0 penalty; Exponential family; Text data analysis; Dimension reduction (search for similar items in EconPapers)
Date: 2020
References: View references in EconPapers View complete reference list from CitEc
Citations:

Downloads: (external link)
http://link.springer.com/10.1007/s00180-019-00903-0 Abstract (text/html)
Access to the full text of the articles in this series is restricted.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:spr:compst:v:35:y:2020:i:2:d:10.1007_s00180-019-00903-0

Ordering information: This journal article can be ordered from
http://www.springer.com/statistics/journal/180/PS2

DOI: 10.1007/s00180-019-00903-0

Access Statistics for this article

Computational Statistics is currently edited by Wataru Sakamoto, Ricardo Cao and Jürgen Symanzik

More articles in Computational Statistics from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().

 
Page updated 2025-03-20
Handle: RePEc:spr:compst:v:35:y:2020:i:2:d:10.1007_s00180-019-00903-0