EconPapers    
Economics at your fingertips  
 

VC-PCR: A prediction method based on variable selection and clustering

Rebecca Marion, Johannes Lederer, Bernadette Goevarts and Rainer von Sachs ()
Additional contact information
Rebecca Marion: Université catholique de Louvain, LIDAM/ISBA, Belgium
Johannes Lederer: University of Hamburg
Bernadette Goevarts: Université catholique de Louvain, LIDAM/ISBA, Belgium
Rainer von Sachs: Université catholique de Louvain, LIDAM/ISBA, Belgium

No 2024023, LIDAM Reprints ISBA from Université catholique de Louvain, Institute of Statistics, Biostatistics and Actuarial Sciences (ISBA)

Abstract: Sparse linear prediction methods suffer from decreased prediction accuracy when the predictor variables have cluster structure (e.g., highly correlated groups of variables). To improve prediction accuracy, various methods have been proposed to identify variable clusters from the data and integrate cluster information into a sparse modeling process. But none of these methods achieve satisfactory performance for prediction, variable selection and variable clustering performed simultaneously. This paper presents Variable Cluster Principal Component Regression (VC-PCR), a prediction method that uses variable selection and variable clustering in order to solve this problem. Experiments with real and simulated data demonstrate that, compared to competitor methods, VC-PCR is the only method that achieves simultaneously good prediction, variable selection, and clustering performance when cluster structure is present.

Keywords: Dimensionality reduction; latent variables; nonnegative matrix factorization; sparsity; variable clustering (search for similar items in EconPapers)
Pages: 25
Date: 2024-08-20
Note: In: Statistica Neerlandica, 2024
References: Add references at CitEc
Citations:

There are no downloads for this item, see the EconPapers FAQ for hints about obtaining it.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:aiz:louvar:2024023

DOI: 10.1111/stan.12358

Access Statistics for this paper

More papers in LIDAM Reprints ISBA from Université catholique de Louvain, Institute of Statistics, Biostatistics and Actuarial Sciences (ISBA) Voie du Roman Pays 20, 1348 Louvain-la-Neuve (Belgium). Contact information at EDIRC.
Bibliographic data for series maintained by Nadja Peiffer ().

 
Page updated 2025-03-19
Handle: RePEc:aiz:louvar:2024023