EconPapers    
Economics at your fingertips  
 

Reproducible Feature Selection for High-Dimensional Measurement Error Models

Xin Zhou (), Yang Li (), Zemin Zheng (), Jie Wu () and Jiarui Zhang ()
Additional contact information
Xin Zhou: International Institute of Finance, School of Management, University of Science and Technology of China, Hefei 230026, China
Yang Li: International Institute of Finance, School of Management, University of Science and Technology of China, Hefei 230026, China
Zemin Zheng: International Institute of Finance, School of Management, University of Science and Technology of China, Hefei 230026, China
Jie Wu: School of Big Data and Statistics, Anhui University, Hefei 230601, China
Jiarui Zhang: Department of Mathematics, Hong Kong University of Science and Technology, Hong Kong 999077, China

INFORMS Journal on Computing, 2025, vol. 37, issue 5, 1350-1368

Abstract: The literature has witnessed an upsurge of interest in dealing with corrupted data in diverse operations research and optimization applications. Despite the substantial progress of feature selection, how to control the false discovery rate (FDR) under measurement errors remains largely unexplored, especially in the knockoffs framework. In this paper, we extend the recently developed knockoff procedures designed for clean data sets to deal with corrupted data. To be specific, we propose a new method called the double projection knockoff filter (DP-knockoff) for reproducible feature selection under additive measurement errors in the high-dimensional setup. Our key contribution is to show that the FDR of the proposed DP-knockoff can be asymptotically controlled within a user-specified level. This is nontrivial because there is no way to obtain the exact knockoff copies due to the unobservable measurement errors. We address this issue by resorting to certain bias-corrected test statistics. Our numerical studies and real data analysis demonstrate the effectiveness of the proposed procedure.

Keywords: measurement errors; false discovery rates; high dimensionality; feature selection; knockoffs; the nearest positive semi-definite projection (search for similar items in EconPapers)
Date: 2025
References: Add references at CitEc
Citations:

Downloads: (external link)
http://dx.doi.org/10.1287/ijoc.2023.0282 (application/pdf)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:inm:orijoc:v:37:y:2025:i:5:p:1350-1368

Access Statistics for this article

More articles in INFORMS Journal on Computing from INFORMS Contact information at EDIRC.
Bibliographic data for series maintained by Chris Asher ().

 
Page updated 2025-10-15
Handle: RePEc:inm:orijoc:v:37:y:2025:i:5:p:1350-1368