EconPapers    
Economics at your fingertips  
 

Feature Screening for Ultrahigh Dimensional Mixed Data via Wasserstein Distance

Bing Tian and Hong Wang

International Statistical Review, 2025, vol. 93, issue 2, 267-287

Abstract: This article develops a novel feature screening procedure for ultrahigh dimensional mixed data based on Wasserstein distance, termed as Wasserstein‐SIS. To handle the mixture of continuous and discrete data, we use Wasserstein distance as a new marginal utility to measure the difference between the joint distribution and the product of marginal distributions. In theory, we establish the sure screening property under less restrictive assumptions on data types. The proposed procedure does not require model specification, gives a more effective geometric measure to compare the discrepancy between distributions and avoids introducing biases caused by the choice of slicing rules for continuous data. Numerical comparison indicates that the proposed Wasserstein‐SIS method performs better than existing methods in various models. A real data application also validates the better practicability of Wasserstein‐SIS.

Date: 2025
References: Add references at CitEc
Citations:

Downloads: (external link)
https://doi.org/10.1111/insr.12609

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:bla:istatr:v:93:y:2025:i:2:p:267-287

Ordering information: This journal article can be ordered from
http://www.blackwell ... bs.asp?ref=0306-7734

Access Statistics for this article

International Statistical Review is currently edited by Eugene Seneta and Kees Zeelenberg

More articles in International Statistical Review from International Statistical Institute Contact information at EDIRC.
Bibliographic data for series maintained by Wiley Content Delivery ().

 
Page updated 2025-08-06
Handle: RePEc:bla:istatr:v:93:y:2025:i:2:p:267-287