EconPapers    
Economics at your fingertips  
 

A commensurate univariate variable ranking method for classification

Nuo Xu, Xuan Huang, Thanh Nguyen and Jake Y. Chen

International Journal of Data Science, 2025, vol. 10, issue 2, 175-194

Abstract: To apply a variable ranking method for feature selection in classification, the notion of commensurateness is necessitated by the presence of different types of independent variables in a dataset. A commensurate ranking method is one that produces consistent and comparable ranking results among independent variables of different types, such as numeric vs. categorical and discrete vs. continuous. We invent a ranking method named conditional empirical expectation (CEE) and demonstrate it is the most commensurate among several representative ranking methods. Further, it has the highest statistical power as a test of independence when the categorical dependent variable is imbalanced. These properties make CEE uniquely suitable for fast feature selection for any datasets, especially those with high dimensionality of mixed types of variables. Its usage is demonstrated with a case study in facilitating preprocessing for classification.

Keywords: variable types; variable ranking; variable relevance; commensurate; statistical dependence. (search for similar items in EconPapers)
Date: 2025
References: Add references at CitEc
Citations:

Downloads: (external link)
http://www.inderscience.com/link.php?id=149831 (text/html)
Access to full text is restricted to subscribers.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:ids:ijdsci:v:10:y:2025:i:2:p:175-194

Access Statistics for this article

More articles in International Journal of Data Science from Inderscience Enterprises Ltd
Bibliographic data for series maintained by Sarah Parker ().

 
Page updated 2025-11-18
Handle: RePEc:ids:ijdsci:v:10:y:2025:i:2:p:175-194