Double-weighted kNN: a simple and efficient variant with embedded feature selection
Almudena Moreno-Ribera and
Aida Calviño ()
Additional contact information
Almudena Moreno-Ribera: Complutense University of Madrid, Department of Statistics and Data Science
Aida Calviño: Complutense University of Madrid, Department of Statistics and Data Science
Journal of Marketing Analytics, 2025, vol. 13, issue 4, No 1, 989-999
Abstract:
Abstract Predictive modeling aims at providing estimates of an unknown variable, the target, from a set of known ones, the input. The k Nearest Neighbors (kNN) is one of the best-known predictive algorithms due to its simplicity and well behavior. However, this class of models has some drawbacks, such as the non-robustness to the existence of irrelevant input features or the need to transform qualitative variables into dummies, with the corresponding loss of information for ordinal ones. In this work, a kNN regression variant, easily adaptable for classification purposes, is suggested. The proposal allows dealing with all types of input variables while embedding feature selection in a simple and efficient manner, reducing the tuning phase. More precisely, making use of the weighted Gower distance, we develop a powerful tool to cope with these inconveniences. Finally, to boost the tool predictive power, a second weighting scheme is added to the neighbors. The proposed method is applied to a collection of 20 data sets, different in size, data type, and distribution of the target variable. Moreover, the results are compared with the previously proposed kNN variants, showing its supremacy, particularly when the weighting scheme is based on non-linear association measures.
Keywords: Gower distance; Ordinal variables; Machine learning; Regression; Weighting scheme (search for similar items in EconPapers)
Date: 2025
References: Add references at CitEc
Citations:
Downloads: (external link)
http://link.springer.com/10.1057/s41270-024-00302-5 Abstract (text/html)
Access to the full text of the articles in this series is restricted.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:pal:jmarka:v:13:y:2025:i:4:d:10.1057_s41270-024-00302-5
Ordering information: This journal article can be ordered from
http://www.springer. ... gement/journal/41270
DOI: 10.1057/s41270-024-00302-5
Access Statistics for this article
Journal of Marketing Analytics is currently edited by Maria Petrescu and Anjala Krishnen
More articles in Journal of Marketing Analytics from Palgrave Macmillan
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().