SNN-PDM: An Improved Probability Density Machine Algorithm Based on Shared Nearest Neighbors Clustering Technique
Shiqi Wu,
Hualong Yu,
Yan Gu,
Changbin Shao () and
Shang Gao
Additional contact information
Shiqi Wu: Jiangsu University of Science and Technology
Hualong Yu: Jiangsu University of Science and Technology
Yan Gu: Jiangsu University of Science and Technology
Changbin Shao: Jiangsu University of Science and Technology
Shang Gao: Jiangsu University of Science and Technology
Journal of Classification, 2024, vol. 41, issue 2, No 5, 289-312
Abstract:
Abstract Probability density machine (PDM) is a novel algorithm which was proposed recently for addressing class imbalance learning (CIL) problem. PDM can capture priori data distribution information well and present robust performance in various CIL applications. However, we also note that the PDM is sensitive to CIL data with varying density and/or small disjunctions which means there are two or multiple obvious sub-clusters within the same class, as on this kind of data, the estimation of conditional probability might be extremely inaccurate. To address this problem, we introduce the shared nearest neighbors (SNN) clustering technique into PDM procedure and propose a novel SNN-PDM algorithm. Specifically, the SNN can adapt varying density and capture small disjunctions existing in data distribution well. We evaluate the proposed algorithm on a large amount of CIL datasets, and the results show that the proposed SNN-PDM algorithm outperforms the PDM and several previous methods. Meanwhile, in comparison with PDM, the SNN-PDM has less time consumption.
Keywords: Class imbalance learning; Probability density machine; Shared nearest neighbors; KNN probability density estimation (search for similar items in EconPapers)
Date: 2024
References: View references in EconPapers View complete reference list from CitEc
Citations:
Downloads: (external link)
http://link.springer.com/10.1007/s00357-024-09474-2 Abstract (text/html)
Access to the full text of the articles in this series is restricted.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:spr:jclass:v:41:y:2024:i:2:d:10.1007_s00357-024-09474-2
Ordering information: This journal article can be ordered from
http://www.springer. ... hods/journal/357/PS2
DOI: 10.1007/s00357-024-09474-2
Access Statistics for this article
Journal of Classification is currently edited by Douglas Steinley
More articles in Journal of Classification from Springer, The Classification Society
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().