EconPapers    
Economics at your fingertips  
 

Improving SVM Classification on Imbalanced Datasets by Introducing a New Bias

Haydemar Núñez (), Luis Gonzalez-Abril and Cecilio Angulo
Additional contact information
Haydemar Núñez: Universidad Central de Venezuela
Luis Gonzalez-Abril: Universidad de Sevilla
Cecilio Angulo: Technical University of Catalonia

Journal of Classification, 2017, vol. 34, issue 3, No 5, 427-443

Abstract: Abstract Support Vector Machine (SVM) learning from imbalanced datasets, as well as most learning machines, can show poor performance on the minority class because SVMs were designed to induce a model based on the overall error. To improve their performance in these kind of problems, a low-cost post-processing strategy is proposed based on calculating a new bias to adjust the function learned by the SVM. The proposed bias will consider the proportional size between classes in order to improve performance on the minority class. This solution avoids not only introducing and tuning new parameters, but also modifying the standard optimization problem for SVM training. Experimental results on 34 datasets, with different degrees of imbalance, show that the proposed method actually improves the classification on imbalanced datasets, by using standardized error measures based on sensitivity and g-means. Furthermore, its performance is comparable to well-known cost-sensitive and Synthetic Minority Over-sampling Technique (SMOTE) schemes, without adding complexity or computational costs.

Keywords: Support Vector Machine; Post-processing; Bias; Cost-sensitive strategy; SMOTE (search for similar items in EconPapers)
Date: 2017
References: View complete reference list from CitEc
Citations: View citations in EconPapers (4)

Downloads: (external link)
http://link.springer.com/10.1007/s00357-017-9242-x Abstract (text/html)
Access to the full text of the articles in this series is restricted.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:spr:jclass:v:34:y:2017:i:3:d:10.1007_s00357-017-9242-x

Ordering information: This journal article can be ordered from
http://www.springer. ... hods/journal/357/PS2

DOI: 10.1007/s00357-017-9242-x

Access Statistics for this article

Journal of Classification is currently edited by Douglas Steinley

More articles in Journal of Classification from Springer, The Classification Society
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().

 
Page updated 2025-03-20
Handle: RePEc:spr:jclass:v:34:y:2017:i:3:d:10.1007_s00357-017-9242-x