Investigating the impact of undersampling and bagging: an empirical investigation for customer attrition modeling
Arno Caigny (),
Kristof Coussement (),
Matthijs Meire () and
Steven Hoornaert ()
Additional contact information
Arno Caigny: Univ. Lille, CNRS, UMR 9221 - LEM - Lille Economie Management
Kristof Coussement: Univ. Lille, CNRS, UMR 9221 - LEM - Lille Economie Management
Matthijs Meire: Univ. Lille, CNRS, UMR 9221 - LEM - Lille Economie Management
Steven Hoornaert: Univ. Lille, CNRS, UMR 9221 - LEM - Lille Economie Management
Annals of Operations Research, 2025, vol. 346, issue 3, No 16, 2421 pages
Abstract:
Abstract Given the growing interest in using AI and analytics to support CRM decision making, we discuss why undersampling and bagging are popular prediction techniques in customer churn prediction (CCP). The former helps in tackling the class imbalance problem and the latter improves model stability. However, extant CCP literature is unclear on the impact of undersampling on model stability and predictive performance, while bagging has difficulties in handling the class imbalance problem. Therefore, we extend existing CCP research to benchmark underbagging, which combines undersampling and bagging. Having both prediction techniques combined we recuperate customer data that would have been lost in undersampling by using them in multiple bags and passing an undersampled, more balanced training set to the classifier. In an extensive experiment including 11 real-life CCP datasets, underbagging is benchmarked against its constituents and other popular CCP classifiers in terms of predictive performance, profit and operational efficiency. Our results indicate that underbagging is a valid and reliable alternative framework for CCP prediction.
Keywords: Business analytics; Customer relationship management; Data science; Underbagging; Logistic regression (search for similar items in EconPapers)
Date: 2025
References: Add references at CitEc
Citations:
Downloads: (external link)
http://link.springer.com/10.1007/s10479-025-06516-9 Abstract (text/html)
Access to the full text of the articles in this series is restricted.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:spr:annopr:v:346:y:2025:i:3:d:10.1007_s10479-025-06516-9
Ordering information: This journal article can be ordered from
http://www.springer.com/journal/10479
DOI: 10.1007/s10479-025-06516-9
Access Statistics for this article
Annals of Operations Research is currently edited by Endre Boros
More articles in Annals of Operations Research from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().