Detecting and Mitigating Shortcut Learning Bias in Machine Learning: A Pathway to More Generalizable ML-based (IS) Research
Matthew Caron (),
Oliver Müller () and
Johannes Kriebel ()
Additional contact information
Matthew Caron: Paderborn University
Oliver Müller: Paderborn University
Johannes Kriebel: University of Hamburg
No 129, Working Papers Dissertations from Paderborn University, Faculty of Business Administration and Economics
Abstract:
Shortcut learning is a critical challenge in machine learning (ML) that arises when models rely on spurious patterns or superficial associations rather than meaningful relationships in the data. While this issue has been widely studied in computer vision and natural language processing, its impact on tabular and categorical data -- i.e., data common in ML-based research within Information Systems (IS) -- remains underexplored. To address this challenge, we propose a two-phase framework: detecting shortcut learning biases through advanced sampling strategies and mitigating these biases using methods like feature exclusion. Additionally, we emphasize the importance of transparent reporting to enhance reproducibility and provide insights into a model’s generalization capabilities. Using simulated and real-world data, we demonstrate the harmful effects of shortcut learning in tabular data. The results highlight how distribution shifts expose shortcut dependencies, a key focus of the detection phase in our framework. These shifts reveal how models relying on shortcuts fail to generalize beyond training data. While our mitigation strategy is exploratory, it demonstrates that addressing shortcut learning is feasible and underscores the need for further research into model-agnostic solutions. By encouraging comprehensive evaluations and transparent reporting, this work aims to advance the generalizability, reproducibility, and reliability of ML-based research in IS.
Keywords: Machine Learning; ML-Based Research; Shortcut Learning; Reproducibility; Generalizability (search for similar items in EconPapers)
JEL-codes: C8 (search for similar items in EconPapers)
Pages: 29
Date: 2025-02
New Economics Papers: this item is included in nep-big and nep-cmp
References: View references in EconPapers View complete reference list from CitEc
Citations:
Downloads: (external link)
http://groups.uni-paderborn.de/wp-wiwi/RePEc/pdf/dispap/DP129.pdf (application/pdf)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:pdn:dispap:129
Access Statistics for this paper
More papers in Working Papers Dissertations from Paderborn University, Faculty of Business Administration and Economics Contact information at EDIRC.
Bibliographic data for series maintained by WP-WiWi-Info ().