The Forest or the Trees? Tackling Simpson's Paradox with Classification Trees
Galit Shmueli and
Inbal Yahav
Production and Operations Management, 2018, vol. 27, issue 4, 696-716
Abstract:
Studying causal effects is central to research in operations management in manufacturing and services, from evaluating prevention procedures, to effects of policies and new operational technologies and practices. The growing availability of micro†level data creates challenges for researchers and decision makers in terms of choosing the right level of data aggregation for inference and decisions. Simpson's paradox describes the case where the direction of a causal effect is reversed in the aggregated data compared to the disaggregated data. Detecting whether Simpson's paradox occurs in a dataset used for decision making is therefore critical. This study introduces the use of Classification and Regression Trees for automated detection of potential Simpson's paradoxes in data with few or many potential confounding variables, and even with large samples (big data). Our approach relies on the tree structure and the location of the cause vs. the confounders in the tree. We discuss theoretical and computational aspects of the approach and illustrate it using several real applications in e†governance and healthcare.
Date: 2018
References: Add references at CitEc
Citations: View citations in EconPapers (7)
Downloads: (external link)
https://doi.org/10.1111/poms.12819
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:bla:popmgt:v:27:y:2018:i:4:p:696-716
Ordering information: This journal article can be ordered from
http://onlinelibrary ... 1111/(ISSN)1937-5956
Access Statistics for this article
Production and Operations Management is currently edited by Kalyan Singhal
More articles in Production and Operations Management from Production and Operations Management Society
Bibliographic data for series maintained by Wiley Content Delivery ().