Machine Learning as a Tool for Hypothesis Generation*
Jens Ludwig and
Sendhil Mullainathan
The Quarterly Journal of Economics, 2024, vol. 139, issue 2, 751-827
Abstract:
While hypothesis testing is a highly formalized activity, hypothesis generation remains largely informal. We propose a systematic procedure to generate novel hypotheses about human behavior, which uses the capacity of machine learning algorithms to notice patterns people might not. We illustrate the procedure with a concrete application: judge decisions about whom to jail. We begin with a striking fact: the defendant’s face alone matters greatly for the judge’s jailing decision. In fact, an algorithm given only the pixels in the defendant’s mug shot accounts for up to half of the predictable variation. We develop a procedure that allows human subjects to interact with this black-box algorithm to produce hypotheses about what in the face influences judge decisions. The procedure generates hypotheses that are both interpretable and novel: they are not explained by demographics (e.g., race) or existing psychology research, nor are they already known (even if tacitly) to people or experts. Though these results are specific, our procedure is general. It provides a way to produce novel, interpretable hypotheses from any high-dimensional data set (e.g., cell phones, satellites, online behavior, news headlines, corporate filings, and high-frequency time series). A central tenet of our article is that hypothesis generation is a valuable activity, and we hope this encourages future work in this largely “prescientific” stage of science.
Date: 2024
References: Add references at CitEc
Citations: View citations in EconPapers (2)
Downloads: (external link)
http://hdl.handle.net/10.1093/qje/qjad055 (application/pdf)
Access to full text is restricted to subscribers.
Related works:
Working Paper: Machine Learning as a Tool for Hypothesis Generation (2023) 
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:oup:qjecon:v:139:y:2024:i:2:p:751-827.
Ordering information: This journal article can be ordered from
https://academic.oup.com/journals
Access Statistics for this article
The Quarterly Journal of Economics is currently edited by Robert J. Barro, Lawrence F. Katz, Nathan Nunn, Andrei Shleifer and Stefanie Stantcheva
More articles in The Quarterly Journal of Economics from President and Fellows of Harvard College
Bibliographic data for series maintained by Oxford University Press ().