Generalization guides human exploration in vast decision spaces
Charley M. Wu (),
Eric Schulz,
Maarten Speekenbrink,
Jonathan D. Nelson and
Björn Meder
Additional contact information
Charley M. Wu: Max Planck Institute for Human Development
Eric Schulz: Harvard University
Maarten Speekenbrink: University College London
Jonathan D. Nelson: University of Surrey
Björn Meder: Max Planck Institute for Human Development
Nature Human Behaviour, 2018, vol. 2, issue 12, 915-924
Abstract:
Abstract From foraging for food to learning complex games, many aspects of human behaviour can be framed as a search problem with a vast space of possible actions. Under finite search horizons, optimal solutions are generally unobtainable. Yet, how do humans navigate vast problem spaces, which require intelligent exploration of unobserved actions? Using various bandit tasks with up to 121 arms, we study how humans search for rewards under limited search horizons, in which the spatial correlation of rewards (in both generated and natural environments) provides traction for generalization. Across various different probabilistic and heuristic models, we find evidence that Gaussian process function learning—combined with an optimistic upper confidence bound sampling strategy—provides a robust account of how people use generalization to guide search. Our modelling results and parameter estimates are recoverable and can be used to simulate human-like performance, providing insights about human behaviour in complex environments.
Date: 2018
References: Add references at CitEc
Citations: View citations in EconPapers (14)
Downloads: (external link)
https://www.nature.com/articles/s41562-018-0467-4 Abstract (text/html)
Access to the full text of the articles in this series is restricted.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:nat:nathum:v:2:y:2018:i:12:d:10.1038_s41562-018-0467-4
Ordering information: This journal article can be ordered from
https://www.nature.com/nathumbehav/
DOI: 10.1038/s41562-018-0467-4
Access Statistics for this article
Nature Human Behaviour is currently edited by Stavroula Kousta
More articles in Nature Human Behaviour from Nature
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().