EconPapers    
Economics at your fingertips  
 

HOW INDIVIDUALS LEARN TO TAKE TURNS: EMERGENCE OF ALTERNATING COOPERATION IN A CONGESTION GAME AND THE PRISONER'S DILEMMA

Dirk Helbing, Martin Schönhof, Hans-Ulrich Stark and Janusz A. Hołyst
Additional contact information
Dirk Helbing: Institute for Transport & Economics, Dresden University of Technology, Andreas-Schubert-Str. 23, 01062 Dresden, Germany
Martin Schönhof: Institute for Transport & Economics, Dresden University of Technology, Andreas-Schubert-Str. 23, 01062 Dresden, Germany
Hans-Ulrich Stark: Institute for Transport & Economics, Dresden University of Technology, Andreas-Schubert-Str. 23, 01062 Dresden, Germany
Janusz A. Hołyst: Faculty of Physics and Center of Excellence for Complex Systems Research, Warsaw University of Technology, Koszykowa 75, PL-00-662 Warsaw, Poland

Advances in Complex Systems (ACS), 2005, vol. 08, issue 01, 87-116

Abstract: In many social dilemmas, individuals tend to generate a situation with low payoffs instead of a system optimum ("tragedy of the commons"). Is the routing of traffic a similar problem? In order to address this question, we present experimental results on humans playing a route choice game in a computer laboratory, which allow one to study decision behavior in repeated games beyond the Prisoner's Dilemma. We will focus on whether individuals manage to find a cooperative and fair solution compatible with the system-optimal road usage. We find that individuals tend towards a user equilibrium with equal travel times in the beginning. However, after many iterations, they often establish a coherent oscillatory behavior, as taking turns performs better than applying pure or mixed strategies. The resulting behavior is fair and compatible with system-optimal road usage. In spite of the complex dynamics leading to coordinated oscillations, we have identified mathematical relationships quantifying the observed transition process. Our main experimental discoveries for 2- and 4-person games can be explained with a novel reinforcement learning model for an arbitrary number of persons, which is based on past experience and trial-and-error behavior. Gains in the average payoff seem to be an important driving force for the innovation of time-dependent response patterns, i.e. the evolution of more complex strategies. Our findings are relevant for decision support systems and routing in traffic or data networks.

Keywords: Game theory; reinforcement learning; multi-agent simulation (search for similar items in EconPapers)
Date: 2005
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (24)

Downloads: (external link)
http://www.worldscientific.com/doi/abs/10.1142/S0219525905000361
Access to full text is restricted to subscribers

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:wsi:acsxxx:v:08:y:2005:i:01:n:s0219525905000361

Ordering information: This journal article can be ordered from

DOI: 10.1142/S0219525905000361

Access Statistics for this article

Advances in Complex Systems (ACS) is currently edited by Frank Schweitzer

More articles in Advances in Complex Systems (ACS) from World Scientific Publishing Co. Pte. Ltd.
Bibliographic data for series maintained by Tai Tone Lim ().

 
Page updated 2025-03-20
Handle: RePEc:wsi:acsxxx:v:08:y:2005:i:01:n:s0219525905000361