Algorithmic Collusion under Observed Demand Shocks
Zexin Ye
Papers from arXiv.org
Abstract:
This paper examines how the observability of demand shocks influences pricing patterns and market outcomes when firms delegate pricing decisions to Q-learning algorithms. Simulations show that demand observability induces Q-learning agents to adapt prices to demand fluctuations, giving rise to distinctive demand-contingent pricing patterns across the discount factor $\delta$, consistent with Rotemberg and Saloner (1986). When $\delta$ is high, they learn procyclical pricing, charging higher prices in higher demand states. In contrast, at low $\delta$, they lower prices during booms and raise them during downturns, exhibiting countercyclical pricing. Q-learning agents also autonomously sustain supracompetitive profits, indicating that demand observability does not hinder algorithmic collusion. I further explore how the information available to algorithms shapes their learned pricing behavior. Overall, the results suggest that, through pure trial and error, Q-learning algorithms internalize both the stronger deviation incentives during booms and the trade-off between short-term gains and long-term continuation values governed by the discount factor, thereby reproducing the cyclicality of pricing patterns predicted by collusion theory.
Date: 2025-02, Revised 2025-12
New Economics Papers: this item is included in nep-ain, nep-com, nep-cta, nep-ind and nep-reg
References: View references in EconPapers View complete reference list from CitEc
Citations:
Downloads: (external link)
http://arxiv.org/pdf/2502.15084 Latest version (application/pdf)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:arx:papers:2502.15084
Access Statistics for this paper
More papers in Papers from arXiv.org
Bibliographic data for series maintained by arXiv administrators ().