Learning in Repeated Multiunit Pay-as-Bid Auctions

Galgana, Rigel; Golrezaei, Negin

Learning in Repeated Multiunit Pay-as-Bid Auctions

Rigel Galgana () and Negin Golrezaei ()
Additional contact information
Rigel Galgana: MIT Operations Research Center, Massachusetts Institute of Technology, Cambridge, Massachusetts 02142
Negin Golrezaei: MIT Sloan School of Management, Massachusetts Institute of Technology, Cambridge, Massachusetts 02142

Manufacturing & Service Operations Management, 2025, vol. 27, issue 1, 200-229

Abstract: Problem definition : Motivated by carbon emissions trading schemes (ETSs), Treasury auctions, procurement auctions, and wholesale electricity markets, which all involve the auctioning of homogeneous multiple units, we consider the problem of learning how to bid in repeated multiunit pay-as-bid (PAB) auctions. In each of these auctions, a large number of (identical) items are to be allocated to the largest submitted bids, where the price of each of the winning bids is equal to the bid itself. In this work, we study the problem of optimizing bidding strategies from the perspective of a single bidder. Methodology/results : Effective bidding in PAB auctions is complex due to the combinatorial nature of the action space. We show that a utility decoupling trick enables a polynomial time algorithm to solve the offline problem where competing bids are known in advance. Leveraging this structure, we design efficient algorithms for the online problem under both full information and bandit feedback settings that achieve an upper bound on regret of O ( M T log T ) and O ( M T 2 3 log T ) , respectively, where M is the number of units demanded by the bidder, and T is the total number of auctions. We accompany these results with a regret lower bound of Ω ( M T ) for the full information setting and Ω ( M 2 / 3 T 2 / 3 ) for the bandit setting. We also present additional findings on the characterization of PAB equilibria. Managerial implications : Although the Nash equilibria of PAB auctions possess nice properties such as winning bid uniformity and high welfare and revenue, they are not guaranteed under no-regret learning dynamics. Nevertheless, our simulations suggest that these properties hold anyways, regardless of Nash equilibrium existence. Compared with its uniform price counterpart, the PAB dynamics converge faster and achieve higher revenue, making PAB appealing whenever revenue holds significant social value—for example, ETSs and Treasury auctions.

Keywords: multiunit pay-as-bid auctions; bidding strategies; regret analysis; market dynamics (search for similar items in EconPapers)
Date: 2025
References: Add references at CitEc
Citations: View citations in EconPapers (1)

Downloads: (external link)
http://dx.doi.org/10.1287/msom.2023.0403 (application/pdf)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:inm:ormsom:v:27:y:2025:i:1:p:200-229

Access Statistics for this article

More articles in Manufacturing & Service Operations Management from INFORMS Contact information at EDIRC.
Bibliographic data for series maintained by Chris Asher ().