Menu Pricing of Large Language Models
Dirk Bergemann,
Alessandro Bonatti and
Alex Smolin
Papers from arXiv.org
Abstract:
We develop a framework for the optimal pricing and product design of LLMs in which a provider sells menus of token budgets to users who differ in their valuations across a continuum of tasks. Under a homogeneous production technology, we show that users' high-dimensional type profiles are summarized by a scalar index, reducing the seller's problem to one-dimensional screening. The optimal mechanism takes the form of committed-spend contracts: buyers pay for a budget that they allocate across token classes priced at marginal cost. We extend the analysis to environments with multiple differentiated models and to competition between a proprietary leader and an open-source fringe, showing that competitive pressure reshapes both the intensive and extensive margins of compute provision. Each element of our theory (token-budget menus, maximum- and minimum-spend plans, multi-model versioning, and linear API pricing) has a direct counterpart in the observed pricing practices of providers such as Anthropic, OpenAI, and GitHub.
Date: 2025-02, Revised 2026-03
New Economics Papers: this item is included in nep-cmp, nep-com, nep-des and nep-mic
References: View references in EconPapers View complete reference list from CitEc
Citations: View citations in EconPapers (5)
Downloads: (external link)
http://arxiv.org/pdf/2502.07736 Latest version (application/pdf)
Related works:
Working Paper: Menu Pricing of Large Language Models (2026) 
Working Paper: Menu Pricing of Large Language Models (2026) 
Working Paper: Menu Pricing of Large Language Models (2025) 
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:arx:papers:2502.07736
Access Statistics for this paper
More papers in Papers from arXiv.org
Bibliographic data for series maintained by arXiv administrators ().