The Economics of Large Language Models: Token Allocation, Fine-Tuning and Optimal Pricing
Dirk Bergemann,
Alessandro Bonatti and
Alex Smolin
No 20226, CEPR Discussion Papers from Centre for Economic Policy Research
Abstract:
We develop an economic framework to analyze the optimal pricing and product design of Large Language Models (LLM). Our framework captures several key features of LLMs: variable operational costs of processing input and output tokens; the ability to customize models through fine-tuning; and high-dimensional user heterogeneity in terms of task requirements and error sensitivity. In our model, a monopolistic seller offers multiple versions of LLMs through a menu of products. The optimal pricing structure depends on whether token allocation across tasks is contractible and whether users face scale constraints. Users with similar aggregate value-scale characteristics choose similar levels of fine-tuning and token consumption. The optimal mechanism can be implemented through menus of two-part tariffs, with higher markups for more intensive users. Our results rationalize observed industry practices such as tiered pricing based on model customization and usage levels.
Keywords: Large; Language; Models (search for similar items in EconPapers)
JEL-codes: D47 D82 D83 (search for similar items in EconPapers)
Date: 2025-05
References: Add references at CitEc
Citations:
Downloads: (external link)
https://cepr.org/publications/DP20226 (application/pdf)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:cpr:ceprdp:20226
Ordering information: This working paper can be ordered from
https://cepr.org/publications/DP20226
Access Statistics for this paper
More papers in CEPR Discussion Papers from Centre for Economic Policy Research 33 Great Sutton Street, London EC1V 0DX, UK.
Bibliographic data for series maintained by CEPR ().