EconPapers    
Economics at your fingertips  
 

Reddit's 'pulse' on US inflation: forecasting with large language models

Andrea Del Monaco (), Luigi Longo (), Juri Marcucci () and Irene Tafani ()
Additional contact information
Andrea Del Monaco: Bank of Italy
Luigi Longo: JRC - European Commission
Juri Marcucci: Bank of Italy
Irene Tafani: IMT School Lucca

No 1028, Questioni di Economia e Finanza (Occasional Papers) from Bank of Italy, Economic Research and International Relations Area

Abstract: We show that large language models (LLMs) can transform Reddit discussions into timely predictors of US inflation. Using inflation-related submissions and time-local comments from major economics-focused subreddits, we construct monthly narrative indicators that capture perceived price dynamics. Signals are generated by fine-tuning pre-trained models (BERT-, Qwen-, LLaMA-, and Gemma-type architectures) for labels produced by human annotators and ChatGPT and benchmarked against a non-fine-tuned LLaMA-70B model. Forecasting and nowcasting are implemented in pseudo-real time with strictly backward-looking transformations, recursive expanding windows, and explicit data-availability constraints. In a recursive pseudo out-of-sample evaluation with horizons up to 18 months, Reddit-LLM models and MSE-weighted forecast combinations improve point and density forecasts of headline CPI and core PCE relative to standard benchmarks, including autoregressive models augmented with Michigan survey expectations and inflation swaps. In real-time nowcasting, Reddit signals constructed using information available early in the month improve nowcasts and perform competitively with the Cleveland Fed Inflation Nowcast. Importantly, much of the predictive content can be captured with fine-tuned small language models (SLMs), which often deliver performances close to those of much larger LLMs at a fraction of the computational cost, supporting scalable and resource-efficient deployment.

Keywords: economic forecasting; social media; Reddit; inflation; text mining; text-as-data; text analysis; natural language processing; sentiment analysis; Big Data; large language models; ChatGPT; generative artificial intelligence (search for similar items in EconPapers)
JEL-codes: C32 C53 C55 E31 (search for similar items in EconPapers)
Date: 2026-06
References: Add references at CitEc
Citations:

Downloads: (external link)
https://www.bancaditalia.it/pubblicazioni/qef/2026-1028/QEF_1028_26.pdf (application/pdf)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:bdi:opques:qef_1028_26

Access Statistics for this paper

More papers in Questioni di Economia e Finanza (Occasional Papers) from Bank of Italy, Economic Research and International Relations Area Contact information at EDIRC.
Bibliographic data for series maintained by ().

 
Page updated 2026-06-19
Handle: RePEc:bdi:opques:qef_1028_26