A NEW SPRING FOR STATISTICAL METHODS: LARGE LANGUAGE MODELS (LLMS)
Kutluk Kağan Sümer ()
Additional contact information
Kutluk Kağan Sümer: İstanbul Üniversitesi
Eurasian Eononometrics, Statistics and Emprical Economics Journal, 2026, vol. 26, issue 26, 87-125
Abstract:
Large Language Models (LLMs) are the cornerstone of modern AI systems capable of humanlike reasoning, language understanding, and text generation. Their success relies not only on deep learning architectures but also on a comprehensive statistical foundation. This article provides an extensive examination of statistical techniques underlying LLMs, including probability theory, statistical learning theory, Bayesian inference, Markov chains, the Expectation–Maximization algorithm (EM), dimensionality reduction (PCA, SVD), probabilistic graphical models, variational inference, and sampling methods such as MCMC. It further explains how these methods are integrated within the Transformer architecture and contemporary LLM training pipelines. Applications in natural language processing, healthcare, finance, and law are also explored in detail.
Date: 2026
References: View references in EconPapers View complete reference list from CitEc
Citations:
Downloads: (external link)
https://eurasianacademy.org/index.php/econstat/article/view/1732 (application/pdf)
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:eas:econst:v:26:y:2025:i:26:p:87-125
DOI: 10.17740/eas.stat.2025-V25-06
Access Statistics for this article
More articles in Eurasian Eononometrics, Statistics and Emprical Economics Journal from Eurasian Academy Of Sciences
Bibliographic data for series maintained by Kutluk Kagan Sumer ().