Tail index estimation for discrete heavy-tailed distributions with application to statistical inference for regular markov chains
Patrice Bertail (),
Stephan Clémençon () and
Carlos Fernández ()
Additional contact information
Patrice Bertail: Université Paris Nanterre
Stephan Clémençon: Institut Polytechnique de Paris
Carlos Fernández: Université Paris Nanterre
TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, 2025, vol. 34, issue 3, No 6, 713 pages
Abstract:
Abstract It is the purpose of this paper to investigate the issue of estimating the regularity index $$\beta >0$$ β > 0 of a discrete heavy-tailed r.v. S, i.e. a r.v. S valued in $$\mathbb {N}^*$$ N ∗ such that $$\mathbb {P}(S>n)=L(n)\cdot n^{-\beta }$$ P ( S > n ) = L ( n ) · n - β for all $$n\ge 1$$ n ≥ 1 , where $$L:\mathbb {R}^*_+\rightarrow \mathbb {R}_+$$ L : R + ∗ → R + is a slowly varying function. Such discrete probability laws, referred to as generalized Zipf’s laws sometimes, are commonly used to model rank-size distributions after a preliminary range segmentation in a wide variety of areas such as e.g. quantitative linguistics, social sciences or information theory. As a first go, we consider the situation where inference is based on independent copies $$S_1,\; \ldots ,\; S_n$$ S 1 , … , S n of the generic variable S. The estimator $$\widehat{\beta }$$ β ^ we propose can be derived by means of a suitable reformulation of the regularly varying condition, replacing S’s survivor function by its empirical counterpart. Under mild assumptions, a non-asymptotic bound for the deviation between $$\widehat{\beta }$$ β ^ and $$\beta $$ β is established, as well as limit results (consistency and asymptotic normality). Beyond the i.i.d. case, the inference method proposed is extended to the estimation of the regularity index of a regenerative $$\beta $$ β -null-recurrent Markov chain. Since the parameter $$\beta $$ β can be then viewed as the tail index of the (regularly varying) distribution of the return time of the chain X to any (pseudo-) regenerative set, in this case, the estimator is constructed from the successive regeneration times. Because the durations between consecutive regeneration times are asymptotically independent, we can prove that the consistency of the estimator promoted is preserved. In addition to the theoretical analysis carried out, simulation results provide empirical evidence of the relevance of the inference technique proposed.
Keywords: Generalized discrete Pareto distribution; Nonparametric estimation; Null-recurrent Markov chain; Regularity index; Zipf’s law; 60K35 (search for similar items in EconPapers)
Date: 2025
References: View complete reference list from CitEc
Citations:
Downloads: (external link)
http://link.springer.com/10.1007/s11749-025-00975-9 Abstract (text/html)
Access to the full text of the articles in this series is restricted.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:spr:testjl:v:34:y:2025:i:3:d:10.1007_s11749-025-00975-9
Ordering information: This journal article can be ordered from
http://www.springer. ... cs/journal/11749/PS2
DOI: 10.1007/s11749-025-00975-9
Access Statistics for this article
TEST: An Official Journal of the Spanish Society of Statistics and Operations Research is currently edited by Alfonso Gordaliza and Ana F. Militino
More articles in TEST: An Official Journal of the Spanish Society of Statistics and Operations Research from Springer, Sociedad de Estadística e Investigación Operativa
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().