Tail index estimation for discrete heavy-tailed distributions with application to statistical inference for regular markov chains

Bertail, Patrice; Clémençon, Stephan; Fernández, Carlos

Tail index estimation for discrete heavy-tailed distributions with application to statistical inference for regular markov chains

Patrice Bertail (), Stephan Clémençon () and Carlos Fernández ()
Additional contact information
Patrice Bertail: Université Paris Nanterre
Stephan Clémençon: Institut Polytechnique de Paris
Carlos Fernández: Université Paris Nanterre

TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, 2025, vol. 34, issue 3, No 6, 713 pages

Abstract: Abstract It is the purpose of this paper to investigate the issue of estimating the regularity index $$\beta >0$$ β > 0 of a discrete heavy-tailed r.v. S, i.e. a r.v. S valued in $$\mathbb {N}^*$$ N ∗ such that $$\mathbb {P}(S>n)=L(n)\cdot n^{-\beta }$$ P ( S > n ) = L ( n ) · n - β for all $$n\ge 1$$ n ≥ 1 , where $$L:\mathbb {R}^*_+\rightarrow \mathbb {R}_+$$ L : R + ∗ → R + is a slowly varying function. Such discrete probability laws, referred to as generalized Zipf’s laws sometimes, are commonly used to model rank-size distributions after a preliminary range segmentation in a wide variety of areas such as e.g. quantitative linguistics, social sciences or information theory. As a first go, we consider the situation where inference is based on independent copies $$S_1,\; \ldots ,\; S_n$$ S 1 , … , S n of the generic variable S. The estimator $$\widehat{\beta }$$ β ^ we propose can be derived by means of a suitable reformulation of the regularly varying condition, replacing S’s survivor function by its empirical counterpart. Under mild assumptions, a non-asymptotic bound for the deviation between $$\widehat{\beta }$$ β ^ and $$\beta $$ β is established, as well as limit results (consistency and asymptotic normality). Beyond the i.i.d. case, the inference method proposed is extended to the estimation of the regularity index of a regenerative $$\beta $$ β -null-recurrent Markov chain. Since the parameter $$\beta $$ β can be then viewed as the tail index of the (regularly varying) distribution of the return time of the chain X to any (pseudo-) regenerative set, in this case, the estimator is constructed from the successive regeneration times. Because the durations between consecutive regeneration times are asymptotically independent, we can prove that the consistency of the estimator promoted is preserved. In addition to the theoretical analysis carried out, simulation results provide empirical evidence of the relevance of the inference technique proposed.

Keywords: Generalized discrete Pareto distribution; Nonparametric estimation; Null-recurrent Markov chain; Regularity index; Zipf’s law; 60K35 (search for similar items in EconPapers)
Date: 2025
References: View references in EconPapers View complete reference list from CitEc
Citations:

Downloads: (external link)
http://link.springer.com/10.1007/s11749-025-00975-9 Abstract (text/html)
Access to the full text of the articles in this series is restricted.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:spr:testjl:v:34:y:2025:i:3:d:10.1007_s11749-025-00975-9

Ordering information: This journal article can be ordered from
http://www.springer. ... cs/journal/11749/PS2

DOI: 10.1007/s11749-025-00975-9

Access Statistics for this article

TEST: An Official Journal of the Spanish Society of Statistics and Operations Research is currently edited by Alfonso Gordaliza and Ana F. Militino

More articles in TEST: An Official Journal of the Spanish Society of Statistics and Operations Research from Springer, Sociedad de Estadística e Investigación Operativa
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().