Bump hunting through density curvature features
José E. Chacón () and
Javier Fernández Serrano ()
Additional contact information
José E. Chacón: Universidad de Extremadura
Javier Fernández Serrano: Universidad Autónoma de Madrid
TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, 2023, vol. 32, issue 4, No 9, 1275 pages
Abstract:
Abstract Bump hunting deals with finding in sample spaces meaningful data subsets known as bumps. These have traditionally been conceived as modal or concave regions in the graph of the underlying density function. We define an abstract bump construct based on curvature functionals of the probability density. Then, we explore several alternative characterizations involving derivatives up to second order. In particular, a suitable implementation of Good and Gaskins’ original concave bumps is proposed in the multivariate case. Moreover, we bring to exploratory data analysis concepts like the mean curvature and the Laplacian that have produced good results in applied domains. Our methodology addresses the approximation of the curvature functional with a plug-in kernel density estimator. We provide theoretical results that assure the asymptotic consistency of bump boundaries in the Hausdorff distance with affordable convergence rates. We also present asymptotically valid and consistent confidence regions bounding curvature bumps. The theory is illustrated through several use cases in sports analytics with datasets from the NBA, MLB and NFL. We conclude that the different curvature instances effectively combine to generate insightful visualizations.
Keywords: Bump hunting; Concavity; Gaussian curvature; Kernel density derivative estimation; Laplacian; Mean curvature; Primary 62G05; 62G20; 60D05; 62R07 (search for similar items in EconPapers)
Date: 2023
References: Add references at CitEc
Citations:
Downloads: (external link)
http://link.springer.com/10.1007/s11749-023-00872-z Abstract (text/html)
Access to the full text of the articles in this series is restricted.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:spr:testjl:v:32:y:2023:i:4:d:10.1007_s11749-023-00872-z
Ordering information: This journal article can be ordered from
http://www.springer. ... cs/journal/11749/PS2
DOI: 10.1007/s11749-023-00872-z
Access Statistics for this article
TEST: An Official Journal of the Spanish Society of Statistics and Operations Research is currently edited by Alfonso Gordaliza and Ana F. Militino
More articles in TEST: An Official Journal of the Spanish Society of Statistics and Operations Research from Springer, Sociedad de Estadística e Investigación Operativa
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().