EconPapers    
Economics at your fingertips  
 

A Flexible Multivariate Distribution for Correlated Count Data

Kimberly F. Sellers, Tong Li, Yixuan Wu and Narayanaswamy Balakrishnan
Additional contact information
Kimberly F. Sellers: Department of Mathematics and Statistics, Georgetown University, Washington, DC 20057, USA
Tong Li: Department of Mathematics and Statistics, Georgetown University, Washington, DC 20057, USA
Yixuan Wu: Department of Mathematics and Statistics, Georgetown University, Washington, DC 20057, USA
Narayanaswamy Balakrishnan: Department of Mathematics and Statistics, McMaster University, Hamilton, ON L8S 4K1, Canada

Stats, 2021, vol. 4, issue 2, 1-19

Abstract: Multivariate count data are often modeled via a multivariate Poisson distribution, but it contains an underlying, constraining assumption of data equi-dispersion (where its variance equals its mean). Real data are oftentimes over-dispersed and, as such, consider various advancements of a negative binomial structure. While data over-dispersion is more prevalent than under-dispersion in real data, however, examples containing under-dispersed data are surfacing with greater frequency. Thus, there is a demonstrated need for a flexible model that can accommodate both data types. We develop a multivariate Conway–Maxwell–Poisson (MCMP) distribution to serve as a flexible alternative for correlated count data that contain data dispersion. This structure contains the multivariate Poisson, multivariate geometric, and the multivariate Bernoulli distributions as special cases, and serves as a bridge distribution across these three classical models to address other levels of over- or under-dispersion. In this work, we not only derive the distributional form and statistical properties of this model, but we further address parameter estimation, establish informative hypothesis tests to detect statistically significant data dispersion and aid in model parsimony, and illustrate the distribution’s flexibility through several simulated and real-world data examples. These examples demonstrate that the MCMP distribution performs on par with the multivariate negative binomial distribution for over-dispersed data, and proves particularly beneficial in effectively representing under-dispersed data. Thus, the MCMP distribution offers an effective, unifying framework for modeling over- or under-dispersed multivariate correlated count data that do not necessarily adhere to Poisson assumptions.

Keywords: multivariate Poisson; multivariate Bernoulli; multivariate geometric; Conway-Maxwell–Poisson; confounding; over-dispersion; under-dispersion; dependence (search for similar items in EconPapers)
JEL-codes: C1 C10 C11 C14 C15 C16 (search for similar items in EconPapers)
Date: 2021
References: View references in EconPapers View complete reference list from CitEc
Citations:

Downloads: (external link)
https://www.mdpi.com/2571-905X/4/2/21/pdf (application/pdf)
https://www.mdpi.com/2571-905X/4/2/21/ (text/html)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:gam:jstats:v:4:y:2021:i:2:p:21-326:d:536861

Access Statistics for this article

Stats is currently edited by Mrs. Minnie Li

More articles in Stats from MDPI
Bibliographic data for series maintained by MDPI Indexing Manager ().

 
Page updated 2025-03-19
Handle: RePEc:gam:jstats:v:4:y:2021:i:2:p:21-326:d:536861