EconPapers    
Economics at your fingertips  
 

The computing of the Poisson multinomial distribution and applications in ecological inference and machine learning

Zhengzhi Lin, Yueyao Wang and Yili Hong ()
Additional contact information
Zhengzhi Lin: Virginia Tech
Yueyao Wang: Virginia Tech
Yili Hong: Virginia Tech

Computational Statistics, 2023, vol. 38, issue 4, No 13, 1877 pages

Abstract: Abstract The Poisson multinomial distribution (PMD) describes the distribution of the sum of n independent but non-identically distributed random vectors, in which each random vector is of length m with 0/1 valued elements and only one of its elements can take value 1 with a certain probability. Those probabilities are different for the m elements across the n random vectors, and form an $$n \times m$$ n × m matrix with row sum equals to 1. We call this $$n\times m$$ n × m matrix the success probability matrix (SPM). Each SPM uniquely defines a $${ \text {PMD}}$$ PMD . The $${ \text {PMD}}$$ PMD is useful in many areas such as, voting theory, ecological inference, and machine learning. The distribution functions of $${ \text {PMD}}$$ PMD , however, are usually difficult to compute and there is no efficient algorithm available for computing it. In this paper, we develop efficient methods to compute the probability mass function (pmf) for the PMD using multivariate Fourier transform, normal approximation, and simulations. We study the accuracy and efficiency of those methods and give recommendations for which methods to use under various scenarios. We also illustrate the use of the $${ \text {PMD}}$$ PMD via three applications, namely, in ecological inference, uncertainty quantification in classification, and voting probability calculation. We build an R package that implements the proposed methods, and illustrate the package with examples. This paper has online supplementary materials.

Keywords: Aggregated data inference; Classification; Multinomial distribution; Poisson binomial distribution; Political science; Uncertainty quantification (search for similar items in EconPapers)
Date: 2023
References: View references in EconPapers View complete reference list from CitEc
Citations:

Downloads: (external link)
http://link.springer.com/10.1007/s00180-022-01299-0 Abstract (text/html)
Access to the full text of the articles in this series is restricted.

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:spr:compst:v:38:y:2023:i:4:d:10.1007_s00180-022-01299-0

Ordering information: This journal article can be ordered from
http://www.springer.com/statistics/journal/180/PS2

DOI: 10.1007/s00180-022-01299-0

Access Statistics for this article

Computational Statistics is currently edited by Wataru Sakamoto, Ricardo Cao and Jürgen Symanzik

More articles in Computational Statistics from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().

 
Page updated 2025-04-12
Handle: RePEc:spr:compst:v:38:y:2023:i:4:d:10.1007_s00180-022-01299-0