EconPapers    
Economics at your fingertips  
 

An approximate-copula distribution for statistical modeling

Sarah S Ji, Benjamin B Chu, Hua Zhou and Kenneth Lange

PLOS Computational Biology, 2026, vol. 22, issue 3, 1-21

Abstract: Copulas, generalized estimating equations, and generalized linear mixed models promote the analysis of grouped data where non-normal responses are correlated. Unfortunately, parameter estimation remains challenging in these three frameworks. Based on prior work of Tonda, we derive a new class of probability density functions that allow explicit calculation of moments, marginal and conditional distributions, and the score and observed information needed in maximum likelihood estimation. We also illustrate how the new distribution flexibly models longitudinal data following a non-Gaussian distribution. Finally, we conduct a tri-variate genome-wide association analysis on dichotomized systolic and diastolic blood pressure and body mass index data from the UK-Biobank, showcasing the modeling potential and computational scalability of the new distributional family.Author summary: Modeling correlated responses is computationally challenging beyond the Gaussian realm. For instance, how should repeated binary outcomes in longitudinal studies be modeled? When a dataset contains both continuous and discrete responses, how can their dependence be captured in a principled and efficient way? This paper introduces a new class of probability distributions that enables flexible modeling of correlated responses of mixed type. Inspired by statistical copulas, the proposed approach is designed to remain computationally efficient even in high-dimensional settings. We refer to this framework as an approximate copula model and show that it provides a promising alternative to classical methods such as generalized linear mixed models and generalized estimating equations. To demonstrate its flexibility and scalability, we apply the approximate-copula model to genome-wide association (GWAS) data involving a mixture of continuous, binary, and count responses.

Date: 2026
References: View references in EconPapers View complete reference list from CitEc
Citations:

Downloads: (external link)
https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1013922 (text/html)
https://journals.plos.org/ploscompbiol/article/fil ... 13922&type=printable (application/pdf)

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:plo:pcbi00:1013922

DOI: 10.1371/journal.pcbi.1013922

Access Statistics for this article

More articles in PLOS Computational Biology from Public Library of Science
Bibliographic data for series maintained by ploscompbiol ().

 
Page updated 2026-04-05
Handle: RePEc:plo:pcbi00:1013922