Mixed membership estimation for categorical data with weighted responses
Huan Qing ()
Additional contact information
Huan Qing: Chongqing University of Technology
TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, 2025, vol. 34, issue 3, No 4, 612-659
Abstract:
Abstract The Grade-of-Membership (GoM) model, which allows subjects to belong to multiple latent classes, is a powerful tool for inferring latent classes in categorical data. However, its application is limited to categorical data with nonnegative integer responses, as it assumes that the response matrix is generated from Bernoulli or Binomial distributions, making it inappropriate for datasets with continuous or negative weighted responses. To address this, this paper proposes a novel model named the Weighted-Grade-of-Membership (WGoM) model. Our WGoM is more general than GoM because it relaxes GoM’s distribution constraint by allowing the response matrix to be generated from distributions like Bernoulli, Binomial, Normal, and Uniform as long as the expected response matrix has a block structure related to subjects’ mixed memberships under the distribution. We show that WGoM can describe any response matrix with finite distinct elements. We then propose an algorithm to estimate the latent mixed memberships and other WGoM parameters. We derive the error bounds of the estimated parameters and show that the algorithm is statistically consistent. We also propose an efficient method for determining the number of latent classes K for categorical data with weighted responses by maximizing fuzzy-weighted modularity. The performance of our methods is validated through both synthetic and real-world datasets. The results demonstrate the accuracy and efficiency of our algorithm for estimating latent mixed memberships, as well as the high accuracy of our method for estimating K, indicating their high potential for practical applications.
Keywords: Categorical data; Latent class analysis; Mixed membership models; Spectral method; Fuzzy-weighted modularity; 62H30; 91C20; 62P15 (search for similar items in EconPapers)
Date: 2025
References: View complete reference list from CitEc
Citations:
Downloads: (external link)
http://link.springer.com/10.1007/s11749-025-00973-x Abstract (text/html)
Access to the full text of the articles in this series is restricted.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:spr:testjl:v:34:y:2025:i:3:d:10.1007_s11749-025-00973-x
Ordering information: This journal article can be ordered from
http://www.springer. ... cs/journal/11749/PS2
DOI: 10.1007/s11749-025-00973-x
Access Statistics for this article
TEST: An Official Journal of the Spanish Society of Statistics and Operations Research is currently edited by Alfonso Gordaliza and Ana F. Militino
More articles in TEST: An Official Journal of the Spanish Society of Statistics and Operations Research from Springer, Sociedad de Estadística e Investigación Operativa
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().