Rethinking Probabilistic Topic Modeling from the Point of View of Classical Non-Bayesian Regularization
Konstantin Vorontsov ()
Additional contact information
Konstantin Vorontsov: Federal Research Center “Computer Science and Control” of RAS and Institute of Artificial Intelligence of M.V.Lomonosov Moscow State University
A chapter in Data Analysis and Optimization, 2023, pp 397-422 from Springer
Abstract:
Abstract Probabilistic Topic Modeling with hundreds of its models and applications has been an efficient text analysis technique for almost 20 years. This research area has evolved mostly within the frame of the Bayesian learning theory. For a long time, the possibility of learning topic models with a simpler conventional (non-Bayesian) regularization remained underestimated and rarely used. The framework of Additive Regularization for Topic Modeling (ARTM) fills this gap. It dramatically simplifies the model inference and opens up new possibilities for combining topic models by just adding their regularizers. This makes the ARTM a tool for synthesizing models with desired properties and gives rise to developing the fast online algorithms in the BigARTM open-source environment equipped with a modular extensible library of regularizers. In this paper, a general iterative process is proposed that maximizes a smooth function on unit simplices. This process can be used as inference mechanism for a wide variety of topic models. This approach is believed to be useful not only for rethinking probabilistic topic modeling, but also for building the neural topic models increasingly popular in recent years.
Keywords: Probabilistic topic modeling; Additive regularization of topic models; EM-algorithm; BigARTM; Multimodal topic modeling; Hierarchical topic modeling; Hypergraph topic modeling; Sequential topic modeling; Topical embedding; Transactional data; Recommender systems; Latent Dirichlet allocation; Bayesian learning (search for similar items in EconPapers)
Date: 2023
References: Add references at CitEc
Citations:
There are no downloads for this item, see the EconPapers FAQ for hints about obtaining it.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:spr:spochp:978-3-031-31654-8_24
Ordering information: This item can be ordered from
http://www.springer.com/9783031316548
DOI: 10.1007/978-3-031-31654-8_24
Access Statistics for this chapter
More chapters in Springer Optimization and Its Applications from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().