Obtaining More Specific Topics and Detecting Weak Signals by Topic Word Selection
Laura Kölbl () and
Michael Grottke ()
Additional contact information
Laura Kölbl: Friedrich-Alexander-Universität Erlangen-Nürnberg
Michael Grottke: Friedrich-Alexander-Universität Erlangen-Nürnberg
A chapter in Reliability and Statistical Computing, 2020, pp 193-206 from Springer
Abstract:
Abstract With topic modeling methods, such as Latent Dirichlet Allocation (LDA), we can find topics in large text collections. To efficiently employ this information, there is a need for a method that automatically analyzes the topics with respect to their usefulness for applications like the detection of new innovations. This paper presents a novel method to automatically evaluate topics produced by LDA. The new approach puts the focus on finding topics with topic words that are not only coherent, but also specific. By using the documents associated with each word to calculate background topics, a baseline can be set for each topic word that helps assess whether its context fits the topic well. Experiments indicate that the resulting topics are more manageable in terms of their interpretability. Moreover, we show that the approach can be used to detect weak signals.
Keywords: Text mining; Topic modeling; Weak signals; Topic coherence (search for similar items in EconPapers)
Date: 2020
References: Add references at CitEc
Citations:
There are no downloads for this item, see the EconPapers FAQ for hints about obtaining it.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:spr:ssrchp:978-3-030-43412-0_12
Ordering information: This item can be ordered from
http://www.springer.com/9783030434120
DOI: 10.1007/978-3-030-43412-0_12
Access Statistics for this chapter
More chapters in Springer Series in Reliability Engineering from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().