lsemantica: A command for text similarity based on latent semantic analysis
Carlo Schwarz
Stata Journal, 2019, vol. 19, issue 1, 129-142
Abstract:
In this article, I present the lsemantica command, which implements latent semantic analysis in Stata. Latent semantic analysis is a machine learning algorithm for word and text similarity comparison and uses truncated singular value decomposition to derive the hidden semantic relationships between words and texts. lsemantica provides a simple command for latent semantic analysis as well as complementary commands for text similarity comparison.
Keywords: lsemantica; machine learning; latent semantic analysis; latent semantic indexing; truncated singular value decomposition; text analysis; text similarity (search for similar items in EconPapers)
Date: 2019
Note: to access software from within Stata, net describe http://www.stata-journal.com/software/sj19-1/st0552/
References: Add references at CitEc
Citations: View citations in EconPapers (3)
Downloads: (external link)
http://hdl.handle.net/10.1177/1536867X19830910
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:tsj:stataj:v:19:y:2019:i:1:p:129-142
Ordering information: This journal article can be ordered from
http://www.stata-journal.com/subscription.html
DOI: 10.1177/1536867X19830910
Access Statistics for this article
Stata Journal is currently edited by Nicholas J. Cox and Stephen P. Jenkins
More articles in Stata Journal from StataCorp LLC
Bibliographic data for series maintained by Christopher F. Baum () and Lisa Gilmore ().