EconPapers    
Economics at your fingertips  
 

lsemantica: A command for text similarity based on latent semantic analysis

Carlo Schwarz

Stata Journal, 2019, vol. 19, issue 1, 129-142

Abstract: In this article, I present the lsemantica command, which implements latent semantic analysis in Stata. Latent semantic analysis is a machine learning algorithm for word and text similarity comparison and uses truncated singular value decomposition to derive the hidden semantic relationships between words and texts. lsemantica provides a simple command for latent semantic analysis as well as complementary commands for text similarity comparison.

Keywords: lsemantica; machine learning; latent semantic analysis; latent semantic indexing; truncated singular value decomposition; text analysis; text similarity (search for similar items in EconPapers)
Date: 2019
Note: to access software from within Stata, net describe http://www.stata-journal.com/software/sj19-1/st0552/
References: Add references at CitEc
Citations: View citations in EconPapers (3)

Downloads: (external link)
http://hdl.handle.net/10.1177/1536867X19830910

Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.

Export reference: BibTeX RIS (EndNote, ProCite, RefMan) HTML/Text

Persistent link: https://EconPapers.repec.org/RePEc:tsj:stataj:v:19:y:2019:i:1:p:129-142

Ordering information: This journal article can be ordered from
http://www.stata-journal.com/subscription.html

DOI: 10.1177/1536867X19830910

Access Statistics for this article

Stata Journal is currently edited by Nicholas J. Cox and Stephen P. Jenkins

More articles in Stata Journal from StataCorp LLC
Bibliographic data for series maintained by Christopher F. Baum () and Lisa Gilmore ().

 
Page updated 2025-03-22
Handle: RePEc:tsj:stataj:v:19:y:2019:i:1:p:129-142