|
- 2019
lsemantica: A command for text similarity based on latent semantic analysisKeywords: st0552,lsemantica,machine learning,latent semantic analysis,latent semantic indexing,truncated singular value decomposition,text analysis,text similarity Abstract: In this article, I present the lsemantica command, which implements latent semantic analysis in Stata. Latent semantic analysis is a machine learning algorithm for word and text similarity comparison and uses truncated singular value decomposition to derive the hidden semantic relationships between words and texts. lsemantica provides a simple command for latent semantic analysis as well as complementary commands for text similarity comparison
|