|
计算机应用 2008
Application of a non-linear dimension reduction algorithm on document clustering
|
Abstract:
This paper presented a non-linear dimension reduction algorithm-Self-organizing Isometric Embedding (SIE) to compress high-dimensional document data. The algorithm was then validated in document clustering by being compared with the typical linear dimension reduction algorithm-Latent Semantic Indexing (LSI). Experimental results show that while significantly lowering the complexity, the performance of SIE is better than that of LSI and the benchmark.