%0 Journal Article
%T Semi-Supervised Web Mining Based on Bayes Latent Semantic Model
基于Bayes潜在语义模型的半监督Web挖掘
%A GONG Xiu-jun
%A SHI Zhong-zhi
%A
宫秀军
%A 史忠植
%J 软件学报
%D 2002
%I
%X With the increasing of information on Internet, Web mining has been the focus of data mining. Web classification predicts the labels of Web documents by learning lots of training examples with labels. It is very expensive to get these examples by manual. Web clustering groups the similar Web documents by a certain of metric of similarity. But the classical algorithms of clustering are aimless in searching the solution space and absent of semantic characters. In this paper, a semi-supervised learning strategy consists of tow stages is put forward.The fist atage,labels the documents the documents that include latent class variables by using Bayes latent semantic model.The second stage,based on the results from the first stage,labels the documents excluding latent class variables with the Naive Bayes models.Experimental results show that this algorithm has good precision and recall rate.
%K Bayes latent semantic analysis
%K semi-supervised learning
%K Naive Bayesian classifier
%K expectation maximization
%K Web mining
贝叶斯潜在语义分析
%K 半监督学习
%K 简单贝叶斯分类
%K 期望最大化算法
%K Web挖掘
%U http://www.alljournals.cn/get_abstract_url.aspx?pcid=5B3AB970F71A803DEACDC0559115BFCF0A068CD97DD29835&cid=8240383F08CE46C8B05036380D75B607&jid=7735F413D429542E610B3D6AC0D5EC59&aid=298F46271C3FCEA4&yid=C3ACC247184A22C1&vid=FC0714F8D2EB605D&iid=5D311CA918CA9A03&sid=D80AE3757B45ED33&eid=B7C6D333F9B9ED14&journal_id=1000-9825&journal_name=软件学报&referenced_num=24&reference_num=9