%0 Journal Article
%T A New Support Vector Machines Active Learning Approach and its Application in Text Classification
一种新的支持向量机主动学习策略及其在文本分类中的应用
%A LIU Hong TU Zhi-Qing HUANG Shang-Teng
%A
刘宏
%A 屠轶清
%A 黄上腾
%J 计算机科学
%D 2003
%I
%X There are two well-known characteristics about text classification. One is that the dimension of the sample space is very high, while the number of examples available usually is very small. The other is that the example vectors are sparse. Meanwhile, we find existing support vector machines active learning approaches are subject to the influence of outliers. Based on these observations, this paper presents a new hybrid active learning approach. In this approach, to select the unlabelled example(s) to query, the learner takes into account both sparseness and high-dimension characteristics of examples as well as its uncertainty about the examples' categorization. This way, the active learner needs less labeled examples, but still can get a good generalization performance more quickly than competing methods. Our empirical results indicate that this new approach is effective.
%K Active learning
%K Text classification
%K Orthogonalization
%K Support vector machines
支持向量机
%K 主动学习策略
%K 文本分类
%K 机器学习
%U http://www.alljournals.cn/get_abstract_url.aspx?pcid=5B3AB970F71A803DEACDC0559115BFCF0A068CD97DD29835&cid=8240383F08CE46C8B05036380D75B607&jid=64A12D73428C8B8DBFB978D04DFEB3C1&aid=589D2D08E6EBB7BB&yid=D43C4A19B2EE3C0A&vid=340AC2BF8E7AB4FD&iid=B31275AF3241DB2D&sid=4DB1E72614E68564&eid=4BB057F167CF3A60&journal_id=1002-137X&journal_name=计算机科学&referenced_num=0&reference_num=9