%0 Journal Article
%T Threshold Optimization with a Small Number of Samples in Adaptive Information Filtering
自适应信息过滤中使用少量正例进行阈值优化
%A XIA Ying-Ju
%A HUANG Xuan-Jing
%A HU Tian
%A WU Li-De
%A
夏迎炬
%A 黄萱菁
%A 胡恬
%A 吴立德
%J 软件学报
%D 2003
%I
%X One special challenge in adaptive information filtering is the problem of extremely sparse data. So it is very important to learn optimal threshold while filtering the input textual stream. In this paper, an algorithm is presented for the threshold optimization. The algorithm learns fast by using few positive samples. Moreover, most of the quantities the algorithm requires can be updated incrementally, so its memory and computational power requirements are low. It also has the merits of effective, robust, and practically useful. Fudan University's adaptive text filtering system used this algorithm for the first time and came in third in all runs of TREC10. Its T10U and T10F are 0.215 and 0.414 respectively.
%K adaptive information filtering
%K vector space model
%K threshold optimization
%K delivery ratio
%K relevance feedback
自适应信息过滤
%K 向量空间模型
%K 阈值优化
%K 检出率
%K 相关反馈
%U http://www.alljournals.cn/get_abstract_url.aspx?pcid=5B3AB970F71A803DEACDC0559115BFCF0A068CD97DD29835&cid=8240383F08CE46C8B05036380D75B607&jid=7735F413D429542E610B3D6AC0D5EC59&aid=7C2702C281225CEA&yid=D43C4A19B2EE3C0A&vid=F3583C8E78166B9E&iid=F3090AE9B60B7ED1&sid=A87AC493625B15AF&eid=3183893ED5218CD5&journal_id=1000-9825&journal_name=软件学报&referenced_num=5&reference_num=14