%0 Journal Article %T Design and implementation of search engine system for digital library
数字图书馆主题搜索引擎的设计与实现* %A LIN Qi-dong %A CHEN Chuan-bo %A ZHENG Le-dan %A ZHANG Yi-manb %A
林其东 %A 陈传波 %A 郑乐丹 %A 张一曼b %J 计算机应用研究 %D 2009 %I %X This paper advanced the total system design for topic-specific search engine of digital library.It made use of a pretreatment system to select the seed station with high quality, thus giving Web topic defined data. Every topic crawler collected synchronistically Web resource recommended by crawlers with regulation of system controller,then classified text and identified topic in download resource, which was stored into Web topic resource database according to discipline classification.Others could search the topic resource through the index of whole information database.According to every specially characterist of digital library,this paper brang up the design for topic-specific crawler of multi-thread, and gave anovel URL pruning algorithm-EPR,for the design to realize topic-specific search engine prototype of digital library. Lucene-based open-source platform for the expansion of the system and the formation of the final system,the experiment results show that the research work of this article is effective,especially in EPR algorithm, which are really creative and valuable in real application environment. %K digital library %K topic-specific %K crawler %K search engines %K algorithm-EPR
数字图书馆 %K 主题 %K 爬行器 %K 搜索引擎 %K EPR算法 %U http://www.alljournals.cn/get_abstract_url.aspx?pcid=5B3AB970F71A803DEACDC0559115BFCF0A068CD97DD29835&cid=8240383F08CE46C8B05036380D75B607&jid=A9D9BE08CDC44144BE8B5685705D3AED&aid=D8303FC68D346143A9D74DEF6602D839&yid=DE12191FBD62783C&vid=96C778EE049EE47D&iid=5D311CA918CA9A03&sid=EA008A08338B42DC&eid=FAE697EF7DB29B61&journal_id=1001-3695&journal_name=计算机应用研究&referenced_num=0&reference_num=16