全部 标题 作者
关键词 摘要

OALib Journal期刊
ISSN: 2333-9721
费用:99美元

查看量下载量

相关文章

更多...

一种改进的高效贝叶斯短信文本分类器

Keywords: 短信,文本分类,贝叶斯,支持向量机,分类能量空间

Full-Text   Cite this paper   Add to My Lib

Abstract:

针对短信分类问题,提出了分类能量空间的概念,将特征词转换为分类能量空间上的一个能量元,以此为基础计算短信的能量特征向量.通过计算短信能量特征向量的领域密度,结合贝叶斯公式输出了短信在不同分类的分类概率.在分类过程中,还对分类概率差别较小的短信采用支持向量机进行了二次分类以提高分类效果.实验结果表明,该分类器模型具有良好的分类效果.

References

[1]  新浪科技.2012年我国短信量同比增2%人均发送量下滑[R/OL].[2013-1-28].http://tech.sina.com.cn/t/2013-01-28/00538020096.shtml.
[2]  Sina Tech.SMS quantity increased is 2% and per capita volume has declined in China in 2012[R/OL].[2013-1-.http://tech.sina.com.cn/t/2013-01-28/00538020096.shtml.(in Chinese)
[3]  陈功平,沈明玉,王红,等.基于内容的短信分类技术[J].华东理工大学学报:自然科学版,2011,37(6):770-774.
[4]  Chen Gongping,Shen Mingyu,Wang Hong.SMS classification technology based on content[J].Journal of East China University of Science and Technology:Natural Science Edition,2011,37(6):770-774.(in Chinese)
[5]  李继刚.短信自动分类技术研究与应用[D].上海:东华大学计算机科学学院,2011.
[6]  Li Jigang.Study and application of SMS automatic classification[D].Shanghai:Computer Science & Technology College,Donghua University,2011.(in Chinese)
[7]  綦科,谢冬青.基于内容的短信分类系统的设计与实现[J].广州大学学报:自然科学版,2011,10(5):43-47.
[8]  Qi Ke,Xie Dongqing.Implement of classification system of short message based on text content[J].Journal of Guangzhou University:Natural Science Edition,2011,10(5):43-47.(in Chinese)
[9]  张兢,候旭东,吕和胜.基于朴素贝叶斯和支持向量机的短信智能分析系统设计[J].重庆理工大学学报:自然科学版,2010,24(1):77-81.
[10]  Zhang Jing,Hou Xudong,Lv Heshen.Journal of chongqing university of technology[J].Journal of Chongqing University of Technology:Natural Science Edition,2010,24(1):77-81.(in Chinese)
[11]  Ganiz M C.Higher order Na?ve Bayes:a novel non-IID approach to text classification[J].IEEE Transactions on Knowledge and Data Engineering,2011,23(7):1 022-1 034.
[12]  Zhang Haijun.Textual and visual content-based anti-phishing:a Bayesian approach[J].IEEE Transactions on Neural Networks,2011,22(10):1 532-1 546.
[13]  Tak-Lam Wong,Wai Lam.Learning to adapt web information extraction knowledge and discovering new attributes via a Bayesian approach[J].IEEE Transactions on Knowledge and Data Engineering,2010,22(4):523-536.
[14]  Belem D.Content filtering for SMS systems based on Bayesian classifier and word grouping[C]//Network Operations and Management Symposium(LANOMS),Quito:IEEE Press,2011:1-7.
[15]  Uysal,Alper Kursat.Detection of SMS spam messages on mobile phones[C]//Signal Processing and Communications Applications Conference(SIU),Mugla:IEEE Press,2012:1-4.
[16]  Vahora S,Hasan M,Lakhani R.Novel approach:Na?ve Bayes with vector space model for spam classification[C]//2011 Nirma University International Conference,Ahmedabad Gujarat:Nirma University Press,2011:1-5.
[17]  Gunal S,Ergin S,Gunal E S.Detection of SMS spam messages on mobile phones[C]//2012 20th Signal Processing and Communications Applications Conference(SIU),Mugla:IEEE Press,2012:1-4.
[18]  Han Kyoungsoo,Rrim Haechang,Sung Hyon Myaeng.Some effective techniques for Naive Bayes text classification[J].IEEE Transactions on Knowledge and Data Engineering,2006,18(11):1 457-1 466.
[19]  Khemapatapan C.Thai-English spam SMS filtering[C]//Communications(APCC),Auckland:IEEE Press,2010:226-230.
[20]  宋艳艳.基于内容分类的垃圾短信拦截系统的研究[D].哈尔滨:哈尔滨理工大学测控技术与通信工程学院,2012.
[21]  Song Yanyan.Research on spam message interception system based on content classification[D].Harbin:Measurement and Control Technology & Communication engineering College,Harbin University of Science and Technology,2012.(in Chinese)
[22]  李慧,叶鸿,潘学瑞,等.基于SVM的垃圾短信过滤系统[J].计算机安全,2012,13(6):34-38.
[23]  Li Hui,Ye Hong,Pan Xuerui.Spam messages filtering system based on SVM[J].Computer Security,2012,13(6):34-38.(in Chinese)
[24]  冯鸥鹏.垃圾短信过滤中字特征与词特征对过滤效果的比较研究[D].北京:北京邮电大学计算机学院,2011.
[25]  Feng Oupeng.A comparative study of chinese character feature and word feature in SMS spam filtering[D].Beijing:School of Computing,Beijing University of Posts and Telecommunications,2011.(in Chinese)
[26]  徐易.基于短文本的分类算法研究[D].上海:上海交通大学电子信息与电气工程学院,2010.
[27]  Xu Yi.Research of text classification algorithm based on short text[D]Shanghai:Electronic Information and Electrical Engineering College,Shanghai Jiao Tong University,2010.(in Chinese)
[28]  龚垒.基于支持向量机的垃圾短信过滤方法研究[D].焦作:河南理工大学计算机科学与技术学院,2011.
[29]  Gong Lei.The research of filtering methods of spam messages based on SVM[D].Jiaozuo:Computer Science & Technology College,Henan Polytechnic University,,2011.(in Chinese)
[30]  刘庆瑜.基于决策树分类的手机垃圾短信过滤器的设计与实现[D].杭州:浙江工业大学计算机科学与技术学院,2011.
[31]  Liu Qingyu.Design and implementation of mobilephone garbage SMS filters based on sorting algorithm of decision tree[D].Hangzhou:Computer Science & Technology College,Zhejiang University of Technology,2011.(in Chinese)
[32]  熊忠阳,蒋健,张玉芳.新的CDF文本分类特征提取方法[J].计算机应用,2009,29(7):1 755-1 757.
[33]  Xiong Zhongyang,Jiang Jian,Zhang Yufang.New feature selection approach(CDF)for text categorization[J].Journal of Computer Applications,2009,29(7):1 755-1 757.(in Chinese)
[34]  Yang Y,Pederson J O.A comparative study on feature selection in text categorization[C]//Proceedings of the 14th International Conference on Machine Learning.San Francisco:Morgan Kaufmann,1997:412-420.
[35]  Forman G.An Extensive empirical study of feature selection metrics for text classification[J].Special Issue on Variable and Feature Selection,2003,8:1 289-1 305.

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133