全部 标题 作者
关键词 摘要

OALib Journal期刊
ISSN: 2333-9721
费用:99美元

查看量下载量

相关文章

更多...
-  2015 

微博新词发现及情感倾向判断分析
Analysis on new word detection and sentiment orientation in Micro-blog

DOI: 10.6040/j.issn.1671-9352.3.2014.024

Keywords: 广义后缀树,新词发现,微博,情感倾向分析,
Micro-blog
,generalized suffix tree,sentiment orientation analysis,new word detection

Full-Text   Cite this paper   Add to My Lib

Abstract:

摘要: 由于社交媒体的普及和灵活性,微博中涌现出越来越多的新词来表达情感态度,新词的发现和情感倾向已成为微博研究的热点问题。主要介绍COAE2014评测任务3的方法与技术。首先提出了一个广义后缀树的词串抽取方法,利用左右灵活度等指标发现潜在新词。然后根据上下文信息对前一步发现的潜在新词采用多重词典,基于模板,统计情感词共现手段判断其情感倾向。最后利用搜索引擎从语义角度进一步优化情感倾向结果。实验结果表明此方法对新词发现和情感倾向判断问题是有效的。
Abstract: Due to popularity and flexibility of social media, more increasingly created words were used to express people's feelings and attitudes. New word detection and sentiment orientation has become a hot issue in Micro-blog analysis. The methods and techniques used in Task 3 of COAE 2014 were introduced. Generalized suffix tree was employed in string extraction, which was determined as new words with metrics like left-right-flexibility of words etc. Then, with pattern-based and statistic-based methods combined with multiple lexicons, sentiment orientation of new words was decided. Search engine was also used to optimize result as a supplement from semantic perspective. Results have shown our methods effective in new word detection and sentiment orientation analysis

References

[1]  王立希, 王建东. 基于数据挖掘的新词发现[J].计算机应用研究, 2006,2(12):195-197. WANG Lixi, WANG Jiandong. Approach for lexicon updating based on data mining[J]. Application Research of Computers, 2006, 2(12):195-197.
[2]  黄轩, 李熔烽. 博客语料的新词发现方法[J]. 现代电子技术, 2013,36(2):144-149. HUANG Xuan, LI Rongfeng. Discovery method of new words in blog contents[J]. Modern Electronics Technique, 2013, 36(2):144-149.
[3]  郑家恒,李文花.基于构词法的网络新词自动识别初探[J].山西大学学报:自然科学版,2002,25(2):115-119. ZHENG Jiahuan, LI Wenhua. A study on automatic identification for internet new words according to word-building rule[J]. Journal of Shanxi University: Natural Science Edition, 2002, 25(2):115-119.
[4]  LIU Tao, LIU Bingquan, XU Zhiming, et al. Automatic domain-specific term extraction and its application in text classification[J]. Acta Electronica Sinica, 2007, 35(2):328-332.
[5]  林自芳,蒋秀凤.基于词内部模式的新词识别[J].计算机与现代化,2010(11):56-58. LIN Zifang, JIANG Xiufeng. A new method for Chinese new word identification based on inner pattern of word[J]. Computer and Modernization, 2010(11):56-58.
[6]  苏其龙. 微博新词发现研究[D]. 哈尔滨:哈尔滨工业大学, 2013. SU Qilong. Research on new word detection from Microblog data[D]. Harbin:Harbin Institute of Technology, 2013.
[7]  UKKONEN E. On-line construction of suffix trees[J]. Algorithmica, 1995, 14(3):249-260.
[8]  徐硕, 乔晓东, 朱礼军, 等. 广义后缀树及其在汉语科技词系统中的应用研究[J]. 数字图书馆论坛, 2013(004):37-41. XU Shuo, QIAO Xiaodong, ZHU Lijun, et al. Generalized suffix trees with its applications in Chinese scientific technical vocabulary system[J]. Digital Library Forum, 2013(004): 37-41.
[9]  赵妍妍, 秦兵, 刘挺. 文本情感分析[J]. 软件学报, 2010, 21(8):1834-1848. ZHAO Yanyan, QIN Bing, LIU Ting. Sentiment analysis[J]. Journal of Software, 2010, 21(8):1834-1848.
[10]  RAO D, RAVICHANDRAN D. Semi-supervised polarity lexicon induction[C]// Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics. Stroudsburg: Association for Computational Linguistics, 2009: 675-682.
[11]  李钝, 乔保军, 曹元大, 等. 基于语义分析的词汇倾向识别研究[J]. 模式识别与人工智能, 2008, 21(4):482-487. LI Dun, QIAO Baojun, CAO Yuanda, et al. Word orientation recognition based on semantic analysis[J]. Pattern Recognition and Artificial Intelligence, 2008, 21(4):482-487.
[12]  田久乐, 赵蔚. 基于同义词词林的词语相似度计算方法[J]. 吉林大学学报: 信息科学版, 2010(006):602-608. TIAN Jiule, ZHAO Wei. Words similarity algorithm based on Tongyici Cilin in semantic web adaptive learning system[J]. Journal of Jilin University: Information Science Edition, 2010(006):602-608.
[13]  TURNEY P D. Thumbs up or thumbs down?: semantic orientation applied to unsupervised classification of reviews[C]// Proceedings of the 40th Annual Meeting on Association for Computational Linguistics. Somerset: Association for Computational Linguistics, 2002: 417-424.
[14]  宋继华, 杨尔弘, 王强军. 中文信息处理教程[M]. 北京:高等教育出版社, 2011: 74-75. SONG Jihua, YANG Erhong, WANG Qiangjun. Chinese information processing tutorial[M]. Beijing: Higher Education Press, 2011: 74-75.

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133