全部 标题 作者
关键词 摘要

OALib Journal期刊
ISSN: 2333-9721
费用:99美元

查看量下载量

相关文章

更多...

Hot Word Extraction for Microblog Based on Massive Data Filtering
基于海量信息过滤的微博热词抽取方法

Keywords: Chinese microblog,user behavior models,massive data filtering,hot word extraction,power law distribution
中文微博
,用户行为模型,海量信息过滤,热词抽取,幂律分布

Full-Text   Cite this paper   Add to My Lib

Abstract:

This paper presents a Chinese microblog hot words extraction algorithm based on massive data Filtering. Firstly, it chooses the user behaviour characteristics and text characteristics to create user behavior models, and filters massive data to create topic-trees by a fast algorithm based on rules. Then, it uses hot words extraction algorithm to get the hot topic of topic-trees by word frequency feature. The experiment results show that the proposed algorithm can reduce the scale of the input data, with keeping lots of important information to extract hot words.

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133