|
计算机应用研究 2011
Blog automatic summarization based on features information
|
Abstract:
To help extract the summary of a Blog effectively, first selected a number of comments in the Blog in a reasonable way.Then,based on considering word frequency in the sentence, this paper calculated the weight of the sentence in the Blog, combined with structured information and the selected comments. However, this method was easy to neglect the minor subject.After that,to overcome the drawback, proposed a solution of secondary abstract extract through the characteristics of paragraph form in the Blog. Finally, an experiment was done with Blog data random downloaded on the Internet, demonstrating the method has a better spreadability and generality.