|
计算机应用研究 2008
Automatic Blog recognition with DOM tree
|
Abstract:
Aiming at the abundant advent of Blog pages on Internet,the paper analyzed the intrinsic features of Blog's structure and techniques,combined them with DOM characteristics and then proposed an algrithm to automatically recognize Blog pages by means of DOM tree and pattern matching.The experiment shows the feasibility of the algorithm.