|
计算机应用研究 2012
Method of multi-protocol network log two-step clustering
|
Abstract:
To deal with large scale of Web log and related issues, and to provide brief data sources for the later log analysis, this paper proposed a method multi-protocol network log two-step clustering, which ploted every log into data grid and first clustering in the grid. Then according to similarity judgment, made the initial cluster grid secondary clustering. Finally output clustered log, some sparse data and outlier data. Through the test experiment, in the premise of ensuring the completeness and accuracy of log, and without affect the normal user network communication, the method can effectively compress log storage, reduce the time complexity and deal with actual dynamic data and realize incremental clustering.