|
计算机应用 2008
Mining algorithm of distributed sequential pattern
|
Abstract:
In order to mine sequential patterns in distributed environment, Distributed Sequential Pattern Mining (DSPM) algorithm based on prefixSpan was proposed. Sample dataset was detected to balance the workload. Mining tasks were decomposed and distributed to many other computers. Pesudo-projected techniques were used to reduce the cost and the parallel was advanced by multithreading. The experimental results show that DSPM algorithm can mine global sequential patterns effectively and quickly.