|
计算机应用研究 2012
Closed sequential pattern mining algorithm with no candidate sequence based on two-level index structure
|
Abstract:
Aiming at the defects of CloSpan algorithm when mining closed sequential pattern that it needs to maintain the candidate sequences in the first stage and do not make full use of the location information, exists repeatedly scanning database calculating database size, this paper put forward posCloSpan algorithm. By detecting the two-level index structure, the algorithm achieved forward pruning, avoided repeatedly scanning database. At the same time, it trimed non-closed sequences through detecting sup-sequence index table and sub-sequence index table, without saving candidate sequence. Experimental result shows that the algorithm can effectively reduce the time consumption in dealing with longer sequence and the data source that has a large number of duplicated project database.