|
计算机应用研究 2008
Structure-similarity cluster analysis on herbal instances based on XML
|
Abstract:
According to characters of herbalist instances, the article designed a clustering arithmetic which based on simulating anneal to optimize the herbalist instances in database. It provided a measurement which consulted usual editing distance between trees to estimate similar degree between instances described by XML(named for XML editing distance). If made full use of XML editing distance, the time complexity which calculated similar degree between XML data could keep in multinomial level, furthermore, the semantic of nodes in document described by XML and the nested relationship among nodes could be preserved. Finally, the test performing in Tamino database gets a good result and proves that it is a feasible and effective clustering arithmetic.