|
计算机应用 2006
COCA - a new way to auto-detect association based on entropy correlated coefficients
|
Abstract:
Self-managing and self-optimizing is currently a hot research field in database.To guarantee the accuracy of the estimates made by optimizer,this paper proposed a new method named COCA(entropy-COrrelated-Coefficient-based Auto-detection of association).In comparison with CORDS,COCA has the following features:(1) Fewer limitations.It overcomes the limitation that Chi-square test needs at least 80% of the cells in the contingency table have frequencies greater than 5.(2) More results.CORDS can tell the correlation between columns,while COCA can further discern the specific association degree for both directions.Experiments show that COCA is more robust and produces more statistical information,which is supportive to the creation of more effective and efficient histograms.