|
国际自动化与计算杂志 2011
Regression Analysis of the Number of Association Rules
Keywords: Association rules,regression analysis,multiple correlation coefficients,interest,support,confidence Abstract: The typical model, which involves the measures: support, confidence, and interest, is often adapted to mining association rules. In the model, the related parameters are usually chosen by experience; consequently, the number of useful rules is hard to estimate. If the number is too large, we cannot effectively extract the meaningful rules. This paper analyzes the meanings of the parameters and designs a variety of equations between the number of rules and the parameters by using regression method. Finally, we experimentally obtain a preferable regression equation. This paper uses multiple correlation coefficients to test the fitting effects of the equations and uses significance test to verify whether the coefficients of parameters are significantly zero or not. The regression equation that has a larger multiple correlation coefficient will be chosen as the optimally fitted equation. With the selected optimal equation, we can predict the number of rules under the given parameters and further optimize the choice of the three parameters and determine their ranges of values.
|