A mathematical model that
makes use of data mining and soft computing techniques is proposed to estimate
the software development effort. The proposed model works as follows: The
parameters that have impact on the development effort are divided into groups
based on the distribution of their values in the available dataset. The
linguistic terms are identified for the divided groups using fuzzy functions,
and the parameters are fuzzified. The fuzzified parameters then adopt
associative classification for generating association rules. The association
rules depict the parameters influencing the software development effort. As the
number of parameters that influence the effort is more, a large number of rules
get generated and can reduce the complexity, the generated rules are filtered
with respect to the metrics, support and confidence, which measures the
strength of the rule. Genetic algorithm is then employed for selecting set of
rules with high quality to improve the accuracy of the model. The datasets such
as Nasa93, Cocomo81, Desharnais, Maxwell, and Finnish-v2 are used for
evaluating the proposed model, and various evaluation metrics such as Mean
Magnitude of Relative Error, Mean Absolute Residuals, Shepperd and MacDonell’s
Standardized Accuracy, Enhanced Standardized Accuracy and Effect Size are
adopted to substantiate the effectiveness of the proposed methods. The results
infer that the accuracy of the model is influenced by the metrics support,
confidence, and the number of association rules considered for effort
prediction.
References
[1]
Jorgensen, M. (2004) A Review of Studies on Expert Estimation of Software Development Effort. Journal of Systems and Software, 70, 37-60. http://dx.doi.org/10.1016/s0164-1212(02)00156-5
[2]
Araujo, R.A., Soares, S. and Oliveria, A.L.I. (2012) Hybrid Morphological Methodology for Software Development Cost Estimation. Expert Systems with Applications, 39, 6129-6139. http://dx.doi.org/10.1016/j.eswa.2011.11.077
[3]
Fdez, J.A., Alcal, R. and Herrera, F.A. (2011) Fuzzy Association Rule Based Classification Model for High Dimensional Problems with Genetic Rule Selection and Lateral Tuning. IEEE Transactions on Fuzzy Systems, 19, 857-872. http://dx.doi.org/10.1109/TFUZZ.2011.2147794
[4]
Araujo, R.A., Oliveria, A.L.I., Soares, S. and Meira, S. (2012) An Evolutionary Morphological Approach for Software Development Cost Estimation. Neural Networks, 32, 285-291. http://dx.doi.org/10.1016/j.neunet.2012.02.040
[5]
Huang, X., Ho, D., Ren, J. and Capretz, L.F. (2007) Improving the COCOMO Model using A Neuro-Fuzzy Approach, Applied Soft Computing, 7, 29-40. http://dx.doi.org/10.1016/j.asoc.2005.06.007
[6]
Jorgensen, M. (2007) Forecasting of Software Development Work Effort: Evidence on Expert Judgement and Formal Models. International Journal of Forecasting, 23, 449-462. http://dx.doi.org/10.1016/j.ijforecast.2007.05.008
[7]
Jorgensen, M. (2005) Practical Guidelines for Expert-Judgment Based Effort Estimation. IEEE Software, 22, 57-63.
[8]
Tan, C.H., Yap, K.S. and Yap, H.J. (2012) Application of Genetic Algorithm for Fuzzy Rules Optimization on Semi Expert Judgment Automation Using Pittsburg Approach. Applied Soft Computing, 12, 2168-2177. http://dx.doi.org/10.1016/j.asoc.2012.03.018
[9]
Martin, C.L. (2008) Predictive Accuracy Comparison of Fuzzy Models for Software Development Effort of Small Programs. Journal of Systems and Software, 81, 949-960. http://dx.doi.org/10.1016/j.jss.2007.08.027
[10]
Oliveira, A.L.I., Braga, P.L., Lima, R.M.F. and Cornélio, M.L. (2010) GA-Based Method for Feature Selection and Parameters Optimization for Machine Learning Regression Applied to Software Effort Estimation. Information and Software Technology, 52, 1155-1166. http://dx.doi.org/10.1016/j.infsof.2010.05.009
[11]
Shepperd, M. and Macdonell, S.G. (2012) Evaluating Prediction Systems in Software Project Estimation. Information and Software Technologyl, 54, 820-827. http://dx.doi.org/10.1016/j.infsof.2011.12.008
[12]
Song, Q., Shepperd, M., Cartwright, M. and Mair, C. (2006) Software Defect Association Mining and Defect Correction Effort Prediction. IEEE Transactions on Software Engineering, 32, 69-82. http://dx.doi.org/10.1109/TSE.2006.1599417
[13]
Minku, L.L. and Yao, X. (2013) Ensembles and Locality: Insight on Improving Software Effort Estimation. Information and Software Technology, 55, 1512-1528. http://dx.doi.org/10.1016/j.infsof.2012.09.012
[14]
Menzies, T. (2013) Beyond Data Mining. IEEE Software, 30, 90-91. http://dx.doi.org/10.1109/ms.2013.49
[15]
Kocaguneli, E., Menzies, T., Bener, A.B. and Keung, J.W. (2012) Exploiting the Essential Assumptions of Analogy- Based Effort Estimation. IEEE Transactions on Software Engineering, 38, 425-438. http://dx.doi.org/10.1109/TSE.2011.27