OALib Journal期刊
ISSN: 2333-9721
费用：99美元

投递稿件

查看量	下载量

相关文章
更多...

Open Journal of Statistics 2022

Improvement of Misclassification Rates of Classifying Objects under Box Cox Transformation and Bootstrap Approach

DOI: 10.4236/ojs.2022.121007, PP. 98-108

Mst Sharmin Akter Sumy, Md Yasin Ali Parh, Ajit Kumar Majumder, Nayeem Bin Saifuddin

Keywords: Misclassification Rate, SVM, Box Cox Transformation, Bootstrapping

Full-Text Cite this paper Add to My Lib

Abstract:

Discrimination and classification rules are based on different types of assumptions. Also, all most statistical methods are based on some necessary assumptions. Parametric methods are the best choice if it follows all the underlying assumptions. When assumptions are violated, parametric approaches do not provide a better solution and nonparametric techniques are preferred. After Box-Cox transformation, when assumptions are satisfied, parametric methods provide fewer misclassification rates. With this problem in mind, our concern is to compare the classification accuracy of parametric and non-parametric approaches with the aid of Box-Cox transformation and Bootstrapping. We carried Support Vector Machines (SVMs) and different discrimination and classification rules to classify objects. The attention is to critically compare the SVMs with Linear discrimination Analysis (LDA), and Quadratic discrimination Analysis (QDA) for measuring the performance of these techniques before and after Box-Cox transformation using misclassification rates. From the apparent error rates, we observe that before Box-Cox transformation, SVMs perform better than existing classification techniques, on the other hand, after Box-Cox transformation, parametric techniques provide fewer misclassification rates compared to nonparametric method. We also investigated the performances of classification techniques using the Bootstrap approach and observed that Bootstrap-based classification techniques significantly reduce the classification error rate than the usual techniques of small samples. Thus, this paper proposes to apply classification techniques under the Bootstrap approach for classifying objects in case of small sample. A real and simulated datasets application is carried out to see the performance.

References

[1]	Xie, W., She, Y. and Guo, Q. (2021) Regression on Classification Based on Improved SVM Algorithm for Balanced Binary Decision Tree. Scientific Programming, 2021, Article ID: 5560465. https://doi.org/10.1155/2021/5560465
[2]	Conover, W.J. (1980) Practical Nonparametric Statistics. Wiley Series in Probability and Statistics, 2nd Edition. Wiley, Hoboken.
[3]	Johnson, R.A. and Wichern, D.W. (2002) Applied Multivariate Statistical Analysis. 5th Edition, Pearson Education (Singapore) Pte. Ltd., Singapore.
[4]	Reddy, B.V.R., Pagadala, B. and Rayalu, G.M. (2011) Analysis of Transformations and Their Applications in Statistics: Extended Box and Cox Transformation Regression. LAP LAMBERT Academic Publishing, Saarbrücken, Germany.
[5]	UCI (n.d.) Thoracic Surgery Data Set. https://archive.ics.uci.edu/ml/datasets/Thoracic+Surgery+Data
[6]	Campbell, C. and Ying, Y. (2011) Learning with Support Vector Machines (Synthesis Lectures on Artificial Intelligence and Machine Learning).1st Edition, Morgan & Claypool Publishers, San Rafael.
[7]	Izenman, A.J. (2008) Modern Multivariate Statistical Techniques: Regression, Classification, and Manifold Learning. Springer, New York. https://doi.org/10.1007/978-0-387-78189-1
[8]	Michie, D., Spiegelhalter, D.J. and Taylor, C.C. (1994) Machine Learning, Neural and Statistical Classification. Ellis Horwood, New York. http://www1.maths.leeds.ac.uk/~charles/statlog/whole.pdf
[9]	Camilo, L.M.M., Lima, K.M.G., Martin, F.L. (2019) Uncertainty Estimation and Misclassification Probability for Classification Models Based on Discriminant Analysis and Support Vector Machines. Analytica Chimica Acta, 1063, 40-46. https://doi.org/10.1016/j.aca.2018.09.022
[10]	Sumy, M.S.S, Parh, M.Y.A. and Hossain, M.S. (2021) Identifying and Classifying Traveler Archetypes from Google Travel Reviews. International Journal of Statistics and Applications, 11, 61-69.
[11]	Wahl, P. and Kronmal, R. (1977) Discriminant Functions When Covariances Are Unequal and Sample Sizes Are Moderate. Biometrics, 33, 479-484. https://doi.org/10.2307/2529362
[12]	Gareth, J., Witten, D., Hastie, T. and Tibshirani, R. (2013) An Introduction to Statistical Learning: With Applications in R. Springer, New York. https://doi.org/10.1007/978-1-4614-7138-7
[13]	Delgado, R. and Tibau, X. (2019) Why Cohen’s Kappa Should Be Avoided as Performance Measure in Classification. PLoS ONE, 14, Article ID: e0222916. https://doi.org/10.1371/journal.pone.0222916
[14]	Efron, B. (1979) Bootstrap Methods: Another Look at the Jackknife. The Annals of Statistics, 7, 1-26. https://doi.org/10.1214/aos/1176344552
[15]	Efron, B. and Tibshirani, R.J. (1993) An Introduction to the Bootstrap. 1st Edition, Chapman and Hall, London.
[16]	Bickel, P.J. and Freedman, D.A. (1981) Some Asymptotic Theory for the Bootstrap. The Annals of Statistics, 9, 1196-1217. https://doi.org/10.1214/aos/1176345637

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133