Discrimination and classification rules are based on different types of
assumptions. Also, all most statistical methods are based on some necessary
assumptions. Parametric methods are the best choice if it follows all the
underlying assumptions. When assumptions are violated, parametric approaches do
not provide a better solution and nonparametric techniques are preferred. After
Box-Cox transformation, when assumptions are satisfied, parametric methods provide fewer misclassification rates. With this problem in
mind, our concern is to compare the classification accuracy of parametric and
non-parametric approaches with the aid of Box-Cox transformation and
Bootstrapping. We carried Support Vector Machines (SVMs) and different
discrimination and classification rules to classify objects. The attention is
to critically compare the SVMs with Linear discrimination Analysis (LDA), and
Quadratic discrimination Analysis (QDA) for measuring the performance of these
techniques before and after Box-Cox transformation using misclassification
rates. From the apparent error rates, we observe that before Box-Cox
transformation, SVMs perform better than existing classification techniques, on
the other hand, after Box-Cox transformation, parametric techniques provide
fewer misclassification rates compared to nonparametric method. We also
investigated the performances of classification techniques using the Bootstrap
approach and observed that Bootstrap-based classification techniques
significantly reduce the classification error rate than the usual techniques of
small samples. Thus, this paper proposes to apply classification techniques
under the Bootstrap approach for classifying objects in case of small sample. A
real and simulated datasets application is carried out to see the performance.
References
[1]
Xie, W., She, Y. and Guo, Q. (2021) Regression on Classification Based on Improved SVM Algorithm for Balanced Binary Decision Tree. Scientific Programming, 2021, Article ID: 5560465. https://doi.org/10.1155/2021/5560465
[2]
Conover, W.J. (1980) Practical Nonparametric Statistics. Wiley Series in Probability and Statistics, 2nd Edition. Wiley, Hoboken.
Reddy, B.V.R., Pagadala, B. and Rayalu, G.M. (2011) Analysis of Transformations and Their Applications in Statistics: Extended Box and Cox Transformation Regression. LAP LAMBERT Academic Publishing, Saarbrücken, Germany.
[5]
UCI (n.d.) Thoracic Surgery Data Set.
https://archive.ics.uci.edu/ml/datasets/Thoracic+Surgery+Data
[6]
Campbell, C. and Ying, Y. (2011) Learning with Support Vector Machines (Synthesis Lectures on Artificial Intelligence and Machine Learning).1st Edition, Morgan & Claypool Publishers, San Rafael.
[7]
Izenman, A.J. (2008) Modern Multivariate Statistical Techniques: Regression, Classification, and Manifold Learning. Springer, New York.
https://doi.org/10.1007/978-0-387-78189-1
[8]
Michie, D., Spiegelhalter, D.J. and Taylor, C.C. (1994) Machine Learning, Neural and Statistical Classification. Ellis Horwood, New York.
http://www1.maths.leeds.ac.uk/~charles/statlog/whole.pdf
[9]
Camilo, L.M.M., Lima, K.M.G., Martin, F.L. (2019) Uncertainty Estimation and Misclassification Probability for Classification Models Based on Discriminant Analysis and Support Vector Machines. Analytica Chimica Acta, 1063, 40-46.
https://doi.org/10.1016/j.aca.2018.09.022
[10]
Sumy, M.S.S, Parh, M.Y.A. and Hossain, M.S. (2021) Identifying and Classifying Traveler Archetypes from Google Travel Reviews. International Journal of Statistics and Applications, 11, 61-69.
[11]
Wahl, P. and Kronmal, R. (1977) Discriminant Functions When Covariances Are Unequal and Sample Sizes Are Moderate. Biometrics, 33, 479-484.
https://doi.org/10.2307/2529362
[12]
Gareth, J., Witten, D., Hastie, T. and Tibshirani, R. (2013) An Introduction to Statistical Learning: With Applications in R. Springer, New York.
https://doi.org/10.1007/978-1-4614-7138-7
[13]
Delgado, R. and Tibau, X. (2019) Why Cohen’s Kappa Should Be Avoided as Performance Measure in Classification. PLoS ONE, 14, Article ID: e0222916.
https://doi.org/10.1371/journal.pone.0222916
[14]
Efron, B. (1979) Bootstrap Methods: Another Look at the Jackknife. The Annals of Statistics, 7, 1-26. https://doi.org/10.1214/aos/1176344552
[15]
Efron, B. and Tibshirani, R.J. (1993) An Introduction to the Bootstrap. 1st Edition, Chapman and Hall, London.
[16]
Bickel, P.J. and Freedman, D.A. (1981) Some Asymptotic Theory for the Bootstrap. The Annals of Statistics, 9, 1196-1217. https://doi.org/10.1214/aos/1176345637