The
objective of this study is to propose the Parametric Seven-Number Summary
(PSNS) as a significance test for normality and to verify its accuracy and
power in comparison with two well-known tests, such as Royston’s W test and
D’Agostino-Belanger-D’Agostino K-squared test. An experiment with 384
conditions was simulated. The conditions were generated by crossing 24 sample
sizes and 16 types of continuous distributions: one normal and 15 non-normal.
The percentage of success in maintaining the null hypothesis of normality
against normal samples and in rejecting the null hypothesis against non-normal
samples (accuracy) was calculated. In addition, the type II error against
normal samples and the statistical power against normal samples were computed.
Comparisons of percentage and means were performed using Cochran’s Q-test,
Friedman’s test, and repeated measures analysis of variance. With sample sizes of 150 or greater, high accuracy and mean power or
type II error (≥0.70 and ≥0.80, respectively) were achieved. All three normality tests were similarly accurate; however,
the PSNS-based test showed lower mean power
than K-squared and W tests, especially against non-normal samples of
symmetrical-platykurtic distributions, such as the uniform, semicircle, and
arcsine distributions. It is concluded that the PSNS-based omnibus test is
accurate and powerful for testing normality with samples of at least 150
observations.
References
[1]
Bowley, A.L. (1901) Elements of Statistics. P. S. King, London.
[2]
Friendly, M. (2007) A Brief History of Data Visualization. In: Chen, C., Härdle, W. and Unwin, A., Eds., Handbook of Computational Statistics: Data Visualization, Springer-Verlag, Heidelberg, 1-34.
[3]
Bowley, A.L. (1910) Elementary Manual of Statistics. P. S. King, London.
Tukey, J.W. (1972) Some Graphic and Semi-Graphic Displays. In: Bancroft, T.A. and Brown, S.A., Eds, Statistical Papers in Honor of George W. Snedecor, Iowa State University Press, Aimes, 293-316.
[9]
Tukey, J.A. (1977) Exploratory Data Analysis. Addison and Wesley, Reading.
[10]
Wickham, H. and Stryjewski, L. (2011) 40 Years of Boxplots.
http://vita.had.co.nz/papers/boxplots.pdf
[11]
Cox, N.J. (2009) Speaking Stata: Creating and Varying Box Plots. Stata Journal, 9, 478-496. https://doi.org/10.1177/1536867X0900900309
[12]
McGill, R., Tukey, J.W. and Larsen, W.A. (1978) Variations of Box Plots. The American Statistician, 32, 12-16. https://doi.org/10.1080/00031305.1978.10479236
[13]
Cleveland, W.S. (1985) Elements of Graphing Data. Wadsworth, Monterey.
[14]
Gotelli, N.J. and Ellison, A.M. (2004) A Primer of Ecological Statistics. Sinauer Associates, Inc., Sunderland.
[15]
Reimann, C., Filzmoser, P., Garrett, R.G. and Dutter, R. (2008) Statistical Data Analysis Explained: Applied Environmental Statistics with R. John Wiley and Sons, New York. https://doi.org/10.1002/9780470987605
[16]
Benjamini, Y. (1988) Opening the Box of a Boxplot. The American Statistician, 42, 257-262. https://doi.org/10.1080/00031305.1988.10475580
[17]
Hintze, J.L. and Nelson, R.D. (1998) Violin Plots: A Box-Plot Density Trace Synergism. The American Statistician, 52, 181-184.
https://doi.org/10.1080/00031305.1998.10480559
[18]
Kampstra, P. (2008) Beanplot: A Boxplot for Visual Comparison of Distributions. Journal of Statistical Software Code Snippets, 28, 1-9.
https://doi.org/10.18637/jss.v028.c01
[19]
Hyndman, R. (1996) Computing and Graphing Highest Density Regions. The American Statistician, 50, 120-126.
https://doi.org/10.1080/00031305.1996.10474359
[20]
Muth, S.Q., Potterat, J.J. and Rothenberg, R.B. (2000) Birds of Feather: Using a Rotational Box Plot to Assess Ascertainment Bias. International Journal of Epidemiology, 29, 899-904. https://doi.org/10.1093/ije/29.5.899
[21]
Phuyal, S., Rashid, M. and Sarkar, J., (2021) GIplot: An R Package for Visualizing the Summary Statistics of a Quantitative Variable.
https://cran.r-project.org/web/packages/GIplot/index.html
[22]
Sarkar, J. and Rashid, M. (2021) IVY Plots and Gaussian Interval Plots. Teaching Statistics, 43, 85-90. https://doi.org/10.1111/test.12257
[23]
Rousseeuw, P.J., Ruts, I. and Tukey, J.W. (1999) The Bagplot: A Bivariate Boxplot. The American Statistician, 53, 382-387.
https://doi.org/10.1080/00031305.1999.10474494
[24]
Mishra, P., Pandey, C.M., Singh, U., Gupta, A., Sahu, C. and Keshri, A. (2019) Descriptive Statistics and Normality Tests for Statistical Data. Annals of Cardiac Anaesthesia, 22, 67-72. https://doi.org/10.4103/aca.ACA_157_18
[25]
Pearson, K. (1900) On the Criterion that a Given System of Deviations from the Probable in the Case of a Correlated System of Variables Is Such That It Can Be Reasonably Supposed to Have Arisen from Random Sampling. The London, Edinburgh, and Dublin Philosophical Magazine and Journal of Science, Series 5, 50, 157-175. https://doi.org/10.1080/14786440009463897
[26]
Woolf, B. (1957) The Log-Likelihood Ratio Test (The G-Test); Methods and Tables for Tests of Heterogeneity in Contingency Tables. Annals of Human Genetics, 21, 397-409. https://doi.org/10.1111/j.1469-1809.1972.tb00293.x
[27]
Kolmogorov, A.N. (1933) Sulla Determinizione Empirica di una Legge di Distribuzione. Giornale dell Istituto Italiano degli Attuari, 4, 83-91.
[28]
Smirnov, N.V. (1939) On the Estimation of the Discrepancy between Empirical Curves of Distributions for Two Independent Samples. Moscow University Mathematics Bulletin, 2, 3-26.
[29]
Kuiper, N.H. (1960) Tests Concerning Random Points on a Circle. Proceedings of the Koninklijke Nederlandse Akademie van Wetenschappen, Series A, 63, 38-47.
https://doi.org/10.1016/S1385-7258(60)50006-0
[30]
Cramer, H. (1928) On the Composition of Elementary Errors. Scandinavian Actuarial Journal, 1, 13-74. https://doi.org/10.1080/03461238.1928.10416862
[31]
Von Mises, R.E. (1928) Wahrscheinlichkeit, Statistik und Wahrheit [Probability, Statistics and Truth]. Verlag von Julius Springer, Wien.
https://doi.org/10.1007/978-3-662-36230-3
[32]
Anderson, T.W. and Darling, D.A. (1952) Asymptotic Theory of Certain “Goodness-of-Fit” Criteria Based on Stochastic Processes. Annals of Mathematical Statistics, 23, 193-212. https://doi.org/10.1214/aoms/1177729437
[33]
Watson, G.S. (1961) Goodness-of-Fit Tests on a Circle. Biometrika, 48, 109-114.
https://doi.org/10.1093/biomet/48.1-2.109
[34]
Shapiro, S.S. and Wilk, M.B. (1965) An Analysis of Variance Test for Normality (Complete Samples). Biometrika, 52, 591-611.
https://doi.org/10.1093/biomet/52.3-4.591
[35]
Royston, J.P. (1992) Approximating the Shapiro-Wilk W-Test for Non-Normality. Statistics and Computing, 2, 117-119. https://doi.org/10.1007/BF01891203
[36]
Chen, L. and Shapiro, S.S. (1995) An Alternative Test for Normality Based on Normalized Spacings. Journal of Statistical Computation and Simulation, 53, 269-288.
https://doi.org/10.1080/00949659508811711
[37]
Shapiro, S.S. and Francia, R.S. (1972) An Approximate Analysis of Variance Test for Normality. Journal of the American Statistical Association, 67, 215-216.
https://doi.org/10.1080/01621459.1972.10481232
[38]
Royston, J.P. (1993) A Toolkit of Testing for Non-Normality in Complete and Censored Samples. Journal of the Royal Statistical Society. Series D (The Statistician), 42, 37-43. https://doi.org/10.2307/2348109
[39]
D’Agostino, R.B. (1971) An Omnibus Test of Normality for Moderate and Large Size Samples. Biometrika, 58, 341-348. https://doi.org/10.1093/biomet/58.2.341
[40]
D’Agostino, R.B. (1972) Small Sample Probability Points for the D Test of Normality. Biometrika, 59, 219-221. https://doi.org/10.2307/2334638
[41]
D’Agostino, R.B. and Pearson, E.S. (1973) Tests for Departure from Normality. Empirical Results for the Distributions of b2 and √b1. Biometrika, 60, 613-622.
https://doi.org/10.1093/biomet/60.3.613
[42]
Jarque, C.M. and Bera, A. (1987) A Test for Normality of Observations and Regression Residuals. International Statistical Review, 55, 163-172.
https://doi.org/10.2307/1403192
[43]
Urzua, C. (1996) On the Correct Use of Omnibus Tests for Normality. Economics Letters, 53, 247-251. https://doi.org/10.1016/S0165-1765(96)00923-8
[44]
D’Agostino, R.B., Belanger, A. and D'Agostino Jr., R.B. (1990) A Suggestion for Using Powerful and Informative Tests of Normality. The American Statistician, 44, 316-321. https://doi.org/10.1080/00031305.1990.10475751
[45]
Moore, D. (1986) Tests of Chi-Squared Type. In: D’Agostino, R.B. and Stephens, M.A., Eds., Goodness-of-Fit Techniques, Marcel Dekker, New York, 63-95.
https://doi.org/10.1201/9780203753064-3
[46]
Wilk, M.B. and Gnanadesikan, R. (1968) Probability Plotting Methods for the Analysis of Data. Biometrika, 55, 1-17. https://doi.org/10.2307/2334448
[47]
Estaki, M., Jiang, L., Bokulich, N.A., McDonald, D., González, A., Kosciolek, T., Martino, C., Zhu, Q., Birmingham, A., Vázquez-Baeza, Y., Dillon, M.R., Bolyen, E., Caporaso, J.G. and Knight, R. (2020) QIIME 2 Enables Comprehensive End-to-End Analysis of Diverse Microbiome Data and Comparative Studies with Publicly Available Data. Current Protocols in Bioinformatics, 70, e100.
https://doi.org/10.1002/cpbi.100
[48]
Milligan, C., Montufar, J., Regehr, J. and Ghanney, B. (2016) Road Safety Performance Measures and AADT Uncertainty from Short-Term Counts. Accident Analysis & Prevention, 97, 186-196. https://doi.org/10.1016/j.aap.2016.09.013
[49]
Stuart, A. and Ord, J.K. (1994) Kendall’s Advanced Theory of Statistics. Volume 1. Distribution Theory. Sixth Edition, Edward Arnold, London.
[50]
Hyndman, R.J. and Fan, Y. (1996) Sample Quantiles in Statistical Packages. The American Statistician, 50, 361-365.
https://doi.org/10.1080/00031305.1996.10473566
[51]
Blom, G. (1958) Statistical Estimates and Transformed Beta Variables. John Wiley and Sons, New York.
[52]
Yap, B.W. and Sim, C.H. (2011) Comparisons of Various Types of Normality Tests. Journal of Statistical Computation and Simulation, 81, 2141-2155.
https://doi.org/10.1080/00949655.2010.520163
[53]
Khatun, N. (2021) Applications of Normality Test in Statistical Analysis. Open Journal of Statistics, 11, 113-122. https://doi.org/10.4236/ojs.2021.111006
[54]
Alonso, J.C. and Montenegro, S. (2015) A Monte Carlo Study to Compare 8 Normality Tests for Least-Squares Residuals Following a First Order Autoregressive Process. Estudios Gerenciales, 31, 253-265.
https://doi.org/10.1016/j.estger.2014.12.003
[55]
Sanchez-Espigares, J.A., Grima, P. and Marco-Almagro, L. (2019) Graphical Comparison of Normality Tests for Unimodal Distribution Data. Journal of Statistical Computation and Simulation, 89, 145-154.
https://doi.org/10.1080/00949655.2018.1539085
[56]
Mukasa, E.S., Christospher, W., Ivan, B. and Kizito, M. (2021) The Effects of Parametric, Non-Parametric Tests and Processes in Inferential Statistics for Business Decision Making. Open Journal of Business and Management, 9, 1510-1526.
https://doi.org/10.4236/ojbm.2021.93081
[57]
Gupta, S.C. and Kapoor, V.K. (2020) Fundamentals of Mathematical Statistics. Twelfth Edition, Sultan Chand & Sons, New Delhi.
[58]
Cochran, W.G. (1950) The Comparison of Proportions in Matched Samples. Biometrika, 37, 256-266. https://doi.org/10.1093/biomet/37.3-4.256
[59]
Serlin, R.C., Carr, J. and Marascuilo, L.A. (1982) A Measure of Association for Selected Nonparametric Procedures. Psychological Bulletin, 92, 786-790.
https://doi.org/10.1037/0033-2909.92.3.786
[60]
Friedman, M. (1940) A Comparison of Alternative Tests of Significance for the Problem of m Rankings. The Annals of Mathematical Statistics, 11, 86-92.
https://doi.org/10.1214/aoms/1177731944
[61]
Kendall, M.G. and Babington-Smith, B. (1939) The Problem of m Rankings. The Annals of Mathematical Statistics, 10, 275-287.
https://doi.org/10.1214/aoms/1177732186
[62]
Kendall, M.G. and Gibbons, J.D. (1990) Rank Correlation Methods. Oxford University Press, New York.
[63]
Hopkins, K.D. (2006) A New View of Statistics. A Scale of Magnitudes for Effect Statistics. http://www.sportsci.org/resource/stats/effectmag.html
[64]
McNemar, Q. (1947) Note on the Sampling Error of the Difference between Correlated Proportions and Percentages. Psychometrika, 12, 153-157.
https://doi.org/10.1007/BF02295996
[65]
Benjamini, Y. and Yekutieli, D. (2001) The Control of the False Discovery Rate in Multiple Testing under Dependency. Annals of Statistics, 29, 1165-1188.
https://doi.org/10.1214/aos/1013699998
[66]
John, S. (1972) The Distribution of a Statistic Used for Testing Sphericity of Normal Distributions. Biometrika, 59, 169-173. https://doi.org/10.1093/biomet/59.1.169
[67]
Nagao, H. (1973) On Some Test Criteria for Covariance Matrix. Annals of Statistics, 1, 700-709. https://doi.org/10.1214/aos/1176342464
[68]
Sugiura, N. (1972) Locally Best Invariant Test for Sphericity and the Limiting Distributions. Annals of Mathematical Statistics, 43, 1312-1316.
https://doi.org/10.1214/aoms/1177692481
[69]
Greenhouse, S.W. and Geisser, S. (1959) On Methods in the Analysis of Profile Data. Psychometrika, 24, 95-112. https://doi.org/10.1007/BF02289823
[70]
Huynh, H. and Feldt, L.S. (1976) Estimation of the Box Correction for Degrees of Freedom from Sample Data in Randomized Block and Split-Plot Designs. Journal of Educational Statistics, 1, 69-82. https://doi.org/10.3102/10769986001001069
[71]
Cohen, J. (1988) Statistical Power Analysis for Behavioral Sciences. Second Edition, Lawrence Erlbaum Associates, Hillsdale.
Friedman, M. (1937) The Use of Ranks to Avoid the Assumption of Normality Implicit in the Analysis of Variance. Journal of the American Statistical Association, 32, 675-701. https://doi.org/10.1080/01621459.1937.10503522
[74]
Friedman, M. (1939) A Correction. The Use of Ranks to Avoid the Assumption of Normality Implicit in the Analysis of Variance. Journal of the American Statistical Association, 34, 109. https://doi.org/10.2307/2279169
[75]
Agresti, A. and Pendergast, J. (1986) Comparing Mean Ranks for Repeated Measures Data. Communications in Statistics—Theory and Methods, 15, 1417-1433.
https://doi.org/10.1080/03610928608829193
[76]
Wilcoxon, F. (1945) Individual Comparisons by Ranking Methods. Biometrics Bulletin, 1, 80-83. https://doi.org/10.2307/3001968
[77]
Microsoft Corporation (2019) Microsoft Excel 2019 for Windows.
https://office.microsoft.com/excel
[78]
Zaiontz, C. (2021) Real Statistics Resource Pack Software. Release 7.6.
http://www.real-statistics.com
[79]
Ramachandran, K.M. and Tsokos, C.P. (2020) Mathematical Statistics with Applications in R. Third Edition, Academic Press, Hoboken.
[80]
Fay, M.P., Sachs, M.C. and Miura, K. (2018) Measuring Precision in Bioassays: Rethinking Assay Validation. Statistics in Medicine, 37, 519-529.
https://doi.org/10.1002/sim.7528
[81]
Aberson, C.L. (2019) Applied Power Analysis for the Behavioral Science. Second Edition, Routledge, New York. https://doi.org/10.4324/9781315171500
[82]
Brysbaert, M. (2019) How Many Participants Do We Have to Include in Properly Powered Experiments? A Tutorial of Power Analysis with Reference Tables. Journal of Cognition, 2, 16. https://doi.org/10.5334/joc.72
[83]
Meredith, J. (1998) Building Operations Management Theory through Case and Field Research. Journal of Operations Management, 16, 441-454.
https://doi.org/10.1016/S0272-6963(98)00023-0
[84]
Morris, T.P., White, I.R. and Crowther, M.J. (2019) Using Simulation Studies to Evaluate Statistical Methods. Statistics in Medicine, 38, 2074-2102.
https://doi.org/10.1002/sim.8086
[85]
Overton, C.E., Stage, H.B., Ahmad, S., Curran-Sebastian, J., Dark, P., Das, R., Fearon, E., Felton, T., Fyles, M., Gent, N., Hall, I., House, T., Lewkowicz, H., Pang, X., Pellis, L., Sawko, R., Ustianowski, A., Vekaria, B. and Webb, L. (2020) Using Statistics and Mathematical Modelling to Understand Infectious Disease Outbreaks: COVID-19 as an Example. Infectious Disease Modelling, 5, 409-441.
https://doi.org/10.1016/j.idm.2020.06.008
[86]
Luong, A. and Bilodeau, C. (2018) Asymptotic Normality Distribution of Simulated Minimum Hellinger Distance Estimators for Continuous Models. Open Journal of Statistics, 8, 846-860. https://doi.org/10.4236/ojs.2018.85056
[87]
Johnson, N.L., Kotz, S. and Balakrishnan, N. (1995) Continuous Univariate Distributions. Second Editon, John Wiley and Sons, New York.