OALib Journal期刊
ISSN: 2333-9721
费用：99美元

投递稿件

查看量	下载量

相关文章
更多...

Journal of Data Analysis and Information Processing 2024

Calculation of Two-Tailed Exact Probability in the Wald-Wolfowitz One-Sample Runs Test

DOI: 10.4236/jdaip.2024.121006, PP. 89-114

José Moral De La Rubia

Keywords: Randomness, Nonparametric Test, Exact Probability, Small Samples, Quantiles

Full-Text Cite this paper Add to My Lib

Abstract:

The objectives of this paper are to demonstrate the algorithms employed by three statistical software programs (R, Real Statistics using Excel, and SPSS) for calculating the exact two-tailed probability of the Wald-Wolfowitz one-sample runs test for randomness, to present a novel approach for computing this probability, and to compare the four procedures by generating samples of 10 and 11 data points, varying the parameters n₀ (number of zeros) and n₁ (number of ones), as well as the number of runs. Fifty-nine samples are created to replicate the behavior of the distribution of the number of runs with 10 and 11 data points. The exact two-tailed probabilities for the four procedures were compared using Friedman’s test. Given the significant difference in central tendency, post-hoc comparisons were conducted using Conover’s test with Benjamini-Yekutielli correction. It is concluded that the procedures of Real Statistics using Excel and R exhibit some inadequacies in the calculation of the exact two-tailed probability, whereas the new proposal and the SPSS procedure are deemed more suitable. The proposed robust algorithm has a more transparent rationale than the SPSS one, albeit being somewhat more conservative. We recommend its implementation for this test and its application to others, such as the binomial and sign test.

References

[1]	Wald, A. and Wolfowitz, J. (1943) An Exact Test for Randomness in the Non-Parametric Case Based on Serial Correlation. The Annals of Mathematical Statistics, 14, 378-388. https://doi.org/10.1214/aoms/1177731358
[2]	Wald, A. and Wolfowitz, J. (1940) On a Test Whether Two Samples Are from the Same Population. The Annals of Mathematical Statistics, 11, 147-162. https://doi.org/10.1214/aoms/1177731909
[3]	Caeiro, F. (2022) Package “Randtests”. https://cran.r-project.org/web/packages/randtests/randtests.pdf
[4]	Zaiontz, C. (2016) One-Sample Runs Test. Real Statistics Using Excel. https://real-statistics.com/non-parametric-tests/one-sample-runs-test/
[5]	IBM Corporation (2013) IBM SPSS Statistics 22 Algorithms. https://www.sussex.ac.uk/its/pdfs/SPSS_Statistics_Algorithms_22.pdf
[6]	IBM Corporation (2022) SPSS Statistics 29. Documentation. https://www.ibm.com/docs/en/spss-statistics/29.0.0
[7]	Mehta, C.R. and Patel, N.R. (2013) IBM SPSS Exact Tests. https://www.ibm.com/docs/en/SSLVMB_27.0.0/pdf/en/IBM_SPSS_Exact_Tests.pdf
[8]	Mohajan, H.K. (2020) Quantitative Research: A Successful Investigation in Natural and Social Sciences. Journal of Economic Development, Environment and People, 9, 50-79.
[9]	Gunver, M.G., Senocak, M.S. and Vehid S. (2018) To Determine Skewness, Mean and Deviation with a New Approach on Continuous Data. Ponte: International Scientific Researches Journal, 73, 64-79. https://doi.org/10.21506/j.ponte.2018.2.5
[10]	Verma, J.P. and Abdel-Salam, A.-S.G. (2019) Testing Statistical Assumptions in Research. John Wiley and Sons, Hoboken.
[11]	Poncet, P. (2022) Package Modeest. Mode Estimation. https://cran.r-project.org/web/packages/modeest/modeest.pdf
[12]	Bickel, D.R. (2002) Robust Estimators of the Mode and Skewness of Continuous Data. Computational Statistics & Data Analysis, 39, 153-163. https://doi.org/10.1016/S0167-9473(01)00057-3
[13]	Conover, W.J. (1999) Practical Nonparametric Statistics. 3th Edition, John Wiley and Sons, Hoboken.
[14]	Diamantopoulos, A., Schlegelmilch, B. and Halkias, G. (2023) Simple Things First: One Variable, One Sample. In: Taking the Fear out of Data Analysis, Edward Elgar Publishing, Cheltenham, 163-184. https://doi.org/10.4337/9781803929842.00022
[15]	Alvo, M. (2022) Nonparametric Statistics. In: Statistical Inference and Machine Learning for Big Data, Springer International Publishing, Cham, 95-170.
[16]	Kvam, P., Vidakovic, B. and Ki, S.-J. (2022) Nonparametric Statistics with Applications to Science and Engineering with R. John Wiley and Sons, Hoboken.
[17]	Martín Andrés, A., álvarez Hernández, M. and Gayá Moreno, F. (2023) The Yates, Conover, and Mantel Statistics in 2 × 2 Tables Revisited (and Extended). Statistica Neerlandica. https://doi.org/10.1111/stan.12320
[18]	Benjamini, Y. and Yekutieli, D. (2001) The Control of False Discovery Rate in Multiple Testing under Dependency. Annals of Statistics, 29, 1165-1188. https://doi.org/10.1214/aos/1013699998
[19]	Maronna, R.A., Martin, R.D., Yohai, V.J. and Salibián-Barrera, M. (2019) Robust Statistics: Theory and Methods (with R). 2nd Edition, John Wiley & Sons, Hoboken.
[20]	Bind, M.A. and Rubin, D.B. (2020) When Possible, Report a Fisher-Exact p Value and Display Its Underlying Null Randomization Distribution. Proceedings of the National Academy of Sciences, 117, 19151-19158. https://doi.org/10.1073/pnas.1915454117
[21]	DiCiccio, C.J., DiCiccio, T.J. and Romano, J.P. (2020) Exact Tests via Multiple Data Splitting. Statistics & Probability Letters, 166, Article ID: 108865. https://doi.org/10.1016/j.spl.2020.108865
[22]	Frey, J. and Zhang, Y. (2019) Improved Exact Confidence Intervals for a Proportion Using Ranked-Set Sampling. Journal of the Korean Statistical Society, 48, 493-501. https://doi.org/10.1016/j.jkss.2019.05.003
[23]	Hodges, C.B., Stone, B.M., Johnson, P.K., Carter III, J.H, Sawyers, C.K., Roby, P.R. and Lindsey, H.M. (2023) Researcher Degrees of Freedom in Statistical Software Contribute to Unreliable Results: A Comparison of Nonparametric Analyses Conducted in SPSS, SAS, Stata, and R. Behavior Research Methods, 55, 2813-2837. https://doi.org/10.3758/s13428-022-01932-2
[24]	McGee, M. (2018) Case for Omitting Tied Observations in the Two-Sample T-Test and the Wilcoxon-Mann-Whitney Test. PLOS ONE, 13, e0200837. https://doi.org/10.1371/journal.pone.0200837
[25]	Rousselet, G., Pernet, C.R. and Wilcox, R.R. (2023) An Introduction to the Bootstrap: A Versatile Method to Make Inferences by Using Data-Driven Simulations. Meta-Psychology, 7, Article No. 2058. https://doi.org/10.15626/MP.2019.2058

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133