Inference Procedures on the Generalized Poisson Distribution from Multiple Samples: Comparisons with Nonparametric Models for Analysis of Covariance (ANCOVA) of Count Data
Count data that exhibit over dispersion (variance of
counts is larger than its mean) are commonly analyzed using discrete
distributions such as negative binomial, Poisson inverse Gaussian and other
models. The Poisson is characterized by the equality of mean and variance
whereas the Negative Binomial and the Poisson inverse Gaussian have variance
larger than the mean and therefore are more appropriate to model over-dispersed
count data. As an alternative to these two models, we shall use the generalized
Poisson distribution for group comparisons in the presence of multiple
covariates. This problem is known as the ANCOVA and is solved for continuous
data. Our objectives were to develop ANCOVA using the generalized Poisson
distribution, and compare its goodness of fit to that of the nonparametric
Generalized Additive Models. We used real life data to show that the model
performs quite satisfactorily when compared to the nonparametric Generalized
Additive Models.
References
[1]
Cox, D.R. (1983) Some Remarks on Overdispersion. Biometrika, 70, 269-274.
https://doi.org/10.1093/biomet/70.1.269
[2]
Hinde, J. and Demetrio, C.G.B. (1998) Overdispersion: Models and Estimation. Computational statistics and Data Analysis, 27, 151-170.
https://doi.org/10.1016/S0167-9473(98)00007-3
[3]
Hayat, M.J. and Higgins, M. (2014) Understanding Poisson Regression. Journal of Nursing Education, 53, 207-215. https://doi.org/10.3928/01484834-20140325-04
[4]
Joe, H. and Zhu, R. (2005) Generalized Poisson Distribution: The Property of Mixture of Poisson and Comparison with the Negative Binomial Distribution. Biometrical Journal, 47, 219-229. https://doi.org/10.1002/bimj.200410102
[5]
Consul, P.C. and Jain, G.C. (1970) On the Generalization of Poisson Distribution. Annals of Mathematical Statistics, 41, 1387.
Janardan, K.G. and Schaeffer, D.J. (1977) Models for the Analysis of Chromosomal Aberrations in Human Leukocytes. Biometrical Journal, 19, 599-612.
https://doi.org/10.1002/bimj.4710190804
[8]
Tanner, J.C. (1961) A Derivation of Borel Distribution. Biometrika, 40, 222-224.
https://doi.org/10.1093/biomet/48.1-2.222
[9]
Consul, P.C. and Shoukri, M.M. (1988) Some Chance Mechanisms Related to a Generalized Poisson Probability Model. American Journal of Mathematical and Management Sciences, 8, 181-202. https://doi.org/10.1080/01966324.1988.10737237
[10]
Jiang, H. and Wong, W.H. (2009) Statistical Inferences for Isoform Expression in RNA-Seq. Bioinformatics, 25, 1026-1032.
https://doi.org/10.1093/bioinformatics/btp113
[11]
Srivastava, S. and Chen, L. (2010) A Two-Parameter Generalized Poisson Model to Improve the Analysis of RNA-seq Data. Nucleic Acids Research, 38, e170.
https://doi.org/10.1093/nar/gkq670
[12]
Shoukri, M.M. and Mian, I.U.H. (1991) Some Aspects of Statistical Inference on the Lagrange (Generalized) Poisson Distribution. Communication in Statistics: Computations and Simulations, 20, 1115-1137.
https://doi.org/10.1080/03610919108812999
[13]
Cox, D.R. and Reid, N. (1987) Parameter Orthogonality and Approximate Conditional Inference. Journal of the Royal Statistical Society. Series B (Methodological), 49, 1-39. https://doi.org/10.1111/j.2517-6161.1987.tb01422.x
[14]
McCullagh, P. and Nelder, J.A. (1989) Generalized Linear Models. Chapman and Hall, London. https://doi.org/10.1007/978-1-4899-3242-6
[15]
Shoukri, M.M. and Al-Eid, M. (2020) Inference Procedures on the Ratio of Modified Generalized Poisson Distribution Means: Applications to RNA_SEQ Data. International Journal of Statistics in Medical Research, 9, 41-49.
https://doi.org/10.6000/1929-6029.2020.09.05
[16]
Consul, P.C. and Shoukri, M.M. (1984) Maximum Likelihood Estimation of the Generalized Poisson Distribution. Communications in Statistics, Theory and Methods, 13, 1533-1547. https://doi.org/10.1080/03610928408828776
[17]
Cochran, W.G. (1954) Some Methods for Strengthening the Common X2 Tests. Biometrics, 10, 417-451. https://doi.org/10.2307/3001616
[18]
Moran, P.A.P. (1970) On Asymptotically Optimal Test of Composite Hypotheses. Biometrika, 57, 47-55. https://doi.org/10.1093/biomet/57.1.47
[19]
Al-Gahtani, S., Shoukri, M. and Al-Eid, M. (2021) Predictors of the Aggregate of COVID-19 Cases and Its Case-Fatality: A Global Investigation Involving 120 Countries. Open Journal of Statistics, 11, 259-277. https://www.scirp.org/journal/ojs
https://doi.org/10.4236/ojs.2021.112014
[20]
Cockwell, P. and Fisher, L.-A. (2017) Global, Regional, and National Burden of Chronic Kidney Disease, 1990-2017: A Systematic Analysis for the Global Burden of Disease Study. The Lancet, 395, 709-733.
[21]
Rottoli, M., Bernante, P., Garelli, S. and Gianella, M. (2020) How Important Is Obesity as a Risk Factor for Respiratory Failure, Intensive Care Admission and Death in Hospitalized COVID-19 Patients? Results from a Single Italian Center. European Journal of Endocrinology, 183, 389-397. https://doi.org/10.1530/EJE-20-0541
Rashed, E.A., Kodera, S., Gomez-Tames, J. and Hirata, A. (2020) Correlation between COVID-19 Morbidity and Mortality Rates in Japan and Local Population Density, Temperature, and Absolute Humidity. International Journal of Environmental Research and Public Health, 17, 5447.
https://doi.org/10.3390/ijerph17155477
[24]
GBD Colorectal Cancer Collaborators (2019) The Global, Regional, and National Burden of Colorectal Cancer and Its Attributable Risk Factors in 195 Countries and Territories, 1990-2017: A Systematic Review for the Global Burden of Disease Study 2017. The Lancet Gastroenterology and Hepatology, 4, 913-933.
[25]
Hastie, T.J. and Tibshirani, R.J. (1990) Generalized Additive Models. Chapman & Hall/CRC, London.
[26]
Ruppert, D., Wand, M.P. and Carroll, R.J. (2003) Semiparametric Regression. Cambridge University Press, Cambridge.
https://doi.org/10.1017/CBO9780511755453
[27]
Wood, S.N. (2017) Generalized Additive Models: An Introduction with R. Second Edition, CRC Press, Boca Raton. https://doi.org/10.1201/9781315370279