Background: Bivariate count data are commonly encountered in medicine, biology, engineering, epidemiology and many other applications. The Poisson distribution has been the model of choice to analyze such data. In most cases mutual independence among the variables is assumed, however this fails to take into accounts the correlation between the outcomes of interests. A special bivariate form of the multivariate Lagrange family of distribution, names Generalized Bivariate Poisson Distribution, is considered in this paper. Objectives: We estimate the model parameters using the method of maximum likelihood and show that the model fits the count variables representing components of metabolic syndrome in spousal pairs. We use the likelihood local score to test the significance of the correlation between the counts. We also construct confidence interval on the ratio of the two correlated Poisson means. Methods: Based on a random sample of pairs of count data, we show that the score test of independence is locally most powerful. We also provide a formula for sample size estimation for given level of significance and given power. The confidence intervals on the ratio of correlated Poisson means are constructed using the delta method, the Fieller’s theorem, and the nonparametric bootstrap. We illustrate the methodologies on metabolic syndrome data collected from 4000 spousal pairs. Results: The bivariate Poisson model fitted the metabolic syndrome data quite satisfactorily. Moreover, the three methods of confidence interval estimation were almost identical, meaning that they have the same interval width.
References
[1]
Winkelmann, R. (2008) Econometric Analysis of Count Data. Springer Science & Business Media.
[2]
Cameron, A.C. and Trivedi, P.K. (2013) Regression Analysis of Count Data. 2nd Edition, Cambridge University Press. https://doi.org/10.1017/cbo9781139013567
[3]
Tzougas, G. and di Cerchiara, A.P. (2021) Bivariate Mixed Poisson Regression Models with Varying Dispersion. North American Actuarial Journal, 27, 211-241. https://doi.org/10.1080/10920277.2021.1978850
[4]
Shenton, L.R. and Consul, P.C. (1973) On Bivariate Lagrange and Borel-Tanner Distributions and Their Use in Queuing Theory. Sankhya, 35, 229-236.
[5]
Jain, G.C. and Singh, N. (1975) On Bivariate Power Series Distributions Associated with Lagrange Expansion. Journal of the American Statistical Association, 70, 951-954. https://doi.org/10.1080/01621459.1975.10480329
[6]
Shoukri, M.M. (1982) On the Generalization and Estimation for the Double Poisson Distribution. Trabajos de Estadistica y de InvestigacionOperativa, 33, 97-109. https://doi.org/10.1007/bf02888625
[7]
Kendall, M. and Ord, K. (1987) Advanced Theory of Statistics. 5th Edition, Griffen and Company.
[8]
Consul, P.C. and Jain, G.C. (1973) A Generalization of the Poisson Distribution. Technometrics, 15, 791-799. https://doi.org/10.1080/00401706.1973.10489112
[9]
Laaksonen, D.E. (2002) Metabolic Syndrome and Development of Diabetes Mellitus: Application and Validation of Recently Suggested Definitions of the Metabolic Syndrome in a Prospective Cohort Study. American Journal of Epidemiology, 156, 1070-1077. https://doi.org/10.1093/aje/kwf145
[10]
Lakka, H. (2002) The Metabolic Syndrome and Total and Cardiovascular Disease Mortality in Middle-Aged Men. Journal of the American Medical Association, 288, 2709-2716. https://doi.org/10.1001/jama.288.21.2709
[11]
Carmelli, D., Cardon, L.R. and Fabsitz, R. (1994) Clustering of Hypertension, Diabetes, and Obesity in Adult Male Twins: Same Genes or Same Environments? American Journal of Human Genetics, 55, 566-573.
[12]
Edwards, K.L., Newman, B., Mayer, E., Selby, J.V., Krauss, R.M. and Austin, M.A. (1997) Heritability of Factors of the Insulin Resistance Syndrome in Women Twins. Genetic Epidemiology, 14, 241-253. https://doi.org/10.1002/(sici)1098-2272(1997)14:3<241::aid-gepi3>3.0.co;2-8
[13]
Hong, Y., Pederson, N.L., et al. (1997) Genetic and Environmental Architecture of the Features of Insulin Resistance Syndrome. American Journal of Human Genetics, 60, 143-152.
[14]
Lee, K.E., Klein, B.E. and Klein, R. (2003) Familial Aggregation of Components of the Multiple Metabolic Syndrome in the Framingham Heart and Offspring Cohorts: Genetic Analysis Workshop Problem. BMC Genetics, 4, Article No. S94. https://doi.org/10.1186/1471-2156-4-s1-s94
[15]
Park, H.S., Park, J.Y. and Cho, S. (2006) Familial Aggregation of the Metabolic Syndrome in Korean Families with Adolescents. Atherosclerosis, 186, 215-221. https://doi.org/10.1016/j.atherosclerosis.2005.07.019
[16]
Borch-Johnson, K. (2007) The Metabolic Syndrome in a Global Perspective. Danish Medical Bulletin, 54, 157-159.
[17]
Cameron, A.J., Shaw, J.E. and Zimmet, P.Z. (2004) The Metabolic Syndrome: Prevalence in Worldwide Populations. Endocrinology and Metabolism Clinics of North America, 33, 351-375. https://doi.org/10.1016/j.ecl.2004.03.005
[18]
World Health Organization (1999) Definition, Diagnosis, and Classification of Diabetes Mellitus and Its Complications. Report of a WHO Consultation. Part 1: Diagnosis and Classification of Diabetes Mellitus.
[19]
Al-Gahtani, S., Shoukri, M. and Al-Eid, M. (2021) Predictors of the Aggregate of COVID-19 Cases and Its Case-Fatality: A Global Investigation Involving 120 Countries. Open Journal of Statistics, 11, 259-277. https://doi.org/10.4236/ojs.2021.112014