All Title Author
Keywords Abstract

Multilevel Modeling of Binary Outcomes with Three-Level Complex Health Survey Data

DOI: 10.4236/ojepi.2017.71004, PP. 27-43

Keywords: Health Care Utilization, Complex Health Survey with Sampling Weights, Simulations for Complex Survey, Pseudo Likelihood, Three-Level Data

Full-Text   Cite this paper   Add to My Lib


Complex survey designs often involve unequal selection probabilities of clus-ters or units within clusters. When estimating models for complex survey data, scaled weights are incorporated into the likelihood, producing a pseudo likeli-hood. In a 3-level weighted analysis for a binary outcome, we implemented two methods for scaling the sampling weights in the National Health Survey of Pa-kistan (NHSP). For NHSP with health care utilization as a binary outcome we found age, gender, household (HH) goods, urban/rural status, community de-velopment index, province and marital status as significant predictors of health care utilization (p-value < 0.05). The variance of the random intercepts using scaling method 1 is estimated as 0.0961 (standard error 0.0339) for PSU level, and 0.2726 (standard error 0.0995) for household level respectively. Both esti-mates are significantly different from zero (p-value < 0.05) and indicate consid-erable heterogeneity in health care utilization with respect to households and PSUs. The results of the NHSP data analysis showed that all three analyses, weighted (two scaling methods) and un-weighted, converged to almost identical results with few exceptions. This may have occurred because of the large num-ber of 3rd and 2nd level clusters and relatively small ICC. We performed a sim-ulation study to assess the effect of varying prevalence and intra-class correla-tion coefficients (ICCs) on bias of fixed effect parameters and variance components of a multilevel pseudo maximum likelihood (weighted) analysis. The simulation results showed that the performance of the scaled weighted estimators is satisfactory for both scaling methods. Incorporating simulation into the analysis of complex multilevel surveys allows the integrity of the results to be tested and is recommended as good practice.


[1]  Carle, A.C. (2009) Fitting Multilevel Models in Complex Survey Data with Design Weights: Recommendations. BMC Medical Research Methodology, 9, 49.
[2]  Goldstein, H. (2003) Multilevel Statistical Models. 3rd Edition.
[3]  Rabe-Hesketh, S. and Skrondal, A. (2006) Multilevel Modelling of Complex Survey Data. Journal of the Royal Statistical Society: Series A (Statistics in Society), 169, 805-827.
[4]  Grilli, L. and Pratesi, M. (2004) Weighted Estimation in Multilevel Ordinal and Binary Models in the Presence of Informative Sampling Designs. Survey Methodology, 30, 93-104.
[5]  Chambers, R.L. and Skinner, C.J. (2003) Analysis of Survey Data. John Wiley & Sons.
[6]  Longford, N.T. (1996) Model-Based Variance Estimation in Surveys with Stratified Clustered Designs. Australian Journal of Statistics, 38, 333-352.
[7]  Pfeffermann, D., Skinner, C.J., Holmes, D.J., Goldstein, H. and Rasbash, J. (1998) Weighting for Unequal Selection Probabilities in Multilevel Models. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 60, 23-40.
[8]  Asparouhov, T. (2006) General Multi-Level Modeling with Sampling Weights. Communications in Statistics—Theory and Methods, 35, 439-460.
[9]  Stapleton, L.M. (2002) The Incorporation of Sample Weights into Multilevel Structural Equation Models. Structural Equation Modeling, 9, 475-502.
[10]  Jenkins, F. (2008) Multilevel Analysis with Informative Weights. Proceedings of the Joint Statistical Meeting, ASA Section on Survey Research Methods, 2225-2233.
[11]  Potthoff, R.F., Woodbury, M.A. and Manton, K.G. (1992) Equivalent Sample Size and Equivalent Degrees of Freedom Refinements for Inference Using Survey Weights under Superpopulation Models. Journal of the American Statistical Association, 87, 383-396.
[12]  Mallick, M.D. (1992) Sample Design for the National Health Survey of Pakistan. Pakistan Journal of Medical Sciences, 31, 289-290.
[13]  Hadden, W.C., Pappas, G. and Khan, A.Q. (2003) Social Stratification, Development and Health in Pakistan: An Empirical Exploration of Relationships in Population-Based National Health Examination Survey Data. Social Science & Medicine, 57, 1863-1874.
[14]  Skrondal, R.-H. (2008) Multilevel and Longitudinal Modeling Using Stata. STATA Press, College Station.
[15]  Fatmi, Z., Hadden, W.C., Razzak, J.A., Qureshi, H.I., Hyder, A.A. and Pappas, G. (2007) Incidence, Patterns and Severity of Reported Unintentional Injuries in Pakistan for Persons Five Years and Older: Results of the National Health Survey of Pakistan 1990-94. BMC Public Health, 7, 152.
[16]  Pappas, G., Akhtar, T., Gergen, P.J., Hadden, W.C. and Khan, A.Q. (2001) Health Status of the Pakistani Population: A Health Profile and Comparison with the United States. American Journal of Public Health, 91, 93.
[17]  Burton, A., Altman, D.G., Royston, P. and Holder, R.L. (2006) The Design of Simulation Studies in Medical Statistics. Statistics in Medicine, 25, 4279-4292.
[18]  Collins, L.M., Schafer, J.L. and Kam, C.-M. (2001) A Comparison of Inclusive and Restrictive Strategies in Modern Missing Data Procedures. Psychological Methods, 6, 330.
[19]  Snijders, T.A.B. and Bosker, R.J. (1999) Multilevel Analysis. An Introduction to Basic and Advanced Multilevel Modeling. Sage, London.
[20]  Kovacevic, M.S. and Rai, S.N. (2003) A Pseudo Maximum Likelihood Approach to Multilevel Modelling of Survey Data. Communications in Statistics-Theory and Methods, 32, 103-121.
[21]  Moineddin, R., Matheson, F.I. and Glazier, R.H. (2007) A Simulation Study of Sample Size for Multilevel Logistic Regression Models. BMC Medical Research Methodology, 7, 34.
[22]  AbouZahr, C. and Boerma, T. (2005) Health Information Systems: The Foundations of Public Health. Bulletin of the World Health Organization, 83, 578-583.


comments powered by Disqus

Contact Us


微信:OALib Journal