全部 标题 作者
关键词 摘要

OALib Journal期刊
ISSN: 2333-9721
费用:99美元

查看量下载量

相关文章

更多...

Growth Estimators and Confidence Intervals for the Mean of Negative Binomial Random Variables with Unknown Dispersion

DOI: 10.1155/2013/602940

Full-Text   Cite this paper   Add to My Lib

Abstract:

The negative binomial distribution becomes highly skewed under extreme dispersion. Even at moderately large sample sizes, the sample mean exhibits a heavy right tail. The standard normal approximation often does not provide adequate inferences about the data's expected value in this setting. In previous work, we have examined alternative methods of generating confidence intervals for the expected value. These methods were based upon Gamma and Chi Square approximations or tail probability bounds such as Bernstein's inequality. We now propose growth estimators of the negative binomial mean. Under high dispersion, zero values are likely to be overrepresented in the data. A growth estimator constructs a normal-style confidence interval by effectively removing a small, predetermined number of zeros from the data. We propose growth estimators based upon multiplicative adjustments of the sample mean and direct removal of zeros from the sample. These methods do not require estimating the nuisance dispersion parameter. We will demonstrate that the growth estimators' confidence intervals provide improved coverage over a wide range of parameter values and asymptotically converge to the sample mean. Interestingly, the proposed methods succeed despite adding both bias and variance to the normal approximation. 1. Introduction Confidence intervals are routinely applied to limited samples of data based upon their asymptotic properties. For instance, the central limit theorem states that the sample mean will have an approximately Normal distribution for large sample sizes provided that the data’s second moment is finite. This Normal approximation is a fundamental tool for inferences about the data’s expected value in a wide variety of settings. Confidence intervals for the mean are often based upon Normal quantiles even when the sample size is very moderate (e.g. 30 or 50). However, the Normal approximation’s quality cannot be ensured for highly skewed distributions [1]. In this setting, the sample mean may converge to the Normal in distribution at a much slower rate. The Negative Binomial distribution is known to have an extremely heavy right tail, especially under high dispersion. In previous work [2], we established that the Normal confidence interval significantly undercovers the mean at moderate sample sizes. We also suggested alternatives based upon Gamma and Chi Square approximations along with tail probability bounds such as Bernstein’s inequality. We now propose growth estimators for the mean. These estimators seek to account for the relative overrepresentation

References

[1]  R. R. Wilcox, Robust Estimation and Hypothesis Testing, Academic Press, Burlington, Mass, USA, 2005.
[2]  D. Shilane, S. N. Evans, and A. E. Hubbard, “Confidence intervals for negative binomial random variables of high dispersion,” International Journal of Biostatistics, vol. 6, no. 1, article 10, 2010.
[3]  J. M. Hilbe, Negative Binomial Regression, Cambridge University Press, Cambridge, UK, 2007.
[4]  S. J. Clark and J. N. Perry, “Estimation of the negative binomial parameter κ by maximum quasi-likelihood,” Biometrics, vol. 45, no. 1, pp. 309–316, 1989.
[5]  W. W. Piegorsch, “Maximum likelihood estimation for the negative binomial dispersion parameter,” Biometrics, vol. 46, no. 3, pp. 863–867, 1990.
[6]  J. Aragón, D. Eberly, and S. Eberly, “Existence and uniqueness of the maximum likelihood estimator for the two-parameter negative binomial distribution,” Statistics & Probability Letters, vol. 15, no. 5, pp. 375–379, 1992.
[7]  C. Ferreri, “On the ML-estimator of the positive and negative two-parameter binomial distribution,” Statistics & Probability Letters, vol. 33, no. 2, pp. 129–134, 1997.
[8]  E. P. Pieters, C. E. Gates, J. H. Matis, and W. L. Sterling, “Small sample comparison of different estimators of negative binomial parameters,” Biometrics, vol. 33, no. 4, pp. 718–723, 1977.
[9]  B. Efron and R. J. Tibshirani, An Introduction to the Bootstrap, vol. 57 of Monographs on Statistics and Applied Probability, Chapman and Hall, Boca Raton, Fla, USA, 1993.
[10]  G. Casella and R. L. Berger, Statistical Inference, Wadsworth & BrooksSoftware, Pacific Grove, Calif, USA, 1990.
[11]  C. J. Clopper and E. S. Pearson, “The use of confidence or fiducial limits illustrated in the case of the binomial,” Biometrika, vol. 26, pp. 404–413, 1934.
[12]  E. L. Crow and R. S. Gardner, “Confidence intervals for the expectation of a Poisson variable,” Biometrika, vol. 46, pp. 441–453, 1959.
[13]  T. E. Sterne, “Some remarks on confidence or fiducial limits,” Biometrika, vol. 41, pp. 275–278, 1954.
[14]  S. N. Bernstein, “Teoriya Veroiatnostei,” Publisher Unknown, 1934, (Russian).
[15]  G. Bennett, “Probability inequalities for the sum of independent random variables,” Journal of the American Statistical Association, vol. 57, pp. 33–45, 1962.
[16]  G. Bennett, “On the probability of large deviations from the expectation for sums of bounded, independent random variables,” Biometrika, vol. 50, pp. 528–535, 1963.
[17]  W. Hoeffding, “Probability inequalities for sums of bounded random variables,” Journal of the American Statistical Association, vol. 58, pp. 13–30, 1963.
[18]  A. C. Berry, “The accuracy of the Gaussian approximation to the sum of independent variates,” Transactions of the American Mathematical Society, vol. 49, pp. 122–136, 1941.
[19]  C.-G. Esseen, “On the Liapounoff limit of error in the theory of probability,” Arkiv f?r Matematik, vol. 28, no. 9, pp. 1–19, 1942.
[20]  C. G. Esseen, “A moment inequality with an application to the central limit theorem,” Scandinavian Actuarial Journal, vol. 39, pp. 160–170, 1956.
[21]  P. van Beek, “An application of Fourier methods to the problem of sharpening the Berry-Esseen inequality,” Zeitschrift für Wahrscheinlichkeitstheorie und verwandte Gebiete, vol. 23, pp. 187–196, 1972.
[22]  M. A. Rosenblum and M. J. van der Laan, “Confidence intervals for the population mean tailored to small sample sizes, with applications to survey sampling,” International Journal of Biostatistics, vol. 5, article 10, 2009.
[23]  M. J. van der Laan, D. B. Rubin, et al., “Estimating function based crossvalidation and learning,” Tech. Rep. 180, Division of Biostatistics, University of California-Berkeley, 2005.
[24]  A. Agresti and B. A. Coull, “Approximate is better than “exact” for interval estimation of binomial proportions,” The American Statistician, vol. 52, no. 2, pp. 119–126, 1998.
[25]  D. Shilane and D. Bean, “Two-sample inference in highly dispersed negative binomial models,” The American Statistician, 2011.

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133