MODELS FOR ANALYZING OVER-DISPERSED HURDLE NEGATIVE BINOMIAL REGRESSION MODEL: AN APPLICATION TO MANUFACTURED CIGARETTE USE
DOI:
https://doi.org/10.13052/jrss2229-5666.1225%20Keywords:
Manufactured Cigarettes, Tobacco, Over Dispersed, Hurdle Negative Binomial, BangladeshAbstract
Our main aim is to identify the factors that influence the use of manufactured cigarettes among tobacco users especially those whose age is above fifteen. Among the tobacco users, a large portion of adult does not take manufactured cigarettes but take other tobacco. As a result, we need to construct a model that can handle the existence of excess zero counts and the over-dispersed phenomenon. Motivated by these facts, in this paper, we propose to apply the Hurdle Negative Binomial (HNB) regression model to discover the relationships between uses of manufactured cigarettes and social factors. The data were found to have excess zeros (35%); moreover, the variance is 47.122, which is much higher than its mean 5.933. With excess zeros and high variability of non-zero outcomes, the HNB model was found to be better fitted.
Downloads
References
Aguirre-Torres, V. and Gallant, A. R. ( 1983). The null and non-null asymptotic distribution of the Cox test for multivariate nonlinear regression: alternatives and a new distribution-free Cox test, Journal of Econometrics, 21, p. 5-33.
Arndt, C. (2004). Information Measures: Information and its Description in Science and Engineering, Springer, ISBN 978-3-540-40855-0.
Baetschmann, G. and Winkelmann, R. (2014). A Dynamic Hurdle Model for Zero- Inflated Count Data: With an Application to Health Care Utilization, University of Zurich, Working Paper Series, Paper No. 151, JEL Classification: C25, I10.
Bandopadhyaya, A. (1994). an estimation of the hazard rate of firms under Chapter 11 protection,ReviewofEconomicsandStatistics, 76(2),p. 346-350.
Bohara, A.K. and Krieg R.G. (1996). A Poisson hurdle model of migration frequency, The Journal of Regional Analysis and Policy,26(1), p. 37-45.
Burnham, K.P. and Anderson, D.R. (1998). Model Selection and Inference, Springer – Verlag, New York.
Cameron, A. C. and Trivedi, P. K. (1998). Regression Analysis of Count Data, Cambridge: Cambridge University Press.
Cameron, A.C. and Windmeijer, F. A.G. (1996). R-squared measures for count data regression models with applications to health care utilization, Journal of Business and Economic Statistics, 14(2), p. 209-220.
Cameron, A.C. and P.K. Trivedi (1986).Econometric models based on count data: comparisons and applications of some estimators and tests, Journal of Applied Econometrics, 1, p. 29-54.
Cameron, A.C. and Trivedi, P.K. (1990). Regression based tests for over- dispersion in the Poisson model, Journal of Econometrics, 46(3), p. 347-364
Cameron, A.C. and Trivedi, P.K. (1993). Tests of independence in parametric models with applications and illustrations, Journal of Business and Economic Statistics, 11, p. 29-43.
Cameron, A.C., Trivedi, P.K., Milne, F. and Piggott, J. (1988). A micro econometric model of the demand for health insurance and health care in Australia’, Review of Economic Studies, 55(1), p. 85-106.
Chandhiok, N., Dhillon, B. S., Kambo, I. and Saxena N.C. (2006). Determinants of antenatal care utilization in rural areas of India: A cross- sectional study from 28 districts (An ICMR task force study), J. Obstet. Gynecol. India, 56(1), p. 47-52.
Chao, A. and Shen, Tsung-Jen (2003). Nonparametric estimation of Shannon's index of diversity when there are unseen species in sample, Environ. Ecol. Stat., 10, p. 429–443.
Chen, X.., Hong, H. and Shum, M. (2007). Nonparametric likelihood ratio model selection tests between parametric likelihood and moment condition models, Journal of Econometrics, 141, p. 109-140.
Chernoff, H. (1954). On the distribution of the likelihood ratio, Annals of Mathematical Statistics, 25, p. 573-578.
Chipeta, M.G., Ngwira, B.M., Simoonga, C. and Kazembe L. N. (2014). Zero adjusted models with applications to analyzing helminths count data, BMC Research Notes, 7(1), p. 856.
Cover, T. M. and Thomas, J. A. (2006). Elements of Information Theory, 2nd Edition,Wiley Interscience, ISBN 0-471-24195-4.
Dawit Sekata (2015). Modeling the Number of Antenatal Care Service Visits Among Pregnant Women in Rural Ethiopia: Zero Inflated and Hurdle Model Specifications, International Journal of Healthcare Sciences, 3(1), p. 332-355.
Desmarais, Bruce A. and Jeffrey J. Harden (2013). Testing for Zero-Inflation in Count Models: Bias Correction for the Vuong Test, The Stata Journal, 13(4), p. 810–835.
Horvitz, D.G., and D. J. Thompson(1952). A generalization of sampling without replacement from a finite universe, J. Am. Stat. Assoc. 47, p. 663-685.
Linden, A. and Mantyniemi, S. (2011). Using the negative binomial distribution to model overdispersion in ecological count data, Ecology, 92(7), p. 1414–1421.
Liu,W.S. and Cela, J. (2008). Count Data Models in SAS®, Statistics and Data Analysis, SAS Global Forum, Statistics and Data Analysis, paper 371.
MacDonald, R. B. (2014, March 24). Overdispersion and Poisson regression, J Quant Criminol,p. 269-284.
Makalic, D. F. (November 22,2008). Model selection Tutorial#1:Akaike's Information Criterion, In Model selection with AIC, Melbourne.
Mullahy, J. (1986). Specification and testing of some modified count data models, Journal of Econometrics, 33, p. 341-365.
Simkhada, B., Teijlingen E. R., Porter M. and Simkhada P. (2008). Factor affecting the utilization of antenatal care in developing countries: systematic review of the literature, Journal of Advanced Nursing, 61(3), p. 244-260.
Winkelmann, R. (1994). Count Data Models: Econometric Theory and an Application to Labor Mobility, Springer-Verlag, Berlin.
Winkelmann, R. and Zimmermann (1991). A new approach for modeling economic count data, Economics Letters, 37, p.139-143.