A NON-STUDENT DISTRIBUTION OF JACK-KNIFE RESIDUALS AND IDENTIFICATION OF OUTLIERS
Keywords:
Jack Knife Residuals, External Studentized Residual, Outliers, T-Ratio, F- Ratio, Moments, Critical Points.Abstract
This paper proposes the exact distribution of Jack-Knife residual which is formally called as external studentized residual and used to evaluate the outliers in linear multiple regression analysis. The authors have proved that the Jackknife residuals do not follow student’s t-distribution and they have explored the relationship among Jack-Knife residual, t-ratio and F- ratio and have expressed the derived density function of the residual in terms of series expression form. Moreover, the new form of the distribution is symmetric, first two moments of the distribution are derived and the authors have computed the critical points of Jack Knife residual at 5% and 1% level of significance and for varying sample sizes and predictors. Finally, the numerical example shows that the results extracted from the proposed approach and classical approach are similar even though the proposed distribution of the Jackknife residuals is different.
Downloads
References
Ali S. Hadi (1992). A new measure of overall potential influence in linear
regression, Computational Statistics & Data Analysis, 14(1), p. 1-27.
Behnken, D. W. and Draper, N. R. (1972). Residuals and their tariance
patterns, Technometrics, 14, p. 101-111.
Belsley, D.A., Kuh, E. and Welsch, R.E. (1980). Regression Diagnostics:
Identifying Influential Data and Sources of Collinearity, Wiley, New York.
Cook, R.D. (1977). Detection of influential observation in linear regression,
Technometrics, 19, p. 15-18.
Chatterjee, S. and Hadi, A. S. (1988). Sensitivity Analysis in Linear
Regression, New York: John Wiley and Sons.
Cook, R. D. and Weisberg, S. (1982). Residuals and Influence in Regression,
Vol. 5, New York: Chapman and Hall.
Davies, R. B. and Hutton, B. (1975). The effects of errors in the independent
variables in linear regression, Biometrika, 62, p. 383-391.
Dı́az-Garcı́a, J. A., and González-Farı́as, G. (2004). A note on the Cook's
distance, Journal of Statistical Planning and Inference, 120(1), p. 119-136.
Eubank, R.L(1985). Diagnostics for smoothing splines, J. Roy. Statist. Soc.
Ser. B, 47, p. 332–341.
Hoaglin, D.C, and Welsch, R.E. (1978). The Hat matrix in regression and
ANOVA, The Amer. Statist., 32, p. 17-22.
Huber, P. J. (1975). Robustness and Designs, in A Survey of Statistical Design
and Linear Models. North-Holland, Amsterdam.
Kendall, M.G., Stuart, A. (1973). The Advanced Theory of Statistics, Volume
: Inference and Relationship, Griffin
Kim, C. (1996). Cook’s distance in spline smoothing, Statist. Probab. Lett., 31,
p. 139–144.
Kim, C. and Kim, W. (1998). Some diagnostics results in nonparametric
density estimation, Comm. Statist., Theory Methods, 27, p. 291–303.
Kim, C., Lee, Y. and Park, B.U. (2001). Cook’s distance in local polynomial
regression, Statist. Probab. Lett., 54, p. 33–40.
Lund, R. E. (1975). Tables for an approximate test for outliers in linear
models, Technometrics, 17, p. 473-476.
Silverman, B.W. (1985). Some aspects of the spline smoothing approach to
non-parametric regression curve fitting (with discussion), J. Roy. Statist. Soc.
Ser. B 47, p. 1–52.
Thomas, W (1991). Influence diagnostics for the cross-validated smoothing
parameter in spline smoothing, J. Amer. Statist. Assoc., 86, p. 693–698.
Weisberg, S. (1980). Applied linear regression, New York: Wiley.