Estimation of Average Paddy Production of Pira Nagar Village at Barabanki District in India
S. K. Yadav*, Shanya Baghel, Sugandha Saxena and Avinash Kumar Singh
Department of Statistics, Babasaheb Bhimrao Ambedkar University, Lucknow, India
E Mail: drskystats@gmail.com; baghel.shanya@gmail.com; saxenasugandha0911@gmail.com; avinashsingh543@gmail.com
Corresponding Author
The idea of the present paper is the use of the known information on study variable for enhanced estimation of average paddy production of Pira Nagar village at Barabanki District in India under the Simple Random Sampling Scheme. This known information is utilzed in the form of median of primary variable as it is readily available and does not require every unit of the population to be inquired. The Bias and MSE of the suggested estimator are derived up to approximation of degree one.The minimum value of the MSE of suggested estimator is also obtained by optimizing the characterizing scalar. The MSE has also been compared with the considered competing estimators both theoretically and empirically. The theoretical efficiency conditions of the suggested estimator to be better than the considered estimators are verified using natural population on primary data collected from Pira Nagar Village at Barabanki District of Uttar Pradesh state in India.
Keywords: Study variable, median of study variable, simple random sampling, bias, MSE, PRE
Cochran (1940) was the first who used the auxiliary information for enhancing the efficiency of the estimator of population mean. Since then a number of researchers have modified the usual ratio estimator, utilizing the auxiliary information in the form of its various parameters. Taking data on auxiliary variable requires extra cost. Sometimes it is not feasible to bear the extra cost and sometimes it becomes impossible to get information on auxiliary variable. In such situations we need to find an alternative to the auxiliary information. One of the solutions for such situations is to obtain some easily accessible chracteristics on study variable itself as supplementary information. For example Let the study variable is monthly salary of the workers working at a place. Most of the workers are unwilling to reveal their exact salary or in case of free lancers they don’t have a fixed salary. But they can tell whether their salary lies between 10000–15000, 15000–20000 and so on. Hence it is easier for them to tell the range within which their earning lies. In such kind of situations median can be obtained for the whole data and can be used for elevated estimation of average salary of the workers. In the present investigation, we make use of the known median of the main variable to obtain the estimate of average production of paddy crop at Pira Nagar Village of Barabanki District of Uttar Prdesh state in India.
Let be population containing N units which are distinct and may be identified. The problem is to estimate the population mean of the main variable Y with higher efficiency. The most suitable estimator for is the sample mean . When it costs high to get auxiliary information, we may consider additional information on Y and may suggest a modified ratio type estimator for enhanced estimation of . If we go for biased estimator, we can obtain much lesser MSE than the variance of and MSE/Variance of other existing biased and unbiased estimators of .
N: | Population Size |
n: | Sample Size |
f: : | Sampling Fraction |
: | All possible samples of size n |
Y: | Study variable |
M: | Median of the Y |
X: | Auxiliary variable |
: | Population means |
: | Sample means |
: | Correlation coefficient between X and Y |
: | Regression coefficient of Y on X |
: | Coefficient of Skewness of X |
: | Coefficient of Kurtosis of X |
: | Average of sample medians of Y |
m: | Sample median of Y |
: | Interquartile range |
B(): | Bias of the estimator |
V(): | Variance of the estimator |
MSE(): | Mean squared error of the estimator |
: | Quartiles of X |
, , : | Coefficient of variation of x, y and m respectively |
, : | Relative Covariances |
PRE(e,p): | Percentage relative efficiency of the proposed |
: | estimator(p) with respect to the existing estimator (e) |
Variance of Study Variable:
Variance of Auxiliary variable:
where
Mean of medians of possible samples
Variance of Sample Median of Y
Covariance of and
where
Covariance of and
Coefficient of Variations
Under this section, various estimators of along with their biases and MSEs are presented in Table 1. It is well known that in simple random sampling technique (, ) are unbiased estimators for (, ) respectively.
Table 1 Various existing estimators of along with their biases & MSEs
S.No. | Estimators | Bias | MSE/Variance |
1 | Sample Mean | – | |
2 | Watson(1937) | – | |
3 | Cochron(1940) | ||
4 | Goodman(1958) | – | |
5 | Chakrabarty (1979) | ||
6 | Sahai (1980) | ||
7 | Sisodia (1981) | ||
8 | Tuteja(1991) | ||
10 | Upadhyaya(1999) | ||
11 | Kadilar(2003) | ||
12 | Singh(2003) | ||
13 | Singh (2003) | ||
14 | Singh et al. | ||
15 | Yan and Tian (2010) | ||
16 | Yan and Tian (2010) | ||
17 | Yan and Tian (2010) | ||
18 | Yan and Tian (2010) | ||
19 | Pandey (2011) | ||
20 | Al-Omari (2012) | ||
21 | Al-Omari (2012) | ||
22 | Subramani and Kumarpandiyan (2012) | ||
23 | Subramani and Kumarpandiyan (2012) | ||
24 | Swain (2014) | ||
25 | Yadav and Mishra (2015) | ||
26 | Jerjuddin and Kishun (2016) | ||
27 | Soponviwatkul and Lawson (2017) | ||
28 | Soponviwatkul and Lawson (2017) | ||
29 | Ijaz and Ali (2018) | ||
30 | Ijaz and Ali (2018) | ||
31 | Yadav et al. (2019) | – |
Note: We will denote our measures in terms of Coefficient of Variation because it is a relative measure and most suitable to compare two series.
where,
a,b,c,d = Constants or Parametric Value
If the median M of Y is known so by utilizing it, we may suggest an elevated estimator of as,
(1) |
Where is chosen such that MSE(t) is minimum. The bias and MSE of t up to approximation of order one is given by,
(2) | ||
(3) |
where
Hence
(4) | ||
(5) |
Let
So,
(6) | ||
(7) | ||
(8) | ||
(9) | ||
(10) |
Hence the Estimator can be rewritten as,
(11) |
For Biasedness,
(from Equation (6), (3.1) and (8))
Hence Biasedness is :
(12) |
For MSE,
(second and higher order terms are ignored) | |
Hence,
(13) |
(from equation (8), (9) and (10))
Minimum Value of
For Minimum value of , we should have
So,
(14) |
For Minima,
Hence has a minimum value.
Minimum Value of MSE
Hence Minimum value of MSE(t) is obtained by putting the Value of in Equation (3.3)
(15) |
In this section the suggested estimator is being compared with the competing estimators of and the efficiency conditions are presented in Table 2.
Table 2 Efficiency comparison
S.No. | Condition | |
1 | ||
2 | ||
3 | ||
4 | ||
5 | ||
6 | ; | |
7 | ||
8 | ||
9 | ||
10 | ||
11 |
Under this section the efficiency conditions of the suggested estimator over competing estimators are verified using real data sets.
To verify the results we have obtained for the Paddy Production data from the Pira Nagar Village of Barabanki Distrcit.The details of the obtained data is as follows:
Village | Pira Nagar |
District | Barabanki |
Time | March 2018 |
Production | Paddy Production |
Type of Data | Primary Data |
Information Taken | Name of Resident |
Their Area of Cultivation (Unit in Hectares) | |
Yield obtained for each area ( Unit in Quintals): | |
one Quintal=100 Kilogram |
For our Numerical Justification we have taken:
Study Variable | Yield | denoted as Y |
Auxiliary Variable | Area of Cultivation | denoted as X |
Population Size | 52 | |
Sample Size | 3 |
Parameter | Value | Units in |
N | 52 | – |
n | 3 | – |
22100 | – | |
14.721 | Quintal | |
0.46227 | Hectare | |
M | 10 | Quintal |
0.8046229 | – | |
190.8668 | Quintal | |
0.15675383 | Hectare | |
4.401155 | – | |
0.8807379 | – | |
0.7335474 | – | |
4.401155 | – | |
8.1028894 | – | |
14.146291 | – | |
28.0769 | – | |
0.4040 | Hectare | |
0.5050 | Hectare | |
0.2525 | Hectare | |
12.0119 | Quintal | |
V(m) | 37.7429 | Quintal |
0.2536 | – | |
0.3774 | – |
On the basis of the data, The numerical values related to proposed estimators are obtained as follows:
t | 14.022176 |
B(t) | 1.5962096 |
9.8921161 | |
14.211079 |
We have constructed a table for numerical comparison of the suggested estimator with the estimators in competition. The following table gives:
• The values of the suggested and competing estimators on the Basis of the data obtained and sample taken.
• The Biases of competing and suggested Estimators.
• Mean Square Error (MSE) of competing and Proposed Estimators.
• The percentage relative efficiencies (PRE) of the suggested over competing estimators of .
Table 4 Biases, MSE’s and respective PRE’s
Estimator | Bias | MSE | PRE |
0 | 59.951751 | 421.866 | |
0 | =21.137907 | 148.74245 | |
0.40139256 | 21.837168 | 153.66298 | |
0 | 21.837168 | 153.66298 | |
1.1203056 | 21.137907 | 148.74245 | |
2.3629292 | 21.137907 | 148.74245 | |
0.63149751 | 35.223464 | 247.8592 | |
0.22328973 | 28.411333 | 199.92382 | |
0.078904053 | 57.591622 | 405.25862 | |
0.0076015701 | 21.138228 | 148.74471 | |
4.1946732 | 83.587598 | 588.18614 | |
0.06976912 | 21.163736 | 148.9242 | |
0.63958709 | 34.472784 | 242.5768 | |
0.091234028 | 57.215612 | 402.6127 | |
0.15151962 | 55.345217 | 389.4512 | |
0.47740469 | 43.707413 | 307.5587 | |
0.086059435 | 57.373669 | 403.7249 | |
0.65903753 | 30.58859 | 215.2447 | |
0.27660967 | 21.493713 | 151.2462 | |
0.65448695 | 29.277583 | 206.0194 | |
0.51534035 | 23.893597 | 168.13359 | |
0.62993932 | 27.185932 | 191.30097 | |
0.64921611 | 28.605068 | 201.2871 | |
0.22328973 | 28.411333 | 199.9238 | |
1.7460142 | 15.928206 | 112.0830 | |
0.33881397 | 49.086159 | 345.4077 | |
0.7940365 | 21.137907 | 148.74245 | |
0.8804347 | 21.137907 | 148.74245 | |
0.1809299 | 21.137907 | 148.74245 | |
1.888758 | 21.137907 | 148.74245 | |
21.137907 | 148.74245 | ||
(t) | 1.5962096 | 14.211079 |
From the Table 4 it is observed from all the estimators that,
6.1 Biasednesses are ranging from 2.3629 to 4.1947.
6.2 Mean Square Errors are ranging from 14.2111 to 83.5875.
6.3 The PREs of the suggested estimator over estimators in competition are ranging from 112.0830 to 588.1861.
6.4 The sample mean (), Usual Regression Estimator (Watson,1937) , Goodman and Hartely’s revised estimator (1958), ; are unbiased for .
6.5 Among the existing estimators, the MSE of the estimator of Yadav and Mishra(2015), is minimum i.e. 15.9282 and MSE of the estimator of Kadilar and Kingi (2003), is maximum i.e. 83.5875.
6.6 The suggested estimator is 1.12 more efficient than and 5.88 more efficient than .
6.7 Since Efficiency is stronger property than the unbiasedness. Hence here we prefer the biased estimator with minimum MSE instead of unbiased estimator with higher MSE.
6.8 It is observed from data that the given below inequalities hold good;
6.9
6.10 The proposed estimator has minimum MSE and comes out to be more efficient than the other estimators which was our aim of study.
7.1 We have applied our proposed estimator successfully for the estimation of Average Paddy production. We can also use it for other agricultural areas and productions with larger sample size and population size.
7.2 In case the exact values are not known but we may know the ranges within which our values may be supposed to lie, this Median based estimator will give a precise result.
7.3 The use of this estimator can be extended to other fields also and can be used as an alternative of SRSWOR sample mean when exact values of characteristic under study are not known.
7.4 Whenever taking the auxiliary information involves very high cost we can use the proposed estimator as an alternative of Ratio and Regression type estimators.
7.5 It has been shown theoretically as well as numerically that the suggested estimator is better than the competing estimators for the estimating .
7.6 Mostly the minimum MSE of any ratio type estimator is equal to the variance of the regression estimator but the suggested ratio type estimator has its MSE less than the variance of the usual regression estimator.
7.7 Therefore, the suggested estimator is highly recommended for the practical applications.
The authors are very much thankful to the Editior-in-Chief of JRSS and the learned referees for critically examining the paper which improved the quality of the earlier draft.
Al-Omari, A. I. (2012). Ratio estimation of the population mean using auxiliary information in simple random sampling and median ranked set sampling, Statistics and Probability Letters, 82, p. 1883–18890.
Bahl, S. and Tuteja, R.K. (1991). Ratio and product type exponential estimator, Information and Optimization Sciences, XII (I), p. 159–163.
Chakrabarty, R.P. (1979). Some ratio estimators, Journal of the Indian Society of Agricultural Statistics, 31(1), p. 49–57.
Cochran, W. G. (1940). The estimation of the yields of the cereal experiments by sampling for the ratio of grain to total produce, The Journal of Agric. Science, 30, p. 262–275.
Goodman, L. A. and Hartley, H.O. (1958).The precision of unbiased ratio-type estimators, Journal of the American Statistical Association, 53(282), p. 491–508.
Hartley, H.O. and Ross, A. (1954). Unbiased ratio estimators, Nature, August, 7, Vol. 174, p. 270–271.
Ijaz, M., and Ali, H. (2018). Some improved ratio estimators for estimating mean of finite population, Research and Reviews: Journal of Statistics and Mathematical Sciences, 4(2), 18–23.
Jerajuddin, M. and Kishun, J. (2016). Modified ratio estimators for population mean using size of the sample selected from population, IJSRSET, 2(2), p. 10–16.
Kadilar, C., and Cingi, H. (2003). A study on the chain ratio type estimator, Hacettepe Journal of Mathematics and Statistics, Vol. 32, p. 105–108.
Pandey, H., Yadav, S.K. and Shukla, A.K. (2011). An Improved General Class of Estimators, Estimating Population Mean using Auxiliary Information, International Journal of Statistics and Systems, Vol. 6(1), p. 1–7.
Quenouille, M. H. (1956). Notes on bias in estimation, Biometrika, 43, p. 353–60.
Sahai, A. and Ray, S.K. (1980). Efficient families of ratio and product type estimators, Biometrika, 67(1), p. 211–215.
Singh, G.N. (2003). On the improvement of product method of estimation in sample surveys. Jour. Ind. Soc. Agri. Statistics, 56(3), p. 267–275.
Singh, H. P., Tailor, R., Tailor, R. and Kakran, M. S. (2004): An improved estimator of population mean using power transformation, Journal of the Indian Society of Agricultural Statistics, 58(2), p. 223–230.
Singh, H.P. and Tailor, R. (2003). Use of known correlation coefficient in estimating the finite population mean, Statistics in Transition, 6(4), p. 555–560.
Sisodia, B.V.S. and Dwivedi, V.K. (1981). A modified ratio estimator using coefficient of variation of auxiliary variable, Journal of the Indian Society of Agricultural Statistics, 33, p. 13–18.
Soponviwatkal, K., and Lawson, N. (2017). New ratio estimators for estimating population mean in simple random sampling using a coefficient of variation, correlation coefficient and a regression coefficient, Gazi University Journal of Science, 30(4), p. 610–621.
Subramani, J. and Kumarapandiyan, G. (2012a). Modified ratio estimator for population mean using median of the auxiliary variable, Proceedings of National Conference on Recent developments in the applications of Reliability Theory and Survival Analysis “Application of Reliability Theory and Survival Analysis”, Bonfring Publisher, p. 224–231, ISBN:978-93-82338-25-3.
Subramani, J. and Kumarapandiyan, G. (2012b). Estimation of population mean using co-efficient of variation and median of an auxiliary variable. International Journal of Probability and Statistics, 1(4), p. 111–118.
Subramani, J. and Kumarapandiyan, G. (2012c): Modified ratio estimators for population mean using function of quartiles of auxiliary variable, Bonfring International Journal of Industrial Engineering and Management Science, 2(2), p. 19–23.
Sukhatme, P. V. and Sukhatme, B. V. (1970). Sample Theory of Surveys with Applications, Ames : Iowa State University Press.
Swain, A.K.P.C. (2014). On an improved ratio type estimator of finite population mean in sample surveys, Revista Investigacion Operacional, 35(1), p. 49–57.
Upadhyaya, L. N. and Singh, H. P. (1999). Use of transformed auxiliary variable in estimating the finite population mean, Biometrical Journal, 41(5), p. 627–636.
Watson, D.J. (1937). The estimation of leaf area in field crops, Jour. Agr. Sci., 27(3), p. 474–483.
Yadav, S.K. and Mishra, S.S. (2015). developing improved predictive estimator for finite population mean using auxiliary information, Statistika, 95(1), p. 76–85.
Yadav, S.K., Dixit, M. K., Dungana, H.N. and Mishra, S.S. (2019): Improved estimators for estimating average yield using auxiliary variable, International Journal of Mathematical, Engineering and Management Sciences, Vol. 4, No. 5, p. 1228–1238.
Yadav, S.K, Misra, S., Tiwari, V. and Shukla, A.K. (2010). Improved family of ratio-cum-dual to ratio estimators of finite population mean, IFRSA’s Int. Journal of Computing, 1(1), 17–23.
Yadav, S.K, Misra, S., Tiwari, V. and Shukla, A.K. (2014). Use of coefficient of skewness and quartile deviation of auxiliary variable for improved estimation of population mean, International Journal of Engineering Sciences and Research Technology, 3(1), p. 404–410.
Yan, Z. and Tian, B. (2010). Ratio method to the mean estimation using coefficient of skewness of auxiliary variable, ICICA, Part II, CCIS, 106, p. 103–110.
S. K. Yadav is a faculty in the Department of Statistics at Babasaheb Bhimrao Ambedkar University Lucknow, U.P., India. He earned his M.Sc. and Ph.D. degrees in Statistics from Lucknow University and qualified the National Eligibility Test. Dr. Yadav has published more than 40 research papers in national and international journals of repute indexed in Scopus/Web of science and two books from an international publisher. He is a referee for 20 reputed international journals. He has been awarded Young Scientist Award in 2016 for the contribution in the field survey sampling by Venus International Research Foundation, Chennai, India and best paper award in 2018 by MTMI, USA. He has presented papers in more than 20 national and international conferences and also delivered 18 invited talks in several conferences.
Shanya Baghel is pursuing her Ph.D in IIT(ISM) Dhanbad, Department of Mathematics and Computing as a JRF. She has completed her M.Phil. from Babasaheb Bhimrao Ambedkar University (BBAU), Lucknow, in 2020. She received her M.Sc. degree in Applied Statistics from BBAU, Lucknow in 2018. She attended University of Lucknow, India and received her B.Sc. in 2016. Her M.Phil work centers on Estimation of Population Mean using known Auxiliary Variable.
Sugandha Saxena is a postgraduate from Babasaheb Bhimrao Ambedkar University. She is an alumna of City Montessori School, Lucknow, India. She received a B.Sc. degree in Statistics and Mathematics from Mahila P.G. College, University of Lucknow. She completed her M.Sc. in Applied Statistics from Babasaheb Bhimrao Ambedkar University, Lucknow.
Avinash Kumar Singh received his M.Sc. degree in Applied Statistics form Babasaheb Bhimrao Ambedkar University, Lucknow, India. He attended the University of Lucknow, India and received his B.Sc. in Physics, Mathematics and Statistics.
Journal of Reliability and Statistical Studies, Vol. 13_1, 127–148.
doi: 10.13052/jrss0974-8024.1317
© 2020 River Publishers