A Generalized Class of Estimators for Finite Population Mean Using Two Auxiliary Variables in Sample Surveys
Housila Prasad Singh and Pragati Nigam*, †
School of Studies in Statistics, Vikram University, Ujjain-456010, M.P., India
*Faculty of Agricultural Sciences, Mandsaur University, Mandsaur, M.P., India
E-mail: pragatinigam1@gmail.com
†Corresponding Author
Received 08 December 2021; Accepted 05 January 2022; Publication 15 February 2022
In this paper we have suggested a generalized class of estimators for estimating the finite population mean of the study variable y using information on two auxiliary variables x and z. We have studied the properties of the proposed generalized class of estimators in simple random sampling without replacement scheme and in stratified random sampling up to the first order of approximation. It is shown that the suggested class of estimators is more efficient than the conventional unbiased estimator, ratio estimator, product estimator, traditional difference estimator, Srivastava (1967) estimator, Ray et al. (1979) estimator, Vos (1980) estimator, Upadhyaya et al. (1985) estimator, Rao (1991) estimator and Gupta and Shabbir (2008) estimator. Theoretical results are well supported through an empirical study.
Keywords: Auxiliary variable, study variable, bias, mean squared error, simple random sampling, stratified random sampling.
In sample surveys, the use of auxiliary variable(s) at the estimation stage played a prominent role in improving the precision of an estimate of the population mean. Various authors have paid their attention towards the estimation of population mean of the study variable y using information on a single auxiliary variable x and suggested a large number of estimators along with their properties in simple random sampling without (or with) replacement schemes for instance, see Murthy (1967), Singh (1986, 2003) and the references cited therein. In many survey situations of practical importance, adequate information on more than one of auxiliary variables is available. In such a situation Olkin (1958) was first to introduce multivariate ratio estimator for population mean of the study variable y using information on p(1) auxiliary variables. Later many authors including Raj (1965), Rao and Mudholkar (1967), Singh (1967, 1969), Shukla (1966), Srivastava (1965, 1967, 1971), Singh and Tailor (2005), Gupta and Shabbir (2008), Singh et al. (2009), Swain (2013) and Sharma and Singh (2014, 2015) etc. have developed estimators which utilize data from p(1) auxiliary variables. The properties of the estimators studied under simple random sampling with (or without) replacement i.e. SRSWR (or SRSWOR) scheme.
It is well established fact that the simple random sampling (SRS) procedure is employed when the population is homogeneous. However in practice, the populations encountered are not homogeneous (i.e. populations are heterogeneous). Thus in such a situation SRS procedure does not provide a sample which is good representative of the entire population. Hence we can say that when the population is heterogeneous, SRS procedure does not provide better estimate of the population mean . To cope up with this situation, we use stratified random sampling for selecting a good sample from the target population. Thus when the population is heterogeneous stratified random sampling is more appropriate and gives better estimate of the population mean. In a stratified random sampling design, we divide the population into groups known as strata and samples are selected from each group with pre-determined sample size. Several authors including Diana (1993), Kadilar and Cingi (2003), Shabbir and Gupta (2005), Singh and Vishwakarma (2008), Singh et al. (2008), Koyuncu and Kadilar (2009, 2010), Koyuncu (2013), Yadav et al. (2015a, 2015b) and Koyuncu (2016) etc. have suggested estimators for population mean of y using information on single auxiliary variable x in stratified random sampling. It is further noticed that various authors including Koyuncu and Kadilar (2009), Tailor et al. (2012), Singh and Kumar (2012), Olufadi (2013), Tailor and Chouhan (2014), Verma et al. (2015), Shabbir and Gupta (2015, 2016), Muneer et al. (2016), Malik and Singh (2017), Mishra et al. (2017) and Shabbir (2018) etc. have suggested several estimators for population mean of y using two auxiliary variables x and z in stratified random sampling.
In this paper we have made an effort to develop a generalized class of estimators for population mean of y using information on two auxiliary variables x and z. The properties of the suggested class of estimators are studied up to the first order of approximation in SRSWOR scheme as well as in stratified random sampling. Numerical examples are given in support of the present study.
Consider a finite population of N units. Let y and (x, z) be study variable and auxiliary variables respectively. Let and be the values of study variable y and auxiliary variables (x, z) on the ith unit of the population . Suppose a sample of size n is drawn by using SRSWOR scheme from the population for estimating the population mean of the study variable y. Let and , be the unbiased estimators of the population means and , respectively.
It is assumed that the population means of (x, z) are known. Further we denote
: the population coefficient of variation of y,
: the population coefficient of variation of x,
: the population coefficient of variation of z,
: the population correlation coefficient between y and x,
: the population correlation coefficient between y and z,
: the population correlation coefficient between x and z,
: the population covariance between y and x,
: the population covariance between y and z,
: the population covariance between x and z,
: the population mean square of y,
: the population mean square of x,
: the population mean square of z.
, , , and : is the sampling fraction.
Now we review some existing estimators.
The usual unbiased estimator for population mean is given by
(1) |
The variance/mean squared error (MSE) under SRSWOR is given by
(2) |
The usual ratio and product estimators for population mean are respectively defined by
(3) |
and
(4) |
To the first degree of approximation (fda), the MSEs of the estimators and are respectively given by
(5) | ||
(6) |
The generalized version of the estimators , and due to Srivastava (1967) is given by
(7) |
where is a suitably chosen constant.
To the fda, the is given by
(8) |
which is minimum when
(9) |
Substitution of (9) in (8) yields the minimum MSE of as
(10) |
The traditional difference estimator for is defined by
(11) |
where is a suitably chosen constant to be determined such that the MSE of is minimum.
The minimum MSE of is given by
Upadhyaya et al. (1985) suggested a class of estimators for population mean as
(13) |
where and are suitably chosen weights whose sum need not be ‘unity’ and is a design parameter.
The MSE of to the fda is given by
(14) |
where
The best values of for which the MSE of is minimized, are given by
(15) |
where
Thus the minimum MSE of is given by
(16) |
If we set in (13) we get an estimator for population mean of y as
(17) |
which includes the estimators due to Srivastava (1967), Ray et al. (1979) and Vos (1980).
Putting in (2) we get the MSE of to the fda as
which is minimized for
(19) |
Thus the minimum MSE of is given by
(20) |
Rao (1991) suggested difference-type estimator for population mean as
(21) |
where and are constants to be determined such that MSE of is minimum.
The bias and MSE of are respectively given by
(22) | ||
(23) |
where .
The at (2) is minimized for
Thus the minimum MSE of is given by
(24) |
Gupta and Shabbir (2008) proposed the following class of estimators for population mean as
(25) |
where () are suitably chosen constants such that the MSE of is minimum.
To the fda, the bias and MSE of are respectively given by
(26) | ||
(27) |
The at (27) is minimum when
where
Thus the minimum MSE of is given by
(28) |
The traditional difference estimator for population mean using two auxiliary variables x and z is defined by
(29) |
where and are constants to be determined such that is minimum.
It is obvious from (29) that the difference estimator is unbiased for population mean .
The variance/MSE of the estimator is given by
(30) |
where .
The at (2) is minimized for
Thus the minimum MSE of is given by
(31) |
where is the multiple correlation coefficient.
Motivated by Upadhyaya et al. (1985) we propose a generalized class of estimators based on two auxiliary variables x and z for population mean of y as
(32) |
where are weights whose sum need not be ‘unity’ and are design parameters. The constants may take positive or negative or positive-negative or negative-positive values to form product-type or ratio-type or product-cum-ratio-type or ratio-cum-product-type estimator.
To obtain the bias and MSE of the propounded estimator t, we write
such that
Expressing (32) in terms of e’s we have
We suppose that so that are expandable.
Expanding the right hand side of (3), multiplying out and ignoring terms of e’s having power greater than two, we have
or
(34) |
Taking expectation of both sides of (34) we get the bias of t to the fda as
(35) |
where
Squaring both sides of (34) and ignoring terms of e’s having power greater than two we have
(36) |
Taking expectation of both sides of (36) we get the MSE of t to the fda as
(37) |
where
are same as defined earlier.
Minimization of at (37) with respect to yields
(38) |
After simplification of (38), we get the optimum values of respectively as
(39) |
where
Substitution of (39) in (37) yields the minimum MSE of t as
(40) |
Now we state the following theorem.
Theorem-3.1 – The minimum MSE of t is greater than or equal to MSE(t) i.e.
(41) |
with equality holding if
Remark-3.1 – It is to be mentioned that the class of estimators ‘t’ at (32) will attained its minimum MSE at (40) only when the optimum values at (39) of the weights are known exactly, but in practice the exact values of the population parameters are rarely available. However in repeated surveys or studies based on multiphase sampling, where information regarding the same variates is gathered on several occasions, it is possible to guess quite precisely the values of certain population parameters such as . Further we mention that the good guess values of these population parameters can also be obtained from the past data or the experience gathered in due course of time or through a pilot sample survey. This problem has been discussed among others by Murthy (1967, pp. 96–99), Searls (1964), Srivastava (1966), Gleser and Healy (1976), Das and Tripathi (1978), Reddy (1978), Tripathi et al. (1983) and Srivenkataramana and Tracy (1984). Thus the values of such population parameters can be known exactly. We recall that the scalars are real. The values of the scalars are known (or can be known by the experimental practitioner) as the values of yield the form of the estimator. Thus the optimum values of the corresponding constants can be obtained quite accurately. Hence we conclude that in practice, an operational estimator can be derived from the suggested class of estimators ‘t’ with mean squared error smaller than the conventional estimators.
On the other hand if the values of the population parameters such as are not known (or cannot be made known) at all. In such situations, the practical utility of such estimators is limited. So in such circumstances one can estimate the value of these population parameters by their corresponding sample statistics. Hence the estimates say, of the corresponding optimum values can be obtained. Thus this also suggests that one can also obtain the operational (feasible) estimator from the proposed class of estimators ‘t’ having mean squared error fewer than the usual estimators.
It is observed from (2), (2) and (20) that the common minimum MSEs of the estimators and is same, i.e.
(42) |
Now we compare the efficiency of traditional difference estimator with usual unbiased estimator , ratio estimator and product estimator .
From (2), (5), (6) and (2), we have,
(43) | |
(44) | |
(45) |
It follows from (4), (43), (44) and (45) that the estimators and are more efficient than usual unbiased estimator , ratio estimator and product estimator .
(46) |
It follows from (4), (43), (44), (45) and (46) that the estimator due to Rao (1991) is more efficient than and .
The minimum MSE of the difference estimator given by (2) can be expressed as
(47) |
(48) |
(49) |
which shows that the traditional estimator is better than .
(50) |
It shows that the proposed class of estimators t is more efficient than the Upadhyaya et al. (1985) estimator .
Hence from (4), (43), (44), (45), (46), (4) and (49) it is observed that the suggested generalized class of estimators t is better than the estimators , , , , , and .
From (24) and (40) we have that if
(51) |
From (28) and (40) it is observed that if
(52) |
Further from (31) and (40) we note that if
Thus from (51), (52) and (4) it is observed that the proposed generalized class of estimators t is more efficient than and as long as the conditions (51), (52) and (4) are satisfied respectively.
For numerical comparisons of different estimators, we use the following data sets.
Data I: [Source: Singh and Chaudhary (1986), page 177] Data II: [Source: Abu-Dayyeh et al. (2003)] Data III: [Source: Steel and Torrie (1960)] Data IV: [Source: Cochran (1977)] Data V: [Source: Ahmed (1997)] Data VI: [Source: PCR (1998)]
Data | I | II | III | IV | V | VI |
N | 34 | 332 | 30 | 34 | 376 | 424 |
N | 20 | 80 | 6 | 15 | 159 | 169 |
856.41 | 1093.1 | 0.6860 | 4.92 | 316.65 | 646.215 | |
208.88 | 181.57 | 4.6537 | 2.59 | 141.13 | 4533.981 | |
199.44 | 143.37 | 0.8077 | 2.91 | 1075.31 | 325.0325 | |
0.86 | 0.7626 | 0.4803 | 1.01232 | 0.7721 | 1.509 | |
0.72 | 0.7684 | 0.2295 | 1.23187 | 0.845 | 1.342 | |
0.75 | 0.7616 | 0.7493 | 1.05351 | 0.7746 | 1.335 | |
0.45 | 0.973 | 0.7194 | 0.7326 | 0.9106 | 0.623 | |
0.45 | 0.862 | 0.04996 | 0.643 | 0.9094 | 0.907 | |
0.98 | 0.842 | 0.4074 | 0.6837 | 0.8614 | 0.682 |
Table 1 gives the PRE’s of and estimators with respect to for six data sets respectively.
Table 2 gives the PRE of with respect to for , for six data sets.
Table 3(a) depicts the PREs of proposed class of estimators with respect to at different values of () for data sets I, II, III.
Table 1 PRE’s of different estimators of population mean with respect to
Estimator | Data I | Data II | Data III | Data IV | Data V | Data VI |
105.55 | 1835.92 | 94.62 | 143.30 | 488.77 | 146.46 | |
40.74 | 25.15 | 71.44 | 23.45 | 23.86 | 34.49 | |
125.39 | 1877.19 | 103.33 | 215.84 | 585.45 | 163.43 | |
126.92 | 1877.75 | 106.40 | 219.66 | 585.67 | 164.24 | |
126.93 | 1877.75 | 106.42 | 219.89 | 585.67 | 164.25 | |
125.71 | 2127.83 | 174.04 | 235.09 | 907.16 | 563.97 |
Table 2 PRE’s of with respect to
Data I | Data II | Data III | Data IV | Data V | Data VI | ||||||
PRE | PRE | PRE | PRE | PRE | PRE | ||||||
1 | 127.66 | 1 | 1878.66 | 1 | 106.76 | 1 | 227.99 | 1 | 586.30 | 1 | 164.74 |
1 | 126.24 | 1 | 2049.28 | 1 | 106.20 | 1 | 215.95 | 1 | 588.70 | 1 | 163.54 |
Table 3(a): PRE’s of proposed class of estimators t for population mean with respect to (for data sets I, II, III)
Data I | Data II | Data III | ||||||
PRE | PRE | PRE | ||||||
16.56 | 16.56 | 17654.39 | 6.5 | 6.9 | 772384 | 4 | 4 | 1286.97 |
16.55 | 16.55 | 11434.21 | 6.5 | 6.8 | 234504.2 | 3 | 3 | 240.98 |
16.54 | 16.54 | 8469.43 | 6.5 | 6.7 | 139357.5 | 2 | 2 | 194.41 |
16.53 | 16.53 | 6734.46 | 6.5 | 6.6 | 99699.33 | 1 | 1 | 180.01 |
16.52 | 16.52 | 5595.56 | 6.5 | 6.5 | 77951.88 | 1 | 1 | 174.25 |
16.51 | 16.51 | 4790.57 | 6.4 | 6.4 | 34176.73 | 2 | 2 | 177.14 |
16.5 | 16.5 | 4191.43 | 6.3 | 6.3 | 22046.63 | 3 | 3 | 183.83 |
16 | 16 | 657.28 | 6.2 | 6.2 | 16359.33 | 4 | 4 | 196.20 |
10 | 10 | 153.84 | 6.1 | 6.1 | 13060.19 | 5 | 5 | 219.66 |
5 | 5 | 133.88 | 6 | 6 | 10906.9 | 6 | 6 | 273.01 |
4 | 4 | 131.95 | 5 | 5 | 4435.19 | 7 | 7 | 484.92 |
3 | 3 | 130.37 | 4 | 4 | 3039.74 | 4 | 6 | 9201.75 |
2 | 2 | 129.08 | 3 | 3 | 2477.50 | 3 | 5 | 341.57 |
1 | 1 | 128.03 | 2 | 2 | 2220.59 | 2 | 4 | 230.89 |
1 | 1 | 126.56 | 1 | 1 | 2130.16 | 1 | 3 | 195.37 |
2 | 2 | 126.10 | 1 | 1 | 2358.74 | 1 | 2 | 178.42 |
3 | 3 | 125.82 | 2 | 2 | 2796.94 | 2 | 1 | 174.06 |
4 | 4 | 125.72 | 3 | 3 | 3848.96 | 4 | 5 | 386.31 |
5 | 5 | 125.81 | 4 | 4 | 7795.28 | 4 | 4 | 253.76 |
8 | 8 | 127.58 | 4.5 | 4.5 | 20478.73 | 4 | 3 | 209.95 |
10 | 10 | 130.80 | 4.6 | 4.6 | 31462.35 | 4 | 2 | 189.32 |
12 | 12 | 137.87 | 4.7 | 4.7 | 69729.23 | 4 | 1 | 178.73 |
15 | 15 | 187.23 | 1 | 2 | 2404.66 | 3 | 6 | 1354.17 |
16 | 16 | 328.47 | 2 | 3 | 2902.67 | 3 | 5 | 341.57 |
16.1 | 16.1 | 378.07 | 4 | 5 | 9706.82 | 3 | 4 | 241.09 |
16.2 | 16.2 | 456.93 | 1 | 2 | 2135.10 | 3 | 3 | 204.27 |
16.3 | 16.3 | 601.84 | 3 | 4 | 2539.23 | 3 | 2 | 186.40 |
16.4 | 16.4 | 955.32 | 4 | 5 | 3182.99 | 2 | 6 | 788.06 |
16.5 | 16.5 | 3122.77 | 5 | 6 | 4877.70 | 2 | 5 | 310.28 |
* | * | * | 6 | 7 | 15453.26 | 2 | 4 | 230.89 |
Table 3(b) indicates the PREs of proposed class of estimators with respect to at different values of () for data sets IV, V, VI.
Table 3(b): PRE’s of proposed class of estimators t with respect to (for data sets IV, V, VI)
Data IV | Data V | Data VI | ||||||
PRE | PRE | PRE | ||||||
5.5 | 5.6 | 18686.57 | 13.8 | 13.8 | 146480.3 | 11.3 | 11.3 | 56544.35 |
5.5 | 5.5 | 9127.82 | 13.5 | 13.5 | 16447.9 | 11 | 11 | 7475.86 |
5 | 5 | 1011.91 | 13 | 13 | 6808.17 | 10 | 10 | 2084.82 |
4 | 4 | 443.93 | 12 | 12 | 3275.97 | 8 | 8 | 986.33 |
3 | 3 | 323.39 | 10 | 10 | 1749.71 | 5 | 5 | 659.03 |
2 | 2 | 273.50 | 8 | 8 | 1285.83 | 4 | 4 | 616.64 |
1 | 1 | 248.82 | 5 | 5 | 1012.38 | 3 | 3 | 588.91 |
1 | 1 | 235.59 | 4 | 4 | 967.68 | 2 | 2 | 572.19 |
2 | 2 | 244.97 | 3 | 3 | 936.87 | 1 | 1 | 564.63 |
3 | 3 | 274.84 | 2 | 2 | 917.52 | 1 | 1 | 575.12 |
4 | 4 | 372.70 | 1 | 1 | 908.24 | 2 | 2 | 594.85 |
5 | 5 | 2084.42 | 1 | 1 | 918.17 | 3 | 3 | 627.61 |
5.1 | 5.1 | 8806.16 | 2 | 2 | 938.25 | 4 | 4 | 679.08 |
5.5 | 5.4 | 6136.23 | 3 | 3 | 970.32 | 5 | 5 | 760.58 |
5.5 | 5.2 | 3812.23 | 4 | 4 | 1017.20 | 6 | 6 | 897.22 |
5.5 | 5.1 | 3242.17 | 5 | 5 | 1083.64 | 7 | 7 | 1156.67 |
5.5 | 5 | 2838.5 | 6 | 6 | 1177.62 | 8 | 8 | 1802.22 |
5.5 | 4.5 | 1851.49 | 7 | 7 | 1313.39 | 9 | 9 | 5816.34 |
5.5 | 4 | 1470.75 | 8 | 8 | 1518.41 | 9.3 | 9.3 | 24933.39 |
5.5 | 3 | 1210.17 | 10 | 10 | 2479.08 | 9 | 9.4 | 100422 |
5.5 | 2 | 1286.14 | 12 | 12 | 13846.51 | * | * | * |
5.5 | 1 | 2344.68 | 12.1 | 12.1 | 18724.25 | * | * | * |
5.4 | 5.6 | 5179.15 | 12.2 | 12.2 | 29101.79 | * | * | * |
5.3 | 5.6 | 3067.52 | 12.3 | 12.3 | 66293.65 | * | * | * |
5.2 | 5.6 | 2208.87 | * | * | * | * | * | * |
5.1 | 5.6 | 1743.17 | * | * | * | * | * | * |
We measured Percent Relative Efficiencies (PREs) of various estimators along with our proposed generalized class of estimators t with respect to. It is observed that from the entries of the Tables 1, 2, 3(a) and 3(b) that the suggested generalized class of estimators t gives the largest PRE (17654.39%, 772384.00%, 9201.75%, 18686.57%, 146480.30%, and 100422.00%) for data set I to IV respectively. Using the proposed generalized class of estimators t over other existing estimators, there is considerable gain in efficiency. Thus there is ample room to pick up the scalars in order to obtain estimators better than the existing estimators. Finally our recommendation is in favor of the proposed generalized class of estimators t for its use in practice.
We consider a finite population of N units divided into L strata with the th stratum having units such that . Let and respectively be the observations of study variable y and auxiliary variables (x, z) for the th population unit in the th stratum. A simple random sample of size is drawn without replacement from the th stratum such that .
Let and be the sample means corresponding to the population means and of the variables y, x and z respectively, where , and be the sample means corresponding to the population means , and in the th stratum respectively with known stratum weight .
Further we denote
In the following section we have presented review of some existing estimators with their properties.
The conventional stratified sample mean estimator for population mean of y is defined by
(54) |
whose variance/MSE is given by
(55) |
The combined ratio estimator for is given by
(56) |
To the fda, the MSE of is given by
(57) |
The combined product estimator for is defined by
(58) |
The MSE of to the fda is given by
(59) |
Following the approach adopted by Srivastava (1967), we define a class of estimators for population mean as
(60) |
We mention that for reduces to while for it reduces to the product estimator . If we set , then reduces to usual unbiased estimator .
The MSE of to the fda is given by
(61) |
which is minimum when
(62) |
Thus the corresponding minimum MSE of is given by
(63) |
which is same as the minimum MSE of the difference estimator
(64) |
i.e.
(65) |
The stratified version of Upadhyaya et al. (1985) estimator is given by
(66) |
The MSE of to the fda is given by
(67) |
where
The is minimized for
(68) |
Thus the corresponding minimum MSE of is given by
(69) |
where
For in (66), the class of estimators reduces to the estimator
(70) |
To the fda, the MSE of is given by
(71) |
which is minimized for
(72) |
Thus the corresponding minimum MSE of is given by
(73) |
Stratified version of Rao (1991) estimator for population mean is given by
(74) |
where are suitably chosen constants such that is minimum.
The optimum values of along with minimum MSE of are respectively given by
(75) |
and
(76) |
Gupta and Shabbir (2008) suggested the following estimator for as,
(77) |
The MSE of to the fda is given by
(78) |
where
The MSE of is minimum when
(79) |
Thus the minimum MSE of is given by
(80) |
The usual difference estimator using two auxiliary variables in stratified random sampling is defined by
(81) |
where and are constants whose values are to be obtained.
The MSE of is given by
(82) |
which is minimized for
(83) |
Thus the corresponding minimum MSE of is given by
(84) |
Motivated by Upadhyaya et al. (1985), we propose a generalized class of estimators based on two auxiliary variables for population mean in stratified random sampling as
(85) |
where are appropriately elected weights whose sum need not be unity and are design parameters. The constants may take positive or negative or positive-negative or negative-positive values to form product-type or ratio-type or product-cum-ratio-type or ratio-cum-product-type estimator.
To obtain the bias and MSE of the proposed estimator , we write, and such that ,
Expressing (85) in terms of e’s we have
(86) |
We assume that so that are expandable. Expanding the right hand side of (85), multiplying out and ignoring terms of e’s having power greater than two, we have
or
Taking expectation of both sides of (8), we get the bias of to the fda as
(88) |
Squaring both sides of (8), ignoring terms of e’s having power greater than two and then taking expectation of both sides we get the MSE of to the fda as
(89) |
where
and are same as defined earlier.
Minimization of at (89) with respect to yields
(90) |
Solving (90), we get the optimum values of respectively as
(91) |
where
Thus the corresponding minimum MSE of is given by
Thus we state the following theorem.
Theorem-8.1 – The MSE of is always greater than equal to the minimum MSE of i.e.
with equality holding if
*A remark similar to Remark 3.1 follows here.
From (55), (57), (59) and (65), we have
(93) | |
(94) | |
(95) |
Expressions (9), (9) and (9) clearly indicates that the usual difference estimator and Srivastava (1967) estimator are better than the estimators and .
(96) |
From (9)–(9), we have the following inequalities:
(97) | ||
(98) | ||
(99) |
It follows from (9), (9) and (9) that the Rao (1991) estimator is more precise than , , and Srivastava’s (1967) estimator .
(100) |
It follows that the Upadhyaya et al. (1985) estimator is more efficient than and .
(101) |
which shows that the proposed generalized class of estimators is more efficient than Upadhyaya et al. (1985) estimator . Hence the estimator is more precise than the estimators , , and .
(102) |
which shows that the difference estimator is more efficient than the estimator .
From (76), (7), (84) and (8),we have
• if
(103) |
• if
(104) |
• if
(105) |
It is observed from (9), (9) and (9) that the proposed generalized class of estimators is more efficient than the estimators , and as long as the conditions (9), (9) and (9) are satisfied respectively.
To examine the performance of the proposed generalized class of estimators over existing estimators, we use the data sets given below
Data I: Source: [Murthy (1967), P. 228] N = 80, n = 22
Data II: [Source: Koyuncu and Kadilar (2009)] N = 923, 180
Table 4 presents the PRE’s of and estimators with respect to for two data sets respectively.
Table 5 shows the PRE of with respect to for , for two data sets.
Table 4 PRE’s of different estimators of population mean with respect to
Estimator | Data I | Data II |
14.42 | 1025.10 | |
5.89 | 24.22 | |
235.83 | 1141.85 | |
235.91 | 1143.02 | |
183.44 | 1109.11 | |
273.99 | 2621.61 |
Table 5 PRE’s of the estimator with respect to
Data I | Data II | ||
PRE | PRE | ||
1 | 238.07 | 1 | 1146.96 |
1 | 235.84 | 1 | 1260.08 |
Table 6 depicts the PRE of proposed estimator wrt at different values of and , for two data sets.
Table 6 PRE’s of the proposed estimator with respect to for different values of
Data I | Data II | ||||
PRE | PRE | ||||
1 | 1 | 275.29 | 1 | 1 | 2626.83 |
2 | 2 | 277.33 | 2 | 2 | 2720.91 |
3 | 3 | 280.47 | 3 | 3 | 3216.10 |
4 | 4 | 284.92 | 4 | 4 | 4826.13 |
5 | 5 | 291.07 | 5 | 5 | 20279.5 |
8 | 8 | 327.57 | 5.2 | 5.2 | 96957.81 |
10 | 10 | 391.89 | 1 | 1 | 3590.02 |
12 | 12 | 642.36 | 2 | 2 | 6380.74 |
13 | 13 | 1645.27 | 2.1 | 2.1 | 7060.90 |
13.1 | 13.1 | 2078.85 | 2.2 | 2.2 | 7938.96 |
13.2 | 13.2 | 2887.43 | 2.5 | 2.5 | 13252.63 |
13.3 | 13.3 | 4928.01 | 2.8 | 2.8 | 51458.14 |
13.4 | 13.4 | 20017.07 | 1 | 1 | 3067.82 |
1 | 1 | 274.03 | 2 | 1 | 2923.05 |
2 | 2 | 274.74 | 1 | 2 | 1002.942 |
3 | 3 | 276.39 | 3 | 1 | 2734.047 |
4 | 4 | 279.07 | 4 | 1 | 2871.735 |
5 | 5 | 282.99 | * | * | * |
8 | 8 | 306.25 | * | * | * |
10 | 10 | 342.15 | * | * | * |
12 | 12 | 434.69 | * | * | * |
14 | 14 | 1047.46 | * | * | * |
14.1 | 14.1 | 1180.86 | * | * | * |
14.3 | 14.3 | 1634.81 | * | * | * |
14.5 | 14.5 | 2885.63 | * | * | * |
14.6 | 14.6 | 4977.54 | * | * | * |
14.7 | 14.7 | 22036.36 | * | * | * |
1 | 1 | 274.16 | * | * | * |
1 | 2 | 275.62 | * | * | * |
3 | 1 | 277.21 | * | * | * |
4 | 1 | 279.03 | * | * | * |
5 | 5 | 259.76 | * | * | * |
It is observed from Tables 4, 5 and 6 that for various values of the proposed generalized class of estimators is more efficient than the estimators , , , and , with considerable gain in efficiency. The proposed generalized class of estimators yields the largest percent relative efficiency 22036.60% for data set I while it is 96957.81% for data set II. It is further observed from Table 6 that there is enough scope of selecting the scalars in acquiring efficient estimators (from the suggested generalized class of estimators ) than the existing estimators. Thus we conclude that the proposed generalized class of estimators can be used in practice just by selecting the appropriate values of .
This article considers the problem of estimating the population mean of the study variable y using information on two auxiliary variables x and z. We have proposed a generalized class of estimators for the population mean using information on two supplementary variables x and z. Expressions of bias and mean square error up to the fda have been obtained in SRSWOR as well as in stratified random sampling. It is interesting to mention that the envisaged class of estimators includes several existing estimators. Thus the properties of the proposed generalized class of estimators unify results at one place. We have proved theoretically that the proposed generalized class of estimators is more efficient than the several existing estimators in both sampling designs SRSWOR and stratified random sampling.
Empirical studies are carried out to throw light on the merits of the envisaged generalized class of estimators over some existing competitors. Larger gain in efficiency is observed by using the proposed generalized class of estimators over some existing estimators in both the sampling designs: SRSWOR and stratified random sampling. Results incorporated in this study are very sound and quite illuminating. Thus it is recommended that the proposed study is useful in practice.
Authors are thankful to the learned referees for their valuable suggestions regarding improvement of the paper.
Abu-Dayyeh, W. A., Ahmed, M. S., Ahmed, R. A. and Mutlak, H. A. (2003). Some estimators of a finite population mean using auxiliary information, Applied Mathematics and Computation, 139, 287–298.
Ahmed, M. S. (1997). The general class of chain estimators for ratio of two means using double sampling, Communications in Statistics-Theory and Methods, 26(9), 2249–2254.
Cochran, W. G. (1977). Sampling Techniques, Jhon Wiley and Sons, New York.
Das, A.K. and Tripathi, T. P. (1978). Use of auxiliary information in estimating the finite population variance, Sankhya, C, 40, 139–148.
Diana, G. (1993). A class of estimators of the population mean in stratified random sampling, Statistica, 53(1), 59–66.
Gleser, L. J. and Healy, J. D. (1976). Estimating the mean of a normal distribution with known coefficient of variation, Journal of American Statistical Association, 71, 977–981.
Gupta, S. and Shabbir, J. (2008). On estimating in estimating the population mean in simple random sampling, Journal of Applied Statistics, 35(5), 559–566.
Kadilar, C. and Cingi, H. (2003). Ratio estimator in stratified sampling, Biometrical Journal, 45, 218–225.
Koyuncu, N. (2013). Families of estimators for population mean using information on auxiliary attribute in stratified random sampling, Gazi University Journal of Sciences, 26(2), 181–193.
Koyuncu, N. (2016). Improved exponential type estimators for finite population mean in stratified random sampling, Pakistan Journal of Statistics and Operation Research, 12(3), 429–441.
Koyuncu, N. and Kadilar C. (2009). Family of estimators of population mean using two auxiliary variables in stratified sampling, Communications in Statistics-Theory and Methods, 38, 2398–2417.
Koyuncu, N. and Kadilar, C. (2010). On the family of estimators of population mean in stratified random sampling, Communications in Statistics-Theory and Methods, 26(2), 427–443.
Malik, S. and Singh, R. (2017). A new estimator for population mean using two auxiliary variables in stratified random sampling, Journal of Information and Optimization Sciences, 38(8), 1243–1252.
Mishra, M., Singh, B. P. and Singh, R. (2017). Estimation of population mean using two auxiliary variables in the stratified random sampling, Journal of Reliability and Statistical Studies, 10(1), 59–68.
Munner, S., Shabbir, J. and Khalil, A. (2016). Estimation of finite population mean in simple random sampling and stratified random sampling using two auxiliary variables, Communications in Statistics-Theory and Methods, DOI:10.1080/03610926.2015.1035394.
Murthy, M. N. (1967). Sampling Theory and Methods, Statistical Publishing Society, India.
Olkin, I. (1958). Multivariate Ratio Estimation for Finite Population Biometrika, 45, 154–165.
Olufadi, Y. (2013). Dual to ratio-cum-product estimator in simple and stratified random sampling, Pakistan Journal of Statistics and Operation Research, 9(3), 305–319.
PCR (1998). Population Census Report of Multan District, Punjab Bureau of Statistics, Lahore, Pakistan.
Raj, D. (1965). On a method of using multiauxiliary information sample surveys, Journal of American Statistical Association, 60, 270–277.
Rao, P. S. R. S. and Mudholkar, G. D. (1967). Generalized multivariate estimator for the mean of finite populations, Journal of American Statistical Association, 62, 1009–1012.
Rao, T. J. (1991). On certain method of improving rations and regression estimators, Communications in Statistics, Theory and Methods, 20(10), 3325–3340.
Ray, S. K., Sahai, A. and Sahai, A. (1979). A note on ratio and product-type estimators, Annals of the Institute of Statistical Mathematics, 31, 141–144.
Reddy, V. N. (1978). A study on the use of prior knowledge on certain population parameters in estimation, Sankhya, C, 40, 29–34.
Searls, D. T. (1964). The utilization of a known coefficient of variation in the estimation procedure, Journal of American Statistical Association, 59, 1225–226.
Shabbir, J. (2018). Efficient utilisation of two auxiliary variables in stratified double sampling, Communications in Statistics-Theory and Methods, 47(1), 92–101.
Shabbir, J. and Gupta, S. (2005). Improved ratio estimators in stratified sampling, American Journal of Mathematics and Management Sciences, 25(3–4), 293–311.
Shabbir, J. and Gupta, S. (2015). Estimation of finite population mean using two auxiliary variables in stratified two-phase sampling, Communications in Statistics-Simulation and Computation, DOI:10.1080/03610918.2014.995817.
Shabbir, J. and Gupta, S. (2016). Estimation of population coefficient of variation in simple and stratified random sampling under two-phase sampling scheme when using two auxiliary variables, Communications in Statistics-Theory and Methods, DOI:10.1080/03610926.2016.1175627.
Sharma, P. and Singh, R. (2014). Improved ratio-type estimators using two auxiliary variables under second order approximation, Mathematical Journal of Interdisciplinary Sciences, 2(2), 193–204.
Sharma, P. and Singh, R. (2015). A class of exponential ratio estimators of finite population mean using two auxiliary variables, Pakistan Journal of Statistics and Operation Research, 11(2), 221–229.
Shukla, G. K. (1966). An alternative multivariate ratio estimator for finite population, Calcutta Statistical Association Bulletin, 15, 127–134.
Singh, D. and Chaudhary, F. S. (1986). Theory and Analysis of Sample Survey Designs, Jhon Wiley and Sons, New York.
Singh, H. P. (1986). A Generalized class of estimators of ratio, product and mean using supplementary information on an auxiliary character in PPSWR sampling scheme, Gujarat Statistical Review, 13(2), 1–30.
Singh, H. P. and Tailor, R. (2005). Estimation of finite population mean using known correlation coefficient between auxiliary characters, Statistica, anno LXV, 4, 407–418.
Singh, H. P. and Vishwakarma, G. K. (2008). A family of estimators of population mean using auxiliary formation in stratified sampling, Communications in Statistics-Theory and Methods, 37(7), 1038–1050.
Singh, H. P., Upadhyaya, L. N. and Tailor, R. (2009). Ratio-cum-product type exponential estimator, Statistica, anno LXIX, 4, 300–310.
Singh, M. P. (1967). Ratio-cum-product method of estimation, Metrika, 12(1), 34–62.
Singh, M. P. (1969). Comparison of some ratio-cum-product estimators, Sankhya, series B, 31, 375–378.
Singh, R. and Kumar, M. (2012). Improved estimators of population mean using two auxiliary variables in stratified random sampling, Pakistan Journal of Statistics and Operation Research, 8, 65–72.
Singh, R., Chouhan, P. and Sawan, N. (2008). On linear combination of ratio-product-type exponential estimator for estimating finite population mean, Statistics in Transition, 9(1), 105–115.
Singh, S. (2003). Advanced sampling theory with applications- How michel “selected” Amy, Kluwer Academic Publishers, I edition volume I and II.
Srivastava, S. K. (1965). An estimation of the mean of a finite population using several auxiliary variables, Journal of the Indian Statistical Association 3, 189–194.
Srivastava, S. K. (1966). A note on Olkin’s multivariate ratio estimator, Journal of Indian Statistical Association, 4, 202–208.
Srivastava, S. K. (1967). An estimator using auxiliary information in sample surveys, Culcutta Statistical Association, 16, 121–132.
Srivastava, S. K. (1971). A generalized estimator for the mean of a finite population using multiauxiliary information, Journal of American Statistical Association, 66, 404–407.
Srivenkataramana, T. and Tracy, D. S. (1984). Positive and negative valued auxiliary variates in surveys, Metron, XLII, (3–4), 3–14.
Steel, R. G. D. and Torrie, J, H. (1960). Principles and Procedures of Statistics, McGraw Hill Book Co.
Swain, A. K. P. C. (2013). On some modified ratio and product-type estimators. Revisited, Investigacion operational, 34(1), 35–57.
Tailor, R. and Chouhan, S. (2014). Ratio-cum-product-type exponential estimator of finite population mean in stratified random sampling, Communications in Statistics-Theory and Methods, 43(2), 343–354.
Tailor, R., Chouhan, S., Tailor, R. and Garg, N. (2012). A ratio-cum-product estimator of population mean in stratified random sampling using two auxiliary variables, Statistica, anno LXXII, n.3, 287–297.
Tripathi, T. P., Maiti, P. and Sharma, S. D. (1983). Use of prior information on some parameter in estimating population mean, Sankhaya, A, 45(3), 372–376.
Upadhyaya, L. N., Singh, H. P., and Vos, J. W. E. (1985). On the estimation of population means and ratios using supplementary information, Statistica Neerlandica, 39, 309–318.
Verma, H. K., Sharma, P. and Singh, R. (2015). Some families of estimators using two auxiliary variables in a stratified random sampling, Revista Investigacion Operacional, 36(2), 140–150.
Vos, J. W. E. (1980). Mixing of direct, ratio and product method estimators, Statistica Neerlandica, 34, 209–218.
Yadav, S. K., Mishra, S., Kadilar, C. and Shukla, A. K. (2015a). Improved dual to ratio cum dual to product estimator in the stratified random sampling, American Journal of Operational Research, 5(3), 57–63.
Yadav, S. K., Mishra, S., Mishra, S. S. and Shukla, A. K. (2015b). Searching efficient estimator of population mean in stratified random sampling, American Journal of Operational Research, 5(4), 75–81.
Housila Prasad Singh, born on 08/09/1957 in a village of Varanasi district of Uttar Pradesh. He did his M.Sc. (Statistics) in 1979 from Banaras Hindu University, Varanasi, U.P.. He obtained his M.Phil. (Applied Mathematics) in 1981 and Ph.D. (Applied Statistics) in 1985 from Indian School of Mines, Dhanbad, Bihar (Now Indian Institute of Technology, Dhanbad, Jharkhand). Currently he is a professor of Statistics, Vikram University, Ujjain, M.P.. He has 37 years of teaching experience and 41 years of research experience. He has been Head, School of Studies in Statistics, Dean, Faculty of Science and Executive Council Member of Vikram University, Ujjain, M.P. He has guided many students for their M.Sc. (02), M.Phil (18) and Ph.D. (23) degrees. He has been visiting scientist at University of Windsor, Windsor, Canada. He has published more than 510 research papers in journals of national and international repute. One of his research papers submitted for the award of “Dr. Radha Krishnan Samman 1992” has been appreciated by valuers. He has been awarded ‘Best Scientist Research Publication’ award (2009–10) and Out Standing Research Faculty by research faculty awards by Careers 360 as “One of the 10 Knowledge Producers in India” for academic year 2017–18. He has written two book reviews out of them one is published in Computational Statistics and Data Analysis (2000) and the other one in the Journal of Royal Statistical Society, Sr. A (2006). He is also the author of the Book “Randomness and Optimal Estimation in Data Sampling”, American Research Press, Rehoboth, USA. His area of research interest is Sampling Theory and Statistical Inference. Google Scholar (Citations by this date) is Citations-5653, h-index-35 and i10-index-134.
Pragati Nigam, born on 27/11/1992 in Ujjain, M.P. She did her M.Sc. (Statistics) in 2015 (achieved II rank in the University) and Ph.D. (Statistics) in 2021 under the guidance of Prof. H. P. Singh, from School of Studies in Statistics, Vikram University, Ujjain. Presently she is working as Assistant Professor of Statistics at Faculty of Agricultural Sciences, Mandsaur University, Mandsaur. Dr. Pragati published 5 research papers and 1 book chapter (MKSES Publications) in national and international journals of repute.
Journal of Reliability and Statistical Studies, Vol. 15, Issue 1 (2022), 61–104.
doi: 10.13052/jrss0974-8024.1514
© 2022 River Publishers
2 Some Existing Estimators of SRS
3 Suggested Generalized Class of Estimators in Simple Random Sampling
6 Estimation of Population Mean Under Stratified Random Sampling
7 Reviewing Some Existing Estimators in Stratified Random Sampling
8 Suggested Class of Estimators for Population Mean in Stratified Random Sampling