A Generalized Class of Estimators for Finite Population Mean Using Two Auxiliary Variables in Sample Surveys

Housila Prasad Singh and Pragati Nigam^{*, †}

School of Studies in Statistics, Vikram University, Ujjain-456010, M.P., India
*Faculty of Agricultural Sciences, Mandsaur University, Mandsaur, M.P., India
E-mail: pragatinigam1@gmail.com
^†Corresponding Author

Received 08 December 2021; Accepted 05 January 2022; Publication 15 February 2022

Abstract

In this paper we have suggested a generalized class of estimators for estimating the finite population mean $\bar{Y}$ of the study variable y using information on two auxiliary variables x and z. We have studied the properties of the proposed generalized class of estimators in simple random sampling without replacement scheme and in stratified random sampling up to the first order of approximation. It is shown that the suggested class of estimators is more efficient than the conventional unbiased estimator, ratio estimator, product estimator, traditional difference estimator, Srivastava (1967) estimator, Ray et al. (1979) estimator, Vos (1980) estimator, Upadhyaya et al. (1985) estimator, Rao (1991) estimator and Gupta and Shabbir (2008) estimator. Theoretical results are well supported through an empirical study.

Keywords: Auxiliary variable, study variable, bias, mean squared error, simple random sampling, stratified random sampling.

1 Introduction

In sample surveys, the use of auxiliary variable(s) at the estimation stage played a prominent role in improving the precision of an estimate of the population mean. Various authors have paid their attention towards the estimation of population mean $\bar{Y}$ of the study variable y using information on a single auxiliary variable x and suggested a large number of estimators along with their properties in simple random sampling without (or with) replacement schemes for instance, see Murthy (1967), Singh (1986, 2003) and the references cited therein. In many survey situations of practical importance, adequate information on more than one of auxiliary variables is available. In such a situation Olkin (1958) was first to introduce multivariate ratio estimator for population mean $\bar{Y}$ of the study variable y using information on p( $>$ 1) auxiliary variables. Later many authors including Raj (1965), Rao and Mudholkar (1967), Singh (1967, 1969), Shukla (1966), Srivastava (1965, 1967, 1971), Singh and Tailor (2005), Gupta and Shabbir (2008), Singh et al. (2009), Swain (2013) and Sharma and Singh (2014, 2015) etc. have developed estimators which utilize data from p( $>$ 1) auxiliary variables. The properties of the estimators studied under simple random sampling with (or without) replacement i.e. SRSWR (or SRSWOR) scheme.

It is well established fact that the simple random sampling (SRS) procedure is employed when the population is homogeneous. However in practice, the populations encountered are not homogeneous (i.e. populations are heterogeneous). Thus in such a situation SRS procedure does not provide a sample which is good representative of the entire population. Hence we can say that when the population is heterogeneous, SRS procedure does not provide better estimate of the population mean $\bar{Y}$ . To cope up with this situation, we use stratified random sampling for selecting a good sample from the target population. Thus when the population is heterogeneous stratified random sampling is more appropriate and gives better estimate of the population mean. In a stratified random sampling design, we divide the population into groups known as strata and samples are selected from each group with pre-determined sample size. Several authors including Diana (1993), Kadilar and Cingi (2003), Shabbir and Gupta (2005), Singh and Vishwakarma (2008), Singh et al. (2008), Koyuncu and Kadilar (2009, 2010), Koyuncu (2013), Yadav et al. (2015a, 2015b) and Koyuncu (2016) etc. have suggested estimators for population mean $\bar{Y}$ of y using information on single auxiliary variable x in stratified random sampling. It is further noticed that various authors including Koyuncu and Kadilar (2009), Tailor et al. (2012), Singh and Kumar (2012), Olufadi (2013), Tailor and Chouhan (2014), Verma et al. (2015), Shabbir and Gupta (2015, 2016), Muneer et al. (2016), Malik and Singh (2017), Mishra et al. (2017) and Shabbir (2018) etc. have suggested several estimators for population mean $\bar{Y}$ of y using two auxiliary variables x and z in stratified random sampling.

In this paper we have made an effort to develop a generalized class of estimators for population mean $\bar{Y}$ of y using information on two auxiliary variables x and z. The properties of the suggested class of estimators are studied up to the first order of approximation in SRSWOR scheme as well as in stratified random sampling. Numerical examples are given in support of the present study.

2 Some Existing Estimators of SRS

Consider a finite population $Ω = {Ω_{1}, Ω_{2}, \dots Ω_{N}}$ of N units. Let y and (x, z) be study variable and auxiliary variables respectively. Let $y_{i}$ and $(x_{i}, z_{i})$ be the values of study variable y and auxiliary variables (x, z) on the ith unit $Ω_{i}$ of the population $Ω$ . Suppose a sample of size n is drawn by using SRSWOR scheme from the population $Ω$ for estimating the population mean $\bar{Y}$ of the study variable y. Let $\bar{y} = \frac{1}{n} \sum_{i = 1}^{n} y_{i}$ and $(\bar{x} = \frac{1}{n} \sum_{i = 1}^{n} x_{i}$ , $\bar{z} = \frac{1}{n} \sum_{i = 1}^{n} z_{i})$ be the unbiased estimators of the population means $\bar{Y} = \frac{1}{N} \sum_{i = 1}^{N} y_{i}$ and $(\bar{X} = \frac{1}{N} \sum_{i = 1}^{N} x_{i}$ , $\bar{Z} = \frac{1}{N} \sum_{i = 1}^{N} z_{i})$ respectively.

It is assumed that the population means $(\bar{X}, \bar{Z})$ of (x, z) are known. Further we denote

$C_{y} = \frac{S_{y}}{\bar{Y}}$ : the population coefficient of variation of y,

$C_{x} = \frac{S_{x}}{\bar{X}}$ : the population coefficient of variation of x,

$C_{z} = \frac{S_{z}}{\bar{Z}}$ : the population coefficient of variation of z,

$ρ_{y x} = \frac{S_{y x}}{S_{y} S_{x}}$ : the population correlation coefficient between y and x,

$ρ_{y z} = \frac{S_{y z}}{S_{y} S_{z}}$ : the population correlation coefficient between y and z,

$ρ_{x z} = \frac{S_{x z}}{S_{x} S_{z}}$ : the population correlation coefficient between x and z,

$S_{y x} = \frac{1}{N - 1} \sum_{i = 1}^{N} (y_{i} - \bar{Y}) (x_{i} - \bar{X})$ : the population covariance between y and x,

$S_{y z} = \frac{1}{N - 1} \sum_{i = 1}^{N} (y_{i} - \bar{Y}) (z_{i} - \bar{Z})$ : the population covariance between y and z,

$S_{x z} = \frac{1}{N - 1} \sum_{i = 1}^{N} (x_{i} - \bar{X}) (z_{i} - \bar{Z})$ : the population covariance between x and z,

$S_{y}^{2} = \frac{1}{N - 1} \sum_{i = 1}^{N} {(y_{i} - \bar{Y})}^{2}$ : the population mean square of y,

$S_{x}^{2} = \frac{1}{N - 1} \sum_{i = 1}^{N} {(x_{i} - \bar{X})}^{2}$ : the population mean square of x,

$S_{z}^{2} = \frac{1}{N - 1} \sum_{i = 1}^{N} {(z_{i} - \bar{Z})}^{2}$ : the population mean square of z.

$K_{y x} = \frac{ρ_{y x} C_{y}}{C_{x}}$ , $K_{y z} = \frac{ρ_{y z} C_{y}}{C_{z}}$ , $K_{x z} = \frac{ρ_{x z} C_{x}}{C_{z}}$ , $K_{z x} = \frac{ρ_{z x} C_{z}}{C_{x}}$ and $f = \frac{n}{N}$ : is the sampling fraction.

Now we review some existing estimators.

The usual unbiased estimator for population mean $\bar{Y}$ is given by

{\hat{\bar{Y}}}_{0} = \bar{y} = \frac{1}{n} \sum_{i = 1}^{n} y_{i}

(1)

The variance/mean squared error (MSE) under SRSWOR is given by

Var ({\hat{\bar{Y}}}_{0}) = MSE ({\hat{\bar{Y}}}_{0}) = (\frac{1 - f}{n}) {\bar{Y}}^{2} C_{y}^{2} = (\frac{1 - f}{n}) S_{y}^{2}

(2)

The usual ratio and product estimators for population mean $\bar{Y}$ are respectively defined by

{\hat{\bar{Y}}}_{R} = \bar{y} (\frac{\bar{X}}{\bar{x}})

(3)

and

{\hat{\bar{Y}}}_{P} = \bar{y} (\frac{\bar{x}}{\bar{X}})

(4)

To the first degree of approximation (fda), the MSEs of the estimators ${\hat{\bar{Y}}}_{R}$ and ${\hat{\bar{Y}}}_{P}$ are respectively given by

$MSE ({\hat{\bar{Y}}}_{R})$	$= (\frac{1 - f}{n}) {\bar{Y}}^{2} [C_{y}^{2} + C_{x}^{2} (1 - 2 K_{y x})]$	(5)
$MSE ({\hat{\bar{Y}}}_{P})$	$= (\frac{1 - f}{n}) {\bar{Y}}^{2} [C_{y}^{2} + C_{x}^{2} (1 + 2 K_{y x})]$	(6)

The generalized version of the estimators ${\hat{\bar{Y}}}_{0}$ , ${\hat{\bar{Y}}}_{R}$ and ${\hat{\bar{Y}}}_{P}$ due to Srivastava (1967) is given by

{\hat{\bar{Y}}}_{α_{1}} = \bar{y} {(\frac{\bar{x}}{\bar{X}})}^{α_{1}}

(7)

where $α_{1}$ is a suitably chosen constant.

To the fda, the $MSE ({\hat{\bar{Y}}}_{α_{1}})$ is given by

MSE ({\hat{\bar{Y}}}_{α_{1}}) = \frac{(1 - f)}{n} {\bar{Y}}^{2} [C_{y}^{2} + α_{1} C_{x}^{2} (α_{1} + 2 K_{y x})]

(8)

which is minimum when

α_{1} = - K_{y x} α_{1 (o p t)}, say

(9)

Substitution of (9) in (8) yields the minimum MSE of ${\hat{\bar{Y}}}_{α_{1}}$ as

${MSE}_{\min} ({\hat{\bar{Y}}}_{α_{1}})$	$= \frac{(1 - f)}{n} {\bar{Y}}^{2} [C_{y}^{2} - K_{y x}^{2} C_{x}^{2}]$
	$= \frac{(1 - f)}{n} {\bar{Y}}^{2} C_{y}^{2} [1 - ρ_{y x}^{2}] = \frac{(1 - f)}{n} S_{y}^{2} [1 - ρ_{y x}^{2}]$	(10)

The traditional difference estimator for $\bar{Y}$ is defined by

{\hat{\bar{Y}}}_{D 1} = \bar{y} + d_{0} (\bar{X} - \bar{x}),

(11)

where $d_{0}$ is a suitably chosen constant to be determined such that the MSE of ${\hat{\bar{Y}}}_{D_{1}}$ is minimum.

The minimum MSE of ${\hat{\bar{Y}}}_{D 1}$ is given by

{MSE}_{\min} ({\hat{\bar{Y}}}_{D 1}) = (\frac{1 - f}{n}) {\bar{Y}}^{2} C_{y}^{2} (1 - ρ_{y x}^{2}) = (\frac{1 - f}{n}) S_{y}^{2} (1 - ρ_{y x}^{2})

Upadhyaya et al. (1985) suggested a class of estimators for population mean $\bar{Y}$ as

{\hat{\bar{Y}}}_{USV} = w_{0} \bar{y} + w_{1} \bar{y} {(\frac{\bar{x}}{\bar{X}})}^{α_{1}}

(13)

where $w_{0}$ and $w_{1}$ are suitably chosen weights whose sum need not be ‘unity’ and $α_{1}$ is a design parameter.

The MSE of ${\hat{\bar{Y}}}_{USV}$ to the fda is given by

$MSE ({\hat{\bar{Y}}}_{USV})$	$= {\bar{Y}}^{2} [1 + w_{0}^{2} A_{0 (s r s)} + w_{1}^{2} A_{1 (s r s)}$
	$+ 2 w_{0} w_{1} A_{3 (s r s)} - 2 w_{0} - 2 w_{1} A_{6 (s r s)}]$	(14)

where

$A_{0 (s r s)}$	$= [1 + (\frac{1 - f}{n}) C_{y}^{2}]$
$A_{1 (s r s)}$	$= [1 + (\frac{1 - f}{n}) {C_{y}^{2} + 4 α_{1} ρ_{y x} C_{y} C_{x} + α_{1} (2 α_{1} - 1) C_{x}^{2}}]$
$A_{3 (s r s)}$	$= [1 + (\frac{1 - f}{n}) {C_{y}^{2} + 2 α_{1} ρ_{y x} C_{y} C_{x} + \frac{α_{1} (α_{1} - 1)}{2} C_{x}^{2}}]$
$A_{6 (s r s)}$	$= [1 + (\frac{1 - f}{n}) {α_{1} ρ_{y x} C_{y} C_{x} + \frac{α_{1} (α_{1} - 1)}{2} C_{x}^{2}}]$

The best values of $(w_{0}, w_{1})$ for which the MSE of ${\hat{\bar{Y}}}_{USV}$ is minimized, are given by

w_{0} = \frac{Δ_{0}^{*}}{Δ^{*}}, w_{1} = \frac{Δ_{1}^{*}}{Δ^{*}}

(15)

where

$Δ^{*}$	$= \| \begin{matrix} A_{0 (s r s)} & A_{3 (s r s)} \\ A_{3 (s r s)} & A_{1 (s r s)} \end{matrix} \| = (A_{0 (s r s)} A_{1 (s r s)} - A_{3 (s r s)}^{2})$
$Δ_{0}^{*}$	$= \| \begin{matrix} 1 & A_{3 (s r s)} \\ A_{6 (s r s)} & A_{1 (s r s)} \end{matrix} \| = (A_{1 (s r s)} - A_{3 (s r s)} A_{6 (s r s)})$
$Δ_{1}^{*}$	$= \| \begin{matrix} A_{0 (s r s)} & 1 \\ A_{3 (s r s)} & A_{6 (s r s)} \end{matrix} \| = (A_{0 (s r s)} A_{6 (s r s)} - A_{3 (s r s)})$

Thus the minimum MSE of ${\hat{\bar{Y}}}_{USV}$ is given by

{MSE}_{\min} ({\hat{\bar{Y}}}_{USV}) = {\bar{Y}}^{2} [1 - \frac{{A_{1 (s r s)} - 2 A_{3 (s r s)} A_{6 (s r s)} + A_{0 (s r s)} A_{6 (s r s)}^{2}}}{(A_{0 (s r s)} A_{1 (s r s)} - A_{3 (s r s)}^{2})}]

(16)

If we set $w_{0} + w_{1} = 1 \Rightarrow w_{1} = (1 - w_{0})$ in (13) we get an estimator for population mean $\bar{Y}$ of y as

{\hat{\bar{Y}}}^{*} = w_{0} \bar{y} + (1 - w_{0}) \bar{y} {(\frac{\bar{x}}{\bar{X}})}^{α_{1}}

(17)

which includes the estimators due to Srivastava (1967), Ray et al. (1979) and Vos (1980).

Putting $w_{1} = (1 - w_{0})$ in (2) we get the MSE of ${\hat{\bar{Y}}}^{*}$ to the fda as

$MSE ({\hat{\bar{Y}}}^{*})$	$= {\bar{Y}}^{2} [1 + A_{1 (s r s)} - 2 A_{6 (s r s)} + w_{0}^{2} (A_{0 (s r s)} + A_{1 (s r s)}$
	$- 2 A_{3 (s r s)}) - 2 w_{0} (1 + A_{1 (s r s)} - A_{3 (s r s)} - A_{6 (s r s)})]$

which is minimized for

w_{0} = \frac{(1 + A_{1 (s r s)} - A_{3 (s r s)} - A_{6 (s r s)})}{(A_{0 (s r s)} + A_{1 (s r s)} - 2 A_{3 (s r s)})} = (1 + \frac{K_{y x}}{α_{1}}) = w_{0 (o p t)}, say

(19)

Thus the minimum MSE of ${\hat{\bar{Y}}}^{*}$ is given by

${MSE}_{\min} ({\hat{\bar{Y}}}^{*})$
$= {\bar{Y}}^{2} [1 + A_{1 (s r s)} - 2 A_{6 (s r s)} - \frac{{(1 + A_{1 (s r s)} - A_{3 (s r s)} - A_{6 (s r s)})}^{2}}{(A_{0 (s r s)} + A_{1 (s r s)} - 2 A_{3 (s r s)})}]$
$= (\frac{1 - f}{n}) {\bar{Y}}^{2} C_{y}^{2} (1 - ρ_{y x}^{2}) = (\frac{1 - f}{n}) S_{y}^{2} (1 - ρ_{y x}^{2})$	(20)

Rao (1991) suggested difference-type estimator for population mean $\bar{Y}$ as

{\hat{\bar{Y}}}_{R a o} = α_{1} \bar{y} + α_{2} (\bar{X} - \bar{x}),

(21)

where $α_{1}$ and $α_{2}$ are constants to be determined such that MSE of ${\hat{\bar{Y}}}_{R a o}$ is minimum.

The bias and MSE of ${\hat{\bar{Y}}}_{R a o}$ are respectively given by

$B ({\hat{\bar{Y}}}_{R a o})$	$= \bar{Y} (α_{1} - 1),$	(22)
$MSE ({\hat{\bar{Y}}}_{R a o})$	$= {\bar{Y}}^{2} [1 + α_{1}^{2} {1 + (\frac{1 - f}{n}) C_{y}^{2}} + α_{2}^{2} (\frac{1 - f}{n}) \frac{C_{x}^{2}}{R^{2}}$
	$- 2 α_{1} α_{2} (\frac{1 - f}{n}) \frac{K_{y x} C_{x}^{2}}{R} - 2 α_{1}]$	(23)

where $R = \frac{\bar{Y}}{\bar{X}}$ .

The $MSE ({\hat{\bar{Y}}}_{R a o})$ at (2) is minimized for

$α_{1}$	$= {1 + (\frac{1 - f}{n}) (C_{y}^{2} - K_{y x}^{2} C_{x}^{2})}^{- 1} = α_{1 (o p t)}, say$
$α_{2}$	$= - R K_{y x} {1 + (\frac{1 - f}{n}) (C_{y}^{2} - K_{y x}^{2} C_{x}^{2})}^{- 1} = α_{2 (o p t)}, say$

Thus the minimum MSE of ${\hat{\bar{Y}}}_{R a o}$ is given by

{MSE}_{\min} ({\hat{\bar{Y}}}_{R a o}) = \frac{(\frac{1 - f}{n}) {\bar{Y}}^{2} (C_{y}^{2} - K_{y x}^{2} C_{x}^{2})}{1 + (\frac{1 - f}{n}) (C_{y}^{2} - K_{y x}^{2} C_{x}^{2})}

(24)

Gupta and Shabbir (2008) proposed the following class of estimators for population mean $\bar{Y}$ as

{\hat{\bar{Y}}}_{G S} = [α_{3} \bar{y} + α_{4} (\bar{X} - \bar{x})] (\frac{\bar{X}}{\bar{x}})

(25)

where ( $α_{3}, α_{4}$ ) are suitably chosen constants such that the MSE of ${\hat{\bar{Y}}}_{G S}$ is minimum.

To the fda, the bias and MSE of ${\hat{\bar{Y}}}_{G S}$ are respectively given by

$B ({\hat{\bar{Y}}}_{G S})$	$= \bar{Y} [α_{3} {1 + (\frac{1 - f}{n}) C_{x}^{2} (1 - K_{y x})} + \frac{α_{4}}{R} C_{x}^{2} - 1]$	(26)
$MSE ({\hat{\bar{Y}}}_{G S})$	$= {\bar{Y}}^{2} [\begin{matrix} 1 + α_{3}^{2} {1 + \frac{(1 - f)}{n} [C_{y}^{2} + C_{x}^{2} (3 - 4 K_{y x})]} \\ + α_{4}^{2} \frac{(1 - f)}{n} \frac{C_{x}^{2}}{R^{2}} + \frac{2 α_{3} α_{4}}{R} \frac{(1 - f)}{n} C_{x}^{2} (2 - K_{y x}) \\ - 2 α_{3} {1 + \frac{(1 - f)}{n} C_{x}^{2} (1 - K_{y x}) \\ - 2 α_{4} \frac{(1 - f)}{n} \frac{C_{x}^{2}}{R}} \end{matrix}]$	(27)

The $M S E ({\hat{\bar{Y}}}_{G S})$ at (27) is minimum when

$α_{3}$	$= \frac{(a_{2} a_{4} - a_{3} a_{5})}{(a_{1} a_{2} - a_{3}^{2})} = α_{3 (o p t)}, say$
$α_{4}$	$= \frac{(a_{1} a_{5} - a_{3} a_{4})}{(a_{1} a_{2} - a_{3}^{2})} = α_{4 (o p t)}, say$

where

$a_{1}$	$= [1 + (\frac{1 - f}{n}) {C_{y}^{2} + C_{x}^{2} (3 - 4 K_{y x})}]$
$a_{2}$	$= (\frac{1 - f}{n}) \frac{C_{x}^{2}}{R^{2}}$
$a_{3}$	$= (\frac{1 - f}{n}) \frac{C_{x}^{2}}{R} (2 - K_{y x})$
$a_{4}$	$= [1 + (\frac{1 - f}{n}) C_{x}^{2} (1 - K_{y x})]$
$a_{5}$	$= (\frac{1 - f}{n}) \frac{C_{x}^{2}}{R}$

Thus the minimum MSE of ${\hat{\bar{Y}}}_{G S}$ is given by

{MSE}_{\min} ({\hat{\bar{Y}}}_{G S}) = {\bar{Y}}^{2} {1 - \frac{(a_{2} a_{4}^{2} - 2 a_{3} a_{4} a_{5} + a_{1} a_{5}^{2})}{(a_{1} a_{2} - a_{3}^{2})}}

(28)

The traditional difference estimator for population mean $\bar{Y}$ using two auxiliary variables x and z is defined by

{\hat{\bar{Y}}}_{D 2} = {\bar{y} + k_{1} (\bar{X} - \bar{x}) + k_{2} (\bar{Z} - \bar{z})}

(29)

where $k_{1}$ and $k_{2}$ are constants to be determined such that $MSE ({\hat{\bar{Y}}}_{D 2})$ is minimum.

It is obvious from (29) that the difference estimator ${\hat{\bar{Y}}}_{D 2}$ is unbiased for population mean $\bar{Y}$ .

The variance/MSE of the estimator ${\hat{\bar{Y}}}_{D 2}$ is given by

$Var ({\hat{\bar{Y}}}_{D 2})$	$= MSE ({\hat{\bar{Y}}}_{D 2})$
	$= (\frac{1 - f}{n}) {\bar{Y}}^{2} [C_{y}^{2} + k_{1}^{2} \frac{C_{x}^{2}}{R^{2}} + k_{2}^{2} \frac{C_{z}^{2}}{R^{2}} + 2 k_{1} k_{2} \frac{K_{x z} C_{z}^{2}}{R R^{*}}$
	$- 2 k_{1} \frac{K_{y x} C_{x}^{2}}{R} - 2 k_{2} \frac{K_{y z} C_{z}^{2}}{R^{*}}]$	(30)

where $R^{*} = \frac{\bar{Y}}{\bar{Z}}$ .

The $MSE ({\hat{\bar{Y}}}_{D 2})$ at (2) is minimized for

$k_{1}$	$= \frac{R C_{y} (ρ_{y x} - ρ_{y z} ρ_{x z})}{C_{x} (1 - ρ_{x z}^{2})} = k_{1 (o p t)}, say$
$k_{2}$	$= \frac{R^{*} C_{y} (ρ_{y z} - ρ_{y x} ρ_{x z})}{C_{z} (1 - ρ_{x z}^{2})} = k_{2 (o p t)}, say$

Thus the minimum MSE of ${\hat{\bar{Y}}}_{D 2}$ is given by

{MSE}_{\min} ({\hat{\bar{Y}}}_{D 2}) = (\frac{1 - f}{n}) {\bar{Y}}^{2} C_{y}^{2} (1 - R_{y . x z}^{2}) = (\frac{1 - f}{n}) S_{y}^{2} (1 - R_{y . x z}^{2})

(31)

where $R_{y . x z}^{2} = \frac{ρ_{y x}^{2} + ρ_{y z}^{2} - 2 ρ_{y x} ρ_{y z} ρ_{x z}}{1 - ρ_{x z}^{2}}$ is the multiple correlation coefficient.

3 Suggested Generalized Class of Estimators in Simple Random Sampling

Motivated by Upadhyaya et al. (1985) we propose a generalized class of estimators based on two auxiliary variables x and z for population mean $\bar{Y}$ of y as

t = w_{0} \bar{y} + w_{1} \bar{y} {(\frac{\bar{x}}{\bar{X}})}^{α_{1}} + w_{2} \bar{y} {(\frac{\bar{z}}{\bar{Z}})}^{α_{2}}

(32)

where $(w_{0}, w_{1}, w_{2})$ are weights whose sum need not be ‘unity’ and $(α_{1}, α_{2})$ are design parameters. The constants $(α_{1}, α_{2})$ may take positive $(+, +)$ or negative $(-, -)$ or positive-negative $(+, -)$ or negative-positive $(-, +)$ values to form product-type or ratio-type or product-cum-ratio-type or ratio-cum-product-type estimator.

To obtain the bias and MSE of the propounded estimator t, we write

\bar{y} = \bar{Y} (1 + e_{0}), \bar{x} = \bar{X} (1 + e_{1}) and \bar{z} = \bar{Z} (1 + e_{2})

such that $E (e_{0}) = E (e_{1}) = E (e_{2}) = 0$

$E (e_{0}^{2})$	$= (\frac{1 - f}{n}) C_{y}^{2}, E (e_{1}^{2}) = (\frac{1 - f}{n}) C_{x}^{2},$
$E (e_{2}^{2})$	$= (\frac{1 - f}{n}) C_{z}^{2}$
$E (e_{0} e_{1})$	$= (\frac{1 - f}{n}) ρ_{y x} C_{y} C_{x} = (\frac{1 - f}{n}) K_{y x} C_{x}^{2},$
$E (e_{0} e_{2})$	$= (\frac{1 - f}{n}) ρ_{y z} C_{y} C_{z} = (\frac{1 - f}{n}) K_{y z} C_{z}^{2}$
$E (e_{1} e_{2})$	$= (\frac{1 - f}{n}) ρ_{x z} C_{x} C_{z} = (\frac{1 - f}{n}) K_{x z} C_{x}^{2} = (\frac{1 - f}{n}) K_{z x} C_{z}^{2} .$

Expressing (32) in terms of e’s we have

t = \bar{Y} [w_{0} (1 + e_{0}) + w_{1} (1 + e_{0}) {(1 + e_{1})}^{α_{1}} + w_{2} (1 + e_{0}) {(1 + e_{2})}^{α_{2}}]

We suppose that $| e_{i} | ≪ 1$ so that ${(1 + e_{i})}^{α_{i}}, i = 1, 2$ are expandable.

Expanding the right hand side of (3), multiplying out and ignoring terms of e’s having power greater than two, we have

$t$	$≅ \bar{Y} [w_{0} (1 + e_{0}) + w_{1} {1 + e_{0} + α_{1} e_{1} + α_{1} e_{0} e_{1} + \frac{α_{1} (α_{1} - 1)}{2} e_{1}^{2}}$
	$+ w_{2} {1 + e_{0} + α_{2} e_{2} + α_{2} e_{0} e_{2} + \frac{α_{2} (α_{2} - 1)}{2} e_{2}^{2}}]$

(t - \bar{Y}) ≅ \bar{Y} [\begin{matrix} w_{0} (1 + e_{0}) \\ + w_{1} {1 + e_{0} + α_{1} e_{1} + α_{1} e_{0} e_{1} + \frac{α_{1} (α_{1} - 1)}{2} e_{1}^{2}} \\ + w_{2} {1 + e_{0} + α_{2} e_{2} + α_{2} e_{0} e_{2} + \frac{α_{2} (α_{2} - 1)}{2} e_{2}^{2}} - 1 \end{matrix}]

(34)

Taking expectation of both sides of (34) we get the bias of t to the fda as

$B (t)$	$= \bar{Y} [w_{0} + w_{1} {1 + (\frac{1 - f}{n}) \frac{α_{1}}{2} (α_{1} + 2 K_{y x} - 1) C_{x}^{2}}$
	$+ w_{2} {1 + (\frac{1 - f}{n}) \frac{α_{2}}{2} (α_{2} + 2 K_{y z} - 1) C_{z}^{2}} - 1]$
	$= \bar{Y} [w_{0} + w_{2} A_{6 (s r s)} + w_{3} A_{7 (s r s)} - 1]$	(35)

where

$A_{6 (s r s)}$	$= [1 + (\frac{1 - f}{n}) \frac{α_{1}}{2} (α_{1} + 2 K_{y x} - 1) C_{x}^{2}]$
$A_{7 (s r s)}$	$= [1 + (\frac{1 - f}{n}) \frac{α_{2}}{2} (α_{2} + 2 K_{y z} - 1) C_{z}^{2}]$

Squaring both sides of (34) and ignoring terms of e’s having power greater than two we have

{(t - \bar{Y})}^{2} = {\bar{Y}}^{2} [\begin{matrix} 1 + w_{0}^{2} (1 + 2 e_{0} + e_{0}^{2}) \\ + w_{1}^{2} {\begin{matrix} 1 + 2 e_{0} + 2 α_{1} e_{1} \\ + e_{0}^{2} + 4 α_{1} e_{0} e_{1} \\ + α_{1} (2 α_{1} - 1) e_{1}^{2} \end{matrix}} \\ + w_{2}^{2} {\begin{matrix} 1 + 2 e_{0} + 2 α_{2} e_{2} \\ + e_{0}^{2} + 4 α_{2} e_{0} e_{2} \\ + α_{2} (2 α_{2} - 1) e_{2}^{2} \end{matrix}} \\ + 2 w_{0} w_{1} {\begin{matrix} 1 + 2 e_{0} + α_{1} e_{1} + e_{0}^{2} \\ + 2 α_{1} e_{0} e_{1} + \frac{α_{1} (α_{1} - 1)}{2} e_{1}^{2} \end{matrix}} \\ + 2 w_{0} w_{2} {\begin{matrix} 1 + 2 e_{0} + α_{2} e_{2} + e_{0}^{2} + 2 α_{2} e_{0} e_{2} \\ + \frac{α_{2} (α_{2} - 1)}{2} e_{2}^{2} \end{matrix}} \\ + 2 w_{1} w_{2} {\begin{matrix} 1 + 2 e_{0} + α_{1} e_{1} + α_{2} e_{2} \\ + e_{0}^{2} + 2 α_{1} e_{0} e_{1} \\ + 2 α_{2} e_{0} e_{2} + α_{1} α_{1} e_{1} e_{0} \\ + \frac{α_{1} (α_{1} - 1)}{2} e_{1}^{2} \\ + \frac{α_{2} (α_{2} - 1)}{2} e_{2}^{2} \end{matrix}} - 2 w_{0} \\ - 2 w_{1} {1 + e_{0} + α_{1} e_{1} + α_{1} e_{0} e_{1} + \frac{α_{1} (α_{1} - 1)}{2} e_{1}^{2}} \\ - 2 w_{2} {1 + e_{0} + α_{2} e_{2} + α_{2} e_{0} e_{2} + \frac{α_{2} (α_{2} - 1)}{2} e_{2}^{2}} \end{matrix}]

(36)

Taking expectation of both sides of (36) we get the MSE of t to the fda as

MSE (t) = {\bar{Y}}^{2} [\begin{matrix} 1 + w_{0}^{2} A_{0 (s r s)} + w_{1}^{2} A_{1 (s r s)} + w_{2}^{2} A_{2 (s r s)} \\ + 2 w_{0} w_{1} A_{3 (s r s)} + 2 w_{0} w_{2} A_{4 (s r s)} + 2 w_{1} w_{2} A_{5 (s r s)} \\ - 2 w_{0} - 2 w_{1} A_{6 (s r s)} - 2 w_{2} A_{7 (s r s)} \end{matrix}]

(37)

where

$A_{2 (s r s)}$	$= [1 + (\frac{1 - f}{n}) {C_{y}^{2} + 4 α_{2} ρ_{y z} C_{y} C_{z} + α_{2} (2 α_{2} - 1) C_{z}^{2}}]$
$A_{4 (s r s)}$	$= [1 + (\frac{1 - f}{n}) {C_{y}^{2} + 2 α_{2} ρ_{y z} C_{y} C_{z} + \frac{α_{2} (α_{2} - 1)}{2} C_{z}^{2}}]$
$A_{5 (s r s)}$	$= [1 + (\frac{1 - f}{n}) {\begin{matrix} C_{y}^{2} + 2 α_{1} ρ_{y x} C_{y} C_{x} + 2 α_{2} ρ_{y z} C_{y} C_{z} \\ + α_{1} α_{2} ρ_{x z} C_{x} C_{z} \\ + \frac{α_{1} (α_{1} - 1)}{2} C_{x}^{2} + \frac{α_{2} (α_{2} - 1)}{2} C_{z}^{2} \end{matrix}}]$

$(A_{0 (s r s)}, A_{1 (s r s)}, A_{3 (s r s)}, A_{6 (s r s)} and A_{7 (s r s)})$ are same as defined earlier.

Minimization of $M S E (t)$ at (37) with respect to $(w_{0}, w_{1}, w_{2})$ yields

[\begin{matrix} A_{0 (s r s)} & A_{3 (s r s)} & A_{4 (s r s)} \\ A_{3 (s r s)} & A_{1 (s r s)} & A_{5 (s r s)} \\ A_{4 (s r s)} & A_{5 (s r s)} & A_{2 (s r s)} \end{matrix}] [\begin{matrix} w_{0} \\ w_{1} \\ w_{2} \end{matrix}] = [\begin{matrix} 1 \\ A_{6 (s r s)} \\ A_{7 (s r s)} \end{matrix}]

(38)

After simplification of (38), we get the optimum values of $(w_{0}, w_{1}, w_{2})$ respectively as

w_{00} = \frac{Δ_{0}}{Δ}, w_{10} = \frac{Δ_{1}}{Δ}, w_{20} = \frac{Δ_{2}}{Δ};

(39)

where

$Δ$	$= \| \begin{matrix} A_{0 (s r s)} & A_{3 (s r s)} & A_{4 (s r s)} \\ A_{3 (s r s)} & A_{1 (s r s)} & A_{5 (s r s)} \\ A_{4 (s r s)} & A_{5 (s r s)} & A_{2 (s r s)} \end{matrix} \|$
	$= A_{0 (s r s)} (A_{1 (s r s)} A_{2 (s r s)} - A_{5 (s r s)}^{2}) - A_{3 (s r s)} (A_{2 (s r s)} A_{3 (s r s)}$
	$- A_{4 (s r s)} A_{5 (s r s)}) + A_{4 (s r s)} (A_{3 (s r s)} A_{5 (s r s)} - A_{1 (s r s)} A_{4 (s r s)})$
$Δ_{0}$	$= \| \begin{matrix} 1 & A_{3 (s r s)} & A_{4 (s r s)} \\ A_{6 (s r s)} & A_{1 (s r s)} & A_{5 (s r s)} \\ A_{7 (s r s)} & A_{5 (s r s)} & A_{2 (s r s)} \end{matrix} \|$
	$= (A_{1 (s r s)} A_{2 (s r s)} - A_{5 (s r s)}^{2}) - A_{3 (s r s)} (A_{2 (s r s)} A_{6 (s r s)}$
	$- A_{5 (s r s)} A_{7 (s r s)}) + A_{4 (s r s)} (A_{5 (s r s)} A_{6 (s r s)} - A_{1 (s r s)} A_{7 (s r s)})$
$Δ_{1}$	$= \| \begin{matrix} A_{0 (s r s)} & 1 & A_{4 (s r s)} \\ A_{3 (s r s)} & A_{6 (s r s)} & A_{5 (s r s)} \\ A_{4 (s r s)} & A_{7 (s r s)} & A_{2 (s r s)} \end{matrix} \|$
	$= A_{0 (s r s)} (A_{2 (s r s)} A_{6 (s r s)} - A_{5 (s r s)} A_{7 (s r s)}) - (A_{2 (s r s)} A_{3 (s r s)}$
	$- A_{4 (s r s)} A_{5 (s r s)}) + A_{4 (s r s)} (A_{3 (s r s)} A_{7 (s r s)} - A_{4 (s r s)} A_{6 (s r s)})$
$Δ_{2}$	$= \| \begin{matrix} A_{0 (s r s)} & A_{3 (s r s)} & 1 \\ A_{3 (s r s)} & A_{1 (s r s)} & A_{6 (s r s)} \\ A_{4 (s r s)} & A_{5 (s r s)} & A_{7 (s r s)} \end{matrix} \|$
	$= A_{0 (s r s)} (A_{1 (s r s)} A_{7 (s r s)} - A_{5 (s r s)} A_{6 (s r s)}) - A_{3 (s r s)} (A_{3 (s r s)} A_{7 (s r s)}$
	$- A_{4 (s r s)} A_{6 (s r s)}) + (A_{3 (s r s)} A_{5 (s r s)} - A_{1 (s r s)} A_{4 (s r s)})$

Substitution of (39) in (37) yields the minimum MSE of t as

{MSE}_{\min} (t) = {\bar{Y}}^{2} [1 - \frac{Δ_{0}}{Δ} - \frac{A_{6 (s r s)} Δ_{1}}{Δ} - \frac{A_{7 (s r s)} Δ_{2}}{Δ}]

(40)

Now we state the following theorem.

Theorem-3.1 – The minimum MSE of t is greater than or equal to MSE(t) i.e.

M S E (t) \geq {MSE}_{\min} (t) = {\bar{Y}}^{2} [1 - \frac{(Δ_{0} + A_{6 (s r s)} Δ_{1} + A_{7 (s r s)} Δ_{2})}{Δ}]

(41)

with equality holding if

w_{0 i} = \frac{Δ_{i}}{Δ}, i = 0, 1, 2 .

Remark-3.1 – It is to be mentioned that the class of estimators ‘t’ at (32) will attained its minimum MSE at (40) only when the optimum values $(w_{00}, w_{10}, w_{20})$ at (39) of the weights $(w_{0}, w_{1}, w_{2})$ are known exactly, but in practice the exact values of the population parameters $(C_{y}, C_{x}, C_{z}, ρ_{y x}, ρ_{y z}, ρ_{x z})$ are rarely available. However in repeated surveys or studies based on multiphase sampling, where information regarding the same variates is gathered on several occasions, it is possible to guess quite precisely the values of certain population parameters such as $(C_{y}, C_{x}, C_{z}, ρ_{y x}, ρ_{y z}, ρ_{x z})$ . Further we mention that the good guess values of these population parameters can also be obtained from the past data or the experience gathered in due course of time or through a pilot sample survey. This problem has been discussed among others by Murthy (1967, pp. 96–99), Searls (1964), Srivastava (1966), Gleser and Healy (1976), Das and Tripathi (1978), Reddy (1978), Tripathi et al. (1983) and Srivenkataramana and Tracy (1984). Thus the values of such population parameters $(C_{y}, C_{x}, C_{z}, ρ_{y x}, ρ_{y z}, ρ_{x z})$ can be known exactly. We recall that the scalars $(α_{1}, α_{2})$ are real. The values of the scalars $(α_{1}, α_{2})$ are known (or can be known by the experimental practitioner) as the values of $(α_{1}, α_{2})$ yield the form of the estimator. Thus the optimum values $(w_{00}, w_{10}, w_{20})$ of the corresponding constants $(w_{0}, w_{1}, w_{2})$ can be obtained quite accurately. Hence we conclude that in practice, an operational estimator can be derived from the suggested class of estimators ‘t’ with mean squared error smaller than the conventional estimators.

On the other hand if the values of the population parameters such as $(C_{y}, C_{x}, C_{z}, ρ_{y x}, ρ_{y z}, ρ_{x z})$ are not known (or cannot be made known) at all. In such situations, the practical utility of such estimators is limited. So in such circumstances one can estimate the value of these population parameters by their corresponding sample statistics. Hence the estimates $({\hat{w}}_{00}, {\hat{w}}_{10}, {\hat{w}}_{20})$ say, of the corresponding optimum values $(w_{00}, w_{10}, w_{20})$ can be obtained. Thus this also suggests that one can also obtain the operational (feasible) estimator from the proposed class of estimators ‘t’ having mean squared error fewer than the usual estimators.

4 Efficiency Comparison

It is observed from (2), (2) and (20) that the common minimum MSEs of the estimators ${\hat{\bar{Y}}}_{α_{1}}, {\hat{\bar{Y}}}_{D 1}$ and ${\hat{\bar{Y}}}^{*}$ is same, i.e.

${MSE}_{\min} ({\hat{\bar{Y}}}_{α_{1}})$	$= {MSE}_{\min} ({\hat{\bar{Y}}}_{D 1})$
	$= {MSE}_{\min} ({\hat{\bar{Y}}}^{*}) = (\frac{1 - f}{n}) {\bar{Y}}^{2} (C_{y}^{2} - K_{y x}^{2} C_{x}^{2})$	(42)

Now we compare the efficiency of traditional difference estimator with usual unbiased estimator ${\hat{\bar{Y}}}_{0} = \bar{y}$ , ratio estimator ${\hat{\bar{Y}}}_{R}$ and product estimator ${\hat{\bar{Y}}}_{P}$ .

From (2), (5), (6) and (2), we have,

$MSE ({\hat{\bar{Y}}}_{0} = \bar{y}) - {MSE}_{\min} ({\hat{\bar{Y}}}_{D 1}) = (\frac{1 - f}{n}) {\bar{Y}}^{2} K_{y x}^{2} C_{x}^{2} \geq 0$	(43)
$MSE ({\hat{\bar{Y}}}_{R}) - {MSE}_{\min} ({\hat{\bar{Y}}}_{D 1}) = (\frac{1 - f}{n}) {\bar{Y}}^{2} C_{x}^{2} {(1 - K_{y x})}^{2} \geq 0$	(44)
$MSE ({\hat{\bar{Y}}}_{P}) - {MSE}_{\min} ({\hat{\bar{Y}}}_{D 1}) = (\frac{1 - f}{n}) {\bar{Y}}^{2} C_{x}^{2} {(1 + K_{y x})}^{2} \geq 0$	(45)

It follows from (4), (43), (44) and (45) that the estimators ${\hat{\bar{Y}}}_{α_{1}}, {\hat{\bar{Y}}}_{D 1}$ and ${\hat{\bar{Y}}}^{*}$ are more efficient than usual unbiased estimator $\bar{y}$ , ratio estimator ${\hat{\bar{Y}}}_{R}$ and product estimator ${\hat{\bar{Y}}}_{P}$ .

From (2) and (4), we have,

$[{MSE}_{\min} ({\hat{\bar{Y}}}_{α_{1}}) = {MSE}_{\min} ({\hat{\bar{Y}}}_{D 1}) = {MSE}_{\min} ({\hat{\bar{Y}}}^{*})] - {MSE}_{\min} ({\hat{\bar{Y}}}_{R a o})$
$= {(\frac{1 - f}{n})}^{2} {\bar{Y}}^{2} \frac{{(C_{y}^{2} - K_{y x}^{2} C_{x}^{2})}^{2}}{{1 + (\frac{1 - f}{n}) (C_{y}^{2} - K_{y x}^{2} C_{x}^{2})}}$
$\geq 0$	(46)

It follows from (4), (43), (44), (45) and (46) that the estimator ${\hat{\bar{Y}}}_{R a o}$ due to Rao (1991) is more efficient than $\bar{y}, {\hat{\bar{Y}}}_{R}, {\hat{\bar{Y}}}_{P}, {\hat{\bar{Y}}}_{α_{1}}, {\hat{\bar{Y}}}_{D 1}, {\hat{\bar{Y}}}_{α_{1}}$ and ${\hat{\bar{Y}}}^{*}$ .

The minimum MSE of the difference estimator ${\hat{\bar{Y}}}_{D 1}$ given by (2) can be expressed as

${MSE}_{\min} ({\hat{\bar{Y}}}_{D 1})$
$= {\bar{Y}}^{2} [1 + A_{1 (s r s)} - 2 A_{1 (s r s)} - \frac{{(1 + A_{1 (s r s)} - A_{3 (s r s)} - A_{6 (s r s)})}^{2}}{(A_{0 (s r s)} + A_{1 (s r s)} - 2 A_{3 (s r s)})}]$	(47)

From (16) and (4), we have

${MSE}_{\min} ({\hat{\bar{Y}}}_{D 1}) - {MSE}_{\min} ({\hat{\bar{Y}}}_{USV})$
$= {\bar{Y}}^{2} \frac{{[\begin{matrix} A_{1 (s r s)} (1 - A_{0 (s r s)}) + A_{3 (s r s)} (A_{3 (s r s)} - 1) \\ + A_{6 (s r s)} (A_{0 (s r s)} - A_{3 (s r s)}) \end{matrix}]}^{2}}{(A_{0 (s r s)} A_{1 (s r s)} - A_{3 (s r s)}^{2}) (A_{0 (s r s)} + A_{1 (s r s)} - 2 A_{3 (s r s)})} \geq 0$	(48)

From (2) and (31) we have

{MSE}_{\min} ({\hat{\bar{Y}}}_{D 1}) - {MSE}_{\min} ({\hat{\bar{Y}}}_{D 2}) = (\frac{1 - f}{n}) {\bar{Y}}^{2} C_{y}^{2} \frac{{(ρ_{y z} - ρ_{y x} ρ_{x z})}^{2}}{(1 - ρ_{x z}^{2})}

(49)

which shows that the traditional estimator ${\hat{\bar{Y}}}_{D 2}$ is better than ${\hat{\bar{Y}}}_{D 1}$ .

From (16) and (40) we have

${MSE}_{\min} ({\hat{\bar{Y}}}_{USV}) - {MSE}_{\min} (t)$
$= \frac{{\bar{Y}}^{2}}{Δ Δ_{1}} {[\begin{matrix} A_{7 (s r s)} (A_{0 (s r s)} A_{1 (s r s)} - A_{3 (s r s)}^{2}) \\ + (A_{3 (s r s)} A_{5 (s r s)} - A_{1 (s r s)} A_{4 (s r s)}) \\ + A_{6 (s r s)} (A_{3 (s r s)} A_{4 (s r s)} - A_{0 (s r s)} A_{5 (s r s)}) \end{matrix}]}^{2} \geq 0$	(50)

It shows that the proposed class of estimators t is more efficient than the Upadhyaya et al. (1985) estimator ${\hat{\bar{Y}}}_{USV}$ .

Hence from (4), (43), (44), (45), (46), (4) and (49) it is observed that the suggested generalized class of estimators t is better than the estimators $\bar{y}$ , ${\hat{\bar{Y}}}_{R}$ , ${\hat{\bar{Y}}}_{P}$ , ${\hat{\bar{Y}}}_{α_{1}}$ , ${\hat{\bar{Y}}}_{D 1}$ , ${\hat{\bar{Y}}}^{*}$ and ${\hat{\bar{Y}}}_{USV}$ .

From (24) and (40) we have that ${MSE}_{\min} (t) < {MSE}_{\min} ({\hat{\bar{Y}}}_{R a o})$ if

[1 - \frac{(Δ_{0} + A_{6 (s r s)} Δ_{1} + A_{7 (s r s)} Δ_{2})}{Δ}] < \frac{(\frac{1 - f}{n}) (C_{y}^{2} - K_{y x}^{2} C_{x}^{2})}{{1 + (\frac{1 - f}{n}) (C_{y}^{2} - K_{y x}^{2} C_{x}^{2})}}

(51)

From (28) and (40) it is observed that ${MSE}_{\min} (t) < {MSE}_{\min} ({\hat{\bar{Y}}}_{G S})$ if

\frac{a_{2} a_{4}^{2} - 2 a_{3} a_{4} a_{5} + a_{1} a_{5}^{2}}{(a_{1} a_{2} - a_{3}^{2})} < \frac{(Δ_{0} + A_{6 (s r s)} Δ_{1} + A_{7 (s r s)} Δ_{2})}{Δ}

(52)

Further from (31) and (40) we note that ${MSE}_{\min} (t) < {MSE}_{\min} ({\hat{\bar{Y}}}_{D 2})$ if

[1 - \frac{(Δ_{0} + A_{6 (s r s)} Δ_{1} + A_{7 (s r s)} Δ_{2})}{Δ}] < (\frac{1 - f}{n}) C_{y}^{2} (1 - R_{y . x z}^{2})

Thus from (51), (52) and (4) it is observed that the proposed generalized class of estimators t is more efficient than ${\hat{\bar{Y}}}_{R a o}, {\hat{\bar{Y}}}_{G S}$ and ${\hat{\bar{Y}}}_{D 2}$ as long as the conditions (51), (52) and (4) are satisfied respectively.

5 Empirical Study

For numerical comparisons of different estimators, we use the following data sets.

Data I: [Source: Singh and Chaudhary (1986), page 177] Data II: [Source: Abu-Dayyeh et al. (2003)] Data III: [Source: Steel and Torrie (1960)] Data IV: [Source: Cochran (1977)] Data V: [Source: Ahmed (1997)] Data VI: [Source: PCR (1998)]

Data	I	II	III	IV	V	VI
N	34	332	30	34	376	424
N	20	80	6	15	159	169
$\bar{Y}$	856.41	1093.1	0.6860	4.92	316.65	646.215
$\bar{X}$	208.88	181.57	4.6537	2.59	141.13	4533.981
$\bar{Z}$	199.44	143.37	0.8077	2.91	1075.31	325.0325
$C_{y}$	0.86	0.7626	0.4803	1.01232	0.7721	1.509
$C_{x}$	0.72	0.7684	0.2295	1.23187	0.845	1.342
$C_{z}$	0.75	0.7616	0.7493	1.05351	0.7746	1.335
$ρ_{y x}$	0.45	0.973	0.7194	0.7326	0.9106	0.623
$ρ_{y z}$	0.45	0.862	0.04996	0.643	0.9094	0.907
$ρ_{x z}$	0.98	0.842	0.4074	0.6837	0.8614	0.682

Table 1 gives the PRE’s of ${\hat{\bar{Y}}}_{R}, {\hat{\bar{Y}}}_{P}, {\hat{\bar{Y}}}_{D 1}, {\hat{\bar{Y}}}_{R a o}, {\hat{\bar{Y}}}_{G S}$ and ${\hat{\bar{Y}}}_{D 2}$ estimators with respect to $\bar{y}$ for six data sets respectively.

Table 2 gives the PRE of ${\hat{\bar{Y}}}_{USV}$ with respect to $\bar{y}$ for $α_{1} = (- 1, 1)$ , for six data sets.

Table 3(a) depicts the PREs of proposed class of estimators $t$ with respect to $\bar{y}$ at different values of ( $α_{1}, α_{2}$ ) for data sets I, II, III.

Table 1 PRE’s of different estimators of population mean $\bar{Y}$ with respect to $\bar{y}$

Estimator	Data I	Data II	Data III	Data IV	Data V	Data VI
${\hat{\bar{Y}}}_{R}$	105.55	1835.92	94.62	143.30	488.77	146.46
${\hat{\bar{Y}}}_{P}$	40.74	25.15	71.44	23.45	23.86	34.49
${\hat{\bar{Y}}}_{D 1}$	125.39	1877.19	103.33	215.84	585.45	163.43
${\hat{\bar{Y}}}_{R a o}$	126.92	1877.75	106.40	219.66	585.67	164.24
${\hat{\bar{Y}}}_{G s}$	126.93	1877.75	106.42	219.89	585.67	164.25
${\hat{\bar{Y}}}_{D 2}$	125.71	2127.83	174.04	235.09	907.16	563.97

Table 2 PRE’s of ${\hat{\bar{Y}}}_{USV}$ with respect to $\bar{y}$

Data I		Data II		Data III		Data IV		Data V		Data VI
$α_{1}$	PRE	$α_{1}$	PRE	$α_{1}$	PRE	$α_{1}$	PRE	$α_{1}$	PRE	$α_{1}$	PRE
$-$ 1	127.66	$-$ 1	1878.66	$-$ 1	106.76	$-$ 1	227.99	$-$ 1	586.30	$-$ 1	164.74
1	126.24	1	2049.28	1	106.20	1	215.95	1	588.70	1	163.54

Table 3(a): PRE’s of proposed class of estimators t for population mean $\bar{Y}$ with respect to $\bar{y}$ (for data sets I, II, III)

Data I	Data II	Data III
$α_{1}$	$α_{2}$	PRE	$α_{1}$	$α_{2}$	PRE	$α_{1}$	$α_{2}$	PRE
$-$ 16.56	$-$ 16.56	17654.39	$-$ 6.5	$-$ 6.9	772384	$-$ 4	$-$ 4	1286.97
$-$ 16.55	$-$ 16.55	11434.21	$-$ 6.5	$-$ 6.8	234504.2	$-$ 3	$-$ 3	240.98
$-$ 16.54	$-$ 16.54	8469.43	$-$ 6.5	$-$ 6.7	139357.5	$-$ 2	$-$ 2	194.41
$-$ 16.53	$-$ 16.53	6734.46	$-$ 6.5	$-$ 6.6	99699.33	$-$ 1	$-$ 1	180.01
$-$ 16.52	$-$ 16.52	5595.56	$-$ 6.5	$-$ 6.5	77951.88	1	1	174.25
$-$ 16.51	$-$ 16.51	4790.57	$-$ 6.4	$-$ 6.4	34176.73	2	2	177.14
$-$ 16.5	$-$ 16.5	4191.43	$-$ 6.3	$-$ 6.3	22046.63	3	3	183.83
$-$ 16	$-$ 16	657.28	$-$ 6.2	$-$ 6.2	16359.33	4	4	196.20
$-$ 10	$-$ 10	153.84	$-$ 6.1	$-$ 6.1	13060.19	5	5	219.66
$-$ 5	$-$ 5	133.88	$-$ 6	$-$ 6	10906.9	6	6	273.01
$-$ 4	$-$ 4	131.95	$-$ 5	$-$ 5	4435.19	7	7	484.92
$-$ 3	$-$ 3	130.37	$-$ 4	$-$ 4	3039.74	$-$ 4	6	9201.75
$-$ 2	$-$ 2	129.08	$-$ 3	$-$ 3	2477.50	$-$ 3	5	341.57
$-$ 1	$-$ 1	128.03	$-$ 2	$-$ 2	2220.59	$-$ 2	4	230.89
1	1	126.56	$-$ 1	$-$ 1	2130.16	$-$ 1	3	195.37
2	2	126.10	1	1	2358.74	1	2	178.42
3	3	125.82	2	2	2796.94	2	1	174.06
4	4	125.72	3	3	3848.96	$-$ 4	5	386.31
5	5	125.81	4	4	7795.28	$-$ 4	4	253.76
8	8	127.58	4.5	4.5	20478.73	$-$ 4	3	209.95
10	10	130.80	4.6	4.6	31462.35	$-$ 4	2	189.32
12	12	137.87	4.7	4.7	69729.23	$-$ 4	1	178.73
15	15	187.23	1	2	2404.66	$-$ 3	6	1354.17
16	16	328.47	2	3	2902.67	$-$ 3	5	341.57
16.1	16.1	378.07	4	5	9706.82	$-$ 3	4	241.09
16.2	16.2	456.93	$-$ 1	$-$ 2	2135.10	$-$ 3	3	204.27
16.3	16.3	601.84	$-$ 3	$-$ 4	2539.23	$-$ 3	2	186.40
16.4	16.4	955.32	$-$ 4	$-$ 5	3182.99	$-$ 2	6	788.06
16.5	16.5	3122.77	$-$ 5	$-$ 6	4877.70	$-$ 2	5	310.28
*	*	*	$-$ 6	$-$ 7	15453.26	$-$ 2	4	230.89

Table 3(b) indicates the PREs of proposed class of estimators $t$ with respect to $\bar{y}$ at different values of ( $α_{1}, α_{2}$ ) for data sets IV, V, VI.

Table 3(b): PRE’s of proposed class of estimators t with respect to $\bar{y}$ (for data sets IV, V, VI)

Data IV			Data V			Data VI
$α_{1}$	$α_{2}$	PRE	$α_{1}$	$α_{2}$	PRE	$α_{1}$	$α_{2}$	PRE
$-$ 5.5	$-$ 5.6	18686.57	$-$ 13.8	$-$ 13.8	146480.3	$-$ 11.3	$-$ 11.3	56544.35
$-$ 5.5	$-$ 5.5	9127.82	$-$ 13.5	$-$ 13.5	16447.9	$-$ 11	$-$ 11	7475.86
$-$ 5	$-$ 5	1011.91	$-$ 13	$-$ 13	6808.17	$-$ 10	$-$ 10	2084.82
$-$ 4	$-$ 4	443.93	$-$ 12	$-$ 12	3275.97	$-$ 8	$-$ 8	986.33
$-$ 3	$-$ 3	323.39	$-$ 10	$-$ 10	1749.71	$-$ 5	$-$ 5	659.03
$-$ 2	$-$ 2	273.50	$-$ 8	$-$ 8	1285.83	$-$ 4	$-$ 4	616.64
$-$ 1	$-$ 1	248.82	$-$ 5	$-$ 5	1012.38	$-$ 3	$-$ 3	588.91
1	1	235.59	$-$ 4	$-$ 4	967.68	$-$ 2	$-$ 2	572.19
2	2	244.97	$-$ 3	$-$ 3	936.87	$-$ 1	$-$ 1	564.63
3	3	274.84	$-$ 2	$-$ 2	917.52	1	1	575.12
4	4	372.70	$-$ 1	$-$ 1	908.24	2	2	594.85
5	5	2084.42	1	1	918.17	3	3	627.61
5.1	5.1	8806.16	2	2	938.25	4	4	679.08
$-$ 5.5	$-$ 5.4	6136.23	3	3	970.32	5	5	760.58
$-$ 5.5	$-$ 5.2	3812.23	4	4	1017.20	6	6	897.22
$-$ 5.5	$-$ 5.1	3242.17	5	5	1083.64	7	7	1156.67
$-$ 5.5	$-$ 5	2838.5	6	6	1177.62	8	8	1802.22
$-$ 5.5	$-$ 4.5	1851.49	7	7	1313.39	9	9	5816.34
$-$ 5.5	$-$ 4	1470.75	8	8	1518.41	9.3	9.3	24933.39
$-$ 5.5	$-$ 3	1210.17	10	10	2479.08	9	9.4	100422
$-$ 5.5	$-$ 2	1286.14	12	12	13846.51	*	*	*
$-$ 5.5	$-$ 1	2344.68	12.1	12.1	18724.25	*	*	*
$-$ 5.4	$-$ 5.6	5179.15	12.2	12.2	29101.79	*	*	*
$-$ 5.3	$-$ 5.6	3067.52	12.3	12.3	66293.65	*	*	*
$-$ 5.2	$-$ 5.6	2208.87	*	*	*	*	*	*
$-$ 5.1	$-$ 5.6	1743.17	*	*	*	*	*	*

We measured Percent Relative Efficiencies (PREs) of various estimators along with our proposed generalized class of estimators t with respect to $\bar{y}$ . It is observed that from the entries of the Tables 1, 2, 3(a) and 3(b) that the suggested generalized class of estimators t gives the largest PRE (17654.39%, 772384.00%, 9201.75%, 18686.57%, 146480.30%, and 100422.00%) for data set I to IV respectively. Using the proposed generalized class of estimators t over other existing estimators, there is considerable gain in efficiency. Thus there is ample room to pick up the scalars $(α_{1}, α_{2})$ in order to obtain estimators better than the existing estimators. Finally our recommendation is in favor of the proposed generalized class of estimators t for its use in practice.

6 Estimation of Population Mean Under Stratified Random Sampling

We consider a finite population $Ω = {Ω_{1}, Ω_{2}, \dots, Ω_{N}}$ of N units divided into L strata with the $h$ th stratum $(h = 1, 2, \dots, L)$ having $N_{h}$ units such that $\sum_{i = 1}^{L} N_{h} = N$ . Let $y_{h i}$ and $(x_{h i}, z_{h i}) (i = 1, 2, \dots, N_{h})$ respectively be the observations of study variable y and auxiliary variables (x, z) for the $i$ th population unit in the $h$ th stratum. A simple random sample of size $n_{h}$ is drawn without replacement from the $h$ th stratum such that $\sum_{i = 1}^{L} n_{h} = n$ .

Let ${\bar{y}}_{(s t)} = \sum_{h = 1}^{L} W_{h} {\bar{y}}_{h}, {\bar{x}}_{(s t)} = \sum_{h = 1}^{L} W_{h} {\bar{x}}_{h}$ and ${\bar{z}}_{(s t)} = \sum_{h = 1}^{L} W_{h} {\bar{z}}_{h}$ be the sample means corresponding to the population means $\bar{Y} = \sum_{h = 1}^{L} W_{h} {\bar{Y}}_{h},$ $\bar{X} = \sum_{h = 1}^{L} W_{h} {\bar{X}}_{h}$ and $\bar{Z} = \sum_{h = 1}^{L} W_{h} {\bar{Z}}_{h}$ of the variables y, x and z respectively, where ${\bar{y}}_{h} = \frac{1}{n_{h}} \sum_{i = 1}^{n_{h}} y_{h i}$ , ${\bar{x}}_{h} = \frac{1}{n_{h}} \sum_{i = 1}^{n_{h}} x_{h i}$ and ${\bar{z}}_{h} = \frac{1}{n_{h}} \sum_{i = 1}^{n_{h}} z_{h i}$ be the sample means corresponding to the population means ${\bar{Y}}_{h} = \sum_{i = 1}^{N_{h}} \frac{y_{h i}}{N_{h}}$ , ${\bar{X}}_{h} = \sum_{i = 1}^{N_{h}} \frac{x_{h i}}{N_{h}}$ and ${\bar{Z}}_{h} = \sum_{i = 1}^{N_{h}} \frac{z_{h i}}{N_{h}}$ in the $h$ th stratum respectively with known stratum weight $W_{h} = \frac{N_{h}}{N}$ .

Further we denote

C_{y h} = \frac{S_{y h}}{{\bar{Y}}_{h}}, C_{x h} = \frac{S_{x h}}{{\bar{X}}_{h}}, C_{z h} = \frac{S_{z h}}{{\bar{Z}}_{h}}, ρ_{y x h} = \frac{S_{y x h}}{S_{y h} S_{x h}},

ρ_{y z h} = \frac{S_{y z h}}{S_{y h} S_{z h}}, ρ_{x z h} = \frac{S_{x z h}}{S_{x h} S_{z h}},

S_{y h}^{2} = \frac{1}{N_{h} - 1} \sum_{i = 1}^{N_{h}} {(y_{h i} - {\bar{Y}}_{h})}^{2}, S_{x h}^{2} = \frac{1}{N_{h} - 1} \sum_{i = 1}^{N_{h}} {(x_{h i} - {\bar{X}}_{h})}^{2},

S_{z h}^{2} = \frac{1}{N_{h} - 1} \sum_{i = 1}^{N_{h}} {(z_{i h} - {\bar{Z}}_{h})}^{2},

S_{y x h} = \frac{1}{N_{h} - 1} \sum_{i = 1}^{N_{h}} (y_{h i} - {\bar{Y}}_{h}) (x_{h i} - {\bar{X}}_{h}),

S_{y z h} = \frac{1}{N_{h} - 1} \sum_{i = 1}^{N_{h}} (y_{h i} - {\bar{Y}}_{h}) (z_{h i} - {\bar{Z}}_{h}),

S_{x z h} = \frac{1}{N_{h} - 1} \sum_{i = 1}^{N_{h}} (x_{h i} - {\bar{X}}_{h}) (z_{h i} - {\bar{Z}}_{h}), V_{200} = \sum_{h = 1}^{L} γ_{h} W_{h}^{2} S_{y h}^{2},

V_{020} = \sum_{h = 1}^{L} γ_{h} W_{h}^{2} S_{x h}^{2}, V_{002} = \sum_{h = 1}^{L} γ_{h} W_{h}^{2} S_{z h}^{2},

V_{110} = \sum_{h = 1}^{L} γ_{h} W_{h}^{2} S_{y x h}, V_{101} = \sum_{h = 1}^{L} γ_{h} W_{h}^{2} S_{y z h},

V_{011} = \sum_{h = 1}^{L} γ_{h} W_{h}^{2} S_{x z h}, γ_{h} = (\frac{1 - f_{h}}{n_{h}}) .

In the following section we have presented review of some existing estimators with their properties.

7 Reviewing Some Existing Estimators in Stratified Random Sampling

The conventional stratified sample mean estimator for population mean $\bar{Y}$ of y is defined by

{\hat{\bar{Y}}}_{0 (s t)} = {\bar{y}}_{(s t)} = \sum_{h = 1}^{L} W_{h} {\bar{y}}_{h}

(54)

whose variance/MSE is given by

V a r ({\hat{\bar{Y}}}_{0 (s t)}) = M S E ({\hat{\bar{Y}}}_{0 (s t)}) = V_{200} = \sum_{h = 1}^{L} γ_{h} W_{h}^{2} S_{y h}^{2}

(55)

The combined ratio estimator for $\bar{Y}$ is given by

{\hat{\bar{Y}}}_{R (s t)} = {\bar{y}}_{(s t)} (\frac{\bar{X}}{{\bar{x}}_{(s t)}})

(56)

To the fda, the MSE of ${\hat{\bar{Y}}}_{R (s t)}$ is given by

M S E ({\hat{\bar{Y}}}_{R (s t)}) = (V_{200} + R_{1}^{2} V_{020} - 2 R_{1} V_{110})

(57)

The combined product estimator for $\bar{Y}$ is defined by

{\hat{\bar{Y}}}_{P (s t)} = {\bar{y}}_{(s t)} (\frac{{\bar{x}}_{(s t)}}{\bar{X}})

(58)

The MSE of ${\hat{\bar{Y}}}_{P (s t)}$ to the fda is given by

M S E ({\hat{\bar{Y}}}_{P (s t)}) = (V_{200} + R_{1}^{2} V_{020} + 2 R_{1} V_{110})

(59)

Following the approach adopted by Srivastava (1967), we define a class of estimators for population mean $\bar{Y}$ as

{\hat{\bar{Y}}}_{S (s t)} = {\bar{y}}_{(s t)} {(\frac{{\bar{x}}_{(s t)}}{\bar{X}})}^{α_{1}}

(60)

We mention that for $α_{1} = - 1, {\hat{\bar{Y}}}_{S (s t)}$ reduces to ${\hat{\bar{Y}}}_{R (s t)}$ while for $α_{1} = 1$ it reduces to the product estimator ${\hat{\bar{Y}}}_{P (s t)}$ . If we set $α_{1} = 0$ , then ${\hat{\bar{Y}}}_{S (s t)}$ reduces to usual unbiased estimator ${\bar{y}}_{(s t)}$ .

The MSE of ${\hat{\bar{Y}}}_{S (s t)}$ to the fda is given by

M S E ({\hat{\bar{Y}}}_{S (s t)}) = (V_{200} + α_{1}^{2} R_{1}^{2} V_{020} + 2 α_{1} R_{1} V_{110})

(61)

which is minimum when

α_{1} = - \frac{V_{110}}{R_{1} V_{020}} = α_{1 (o p t)}, say

(62)

Thus the corresponding minimum MSE of ${\hat{\bar{Y}}}_{S (s t)}$ is given by

{MSE}_{\min} ({\hat{\bar{Y}}}_{S (s t))}) = (V_{200} - \frac{V_{110}^{2}}{V_{020}})

(63)

which is same as the minimum MSE of the difference estimator

{\hat{\bar{Y}}}_{D 1 (s t)} = {\bar{y}}_{(s t)} + d (\bar{X} - {\bar{x}}_{(s t)})

(64)

i.e.

{MSE}_{\min} ({\hat{\bar{Y}}}_{S (s t)}) = {MSE}_{\min} ({\hat{\bar{Y}}}_{D 1 (s t)}) = (V_{200} - \frac{V_{110}^{2}}{V_{020}})

(65)

The stratified version of Upadhyaya et al. (1985) estimator is given by

{\hat{\bar{Y}}}_{U S V (s t)} = w_{0} {\bar{y}}_{(s t)} + w_{1} {\bar{y}}_{(s t)} {(\frac{{\bar{x}}_{(s t)}}{\bar{X}})}^{α_{1}}

(66)

The MSE of ${\hat{\bar{Y}}}_{U S V (s t)}$ to the fda is given by

$MSE ({\hat{\bar{Y}}}_{U S V (s t)})$	$= [{\bar{Y}}^{2} + w_{0}^{2} A_{0 (s t)} + w_{1}^{2} A_{1 (s t)}$
	$+ 2 w_{0} w_{1} A_{3 (s t)} - 2 w_{0} {\bar{Y}}^{2} - 2 w_{1} A_{6 (s t)}]$	(67)

where

$A_{0 (s t)}$	$= ({\bar{Y}}^{2} + V_{200})$
$A_{1 (s t)}$	$= [{\bar{Y}}^{2} + V_{200} + 4 α_{1} R_{1} V_{110} + α_{1} (2 α_{1} - 1) R_{1}^{2} V_{020}]$
$A_{3 (s t)}$	$= [{\bar{Y}}^{2} + V_{200} + 2 α_{1} R_{1} V_{110} + α_{1} \frac{(α_{1} - 1)}{2} R_{1}^{2} V_{020}]$
$A_{6 (s t)}$	$= [{\bar{Y}}^{2} + α_{1} R_{1} V_{110} + α_{1} \frac{(α_{1} - 1)}{2} R_{1}^{2} V_{020}]$

The $M S E ({\hat{\bar{Y}}}_{U S V (s t)})$ is minimized for

w_{0} = \frac{Δ_{0 (s t)}^{*}}{Δ_{(s t)}^{*}}, w_{1} = \frac{Δ_{1 (s t)}^{*}}{Δ_{(s t)}^{*}}

(68)

Thus the corresponding minimum MSE of ${\hat{\bar{Y}}}_{U S V (s t)}$ is given by

${MSE}_{\min} ({\hat{\bar{Y}}}_{U S V (s t)})$
$= [{\bar{Y}}^{2} - \frac{{A_{1 (s t)} {\bar{Y}}^{4} - 2 A_{3 (s t)} A_{6 (s t)} {\bar{Y}}^{2} + A_{0 (s t)} A_{6 (s t)}^{2}}}{(A_{0 (s t)} A_{1 (s t)} - A_{3 (s t)}^{2})}]$	(69)

where

$Δ_{(s t)}^{*}$	$= (A_{0 (s t)} A_{1 (s t)} - A_{3 (s t)}^{2})$
$Δ_{0 (s t)}^{*}$	$= (A_{1 (s t)} - A_{3 (s t)} A_{6 (s t)})$
$Δ_{1 (s t)}^{*}$	$= (A_{0 (s t)} A_{6 (s t)} - A_{3 (s t)})$

For $w_{0} + w_{1} = 1 \Rightarrow w_{1} = (1 - w_{0})$ in (66), the class of estimators ${\hat{\bar{Y}}}_{U S V (s t)}$ reduces to the estimator

{\hat{\bar{Y}}}_{U S V (s t)}^{*} = w_{0} {\bar{y}}_{s t} + (1 - w_{0}) {\bar{y}}_{s t} {(\frac{{\bar{x}}_{s t}}{\bar{X}})}^{α_{1}}

(70)

To the fda, the MSE of ${\hat{\bar{Y}}}_{U S V (s t)}^{*}$ is given by

MSE ({\hat{\bar{Y}}}_{U S V (s t)}^{*}) = [\begin{matrix} V_{100} + A_{1 (s t)} - 2 A_{6 (s t)} \\ + w_{0}^{2} (A_{0 (s t)} + A_{1 (s t)} - 2 A_{3 (s t)}) \\ - 2 w_{0} ({\bar{Y}}^{2} + A_{1 (s t)} - A_{3 (s t)} - A_{6 (s t)}) \end{matrix}]

(71)

which is minimized for

w_{0 (o p t)}

= \frac{({\bar{Y}}^{2} + A_{1 (s t)} - A_{3 (s t)} - A_{6 (s t)})}{(A_{0 (s t)} + A_{1 (s t)} - 2 A_{3 (s t)})} = \frac{(V_{110} + α_{1} R_{1} V_{020})}{α_{1} R_{1} V_{020}}

(72)

Thus the corresponding minimum MSE of ${\hat{\bar{Y}}}_{U S V (s t)}^{*}$ is given by

${MSE}_{\min} ({\hat{\bar{Y}}}_{U S V (s t)}^{*})$
$= [{\bar{Y}}^{2} + A_{1 (s t)} - 2 A_{6 (s t)} - \frac{{({\bar{Y}}^{2} + A_{1 (s t)} - A_{3 (s t)} - A_{6 (s t)})}^{2}}{(A_{0 (s t)} + A_{1 (s t)} - 2 A_{3 (s t)})}]$
$= (V_{200} - \frac{V_{110}^{2}}{V_{020}}) = {MSE}_{\min} ({\hat{\bar{Y}}}_{D 1 (s t)}^{*})$	(73)

Stratified version of Rao (1991) estimator for population mean $\bar{Y}$ is given by

{\hat{\bar{Y}}}_{R a o (s t)} = α_{1} {\bar{y}}_{s t} + α_{2} (\bar{X} - {\bar{x}}_{s t})

(74)

where $(α_{1}, α_{2})$ are suitably chosen constants such that $MSE ({\hat{\bar{Y}}}_{R a o (s t)})$ is minimum.

The optimum values of $(α_{1}, α_{2})$ along with minimum MSE of ${\hat{\bar{Y}}}_{R a o (s t)}$ are respectively given by

\begin{matrix} α_{1 (o p t)} = {\frac{{\bar{Y}}^{2} V_{020}}{V_{020} ({\bar{Y}}^{2} + V_{200}) - V_{110}^{2}}} \\ α_{2 (o p t)} = - {\frac{{\bar{Y}}^{2} V_{110}}{V_{020} ({\bar{Y}}^{2} + V_{200}) - V_{110}^{2}}} \end{matrix}}

(75)

and

{MSE}_{\min} ({\hat{\bar{Y}}}_{R a o (s t)}) = \frac{{\bar{Y}}^{2} {V_{020} V_{200} - V_{110}^{2}}}{{V_{020} ({\bar{Y}}^{2} + V_{200}) - V_{110}^{2}}}

(76)

Gupta and Shabbir (2008) suggested the following estimator for $\bar{Y}$ as,

{\hat{\bar{Y}}}_{G S (s t)} = {α_{3} {\bar{y}}_{s t} + α_{4} (\bar{X} - {\bar{x}}_{s t})} (\frac{\bar{X}}{{\bar{x}}_{s t}})

(77)

The MSE of ${\hat{\bar{Y}}}_{G S (s t)}$ to the fda is given by

$MSE ({\hat{\bar{Y}}}_{G S (s t)})$	$= [{\bar{Y}}^{2} + α_{3}^{2} a_{1 (s t)} + α_{4}^{2} a_{2 (s t)} + 2 α_{3} α_{4} a_{3 (s t)}$
	$- 2 α_{3} a_{4 (s t)} - 2 α_{4} a_{5 (s t)}]$	(78)

where

$a_{1 (s t)}$	$= [{\bar{Y}}^{2} + V_{200} + 3 R_{1}^{2} V_{020} - 4 R_{1} V_{110}]$
$a_{2 (s t)}$	$= V_{020}$
$a_{3 (s t)}$	$= (2 R_{1} V_{020} - V_{110})$
$a_{4 (s t)}$	$= (R_{1}^{2} V_{020} - R_{1} V_{110} + {\bar{Y}}^{2})$
$a_{5 (s t)}$	$= R_{1} V_{020}$

The MSE of ${\hat{\bar{Y}}}_{G S (s t)}$ is minimum when

\begin{matrix} α_{3 (o p t)} = \frac{(a_{2 (s t)} a_{4 (s t)} - a_{3 (s t)} a_{5 (s t)})}{(a_{1 (s t)} a_{2 (s t)} - a_{3 (s t)}^{2})} \\ α_{4 (o p t)} = \frac{(a_{1 (s t)} a_{5 (s t)} - a_{3 (s t)} a_{4 (s t)})}{(a_{1 (s t)} a_{2 (s t)} - a_{3 (s t)}^{2})} \end{matrix}}

(79)

Thus the minimum MSE of ${\hat{\bar{Y}}}_{G S (s t)}$ is given by

${MSE}_{\min} ({\hat{\bar{Y}}}_{G S (s t)})$
$= [{\bar{Y}}^{2} - \frac{(a_{2 (s t)} a_{4 (s t)}^{2} - 2 a_{3 (s t)} a_{4 (s t)} a_{5 (s t)} + a_{4 (s t)} a_{5 (s t)}^{2})}{(a_{1 (s t)} a_{2 (s t)} - a_{3 (s t)}^{2})}]$	(80)

The usual difference estimator using two auxiliary variables in stratified random sampling is defined by

{\hat{\bar{Y}}}_{D 2 (s t)} = {\bar{y}}_{s t} + k_{1} (\bar{X} - {\bar{x}}_{s t}) + k_{2} (\bar{Z} - {\bar{z}}_{s t})

(81)

where $k_{1}$ and $k_{2}$ are constants whose values are to be obtained.

The MSE of ${\hat{\bar{Y}}}_{D 2 (s t)}$ is given by

M S E ({\hat{\bar{Y}}}_{D 2 (s t)}) = [\begin{matrix} V_{200} + k_{1}^{2} V_{020} + k_{2}^{2} V_{002} \\ + 2 k_{1} k_{2} V_{011} - 2 k_{1} V_{110} - 2 k_{2} V_{101} \end{matrix}]

(82)

which is minimized for

\begin{matrix} k_{1 (o p t)} = \frac{(V_{002} V_{110} - V_{011} V_{101})}{(V_{020} V_{002} - V_{011}^{2})} \\ k_{2 (o p t)} = \frac{(V_{020} V_{101} - V_{011} V_{110})}{(V_{020} V_{002} - V_{011}^{2})} \end{matrix}}

(83)

Thus the corresponding minimum MSE of ${\hat{\bar{Y}}}_{D 2 (s t)}$ is given by

{MSE}_{\min} ({\hat{\bar{Y}}}_{D 2 (s t)}) = [V_{200} - \frac{(V_{110}^{2} V_{002} - 2 V_{011} V_{101} V_{110} + V_{020} V_{101}^{2})}{(V_{020} V_{002} - V_{011}^{2})}]

(84)

8 Suggested Class of Estimators for Population Mean in Stratified Random Sampling

Motivated by Upadhyaya et al. (1985), we propose a generalized class of estimators based on two auxiliary variables $(x, z)$ for population mean $\bar{Y}$ in stratified random sampling as

t_{(s t)} = w_{0} {\bar{y}}_{s t} + w_{1} {\bar{y}}_{s t} {(\frac{{\bar{x}}_{s t}}{\bar{X}})}^{α_{1}} + w_{2} {\bar{y}}_{s t} {(\frac{{\bar{z}}_{s t}}{\bar{Z}})}^{α_{2}}

(85)

where $(w_{0}, w_{1}, w_{2})$ are appropriately elected weights whose sum need not be unity and $(α_{1}, α_{2})$ are design parameters. The constants $(α_{1}, α_{2})$ may take positive $(+, +)$ or negative $(-, -)$ or positive-negative $(+, -)$ or negative-positive $(-, +)$ values to form product-type or ratio-type or product-cum-ratio-type or ratio-cum-product-type estimator.

To obtain the bias and MSE of the proposed estimator $t_{(s t)}$ , we write, ${\bar{y}}_{s t} = \bar{Y} (1 + e_{0 (s t)}), {\bar{x}}_{s t} = \bar{X} (1 + e_{1 (s t)})$ and ${\bar{z}}_{s t} = \bar{Z} (1 + e_{2 (s t)})$ such that $E (e_{0 (s t)}) = E (e_{1 (s t)}) = E (e_{2 (s t)}) = 0$ ,

$E (e_{0 (s t)}^{2})$	$= \frac{1}{{\bar{Y}}^{2}} \sum_{h = 1}^{L} W_{h}^{2} γ_{h} S_{y h}^{2}, E (e_{1 (s t)}^{2}) = \frac{1}{{\bar{X}}^{2}} \sum_{h = 1}^{L} W_{h}^{2} γ_{h} S_{x h}^{2},$
$E (e_{2 (s t)}^{2})$	$= \frac{1}{{\bar{Z}}^{2}} \sum_{h = 1}^{L} W_{h}^{2} γ_{h} S_{z h}^{2},$
$E (e_{0 (s t)} e_{1 (s t)})$	$= \frac{1}{\bar{Y} \bar{X}} \sum_{h = 1}^{L} W_{h}^{2} γ_{h} S_{y x h},$
$E (e_{0 (s t)} e_{2 (s t)})$	$= \frac{1}{\bar{Y} \bar{Z}} \sum_{h = 1}^{L} W_{h}^{2} γ_{h} S_{y z h} and$
$E (e_{1 (s t)} e_{2 (s t)})$	$= \frac{1}{\bar{X} \bar{Z}} \sum_{h = 1}^{L} W_{h}^{2} γ_{h} S_{x z h} .$

Expressing (85) in terms of e’s we have

t_{(s t)} = \bar{Y} [\begin{matrix} w_{0} (1 + e_{0 (s t)}) + w_{1} (1 + e_{0 (s t)}) {(1 + e_{1 (s t)})}^{α_{1}} \\ + w_{2} (1 + e_{0 (s t)}) {(1 + e_{1 (s t)})}^{α_{2}} \end{matrix}]

(86)

We assume that $| e_{i (s t)} | ≪ 1, i = 1, 2$ so that ${(1 + e_{i (s t)})}^{α_{i}}, i = 1, 2$ are expandable. Expanding the right hand side of (85), multiplying out and ignoring terms of e’s having power greater than two, we have

t_{(s t)} ≅ \bar{Y} [\begin{matrix} w_{0} (1 + e_{0 (s t)}) \\ + w_{1} {1 + e_{0 (s t)} + α_{1} e_{1 (s t)} + α_{1} e_{0 (s t)} e_{1 (s t)} \\ + \frac{α_{1} (α_{1} - 1)}{2} e_{1 (s t)}^{2}} \\ + w_{2} {1 + e_{0 (s t)} + α_{2} e_{2 (s t)} + α_{2} e_{0 (s t)} e_{2 (s t)} \\ + \frac{α_{2} (α_{2} - 1)}{2} e_{2 (s t)}^{2}} \end{matrix}]

(t_{(s t)} - \bar{Y}) ≅ \bar{Y} [\begin{matrix} w_{0} (1 + e_{0 (s t)}) \\ + w_{1} {1 + e_{0 (s t)} + α_{1} e_{1 (s t)} + α_{1} e_{0 (s t)} e_{1 (s t)} \\ + \frac{α_{1} (α_{1} - 1)}{2} e_{1 (s t)}^{2}} \\ + w_{2} {1 + e_{0 (s t)} + α_{2} e_{2 (s t)} + α_{2} e_{0 (s t)} e_{2 (s t)} \\ + \frac{α_{2} (α_{2} - 1)}{2} e_{2 (s t)}^{2}} - 1 \end{matrix}]

Taking expectation of both sides of (8), we get the bias of $t_{(s t)}$ to the fda as

$B (t_{(s t)})$	$= [\bar{Y} (w_{0} - 1) + w_{1} {\bar{Y} + \frac{α_{1} (α_{1} - 1)}{2} R_{1} \frac{V_{020}}{\bar{X}} + α_{1} \frac{V_{110}}{\bar{X}}}$
	$+ w_{2} {\bar{Y} + \frac{α_{2} (α_{2} - 1)}{2} R_{2} \frac{V_{002}}{\bar{Z}} + α_{2} \frac{V_{101}}{\bar{Z}}}]$	(88)

Squaring both sides of (8), ignoring terms of e’s having power greater than two and then taking expectation of both sides we get the MSE of $t_{(s t)}$ to the fda as

MSE (t_{(s t)}) = [\begin{matrix} {\bar{Y}}^{2} + w_{0}^{2} A_{0 (s t)} + w_{1}^{2} A_{1 (s t)} + w_{2}^{2} A_{2 (s t)} + 2 w_{0} w_{1} A_{3 (s t)} \\ + 2 w_{0} w_{2} A_{4 (s t)} + 2 w_{1} w_{2} A_{5 (s t)} - 2 w_{0} {\bar{Y}}^{2} \\ - 2 w_{1} A_{6 (s t)} - 2 w_{2} A_{7 (s t)} \end{matrix}]

(89)

where

$A_{2 (s t)}$	$= [{\bar{Y}}^{2} + V_{200} + 4 α_{2} R_{2} V_{101} + α_{2} (2 α_{2} - 1) R_{2}^{2} V_{002}]$
$A_{4 (s t)}$	$= [{\bar{Y}}^{2} + V_{200} + 2 α_{2} R_{2} V_{101} + α_{2} \frac{(α_{2} - 1)}{2} R_{2}^{2} V_{002}]$
$A_{5 (s t)}$	$= [{\bar{Y}}^{2} + V_{200} + 2 α_{1} R_{1} V_{110} + 2 α_{2} R_{2} V_{101} + α_{1} α_{2} R_{1} R_{2} V_{011}$
	$+ α_{1} \frac{(α_{1} - 1)}{2} R_{1}^{2} V_{020} + α_{2} \frac{(α_{2} - 1)}{2} R_{2}^{2} V_{002}]$
$A_{7 (s t)}$	$= [{\bar{Y}}^{2} + α_{2} R_{2} V_{101} + α_{2} \frac{(α_{2} - 1)}{2} R_{2}^{2} V_{002}]$

$(A_{0 (s t)}, A_{1 (s t)}, A_{3 (s t)}$ and $A_{6 (s t)})$ are same as defined earlier.

Minimization of $M S E (t_{(s t)})$ at (89) with respect to $(w_{0}, w_{1}, w_{2})$ yields

[\begin{matrix} A_{0 (s t)} & A_{3 (s t)} & A_{4 (s t)} \\ A_{3 (s t)} & A_{1 (s t)} & A_{5 (s t)} \\ A_{4 (s t)} & A_{5 (s t)} & A_{2 (s t)} \end{matrix}] [\begin{matrix} w_{0} \\ w_{1} \\ w_{2} \end{matrix}] = [\begin{matrix} {\bar{Y}}^{2} \\ A_{6 (s t)} \\ A_{7 (s t)} \end{matrix}]

(90)

Solving (90), we get the optimum values of $(w_{0}, w_{1}, w_{2})$ respectively as

w_{00} = \frac{Δ_{0 (s t)}}{Δ_{(s t)}}, w_{10} = \frac{Δ_{1 (s t)}}{Δ_{(s t)}}, w_{20} = \frac{Δ_{2 (s t)}}{Δ_{(s t)}} .

(91)

where

$Δ_{(s t)}$	$= \| \begin{matrix} A_{0 (s t)} & A_{3 (s t)} & A_{4 (s t)} \\ A_{3 (s t)} & A_{1 (s t)} & A_{5 (s t)} \\ A_{4 (s t)} & A_{5 (s t)} & A_{2 (s t)} \end{matrix} \|$
	$= A_{0 (s t)} (A_{1 (s t)} A_{2 (s t)} - A_{5 (s t)}^{2})$
	$- A_{3 (s t)} (A_{2 (s t)} A_{3 (s t)} - A_{4 (s t)} A_{5 (s t)})$
	$+ A_{4 (s t)} (A_{3 (s t)} A_{5 (s t)} - A_{1 (s t)} A_{4 (s t)})$
$Δ_{0 (s t)}$	$= \| \begin{matrix} {\bar{Y}}^{2} & A_{3 (s t)} & A_{4 (s t)} \\ A_{6 (s t)} & A_{1 (s t)} & A_{5 (s t)} \\ A_{7 (s t)} & A_{5 (s t)} & A_{2 (s t)} \end{matrix} \|$
	$= {\bar{Y}}^{2} (A_{1 (s t)} A_{2 (s t)} - A_{5 (s t)}^{2})$
	$- A_{3 (s t)} (A_{2 (s t)} A_{6 (s t)} - A_{5 (s t)} A_{7 (s t)})$
	$+ A_{4 (s t)} (A_{5 (s t)} A_{6 (s t)} - A_{1 (s t)} A_{7 (s t)})$
$Δ_{1 (s t)}$	$= \| \begin{matrix} A_{0 (s t)} & {\bar{Y}}^{2} & A_{4 (s t)} \\ A_{3 (s t)} & A_{6 (s t)} & A_{5 (s t)} \\ A_{4 (s t)} & A_{7 (s t)} & A_{2 (s t)} \end{matrix} \|$
	$= A_{0 (s t)} (A_{2 (s t)} A_{6 (s t)} - A_{5 (s t)} A_{7 (s t)})$
	$- {\bar{Y}}^{2} (A_{2 (s t)} A_{3 (s t)} - A_{4 (s t)} A_{5 (s t)})$
	$+ A_{4 (s t)} (A_{3 (s t)} A_{7 (s t)} - A_{4 (s t)} A_{6 (s t)})$
$Δ_{2 (s t)}$	$= \| \begin{matrix} A_{0 (s t)} & A_{3 (s t)} & {\bar{Y}}^{2} \\ A_{3 (s t)} & A_{1 (s t)} & A_{6 (s t)} \\ A_{4 (s t)} & A_{5 (s t)} & A_{7 (s t)} \end{matrix} \|$
	$= A_{0 (s t)} (A_{1 (s t)} A_{7 (s t)} - A_{5 (s t)} A_{6 (s t)})$
	$- A_{3 (s t)} (A_{3 (s t)} A_{7 (s t)} - A_{4 (s t)} A_{6 (s t)})$
	$+ {\bar{Y}}^{2} (A_{3 (s t)} A_{5 (s t)} - A_{1 (s t)} A_{4 (s t)})$

Thus the corresponding minimum MSE of $t_{(s t)}$ is given by

{MSE}_{\min} (t_{(s t)}) = [{\bar{Y}}^{2} - \frac{Δ_{0 (s t)} {\bar{Y}}^{2}}{Δ_{(s t)}} - \frac{A_{6 (s t)} Δ_{1 (s t)}}{Δ_{(s t)}} - \frac{A_{7 (s t)} Δ_{2 (s t)}}{Δ_{(s t)}}]

Thus we state the following theorem.

Theorem-8.1 – The MSE of $t_{s t}$ is always greater than equal to the minimum MSE of $t_{(s t)}$ i.e.

$M S E (t_{(s t)})$	$\geq {MSE}_{\min} (t_{(s t)}$
	$= [{\bar{Y}}^{2} - \frac{Δ_{0 (s t)} {\bar{Y}}^{2}}{Δ_{(s t)}} - \frac{A_{6 (s t)} Δ_{1 (s t)}}{Δ_{(s t)}} - \frac{A_{7 (s t)} Δ_{2 (s t)}}{Δ_{(s t)}}]$

with equality holding if

w_{0 i} = \frac{Δ_{i}}{Δ}, i = 0, 1, 2 .

*A remark similar to Remark 3.1 follows here.

9 Comparison of the Proposed Class of Estimator with Some Existing Estimators in Stratified Random Sampling

From (55), (57), (59) and (65), we have

$MSE ({\bar{y}}_{(s t)}) - [{MSE}_{\min} ({\hat{\bar{Y}}}_{D 1 (s t)}) = {MSE}_{\min} ({\hat{\bar{Y}}}_{S (s t)})]$
$= \frac{V_{110}^{2}}{V_{020}} \geq 0$	(93)
$MSE ({\bar{y}}_{R (s t)}) - [{MSE}_{\min} ({\hat{\bar{Y}}}_{D 1 (s t)}) = {MSE}_{\min} ({\hat{\bar{Y}}}_{S (s t)})]$
$= \frac{{(R_{1} V_{020} - V_{110})}^{2}}{V_{020}} \geq 0$	(94)
$M S E ({\bar{y}}_{P (s t)}) - [{MSE}_{\min} ({\hat{\bar{Y}}}_{D 1 (s t)}) = {MSE}_{\min} ({\hat{\bar{Y}}}_{S (s t)})]$
$= \frac{{(R_{1} V_{020} + V_{110})}^{2}}{V_{020}} \geq 0$	(95)

Expressions (9), (9) and (9) clearly indicates that the usual difference estimator ${\hat{\bar{Y}}}_{D 1 (s t)}$ and Srivastava (1967) estimator ${\hat{\bar{Y}}}_{S (s t)}$ are better than the estimators ${\bar{y}}_{s t}, {\hat{\bar{Y}}}_{R (s t)}$ and ${\hat{\bar{Y}}}_{P (s t)}$ .

From (65) and (76), we have

$[{MSE}_{\min} ({\hat{\bar{Y}}}_{D 1 (s t)}) = {MSE}_{\min} ({\hat{\bar{Y}}}_{S (s t)})] - {MSE}_{\min} ({\hat{\bar{Y}}}_{R a o (s t)})$
$= \frac{{(V_{200} V_{020} - V_{110}^{2})}^{2}}{({\bar{Y}}^{2} V_{020} + V_{020} V_{200} - V_{110}^{2})} \geq 0$	(96)

From (9)–(9), we have the following inequalities:

${MSE}_{\min} ({\hat{\bar{Y}}}_{R a o (s t)})$	$\leq [{MSE}_{\min} ({\hat{\bar{Y}}}_{D 1 (s t)}) = {MSE}_{\min} ({\hat{\bar{Y}}}_{S (s t)})]$
	$\leq MSE ({\bar{y}}_{s t})$	(97)
${MSE}_{\min} ({\hat{\bar{Y}}}_{R a o (s t)})$	$\leq [{MSE}_{\min} ({\hat{\bar{Y}}}_{D 1 (s t)}) = {MSE}_{\min} ({\hat{\bar{Y}}}_{S (s t)})]$
	$\leq MSE ({\hat{\bar{Y}}}_{R (s t)})$	(98)
${MSE}_{\min} ({\hat{\bar{Y}}}_{R a o (s t)})$	$\leq [{MSE}_{\min} ({\hat{\bar{Y}}}_{D 1 (s t)}) = {MSE}_{\min} ({\hat{\bar{Y}}}_{S (s t)})]$
	$\leq MSE ({\hat{\bar{Y}}}_{P (s t)})$	(99)

It follows from (9), (9) and (9) that the Rao (1991) estimator ${\hat{\bar{Y}}}_{R a o (s t)}$ is more precise than ${\bar{y}}_{s t}, {\hat{\bar{Y}}}_{R (s t)}$ , ${\hat{\bar{Y}}}_{P (s t)}$ , ${\hat{\bar{Y}}}_{D 1 (s t)}$ and Srivastava’s (1967) estimator ${\hat{\bar{Y}}}_{S (s t)}$ .

From (65) and (7), we have

$[{MSE}_{\min} ({\hat{\bar{Y}}}_{D 1 (s t)}) = {MSE}_{\min} ({\hat{\bar{Y}}}_{S (s t)}) = {MSE}_{\min} ({\hat{\bar{Y}}}_{U S V (s t)}^{*})]$
$- {MSE}_{\min} ({\hat{\bar{Y}}}_{U S V (s t)})$
$= \frac{\begin{matrix} {\bar{Y}}^{2} {[\begin{matrix} A_{1 (s t)} ({\bar{Y}}^{2} - A_{0 (s t)}) + A_{3 (s t)} (A_{3 (s t)} - {\bar{Y}}^{2}) \\ + A_{6 (s t)} (A_{0 (s t)} - A_{3 (s t)}) \end{matrix}]}^{2} \end{matrix}}{(A_{0 (s t)} A_{1 (s t)} - A_{3 (s t)}^{2}) (A_{0 (s t)} + A_{1 (s t)} - 2 A_{3 (s t)})} \geq 0$	(100)

It follows that the Upadhyaya et al. (1985) estimator ${\hat{\bar{Y}}}_{U S V (s t)}$ is more efficient than ${\hat{\bar{Y}}}_{D 1 (s t)}, {\hat{\bar{Y}}}_{S (s t)}$ and ${\hat{\bar{Y}}}_{U S V (s t)}^{*}$ .

From (7) and (8), we have

${MSE}_{\min} ({\hat{\bar{Y}}}_{U S V (s t)}) - {MSE}_{\min} (t_{(s t)})$
$= \frac{{\bar{Y}}^{2}}{Δ_{(s t)} Δ_{1 (s t)}} {[\begin{matrix} A_{7 (s t)} (A_{0 (s t)} A_{1 (s t)} - A_{3 (s t)}^{2}) \\ + (A_{3 (s t)} A_{5 (s t)} - A_{1 (s t)} A_{4 (s t)}) \\ + A_{6 (s t)} (A_{3 (s t)} A_{4 (s t)} - A_{0 (s t)} A_{5 (s t)}) \end{matrix}]}^{2}$
$\geq 0$	(101)

which shows that the proposed generalized class of estimators $t_{(s t)}$ is more efficient than Upadhyaya et al. (1985) estimator ${\hat{\bar{Y}}}_{U S V (s t)}$ . Hence the estimator $t_{(s t)}$ is more precise than the estimators ${\bar{y}}_{s t}, {\hat{\bar{Y}}}_{R (s t)}$ , ${\hat{\bar{Y}}}_{P (s t)}$ , ${\hat{\bar{Y}}}_{D 1 (s t)}, {\hat{\bar{Y}}}_{S (s t)}$ and ${\hat{\bar{Y}}}_{U S V (s t)}^{*}$ .

From (73) and (84) we have

$[{MSE}_{\min} ({\hat{\bar{Y}}}_{D 1 (s t)}) = {MSE}_{\min} ({\hat{\bar{Y}}}_{S (s t)}) = {MSE}_{\min} ({\hat{\bar{Y}}}_{U S V (s t)}^{*})]$
$- {MSE}_{\min} ({\hat{\bar{Y}}}_{D 2 (s t)}) = \frac{(V_{020} V_{101} - V_{110} V_{011})}{V_{020} (V_{020} V_{002} - V_{011}^{2})} \geq 0$	(102)

which shows that the difference estimator ${\hat{\bar{Y}}}_{D 2 (s t)}$ is more efficient than the estimator ${\hat{\bar{Y}}}_{D 1 (s t)}$ .

From (76), (7), (84) and (8),we have

• ${MSE}_{\min} (t_{(s t)}) < {MSE}_{\min} ({\hat{\bar{Y}}}_{R a o (s t)})$ if

$\frac{{\bar{Y}}^{4} V_{020}}{[V_{020} ({\bar{Y}}^{2} + V_{200}) - V_{110}^{2}]}$
$< [\frac{(Δ_{0 (s t)} {\bar{Y}}^{2} + A_{6 (s t)} Δ_{1 (s t)} + A_{7 (s t)} Δ_{2 (s t)})}{Δ_{(s t)}}]$	(103)

• ${MSE}_{\min} (t_{(s t)}) < {MSE}_{\min} ({\hat{\bar{Y}}}_{G S (s t)})$ if

$[\frac{(a_{2 (s t)} a_{4 (s t)}^{2} - 2 a_{3 (s t)} a_{4 (s t)} a_{5 (s t)} + a_{4 (s t)} a_{5 (s t)}^{2})}{(a_{1 (s t)} a_{2 (s t)} - a_{3 (s t)}^{2})}]$
$< [\frac{(Δ_{0 (s t)} {\bar{Y}}^{2} + A_{6 (s t)} Δ_{1 (s t)} + A_{7 (s t)} Δ_{2 (s t)})}{Δ_{(s t)}}]$	(104)

• ${MSE}_{\min} (t_{(s t)}) < {MSE}_{\min} ({\hat{\bar{Y}}}_{D 2 (s t)})$ if

$[{\bar{Y}}^{2} + \frac{(V_{110}^{2} V_{002} - 2 V_{011} V_{101} V_{110} + V_{020} V_{101}^{2})}{(V_{020} V_{002} - V_{011}^{2})}]$
$< [V_{200} + \frac{(Δ_{0 (s t)} {\bar{Y}}^{2} + A_{6 (s t)} Δ_{1 (s t)} + A_{7 (s t)} Δ_{2 (s t)})}{Δ_{(s t)}}]$	(105)

It is observed from (9), (9) and (9) that the proposed generalized class of estimators $t_{(s t)}$ is more efficient than the estimators ${\hat{\bar{Y}}}_{R a o (s t)}$ , ${\hat{\bar{Y}}}_{G S (s t)}$ and ${\hat{\bar{Y}}}_{D 2 (s t)}$ as long as the conditions (9), (9) and (9) are satisfied respectively.

10 Numerical Illustration

To examine the performance of the proposed generalized class of estimators $t_{(s t)}$ over existing estimators, we use the data sets given below

Data I: Source: [Murthy (1967), P. 228] N = 80, n = 22

$N_{1} = 19$	$N_{2} = 32$	$N_{3} = 29$	$n_{1} = 5$	$n_{2} = 9$	$n_{3} = 8$
${\bar{Y}}_{1} = 2967.95$	${\bar{Y}}_{2} = 4657.63$	${\bar{Y}}_{3} = 7212.97$	${\bar{X}}_{1} = 65.16$	${\bar{X}}_{2} = 139.97$	${\bar{X}}_{3} = 589.41$
${\bar{Z}}_{1} = 349.68$	${\bar{Z}}_{2} = 706.59$	${\bar{Z}}_{3} = 2098.69$	$C_{y 1} = 0.25509$	$C_{y 2} = 0.14366$	$C_{y 3} = 0.11848$
$C_{x 1} = 0.17158$	$C_{x 2} = 0.31693$	$C_{x 3} = 0.38415$	$C_{z 1} = 0.3130$	$C_{z 2} = 0.15457$	$C_{z 3} = 0.30386$
$ρ_{y x 1} = 0.81$	$ρ_{y x 2} = 0.89$	$ρ_{y x 3} = 0.98$	$ρ_{y z 1} = 0.94$	$ρ_{y z 2} = 0.93$	$ρ_{y z 3} = 0.98$
$ρ_{x z 1} = 0.90$	$ρ_{x z 2} = 0.85$	$ρ_{x z 3} = 0.97$

Data II: [Source: Koyuncu and Kadilar (2009)] N = 923, $n =$ 180

$N_{1} = 127$	$N_{2} = 117$	$N_{3} = 103$	$N_{4} = 170$	$N_{5} = 205$	$N_{6} = 201$
$n_{1} = 31$	$n_{2} = 21$	$n_{3} = 29$	$n_{4} = 38$	$n_{5} = 22$	$n_{6} = 39$
${\bar{Y}}_{1} = 703.74$	${\bar{Y}}_{2} = 413.0$	${\bar{Y}}_{3} = 513.17$	${\bar{Y}}_{4} = 424.66$	${\bar{Y}}_{5} = 267.03$	${\bar{Y}}_{6} = 393.84$
${\bar{X}}_{1} = 20804.59$	${\bar{X}}_{2} = 9211.79$	${\bar{X}}_{3} = 14309.30$	${\bar{X}}_{4} = 9478.85$	${\bar{X}}_{5} = 5569.95$	${\bar{X}}_{6} = 12997.59$
${\bar{Z}}_{1} = 498.28$	${\bar{Z}}_{2} = 318.33$	${\bar{Z}}_{3} = 413.36$	${\bar{Z}}_{4} = 311.32$	${\bar{Z}}_{5} = 227.20$	${\bar{Z}}_{6} = 313.71$
$S_{y 1} = 883.84$	$S_{y 2} = 644.92$	$S_{y 3} = 1033.46$	$S_{y 4} = 810.58$	$S_{y 5} = 403.65$	$S_{y 6} = 711.72$
$S_{x 1} = 30486.7$	$S_{x 2} = 15180.77$	$S_{x 3} = 27549.78$	$S_{x 4} = 18218.93$	$S_{x 5} = 8497.77$	$S_{x 6} = 2394.14$
$S_{z 1} = 555.58$	$S_{z 2} = 365.46$	$S_{z 3} = 612.95$	$S_{z 4} = 458.03$	$S_{z 5} = 260.85$	$S_{z 6} = 397.05$
$ρ_{y x 1} = 0.936$	$ρ_{y x 2} = 0.996$	$ρ_{y x 3} = 0.994$	$ρ_{y x 4} = 0.983$	$ρ_{y x 5} = 0.989$	$ρ_{y x 6} = 0.965$
$ρ_{y z 1} = 0.979$	$ρ_{y z 2} = 0.976$	$ρ_{y z 3} = 0.984$	$ρ_{y z 4} = 0.983$	$ρ_{y z 5} = 0.964$	$ρ_{y z 6} = 0.983$
$ρ_{x z 1} = 0.9396$	$ρ_{x z 2} = 0.9696$	$ρ_{x z 3} = 0.977$	$ρ_{x z 4} = 0.964$	$ρ_{x z 5} = 0.9676$	$ρ_{x z 6} = 0.996$

Table 4 presents the PRE’s of ${\hat{\bar{Y}}}_{R (s t)}, {\hat{\bar{Y}}}_{P (s t)}, {\hat{\bar{Y}}}_{D 1 (s t)}, {\hat{\bar{Y}}}_{R a o (s t)}, {\hat{\bar{Y}}}_{G S (s t)}$ and ${\hat{\bar{Y}}}_{D 2 (s t)}$ estimators with respect to ${\bar{y}}_{(s t)}$ for two data sets respectively.

Table 5 shows the PRE of ${\hat{\bar{Y}}}_{U S V (s t)}$ with respect to ${\bar{y}}_{(s t)}$ for $α_{1} = - 1, 1$ , for two data sets.

Table 4 PRE’s of different estimators of population mean $\bar{Y}$ with respect to ${\bar{y}}_{s t}$

Estimator	Data I	Data II
${\hat{\bar{Y}}}_{R (s t)}$	14.42	1025.10
${\hat{\bar{Y}}}_{P (s t)}$	5.89	24.22
${\hat{\bar{Y}}}_{D 1 (s t)}$	235.83	1141.85
${\hat{\bar{Y}}}_{R a o (s t)}$	235.91	1143.02
${\hat{\bar{Y}}}_{G S (s t)}$	183.44	1109.11
${\hat{\bar{Y}}}_{D 2 (s t)}$	273.99	2621.61

Table 5 PRE’s of the estimator ${\hat{\bar{Y}}}_{U S V (s t)}$ with respect to ${\bar{y}}_{s t}$

Data I		Data II
$α_{1}$	PRE	$α_{1}$	PRE
$-$ 1	238.07	$-$ 1	1146.96
1	235.84	1	1260.08

Table 6 depicts the PRE of proposed estimator $t_{(s t)}$ wrt ${\bar{y}}_{(s t)}$ at different values of $α_{1}$ and $α_{2}$ , for two data sets.

Table 6 PRE’s of the proposed estimator $t_{(s t)}$ with respect to ${\bar{y}}_{s t}$ for different values of $(α_{1}, α_{2})$

Data I	Data II
$α_{1}$	$α_{2}$	PRE	$α_{1}$	$α_{2}$	PRE
$-$ 1	$-$ 1	275.29	$-$ 1	$-$ 1	2626.83
$-$ 2	$-$ 2	277.33	$-$ 2	$-$ 2	2720.91
$-$ 3	$-$ 3	280.47	$-$ 3	$-$ 3	3216.10
$-$ 4	$-$ 4	284.92	$-$ 4	$-$ 4	4826.13
$-$ 5	$-$ 5	291.07	$-$ 5	$-$ 5	20279.5
$-$ 8	$-$ 8	327.57	$-$ 5.2	$-$ 5.2	96957.81
$-$ 10	$-$ 10	391.89	1	1	3590.02
$-$ 12	$-$ 12	642.36	2	2	6380.74
$-$ 13	$-$ 13	1645.27	2.1	2.1	7060.90
$-$ 13.1	$-$ 13.1	2078.85	2.2	2.2	7938.96
$-$ 13.2	$-$ 13.2	2887.43	2.5	2.5	13252.63
$-$ 13.3	$-$ 13.3	4928.01	2.8	2.8	51458.14
$-$ 13.4	$-$ 13.4	20017.07	$-$ 1	1	3067.82
1	1	274.03	2	$-$ 1	2923.05
2	2	274.74	$-$ 1	2	1002.942
3	3	276.39	3	$-$ 1	2734.047
4	4	279.07	4	$-$ 1	2871.735
5	5	282.99	*	*	*
8	8	306.25	*	*	*
10	10	342.15	*	*	*
12	12	434.69	*	*	*
14	14	1047.46	*	*	*
14.1	14.1	1180.86	*	*	*
14.3	14.3	1634.81	*	*	*
14.5	14.5	2885.63	*	*	*
14.6	14.6	4977.54	*	*	*
14.7	14.7	22036.36	*	*	*
$-$ 1	1	274.16	*	*	*
$-$ 1	2	275.62	*	*	*
3	$-$ 1	277.21	*	*	*
4	$-$ 1	279.03	*	*	*
$-$ 5	5	259.76	*	*	*

It is observed from Tables 4, 5 and 6 that for various values of $(α_{1}, α_{2})$ the proposed generalized class of estimators $t_{(s t)}$ is more efficient than the estimators ${\bar{y}}_{(s t)} {\hat{\bar{Y}}}_{R (s t)}, {\hat{\bar{Y}}}_{P (s t)}$ , ${\hat{\bar{Y}}}_{D 1 (s t)}, {\hat{\bar{Y}}}_{R a o (s t)}$ , ${\hat{\bar{Y}}}_{G S (s t)}$ , ${\hat{\bar{Y}}}_{D 2 (s t)}$ and ${\hat{\bar{Y}}}_{U S V (s t)}$ , with considerable gain in efficiency. The proposed generalized class of estimators $t_{(s t)}$ yields the largest percent relative efficiency 22036.60% for data set I while it is 96957.81% for data set II. It is further observed from Table 6 that there is enough scope of selecting the scalars $(α_{1}, α_{2})$ in acquiring efficient estimators (from the suggested generalized class of estimators $t_{(s t)}$ ) than the existing estimators. Thus we conclude that the proposed generalized class of estimators $t_{(s t)}$ can be used in practice just by selecting the appropriate values of $(α_{1}, α_{2})$ .

11 Discussion and Conclusion

This article considers the problem of estimating the population mean $\bar{Y}$ of the study variable y using information on two auxiliary variables x and z. We have proposed a generalized class of estimators for the population mean $\bar{Y}$ using information on two supplementary variables x and z. Expressions of bias and mean square error up to the fda have been obtained in SRSWOR as well as in stratified random sampling. It is interesting to mention that the envisaged class of estimators includes several existing estimators. Thus the properties of the proposed generalized class of estimators unify results at one place. We have proved theoretically that the proposed generalized class of estimators is more efficient than the several existing estimators in both sampling designs SRSWOR and stratified random sampling.

Empirical studies are carried out to throw light on the merits of the envisaged generalized class of estimators over some existing competitors. Larger gain in efficiency is observed by using the proposed generalized class of estimators over some existing estimators in both the sampling designs: SRSWOR and stratified random sampling. Results incorporated in this study are very sound and quite illuminating. Thus it is recommended that the proposed study is useful in practice.

Acknowledgement

Authors are thankful to the learned referees for their valuable suggestions regarding improvement of the paper.

References

Abu-Dayyeh, W. A., Ahmed, M. S., Ahmed, R. A. and Mutlak, H. A. (2003). Some estimators of a finite population mean using auxiliary information, Applied Mathematics and Computation, 139, 287–298.

Ahmed, M. S. (1997). The general class of chain estimators for ratio of two means using double sampling, Communications in Statistics-Theory and Methods, 26(9), 2249–2254.

Cochran, W. G. (1977). Sampling Techniques, Jhon Wiley and Sons, New York.

Das, A.K. and Tripathi, T. P. (1978). Use of auxiliary information in estimating the finite population variance, Sankhya, C, 40, 139–148.

Diana, G. (1993). A class of estimators of the population mean in stratified random sampling, Statistica, 53(1), 59–66.

Gleser, L. J. and Healy, J. D. (1976). Estimating the mean of a normal distribution with known coefficient of variation, Journal of American Statistical Association, 71, 977–981.

Gupta, S. and Shabbir, J. (2008). On estimating in estimating the population mean in simple random sampling, Journal of Applied Statistics, 35(5), 559–566.

Kadilar, C. and Cingi, H. (2003). Ratio estimator in stratified sampling, Biometrical Journal, 45, 218–225.

Koyuncu, N. (2013). Families of estimators for population mean using information on auxiliary attribute in stratified random sampling, Gazi University Journal of Sciences, 26(2), 181–193.

Koyuncu, N. (2016). Improved exponential type estimators for finite population mean in stratified random sampling, Pakistan Journal of Statistics and Operation Research, 12(3), 429–441.

Koyuncu, N. and Kadilar C. (2009). Family of estimators of population mean using two auxiliary variables in stratified sampling, Communications in Statistics-Theory and Methods, 38, 2398–2417.

Koyuncu, N. and Kadilar, C. (2010). On the family of estimators of population mean in stratified random sampling, Communications in Statistics-Theory and Methods, 26(2), 427–443.

Malik, S. and Singh, R. (2017). A new estimator for population mean using two auxiliary variables in stratified random sampling, Journal of Information and Optimization Sciences, 38(8), 1243–1252.

Mishra, M., Singh, B. P. and Singh, R. (2017). Estimation of population mean using two auxiliary variables in the stratified random sampling, Journal of Reliability and Statistical Studies, 10(1), 59–68.

Munner, S., Shabbir, J. and Khalil, A. (2016). Estimation of finite population mean in simple random sampling and stratified random sampling using two auxiliary variables, Communications in Statistics-Theory and Methods, DOI:10.1080/03610926.2015.1035394.

Murthy, M. N. (1967). Sampling Theory and Methods, Statistical Publishing Society, India.

Olkin, I. (1958). Multivariate Ratio Estimation for Finite Population Biometrika, 45, 154–165.

Olufadi, Y. (2013). Dual to ratio-cum-product estimator in simple and stratified random sampling, Pakistan Journal of Statistics and Operation Research, 9(3), 305–319.

PCR (1998). Population Census Report of Multan District, Punjab Bureau of Statistics, Lahore, Pakistan.

Raj, D. (1965). On a method of using multiauxiliary information sample surveys, Journal of American Statistical Association, 60, 270–277.

Rao, P. S. R. S. and Mudholkar, G. D. (1967). Generalized multivariate estimator for the mean of finite populations, Journal of American Statistical Association, 62, 1009–1012.

Rao, T. J. (1991). On certain method of improving rations and regression estimators, Communications in Statistics, Theory and Methods, 20(10), 3325–3340.

Ray, S. K., Sahai, A. and Sahai, A. (1979). A note on ratio and product-type estimators, Annals of the Institute of Statistical Mathematics, 31, 141–144.

Reddy, V. N. (1978). A study on the use of prior knowledge on certain population parameters in estimation, Sankhya, C, 40, 29–34.

Searls, D. T. (1964). The utilization of a known coefficient of variation in the estimation procedure, Journal of American Statistical Association, 59, 1225–226.

Shabbir, J. (2018). Efficient utilisation of two auxiliary variables in stratified double sampling, Communications in Statistics-Theory and Methods, 47(1), 92–101.

Shabbir, J. and Gupta, S. (2005). Improved ratio estimators in stratified sampling, American Journal of Mathematics and Management Sciences, 25(3–4), 293–311.

Shabbir, J. and Gupta, S. (2015). Estimation of finite population mean using two auxiliary variables in stratified two-phase sampling, Communications in Statistics-Simulation and Computation, DOI:10.1080/03610918.2014.995817.

Shabbir, J. and Gupta, S. (2016). Estimation of population coefficient of variation in simple and stratified random sampling under two-phase sampling scheme when using two auxiliary variables, Communications in Statistics-Theory and Methods, DOI:10.1080/03610926.2016.1175627.

Sharma, P. and Singh, R. (2014). Improved ratio-type estimators using two auxiliary variables under second order approximation, Mathematical Journal of Interdisciplinary Sciences, 2(2), 193–204.

Sharma, P. and Singh, R. (2015). A class of exponential ratio estimators of finite population mean using two auxiliary variables, Pakistan Journal of Statistics and Operation Research, 11(2), 221–229.

Shukla, G. K. (1966). An alternative multivariate ratio estimator for finite population, Calcutta Statistical Association Bulletin, 15, 127–134.

Singh, D. and Chaudhary, F. S. (1986). Theory and Analysis of Sample Survey Designs, Jhon Wiley and Sons, New York.

Singh, H. P. (1986). A Generalized class of estimators of ratio, product and mean using supplementary information on an auxiliary character in PPSWR sampling scheme, Gujarat Statistical Review, 13(2), 1–30.

Singh, H. P. and Tailor, R. (2005). Estimation of finite population mean using known correlation coefficient between auxiliary characters, Statistica, anno LXV, 4, 407–418.

Singh, H. P. and Vishwakarma, G. K. (2008). A family of estimators of population mean using auxiliary formation in stratified sampling, Communications in Statistics-Theory and Methods, 37(7), 1038–1050.

Singh, H. P., Upadhyaya, L. N. and Tailor, R. (2009). Ratio-cum-product type exponential estimator, Statistica, anno LXIX, 4, 300–310.

Singh, M. P. (1967). Ratio-cum-product method of estimation, Metrika, 12(1), 34–62.

Singh, M. P. (1969). Comparison of some ratio-cum-product estimators, Sankhya, series B, 31, 375–378.

Singh, R. and Kumar, M. (2012). Improved estimators of population mean using two auxiliary variables in stratified random sampling, Pakistan Journal of Statistics and Operation Research, 8, 65–72.

Singh, R., Chouhan, P. and Sawan, N. (2008). On linear combination of ratio-product-type exponential estimator for estimating finite population mean, Statistics in Transition, 9(1), 105–115.

Singh, S. (2003). Advanced sampling theory with applications- How michel “selected” Amy, Kluwer Academic Publishers, I edition volume I and II.

Srivastava, S. K. (1965). An estimation of the mean of a finite population using several auxiliary variables, Journal of the Indian Statistical Association 3, 189–194.

Srivastava, S. K. (1966). A note on Olkin’s multivariate ratio estimator, Journal of Indian Statistical Association, 4, 202–208.

Srivastava, S. K. (1967). An estimator using auxiliary information in sample surveys, Culcutta Statistical Association, 16, 121–132.

Srivastava, S. K. (1971). A generalized estimator for the mean of a finite population using multiauxiliary information, Journal of American Statistical Association, 66, 404–407.

Srivenkataramana, T. and Tracy, D. S. (1984). Positive and negative valued auxiliary variates in surveys, Metron, XLII, (3–4), 3–14.

Steel, R. G. D. and Torrie, J, H. (1960). Principles and Procedures of Statistics, McGraw Hill Book Co.

Swain, A. K. P. C. (2013). On some modified ratio and product-type estimators. Revisited, Investigacion operational, 34(1), 35–57.

Tailor, R. and Chouhan, S. (2014). Ratio-cum-product-type exponential estimator of finite population mean in stratified random sampling, Communications in Statistics-Theory and Methods, 43(2), 343–354.

Tailor, R., Chouhan, S., Tailor, R. and Garg, N. (2012). A ratio-cum-product estimator of population mean in stratified random sampling using two auxiliary variables, Statistica, anno LXXII, n.3, 287–297.

Tripathi, T. P., Maiti, P. and Sharma, S. D. (1983). Use of prior information on some parameter in estimating population mean, Sankhaya, A, 45(3), 372–376.

Upadhyaya, L. N., Singh, H. P., and Vos, J. W. E. (1985). On the estimation of population means and ratios using supplementary information, Statistica Neerlandica, 39, 309–318.

Verma, H. K., Sharma, P. and Singh, R. (2015). Some families of estimators using two auxiliary variables in a stratified random sampling, Revista Investigacion Operacional, 36(2), 140–150.

Vos, J. W. E. (1980). Mixing of direct, ratio and product method estimators, Statistica Neerlandica, 34, 209–218.

Yadav, S. K., Mishra, S., Kadilar, C. and Shukla, A. K. (2015a). Improved dual to ratio cum dual to product estimator in the stratified random sampling, American Journal of Operational Research, 5(3), 57–63.

Yadav, S. K., Mishra, S., Mishra, S. S. and Shukla, A. K. (2015b). Searching efficient estimator of population mean in stratified random sampling, American Journal of Operational Research, 5(4), 75–81.

Biographies

Housila Prasad Singh, born on 08/09/1957 in a village of Varanasi district of Uttar Pradesh. He did his M.Sc. (Statistics) in 1979 from Banaras Hindu University, Varanasi, U.P.. He obtained his M.Phil. (Applied Mathematics) in 1981 and Ph.D. (Applied Statistics) in 1985 from Indian School of Mines, Dhanbad, Bihar (Now Indian Institute of Technology, Dhanbad, Jharkhand). Currently he is a professor of Statistics, Vikram University, Ujjain, M.P.. He has 37 years of teaching experience and 41 years of research experience. He has been Head, School of Studies in Statistics, Dean, Faculty of Science and Executive Council Member of Vikram University, Ujjain, M.P. He has guided many students for their M.Sc. (02), M.Phil (18) and Ph.D. (23) degrees. He has been visiting scientist at University of Windsor, Windsor, Canada. He has published more than 510 research papers in journals of national and international repute. One of his research papers submitted for the award of “Dr. Radha Krishnan Samman 1992” has been appreciated by valuers. He has been awarded ‘Best Scientist Research Publication’ award (2009–10) and Out Standing Research Faculty by research faculty awards by Careers 360 as “One of the 10 Knowledge Producers in India” for academic year 2017–18. He has written two book reviews out of them one is published in Computational Statistics and Data Analysis (2000) and the other one in the Journal of Royal Statistical Society, Sr. A (2006). He is also the author of the Book “Randomness and Optimal Estimation in Data Sampling”, American Research Press, Rehoboth, USA. His area of research interest is Sampling Theory and Statistical Inference. Google Scholar (Citations by this date) is Citations-5653, h-index-35 and i10-index-134.

Pragati Nigam, born on 27/11/1992 in Ujjain, M.P. She did her M.Sc. (Statistics) in 2015 (achieved II rank in the University) and Ph.D. (Statistics) in 2021 under the guidance of Prof. H. P. Singh, from School of Studies in Statistics, Vikram University, Ujjain. Presently she is working as Assistant Professor of Statistics at Faculty of Agricultural Sciences, Mandsaur University, Mandsaur. Dr. Pragati published 5 research papers and 1 book chapter (MKSES Publications) in national and international journals of repute.

Journal of Reliability and Statistical Studies, Vol. 15, Issue 1 (2022), 61–104.
doi: 10.13052/jrss0974-8024.1514
© 2022 River Publishers