Inverted Topp-Leone Distribution: Contribution to a Family of J-Shaped Frequency Functions in Presence of Random Censoring

Hiba Zeyada Muhammed^1,* and Essam Abd Elsalam Muhammed²

¹Department of Mathematical Statistics, Faculty of Graduate Studies for Statistical Research, Cairo University, Egypt

²Department of Management and Financial, High Institute of Computer and Information Technology, Elshorouk Academy, Egypt

E-mail: hiba_stat@cu.edu.eg; essamabdelsalam16@gmail.com

*Corresponding Author

Received 31 July 2021; Accepted 11 November 2021; Publication 06 December 2021

Abstract

In this paper, Bayesian and non-Bayesian estimation of the inverted Topp-Leone distribution shape parameter are studied when the sample is complete and random censored. The maximum likelihood estimator (MLE) and Bayes estimator of the unknown parameter are proposed. The Bayes estimates (BEs) have been computed based on the squared error loss (SEL) function and using Markov Chain Monte Carlo (MCMC) techniques. The asymptotic, bootstrap (p,t), and highest posterior density intervals are computed. The Metropolis Hasting algorithm is proposed for Bayes estimates. Monte Carlo simulation is performed to compare the performances of the proposed methods and one real data set has been analyzed for illustrative purposes.

Keywords: Inverted Topp leone distribution, moments, order statistic, maximum likelihood estimation, Bayesian estimation, MCMC, highest posterior density interval, asymptotic confidence interval, bootstrap confidence interval, random censoring.

1 Introduction

The Topp-Leone (TL) distribution was originally proposed by [1] as an alternative to beta distribution and it has applied for some family data. In recent years, the TL distribution has received huge attention in the literature; see for example; [2] showed that the TL distribution exhibits bathtub failure rate function with widespread applications in reliability. Moreover, [3] showed that TL distribution possesses some attractive reliability properties such as bath tub-shape hazard rate, decreasing reversed hazard rate, upside-down mean residual life, and increasing expected inactivity time. Recently, [4] derived admissible minimax estimates for the shape parameter of the TL distribution under squared and linear-exponential loss functions. Recently, [5] introduced the inverted Topp-Leone (IVT) distribution as a J-shaped distribution. Which is useful for modeling lifetime Phenomena and she studied more of its properties.

In this paper, we study classical and Bayesian estimation for the shape parameter of the IVT distribution when the sample is complete and random censored.

The random censoring can be expressed as follows: When the units under test lose or remove from the test before its failure this data is called random censoring. To more illustrate, in clinical trials or medical tests, some patients retreat or leave the test before finishing it.

The work in this paper is organized as follows: In Section 2 we introduce the IVT distribution. The maximum likelihood estimator (MLE) of the unknown parameter, the Bayes estimator, and the confidence interval based on complete data will be introduced in Section 3, in Section 4 we introduce the maximum likelihood estimators (MLEs) of the unknown parameters, the Bayes estimators and the confidence intervals based on random censoring data. Finally, the paper is concluded in Section 5.

2 Inverted Topp-Leone Distribution

The Topp Leone distribution is defined with the following pdf and cdf respectively,

F (t; β) = t^{β} {(2 - t)}^{β}

(1)

And

f (t; β) = β (2 - 2 t) {(2 t - t^{2})}^{β - 1}

(2)

For $0 < t < 1$ and $β > 0 .$

Assume $X = \frac{1}{T}$ the pdf and cdf of $X$ are given respectively, as

f (x; β) = 2 β (x - 1) x^{- 2 β - 1} {(2 x - 1)}^{β - 1},

(3)

And

F (x; β) = 1 - x^{- 2 β} {(2 x - 1)}^{β} .

(4)

For $1 < x < \infty$ and $β > 0 .$

In this case, the distribution of X is called inverted Topp-Leone (IVT) distribution denoted by $(β)$ . It can be showed that the pdf (3) satisfies the following generalized Pearson system of differential equation

\frac{\overset{´}{f (x)}}{f (x)} = \frac{a_{0} + a_{1} x + a_{2} x^{2}}{b_{0} + b_{1} x + b_{2} x^{2} + b_{3} x^{3}}

where $a_{0} = - 2 β - 2$ , $a_{1} = 6 β$ , $a_{2} = - 2 β - 2$ , $b_{0} = 0$ , $b_{1} = 1$ , $b_{2} = - 3$ and $b_{3} = 2$ .

The IVT distribution may be considered as a J- shaped because $f (x) > 0$ , $\frac{d f (x)}{d x} < 0$ and for some values $\frac{d^{2} f (x)}{d x^{2}} > 0$ . And it can be noted that from Figure 1. Also, Figure 2 shows the cdf of IVT distribution for different values for the parameter $β$ .

The mode of the $IVT (β)$ is given by $1 + \sqrt{\frac{3}{2 β + 2}} .$

The quantiles of the $IVT (β)$ distribution is given by

x_{q} = q^{\frac{- 1}{β}} (1 + \sqrt{1 - q^{\frac{1}{β}}}), 0 < q < 1 .

Figure 1 The pdf for IVT distribution.

Figure 2 The cdf for IVT distribution.

The median is a special case from the quantile function, when $q = \frac{1}{2},$

x_{0.5} = {(0.5)}^{\frac{- 1}{β}} (1 + \sqrt{1 - {(0.5)}^{\frac{1}{β}}})

And the inter-quartile range (IQR) is given as

IQR = {(\frac{3}{4})}^{\frac{- 1}{β}} (1 + \sqrt{1 - {(\frac{3}{4})}^{\frac{1}{β}}}) - {(\frac{1}{4})}^{\frac{- 1}{β}} (1 + \sqrt{1 - {(\frac{1}{4})}^{\frac{1}{β}}}) .

The $k^{t h}$ moment about origin $\overset{´}{μ_{k}}$ is given by the following theorem.

Theorem 1: the $k^{t h}$ -moment about zero $\overset{´}{μ_{k}}$ $X$ is given by

\overset{´}{μ_{k}} = β \sum_{j = 0}^{\infty} \frac{c (k, j) {(- 1)}^{j}}{j - k + β}

(5)

For $k = 1 \dots n$ and $β \neq 1$

Where

$c (k, 0)$	$= 2^{k}, c (k, 1) = k 2^{k - 1} and$
$c (k, j)$	$= \frac{k 2^{k - 2 j}}{j!} \prod_{i = 1}^{j - 1} (k - j - i), j \geq 2 .$

The survival function for the failure time X follows IVT distribution is defined as

R (x) = x^{- 2 β} {(2 x - 1)}^{β}

(6)

For fixed x means the probability of survival up to time x. Figure 3 shows the survival function for the IVT distribution with different parameter values.

Moreover, for the IVT distribution, the hazard function is easily obtained as

h (x) = 2 β \frac{x - 1}{x (2 x - 1)} .

(7)

and it has different shapes according to the values of the parameter $β$ as shown in Figure 4.

The reversed hazard function for the IVT distribution is given as follows

r (x) = \frac{2 β (x - 1) x^{- 2 β - 1} {(2 x - 1)}^{β - 1}}{1 - x^{- 2 β} {(2 x - 1)}^{β}} .

(8)

and it has different shapes as shown in Figure 5 according to to the variability in the parameter $β$ .

Figure 3 The reliability function for IVT distribution.

Figure 4 The hazard function for IVT distribution.

Figure 5 The reversed hazard function for IVT distribution.

2.1 Distributions and Moments of Order Statistics from IVT Distribution

Let $X_{1}, X_{2}, \dots, X_{n}$ be independent and identically distributed random variables drawn from IVT distribution. Let $X_{(r)}; r = 1, 2, \dots, n,$ be the $r^{t h}$ order statistic, then the pdf of $X_{(r)}$ is defined as

f_{n} (x) = C_{n, r} {[F (x)]}^{r - 1} {[1 - F (x)]}^{n - r} f (x)

Where $x = x_{(r)}$ , and $C_{n, r} = \frac{n!}{r! (n - r)!}$ .

$f_{n} (x)$	$= 2 β C_{n, r} (x - 1) x^{2 β (n - r - 1) - 1} {(2 x - 1)}^{β (n - r - 1) - 1}$
	$\cdot {[1 - x^{2 β} {(2 x - 1)}^{β}]}^{β (n - r - 1) - 1}$

Special cases for $X_{(1)}$ and $X_{(n)}$ are respectively considered as

f_{n} (x) = 2 n β (x - 1) x^{2 β (n - 2) - 1} {(2 x - 1)}^{n β - 1}, x = x_{(1)}

and

$f_{n} (x)$	$= 2 n β (x - 1) x^{- 2 β - 1} {(2 x - 1)}^{β - 1} {[1 - x^{2 β} {(2 x - 1)}^{β}]}^{n - 1},$
$x$	$= x_{(n)}$

The joint pdf of $x_{(r)}$ and $x_{(s)}$ , $1 \leq r < s \leq n$ , for a sample of size n

$f_{n} (x, y)$	$= C_{n, r, s} {[F (x)]}^{r - 1} {[1 - F (y)]}^{n - s}$
	$\times {[F (y) - F (x)]}^{s - r - 1} f (x) f (y)$

Where $x = x_{(r)}, y = x_{(s)}$ and $C_{n, r, s} = \frac{n!}{r! (s - r - 1)! (n - s)!}$ .

$f_{n} (x, y)$	$= 4 β^{2} C_{n, r, s} {[1 - x^{2 β} {(2 x - 1)}^{β}]}^{r - 1} {[y^{2 β} {(2 y - 1)}^{β}]}^{n - s}$
	$\cdot {[x^{2 β} {(2 x - 1)}^{β} - y^{2 β} {(2 y - 1)}^{β}]}^{s - r - 1}$
	$\cdot (x - 1) x^{- 2 β - 1} {(2 x - 1)}^{β - 1} (y - 1) y^{- 2 β - 1} {(2 y - 1)}^{β - 1},$
	$1 < x < y < \infty$

In the following theorems, the moments and product moments about origin will be introduced

Theorem 2: the $k^{t h}$ -moment about zero $μ_{r : n}^{k}$ of $r^{t h}$ order statistic $X_{(r)}$ is given by

μ_{r : n}^{k} = C_{n, r} \sum_{j = 0}^{\infty} c (k, j) {(- 1)}^{\frac{1}{β}} B (\frac{j - k}{β} + 1, n - r + 1)

(9)

For $k = 1 \dots n$ and $r = 1 \dots n$ ,

Where

$c (k, 0)$	$= 2^{k}, c (k, 1) = k 2^{k - 1} and$
$c (k, j)$	$= \frac{k 2^{k - 2 j}}{j!} \prod_{i = 1}^{j - 1} (k - j - i), j \geq 2,$

and $B (\cdot, \cdot)$ is the beta function.

The $k^{t h}$ moment of $X_{(1)}$ is given as

μ_{1 : n}^{k} = n! \sum_{j = 0}^{\infty} c (k, j) {(- 1)}^{\frac{1}{β}} \frac{Γ (\frac{j - k}{β} + 1)}{Γ (\frac{j - k}{β} + n + 1)} .

The $k^{t h}$ moment of $X_{(n)}$ is given as

μ_{n : n}^{k} = n \sum_{j = 0}^{\infty} c (k, j) {(- 1)}^{\frac{1}{β}} \frac{Γ (\frac{j - k}{β} + n)}{Γ (\frac{j - k}{β} + n + 1)} .

Theorem 3: the $k^{t h}$ and $L^{t h}$ -product moments about zero $μ_{r : n}^{(k, L)}$ of $X_{(r)}$ and $X_{(s)}$ are given by

$μ_{r : n}^{(k, L)}$	$= C_{n, r, s} \sum_{j_{3}}^{s - r - 1} \sum_{j_{2}}^{\infty} \sum_{j_{1}}^{\infty} c (k, j_{1}) c (L, j_{2}) \frac{(\binom{s - r - 1}{j_{3}}) {(- 1)}^{\frac{j_{1} + j_{2}}{β} + j_{3}}}{\frac{j_{1} - k}{β} + j_{3} + r}$
	$\cdot B (\frac{j_{1} + j_{2} - k - L}{β} + s, n - s + 1) .$	(10)

For $k = 1 \dots n$ , $L = 1 \dots n$ and $r = 1 \dots n$ .

Where $c (k, j_{1}), c (L, j_{2})$ as given in (9).

3 Estimation Based on Complete Samples for IVT Distribution Shape Parameter

3.1 Maximum Likelihood Estimation

Suppose that $X_{1}, X_{2}, \dots, X_{n}$ is a simple random sample of size n drawn from $IVT (β)$ . In this section, the shape parameter of the IVT distribution will be estimated using the MLE as follows.

The likelihood function is given by

l (x; β) = \prod_{i = 1}^{n} 2 β (x_{i} - 1) x_{i}^{- (2 β + 1)} {(2 x_{i} - 1)}^{β - 1}

(11)

The natural logarithm of the likelihood function is given as

L (x; β) \propto n l o g (β) - (2 β + 1) \sum_{i = 1}^{n} \log (x_{i}) + (β - 1) \sum_{i = 1}^{n} \log (2 x_{i} - 1)

After differentiating the $L (x; β)$ and equating to zero the MLE for $β$ can be expressed in closed form as follows

\hat{β} = \frac{n}{\sum_{i = 1}^{n} \log ? (x_{i}^{2} / (2 x_{i} - 1))} .

3.2 Bayesian Estimation

In this section, we have discussed the Bayesian estimation procedure for the parameter of the IVT distribution and we get the Bayesian estimate of the unknown parameter under the squared error loss (SEL) function. We assume that the unknown parameter of the IVT distribution have gamma prior distribution and can be written with proportional as follows;

π (β | a_{1}, b_{1}) \propto β^{a_{1} - 1} e^{- β b_{1}}, β > 0, a_{1}, b_{1} > 0

(12)

Hyper-parameters determination: The hyper-parameters involved in priors (12) can be easily evaluated if we consider that prior mean and prior variance are known. The prior mean and prior variance will be obtained from the maximum likelihood estimate of $(β)$ by equating the mean and variance of $({\hat{β}}^{j})$ with the mean and variance of the considered priors (gamma prior), where $j = 1, 2, \dots, k$ and k is the number of random samples generated from the model in Section 3.1. Thus, on equating mean and variance of $({\hat{β}}^{j})$ with the mean and variance of gamma priors, we get ([6])

\frac{1}{k} \sum_{j = 1}^{k} {\hat{β}}^{j} = \frac{a_{1}}{b_{1}} & \frac{1}{k - 1} \sum_{j = 1}^{k} {({\hat{β}}^{j} - \frac{1}{k} \sum_{j = 1}^{k} {\hat{β}}^{j})}^{2} = \frac{a_{1}}{b_{1}^{2}}

Now, on solving the above two equations, the estimated hyper-parameters can be written as

$a_{1}$	$= \frac{{(\frac{1}{k} \sum_{j = 1}^{k} {\hat{β}}^{j})}^{2}}{\frac{1}{k - 1} \sum_{j = 1}^{k} {({\hat{β}}^{j} - \frac{1}{k} \sum_{j = 1}^{k} {\hat{β}}^{j})}^{2}} &$
$b_{1}$	$= \frac{\frac{1}{k} \sum_{j = 1}^{k} {\hat{β}}^{j}}{\frac{1}{k - 1} \sum_{j = 1}^{k} {({\hat{β}}^{j} - \frac{1}{k} \sum_{j = 1}^{k} {\hat{β}}^{j})}^{2}}$

Based on the likelihood function (11) and the gamma prior (12), the joint posterior density function of $β$ given the data can be written as

π (β | \underline{x}) = \frac{π (β) L (β | \underline{x})}{\int_{0}^{\infty} π (β) L (β | \underline{x}) d β} .

Then, the joint posterior density function can be written as

π (β | \underline{x}) = \frac{1}{k (β)} \prod_{i = 1}^{n} 2 (x_{i} - 1) x_{i}^{- (2 β + 1)} {(2 x_{i} - 1)}^{β - 1} β^{a_{1}} e^{- β b_{1}}

(13)

Where,

k (β) = \int_{0}^{\infty} \prod_{i = 1}^{n} 2 (x_{i} - 1) x_{i}^{- (2 β + 1)} {(2 x_{i} - 1)}^{β - 1} β^{a_{1}} e^{- β b_{1}} d β,

Thus, the Bayes estimate of $β$ based on SEL function is given by

$\tilde{β}$	$= E (β \| \underline{x})$
$\tilde{β}$	$= \frac{\int_{0}^{\infty} β π (β) L (β \| \underline{x}) d β}{\int_{0}^{\infty} π (β) L (β \| \underline{x}) d β}$	(14)

It should be noted that the ratio of integral in (3.2) cannot be obtained in closed forms. So, we use the MCMC approximation method to generate samples from (13) and to calculate the BE of $β$ and also to construct associated HPD intervals.

Markov Chain Monte Carlo (MCMC) is considered to be a computer-driven sampling technique. It permits one to characterize a distribution without knowing all of the distribution mathematical properties by random sampling values out of the distribution ([7]). We use Metropolis Hasting (M-H) method with normal proposal distribution to generate random numbers from (13). Thus, we perform the following steps of the M-H algorithm to draw samples from the posterior density (13) and in turn compute the Bayes estimate of $β$ and construct the corresponding HPD intervals ([8]):

I. Set initial values $θ^{(0)},$ M $=$ burn-in.

II. For i $=$ 1,…,N, repeat the following steps.

∙ Set $θ = θ^{(i - 1)}$ .

∙ Generate new candidate parameter values $ω$ from $N_{1} (\log (θ), S_{θ}) .$

∙ Set $θ^{'} = \exp (ω)$ .

∙ Calculate $A = \min (1, \frac{π (θ^{'} | \underline{\underline{x}})}{π (θ | \underline{x})})$

∙ Update $θ^{(i)} = θ^{'}$ with probability A; otherwise set $θ^{(i)} = θ .$

The approximate Bayes estimate of $θ^{(i)} = {(β^{(i)})}^{'}$ , $i = 1, 2, \dots$ ,N with respect to SEL function is given by

{\tilde{θ}}_{B S} = \frac{1}{N - M} \sum_{i = M + 1}^{N} θ^{(i)},

Where ${\tilde{θ}}_{BS}$ is Bayes estimate under SEL function and M is the burn-in-period (that is, a number of iterations before the stationary distribution is achieved).

3.3 Interval Estimation for IVT Distribution Shape Parameter

In this section, we propose different confidence intervals. One is based on the asymptotic distribution of $β$ , two different bootstrap confidence intervals, and finally, HPD intervals.

The Asymptotic Confidence Interval

The second derivative for $L (x; β)$ is trivially obtained as

\frac{d^{2} L}{d β^{2}} = - \frac{n}{β^{2}} .

The observed Fisher information matrix is given by

I (\hat{β}) = - {\frac{d^{2} L}{d β^{2}} |}_{β = \hat{β}} = \frac{n}{{\hat{β}}^{2}}

The asymptotic variance of $\hat{β}$ is

V (\hat{β}) = \frac{1}{I (\hat{β})} = \frac{{\hat{β}}^{2}}{n} .

The sampling distribution of $\frac{\hat{β} - β}{\sqrt{V (\hat{β})}}$ can be approximated by a standard normal distribution.

The large sample $(1 - α) 100 %$ confidence interval for $β$ is given by

({\hat{β}}_{L}, {\hat{β}}_{U}) = \hat{β} \mp Z_{\frac{α}{2}} \sqrt{V (\hat{β})} .

where $Z_{\frac{α}{2}}$ is the standard normal random variable and $(1 - α)$ is the confidence coefficient.

Bootstrap Confidence Intervals

The bootstrap confidence intervals are approximate confidence intervals but in general are better approximate than standard intervals. A parametric bootstrap interval provides much more information about the population value of the quantity of interest than does a point estimate. The parametric bootstrap methods are of two types: –

(i) The percentile bootstrap method (Boot-p) was proposed by [9].

(ii) The Bootstrap-t method (Boot-t) was proposed by [10].

Percentile Bootstrap (Boot-P) Confidence Interval

The boot-p method is rather simple and constructs confidence intervals directly from the percentiles of the bootstrap distribution of the estimated parameters. It is given by the following steps:

I. A complete sample is generated from the original data $T = (t_{1}, t_{2} \dots t_{n})$ and the MLE $\hat{θ} = {(\hat{β})}^{'}$ of the parameter $θ = {(β)}^{'}$ is computed.

II. Again, an independent complete bootstrap sample $T^{*} = (t_{1}^{*}, t_{2}^{*} \dots t_{n}^{*})$ is generated by using $\hat{θ}$ .

III. Now, compute the bootstrap MLE ${\hat{θ}}^{*}$ of parameter $θ$ based on $T^{*}$ , as in step-1.

IV. Repeat steps 2–3, B times representing B bootstrap MLE’s ${\hat{θ}}^{*}$ ’s based on B different bootstrap samples, i $=$ 1,2,… B.

V. Arrange all ${\hat{θ}}^{*}$ ’s in an ascending order to obtain the bootstrap sample i.e. ${\hat{θ}}^{*}_{(1)} \leq {\hat{θ}}^{*}_{(2)} \leq \dots \leq {\hat{θ}}^{*}_{(B)}$ . An approximate $100 (1 - ω) %$ boot-p confidence interval for $θ$ is obtained by $({\hat{θ}}^{*}_{[(\frac{ω}{2}) \times B]}, {\hat{θ}}^{*}_{[(1 - \frac{ω}{2}) \times B]})$ .

Where, $\frac{ω}{2}$ is the quantity that helps to determine the bootstrap point.

Bootstrap-t (Boot-t) Confidence Intervals

The bootstrap-t confidence interval is given by the following steps:

I. Steps 1 and 2 of boot-p and boot-t methods are the same.

II. Compute the bootstrap-t statistic $T^{*} = \frac{{\hat{θ}}^{*}_{b} - \hat{θ}}{\sqrt{v ({\hat{θ}}^{*}_{b})}}$ for ${\hat{θ}}^{*}_{b}$ where b $=$ 1, 2,…B.

III. To obtain a set of bootstrap statistics $T^{*}_{i}; i = 1, 2, \dots, B$ repeat steps 2–3, B times.

IV. Let $T^{*}_{(1)} \leq T^{*}_{(2)} \leq \dots \leq T^{*}_{(B)}$ be the ordered values of $T^{*}_{i}$ ; $i = 1, 2, \dots, B$ .

V. Now, the approximate $100 (1 - ω) %$ boot-t confidence interval for parameter $θ$ is obtained by

(\hat{θ} - {\hat{T}}^{*}_{[(1 - \frac{ω}{2}) \times B]} \sqrt{v (\hat{θ})}, \hat{θ} - {\hat{T}}^{*}_{[(\frac{ω}{2}) \times B]} \sqrt{v (\hat{θ})})

Highest Posterior Density (HPD) Intervals

The HPD intervals for the unknown parameters can be constructed by using the following algorithm: let $θ_{(1)}, θ_{(2)}, \dots, θ_{(n)}$ be the corresponding ordered MCMC sample, to construct the HPD interval, let $θ_{(j)}$ be the j $^{th}$ smallest of ${θ_{(i)}}$ and denote $D_{j} (n) = (θ_{(j)}, θ_{(j + [(1 - φ) n])})$ , where $0 < φ < 1$ For $j = 1, 2, \dots, n - [(1 - φ) n]$ be the HPD intervals then the best HPD interval that has the smallest interval width from $D_{j} {(n)}^{,} s$ . So, we can say $D_{j^{*}} (n) = (θ_{(j^{*})}, θ_{(j^{*} + [(1 - φ) n])})$ , be HPD interval for the unknown parameters have the smallest interval width among all $D_{j^{*}} {(n)}^{'}$ s. Where $j^{*}$ is chosen so that

θ_{(j^{*} + [(1 - φ) n])} - θ_{(j^{*})} = \min_{1 \leq j \leq n - [(1 - φ) n]} (θ_{(j + [(1 - φ) n])} - θ_{(j)})

Where $θ_{(j)} = β_{(j)}$ , and $θ_{(i)} = β_{(i)}$ then the HPD intervals for the unknown parameters can be constructed.

3.4 Simulation Study

A simulation study was carried to check the performance of the accuracy of point and interval estimates for several cases, for which estimate the one parameter of IVT distribution ( $β$ ) for the number of replications $(m = 1000)$ , for different sample sizes (n) as $n = 25$ , 50, 80, 100 and different parameters values. All the computations are performed using statistical software R.

The simulations results for MLE are summarized in Tables 2, 3, 4, and 5 and obtained by the following steps:

i. Specify initial values for parameter ( $β$ ) as (0.5), (0.8), (1.2) and (1.9).

ii. Specify the sample size n. as $n = 25, 50, 80, 100$ .

iii. Generate n standard uniform variates i.e. $U \sim Uniform (0, 1)$ .

iv. Generated complete samples of size n from IVT ( $β$ ) distribution by using the formula $x = U^{\frac{- 1}{β}} (1 + \sqrt{1 - U^{\frac{1}{β}}})$

v. Obtain the maximum likelihood estimates (MLEs).

vi. Obtain the mean, bias, mean squared error (MSE), asymptotic and bootstrap confidence intervals (CI’s) for the unknown parameters, average interval lengths (AILs), and coverage probability (CP) for the different sample size.

vii. Repeat steps 1–5 1000 times.

And the simulation results for the Bayesian estimate are summarized in Tables 2, 3, 4, and 5 which are obtained by the following steps:

i. Step I, ii, iii, iv, and v of the MLE simulation are the same

ii. By using the M-H algorithm shown in Section 3.2 under the informative prior and the non-informative prior and repeat the chain N times (N $=$ 10000) to obtain MCMC samples.

∙ For informative prior, we compute the hyperparameters for all simulation cases as in Table 1.

∙ For non-informative prior (P-II) we assume that hyper-parameter values are $a_{1} = b_{1} = 0$ .

iii. Compute the approximate Bayes estimator of $β$ under SEL function is given by

{\tilde{β}}_{SEL} = \frac{1}{N - M} \sum_{i = M + 1}^{N} g_{S E L} (β^{(i)}), i = 1, 2, \dots, N .

Where M ( $=$ 2000) is the burn-in-period (that is, a number of iterations before the stationary distribution is achieved).

Repeat step i–iii (1000) times to obtain the mean, bias, mean squared error (MSE), HPD intervals for the unknown parameters, average interval lengths (AILs), and coverage probability (CP) for the different sample sizes.

Table 1 The hyper parameters values under complete data

		Initial Values

Hyper-Parameters		$β_{0} = 0.5$	$β_{0} = 0.8$	$β_{0} = 1.2$	$β_{0} = 1.9$
30	a1	23.97	23.95	23.94	23.88
	b1	45.85	28.75	19.21	12.00
50	a1	48.92	48.93	48.99	48.97
	b1	95.59	59.47	40.02	25.21
80	a1	78.91	78.86	79.05	79.04
	b1	155.47	97.51	64.86	41.17
100	a1	98.90	98.97	99.06	98.90
	b1	195.76	123.19	82.52	51.58

Table 2 Average estimated values, MSEs, bias, asymptotic and bootstrap (t-p) CI intervals of MLEs and BEs of IVT distribution parameters under complete data

images

Table 3 Average estimated values, MSEs, bias, asymptotic and bootstrap (t-p) CI intervals of MLEs and BEs of IVT distribution parameters under complete data

images

Table 4 Average estimated values, MSEs, bias, asymptotic and bootstrap (t-p) CI intervals of MLEs and BEs of IVT distribution parameters under complete data

images

From tabulated values in Tables 2, 3, 4, and 5 it can be noticed that:

i. As expected, with respect to MSEs, higher values of n lead to better estimates.

ii. It is also noticed that the maximum likelihood estimates compete well with non-informative Bayes estimates, and the performance of the Bayes estimates obtained under informative prior is better than the non-informative Bayes estimates.

iii. It can also be noticed that under informative prior the AILs and associated CPs of HPD intervals are better than those of non-informative priors, bootstrap (p, t), and asymptotic confidence intervals respectively.

Table 5 Average estimated values, MSEs, bias, asymptotic and bootstrap (t-p) CI intervals of MLEs and BEs of IVT distribution parameters under complete data

images

3.5 Application to Real Data Set

In this section, the IVT distribution will be fitted to a real data set, to show how the IVT distribution can be applied in practice. moreover the IVT distribution will also compare with other inverted distributions that are fitted this data such as: inverse exponential (IE), inverse Rayleigh (IR), inverse Lindley(IL). And they will be introduced below as

The cdf, pdf of the inverse exponential (IE) distribution are respectively as

F_{I E} (x) = e^{- \frac{λ}{x}} x > 0 and λ > 0 and f_{I E} (x) = \frac{λ}{x^{2}} e^{- \frac{λ}{x}} .

The cdf, pdf of the inverse Rayleigh (IR) distribution are respectively as:

$F_{I R} (x)$	$= e^{- {(\frac{σ}{x})}^{2}} x > 0 and σ > 0 and$
$f_{I R} (x)$	$= \frac{2 σ^{2}}{x^{3}} e^{- {(\frac{σ}{x})}^{2}} .$

The cdf, pdf of the inverse Lindley (IL) distribution are respectively as

$F_{I L} (x)$	$= [1 + \frac{θ}{1 + θ} \frac{1}{x}] e^{- \frac{θ}{x}} x > 0 and θ > 0 and$
$f_{I L} (x)$	$= \frac{θ^{2}}{1 + θ} (\frac{1 + x}{x^{3}}) e^{- \frac{θ}{x}} .$

The data set consists of 100 observations of breaking stress of carbon fibers in (Gba) which are listed as follows:
1.061, 1.117, 1.162, 1.183, 1.187, 1.192, 1.196, 1.213, 1.215, 1.219, 1.220, 1.224, 1.225, 1.228, 1.237, 1.240, 1.244, 1.259, 1.261, 1.263, 1.276, 1.310, 1.321, 1.329, 1.331, 1.337, 1.351, 1.359, 1.388, 1.408, 1.449, 1.449, 1.450, 1.459, 1.471, 1.475, 1.477, 1.480, 1.489, 1.501, 1.507, 1.515, 1.530,1.530, 1.533, 1.544, 1.544, 1.552, 1.556, 1.562, 1.566, 1.585, 1.586, 1.599, 1.602, 1.614, 1.616, 1.617, 1.628, 1.684, 1.711, 1.718, 1.733, 1.738, 1.743, 1.759, 1.777, 1.794, 1.799, 1.806, 1.814, 1.816, 1.828, 1.830, 1.884, 1.892, 1.944, 1.972, 1.984, 1.987, 2.020, 2.030, 2.029, 2.035, 2.037, 2.043, 2.046, 2.059, 2.111, 2.165, 2.686, 2.778, 2.972, 3.504, 3.863, 5.306

Figure 6 shows that the empirical date compared by the inverted distributions namely IVT, IE, IR and IL.

Figure 6 Empirical distribution for lifetimes for carbon fibers data.

Table 6 MLEs, AIC, BIC, AICC and HQIC values, and Kolmogorov-Smirnov statistics for carbon fibers data

		Measures

Model	MLE	p-value	K-S	-2log L	AIC	BIC	AICc	HQIC
IVT	5.6313	0.1197	0.1211	90.3201	92.3201	94.8844	90.3617	93.3566
IE	1.5680	0.0000	0.4528	290.4326	292.4326	294.9969	290.4742	293.4691
IR	1.5293	0.0000	0.3407	173.0542	175.0542	177.6185	173.0958	176.0907
IL	2.0773	0.0000	0.4350	280.3121	282.3121	284.8765	280.3538	283.3487

4 Estimation Based on Random Censored Samples for IVT Distribution Shape Parameter

4.1 Model Assumption and Description

The random censoring can be described as follows: if we have n units under test, Let their lifetime is $T_{1}, T_{2}, \dots, T_{n}$ which are independent and identically distributed (iid) random variables with pdf $f_{T} (t), t > 0$ and cdf $F_{T} (t) t > 0$ , their random censoring times are $C_{1}, C_{2}, \dots, C_{n}$ which are iid with pdf $g_{C} (c), c > 0$ and cdf $F_{C} (c), c > 0$ , assume $T_{i} {}^{'}s$ and $C_{i} {}^{'}s$ be mutually independent. Note that, between $T_{i} {}^{'}s$ and $C_{i} {}^{'}s$ , only one will be observed. Further, let the actual observed time be $X_{i} = \min (T_{i}, C_{i}), i = 1, \dots, n$ , and the indicator variable $δ_{i}$ are defined as

δ_{i} = {\begin{matrix} 1; & T_{i} \leq C_{i} \\ 0; & T_{i} > C_{i} \end{matrix}

(15)

The censored data ( $X_{i}$ ) is known as the random censoring samples. The likelihood function under random censoring is given by [11]

L = \prod_{i = 1}^{n} {[f_{T} (x) S_{C} (x_{i})]}^{δ_{i}} {[g_{C} (x_{i}) R_{T} (x_{i})]}^{1 - δ_{i}}

(16)

Where, $R_{T} (x_{i}) = 1 - F_{T} (t)$ and $S_{C} (x_{i}) = 1 - F_{C} (c)$ .

4.2 Maximum Likelihood Estimation

In this section, we obtain the MLEs for the unknown parameters of the IVT distribution. Let the lifetime T and censoring time C follow IVT $(β_{1})$ and IVT $(β_{2})$ respectively. Then the likelihood function for the unknown parameters under random censoring becomes:

$L$	$= \prod_{i = 1}^{n} {[2 β_{1} (x_{i} - 1) x_{i}^{- 2 β_{1} - 2 β_{2} - 1} {(2 x_{i} - 1)}^{β_{1} + β_{2} - 1}]}^{δ_{i}}$
	${[2 β_{2} (x_{i} - 1) x^{- 2 β_{2} - 2 β_{1} - 1} {(2 x_{i} - 1)}^{β_{2} + β_{1} - 1}]}^{1 - δ_{i}}$	(17)

where $\sum_{i = 1}^{n} δ_{i} = r$ is the observed number of uncensored lifetimes or failures.

Then, the corresponding log-likelihood function can be written as

$l$	$= r l o g (2 β_{1}) + \sum_{i = 1}^{n} δ_{i} \log (x_{i} - 1) + (- 2 β_{2} - 2 β_{1} - 1) \sum_{i = 1}^{n} δ_{i} \log (x_{i})$
	$+ (β_{1} + β_{2} - 1) \sum_{i = 1}^{n} δ_{i} \log (2 x_{i} - 1) + (n - r) l o g (2 β_{2})$
	$+ \sum_{i = 1}^{n} (1 - δ_{i}) \log (x_{i} - 1) + (- 2 β_{1} - 2 β_{2} - 1) \sum_{i = 1}^{n} (1 - δ_{i}) \log (x_{i})$
	$+ (β_{2} + β_{1} - 1) \sum_{i = 1}^{n} (1 - δ_{i}) \log (2 x_{i} - 1)$	(18)

Differentiating (18) with respect to $β_{1}$ and $β_{2}$ gets:

$\frac{\partial l}{\partial β_{1}}$	$= \frac{r}{β_{1}} - 2 \sum_{i = 1}^{n} δ_{i} \log (x_{i}) + \sum_{i = 1}^{n} δ_{i} \log (2 x_{i} - 1)$
	$- 2 \sum_{i = 1}^{n} (1 - δ_{i}) \log (x_{i})$
	$+ \sum_{i = 1}^{n} (1 - δ_{i}) \log (2 x_{i} - 1)$	(19)
$\frac{\partial l}{\partial β_{2}}$	$= \frac{n - r}{β_{2}} - 2 \sum_{i = 1}^{n} δ_{i} \log (x_{i}) + \sum_{i = 1}^{n} δ_{i} \log (2 x_{i} - 1)$
	$- 2 \sum_{i = 1}^{n} (1 - δ_{i}) \log (x_{i})$
	$+ \sum_{i = 1}^{n} (1 - δ_{i}) \log (2 x_{i} - 1)$	(20)

Equating the first derivatives in (19) and (20) to zero and solving for $β_{1}$ and $β_{2}$ to get the MLEs ${\hat{β}}_{1}$ and ${\hat{β}}_{2}$ of $β_{1}$ and $β_{2}$ , respectively in closed form as follows

{\hat{β}}_{1} = \frac{r}{\begin{matrix} 2 \sum_{i = 1}^{n} δ_{i} \log (x) - \sum_{i = 1}^{n} δ_{i} \log (2 x - 1) \\ + 2 \sum_{i = 1}^{n} (1 - δ_{i}) \log (x) - \sum_{i = 1}^{n} (1 - δ_{i}) \log (2 x - 1) \end{matrix}}

and

{\hat{β}}_{2} = \frac{n - r}{\begin{matrix} 2 \sum_{i = 1}^{n} δ_{i} \log (x) - \sum_{i = 1}^{n} δ_{i} \log (2 x - 1) \\ + 2 \sum_{i = 1}^{n} (1 - δ_{i}) \log (x) - \sum_{i = 1}^{n} (1 - δ_{i}) \log (2 x - 1) \end{matrix}}

4.3 Bayes Estimation for IVT Shape Parameter

In this section, we have discussed the Bayesian estimation procedure for the parameters of the IVT distribution based random censoring samples and we get BEs of the unknown parameters under the squared error loss (SEL) function. We assume that the unknown parameter of the IVT distribution has the independent gamma prior and can be written with proportional as follows;

$π (β_{1} \| a_{1}, b_{1})$	$\propto β_{1}^{a_{1} - 1} e^{- β_{1} b_{1}}, β_{1} > 0, a_{1}, b_{1} > 0$	(21)
$π (β_{2} \| a_{2}, b_{2})$	$\propto β_{2}^{a_{2} - 1} e^{- β_{2} b_{2}}, β_{2} > 0, a_{2}, b_{2} > 0$	(22)

Therefore, the joint prior density of $β_{1}$ and $β_{2}$ can be written with proportional as follows:

π (β_{1}, β_{2}) \propto β_{1}^{a_{1} - 1} β_{2}^{a_{2} - 1} e^{- (β_{1} b_{1} + β_{2} b_{2})} β_{1}, β_{2} > 0, a_{1}, a_{2}, b_{1}, b_{2} > 0

(23)

Hyper-parameters determination: As in Section 3.2, the hyper-parameters involved in priors (21) and (22) can be easily evaluated, if we consider that prior mean and prior variance are known. The prior mean and prior variance will be obtained from the maximum likelihood estimates of $(β_{1}, β_{2})$ by equating the mean and variance of $({\hat{β}}_{1}^{j}, {\hat{β}}_{2}^{j})$ with the mean and variance of the considered priors (gamma prior), where $j = 1, 2, \dots, k$ and k is the number of random samples generated from the model in Section 4.2. Thus, on equating mean and variance of $({\hat{β}}_{1}^{j}, {\hat{β}}_{2}^{j})$ with the mean and variance of gamma priors, we get

\frac{1}{k} \sum_{j = 1}^{k} {\hat{β}}_{1}^{j} = \frac{a_{1}}{b_{1}} & \frac{1}{k - 1} \sum_{j = 1}^{k} {({\hat{β}}_{1}^{j} - \frac{1}{k} \sum_{j = 1}^{k} {\hat{β}}_{1}^{j})}^{2} = \frac{a_{1}}{b_{1}^{2}}

Now, on solving the above two equations, the estimated hyper-parameters can be written as

$a_{1}$	$= \frac{{(\frac{1}{k} \sum_{j = 1}^{k} {\hat{β}}_{1}^{j})}^{2}}{\frac{1}{k - 1} \sum_{j = 1}^{k} {({\hat{β}}_{1}^{j} - \frac{1}{k} \sum_{j = 1}^{k} {\hat{β}}_{1}^{j})}^{2}} &$
$b_{1}$	$= \frac{\frac{1}{k} \sum_{j = 1}^{k} {\hat{β}}_{1}^{j}}{\frac{1}{k - 1} \sum_{j = 1}^{k} {({\hat{β}}_{1}^{j} - \frac{1}{k} \sum_{j = 1}^{k} {\hat{β}}_{1}^{j})}^{2}}$

A similar procedure for determining the hyperparameters ( $a_{2}, b_{2}$ ) can be used for $β_{2}$ .

Based on the likelihood function (4.2) and the joint prior density (23), the joint posterior density $β_{1}$ and $β_{2}$ given the data can be written as

π (β_{1}, β_{2} | \underline{x}) = \frac{π (β_{1}, β_{2}) L (β_{1}, β_{2} | \underline{x})}{\int_{0}^{\infty} \int_{0}^{\infty} π (β_{1}, β_{2}) L (β_{1}, β_{2} | \underline{x}) d β_{1} d β_{2}} .

Then, the joint posterior function can be written as

$π (β_{1}, β_{2} \| \underline{x})$	$= \frac{1}{k (β_{1}, β_{2})} 4 β_{1}^{r + a_{1} - 1} β_{2}^{(n - r) + a_{2} - 1} e^{- (β_{1} b_{1} + β_{2} b_{2})}$
	$\prod_{i = 1}^{n} {(x_{i} - 1)}^{δ_{i}} \prod_{i = 1}^{n} {(x_{i} - 1)}^{1 - δ_{i}}$
	$\prod_{i = 1}^{n} x_{i}^{(- 2 β_{1} - 2 β_{2} - 1) δ_{i}} \prod_{i = 1}^{n} x_{i}^{(- 2 β_{2} - 2 β_{1} - 1) (1 - δ_{i})}$
	$\prod_{i = 1}^{n} {(2 x_{i} - 1)}^{(β_{2} - β_{1} - 1) δ_{i}}$
	$\prod_{i = 1}^{n} {(2 x_{i} - 1)}^{(β_{1} - β_{2} - 1) (1 - δ_{i})}$	(24)

Where,

$k (β_{1}, β_{2})$	$= \int_{0}^{\infty} \int_{0}^{\infty} 4 β_{1}^{r + a_{1} - 1} β_{2}^{(n - r) + a_{2} - 1} e^{- (β_{1} b_{1} + β_{2} b_{2})}$
	$\cdot \prod_{i = 1}^{n} {(x_{i} - 1)}^{δ_{i}} \cdot \prod_{i = 1}^{n} {(x_{i} - 1)}^{1 - δ_{i}}$
	$\cdot \prod_{i = 1}^{n} x_{i}^{(- 2 β_{1} - 2 β_{2} - 1) δ_{i}} \prod_{i = 1}^{n} x_{i}^{(- 2 β_{2} - 2 β_{1} - 1) (1 - δ_{i})}$
	$\cdot \prod_{i = 1}^{n} {(2 x_{i} - 1)}^{(β_{2} - β_{1} - 1) δ_{i}}$
	$\cdot \prod_{i = 1}^{n} {(2 x_{i} - 1)}^{(β_{1} - β_{2} - 1) (1 - δ_{i})} d β_{1}, d β_{2},$

Thus, the Bayes estimate of $g (β_{1}, β_{2})$ based on SEL function is given by

${\tilde{g}}_{B S} (β_{1}, β_{2})$	$= E (g (β_{1}, β_{2}) \| \underline{x})$
${\tilde{g}}_{B S} (β_{1}, β_{2})$	$= \frac{\int_{0}^{\infty} \int_{0}^{\infty} g (β_{1}, β_{2}) π (β_{1}, β_{2}) L (β_{1}, β_{2} \| \underline{x}) d β_{1} d β_{2}}{\int_{0}^{\infty} \int_{0}^{\infty} π (β_{1}, β_{2}) L (β_{1}, β_{2} \| \underline{x}) d β_{1} d β_{2}}$ (25)

It should be noted that the ratio of integral in (4.3) cannot be obtained in closed forms. So, we use the MCMC approximation method to generate samples from (24) and to calculate the BEs of $β_{1}$ and $β_{2}$ and also to construct associated HPD intervals. where we use the M-H method with normal proposal distribution.

4.4 Interval Estimation Based on Random Censoring Samples

In this section, we propose different confidence intervals. One is based on the asymptotic distribution of $β_{1}$ and $β_{2}$ , two different bootstrap confidence intervals and finally, HPD intervals.

Asymptotic Confidence Intervals

The asymptotic variance-covariance matrix of the MLEs of $β_{1}$ and $β_{2}$ can be obtained by inverting the observed information matrix, and is given as follows:

{[\begin{matrix} - E (\frac{\partial^{2} ℓ}{\partial β_{1}^{2}}) & - E (\frac{\partial^{2} ℓ}{\partial β_{1} \partial β_{2}}) \\ - E (\frac{\partial^{2} ℓ}{\partial β_{2} \partial β_{1}}) & - E (\frac{\partial^{2} ℓ}{\partial β_{2}^{2}}) \end{matrix}]}_{(θ = \hat{θ})}^{- 1}

= [\begin{matrix} V_{11} & V_{12} \\ V_{21} & V_{22} \end{matrix}]

Where $\hat{θ} = {({\hat{β}}_{1}, {\hat{β}}_{2})}^{'}$ and $θ = {(β_{1}, β_{2})}^{'}$ . The elements of the observed information matrix for $β_{1}$ and $β_{2}$ are given as follows:

{\frac{\partial^{2} l}{\partial β_{1}^{2}} |}_{θ = \hat{θ}} = - \frac{r}{{\hat{β}}_{1}^{2}}

Then the observed fisher information

I ({\hat{β}}_{1}) = - {\frac{\partial^{2} l}{\partial β_{1}^{2}} |}_{θ = \hat{θ}} = \frac{r}{{\hat{β}}_{1}^{2}}

The asymptotic variance of ${\hat{β}}_{1}$ is

V ({\hat{β}}_{1}) = \frac{1}{I ({\hat{β}}_{1})} = \frac{{\hat{β}}_{1}^{2}}{r}

and

{\frac{\partial^{2} l}{\partial β_{2}^{2}} |}_{θ = \hat{θ}} = - \frac{(n - r)}{β_{2}^{2}}

I ({\hat{β}}_{1}) = - {\frac{\partial^{2} l}{\partial β_{2}^{2}} |}_{θ = \hat{θ}} = \frac{n - r}{{\hat{β}}_{2}^{2}}

The asymptotic variance of ${\hat{β}}_{2}$ is

V ({\hat{β}}_{2}) = \frac{1}{I ({\hat{β}}_{2})} = \frac{{\hat{β}}_{2}^{2}}{n - r}

The sampling distribution of $\frac{{\hat{β}}_{i} - β_{i}}{\sqrt{v ({\hat{β}}_{i})}}$ where $i = 1, 2$ , can be approximated by a standard normal distribution. The large $(1 - α) 100 %$ confidence intervals for $β_{1}$ and $β_{2}$ are given by

({\hat{β}}_{i L}, {\hat{β}}_{i U}) = {\hat{β}}_{i} \pm Z_{\frac{α}{2}} \sqrt{v ({\hat{β}}_{i})} where i = 1, 2 .

Bootstrap Confidence Intervals

As in Section 3.3, two types of parametric bootstrap methods are considered

(iii) Percentile bootstrap method (Boot-p)

(iv) Bootstrap-t method (Boot-t)

Percentile Bootstrap (Boot-P) Confidence Interval

It given by the following steps:

i. A randomly censored sample is generated from the original data $T = (t_{1}, t_{2} \dots t_{n})$ and the MLE $\hat{θ} = ({\hat{β}}_{1}$ , ${\hat{β}}_{2})^{'}$ of the parameter $θ = {(β_{1}, β_{2})}^{'}$ is computed.

ii. Again, an independent randomly censored bootstrap sample $T^{*} = (t_{1}^{*}, t_{2}^{*} \dots t_{n}^{*})$ is generated by using $\hat{θ}$ .

iii. Now, compute the bootstrap MLE ${\hat{θ}}^{*}$ of parameter $θ$ based on $T^{*}$ , as in step-1.

iv. Repeat steps 2–3, B times representing B bootstrap MLE’s ${\hat{θ}}^{*}$ ’s based on B different bootstrap samples, i $=$ 1,2,… B.

v. Arrange all ${\hat{θ}}^{*}$ ’s in an ascending order to obtain the bootstrap sample i.e. ${\hat{θ}}^{*}_{(1)} \leq {\hat{θ}}^{*}_{(2)} \leq \dots \leq {\hat{θ}}^{*}_{(B)}$ . An approximate $100 (1 - ω) %$ boot-p confidence interval for $θ$ is obtained by $({\hat{θ}}^{*}_{[(\frac{ω}{2}) \times B]}, {\hat{θ}}^{*}_{[(1 - \frac{ω}{2}) \times B]})$ .

Where, $\frac{ω}{2}$ is the quantity that helps to determine the bootstrap point.

Bootstrap-t (Boot-t) Confidence Intervals

The bootstrap-t confidence interval is given by the following steps:

i. Steps 1 and 2 of boot-p and boot-t methods are the same.

ii. Compute the bootstrap-t statistic $T^{*} = \frac{{\hat{θ}}^{*}_{b} - \hat{θ}}{\sqrt{v ({\hat{θ}}^{*}_{b})}}$ for ${\hat{θ}}^{*}_{b}$ where b $=$ 1, 2,… B.

iii. To obtain a set of bootstrap statistics $T^{*}_{i}; i = 1, 2, \dots, B$ repeat steps 2–3, B times.

iv. Let $T^{*}_{(1)} \leq T^{*}_{(2)} \leq \dots \leq T^{*}_{(B)}$ be the ordered values of $T^{*}_{i}; i = 1, 2, \dots, B$ .

v. Now, the approximate $100 (1 - ω) %$ boot-t confidence interval for parameter $θ$ is obtained by

(\hat{θ} - {\hat{T}}^{*}_{[(1 - \frac{ω}{2}) \times B]} \sqrt{v (\hat{θ})}, \hat{θ} - {\hat{T}}^{*}_{[(\frac{ω}{2}) \times B]} \sqrt{v (\hat{θ})})

Highest Posterior Density (HPD) Intervals

As in Section 3.3, the HPD intervals for the unknown parameters can be constructed. let $θ_{(1)}, θ_{(2)}, \dots, θ_{(n)}$ be the corresponding ordered MCMC sample, to construct the HPD interval, let $θ_{(j)}$ be the j $^{th}$ smallest of ${θ_{(i)}}$ and denote $D_{j} (n) = (θ_{(j)}, θ_{(j + [(1 - φ) n])})$ , where $0 < φ < 1$ . For $j = 1, 2, \dots, n - [(1 - φ) n]$ be the HPD intervals then the best HPD interval that has the smallest interval width from $D_{j} (n)$ ’s. So, we can say $D_{j^{*}} (n) = (θ_{(j^{*})}, θ_{(j^{*} + [(1 - φ) n])})$ , be HPD interval for the unknown parameters have the smallest interval width among all $D_{j^{*}} (n)$ ’s. Where $j^{*}$ is chosen so that

θ_{(j^{*} + [(1 - φ) n])} - θ_{(j^{*})} = \min_{1 \leq j \leq n - [(1 - φ) n]} (θ_{(j + [(1 - φ) n])} - θ_{(j)})

Where $θ_{(j)} = β_{1 (j)}, β_{2 (j)}$ , and $θ_{(i)} = β_{1 (i)}, β_{2 (i)}$ then the HPD intervals for the unknown parameters can be constructed.

4.5 Simulation Study

A simulation study was carried to check the performance of the accuracy of point and interval estimates for several cases, for which estimate the two parameters of IVT distribution ( $β_{1}$ and $β_{2}$ ) for number of replications $(m = 1000)$ , for different sample sizes (n) as $n = 25$ , 50, 80, 100 and different parameters values. All the computations are performed using statistical software R.

The simulations results for MLEs are summarized in Tables 8, 9, 10, and 11 and obtained by the following steps:

i. Specify initial values for parameters ( $β_{1}$ and $β_{2}$ ) as (0.5, 0.3), (0.8, 0.9), (1.2, 1) and (1.9, 1.5).

ii. Specify the sample size n. as $n = 25, 50, 80, 100 .$

iii. Generate n standard uniform variates i.e. $U \sim Uniform (0, 1)$ .

iv. Generated samples of size n from IVT ( $β_{1}$ ) distribution (lifetimes) and IVT $(β_{2})$ (censoring times) distribution by using the formula

$t$	$= U^{\frac{- 1}{β_{1}}} (1 + \sqrt{1 - U^{\frac{1}{β_{1}}}}) and$
$c$	$= U^{\frac{- 1}{β_{2}}} (1 + \sqrt{1 - U^{\frac{1}{β_{2}}}}), respectively .$

v. Calculate the times $x_{i} = \min (T_{i}, C_{i})$ and the censorship indicators $δ_{i}$ , which are equal to 1 if $T_{i} < C_{i}$ and 0 otherwise.

vi. Obtain the maximum likelihood estimates (MLEs).

vii. Obtain the mean, bias, mean squared error (MSE), asymptotic and bootstrap confidence intervals (CI’s) for the unknown parameters, average interval lengths (AILs), and coverage probability (CP) for the different sample size.

viii. Repeat steps 1–5 1000 times.

And the simulation results for Bayesian estimates are summarized in Tables 8, 9, 10, and 11 which are obtained by the following steps:

iv. Step I, ii, iii, iv, and v of the MLEs simulation are the same

v. By using the M-H algorithm shown in Section 4.2 under the informative prior and the non-informative prior and repeat the chain N times (N $=$ 10000) to obtain MCMC samples.

∙ For informative prior, we compute the hyperparameters for all simulation cases as in Table 7.

∙ For non-informative prior (P-II) we assume that hyper-parameter values are $a_{1} = b_{1} =$ $a_{2} = b_{2} = a_{3} =$ $b_{3} = a_{4} = b_{4} = a_{5} = b_{5} = 0$ .

vi. Compute the approximate Bayes estimator of $g (β_{1}, β_{2})$ under SEL is given by

{\tilde{g}}_{SEL} (β_{1}, β_{2}) = \frac{1}{N - M} \sum_{i = M + 1}^{N} g_{SEL} (β_{1}^{(i)}, β_{2}^{(i)},), i = 1, 2, \dots, N .

Where M ( $=$ 2000) is the burn-in-period (that is, a number of iterations before the stationary distribution is achieved).

vii. Repeat step i–iii 1000 times to obtain the mean, bias, mean squared error (MSE), HPD intervals for the unknown parameters, average interval lengths (AILs), and coverage probability (CP) for the different sample size.

Table 7 The Hyper Parameters Values under random censoring data

		Initial Values

		$β_{01} = 0.5$ ,	$β_{01} = 0.8$ ,	$β_{01} = 1.2$ ,	$β_{01} = 1.9$ ,
Hyper-Parameters		$β_{02} = 0.3$	$β_{02} = 0.9$	$β_{02} = 1$	$β_{02} = 1.5$
30	a1	18.0	13.61	15.74	16.23
	a2	10.9	15.28	13.28	12.73
	b1	34.9	16.49	8.23	8.23
	b2	34.7	16.34	8.18	8.18
50	a1	30.7	22.81	22.81	27.64
	a2	18.3	26.12	26.12	21.41
	b1	60.2	27.87	14.15	14.15
	b2	60.2	27.86	14.17	14.17
80	a1	49.6	36.91	42.88	43.81
	a2	29.5	42.19	36.21	35.25
	b1	97.8	45.93	22.98	22.98
	b2	97.8	45.90	22.97	22.97
100	a1	61.3	46.57	54.36	55.37
	a2	37.7	52.37	44.60	43.60
	b1	121.7	57.66	28.86	28.86
	b2	121.7	57.61	28.84	28.84

Table 8 Average estimated values, MSEs, bias, asymptotic and bootstrap (t-p) CI intervals of MLEs and BEs of IVT distribution parameters under random censoring data

images

Table 9 Average estimated values, MSEs, bias, asymptotic and bootstrap (t-p) CI intervals of MLEs and BEs of IVT distribution parameters under random censoring data

images

Table 10 Average estimated values, MSEs, bias, asymptotic and bootstrap (t-p) CI intervals of MLEs and BEs of IVT distribution parameters under random censoring data

images

Table 11 Average estimated values, MSEs, bias, asymptotic and bootstrap (t-p) CI intervals of MLEs and BEs of IVT distribution parameters under random censoring data

images

From the results in Tables 8–11 the following conclusion can be made:

i. Similar to the complete case based on MSEs, higher values of $n$ lead to better estimates.

ii. The CPs of the MLEs are better than those of the CPs of Bayes estimates obtained under informative prior and the non-informative Bayes estimates, respectively.

iii. The MSEs of the MLEs are less than the BEs under the SEL function.

iv. It can be noticed that under informative prior the AILs and of HPD intervals are better than those of non-informative priors, Bootstrap (t – p), and MLEs.

v. Estimates obtained by the MLEs and BEs are almost unbiased.

4.6 Application to Real Data

In this section, the IVT distribution will be fitted to a real data set, to show how the IVT distribution can be applied in practice. These data are taken from a lung cancer study described by [12]. These data show remission times (in days) of a group of 15 patients. The data set is given as: (8, 10, 11, 25*, 42, 72, 82, 100*, 110, 118, 126, 144, 228, 314, 411). The observations with (*) sign the censored times. For this data set, the unknown parameter ( $β$ ) of the IVT distribution will be estimated by the maximum-likelihood method, and with this, the estimate (MLE), the values of the Kolmogorov-Smirnov (KS) statistic (the distance between the empirical CDFs and the fitted CDFs), Akaike information criterion (AIC ), Bayesian information criterion (BIC) and Hannan-Quinn information criterion (HQIC) are calculated. These results are summarized in Table 12:

Table 12 The Values of Goodness of Fit Test for Lung Cancer Data Set to the IVT distribution

						k-s

Distribution	$B$	$-$ 2log L	AIC	BIC	HQIC	D-statistics	p-value
IVT	0.277	299.65	301.7	302.2	301.5	0.3408	0.0751
IVT*	0.309	42.71	44.71	43.4	41.98	0.5452	0.4137
Note: (*) indicates the censoring times’ distribution.

From Table 12, the null hypothesis is not rejected, these lung cancer data may be modeled by the IVT distribution.

Moreover, MLE and Bayesian estimation methods are applied for estimating the model unknown parameter. For calculation of BEs, the hyper-parameters $a_{1}, b_{1}, a_{2}$ and $b_{2}$ are chosen such that the expected value $M_{β_{1}}$ of $β_{1}$ is 0.2437 with a variance $V_{β_{1}} = 0.0046$ giving $a_{1} = 13$ and $b_{1} = 53.34$ , the expected value $M_{β_{2}}$ of $β_{2}$ is 0.0375 with a variance $V_{β_{2}} = 0.0007$ giving $a_{2} = 2$ and $b_{2} = 53.33$ . these results are listed in Table 13.

The empirical distribution for lifetimes and for censoring times for the lung cancer data are represented in Figures 7 and 8 respectively.

Table 13 The MLEs and BEs of the parameters from lung cancer data set

		BEs Under SEL Function		Confidence Intervals

					AILs (HPD Interval)

Parameter	MLEs	P-I	P-II	AILs (Asy CI)	P-I	P-II
${\hat{β}}_{1}$	0.2437	0.2355	0.1761	0.2672 (0.1341,0.4012)	0.1284 (0.881,0.3165)	0.0533 (0.1494,0.2027)
${\hat{β}}_{2}$	0.0375	0.0344	0.0321	0.1083 (0.0074,0.1158)	0.1083 (0.0232,0.0476)	0.0108 (0.027,0.0378)

Note: AILs- Average interval lengths.

Figure 7 Empirical distribution for lifetimes for lung cancer data.

Furthermore, the inverted distributions defined in Section 3.5 can be used to fit this data also with numerical results listed in Table 14 and Figure 9.

Figure 8 Empirical distribution for censoring times for lung cancer data.

Table 14 MLEs, AIC, BIC AICC and HQIC values, and Kolmogorov- Smirnov statistics for Lung Cancer Data lifetimes

		Measures

Model	MLE	p-value	K-S	$-$ 2log L	AIC	BIC	AICc	HQIC
IL	32.725	0.1052	0.3013	$-$ 195.5445	181.3550	182.0630	179.6216	181.3474
IE	36.90926	0.1051	0.3014	$-$ 179.3469	181.3469	182.0550	179.6136	181.3394
IR	0.824718	0.0000	0.5911	$-$ 179.3550	211.8932	212.6012	210.1599	211.8857

Figure 9 Empirical distribution for different lifetimes for lung cancer data.

5 Conclusion

In this paper, we have obtained the maximum likelihood estimates and Bayes estimates for the unknown parameter of the IVT distribution based on complete and random censoring data, the confidence intervals, HPD intervals, and bootstrap (p-t) intervals are also obtained. We perform some simulations to see the performances of the MLEs and BEs incomplete and random censoring data. One real data set has been re-analyzed based on random censoring data.

References

[1] Topp, C.W. and Leone, F.C. (1955). A family of J-shaped frequency functions, Journal of the American Statistical Association, 50, 209–219.

[2] Nadarajah, S. and Kotz, S. (2003). Moments of some J-shaped distributions, Journal of Applied Statistics, 30, 311–317.

[3] Ghitany, M.E., Kotz, S. and Xie, M. (2005). On some reliability measures and their stochastic ordering for the Topp–Leone distribution, Journal of Applied Statistics, 32, 715–722.

[4] Bayoud, H. (2016). Admissible minimax estimators for the shape parameter of Topp–Leone distribution, Communications in Statistics-Theory and Methods, doi: 10.1080/03610926.2013.818700.

[5] Muhammed, H.Z. (2019). On The Inverted Topp-Leone Distribution, international journal of reliability and applications, 20, 17–28.

[6] Dey, S., Singh, S., Tripathi, Y.M. and Asgharzadeh, A. (2016). Estimation and prediction for a progressively censored generalized inverted exponential distribution. Statistical Methodology, 132, 185–202.

[7] Ravenzwaaij, D.V., Cassey, P. and Brown, S.D. (2018). A simple introduction to Markov Chain Monte-Carlo sampling. Psychonomic Bulletin Review, 25, 143–154.

[8] Dey, S. and Pradhan, B. (2014). Generalized inverted exponential distribution under hybrid censoring. Statistical Methodology, 18, 101–114.

[9] Efron, B., and Tibshirani, R. J. (1993). An Introduction to the Bootstrap. New York: Chapman and Hall.

[10] Hall, P. (1988). Theoretical comparison of bootstrap confidence intervals. The Annals of Statistics, 927–953.

[11] Lawless, J. F. (2011). Statistical Models And Methods for Lifetime Data, Second edition. John Wiley & Sons, Inc, Canada.

[12] Kalbfleisch, J. D. and Prentice, R. L. (1980). The Statistical Analysis of Failure Time Data. New York: Wiley.

Biographies

Hiba Zeyada Muhammed received a bachelor’s degree in Statistics from the Faculty of Science at Cairo University in 2006, a master’s degree in Statistics from Cairo University in 2009, and philosophy of doctorate in statistics from Cairo University in 2013, respectively. She is currently working as an Associative Professor at the Department of Mathematical Statistics, Faculty of Graduate Studies for Statistical Research, Cairo University. Her research areas include Reliability, life testing, bivariate and multivariate analysis, copula modeling and ranked set sampling. She has been serving as a reviewer for many highly-respected journals.

Essam Abd Elsalam Muhammed received a bachelor’s degree in applied Statistics from the Faculty of Commerce at kafr El-sheikh University in 2015, a master’s degree in Statistics from Cairo University in 2020, and in the preparatory year of the doctorate in statistics at Cairo University, respectively. He is currently working as a teaching assistant at High Institute of Computer and Information Technology, Elshorouk Academy. his research areas include Reliability and life testing.

Journal of Reliability and Statistical Studies, Vol. 14, Issue 2 (2021), 615–650.
doi: 10.13052/jrss0974-8024.14212
© 2021 River Publishers