Inverted Topp-Leone Distribution: Contribution to a Family of J-Shaped Frequency Functions in Presence of Random Censoring
Hiba Zeyada Muhammed1,* and Essam Abd Elsalam Muhammed2
1Department of Mathematical Statistics, Faculty of Graduate Studies for Statistical Research, Cairo University, Egypt
2Department of Management and Financial, High Institute of Computer and Information Technology, Elshorouk Academy, Egypt
E-mail: hiba_stat@cu.edu.eg; essamabdelsalam16@gmail.com
*Corresponding Author
Received 31 July 2021; Accepted 11 November 2021; Publication 06 December 2021
In this paper, Bayesian and non-Bayesian estimation of the inverted Topp-Leone distribution shape parameter are studied when the sample is complete and random censored. The maximum likelihood estimator (MLE) and Bayes estimator of the unknown parameter are proposed. The Bayes estimates (BEs) have been computed based on the squared error loss (SEL) function and using Markov Chain Monte Carlo (MCMC) techniques. The asymptotic, bootstrap (p,t), and highest posterior density intervals are computed. The Metropolis Hasting algorithm is proposed for Bayes estimates. Monte Carlo simulation is performed to compare the performances of the proposed methods and one real data set has been analyzed for illustrative purposes.
Keywords: Inverted Topp leone distribution, moments, order statistic, maximum likelihood estimation, Bayesian estimation, MCMC, highest posterior density interval, asymptotic confidence interval, bootstrap confidence interval, random censoring.
The Topp-Leone (TL) distribution was originally proposed by [1] as an alternative to beta distribution and it has applied for some family data. In recent years, the TL distribution has received huge attention in the literature; see for example; [2] showed that the TL distribution exhibits bathtub failure rate function with widespread applications in reliability. Moreover, [3] showed that TL distribution possesses some attractive reliability properties such as bath tub-shape hazard rate, decreasing reversed hazard rate, upside-down mean residual life, and increasing expected inactivity time. Recently, [4] derived admissible minimax estimates for the shape parameter of the TL distribution under squared and linear-exponential loss functions. Recently, [5] introduced the inverted Topp-Leone (IVT) distribution as a J-shaped distribution. Which is useful for modeling lifetime Phenomena and she studied more of its properties.
In this paper, we study classical and Bayesian estimation for the shape parameter of the IVT distribution when the sample is complete and random censored.
The random censoring can be expressed as follows: When the units under test lose or remove from the test before its failure this data is called random censoring. To more illustrate, in clinical trials or medical tests, some patients retreat or leave the test before finishing it.
The work in this paper is organized as follows: In Section 2 we introduce the IVT distribution. The maximum likelihood estimator (MLE) of the unknown parameter, the Bayes estimator, and the confidence interval based on complete data will be introduced in Section 3, in Section 4 we introduce the maximum likelihood estimators (MLEs) of the unknown parameters, the Bayes estimators and the confidence intervals based on random censoring data. Finally, the paper is concluded in Section 5.
The Topp Leone distribution is defined with the following pdf and cdf respectively,
(1) |
And
(2) |
For and
Assume the pdf and cdf of are given respectively, as
(3) |
And
(4) |
For and
In this case, the distribution of X is called inverted Topp-Leone (IVT) distribution denoted by . It can be showed that the pdf (3) satisfies the following generalized Pearson system of differential equation
where , , , , , and .
The IVT distribution may be considered as a J- shaped because , and for some values . And it can be noted that from Figure 1. Also, Figure 2 shows the cdf of IVT distribution for different values for the parameter .
The mode of the is given by
The quantiles of the distribution is given by
The median is a special case from the quantile function, when
And the inter-quartile range (IQR) is given as
The moment about origin is given by the following theorem.
Theorem 1: the -moment about zero is given by
(5) |
For and
Where
The survival function for the failure time X follows IVT distribution is defined as
(6) |
For fixed x means the probability of survival up to time x. Figure 3 shows the survival function for the IVT distribution with different parameter values.
Moreover, for the IVT distribution, the hazard function is easily obtained as
(7) |
and it has different shapes according to the values of the parameter as shown in Figure 4.
The reversed hazard function for the IVT distribution is given as follows
(8) |
and it has different shapes as shown in Figure 5 according to to the variability in the parameter .
Let be independent and identically distributed random variables drawn from IVT distribution. Let be the order statistic, then the pdf of is defined as
Where , and .
Special cases for and are respectively considered as
and
The joint pdf of and , , for a sample of size n
Where and .
In the following theorems, the moments and product moments about origin will be introduced
Theorem 2: the -moment about zero of order statistic is given by
(9) |
For and ,
Where
and is the beta function.
The moment of is given as
The moment of is given as
Theorem 3: the and -product moments about zero of and are given by
(10) |
For , and .
Where as given in (9).
Suppose that is a simple random sample of size n drawn from . In this section, the shape parameter of the IVT distribution will be estimated using the MLE as follows.
The likelihood function is given by
(11) |
The natural logarithm of the likelihood function is given as
After differentiating the and equating to zero the MLE for can be expressed in closed form as follows
In this section, we have discussed the Bayesian estimation procedure for the parameter of the IVT distribution and we get the Bayesian estimate of the unknown parameter under the squared error loss (SEL) function. We assume that the unknown parameter of the IVT distribution have gamma prior distribution and can be written with proportional as follows;
(12) |
Hyper-parameters determination: The hyper-parameters involved in priors (12) can be easily evaluated if we consider that prior mean and prior variance are known. The prior mean and prior variance will be obtained from the maximum likelihood estimate of by equating the mean and variance of with the mean and variance of the considered priors (gamma prior), where and k is the number of random samples generated from the model in Section 3.1. Thus, on equating mean and variance of with the mean and variance of gamma priors, we get ([6])
Now, on solving the above two equations, the estimated hyper-parameters can be written as
Based on the likelihood function (11) and the gamma prior (12), the joint posterior density function of given the data can be written as
Then, the joint posterior density function can be written as
(13) |
Where,
Thus, the Bayes estimate of based on SEL function is given by
(14) |
It should be noted that the ratio of integral in (3.2) cannot be obtained in closed forms. So, we use the MCMC approximation method to generate samples from (13) and to calculate the BE of and also to construct associated HPD intervals.
Markov Chain Monte Carlo (MCMC) is considered to be a computer-driven sampling technique. It permits one to characterize a distribution without knowing all of the distribution mathematical properties by random sampling values out of the distribution ([7]). We use Metropolis Hasting (M-H) method with normal proposal distribution to generate random numbers from (13). Thus, we perform the following steps of the M-H algorithm to draw samples from the posterior density (13) and in turn compute the Bayes estimate of and construct the corresponding HPD intervals ([8]):
I. Set initial values M burn-in.
II. For i 1,…,N, repeat the following steps.
∙ Set .
∙ Generate new candidate parameter values from
∙ Set .
∙ Calculate
∙ Update with probability A; otherwise set
The approximate Bayes estimate of , ,N with respect to SEL function is given by
Where is Bayes estimate under SEL function and M is the burn-in-period (that is, a number of iterations before the stationary distribution is achieved).
In this section, we propose different confidence intervals. One is based on the asymptotic distribution of , two different bootstrap confidence intervals, and finally, HPD intervals.
The second derivative for is trivially obtained as
The observed Fisher information matrix is given by
The asymptotic variance of is
The sampling distribution of can be approximated by a standard normal distribution.
The large sample confidence interval for is given by
where is the standard normal random variable and is the confidence coefficient.
The bootstrap confidence intervals are approximate confidence intervals but in general are better approximate than standard intervals. A parametric bootstrap interval provides much more information about the population value of the quantity of interest than does a point estimate. The parametric bootstrap methods are of two types: –
(i) The percentile bootstrap method (Boot-p) was proposed by [9].
(ii) The Bootstrap-t method (Boot-t) was proposed by [10].
The boot-p method is rather simple and constructs confidence intervals directly from the percentiles of the bootstrap distribution of the estimated parameters. It is given by the following steps:
I. A complete sample is generated from the original data and the MLE of the parameter is computed.
II. Again, an independent complete bootstrap sample is generated by using .
III. Now, compute the bootstrap MLE of parameter based on , as in step-1.
IV. Repeat steps 2–3, B times representing B bootstrap MLE’s ’s based on B different bootstrap samples, i 1,2,… B.
V. Arrange all ’s in an ascending order to obtain the bootstrap sample i.e. . An approximate boot-p confidence interval for is obtained by .
Where, is the quantity that helps to determine the bootstrap point.
The bootstrap-t confidence interval is given by the following steps:
I. Steps 1 and 2 of boot-p and boot-t methods are the same.
II. Compute the bootstrap-t statistic for where b 1, 2,…B.
III. To obtain a set of bootstrap statistics repeat steps 2–3, B times.
IV. Let be the ordered values of ; .
V. Now, the approximate boot-t confidence interval for parameter is obtained by
The HPD intervals for the unknown parameters can be constructed by using the following algorithm: let be the corresponding ordered MCMC sample, to construct the HPD interval, let be the j smallest of and denote , where For be the HPD intervals then the best HPD interval that has the smallest interval width from . So, we can say , be HPD interval for the unknown parameters have the smallest interval width among all s. Where is chosen so that
Where , and then the HPD intervals for the unknown parameters can be constructed.
A simulation study was carried to check the performance of the accuracy of point and interval estimates for several cases, for which estimate the one parameter of IVT distribution () for the number of replications , for different sample sizes (n) as , 50, 80, 100 and different parameters values. All the computations are performed using statistical software R.
The simulations results for MLE are summarized in Tables 2, 3, 4, and 5 and obtained by the following steps:
i. Specify initial values for parameter () as (0.5), (0.8), (1.2) and (1.9).
ii. Specify the sample size n. as .
iii. Generate n standard uniform variates i.e. .
iv. Generated complete samples of size n from IVT () distribution by using the formula
v. Obtain the maximum likelihood estimates (MLEs).
vi. Obtain the mean, bias, mean squared error (MSE), asymptotic and bootstrap confidence intervals (CI’s) for the unknown parameters, average interval lengths (AILs), and coverage probability (CP) for the different sample size.
vii. Repeat steps 1–5 1000 times.
And the simulation results for the Bayesian estimate are summarized in Tables 2, 3, 4, and 5 which are obtained by the following steps:
i. Step I, ii, iii, iv, and v of the MLE simulation are the same
ii. By using the M-H algorithm shown in Section 3.2 under the informative prior and the non-informative prior and repeat the chain N times (N 10000) to obtain MCMC samples.
∙ For informative prior, we compute the hyperparameters for all simulation cases as in Table 1.
∙ For non-informative prior (P-II) we assume that hyper-parameter values are .
iii. Compute the approximate Bayes estimator of under SEL function is given by
Where M (2000) is the burn-in-period (that is, a number of iterations before the stationary distribution is achieved).
Repeat step i–iii (1000) times to obtain the mean, bias, mean squared error (MSE), HPD intervals for the unknown parameters, average interval lengths (AILs), and coverage probability (CP) for the different sample sizes.
Table 1 The hyper parameters values under complete data
Initial Values | |||||
Hyper-Parameters | |||||
30 | a1 | 23.97 | 23.95 | 23.94 | 23.88 |
b1 | 45.85 | 28.75 | 19.21 | 12.00 | |
50 | a1 | 48.92 | 48.93 | 48.99 | 48.97 |
b1 | 95.59 | 59.47 | 40.02 | 25.21 | |
80 | a1 | 78.91 | 78.86 | 79.05 | 79.04 |
b1 | 155.47 | 97.51 | 64.86 | 41.17 | |
100 | a1 | 98.90 | 98.97 | 99.06 | 98.90 |
b1 | 195.76 | 123.19 | 82.52 | 51.58 |
Table 2 Average estimated values, MSEs, bias, asymptotic and bootstrap (t-p) CI intervals of MLEs and BEs of IVT distribution parameters under complete data
Table 3 Average estimated values, MSEs, bias, asymptotic and bootstrap (t-p) CI intervals of MLEs and BEs of IVT distribution parameters under complete data
Table 4 Average estimated values, MSEs, bias, asymptotic and bootstrap (t-p) CI intervals of MLEs and BEs of IVT distribution parameters under complete data
From tabulated values in Tables 2, 3, 4, and 5 it can be noticed that:
i. As expected, with respect to MSEs, higher values of n lead to better estimates.
ii. It is also noticed that the maximum likelihood estimates compete well with non-informative Bayes estimates, and the performance of the Bayes estimates obtained under informative prior is better than the non-informative Bayes estimates.
iii. It can also be noticed that under informative prior the AILs and associated CPs of HPD intervals are better than those of non-informative priors, bootstrap (p, t), and asymptotic confidence intervals respectively.
Table 5 Average estimated values, MSEs, bias, asymptotic and bootstrap (t-p) CI intervals of MLEs and BEs of IVT distribution parameters under complete data
In this section, the IVT distribution will be fitted to a real data set, to show how the IVT distribution can be applied in practice. moreover the IVT distribution will also compare with other inverted distributions that are fitted this data such as: inverse exponential (IE), inverse Rayleigh (IR), inverse Lindley(IL). And they will be introduced below as
The cdf, pdf of the inverse exponential (IE) distribution are respectively as
The cdf, pdf of the inverse Rayleigh (IR) distribution are respectively as:
The cdf, pdf of the inverse Lindley (IL) distribution are respectively as
The data set consists of 100 observations of breaking stress of carbon fibers in (Gba) which are listed as follows:
1.061, 1.117, 1.162, 1.183, 1.187, 1.192, 1.196, 1.213, 1.215, 1.219, 1.220, 1.224, 1.225, 1.228, 1.237, 1.240, 1.244, 1.259, 1.261, 1.263, 1.276, 1.310, 1.321, 1.329, 1.331, 1.337, 1.351, 1.359, 1.388, 1.408, 1.449, 1.449, 1.450, 1.459, 1.471, 1.475, 1.477, 1.480, 1.489, 1.501, 1.507, 1.515, 1.530,1.530, 1.533, 1.544, 1.544, 1.552, 1.556, 1.562, 1.566, 1.585, 1.586, 1.599, 1.602, 1.614, 1.616, 1.617, 1.628, 1.684, 1.711, 1.718, 1.733, 1.738, 1.743, 1.759, 1.777, 1.794, 1.799, 1.806, 1.814, 1.816, 1.828, 1.830, 1.884, 1.892, 1.944, 1.972, 1.984, 1.987, 2.020, 2.030, 2.029, 2.035, 2.037, 2.043, 2.046, 2.059, 2.111, 2.165, 2.686, 2.778, 2.972, 3.504, 3.863, 5.306
Figure 6 shows that the empirical date compared by the inverted distributions namely IVT, IE, IR and IL.
Table 6 MLEs, AIC, BIC, AICC and HQIC values, and Kolmogorov-Smirnov statistics for carbon fibers data
Measures | ||||||||
Model | MLE | p-value | K-S | -2log L | AIC | BIC | AICc | HQIC |
IVT | 5.6313 | 0.1197 | 0.1211 | 90.3201 | 92.3201 | 94.8844 | 90.3617 | 93.3566 |
IE | 1.5680 | 0.0000 | 0.4528 | 290.4326 | 292.4326 | 294.9969 | 290.4742 | 293.4691 |
IR | 1.5293 | 0.0000 | 0.3407 | 173.0542 | 175.0542 | 177.6185 | 173.0958 | 176.0907 |
IL | 2.0773 | 0.0000 | 0.4350 | 280.3121 | 282.3121 | 284.8765 | 280.3538 | 283.3487 |
The random censoring can be described as follows: if we have n units under test, Let their lifetime is which are independent and identically distributed (iid) random variables with pdf and cdf , their random censoring times are which are iid with pdf and cdf , assume and be mutually independent. Note that, between and , only one will be observed. Further, let the actual observed time be , and the indicator variable are defined as
(15) |
The censored data () is known as the random censoring samples. The likelihood function under random censoring is given by [11]
(16) |
Where, and .
In this section, we obtain the MLEs for the unknown parameters of the IVT distribution. Let the lifetime T and censoring time C follow IVT and IVT respectively. Then the likelihood function for the unknown parameters under random censoring becomes:
(17) |
where is the observed number of uncensored lifetimes or failures.
Then, the corresponding log-likelihood function can be written as
(18) |
Differentiating (18) with respect to and gets:
(19) | ||
(20) |
Equating the first derivatives in (19) and (20) to zero and solving for and to get the MLEs and of and , respectively in closed form as follows
and
In this section, we have discussed the Bayesian estimation procedure for the parameters of the IVT distribution based random censoring samples and we get BEs of the unknown parameters under the squared error loss (SEL) function. We assume that the unknown parameter of the IVT distribution has the independent gamma prior and can be written with proportional as follows;
(21) | ||
(22) |
Therefore, the joint prior density of and can be written with proportional as follows:
(23) |
Hyper-parameters determination: As in Section 3.2, the hyper-parameters involved in priors (21) and (22) can be easily evaluated, if we consider that prior mean and prior variance are known. The prior mean and prior variance will be obtained from the maximum likelihood estimates of by equating the mean and variance of with the mean and variance of the considered priors (gamma prior), where and k is the number of random samples generated from the model in Section 4.2. Thus, on equating mean and variance of with the mean and variance of gamma priors, we get
Now, on solving the above two equations, the estimated hyper-parameters can be written as
A similar procedure for determining the hyperparameters () can be used for .
Based on the likelihood function (4.2) and the joint prior density (23), the joint posterior density and given the data can be written as
Then, the joint posterior function can be written as
(24) |
Where,
Thus, the Bayes estimate of based on SEL function is given by
(25) |
It should be noted that the ratio of integral in (4.3) cannot be obtained in closed forms. So, we use the MCMC approximation method to generate samples from (24) and to calculate the BEs of and and also to construct associated HPD intervals. where we use the M-H method with normal proposal distribution.
In this section, we propose different confidence intervals. One is based on the asymptotic distribution of and , two different bootstrap confidence intervals and finally, HPD intervals.
The asymptotic variance-covariance matrix of the MLEs of and can be obtained by inverting the observed information matrix, and is given as follows:
Where and . The elements of the observed information matrix for and are given as follows:
Then the observed fisher information
The asymptotic variance of is
and
The asymptotic variance of is
The sampling distribution of where , can be approximated by a standard normal distribution. The large confidence intervals for and are given by
As in Section 3.3, two types of parametric bootstrap methods are considered
(iii) Percentile bootstrap method (Boot-p)
(iv) Bootstrap-t method (Boot-t)
It given by the following steps:
i. A randomly censored sample is generated from the original data and the MLE , of the parameter is computed.
ii. Again, an independent randomly censored bootstrap sample is generated by using .
iii. Now, compute the bootstrap MLE of parameter based on , as in step-1.
iv. Repeat steps 2–3, B times representing B bootstrap MLE’s ’s based on B different bootstrap samples, i 1,2,… B.
v. Arrange all ’s in an ascending order to obtain the bootstrap sample i.e. . An approximate boot-p confidence interval for is obtained by .
Where, is the quantity that helps to determine the bootstrap point.
The bootstrap-t confidence interval is given by the following steps:
i. Steps 1 and 2 of boot-p and boot-t methods are the same.
ii. Compute the bootstrap-t statistic for where b 1, 2,… B.
iii. To obtain a set of bootstrap statistics repeat steps 2–3, B times.
iv. Let be the ordered values of .
v. Now, the approximate boot-t confidence interval for parameter is obtained by
As in Section 3.3, the HPD intervals for the unknown parameters can be constructed. let be the corresponding ordered MCMC sample, to construct the HPD interval, let be the j smallest of and denote , where . For be the HPD intervals then the best HPD interval that has the smallest interval width from ’s. So, we can say , be HPD interval for the unknown parameters have the smallest interval width among all ’s. Where is chosen so that
Where , and then the HPD intervals for the unknown parameters can be constructed.
A simulation study was carried to check the performance of the accuracy of point and interval estimates for several cases, for which estimate the two parameters of IVT distribution ( and ) for number of replications , for different sample sizes (n) as , 50, 80, 100 and different parameters values. All the computations are performed using statistical software R.
The simulations results for MLEs are summarized in Tables 8, 9, 10, and 11 and obtained by the following steps:
i. Specify initial values for parameters ( and ) as (0.5, 0.3), (0.8, 0.9), (1.2, 1) and (1.9, 1.5).
ii. Specify the sample size n. as
iii. Generate n standard uniform variates i.e. .
iv. Generated samples of size n from IVT () distribution (lifetimes) and IVT (censoring times) distribution by using the formula
v. Calculate the times and the censorship indicators , which are equal to 1 if and 0 otherwise.
vi. Obtain the maximum likelihood estimates (MLEs).
vii. Obtain the mean, bias, mean squared error (MSE), asymptotic and bootstrap confidence intervals (CI’s) for the unknown parameters, average interval lengths (AILs), and coverage probability (CP) for the different sample size.
viii. Repeat steps 1–5 1000 times.
And the simulation results for Bayesian estimates are summarized in Tables 8, 9, 10, and 11 which are obtained by the following steps:
iv. Step I, ii, iii, iv, and v of the MLEs simulation are the same
v. By using the M-H algorithm shown in Section 4.2 under the informative prior and the non-informative prior and repeat the chain N times (N 10000) to obtain MCMC samples.
∙ For informative prior, we compute the hyperparameters for all simulation cases as in Table 7.
∙ For non-informative prior (P-II) we assume that hyper-parameter values are .
vi. Compute the approximate Bayes estimator of under SEL is given by
Where M (2000) is the burn-in-period (that is, a number of iterations before the stationary distribution is achieved).
vii. Repeat step i–iii 1000 times to obtain the mean, bias, mean squared error (MSE), HPD intervals for the unknown parameters, average interval lengths (AILs), and coverage probability (CP) for the different sample size.
Table 7 The Hyper Parameters Values under random censoring data
Initial Values | |||||
, | , | , | , | ||
Hyper-Parameters | |||||
30 | a1 | 18.0 | 13.61 | 15.74 | 16.23 |
a2 | 10.9 | 15.28 | 13.28 | 12.73 | |
b1 | 34.9 | 16.49 | 8.23 | 8.23 | |
b2 | 34.7 | 16.34 | 8.18 | 8.18 | |
50 | a1 | 30.7 | 22.81 | 22.81 | 27.64 |
a2 | 18.3 | 26.12 | 26.12 | 21.41 | |
b1 | 60.2 | 27.87 | 14.15 | 14.15 | |
b2 | 60.2 | 27.86 | 14.17 | 14.17 | |
80 | a1 | 49.6 | 36.91 | 42.88 | 43.81 |
a2 | 29.5 | 42.19 | 36.21 | 35.25 | |
b1 | 97.8 | 45.93 | 22.98 | 22.98 | |
b2 | 97.8 | 45.90 | 22.97 | 22.97 | |
100 | a1 | 61.3 | 46.57 | 54.36 | 55.37 |
a2 | 37.7 | 52.37 | 44.60 | 43.60 | |
b1 | 121.7 | 57.66 | 28.86 | 28.86 | |
b2 | 121.7 | 57.61 | 28.84 | 28.84 |
Table 8 Average estimated values, MSEs, bias, asymptotic and bootstrap (t-p) CI intervals of MLEs and BEs of IVT distribution parameters under random censoring data
Table 9 Average estimated values, MSEs, bias, asymptotic and bootstrap (t-p) CI intervals of MLEs and BEs of IVT distribution parameters under random censoring data
Table 10 Average estimated values, MSEs, bias, asymptotic and bootstrap (t-p) CI intervals of MLEs and BEs of IVT distribution parameters under random censoring data
Table 11 Average estimated values, MSEs, bias, asymptotic and bootstrap (t-p) CI intervals of MLEs and BEs of IVT distribution parameters under random censoring data
From the results in Tables 8–11 the following conclusion can be made:
i. Similar to the complete case based on MSEs, higher values of lead to better estimates.
ii. The CPs of the MLEs are better than those of the CPs of Bayes estimates obtained under informative prior and the non-informative Bayes estimates, respectively.
iii. The MSEs of the MLEs are less than the BEs under the SEL function.
iv. It can be noticed that under informative prior the AILs and of HPD intervals are better than those of non-informative priors, Bootstrap (t – p), and MLEs.
v. Estimates obtained by the MLEs and BEs are almost unbiased.
In this section, the IVT distribution will be fitted to a real data set, to show how the IVT distribution can be applied in practice. These data are taken from a lung cancer study described by [12]. These data show remission times (in days) of a group of 15 patients. The data set is given as: (8, 10, 11, 25*, 42, 72, 82, 100*, 110, 118, 126, 144, 228, 314, 411). The observations with (*) sign the censored times. For this data set, the unknown parameter () of the IVT distribution will be estimated by the maximum-likelihood method, and with this, the estimate (MLE), the values of the Kolmogorov-Smirnov (KS) statistic (the distance between the empirical CDFs and the fitted CDFs), Akaike information criterion (AIC ), Bayesian information criterion (BIC) and Hannan-Quinn information criterion (HQIC) are calculated. These results are summarized in Table 12:
Table 12 The Values of Goodness of Fit Test for Lung Cancer Data Set to the IVT distribution
k-s | |||||||
Distribution | 2log L | AIC | BIC | HQIC | D-statistics | p-value | |
IVT | 0.277 | 299.65 | 301.7 | 302.2 | 301.5 | 0.3408 | 0.0751 |
IVT* | 0.309 | 42.71 | 44.71 | 43.4 | 41.98 | 0.5452 | 0.4137 |
Note: (*) indicates the censoring times’ distribution. |
From Table 12, the null hypothesis is not rejected, these lung cancer data may be modeled by the IVT distribution.
Moreover, MLE and Bayesian estimation methods are applied for estimating the model unknown parameter. For calculation of BEs, the hyper-parameters and are chosen such that the expected value of is 0.2437 with a variance giving and , the expected value of is 0.0375 with a variance giving and . these results are listed in Table 13.
The empirical distribution for lifetimes and for censoring times for the lung cancer data are represented in Figures 7 and 8 respectively.
Table 13 The MLEs and BEs of the parameters from lung cancer data set
BEs Under SEL Function | Confidence Intervals | |||||
AILs (HPD Interval) | ||||||
Parameter | MLEs | P-I | P-II | AILs (Asy CI) | P-I | P-II |
0.2437 | 0.2355 | 0.1761 | 0.2672 (0.1341,0.4012) | 0.1284 (0.881,0.3165) | 0.0533 (0.1494,0.2027) | |
0.0375 | 0.0344 | 0.0321 | 0.1083 (0.0074,0.1158) | 0.1083 (0.0232,0.0476) | 0.0108 (0.027,0.0378) | |
Note: AILs- Average interval lengths. |
Furthermore, the inverted distributions defined in Section 3.5 can be used to fit this data also with numerical results listed in Table 14 and Figure 9.
Table 14 MLEs, AIC, BIC AICC and HQIC values, and Kolmogorov- Smirnov statistics for Lung Cancer Data lifetimes
Measures | ||||||||
Model | MLE | p-value | K-S | 2log L | AIC | BIC | AICc | HQIC |
IL | 32.725 | 0.1052 | 0.3013 | 195.5445 | 181.3550 | 182.0630 | 179.6216 | 181.3474 |
IE | 36.90926 | 0.1051 | 0.3014 | 179.3469 | 181.3469 | 182.0550 | 179.6136 | 181.3394 |
IR | 0.824718 | 0.0000 | 0.5911 | 179.3550 | 211.8932 | 212.6012 | 210.1599 | 211.8857 |
In this paper, we have obtained the maximum likelihood estimates and Bayes estimates for the unknown parameter of the IVT distribution based on complete and random censoring data, the confidence intervals, HPD intervals, and bootstrap (p-t) intervals are also obtained. We perform some simulations to see the performances of the MLEs and BEs incomplete and random censoring data. One real data set has been re-analyzed based on random censoring data.
[1] Topp, C.W. and Leone, F.C. (1955). A family of J-shaped frequency functions, Journal of the American Statistical Association, 50, 209–219.
[2] Nadarajah, S. and Kotz, S. (2003). Moments of some J-shaped distributions, Journal of Applied Statistics, 30, 311–317.
[3] Ghitany, M.E., Kotz, S. and Xie, M. (2005). On some reliability measures and their stochastic ordering for the Topp–Leone distribution, Journal of Applied Statistics, 32, 715–722.
[4] Bayoud, H. (2016). Admissible minimax estimators for the shape parameter of Topp–Leone distribution, Communications in Statistics-Theory and Methods, doi: 10.1080/03610926.2013.818700.
[5] Muhammed, H.Z. (2019). On The Inverted Topp-Leone Distribution, international journal of reliability and applications, 20, 17–28.
[6] Dey, S., Singh, S., Tripathi, Y.M. and Asgharzadeh, A. (2016). Estimation and prediction for a progressively censored generalized inverted exponential distribution. Statistical Methodology, 132, 185–202.
[7] Ravenzwaaij, D.V., Cassey, P. and Brown, S.D. (2018). A simple introduction to Markov Chain Monte-Carlo sampling. Psychonomic Bulletin Review, 25, 143–154.
[8] Dey, S. and Pradhan, B. (2014). Generalized inverted exponential distribution under hybrid censoring. Statistical Methodology, 18, 101–114.
[9] Efron, B., and Tibshirani, R. J. (1993). An Introduction to the Bootstrap. New York: Chapman and Hall.
[10] Hall, P. (1988). Theoretical comparison of bootstrap confidence intervals. The Annals of Statistics, 927–953.
[11] Lawless, J. F. (2011). Statistical Models And Methods for Lifetime Data, Second edition. John Wiley & Sons, Inc, Canada.
[12] Kalbfleisch, J. D. and Prentice, R. L. (1980). The Statistical Analysis of Failure Time Data. New York: Wiley.
Hiba Zeyada Muhammed received a bachelor’s degree in Statistics from the Faculty of Science at Cairo University in 2006, a master’s degree in Statistics from Cairo University in 2009, and philosophy of doctorate in statistics from Cairo University in 2013, respectively. She is currently working as an Associative Professor at the Department of Mathematical Statistics, Faculty of Graduate Studies for Statistical Research, Cairo University. Her research areas include Reliability, life testing, bivariate and multivariate analysis, copula modeling and ranked set sampling. She has been serving as a reviewer for many highly-respected journals.
Essam Abd Elsalam Muhammed received a bachelor’s degree in applied Statistics from the Faculty of Commerce at kafr El-sheikh University in 2015, a master’s degree in Statistics from Cairo University in 2020, and in the preparatory year of the doctorate in statistics at Cairo University, respectively. He is currently working as a teaching assistant at High Institute of Computer and Information Technology, Elshorouk Academy. his research areas include Reliability and life testing.
Journal of Reliability and Statistical Studies, Vol. 14, Issue 2 (2021), 615–650.
doi: 10.13052/jrss0974-8024.14212
© 2021 River Publishers
2 Inverted Topp-Leone Distribution
2.1 Distributions and Moments of Order Statistics from IVT Distribution
3 Estimation Based on Complete Samples for IVT Distribution Shape Parameter
3.1 Maximum Likelihood Estimation
3.3 Interval Estimation for IVT Distribution Shape Parameter
The Asymptotic Confidence Interval
Bootstrap Confidence Intervals
Percentile Bootstrap (Boot-P) Confidence Interval
Bootstrap-t (Boot-t) Confidence Intervals
Highest Posterior Density (HPD) Intervals
3.5 Application to Real Data Set
4 Estimation Based on Random Censored Samples for IVT Distribution Shape Parameter
4.1 Model Assumption and Description
4.2 Maximum Likelihood Estimation
4.3 Bayes Estimation for IVT Shape Parameter
4.4 Interval Estimation Based on Random Censoring Samples
Asymptotic Confidence Intervals
Bootstrap Confidence Intervals
Percentile Bootstrap (Boot-P) Confidence Interval
Bootstrap-t (Boot-t) Confidence Intervals