Statistical Inference for Multi State Systems under the Generalized Modified Weibull Class
Andreas Makrides
Laboratory of Statistics and Data Analysis, Department of Statistics and Actuarial-Financial Mathematics, University of the Aegean, Greece
E-mail: amakridis@aegean.gr
Received 11 October 2021; Accepted 01 May 2022; Publication 22 July 2022
Multi state systems can be seen as semi-Markov processes by considering an arbitrary distribution function for sojourn times. Especially, in this work, the Modified Weibull distribution is employed to be the distribution of sojourn times with a shape parameter such that is member of a distributions family that is closed under minima. Parameters estimators are provided and the proposed methodology is evaluated using a detailed simulation procedure.
Keywords: Multi-state system, semi-Markov processes, H-class of distributions, Modified Weibull distribution, parameter estimation.
There is a great interest in developing generalized families of distributions using a function corresponding to a classical distribution, as the baseline (parent) distribution for the generalization. Such generalizations are quite popular primarily because some phenomena cannot be satisfactorily described by classical distributions, a defect that can be resolved if additional complexity is introduced into the parent distribution. Indeed, it is not uncommon that the tail and skewed behavior cannot be easily captured which affects the accuracy in terms of both description and prediction. It is the class of such families of distributions that improves the goodness-of-fit and consequently the overall modelling process.
One of the first such families introduced by [2] is the Gompertz-Verhulst family which itself belongs to the so called exponentiated family and is used among others, for the analysis of the growth curve mortality. Following this first family, several other families were proposed like the skewed family [1], the Marshall-Olkin extended (MOE) family [13], the Beta G family [9], the Gamma generated (GG) family [16] and the exponentiated exponential – Poisson family [15] just to mention a few. In this work we focus on the 4-parameter Modified Weibull Poisson (MWP) distribution [8] for developing a new general H-class of distributions with MWP as the baseline parent distribution.
Consider the Modified Weibull distribution introduced in [10] defined by
(1) |
and
(2) |
If is a random variable with zero-truncated Poisson mass distribution with parameter then the conditional distribution of the minimum ordered statistic of a random sample from (1) given , is given by
(3) |
Summing over all values of we obtain the marginal distribution given below
(4) |
The above distribution is known as the Modified Weibull Poisson (MWP) distribution with cumulative distribution function
(5) |
Under the assumption that the parameter is such that the term a general class of distributions with MWP as a baseline distribution can be considered by using a parent continuous distribution function, say . Hence, based on an arbitrary parent distribution we introduce a family of distributions with Modified Weibull Poisson as the baseline distribution, defined by
(6) |
where is the shape parameter of the proposed class. Note that additional distributional parameters associated with the parent distribution may also be involved in (6).
The present work concentrates on the family in (6) using a parent continuous distribution function and discuss some of its properties. Parameter estimators for (6) are provided, under a multi state system (see [12]), that are assumed to not be constant over time evolution. Asymptotic results regarding the proposed parameter estimates are also provided. The performance of the proposed methodology is investigated by simulated results.
The manuscript is structured in 6 sections. The second section establishes a family of distributions with the Modified Weibull distribution as the baseline distribution. In the third section we discuss the semi-Markov setting that is used in order to estimate, in Section 4, the parameters involved. The semi-Markov transition matrix, in addition with some reliability indices are established in Section 5. Finally, the accuracy of the proposed methodology is evaluating in Section 6.
Let us define the general family of distribution functions with shape parameter given by
(7) |
which meets the conditions according to the Lebesgue measure, with pdf . Typical members of the family are classical distributions like the exponential and Weibull. The main feature of the family (7) is that the cdf of the minimum ordered statistic of a random sample from (7) falls into the same family (see [3]; [4]).
Observe that clearly the MWP distribution given in (5) is a member of the class (7). In what follows we introduce a generalized cfamily using an arbitrary function , with the MWP distribution as the parent (baseline) distribution function, defined in (6).
Observe that the proposed generalized H-class of distributions consists of distributions (each having a different H function) falling within the class (7) and each of which is based on the parent MWP distribution which is also a member of (7).Thus, focusing on a single member of (7) we create a generalized family of distributions by adding extra complexity into the baseline MWP distirbution and at the same time staying within the class (7).
Remark 1 It is remarkable that the exponential distribution is obtained when the function is the identity.
Observe that the H-class in (6) generates a family of distributions which extends greatly the applicability of the Modified Weibull Poisson distribution covering among others, classical problems in engineering, reliability and safety as well as in any other field where the time-to-event is of primary interest.
Assume that has pdf denoted by . It is easy to see that the density function of a typical member of the H-class (6) is
(8) |
where the pdf associated with .
Recall that the baseline distribution of the (6) is the MWP distribution given in (5) with parameters and which is denoted by MWP() and it is obtained if in (6) we take .
Taking the Weibull distribution as a parent distribution (i.e. setting in the baseline distribution), we have the Weibull Poisson distribution
(9) |
and
(10) |
Observe that the Exponential Poisson distribution is obtained if i.e. if we take and in the baseline distribution.
As expected, irrespectively of the parent distribution, the resulting distribution is a member of the H-class of distributions given in (6). The result is summarized below:
Proposition 1 For any specific parent continuous distribution the resulting creates a new class of distributions like (6).
Proposition 2 Assume that the cdf of the r.v. falls into the class (6). Then, the reliability function is equal to
(11) |
and the instantaneous failure rate defined as
(12) |
The result is immediate from the definitions of the reliability and hazard functions and the expressions (6) and (8).
In this section we establish that the H-class in (6) is closed under minima which is a significant property which plays a key role in the statistical inference of the multi-state setting of the next section. More precisely, the above property is important for establishing the expressions for the quantities of interest of the SMM (see Proposition 3). Although it is not a necessary condition, it provides the ability to obtain a closed form for the expressions of the main characteristics of the proposed model.
Theorem 1 If are i.i.d. r.v’s from (6), then the cdf of satisfies the property (7).
For the required cdf we can easily see that
which belongs to H-class in (6) with shape parameter .
For the Weibull distribution which ibelongs to the above family, the cdf of the minimum becomes
(13) |
Remark 2 The results of this section can be generalized by dropping the assumption of identically distributed random variables. Indeed, if one considers the case of independent random variables which though are not necessarily identically distributed (inid) and assumes a random sample with the cdf of , being given by
(14) |
then, Theorem 1 still holds with belonging to the H-class (6) with parameter , that is
(15) |
In the following section we focus on the inid case for multi-state systems with (finite) number of states and sojourn times (the time spend on state before moving to state ) having a cdf belonging to the family (6) with shape parameter , .
Let the semi-Markov process (SMP) where
• (cf. [11]) is the state space,
• represent the jump times,
• are the visited states
• are the successive sojourn times with and
• (16)
is the process that counts the jumps in .
Observe that is equivalent to
It is easily showed that is a Markov chain (MC).
Under the assumption that SMP is ergodic (interested readers are referred to [11]), the main features of the model are the initial law
and the semi-Markov kernel
Define also by
the associated the transition probabilities while the conditional sojourn time distribution functions are given below:
Let us consider some random variables with c.d.f. A specific system is considered in this work which has the property that the state of the system visited directly after state is the state for which is minimized. Several remarks need to be done here.
Remark 3
(a) The motivation of the framework that we have just described comes from the fact that one could see theses as potential sojourn times in before jumping to for various reasons (minimum cost, minimum waiting time, first come first served, etc.), there is an interest in choosing the minimum of them.
(b) Note that this framework has been considered also in [4], but for different family of distributions.
(c) This could be an interesting and rich framework for modelling time varying parameters with a similar approach like in [6].
Under this setting, we can write
and
which is independent of the state and represents the unconditional cdf of the sojourn time in state irrespectively of the state to be visited next. We finally assume that the associated pdf is denoted by .
The following Proposition from [5] holds true for the class of distributions (6).
Proposition 3
(17) | |
(18) | |
(19) |
and
(20) |
We proceed now to statistical inference by focusing on sample paths, , under two different settings: the uncensored case where all sojourn times are observed and the censored case with the sojourn time in the last visited state being right censored with censoring time denoted by .
Having available a semi-Markov process with censored sample paths, and for the family of distribution in (6), the general expression of the likelihood function is given by
(21) | |||
where
• : number of trajectories beginning from state ,
• : number of visits to state of the trajectory up to observation time ,
• : number of transitions from state to of the trajectory up to observation time ,
• ,
• : sojourn time in state during the visit, of the trajectory,
• is the trajectory’s observed censored time,
• is the number of visits in state , as the last visit, during the trajectories; it holds that
• : observed censored sojourn time in state during the visit, .
The maximization of the likelihood provides the estimator of which is equal to
(22) |
while the estimator of the initial law by
(23) |
In case of no censoring the sample paths are , , , , and the associated uncensored likelihood function can be considered as a particular case of the censored likelihoood defined earlier in (21). As a result the expression of the estimator of in this case, is a simplified version of the one given above for the censored case. Indeed, the resulting estimator is
(24) |
where
The initial probabilities can be estimated using the following expression
(25) |
Using the proper expression among the previous ones, for the parameter estimates, the following estimators can be easily obtained:
(26) | |
(27) |
and
(28) |
It is straithforward that for the uncesored setting, the estimator of the parameter for MWP, is simplified to
(29) |
In the censored case the estimator of becomes
(30) |
Under the censoring setting and for the general case of H-class of distributions, the estimator of the parameter is
(31) |
where for , one can consider any distribution function.
The purpose of this section is to remind definitions and results on Markov renewal function, semi-Markov transition probabilities and some reliability measures and to point out how we can estimate these measures. As it will be clear in the sequel, the estimators obtained in the previous section for various cases will furnish corresponding estimators of the reliability measures, Markov renewal function and semi-Markov transition probabilities,.
The Markov renewal function, , , with and belonging to the state space , is given by ([7]; [11])
(32) |
where is the convolution of .
Since the aforementioned is defined as a function of infinite terms, in practice we use the sum where is a large enough integer such that , for a sufficient small .
For two states and , the semi-Markov transition matrix is ([11])
Consider now two disjoint subsets of the state space, say and corresponding to the up- and down-states the union of which is the entire state space. To simplify matters, let and
The reliability function of the system, evaluated at is equal to ([14])
where: is the value of obtained using the restriction of the kernel to the up-states is the restriction of the initial distribution to the up-states is a vector of s.
After obtaining the reliability function, the failure rate can be easily obtained:
Similarly, under the present setting, the availability and maintainability are given by ([14], [11])
(34) | |||
where: is a vector of s; is the value of obtained using the restriction of the kernel to the down-states is the restriction of the initial distribution to the down-states
The mean time to failure (MTTF) is given by:
(35) |
where: is the restriction to of the mean sojourn time in state , is the restriction to of the Markov transition matrix A similar expression holds also true for the mean time to repair (MTTR).
Note that, for all , taken into account the parameter estimates obtained in the previous section for various cases, we can obtain the corresponding plug-in estimators of and Finally, for any state , is estimated by
or
and we can also obtain the plug-in estimator of the MTTF.
The accuracy of the estimating procedure is examined using simulations in R. A semi-Markov process, with 3 states, for several values of the number of trajectories, , in both cases of censoring or not, is considered. The sojourn times are taken randomly from a Modified Weibull Poisson distribution with fixed parameters and which are chosen arbitrarily:
(36) |
The total observation time is assumed to be and we record the results of the estimated parameters of interest. As for the initial law, is simulated from the discrete Uniform distribution with parameters 1 and . For the cases where there exist censored paths, using the Uniform distribution, the trajectories with censored sojourn time in the first visited state, are chosen. Randomly we cut the interval that is computed as the first/last sojourn time in two parts, where the second part is considered to be the censored sojourn time in the first/last visited state. Note that modifications of the method described above could be considered.
The two tables below provides the target values of the parameters and the markov chain transition probabilities
Figure 1 Squared errors of for censored trajectories at the beginning and/or at the end, for .
Figure 2 Squared errors of for censored trajectories at the beginning and/or at the end, for .
Figure 3 Estimators for the initial law, for censored trajectories at the beginning and/or at the end, for .
Figure 4 Squared errors of for censored trajectories at the beginning and/or at the end, for .
Figures 1 and 2 present the squared errors (S.E.) of the estimators and respectively, as the number of trajectories increases from 1 to 100. Observe that in almost all cases the estimators of both parameters are very good with respect to the squared errors. However, the squared errors of the markov chain transition probabilities, , are smaller as compared to the ones of the parameters . Figure 3 presents the estimate of the initial law function which is closed to the true value of
According to Figure 4, the estimated values for the semi-Markov process transition probabilities are very close to the real values with the squared errors to be less than
Figure 5 Squared errors of for uncensored trajectories, for .
Figure 6 Squared errors of for uncensored trajectories, .
The estimators of the parameters for the baseline distribution, MWP, and the markov chain transition probabilities, , behave very well even in the case of no censoring (see Figures 5 and 6).
Figure 7 Estimators for the initial law, for uncensored trajectories, .
Figure 8 Squared errors of for uncensored trajectories, for .
As for the estimator of the initial law (see Figures 3 and 7), in both cases of censoring or not censoring, is close to the vector which is the true value, especially when the number of trajectories is large enough (greater than 30). Figure 8 proves that estimators of the semi-Markov transition probabilities are very accurate with a squared error to be almost zero.
A new generalized class of distributions based on the Modified Weibull distribution is proposed in this work where for any specific parent distribution, a new class of distributions is obtained. The aforementioned opens up the way to make inference on a semi-Markov model by allowing a variety of distributions for sojourn times. The main contribution of this work is the fact that using the proposed generalized class we do not limit the problem to a restricted family of distributions. The proposed methodology is examined using simulations providing a comparison between real and estimated parameters with respect to the squared errors. The results are both encouraging and reliable in all cases.
[1] Azzalini, A. (1985). A Class of Distributions Which Includes the Normal Ones. Scandinavian Journal of Statistics, 12, 171–178.
[2] Ahuja, J., C. and Nash, Stanley W. (1967). The generalized Gompertz-Verhulst family of distributions. Sankhya, Series A, 29, 141–156.
[3] Balasubramanian, K., Beg, M. I. and Bapat, R. B. (1991). On families of distributions closed under extrema, Sankhya: The Indian Journal of Statistics A, 53, 375–388.
[4] Barbu, V., S., Karagrigoriou, A., Makrides, A. (2017). Semi-Markov Modelling for Multi-State Systems, Meth. & Comput. Appl. Prob., 19, 1011–1028.
[5] Barbu, V., S., Karagrigoriou, A., Makrides, A. (2019). Estimation and reliability for a special type of semi-Markov process, J. Math. Stat., 15(1), 259–272.
[6] Barbu, V., S., Karagrigoriou, A., Makrides, A. (2020). Statistical inference for a general class of distributions with time-varying parameters, J. Appl. Stat., 1–20.
[7] Barbu, V., S., Limnios N. (2004). Discrete time semi-Markov processes for reliability and survival analysis – a nonparametric estimation approach, Parametric and Semiparametric Models with Applications to Reliability, Survival Analysis and Quality of Life, eds. Narayanaswamy Balakrishnan, Mikhail Nikulin, Mounir Mesbah, Nikolaos Limnios, Birkhäuser, Collection Statistics for Industry and Technology, Boston, 487–502.
[8] Delgarm, L. and Zadkarami, M., R. (2015). A new generalization of lifetime distributions, Comput. Stat. 30, 1185–1198.
[9] Eugene N., Lee C., and Famoye F. (2002). Beta-normal distribution and its applications, Commun. Stat. Theory Methods 31, 497–512.
[10] Lai, C., D., Xie, M. and Murthy, D. (2003). A modified Weibull distribution. IEEE Trans. Reliab., 52, 33–37.
[11] Limnios, N. and Oprişan, G. (2001). Semi-Markov Processes and Reliability, Birkhäuser, Boston.
[12] Lisnianski, A., Frenkel, Karagrigoriou A. (2017). Recent advances in multi-state systems reliability: Theory and applications, Springer.
[13] Marshall A., W., Olkin I. (1967). A multivariate exponential distribution. J. Am. Stat. Assoc., 62, 30–44.
[14] Ouhbi, B. and Limnios, N. (1996). Non-parametric estimation for semi-Markov kernels with application to reliability analysis, Appl. Stoch. Models Data Anal., 12, 209–220.
[15] Ristic M., M. and Nadarajah S. (2014). A new lifetime distribution. J. Stat. Comput. Simul. 84, 135–150.
[16] Zografos, K. and Balakrishnan, N. (2009). On families of beta- and generalized gamma-generated distributions and associated inference, Stat. Methodol., 6, 344–362.
Andreas Makrides is a Post Doc researcher at the Department of Statistics and Actuarial – Financial Mathematics, University of the Aegean, Samos, Greece (advisor: Prof. Alex Karagrigoriou).
His research interests lie among others in Stochastic Modeling, Stochastic Processes, Applied Probability, Mathematical Statistics, Semi-Markov Processes, Reliability Theory, Multi-State Systems, Entropy and Divergence, Goodness of Fit Tests and Control Charts – Statistical Quality Control.
He has worked as a Post Doc researcher (2018–2019) at the Laboratoire de Mathématiques Raphaël Salem, University of Rouen, France.
He received his Ph.D in Statistics (2016) from the Department of Mathematics and Statistics, University of Cyprus, his M.Sc. in Statistics and Modeling (with excellence, 2010) from the Department of Mathematics of Aristotle University of Thessaloniki, Greece and his B.Sc. (Hons, with excellence, 2008) from the Department of Mathematics of Aristotle University of Thessaloniki, Greece.
Andreas has been the representative of both Cyprus and France to the 21st EYSM, July 29–Aug 2, 2019, Belgrade, Serbia (selected by the Regional Committee of the Bernoulli Society). He is the recipient of two excellence awards: (1) IKY (State Scholarships Foundation in Greece) awards, for academic excellence in undergraduate studies during the academic years 2005–2006, 2006–2007, 2007–2008. (2) 1st ranked postgraduate candidate, M.Sc “Statistics and Modeling, Department of Mathematics, Aristotle University of Thessaloniki, Greece, 2008. IKYK (State Scholarships Foundation in Cyprus) Fellowship, for the graduate studies.
He is also a reviewer for several scientific journals.
Furthermore, he has teaching experience since he has been working from 2017 as an Associated Lecturer at the University of Cyprus, the Cyprus University of Technology and the Uclan University, Cyprus on various courses both undergraduate and postgraduate.
Journal of Reliability and Statistical Studies, Vol. 15, Issue 2 (2022), 411–430.
doi: 10.13052/jrss0974-8024.1521
© 2022 River Publishers
2 The H-Class of Distributions
2.1 Basic Statistical and Reliability Functions
2.2 H-class: A Class Closed Under Minima
3 The Semi-Markov Model – The General Setting
4 The Semi-Markov Model – Estimation with and Without Censoring
4.1 The Case of Modified Weibull Poisson Distribution
4.2 The Case of the General H-class
6.1 Censoring at the Beginning and/or at the End