Stochastic Aspects of Proportional Vitalities Model

: In this paper, a family of models requiring proportional mean life vitalities is considered. The problem of estimation of the parameter(s) of the model is studied in two cases of known and unknown baselines along with some simulation studies to detect the adequacy of ﬁtting. Closure and preservation properties of some ageing classes and stochastic orders are derived.


Introduction
A large segment of statistical methodology deals with the modeling and analysis of data representing the time until the occurrence of an event (cf. Nanda et al. [1]). In industrial and biomedical studies, these events represent the time to failure of a machine, an organ, an individual, or the completion of a certain task. These times are referred to as survival times in biomedical studies and as lifetimes or failure times in actuarial and engineering studies (cf. Henley and Kumamoto [2], Fleming [3] and Miller [4]). Reliability measures are ordinarily considered in lifetime sciences as characteristic quantities to quantify the amount of uncertainty during the lifetime process of an item to presignify its life span. Endless numbers of models are being introduced in the literature to model failure time data. The proportional hazard rates (PHR) model, used to model left-truncated and right-censored failure times, is a reputable model that has been greatly developed in the literature (see, e.g., Cox [5]). To concentrate on the average amounts in place of probability values Oakes and Dasu [6] introduced the proportional mean residual life (PMRL) model for the analysis of reliability and survival data in the situations where left-truncated or right-censored observations are available. Nanda et al. [7] studied some reliability aspects and further stochastic properties including a number of stochastic comparisons and ageing properties in the PMRL model. In contrast to the PHR and the PMRL models, there have been proposed specific models to account for right-truncated and left-censored observations. The proportional reversed hazard rates (PRHR) model has been introduced as an alternative model for the PHR model by Gupta et al. [8] and some stochastic aspects along with further inferential procedures about it have been studied by Gupta and Gupta [9]. The proportional mean past lifetime (PMPL) model that focuses on the mean inactivity time of items has been considered by Asadi and Berred [10] and several stochastic properties of it have been developed by Rezaei [11].
The advantage of the current investigation is to establish another model based on the vitality function of a distribution which is a common measure of life expectancy among demographers. The vitality function of a distribution is the conditional mean of the random variable on the right tail of distribution (see, e.g., Navarro et al. [12], Sunoj et al. [13] and Abdul-Sathar et al. [14]). The proposed model can be used for modelling right-censored and left-truncated distributions. and the hazard rate (HR) function of X is The mean residual life (MRL) of X after time point t is dx.
If the support of X is (l X , u X ), thenF(t) = 0 for all t > u X . In such a case, m X (t), v X (t) and h X (t) are not defined for t > u X . But it is customary, in such cases, to define m X (t) = v X (t) = h X (t) = 0. The vitality function has a relationship with the HR and the MRL functions as follows: where For the random variable X with a finite mean such that F(0 − ) = 0, the survival function (SF) of X (F = 1 − F) can be retrieved from vitality function v X by the inversion formulā Demographers have used the vitality function or life expectancy or expectation of life function for centuries in studies of human populations (see, e.g, Vaupel et al. [15], Yashin et al. [16] and Sharrow et al. [17]). The vitality function is monotonically increasing over [0, ∞). With this in mind, it is a valid conjecture that the curve of the ratio of the parametric or non-parametric estimated vitality functions based on two sets of data will indicate significant departures from a horizontal line with less probability than the case where hazard rate or mean residual life functions are utilized.
The regression models proposed for lifetime data generally consider the assumption of the PH model or the assumption of PMRL. For instance, suppose that we use the PMRL model, i.e., m(t) = θm 0 (t), for all t ≥ 0 and for some unknown θ > 0, where m 0 and m are, respectively, the baseline and the response mean residual life functions assumed to be unknown. In the cases that two samples from the underlying distributions of m and m 0 is available, there is a principle regarding the shape of the two MRL functions. The MRL functions may exhibit quite different behaviours, such as monotonically increasing, monotonically decreasing, bathtub shaped, upside-down bathtub shaped and roller-coaster shaped depending on the fluctuations of data. The data analyst observes that a strong assumption in the PMRL model exists as the behaviour of the MRL curves must be the same under the setup of the model. For example, consider that according to data in a two sample problem the shape of the estimated mean residual lives are a lot different, thus, it is unlikely to find an appropriate value for θ to incorporate that possibility and this leads to an inadequacy for fitting the model to data.
The survival models have developed rapidly in the literature to model time to event data (cf. Hosmer Jr and Lemeshow [18] and Liu [19]). When a semi-parametric model is proposed, there may exist some challenging issues regarding that model. For instance, one may question the suitability of the model to model data in many practical situations and that how well does it fit the data. The main goal of this paper is to introduce a semi-parametric model based on the concept of vitality function, called proportional vitalities (PVIT) model. In the context of the PVIT, the involved age-specific vitality function is inherently monotonically increasing. The credibility of modelling lifetime data using the PVIT model may be worthy.
The entire structure of this paper is organized as follows. In Section 2, the model is proposed. In Section 3, making inferences about the parameter(s) of the model is discussed. In Section 4, we lay the foundations of several stochastic orderings and aging properties. In Section 5, a brief conclusion of the paper is emphasized and some generalizations for future work are outlined.

The PVIT Model
First of all, in what follows in the paper, we refer the readers for the definitions of the utilized stochastic orders to Shaked and Shanthikumar [20] and for the concepts of aging notions that will be used in the sequel to Barlow and Proschan [21]. The property of Totally positive of order 2 (TP 2 ) for bivariate functions is also adopted from Karlin [22]. To introduce the model, some motivations shall be provided. An examples of parametric distributions is brought to clarify the structure of the model. The following definition is stated.
where ξ is a positive constant that we call it vitality growth parameter.
We can infer that v X (t) = E(X | X > t) is the expected life-length of an individual randomly drawn from the individuals grouped at the age t. The PVIT model induces that the ratio v Y (t)/v X (t) is t-free which means that the vitality functions of X and Y are relatively free of their underlying aging processes. From a mathematical point of view, purporting that relative mean life-lengths of survivor individuals in both populations remain unchanged when the age of survival is variable. In particular, for almost surely positive random variables, In view of (3), if X and Y satisfy the PVIT model as described in Definition 1, then The identity (7) may not give a closed form for the SF of the random variable Y in terms of SF of X, whereas in the PMRL model the inversion formula provides an explicit expression for the SF of the response variable in terms of a combination of the SF of the underlying distribution function with that of its equilibrium distributions (see, e.g., Nanda et al. [23]). On that account, we are capable of proving that (7) presents a proper SF if the following conditions hold: In what follows, the PVIT model is characterized in terms of the relationship between the ratios of survival (or hazard rate) functions of the length-biased distributions and the ratio of those of the underlying distributions. Recall thatX andŶ with PDF'sf (t) = t f (t)/E(X), t ≥ 0 andĝ(t) = tg(t)/E(Y), t ≥ 0 (provided that the expectations exist and are finite) are said to follow length-biased distributions associated with f and g, respectively (cf. Gupta and Keating [24]). Length-biased sampling arises when a component already in use is sampled at a fixed time and then allowed to fail (cf. Scheaffer [25]). The concept of length-biased sampling and the arising hazard rates, as well as cumulative distribution functions and the relations with other reliability measures, are appreciated in the literature (cf. Gupta and Keating [24]). Proposition 1. The non-negative random variables X and Y satisfy the PVIT model, if and only if one of the following assertions holds; Proof of Proposition 1. By Proposition 1 in Izadkhah et al. [26], for all t ≥ 0, we have and , for all t ≥ 0 which is equivalent to (i). By a further application of Proposition 1 of Izadkhah et al. [26], one has, for all t ≥ 0, This, together with the recent identities above, establishes the equivalence of (ii) and the PVIT model.
For studies in reliability, biometry and survival analysis, the length-biased distributions have been frequently appropriate for quite a number of sampling plans. Suppose that two random samples on X and Y cannot be reached due to an ungovernable biased sampling procedure. In spite of that, imagine that the available data come from the associated length-biased distributions. Proposition 1 states that in the context of the PVIT model, the ratio of survival functions (or the hazard rates) of the original distributions could be estimated under length-biased sampling as effectively as acting under random (unbiased) sampling. The following example indicates that by adding the vitality parameter ξ to the exponential distribution, the well-known Hall-Wellner family of distributions is generated.
Example 1. Suppose that X is distributed with mean 1/λ in accordance with exponential distribution. In an easy way, we get v X ( which is valid for A > −1 and B > 0. When A > 0, A = 0 and −1 < A < 0, it gives respectively a Pareto, an exponential (at limit) and a rescaled beta distribution. Here, by choosing A = ξ − 1 and B = ξ/λ, it is clarified that the distribution of Y is in the Hall-Wellner family. Now, we discuss a characterization result within the class of distributions satisfying the PVIT model. Theorem 1. Let X and Y be two non-negative random variables with finite means and hazard rate functions h X and h Y , respectively, such that lim t→∞ th X (t) = lim t→∞ th Y (t) = ∞. Then, X and Y satisfy the PVIT model if and only if they are equal in distribution.
Proof of Theorem 1. By the construction of the model, In a similar manner, lim t→∞ m Y (t)/t = 0. It thus concludes that ξ = 1, i.e., v Y (t) = v X (t), for all t ≥ 0, which means that X and Y have equal distributions.

Estimation of the Vitality Growth Parameter
In this section, statistical inferences about the unknown parameter(s) of the model are made. In the PVIT model, as introduced in Definition 1, the baseline vitality function v X may have either an accurate form (known) or an unknown feature. In this setting, we shall first consider the case when X has the Parametric distribution so that v X is fully known. Then, the case where v X is unknown but a random sample on X and also a random sample on Y are available will be considered.

One-Sample Case with Specified v X
Here, by simulating a data set from Pareto distribution we initiate a process for estimation of the growth parameter ξ in the PVIT model. We generate a sample of size 50 from the Pareto distribution having survival functionF X (t) = 1/(1 + t) 3 , t ≥ 0 and denote them by t 1 , . . . , t 50 , as presented in the Appendix. The vitality function is specified as v X (t) = (3t + 1)/2, t ≥ 0. Now, assume that Y as another random variable has vitality function v Y (t) = ξv X (t), in which ξ = 3. To perform a linear regression analysis, set z i = ξv X (t i ) + i , where i 's represent a generated random sample from N(0, 0.01). The method of least square provides that The data and the calculated values are given in Appendix A.

Two-Sample Case when v X is Unspecified
In this sub-section, we address the problem of estimating the parameter ξ = (ξ 1 , . . . , ξ p ) in model (4) when, X 1 , X 2 , . . . , X n is an independent random sample from F (the baseline) and Y 1 , Y 2 , . . . , Y N drawn independently from the populations with distribution functions G 1 , G 2 , . . . , G p , (the output) respectively, so that Y 1 , . . . , Y N 1 are adopted independently from G 1 , and Y N 1 +1 , . . . , Y N 1 +N 2 are taken independently from G 2 , . . . and Y , . . . , Y N p taken independently from G p . To this end, several steps will be taken to develop the procedure. Take into consideration that F n is the empirical distribution function of X, defined as where I is the indicator (heavy-side) function and x (i) denotes the observed the ith order statistic in the sample drawn from F, and accordinglyF n = 1 − F n represents the empirical SF. Suppose v F n stands for the vitality function of the distribution (8). It can be acquired from (2) that One disadvantage with the empirical distribution is that it assigns positive probability mass only to a finite number of points x (1) , x (2) , . . . , x (n) , say, which may be undesirable when modeling continuous variables. Given the observed values x (1) , x (2) , . . . , x (n) of the sample X 1 , X 2 , . . . , X n from F, the smoothed empirical distribution is defined as a mixture distribution with the following SF whereF δ i is a SF of a random variable distributed at a two-sided neighborhood of x (i) . Note that ifF δ i is the SF of a degenerate random variable at x (i) , then (10) corresponds to (8). We will take F n and f n as the CDF and the PDF associated with the SF (10). To show initiative in the case of our study, we are inclined to use triangular distribution. It is said that X * follows the triangular distribution with the vector of parameters δ = (a, c, b) in which a < c < d ∈ R (denoted by X * TA(δ)) when it has density The SF associated with (11) is derived as In the case of (10), suppose thatF δ i , i = 2, . . . , n − 1 is the SF of the triangular distribution with parameter δ i = (x (i−1) , x (i) , x (i+1) ). In parallel, assume thatF δ 1 is the survival functions associated with triangular distribution with parameter δ 1 = (0, x (1) , x (2) ) and thatF δ n is the SF of a symmetric triangular distribution with δ n = (x (n−1) , x (n) , 2x (n) − x (n−1) ). We now derive the vitality function of a random variable with SF (10) as it has already been fully characterized. Therefore, a smoothed estimation of vitality function is derived as Computation of the integrals in (13) is not difficult as it can be easily accomplished by replacing the specified amounts of parameters of the survival functions, i.e., δ i as in (12). For, i = 2, . . . , n − 1 when t < x (i−1) we have for t ∈ [x (i−1) , x (i) ) : and eventually . The cases when i = 1, n readily follow. Consider a setting by which Y i is a random variable with vitality function v Y i (t) = ξ i v X (t), for i = 1, 2, . . . , p. In situations where v X is unspecified, we are able to estimate it with the estimator (13) by applying data on X. In accordance with (7), we take Y = (Y 1 , Y 2 , . . . , Y N ) T as a sample of independent random variables as described before. Consider Y j = (Y 1+S j−1 , . . . , Y S j ) T , which has likelihood function (13), then the resulting model is completely parametric. The method of maximum likelihood can be used to estimate ξ. The full likelihood function obtains: from which the likelihood equations becomes: The above equations can be solved using the numerical Newton-Raphson method of iteration.

Some Closure and Preservation Properties
In this section, the closure (preservation) property of some stochastic orders (aging classes) under the formation of the model is studied. The problem has attracted the attention of many researchers in the recent past decades (cf. Amini-Seresht and Zhang [28], Amini-Seresht and Khaledi [29], Barmalzan and Najafabadi [30]). The following result indicates that the model by restricting the domain of the parameter ξ induces the hazard rate (mean residual life) ordering property between the underlying random variables. Theorem 2. X ≤ hr (≤ mrl )Y, if and only if, ξ ≥ 1.

Proof of Theorem 2.
From the relationships among the HR and the MRL orders, it suffices to prove that ξ ≥ 1 implies X ≤ hr Y, and that X ≤ mrl Y implicates ξ ≥ 1. Under the set up of the model, by (1) which is non-negative for ξ ≥ 1 as v X (t) > 0 for all t ≥ 0 because v X (t) is increasing in t ≥ 0 (see, e.g., Proposition 2.4 of Nanda and Jain [31]) and also ξv X (t)/(ξv X (t) − t) is decreasing in ξ, for any t ≥ 0. This indicates that when ξ ≥ 1, X ≤ hr Y. Now, we prove that X ≤ mrl Y implies that ξ ≥ 1. From (2), for random variables X and Y that satisfy the PVIT model, we can get m Y (t) = ξm X (t) + (ξ − 1)t, for all t ≥ 0 and thus ξ = (t + m Y (t))/(t + m X (t)), which is greater than 1 by definition.
However, the following example is an indication that the hazard rate order in Theorem 2 cannot be replaced by the likelihood ratio order.
Example 2. Let X and Y have, respectively, exponential distribution and Lomax distribution with survival functionsF(t) = exp(−t), t ≥ 0 andḠ(t) = 2 t+2 i.e., vitality of Y has been evolved with respect to vitality of X according to the PVIT model with a frailty growth identified by ξ = 2. By some routine calculations, it is perceived that X lr Y.
In the recent past decade, Righter et al. [32] propounded some fresh ageing notions on the basis of scaled conditional lifetime. The scaled conditional life of a random variable X is the total life relative to the current age, conditioned on the current age, and is given by , for all t with F(t) < 1, is called the residual life of an item with age t, in which d = means equality in distribution. For further studies of the family of distributions of X SC (t), see Righter et al. [32], Belzunce et al. [33] and the references therein. A new class of life distributions could be as the following. Recall that a function η is anti-star-shaped on (a, b), if (1/t)η(t) is non-increasing in t ∈ (a, b).

Definition 2.
The non-negative random variable X is said to have decreeing mean scaled conditional life (denoted as X ∈ DMSCL) if E[X SC (t)] is non-increasing in t > 0, or equivalently, v X (t) is an anti-star-shaped function in t > 0.
It has been investigated by the authors that some ageing properties of X are inherited by the random variable Y in the context of the model. Before stating our findings about that problem, we need to prove the following definition and key lemma. Definition 3. The non-negative random variable X is said to have (i) increasing failure rate (IFR) whenever the hazard rate function h X of X is non-decreasing. (ii) increasing mean residual life (IMRL) whenever the mean residual life function m X of X is non-decreasing. (iii) increasing failure rate in average (IFRA) whenever (1/x) x 0 h X (u)du is non-decreasing in x > 0. (iv) new worse than used (NWU) property whenever P(X > x)P(X > t) ≤ P(X > x + t), for all x ≥ 0 and for all t ≥ 0. (v) new worse than used in expectation (NWUE) whenever E(X) ≤ E(X − t | X > t), for all t ≥ 0 (vi) increasing generalized failure rate (IGFR) whenever xh X (x) is non-decreasing in x > 0.
The decreasing failure rate (DFR) and the decreasing failure rate in average (DFRA) aging paths are defined by reversing the required monotonicity behaviour in Definition 3(i) and Definition 3(ii), respectively. The new better than used in expectation (NBUE) property is also defined by revering the side of the inequality in Definition 3(v). Lemma 1. Let X ∈ DMSCL such that ξ ∈ (0, 1] (ξ ∈ [1, ∞)). Then the function δ given by is non-decreasing (non-increasing) in x ≥ 0 for all ξ in the specified intervals.
In the situations where X and Y with common support (0, ∞) satisfy the IFR (or even the ultimately IFR) property, the assumption of Theorem 1 is fulfilled. Furthermore, when X and Y possess the (ultimately) DMRL ageing property, then trivially both m X (t)/t and m Y (t)/t tend to zero when t is closing to infinity and X and Y are equal in distribution in this case either. It can be shown that when X and Y have IFRA property then for all t > 0 : and the requirement lim t→∞ t 0 h X (x)dx = lim t→∞ t 0 h Y (y)dy = ∞ together with (16) makes the assumption of Theorem 1 satisfied. Therefore, in all of these cases the model is valid if and only if distributions of X and Y coincide each other and as a result preservation property for the foregoing positive aging classes is an ineffective study. In spite of that, we can prove the following results. For the IGFR class of distributions, we refer the readers to Lariviere and Porteus [34].
Proof of Theorem 3. To prove (i), from the relationship between m Y and m X in Theorem 2, we have m Y (t) = ξm X (t) + (ξ − 1) which is non-negative for all t ≥ 0, by assumption. For proving (ii) To prove the assertion (iv), denote first by l Y (t) = th Y (t) and l X (t) = th X (t) the PHRs of X and Y, respectively. From the construction of the model it follows that where δ is as defined in Lemma 1. From assumption, since X ∈ IGFR thus l X is non-decreasing, and according to Theorem 20 in Kayid et al. [35], X ∈ DMSCL. On using Lemma 1, when ξ ≤ 1, δ(t, ξ) is non-decreasing in t ≥ 0, which results Y ∈ IGFR.
The next result establishes preservation of some other ageing properties. Theorem 4. Let X and Y be related as in the model such that X ∈ DMSCL. Then, (i) For any ξ ≥ 1, X ∈ DFR implies that Y ∈ DFR.
Proof of Theorem 4. To prove (i), observe that h Y (t) = δ(t, ξ)h X (t) for all t ≥ 0. By applying Lemma 1, the result follows immediately. For the sake of proving (ii), notice that X ∈ DFRA, if and only if, dx ≥ 0, for all t ≥ 0. Since ξ ≥ 1 and X ∈ DMSCL thus Lemma 1 concludes that δ(x, ξ) is non-negative and non-increasing in x ≥ 0. Therefore, which clarifies that Y ∈ DFRA. To demonstrate (iii), be mindful of that X ∈ NWU if and only if, where δ is as given in Lemma 1 and I[u ≤ x] is the indicator function of the set [u ≤ x]. It is clear from assumption by using Lemma 1 that ξ t,x,ξ (u) is non-increasing in u ≥ 0, for all x ≥ 0 and for all t ≥ 0, with ξ ≥ 1. On that account, Lemma 7.1(b) in Barlow and Proschan [21] implies that where In contrast, since δ(x, ξ) is non-negative and non-increasing by assumption, thus which is non-negative from (17) and the result follows.

Conclusions
The PVIT model has some complementary role with respect to the so called PHR model. The random varaibles X and Y are said to have PHR when h Y (t) = θh X (t) for all t > 0 and for some constant θ > 0 (see, e.g., Nanda and Das [36]). As in the PHR model, the data from lifetime of units that deteriorate with age could be used but in the PVIT model those units that recuperate with age can be modelled. In a regression analysis problem, the vitality growth parameter ξ may be expressed in terms of some covaraites, i.e., ξ = exp(b t z) where b is a p × 1 vector of regression parameters, and z is a p × 1 vector of covariates. On the ith individual is available a certain vector of observed covariates z i . As a result, the factor z i in the study can be used to quantify some positive/negative events. For example, in the PHR model z i , as a covariate, has two levels for the ith individual, z i = 0 describes a non-smoker and z i = 1 a smoker. In the PVIT model, z i = 0 if the ith individual does not get enough exercises weekly, and z i = 1 if he/she does have bodily activity weekly enough. It is well-known that if somebody is a smoker, the hazard rate of his/her lifetime increases, but if the person is an athlete, his/her lifetime is improved in the sense of vitality function, relatively.
Specifically, in the current investigation, some simulation studies were carried out to detect the accuracy of the PVIT model based on some data sets. The numerical iteration method of Newton-Raphson has been adopted to estimate the parameters of the model. Several reliability properties in the PVIT model, including some stochastic orders and some aging properties, are discussed. In the future of this study, time dependent proportional vitalities model will be considered.