A Comparison of MLE for Some Index Distributions Based on Censored Samples

Yunhan Liu; Changchun Gao; Xiaofeng Liu; Ping Luo; Jianguo Ren

doi:10.3390/math12203264

,

and

¹

Glorious Sun School of Business and Management, Donghua University, Shanghai 200051, China

²

Department of Statistics, Shanghai University of Finance and Economics Zhejiang College, Jinhua 321000, China

³

Department of Mathematics, Shanghai University of Finance and Economics Zhejiang College, Jinhua 321000, China

^*

Author to whom correspondence should be addressed.

Mathematics2024, 12(20), 3264;https://doi.org/10.3390/math12203264

This article belongs to the Special Issue Decision Making under Uncertainty in Soft Computing

Version Notes

Order Reprints

Abstract

This paper elucidates the prerequisites for maximum likelihood estimation (MLE) of parameters within the exponential and scale parameter families. Estimation of these parameters is predicated on data derived from censored samples and seeks to adhere to stochastic ordering principles. The study establishes that for two independent normal distributions and a two-parameter exponential distribution discernible by the distinct parameter sets, the MLEs of the parameters evince a stochastically ordered relationship when evaluated using full datasets. Furthermore, this research is extended to corroborate the persistence of stochastic ordering in the MLEs of such parameters under conditions of fixed censoring of samples.

Keywords:

usual random order; censored samples; maximum likelihood estimator; location family; scale family

MSC:

90B25; 60E05

1. Introduction

In medical research, finance, service industries, and business analytics, the exponential distribution and MLE are pivotal. Among other things, they assess treatment outcomes amid censored survival data, gauge risks for insurance purposes, optimize service delivery in call centers, ensure stochastic order for decision-making, facilitate statistical inferences from incomplete datasets, and validate model accuracy. This approach is paramount for efficient resource management and informed strategic planning across sectors.

In a series of examples, such as life-test or reliability tests, there is prior information available about the parameters. An experimenter can use this prior information to judge that a product is more reliable than its competitive product. In this scenario, the expectation is that parameter estimation will also effectively capture the comparative magnitudes of the parameters. Numerous inferential methods for estimating global parameters under these ordered constraints have been explored. The literature provides comprehensive details on inferential techniques for order constraints, as presented by Barlow (1972) [1] and expanded upon by Nuesch (1999) [2], among others.

N. Balakrishnan and Jiemi (2001) [3] proposed the random order problem for maximum likelihood estimation of parameters under the full sample, and proposed three conditions for single parameter distribution based on the maximum likelihood estimate of the full sample. The random order theory is a set of theories about the “size” of random variables. In a certain sense, it can be used to compare one random variable (vector) in terms of its size or number of variables with respect to another random variable (vector) [4,5,6]. Random order is a theoretical tool and method for decision-making in random environments, and is also a typical representative of uncertain variable comparison [7,8,9,10]. The study of random order has a wide and profound practical background, making it of great significance in both theory and application [11,12,13,14,15].

The life test is divided into the complete life test and the censored life test according to the failure of the sample, with the latter being the most widely used. Bartholomew (1957) [16] mentioned some problems and solutions that may arise during life test. The exponential distribution is one of the more widely used distributions for modeling life in reliability theory and practical reliability engineering [17,18,19]. Many useful results on exponential distributions can be found in [20,21]. The problem of estimating the mean of the exponential distribution is very important, and different maximum likelihood estimates can exist depending on the censored sample.

Consider a population that has a probability density function

f (x, θ)

, where the parameter

θ

belongs to a subset

Θ

of

(- \infty, + \infty)

. One important aspect of statistical inference is obtaining a point estimator of

θ

. Many different methods of estimation are known in the literature, for example moment estimation, maximum likelihood estimation, least-squares estimation, etc. [22]. Various properties of these approaches, such as unbiasedness, weak and strong consistency, and asymptotic normality, have been discussed; in this paper, our aim is as follows: based on censored samples, we discuss the conditions under which the maximum likelihood estimates of the parameters of the family of exponential distributions satisfy random ordering [23,24,25,26].

By “order preserving”, we refer to the following concept: suppose that X and Y are independent of each other from two exponentially distributed aggregates, with

X_{1}

,

X_{2}

, …,

X_{n}

a sample from the aggregate

f (x, θ_{1})

and

Y_{1}

,

Y_{2}

, …,

Y_{n}

a sample from population

f (x, θ_{2})

. Assume that

θ_{1}

,

θ_{2} \in Θ

, and further that

θ_{1} < θ_{2}

. We denote the point estimators of

θ_{1}

and

θ_{2}

obtained from applying the same estimation method based on

X = X_{1}, X_{2}, \dots, X_{n}

and

Y = Y_{1}, Y_{2}, \dots, Y_{n}

by

\hat{θ_{1}} \equiv \hat{θ_{1}} (X)

and

\hat{θ_{2}} \equiv \hat{θ_{2}} (Y)

, respectively. Let

X_{1}

,

X_{2}

, …,

X_{r}

and

Y_{1}

,

Y_{2}

, …,

Y_{s}

be Type-I censored samples, where

r \leq n

,

s \leq n

. Because the real parameter values

θ_{1}

and

θ_{2}

satisfy

θ_{1} < θ_{2}

, it is desirable that a certain order exist between their point estimators

\hat{θ_{1}}

and

\hat{θ_{2}}

. In general, it cannot be true that

\hat{θ_{1}} \leq \hat{θ_{2}}

point-wise due to the randomness of samples. Then, it is of interest to investigate what kind of order may exist between

\hat{θ_{1}}

and

\hat{θ_{2}}

. Because of its popularity and importance, a natural candidate for possible order between

\hat{θ_{1}}

and

\hat{θ_{2}}

is the usual stochastic order ≤.

Definition 1.

If the random variables X and Y have distribution functions

F_{1}

and

F_{2}

, we say that X is stochastically less than or equal to Y if

P (X > t) \leq P (Y > t)

,

\forall t \in (- \infty, + \infty)

, which is denoted by

X \overset{s t}{\leq} Y

or

F_{1} \overset{s t}{\leq} F_{2}

.

In comparison to the various stochastic orders known in the literature, the order ≤ or ≥ is not a restrictive one; however, even this order does not always hold between

\hat{θ_{1}}

and

\hat{θ_{2}}

, as the following example indicates.

Studying the order preservation of maximum likelihood estimates obtained from censored samples can verify consistency, rank parameters, perform variable selection, and evaluate robustness, helping us to better understand and apply maximum likelihood estimation methods. Consistency means that the maximum likelihood estimates converge to the true parameters as the sample size approaches infinity. By studying the order preservation of maximum likelihood estimates under truncated samples, we can further verify the consistency of maximum likelihood estimates in finite samples. Order preservation studies allow for parameter ranking. Through the analysis of order preservation after removing samples, we can compare the order of parameter estimates under different models or hypotheses, identify important parameters or model features, and help to understand the influence and importance of parameters. In regression models with a large number of predictors, analyzing the order preservation of parameter estimates after sample removal can help in selecting important variables based on the order preservation of parameter estimates, which not only reduces model complexity but also enhances model interpretability and prediction accuracy. In the presence of outliers or extreme observations, analyzing the order preservation after removing samples can quantify the sensitivity of maximum likelihood estimates to outliers and help to evaluate the robustness and stability of parameter estimates.

The rest of the paper is organized as follows. We demonstrate in Section 2 that the maximum likelihood estimators (MLEs) of the single-parameter exponential distribution, double-parameter exponential distribution, and normal distribution, which are part of the exponential family, exhibit stochastic order relations under complete sampling. In Section 3, we present the general form of the exponential family distribution and the specific conditions required for the location and scale parameter families to satisfy stochastic order in censored samples, followed by a numerical analysis. Section 4 details the conditions under which the MLEs of the double-parameter exponential distribution meet the criteria for stochastic order in the context of censored samples. Section 5 provided illustrative examples to clarify these concepts.

2. Comparison of MLE under Complete Samples

2.1. One-Parameter Exponential Distribution

Let

g (x, θ) \geq 0

be an integrable function on

(a, b)

for each

θ \in Θ \subseteq (- \infty, + \infty)

and let

h (θ)

be defined by

\frac{1}{h (θ)} = \int_{a}^{b} g (x, θ) d x < \infty .

(1)

In addition, we denote

f (x, θ) \equiv h (θ) g (x, θ) .

(2)

Here,

f (x, θ)

is a valid probability density function on

(a, b)

. Suppose that

θ_{1} < θ_{2} \in Θ

and that the random variables X and Y have density functions

f (x, θ_{1})

and

f (y, θ_{2})

, respectively. Furthermore, let

X_{1}, X_{2}, \dots, X_{r}

and

Y_{1}, Y_{2}, \dots, Y_{s}

, be censored samples under

f (x, θ_{1})

and

f (y, θ_{2})

, respectively. The following result provides the conditions under which

X \overset{s t}{\leq} Y

, and the maximum likelihood estimators

\hat{θ_{1}}

and

\hat{θ_{2}}

for

θ_{1}

and

θ_{2}

satisfy

\hat{θ_{1}} \overset{s t}{\leq} \hat{θ_{2}}

. Throughout this article, it is always assumed that the maximum likelihood estimator belonging to the parameter space

Θ

exists and is unique. The conditions that guarantee this are not provided explicitly.

Lemma 1.

Let

f (x, θ) = h (θ) g (x, θ)

be defined by (1) and (2).

Suppose that:

1.: $h (θ)$ is log-concave, i.e., $\frac{d^{2}}{d θ^{2}} ln h (θ) \leq 0$ , $\forall θ \in Θ$ ;
2.: $g (x, θ)$ is log-concave in $θ \in Θ$ , i.e., $\frac{𝜕^{2}}{𝜕 θ^{2}} ln g (x, θ) \leq 0$ , $\forall θ \in Θ$ , $\forall x \in (a, b)$ ;
3.: $\frac{𝜕^{2}}{𝜕 x 𝜕 θ} ln g (x, θ) \geq 0$ , $\forall θ \in Θ$ , $\forall x \in (a, b)$ .

If

θ_{1} < θ_{2} \in Θ

and at least one of the inequalities in 1–3 is strict, then there is

(a) X \overset{s t}{\leq} Y

and

(b) \hat{θ_{1}} \overset{s t}{\leq} \hat{θ_{2}}

.

The above lemma provides a general condition for the MLE to have the property of preserving the usual stochastic orders. It is given that when conditions 1, 2, and 3 are satisfied, two mutually independent exponentially distributed random variables exist under the equivalent carve-outs under the usual random order.

2.2. Two-Parameter Exponential Distribution

Theorem 1.

Let the samples

X_{1}, X_{2}, \dots, X_{n}

follow a two-parameter exponential distribution with probability density function

f (x, μ_{1}, θ_{1}) = \frac{1}{θ_{1}} exp \{- \frac{x - μ_{1}}{θ_{1}}\}, x \geq μ_{1},

and let the samples

Y_{1}, Y_{2}, \dots, Y_{n}

follow a two-parameter exponential distribution with probability density function

f (y, μ_{2}, θ_{2}) = \frac{1}{θ_{2}} exp \{- \frac{y - μ_{2}}{θ_{2}}\}, y \geq μ_{2} .

If

μ_{1} \leq μ_{2} and θ_{1} \leq θ_{2},

then the maximum likelihood estimators

{\hat{μ}}_{1}

and

{\hat{θ}}_{1}

obtained from

X_{1}, X_{2}, \dots, X_{n}

satisfy the stochastic order relationship with the maximum likelihood estimators

{\hat{μ}}_{2}

and

{\hat{θ}}_{2}

obtained from

Y_{1}, Y_{2}, \dots, Y_{n}

, denoted as

{\hat{μ}}_{1} \overset{s t}{\leq} {\hat{μ}}_{2} and {\hat{θ}}_{1} \overset{s t}{\leq} {\hat{θ}}_{2} .

Proof.

The likelihood function of

μ, θ

is

L (x_{1}, x_{2}, \dots, x_{n}, μ, θ) = \frac{1}{θ^{n}} exp (- \sum_{i = 1}^{n} \frac{x_{i} - μ}{θ}),

and its log-likelihood function is

ln L (x_{1}, x_{2}, \dots, x_{n}, μ, θ) = - n ln θ - \sum_{i = 1}^{n} \frac{x_{i} - μ}{θ} .

(3)

Observing the expression of

ln L (x_{1}, x_{2}, \dots, x_{n}, μ, θ)

, for any fixed

θ

, in order to maximize the likelihood function

μ

must be as large as possible with

μ \leq x_{(1)}

. The likelihood function

L (x_{1}, x_{2}, \dots, x_{n}, ν, θ)

reaches its maximum; hence,

\hat{μ} = X_{1}

,and then

\hat{θ} = S / n

, where

S = (X^{(1)} - X^{(1)}) \sum_{i = 1}^{n}

. Thus, we have

\begin{matrix} {\hat{μ}}_{1} = X_{(1)}, {\hat{θ}}_{1} = \frac{S_{1}}{n}, S_{1} = \sum_{i = 1}^{n} (X_{(i)} - X_{(1)}), \end{matrix}

(4)

\begin{matrix} {\hat{μ}}_{2} = X_{(2)}, {\hat{θ}}_{2} = \frac{S_{2}}{n}, S_{2} = \sum_{i = 1}^{n} (Y_{(i)} - Y_{(1)}) . \end{matrix}

(5)

Knowing

\hat{μ} = X_{(1)} \sim W (1, \frac{θ}{n}, μ)

, we aim to find

{\hat{μ}}_{1} \overset{s t}{\leq} {\hat{μ}}_{2}

, that is,

\forall t, p ({\hat{μ}}_{1} > t) \leq p ({\hat{μ}}_{2} > t)

,

\int_{t}^{\infty} \frac{n}{θ_{1}} exp \{- \frac{n (x - μ_{1})}{θ_{1}}\} d x \leq \int_{t}^{\infty} \frac{n}{θ_{2}} exp \{- \frac{n (y - μ_{2})}{θ_{2}}\} d y .

From

μ_{1} \leq μ_{2}, θ_{1} \leq θ_{2},

we know that the above formula holds, that is,

μ_{1} \overset{s t}{\leq} {\hat{μ}}_{2}

.

Given that

S = \sum_{i = 1}^{n} (X_{(i)} - X_{(1)}) \sim Γ (n - 1, θ)

, we want to obtain

{\hat{θ}}_{1} \overset{s t}{\leq} {\hat{θ}}_{2}

, that is,

\forall t, p ({\hat{θ}}_{1} > t) \leq p ({\hat{θ}}_{2} > t) \Leftrightarrow p (\frac{S_{1}}{n} > t) \leq p (\frac{S_{2}}{n} > t) .

Because

p (\frac{S_{1}}{n} > t) = \int_{t}^{\infty} \frac{θ_{1}^{n - 1} {(\frac{x}{n})}^{n - 2}}{n Γ (n - 1)} exp \{\frac{- θ_{1} x}{n}\} d x = \frac{θ_{1}^{n - 1}}{(n - 2)! n^{n - 1}} \int_{t}^{\infty} exp \{\frac{- θ_{1} x}{n}\} x^{n - 2} d x,

p (\frac{S_{2}}{n} > t) = \int_{t}^{\infty} \frac{θ_{2}^{n - 1} {(\frac{y}{n})}^{n - 2}}{n Γ (n - 1)} exp \{\frac{- θ_{2} y}{n}\} d y = \frac{θ_{2}^{n - 1}}{(n - 2)! n^{n - 1}} \int_{t}^{\infty} exp \{\frac{- θ_{2} y}{n}\} y^{n - 2} d y,

and

θ_{1} \leq θ_{2}

, we have

p (\frac{S_{1}}{n} > t) \leq p (\frac{S_{2}}{n} > t);

(6)

therefore,

{\hat{θ}}_{1} \overset{s t}{\leq} {\hat{θ}}_{2}

. This completes the proof of Theorem 1. □

2.3. Normal Distribution

Theorem 2.

Let the samples

X_{1}, X_{2}, \dots, X_{n}

follow a normal distribution with probability density function

f (x, μ_{1}, σ_{1}) = \frac{1}{\sqrt{2 π} σ_{1}} exp \{- \frac{{(x - μ_{1})}^{2}}{2 σ_{1}^{2}}\}, - \infty < x < \infty,

and let the samples

Y_{1}, Y_{2}, \dots, Y_{n}

follow a normal distribution with probability density function

f (y, μ_{2}, σ_{2}) = \frac{1}{\sqrt{2 π} σ_{2}} exp \{- \frac{{(y - μ_{2})}^{2}}{2 σ_{2}^{2}}\}, - \infty < y < \infty .

If

μ_{1} \leq μ_{2}

and

σ_{1}^{2} \leq σ_{2}^{2}

, then the maximum likelihood estimators

{\hat{μ}}_{1}

and

{\hat{σ}}_{1}^{2}

obtained from

X_{1}, X_{2}, \dots, X_{n}

satisfy the stochastic order relationship with the maximum likelihood estimators

{\hat{μ}}_{2}

and

{\hat{σ}}_{2}^{2}

obtained from

Y_{1}, Y_{2}, \dots, Y_{n}

, which we denote as

{\hat{μ}}_{1} \overset{s t}{\leq} {\hat{μ}}_{2}

and

{\hat{σ}}_{1}^{2} \overset{s t}{\leq} {\hat{σ}}_{2}^{2}

.

Proof.

For all t, the probability

p (\bar{X} > t) \leq p (\bar{Y} > t)

if and only if

p (\frac{\bar{X} - μ_{1}}{\frac{σ_{1}}{\sqrt{n}}} > \frac{t - μ_{1}}{\frac{σ_{1}}{\sqrt{n}}}) \leq p (\frac{\bar{Y} - μ_{2}}{\frac{σ_{2}}{\sqrt{n}}} > \frac{t - μ_{2}}{\frac{σ_{2}}{\sqrt{n}}}),

where the random variable

\frac{\bar{X} - μ}{\frac{σ}{\sqrt{n}}}

follows a standard normal distribution

N (0, 1)

. Therefore,

p (\frac{\bar{X} - μ_{1}}{\frac{σ_{1}}{\sqrt{n}}} > \frac{t - μ_{1}}{\frac{σ_{1}}{\sqrt{n}}}) = \int_{\frac{t - μ_{1}}{\frac{σ_{1}}{\sqrt{n}}}}^{\infty} \frac{1}{\sqrt{2 π}} exp \{- \frac{x^{2}}{2}\} d x,

(7)

p (\frac{\bar{Y} - μ_{2}}{\frac{σ_{2}}{\sqrt{n}}} > \frac{t - μ_{2}}{\frac{σ_{2}}{\sqrt{n}}}) = \int_{\frac{t - μ_{2}}{\frac{σ_{2}}{\sqrt{n}}}}^{\infty} \frac{1}{\sqrt{2 π}} exp \{- \frac{y^{2}}{2}\} d y .

(8)

Given that

μ_{1} \leq μ_{2}

and

σ_{1}^{2} \leq σ_{2}^{2}

, it follows that

\frac{t - μ_{1}}{\frac{σ_{1}}{\sqrt{n}}} \geq \frac{t - μ_{2}}{\frac{σ_{2}}{\sqrt{n}}},

(9)

which implies

p (\frac{\bar{X} - μ_{1}}{\frac{σ_{1}}{\sqrt{n}}} > \frac{t - μ_{1}}{\frac{σ_{1}}{\sqrt{n}}}) \leq p (\frac{\bar{Y} - μ_{2}}{\frac{σ_{2}}{\sqrt{n}}} > \frac{t - μ_{2}}{\frac{σ_{2}}{\sqrt{n}}}),

(10)

indicating that

{\hat{μ}}_{1} \overset{s t}{\leq} {\hat{μ}}_{2}

.

To achieve

σ_{1}^{2} \overset{s t}{\leq} σ_{2}^{2}

, it suffices to show that for all t,

p ({\hat{σ}}_{1}^{2} > t) \leq p ({\hat{σ}}_{2}^{2} > t)

. To this end, we can consider

p (\frac{n {\hat{σ}}_{1}^{2}}{σ_{1}^{2}} > \frac{n t}{σ_{1}^{2}}) \leq p (\frac{n {\hat{σ}}_{2}^{2}}{σ_{2}^{2}} > \frac{n t}{σ_{2}^{2}}) .

(11)

Because

{\hat{σ}}^{2} = \frac{1}{n} \sum_{i = 1}^{n} {(X_{i} - \bar{X})}^{2}

, it follows that

\frac{n {\hat{σ}}^{2}}{σ^{2}}

follows a chi-squared distribution with n degrees of freedom. Therefore,

p (\frac{n {\hat{σ}}_{1}^{2}}{σ_{1}^{2}} > \frac{n t}{σ_{1}^{2}}) = \int_{\frac{n t}{σ_{1}^{2}}}^{\infty} \frac{x^{\frac{n}{2} - 1} e^{- \frac{x}{2}}}{2^{\frac{n}{2}} Γ (\frac{n}{2})} d x,

(12)

p (\frac{n {\hat{σ}}_{2}^{2}}{σ_{2}^{2}} > \frac{n t}{σ_{2}^{2}}) = \int_{\frac{n t}{σ_{2}^{2}}}^{\infty} \frac{y^{\frac{n}{2} - 1} e^{- \frac{y}{2}}}{2^{\frac{n}{2}} Γ (\frac{n}{2})} d y .

(13)

Given that

σ_{1}^{2} \leq σ_{2}^{2}

, it follows that

\frac{n t}{σ_{1}^{2}} \geq \frac{n t}{σ_{2}^{2}}

; thus,

\int_{\frac{n t}{σ_{1}^{2}}}^{\infty} \frac{x^{\frac{n}{2} - 1} e^{- \frac{x}{2}}}{2^{\frac{n}{2}} Γ (\frac{n}{2})} d x \leq \int_{\frac{n t}{σ_{2}^{2}}}^{\infty} \frac{y^{\frac{n}{2} - 1} e^{- \frac{y}{2}}}{2^{\frac{n}{2}} Γ (\frac{n}{2})} d y,

(14)

which implies

p (\frac{n {\hat{σ}}_{1}^{2}}{σ_{1}^{2}} > \frac{n t}{σ_{1}^{2}}) \leq p (\frac{n {\hat{σ}}_{2}^{2}}{σ_{2}^{2}} > \frac{n t}{σ_{2}^{2}}),

(15)

confirming that

{\hat{σ}}_{1}^{2} \overset{s t}{\leq} {\hat{σ}}_{2}^{2}

. □

3. Comparison of MLEs under Censored Samples

3.1. Generalization of the Exponential Distribution

Theorem 3.

Assuming that the exponential distributions of populations X and Y are independent, the parameters are

θ_{1}

and

θ_{2}

, respectively. Suppose that

X_{1}

,

X_{2}

,…,

X_{n}

are lifetimes following the exponential distribution with parameter

θ_{1}

, while

Y_{1}

,

Y_{2}

,…,

Y_{n}

are lifetimes following the exponential distribution with parameter

θ_{2}

. Given

(0 < t < + \infty)

, let

r =

be the number of

X_{i}

that satisfies

X_{i} \leq t

and

s =

be the number of

Y_{i}

that satisfies

Y_{i} \leq t

,

r \leq n

,

s \leq n

. Moreover,

\hat{θ_{1}}

and

\hat{θ_{2}}

are maximum likelihood estimators of

θ_{1}

and

θ_{2}

. Suppose that:

(I): $\int_{t}^{b} g (x, θ) d x$ increases with respect to θ, while $g (x, θ)$ and its derivatives are continuous and $\int_{t}^{b} \frac{𝜕}{𝜕 θ} g (x, θ) d x$ decreases with respect to θ;
(II): $r \leq s$ , $△_{x} = \frac{[\int_{t}^{b} \frac{𝜕}{𝜕 θ} g (x, θ) d x] |_{θ = \hat{θ_{1}}}}{\int_{t}^{b} g (x, \hat{θ_{1}}) d x}$ , $△_{z} = \frac{[\int_{t}^{b} \frac{𝜕}{𝜕 θ} g (z, θ) d z] |_{θ = \hat{θ_{2}}}}{\int_{t}^{b} g (z, \hat{θ_{2}}) d z}$ , $\frac{Δ_{z}}{Δ_{x}} \geq \frac{n - r}{n - s}$ ;

If

θ_{1} < θ_{2} \in Θ

, then there is (a)

X \overset{s t}{\leq} Y

and (b)

\hat{θ_{1}} \overset{s t}{\leq} \hat{θ_{2}}

.

Proof.

We shall first prove (a). For any

t \in (a, b)

, we have

\frac{f (t, θ_{2})}{f (t, θ_{1})} = \frac{h (θ_{2}) g (t, θ_{2})}{h (θ_{1}) g (t, θ_{1})} .

(16)

Note that condition (I) implies that

\frac{𝜕}{𝜕 t} ln g (t, θ_{2}) \geq \frac{𝜕}{𝜕 t} ln g (t, θ_{1}), \forall t \in (a, b),

(17)

which is equivalent to

\frac{𝜕}{𝜕 t} ln (\frac{g (t, θ_{2})}{g (t, θ_{1})}) \geq 0 .

(18)

From (1) and (2), we can see that

f (t, θ_{2}) / f (t, θ_{1})

increases in

t \in (a, b)

. This means that X is smaller than Y in the likelihood ratio order denoted by

X \overset{l r}{\leq} Y

. This order further yields

X \overset{s t}{\leq} Y

; hence, (a) is proved.

Because

X_{1} \dots, X_{r}

and

Y_{1} \dots, Y_{s}

are censored samples, they are independent and identically distributed; in addition,

X_{i} \overset{s t}{\leq} Y_{i}

is known from conclusion (a), and there is a random variable

Z_{i} (i = 1, 2, \dots, s)

known by the coupling method that makes

X_{i} \leq Z_{i}

point by point, where

Z_{i}

and

Y_{i}

have the same distribution.

If n products are put into the timing truncation experiment, the truncation time is t and the timing truncation sample

0 \leq x_{1} \leq x_{2} \dots \leq x_{r} \leq t

is obtained, that is, r products fail in the time interval

[0, t]

and

n - r

do not fail at time

x_{r}

. The probability of a product failure in the interval

(x_{i}, x_{i} + d x_{i})

is approximately

f (x_{i}, θ) d x_{i}

,

(i = 1, 2 \dots r)

. The probability of the remaining

n - r

product life exceeding

x_{r}

is

\int_{t}^{b} {f (x, θ) d x)}^{n - r}

.

Therefore, the likelihood function of the sample is

L (x, θ) = \prod_{i = 1}^{r} f (x_{i}, θ) {(\int_{t}^{b} f (x, θ) d x)}^{n - r} = {[h (θ)]}^{n} \prod_{i = 1}^{r} g (x_{i}, θ) {(\int_{t}^{b} g (x, θ) d x)}^{n - r},

i.e.,

\frac{𝜕}{𝜕 θ} ln L (x, θ) = n \frac{h^{'} (θ)}{h (θ)} + \sum_{i = 1}^{r} \frac{g_{θ}^{'} (x_{i}, θ)}{g (x_{i}, θ)} + (n - r) \frac{\frac{𝜕}{𝜕 θ} \int_{t}^{b} g (x, θ) d x}{\int_{t}^{b} g (x, θ) d x} .

(19)

Thus, we can know that

\hat{θ_{1}} (X)

is determined by

n \frac{h^{'} (θ_{1})}{h (θ_{1})} + \sum_{i = 1}^{r} \frac{g_{θ}^{'} (x_{i}, θ_{1})}{g (x_{i}, θ_{1})} + (n - r) \frac{\frac{𝜕}{𝜕 θ} \int_{t}^{b} g (x, θ) d x}{\int_{t}^{b} g (x, θ) d x}, |_{θ = θ_{1}} = 0

(20)

while

\hat{θ_{2}} (Z)

is determined by

n \frac{h^{'} (θ_{2})}{h (θ_{2})} + \sum_{i = 1}^{s} \frac{g_{θ}^{'} (z_{i}, θ_{2})}{g (z_{i}, θ_{2})} + (n - s) \frac{\frac{𝜕}{𝜕 θ} \int_{t}^{b} g (z, θ) d z}{\int_{t}^{b} g (z, θ) d z} . |_{θ = θ_{2}} = 0

(21)

To prove

(b)

, it is sufficient to show that

\hat{θ_{1}} (X) \leq \hat{θ_{2}} (Z)

is pointwise, that is,

p (\hat{θ_{1}} (X) \leq \hat{θ_{2}} (Z)) = 1 .

(22)

Using the coupling method, we obtain

\hat{θ_{1}} \equiv \hat{θ_{1}} (X) \overset{s t}{\leq} \hat{θ_{2}} (Y) \equiv \hat{θ_{2}}

.

Now, we shall prove that

\hat{θ_{1}} \leq \hat{θ_{2}} (Z)

pointwise. Let us first assume the contrary, that is, that

\hat{θ_{1}} (X) > \hat{θ_{2}} (Z)

holds on a set of positive probabilities. From condition 1, the function

h^{'} (θ) / h (θ)

decreases in

θ \in Θ

; hence, it follows that on the set

\hat{θ_{1}} > \hat{θ_{2}} (Z)

we have

\frac{h^{'} (\hat{θ_{1}})}{h (\hat{θ_{1}})} \leq \frac{h^{'} (\hat{θ_{2}})}{h (\hat{θ_{2}})} .

(23)

Inequalities (19), (21), and (23) imply that

\frac{1}{n} \sum_{i = 1}^{r} \frac{g_{θ}^{'} (x_{i}, \hat{θ_{1}})}{g (x_{i}, \hat{θ_{1}})} + \frac{(n - r)}{n} \frac{[\int_{t}^{b} \frac{𝜕}{𝜕 θ} g (x, θ) d x] |_{θ = \hat{θ_{1}}}}{\int_{t}^{b} g (x, \hat{θ_{1}}) d x} \geq \frac{1}{n} \sum_{i = 1}^{s} \frac{g_{θ}^{'} (z_{i}, \hat{θ_{2}})}{g (z_{i}, \hat{θ_{2}})} + \frac{(n - s)}{n} \frac{[\int_{t}^{b} \frac{𝜕}{𝜕 θ} g (z, θ) d z] |_{θ = \hat{θ_{2}}}}{.} \int_{t}^{b} g (z, \hat{θ_{2}}) d z

(24)

From condition (I), it can be seen that

\frac{𝜕}{𝜕 θ} ln g (z, θ) \geq \frac{𝜕}{𝜕 θ} ln g (x, θ), \forall θ \in Θ, \forall x < z \in (a, b),

and consequently that

\frac{g \overset{´}{_{θ}} (x_{i}, θ)}{g (x_{i}, θ)} \leq \frac{g \overset{´}{_{θ}} (z_{i}, θ)}{g (z_{i}, θ)}, \forall θ \in Θ, \forall x < z \in (a, b) .

(25)

Because

X_{i} \leq Z_{i}

pointwise

(1 \leq i \leq n)

, from (25) it follows that

\frac{g \overset{´}{_{θ}} (X_{i}, \hat{θ_{1}})}{g (X_{i}, \hat{θ_{1}})} \leq \frac{g \overset{´}{_{θ}} (Z_{i}, \hat{θ_{1}})}{g (Z_{i}, \hat{θ_{1}})} .

(26)

On the other hand, it is true that

\frac{g \overset{´}{_{θ}} (Z_{i}, \hat{θ_{1}})}{g (Z_{i}, \hat{θ_{1}})} \leq \frac{g \overset{´}{_{θ}} (Z_{i}, \hat{θ_{2}} (Z))}{g (Z_{i}, \hat{θ_{2}})} .

(27)

Because Lemma 1 implies that

g {\overset{´}{}}_{θ} (t, θ) / g (t, θ)

decreases in

θ \in Θ

for any

t \in (a, b)

and

\hat{θ_{1}} > \hat{θ_{2}} (Z)

, by combining (26) and (27) we obtain

\frac{g \overset{´}{_{θ}} (X_{i}, \hat{θ_{1}})}{g (X_{i}, \hat{θ_{1}})} \leq \frac{g \overset{´}{_{θ}} (Z_{i}, \hat{θ_{2}} (Z))}{g (Z_{i}, \hat{θ_{2}})},

and consequently,

\sum_{i = 1}^{n} \frac{g \overset{´}{_{θ}} (X_{i}, \hat{θ_{1}})}{g (X_{i}, \hat{θ_{1}})} \leq \sum_{i = 1}^{n} \frac{g \overset{´}{_{θ}} (Z_{i}, \hat{θ_{2}} (Z))}{g (Z_{i}, \hat{θ_{2}} (Z))} .

(28)

Because of

r \leq s

, we have

\frac{1}{n} \sum_{i = 1}^{r} \frac{g \overset{´}{_{θ}} (x_{i}, \hat{θ_{1}})}{g (x_{i}, \hat{θ_{1}})} \leq \frac{1}{n} \sum_{i = 1}^{s} \frac{g \overset{´}{_{θ}} (z_{i}, \hat{θ_{2}})}{g (z_{i}, \hat{θ_{2}})} .

(29)

According to condition (I),

\int_{x}^{b} g (x, θ) d x

is increased with respect to

θ

and

\hat{θ_{1}} > \hat{θ_{2}}

; thus, we have

\int_{t}^{b} g (x, \hat{θ_{1}}) d x \geq \int_{t}^{b} g (z, \hat{θ_{2}}) d z .

Also from condition (I),

\int_{x}^{b} \frac{𝜕}{𝜕 θ} g (x, θ) d x

decreases with respect to

θ

and

\hat{θ_{1}} > \hat{θ_{2}}

; thus, we have

[\int_{t}^{b} \frac{𝜕}{𝜕 θ} g (x, θ) d x] |_{θ = \hat{θ_{1}}} \leq \int_{t}^{b} \frac{𝜕}{𝜕 θ} g (z, θ) d z] |_{θ = \hat{θ_{2}}} .

From Condition (II),

\frac{Δ_{z}}{Δ_{x}} \geq \frac{n - r}{n - s}

; thus, it is apparent that

\frac{n - r}{n} \frac{[\int_{t}^{b} \frac{𝜕}{𝜕 θ} g (x, θ) d x] |_{θ = \hat{θ_{1}}}}{\int_{t}^{b} g (x, \hat{θ_{1}}) d x} \leq \frac{n - s}{n} \frac{\int_{t}^{b} \frac{𝜕}{𝜕 θ} g (z, θ) d z] |_{θ = \hat{θ_{2}}}}{\int_{t}^{b} g (z, \hat{θ_{2}}) d z} .

(30)

Because at least one of conditions (I) or (II) is strictly established, we can combine (29) and (30) to obtain

\frac{1}{n} \sum_{i = 1}^{r} \frac{g_{θ}^{'} (x_{i}, \hat{θ_{1}})}{g (x_{i}, \hat{θ_{1}})} + \frac{(n - r)}{n} \frac{[\int_{t}^{b} \frac{𝜕}{𝜕 θ} g (x, θ) d x] |_{θ = \hat{θ_{1}}}}{\int_{t}^{b} g (x, \hat{θ_{1}}) d x} < \frac{1}{n} \sum_{i = 1}^{s} \frac{g_{θ}^{'} (z_{i}, \hat{θ_{2}})}{g (z_{i}, \hat{θ_{2}})} + \frac{(n - s)}{n} \frac{[\int_{t}^{b} \frac{𝜕}{𝜕 θ} g (z, θ) d z] |_{θ = \hat{θ_{2}}}}{\int_{t}^{b} g (z, \hat{θ_{2}}) d z} .

(31)

This contradicts (24), meaning that the hypothesis does not hold; thus,

\hat{θ_{1}} (X) \leq \hat{θ_{2}} (Z)

is further obtained by

\hat{θ_{1}} \overset{s t}{\leq} \hat{θ_{2}}

, and

(b)

is proved. □

Example 1.

For the gamma density function

f (x, α) = \frac{λ^{α}}{Γ (α)} x^{α - 1} e^{- λ α}, x > 0, α > 0

, where

λ > 0

is known, it can be verified that conditions (I) and (II) in Theorem 1 are satisfied. Let

h (α) = \frac{λ^{α}}{Γ (α)}

; then,

\frac{d^{2}}{d α^{2}} ln h (α) = - \frac{d}{d α} \frac{Γ^{'} (α)}{d α Γ (α)} = - \sum_{j = 0}^{\infty} \frac{1}{{(α + j)}^{2}} < 0, \forall α > 0 .

Therefore, the conditions in Theorem 3 are all satisfied, and if

α_{1} \leq α_{2}

, then there is

\hat{α_{1}} \overset{s t}{\leq} \hat{α_{2}}

.

Example 2

(Beta Distribution). Consider the beta density function

f (x, α) = \frac{Γ (α + β)}{Γ (α) Γ (β)} x^{α - 1} {(1 - x)}^{β - 1}, 0 < x < 1, α > 0

with known

β > 0

. This is an exponential family density with

h (α) = \frac{Γ (α + β)}{Γ (α) Γ (β)}

. It can be verified that all the conditions of Theorem 3 are satisfied. In particular, we have

\frac{d^{2}}{d α^{2}} ln h (α) = \frac{d}{d α} [\frac{Γ^{'} (α + β)}{Γ (α + β)} - \frac{Γ^{'} (α)}{Γ (α)}]

= \sum_{j = 1}^{\infty} \frac{1}{{(α + β + j)}^{2}} - \sum_{j = 1}^{\infty} \frac{1}{{(α + j)}^{2}} < 0 .

Hence,

0 < α_{1} < α_{2}

implies that

\hat{α_{1}} \overset{s t}{\leq} \hat{α_{2}}

. Similarly, assuming that α is known, it can be shown that

0 < β_{1} < β_{2}

implies the maximum likelihood estimators

\hat{β_{1}} \overset{s t}{\leq} \hat{β_{2}}

.

3.2. Location and Scale Family

It can be seen that Theorem 1 cannot be applied to the exponential density function

f (x, θ) = \frac{1}{θ} e^{- \frac{x}{θ}}, x > 0, θ > 0

.

To better study the randomness of the maximum likelihood estimator under censored samples, Theorem 3 needs to be further improved.

Assuming that

g (x)

is a probability density function defined on the interval

(0, \infty)

, then for any

θ \in Θ \subseteq (0, + \infty), f (x, θ) \equiv \frac{1}{θ} g (\frac{x}{θ})

, the density function set

{f (x, θ) = \frac{1}{θ} g (\frac{x}{θ}), θ \in Θ \subseteq (0, + \infty)}

(32)

is called the scale distribution family.

Theorem 4.

Let

{f (x, θ) \equiv \frac{1}{θ} g (\frac{x}{θ}), θ \in Θ}

:

(i): $\frac{x g^{'} (x)}{g (x)}$ is strictly increasing with respect to $x > 0$ ;
(ii): $r \leq s$ , $△_{x} = \frac{\frac{𝜕}{𝜕 θ} [\int_{t}^{b} g (\frac{x}{θ}) d x] |_{θ = \hat{θ_{1}}}}{\int_{t}^{b} g (\frac{x}{\hat{θ_{1}}}) d x}$ , $△_{y} = \frac{\frac{𝜕}{𝜕 θ} [\int_{t}^{b} g (\frac{y}{θ}) d y] |_{θ = \hat{θ_{2}}}}{\int_{t}^{b} g (\frac{y}{\hat{θ_{2}}}) d y}$ , $\frac{Δ_{y}}{Δ_{x}} \geq \frac{(n - r) \hat{θ_{1}}}{(n - s) \hat{θ_{2}}}$ ;

If

θ_{1} < θ_{2} \in Θ

, then (a)

X \overset{s t}{\leq} Y

; further, if

g (x)

is either strictly log-concave or strictly log-convex, then (b)

\hat{θ_{1}} \overset{s t}{\leq} \hat{θ_{2}}

.

Proof.

We shall first prove (a).

Let

X_{1} \dots X_{r}

be an independent and identically distributed random variable with the common probability density

f (x, θ_{1})

. Furthermore, let

Y_{1} \dots Y_{s}

be an independent and identically distributed random variable with common probability density

f (y, θ_{2})

.

Using the coupling method, it can be seen that there is an independent and identically distributed random variable

Z_{i} (1 \leq i \leq s)

such that

X_{i} \leq Z_{i}

pointwise; moreover,

Z_{i}

has the same distribution as

Y_{i} (1 \leq i \leq s)

. Then,

X_{1} \dots X_{r}

and

Z_{1} \dots Z_{s}

can be proved as in Theorem 1, providing

X \overset{s t}{\leq} Y

.

Without loss of generality, we simply assume that

X_{i} \leq Y_{i}

pointwise; thus, we need to show that

\hat{θ_{1}} \leq \hat{θ_{2}}

pointwise.

Using the contradiction method, suppose that

\hat{θ_{1}} > \hat{θ_{2}}

; then, in the set

{\hat{θ_{1}} > \hat{θ_{2}}}

, we have

\frac{X_{i}}{\hat{θ_{1}}} < \frac{Y_{i}}{\hat{θ_{2}}}

.

From (32), under the condition of censored samples (assuming the censored number is r),

\hat{θ_{1}}

is determined by

n + \sum_{i = 1}^{r} \frac{g \overset{´}{(} \frac{x_{i}}{θ_{1}})}{g (\frac{x_{i}}{θ_{1}})} \frac{x_{i}}{θ_{1}} + (n - r) \frac{\frac{𝜕}{𝜕 θ} [\int_{t}^{b} g (\frac{x}{θ}) d x] |_{θ = θ_{1}}}{\int_{t}^{b} g (\frac{x}{θ_{1}}) d x} θ_{1} = 0

(33)

and

\hat{θ_{2}}

is determined by

n + \sum_{i = 1}^{s} \frac{g \overset{´}{(} \frac{y_{i}}{θ_{2}})}{g (\frac{y_{i}}{θ_{2}})} \frac{y_{i}}{θ_{2}} + (n - s) \frac{\frac{𝜕}{𝜕 θ} [\int_{t}^{b} g (\frac{y}{θ}) d y] |_{θ = θ_{2}}}{\int_{t}^{b} g (\frac{y}{θ_{2}}) d y} θ_{2} = 0,

(34)

that is, it always holds that

\sum_{i = 1}^{r} \frac{g \overset{´}{(} \frac{x_{i}}{\hat{θ_{1}}})}{g (\frac{x_{i}}{\hat{θ_{1}}})} \frac{x_{i}}{\hat{θ_{1}}} + (n - r) \frac{\frac{𝜕}{𝜕 θ} [\int_{t}^{b} g (\frac{x}{θ}) d x] |_{θ = \hat{θ_{1}}}}{\int_{t}^{b} g (\frac{x}{\hat{θ_{1}}}) d x} \hat{θ_{1}} = \sum_{i = 1}^{s} \frac{g \overset{´}{(} \frac{y_{i}}{\hat{θ_{2}}})}{g (\frac{y_{i}}{\hat{θ_{2}}})} \frac{y_{i}}{\hat{θ_{2}}} + (n - s) \frac{\frac{𝜕}{𝜕 θ} [\int_{t}^{b} g (\frac{y}{θ}) d y] |_{θ = \hat{θ_{2}}}}{\int_{t}^{b} g (\frac{y}{\hat{θ_{2}}}) d y} \hat{θ_{2}} .

(35)

Because

\frac{x g' (x)}{g (x)}

is strictly increasing and

\frac{X_{i}}{\hat{θ_{1}}} < \frac{Y_{i}}{\hat{θ_{2}}}

, we have

\frac{g' (\frac{x_{i}}{\hat{θ_{1}}})}{g (\frac{x_{i}}{\hat{θ_{1}}})} \frac{x_{i}}{\hat{θ_{1}}} < \frac{g' (\frac{y_{i}}{\hat{θ_{2}}})}{g (\frac{y_{i}}{\hat{θ_{2}}})} \frac{y_{i}}{\hat{θ_{2}}}

(36)

and

r \leq s

; thus,

\sum_{i = 1}^{r} \frac{g' (\frac{x_{i}}{\hat{θ_{1}}})}{g (\frac{x_{i}}{\hat{θ_{1}}})} \frac{x_{i}}{\hat{θ_{1}}} < \sum_{i = 1}^{s} \frac{g' (\frac{y_{i}}{\hat{θ_{2}}})}{g (\frac{y_{i}}{\hat{θ_{2}}})} \frac{y_{i}}{\hat{θ_{2}}} .

(37)

From the condition

\frac{Δ_{y}}{Δ_{x}} \geq \frac{(n - r) \hat{θ_{1}}}{(n - s) \hat{θ_{2}}}

we can obtain

(n - r) \frac{\frac{𝜕}{𝜕 θ} [\int_{t}^{b} g (\frac{x}{θ}) d x] |_{θ = \hat{θ_{1}}}}{\int_{t}^{b} g (\frac{x}{\hat{θ_{1}}}) d x} \hat{θ_{1}} \leq (n - s) \frac{\frac{𝜕}{𝜕 θ} [\int_{t}^{b} g (\frac{y}{θ}) d y] |_{θ = \hat{θ_{2}}}}{\int_{t}^{b} g (\frac{y}{\hat{θ_{2}}}) d y} \hat{θ_{2}} .

(38)

Inequalities (37) and (38) imply that

\sum_{i = 1}^{r} \frac{g \overset{´}{(} \frac{x_{i}}{\hat{θ_{1}}})}{g (\frac{x_{i}}{\hat{θ_{1}}})} \frac{x_{i}}{\hat{θ_{1}}} + (n - r) \frac{\frac{𝜕}{𝜕 θ} [\int_{t}^{b} g (\frac{x}{θ}) d x] |_{θ = \hat{θ_{1}}}}{\int_{t}^{b} g (\frac{x}{\hat{θ_{1}}}) d x} \hat{θ_{1}} < \sum_{i = 1}^{s} \frac{g \overset{´}{(} \frac{y_{i}}{\hat{θ_{2}}})}{g (\frac{y_{i}}{\hat{θ_{2}}})} \frac{y_{i}}{\hat{θ_{2}}} + (n - s) \frac{\frac{𝜕}{𝜕 θ} [\int_{t}^{b} g (\frac{y}{θ}) d y] |_{θ = \hat{θ_{2}}}}{\int_{t}^{b} g (\frac{y}{\hat{θ_{2}}}) d y} \hat{θ_{2}} .

(39)

As (35) and (39) are contradictory, the hypothesis does not hold; therefore,

\hat{θ_{1}} \overset{s t}{\leq} \hat{θ_{2}}

is proved. □

Example 3.

Consider the function

f (x, θ) = \frac{x^{α}}{Γ (α) θ^{α}} e^{- \frac{x}{θ}}, x > 0, θ > 0

with known

α > 0

and

f (x, θ)

satisfying the conditions in Theorem 4. Therefore, if

0 < θ_{1} \leq θ_{2}

, then there is

\hat{θ_{1}} \overset{s t}{\leq} \hat{θ_{2}}

, In particular, when

α = 1

, it is an exponential distribution, which is also true.

Example 4.

The logistic distribution with density

f (x, μ) = \frac{exp (- (x - μ))}{{[1 + exp (- (x - μ))]}^{2}}

satisfies the conditions of Theorem 4; thus, if

μ_{1} < μ_{2}

, then

η_{1} \leq η_{2}

.

Now, let us investigate the scale family. Suppose that

g (x)

is a probability density function on

(0, \infty)

. For any

θ > 0

,

f (x, θ) = \frac{g (x / θ)}{θ}

forms a density on

(0, \infty)

. We refer to the collection of density functions

{f (x, θ) = \frac{g (x / θ)}{θ}, θ \in (0, \infty)}

as a scale family of distribution.

The scale parameter can be reduced to a location parameter by logarithmic transform. Therefore, the following result follows immediately from Theorem 4.

3.3. Numerical Examples

As part of this study, we conducted a simulation analysis on two exponential populations with parameters

θ = 2

and

θ = 5

. We employed censored data with a predetermined number of failures

r = 35

to obtain the maximum likelihood estimator

\hat{θ}

. For each of these two exponential distributions, 50 experimental units were tested and the required sample data were recorded to apply this method. This experiment was repeated 1000 times. Based on the 1000 values of

\hat{θ}

corresponding to each population under this method, the empirical cumulative distribution functions (ECDFs) of

\hat{θ}

were calculated.

The empirical cumulative distribution functions (ECDFs) of

\hat{θ}

associated with

θ = 2

and

θ = 5

obtained by applying censoring with

r = 35

are plotted in Figure 1.

Figure 1. Censoring with

r = 35

.

3.4. Data Analysis

We used numerical simulation to analyze a practical application as a case study. The Monte Carlo method was used to randomly generate samples from the exponential distributions of

θ_{1} = 2

and

θ_{2} = 5

with sample sizes of

n =

10, 50, 100, 200, and 500. The maximum likelihood estimates of parameters

θ_{1}

and

θ_{2}

, denoted as

{\hat{θ}}_{1}

and

{\hat{θ}}_{2}

, were obtained from each set of simulated data. The entire program was repeated 5000 times, with the bias and mean square error (MSE) used to evaluate the maximum likelihood estimates. The calculation formulas are as follows:

B i a s (θ_{j}) = \frac{1}{5000} \sum_{i = 1}^{5000} ({\hat{θ}}_{j} - θ_{j}), j = 1, 2, \dots

and

M S E (θ_{j}) = \frac{1}{5000} \sum_{i = 1}^{5000} {({\hat{θ}}_{j} - θ_{j})}^{2}, j = 1, 2, \dots .

Table 1 shows the maximum likelihood estimates, biases, and mean square errors for parameters

θ_{1} = 2

and

θ_{2} = 5

of the exponential distribution. It can be observed that the maximum likelihood estimates for both parameters approach the true values as the sample size increases. Furthermore, the biases and mean square errors decrease as the sample size increases, indicating that the maximum likelihood estimation exhibits good stability.

Table 1. Monte Carlo simulation results of the exponential distribution.

4. Two-Parameter Exponential Distribution

Theorem 5.

In addition, we performed a tailing experiment on two sets of samples with a capacity of n. Here, the tail number is m, the tailed samples

0 < x_{1} \leq x_{2} \leq \dots \leq x_{m}

obey the two-parameter exponential distribution

f (x, μ_{1}, θ_{1}) = \frac{1}{θ_{1}} exp \{- \frac{x - μ_{1}}{θ_{1}}\}, x \geq μ_{1},

and the tailed samples

0 < y_{1} \leq y_{2} \leq \dots \leq y_{m}

obey the two-parameter exponential distribution

f (y, μ_{2}, θ_{2}) = \frac{1}{θ_{2}} exp \{- \frac{y - μ_{2}}{θ_{2}}\}, y \geq μ_{2} .

If

μ_{1} \leq μ_{2}, θ_{1} \leq θ_{2},

then there is

{\hat{μ}}_{1} \overset{s t}{\leq} {\hat{μ}}_{2}, {\hat{θ}}_{1} \overset{s t}{\leq} {\hat{θ}}_{2},

where

{\hat{μ}}_{1}, {\hat{θ}}_{1}

are the censored maximum likelihood estimators obtained from

0 < x_{1} \leq x_{2} \leq \dots \leq x_{m}

and

{\hat{μ}}_{2}, {\hat{θ}}_{2}

are the censored maximum likelihood estimators obtained from

0 < y_{1} \leq y_{2} \leq \dots \leq y_{m}

.

Proof.

The likelihood function of

μ_{1}, θ_{1}

is as follows:

\begin{matrix} L (x_{1}, x_{2}, \dots, x_{m}, μ_{1}, θ_{1}) & = \prod_{i = 1}^{r} f (x_{i}, μ_{1}, θ_{1}) {(\int_{x_{m}}^{\infty} f (x, μ_{1}, θ_{1}) d x)}^{n - r} \\ = \frac{1}{θ_{1}^{r}} exp \{- \sum_{i = 1}^{r} \frac{x_{i} - μ_{1}}{θ_{1}}\} {(exp \{- \frac{x_{r} - μ_{1}}{θ_{1}}\})}^{n - r} \\ = \frac{1}{θ_{1}^{r}} exp \{- \frac{1}{θ_{1}} [\sum_{i = 1}^{r} x_{i} + (n - r) x_{r} - n μ_{1}]\} \end{matrix}

and its log-likelihood function is

\frac{𝜕 ln L (x_{1}, x_{2}, \dots, x_{m}, μ_{1}, θ_{1})}{𝜕 θ_{1}} = - \frac{r}{θ_{1}} + \frac{1}{θ_{1}^{2}} [\sum_{i = 1}^{r} x_{i} + (n - r) x_{r} - n μ_{1}] = 0 .

(40)

In order to make the likelihood function

L (x_{1}, x_{2}, \dots, x_{m}, μ_{1}, θ_{1})

reach the maximum value, we have

{\hat{μ}}_{1} = X_{1}

according to the context, and

{\hat{θ}}_{1} = \frac{1}{r} [\sum_{i = 1}^{r} X_{i} + (n - r) X_{r} - n X_{1}];

(41)

Similarly,

{\hat{θ}}_{2} = \frac{1}{r} [\sum_{i = 1}^{r} Y_{i} + (n - r) Y_{r} - n Y_{1}] .

(42)

Because

\begin{matrix} X_{1} & \sim f (x, μ_{1}, θ_{1}) = \frac{1}{θ_{1}} exp \{- \frac{x - μ_{1}}{θ_{1}}\} \\ Y_{1} & \sim f (y, μ_{2}, θ_{2}) = \frac{1}{θ_{2}} exp \{- \frac{y - μ_{2}}{θ_{2}}\} \end{matrix}

\begin{matrix} [\sum_{i = 1}^{r} X_{i} + (n - r) X_{r} - n X_{1}] & \sim Γ (r - 1, θ_{1}) \\ [\sum_{i = 1}^{r} Y_{i} + (n - r) Y_{r} - n Y_{1}] & \sim Γ (r - 1, θ_{2}) \end{matrix}

are given, we can use a proof method similar to that in Theorem 4 to obtain

{\hat{μ}}_{1} \overset{s t}{\leq} {\hat{μ}}_{2}, {\hat{θ}}_{1} \overset{s t}{\leq} {\hat{θ}}_{2} .

□

5. Illustrative Example

Assume that we have two types of batteries and that their lifetimes follow exponential distributions with parameters

λ_{1} = \frac{1}{100}

(i.e., the average lifetime is 100 h) and

λ_{2} = \frac{1}{120}

(i.e., the average lifetime is 120 h). We want to test the stochastic order of the lifetimes of the two types of batteries.

We can draw 50 samples from each distribution, but only record the lifetime data exceeding 80 h (censoring point

t = 80

h).

The following are the sample data (in hours) drawn from the two distributions, listing only the samples exceeding 80 h:

Battery Type 1:

λ_{1} = \frac{1}{100}

Samples: 82, 85, 90, 84, 81, 83, 85, 89, 88, 87, …

Battery Type 2:

λ_{2} = \frac{1}{120}

Samples: 81, 86, 92, 83, 85, 87, 90, 82, 84, 88, ….

For the exponential distribution, the MLE

λ

can be calculated using the following formula:

\hat{λ} = \frac{n}{\sum_{i = 1}^{n} x_{i}}

where n is the sample size and

x_{i}

are the sample data.

{\hat{λ}}_{1} = \frac{50}{82 + 85 + 90 + \dots}

{\hat{λ}}_{2} = \frac{50}{81 + 86 + 92 + \dots}

Assuming the calculations yield

{\hat{λ}}_{1} = \frac{1}{85},

{\hat{λ}}_{2} = \frac{1}{90},

we then need to verify whether

{\hat{λ}}_{1} \leq {\hat{λ}}_{2}

holds. Because

\frac{1}{85} > \frac{1}{90}

, it is implied that

{\hat{λ}}_{1} > {\hat{λ}}_{2}

, meaning that Battery Type 1 has a higher failure rate. As we are interested in the lifespan, we can conclude that Battery Type 2 has a longer lifespan.

Based on the maximum likelihood estimates, we can conclude that Battery Type 2 has a longer lifespan, satisfying the condition of stochastic order

{\hat{λ}}_{1} \leq {\hat{λ}}_{2}

.

This practical example demonstrates how to compare the lifespan distributions of batteries with different parameters using the MLEs from censored samples and infer differences in battery performance through stochastic order.

6. Conclusions

In this study, we have successfully validated two crucial theorems which establish that maximum likelihood estimators (MLEs) for parameters associated with location and exponential distribution families maintain stochastic ordering when estimated from censored samples provided that certain conditions are met. Furthermore, we provide practical examples that demonstrate the applicability of these theorems in real-world scenarios.

Author Contributions

Conceptualization, Y.L.; methodology, J.R., X.L. and P.L.; software (R language 4.3): J.R.; formal analysis, C.G.; data curation, P.L.; writing—original draft, J.R. and X.L.; writing—editing, C.G. and J.R.; project administration, Y.L., C.G. and J.R.; funding acquisition, J.R. All authors have read and agreed to the published version of the manuscript.

Funding

This paper was supported by the Shanghai University of Finance and Economics, Zhejiang College Educational Committee (Grant No. 2020 GR007).

Data Availability Statement

The data that support the findings of this study are available from the corresponding author upon reasonable request.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Brunk, H.D.; Barlow, R.E.; Bartholomew, D.J.; Bremner, J.M. International Statistical Review. In Statistical Inference under Order Restrictions; Wiley: Hoboken, NJ, USA, 1972; Volume 41, pp. 395–412. [Google Scholar]
Nuesch, P.E. Order Restricted Statistical Inference. J. Appl. Econom. 1991, 6, 105–107. [Google Scholar]
Viveros, R.; Balakrishnan, N. Interval estimation of parameters of life from progressively censored data. Technometrics 1994, 36, 84–91. [Google Scholar] [CrossRef]
Balakrishnan, N.; Brain, C.; Mi, J. Stochastic order and MLE of the mean of the exponential distribution. Methodol. Comput. Appl. Probab. 2001, 4, 83–93. [Google Scholar] [CrossRef]
Balakrishnan, N.; Mi, J. Order-preserving property of maximum likelihood estimator. J. Stat. Plan. Inference 2001, 98, 89–99. [Google Scholar] [CrossRef]
Bain, L.J.; Engelhardt, M. Interval estimation for the two-parameter double exponential distribution. Technometrics 1973, 15, 875–887. [Google Scholar] [CrossRef]
Kng, F.; Fei, H. Limit theorems for the maximum likelihood estimate under general multiply Type II censoring. Ann. Inst. Stat. Math. 1996, 48, 731–755. [Google Scholar]
Cohen, C.A.; Whitten, B. Modified maximum likelihood and modified moment estimators for the three-parameter Weibull distribution. Commun. Stat.-Theory Methods 1982, 11, 2631–2656. [Google Scholar] [CrossRef]
Prajapati, D.; Mitra, S.; Kundu, D. A new decision theoretic sampling plan for type-I and type-I hybrid censored samples from the exponential distribution. Sankhya B 2019, 81, 251–288. [Google Scholar] [CrossRef]
Krishnamoorthy, K.; Xia, Y. Confidence intervals for a two-parameter exponential distribution: One-and two-sample problems. Commun. Stat.-Theory Methods. 2018, 47, 935–952. [Google Scholar] [CrossRef]
Rényi, A. On the theory of order statistics. Acta Math. Acad. Sci. Hung. 1953, 4, 48–89. [Google Scholar] [CrossRef]
Kundu, D.; Kannan, N.; Balakrishnan, N. Analysis of progressively censored competing risks data. Handb. Stat. 2003, 23, 331–348. [Google Scholar]
Guo, M.-Y.; Zhang, J.; Yan, R. Stochastic comparisons of second largest order statistics with dependent heterogeneous random variables. Commun. Stat.-Theory Methods 2024, 1–19. [Google Scholar] [CrossRef]
Qiu, G.; Raqab, M. On weighted extropy of ranked set sampling and its comparison with simple random sampling counterpart. Commun. Stat.-Theory Methods 2024, 53, 378–395. [Google Scholar] [CrossRef]
Crescenzo, A.D.; Paolillo, L.; Suárez-Llorens, A. Stochastic comparisons, differential entropy and varentropy for distributions induced by probability density functions. Metrika 2024, 1–17. [Google Scholar] [CrossRef]
Bartholomew, D.J. A problem in life testing. J. Am. Stat. Assoc. 1957, 52, 350–355. [Google Scholar] [CrossRef]
Al-Athari, M.F.M. Estimation of the mean of truncated exponential distribution. J. Math. Stat. 2008, 4, 284. [Google Scholar] [CrossRef]
Weißbach, R.; Wied, D. Truncating the exponential with a uniform distribution. Stat. Pap. 2022, 63, 1247–1270. [Google Scholar] [CrossRef]
Hannon, P.M.; Dahiya, R.C. Estimation of parameters for the truncated exponential distribution. Commun. Stat.-Theory Methods. 1999, 28, 2591–2612. [Google Scholar] [CrossRef]
Hu, Y.-H.; Emura, T. Maximum likelihood estimation for a special exponential family under random double-truncation. Comput. Stat. 2015, 30, 1199–1229. [Google Scholar] [CrossRef]
Sabti, A.N.; Ansseif, A.A.l.; Shakir, A.M. Estimating the Reliability for the Sequential System of Two Truncated Exponential Distribution. Ind. Eng. Manag. Syst. 2021, 20, 455–463. [Google Scholar] [CrossRef]
Raschke, M. Inference for the truncated exponential distribution. Stoch. Environ. Res. Risk Assess. 2012, 26, 127–138. [Google Scholar] [CrossRef]
Blumenthal, S.; Dahiya, R.C. Estimating scale and truncation parameters for the truncated exponential distribution with type-I censored sampling. Commun. Stat.-Theory Methods 2005, 34, 1–21. [Google Scholar] [CrossRef]
Suich, R.; Rutemiller, H.C. Point Estimation of the Parameter of a Truncated Exponential Distribution. IEEE Trans. Reliab. 1982, 31, 393–397. [Google Scholar] [CrossRef]
Akahira, M. Statistical Estimation for Truncated Exponential Families; Springer: Berlin/Heidelberg, Germany, 2017. [Google Scholar]
Kumar, D.; Dey, S.; Nadarajah, S. Extended exponential distribution based on order statistics. Commun. Stat.-Theory Methods 2017, 46, 9166–9184. [Google Scholar] [CrossRef]

Figure 1. Censoring with

r = 35

.

Table 1. Monte Carlo simulation results of the exponential distribution.

Par.	n	MLE	Bias	MSE
$θ_{1} = 2$	10	2.7759	0.1114	0.2273
	50	2.2050	0.0251	0.0170
	100	2.0848	0.0117	0.0056
	200	2.0243	0.0042	0.0023
	500	1.9884	−0.0002	0.0008
$θ_{2} = 5$	10	5.9381	0.1550	0.7082
	50	5.4196	0.1529	0.3287
	100	5.0841	0.0140	0.0325
	200	5.0356	0.0077	0.0130
	500	4.9924	−0.0072	0.0052

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

A Comparison of MLE for Some Index Distributions Based on Censored Samples

Abstract

1. Introduction

2. Comparison of MLE under Complete Samples

2.1. One-Parameter Exponential Distribution

2.2. Two-Parameter Exponential Distribution

2.3. Normal Distribution

3. Comparison of MLEs under Censored Samples

3.1. Generalization of the Exponential Distribution

3.2. Location and Scale Family

3.3. Numerical Examples

3.4. Data Analysis

4. Two-Parameter Exponential Distribution

5. Illustrative Example

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Article Metrics

Citations

Article Access Statistics