The Entropy of Progressively Censored Samples

Z. A. Abo-Eleneen

doi:10.3390/e13020437

Abstract

In many life-testing and reliability studies, the experimenter might not always obtain complete information on failure times for all experimental units. Among the different censoring schemes, the progressive censoring scheme has received a considerable attention in the last few years. The aim of this paper is simplifying the entropy of progressively Type II censored samples. We propose an indirect approach using a decomposition of the entropy in progressively Type II censored samples to simplify the calculation. Some recurrence relations for the entropy in progressively Type II censored samples are derived to facilitate this calculation. An efficient computational method is derived that simplifies computation of the entropy in progressively Type II censored samples to a sum; entropy in collections order statistics. We compute the entropy in a collection of progressively Type II censored samples for some known distributions.

Keywords:

entropy; progressive censoring; order statistics; recurrence relations; Markov chain

1. Introduction

Information theory provides an intuitive tool to measure the uncertainty of random variables and the information shared by them, in which the entropy and the mutual information are two critical concepts.

Let X be a random variable with a cumulative distribution function (cdf)

F (x)

and probability density function (pdf)

f (x)

. The differential entropy

H (X)

of the random variable is defined by Cover and Thomas [1] to be

H (X) = - \int_{- \infty}^{\infty} f (x) log f (x) d x

(1)

Let us consider a life-testing experiment where n units is kept under observation until failure. These units could be some system, components, or computer chips in reliability study experiments, or they could be patients put under certain drug or clinical conditions. Suppose the life lengths of these n units are independent identical random variables with a common cdf

F (x)

and pdf

f (x)

. Data collected from such experiments called the order statistics sample

X_{1 : n} < X_{2 : n} < \dots < X_{n : n}

where

X_{r : n}

is called the rth-order statistics (OS).

For some reason, suppose that we have to terminate the experiment before all items have failed. For example, individuals in a clinical trial may drop out of the study, or the study may have to be terminated for lack of funds. In an industrial experiment, units may break accidentally. There are, however, many situations in which the removal of units prior to failure is pre-planned. One of the main reasons for this is to save time and cost associated with testing. Data obtained from such experiments are called censored data.

The most common censoring schemes are Type I and Type II censoring. In conventional Type I censoring, the experiment continues up to a prespecified time T. Any failures that occur after T are not observed. The termination point T of the experiment is assumed to be independent of the failure times. In conventional Type II censoring, the experimenter decides to terminate the experiment after a prespecified number of items

r \leq n

fail. In this scenario, only the smallest lifetimes are observed. In Type I censoring, the number of failures observed is random and the endpoint of the experiment is fixed, whereas in Type II censoring the endpoint is random, while the number of failures is fixed.

Park [2] studied the entropy of Type II censored sample. Park [3] considered testing exponentiality based on the Kullback-Leibler information with the Type II censored data. The entropy of a single

X_{r : n}

, and a complete order statistic sample has been studied in Wong and Chen [4] and Ebrahimi et al. [5].

Here we considers progressive Type II censored schemes. Among the different censoring schemes, the progressive censoring scheme has received a considerable attention in the last few years, particularly in reliability analysis. It is a more general censoring mechanism than the traditional Type I and Type II censoring [6]. The recent review article by Balakrishnan [7] provide details on progressive censoring schemes and on its different applications. This paper is concerned with simplifying calculation of the entropy in progressively Type II censored data from the i.i.d. random sample of size n. However, the extension to progressively Type II censored data is not so straightforward, because the joint entropy of progressively Type II censored data is an n-dimensional integral. Besides, removals cause additional complications.

Following Balakrishnan and Aggarwala [8], progressively Type II censored samples can be described as follows. Let n units be placed in test at time zero.

The m, and the $R_{i}$ and $i = 1 \dots, m - 1$ are fixed prior to the test.
At the first failure, $R_{1}$ units are randomly removed from the remaining $n - 1$ surviving units.
At the second failure, $R_{2}$ units are randomly removed from the remaining $n - R_{1} - 2$ units.
The test continues until the mth failure, when all remaining $R_{m} = n - R_{1} - R_{2} - \dots - R_{m - 1} - m$ are removed from experiment, so the life testing stops at the mth failure..
The observed failure times $X = (X_{1 : m : n}, X_{2 : m : n} \dots X_{m : m : n})$ constitute Type II progressive censored OS.
If $R_{1} = R_{2} = \dots = R_{m - 1} = 0,$ then $R_{m} = n - m$ which corresponds to the Type II censoring.
If $R_{1} = R_{2} = \dots = R_{m} = 0,$ then $m = n$ which corresponds to the usual order statistics.

Thus, usual OS and the Type II censoring become a special cases of progressively Type II censored samples. So any result established for progressively Type II censoring becomes a generalization of the corresponding result for OS and the Type II censoring.

The likelihood function may be written as [8]

f_{1 : m : n, . . . ., m : m : n} (_{1 : m : n}, x_{2 : m : n} \dots x_{m : m : n}) = c \prod_{i = 1}^{m} f (x_{i : m : n}; θ) {[1 - F (x_{i : m : n}; θ)]}^{R_{i}}

(2)

where

c = n (n - R_{1} - 1) (n - R_{1} - R_{2} - 2) \dots (n - R_{1} - R_{2} - R_{3} \dots - R_{m - 1} - m + 1)

. The joint entropy contained in

(X_{1 : m : n}, X_{2 : m : n} \dots X_{i : m : n})

, i.e., a collection of first i of progressively Type II censored OS, is defined to be

H_{1 . . . . i : m : n} = E {log f_{1 : m : n, . . . ., i : m : n} (X_{1 : m : n}, X_{2 : m : n} \dots X_{i : m : n})}

(3)

where

f_{1 : m : n, . . . ., i : m : n} (x_{1 : m : n}, x_{2 : m : n} \dots, x_{i : m : n})

, is the density function of

(X_{1 : m : n}, X_{2 : m : n} \dots X_{i : m : n})

. To our knowledge, Balakrishnan et al. [9] generalized the result of Park [3] testing exponentiality based on the Kullback-Leibler information with the type II censored data to a progressively Type II censored data and obtained an approximate to the joint entropy in progressively Type II censored samples based on nonparametric estimation. Hence, the exact values of the joint entropy in progressively Type II censored samples has not been obtained. Several applications for entropy such as characterization, tests for goodness-of-fit based on censored data, parameter estimation and quantization theory are known, for example see [3,9].

In the case of

H_{1 . . . . i : m : n}

, difficulty arise from the removal as well as the expression of

H_{1 . . . . i : m : n}

, which involves integration over i random variables, so simplifying the calculation of

H_{1 . . . . i : m : n}

is more attractive. In this article we focus on the study of the properties of the joint entropy in progressively Type II censored OS. In Section 2 we developed the idea of Park [2] about the decomposition of entropy in OS to introduce an indirect approach for decomposition of entropy in progressive Type II censored OS. In Section 3 we derive a recurrence relations for the entropy in progressively Type II censored samples, which will prove helpful in calculating the entropy. In Section 4 we derive an efficient computational method to reduce r-dimensional integrals in the calculation of

H_{1 \dots r : m : n}

to no integral where the computation of the entropy in progressively Type II censored samples simplifies to a sum; entropy of the smallest OS of varying sample size. In Section 5 we apply our results for computing the entropy in collections of a progressively Type II censored samples from normal and exponential distributions.

2. Decomposition of the Joint Entropy

Park [2] and Wong and Chen [4] have shown that the total entropy of i.i.d. random sample of size n is decreased if the sample is ordered. Park [2] showed how much the entropy of i.i.d. random sample of size n is decreased if the sample is ordered through, the following identity about the entropy of the ordered data

h_{1 . . . . : n : n}

h_{1 . . . . : n : n} = n h_{1 : 1} - log n!

(4)

In view of Equation (4) and noting that progressive Type II censored sample can be seen as an ordered sample

(X_{1 : m : n}, X_{2 : m : n} \dots X_{m : m : n})

with the removals

(R_{1}, R_{2}, \dots R_{m})

, we have the following result for the entropy of the progressive Type II censored OS sample.

Lemma 2.1.

H_{1 . . . . m : m : n} = n H_{1 : 1 : 1} - log c

(5)

where

c = n (n - R_{1} - 1) (n - R_{1} - R_{2} - 2) \dots (n - R_{1} - R_{2} - R_{3} \dots - R_{m - 1} - m + 1)

, and

H_{1 : 1 : 1} = h_{1 : n} = 1 - \int_{- \infty}^{\infty} f (x) log h (x) d x

.

Since the progressively Type II censored sample form a Markov chain [8], we have the following results.

Lemma 2.2.

$H_{r + 1 . . . . m : m : n | i . . . r : m : n} = H_{r + 1 . . . . m : m : n | r : m : n}, i = 1, \dots r$
$H_{r + 1 . . . . i : m : n} - H_{r + 1 . . . . i : m : n | r : m : n} = H_{r + 1 : m : n} - H_{r + 1 : m : n | r : m : n}, i = r + 1, \dots m .$

PROOF. From the Markov chain property of progressive Type II censored OS, the first part follows directly. The second part can be shown by using the first part and the symmetry of the mutual information Csisz

\overset{´}{a} r

[10],

\begin{matrix} H_{r + 1 . . . . i : m : n} - H_{r + 1 . . . . i : m : n | r : m : n} & = & H_{r + 1 : m : n} - H_{r + 1 : m : n | r : m : n} \\ = & H_{r : m : n} - H_{r : m : n | r + 1 : m : n} \end{matrix}

Next we show the following decomposition of the entropy of progressive Type II censored OS.

Lemma 2.3.

H_{1 . . . . m : m : n} = H_{1 . . . . r : m : n} + H_{r + 1 . . . . m : m : n | r : m : n}

(6)

PROOF. By the additive property of the entropy measure and Lemma 2.1. we have the result.

We see from Equations (5) and (6) that the entropy of r progressive censored data

H_{1 . . . . r : m : n}

can be obtained from

H_{r + 1 . . . . m : m : n | r : m : n}

. So we consider

H_{r + 1 . . . . m : m : n | r : m : n}

to study

H_{1 . . . . r : m : n}

. Let

(X_{1 : m : n}, X_{2 : m : n} \dots X_{m : m : n})

be a progressively Type II censored sample with censoring scheme

(R_{1}, R_{2}, \dots R_{m})

. The entropy in a collection of first i progressively Type II censored

(X_{1 : m : n}, X_{2 : m : n} \dots X_{i : m : n})

is defined by Equation (3), and can be written as

\begin{matrix} H_{1 . . . . i : m : n} & = & - \int_{- \infty}^{\infty} \dots \int_{- \infty}^{x_{2 : m : n}} f_{1 . . i : m : n} (x_{1 : m : n}, \dots, x_{i : m : n}) \\ \times & log f_{1 . . i : m : n} (x_{1 : m : n}, \dots, x_{i : m : n}) d x_{1 : m : n} \dots d x_{i : m : n} \end{matrix}

(7)

where

f_{1 . . . . i : m : n} (x_{1 : m : n}, \dots, x_{i : m : n})

is the joint pdf of the first i order statistics of the progressively Type II censored sample.

Using the Markov chain property of the order statistics from progressive Type II censored samples, we have the following decomposition for the score function:

log f_{1 \dots m : m : n} = log f_{1 \dots i : m : n} + log f_{i + 1 \dots m : m : n | i : m : n}

where

f_{i + 1 \dots m : m : n | i : m : n}

is the pdf of

(X_{i + 1 \dots m : m : n}, \dots X_{m : m : n})

given

X_{i : m : n} = x_{i}

. The following decomposition follows from the strong additivity of the entropy

H_{1 \dots m : m : n} = H_{1 \dots i : m : n} + H_{i + 1 \dots m : m : n | i : m : n},

where

H_{i + 1 \dots m : m : n | i : m : n}

is the average of the conditional information in

(X_{i + 1 \dots m : m : n}, \dots, X_{m : m : n})

given

X_{i : m : n} = x_{i}

.

On the other hand, in view of the result of Balakrishnan and Aggarwala [8], the

f_{i + 1 \dots m : n | i : m : n}

is the joint density of the progressively Type II censored sample of size

(m - i)

, with censoring scheme

(R_{i + 1},, \dots R_{m})

, from

(n - \sum_{j = 1}^{j = i} R_{j} - i)

drawn from the parent distribution

f (x)

truncated from the left at

x_{i}

with density

\frac{f (x)}{1 - F (x_{i})}, x > x_{i}

. Therefore

H_{i + 1 \dots m : m : n | i : m : n}

can be written as the double integral

\begin{matrix} H_{i + 1 \dots m : m : n | i : m : n} & = & (n - \sum_{j = 1}^{j = i} R_{j} - i) \int_{- \infty}^{\infty} g (w) f_{i : m : n} (w) d w \\ - log (n - \sum_{j = 1}^{j = i} R_{j} - i)! \end{matrix}

(8)

where

g (w) = - \int_{w}^{\infty} \frac{f (x; θ)}{1 - F (w; θ)} log \frac{f (x)}{1 - F (w)} d x

and

f_{i : m : n} (x)

is defined by

f_{i : m : n} (x) = c_{i - 1} \sum_{j = 1}^{i} a_{j} (i) {(1 - F (x))}^{γ_{j} - 1} f (x), - \infty < x < \infty, 1 \leq i \leq m

where

γ_{i} = n - i + \sum_{j = i}^{m} R_{i}, c_{i - 1} = \prod_{j = 1}^{i} γ_{j}, 1 \leq i \leq m

(9)

and

a_{j} (i) = \prod_{r = 1, r \neq j}^{i} \frac{1}{γ_{r} - γ_{j}}, 1 \leq j \leq i \leq m

(10)

Since we already know about the entropy of the complete sample

H_{1 \dots m : m : n}

, the entropy

H_{1 \dots i : m : n}

can be now easily derived from Equations (6) and (8).

EXAMPLE 2.1. For the exponential density

exp (- x)

, we can show that

g (w) = 1

so that

\begin{matrix} H_{i + 1 \dots m : m : n | i : m : n} & = & (n - \sum_{j = 1}^{i} R_{j} - i) \int_{- \infty}^{\infty} f_{i : m : n} (w) d w \\ - log (n - \sum_{j = 1}^{i} R_{j} - i)! \end{matrix}

(11)

H_{i + 1 \dots m : m : n | i : m : n} = (n - \sum_{j = 1}^{i} R_{j} - i) - log (n - \sum_{j = 1}^{i} R_{j} - i)!

Thus in that case

H_{1 \dots i : m : n} = i + \sum_{j = 1}^{j = i} R_{j} + log (n - \sum_{j = 1}^{i} R_{j} - i)! - log (n) - \sum_{k = 1}^{m} log (n - \sum_{j = 1}^{k} R_{j} - k)

where

H_{1 : 1 : 1} = 1

is the entropy in a single observation from the exponential density

exp (- x)

.

REMARK 2.1. We note that all of Park’s results concerning the entropy for the minimum order statistics

X_{1 : n}

works for the case of progressively Type II censored sample, since

f_{1 : n} = f_{1 : m : n}

.

3. Recurrence Relations

Recurrence relations between the cdf (pdf) of OS and progressive Type II censored OS have been studied by many authors for the purpose of simplifying the calculation of moments of OS and progressive Type II censored OS.

The standard recurrence relation for the moments of OS was obtained by Cole [11], and can be written as

n μ_{r : n - 1}^{k} = (n - r) μ_{r : n}^{k} + r μ_{r + 1 : n}^{k}

(12)

where

μ_{i : j}^{k}

is the moments of the usual OS

X_{i : j}

.

This result can be directly derived from the corresponding recurrence relation between the cdf’s of OS. Kamps and Cramr Lemma 4 [12] obtained the corresponding recurrence relation for generalized OS as

\begin{matrix} (k + n - r - 1 & + & \sum_{j = 1}^{n - 1} m_{j}) f_{X (r : n - 1 : (m_{1}, \cdot, m_{n - 1}))} (x) \\ = & (k + n - r - 1 + \sum_{j = r + 1}^{n - 1} m_{j}) f_{X (r : n : (m_{1}, \cdot, m_{n - 1}))} (x) \\ + & (r + \sum_{j = 1}^{r} m_{j}) f_{X (r + 1 : n : (m_{1}, \cdot, m_{n - 1}))} (x), 1 \leq r \leq n - 1 \end{matrix}

(13)

Since the generalized OS includes the progressive Type II censored OS, it is clear that the case of progressive Type II censoring is subsumed in the above result. By setting

m_{i} = R_{i}

for

i = 1, 2, \dots, m - 1

,

m_{i} = 0

for

i = m, \dots, n - 1,

and

k = R_{m} + 1

, we have

(m + \sum_{j = 1}^{m} R_{j}) f_{i : m : n - 1} = (m - i + \sum_{j = r + 1}^{m} R_{j}) f_{i : m : n} + (i + \sum_{j = 1}^{i} R_{j}) f_{i + 1 : m : n}

(14)

Using Equation (14) and the decomposition of the entropy in Equation (8) we have the following results for the entropy in the progressive censoring scheme.

RELATION 3.1

\begin{matrix} H_{i + 1 \dots m : m : n - 1 | i : m : n - 1} & = & \frac{(n - \sum_{j = 1}^{i} R_{j} - i)}{(m + \sum_{j = 1}^{m} R_{j})} H_{i + 1 \dots m : m : n | i : m : n} \\ + & \frac{(\sum_{j = 1}^{i} R_{j} + i)}{(m + \sum_{j = 1}^{m} R_{j})} H_{i + 2 \dots m : m : n | i + 1 : m : n} + C_{1} (n, m, R_{1} \dots, R_{i}) \end{matrix}

(15)

where

C_{1} (n, m, R_{1} \dots, R_{i}) =

\frac{1}{n} {(n - \sum_{j = 1}^{i} R_{j} - i) log (n - \sum_{j = 1}^{i} R_{j} - i) - log (n - \sum_{j = 1}^{i} R_{j} - i)!}

and

n = \sum_{j = 1}^{m} R_{j} + m

.

PROOF. From Equation (8) we have

\begin{matrix} H_{i + 1 \dots m : m : n - 1 | i : m : n - 1} & = & (n - \sum_{j = 1}^{i} R_{j} - i - 1) \int_{- \infty}^{\infty} g (w) f_{i : m : n - 1} (w) d w \\ - & log (n - \sum_{j = 1}^{i} R_{j} - i - 1)! \end{matrix}

(16)

on the other hand Equation (14) yields

f_{i : m : n - 1} (w) = \frac{(m - i + \sum_{j = r + 1}^{m} R_{j})}{(m + \sum_{j = 1}^{m} R_{j})} f_{i : m : n} (w) + \frac{(i + \sum_{j = 1}^{i} R_{j})}{(m + \sum_{j = 1}^{m} R_{j})} f_{i + 1 : m : n} (w)

(17)

combining Equations (16) and (17) and noting that

(m - i + \sum_{j = r + 1}^{m} R_{j}) = (n - \sum_{j = 1}^{j = i} R_{j} - i)

since

n = \sum_{j = 1}^{j = m} R_{j} + m

, then the Lemma follows.

The following relation shows that the entropy of the first r of the progressive Type II censored OS of sample size

n - 1

can be obtained as a linear combination of the first r and

r + 1

of the progressive Type II censored OS of sample size n.

RELATION 3.2

\begin{matrix} H_{1 \dots i : m : n - 1} & = & \frac{(n - \sum_{j = 1}^{j = i} R_{j} - i - 1)}{(m + \sum_{j = 1}^{m} R_{j})} H_{1 \dots i : m : n} \\ + & \frac{(\sum_{j = 1}^{i} R_{j} + i)}{(m + \sum_{j = 1}^{m} R_{j})} H_{1 \dots i + 1 : m : n} + C_{2} (n, m, R_{1} \dots, R_{i}) \end{matrix}

(18)

where

\begin{matrix} C_{2} (n, m, R_{1} \dots, R_{i}) & = & \frac{(n - 1)}{n} {log n + \sum_{i = 1}^{m} (n - \sum_{j = 1}^{i} R_{j} - j - 1)} \\ - & log (n - \sum_{j = 1}^{i} R_{j} - i)! - (n - \sum_{j = 1}^{i} R_{j} - i) log (n - \sum_{j = 1}^{i} R_{j} - i) \end{matrix}

(19)

PROOF. For a sample of size

n - 1

the general decomposition of the entropy of progressive Type II censoring takes the form

I_{1 \dots m : m : n - 1} = I_{1 \dots i : m : n - 1} + I_{i + 1 \dots m : m : n - 1 | i : m : n - 1}

(20)

By applying RELATION 3.1 on Equation (20) we get

\begin{matrix} H_{1 \dots m : m : n - 1} & = & H_{1 \dots i : m : n - 1} + \frac{(n - \sum_{j = 1}^{j = i} R_{j} - i)}{(m + \sum_{j = 1}^{m} R_{j})} H_{i + 1 \dots m : m : n | i : m : n} \\ + & \frac{(\sum_{j = 1}^{i} R_{j} + i)}{(m + \sum_{j = 1}^{m} R_{j})} H_{i + 2 \dots m : m : n | i + 1 : m : n} + C_{1} (n, m, R_{1} \dots, R_{i}) \end{matrix}

(21)

where

C_{1}

defined above. Equation (21) can be written, by using Equations (5) and (6), as

\begin{matrix} ( & n & - 1) H_{1 : 1 : 1} - {log n + \sum_{i = 1}^{m} log (n - \sum_{j = 1}^{j = i} R_{j} - i - 1)!} = H_{1 \dots i : m : n - 1} \\ + & \frac{(n - \sum_{j = 1}^{i} R_{j} - i - 1)}{(m + \sum_{j = 1}^{m} R_{j})} {n H_{1 : 1 : 1} - log n - \sum_{i = 1}^{m} log (n - \sum_{j = 1}^{i} R_{j} - i)! - H_{1 \dots i : m : n}} \\ + & \frac{(\sum_{j = 1}^{i} R_{j} + i)}{(m + \sum_{j = 1}^{m} R_{j})} {n H_{1 : 1 : 1} - log n - \sum_{i = 1}^{m} log (n - \sum_{j = 1}^{i} R_{j} - i)! - H_{1 \dots i + 1 : m : n}} \\ + & C_{1} (n, R_{1} \dots, R_{i}) \end{matrix}

(22)

After some simplifications the result follows.

REMARK 3.1. With

R_{1} = R_{2} = \dots = R_{m} = 0

all results of Section 2 and Section 3 reduce to corresponding results for the entropy in a collections usual OS.

4. Computational Method for Calculating $H_{1 \dots i : m : n}$

In this section we provide another approach to simplify the calculation of the entropy in a collection of progressively Type II censored OS. We reduce r integrals in the calculation of

H_{1 \dots r : m : n}

to no integral where the computation of the entropy in progressively Type II censored samples simplifies to a sum; entropy of the smallest OS of varying sample size

h_{1 : n}

.

Lemma 4.1. Let

X_{1}, X_{2} \dots X_{n}

be i.i.d. random sample of size n from pdf

f (x)

with cdf

F (x)

and hazard function

h (x) = \frac{f (x)}{1 - F (x)}

, and let

X_{1 : n}, X_{2 : n}, \dots, X_{n : n}

be OS corresponding to this sample. Park [2] obtained the entropy in the smallest order statistics as

h_{1 : n} = 1 - log n - \int_{- \infty}^{\infty} log h (x) d F_{1 : n} (x)

(23)

Theorem 4.1. Let

(X_{1 : m : n}, X_{2 : m : n} \dots X_{m : m : n})

be a progressively Type II censored sample with censoring scheme

(R_{1}, R_{2}, \dots R_{m})

. The entropy in the r collection of progressively Type II censored sample

(X_{1 : m : n}, X_{2 : m : n} \dots X_{r : m : n})

can be written as

H_{1 \dots r : m : n} = r - log c^{'} (r) - \sum_{s = 1}^{r} c^{'} (s) \sum_{i = 1}^{s - 1} \frac{c_{i, s - 1} (R_{1} + 1, \dots, R_{s - 1} + 1)}{R_{i}^{'}} (1 - log R_{i}^{'} - h_{1 : R_{i}^{'}})

(24)

where

R_{i}^{'} = (R_{s}^{*} + 1) + \sum_{j = s - i}^{s - 1} (R_{j} + 1)

,

R_{s}^{*} = (n - s - R_{1} - \dots - R_{s - 1} + 1)

,

c^{'} (t) = n (n - R_{1} - 1) \dots (n - R_{1} - \dots - R_{t - 1} - t + 1)

and

c_{i, s} (R_{1}, \dots, R_{s}) = \frac{{(- 1)}^{i}}{{\prod_{j = 1}^{i} \sum_{k = s - i + 1}^{s - i + j} R_{k}} {\prod_{j = 1}^{s - i} \sum_{k = j}^{s - i} R_{k}}}

(25)

in which empty products are defined as 1.

PROOF. By the Markov chain properties of progressive Type II censored samples, one can write

f_{1 : m : n, . . ., r : m : n} (x_{1 : m : n}, \dots, x_{r : m : n}) = f_{1 : m : n} (x_{1}) f_{2 | 1 : m : n} (x_{2} | x_{1}) \dots f_{r | r - 1 : m : n} (x_{r} | x_{r - 1})

where

f_{i + 1 | i : m : n} (x_{i + 1} | x_{i})

is the conditional pdf of

X_{i + 1 : m : n}

given

X_{i : m : n} = x_{i}

, which also is the density of the first order statistic of a sample of size

(n - R_{1} - \dots - R_{i} - i)

with the truncated density

g (x) = \frac{f (x)}{1 - F (x_{i})}

. Therefore, we have

H_{1 : m : n, . . ., r : m : n} = H_{1 : m : n} + H_{2 | 1 : m : n} + \dots + H_{r | r - 1 : m : n}

(26)

where

H_{i + 1 | i : m : n}

is the expected entropy in

X_{i + 1 : m : n}

given

X_{i : m : n} = x_{i}

i.e.,

H_{i + 1 | i : m : n} = E {- \int_{- \infty}^{\infty} f_{i + 1 | i : m : n} (x | x_{i : m : n}) log f_{i + 1 | i : m : n} (x | x_{i : m : n}) d x}

(27)

By Lemma 4.1. and noting that, condition on

X_{i : m : n} = x_{i}

,

X_{i + 1 : m : n}

has the same pdf as the first order statistic from a random sample of size

(n - R_{1} - \dots - R_{i} - i)

with pdf

g (x) = \frac{f (x)}{1 - F (x_{i})}

. Equation (27) can be written as

H_{i + 1 | i : m : n} = 1 - log (n - R_{1} - \dots - R_{i} - i) - I

(28)

where

I = E {\int_{x_{i : m : n}}^{\infty} f_{i + 1 | i : m : n} (x | x_{i : m : n}) log h (x) d x}

(29)

By changing integrals and noting that

X_{i : m : n} < X_{i + 1 : m : n},

we have

\begin{matrix} I & = & \int_{- \infty}^{\infty} {\int_{x_{i}}^{\infty} f_{i + 1 | i} (x | x_{i}) log h (x) d x} f_{i} (x_{i)} d x_{i} \\ = & \int_{- \infty}^{\infty} log h (x) {\int_{- \infty}^{x} f_{i + 1 | i} (x | x_{i}) f_{i} (x_{i)} d x_{i}} d x \\ = & \int_{- \infty}^{\infty} log h (x) {\int_{- \infty}^{x} f_{i, i + 1} (x_{i}, x) d x_{i}} d x \\ = & \int_{- \infty}^{\infty} {log h (x)} f_{i + 1} (x) d x \end{matrix}

(30)

Therefore Equation (31), can be written as

H_{i + 1 | i : m : n} = 1 - log (n - R_{1} - \dots - R_{i} - i) - \int_{- \infty}^{\infty} {log h (x)} f_{i + 1} (x) d x

(31)

Thus by using Equations (26) and (31)

H_{1 . . . . r : m : n}

can be expressed as a summation of single integral as

H_{1 . . . . r : m : n} = r - log c^{'} (r) - \sum_{i = 1}^{r} \int_{- \infty}^{\infty} log h (x) f_{i : m : n} (x) d x

(32)

where

c^{'} (r)

is defined above. From Theorem 1 in Balakrishnan et al. [13], we have the following relation for

f_{s : m : n}

f_{s : m : n} = c^{'} (s) \sum_{i = 1}^{s - 1} c_{i, s - 1} (R_{1} + 1, \dots, R_{s - 1} + 1) f (x_{s}) {(1 - F (x_{s}))}^{R_{i}^{'}}

(33)

- \infty < x_{s} < \infty

where,

R_{i}^{'}

,

R_{s}^{*}

,

c^{'} (s)

and

c_{i, s - 1} (R_{1} + 1, \dots, R_{s - 1} + 1)

are defined above.

We reexpress Equation (33) as

f_{s : m : n} = c^{'} (s) \sum_{i = 1}^{s - 1} \frac{c_{i, s - 1} (R_{1} + 1, \dots, R_{s - 1} + 1)}{R_{i}^{'}} f_{1 : R_{i}} (x_{s})

(34)

where

f_{1 : R_{i}}

is the usual smallest order statistics in a sample of size

R_{i}^{'}

. If we use Equations (23) and (34) in Equation (32) the result follows.

We have written program in the algebraic manipulation package, MATHEMATICA [14], for computing Theorem 4.1 and Lemma 4.1 calculated above. For a pre-determined progressively Type II censoring scheme

(n, m, R_{1}, R_{2}, \dots R_{m})

the program return the numerical values of the entropy. The electronic version of the computer program can be obtained by contacting the corresponding author.

REMARK 4.1. The entropy of the smallest usual order statistics are known for well-known distributions for example see Park [2] and Asadi et al. [15].

5. Illustrative Examples

The entropy of the smallest OS

h_{1 : n}

has the expression [2].

h_{1 : n} = 1 - \frac{1}{n} - log n - \int_{- \infty}^{\infty} log f (x) d F_{1 : n}

(35)

EXAMPLE 5.1. For the normal distribution,

f (x; μ, σ) = \frac{1}{\sqrt{2} π σ^{2}} exp - {(x - μ)}^{2} / 2 σ^{2},

the entropy of the smallest OS

h_{1 : n}

takes the form

h_{1 : n} = 1 - \frac{1}{n} - log n + \frac{log (2 π)}{2} + log σ + \frac{μ_{1 : n}^{(2)}}{2}

(36)

where

μ_{r : n}^{(2)}

is the second moment of

X_{r : n}

of the standard normal distribution, see Park [2]. We use Theorem 4.1 and Equation (36) to calculate the

H_{1 \dots r : m : n}

given in Table 1.

Discussion

Table 1 provides the values of

H_{1 \dots r : m : n}

for

n = 5, 10

and

m = 3, 5

for different schemes and

r = 1 \dots, m

. The entries were computed using Theorem 4.1, Equation (36) and MATHEMATICA [14]. For

r < m

, the table gives the values of the entropy in a collection of r of OS from a progressively Type II censored sample. For

r = m

the table gives the values of entropy in a complete progressive Type II censored sample. The table includes the cases

r_{1} = r_{2} = \dots = r_{m - 1} = 0, r_{m} = n - m

which corresponds to the Type II censored sample and

r_{1} = r_{2} = \dots = r_{m} = 0, n = m

which corresponds to the complete sample.

EXAMPLE 5.2. For the logistic distribution,

f (x) = \frac{exp - (x)}{{(1 + exp - (x))}^{2}}, - \infty < x < \infty

the entropy of the smallest OS

h_{1 : n}

takes the form

h_{1 : n} = log β (1, n) - (n - 1) (ψ (n) - ψ (n + 1) + 2 ψ (n + 1) - ψ (n) - ψ (1)

(37)

where

β (a, b) = \frac{Γ (a) Γ (b)}{Γ (a + b)}

is the beta function and

ψ (z) = \frac{d log Γ (z)}{d z}

is the digamma function, see Asadi et al. [15]. We use Theorem 4.1 and Equation (37) to calculate the

H_{1 \dots r : m : n} .

Table 2 provides the values of

H_{1 \dots r : m : n}

for

n = 5, 10

and

m = 3, 5

for different schemes and

r = 1 \dots, m

. The entries were computed using Theorem 4.1 and Equation (37) and MATHEMATICA [14]. For

r < m

, the Table gives the values of the entropy in a collection r of OS progressive Type II censored sample. For

r = m

, the Table gives the values of entropy in a complete progressive Type II censored sample.

Table 1. The entropy in a collection of order statistics from a progressive Type II censored sample from a normal distribution with unit standard deviation.

**Table 1.** The entropy in a collection of order statistics from a progressive Type II censored sample from a normal distribution with unit standard deviation.
n	m	Censoring scheme	r	OS of Proressive samples	Entropy
5	3	(2,0,0)	1	$(X_{1 : 3 : 5})$	1.0096
5	3	(2,0,0)	2	$(X_{1 : 3 : 5}, X_{2 : 3 : 5})$	2.11423
5	3	(2,0,0)	3	$(X_{1 : 3 : 5}, X_{2 : 3 : 5}, X_{3 : 3 : 5})$	3.40163
5	3	(0,0,2)	1	$(X_{1 : 3 : 5})$	1.0096
5	3	(0,0,2)	2	$(X_{1 : 3 : 5}, X_{2 : 3 : 5})$	1.87510
5	3	(0,0,2)	3	$(X_{1 : 3 : 5}, X_{2 : 3 : 5}, X_{3 : 3 : 5})$	3.18129
5	3	(1,1,0)	1	$(X_{1 : 3 : 5})$	1.0096
5	3	(1,1,0)	2	$(X_{1 : 3 : 5}, X_{2 : 3 : 5})$	1.97448
5	3	(1,1,0)	3	$(X_{1 : 3 : 5}, X_{2 : 3 : 5}, X_{3 : 3 : 5})$	3.27328
5	5	(0,0,0,0,0)	1	$(X_{1 : 5})$	1.0096
5	5	(0,0,0,0,0)	2	$(X_{1 : 5}, X_{2 : 5})$	1.8751
5	5	(0,0,0,0,0)	3	$(X_{1 : 5}, X_{2 : 5}, X_{3 : 5})$	2.76331
5	5	(0,0,0,0,0)	5	$(X_{1 : 5}, \dots, X_{5 : 5})$	5.01551
10	5	(0,0,0,0,5)	1	$(X_{1 : 5 : 10})$	0.872403
10	5	(0,0,0,0,5)	2	$(X_{1 : 5 : 10}, X_{2 : 5 : 10})$	1.52808
10	5	(0,0,0,0,5)	3	$(X_{1 : 5 : 10}, X_{2 : 5 : 10}, X_{3 : 5 : 10})$	2.12015
10	5	(0,0,0,0,5)	4	$(X_{1 : 5 : 10}, \dots, X_{4 : 0 : 10})$	2.69953
10	5	(0,0,0,0,5)	5	$(X_{1 : 5 : 10}, \dots, X_{5 : 5 : 10})$	3.29964
10	5	(5,0,0,0,0)	1	$(X_{1 : 5 : 10})$	0.872403
10	5	(5,0,0,0,0)	2	$(X_{1 : 5 : 10}, X_{2 : 5 : 10})$	1.78936
10	5	(5,0,0,0,0)	3	$(X_{1 : 5 : 10}, X_{2 : 5 : 10}, X_{3 : 5 : 10})$	2.69933
10	5	(5,0,0,0,0)	4	$(X_{1 : 5 : 10}, \dots, X_{4 : 5 : 10})$	3.70963
10	5	(5,0,0,0,0)	5	$(X_{1 : 5 : 10}, \dots, X_{5 : 5 : 10})$	4.96641
10	5	(3,2,0,0,0)	1	$(X_{1 : 5 : 10})$	0.872403
10	5	(3,2,0,0,0)	2	$(X_{1 : 5 : 10}, X_{2 : 5 : 10})$	1.65948
10	5	(3,2,0,0,0)	3	$(X_{1 : 5 : 10}, X_{2 : 5 : 10}, X_{3 : 5 : 10})$	2.58920
10	5	(3,2,0,0,0)	4	$(X_{1 : 5 : 10}, \dots, X_{4 : 5 : 10})$	3.60857
10	5	(3,2,0,0,0)	5	$(X_{1 : 5 : 10}, \dots, X_{5 : 5 : 10})$	4.86899
10	10	(0,0,0,0,0)	1	$(X_{1 : 10})$	0.872403
10	10	(0,0,0,0,0)	2	$(X_{1 : 10}, X_{2 : 10})$	1.52808
10	10	(0,0,0,0,0)	3	$(X_{1 : 5 : 10}, X_{2 : 10}, X_{3 : 10})$	2.12015
10	10	(0,0,0,0,0)	4	$(X_{1 : 10}, \dots, X_{4 : 10})$	2.69953
10	10	(0,0,0,0,0)	5	$(X_{1 : 10}, \dots, X_{5 : 10})$	3.29974
10	10	(0,0,0,0,0)	4	$(X_{1 : 10}, \dots, X_{6 : 10})$	3.95362
10	10	(0,0,0,0,0)	4	$(X_{1 : 10}, \dots, X_{7 : 10})$	4.57641
10	10	(0,0,0,0,0)	4	$(X_{1 : 10}, \dots, X_{8 : 10})$	5.50748
10	10	(0,0,0,0,0)	10	$(X_{1 : 10}, \dots, X_{10 : 10})$	7.62962

Table 2. The entropy in a collection of order statistics from a progressive Type II censored sample from logistic distribution.

**Table 2.** The entropy in a collection of order statistics from a progressive Type II censored sample from logistic distribution.
n	m	Censoring scheme	r	OS of Proressive samples	Entropy
5	3	(2,0,0)	1	$(X_{1 : 3 : 5})$	1.67390
5	3	(2,0,0)	2	$(X_{1 : 3 : 5}, X_{2 : 3 : 5})$	3.30409
5	3	(2,0,0)	3	$(X_{1 : 3 : 5}, X_{2 : 3 : 5}, X_{3 : 3 : 5})$	5.18643
5	3	(0,0,2)	1	$(X_{1 : 3 : 5})$	1.67390
5	3	(0,0,2)	2	$(X_{1 : 3 : 5}, X_{2 : 3 : 5})$	3.07589
5	3	(0,0,2)	3	$(X_{1 : 3 : 5}, X_{2 : 3 : 5}, X_{3 : 3 : 5})$	4.95024
5	3	(1,1,0)	1	$(X_{1 : 3 : 5})$	1.6739
5	3	(1,1,0)	2	$(X_{1 : 3 : 5}, X_{2 : 3 : 5})$	3.16708
5	3	(1,1,0)	3	$(X_{1 : 3 : 5}, X_{2 : 3 : 5}, X_{3 : 3 : 5})$	5.04210
5	5	(0,0,0,0,0)	1	$(X_{1 : 5})$	1.67390
5	5	(0,0,0,0,0)	2	$(X_{1 : 5}, X_{2 : 5})$	3.07587
5	5	(0,0,0,0,0)	3	$(X_{1 : 5}, X_{2 : 5}, X_{3 : 5})$	4.46788
5	5	(0,0,0,0,0)	5	$(X_{1 : 5}, \dots, X_{5 : 5})$	7.9208
10	5	(0,0,0,0,5)	1	$(X_{1 : 5 : 10})$	1.62638
10	5	(0,0,0,0,5)	2	$(X_{1 : 5 : 10}, X_{2 : 5 : 10})$	2.89455
10	5	(0,0,0,0,5)	3	$(X_{1 : 5 : 10}, X_{2 : 5 : 10}, X_{3 : 5 : 10})$	4.03011
10	5	(0,0,0,0,5)	4	$(X_{1 : 5 : 10}, \dots, X_{4 : 0 : 10})$	5.11611
10	5	(0,0,0,0,5)	5	$(X_{1 : 5 : 10}, \dots, X_{5 : 5 : 10})$	6.20260
10	5	(5,0,0,0,0)	1	$(X_{1 : 5 : 10})$	1.62638
10	5	(5,0,0,0,0)	2	$(X_{1 : 5 : 10}, X_{2 : 5 : 10})$	3.10523
10	5	(5,0,0,0,0)	3	$(X_{1 : 5 : 10}, X_{2 : 5 : 10}, X_{3 : 5 : 10})$	4.52211
10	5	(5,0,0,0,0)	4	$(X_{1 : 5 : 10}, \dots, X_{4 : 5 : 10})$	6.059990
10	5	(5,0,0,0,0)	5	$(X_{1 : 5 : 10}, \dots, X_{5 : 5 : 10})$	7.96778
10	5	(3,2,0,0,0)	1	$(X_{1 : 5 : 10})$	1.62638
10	5	(3,2,0,0,0)	2	$(X_{1 : 5 : 10}, X_{2 : 5 : 10})$	2.99964
10	5	(3,2,0,0,0)	3	$(X_{1 : 5 : 10}, X_{2 : 5 : 10}, X_{3 : 5 : 10})$	4.43974
10	5	(3,2,0,0,0)	4	$(X_{1 : 5 : 10}, \dots, X_{4 : 5 : 10})$	5.97958
10	5	(3,2,0,0,0)	5	$(X_{1 : 5 : 10}, \dots, X_{5 : 5 : 10})$	7.88009
10	10	(0,0,0,0,0)	1	$(X_{1 : 10})$	1.62638
10	10	(0,0,0,0,0)	2	$(X_{1 : 10}, X_{2 : 10})$	2.89455
10	10	(0,0,0,0,0)	3	$(X_{1 : 5 : 10}, X_{2 : 10}, X_{3 : 10})$	4.03011
10	10	(0,0,0,0,0)	4	$(X_{1 : 10}, \dots, X_{4 : 10})$	5.11611
10	10	(0,0,0,0,0)	5	$(X_{1 : 10}, \dots, X_{5 : 10})$	6.20260
10	10	(0,0,0,0,0)	4	$(X_{1 : 10}, \dots, X_{6 : 10})$	7.33016
10	10	(0,0,0,0,0)	4	$(X_{1 : 10}, \dots, X_{7 : 10})$	8.54041
10	10	(0,0,0,0,0)	4	$(X_{1 : 10}, \dots, X_{8 : 10})$	9.88642
10	10	(0,0,0,0,0)	10	$(X_{1 : 10}, \dots, X_{10 : 10})$	13.44020

Acknowledgements

The author would like to express deep thanks to the Editor-in-Chief and the referees for their helpful comments and suggestions which led to a considerable improvement in the presentation of this paper.

References

Cover, T.M.; Thomas, J.A. Elements of Information Theory; Wiley: Hoboken, NJ, USA, 2005. [Google Scholar]
Park, S. The entropy of consecutive order statistics. IEEE Trans. Inform. Theor. 1995, 41, 2003–2007. [Google Scholar] [CrossRef]
Park, S. Testing exponentiality based on the Kullback-Leibler information with the type II censored data. IEEE Trans. Reliab. 2005, 54, 22–26. [Google Scholar] [CrossRef]
Wong, K.M.; Chen, S. The entropy of ordered sequences and order statistics. IEEE Trans. Inform. Theor. 1999, 36, 276–284. [Google Scholar] [CrossRef]
Ebrahimi, N.; Soofi, E.S.; Zahedi, H. Information properites of order statistics and spacings. IEEE Trans. Inform. Theor. 2004, 50, 177–183. [Google Scholar] [CrossRef]
Nelson, W. Applied Life Data Analysis; John Wiley and Sons: New York, NY, USA, 1982. [Google Scholar]
Balakrishnan, N. Progressive censoring methodology: An appraisal (with discussions). Test 2007, 16, 211–259. [Google Scholar] [CrossRef]
Balakrishnan, N.; Aggarawala, R. Progressive Censoring: Theory, Methods, and Applications; Birkhauser: Boston, MA, USA, 2000; p. 15. [Google Scholar]
Balakrishnan, N.; Habibi Rad, A.; Arghami, N.R. Testing exponentiality based on Kullback-Leibler information with progressively Type-II censored data. IEEE Trans. Reliab. 2007, 56, 301–307. [Google Scholar] [CrossRef]
Csiszár, I. Information Theory; Academic press: London, UK, 1981; p. 170. [Google Scholar]
Cole, R.H. Relations between moments of order statistics. Ann. Math. Stat. 1951, 22, 308–310. [Google Scholar] [CrossRef]
Kamps, U.; Cramer, E. On distributions of generalized order statistics. Statistics 2000, 35, 269–280. [Google Scholar] [CrossRef]
Balakrishnan, N.; Childs, A.; Chadrasekar, B. An efficient Computatinal Method for moments of order statistics under progressive Type-II censored samples. Stat. Probab. Lett. 2002, 60, 359–365. [Google Scholar] [CrossRef]
Mathematica; Version 7.0; Wolfram Research, Inc.: Champaign, IL, USA, 2008.
Asadi, M.; Ebrahimi, N.; Hamedani, G.G.; Soofi, E. Information measures of Pareto distributions and order statistics. In Advances on Distribution Theory, Order Statistics and Inference; Balakrishnan, N., Castillo, E., Sarabia, J.M., Eds.; Birkhauser: Boston, MA, USA, 2006. [Google Scholar]

© 2011 by the author; licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/3.0/.)

Article Metrics

Citations

Article Access Statistics

Journal Statistics

Multiple requests from the same IP address are counted as one view.

The Entropy of Progressively Censored Samples

Abstract

1. Introduction

2. Decomposition of the Joint Entropy

3. Recurrence Relations

4. Computational Method for Calculating H 1 ⋯ i : m : n

5. Illustrative Examples

Discussion

Acknowledgements

References

Article Metrics

Article Access Statistics

4. Computational Method for Calculating $H_{1 \dots i : m : n}$