Multistage Estimation of the Scale Parameter of Rayleigh Distribution with Simulation

Yousef, Ali; Hassan, Emad E. H.; Amin, Ayman A.; Hamdy, Hosny I.

doi:10.3390/sym12111925

Open AccessArticle

Multistage Estimation of the Scale Parameter of Rayleigh Distribution with Simulation

¹

Department of Mathematics, Faculty of Engineering, Kuwait College of Science and Technology, Doha District POBox 27235, Kuwait

²

Faculty of Management Sciences, October University for Modern Sciences and Arts, 6th October City, 12566 Cairo, Egypt

³

Faculty of Commerce, Menoufia University, Gamal Abd El-Nasir, Qism Shebeen El-Kom, Shibin el Kom, Menofia Governorate, Menoufia, Egypt

^*

Author to whom correspondence should be addressed.

Symmetry 2020, 12(11), 1925; https://doi.org/10.3390/sym12111925

Submission received: 3 November 2020 / Revised: 18 November 2020 / Accepted: 20 November 2020 / Published: 22 November 2020

(This article belongs to the Special Issue Skewed (Asymmetrical) Probability Distributions and Applications across Disciplines)

Download Versions Notes

Abstract

:

This paper discusses the sequential estimation of the scale parameter of the Rayleigh distribution using the three-stage sequential sampling procedure proposed by Hall (Ann. Stat. 1981, 9, 1229–1238). Both point and confidence interval estimation are considered via a unified optimal decision framework, which enables one to make the maximum use of the available data and, at the same time, reduces the number of sampling operations by using bulk samples. The asymptotic characteristics of the proposed sampling procedure are fully discussed for both point and confidence interval estimation. Since the results are asymptotic, Monte Carlo simulation studies are conducted to provide the feel of small, moderate, and large sample size performance in typical situations using the Microsoft Developer Studio software. The procedure enjoys several interesting asymptotic characteristics illustrated by the asymptotic results and supported by simulation.

Keywords:

asymptotic regret; coverage probability; loss function; three-stage procedure

1. Introduction

Let

X_{1}

,

X_{2}, X_{3}, \dots

be independent and identically distributed random variables following a Rayleigh distribution with unknown scale parameter

σ

of the form:

f (x; σ) = \frac{x}{σ^{2}} e^{- x^{2} / 2 σ^{2}}, x > 0 and σ > 0 .

The survival or reliability function is

e^{- x^{2} / 2 σ^{2}}

and the hazard function is

x / σ^{2}

for all

x > 0

and

σ > 0

. An important characteristic of the Rayleigh distribution is that its failure rate is a linear function of time. The reliability function decreases at a much higher rate than the exponential distribution’s reliability function, whose hazard rate is constant (see Kodlin [1]). This distribution relates to several distributions, such as generalized extreme value, Weibull, and Chi-square, and hence its applicability in real-life situations is significant. This is a particular case of a two-parameter Weibull distribution with a shape parameter equal to 2 and scale parameter

σ \sqrt{2}

.

Rayleigh distribution was introduced by Lord Rayleigh [2] and plays an important role in various research areas, such as acoustics, communication engineering, clinical studies, applied statistics, life-testing experiments, reliability analysis, and survival analysis. For instance, Palovko [3] discussed its application in life testing, especially in electro-vacuum devices. Gross and Clark [4] and Lee and Wang [5] discussed its usage in clinical studies dealing with cancer patients. Dyer and Whisenand [6] used it in communication engineering. Siddiqui [7] discussed its usage in electromagnetic wave propagation through a scattering medium. Others, such as Siddiqui [7], Hirano [8], and Howlader and Hossian [9], discussed several aspects of the Rayleigh distribution. It was shown from [10,11,12,13] that both Rayleigh and Weibull distributions are suitable probability distributions for evaluating wind energy potentials. They provide the most accurate and adequate wind, analyzing and interpreting the actual wind speed data and predicting the prevailing wind profile. Several authors have contributed to this model, such as Sinha and Howlader [14], Ariyawansa and Templeton [15], Howlader [16], Lalitha and Mishra [17], and Abd Elfattah et al. [18].

Regarding estimation, [19,20,21] have carried out extensive studies concerning the estimation, prediction, and several other inferences concerning the Rayleigh distribution. From [22], the

r^{t h}

moment around the origin is:

E (X^{r}) = σ^{r} 2^{\frac{r}{2}} Γ (\frac{r}{2} + 1) .

Substituting for

r = 1

and

r = 2

, yield that the population mean is

E (X) = \sqrt{π / 2} σ

and the population variance

V a r (X) = \frac{(4 - π)}{2} σ^{2}

. The mode equals

σ

, and the median is

σ \sqrt{2 \ln (2)}

. All the moments are unknown but finite. Moreover, the differential entropy of the random variable

X

, Shannon [23] entropy, measures the amount of uncertainty or missing information and is defined by means of its underlying distribution

f (x)

as

h (X) = - \int_{S}^{} f (x) \log f (x) d x,

where

S

is the support of

f (x)

. The entropy of the Rayleigh distribution is defined as:

h (σ) = 1 + \log (σ / \sqrt{2}) + γ / 2,

where

γ \approx 0.5772

is the Euler’s constant, which will be of interest in this research. Assume a preliminary random sample

X_{1}, X_{2}, \dots, X_{n}

of size;

n

becomes available, from which we calculate the sample mean

{\bar{X}}_{n} = \sum_{1}^{n} X_{i} / n

for

n \geq 1

and propose the estimate

{\bar{T}}_{n} = \sqrt{2 / π} {\bar{X}}_{n}

as an unbiased point estimate of the unknown parameter

σ

. For the convenience of calculation, we continue to use:

E ({\bar{T}}_{n}) = σ, E {({\bar{T}}_{n} - σ)}^{2} = \frac{(4 - π) σ^{2}}{π n}, E {({\bar{T}}_{n} - σ)}^{3} = \frac{2 (π - 3) σ^{3}}{π n^{2}}, and E {({\bar{T}}_{n} - σ)}^{4} = \frac{(32 - 3 π^{2}) σ^{4}}{π^{2} n^{3}} .

This paper aims to estimate sequentially the scale parameter 𝜎 of the Rayleigh distribution using a multistage sampling procedure, the three-stage procedure, that was presented by Hall [24]. For more details regarding the three-stage procedure and its properties, see Section 3. We tackle two types of estimation problems—point estimation under a squared-error loss function plus linear sampling cost and confidence interval estimation. In the following section, we set up the estimation problems.

2. Estimation Problems

2.1. Minimum Risk Point Estimation

To obtain a point estimate for

σ

, we assume that the cost incurred in estimating the scale parameter

σ

by the corresponding sample measure

{\bar{T}}_{n}

is given by the following squared-error loss function with a linear sampling cost, given by:

L_{n} (A) = A^{2} {({\bar{T}}_{n} - σ)}^{2} + n,

(1)

where

A^{2}

represents the cost per estimation unit and

A > 0

will be determined shortly. The cost function in (1) is similar to those considered by Degroot [25], Chow and Yu [26], Martinsek [27], and Hamdy [28]. The risk associated with the cost function in (1) is given by,

R_{n} (A) = E (L_{n} (A)) = A^{2} σ^{2} \frac{(4 - π)}{π n} + n .

(2)

Minimizing the risk in (2) concerning the sample size

n

yields the optimal sample size:

n \geq n_{p o i n t}^{*} = A σ \sqrt{\frac{(4 - π)}{π}},

(3)

where

n_{p o i n t}^{*} \to \infty, a s A \to \infty .

We elaborate more on the physical entity of

A

in the following subsections. The optimal sample size in (3) is unknown because

σ

is unknown. Therefore, we resort to multistage sampling procedures, developed over the last 50 years, to estimate the unknown scale parameter

σ

via the estimation of

n^{*} .

2.2. Fixed-Width Confidence Interval

Assume further that a fixed

2 d

width confidence interval for

σ

of the form

I_{n} = ({\bar{T}}_{n} - d \leq σ \leq {\bar{T}}_{n} + d)

is required, such that its coverage probability is at least a 100

(1 - α) %

uniformly over

σ > 0

.

For large

n

, the central limit theorem justifies that the quantity

Q = \frac{\sqrt{π n} ({\bar{T}}_{n} - σ)}{σ \sqrt{4 - π}}

follows the standard normal distribution, defined by cumulative distribution function (CDF)

Φ (.)

.

P (| Q | \leq \frac{d \sqrt{n π}}{σ \sqrt{(4 - π)}}) \geq (1 - α) = 2 Φ (a) - 1,

where

a

is the upper

α / 2

percentage, the cutoff point of the standard normal distribution.

It follows that the optimal sample size required to achieve the above objectives must satisfy:

n \geq n_{c o n f}^{*} = \frac{a^{2} σ^{2}}{d^{2}} \frac{(4 - π)}{π} .

(4)

2.3. A Unified Decision Framework

Since sequential sampling is utilized to perform inference, authors usually specify one decision rule for each research objective. It could be a point estimation with a specified cost function or a fixed-width confidence interval whose coverage probability is at least the nominal value, or testing hypotheses regarding the population parameters. If the interest is in defining one decision rule to achieve more than one objective, and at the same time to make the maximum use of the available data, we have to have

n_{p o i n t}^{*} = n_{c o n f}^{*},

which implies that:

A = \frac{a^{2} σ}{d^{2}} \sqrt{\frac{(4 - π)}{π}} = k^{2} σ \sqrt{\frac{π}{(4 - π)}},

(5)

where

k^{2} = \frac{a^{2}}{d^{2}} \frac{(4 - π)}{π}

. As

A \to \infty

k \to \infty

. Therefore, the constant

A

is chosen as in (5) to perform inference through a unified framework. In fact, in sequential point estimation problems where cost functions are assumed to assess the encountered risk, the constant

A

is assumed to be known and is permitted to go to infinity to check if the estimation risk is still finite and bounded. However, by knowing

A

we restrict the sampling population. In other words, by doing so we assume that

σ

is not entirely unknown. The constant

A

in (5) is partially known because it depends on the unknown parameter

σ .

The constant

A \to \infty

, as the width of the interval

d \to

0, is a common practice in the sequential estimation when we study the asymptotic characteristics of the fixed-width interval. The constant

A^{2}

can be thought of as

A^{2} = \frac{a^{2}}{d^{2}} \times n_{p o i n t}^{*}

, where:

A^{2} = F i s h e r ’ s i n f o r m a t i o n \times o p t i m a l s a m p l i n g c o s t .

Meanwhile, we continue to use the representation

n^{*} = k^{2} σ^{2}

to define the three-stage stopping rules in the following subsections. Therefore, we proceed to use the following optimal sample size,

n \geq n^{*} = k^{2} σ^{2},

(6)

to perform the necessary inference. The parameter

σ^{2}

in (6) is unknown, then no fixed sample size procedure can estimate the scale parameter uniformly over the parameter space; see Dantzig [29]. Therefore, we resort to a three-stage sampling procedure to achieve the required objectives.

Henceforth, we continue to use the asymptotic sample size defined in (6) to propose the following three-stage sampling procedure to estimate the unknown scale parameter

σ

via the estimation of

n^{*}

.

3. Multistage Sampling

Multistage sequential sampling procedures have been developed over the past few decades to achieve several popular characteristics lacking in classical inference theory. This goes back to Abraham Wald in 1947, who introduced the idea of one-by-one sequential sampling through the sequential probability ratio test (SPRT) to minimize the cost of inspection and transportation. Since the publication of the one-by-one sequential sampling procedure, attention was mainly directed to multistage sampling under optimal decision frames. The aim is to achieve several optimal objectives, including minimizing the risk associated with point estimation, maintaining the coverage probability of at least the desired nominal value, or controlling the type I and type II error probabilities. This was not the case in classical inference.

Multistage came out to motivate researchers to perform inference through different sampling techniques. Stein [30,31] created the foundation of two-stage sampling, also referred to as double sampling, which led to an exact solution for a fundamental statistical inference problem. Additionally, Seelbinder [32] and Cox [33] introduced the idea of group sampling in two stages. Although the procedure enjoys many asymptotic requirements, it still suffers from a lack of asymptotic efficiency. The procedure could lead to oversampling, mostly when the initial sample chosen is much smaller than the optimal sample size. Anscombe [34], Robbins [35], Chow, and Robbins [36] devised purely one-by-one sequential sampling procedures to perform inference subjected to some optimality criteria. The one-by-one sequential sampling procedure surpasses two-stage sampling in achieving all asymptotic characteristics. However, practically it is inefficient since it takes quite some time to terminate the sampling course.

Hall [24], in his sophisticated influential work, introduced the idea of sampling in three stages to overcome all the deficiencies portrayed in both two-stage and purely one-by-one sequential sampling. By doing so, he combined both the asymptotic characteristics of the purely one-by-one sequential sampling of Anscombe [34], Robbins [35], and Chow and Robbins [36] and the operational saving made possible by Stein [30] and Cox [33] bulk sampling.

Hall’s results were emphasized specifically to a fixed-width confidence interval for the normal mean. Other successful attempts were made for non-normal distributions. Since the publication of Hall’s paper, research in multistage sampling has moved in several directions. Some have utilized the three-stage sampling technique to perform point estimation for the normal mean under different cost functions or generate inference for other distributions. Others have tried to improve the inference quality by protecting the inference against type II error probability, studying the characteristic operating curve, or/and discussing three-stage sampling’s sensitivity when the underlying distribution departs from normality. For more details, see Mukhopadhyay [37,38,39], Mukhopadhyay et al. [40], Mukhopadhyay and Mauromoustakos [41], Hamdy and Palotta [42], Hamdy et al. [43], Hamdy [28], Hamdy et al. [44], Mukhopadhyay and Padmanabhan [45], Takada [46], Hamdy [47], Al-Mahmeed and Hamdy [48], AlMahmeed et al. [49], Costanzo et al. [50], Yousef et al. [51], Yousef [52], Hamdy et al. [53], Yousef [54], Yousef and Hamdy [55,56], and Yousef [57].

The extension of Hall’s results to tackle hypothesis testing problems of the normal mean was developed by Liu [58]. At the same time, Son et al. [59] proposed a three-stage sampling sequential procedure that yields both a fixed-width confidence interval and a hypothesis test for the normal while controlling the type II error probability. Their procedure also provided second-order approximations to the operating characteristic curves of the inference.

Tahir [60] addressed a sequential procedure to tackle a point estimation problem for the Rayleigh distribution parameter square, subject to a weighted squared-error loss plus cost of sampling. He found a second-order asymptotic expansion for the incurred regret and found that the asymptotic regret is negative for a range of parameter values.

The main objective of this paper is the estimation of the unknown scale parameter

σ .

We tackle two estimation problems—point estimation under a squared-error loss function with linear sampling cost and confidence interval, where we find a fixed-width confidence interval with a coverage probability of at least

100 (1 - α) %

. We use the three-stage procedure to find all the asymptotic results that enhanced finding the asymptotic regret and the asymptotic confidence interval. We use Monte Carlo simulation to verify the asymptotic results. To the best of our knowledge, none of the existing papers in the literature on sequential estimation conduct this study.

In the following lines, we state the three-stage procedure as follows:

P i l o t S t u d y P h a s e :

The pilot study phase starts with selecting an initial random sample

T_{1}

,

T_{2}

,

T_{3}, \dots, T_{m}

of size

m (\geq 2

) from the Rayleigh distribution and calculate the sample average

{\bar{T}}_{m}

to initiate the process. We propose to estimate

σ

by the corresponding sample measure

{\bar{T}}_{m}

.

M a i n S t u d y P h a s e :

During the main study phase, we only estimate a portion

0 < δ < 1,

of

n^{*}

to avoid the possibility of over-sampling in the pilot study phase. The required stopping rule is:

N_{1} = m a x {m, [δ k^{2} {\bar{T}}_{m}^{2}] + 1},

(7)

where

[x]

is the integer-valued function.

If

m \geq [δ k^{2} {\bar{T}}_{m}^{2}] + 1

, then we stop at this stage. Otherwise, we continue to observe an additional sample of size

[δ k^{2} {\bar{T}}_{m}^{2}] + 1 - m

—say,

T_{m + 1}

,

T_{m + 2}

,

T_{m + 3}, \dots, T_{N_{1}}

. Hence, we update the estimate

{\bar{T}}_{N_{1}}

based on the collected

N_{1}

samples to define the main study phase. Note that in this stage,

\hat{σ} = {\bar{T}}_{N_{1}}

.

T h e F i n e T u n i n g P h a s e

: The primary study phase is determined through the following stopping rule:

N = \max {N_{1}, [k^{2} {\bar{T}}_{N_{1}}^{2}] + 1} .

(8)

If

N_{1} \geq [k^{2} {\bar{T}}_{N_{1}}^{2}] + 1

, we stop at this stage. Otherwise, we continue to sample an additional sample of size

[k^{2} {\bar{T}}_{N_{1}}^{2}] + 1 - N_{1}

—say,

T_{N_{1} + 1}

,

T_{N_{1} + 2}

,

T_{N_{1} + 3}

,

\dots

,

T_{N}

. Upon the realization of

N

, we terminate the sampling course and propose the estimate

{\bar{T}}_{N} = \sqrt{2 / π} {\bar{X}}_{N}

for the unknown scale parameter

σ .

In the following subsection, we present the stopping rules (7) and (8). These results were developed under the following assumption set forward by Hall [24] to develop a three-stage sequential sampling procedure theory. That is,

Assumption A:

Let

ξ

(>0) such that

l i m S u p (\frac{m}{ξ (m)}) < δ

as

ξ (m) \to \infty

, and

ξ (m) = O (m^{r})

, for

r > 1 .

Theorem 1 below provides the asymptotic results of the main study phase:

Theorem 1.

Under assumption A, for the three-stage sampling procedure (7) and (8) as

d \to 0,

we have:

(i): $E ({\bar{T}}_{N_{1}}) = σ - \frac{2 (4 - π) σ}{π} {(δ n^{*})}^{- 1} + o (d^{2})$ ,
(ii): $E ({\bar{T}}_{N_{1}}^{2}) = σ^{2} - \frac{3 (4 - π)}{π} σ^{2} {(δ n^{*})}^{- 1} + o (d^{2})$ ,
(iii): $V a r ({\bar{T}}_{N_{1}}) = \frac{(4 - π)}{π} σ^{2} {(δ n^{*})}^{- 1} + o (d^{2})$ ,
(iv): $E ({\bar{T}}_{N_{1}}^{4}) = σ^{4} - \frac{2 (4 - π)}{π} σ^{4} {(δ n^{*})}^{- 1} + o (d^{2})$ ,
(v): $V a r ({\bar{T}}_{N_{1}}^{2}) = \frac{4 (4 - π)}{π} σ^{4} {(δ n^{*})}^{- 1} + o (d^{2})$ .

Proof.

(i)

write

E ({\bar{T}}_{N_{1}}) = E ({\bar{T}}_{N_{1}} - σ + σ) = σ + E (N_{1}^{- 1} \sum_{i = 1}^{N_{1}} (T_{i} - σ))

, then, conditional on the

σ -

field generated by the random variables

X_{1}

,

X_{2}

,

X_{3}, \dots, X_{m},

we have:

E ({\bar{T}}_{N_{1}}) = σ + E N_{1}^{- 1} E (\sum_{i = 1}^{m} (T_{i} - σ) + \sum_{i = m + 1}^{N_{1}} (T_{i} - σ)) | T_{1}, T_{2}, T, \dots, T_{m} .

Given

T_{1}, T_{2}, T_{3}, \dots, T_{m}

we have

\sum_{i = 1}^{m} (T_{i} - σ)

is constant and

\sum_{i = m + 1}^{N_{1}} (T_{i} - σ | T_{1}, T_{2}, \dots, T_{m}) = (N_{1} - m) E (T_{i} - σ) = 0

by Wald’s first equation [61].

Therefore, we have

E ({\bar{T}}_{N_{1}}) = σ + m E (\frac{{\bar{T}}_{m} - σ}{N_{1}})

.

Next, expand

N_{1}^{- 1}

around

δ n^{*}

in stochastic Taylor series to obtain:

\begin{matrix} N_{1}^{- 1} = {(δ n^{*})}^{- 1} - (N_{1} - δ n^{*}) {(δ n^{*})}^{- 2} + (N_{1} - δ n^{*})^{2} ν^{- 3}, \\ = {(δ n^{*})}^{- 1} - δ k ({\bar{T}}_{m}^{2} - σ^{2}) {(δ n^{*})}^{- 2} + {(δ k)}^{2} ({\bar{T}}_{m}^{2} - σ^{2})^{2} ν^{- 3}, \end{matrix}

where

ν

is a random variable between

N_{1}

and

δ n^{*}

.

It follows that:

\begin{matrix} E ({\bar{T}}_{N_{1}}) & = σ + m E {({\bar{T}}_{m} - σ) ({(δ n^{*})}^{- 1} - m δ k^{2} ({\bar{T}}_{m}^{2} - σ^{2}) {(δ n^{*})}^{- 2} + δ^{2} k^{4} {({\bar{T}}_{m}^{2} - σ^{2})}^{2} ν^{- 3})} \\ = σ + E {({\bar{T}}_{m} - σ) {(δ n^{*})}^{- 1}} - E {m δ k^{2} {(δ n^{*})}^{- 2} E {({\bar{T}}_{m} - σ)}^{3}} + E {2 σ m δ k^{2} {(δ n^{*})}^{- 2} E {({\bar{T}}_{m} - σ)}^{2}} \\ + m δ^{2} k^{4} E {({\bar{T}}_{m} - σ) ({\bar{T}}_{m}^{2} - σ^{2})^{2} ν^{- 3}}, \end{matrix}

It follows that:

E ({\bar{T}}_{N_{1}}) = σ + I - I I + I I I + I V .

By assumption A,

m / n^{*} \approx δ

. Then, as

m \to \infty

,

I = 0,

and

I I = m δ k^{2} {(δ n^{*})}^{- 2} \frac{2 (π - 3) σ^{3}}{m^{2}} = o (d^{2}

). Next, recall

I I I

:

\begin{matrix} I I I = 2 σ m δ k^{2} {(δ n^{*})}^{- 2} E {({\bar{T}}_{m} - σ)}^{2} = 2 σ m δ k^{2} {(δ n^{*})}^{- 2} \frac{(4 - π) σ^{2}}{π m} \\ = \frac{2 (4 - π) σ^{2}}{π} {(δ n^{*})}^{- 1} + o (d^{2}) . \end{matrix}

Next, recall

I V

:

\begin{matrix} I V = & m δ^{2} k^{4} E {({\bar{T}}_{m} - σ) {({\bar{T}}_{m}^{2} - σ^{2})}^{2} ν^{- 3}} = m δ^{2} k^{4} E {{({\bar{T}}_{m} - σ)}^{5} ν^{- 3} + \\ 2 σ {({\bar{T}}_{m} - σ)}^{4} ν^{- 3} + 4 σ^{2} {({\bar{T}}_{m} - σ)}^{3} ν^{- 3}} = o (d^{2}) as m \to \infty . \end{matrix}

where we consider the two cases

ν \leq δ n^{*},

then

m δ^{2} k^{4} E {

(

{\bar{T}}_{m} - σ)^{r} ν^{- 3}}

\leq

m δ^{2} k^{4} E (

(

{\bar{T}}_{m} - σ)^{r} {(δ n^{*})}^{- 1} = o (d^{2})

for

r = 5, 4, 3

, as

m \to \infty .

Second, if

ν \leq m \leq N_{1}

, then

m δ^{2} k^{4} E {

(

{\bar{T}}_{m} - σ)^{r} ν^{- 3}}

\leq

m δ^{2} k^{4} E (

(

{\bar{T}}_{m} - σ)^{r} m^{- 3}) = o (d^{2})

as

m \to \infty

for

r = 5, 4, 3 .

We have also used assumption A.

The proof of

(i)

is complete.

Similar arguments can be used to justify

(i i)

and

(i v)

. Part

(i i i)

follows from

(i)

and

(i i) .

Part

(v)

follows from

(i i)

and (

i v) .

We omit details for brevity. The proof is complete. □

Theorem 2 below provides the asymptotic mean and variance for the final random sample size.

Theorem 2.

Under assumption (A), for the three-stage procedure (7) and (8) and as

d \to 0

, we have:

(i): $E (N) = n^{*} - \frac{3 (4 - π)}{π} δ^{- 1} + \frac{1}{2} + o (1)$ ,
(ii): $V a r (N) = \frac{4 (4 - π)}{π} δ^{- 1} n^{*} + o (d)$ .

Proof.

(i)

write the random variable

N

as

=

[k^{2} {\bar{T}}_{N_{1}}^{2}

] +1

, a . s .

except possibly on a set

ζ = (N_{1} < m) \cup^{} (k^{2} {\bar{T}}_{N_{1}}^{2} < δ k^{2} {\bar{T}}_{m}^{2} + 1

), of measure zero, such that

\int_{ζ}^{} d P = o (1)

.

Therefore,

(N) =

E (k^{2} {\bar{T}}_{N_{1}}^{2}) + E (β_{N_{1}}) + o (1)

. The continuous random variable

β_{N_{1}} = 1 - (k^{2} {\bar{T}}_{N_{1}}^{2} - [k^{2} {\bar{T}}_{N_{1}}^{2}])

has a standard uniform distribution (see Hall [24], and for large

m

see Anscombe [62]) central limit Theorem suggests that

{\bar{T}}_{N_{1}}^{2}

is normally distributed.

Hence,

E (N) = E (k^{2} {\bar{T}}_{N_{1}}^{2}) + 1 / 2 + o (1)

. By using the Theorem 1 part

(i i),

we get the result. The proof of part

(i)

is complete.

Part

(i i)

follows immediately from Anscombe [62] central limit theorem, since

\sqrt{N_{1}} ({\bar{T}}_{N_{1}}^{2} - σ^{2}) \to N (0, \frac{4 (4 - π) σ^{4}}{π})

and

\sqrt{\frac{δ}{n^{*}}} (N - n^{*}) \to N (0, \frac{(4 - π) σ^{2}}{π})

together with the uniform integrability of

{(\sqrt{\frac{δ}{n^{*}}} (N - n^{*}))}^{2}

. The proof of

(i i)

is complete. □

Theorem 2 shows that the average random sample size is always less than the optimal sample size. That is

E (N) < n^{*}

for all values of

n^{*}

. Moreover,

\lim_{d \to 0} E (N / n^{*}) = 1

, which means the procedure attains first-order asymptotic efficiency and

\lim_{d \to 0} E (N - n^{*}) < \infty,

which indicates that the procedure attains asymptotic second-order efficiency in the sense of [63]. Part

(i i)

shows that the variance increases as

n^{*}

increases.

The following Theorem 3 gives the second-order asymptotic expansion of the moments of a real-valued continuously differentiable function of the stopping time random variable

N

.

Theorem 3.

Let

h (> 0)

be a real-valued continuously differentiable and bounded function, such that

\sup_{n > m} | h^{‴} (n) | = O | h^{‴} (n^{*}) |

, then as

m \to \infty

\begin{matrix} E (h (N)) = h (n^{*}) & + {- \frac{3 (4 - π)}{π} δ^{- 1} + \frac{1}{2}} h^{'} (n^{*}) \\ + {\frac{4 (4 - π)}{π} δ^{- 1} n^{*}} h^{″} (n^{*}) . + o (d^{- 2} (h^{‴} (n^{*}))) . \end{matrix}

Proof.

The proof is a direct substitution of Theorem 2 parts

(i

) and

(i i)

in Taylor expansion of

h (N),

while we use the assumption that

h^{‴}

is bounded. The proof is complete. □

Theorem 4 below gives the asymptotic characteristics of the fine-tuning phase under the Assumption

A

.

Theorem 4.

For the three-stage rules (7) and (8), and as

d \to 0,

(i): $E ({\bar{T}}_{N}) = σ - \frac{2 (4 - π) σ}{π n^{*}} + o (d^{2}),$
(ii): $E ({\bar{T}}_{N}^{2}) = σ^{2} + (δ - 4) \frac{(4 - π) σ^{2}}{π n^{*}} + o (d^{2})$ ,
(iii): $V a r ({\bar{T}}_{N}) = \frac{δ (4 - π) σ^{2}}{π n^{*}} + o (d^{2}) .$

Proof.

Part

(i)

write

E ({\bar{T}}_{N}) = E ({\bar{T}}_{N}) = σ + E {N^{- 1} \sum_{i = 1}^{N} (T_{i} - σ)}

.

Next, condition on the

σ

- field generated by

T_{1}

,

T_{2},

…,

T_{N_{1}}

. It follows that:

E ({\bar{T}}_{N}) = σ + E {N^{- 1} N_{1} ({\bar{T}}_{N} - σ)},

then expand

N^{- 1}

in Taylor series around

n^{*}

as:

N^{- 1} = {(n^{*})}^{- 1} - (N - n^{*}) {(n^{*})}^{- 2} + {(N - n^{*})}^{2} {(ν)}^{- 3},

where

ν

is a random variable between

N

and

n^{*}

.

N^{- 1} = {(n^{*})}^{- 1} - k^{2} ({\bar{T}}_{N_{1}}^{2} - σ^{2}) {(n^{*})}^{- 2} + k^{4} {({\bar{T}}_{N_{1}}^{2} - σ^{2})}^{2} {(ν)}^{- 3} .

Therefore,

\begin{matrix} E ({\bar{T}}_{N}) = & E {N_{1} ({\bar{T}}_{N_{1}} - σ) {{(n^{*})}^{- 1} - k^{2} ({\bar{T}}_{N_{1}}^{2} - σ^{2}) {(n^{*})}^{- 2} + k^{4} {({\bar{T}}_{N_{1}}^{2} - σ^{2})}^{2} {(ν)}^{- 3}}} \\ = I - I I + I I I . \end{matrix}

I = E {N_{1} ({\bar{T}}_{N_{1}} - σ) {(n^{*})}^{- 1}} = 0,

by Wald’s first equation [61]. Then, recall II,

\begin{matrix} I I & = k^{2} {(n^{*})}^{- 2} E {N_{1} ({\bar{T}}_{N_{1}} - σ) ({\bar{T}}_{N_{1}} - σ) ({\bar{T}}_{N_{1}} - σ + 2 σ)} \\ = k^{2} {(n^{*})}^{- 2} E {N_{1} {({\bar{T}}_{N_{1}} - σ)}^{3}} + k^{2} {(n^{*})}^{- 2} 2 σ E {N_{1} {({\bar{T}}_{N_{1}} - σ)}^{2}} . \end{matrix}

The first term of II,

k^{2} {(n^{*})}^{- 2} E {N_{1} {({\bar{T}}_{N_{1}} - σ)}^{3}} = {(n^{*})}^{- 2} E {N_{1}^{- 2} (\sum_{i = 1}^{N_{1}} (T_{i} - σ))^{3}} .

Condition on the

σ -

field generated by

T_{1}

,

T_{2},

…,

T_{m}

and expand

(\sum_{i = 1}^{N_{1}} ({\bar{T}}_{N_{1}} - σ))^{3}

.

We have,

{(n^{*})}^{- 2} E {N_{1}^{- 2} {(\sum_{i = 1}^{N_{1}} (T_{i} - σ))}^{3}} = {(n^{*})}^{- 2} E {N_{1}^{- 2} E (\sum_{i = 1}^{m} (T_{i} - σ) + {\sum_{i = m + 1}^{N_{1}} (T_{i} - σ))}^{3} | T_{1}, T_{2}, …, T_{m_{1}}} = k^{2} {(n^{*})}^{- 2} E N_{1}^{- 2} E ({(\sum_{i = 1}^{m} (T_{i} - σ))}^{3} + 3 (\sum_{i = 1}^{m} (T_{i} - σ))^{2} \sum_{i = m + 1}^{N_{1}} (T_{i} - σ) + + 3 (\sum_{i = 1}^{m} (T_{i} - σ) (\sum_{i = m + 1}^{N_{1}} (T_{i} - σ))^{2} + (\sum_{i = m + 1}^{N_{1}} (T_{i} - σ))^{2}) | T_{1}, T_{2},, \dots, T_{m})

\begin{matrix} = k^{2} {(n^{*})}^{- 2} E {N_{1}^{- 2} m^{3} {({\bar{T}}_{m} - σ)}^{3}} + 3 k^{2} {(n^{*})}^{- 2} m E {N_{1}^{- 2} {m ({\bar{T}}_{m} - σ) (N_{1} - m)} \\ + k^{2} {(n^{*})}^{- 2} m E {N_{1}^{- 2} (N_{1} - m) \frac{2 (3 - π) σ^{3}}{π {(N_{1} - m)}^{2}}} . \end{matrix}

We have used Wald’s first equation [61] to prove that the second term in the expansion is zero.

= A + B + C,

where,

A = k^{2} {(n^{*})}^{- 2} E {N_{1}^{- 2} m^{3} {({\bar{T}}_{m} - σ)}^{3}} \leq \frac{2 (3 - π)}{π m n^{*}} σ = o (d^{2}) a s m \to \infty .

B = 3 k^{2} {(n^{*})}^{- 2} m E {N_{1}^{- 2} {m ({\bar{T}}_{m} - σ) (N_{1} - m)} \leq 3 k^{2} {(n^{*})}^{- 2} m E ({\bar{T}}_{m} - σ) = 0,

and

C = k^{2} {(n^{*})}^{- 2} m E {N_{1}^{- 2} (N_{1} - m) \frac{2 (3 - π) σ^{3}}{π {(N_{1} - m)}^{2}}} \leq \frac{2 (3 - π) σ}{π m n^{*}} = o (d^{2}) .

The second term in II,

\begin{matrix} k^{2} {(n^{*})}^{- 2} 2 σ E {N_{1} ({\bar{T}}_{N_{1}} - σ)^{2}} = k^{2} {(n^{*})}^{- 2} 2 σ E {N_{1}^{- 1} (E {(\sum_{i = 1}^{m} (T_{i} - σ) + \sum_{i = m + 1}^{N_{1}} (T_{i} - σ))}^{2} | T_{1}, T_{2}, \dots, T_{m})} \\ = k^{2} {(n^{*})}^{- 2} 2 σ E {N_{1}^{- 1} E {(\sum_{i = 1}^{m} (T_{i} - σ))}^{2}} + k^{2} {(n^{*})}^{- 2} 2 σ E {(N_{1}^{- 1} (N_{1} - m) \frac{σ^{2} (4 - π)}{π ({(N_{1} - m)}^{2}}} \\ = D + E, \end{matrix}

D = k^{2} {(n^{*})}^{- 2} 2 σ E {N_{1}^{- 1} m^{2} E ({\bar{T}}_{m} - σ)^{2}} = \frac{2 σ (4 - π) σ^{2}}{π n^{*}} as m \to \infty .

Here, we have used the fact

N_{1}^{- 1} \approx {(δ n)}^{- 1}

and

m / n^{*} \approx δ

under assumption A.

Similar arguments prove that

E = o (d^{2}),

where we have used the fact that

\frac{1}{N_{1} (N_{1} - m)} \leq \frac{1}{m^{2}}

.

It remains to evaluate the remainder term in

I I I

, which is:

I I I = k^{4} E {N_{1} ({\bar{T}}_{N_{1}} - σ) {({\bar{T}}_{N_{1}}^{2} - σ^{2})^{2} {(ν)}^{- 3}}} = o (d^{2}) .

Arguments similar to those used above and the fact that the random variable

ν

is between

N

and

n^{*}

can be used to justify the rate of convergence of

I I I

. We omit any further details for brevity.

This completes the proof of

(i) .

□

Likewise,

(i i)

can be asserted along the above lines if we write:

E ({\bar{T}}_{N}^{2}) = σ^{2} + E {N^{- 2} (\sum_{i = 1}^{N} (T_{i} - σ))^{2} + 2 σ E {N^{- 1} \sum_{i = 1}^{N} (T_{i} - σ)} .

Theorem 2 provides

2 σ E {N^{- 1} \sum_{i = 1}^{N} (T_{i} - σ)} = - \frac{2 (4 - π) σ^{2}}{π n^{*}}

.

Therefore,

E ({\bar{T}}_{N}^{2}) = σ^{2} + E {{N^{- 2} (\sum_{i = 1}^{N} (T_{i} - σ))}^{2}} - \frac{2 (4 - π) σ^{2}}{π n^{*}} + o (d^{2}) .

Likewise, condition on the

σ - f i e l d

generated by

T_{1}

,

T_{2},

…,

T_{N_{1}}

to obtain:

\begin{matrix} E ({\bar{T}}_{N}^{2}) = σ^{2} + E {N^{- 2} E {(\sum_{i = 1}^{N_{1}} (T_{i} - σ) + \sum_{i = N_{1} + 1}^{N} (T_{i} - σ))}^{2}} | T_{1}, T_{2}, \dots, T_{N_{1}} \\ = σ^{2} - \frac{2 (4 - π) σ^{2}}{π n^{*}} + E {N^{- 2} {(\sum_{i = 1}^{N_{1}} (T_{i} - σ))}^{2}} + \frac{σ^{2} (4 - π)}{π} E {N^{- 2} (N - N_{1})} + o (d^{2}) . \end{matrix}

The term

\frac{σ^{2} (4 - π)}{π} E {N^{- 2} (N - N_{1})} \leq \frac{σ^{2} (4 - π)}{π} E (\frac{1}{N}) = o (d^{2})

, while, by using Wald’s, second equation [61]:

E {N^{- 2} {\sum_{i = 1}^{N_{1}} (T_{i} - σ)}^{2}} \approx {(n^{*})}^{- 2} E (N_{1}) V a r (T_{i}) = \frac{δ (4 - π) σ^{2}}{π n^{*}} + o (d^{2}) .

This completes the proof. □

Part (i) of Theorem 4 shows that

{\bar{T}}_{N}

is an asymptotically unbiased estimator of

σ

. Meanwhile part (iii) shows that the variance decreases as

n^{*}

increases.

Theorem 5.

Let

g > 0

be a continuously differentiable real-valued function in a neighborhood around

σ

, such that

S u p_{n \geq m} | g^{‴} (n) | = O | g^{‴} (n^{*}) |

, then as

m \to \infty

E (g ({\bar{T}}_{N})) = g (σ) - σ \frac{(4 - π)}{2 π n^{*}} {4 g^{'} (σ) - δ σ {g^{'}}^{’ (σ)}} + o (d^{- 2} g^{‴} (n^{*})) .

Proof.

The proof is instantaneous if we expand

g ({\bar{T}}_{N})

in Taylor series around

σ

, and substitute

(i)

and (i i i)

of Theorem 4, together with the assumption that the function

g (> 0)

and its derivatives are bounded. The proof is complete. □

3.1. Three-Stage Minimum Risk Point Estimation

The asymptotic regret

ω (d)

encountered in the estimation of

σ

by the corresponding three-stage point estimate

{\bar{T}}_{N}

is given by:

ω (d) = E (R_{N} (d)) - R {(d)}_{n}^{*} = \frac{a^{2} n^{*}}{d^{2}} E {({\bar{T}}_{N} - σ)}^{2} + E (N) - 2 n^{*} .

By using Theorems 2 and 4, as

d \to 0,

we get:

ω (d) = n^{*} (δ - 1) + \frac{3 (4 - π) δ^{- 1}}{π} + o (1) .

The asymptotic regret

ω (d) < 0

(negative regret), which reflects that the three-stage procedure produces estimates for the Rayleigh distribution scale parameter better than using the fixed sample size technique. Additionally, the regret of using the three-stage procedure to estimate the scale parameter compared to using the fixed sample size (classical inference) is less than a non-vanishing finite quantity

\frac{3 (4 - π)}{π} δ^{- 1} + \frac{1}{2},

0 < δ < 1

. Simon [64] called this quantity the cost of ignorance, of not knowing the scale parameter. The issue of negative regret was discussed by Martinsek [27]. Table 1 below shows the Rayleigh distribution characteristics’ mathematical representation and the three-stage estimates for the mode, the median, the reliability, the hazard function at a specific time, and the entropy.

3.2. Three-Stage Fixed-Width Confidence Interval

Once the sampling procedure is terminated, we propose the fixed

2 d

width three-stage confidence interval

I_{N} = {\bar{T}}_{N} \pm d

for the scale parameter

σ

.

The coverage probability of the interval is calculated as:

\begin{matrix} P (σ \in I_{N}) = \sum_{n = m}^{\infty} P (| {\bar{T}}_{N} - σ | \leq d, N = n) \\ = \sum_{n = m}^{\infty} P (| {\bar{T}}_{N} - σ | \leq d | N = n) P (N = n) . \end{matrix}

Hence,

P (σ \in I_{N}) = \sum_{n = m}^{\infty} P (| {\bar{T}}_{n} - σ | \leq d | N = n) P (N = n) .

Since the stopping variable

N

depends on the scale parameter estimate

{\bar{T}}_{N}

, then

N

and

{\bar{T}}_{N}

are not stochastically independent. Therefore, we use Monte Carlo simulation to study the characteristic of

P (σ \in I_{N})

when the sample size varies from small, moderate, and large.

4. Simulation Study

Monte Carlo simulation is conducted to evaluate the three-stage procedure’s performance when the sample size varies from small, moderate, and large. A FORTRAN program is coded using Microsoft Developer Studio software to generate a series of simulations. For each experimental situation, 50,000 replicate samples were used. Random samples from the Rayleigh distribution were generated, and a three-stage sampling rule (7), (8) was implemented to estimate all the parameters in concern;

\hat{σ}

and its standard error;

\bar{N}

the estimated values of

n^{*}

and their standard error; the mean and the variance of the Rayleigh distribution and their standard errors; the regret; and, finally, the estimated value of the coverage probability. The optimal sample sizes are chosen typically

n^{*} =

25, 50, 100, 150, 200, 250, 300, 400, and 500.

For constructing a fixed-width confidence interval for the scale parameter

σ,

we take

α = 0.05

, and, accordingly,

a = 1.96

. Additionally, we consider different values for the initial sample size,

m =

5, 10, and 15, and the portion of the initial sample used for estimation,

δ =

0.3, 0.5, and 0.8. Mukhopadhyay [41] noted that if the design factor

δ

is chosen near zero or one, then a three-stage procedure would be more like Stein’s two-stage procedure. Therefore, a three-stage procedure is better implemented with

δ =

0.4, 0.5, or 0.6. Hall [36] mentioned that in practice, it seems a reasonable compromise to choose

δ =

0.5.

The simulation process is performed as follows:

For the

i

-th sample generated for a particular combination of

σ, m

,

δ

,

n^{*},

and

a,

we have:

First. Generate an initial sample of size

m (\geq 2),

say

T_{1, i}, T_{2, i}, \dots, T_{m, i}

from Rayleigh distribution with scale parameter

σ

and calculate

{\bar{T}}_{m}

as an initial estimate of

σ

.

Second. Apply the three-stage sampling procedure as presented in (7) and (8) to determine the stopping sample size at this iteration, whether in the first or second stage

N_{i}^{*}

.

Third. Record the resultant values of stage

N_{i}^{*}

and

T_{i}^{*}

.Hence, for each experimental combination we have two vectors of size

s =

50,000

(N_{1}^{*}, N_{2}^{*}, \dots, N_{s}^{*})

and

({\bar{T}}_{1}^{*}, {\bar{T}}_{2}^{*}, \dots, {\bar{T}}_{s}^{*})

. Define:

\bar{N} = s^{- 1} \sum_{i}^{s} N_{i}^{*} and \bar{T} = s^{- 1} \sum_{i}^{s} {\bar{T}}_{i}^{*},

where

\bar{N}

and

\bar{T}

are, respectively, the estimated mean sample size and the estimated mean of the estimator of the population scale parameter across replicates. Thus,

\hat{σ} = \bar{T}

may be regarded as an estimate of the expected value of the estimator of the scale parameter. The standard errors are:

S_{\bar{N}} = {s (s - 1)}^{- 1 / 2} {\sum_{i}^{s} {({\bar{N}}_{i} - \bar{N})}^{2}}^{1 / 2}, S_{\hat{σ}} = {s (s - 1)}^{- 1 / 2} {\sum_{i}^{s} {({\bar{T}}_{i} - \bar{T})}^{2}}^{1 / 2} .

Similar arguments can be calculated for other parameter estimates.

Fourth. The simulated regret is

\hat{ω} (d) = A^{2} s^{- 1} {\sum_{1}^{s} {({\bar{T}}_{i} - σ)}^{2}} + c \bar{N} - R (n^{*})

.

Fifth. The simulated coverage probability is:

\hat{(1 - α)} = \frac{# σ \in ({\bar{T}}_{i} \pm d)}{s}, i = 1, \dots, s .

For brevity, Table 2 below demonstrates the simulation results evaluated at

m = 10

,

δ =

0.5, and

1 - α =

0.95 for each respective

n^{*}

as follows:

\bar{N}

is the simulated estimate for the optimal sample size, with a standard error given by

S (\bar{N})

.

\hat{σ}

is the simulated estimate for the scale parameter

σ

with standard error

S (\hat{σ})

.

\hat{μ}

is the simulated estimate for the population mean of the distribution with standard error

S (\hat{μ})

.

\hat{v a r} (x)

is the simulated estimate for the variance of the distribution with standard error

\hat{S v a r} (x)

.

\hat{m e d} (x)

stands for the simulated estimate for the population median with standard error

S \hat{m e d} (x)

.

\hat{E n t}

stands for the simulated estimate for the population entropy with standard error and

\hat{S E n t}

.

\hat{ω}

is the simulated estimates for the asymptotic regret and finally

1 - \hat{α}

is the simulated estimate for the asymptotic coverage probability.

From these results, we observe that the final random sample size

N

is very close to the optimal sample size

n^{*}

—i.e.,

\bar{N} / n^{*} \approx 1

(first-order asymptotic efficiency)—and

N

is less than

n^{*},

which refers to early stopping with standard error increases as

n^{*}

increases. Additionally,

\bar{N} - n^{*}

is bounded by a finite number that is unrelated to

n^{*}

(second-order asymptotic efficiency). Besides, as

n^{*}

increases the estimate of the scale parameter gets significantly closer to the actual value with decreasing standard errors. Moreover, the simulated coverage probability is always less than the desired nominal value (asymptotic consistency in the sense of [30,36,63]), and this might be because of the early stopping sampling. The regret is a non-vanishing finite quantity with negative values. The negativity in the regret goes due to the dependency between the final random sample size

N

and the estimate of the scale parameter

{\bar{T}}_{N}

Furthermore, it may refer to early stopping.

5. Conclusions

This paper proposes a unified decision framework to estimate the scale parameter of the Rayleigh distribution and several related parameters. Within this optimal decision structure, a three-stage sampling procedure with a bona fide stopping rule is defined to determine the optimal sample size required to perform inference. The procedure enjoys the asymptotic characteristics set forward by Chow and Robbins [36] and Anscombe [34] as well the operational saving made possible by sampling in batches, as in Stein [30] and Cox [33]. Asymptotic characteristics of the three-stage sampling scale parameter estimate and its higher-order moments are presented in Theorems 1–5. The asymptotic regret associated with minimizing the expected cost of the squared-error loss function with the linear sampling cost is also discussed. Monte Carlo simulation was performed to give a proper feel of the inference performance in typical real-life situations. This current problem is different from those considered previously in the case of the normal and exponential distribution. The independence between the stopping variable

N

and the nuisance parameter estimates are apparent. In the Rayleigh distribution case, the stopping variable

N

depends on the scale parameter estimate, and thus the proofs took different directions.

Author Contributions

Conceptualization, A.Y., E.E.H.H., and H.I.H.; methodology, A.Y., E.E.H.H., and H.I.H.; software, A.Y., A.A.A., and H.I.H.; validation, A.Y., E.E.H.H.; formal analysis, A.Y. and H.I.H.; investigation, A.Y., E.E.H.H., A.A.A., and H.I.H.; resources, A.Y.; data curation, A.Y. and H.I.H.; writing—original draft preparation, A.Y.; writing—review and editing, A.Y., E.E.H.H., and H.I.H.; visualization, A.Y. and H.I.H.; supervision, A.Y. and H.I.H.; project administration, A.Y.; funding acquisition, A.Y. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Conflicts of Interest

The authors declare no conflict of interest.

References

Kodlin, D. A new response time distribution. Biometrics 1967, 23, 227–239. [Google Scholar] [CrossRef] [PubMed]
Rayleigh, F.R.S. On the resultant of a large number of vibrations of the same pitch and arbitrary phase. Lond. Edinb. Dublin Philos. Mag. 1880, 10, 73–78. [Google Scholar] [CrossRef] [Green Version]
Palovko, A.M. Fundamentals of Reliability Theory; Academic Press: New York, NY, USA, 1968. [Google Scholar]
Gross, A.J.; Clark, V.A. Survival Distributions, Reliability Application in Biomedical Science; Wiley: New York, NY, USA, 1975. [Google Scholar]
Lee, E.T.; Wang, J.W. Statistical Methods for Survival Data Analysis, 3rd ed.; Wiley & Sons Inc.: Hoboken, NJ, USA, 2003. [Google Scholar]
Dyer, D.D.; Whisenand, C.W. Best linear unbiased estimator of the parameter of the Rayleigh distribution: Part-II optimum theory for selected order statistics. IEEE Trans. Reliab. 1965, 60, 229–231. [Google Scholar] [CrossRef]
Siddiqui, M.M. Some problems connected with Rayleigh distributions. J. Res. Natl. Bur. Stand. 1962, 66D, 167–174. [Google Scholar] [CrossRef]
Hirano, K. Rayleigh Distributions; Wiley: New York, NY, USA, 1986. [Google Scholar]
Howlader, H.A.; Hossain, A. On Bayesian estimation and prediction from Rayleigh distribution based on type-II censored data. Commun. Stat. Theory Methods 1995, 24, 2251–2259. [Google Scholar] [CrossRef]
Rosen, K.; Van Buskirk, R.; Garbesi, K. Wind energy potential of coastal Eritrea: An analysis of sparse wind data. Sol. Energy 1999, 66, 201–213. [Google Scholar] [CrossRef]
Rehman, S.; Halawani, T.O.; Husain, T. Weibull parameters for wind speed distribution in Saudi Arabia. Sol. Energy 1994, 53, 473–479. [Google Scholar] [CrossRef]
Celik, A.N. Energy output estimation for small-scale wind power generators using Weibull-representative wind data. J. Wind Eng. Ind. Aerodyn. 2003, 91, 693–707. [Google Scholar] [CrossRef]
Pishgar-Komleh, S.H.; Keyhani, A.; Sefeedpari, P. Wind speed and power density analysis based on Weibull and Rayleigh distributions (a case study: Firouzkooh county of Iran). Renew. Sustain. Energy Rev. 2015, 42, 313–322. [Google Scholar] [CrossRef]
Sinha, S.K.; Howlader, H.A. Credible and HPD intervals of the parameter and reliability of Rayleigh distribution. IEEE Trans. Reliab. 1983, 32, 217–220. [Google Scholar] [CrossRef]
Ariyawansa, K.A.; Templeton, J.G.C. Structural inference on the parameter of the Rayleigh distribution from doubly censored samples. Stat. Hefte 1984, 25, 181–199. [Google Scholar] [CrossRef]
Howlader, H.A. HPD prediction intervals for Rayleigh distribution. IEEE Trans. Reliab. 1985, 34, 121–123. [Google Scholar] [CrossRef]
Lalitha, S.; Mishra, A. Modified maximum likelihood estimation for Rayleigh distribution. Commun. Stat. Theory Methods 1996, 25, 389–401. [Google Scholar] [CrossRef]
Elfattah, A.M.; AbdHassan, A.S.; Ziedan, D.M. Efficiency of maximum likelihood estimators under different censored sampling schemes for Rayleigh Distribution. Interstat Electron. J. 2006, 1, 1–16. [Google Scholar]
Dey, S.; Das, M.K. A note on prediction interval for a Rayleigh distribution: Bayesian approach. Am. J. Math. Manag. Sci. 2007, 27, 43–48. [Google Scholar] [CrossRef]
Dey, S. Comparison of Bayes estimators of the parameter and reliability function for Rayleigh distribution under different loss functions. Malays. J. Math. Sci. 2009, 3, 249–266. [Google Scholar]
Dey, S.; Dey, T. Bayesian estimation of the parameter and reliability of a Rayleigh distribution using records. Model Assist. Stat. Appl. 2012, 7, 81–90. [Google Scholar] [CrossRef]
Johnson, N.L.; Kotz, S.; Balakrishnan, N. Continuous Univariate Distributions, 2nd ed.; John Wiley & Sons: New York, NY, USA, 1994; Volume 1, ISBN 0-471-58495-9. [Google Scholar]
Shannon, C.E. Mathematical theory of communication. Bell Syst. Tech. J. 1948, 27, 379–423. [Google Scholar] [CrossRef] [Green Version]
Hall, P. Asymptotic theory and triple sampling of sequential estimation of a mean. Ann. Stat. 1981, 9, 1229–1238. [Google Scholar] [CrossRef]
Degroot, M.H. Optimal Statistical Decisions; McGraw-Hill: New York, NY, USA, 1970. [Google Scholar]
Chow, Y.S.; Yu, K.F. The performance of a sequential procedure for the estimation of the mean. Ann. Stat. 1981, 9, 189–198. [Google Scholar] [CrossRef]
Martinsek, A.T. Negative regret, optional stopping and the elimination of outliers. J. Am. Stat. Assoc. 1988, 83, 160–163. [Google Scholar] [CrossRef]
Hamdy, H.I. Remarks on the asymptotic theory of triple stage estimation of the normal mean. Scand. J. Stat. 1988, 15, 303–310. [Google Scholar]
Dantzig, G.B. On the Non-existence of tests of student’s hypothesis having power function independent of σ. Ann. Math. Stat. 1940, 11, 186–192. [Google Scholar] [CrossRef]
Stein, C. A two-sample test for a linear hypothesis whose power is independent of the variance. Ann. Math. Stat. 1945, 16, 243–258. [Google Scholar] [CrossRef]
Stein, C. Some problems in sequential estimation (abstract). Econometrica 1949, 17, 77–78. [Google Scholar]
Seelbinder, B.M. On Stein’s two stage sampling scheme. Ann. Math. Stat. 1953, 24, 640–900. [Google Scholar] [CrossRef]
Cox, D.R. Estimation by double sampling. Biometrika 1952, 39, 217–227. [Google Scholar] [CrossRef]
Anscombe, F.J. Sequential estimation. J. R. Stat. Soc. 1953, 15, 1–21. [Google Scholar] [CrossRef]
Robbins, H. Sequential Estimation of the Mean of a Normal Population. Probability and Statistics (Harald Cramer Volume); Almquist and Wiksell: Uppsala, Sweden, 1959; pp. 235–245. [Google Scholar]
Chow, Y.S.; Robbins, H. On the asymptotic theory of fixed-width sequential confidence intervals for the mean. Ann. Math. Stat. 1965, 36, 1203–1212. [Google Scholar] [CrossRef]
Mukhopadhyay, N. A note on three-stage and sequential point estimation procedures for a normal mean. J. Seq. Anal. 1985, 4, 311–319. [Google Scholar] [CrossRef]
Mukhopadhyay, N. Sequential estimation problems for negative exponential populations. Commun. Stat. Theory Methods 1988, 17, 2471–2506. [Google Scholar] [CrossRef]
Mukhopadhyay, N. Some properties of a three-stage procedure with applications in sequential analysis. Sankhya 1990, 52, 218–231. [Google Scholar]
Mukhopadhyay, N.; Hamdy, H.I.; AlMahmeed, M.; Costanza, M.C. Three stage point estimation procedures for a normal mean. J. Seq. Anal. 1987, 6, 21–36. [Google Scholar] [CrossRef]
Mukhopadhyay, N.; Mauromoustakos, A. Three stage estimation procedures of the negative exponential distribution. Metrika 1987, 34, 83–93. [Google Scholar] [CrossRef]
Hamdy, H.I.; Pallotta, W.J. Triple sampling procedure for estimating the scale parameter of Pareto distribution. Commun. Stat. Theory Methods 1987, 16, 2155–2164. [Google Scholar] [CrossRef]
Hamdy, H.I.; Mukhopadhyay, N.; Costanza, M.C.; Son, M.S. Triple stage point estimation for the exponential location parameter. Ann. Inst. Stat. Math. 1988, 40, 785–797. [Google Scholar] [CrossRef]
Hamdy, H.I.; AlMahmeed, M.; Nigm, A.; Son, M.S. Three stage estimation for the exponential location parameters. Metron 1989, 47, 279–294. [Google Scholar]
Mukhopadhyay, N.; Padmanabhan, A.R. A note on three-stage confidence intervals for the difference of locations: The exponential case. Metrika 1993, 40, 121–128. [Google Scholar] [CrossRef]
Takada, Y. Three stage estimation procedure of the multivariate normal mean. Sankhya 1993, 55, 124–129. [Google Scholar]
Hamdy, H.I. Performance of fixed width confidence intervals under type II errors: The exponential case. S. Afr. Stat. J. 1997, 31, 259–269. [Google Scholar]
AlMahmeed, M.; Hamdy, H.I. Sequential estimation of linear models in three stages. Metrika 1990, 37, 19–36. [Google Scholar] [CrossRef]
AlMahmeed, M.; AlHessainan, A.; Son, M.S.; Hamdy, H.I. Three stage estimation for the mean of a one-parameter exponential family. Korean Commun. Stat. 1998, 5, 539–557. [Google Scholar]
Costanza, M.C.; Hamdy, H.I.; Haugh, L.D.; Son, M.S. Type II error performance of triple sampling fixed precision confidence intervals for the normal mean. Metron 1995, 53, 69–82. [Google Scholar]
Yousef, A.; Kimber, A.; Hamdy, H.I. Sensitivity of Normal-Based Triple sampling sequential point estimation to the normality assumption. J. Stat. Plan. Inference 2013, 143, 1606–1618. [Google Scholar] [CrossRef] [Green Version]
Yousef, A.S. Construction a three-stage asymptotic coverage probability for the mean using Edgeworth second order approximation. In Selected Papers on the International Conference on Mathematical Sciences and Statistics 2013; Springer: Singapore, 2014; pp. 53–67. [Google Scholar] [CrossRef]
Hamdy, H.I.; Son, S.M.; Yousef, S.A. Sensitivity analysis of multistage sampling to departure of an underlying distribution from normality with computer simulations. J. Seq. Anal. 2015, 34, 532–558. [Google Scholar] [CrossRef]
Yousef, A. A Note on a three-stage sequential confidence interval for the mean when the underlying distribution departs away from normality. Int. Appl. Math. Stat. 2018, 57, 57–69. [Google Scholar]
Yousef, A.; Hamdy, H. Three-stage estimation for the mean and variance of the normal distribution with application to inverse coefficient of variation. Mathematics 2019, 7, 831. [Google Scholar] [CrossRef] [Green Version]
Yousef, A.; Hamdy, H. Three-stage sequential estimation of the inverse coefficient of variation of the normal distribution. Computation 2019, 7, 69. [Google Scholar] [CrossRef] [Green Version]
Yousef, A. Performance of three-stage sequential estimation of the inverse coefficient of variation under type II error probability: A Monte Carlo simulation study. Front. Phys. 2020. [Google Scholar] [CrossRef]
Liu, W. A k-stage sequential sampling procedure for estimation of a normal mean. J. Stat. Plan. Inference 1995, 65, 109–127. [Google Scholar] [CrossRef]
Son, M.S.; Haugh, H.I.; Hamdy, H.I.; Costanza, M.C. Controlling type II error while constructing triple sampling fixed precision confidence intervals for the normal mean. Ann. Inst. Stat. Math. 1997, 49, 681–692. [Google Scholar] [CrossRef]
Tahir, M. Sequential estimation of the square of the Rayleigh parameter. J. Math. Stat. 2014, 10, 275–280. [Google Scholar] [CrossRef]
Wald, A. Sequential Analysis; Wiley: New York, NY, USA, 1947. [Google Scholar]
Anscombe, F.J. Large sample theory of sequential estimation. Math. Proc. Camb. Philos. Soc. 1952, 48, 600–607. [Google Scholar] [CrossRef]
Ghosh, M.; Mukhopadhyay, N. Consistency and asymptotic efficiency of two-stage and sequential estimation procedures. Sankhya 1981, 43, 220–227. [Google Scholar]
Simon, G. On the cost of not knowing the variance when making a fixed-width confidence interval for the mean. Ann. Math. Stat. 1968, 39, 1946–1952. [Google Scholar] [CrossRef]

Table 1. Point estimation of other distribution parameters.

Distribution Characteristic	Mathematical Representation	Three Stage Point Estimate
The Mode	$σ$	${\bar{T}}_{N}$
The Median	$σ \sqrt{2 l n (2)}$	${\bar{T}}_{N} \sqrt{2 l n (2)}$
Reliability at time $T_{0}$	$e^{- (T_{0} / 2 T_{N}^{2})}$	$e^{- (T_{0} / 2 T_{N}^{2})}$
Hazard Function at time $T_{0}$	$T_{0} / σ^{2}$	$T_{0} / {\bar{T}}_{N}^{2}$
Entropy, $γ = 0.5772$	$1 + l o g (σ / \sqrt{2}) + γ / 2$	$1 + l o g ({\bar{T}}_{N} / \sqrt{2}) + γ / 2$

Table 2. Three-stage estimation results with

m = 10

,

δ = 0.5,

and

α = 0.05

.

Table 2. Three-stage estimation results with

m = 10

,

δ = 0.5,

and

α = 0.05

.

$n^{*}$	25	50	100	150	200	250	300	400	500
$\bar{N}$	22.79	48.84	98.95	148.75	198.82	248.68	298.89	399.10	499.04
$S (\bar{N})$	0.040	0.049	0.069	0.085	0.099	0.110	0.121	0.141	0.156
$\hat{σ}$	9.588	9.859	9.944	9.962	9.970	9.975	9.981	9.987	9.989
$S (\hat{σ})$	0.006	0.004	0.002	0.002	0.002	0.002	0.001	0.001	0.001
$\hat{μ}$	12.537	12.537	12.537	12.485	12.495	12.519	12.521	12.527	12.529
$S (\hat{μ})$	0.010	0.010	0.010	0.004	0.003	0.003	0.003	0.002	0.002
$\hat{v a r} (x)$	6.281	6.459	6.515	6.526	6.532	6.535	6.539	6.543	6.544
$\hat{S v a r} (x)$	0.004	0.003	0.002	0.001	0.001	0.001	0.001	0.001	0.001
$\hat{m e d} (x)$	10.795	11.111	11.209	11.228	11.230	11.232	11.225	11.212	11.188
$S \hat{m e d} (x)$	0.007	0.005	0.003	0.003	0.002	0.002	0.002	0.002	0.002
$\hat{E n t}$	3.1932	3.2262	3.237	3.240	3.241	3.241	3.242	3.243	3.243
$\hat{S E n t}$	0.001	0.000	0.000	0.000	0.000	0.000	0.000	0.000	0.000
$\hat{ω}$	−27.21	−51.16	−101.1	−80.27	−148.79	−85.89	−165.37	−169.95	−178.12
$1 - \hat{α}$	0.8823	0.927	0.940	0.942	0.944	0.946	0.948	0.945	0.949

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Yousef, A.; Hassan, E.E.H.; Amin, A.A.; Hamdy, H.I. Multistage Estimation of the Scale Parameter of Rayleigh Distribution with Simulation. Symmetry 2020, 12, 1925. https://doi.org/10.3390/sym12111925

AMA Style

Yousef A, Hassan EEH, Amin AA, Hamdy HI. Multistage Estimation of the Scale Parameter of Rayleigh Distribution with Simulation. Symmetry. 2020; 12(11):1925. https://doi.org/10.3390/sym12111925

Chicago/Turabian Style

Yousef, Ali, Emad E. H. Hassan, Ayman A. Amin, and Hosny I. Hamdy. 2020. "Multistage Estimation of the Scale Parameter of Rayleigh Distribution with Simulation" Symmetry 12, no. 11: 1925. https://doi.org/10.3390/sym12111925

APA Style

Yousef, A., Hassan, E. E. H., Amin, A. A., & Hamdy, H. I. (2020). Multistage Estimation of the Scale Parameter of Rayleigh Distribution with Simulation. Symmetry, 12(11), 1925. https://doi.org/10.3390/sym12111925

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Multistage Estimation of the Scale Parameter of Rayleigh Distribution with Simulation

Abstract

1. Introduction

2. Estimation Problems

2.1. Minimum Risk Point Estimation

2.2. Fixed-Width Confidence Interval

2.3. A Unified Decision Framework

3. Multistage Sampling

3.1. Three-Stage Minimum Risk Point Estimation

3.2. Three-Stage Fixed-Width Confidence Interval

4. Simulation Study

5. Conclusions

Author Contributions

Funding

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI