Empirical Bayes Decision for a Generalized Exponential Distribution with Contaminated Data

Han Qiu; Jiaqing Chen; Zihao Yuan; Yangxin Huang

doi:10.3390/sym15020511

,

and

¹

Department of Statistics, College of Science, Wuhan University of Technology, Wuhan 430070, China

²

Department of Epidemiology and Biostatistics, College of Public Health, University of South Florida, Tampa, FL 33612, USA

^*

Author to whom correspondence should be addressed.

Symmetry2023, 15(2), 511;https://doi.org/10.3390/sym15020511

This article belongs to the Special Issue Mathematical Models: Methods and Applications

Version Notes

Order Reprints

Abstract

The two-sided and one-sided empirical Bayes test (EBT) rules for the parameter of a generalized exponential distribution with contaminated data (errors in variables) are constructed by a deconvolution kernel method, respectively. Under the type of the supersmooth error distributions and the supersmooth errors with the error level can be controlled situations, the asymptotically optimal uniformly over a class of prior distributions and uniform rates of convergence of the corresponding regret for the proposed EBT rules are obtained with suitable conditions. The example study shows that the assumptions and conditions of the main results of this paper are satisfied easily by calculating.

Keywords:

generalized exponential distribution; empirical Bayes decision; deconvolution kernel method; contaminated data

1. Introduction

Much of the more recent literature has looked at the empirical Bayes test (EBT). EBT for the parameter of some common distributions are investigated [1,2,3,4]. Under non-identical components case, empirical Bayes testing for a lifetime guarantee is considered for the double parameter exponential distribution [5]. Merging Bayesian and empirical Bayes posterior distributions in total variation is discussed [6]. A double empirical Bayes decision is obtained for multi-experiment studies by means of an empirical Bayes analytical method [7]. In order to study the relationship between empirical Bayes posterior distributions and false discovery rate control, a spike and slab empirical Bayes multiple testing is constructed [8]. An empirical Bayes multiple testing procedure for the sparse sequence model is investigated [9]. In earlier times, on the EBT for the continuous one-parameter exponential family has lots of work and is asymptotically optimal and the optimal convergence rates of EBT are obtained [10,11,12,13,14]. Most of the studies have discussed the empirical Bayes decision problem in the case of non-contaminated data, which is not the case for pure data cases. However, in practical application problems, contaminated data (errors in variables) are involved in many fields, and it has been widely studied [15,16,17,18,19]. In recent literature, the one-sided empirical Bayes decision problem is investigated for the continuous one-parameter exponential family with contaminated data [20].

Suppose that the random variable X has the generalized exponential distribution (GED) with probability density function (PDF) of the following forms [21]

f (x | θ) = (μ x + θ) exp (- θ x - μ x^{2} / 2),

(1)

where

θ

is a unknown parameter with the natural space

Θ = {θ > 0 | \int_{Ω}^{} f (x | θ) d x = 1}

. In this article, we assume

μ > 0

is a known constant, and the sample space is

Ω = {x | x > 0}

.

GED is also called linear exponential distribution, it is a combinatorial distribution, and the exponential and Rayleigh distributions are considered as special cases of GED when

θ = 0

and

μ = 0

, respectively. The hazard function of GED is a linear function about time and age in linear exponential models, and it is one of the reasonable models for lifetime distributions of random phenomena. Progressive type-II censored competing risks data when the lifetimes are assumed to be a linear exponential distribution [21]. Recurrence relations for single and product moments of generalized order statistics have been derived with the linear exponential distribution [22]. The linear exponential distribution has been used in the area of reliability and life-testing see, for example, Broadbent [23] and Bain [24].

The two-sided and one-sided EBT rules are constructed for GED with contaminated data in this article. Deconvolution kernel method is employed to develop the two-sided and one-sided EBT rules with contaminated data, respectively. For errors in the variables model, deconvolution kernel method can eliminate the effect of the additive noise kernel density estimation. Furthermore, under the supersmooth error distributions and the supersmooth errors, the error level can be controlled, the asymptotically optimal and uniform rates of convergence are obtained with suitable conditions.

In practical problems, we often encounter measurement errors due to observation conditions, so the analysis of contaminated data is very important. For the pair random variables

(X, θ)

, assume that

θ

has a prior distribution

G (θ)

, X is one dimensional real random variable with a marginal density function

f_{X | θ} (x)

when

θ

is given,

(X, θ)

is not directly observable. We observe only Y, where

Y = X + ε

, and

ε

are the random error. Suppose that

ε

follows a known distribution

F_{ε}

on

(- \infty, \infty)

, and independent on

(X, θ)

.

Firstly, we consider the two-sided test problem as follows

H_{0} : θ_{1} ⩽ θ ⩽ θ_{2} \leftrightarrow H_{1} : θ < θ_{1} o r θ > θ_{2}

(2)

where

θ_{1}

and

θ_{2}

are known constants.

Define

θ_{0} = (θ_{1} + θ_{2}) / 2

,

γ_{0} = (θ_{2} - θ_{1}) / 2

, then (2) is equivalent to

H_{0}^{*} : | θ - θ_{0} | ⩽ γ_{0} \leftrightarrow H_{1}^{*} : | θ - θ_{0} | > γ_{0}

(3)

For hypothesis test (3), let

i = 0, 1 .

taking 0–1 weighted square loss function in the following

\begin{matrix} L_{i} (θ, d_{i}) = (1 - i) a [{(θ - θ_{0})}^{2} - γ_{0}^{2}] I_{[| θ - θ_{0} | > γ_{0}]} + i a [γ_{0}^{2} - {(θ - θ_{0})}^{2}] I_{[| θ - θ_{0} | ⩽ γ_{0}]}, \end{matrix}

where

a > 0

is a constant, and

d = {d_{0}, d_{1}}

is the decision space,

d_{0}

indicates accepting

H_{0}^{*}

,

d_{1}

indicates rejecting

H_{0}^{*}

.

When

i = 0

, then we obtain

L_{0} (θ, d_{0}) = a [{(θ - θ_{0})}^{2} - γ_{0}^{2}] I_{[|θ - θ_{0}| γ_{0}]}

; when

i = 1

then we have

L_{1} (θ, d_{1}) = a [γ_{0}^{2} - {(θ - θ_{0})}^{2}] I_{[[θ - θ_{0} ∣ \leq γ_{0}]}

.

Let the parameter

θ

be distributed according to an unknown prior

G (θ)

, and assume that

G (θ)

belongs to the following class of distributions

ϑ = {G : G is a prior on Ω such that sup_{x} |f_{X}^{(m)} (x)| \leq B},

(4)

where

f_{x}^{m} (x)

denotes the m order derivative of

f_{X} (x) = \int_{Θ}^{} f_{X | θ} (x) d G (θ)

, which is the marginal density of X, and

m ⩾ 2

is an integer,

B > 0

is a constant.

We define the randomized decision rule for hypothesis test (3) as follows

δ (y) = P (a c c e p t i n g H_{0}^{*} | Y = y) .

(5)

Then, the Bayes risk of

δ (y)

is given by

\begin{matrix} R (δ (y), G (θ)) & = \int_{- \infty}^{\infty} \int_{Θ}^{} [L_{0} (θ, d_{0}) δ (y) + L_{1} (θ, d_{1}) (1 - δ (y)] f_{Y | θ} (y) d G (θ) d y \\ = a \int_{- \infty}^{\infty} β_{G} (y) δ (y) d y + C_{G}, \end{matrix}

(6)

where

C_{G} = \int_{Θ}^{} L_{1} (θ, d_{1}) d G (θ), β_{G} (y) = \int_{Θ}^{} [{(θ - θ_{0})}^{2} - γ_{0}^{2}] f_{Y | θ} (y) d G (θ)

(7)

with

f_{Y} (y) = \int_{Θ}^{} f_{Y | θ} (y) d G (θ) .

(8)

and

f_{Y | θ} (y)

denotes the density of Y given

θ

, i.e.,

f_{Y | θ} (y) = \int_{}^{} f_{X | θ} (y - x) d F_{ε} (x)

.

Let

P_{X} (x) = \int_{Θ}^{} e^{- θ x - \frac{1}{2} μ x^{2}} d G (θ)

, and

P_{X}^{(1)} (x) = - \int_{Θ}^{} (μ x + θ) e^{- θ x - \frac{1}{2} μ x^{2}} d G (θ) = - f_{X} (x),

thus, we have

\int_{x}^{\infty} f_{X} (x) d x = P_{X} (x) .

From (7), we obtain

\begin{matrix} β_{G} (y) & = \int_{- \infty}^{\infty} f_{X}^{(2)} (y - x) d F_{ε} (x) + \int_{- \infty}^{\infty} Q (y - x) f_{X}^{(1)} (y - x) d F_{ε} (x) \\ + \int_{- \infty}^{\infty} ϕ (y - x) f_{X} (y - x) d F_{ε} (x) - μ \int_{- \infty}^{\infty} Q (y - x) p_{X} (y - x) d F_{ε} (x), \end{matrix}

(9)

where

f_{X} (x) = \int_{θ} f_{X ∣ θ} (x) d G (θ)

is the marginal PDF of random variable X,

f_{X}^{(1)} (x)

and

f_{X}^{(2)} (x)

denote the first order and the second order derivative of

f_{X} (x)

, respectively.

In (9), let

\begin{matrix} Q (y - x) & = 2 u (y - x) + 2 θ_{0} a n d \\ ϕ (y - x) & = μ^{2} {(y - x)}^{2} + 2 μ θ_{0} (y - x) + 3 μ + θ_{0}^{2} - γ_{0}^{2} . \end{matrix}

So from (9), we define the best Bayes decision minimizing

R (δ (y), G (θ))

as follows

δ_{G} (y) = \{\begin{matrix} 1, & if β_{G} (y) \leq 0, \\ 0, & elsewhere . \end{matrix}

(10)

A test is called a Bayes test with respect to

G (θ)

if

R (δ_{G}, G) = inf_{δ^{'}} R (δ^{'}, G) = a \int_{- \infty}^{\infty} β_{G} (y) δ_{G} (y) d y + C_{G} .

(11)

Since

G (θ)

is unknown in this paper,

δ_{G} (y)

is unavailable to use, so this leads us to use the empirical Bayes approach in the following.

The rest of this article is organized as follows. In Section 2, the two-sided EBT rule for GED with contaminated data is proposed; Section 3 is devoted to obtaining asymptotic properties and the uniform convergence rate of two-sided EBT rule; the main results of two-sided EBT are proved in Section 4; Section 5 investigated one-sided EBT rule for GED with contaminated data; an example study is presented in Section 6.

2. The Proposed Two-Sided EBT Rule of GED with Contaminated Data

It is well-known that we usually make the following assumptions in the empirical Bayes framework, let

(Y_{1}, θ_{1})

,

(Y_{2}, θ_{2})

, ⋯,

(Y_{n}, θ_{n})

, and

(Y, θ)

be independent pair of random variables, the parameters

θ_{i}

(

1 \leq i \leq n

) and

θ

have a common prior distribution

G (θ)

;

Y_{i}

(

1 \leq i \leq n

) and Y are distributed according to the same marginal distribution

F_{Y}

with density function

f_{Y} (y) = \int_{Θ} f_{Y ∣ θ} (y) d G (θ)

,

Y_{1}

, ⋯,

Y_{n}

denotes historical samples and Y is called the present sample.

Deconvolution is a very important problem. It is often encountered when modeling unobservable data or to estimate conditional moments useful in likelihood calculations. When dealing with non-parametric estimation of priors or in measurement error models, the sample data are noisy because of the measurement error; deconvolution kernel method is adopted to eliminate the effect of the additive noise kernel density estimation. In order to obtain the empirical Bayes decision, we employ the deconvolution kernel method in the following by Fan [17,18,19].

Let

φ_{Y} (t)

and

φ_{ε} (t)

be the characteristic function (c.f.) of Y and

ε

, respectively. Note that

f_{X} (x) = \frac{1}{2 π} \int_{- \infty}^{\infty} exp (- i t x) \frac{φ_{Y} (t)}{φ_{ε} (t)} d t

(12)

Thus, a deconvoluted kernel density estimation of

f_{X}^{(r)} (x) (r = 0, 1, 2)

is defined by

f_{n}^{(r)} (x) = \frac{1}{2 π} \int_{- \infty}^{\infty} exp (- i t x) {(- i t)}^{r} φ_{K} (t h_{n}) \frac{φ_{n} (t)}{φ_{ε} (t)} d t,

(13)

where

0 < h_{n} \to 0

as

n \to \infty

and

φ_{n} (t) = \frac{1}{n} \sum_{j = 1}^{n} exp (i t Y_{j})

is called the empirical c.f. of random variable Y. Note that

f_{X}^{(0)} (x) = f_{X} (x)

and

f_{n}^{(0)} (x) = f_{n} (x)

.

We can also rewrite (13) as kernel type of estimation as follows

f_{n}^{(r)} (x) = \frac{1}{n h_{n}^{1 + r}} \sum_{j = 1}^{n} K_{n r} (\frac{x - Y_{j}}{h_{n}}), r = 0, 1, 2 .

(14)

where

K_{n r} (x) = \frac{1}{2 π} \int_{- \infty}^{\infty} exp (- i t x) {(- i t)}^{r} \frac{φ_{K} (t)}{φ_{ε} (t / h_{n})} d t

We define an estimator of the

p_{X} (x)

of the random variable X by

p_{n} (x) = \int_{- M_{n}}^{x} f_{n} (t) d t,

(15)

where

f_{n} (x)

is the kernel density estimator given by (13), and

M_{n} (\to \infty)

is a sequence of constants.

Hence, we define an estimator of the

β_{G} (y)

as follows

\begin{matrix} β_{n} (y) = & \int_{- \infty}^{\infty} f_{n}^{(2)} (y - x) d F_{ε} (x) + \int_{- \infty}^{\infty} Q (y - x) f_{n}^{(1)} (y - x) d F_{ε} (x) \\ + \int_{- \infty}^{\infty} ϕ (y - x) f_{n} (y - x) d F_{ε} (x) - μ \int_{- \infty}^{\infty} Q (y - x) p_{n} (y - x) d F_{ε} (x) . \end{matrix}

(16)

Furthermore, an empirical Byes test rule is defined as

δ_{n} (y) = \{\begin{matrix} 1, & if β_{n} (y) \leq 0 \\ 0, & elsewhere . \end{matrix}

(17)

In the following, let E be the expectation with respect to the joint distribution of

(Y_{1}, Y_{2}, \dots, Y_{n})

. Then, the overall Bayes risk of

δ_{n} (y)

would be

R (δ_{n}, G) = a \int_{- \infty}^{\infty} β_{G} (y) E [δ_{n} (y)] d y + C_{G} .

(18)

By the definition, for any

G (θ) \in ϑ

, if

sup_{G \in ϑ} (R (δ_{n}, G) - R (δ_{G}, G)) = O (n^{- q})

, where

q > 0

, then

δ_{n} (y)

is called asymptotically optimal uniformly with uniform convergence rate

O (n^{- q})

.

3. Asymptotic Properties of Two-Sided EBT Rule

In this section, asymptotic properties of

R (δ_{n}, G) - R (δ_{G}, G)

be investigated, some assumptions on the kernel function

K (x)

and the error variable

ε

are given in the following.

(A1): The $K (x)$ is a symmetric function about zero on $(- \infty, + \infty)$ and satisfies
$\int_{- \infty}^{\infty} K (x) d x = 1$ , $\int_{- \infty}^{\infty} x^{j} K (x) d x = 0$ for $j = 1, \dots, (r - 1)$ and $\int_{- \infty}^{\infty} x^{r} K (x) d x \neq 0$ for some integer $r > 0$ ,
(A2): $φ_{K} (t)$ is a symmetric function, having $m + 2$ bounded integral derivatives on $(- \infty, + \infty)$ ,
(A3): $φ_{K} (t) = 1 + O ({| t |}^{m}) as t \to 0$ ,
(A4): The characteristic function of $ε$ satisfies $φ_{ε} (t) \neq 0$ for any t,
(A5): $\int_{Θ} \int_{- \infty}^{\infty} θ^{2} f_{X ∣ θ} (y - x) d F_{ε} (x) d G (θ) < \infty$ uniformly in y, and
(A6): for some $0 < λ < 1$ , $\int_{- \infty}^{\infty} {|β_{G} (y)|}^{1 - λ} d y < \infty$ and
$\int_{- \infty}^{\infty} {|β_{G} (y)|}^{1 - λ} {[\int_{- \infty}^{\infty} | Q (y - x) | d F_{ε} (x)]}^{λ} d y < \infty$ and
$\int_{- \infty}^{\infty} {|β_{G} (y)|}^{1 - λ} {[\int_{- \infty}^{\infty} | ϕ (y - x) | d F_{ε} (x)]}^{λ} d y < \infty$ , where $β_{G} (y)$ is given by (9).

Next, theorem below about the two-sided EBT establish the rates of convergence of the regret

R (δ_{n}, G) - R (δ_{G}, G)

, where

R (δ_{G}, G)

and

R (δ_{n}, G)

are given by (11) and (18), respectively.

Theorem 1.

For some integer

m \geq 2

and constants

0 < λ < 1

, ϑ is defined by (4). Suppose that

K (x)

and

F_{ε} (x)

are such that (A1)–(A6) hold, and the following conditions are satisfied:

(B1): $φ_{K} (t) = 0$ for $| t | \geq 1$ ,
(B2): $|φ_{s} (t)| {| t |}^{- β_{0}} exp ({| t |}^{- β} / γ) \geq γ_{0}$ as $| t | \to \infty$ for some positive constants $β$ , $γ$ , $γ_{0}$ and a constant $β_{0}$ .

Then, by the choosing the bandwidth

h_{n} = {(4 / γ)}^{1 / β} {(log n)}^{- 1 / β}

, we obtain

sup_{G \in ϑ} (R (δ_{n}, G) - R (δ_{G}, G)) = O ({(log n)}^{- λ (m - 2) / β}) .

(19)

Remark 1.

If its characteristic function

φ_{ε} (t)

satisfies condition (B2) of Theorem 1, then the distribution of a random variable ε is called supersmooth of order β. The common examples of supersmooth distributions are normal, Cauchy, mixture normal, etc. In practice, the conditions of Theorem 1 are easy to verify. It can be seen from the result of the Theorem 1 that the rate of convergence of EBT is very slow for very common error distributions, such as normal. Fan [17,18] pointed out the supersmooth error distribution will result in a worse convergence rate than of the smooth distribution.

It appears that the optimal rate of convergence for Gaussian deconvolution is extremely slow. Since the normal distribution is frequently used in applications, we need to study how to large a noise level is acceptable. Thus, considering the following model, let us assume that the data

Y_{1}, \dots, Y_{n}

are independent identical distribution samples from

Y = X + ε,

(20)

where

ε = σ_{0} \tilde{ε}

,

σ_{0}

parameterizes the noise level.

Theorem 2.

For some integer

m \geq 2

and constants

0 < λ < 1

, ϑ is defined by (4). Suppose that

K (x)

and

F_{ε} (x)

are such that (A1)–(A6) hold with

ε = σ_{0} \tilde{ε}

. Then, let

σ_{0} = O (n^{- 1 / (2 m + 1)})

and by choosing the bandwidth

h_{n} = O (n^{- 1 / (2 m + 1)})

, we have

sup_{G \in ϑ} (R_{n} - R_{G}) = O (n^{- λ (m - 2) / (2 m + 1)}) .

Remark 2.

Although all the data are contaminated with supersmooth errors, the results of Theorem 2 can also be as good as that of the uncontaminated data case. Suppose that all the data are contaminated with supersmooth errors, while the error level can be controlled, namely,

φ_{ε} (t) = φ_{\tilde{ε}} (σ_{0} t)

. Fan [19] had been considered model (20). Theorem 2 indicates that the convergence rate is also very slow. The result of the following Lemma 3 is as good as ordinary smooth errors distribution but the result of the following Lemma 4 cause to the worse convergence rate of empirical Bayes estimator.

4. Proofs

In this section, first we need some lemmas to prove the main results of this paper. Lemmas 1 and 2 are due to Fan [17,18], Lemmas 3, and 4 are due to Fan [19]. The proof of Lemma 4 can be found in Johns and Van Ryzin [10]. Theorems 1 and 2 shall be proved, since the proofs of Theorems 1 and 2 are similar, only Theorem 1 is proved in detail. In the following,

c, c_{1}, c_{2}, \dots

always stand for some positive constants and may be different even with the same notations.

Lemma 1.

Let

f_{n}^{(r)} (x)

be given by (14), under the assumptions (A1)–(A4) and the conditions (B1)–(B2) of Theorem 1 are satisfied, by the choosing the bandwidth

h_{n} = {(4 / γ)}^{1 / β} {(log n)}^{- 1 / β}

, we have

sup_{x} sup_{G \in ϑ} E {(f_{n}^{(r)} (x) - f_{X}^{(r)} (x))}^{2} \leq c {(log n)}^{- 2 (m - r) / β},

(21)

where

f_{X}^{(r)} (x)

denotes the r order derivative of

f_{X} (x)

and ϑ is given by (4).

Lemma 2.

Let

p_{n} (x)

be given by (15), suppose that

φ_{K} (t)

is a symmetric function, having

m + 3

bounded integrable derivatives on

(- \infty, \infty)

, and satisfying

φ_{K} (t) = 1 + O ({| t |}^{m + 1})

as

t \to 0

. Under the assumption (A4) and the conditions (B1)–(B2) of Theorem 1 hold, with the choice

h_{n} = {(4 / γ)}^{- 1 / β} {(log n)}^{- 1 / β}

of the bandwidth and

M_{n} = n^{1 / 3}

, we have

sup_{P_{X} \in Ω^{*}} sup_{G \in ϑ} E {(p_{n} (x) - p_{X} (x))}^{2} \leq c {(log n)}^{- 2 (m + 1) / β}

(22)

where ϑ is given by (4), and

Ω^{*} = \{p : p_{X}^{'} (x) \in ϑ, p (- n) + 1 - p (n) = o ({(log n)}^{- (m + 1) / β})\}

.

Lemma 3.

Let

f_{X}^{(r)} (x)

be given by (14). If the assumptions of (A1)–(A4) hold, let

σ_{0} = O (n^{- 1 / (2 m + 1)})

, then by choosing the bandwidth

h_{n} = O (n^{- 1 / (2 m + 1)})

, we have

sup_{x \in Ω} sup_{G \in ϑ} E_{n} {(f_{n}^{(r)} (x) - f_{X}^{(r)} (x))}^{2} \leq c (n^{- 2 (m - r) / (2 m + 1)}),

(23)

where

f_{X}^{(r)} (x)

denote the r order derivative of

f_{X} (x)

and ϑ is given by (4).

Lemma 4.

Let

p_{n} (x)

is given by (15), suppose that

ϕ_{K}^{″} (\cdot)

and

ϕ_{ε}^{″} (\cdot)

are bounded, respectively. Let

K (x)

satisfy (A1)–(A4) with

m = 2

, and

σ_{0} = O (n^{- 1 / 5})

, then we have

sup_{x \in Ω} sup_{G \in ϑ} E_{n} {(p_{n} (x) - p_{X} (x))}^{2} \leq c n^{- 1} .

(24)

Lemma 5.

Let

R (δ_{G}, G)

and

R (δ_{n}, G)

be defined by (11) and (18), respectively, then

0 \leq R (δ_{n}, G) - R (δ_{G}, G) \leq a \int_{- \infty}^{\infty} |β_{G} (y)| p (|β_{n} (y) - β_{G} (y)| \geq |β_{G} (y)|) d y,

where

β_{G} (y)

and

β_{n} (y)

are given by (9) and (16), respectively.

Proof of Theorem 1.

By Lemma 5 and by the Markov inequality, for any

0 < λ < 1

,

\begin{matrix} 0 \leq R (δ_{n}, G) - R (δ_{G}, G) \leq a \int_{- \infty}^{\infty} |β_{G} (y)| p (|β_{n} (y) - β_{G} (y)| \geq |β_{G} (y)|) d y \\ \leq a \int_{- \infty}^{\infty} {|β_{G} (y)|}^{1 - λ} E {|β_{n} (y) - β_{G} (y)|}^{λ} d y . \end{matrix}

(25)

By applying the

C_{r}

-inequality followed by Lyapunov’s inequality and using Fubini’s Theorem, we obtain

\begin{matrix} E {|β_{n} (y) - β_{G} (y)|}^{λ} & \leq c_{1} \{E {|\int_{- \infty}^{\infty} (f_{n}^{(2)} (y - x) - f_{G}^{(2)} (y - x)) d F_{ε} (x)|}^{λ} \\ + E {|\int_{- \infty}^{\infty} Q (y - x) (f_{n}^{(1)} (y - x) - f_{G}^{(1)} (y - x)) d F_{ε} (x)|}^{λ} \\ + E {|\int_{- \infty}^{\infty} ϕ (y - x) (f_{n} (y - x) - f_{G} (y - x)) d F_{ε} (x)|}^{λ} \\ - μ E {|\int_{- \infty}^{\infty} Q (y - x) (p_{n} (y - x) - p_{G} (y - x)) d F_{ε} (x)|}^{λ}\} \\ \leq c_{1} {[\int_{- \infty}^{\infty} E |f_{n}^{(2)} (y - x) - f_{G}^{(2)} (y - x)| d F_{ε} (x)]}^{λ} \\ + c_{1} {[\int_{- \infty}^{\infty} | Q (y - x) | E |f_{n}^{(1)} (y - x) - f_{G}^{(1)} (y - x)| d F_{ε} (x)]}^{λ} \\ + c_{1} {[\int_{- \infty}^{\infty} | ϕ (y - x) | E |f_{n} (y - x) - f_{G} (y - x)| d F_{ε} (x)]}^{λ} \\ + c_{2} {[\int_{- \infty}^{\infty} | Q (y - x) | E |p_{n} (y - x) - p_{G} (y - x)| d F_{ε} (x)]}^{λ} . \end{matrix}

(26)

Furthermore, by (25) and (26), we obtain

\begin{matrix} sup_{G \in ϑ} (R (δ_{n}, G) - R (δ_{G}, G)) \leq \int_{- \infty}^{\infty} {|β_{G} (y)|}^{1 - λ} sup_{G \in ϑ} E {|β_{n} (y) - β_{G} (y)|}^{λ} d y \\ \leq c_{1} \int_{- \infty}^{\infty} {|β_{G} (y)|}^{1 - λ} \times {[\int_{- \infty}^{\infty} sup_{G \in ϑ} E |f_{n}^{(2)} (y - x) - f_{G}^{(2)} (y - x)| d F_{ε} (x)]}^{λ} d y \\ + c_{1} \int_{- \infty}^{\infty} {|β_{G} (y)|}^{1 - λ} \times {[\int_{- \infty}^{\infty} | Q (y - x) | sup_{G \in ϑ} E |f_{n}^{(1)} (y - x) - f_{G}^{(1)} (y - x)| d F_{ε} (x)]}^{λ} d y \\ + c_{1} \int_{- \infty}^{\infty} {|β_{G} (y)|}^{1 - λ} \times {[\int_{- \infty}^{\infty} | ϕ (y - x) | sup_{G \in ϑ} E |f_{n} (y - x) - f_{G} (y - x)| d F_{ε} (x)]}^{λ} d y \\ + c_{2} \int_{- \infty}^{\infty} {|β_{G} (y)|}^{1 - λ} \times {[\int_{- \infty}^{\infty} | Q (y - x) | sup_{G \in ϑ} E |p_{n} (y - x) - p_{G} (y - x)| d F_{ε} (x)]}^{λ} d y \\ = A_{n} + B_{n} + C_{n} + D_{n} \end{matrix}

(27)

From Lemmas 1 and 2, by the assumption conditions of Theorem 1, we have

\begin{matrix} A_{n} & \leq c_{3} {(log n)}^{- λ (m - 2) / β} \int_{- \infty}^{\infty} {|β_{G} (y)|}^{1 - λ} d y \leq c {(log n)}^{- λ (m - 2) / β}, \\ B_{n} & \leq c_{4} {(log n)}^{- λ (m - 1) / β} \int_{- \infty}^{\infty} {|β_{G} (y)|}^{1 - λ} {[\int_{- \infty}^{\infty} | Q (y - x) | d F_{ε} (x)]}^{λ} d y \end{matrix}

(28)

\begin{matrix} \leq c {(log n)}^{- λ (m - 1) / β}, \end{matrix}

(29)

\begin{matrix} C_{n} & \leq c_{5} {(log n)}^{- λ m / β} \int_{- \infty}^{\infty} {|β_{G} (y)|}^{1 - λ} {[\int_{- \infty}^{\infty} | ϕ (y - x) | d F_{ε} (x)]}^{λ} d y \leq c {(log n)}^{- λ m / β}, \\ D_{n} & \leq c_{6} {(log n)}^{- λ (m + 1) / β} \int_{- \infty}^{\infty} {|β_{G} (y)|}^{1 - λ} {[\int_{- \infty}^{\infty} | Q (y - x) | d F_{ε} (x)]}^{λ} d y \end{matrix}

(30)

\begin{matrix} \leq c {(log n)}^{- λ (m + 1) / β} \end{matrix}

(31)

Substituted (27) by (28) to (31), we obtain

sup_{G \in ϑ} (R (δ_{n}, G) - R (δ_{G}, G)) = O ({(log n)}^{- λ (m - 2) / β})

So the proof of Theorem 1 was completed. □

Proof of Theorem 2.

The proof is similar to that of Theorem 1 above, except that we let Lemmas 3 and 4 in the place of Lemmas 1 and 2 in the proof of Theorem 1, respectively. □

5. One-Sided EBT Rule and Its Asymptotic Properties

In this section, we study one-sided EBT for the parameter

θ

of GED with contaminated data. Considering the problem of testing the hypotheses

H_{0}^{'} : θ \leq θ_{0}

versus

H_{1}^{'} : θ > θ_{0}

, where

θ_{0}

be a known positive constant. Let linear loss function of testing the hypotheses as follows

L_{0} (θ, d_{0}^{'}) = a (θ - θ_{0}) I (θ > θ_{0}), L_{1} (θ, d_{1}^{'}) = a (θ - θ_{0}) I (θ \leq θ_{0}),

(32)

where a is a positive constant and

d = \{d_{0}^{'}, d_{1}^{'}\}

is the action space,

d_{0}^{'}

indicates accepting

H_{0}^{'}

,

d_{1}^{'}

indicates rejecting

H_{0}^{'}

,

I_{[A]}

is the indicator of the set A.

The same as above, assume that X is not directly observable and because of measurement error or the nature of environment, we can only observe

Y = X + ε

, where the error variable

ε

has a known distribution

F_{ε}

on

(- \infty, \infty)

. It is assumed that

ε

and

(X, θ)

are independent. It is assumed that the parameter

θ

is a realization of a random variable having an unknown prior distribution

G (θ)

over the natural parameter space

Θ

. Let randomized decision rule for the preceding testing problem is

δ^{*} (y) = P \{a c c e p t i n g H_{0}^{'} ∣ Y = y\}

. For one-sided test, we assume that

G (θ)

belongs to the following class of distributions

ϑ^{*} = \{G : G is a prior on Ω such that sup_{x} |f_{X}^{(m)} (x)| \leq B\},

(33)

where

f_{X}^{(m)} (x)

denotes the m order derivative of

f_{X} (x) = \int_{Θ} f_{X ∣ θ} (x) d G (θ)

, which is the marginal density of X, and

m \geq 1

is an integer,

B > 0

is a constant.

Let

R (δ^{*}, G)

denotes the Bayes risk of the test

δ^{*}

when G is the prior distribution, it can be expressed as

\begin{matrix} R (δ^{*} (y), G (θ)) & = \int_{Θ} \int_{Ω} [L_{0} (θ, d_{0}) δ^{*} (y) + L_{1} (θ, d_{1}) (1 - δ^{*} (y))] f_{Y ∣ θ} (y) d y d G (θ) \\ = a \int_{Ω} β_{G}^{*} (y) δ^{*} (y) d y + C_{G}, \end{matrix}

(34)

where

C_{G} = \int_{Θ} L_{1} (θ, d_{1}) d G (θ), β_{G}^{*} (y) = \int_{Θ} (θ - θ_{0}) f_{Y ∣ θ} (y) d G (θ) .

(35)

From (35), we obtain

\begin{matrix} β_{G}^{*} (y) = & μ \int_{- \infty}^{\infty} p_{X} (y - x) d F_{ε} (x) - \int_{- \infty}^{\infty} τ (y - x) f_{X} (y - x) d F_{ε} (x) \\ - \int_{- \infty}^{\infty} f_{X}^{(1)} (y - x) d F_{ε} (x), \end{matrix}

(36)

where

τ (y - x) = μ (y - x) + θ_{0}

.

Therefore, the Bayes test

δ_{G}^{*}

can be presented as

δ_{G}^{*} (y) = \{\begin{matrix} 1, if β_{G}^{*} (y) \leq 0 \\ 0, if β_{G}^{*} (y) > 0 \end{matrix} .

(37)

The Bayes risk of

δ_{G}^{*} (y)

is

R (δ_{G}^{*}, G) = inf_{δ^{*}} R (δ^{*}, G) = a \int_{Ω} β_{G}^{*} (y) δ_{G}^{*} (y) d y + C_{G} .

(38)

Thus, we defined the estimation of

β_{G}^{*} (y)

as

\begin{matrix} β_{n}^{*} (y) & = μ \int_{- \infty}^{\infty} p_{n} (y - x) d F_{ε} (x) - \int_{- \infty}^{\infty} τ (y - x) f_{n} (y - x) d F_{ε} (x) \\ - \int_{- \infty}^{\infty} f_{n}^{(1)} (y - x) d F_{ε} (x) . \end{matrix}

(39)

Furthermore, one-sided empirical Byes test rule is defined by

δ_{n}^{*} (y) = \{\begin{matrix} 1, & if β_{n}^{*} (y) \leq 0 \\ 0, & elsewhere . \end{matrix}

(40)

Then, the overall Bayes risks of

δ_{n}^{*} (y)

would be

R (δ_{n}^{*}, G) = a \int_{Ω} β_{n}^{*} (y) E [δ_{n}^{*} (y)] d y + C_{G} .

(41)

It is necessary state that Lemmas 1–4 still hold over a class of new prior distributions

ϑ^{*}

for one-sided EB decision problem. So by Lemmas 1–5, Theorem below establish the rates of convergence of the regret

R (δ_{n}^{*}, G) - R (δ_{G}^{*}, G)

, where

R (δ_{G}^{*}, G)

and

R (δ_{n}^{*}, G)

are given by (38) and (41), respectively. For one-sided EBT, we assume that the following conditions are satisfied:

(C1): $\int_{Θ} \int_{- \infty}^{\infty} | θ | f_{X ∣ θ} (y - x) d F_{ε} (x) d G (θ) < \infty$ uniformly in y, and
(C2): for some $0 < λ < 1$ , $\int_{- \infty}^{\infty} {|β_{G}^{*} (y)|}^{1 - λ} d y < \infty$ and

\int_{- \infty}^{\infty} {|β_{G}^{*} (y)|}^{1 - λ} {[\int_{- \infty}^{\infty} | τ (y - x) | d F_{ε} (x)]}^{λ} d y < \infty

, where

β_{G}^{*} (y)

is given by (36).

Theorem 3.

For any

0 < λ < 1

, let

ϑ^{*}

be defined by (33), suppose that

K (x)

, such that (A1)–(A4) and (B1)–(B2) of Theorem 1 hold and satisfying conditions (C1) and (C2). Then, by choosing the bandwidth

h_{n} = {(4 / γ)}^{1 / β} {(log n)}^{- 1 / β}

, we obtain

sup_{G \in g^{*}} (R (δ_{n}^{*}, G) - R (δ_{G}^{*}, G)) = O ({(log n)}^{- λ (m - 1) / β}) .

(42)

Theorem 4.

For any

0 < λ < 1

and some integer

m \geq 1

, let

ϑ^{*}

be defined by (33), suppose that

K (x)

and

F_{ε} (x)

are such that (A1)–(A4) hold with

ε = σ_{0} \tilde{ε}

, and satisfying conditions (C1) and (C2). Then, let

σ_{0} = O (n^{- 1 / (2 m + 1)})

and by choosing the bandwidth

h_{n} = O (n^{- 1 / (2 m + 1)})

, we have

sup_{G \in ϑ^{*}} (R (δ_{n}^{*}, G) - R (δ_{G}^{*}, G)) = O (n^{- λ (m - 1) / (2 m + 1)}) .

(43)

Remark 3.

For one-sided EBT, Similar to Theorem 1, the supersmooth distribution of a random variable ε is also considered to Theorem 3, its characteristic function

φ_{ε} (t)

satisfies condition (B2). Under all the data are contaminated while the error level can be controlled situation, for model (20), by Lemmas 1–5, Theorem 4, Theorem 4 obtained the rate of convergence of one-sided EBT, this result can also be as good as that of the uncontaminated data case.

Proof of Theorem 3.

By Lemma 5 and by the Markov inequality, for

0 < λ < 1

,

\begin{matrix} 0 \leq R (δ_{n}^{*}, G) - R (δ_{G}^{*}, G) \leq a \int_{- \infty}^{\infty} |β_{G}^{*} (y)| p (|β_{n}^{*} (y) - β_{G}^{*} (y)| \geq |β_{G}^{*} (y)|) d y \\ \leq a \int_{- \infty}^{\infty} {|β_{G}^{*} (y)|}^{1 - λ} E {|β_{n}^{*} (y) - β_{G}^{*} (y)|}^{λ} d y . \end{matrix}

(44)

By applying the

C_{r}

-inequality followed by Lyapunov’s inequality and using Fubini’s Theorem, we obtain

\begin{matrix} E {|β_{n}^{*} (y) - β_{G}^{*} (y)|}^{λ} \leq & c_{1} \{E {|\int_{- \infty}^{\infty} (f_{n}^{(1)} (y - x) - f_{G}^{(1)} (y - x)) d F_{ε} (x)|}^{λ} \\ + E {|\int_{- \infty}^{\infty} τ (y - x) (f_{n} (y - x) - f_{G} (y - x)) d F_{ε} (x)|}^{λ} \\ - μ E {|\int_{- \infty}^{\infty} (p_{n} (y - x) - p_{G} (y - x)) d F_{ε} (x)|}^{λ}\} \\ \leq & c_{1} {[\int_{- \infty}^{\infty} E |f_{n}^{(1)} (y - x) - f_{G}^{(1)} (y - x)| d F_{ε} (x)]}^{λ} \\ + c_{1} {[\int_{- \infty}^{\infty} | τ (y - x) | E |f_{n} (y - x) - f_{G} (y - x)| d F_{ε} (x)]}^{λ} \\ + c_{2} {[\int_{- \infty}^{\infty} E |p_{n} (y - x) - p_{G} (y - x)| d F_{ε} (x)]}^{λ} . \end{matrix}

(45)

Furthermore, by (44) and (45), we obtain

\begin{matrix} sup_{G \in ϑ^{*}} (R (δ_{n}^{*}, G) - R (δ_{G}^{*}, G)) \leq \int_{- \infty}^{\infty} {|β_{G}^{*} (y)|}^{1 - λ} sup_{G \in ϑ^{*}} E {|β_{n}^{*} (y) - β_{G}^{*} (y)|}^{λ} d y \\ \leq & c_{1} \int_{- \infty}^{\infty} {|β_{G}^{*} (y)|}^{1 - λ} \times {[\int_{- \infty}^{\infty} sup_{G \in ϑ^{*}} E |f_{n}^{(1)} (y - x) - f_{G}^{(1)} (y - x)| d F_{ε} (x)]}^{λ} d y \\ + c_{1} \int_{- \infty}^{\infty} {|β_{G}^{*} (y)|}^{1 - λ} \times {[\int_{- \infty}^{\infty} | τ (y - x) | sup_{G \in ϑ^{*}} E |f_{n} (y - x) - f_{G} (y - x)| d F_{ε} (x)]}^{λ} d y \\ + c_{2} \int_{- \infty}^{\infty} {|β_{G}^{*} (y)|}^{1 - λ} \times {[\int_{- \infty}^{\infty} sup_{G \in ϑ^{*}} E |p_{n} (y - x) - p_{G} (y - x)| d F_{ε} (x)]}^{λ} d y \\ = A_{n} + B_{n} + C_{n} . \end{matrix}

(46)

From Lemmas 1 and 2, by the assumption conditions of Theorem 3, we have

\begin{matrix} A_{n} \leq c_{3} {(log n)}^{- λ (m - 1) / β} \int_{- \infty}^{\infty} {|β_{G}^{*} (y)|}^{1 - λ} d y \leq c {(log n)}^{- λ (m - 1) / β}, \end{matrix}

(47)

\begin{matrix} B_{n} \leq c_{4} {(log n)}^{- λ m / β} \int_{- \infty}^{\infty} {|β_{G}^{*} (y)|}^{1 - λ} {[\int_{- \infty}^{\infty} | τ (y - x) | d F_{ε} (x)]}^{λ} d y \leq c {(log n)}^{- λ m / β}, \end{matrix}

(48)

\begin{matrix} C_{n} \leq c_{6} {(log n)}^{- λ (m + 1) / β} \int_{- \infty}^{\infty} {|β_{G}^{*} (y)|}^{1 - λ} d y \leq c {(log n)}^{- λ (m + 1) / β} . \end{matrix}

(49)

Substituted (46) by (47) to (49), we obtain

sup_{G \in ϑ^{*}} (R (δ_{n}^{*}, G) - R (δ_{G}^{*}, G)) = O ({(log n)}^{- λ (m - 1) / β}) .

So the proof of Theorem 3 was completed. □

Proof of Theorem 4.

The proof is similar to that of Theorem 3 above, except that we let Lemmas 3 and 4 in the place of Lemmas 1 and 2 in the proof of Theorem 3, respectively. □

6. An Example Study

In this section, an example study is presented to verify the GED and the prior distribution which satisfies theorems in this paper exist. Suppose that the probability density function of random variable X as follows

f (x ∣ θ) = (θ + 2 x) e^{- (θ x + x^{2})},

(50)

where

θ

is a given parameter, and the sample space is

Ω = {x ∣ x > 0}

, the parameter space is

Θ = {θ ∣ θ > 0}

. Let the prior distribution of parameter

θ

is

g (θ) = \frac{1}{Γ (r)} θ^{- (r + 1)} e^{- 1 / θ},

(51)

where r is a positive known parameter and

θ

is a positive unknown parameter. By calculating we obtain

f_{X} (x) = \int_{0}^{\infty} f (x ∣ θ) g (θ) d θ = - e^{- x^{2}} [\frac{r}{{(x + 1)}^{r + 1}} + \frac{2 x}{{(x + 1)}^{r}}] = - e^{- x^{2}} q (x),

where

q (x) = \frac{r}{{(x + 1)}^{r + 1}} + \frac{2 x}{{(x + 1)}^{r}}

.

Obviously,

f_{X}^{(m)} (x)

is existence, and

f_{X}^{(m)} (x) = \frac{- e^{- x^{2}} p (x)}{{(x + 1)}^{2^{m} (r + 1)}}

, where

p (x)

is polynomial with respect to x and

\partial (p (x)) \leq 2^{m - 1} (r + 1) - 1

. Since

lim_{x \to \infty} f_{X}^{(m)} (x) = 0

,

|f_{X}^{(m)} (x)|

is bounded on

x \in Ω

, where

m \geq 1

is an integer. Thus,

G (θ) \in ϑ

and

G (θ) \in ϑ^{*}

are satisfied.

Let the supersmooth error distribution

F_{ε}

be N(0,1), it is easy to check that

φ_{ε} (t)

satisfies the condition (B2) of Theorem 1. Moreover, we can take

b_{n} = \sqrt{2} {(log n)}^{- 1 / 2}

.

For the two-sided EBT case, we used the following kernel function

K (x) = \frac{6144 sin x}{π x^{5}} + \frac{18320 cos x}{π x^{6}} - \frac{3225600 sin x}{π x^{7}} + \dots - \frac{3360 \times 13!}{π x^{16}} cos x,

(52)

where

- \infty < x < \infty

, and we choose the Fourier transform of the above kernel is

φ_{K} (t) = {(1 - t^{2})}^{4} I_{[| t | \leq 1]} .

Then, the deconvolution kernel density estimators (14) are the following kernels, for the type of supersmooth error distribution case,

K_{n l} (x) = \frac{1}{2 π} \int_{- \infty}^{\infty} (cos t x - i sin t x) {(i t)}^{l} {(1 - t^{4})}^{4} exp (\frac{t^{2}}{2 h_{n}^{2}}) d t

(53)

Similar to [20], it is easily shown that assumptions and conditions of Theorems 1 and 2 are satisfied with the above specifications.

For one-sided EBT case, we choose

φ_{K} (t) = {(1 - t^{2})}^{3} I_{[| t | \leq 1]}

, then the Fourier transform of

φ_{K} (t)

is a second order kernel as follows

K (x) = \frac{48 cos x}{π x^{4}} (1 - \frac{15}{x^{2}}) - \frac{144 sin x}{π x^{5}} (2 - \frac{15}{x^{2}}), - \infty < x < \infty .

(54)

The corresponding deconvolution kernel density estimators (14) are kernel in the following

K_{n l} (x) = \frac{{(- 1)}^{l}}{π} \int_{0}^{1} t^{l} {(cos t x)}^{1 - l} {(sin t x)}^{l} {(1 - t^{2})}^{3} exp (\frac{t^{2}}{2 h_{n}^{2}}) d t, l = 0, 1 .

(55)

Then, similar to literature [20], it is easily shown that assumptions and conditions of Theorem 3 and 4 are satisfied with the above specifications.

Actually, we can take

φ_{K} (t) = {(1 - t^{2})}^{k} I_{[| t | ⩽ 1]}

, when

k \geq 4

, at the same time it may suit for one-sided and two-sided EBT. However, if

k = 3

, the second order kernel (53) only satisfies kernel conditions of Theorems 3 and 4.

7. Conclusions

In this paper, we had studied the empirical Bayes decision for the parameter of a generalized exponential distribution with contaminated data, two-sided and one-sided empirical Bayes test rules were constructed by a deconvolution kernel method, respectively. For the type of the supersmooth error distributions the asymptotically optimal uniformly over a class of prior distributions and uniform rates of convergence of the corresponding regret for the proposed EBT rules are obtained under the conditions of Theorems 1 and 3. Furthermore, we also investigated the supersmooth errors with the error level can be controlled case,

Y = X + ε

, where

ε = σ_{0} \tilde{ε}

,

σ_{0}

parameterizes the noise level, that is,

φ_{ε} (t) = φ_{\tilde{ε}} (σ_{0} t)

, and obtained Theorems 2 and 4. As an example, let the supersmooth error distribution

F_{ε}

be N(0,1), we proved the assumptions and conditions of the main results of this paper are satisfied easily by calculating.

In many practical problems, not all the observations are contaminated, but there may be a partially contaminated case. Suppose that only

100 p % (0 < p < 1)

of the data are measured with error and the remaining data are error free. We consider the mode

Y = X + ε

, taking

P (ε = 0) = 1 - p

and

P (ε = ε^{*}) = p

, where

ε^{*}

is an error variable with distribution

F_{ε^{*}}

and the characteristic function

φ_{ε^{*}}

. Thus, the characteristic function of

ε

is denoted by

φ_{ε} (t) = (1 - p) + p φ_{ε^{*}} (t)

. In this regard, we can consider extending the current research work to this situation, which is believed to be a very interesting topic.

Author Contributions

Conceptualization, J.C.; Data curation, H.Q.; Formal analysis, H.Q.; Funding acquisition, J.C.; Investigation, H.Q.; Methodology, H.Q.; Project administration, Z.Y. and Y.H.; Supervision, J.C.; visualization, Z.Y.; Validation, Y.H.; Writing—original draft, H.Q.; Writing—review and editing, H.Q. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by the National Natural Science Foundation of China under Grant 81671633 to J. Chen.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

All data in this paper have been presented in the manuscript.

Acknowledgments

Many thanks to reviewers for their positive feedback, valuable comments and constructive suggestions that helped improve the quality of this article. Many thanks to editors’ great help and coordination for the publish of this paper.

Conflicts of Interest

The authors declare no conflict of interest.

References

Karunamuni, R.; Li, J.; Wu, J. Robust empirical Bayes tests for continuous distributions. J. Stat. Plan. Inference 2010, 140, 268–282. [Google Scholar] [CrossRef]
Chen, L.S.; Yang, M.C. Empirical Bayes testing for equivalence. J. Stat. Plan. Inference 2011, 141, 2670–2681. [Google Scholar] [CrossRef]
Yuan, M.; Wei, L. Two-sided empirical Bayes test for location parameter in the gamma distribution. Commun. Stat.-Theory Methods 2017, 46, 4215–4225. [Google Scholar] [CrossRef]
Yuan, M.; Zhang, Q.; Wei, L.S. One-sided empirical Bayes test for location parameter in Gamma distribution. Appl. Math. J. Chin. Univ. 2018, 33, 287–297. [Google Scholar] [CrossRef]
Chen, L.S. Empirical Bayes testing for guarantee lifetime: Non identical components case. Commun. Stat.-Theory Methods 2017, 46, 683–705. [Google Scholar] [CrossRef]
Petrone, S.; Rousseau, J.; Scricciolo, C. Bayes and empirical Bayes: Do they merge? Biometrika 2014, 101, 285–302. [Google Scholar] [CrossRef]
Tansey, W.; Wang, Y.; Rabadan, R.; Blei, D. Double empirical bayes testing. Int. Stat. Rev. 2020, 88, S91–S113. [Google Scholar] [CrossRef]
Castillo, I.; Roquain, É. On spike and slab empirical Bayes multiple testing. Ann. Statist. 2020, 48, 2548–2574. [Google Scholar] [CrossRef]
Abraham, K.; Castillo, I.; Roquain, É. Empirical Bayes cumulative ℓ-value multiple testing procedure for sparse sequences. Electron. J. Stat. 2022, 16, 2033–2081. [Google Scholar]
Johns, M., Jr.; Van Ryzin, J. Convergence rates for empirical Bayes two-action problems II. Continuous case. Ann. Math. Stat. 1972, 43, 934–947. [Google Scholar] [CrossRef]
Singh, R.S.; Laisheng, W. Nonparametric empirical bayes procedures, asymptotic optimality And rates Of convergence For two-tail tests In exponential family. J. Nonparametric Stat. 2000, 475–501. [Google Scholar] [CrossRef]
Pensky, M. Rates of convergence of empirical Bayes tests for a normal mean. J. Stat. Plan. Inference 2003, 111, 181–196. [Google Scholar] [CrossRef]
Liang, T.C. On optimal convergence rate of empirical Bayes tests. Stat. Probab. Lett. 2004, 68, 189–198. [Google Scholar] [CrossRef]
Gupta, S.S.; Li, J. On empirical Bayes procedures for selecting good populations in a positive exponential family. J. Stat. Plan. Inference 2005, 129, 3–18. [Google Scholar] [CrossRef]
Carroll, R.J.; Hall, P. Optimal rates of convergence for deconvolving a density. J. Am. Stat. Assoc. 1988, 83, 1184–1186. [Google Scholar] [CrossRef]
Stefanski, L.A. Rates of convergence of some estimators in a class of deconvolution problems. Stat. Probab. Lett. 1990, 9, 229–235. [Google Scholar] [CrossRef]
Fan, J. On the optimal rates of convergence for nonparametric deconvolution problems. Ann. Stat. 1991, 19, 1257–1272. [Google Scholar] [CrossRef]
Fan, J. Global behavior of deconvolution kernel estimates. Stat. Sin. 1991, 1, 541–551. [Google Scholar]
Fan, J. Deconvolution with supersmooth distributions. Can. J. Stat. 1992, 20, 155–169. [Google Scholar] [CrossRef]
Karunamuni, R.J.; Zhang, S. Empirical Bayes two-action problem for the continuous one-parameter exponential family with errors in variables. J. Stat. Plan. Inference 2003, 113, 437–449. [Google Scholar] [CrossRef]
Davies, K.F.; Volterman, W. Progressively Type-II censored competing risks data from the linear exponential distribution. Commun. Stat.-Theory Methods 2022, 51, 1444–1460. [Google Scholar] [CrossRef]
Ahmad, A.E.B.A. Single and product moments of generalized order statistics from linear exponential distribution. Commun. Stat.-Theory Methods 2008, 37, 1162–1172. [Google Scholar] [CrossRef]
Broadbent, S. Simple mortality rates. Appl. Stat. 1958, 7, 86–95. [Google Scholar] [CrossRef]
Bain, L.J. Analysis for the linear failure-rate life-testing distribution. Technometrics 1974, 16, 551–559. [Google Scholar] [CrossRef]

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Empirical Bayes Decision for a Generalized Exponential Distribution with Contaminated Data

Abstract

1. Introduction

2. The Proposed Two-Sided EBT Rule of GED with Contaminated Data

3. Asymptotic Properties of Two-Sided EBT Rule

4. Proofs

5. One-Sided EBT Rule and Its Asymptotic Properties

6. An Example Study

7. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Article Metrics

Citations

Article Access Statistics