Likelihood Ratio Testing under Measurement Errors

Broniatowski, Michel; Jurečková, Jana; Kalina, Jan

doi:10.3390/e20120966

Open AccessArticle

Likelihood Ratio Testing under Measurement Errors

by

Michel Broniatowski

^1,*,

Jana Jurečková

^2,3 and

Jan Kalina

⁴

¹

Faculté de Mathématiques, Laboratoire de Probabilité, Statistique et Modélisation, Université Pierre et Marie Curie (Sorbonne Université), 4 place Jussieu, 75252 Paris CEDEX 05, France

²

Institute of Information Theory and Automation, The Czech Academy of Sciences, Pod Vodárenskou věží 4, 182 08 Prague 8, Czech Republic

³

Faculty of Mathematics and Physics, Charles University, Sokolovská 83, 186 75 Prague 8, Czech Republic

⁴

Institute of Computer Science, The Czech Academy of Sciences, Pod Vodárenskou věží 2, 182 07 Prague 8, Czech Republic

^*

Author to whom correspondence should be addressed.

Entropy 2018, 20(12), 966; https://doi.org/10.3390/e20120966

Submission received: 13 November 2018 / Revised: 6 December 2018 / Accepted: 7 December 2018 / Published: 13 December 2018

(This article belongs to the Special Issue New Developments in Statistical Information Theory Based on Entropy and Divergence Measures)

Download Review Reports Versions Notes

Abstract

:

We consider the likelihood ratio test of a simple null hypothesis (with density

f_{0}

) against a simple alternative hypothesis (with density

g_{0}

) in the situation that observations

X_{i}

are mismeasured due to the presence of measurement errors. Thus instead of

X_{i}

for

i = 1, \dots, n,

we observe

Z_{i} = X_{i} + \sqrt{δ} V_{i}

with unobservable parameter

δ

and unobservable random variable

V_{i}

. When we ignore the presence of measurement errors and perform the original test, the probability of type I error becomes different from the nominal value, but the test is still the most powerful among all tests on the modified level. Further, we derive the minimax test of some families of misspecified hypotheses and alternatives. The test exploits the concept of pseudo-capacities elaborated by Huber and Strassen (1973) and Buja (1986). A numerical experiment illustrates the principles and performance of the novel test.

Keywords:

measurement errors; robust testing; two-sample test; misspecified hypothesis and alternative; 2-alternating capacities

1. Introduction

Measurement technologies are often affected by random errors; if the goal of the experiment is to compare two probability distributions using data, then the conclusion can be distorted if the data are affected by some measurement errors. If the data are mismeasured due to the presence of measurement errors, the statistical inference performed with them is biased and trends or associations in the data are deformed. This is common for a broad spectrum of applications e.g., in engineering, physics, biomedicine, molecular genetics, chemometrics, econometrics etc. Some observations can be even undetected, e.g., in measurements of magnetic or luminous flux in analytical chemistry when the flux intensity falls below some flux limit. Actually, we can hardly imagine real data free of measurement errors; the question is how severe the measurement errors are and what their influence on the data analysis is [1,2,3].

A variety of functional models have been proposed for handling measurement errors in statistical inference. Technicians, geologists, and other specialists are aware of this problem, and try to reduce the effect of measurement errors with various ad hoc procedures. However, this effect cannot be completely eliminated or substantially reduced unless we have some additional knowledge on the behavior of measurement errors.

There exists a rich literature on the statistical inference in the error-in-variables (EV) models as is evidenced by the monographs of Fuller [4], Carroll et al. [5], and Cheng and van Ness [6], and the references therein. The monographs [4] and [6] deal mostly with classical Gaussian set up while [5] discusses numerous inference procedure under semi-parametric set up. Nonparametric methods in EV models are considered in [7,8] and in references therein, and in [9], among others. The regression quantile theory in the area of EV models was started by He and Liang [10]. Arias [11] used an instrumental variable estimator for quantile regression, considering biases arising from unmeasured ability and measurement errors. The papers dealing with practical aspects of measurement error models include [12,13,14,15,16], among others. Recent developments in treating the effect of measurement errors on econometric models was presented in [17] or [18] The advantage of rank and signed rank procedures in the measurement errors models was discovered recently in [19,20,21,22,23,24]. The problem of interest in the present paper is to study how the measurement errors can affect the conclusion of the likelihood ratio test.

The distribution function of measurement errors is considered unknown, up to zero expectation and unit variance. When we use the the likelihood ratio test while ignoring the possible measurement errors, we can suffer a loss in both errors of the first and second kind. However, we show that under a small variance of measurement errors, the original likelihood ratio test is still most powerful, only on a slightly changed significance level.

On the other hand, we may consider the situation that

H_{0}

or

H_{1}

are classes of distributions of random variables

Z + \sqrt{δ} V .

Hence, both hypothesis and alternative are composite as families

H_{0}

and

H_{1};

if they are bounded by alternating Choquet capacities of order 2, then we can look for a minimax test based on the ratio of the capacities, and/over on the ratio of the pair of the least favorable distributions of

H_{0}

and

H_{1}

, respectively (cf. Huber and Strassen [25]).

2. Likelihood Ratio Test under Measurement Errors

Our primary goal is to test the null hypothesis

H_{0}

that independent observations

X = {(X_{1}, \dots, X_{n})}^{⊤}

come from a population with a density f against the alternative

H_{1}

that the true density is

g,

where f and g are fixed densities of our interest. For the identifiability, we shall assume that f and g are continuous and symmetric around 0. Although the alternative is the main concern of the experimenter, some measurement errors or just the nature may cause the situation that the true alternative should be considered as composite. Specifically,

X_{1}, \dots, X_{n},

can be affected by additive measurement errors, what appears in numerous fields, as illustrated in Section 1.

Hence the alternative is

H_{1, δ}

under which the observations are

Z_{i, δ} = X_{i} + \sqrt{δ} V_{i},

identically distributed with continuous density

g_{δ} .

Here, both under the hypothesis and under the alternative,

V_{i}

are independent random variables, unobservable with unknown distribution, independent of

X_{i}; i = 1, \dots, n .

The parameter

δ > 0

is also unknown, only we assume that

E V_{i} = 0

and

E V_{i}^{2} = 1,

for simplicity. The mismeasured, hence unobservable,

X_{i}

are assumed to have the density g under the alternative. Quite analogously, the mismeasured observations lead to a composite hypothesis

H_{0, δ}

under which the density of observations

Z_{i, δ} = X_{i} + \sqrt{δ} V_{i}

is

f_{δ}

while the

X_{i}

are assumed to have density f.

If we knew

f_{δ}

and

g_{δ},

we would use the Neyman-Pearson critical region

W = \{z : \sum_{i = 1}^{n} ln (\frac{g_{δ} (z_{i})}{f_{δ} (z_{i})}) \geq u\}

(1)

with u determined so that

P_{f_{δ}} \{\sum_{i = 1}^{n} ln (\frac{g_{δ} (z_{i})}{f_{δ} (z_{i})}) \geq u\} = α,

with a significance level

α .

Evidently

\begin{matrix} \int I [\sum_{i = 1}^{n} ln (\frac{g_{δ} (z_{i})}{f_{δ} (z_{i})}) \geq u] \prod_{i = 1}^{n} g_{δ} (z_{i}) d z_{i} = \int I [\sum_{i = 1}^{n} ln (\frac{g (x_{i})}{f (x_{i})}) \geq u] \prod_{i = 1}^{n} g (x_{i}) d x_{i} \\ \int I [\sum_{i = 1}^{n} ln (\frac{g_{δ} (z_{i})}{f_{δ} (z_{i})}) \geq u] \prod_{i = 1}^{n} f_{δ} (z_{i}) d z_{i} = \int I [\sum_{i = 1}^{n} ln (\frac{g (x_{i})}{f (x_{i})}) \geq u] \prod_{i = 1}^{n} f (x_{i}) d x_{i} . \end{matrix}

Indeed, notice that

\begin{matrix} E_{g_{δ}} \{I [\sum_{i = 1}^{n} ln (\frac{g_{δ} (Z_{i})}{f_{δ} (Z_{i})}) \geq u] | V_{1} = v_{1}, \dots, V_{n} = v_{n}\} \\ = E_{g} \{I [\sum_{i = 1}^{n} ln (\frac{g (X_{i})}{f (X_{i})}) \geq u] | V_{1} = v_{1}, \dots, V_{n} = v_{n}\} \forall v_{i} \in R, i = 1, \dots, n, \end{matrix}

where the expectations are considered with respect to the conditional distribution; a similar equality holds for

f_{δ} .

Combining the integration transmission in the conditional distribution, we obtain

\begin{matrix} \int I [\sum_{i = 1}^{n} ln (\frac{g_{δ} (x_{i} + \sqrt{δ} V_{i})}{f_{δ} (x_{i} + \sqrt{δ} V_{i})}) \geq u] \prod_{i = 1}^{n} f (x_{i}) d x_{i} \\ \neq \int I [\sum_{i = 1}^{n} ln (\frac{g (x_{i})}{f (x_{i})}) \geq u] \prod_{i = 1}^{n} f (x_{i}) d x_{i} = α, \end{matrix}

(2)

hence the size of the critical region W when used for testing

H_{0}

against

H_{1}

differs from

α .

Then we ask how the critical region W in (1) behaves when it is used as a test of

H_{0} .

This problem we shall try to attack with an expansion of

f_{δ}, g_{δ}

in

δ

close to zero.

Approximations of Densities

Put

f = f_{0}, g = g_{0}

the densities of X under the hypotheses and alternative, respectively. For the identifiability, we shall assume that

f_{0}

and

g_{0}

are continuous and symmetric around 0. Denote

f_{δ}

the density of

Z_{δ} = X + \sqrt{δ} V .

This means that X is affected by an additive measurement error

\sqrt{δ} V,

where V is independent of X and

E V = 0, E V^{2} = 1, E V^{4} < \infty .

Notice that if densities of X and V are strongly unimodal, then that of Z is also strongly unimodal (see [26]). Under some additional conditions on

f_{0}, g_{0},

we shall derive approximations of

f_{δ}

and

g_{δ}

for small

δ > 0 .

More precisely, we assume that both

f_{0}

and

g_{0}

have differentiable and integrable derivatives up to order 5. Then we have the following expansion of

f_{δ}

and a parallel result for

g_{δ}

:

Theorem 1.

Assume that

f_{0}

and

g_{0}

are symmetric around 0, strongly unimodal with differentiable and integrable derivatives, up to the order 5. Then, as

δ ↓ 0,

\begin{matrix} f_{δ} (z) = f_{0} (x + \sqrt{δ} V) = f_{0} (x) + \frac{δ}{2} \frac{d^{2}}{d z^{2}} f_{0} (x) + \frac{δ^{2}}{4!} \frac{d^{4}}{d z^{4}} f_{0} (x) E (V^{4}) + o (δ^{2}), \\ g_{δ} (z) = g_{0} (x + \sqrt{δ} V) = g_{0} (x) + \frac{δ}{2} \frac{d^{2}}{d z^{2}} g_{0} (x) + \frac{δ^{2}}{4!} \frac{d^{4}}{d z^{4}} g_{0} (x) E (V^{4}) + o (δ^{2}) \end{matrix}

(3)

Proof.

Let

φ (u, δ) = E {e^{i u Z}}

be the characteristic function of

Z .

Then

\begin{matrix} φ (u, δ) = E {e^{i u X}} E {e^{i u \sqrt{δ} V}} = φ (0, 0) φ_{V} (u \sqrt{δ}) \\ = φ (u, 0) [1 + \frac{1}{2} δ {(i u)}^{2} + \frac{1}{4!} δ^{2} {(i u)}^{4} E (V^{4}) + o (δ^{2})] \\ = φ (u, 0) [1 - \frac{δ}{2} u^{2} + \frac{1}{4!} δ^{2} u^{4} E (V^{4}) + o (δ^{2})], \end{matrix}

where

φ_{V}

denotes the characteristic function of V. Taking the inverse Fourier transform on both sides, we obtain (3), taking the above assumptions on V into account. □

Consider the problem of testing the hypothesis

H_{0}

that the observations are distributed according to density

f_{0}

against the alternative

H_{1}

that they are distributed according to density

g_{0} .

Parallelly, we consider the hypothesis

H_{0, δ}

that observations are distributed according the

g_{δ}

against the alternative

H_{1, δ}

that the true density is

g_{δ} .

Let

Φ (x)

be the likelihood ratio test with critical region

W = \{x : \sum_{i = 1}^{n} ln (\frac{g_{0} (x_{i})}{f_{0} (x_{i})}) > u\}

and the significance level

α,

and

Φ^{*} = Φ^{*} (z)

be the test with critical region

W^{*} = \{z : \sum_{i = 1}^{n} ln (\frac{g_{0} (z_{i}))}{f_{0} (z_{i})}) > u\}

based on observations

z_{i} = x_{i} + \sqrt{δ} V_{i}, i = 1, \dots, n .

We know neither

δ

nor

V,

hence the test

Φ^{*}

is just an application of the critical region W for contaminated data

Z_{1}, \dots, Z_{n} .

Thus, due to our lack of information, we use the test

Φ

even for testing

H_{0, δ}

against

H_{1, δ},

and the performance of this test is of interest. This is described in the following theorem:

Theorem 2. (Assume the conditions of Theorem 1).

Then, as

δ ↓ 0,

the test

Φ^{*}

is the most powerful even for testing

H_{0, δ}

against

H_{1, δ},

with a modified significance level satisfying

α_{δ} \leq α + \frac{δ}{2} | f_{0}^{'} (0) | + \frac{δ^{2}}{24} E V^{4} | f_{0}^{(3)} (0) | + O (δ) .

Proof.

\begin{matrix} E_{f_{0}} Φ^{*} (X) = \int I [ln (\frac{g_{0} (x + \sqrt{δ} V)}{f_{0} (x + \sqrt{δ} V)}) > u] f_{0} (x) d x \\ = \int I [ln (\frac{g_{0} (x + \sqrt{δ} V)}{f_{0} (x + \sqrt{δ} V)}) > u] \frac{f_{0} (x)}{f_{0} (x + \sqrt{δ} V)} f_{0} (x + \sqrt{δ} V) d x \\ = \int I [ln (\frac{g_{0} (x)}{f_{0} (x)}) > u] \frac{f_{0} (x - \sqrt{δ} V)}{f_{0} (x)} f_{0} (x) d x . \end{matrix}

If

f_{0}

is symmetric, then the derivative

f_{0}^{(k)}

is symmetric for k even and skew-symmetric for k odd,

k = 1, \dots, 4 .

Moreover, because

| f_{0}^{'} (x) |

and

| f_{0}^{(3)} (x) |

are integrable, then

{lim}_{x \to \pm \infty} | f_{0}^{'} (x) | = 0

and

{lim}_{x \to \pm \infty} | f_{0}^{(3)} (x) | = 0 .

Hence, using the expansion (3), we obtain

\begin{matrix} E_{f_{0}} Φ^{*} (X) = E_{f_{0}} Φ (X) + \int I [ln (\frac{g_{0} (x)}{f_{0} (x)}) > u] (\frac{δ}{2} f_{0}^{″} (x) + \frac{δ^{2}}{24} E V^{4} f^{(4)} (x) d x) + o (δ^{2}) \\ \leq E_{f_{0}} Φ (X) + \frac{δ}{2} | f_{0}^{'} (0) | + \frac{δ^{2}}{24} E V^{4} | f_{0}^{(3)} (0) | + o (δ^{2}) = α + O (δ) a s δ ↓ 0 . \end{matrix}

□

3. Robust Testing

If the observations are missmeasured or contaminated, we observe

Z_{δ} = Z + \sqrt{δ} V

with unknown

δ

and unobservable V instead of Z. Hence, instead of simple

f_{0}

and

g_{0},

we are led to composite hypothesis and alternative

H

and

K

. Following [25], we can try to find suitable 2-alternating capacities, dominating

H

and

K

and to construct a pertaining minimax test. As before, we assume that Z and V are independent,

E V = 0, E V^{2} = 1

, and

E V^{4} < \infty .

Moreover, we assume that

f_{0}

and

g_{0}

are symmetric, strongly unimodal and differentiable up to order 5, with derivatives integrable and increasing distribution functions

F_{0}

and

G_{0},

respectively. The measurement errors V are assumed to satisfy

1 \leq E V^{4} \leq K

(4)

with a fixed

K, 0 < K < \infty .

Hence the distribution of V is restricted to have the tails lighter than t-distribution with 4 degrees of freedom. We shall construct a pair of 2-alternating capacities around specific subfamilies of

f_{0}

and

g_{0} .

Let us determine the capacity around

g_{0}

; that for

f_{0}

is analogous. By Theorem 1 we have

g_{δ} (z) = g_{0} (z) + \frac{δ}{2} \frac{d^{2}}{d z^{2}} g_{0} (z) + \frac{δ^{2}}{4!} \frac{d^{4}}{d z^{4}} g_{0} (z) E (V^{4}) + o (δ^{2}), a s δ ↓ 0 .

We shall concentrate on the following family

K^{*}

of densities (similarly for

f_{0}

):

K^{*} = \{g_{δ, κ}^{*} : g_{δ, κ}^{*} (z) = g_{0} (z) + \frac{δ}{2} g_{0}^{″} (z) + κ \frac{δ^{2}}{24} g_{0}^{(4)} (z) | δ \leq Δ, 1 \leq κ \leq K\}

(5)

with fixed suitable

Δ, K > 0 .

Indeed, under our assumptions, each

g_{δ, κ}^{*} \in K^{*}

is a positive and symmetric density satisfying

sup_{δ \leq Δ, κ \leq K} sup_{z \in {R} |g_{δ, κ}^{*} (z) - g_{0} (z)| \leq C K Δ^{2} + o (Δ^{2})

for some

C, 0 < C < \infty .

Let

G_{δ, κ}^{*} (B), B \in B,

be the probability distribution induced by density

g_{δ, κ}^{*} \in K^{*},

with

B

being the Borel

σ

-algebra. Then the set function

w (B) = \{\begin{matrix} sup \{G^{*} (B) : G^{*} \in K^{*}\} & if & B \neq \emptyset \\ 0 & if & B = \emptyset \end{matrix}

(6)

is a pseudo-capacity in the sense of Buja [27], i.e., satisfying

(a): $w (\emptyset) = 0, w (Ω) = 1$
(b): $w (A) \leq w (B) \forall A \subset B$
(c): $w (A_{n}) ↑ w (A) \forall A_{n} ↑ A$
(d): $w (A_{n}) ↓ w (A) \forall A_{n} ↓ A \neq \emptyset$
(e): $w (A \cup B) + w (A \cap B) \leq w (A) + w (B) .$

Analogously, consider a density

f_{0},

symmetric around 0 and satisfying the assumptions of Theorem 1 as a simple hypothesis. Construct the family

H^{*}

of densities and the corresponding family of distributions

\{F_{δ, κ}^{*} (\cdot), δ \leq Δ, κ \leq K\}

similarly as above. Then the set function

v (B) = \{\begin{matrix} sup \{F^{*} (B) : F^{*} \in H^{*}\} & if & B \neq \emptyset \\ 0 & if & B = \emptyset \end{matrix}

(7)

is a pseudo-capacity in the sense of Buja [27].

Buja [27] showed that on any Polish space exists a (possibly different) topology which generates the same Borel algebra and on which every pseudo-capacity is a 2-alternating capacity in the sense of [25].

Let us now consider the problem of testing the hypothesis

H = \{F^{*} \in H^{*} | F^{*} (\cdot) \leq v (\cdot)\}

against the alternative

K = \{G^{*} \in K^{*} | G^{*} (\cdot) \leq w (\cdot)\},

based on an independent random sample

Z_{1}, \dots, Z_{n} .

Assume that

H^{*}

and

K^{*}

satisfy (5). Then, following [27] and [25], we have the main theorem providing the minimax test of

H

against

K

with significance level

α \in (0, 1) :

Theorem 3.

The test

ϕ (z_{1}, \dots, z_{n}) = \begin{matrix} 1 & if & \prod_{i = 1}^{n} π (z_{i}) > C \\ γ & if & \prod_{i = 1}^{n} π (z_{i}) = C \\ 0 & if & \prod_{i = 1}^{n} π (z_{i}) < C \end{matrix}

where

π (\cdot)

is a version of

\frac{d w}{d v} (\cdot)

and

C

and γ are chosen so that

E_{v} ϕ (Z) = α,

is a minimax test of

H

against

K

of level

α .

4. Numerical Illustration

We assume to observe independent observations

Z_{1, δ}, \dots, Z_{n, δ}

for

i = 1, \dots, n

, where

Z_{i, δ} = X_{i} + \sqrt{δ} V_{i}

as described in Section 3, where

X_{1}, \dots, X_{n}

are independent identically distributed (with a distribution function F) but unobserved. Let us further denote by

Φ

the distribution function of

N (0, 1)

and by

Φ_{σ}^{*}

the distribution function of

N (0, σ^{2})

. The primary task here is to test

H_{0} : F \equiv Φ

against

H_{1} : F (x) = (1 - λ) Φ (x) + λ Φ_{σ}^{*} (x), x \in R,

with a fixed

σ > 1

and

λ \in (0, 1)

. We perform all the computations using the R software [28].

To describe our approach to computing the test, we will need the notation for the set of pseudo-distribution functions corresponding to the set of pseudo-densities

H^{*}

denotes as

{\tilde{H}}^{*} = \{F_{δ, κ}^{*} : F_{δ, κ}^{*} (z) = Φ (z) + \frac{δ}{2} f_{0}^{'} (z) + κ \frac{δ^{2}}{24} f_{0}^{(3)} (z) | δ \leq Δ, 1 \leq κ \leq K\},

where

Φ

denotes the distribution function of

N (0, 1)

distribution. Under the alternative, the set analogous to

K^{*}

is defined as

{\tilde{K}}^{*} = \{G_{δ, κ}^{*} : G_{δ, κ}^{*} (z) = G_{0} (z) + \frac{δ}{2} g_{0}^{'} (z) + κ \frac{δ^{2}}{24} g_{0}^{(3)} (z) | 0 \leq δ \leq Δ, 1 \leq κ \leq K\} .

Our task is to approximate

v ((- \infty, z)) = sup {F_{δ, κ}^{*} (z); F_{δ, κ}^{*} \in {\tilde{H}}^{*}}, z \in R,

(8)

and

w ((- \infty, z)) = sup {G_{δ, κ}^{*} (z); G_{δ, κ}^{*} \in {\tilde{K}}^{*}}, z \in R .

(9)

Here, the functions

F_{δ, κ}^{*} (z)

and

G_{δ, κ}^{*} (z)

are evaluated over a grid with step 0.05. Then, the maximization in (8) and (9) is performed for values of z over the grid and over four boundary values of

{(δ, κ)}^{T}

, which are equal to

{(0, 0)}^{T}

,

{(0, K)}^{T}

,

{(Δ, 0)}^{T}

, and

{(Δ, K)}^{T}

. Additional computations with 10 randomly selected pairs of

{(δ, κ)}^{T}

over

δ \in [0, Δ]

and

κ \in [0, K]

revealed that the optimum is attained in one of the boundary values. Further, the Radon-Nikodym derivatives of V and W are estimated by a finite difference approximation in order to compute the test statistic.

The test rejects

H_{0}

if the test statistics

\prod_{i = 1}^{n} π (z_{i})

exceeds a critical value, which (as well as the p-value) can be approximated by a Monte Carlo simulation, i.e., by a repeated random generating random variables

X_{1}, \dots, X_{n}

under

H_{0}

, and we generate them 10,000 times here.

We perform the following particular numerical study. We compute the critical value of the

α

-test for

n = 20

(or

n = 40

),

λ = 0.25

,

σ^{2} = 3

,

Δ = 0.2

,

K = 1.1

, and

α = 0.05

. Further, we are interested in evaluating the probability of rejecting this test for data generated from

F (x) = (1 - \tilde{λ}) Φ (x) + \tilde{λ} Φ_{\tilde{σ}}^{*} (x), x \in R,

(10)

with different values of

\tilde{λ}

and

{\tilde{σ}}^{2}

. Its values are shown in Table 1 (for

n = 20

) and Table 2 (for

n = 40

), which are approximated using (again) 10,000 randomly generated variables from (10). The boldface numbers are equal to the power of the test (under the simple

H_{1}

). The proposed test seems meaningful, while its power is increased for

n = 40

compared to

n = 20

; in addition, the power increases with an increasing

\tilde{λ}

if

{\tilde{σ}}^{2}

is retained; and the power also increases with an increasing

{\tilde{σ}}^{2}

if

\tilde{λ}

is retained.

5. Conclusions

The likelihood ratio test of

f_{0}

against

g_{0}

is considered in the situation that observations

X_{i}

are mismeasured due to the presence of measurement errors. Thus instead of

X_{i}

for

i = 1, \dots, n,

we observe

Z_{i} = X_{i} + \sqrt{δ} V_{i}

with unobservable parameter

δ

and unobservable random variable

V_{i}

. When we ignore the presence of measurement errors and perform the original test, the probability of type I error becomes different from the nominal value, but the test is still the most powerful among all tests on the modified level.

Under some assumptions on

f_{0}

and

g_{0}

and for

δ < Δ, E V^{4} \leq K,

we further construct a minimax likelihood ratio test of some families of distributions of the

Z_{i} = X_{i} + \sqrt{δ} V_{i},

based on the capacities of the Huber-Strassen type. The test treats the composite null and alternative hypotheses, which cover all possible measurement errors satisfying the assumptions. The advantage of the novel test is that it keeps the probability of type I error below the desired value (

α = 0.05

) across all possible measurement errors. The test is performed in a straightforward way, while the user must specify particular (not excessively large) values of

Δ

and K. We do not consider this a limiting requirement, because parameters corresponding to the severity of measurement errors are commonly chosen in a similar way in numerous measurement error models [5,23] or robust optimization procedures [29]. The critical value of the test can be approximated by a simulation. The numerical experiment in Section 4 illustrates the principles and performance of the novel test.

Author Contributions

Methodology, M.B. and J.J.; Software, J.K.; Writing—Original Draft Preparation, M.B. and J.J.; Writing—Review & Editing, M.B., J.J. and J.K.; Funding Acquisition, M.B.

Funding

The research of Jana Jurečková was supported by the Grant 18-01137S of the Czech Science Foundation. The research of Jan Kalina was supported by the Grant 17-01251S of the Czech Science Foundation.

Acknowledgments

The authors would like to thank two anonymous referees for constructive advice.

Conflicts of Interest

The authors declare no conflict of interest.

References

Boyd, A.; Lankford, H.; Loeb, S.; Wyckoff, J. Measuring test measurement error: A general approach. J. Educ. Behav. Stat. 2013, 38, 629–663. [Google Scholar] [CrossRef]
Brakenhoff, T.B.; Mitroiu, M.; Keogh, R.H.; Moons, K.G.M.; Groenwold, R.H.H.; van Smeden, M. Measurement error is often neglected in medical literature: A systematic review. J. Clin. Epidemiol. 2018, 98, 89–97. [Google Scholar] [CrossRef] [PubMed]
Edwards, J.K.; Cole, S.R.; Westreich, D. All your data are always missing: Incorporating bias due to measurement error into the potential outcomes framework. Int. J. Epidemiol. 2015, 44, 1452–1459. [Google Scholar] [CrossRef] [PubMed]
Fuller, W.A. Measurement Error Models; John Wiley & Sons: New York, NY, USA, 1987. [Google Scholar]
Carroll, R.J.; Ruppert, D.; Stefanski, L.A.; Crainiceanu, C.M. Measurement Error in Nonlinear Models: A Modern Perspective, 2nd ed.; Chapman & Hall/CRC: Boca Raton, FL, USA, 2006. [Google Scholar]
Cheng, C.L.; van Ness, J.W. Statistical Regression with Measurement Error; Arnold: London, UK, 1999. [Google Scholar]
Carroll, R.J.; Maca, J.D.; Ruppert, D. Nonparametric regression in the presence of measurement error. Biometrika 1999, 86, 541–554. [Google Scholar] [CrossRef] [Green Version]
Carroll, R.J.; Delaigle, A.; Hall, P. Non-parametric regression estimation from data contaminated by a mixture of Berkson and classical errors. J. R. Stat. Soc. B 2007, 69, 859–878. [Google Scholar] [CrossRef]
Fan, J.; Truong, Y.K. Nonparametric regression estimation involving errors-in-variables. Ann. Stat. 1993, 21, 23–37. [Google Scholar] [CrossRef]
He, X.; Liang, H. Quantile regression estimate for a class of linear and partially linear errors-in-variables models. Stat. Sin. 2000, 10, 129–140. [Google Scholar]
Arias, O.; Hallock, K.F.; Sosa-Escudero, W. Individual heterogeneity in the returns to schooling: Instrumental variables quantile regression using twins data. Empir. Econ. 2001, 26, 7–40. [Google Scholar] [CrossRef]
Hyk, W.; Stojek, Z. Quantifying uncertainty of determination by standard additions and serial dilutions methods taking into account standard uncertainties in both axes. Anal. Chem. 2013, 85, 5933–5939. [Google Scholar] [CrossRef]
Kelly, B.C. Some aspects of measurement error in linear regression of astronomical data. Astrophys. J. 2007, 665, 1489–1506. [Google Scholar] [CrossRef]
Marques, T.A. Predicting and correcting bias caused by measurement error in line transect sampling using multiplicative error model. Biometrics 2004, 60, 757–763. [Google Scholar] [CrossRef] [PubMed]
Rocke, D.M.; Lorenzato, S. A two-component model for measurement error in analytical chemistry. Technometrics 1995, 37, 176–184. [Google Scholar] [CrossRef]
Akritas, M.G.; Bershady, M.A. Linear regression for astronomical data with measurement errors and intrinsic scatter. Astrophys. J. 1996, 470, 706–728. [Google Scholar] [CrossRef]
Hausman, J. Mismeasured variables in econometric analysis: Problems from the right and problems from the left. J. Econ. Perspect. 2001, 15, 57–67. [Google Scholar] [CrossRef]
Hyslop, D.R.; Imbens, Q.W. Bias from classical and other forms of measurement error. J. Bus. Econ. Stat. 2001, 19, 475–481. [Google Scholar] [CrossRef]
Jurečková, J.; Picek, J.; Saleh, A.K.M.E. Rank tests and regression rank scores tests in measurement error models. Comput. Stat. Data Anal. 2010, 54, 3108–3120. [Google Scholar] [CrossRef]
Jurečková, J.; Koul, H.L.; Navrátil, R.; Picek, J. Behavior of R-estimators under Measurement Errors. Bernoulli 2016, 22, 1093–1112. [Google Scholar] [CrossRef]
Navrátil, R.; Saleh, A.K.M.E. Rank tests of symmetry and R-estimation of location parameter under measurement errors. Acta Univ. Palacki. Olomuc. Fac. Rerum Nat. Math. 2011, 50, 95–102. [Google Scholar]
Navrátil, R. Rank tests and R-estimates in location model with measurement errors. In Proceedings of Workshop of the Jaroslav Hájek Center and Financial Mathematics in Practice I; Masaryk University: Brno, Czech Republic, 2012; pp. 37–44. [Google Scholar]
Saleh, A.K.M.E.; Picek, J.; Kalina, J. R-estimation of the parameters of a multiple regression model with measurement errors. Metrika 2012, 75, 311–328. [Google Scholar] [CrossRef]
Sen, P.K.; Jurečková, J.; Picek, J. Rank tests for corrupted linear models. J. Indian Stat. Assoc. 2013, 51, 201–230. [Google Scholar]
Huber, P.; Strassen, V. Minimax tests and the Neyman-Pearson lemma for capacities. Ann. Stat. 1973, 2, 251–273. [Google Scholar] [CrossRef]
Ibragimov, I.A. On the composition of unimodal distributions. Theor. Probab. Appl. 1956, 1, 255–260. [Google Scholar] [CrossRef]
Buja, A. On the Huber-Strassen theorem. Probab. Theory Relat. Fields 1986, 73, 149–152. [Google Scholar] [CrossRef]
R Core Team. R: A Language and Environment for Statistical Computing; R Foundation for Statistical Computing: Vienna, Austria, 2017; Available online: https://www.R-project.org/ (accessed on 15 September 2018).
Xanthopoulos, P.; Pardalos, P.M.; Trafalis, T.B. Robust Data Mining; Springer: New York, NY, USA, 2013. [Google Scholar]

Table 1. Probability of rejecting the test in the simulation with

n = 20

.

Table 1. Probability of rejecting the test in the simulation with

n = 20

.

Value of $\tilde{λ}$	Value of ${\tilde{σ}}^{2}$
Value of $\tilde{λ}$	3	4	5	6
0.25	0.39	0.52	0.61	0.67
0.35	0.50	0.67	0.75	0.81
0.45	0.61	0.76	0.85	0.89

Table 2. Probability of rejecting the test in the simulation with

n = 40

.

Table 2. Probability of rejecting the test in the simulation with

n = 40

.

Value of $\tilde{λ}$	Value of ${\tilde{σ}}^{2}$
Value of $\tilde{λ}$	3	4	5	6
0.25	0.55	0.73	0.82	0.87
0.35	0.72	0.86	0.93	0.96
0.45	0.82	0.94	0.97	0.99

© 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Broniatowski, M.; Jurečková, J.; Kalina, J. Likelihood Ratio Testing under Measurement Errors. Entropy 2018, 20, 966. https://doi.org/10.3390/e20120966

AMA Style

Broniatowski M, Jurečková J, Kalina J. Likelihood Ratio Testing under Measurement Errors. Entropy. 2018; 20(12):966. https://doi.org/10.3390/e20120966

Chicago/Turabian Style

Broniatowski, Michel, Jana Jurečková, and Jan Kalina. 2018. "Likelihood Ratio Testing under Measurement Errors" Entropy 20, no. 12: 966. https://doi.org/10.3390/e20120966

APA Style

Broniatowski, M., Jurečková, J., & Kalina, J. (2018). Likelihood Ratio Testing under Measurement Errors. Entropy, 20(12), 966. https://doi.org/10.3390/e20120966

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Likelihood Ratio Testing under Measurement Errors

Abstract

1. Introduction

2. Likelihood Ratio Test under Measurement Errors

Approximations of Densities

3. Robust Testing

4. Numerical Illustration

5. Conclusions

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI