Regression Estimation with Errors in the Variables via the Laplace Transform

Huijun Guo; Qingqun Bai

doi:10.3390/axioms12100992

Abstract

This paper considers nonparametric regression estimation with errors in the variables. It is a standard assumption that the characteristic function of the covariate error does not vanish on the real line. This assumption is rather strong. In this paper, we assume the covariate error distribution is a convolution of uniform distributions, the characteristic function of which contains zeros on the real line. Our regression estimator is constructed via the Laplace transform. We prove its strong consistency and show its convergence rate. It turns out that zeros in the characteristic function have no effect on the convergence rate of our estimator.

Keywords:

nonparametric regression; errors in variables; Laplace transform; strong consistency; convergence rate

MSC:

62G08; 62G20

1. Introduction

This paper considers a regression model with errors in the variables. Suppose observations

(W_{1}, Y_{1}), \dots, (W_{n}, Y_{n})

are i.i.d. (independent and identically distributed) random variables generated by the model

\begin{matrix} W_{j} = X_{j} + δ_{j}, Y_{j} = m (X_{j}) + ϵ_{j}, j = 1, \dots, n . \end{matrix}

(1)

The i.i.d. random variables

δ_{j}

are independent of

X_{j}

and

Y_{j}

.

ϵ_{j}

are independent of

X_{j}

,

E ϵ_{j} = 0

and

E ϵ_{j}^{2} < + \infty

. The functions

f_{δ}

(known) and

f_{X}

(unknown) stand for the densities of

δ_{j}

and

X_{j}

, respectively. The goal is to estimate the regression function

m (x)

from the observations

(W_{1}, Y_{1}), \dots, (W_{n}, Y_{n})

. Errors-in-variables regression problems have been extensively studied in the literature, see, for example, ([1,2,3,4,5,6,7]). Regression models with errors in the variables play an important role in many areas of science and social science ([8,9,10]).

Nadaraya and Watson ([11,12]) propose a kernel regression estimator for the classical regression model

(δ_{j} = 0)

. Since the Fourier transform can transform a complex convolution to an ordinary product, it is a common method to deal with the deconvolution problem. Fan and Truong [4] generalize the Nadaraya–Watson regression estimator from the classical regression model to the regression model (1) via the Fourier transform. They study the convergence rate by assuming the integer order derivatives of

f_{X}

and m to be bounded. Compared to integer-order derivatives, it is more precise to describe the smoothness by the Hölder condition. Meister [6] shows the convergence rate under the local Hölder condition.

The above references on model (1) both assume that the characteristic function of the covariate errors

δ_{j}

does not have zeros on the real line. The assumption is rather strong. For example, if

f_{δ}

is of uniform density on [−1, 1], it vanishes at

v = k π, k = \pm 1, \pm 2, \dots

in the Fourier domain. Delaigle and Meister [1] consider the regression model (1) with a Fourier-oscillating noise, which means the Fourier transform of

f_{δ}

vanishes periodically. They show that if

f_{X}

and m are compact, then they can be estimated with the standard rate, as in the case where

f_{δ}

does not vanish in the Fourier domain. Guo and Liu ([13,14,15]) extend Delaigle and Meister [1]’s work to multivariate cases.

The compactness is the cost of eliminating the zero points effect in the Fourier domain. Belomestny and Goldenshluger [16] apply the Laplace transform to construct a deconvolution density estimator without assuming the density to be compact. They provide sufficient conditions under which the zeros of the corresponding characteristic function have no effect on the estimation accuracy. Goldenshluger and Kim [17] also construct a deconvolution density estimator via the Laplace transform; they study how zero multiplicity affects the estimation accuracy. Motivated by the above work, we apply the Laplace transform to study the regression model (1) with errors following a convolution of uniform distributions.

The organization of the paper is as follows. In Section 2, we present some knowledge about the covariate error distribution and functional classes. Section 3 introduces the kernel regression estimator via the Laplace transform. The consistency and convergence rate of our estimator are discussed in Section 4 and Section 5, respectively.

2. Preparation

This section will introduce the covariate error distribution and functional classes.

For a integrable function f, the bilateral Laplace transform [18] is defined by

\hat{f} (z) : = \int_{- \infty}^{+ \infty} f (t) e^{- z t} d t .

The Laplace transform

\hat{f} (z)

is an analytic function in the convergence region

Σ_{f}

, which is a vertical strip:

Σ_{f} : = {z \in C : σ_{f}^{-} < Re (z) < σ_{f}^{+}}, for some - \infty \leq σ_{f}^{-} < σ_{f}^{+} \leq + \infty .

The inverse Laplace transform is given by the formula

\begin{matrix} f (t) = \frac{1}{2 π i} \int_{s - i \infty}^{s + i \infty} \hat{f} (z) e^{z t} d z = \frac{1}{2 π} \int_{- \infty}^{+ \infty} \hat{f} (s + i v) e^{(s + i v) t} d v, s \in (σ_{f}^{-}, σ_{f}^{+}) . \end{matrix}

Let the covariate error distribution be a

γ

-fold convolution of the uniform distribution on

[- θ, θ], θ > 0

. This means

δ = Z_{1} + \dots + Z_{γ},

where

Z_{i} (i = 1, 2, \dots, γ)

are i.i.d and

Z_{i} \sim U (- θ, θ)

with density

f_{Z}

. Hence,

\begin{matrix} {\hat{f}}_{δ} (z) = {[{\hat{f}}_{Z} (z)]}^{γ} = {[\frac{sinh (θ z)}{θ z}]}^{γ} = \frac{{(1 - e^{2 θ z})}^{γ}}{{(- 2 θ z)}^{γ} e^{γ θ z}}, z \in C . \end{matrix}

(2)

Here,

{\hat{f}}_{δ} (z)

is the product of two functions; the function

{(1 - e^{2 θ z})}^{γ}

has zeros only on the imaginary axis, the function

\frac{1}{{(- 2 θ z)}^{γ} e^{γ θ z}}

does not have zeros for the analyticity of

{(- 2 θ z)}^{γ} e^{γ θ z}

. The zeros of

{\hat{f}}_{δ} (z)

are

z_{k} = \frac{i k π}{θ}

, where

k \in Z ∖ {0}

.

Now, we introduce some functional classes.

Definition 1.

For

A > 0

,

σ > 0

, and

β > 0

, a function

f : R \to R

is said to satisfy the local Hölder condition with smoothness parameter β if f is k times continuously differentiable and

| f^{(k)} (y) - f^{(k)} (\tilde{y}) | \leq A | y - \tilde{y} |^{β_{0}}, \forall y, \tilde{y} \in [x - σ, x + σ],

(3)

where

β = k + β_{0}

and

0 < β_{0} \leq 1

. All these functions are denoted by

H_{σ, β; x} (A)

.

If (3) holds for any

y, \tilde{y} \in R

, f satisfies the Hölder condition with smoothness parameter

β

. All these functions are denoted by

H_{β} (A)

.

Clearly, k in Definition 1 equals

\max {l \in N : l < β}

. In later discussions,

⌊ β ⌋ : = \max {l \in N : l < β}

.

Example 1.

Function

\begin{matrix} f_{1} (x) : = \{\begin{matrix} 1 - | x |, & | x | \leq 1, \\ 0, & | x | > 1 . \end{matrix} \end{matrix}

Then,

f_{1} \in H_{1} (A)

and

f_{1} \in H_{σ, 1; x} (A)

.

It is easy to see that

f \in H_{β} (A)

must be contained in

H_{σ, β; x} (A)

for each

x \in R

. However, the reverse is not necessarily true.

Example 2

([19]). Consider the function

\begin{matrix} f_{2} (x) : = \sum_{l = 0}^{\infty} (1 - 2^{l} | x - 2 l |) χ_{l} (x), \end{matrix}

where

χ_{l} (x)

is the indicator function on the interval

[2 l - 2^{- l}, 2 l + 2^{- l}]

for a non-negative integer l. Then,

f_{2} \in H_{σ, 1; x} (A)

for each

x \in R

. However,

f_{2} \notin H_{1} (A)

.

Note that (3) is a local Hölder condition around

x \in R

. When we consider the pointwise estimation, it is natural to assume the unknown function to satisfy a local smoothness condition.

Definition 2.

Let

r > 0

and

B > 0

be real numbers. We say that a function f belongs to the functional class

M_{r} (B)

if

\begin{matrix} \max {‖ f ‖_{\infty}, max_{0 < r_{1} \leq r} \int_{- \infty}^{+ \infty} | x |^{r_{1}} | f (x) | d x} \leq B . \end{matrix}

We denote

F_{σ, β, r; x} (A, B) = H_{σ, β; x} (A) \cap M_{r} (B)

.

3. Kernel Estimator

This section will construct the kernel regression estimator. Two kernels K and

L_{s, h}

will be used.

Assume that the kernel

K : R \to R

satisfies the following conditions:

(i)

\int_{- 1}^{1} K (x) d x = 1

,

K \in C^{\infty} (R)

and supp

(K) \subseteq [- 1, 1]

;

(ii) There exists a fixed positive integer

k_{0}

such that

\begin{matrix} \int_{- 1}^{1} x^{j} K (x) d x = 0, j = 1, \dots, k_{0} . \end{matrix}

Example 3

([20]). Function

\begin{matrix} K (x) = a φ (x), \end{matrix}

where

\begin{matrix} φ (x) : = \{\begin{matrix} e^{- \frac{1}{1 - x^{2}}}, & | x | < 1 \\ 0, & | x | \geq 1 \end{matrix} \end{matrix}

and

a = {(\int_{- \infty}^{+ \infty} φ (x) d x)}^{- 1}

. Then, the kernel

K (x)

satisfies conditions (i) and (ii) with

k_{0} = 1

.

Motivated by Belomestny and Goldenshluger [16], we will construct the regression estimator via the Laplace transform. Note that

{\hat{f}}_{δ} (- z)

does not have zeros out of the imaginary axis. Then, the kernel

L_{s, h}

is defined by the inverse Laplace transform

\begin{matrix} L_{s, h} (t) & : = \frac{1}{2 π i} \int_{s - i \infty}^{s + i \infty} \frac{\hat{K} (z h)}{{\hat{f}}_{δ} (- z)} e^{z t} d z \\ = \frac{1}{2 π} \int_{- \infty}^{+ \infty} \frac{\hat{K} ((s + i v) h)}{{\hat{f}}_{δ} (- s - i v)} e^{(s + i v) t} d v, \end{matrix}

(4)

where

s \neq 0

,

h > 0

and

\hat{K} (\cdot)

is the Laplace transform of kernel K with the convergence region

Σ_{K} = C

. There is a complex-valued improper integral in (4). One can use the property of the Laplace transform to compute it, see [18].

The following lemma provides a infinite series of kernel

L_{s, h} (t)

. It is a specific form of Lemma 2 in [16]. In order to explain the construction of the estimator, we give the details of the proof.

Lemma 1.

Let (2) hold and

\int_{- \infty}^{+ \infty} | \hat{K} {(i v h) | | v |}^{γ} d v < \infty

.

(a) If

s > 0

, then

\begin{matrix} L_{+, h} (t) : = L_{s, h} (t) = \frac{{(2 θ)}^{γ}}{2 π} \sum_{l_{1} = 0}^{\infty} \dots \sum_{l_{γ} = 0}^{\infty} \int_{- \infty}^{+ \infty} \hat{K} (i v h) {(i v)}^{γ} e^{i v [t - γ θ - 2 θ (l_{1} + \dots + l_{γ})]} d v . \end{matrix}

(b) If

s < 0

, then

\begin{matrix} L_{-, h} (t) : = L_{s, h} (t) = \frac{{(- 2 θ)}^{γ}}{2 π} \sum_{l_{1} = 0}^{\infty} \dots \sum_{l_{γ} = 0}^{\infty} \int_{- \infty}^{+ \infty} \hat{K} (i v h) {(i v)}^{γ} e^{i v [t + γ θ + 2 θ (l_{1} + \dots + l_{γ})]} d v . \end{matrix}

Proof.

(a) If

s > 0

, we have

\begin{matrix} \begin{matrix} \frac{1}{1 - e^{- 2 θ (s + i v)}} = \sum_{l = 0}^{\infty} e^{- 2 θ l (s + i v)} . \end{matrix} \end{matrix}

Therefore,

\begin{matrix} \begin{matrix} \frac{1}{{[1 - e^{- 2 θ (s + i v)}]}^{γ}} = {[\sum_{l = 0}^{\infty} e^{- 2 θ l (s + i v)}]}^{γ} = \sum_{l_{1} = 0}^{\infty} \dots \sum_{l_{γ} = 0}^{\infty} e^{- 2 θ (s + i v) (l_{1} + \dots + l_{γ})} . \end{matrix} \end{matrix}

By (2) and (4),

\begin{matrix} \begin{matrix} L_{+, h} (t) : = L_{s, h} (t) & = \frac{1}{2 π} \int_{- \infty}^{+ \infty} \frac{\hat{K} ((s + i v) h) {[2 θ (s + i v)]}^{γ}}{{[1 - e^{- 2 θ (s + i v)}]}^{γ}} e^{(s + i v) (t - γ θ)} d v \\ = \frac{{(2 θ)}^{γ}}{2 π} \sum_{l_{1} = 0}^{\infty} \dots \sum_{l_{γ} = 0}^{\infty} \int_{- \infty}^{+ \infty} \hat{K} ((s + i v) h) {(s + i v)}^{γ} e^{(s + i v) [t - γ θ - 2 θ (l_{1} + \dots + l_{γ})]} d v \\ = \frac{{(2 θ)}^{γ}}{2 π} \sum_{l_{1} = 0}^{\infty} \dots \sum_{l_{γ} = 0}^{\infty} \int_{- \infty}^{+ \infty} \hat{K} (i v h) {(i v)}^{γ} e^{i v [t - γ θ - 2 θ (l_{1} + \dots + l_{γ})]} d v . \end{matrix} \end{matrix}

(b) If

s < 0

, then

\begin{matrix} \begin{matrix} {[1 - e^{- 2 θ (s + i v)}]}^{- γ} & = {[\frac{1 - e^{2 θ (s + i v)}}{- e^{2 θ (s + i v)}}]}^{- γ} = {(- 1)}^{γ} e^{2 γ θ (s + i v)} {[1 - e^{2 θ (s + i v)}]}^{- γ} \\ = {(- 1)}^{γ} e^{2 γ θ (s + i v)} \sum_{l_{1} = 0}^{\infty} \dots \sum_{l_{γ} = 0}^{\infty} e^{2 θ (s + i v) (l_{1} + \dots + l_{γ})} . \end{matrix} \end{matrix}

Similarly,

\begin{matrix} \begin{matrix} L_{-, h} (t) : = L_{s, h} (t) = \frac{{(- 2 θ)}^{γ}}{2 π} \sum_{l_{1} = 0}^{\infty} \dots \sum_{l_{γ} = 0}^{\infty} \int_{- \infty}^{+ \infty} \hat{K} (i v h) {(i v)}^{γ} e^{i v [t + γ θ + 2 θ (l_{1} + \dots + l_{γ})]} d v . \end{matrix} \end{matrix}

This ends the proof. □

The truncation is used to deal with infinite series. Select parameter N so that

\frac{N}{γ} \in N_{+}

. The cut-off kernels are defined by

\begin{matrix} L_{+, h}^{(N)} (t) : = \frac{{(2 θ)}^{γ}}{2 π} \sum_{l_{1} = 0}^{\frac{N}{γ}} \dots \sum_{l_{γ} = 0}^{\frac{N}{γ}} \int_{- \infty}^{+ \infty} \hat{K} (i v h) {(i v)}^{γ} e^{i v [t - γ θ - 2 θ (l_{1} + \dots + l_{γ})]} d v, \end{matrix}

(5)

\begin{matrix} L_{-, h}^{(N)} (t) : = \frac{{(- 2 θ)}^{γ}}{2 π} \sum_{l_{1} = 0}^{\frac{N}{γ}} \dots \sum_{l_{γ} = 0}^{\frac{N}{γ}} \int_{- \infty}^{+ \infty} \hat{K} (i v h) {(i v)}^{γ} e^{i v [t + γ θ + 2 θ (l_{1} + \dots + l_{γ})]} d v . \end{matrix}

(6)

Denote

\begin{matrix} L_{s, h}^{(N)} (t) : = \{\begin{matrix} L_{+, h}^{(N)} (t), & s > 0, \\ L_{-, h}^{(N)} (t), & s < 0 . \end{matrix} \end{matrix}

Motivated by the Nadaraya–Watson regression estimator, we define the regression estimator of

m (x)

as

\begin{matrix} {\tilde{m}}_{s, h}^{(N)} (x) : = \frac{{\tilde{p}}_{s, h}^{(N)} (x)}{{\tilde{f}}_{X, s, h}^{(N)} (x)}, \end{matrix}

(7)

where

\begin{matrix} {\tilde{f}}_{X, s, h}^{(N)} (x) : = \frac{1}{n} \sum_{j = 1}^{n} L_{s, h}^{(N)} (W_{j} - x) and {\tilde{p}}_{s, h}^{(N)} (x) : = \frac{1}{n} \sum_{j = 1}^{n} Y_{j} L_{s, h}^{(N)} (W_{j} - x) . \end{matrix}

(8)

In what follows, we will write

{\tilde{m}}_{+, h}^{(N)} (x)

and

{\tilde{m}}_{-, h}^{(N)} (x)

for the estimator (7) associated with

s > 0

and

s < 0

, respectively. Finally, our regression estimator is denoted by

\begin{matrix} {\tilde{m}}_{h}^{(N)} (x) : = \{\begin{matrix} {\tilde{m}}_{+, h}^{(N)} (x), & x \geq 0, \\ {\tilde{m}}_{-, h}^{(N)} (x), & x < 0 . \end{matrix} \end{matrix}

(9)

4. Strong Consistency

In this section, we investigate the consistency of the regression estimator (9). Roughly speaking, consistency means that the estimator

{\tilde{m}}_{h}^{(N)} (x)

converges to

m (x)

as the sample size tends to infinity.

Theorem 1

(Strong consistency). Consider the model (1) with (2). Suppose

f_{X}, p : = m f_{X} \in M_{r} (B) (r > \frac{1}{2})

,

E | Y_{1} |^{8 (γ + 1)} < + \infty

and kernel function K satisfies condition (i). If x is the Lebesgue point of both

f_{X}

and p

(f_{X} (x) \neq 0)

, then

{\tilde{m}}_{h}^{(N)} (x)

satisfies

\begin{matrix} lim_{n \to \infty} {\tilde{m}}_{h}^{(N)} (x) \overset{a . s .}{=} m (x) \end{matrix}

with

h = n^{- \frac{1}{6 (γ + 1)}}

and

n^{\frac{1}{3 (γ + 1)}} \leq N \leq 2 n^{\frac{1}{3 (γ + 1)}}

.

Proof.

(1)

We consider the estimator

{\tilde{m}}_{+, h}^{(N)} (x)

for

x \geq 0

.

Note that

{\tilde{m}}_{+, h}^{(N)} (x) = \frac{{\tilde{p}}_{+, h}^{(N)} (x)}{{\tilde{f}}_{X, +, h}^{(N)} (x)}

,

m (x) = \frac{p (x)}{f_{X} (x)}

and

f_{X} (x) \neq 0

. Then, it is sufficient to prove

lim_{n \to \infty} {\tilde{p}}_{+, h}^{(N)} (x) \overset{a . s .}{=} p (x)

and

lim_{n \to \infty} {\tilde{f}}_{X, +, h}^{(N)} (x) \overset{a . s .}{=} f_{X} (x)

.

Now, we prove

lim_{n \to \infty} {\tilde{p}}_{+, h}^{(N)} (x) \overset{a . s .}{=} p (x)

. For any

ϵ > 0

,

P [| {\tilde{p}}_{+, h}^{(N)} (x) - p (x) | > \sqrt{ϵ}] \leq P [| {\tilde{p}}_{+, h}^{(N)} (x) - E {\tilde{p}}_{+, h}^{(N)} (x) | > \frac{\sqrt{ϵ}}{2}] + χ_{(\frac{\sqrt{ϵ}}{2}, \infty)} (| E {\tilde{p}}_{+, h}^{(N)} (x) - p (x) |) .

By Markov’s inequality, we obtain

\begin{matrix} P [| {\tilde{p}}_{+, h}^{(N)} (x) - p (x) | > \sqrt{ϵ}] \leq c_{1} ϵ^{- s} E | {\tilde{p}}_{+, h}^{(N)} (x) - E {\tilde{p}}_{+, h}^{(N)} {(x) |}^{2 s} + χ_{(\frac{\sqrt{ϵ}}{2}, \infty)} (| E {\tilde{p}}_{+, h}^{(N)} (x) - p (x) |) \end{matrix}

(10)

for

s : = 4 (γ + 1)

. This motivates us to derive an upper bound on

E | {\tilde{p}}_{+, h}^{(N)} (x) - E {\tilde{p}}_{+, h}^{(N)} {(x) |}^{2 s}

. Combining (5) with (8), we have

\begin{matrix} {\tilde{p}}_{+, h}^{(N)} (x) & = \frac{1}{n} \sum_{j = 1}^{n} Y_{j} L_{+, h}^{(N)} (W_{j} - x) \\ = \frac{{(2 θ)}^{γ}}{2 π n} \sum_{j = 1}^{n} \int_{- \infty}^{+ \infty} \hat{K} (i v h) {(i v)}^{γ} (Y_{j} e^{i v W_{j}}) e^{- i v (x + γ θ)} [\sum_{l_{1} = 0}^{\frac{N}{γ}} \dots \sum_{l_{γ} = 0}^{\frac{N}{γ}} e^{- i 2 θ v (l_{1} + \dots + l_{γ})}] d v, \end{matrix}

and

\begin{matrix} E {\tilde{p}}_{+, h}^{(N)} (x) = \frac{{(2 θ)}^{γ}}{2 π n} \sum_{j = 1}^{n} \int_{- \infty}^{+ \infty} \hat{K} (i v h) {(i v)}^{γ} E (Y_{j} e^{i v W_{j}}) e^{- i v (x + γ θ)} [\sum_{l_{1} = 0}^{\frac{N}{γ}} \dots \sum_{l_{γ} = 0}^{\frac{N}{γ}} e^{- i 2 θ v (l_{1} + \dots + l_{γ})}] d v . \end{matrix}

(11)

We obtain

\begin{matrix} E | {\tilde{p}}_{+, h}^{(N)} (x) - E {\tilde{p}}_{+, h}^{(N)} {(x) |}^{2 s} = {[\frac{{(2 θ)}^{γ}}{2 π n}]}^{2 s} E | \sum_{j = 1}^{n} \int_{- \infty}^{+ \infty} \hat{K} (i v h) {(i v)}^{γ} Ψ_{j} (v) Φ_{+, N} (x, v) d v |^{2 s}, \end{matrix}

where

Ψ_{j} (v) : = Y_{j} e^{i v W_{j}} - E (Y_{j} e^{i v W_{j}}), Φ_{+, N} (x, v) : = e^{- i v (x + γ θ)} [\sum_{l_{1} = 0}^{\frac{N}{γ}} \dots \sum_{l_{γ} = 0}^{\frac{N}{γ}} e^{- i 2 θ v (l_{1} + \dots + l_{γ})}]

.

Thus,

\begin{matrix} E | {\tilde{p}}_{+, h}^{(N)} (x) - E {\tilde{p}}_{+, h}^{(N)} {(x) |}^{2 s} \\ = {[\frac{{(2 θ)}^{γ}}{2 π n}]}^{2 s} \sum_{j_{1} = 1}^{n} \dots \sum_{j_{2 s} = 1}^{n} & E [(\int_{- \infty}^{+ \infty} \dots \int_{- \infty}^{+ \infty} \prod_{k = 1}^{s} \hat{K} (i v_{2 k - 1} h) \hat{K} (- i v_{2 k} h) {(i v_{2 k - 1})}^{γ} {(- i v_{2 k})}^{γ} \\ \times Φ_{+, N} (x, v_{2 k - 1}) & Φ_{+, N} (x, - v_{2 k}) Ψ_{j_{2 k - 1}} (v_{2 k - 1}) Ψ_{j_{2 k}} (- v_{2 k}) d v_{1} \dots d v_{2 s})] \\ = {[\frac{{(2 θ)}^{γ}}{2 π n}]}^{2 s} \sum_{j_{1} = 1}^{n} \dots \sum_{j_{2 s} = 1}^{n} & \int_{- \infty}^{+ \infty} \dots \int_{- \infty}^{+ \infty} [\prod_{k = 1}^{s} \hat{K} (i v_{2 k - 1} h) \hat{K} (- i v_{2 k} h) {(i v_{2 k - 1})}^{γ} {(- i v_{2 k})}^{γ} \\ \times Φ_{+, N} (x, v_{2 k - 1}) & Φ_{+, N} (x, - v_{2 k})] E [\prod_{k = 1}^{s} Ψ_{j_{2 k - 1}} (v_{2 k - 1}) Ψ_{j_{2 k}} (- v_{2 k})] d v_{1} \dots d v_{2 s} \\ \leq {[\frac{{(2 θ)}^{γ}}{2 π n}]}^{2 s} \sum_{j_{1} = 1}^{n} \dots \sum_{j_{2 s} = 1}^{n} & \int_{- \infty}^{+ \infty} \dots \int_{- \infty}^{+ \infty} [\prod_{k = 1}^{s} | \hat{K} (i v_{2 k - 1} h) | | \hat{K} (- i v_{2 k} h) | | v_{2 k - 1} |^{γ} {| v_{2 k} |}^{γ}] \\ \times {(\frac{N}{γ} + 1)}^{2 γ s} & | E [\prod_{k = 1}^{s} Ψ_{j_{2 k - 1}} (v_{2 k - 1}) Ψ_{j_{2 k}} (- v_{2 k})] | d v_{1} \dots d v_{2 s} . \end{matrix}

(12)

Let

# A

denote the number of elements contained in the set A. If

# {j_{1}, \dots, j_{2 s}} > s

, at least one of

Ψ_{j_{l}}

is independent of all other

Ψ_{j_{l^{'}}}

,

l^{'} \neq l

. Hence,

\begin{matrix} E [\prod_{k = 1}^{s} Ψ_{j_{2 k - 1}} (v_{2 k - 1}) Ψ_{j_{2 k}} (- v_{2 k})] = 0 . \end{matrix}

On the other hand, if

# {j_{1}, \dots, j_{2 s}} = s_{1}

for

s_{1} \leq s

, by Jensen’s inequality, we obtain

\begin{matrix} | E [\prod_{k = 1}^{s} Ψ_{j_{2 k - 1}} (v_{2 k - 1}) Ψ_{j_{2 k}} (- v_{2 k})] | & = | E {[Ψ_{j_{1}^{'}} (t_{1}^{'})]}^{λ_{1}} \dots {[Ψ_{j_{s_{1}}^{'}} (t_{s_{1}}^{'})]}^{λ_{s_{1}}} | \leq E | Ψ_{1} (t_{1}) |^{2 s} \\ \leq E (| Y_{1} | + E | Y_{1} {|)}^{2 s} \leq 4^{s} E {| Y_{1} |}^{2 s}, \end{matrix}

where

λ_{1} + \dots + λ_{s_{1}} = 2 s

. Let

J_{n} = {(j_{1}, \dots, j_{2 s}) : # {j_{1}, \dots, j_{2 s}} \leq s, j_{i} \in {1, \dots, n}, i = 1, \dots, 2 s}

. Then,

\begin{matrix} E | {\tilde{p}}_{+, h}^{(N)} (x) - E {\tilde{p}}_{+, h}^{(N)} {(x) |}^{2 s} \leq {[\frac{{(2 θ)}^{γ}}{2 π n}]}^{2 s} (c_{2} N^{2 γ s}) (4^{s} E | Y_{1} |^{2 s}) \sum_{j \in J_{n}} {(\int_{- \infty}^{+ \infty} | \hat{K} {(i v h) | | v |}^{γ} d v)}^{2 s} . \end{matrix}

Since

{| v |}^{k} | \hat{K} (i v) | \leq c (k)

for all k, we obtain that

\int_{- \infty}^{+ \infty} | \hat{K} {(i v h) | | v |}^{γ} d v \leq c_{3} h^{- (γ + 2)}

for

k = γ + 2

. This, with

# J_{n} \leq c_{4} n^{s}

, leads to

\begin{matrix} E | {\tilde{p}}_{+, h}^{(N)} (x) - E {\tilde{p}}_{+, h}^{(N)} {(x) |}^{2 s} \leq c_{5} n^{- s} N^{2 γ s} h^{- 2 s (γ + 2)} \leq c_{6} n^{\frac{- s}{3 (γ + 1)}} . \end{matrix}

(13)

Inserting this into (10), we obtain

\begin{matrix} P [| {\tilde{p}}_{+, h}^{(N)} (x) - p (x) | > \sqrt{ϵ}] \leq c_{7} n^{\frac{- s}{3 (γ + 1)}} + χ_{(\frac{\sqrt{ϵ}}{2}, \infty)} (| E {\tilde{p}}_{+, h}^{(N)} (x) - p (x) |) . \end{matrix}

(14)

Note that

(W_{j}, Y_{j})

are identically distributed. Then, it follows from (11) and

E (Y_{j} e^{i v W_{j}}) = E (Y_{j} e^{i v X_{j}}) E (e^{i v δ_{j}})

that

\begin{matrix} E {\tilde{p}}_{+, h}^{(N)} (x) \\ = \frac{{(2 θ)}^{γ}}{2 π} \int_{- \infty}^{+ \infty} \hat{K} (i v h) {(i v)}^{γ} E (Y_{1} e^{i v X_{1}}) E (e^{i v δ_{1}}) e^{- i v (x + γ θ)} [\sum_{l_{1} = 0}^{\frac{N}{γ}} \dots \sum_{l_{γ} = 0}^{\frac{N}{γ}} e^{- i 2 θ v (l_{1} + \dots + l_{γ})}] d v, \end{matrix}

(15)

where

\begin{matrix} \sum_{l_{1} = 0}^{\frac{N}{γ}} \dots \sum_{l_{γ} = 0}^{\frac{N}{γ}} {[e^{- i 2 θ v}]}^{l_{1} + \dots + l_{γ}} = {[\sum_{l = 0}^{\frac{N}{γ}} e^{- i v (2 θ l)}]}^{γ} = {[\frac{1 - {(e^{- i 2 θ v})}^{\frac{N}{γ} + 1}}{1 - e^{- i 2 θ v}}]}^{γ} . \end{matrix}

(16)

By (2), we have

\begin{matrix} E {\tilde{p}}_{+, h}^{(N)} (x) & = \frac{1}{2 π} \int_{- \infty}^{+ \infty} E (Y_{1} e^{i v X_{1}}) \hat{K} (i v h) e^{- i v x} {[1 - {(e^{- i 2 θ v})}^{\frac{N}{γ} + 1}]}^{γ} d v \\ = \frac{1}{2 π} \int_{- \infty}^{+ \infty} E [e^{i v X_{1}} \cdot E (Y_{1} | X_{1})] \hat{K} (i v h) e^{- i v x} \sum_{l = 0}^{γ} (\begin{matrix} γ \\ l \end{matrix}) {(- 1)}^{l} {(e^{- i 2 θ v})}^{l (\frac{N}{γ} + 1)} d v \\ = \sum_{l = 0}^{γ} (\begin{matrix} γ \\ l \end{matrix}) {(- 1)}^{l} \frac{1}{2 π} \int_{- \infty}^{+ \infty} [\int_{- \infty}^{+ \infty} p (t) e^{i v t} d t] \hat{K} (i v h) e^{- i v [x + 2 θ l (\frac{N}{γ} + 1)]} d v \\ = \sum_{l = 0}^{γ} (\begin{matrix} γ \\ l \end{matrix}) {(- 1)}^{l} \frac{1}{h} \int_{- \infty}^{+ \infty} p (t) K (\frac{t - x - 2 θ l (\frac{N}{γ} + 1)}{h}) d t \\ = \int_{- \infty}^{+ \infty} \frac{1}{h} K (\frac{t - x}{h}) p (t) d t + T_{+, \frac{N}{γ}} (p; x), \end{matrix}

(17)

where

\begin{matrix} T_{+, \frac{N}{γ}} (p; x) & : = \sum_{l = 1}^{γ} (\begin{matrix} γ \\ l \end{matrix}) {(- 1)}^{l} \frac{1}{h} \int_{- \infty}^{+ \infty} p (t) K (\frac{t - x - 2 θ l (\frac{N}{γ} + 1)}{h}) d t \\ = \sum_{l = 1}^{γ} (\begin{matrix} γ \\ l \end{matrix}) {(- 1)}^{l} \int_{- 1}^{1} K (y) p (y h + x + 2 θ l (\frac{N}{γ} + 1)) d y . \end{matrix}

Hence,

\begin{matrix} | E {\tilde{p}}_{+, h}^{(N)} (x) - p (x) | \leq | \int_{- \infty}^{+ \infty} \frac{1}{h} K (\frac{t - x}{h}) p (t) d t - p (x) | + | T_{+, \frac{N}{γ}} (p; x) | \\ = | \int_{- 1}^{1} K (y) [p (x + y h) - p (x)] d y | + | T_{+, \frac{N}{γ}} (p; x) | . \end{matrix}

(18)

Since

p \in M_{r} (B)

and considering the boundedness of K,

\begin{matrix} | T_{+, \frac{N}{γ}} (p; x) | \leq c_{8} \sum_{l = 1}^{γ} (\begin{matrix} γ \\ l \end{matrix}) \int_{- 1}^{1} |p (y h + x + 2 θ l (\frac{N}{γ} + 1))| d y \leq \frac{c_{9} B}{h {(x + 2 θ N \cdot \frac{1}{γ})}^{r}} \leq \frac{c_{10} B}{h N^{r}} \end{matrix}

(19)

holds for an h that is small enough. It follows from

r > \frac{1}{2}

,

h = n^{- \frac{1}{6 (γ + 1)}}

and

N \geq n^{\frac{1}{3 (γ + 1)}}

that

\begin{matrix} | T_{+, \frac{N}{γ}} (p; x) | \overset{n \to \infty}{\to} 0 . \end{matrix}

Note that the kernel function K satisfies condition (i) and

p \in L (R)

, then

\begin{matrix} | \int_{- 1}^{1} K (y) [p (x + y h) - p (x)] d y | \overset{h \to 0}{\to} 0 \end{matrix}

holds for each Lebesgue point x of p. Hence, for an n that is sufficiently large, the term

χ_{(\frac{\sqrt{ϵ}}{2}, \infty)} (| E {\tilde{p}}_{+, h}^{(N)} (x) - p (x) |)

vanishes. This, with (14), shows

\begin{matrix} P [| {\tilde{p}}_{+, h}^{(N)} (x) - p (x) | > \sqrt{ϵ}] \leq c_{7} n^{\frac{- s}{3 (γ + 1)}} \end{matrix}

(20)

for an n that is large enough. Since

s = 4 (γ + 1)

, we have

\begin{matrix} \sum_{n = 1}^{\infty} P [| {\tilde{p}}_{+, h}^{(N)} (x) - p (x) | > \sqrt{ϵ}] < \infty . \end{matrix}

For any

ϵ > 0

, it follows from the Borel–Cantelli lemma that

\begin{matrix} P \{\bar{lim_{n \to \infty}} [| {\tilde{p}}_{+, h}^{(N)} (x) - p (x) | > \sqrt{ϵ}]\} = 0 . \end{matrix}

Thus,

\begin{matrix} lim_{n \to \infty} {\tilde{p}}_{+, h}^{(N)} (x) \overset{a . s .}{=} p (x) . \end{matrix}

When putting

Y_{j} \equiv 1

almost surely, we have

\begin{matrix} lim_{n \to \infty} {\tilde{f}}_{X, +, h}^{(N)} (x) \overset{a . s .}{=} f_{X} (x) . \end{matrix}

Hence,

\begin{matrix} lim_{n \to \infty} {\tilde{m}}_{+, h}^{(N)} (x) \overset{a . s .}{=} m (x) . \end{matrix}

(2) We consider the estimator

{\tilde{m}}_{-, h}^{(N)} (x)

for

x < 0

. Inserting (6) into (8), we obtain

\begin{matrix} {\tilde{p}}_{-, h}^{(N)} (x) & = \frac{1}{n} \sum_{j = 1}^{n} Y_{j} L_{-, h}^{(N)} (W_{j} - x) \\ = \frac{{(- 2 θ)}^{γ}}{2 π n} \sum_{j = 1}^{n} \int_{- \infty}^{+ \infty} \hat{K} (i v h) {(i v)}^{γ} (Y_{j} e^{i v W_{j}}) e^{- i v (x - γ θ)} [\sum_{l_{1} = 0}^{\frac{N}{γ}} \dots \sum_{l_{γ} = 0}^{\frac{N}{γ}} e^{i 2 θ v (l_{1} + \dots + l_{γ})}] d v, \end{matrix}

and

\begin{matrix} E {\tilde{p}}_{-, h}^{(N)} (x) = \frac{{(- 2 θ)}^{γ}}{2 π n} \sum_{j = 1}^{n} \int_{- \infty}^{+ \infty} \hat{K} (i v h) {(i v)}^{γ} E (Y_{j} e^{i v W_{j}}) e^{- i v (x - γ θ)} [\sum_{l_{1} = 0}^{\frac{N}{γ}} \dots \sum_{l_{γ} = 0}^{\frac{N}{γ}} e^{i 2 θ v (l_{1} + \dots + l_{γ})}] d v . \end{matrix}

(21)

We obtain

\begin{matrix} E | {\tilde{p}}_{-, h}^{(N)} (x) - E {\tilde{p}}_{-, h}^{(N)} {(x) |}^{2 s} = {[\frac{{(2 θ)}^{γ}}{2 π n}]}^{2 s} E | \sum_{j = 1}^{n} \int_{- \infty}^{+ \infty} \hat{K} (i v h) {(i v)}^{γ} Ψ_{j} (v) Φ_{-, N} (x, v) d v |^{2 s}, \end{matrix}

where

Φ_{-, N} (x, v) : = e^{- i v (x - γ θ)} [\sum_{l_{1} = 0}^{\frac{N}{γ}} \dots \sum_{l_{γ} = 0}^{\frac{N}{γ}} e^{i 2 θ v (l_{1} + \dots + l_{γ})}]

. Similar to (12) and (13), we obtain

\begin{matrix} E | {\tilde{p}}_{-, h}^{(N)} (x) - E {\tilde{p}}_{-, h}^{(N)} {(x) |}^{2 s} \leq c_{11} n^{\frac{- s}{3 (γ + 1)}} . \end{matrix}

By (21), we have

\begin{matrix} E {\tilde{p}}_{-, h}^{(N)} (x) & = \frac{{(- 2 θ)}^{γ}}{2 π} \int_{- \infty}^{+ \infty} \hat{K} (i v h) {(i v)}^{γ} E (Y_{1} e^{i v X_{1}}) E (e^{i v δ_{1}}) e^{- i v (x - γ θ)} \\ \times [\sum_{l_{1} = 0}^{\frac{N}{γ}} \dots \sum_{l_{γ} = 0}^{\frac{N}{γ}} e^{i 2 θ v (l_{1} + \dots + l_{γ})}] d v, \end{matrix}

where

\begin{matrix} \sum_{l_{1} = 0}^{\frac{N}{γ}} \dots \sum_{l_{γ} = 0}^{\frac{N}{γ}} e^{i 2 θ v (l_{1} + \dots + l_{γ})} = {[\frac{1 - {(e^{i 2 θ v})}^{\frac{N}{γ} + 1}}{1 - e^{i 2 θ v}}]}^{γ} . \end{matrix}

By

\frac{1}{{(1 - e^{i 2 θ v})}^{γ}} = \frac{1}{{(- e^{i 2 θ v})}^{γ} {(1 - e^{- i 2 θ v})}^{γ}}

and (2), we have that

\frac{{(- i 2 θ v)}^{γ} e^{i v (γ θ)}}{{(1 - e^{i 2 θ v})}^{γ}} = \frac{1}{{\hat{f}}_{δ} (- i v)}

. So,

\begin{matrix} E {\tilde{p}}_{-, h}^{(N)} (x) = \frac{1}{2 π} \int_{- \infty}^{+ \infty} \hat{K} (i v h) E (Y_{1} e^{i v X_{1}}) e^{- i v x} {[1 - {(e^{i 2 θ v})}^{\frac{N}{γ} + 1}]}^{γ} d v . \end{matrix}

Similar to (17), we obtain

\begin{matrix} E {\tilde{p}}_{-, h}^{(N)} (x) = \int_{- \infty}^{+ \infty} \frac{1}{h} K (\frac{t - x}{h}) p (t) d t + T_{-, \frac{N}{γ}} (p; x), \end{matrix}

where

\begin{matrix} T_{-, \frac{N}{γ}} (p; x) & : = \sum_{l = 1}^{γ} (\begin{matrix} γ \\ l \end{matrix}) {(- 1)}^{l} \frac{1}{h} \int_{- \infty}^{+ \infty} p (t) K (\frac{t - x + 2 θ l (\frac{N}{γ} + 1)}{h}) d t \\ = \sum_{l = 1}^{γ} (\begin{matrix} γ \\ l \end{matrix}) {(- 1)}^{l} \int_{- 1}^{1} K (y) p (y h + x - 2 θ l (\frac{N}{γ} + 1)) d y . \end{matrix}

Thus, we have

\begin{matrix} | E {\tilde{p}}_{-, h}^{(N)} (x) - p (x) | \leq | \int_{- 1}^{1} K (y) [p (x + y h) - p (x)] d y | + | T_{-, \frac{N}{γ}} (p; x) | . \end{matrix}

(22)

Since

p \in M_{r} (B)

and considering the boundedness of K,

\begin{matrix} | T_{-, \frac{N}{γ}} (p; x) | \leq c_{12} \sum_{l = 1}^{γ} (\begin{matrix} γ \\ l \end{matrix}) \int_{- 1}^{1} |p (y h + x - 2 θ l (\frac{N}{γ} + 1))| d y \leq \frac{c_{13} B}{h {(- x + 2 θ N \cdot \frac{1}{γ})}^{r}} \leq \frac{c_{14} B}{h N^{r}} \end{matrix}

(23)

holds for an h that is small enough.

Similar to

x \geq 0

, we get

\begin{matrix} lim_{n \to \infty} {\tilde{m}}_{-, h}^{(N)} (x) \overset{a . s .}{=} m (x) . \end{matrix}

This completes the proof. □

Remark 1.

Theorem 1 shows the strong consistency of kernel estimator

{\tilde{m}}_{h}^{(N)} (x)

. It is different from the work of Meister [6] in that the density function of our covariate error δ contains zeros in the Fourier domain. Our covariate error belongs to the Fourier oscillating noise considered by Delaigle and Meister [1]. Compared to their work, we construct a regression estimator via the Laplace transform without assuming

f_{X}

and m to be compact.

5. Convergence Rate

In this section, we focus on the convergence rate in the weak sense. Meister [6] introduces the weak convergence rate by modifying the concept of weak consistency. A regression estimator

{\hat{m}}_{n} (x)

is said to attain the weak convergence rate

ε_{n}

if

lim_{C \to \infty} (\underset{n \to \infty}{lim sup} sup_{(m, f_{X}) \in P} P [| {\hat{m}}_{n} {(x) - m (x) |}^{2} \geq C \cdot ε_{n}]) = 0 .

The set

P

is the collection of all pairs

(m, f_{X})

that satisfy some conditions. The order of limits is first

n \to \infty

, and then

C \to \infty

. Here, C is independent of n.

Define the set

\begin{matrix} P_{β, r; x} : = {(m, f_{X}) : f_{X}, m f_{X} \in F_{σ, β, r; x} (A, B), | m (x) | \leq C_{1}, f_{X} (x) \geq C_{2}, ‖ m (\cdot) ‖_{\infty} \leq C_{3}}, \end{matrix}

where

C_{1}, C_{2}, C_{3} > 0

.

The following Lemma is used to prove the theorem in this section.

Lemma 2

([6]). If

p : = m f_{X}

,

{\hat{m}}_{n} (x) = \frac{p_{n} (x)}{f_{X, n} (x)}

,

| m (x) | < + \infty

and

f_{X} (x) \neq 0

. Then, for a small enough

ϵ > 0

,

\begin{matrix} P [| {\hat{m}}_{n} {(x) - m (x) |}^{2} > ϵ] \leq P [| p_{n} {(x) - p (x) |}^{2} > c_{1} ϵ] + P [| f_{X, n} (x) - f_{X} {(x) |}^{2} > c_{2} ϵ] \end{matrix}

with two positive constants,

c_{1}

and

c_{2}

.

Theorem 2.

Consider the model (1) with (2). Assume that

(m, f_{X}) \in P_{β, r; x}

with

r = 2 γ - 2

if

γ > 1

, and

r > 0

if

γ = 1

. Suppose kernel K satisfies conditions (i), (ii) with

k_{0} \geq β

. Let

h = n^{\frac{- 1}{2 β + 2 γ + 1}}

,

N \geq n^{\frac{β + 1}{r (2 β + 2 γ + 1)}}

. Then,

lim_{C \to \infty} (\underset{n \to \infty}{lim sup} sup_{(m, f_{X}) \in P_{β, r; x}} P [| {\tilde{m}}_{h}^{(N)} {(x) - m (x) |}^{2} \geq C \cdot ε_{n}]) = 0,

where

ε_{n} = n^{\frac{- 2 β}{2 β + 2 γ + 1}}

.

Proof.

(1) We assume that

x \geq 0

and consider the estimator

{\tilde{m}}_{+, h}^{(N)} (x)

. Applying Lemma 2 and Markov’s inequality, we obtain

\begin{matrix} P [| {\tilde{m}}_{+, h}^{(N)} {(x) - m (x) |}^{2} \geq C \cdot ε_{n}] \leq \frac{c_{3}}{C ε_{n}} (E | {\tilde{p}}_{+, h}^{(N)} {(x) - p (x) |}^{2} + E {| {\tilde{f}}_{X, +, h}^{(N)} (x) - f_{X} (x) |}^{2}), \end{matrix}

(24)

where

c_{3}

is the larger of

\frac{1}{c_{1}}

and

\frac{1}{c_{2}}

, and

c_{1}, c_{2}

appear in Lemma 2. Then,

E | {\tilde{p}}_{+, h}^{(N)} {(x) - p (x) |}^{2} = var [{\tilde{p}}_{+, h}^{(N)} (x)] + {| E {\tilde{p}}_{+, h}^{(N)} (x) - p (x) |}^{2},

(25)

and

E | {\tilde{f}}_{X, +, h}^{(N)} (x) - f_{X} {(x) |}^{2} = var [{\tilde{f}}_{X, +, h}^{(N)} (x)] + {| E {\tilde{f}}_{X, +, h}^{(N)} (x) - f_{X} (x) |}^{2} .

(26)

First, we estimate

| E {\tilde{p}}_{+, h}^{(N)} {(x) - p (x) |}^{2}

and

| E {\tilde{f}}_{X, +, h}^{(N)} (x) - f_{X} {(x) |}^{2}

. By (18), we have

\begin{matrix} | E {\tilde{p}}_{+, h}^{(N)} {(x) - p (x) |}^{2} \leq 2 (| \int_{- 1}^{1} K (y) [p (x + y h) - p (x)] d y |^{2} + | T_{+, \frac{N}{γ}} (p; x) |^{2}) . \end{matrix}

(27)

By Taylor expansion of p with the degree

⌊ β ⌋ - 1

, there exists

0 < η < 1

such that

\begin{matrix} |\int_{- 1}^{1} K (y) [p (x + y h) - p (x)] d y| \\ = |\int_{- 1}^{1} K (y) [\sum_{j = 1}^{⌊ β ⌋} \frac{{(y h)}^{j}}{j!} p^{(j)} (x) + \frac{{(y h)}^{⌊ β ⌋}}{⌊ β ⌋!} (p^{(⌊ β ⌋)} (x + η y h) - p^{(⌊ β ⌋)} (x))] d y| \\ \leq | \int_{- 1}^{1} K (y) \sum_{j = 1}^{⌊ β ⌋} \frac{{(y h)}^{j}}{j!} p^{(j)} (x) d y | + | \int_{- 1}^{1} K (y) \frac{{(y h)}^{⌊ β ⌋}}{⌊ β ⌋!} (p^{(⌊ β ⌋)} (x + η y h) - p^{(⌊ β ⌋)} (x)) d y | . \end{matrix}

Since kernel K satisfies condition (ii) and

β \leq k_{0}

, we have

\begin{matrix} \int_{- 1}^{1} K (y) \sum_{j = 1}^{⌊ β ⌋} \frac{{(y h)}^{j}}{j!} p^{(j)} (x) d y = 0 . \end{matrix}

By

p \in H_{σ, β; x} (A)

, we find that

\begin{matrix} |\int_{- 1}^{1} K (y) \frac{{(y h)}^{⌊ β ⌋}}{⌊ β ⌋!} (p^{(⌊ β ⌋)} (x + η y h) - p^{(⌊ β ⌋)} (x)) d y| \\ \leq \int_{- 1}^{1} | K (y) | \frac{{| y h |}^{⌊ β ⌋}}{⌊ β ⌋!} |p^{(⌊ β ⌋)} (x + η y h) - p^{(⌊ β ⌋)} (x)| d y \\ \leq \int_{- 1}^{1} | K (y) | \frac{{A | y h |}^{β} {∥ η |}^{β - ⌊ β ⌋}}{⌊ β ⌋!} d y \\ \leq c_{4} A h^{β} \end{matrix}

(28)

holds for an h that is small enough. Equations (19) and (28) imply the following upper bound:

\begin{matrix} | E {\tilde{p}}_{+, h}^{(N)} {(x) - p (x) |}^{2} \leq c_{5} (A^{2} h^{2 β} + \frac{B^{2}}{h^{2} N^{2 r}}) . \end{matrix}

(29)

Now, we estimate the term

| E {\tilde{f}}_{X, +, h}^{(N)} (x) - f_{X} {(x) |}^{2}

. By (8) and (5),

\begin{matrix} E {\tilde{f}}_{X, +, h}^{(N)} (x) = E [L_{+, h}^{(N)} (W_{1} - x)] \\ = \frac{{(2 θ)}^{γ}}{2 π} \int_{- \infty}^{+ \infty} \hat{K} (i v h) {(i v)}^{γ} E (e^{i v X_{1}}) E (e^{i v δ_{1}}) e^{- i v (x + γ θ)} [\sum_{l_{1} = 0}^{\frac{N}{γ}} \dots \sum_{l_{γ} = 0}^{\frac{N}{γ}} e^{- i 2 θ v (l_{1} + \dots + l_{γ})}] d v . \end{matrix}

(30)

Note that

E (e^{i v X_{1}}) = \int_{- \infty}^{+ \infty} f_{X} (t) e^{i v t} d t

. Then, similar arguments to (15)–(17) show

\begin{matrix} E {\tilde{f}}_{X, +, h}^{(N)} (x) = \int_{- \infty}^{+ \infty} \frac{1}{h} K (\frac{t - x}{h}) f_{X} (t) d t + T_{+, \frac{N}{γ}} (f_{X}; x) . \end{matrix}

(31)

Similar to (27)–(29), we have

\begin{matrix} | E {\tilde{f}}_{X, +, h}^{(N)} (x) - f_{X} {(x) |}^{2} \leq c_{6} (A^{2} h^{2 β} + \frac{B^{2}}{h^{2} N^{2 r}}) . \end{matrix}

(32)

Now, we estimate

var [{\tilde{p}}_{+, h}^{(N)} (x)]

and

var [{\tilde{f}}_{X, +, h}^{(N)} (x)]

. By (8), we have

\begin{matrix} var [{\tilde{p}}_{+, h}^{(N)} (x)] & \leq \frac{1}{n} E [| Y_{1} |^{2} {| L_{+, h}^{(N)} (W_{1} - x) |}^{2}] \\ = \frac{1}{n} E [E (| Y_{1} |^{2} | L_{+, h}^{(N)} (W_{1} - x) |^{2} | X_{1})] \\ = \frac{1}{n} \int_{- \infty}^{+ \infty} E (| Y_{1} |^{2} | X_{1} = t) E | L_{+, h}^{(N)} (t + δ_{1} - x) |^{2} f_{X} (t) d t . \end{matrix}

Note that

var (Y_{1} | X_{1} = t) = E (| Y_{1} |^{2} | X_{1} = t) - m^{2} (t)

. It follows from

‖ var (Y_{1} | X_{1} = \cdot) ‖_{\infty} = E ϵ_{j}^{2}

and

{‖ m (\cdot) ‖}_{\infty} \leq C_{3}

that

‖ E (| Y_{1} |^{2} | X_{1} = \cdot {) ‖}_{\infty} \leq ‖ var (Y_{1} | X_{1} = \cdot) ‖_{\infty} + {‖ m^{2} (\cdot) ‖}_{\infty} \leq c_{7}

. Then,

\begin{matrix} var [{\tilde{p}}_{+, h}^{(N)} (x)] & \leq \frac{c_{7}}{n} \int_{- \infty}^{+ \infty} E {| L_{+, h}^{(N)} (t + δ_{1} - x) |}^{2} f_{X} (t) d t \\ = \frac{c_{7}}{n} E [E (| L_{+, h}^{(N)} (W_{1} - x) |^{2} | X_{1})] \\ = \frac{c_{7}}{n} \int_{- \infty}^{+ \infty} | L_{+, h}^{(N)} (ω - x) |^{2} f_{W} (ω) d ω . \end{matrix}

(33)

It follows from (5) that

\begin{matrix} L_{+, h}^{(N)} (t) = \frac{{(2 θ)}^{γ}}{h^{γ + 1}} \sum_{l_{1} = 0}^{\frac{N}{γ}} \dots \sum_{l_{γ} = 0}^{\frac{N}{γ}} K^{(γ)} (\frac{t - γ θ - 2 θ (l_{1} + \dots + l_{γ})}{h}) . \end{matrix}

Therefore,

\begin{matrix} var [{\tilde{p}}_{+, h}^{(N)} (x)] & \leq \frac{c_{7}}{n} \int_{- \infty}^{+ \infty} {|\frac{{(2 θ)}^{γ}}{h^{γ + 1}} \sum_{l_{1} = 0}^{\frac{N}{γ}} \dots \sum_{l_{γ} = 0}^{\frac{N}{γ}} K^{(γ)} (\frac{ω - x - γ θ - 2 θ (l_{1} + \dots + l_{γ})}{h})|}^{2} \\ \times f_{W} (ω) d ω \\ \leq \frac{c_{7} {(2 θ)}^{2 γ}}{n h^{2 γ + 2}} \int_{- \infty}^{+ \infty} {[\sum_{l_{1} = 0}^{\frac{N}{γ}} \dots \sum_{l_{γ} = 0}^{\frac{N}{γ}} |K^{(γ)} (\frac{ω - x - γ θ - 2 θ (l_{1} + \dots + l_{γ})}{h})|]}^{2} \\ \times f_{W} (ω) d ω . \end{matrix}

(34)

Let

\begin{matrix} C_{l, γ} : = (\begin{matrix} l + γ - 1 \\ γ - 1 \end{matrix}), \end{matrix}

where

C_{l, γ}

is the number of weak compositions of l in

γ

parts [21]. Note that

\begin{matrix} \sum_{l_{1} = 0}^{\frac{N}{γ}} \dots \sum_{l_{γ} = 0}^{\frac{N}{γ}} |K^{(γ)} (\frac{ω - x - γ θ - 2 θ (l_{1} + \dots + l_{γ})}{h})| \leq \sum_{l = 0}^{N} C_{l, γ} |K^{(γ)} (\frac{ω - x - γ θ - 2 θ l)}{h})| . \end{matrix}

(35)

Then,

\begin{matrix} var [{\tilde{p}}_{+, h}^{(N)} (x)] \leq \frac{c_{7} {(2 θ)}^{2 γ}}{n h^{2 γ + 2}} & \sum_{l = 0}^{N} \sum_{j = 0}^{N} C_{l, γ} C_{j, γ} \int_{- \infty}^{+ \infty} | K^{(γ)} (\frac{ω - x - θ (γ + 2 l)}{h}) | \\ \times | K^{(γ)} & (\frac{ω - x - θ (γ + 2 j)}{h}) | f_{W} (ω) d ω . \end{matrix}

(36)

By supp

(K) \subseteq [- 1, 1]

, we have supp

[K^{(γ)} (\frac{ω - x - θ (γ + 2 l)}{h})] \subseteq [x + θ (γ + 2 l) - h, x + θ (γ + 2 l) + h]

. Denote

I_{+, l} (x) : = [x + θ (γ + 2 l) - h, x + θ (γ + 2 l) + h]

. If

h < θ

, the intervals

I_{+, l} (x)

and

I_{+, j} (x)

are disjointed for

l \neq j

. For an h that is small enough, we obtain

\begin{matrix} var [{\tilde{p}}_{+, h}^{(N)} (x)] & \leq \frac{c_{7} {(2 θ)}^{2 γ}}{n h^{2 γ + 2}} \sum_{l = 0}^{N} C_{l, γ}^{2} \int_{- \infty}^{+ \infty} {|K^{(γ)} (\frac{ω - x - θ (γ + 2 l)}{h})|}^{2} f_{W} (ω) d ω \\ \leq \frac{c_{8} {(2 θ)}^{2 γ}}{n h^{2 γ + 1}} \sum_{l = 0}^{N} \frac{C_{l, γ}^{2}}{h} \int_{I_{+, l} (x)} f_{W} (ω) d ω . \end{matrix}

(37)

Denote

ξ_{+, l} : = x + θ (γ + 2 l)

. By supp

(f_{δ}) \subseteq [- γ θ, γ θ]

and

f_{δ} \leq \frac{c_{9}}{θ}

,

\begin{matrix} \frac{1}{h} \int_{I_{+, l} (x)}^{} f_{W} (ω) d ω & \leq \frac{c_{9}}{θ h} \int_{I_{+, l} (x)}^{} [\int_{- γ θ}^{γ θ} f_{X} (ω - t) d t] d ω \\ = \frac{c_{9}}{θ h} \int_{- \infty}^{+ \infty} f_{X} (u) [\int_{- \infty}^{+ \infty} χ_{(u - γ θ, u + γ θ)} (ω) χ_{(ξ_{+, l} - h, ξ_{+, l} + h)} (ω) d ω] d u . \end{matrix}

Since

h < γ θ

, we have

\begin{matrix} \begin{matrix} \frac{1}{h} \int_{I_{+, l} (x)}^{} f_{W} (ω) d ω & \leq \frac{c_{9}}{θ} [\int_{- h}^{h} (1 + \frac{t}{h}) f_{X} (t + ξ_{+, l} - γ θ) d t + 2 \int_{h - γ θ}^{- h + γ θ} f_{X} (t + ξ_{+, l}) d t \\ + \int_{- h}^{h} (1 - \frac{t}{h}) f_{X} (t + ξ_{+, l} + γ θ) d t] \\ \leq \frac{c_{10}}{θ} [\int_{- h}^{h} f_{X} (t + x + 2 θ l) d t + \int_{- γ θ}^{γ θ} f_{X} (t + x + θ (γ + 2 l)) d t \\ + \int_{- h}^{h} f_{X} (t + x + 2 θ (γ + l)) d t] . \end{matrix} \end{matrix}

(38)

This, with (37), leads to

\begin{matrix} var [{\tilde{p}}_{+, h}^{(N)} (x)] & \leq \frac{c_{8} c_{10} {(2 θ)}^{2 γ}}{n h^{2 γ + 1}} \sum_{l = 0}^{N} \frac{C_{l, γ}^{2}}{θ} [\int_{- h}^{h} f_{X} (t + x + 2 θ l) d t + \int_{- γ θ}^{γ θ} f_{X} (t + x + θ (γ + 2 l)) d t \\ + \int_{- h}^{h} f_{X} (t + x + 2 θ (γ + l)) d t] . \end{matrix}

(39)

When

γ > 1

, we obtain

\begin{matrix} \sum_{l = 0}^{N} \frac{C_{l, γ}^{2}}{θ} \int_{- h}^{h} f_{X} (t + x + 2 θ l) d t & = \frac{1}{θ} \int_{x - h}^{x + h} f_{X} (t) d t + \sum_{l = 1}^{N} \frac{C_{l, γ}^{2}}{θ} \int_{x + 2 θ l - h}^{x + 2 θ l + h} f_{X} (t) d t \\ \leq \frac{1}{θ} + c_{11} B θ^{- 2 γ + 1} \end{matrix}

(40)

by

f_{X} \in M_{2 γ - 2} (B)

and similar arguments to [16]. Similarly,

\begin{matrix} \sum_{l = 0}^{N} \frac{C_{l, γ}^{2}}{θ} \int_{- γ θ}^{γ θ} f_{X} (t + x + θ (γ + 2 l)) d t \leq \frac{1}{θ} + c_{12} B θ^{- 2 γ + 1}, \end{matrix}

and

\begin{matrix} \sum_{l = 0}^{N} \frac{C_{l, γ}^{2}}{θ} \int_{- h}^{h} f_{X} (t + x + 2 θ (γ + l)) d t \leq \frac{1}{θ} + c_{13} B θ^{- 2 γ + 1} . \end{matrix}

When

γ = 1

, we have that

\begin{matrix} \sum_{l = 0}^{N} \frac{1}{θ} \int_{- h}^{h} f_{X} (t + x + 2 θ l) d t = \sum_{l = 0}^{N} \frac{1}{θ} \int_{x + 2 θ l - h}^{x + 2 θ l + h} f_{X} (t) d t \leq \frac{1}{θ} \int_{x - h}^{x + 2 θ N + h} f_{X} (t) d t \leq \frac{1}{θ} \end{matrix}

(41)

holds for

h < θ

. Similar to (41), for

h < θ

, we have

\begin{matrix} \sum_{l = 0}^{N} \frac{1}{θ} \int_{- θ}^{θ} f_{X} (t + x + θ (2 l + 1)) d t \leq \frac{1}{θ} and \sum_{l = 0}^{N} \frac{1}{θ} \int_{- h}^{h} f_{X} (t + x + 2 θ (l + 1)) d t \leq \frac{1}{θ} . \end{matrix}

Hence,

\begin{matrix} var [{\tilde{p}}_{+, h}^{(N)} (x)] \leq \frac{c_{14} {(2 θ)}^{2 γ}}{n h^{2 γ + 1}} (B θ^{- 2 γ + 1} + \frac{3}{θ}) \leq c_{15} (B θ + θ^{2 γ - 1}) {(n h^{2 γ + 1})}^{- 1} . \end{matrix}

(42)

Similar to estimate

var [{\tilde{p}}_{+, h}^{(N)} (x)]

, we have

var [{\tilde{f}}_{X, +, h}^{(N)} (x)] \leq \frac{1}{n} E [| L_{+, h}^{(N)} (W_{1} - x) |^{2}] \leq c_{16} (B θ + θ^{2 γ - 1}) {(n h^{2 γ + 1})}^{- 1} .

(43)

By (29), (32), (42) and (43) with (24)–(26), we obtain

\begin{matrix} P [| {\tilde{m}}_{+, h}^{(N)} {(x) - m (x) |}^{2} \geq C \cdot ε_{n}] \leq \frac{c_{17}}{C ε_{n}} [A^{2} h^{2 β} + \frac{B^{2}}{h^{2} N^{2 r}} + (B θ + θ^{2 γ - 1}) {(n h^{2 γ + 1})}^{- 1}] . \end{matrix}

(44)

Since

h = n^{\frac{- 1}{2 β + 2 γ + 1}}

and

N \geq n^{\frac{β + 1}{r (2 β + 2 γ + 1)}}

,

\begin{matrix} P [| {\tilde{m}}_{+, h}^{(N)} {(x) - m (x) |}^{2} \geq C \cdot ε_{n}] & \leq \frac{c_{17} (A^{2} + B^{2} + B θ + θ^{2 γ - 1})}{C ε_{n}} \cdot n^{\frac{- 2 β}{2 β + 2 γ + 1}} . \end{matrix}

Note that

ε_{n} = n^{\frac{- 2 β}{2 β + 2 γ + 1}}

. Then,

sup_{(m, f_{X}) \in P_{β, r; x}} P [| {\tilde{m}}_{+, h}^{(N)} {(x) - m (x) |}^{2} \geq C \cdot ε_{n}] \leq c_{17} (A^{2} + B^{2} + B θ + θ^{2 γ - 1}) C^{- 1} .

(45)

This leads to the result of Theorem 2 for

x \geq 0

.

(2)

We consider the estimator

{\tilde{m}}_{-, h}^{(N)} (x)

for

x < 0

. By (22), (23) and (28), we have

\begin{matrix} | E {\tilde{p}}_{-, h}^{(N)} {(x) - p (x) |}^{2} & \leq 2 (| \int_{- 1}^{1} K (y) [p (x + y h) - p (x)] d y |^{2} + | T_{-, \frac{N}{γ}} (p; x) |^{2}) \\ \leq c_{18} (A^{2} h^{2 β} + \frac{B^{2}}{h^{2} N^{2 r}}) . \end{matrix}

Similar arguments to (30)–(32) show

\begin{matrix} | E {\tilde{f}}_{X, -, h}^{(N)} (x) - f_{X} {(x) |}^{2} & \leq c_{19} (A^{2} h^{2 β} + \frac{B^{2}}{h^{2} N^{2 r}}) . \end{matrix}

Similar to (33),

\begin{matrix} var [{\tilde{p}}_{-, h}^{(N)} (x)] \leq \frac{c_{20}}{n} \int_{- \infty}^{+ \infty} | L_{-, h}^{(N)} (ω - x) |^{2} f_{W} (ω) d ω, \end{matrix}

and from (6),

\begin{matrix} L_{-, h}^{(N)} (t) = \frac{{(- 2 θ)}^{γ}}{h^{γ + 1}} \sum_{l_{1} = 0}^{\frac{N}{γ}} \dots \sum_{l_{γ} = 0}^{\frac{N}{γ}} K^{(γ)} (\frac{t + γ θ + 2 θ (l_{1} + \dots + l_{γ})}{h}) . \end{matrix}

Similar arguments to (34)–(37) show

\begin{matrix} var [{\tilde{p}}_{-, h}^{(N)} (x)] & \leq \frac{c_{20} {(2 θ)}^{2 γ}}{n h^{2 γ + 2}} \int_{- \infty}^{+ \infty} {[\sum_{l_{1} = 0}^{\frac{N}{γ}} \dots \sum_{l_{γ} = 0}^{\frac{N}{γ}} |K^{(γ)} (\frac{ω - x + γ θ + 2 θ (l_{1} + \dots + l_{γ})}{h})|]}^{2} \\ \times f_{W} (ω) d ω \\ \leq \frac{c_{21} {(2 θ)}^{2 γ}}{n h^{2 γ + 1}} \sum_{l = 0}^{N} \frac{C_{l, γ}^{2}}{h} \int_{I_{-, l} (x)} f_{W} (ω) d ω \end{matrix}

holds for an h that is small enough, where

I_{-, l} (x) : = [x - θ (γ + 2 l) - h, x - θ (γ + 2 l) + h]

. Denote

ξ_{-, l} : = x - θ (γ + 2 l)

. Similar to (38),

\begin{matrix} \begin{matrix} \frac{1}{h} \int_{I_{-, l} (x)}^{} f_{W} (ω) d ω & \leq \frac{c_{22}}{θ} [\int_{- h}^{h} (1 + \frac{t}{h}) f_{X} (t + ξ_{-, l} - γ θ) d t + 2 \int_{h - γ θ}^{- h + γ θ} f_{X} (t + ξ_{-, l}) d t \\ + \int_{- h}^{h} (1 - \frac{t}{h}) f_{X} (t + ξ_{-, l} + γ θ) d t] \\ \leq \frac{c_{23}}{θ} [\int_{- h}^{h} f_{X} (t + x - 2 θ (γ + l)) d t + \int_{- γ θ}^{γ θ} f_{X} (t + x - θ (γ + 2 l)) d t \\ + \int_{- h}^{h} f_{X} (t + x - 2 θ l)) d t] . \end{matrix} \end{matrix}

By similar arguments to (39)–(42), we have

\begin{matrix} var [{\tilde{p}}_{-, h}^{(N)} (x)] \leq c_{24} (B θ + θ^{2 γ - 1}) {(n h^{2 γ + 1})}^{- 1}, \end{matrix}

and

var [{\tilde{f}}_{X, -, h}^{(N)} (x)] \leq \frac{1}{n} E [| L_{-, h}^{(N)} (W_{1} - x) |^{2}] \leq c_{25} (B θ + θ^{2 γ - 1}) {(n h^{2 γ + 1})}^{- 1} .

Similar to (45),

sup_{(m, f_{X}) \in P_{β, r; x}} P [| {\tilde{m}}_{-, h}^{(N)} {(x) - m (x) |}^{2} \geq C \cdot ε_{n}] \leq c_{26} (A^{2} + B^{2} + B θ + θ^{2 γ - 1}) C^{- 1} .

This leads to the result of Theorem 2 for

x < 0

.

This completes the proof. □

Remark 2.

Our convergence rate is the same as that in the ordinary smoothness case of Meister [6], where the density function of the covariate error does not vanish in the Fourier domain. Compared to Delaigle and Meister [1], we do not assume

f_{X}

and m to be compact.

Remark 3.

Belomestny and Goldenshluger [16] consider the density deconvolution problem with non-standard error distributions. They assume the density function to be estimated satisfies the Hölder condition. It is natural to assume a local smooth condition in point estimation. Hence,

f_{X}

and

m f_{X}

are assumed to satisfy the local Hölder condition in our discussion.

Remark 4.

Theorem 1 shows the strong consistency of the regression estimator without the smoothness assumption. The main tool used is the Borel–Cantelli lemma which requires a convergent series. It is easy to see from (13) and (20) that the choice of h is not unique. Theorem 2 gives a weak convergence rate, which is defined by modifying the weak consistency. It is natural to assume the smoothness condition when discussing the convergence rate. In Theorem 2, the choice of h is related to the smoothness index β. It follows from our proof (44) that the choice of h is unique in the sense of a constant difference.

Remark 5.

In our discussion,

{\hat{f}}_{δ} (i v) = {[\frac{sinh (i θ v)}{i θ v}]}^{γ} = {[\frac{sin (θ v)}{θ v}]}^{γ}

. Substituting this into the proof of Theorem 3.5 in [6], one can obtain the optimality of convergence rate in our Theorem 2. This means that there does not exist an estimator

\tilde{m} (x)

of the regression function

m (x)

based on i.i.d data

(W_{1}, Y_{1}), \dots, (W_{n}, Y_{n})

generated by model (1) with (2), which satisfies

lim_{C \to \infty} (\underset{n \to \infty}{lim sup} sup_{(m, f_{X}) \in P_{β, r; x}} P [| \tilde{m} {(x) - m (x) |}^{2} \geq C \cdot \circ (n^{\frac{- 2 β}{2 β + 2 γ + 1}})]) = 0 .

It would be interesting to study the numerical illustration of our estimation. We shall investigate this in the future.

Author Contributions

Writing—original draft preparation, H.G. and Q.B.; Writing—review and editing, H.G. All authors have read and agreed to the published version of the manuscript.

Funding

This paper is supported by the National Natural Science Foundation of China (No. 12001132), the Guangxi Colleges and Universities Key Laboratory of Data Analysis and Computation, and the Center for Applied Mathematics of Guangxi (GUET).

Data Availability Statement

Not applicable.

Acknowledgments

The authors would like to thank the editor and reviewers for their important comments.

Conflicts of Interest

The authors declare no conflict of interest.

References

Delaigle, A.; Meister, A. Nonparametric function estimation under Fourier-oscillating noise. Stat. Sin. 2011, 21, 1065–1092. [Google Scholar] [CrossRef]
Dong, H.; Otsu, T.; Taylor, L. Bandwidth selection for nonparametric regression with errors-in-variables. Econom. Rev. 2023, 42, 393–419. [Google Scholar] [CrossRef]
Di Marzio, M.; Fensore, S.; Taylor, C.C. Kernel regression for errors-in-variables problems in the circular domain. Stat. Methods Appl. 2023. [Google Scholar] [CrossRef]
Fan, J.Q.; Truong, Y.K. Nonparametric regression with errors in variables. Ann. Stat. 1993, 21, 1900–1925. [Google Scholar] [CrossRef]
Hu, Z.R.; Ke, Z.T.; Liu, J.S. Measurement error models: From nonparametric methods to deep neural networks. Stat. Sci. 2022, 37, 473–493. [Google Scholar] [CrossRef]
Meister, A. Deconvolution Problems in Nonparametric Statistics; Springer: Berlin, Germany, 2009. [Google Scholar]
Song, W.X.; Ayub, K.; Shi, J.H. Extrapolation estimation for nonparametric regression with measurement error. Scand. J. Stat. 2023. [Google Scholar] [CrossRef]
Carroll, R.J.; Delaigle, A.; Hall, P. Non-parametric regression estimation from data contaminated by a mixture of Berkson and classical errors. J. R. Stat. Soc. Ser. B Stat. Methodol. 2007, 69, 859–878. [Google Scholar] [CrossRef] [PubMed]
Zhou, S.; Pati, D.; Wang, T.Y.; Yang, Y.; Carroll, R.J. Gaussian processes with errors in variables: Theory and computation. J. Mach. Learn. Res. 2023, 24, 1–53. [Google Scholar]
Delaigle, A.; Hall, P.; Jamshidi, F. Confidence bands in non-parametric errors-in-variables regression. J. R. Stat. Soc. Ser. B Stat. Methodol. 2015, 77, 149–169. [Google Scholar] [CrossRef]
Nadaraya, E.A. On estimating regression. Theory Probab. Its Appl. 1964, 9, 141–142. [Google Scholar] [CrossRef]
Watson, G.S. Smooth regression analysis. Sankhyā Indian J. Stat. 1964, 26, 359–372. [Google Scholar]
Guo, H.J.; Liu, Y.M. Strong consistency of wavelet estimators for errors-in-variables regression model. Ann. Inst. Stat. Math. 2017, 69, 121–144. [Google Scholar] [CrossRef]
Guo, H.J.; Liu, Y.M. Convergence rates of multivariate regression estimators with errors-in-variables. Numer. Funct. Anal. Optim. 2017, 38, 1564–1588. [Google Scholar] [CrossRef]
Guo, H.J.; Liu, Y.M. Regression estimation under strong mixing data. Ann. Inst. Stat. Math. 2019, 71, 553–576. [Google Scholar] [CrossRef]
Belomestny, D.; Goldenshluger, A. Density deconvolution under general assumptions on the distribution of measurement errors. Ann. Stat. 2021, 49, 615–649. [Google Scholar] [CrossRef]
Goldenshluger, A.; Kim, T. Density deconvolution with non-standard error distributions: Rates of convergence and adaptive estimation. Electron. J. Stat. 2021, 15, 3394–3427. [Google Scholar] [CrossRef]
Oppenheim, A.V.; Willsky, A.S.; Nawab, H.S. Signals & Systems, 2nd ed.; Prentice Hall: Upper Saddle River, NJ, USA, 1996. [Google Scholar]
Liu, Y.M.; Wu, C. Point-wise estimation for anisotropic densities. J. Multivar. Anal. 2019, 171, 112–125. [Google Scholar] [CrossRef]
Stein, E.M.; Shakarchi, R. Real Analysis: Measure Theory, Integration, and Hilbert Spaces; Princeton University Press: Princeton, NJ, USA, 2005. [Google Scholar]
Stanley, R.P. Enumerative Combinatorics; Cambridge University Press: Cambridge, UK, 1997; Volume 1. [Google Scholar]

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Article Metrics

Citations

Article Access Statistics

Journal Statistics

Multiple requests from the same IP address are counted as one view.