Fractal Perturbation of the Nadaraya–Watson Estimator

Dah-Chin Luor; Chiao-Wen Liu

doi:10.3390/fractalfract6110680

and

¹

Department of Data Science and Analytics, School of Intelligent Science and Technology, I-Shou University, Dashu District, Kaohsiung City 84001, Taiwan

²

Department of Applied Science, School of Academic Studies, R.O.C. Naval Academy, Zuoying District, Kaohsiung City 813000, Taiwan

^*

Author to whom correspondence should be addressed.

^†

These authors contributed equally to this work.

Fractal Fract.2022, 6(11), 680;https://doi.org/10.3390/fractalfract6110680

This article belongs to the Special Issue Recent Advances in Fractal Interpolation Functions and Their Applications in AI

Version Notes

Order Reprints

Abstract

One of the main tasks in the problems of machine learning and curve fitting is to develop suitable models for given data sets. It requires to generate a function to approximate the data arising from some unknown function. The class of kernel regression estimators is one of main types of nonparametric curve estimations. On the other hand, fractal theory provides new technologies for making complicated irregular curves in many practical problems. In this paper, we are going to investigate fractal curve-fitting problems with the help of kernel regression estimators. For a given data set that arises from an unknown function m, one of the well-known kernel regression estimators, the Nadaraya–Watson estimator

\hat{m}

, is applied. We consider the case that m is Hölder-continuous of exponent

β

with

0 < β \leq 1

, and the graph of m is irregular. An estimation for the expectation of

| \hat{m} {- m |}^{2}

is established. Then a fractal perturbation

f_{[\hat{m}]}

corresponding to

\hat{m}

is constructed to fit the given data. The expectations of

| f_{[\hat{m}]} - \hat{m} |^{2}

and

| f_{[\hat{m}]} {- m |}^{2}

are also estimated.

Keywords:

kernel regression estimators; Nadaraya–Watson estimator; fractal interpolation; curve fitting

1. Introduction

One of the main tasks in the problems of machine learning, curve fitting, signal analysis, and many statistical applications is to develop suitable models for given data sets. In many real-world applications, it requires to generate a function to interpolate or to approximate the data arising from some unknown function. In data-fitting problems, interpolation is usually applied when the data are noise-free, and regression is considered if we have noisy observations.

The theory of nonparametric modeling of a regression has been developed by many researchers. Several types of estimators and their statistical properties have been studied in the literature. The class of kernel estimators is one of the main types of nonparametric curve estimations, and the Nadaraya–Watson estimator, the Priestley–Chao estimator, and the Gasser–Müller estimator are widely used in applications. See [1,2,3,4,5,6] and references given in these books. In [7,8], the authors investigated the differences between several types of kernel regression estimators, and there is no answer to which of these estimators is the best since each of them has advantages and disadvantages.

Fractal theory provides another technology for making complicated curves and fitting experimental data. A fractal interpolation function (FIF) is a continuous function interpolating a given set of points, and the graph of a FIF is the attractor of an iterated function system. The concept of FIFs was introduced by Barnsley ([9,10]), and it has been developed to be the basis of an approximation theory for nondifferentiable functions. FIFs can also be applied to model discrete sequences ([11,12,13]). Various types of FIFs and their approximation properties were discussed in [14,15,16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31,32,33,34,35,36,37,38,39,40,41,42,43,44], and the references given in the literature. See also the book [45] for recent developments. In [46,47,48,49,50], the construction of FIFs for random data sets is given, and some statistical properties of such FIFs were investigated. In [51], the authors made a topological–geometric contribution for the development and applications of fractal models, which present periodic changes.

For a given data set that arises from an unknown function m, the purpose of this paper is not to establish a fractal function that interpolates points in the data set, but we aim to find a fractal function that has good approximation for these data points. In [52], the authors trained SVM by the chosen training data and then applied the SVM model to calculate the interpolation points used to construct a linear FIF. In this paper, we consider the Nadaraya–Watson estimator

\hat{m}

for some sample data chosen from a given data set, and establish an estimation for the expectation of

| \hat{m} {- m |}^{2}

. Then a FIF

f_{[\hat{m}]}

corresponding to

\hat{m}

is constructed to fit the given data set, and the expectations of

| f_{[\hat{m}]} - \hat{m} |^{2}

and

| f_{[\hat{m}]} {- m |}^{2}

are also estimated.

Throughout this paper, let

D = {(t_{i}, y_{i}) \in R \times R : i = 0, 1, \dots, N}

be a given data set, where N is an integer greater than or equal to 2, and

t_{0} < t_{1} < \dots < t_{N}

. We take

t_{0} = 0

and

t_{N} = 1

for convenience. Let

I = [0, 1]

and

I_{i} = [t_{i - 1}, t_{i}]

for

i = 1, \dots, N

. Let

C [I]

denote the set of all real-valued continuous functions defined on I. The set of functions in

C [I]

that interpolate all points in

D

is denoted by

C_{D} [I]

. Define

{∥ f ∥}_{\infty} = {max}_{t \in I} | f (t) |

for

f \in C [I]

. It is known that

(C [I], ∥ \cdot ∥_{\infty})

is a Banach space, and

C_{D} [I]

is a complete metric space, where the metric is induced by

{∥ \cdot ∥}_{\infty}

.

2. Construction of Fractal Interpolation Functions

In this section, we establish a fractal perturbation of a given function in

C [I]

. The construction given here has been treated in the literature (see [47]). We show the details here to make our paper more self-contained.

Let

u \in C [I]

and

D = {(t_{i}, y_{i}) : y_{i} = u (t_{i}), i = 0, 1, \dots, N}

, where

0 = t_{0} < t_{1} < \dots < t_{N} = 1

. Assume that the data points in

D

are non-collinear. For

i = 1, \dots, N

, let

L_{i} : I \to I_{i}

be a homeomorphism such that

L_{i} (0) = t_{i - 1}

and

L_{i} (1) = t_{i}

. Define

M_{i} : I \times R \to R

by

M_{i} (t, y) = s_{i} y + u (L_{i} (t)) - s_{i} p (t),

(1)

where

- 1 < s_{i} < 1

and p is a continuous function on I such that

p (0) = u (0)

and

p (1) = u (1)

. Then

M_{i} (0, u (0)) = y_{i - 1}

,

M_{i} (1, u (1)) = y_{i}

, and

| M_{i} (t, y) - M_{i} (t, y^{*}) | = | s_{i} | | y - y^{*} | for all t \in I and y, y^{*} \in R .

(2)

Define

W_{i} : I \times R \to I_{i} \times R

by

W_{i} (t, y) = (L_{i} (t), M_{i} (t, y))

for

i = 1, \dots, N

. For

h \in C_{D} [I]

, let

G_{h} = {(t, h (t)) : t \in I}

. Then

W_{i} (G_{h}) = {(L_{i} (t), M_{i} (t, h (t))) : t \in I}

. Since

L_{i} : I \to I_{i}

is a homeomorphism,

W_{i} (G_{h})

can be written as

W_{i} (G_{h}) = {(t, M_{i} (L_{i}^{- 1} (t), h (L_{i}^{- 1} (t)))) : t \in I_{i}} .

Hence

W_{i} (G_{h})

is the graph of the continuous function

h_{i} : I_{i} \to R

defined by

h_{i} (t) = M_{i} (L_{i}^{- 1} (t), h (L_{i}^{- 1} (t)))

. Define a mapping

T : C_{D} [I] \to C_{D} [I]

by

T (h) (t) = h_{i} (t) = s_{i} h (L_{i}^{- 1} (t)) + u (t) - s_{i} p (L_{i}^{- 1} (t)), t \in I_{i} .

(3)

By

(3)

we see that, for

g, h \in C_{D} [I]

and

t \in I_{i}

,

| T (g) (t) - T (h) (t) | \leq | s_{i} | | g (L_{i}^{- 1} (t)) - h (L_{i}^{- 1} (t)) | .

Then

{∥ T (g) - T (h) ∥}_{\infty} \leq max_{i = 1, \dots, N} | s_{i} | \{max_{z \in I} | g (z) - h (z) |\} \leq s {∥ g - h ∥}_{\infty} .

Here

s = max {| s_{1} |, \dots, | s_{N} |}

. Since

0 \leq s < 1

, we have the following theorem ([47], Theorem 2.1).

Theorem 1.

The operator T given by

(3)

is a contraction mapping on

C_{D} [I]

.

Definition 1.

The fixed point

f_{[u]}

of T in

C_{D} [I]

is called a fractal interpolation function (FIF) on I corresponding to the continuous function u.

The FIF

f_{[u]}

given in Definition 1 satisfies the following equation for

i = 1, \dots, N

:

f_{[u]} (t) = s_{i} \{f_{[u]} (L_{i}^{- 1} (t)) - p (L_{i}^{- 1} (t))\} + u (t), t \in I_{i} .

(4)

If

s_{i} = 0

for all i, then

f_{[u]} = u

. Therefore,

f_{[u]}

can be treated as a fractal perturbation of u.

3. The Nadaraya–Watson Estimator

Let

D = {(t_{i}, y_{i}) \in R \times R : i = 0, 1, \dots, N}

be a given data set, where

0 = t_{0} < t_{1} < \dots < t_{N} = 1

. Suppose that

Y_{i} = m (t_{i}) + ϵ_{i}, for i = 0, 1, \dots, N,

(5)

where

m : [0, 1] \to R

is an unknown function, and each

y_{i}

is an observation of

Y_{i}

. Here, all

ϵ_{i}

are independent stochastic disturbance terms with zero expectation, E

[ϵ_{i}] = 0

, and finite variance, Var

[ϵ_{i}] \leq σ^{2} < \infty

. In this section, we consider the Nadaraya–Watson estimator

\hat{m}

for

D

and establish an estimation for the expectation of

| \hat{m} {- m |}^{2}

.

Consider the case that m is Hölder continuous of exponent

β

with

0 < β \leq 1

, and the graph of m is irregular. Then, m satisfies the inequality with

0 < β \leq 1

and

λ > 0

:

| m (t) - m (t^{'}) | \leq λ | t - t^{'} |^{β}, t, t^{'} \in I .

(6)

The Nadaraya–Watson estimator

\hat{m}

of m is defined by

\hat{m} (t) = \frac{\sum_{i = 0}^{N} k_{d} (t - t_{i}) Y_{i}}{\sum_{j = 0}^{N} k_{d} (t - t_{j})}, where k_{d} (z) = \frac{1}{d} k (\frac{z}{d}) .

(7)

Here

d > 0

is a bandwidth, and k is an integrable function defined on

R

.

The function k is called a kernel and is usually assumed to be bounded and satisfies some integrable conditions. Some widely used kernels are given in ([2], p. 41) and ([5], p. 3), and the estimations using different kernels are usually numerically similar (see [6]). In this paper, we assume that there are positive numbers

C_{1}

,

C_{2}

,

η

, and R such that the kernel k satisfies the condition

C_{1} χ_{[- η, η]} (z) \leq k (z) \leq C_{2} χ_{[- R, R]} (z), z \in R .

(8)

Condition (8) and its multidimensional form was considered in ([5], Theorem 1.7) and ([1], Theorem 5.1).

A new estimation for the bias of

\hat{m}

was obtained in [53]. Here, we give an estimation for E

[{(\hat{m} (t) - m (t))}^{2}]

in the following Theorem 2. Similar results were studied in [1,2,5], and other literature. The convergence rate of upper estimation obtained in Theorem 2 is the same as the known results.

The Nadaraya–Watson estimator

\hat{m}

given in (7) can be written in the form

\hat{m} (t) = \sum_{i = 0}^{N} W_{i} (t) Y_{i}, where W_{i} (t) = \frac{k_{d} (t - t_{i})}{\sum_{j = 0}^{N} k_{d} (t - t_{j})} .

(9)

Then

\sum_{i = 0}^{N} W_{i} (t) = 1

for all t and

E [\hat{m} (t)] = \sum_{i = 0}^{N} W_{i} (t) E [Y_{i}] = \sum_{i = 0}^{N} W_{i} (t) m (t_{i}) .

(10)

In the following lemma, we give a lower bound for

\sum_{j = 0}^{N} k_{d} (t - t_{j})

. Define

a_{N} = min_{1 \leq k \leq N} t_{k} - t_{k - 1}, A_{N} = max_{1 \leq k \leq N} t_{k} - t_{k - 1} .

(11)

Lemma 1.

Let

0 = t_{0} < t_{1} < \dots < t_{N} = 1

. Suppose that

k : R \to R

and there are positive numbers

C_{1}

and η such that

C_{1} χ_{[- η, η]} (z) \leq k (z)

for

z \in R

. Let

d > 0

and let

A_{N}

and

k_{d}

be defined in (11) and (7), respectively. Assume that

A_{N} < 2 d η

and

A_{N} \leq \frac{α}{N}

for some

α > 0

. Then for

0 \leq t \leq 1

,

\sum_{j = 0}^{N} k_{d} (t - t_{j}) \geq \frac{C_{1} η N}{α} .

(12)

Proof.

For

0 \leq t \leq 1

, the condition

C_{1} χ_{[- η, η]} (z) \leq k (z)

implies that

\sum_{j = 0}^{N} k_{d} (t - t_{j}) = \frac{1}{d} \sum_{j = 0}^{N} k (\frac{t - t_{j}}{d}) \geq \frac{C_{1}}{d} \sum_{j = 0}^{N} χ_{[- η, η]} (\frac{t - t_{j}}{d}) = \frac{C_{1}}{d} | E_{η} (t) |,

where

E_{η} (t) = {t_{j} : j = 0, 1, \dots, N, and | \frac{t - t_{j}}{d} | \leq η}

and

| E_{η} (t) |

is the number of elements of

E_{η} (t)

. Since

| \frac{t - t_{j}}{d} | \leq η

if and only if

t_{j} \in [t - d η, t + d η] \cap [0, 1]

, we have

E_{η} (t) = {t_{j} : j = 0, 1, \dots, N, and t_{j} \in [t - d η, t + d η] \cap [0, 1]

}.

For

t \in [d η, 1 - d η]

, we have

[t - d η, t + d η] \subseteq [0, 1]

, and by the condition

A_{N} < 2 d η

, we see that

| E_{η} (t) | \geq [\frac{2 d η}{A_{N}}] \geq 1

and this implies

| E_{η} (t) | \geq \frac{d η}{A_{N}}

. For

t \in [0, d η)

, we have

[t - d η, t + d η] \cap [0, 1] = [0, t + d η]

and

t_{0} = 0 \in E_{η} (t)

. Hence

| E_{η} (t) | \geq [\frac{t + d η}{A_{N}}] + 1 \geq 1

and

| E_{η} (t) | \geq \frac{d η}{A_{N}}

. For

t \in (1 - d η, 1]

, we have

[t - d η, t + d η] \cap [0, 1] = [t - d η, 1]

and

t_{N} = 1 \in E_{η} (t)

. Hence

| E_{η} (t) | \geq [\frac{1 - t + d η}{A_{N}}] + 1 \geq 1

and

| E_{η} (t) | \geq \frac{d η}{A_{N}}

. Then the condition

A_{N} \leq \frac{α}{N}

implies

(12)

. □

Theorem 2.

Let

D

be a given data set and assume that m satisfies

(6)

. Suppose that k satisfies

(8)

and

\hat{m}

is defined by

(7)

. Assume that

A_{N} < 2 d η

and

A_{N} \leq \frac{α}{N}

for some

α > 0

. Then we have

E [{(\hat{m} (t) - m (t))}^{2}] \leq λ^{2} R^{2 β} d^{2 β} + (\frac{α C_{2} σ^{2}}{C_{1} η}) \frac{1}{N d} .

(13)

Proof.

We see that

E [{(\hat{m} (t) - m (t))}^{2}] = {E [\hat{m} (t)] - m (t)}^{2} + E [\hat{m} {(t)}^{2}] - {(E [\hat{m} (t)])}^{2} .

(14)

By (6) and (9)–(10), we have

| E [\hat{m} (t)] - m (t) | = |\sum_{i = 0}^{N} W_{i} (t) (m (t_{i}) - m (t))| \leq λ \sum_{i = 0}^{N} W_{i} (t) {| t_{i} - t |}^{β} .

Condition

(8)

implies that

k (\frac{t - t_{i}}{d}) = 0

if

| \frac{t - t_{i}}{d} | > R

. Therefore,

| E [\hat{m} (t)] - m (t) | \leq λ d^{β} \frac{\sum_{i = 0}^{N} k (\frac{t - t_{i}}{d}) {| \frac{t - t_{i}}{d} |}^{β}}{\sum_{j = 0}^{N} k (\frac{t - t_{j}}{d})} \leq λ R^{β} d^{β} .

(15)

On the other hand, by

(8)

and

(12)

, we also have

sup_{i, t} W_{i} (t) = sup_{i, t} \frac{k (\frac{t - t_{i}}{d})}{\sum_{j = 0}^{N} k (\frac{t - t_{j}}{d})} \leq \frac{α C_{2}}{C_{1} η N d} .

(16)

By (9), (10) and (5), we have

E [\hat{m} {(t)}^{2}] - {(E [\hat{m} (t)])}^{2} = E [{(\hat{m} (t) - E [\hat{m} (t)])}^{2}] = E [{(\sum_{i = 0}^{N} W_{i} (t) ϵ_{i})}^{2}] .

Since all

ϵ_{i}

are independent and satisfy

E [ϵ_{i}] = 0

and

Var [ϵ_{i}] \leq σ^{2} < \infty

, the condition

\sum_{i = 0}^{N} W_{i} (t) = 1

and estimation

(16)

imply that

E [\hat{m} {(t)}^{2}] - {(E [\hat{m} (t)])}^{2} = \sum_{i = 0}^{N} W_{i} {(t)}^{2} E [ϵ_{i}^{2}] \leq σ^{2} (sup_{i, t} W_{i} (t)) \sum_{i = 0}^{N} W_{i} (t) \leq (\frac{α C_{2} σ^{2}}{C_{1} η}) \frac{1}{N d} .

Then by (14) and (15), we have (13). □

For a given kernel k which satisfies

(8)

, estimation

(13)

shows that

C_{1}

and

η

should be chosen so that

C_{1} η

is as large as possible. The minimizer

d^{*}

with respect to d of the right-hand side of

(13)

can be obtained by setting

E (d) = λ^{2} R^{2 β} d^{2 β} + (\frac{α C_{2} σ^{2}}{C_{1} η N}) d^{- 1}

, and then solve the equation

E^{'} (d) = (2 β) λ^{2} R^{2 β} d^{2 β - 1} - (\frac{α C_{2} σ^{2}}{C_{1} η N}) d^{- 2} = 0 .

We have

d^{*} = {(\frac{α C_{2} σ^{2}}{2 β C_{1} η λ^{2} R^{2 β}})}^{\frac{1}{2 β + 1}} N^{\frac{- 1}{2 β + 1}}

(17)

and the upper estimate given in

(13)

can be reduced to

C^{*} N^{- 2 β / (2 β + 1)}

, where

C^{*}

depends on

α

,

β

,

λ

,

σ^{2}

,

η

, R,

C_{1}

, and

C_{2}

.

4. Fractal Perturbation of the Nadaraya–Watson Estimator

In this section, we consider FIFs

f_{[\hat{m}]}

corresponding to the function

\hat{m}

and we establish estimations for the expectation of

| f_{[\hat{m}]} - \hat{m} |^{2}

and

| f_{[\hat{m}]} {- m |}^{2}

. Suppose that k is continuous and we replace each

Y_{i}

in

(7)

by

y_{i}

. Then

\hat{m} \in C [I]

. By the construction given in Section 2 with

u = \hat{m}

, we have a FIF

f_{[\hat{m}]}

on I that satisfies the equation for

i = 1, \dots, N

:

f_{[\hat{m}]} (t) = s_{i} \{f_{[\hat{m}]} (L_{i}^{- 1} (t)) - p (L_{i}^{- 1} (t))\} + \hat{m} (t), t \in I_{i} .

(18)

Here, p is chosen to be the linear polynomial such that

p (0) = \hat{m} (0)

and

p (1) = \hat{m} (1)

. Then we replace

y_{i}

by

Y_{i}

for each i and consider

f_{[\hat{m}]} (t)

a random variable for every

t \in I

. We are interested in estimations for

∥ E [| f_{[\hat{m}]} {- m |}^{2} {] ∥}_{\infty}

.

Theorem 3.

Suppose that k is continuous and k satisfies

(8)

with

R = 1

and

C_{2} = 1

. Suppose that m satisfies

(6)

and

\hat{m}

is defined by

(7)

. Let

M = max {| m (t_{i}) | : i = 0, 1, \dots, N}

. Assume that

A_{N} < 2 d η

,

A_{N} \leq \frac{α}{N}

, and

a_{N} \geq \frac{τ}{N}

for some

α > 0

and

τ > 0

, where

A_{N}

and

a_{N}

are defined in

(11)

. Suppose that

0 < s = max {| s_{1} |, \dots, | s_{N} |} < 2^{- 1 / 2}

and

∥ E [| f_{[\hat{m}]} - \hat{m} |^{2} {] ∥}_{\infty} < \infty

. Then we have

\begin{matrix} ∥ E [| f_{[\hat{m}]} - \hat{m} |^{2} {] ∥}_{\infty} & \leq (\frac{72 s^{2} α^{2} (M^{2} + σ^{2})}{(1 - 2 s^{2}) C_{1}^{2} η^{2} τ^{2}}) \frac{{(N d + τ)}^{2}}{{(N d)}^{2}}, \end{matrix}

(19)

\begin{matrix} ∥ E [| f_{[\hat{m}]} {- m |}^{2} {] ∥}_{\infty} & \leq (\frac{144 s^{2} α^{2} (M^{2} + σ^{2})}{(1 - 2 s^{2}) C_{1}^{2} η^{2} τ^{2}}) \frac{{(N d + τ)}^{2}}{{(N d)}^{2}} + 2 λ^{2} d^{2 β} + (\frac{2 α σ^{2}}{C_{1} η}) \frac{1}{N d} . \end{matrix}

(20)

Proof.

For

t \in I_{i}

,

(18)

implies

| f_{[\hat{m}]} (t) - \hat{m} {(t) |}^{2} \leq 2 s_{i}^{2} \{| f_{[\hat{m}]} (L_{i}^{- 1} (t)) - \hat{m} (L_{i}^{- 1} (t)) |^{2} + {| \hat{m} (L_{i}^{- 1} (t)) - p (L_{i}^{- 1} (t)) |}^{2}\},

and we have

sup_{t \in I_{i}} E [| f_{[\hat{m}]} (t) - \hat{m} (t) |^{2}] \leq 2 s_{i}^{2} (sup_{z \in I} E [| f_{[\hat{m}]} (z) - \hat{m} {(z) |}^{2}] + sup_{z \in I} E [| \hat{m} (z) - p (z) |^{2}]) .

Then

∥ E [| f_{[\hat{m}]} - \hat{m} |^{2} {] ∥}_{\infty} \leq 2 s^{2} {∥ E [| f_{[\hat{m}]} - \hat{m} |^{2} {] ∥}_{\infty} + ∥ E [| \hat{m} {- p |}^{2} {] ∥}_{\infty}}

and therefore

∥ E [| f_{[\hat{m}]} - \hat{m} |^{2} {] ∥}_{\infty} \leq \frac{2 s^{2}}{1 - 2 s^{2}} ∥ E [| \hat{m} {- p |}^{2} {] ∥}_{\infty} .

(21)

Since p is the linear polynomial with

p (0) = \hat{m} (0)

and

p (1) = \hat{m} (1)

, we have

p (t) = \hat{m} (0) + (\hat{m} (1) - \hat{m} (0)) t, t \in I,

(22)

and then

| \hat{m} (t) - p (t) | = | (\hat{m} (t) - \hat{m} (0)) (1 - t) + (\hat{m} (t) - \hat{m} (1)) t | .

The convexity of the square function

x \mapsto x^{2}

implies that

| \hat{m} {(t) - p (t) |}^{2} \leq (1 - t) | \hat{m} (t) - \hat{m} {(0) |}^{2} + t {| \hat{m} (t) - \hat{m} (1) |}^{2}

and therefore

E [| \hat{m} {(t) - p (t) |}^{2}] \leq (1 - t) E [| \hat{m} (t) - \hat{m} {(0) |}^{2}] + t E [| \hat{m} (t) - \hat{m} (1) |^{2}], t \in I .

(23)

By

(9)

,

\hat{m} (t) - \hat{m} (1) = \sum_{r = 0}^{N} (W_{r} (t) - W_{r} (1)) Y_{r}

. By

(8)

with

R = 1

, we see that if

t_{r} < 1 - d

, then

\frac{1 - t_{r}}{d} > 1

and

k (\frac{1 - t_{r}}{d}) = 0

. This implies

W_{r} (1) = 0

. For

t \in I

, if

t_{r} \notin [t - d, t + d]

, then

| \frac{t - t_{r}}{d} | > 1

and

k (\frac{t - t_{r}}{d}) = 0

. This implies

W_{r} (t) = 0

. Then

\hat{m} (t) - \hat{m} (1) = \sum_{r \in B_{t}} (W_{r} (t) - W_{r} (1)) Y_{r},

(24)

where

B_{t} = {r : t_{r} \in [t - d, t + d] or t_{r} \in [1 - d, 1]}

. Let

ξ = [\frac{d}{a_{N}}]

. Then the number of elements in

B_{t}

is less than

3 (ξ + 1)

.

By

(12)

and

(8)

with

C_{2} = 1

, we have

| W_{r} (t) - W_{r} (1) | \leq \frac{k_{d} (t - t_{r})}{\sum_{j = 0}^{N} k_{d} (t - t_{j})} + \frac{k_{d} (1 - t_{r})}{\sum_{j = 0}^{N} k_{d} (1 - t_{j})} \leq \frac{2 α}{C_{1} η N d} .

(25)

By

(5)

we also have

E [Y_{r}^{2}] = m {(t_{r})}^{2} + σ^{2}

for

r = 0, 1, \dots, N

. Condition

(6)

shows that m is continuous and therefore m is bounded on I. Then for

t \in I

,

\begin{matrix} E [| \hat{m} (t) - \hat{m} (1) |^{2}] & \leq \{\sum_{r \in B_{t}} {(W_{r} (t) - W_{r} (1))}^{2}\} \{\sum_{r \in B_{t}} E [Y_{r}^{2}]\} \\ \leq {(\frac{2 α}{C_{1} η N d})}^{2} (M^{2} + σ^{2}) {(3 ξ + 3)}^{2} . \end{matrix}

We also have the same estimate for

E [| \hat{m} (t) - \hat{m} (0) |^{2}]

.

By the condition

a_{N} \geq \frac{τ}{N}

, we have

ξ \leq \frac{d}{a_{N}} \leq \frac{N d}{τ}

, and then

(23)

can be reduced to

E [| \hat{m} (t) - p (t) |^{2}] \leq (\frac{36 α^{2} (M^{2} + σ^{2})}{C_{1}^{2} η^{2} τ^{2}}) \frac{{(N d + τ)}^{2}}{{(N d)}^{2}}, t \in I .

(26)

Thus,

(19)

can be obtained by

(21)

and

(26)

. Moreover, we have

(20)

by

(13)

,

(19)

, and the inequality

∥ E [| f_{[\hat{m}]} {- m |}^{2} {] ∥}_{\infty} \leq 2 ∥ E [| f_{[\hat{m}]} - \hat{m} |^{2} {] ∥}_{\infty} + 2 ∥ E [| \hat{m} {- m |}^{2} {] ∥}_{\infty} .

□

For a given kernel k, which satisfies condition

(8)

, estimation

(20)

shows that

C_{1}

and

η

should be chosen so that

C_{1} η

is as large as possible. If we choose

d = d^{*}

, where

d^{*}

is given by

(17)

with

C_{2} = 1

and

R = 1

, then

(20)

can be reduced to

∥ E [| f_{[\hat{m}]} {- m |}^{2} {] ∥}_{\infty} \leq A {(1 + D N^{\frac{- 2 β}{2 β + 1}})}^{2} + C^{*} N^{\frac{- 2 β}{2 β + 1}},

(27)

where

A = \frac{144 s^{2} α^{2} (M^{2} + σ^{2})}{(1 - 2 s^{2}) C_{1}^{2} η^{2} τ^{2}}

and D depends on

λ

,

α

,

β

,

C_{1}

,

η

,

τ

,

σ^{2}

, and

C^{*}

depends on

λ

,

α

,

β

,

C_{1}

,

η

,

σ^{2}

. Moreover, the constant M can be estimated by

\tilde{M} = max {| y_{0} |, | y_{1} |, \dots, | y_{N} |}

.

The right-hand side of

(27)

tends to A when

N \to \infty

. In fact, if d is chosen so that

d \to 0

and

N d \to \infty

as

N \to \infty

, then the right-hand side of

(20)

tends to A as

N \to \infty

. Moreover,

A \to 0

as

s \to 0

.

Example 1.

The data set we used in this example is the Crude Oil WTI Futures daily highest price from 2021/7/19 to 2022/8/17. These data are opened and they can be obtained from the website https://www.investing.com/commodities/crude-oil-historical-data. There are 287 raw data and we chose 11 data as our sample subset

S

. These data points are shown in Figure 1. We set

S = {(t_{i}, w_{i}) : i = 0, 1, \dots, 10}

, where

t_{i} = \frac{i}{10}

and

w_{i}

are the Crude Oil WTI Futures daily highest prices in 2021/7/19, 8/26, 10/6, 11/16, 12/28, 2022/2/2, 3/9, 4/19, 5/30, 7/5, and 8/17.

Figure 1. Raw data and sample data.

Let

\hat{m}

be defined by

(7)

, with each

Y_{i}

being replaced by

w_{i}

,

\hat{m} (t) = \frac{\sum_{i = 0}^{10} k (\frac{t - 0.1 \times i}{d}) w_{i}}{\sum_{j = 0}^{10} k (\frac{t - 0.1 \times j}{d})},

(28)

and choose k to be the Epanechnikov kernel

k (t) = 0.75 (1 - t^{2}) χ_{{| t | \leq 1}}

. Let

N = 10

and choose

R = 1

,

C_{2} = 1

,

η = \frac{1}{\sqrt{3}}

,

C_{1} = 0.5

in

(8)

. We estimate M by

max {w_{0}, w_{1}, \dots, w_{10}}

, and set

α = 1

and

τ = 1

in Theorem 3. Assume that

β = 0.5

in this example. The values of

σ^{2}

and λ are estimated by the sample variance and

max \{\frac{| w_{i} - w_{j} |}{\sqrt{| t_{i} - t_{j} |}} : i, j = 0, 1, \dots, 10, i \neq j\}

, respectively. By

(17)

, we set

d = 0.092

.

We construct a FIF

f_{[\hat{m}]}

by the method given in Section 2 with linear functions

L_{i}

and the linear polynomial p such that

L_{i} (0) = \frac{i - 1}{10}

,

L_{i} (1) = \frac{i}{10}

, and

p (0) = \hat{m} (0)

,

p (1) = \hat{m} (1)

. The chosen values

s_{1}, \dots, s_{10}

are given in Table 1.

Table 1. The values of

s_{k}

.

The graphs of raw data and

\hat{m}

are shown in Figure 2. The graphs of

\hat{m}

and

f_{[\hat{m}]}

are shown in Figure 3. The graphs of raw data and

f_{[\hat{m}]}

are shown in Figure 4.

Figure 2. Raw data and

\hat{m}

.

Figure 3.

\hat{m}

and

f_{[\hat{m}]}

.

Figure 4. Raw data and

f_{[\hat{m}]}

.

5. Conclusions

The purpose of this paper is to construct a fractal interpolation function (FIF) that has good approximation for a given data set. We consider the Nadaraya–Watson estimator

\hat{m}

for some sample data chosen from a given data set, and then apply

\hat{m}

to construct a FIF

f_{[\hat{m}]}

to fit the given set of data points. The Nadaraya–Watson estimator is widely used in data-fitting problems, and its fractal perturbation is considered in our paper. The expectations of mean squared errors of such approximation are also estimated. By the figures given in Example 1, we may see the quality of curve fitting by a FIF, which is constructed from

\hat{m}

with 11 sample points to fit the 287 raw data points. We see that the error of approximation can be decreased by choosing more sample data.

In this paper, we construct a FIF to fit a given data set with the help of the Nadaraya–Watson estimator. In fact, the Priestley–Chao estimator, the Gasser–Müller estimator, and other types of kernel regression estimators can also be used in our approach. Nonparametric regression has been studied for a long time. Several types of models with their theoretical results and applications are widely developed by many researchers. Fractal perturbations of these models are worth investigating in the field of fractal curve fitting.

Author Contributions

Conceptualization, D.-C.L.; methodology, D.-C.L.; software, C.-W.L.; validation, D.-C.L. and C.-W.L.; formal analysis, D.-C.L.; investigation, C.-W.L.; resources, C.-W.L.; data curation, C.-W.L.; writing—original draft preparation, C.-W.L.; writing—review and editing, D.-C.L.; project administration, D.-C.L.; funding acquisition, D.-C.L. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by Ministry of Science and Technology, R.O.C. grant number MOST 110-2115-M-214-002.

Data Availability Statement

The data set used in this paper can be obtained in the webpage https://www.investing.com/commodities/crude-oil-historical-data.

Conflicts of Interest

The authors declare no conflict of interest.

References

Györfi, L.; Kohler, M.; Krzyzak, A.; Walk, H. A Distribution-Free Theory of Nonparametric Regression; Springer: New York, NY, USA, 2002. [Google Scholar]
Härdle, W.; Müller, M.; Sperlich, S.; Werwatz, A. Nonparametric and Semiparametric Models; Springer: New York, NY, USA, 2004. [Google Scholar]
Hart, J.D. Nonparametric Smoothing and Lack-of-Fit Tests; Springer: New York, NY, USA, 1997. [Google Scholar]
Li, Q.; Racine, J.S. Nonparametric Econometrics; Princeton University Press: Mercer County, NJ, USA, 2007. [Google Scholar]
Tsybakov, A.B. Introduction to Nonparametric Estimation; Springer: New York, NY, USA, 2009. [Google Scholar]
Wasserman, L. All of Nonparametric Statistics; Springer: New York, NY, USA, 2006. [Google Scholar]
Chu, C.-K.; Marron, J.S. Choosing a kernel regression estimator. Stat. Sci. 1991, 6, 404–436. [Google Scholar] [CrossRef]
Jones, M.C.; Davies, S.J.; Park, B.U. Versions of kernel-type regression estimators. J. Amer. Statist. Assoc. 1994, 89, 825–832. [Google Scholar] [CrossRef]
Barnsley, M.F. Fractal functions and interpolation. Constr. Approx. 1986, 2, 303–329. [Google Scholar] [CrossRef]
Barnsley, M.F. Fractals Everywhere; Academic Press: Orlando, FL, USA, 1988. [Google Scholar]
Marvasti, M.A.; Strahle, W.C. Fractal geometry analysis of turbulent data. Signal Process. 1995, 41, 191–201. [Google Scholar] [CrossRef]
Mazel, D.S. Representation of discrete sequences with three-dimensional iterated function systems. IEEE Trans. Signal Process. 1994, 42, 3269–3271. [Google Scholar] [CrossRef]
Mazel, D.S.; Hayes, M.H. Using iterated function systems to model discrete sequences. IEEE Trans. Signal Process. 1992, 40, 1724–1734. [Google Scholar] [CrossRef]
Balasubramani, N. Shape preserving rational cubic fractal interpolation function. J. Comput. Appl. Math. 2017, 319, 277–295. [Google Scholar] [CrossRef]
Balasubramani, N.; Guru Prem Prasad, M.; Natesan, S. Shape preserving α-fractal rational cubic splines. Calcolo 2020, 57, 21. [Google Scholar] [CrossRef]
Barnsley, M.F.; Elton, J.; Hardin, D.; Massopust, P. Hidden variable fractal interpolation functions. SIAM J. Math. Anal. 1989, 20, 1218–1242. [Google Scholar] [CrossRef]
Barnsley, M.F.; Massopust, P.R. Bilinear fractal interpolation and box dimension. J. Approx. Theory 2015, 192, 362–378. [Google Scholar] [CrossRef]
Chand, A.K.B.; Kapoor, G.P. Generalized cubic spline fractal interpolation functions. SIAM J. Numer. Anal. 2006, 44, 655–676. [Google Scholar] [CrossRef]
Chand, A.K.B.; Navascués, M.A. Natural bicubic spline fractal interpolation. Nonlinear Anal. 2008, 69, 3679–3691. [Google Scholar] [CrossRef]
Chand, A.K.B.; Navascués, M.A. Generalized Hermite fractal interpolation. Rev. Real Acad. Cienc. Zaragoza 2009, 64, 107–120. [Google Scholar]
Chand, A.K.B.; Tyada, K.R. Constrained shape preserving rational cubic fractal interpolation functions. Rocky Mt. J. Math. 2018, 48, 75–105. [Google Scholar] [CrossRef]
Chand, A.K.B.; Vijender, N.; Viswanathan, P.; Tetenov, A.V. Affine zipper fractal interpolation functions. BIT Numer. Math. 2020, 60, 319–344. [Google Scholar] [CrossRef]
Chand, A.K.B.; Viswanathan, P. A constructive approach to cubic Hermite fractal interpolation function and its constrained aspects. BIT Numer. Math. 2013, 53, 841–865. [Google Scholar] [CrossRef]
Chandra, S.; Abbas, S.; Verma, S. Bernstein super fractal interpolation function for countable data systems. Numer. Algorithms 2022. [Google Scholar] [CrossRef]
Dai, Z.; Wang, H.-Y. Construction of a class of weighted bivariate fractal interpolation functions. Fractals 2022, 30, 2250034. [Google Scholar] [CrossRef]
Katiyar, S.K.; Chand, A.K.B. Shape preserving rational quartic fractal functions. Fractals 2019, 27, 1950141. [Google Scholar] [CrossRef]
Katiyar, S.K.; Chand, A.K.B.; Kumar, G.S. A new class of rational cubic spline fractal interpolation function and its constrained aspects. Appl. Math. Comput. 2019, 346, 319–335. [Google Scholar] [CrossRef]
Luor, D.-C. Fractal interpolation functions with partial self similarity. J. Math. Anal. Appl. 2018, 464, 911–923. [Google Scholar] [CrossRef]
Massopust, P.R. Fractal Functions, Fractal Surfaces, and Wavelets; Academic Press: San Diego, CA, USA, 1994. [Google Scholar]
Massopust, P.R. Interpolation and Approximation with Splines and Fractals; Oxford University Press: New York, NY, USA, 2010. [Google Scholar]
Miculescu, R.; Mihail, A.; Pacurar, C.M. A fractal interpolation scheme for a possible sizeable set of data. J. Fractal Geom. 2022. [Google Scholar] [CrossRef]
Navascués, M.A. Fractal approximation. Complex Anal. Oper. Theory 2010, 4, 953–974. [Google Scholar] [CrossRef]
Navascués, M.A. Fractal bases of L_p spaces. Fractals 2012, 20, 141–148. [Google Scholar] [CrossRef]
Navascués, M.A.; Chand, A.K.B. Fundamental sets of fractal functions. Acta Appl. Math. 2008, 100, 247–261. [Google Scholar] [CrossRef]
Navascués, M.A.; Pacurar, C.; Drakopoulos, V. Scale-free fractal interpolation. Fractal Fract. 2022, 6, 602. [Google Scholar] [CrossRef]
Prasad, S.A. Super coalescence hidden-variable fractal interpolation functions. Fractals 2021, 29, 2150051. [Google Scholar] [CrossRef]
Ri, S.; Drakopoulos, V. Generalized fractal interpolation curved lines and surfaces. Nonlinear Stud. 2021, 28, 427–488. [Google Scholar]
Tyada, K.R.; Chand, A.K.B.; Sajid, M. Shape preserving rational cubic trigonometric fractal interpolation functions. Math. Comput. Simul. 2021, 190, 866–891. [Google Scholar] [CrossRef]
Vijender, N. Fractal perturbation of shaped functions: Convergence independent of scaling. Mediterr. J. Math. 2018, 15, 211. [Google Scholar] [CrossRef]
Viswanathan, P. A revisit to smoothness preserving fractal perturbation of a bivariate function: Self-Referential counterpart to bicubic splines. Chaos Solitons Fractals 2022, 157, 111885. [Google Scholar] [CrossRef]
Viswanathan, P.; Chand, A.K.B. Fractal rational functions and their approximation properties. J. Approx. Theory 2014, 185, 31–50. [Google Scholar] [CrossRef]
Viswanathan, P.; Chand, A.K.B. α-fractal rational splines for constrained interpolation. Electron. Trans. Numer. Anal. 2014, 41, 420–442. [Google Scholar]
Viswanathan, P.; Navascués, M.A.; Chand, A.K.B. Associate fractal functions in L^p-spaces and in one-sided uniform approximation. J. Math. Anal. Appl. 2016, 433, 862–876. [Google Scholar] [CrossRef]
Wang, H.-Y.; Yu, J.-S. Fractal interpolation functions with variable parameters and their analytical properties. J. Approx. Theory 2013, 175, 1–18. [Google Scholar] [CrossRef]
Banerjee, S.; Gowrisankar, A. Frontiers of Fractal Analysis Recent Advances and Challenges; CRC Press: Boca Raton, FL, USA, 2022. [Google Scholar]
Kumar, M.; Upadhye, N.S.; Chand, A.K.B. Linear fractal interpolation function for data set with random noise. Fractals, 2022; accepted. [Google Scholar] [CrossRef]
Luor, D.-C. Fractal interpolation functions for random data sets. Chaos Solitons Fractals 2018, 114, 256–263. [Google Scholar] [CrossRef]
Luor, D.-C. Statistical properties of linear fractal interpolation functions for random data sets. Fractals 2018, 26, 1850009. [Google Scholar] [CrossRef]
Luor, D.-C. Autocovariance and increments of deviation of fractal interpolation functions for random datasets. Fractals 2018, 26, 1850075. [Google Scholar] [CrossRef]
Luor, D.-C. On the distributions of fractal functions that interpolate data points with Gaussian noise. Chaos Solitons Fractals 2020, 135, 109743. [Google Scholar] [CrossRef]
Caldarola, F.; Maiolo, M. On the topological convergence of multi-rule sequences of sets and fractal patterns. Soft Comput. 2020, 24, 17737–17749. [Google Scholar] [CrossRef]
Wang, H.-Y.; Li, H.; Shen, J.-Y. A novel hybrid fractal interpolation-SVM model for forecasting stock price indexes. Fractals 2019, 27, 1950055. [Google Scholar] [CrossRef]
Tosatto, S.; Akrour, R.; Peters, J. An upper bound of the bias of Nadaraya-Watson kernel regression under Lipschitz assumptions. Stats 2021, 4, 1–17. [Google Scholar] [CrossRef]

$Fractalfract 06 00680 g001$

Figure 1. Raw data and sample data.

$Fractalfract 06 00680 g002$

Figure 2. Raw data and

\hat{m}

.

$Fractalfract 06 00680 g003$

Figure 3.

\hat{m}

and

f_{[\hat{m}]}

.

$Fractalfract 06 00680 g004$

Figure 4. Raw data and

f_{[\hat{m}]}

.

Table 1. The values of

s_{k}

.

Table 1. The values of

s_{k}

.

k	1	2	3	4	5	6	7	8	9	10
$s_{k}$	0.02	−0.03	0.08	−0.16	0.05	−0.26	−0.36	−0.06	−0.14	0.06

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Fractal Perturbation of the Nadaraya–Watson Estimator

Abstract

1. Introduction

2. Construction of Fractal Interpolation Functions

3. The Nadaraya–Watson Estimator

4. Fractal Perturbation of the Nadaraya–Watson Estimator

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Article Metrics

Citations

Article Access Statistics