Some Upper Bounds for RKHS Approximation by Bessel Functions

Tian, Mingdang; Sheng, Baohuai; Wang, Shuhua

doi:10.3390/axioms11050233

Open AccessArticle

Some Upper Bounds for RKHS Approximation by Bessel Functions

by

Mingdang Tian

¹

,

Baohuai Sheng

^1,*

and

Shuhua Wang

²

¹

Department of Economic Statistics, School of International Business, Zhejiang Yuexiu University, Shaoxing 312000, China

²

School of Information Engineering, Jingdezhen Ceramic University, Jingdezhen 333403, China

^*

Author to whom correspondence should be addressed.

Axioms 2022, 11(5), 233; https://doi.org/10.3390/axioms11050233

Submission received: 19 April 2022 / Revised: 8 May 2022 / Accepted: 9 May 2022 / Published: 17 May 2022

(This article belongs to the Special Issue Numerical Computation, Approximation of Functions and Applied Mathematics)

Download Review Reports Versions Notes

Abstract

:

A reproducing kernel Hilbert space (RKHS) approximation problem arising from learning theory is investigated. Some K-functionals and moduli of smoothness with respect to RKHSs are defined with Fourier–Bessel series and Fourier–Bessel transforms, respectively. Their equivalent relation is shown, with which the upper bound estimate for the best RKHS approximation is provided. The convergence rate is bounded with the defined modulus of smoothness, which shows that the RKHS approximation can attain the same approximation ability as that of the Fourier–Bessel series and Fourier–Bessel transform. In particular, it is shown that for a RKHS produced by the Bessel operator, the convergence rate sums up to the bound of a corresponding convolution operator approximation. The investigations show some new applications of Bessel functions. The results obtained can be used to bound the approximation error in learning theory.

Keywords:

bessel function; Fourier–Bessel series; Fourier–Bessel transform; K-functional; modulus of smoothness; semigroup of operators; reproducing kernel Hilbert space (RKHS); best approximation error; learning theory

1. Introduction

The error analysis in learning theory shows that the learning rate of the kernel regularized regression depends upon the approximation ability of the kernel function spaces (see, for example, [1,2,3]).

Let X be a complete metric space and

μ

be a Borel measure on X. Denoted by

L_{μ}^{2} (X)

, the Hilbert space consisting of (real) square integrable functions with the inner product

{〈 f, g 〉}_{L_{μ}^{2} (X)} = \int_{X} f (x) g (x) d μ (x), f, g \in L_{μ}^{2} (X) .

Suppose that

K : X \times X \to R = (- \infty, + \infty)

is continuous, symmetric and strictly positive definite, i.e., for any given integers

m \geq 1, {(K (x_{i}, x_{j}))}_{i, j = 1}^{m}

are positive definite matrices for given finite sets

{x_{1}, x_{2}, \dots, x_{m}} \subset X .

Assume that

K \in L_{μ \times μ}^{2} (X \times X)

, i.e.,

\int_{X} \int_{X} {|K (x, t)|}^{2} d μ (x) d μ (t) < + \infty .

Then the linear operator

L_{K} : L_{μ}^{2} (X) \to L_{μ}^{2} (X)

defined by

\begin{matrix} L_{K} (f, x) = \int_{X} K (x, t) f (t) d μ (t), x \in X \end{matrix}

(1)

is positive, and its range lies in

C (X)

. Take

L_{K}^{\frac{1}{2}}

to be the linear operator on

L_{μ}^{2} (X)

satisfying

L_{K}^{\frac{1}{2}} \circ L_{K}^{\frac{1}{2}} = L_{K}

and

L_{K}^{- \frac{1}{2}}

, the inverse of

L_{K}^{\frac{1}{2}}

. Additionally, define

H_{K} = L_{K}^{\frac{1}{2}} (L_{μ}^{2} (X))

. Then

(H_{K}, ∥ \cdot ∥_{H_{K}})

is a reproducing kernel Hilbert space associated with

K_{x} (y) = K (x, y)

, i.e., (see [1,4,5,6,7]),

\begin{matrix} f (x) = {〈 f, K_{x} 〉}_{H_{K}}, f \in H_{K}, x \in X, \end{matrix}

(2)

where the inner product

{〈 \cdot, \cdot 〉}_{H_{K}}

is induced by a norm defined as

\begin{matrix} {∥ f ∥}_{H_{K}} = {∥ L_{K}^{- \frac{1}{2}} f ∥}_{L_{μ}^{2} (X)}, f \in H_{K}, \end{matrix}

(3)

i.e.,

\begin{matrix} ∥ L_{K}^{\frac{1}{2}} {f ∥}_{H_{K}} = {∥ f ∥}_{L_{μ}^{2} (X)}, f \in L_{μ}^{2} (X) . \end{matrix}

(4)

One of the targets of learning theory is to find an unknown function

f : X \to R

from the random observations

{(x_{i}, y_{i})}_{i = 1}^{m}

drawn i.i.d. (identically and independently distributed) according to a unknown probability

ρ (x, y) = ρ_{X} (x) ρ (y | x)

defined on

X \times R

(see [1,6]). A usual algorithm to realize this aim is to solve the following kernel regularized optimization problem:

\begin{matrix} f_{z, λ} = a r g min_{f \in H_{K}} \frac{1}{m} \sum_{i = 1}^{m} {(f (x_{i}) - y_{i})}^{2} + λ {∥ f ∥}_{H_{K}}^{2}, \end{matrix}

(5)

where

H_{K}

is taken as the hypothesis space,

λ > 0

is a parameter which balances the relationship between the empirical error term

\sum_{i = 1}^{m} {(f (x_{i}) - y_{i})}^{2}

and the penalty term

{∥ f ∥}_{H_{K}}^{2}

. Let

f_{ρ} (x) = \int_{R} y d ρ (y | x)

be the regression function. Then

f_{ρ}

is the least-squares-best predictor (see Section 9.4 in Section 9 of [8]), i.e.,

E ({(f_{ρ} (\cdot) - y)}^{2}) = inf_{g \in L_{ρ_{X}}^{2} (X)} E ({(f (\cdot) - y)}^{2}) .

It is known that the convergence analysis of model (5) sums up to bound the convergence rate for error

∥ f_{z, λ} - f_{ρ} ∥_{L_{ρ_{X}}^{2} (X)}

, which depends upon the decay of the best approximation

I {(f, γ)}_{L_{ρ_{X}}^{2} (X)}

defined as (see e.g., [1,2,6])

\begin{matrix} I {(f, γ)}_{L_{ρ_{X}}^{2} (X)} & = & inf_{g \in H_{K}, {∥ g ∥}_{H_{K}} \leq γ} {∥ f - g ∥}_{L_{ρ_{X}}^{2} (X)}, γ > 0 \end{matrix}

(6)

as

γ \to + \infty .

Formula (6) deals with a decay rate which depends upon the approximation property of

H_{K}

. Many mathematicians have performed investigations on it. For example, D. X. Zhou gives the decay of (6) with the RKHS interpolation theory (see [2,3]). P.X. Ye gives the decay using convolution operators in the Euclidean space

R^{d}

(see [9]). H.W. Sun gives a decay for (6) with the help of operator theory in a Hilbert space (see [10]). It is known that the Fourier–Bessel series is a good approximation tool and has been studied by many mathematicians (see for example, [11,12,13,14,15,16]). Additionally, we found that approximation by RBF networks of Delsarte translates was studied by some mathematicians. The essence of RBF is summed up as the approximation of Fourier–Bessel transforms (see, for example, [17,18,19,20]). So it is of interest for us to conduct investigations on the decay of

I {(f, R)}_{L_{ρ_{X}}^{2} (X)}

with both the Fourier–Bessel series and the Fourier–Bessel transforms.

Let

α > - \frac{1}{2}

and

1 \leq p \leq + \infty

be given real numbers, and

L^{p} (R_{+}, d μ_{α})

denote the space of all measurable real functions on

R_{+} = [0, + \infty)

such that

\begin{matrix} {∥ f ∥}_{p, α} = \{\begin{matrix} {(\int_{R_{+}} | f (x) |^{p} d μ_{α})}^{\frac{1}{p}} < + \infty, & 1 \leq p < + \infty, \\ e s s sup_{x \in R_{+}} | f (x) | < + \infty, & p = + \infty, \end{matrix} \end{matrix}

where

d μ_{α} (x) = \frac{x^{2 α + 1}}{2^{α} Γ (α + 1)} d x .

The normalized Bessel function

j_{α} (z)

of the first kind and order

α

is

\begin{matrix} j_{α} (z) & = & Γ (α + 1) \sum_{n = 0}^{+ \infty} \frac{{(- 1)}^{n} {(\frac{z}{2})}^{2 n}}{n! Γ (n + α + 1)} \\ = & 2^{α} Γ (α + 1) \frac{J_{α} (x)}{x^{α}}, z \in R_{+}, \end{matrix}

(7)

where

J_{α} (x) = {(\frac{x}{2})}^{α} \sum_{n = 0}^{+ \infty} \frac{{(- 1)}^{n} {(\frac{z}{2})}^{2 n}}{n! Γ (n + α + 1)}

is the Bessel function of first kind and order

α

, and

Γ (α + 1)

is the Gamma function.

For

f \in L^{1} (R_{+}, d μ_{α})

, the usual Fourier–Bessel transform

F_{B}^{(α)} (f)

is defined as

F_{B}^{(α)} (f) (λ) = \int_{R_{+}} f (x) j_{α} (λ x) d μ_{α}, λ \in R_{+} .

In the present paper, some investigations on the decay of

I {(f, γ)}_{L_{ρ_{X}}^{2} (X)}

in the case that

H_{K}

are constructed with

j_{α} (z) (z \in [0, 1])

and

F_{B}^{(α)} (f)

are provided. Some K-functional and moduli of smoothness are defined with the help of the semigroup of operators, and their equivalences are shown, with which the error for the decay is bounded. The results obtained are two kinds of upper bound estimates associated with Fourier–Bessel series and Fourier–Bessel transforms, respectively.

The paper is organized as follows. In Section 2, some notions and results of the Fourier–Bessel series and Fourier–Bessel transforms are provided, with which two kinds of RKHSs are constructed; the corresponding best RKHS approximation problem in these setting is restated. Some K-functionals and moduli of smoothness associated with Fourier–Bessel series and Fourier–Bessel transforms are provided, and their equivalence is shown, with which some upper bounds for the best approximation are shown in Section 3 and Section 4, respectively. All the proofs for the propositions, the theorems and lemmas are given in Section 5. Some further analysis for the results of the present paper are given in Section 6, from which one can see the value of writing this manuscript. A general proposition for the strong equivalence of K-functionals and moduli of smoothness is listed in the Appendix A.

2. Preliminaries

Let

λ_{1}, λ_{2}, \dots,

be the positive zeros of

J_{α} (u)

arranged in increasing order. It is well known that

j_{α} (λ_{n} x), n = 1, 2, \dots,

form a complete orthogonal system in

L_{α}^{2} = {{f : ∥ f ∥}_{L_{α}^{2}} = (\int_{0}^{1} x^{2 α + 1} {| f (x) |}^{2} d x)^{\frac{1}{2}} < + \infty}

(see, for example, [12,16,21]), i.e.,

\int_{0}^{1} x^{2 α + 1} j_{α} (λ_{n} u) j_{α} (λ_{m} u) d u = {∥ j_{α} (λ_{i} \cdot) ∥}_{L_{α}^{2}}^{2} δ_{m, n} .

Take

j_{α}^{*} (λ_{i} x) = \frac{j_{α} (λ_{i} x)}{∥ j_{α} (λ_{i} \cdot) ∥_{L_{α}^{2}}}

. Then

\begin{matrix} \int_{0}^{1} x^{2 α + 1} j_{α}^{*} (λ_{n} u) j_{α}^{*} (λ_{m} u) d u = δ_{m, n}, \end{matrix}

(8)

{j_{α}^{*} (λ_{i} x)}_{i = 1}^{\infty}

forms an orthonormal basis of

L_{α}^{2}

and for any

f \in L_{α}^{2}

, there holds Fourier–Bessel series

\begin{matrix} f (x) = \sum_{i = 1}^{+ \infty} a_{i} (f) j_{α}^{*} (λ_{i} x), x \in [0, 1], \end{matrix}

(9)

where

a_{i} (f) = \int_{0}^{1} x^{2 α + 1} f (x) j_{α}^{*} (λ_{i} x) d x

and

\begin{matrix} {∥ f ∥}_{L_{α}^{2}} = {(\sum_{i = 1}^{+ \infty} {|a_{i} (f)|}^{2})}^{\frac{1}{2}} . \end{matrix}

(10)

Lemma 1.

We have the following results:

(i): Let $Λ \subset N$ . Then

$\begin{matrix} ∥ \sum_{i \in Λ} c_{i} j_{α}^{*} (λ_{i} x) ∥_{L_{α}^{2}} = {(\sum_{i \in Λ} c_{i}^{2})}^{\frac{1}{2}} . \end{matrix}$

(11)
(ii): The generalized translation operator $T_{x}$ on $L_{α}^{2}$ defined as

$\begin{matrix} T_{x} (f) (y) = \frac{Γ (α + 1)}{\sqrt{π} Γ (α + \frac{1}{2})} \int_{0}^{π} f (\sqrt{x^{2} + y^{2} - 2 x y cos θ}) {(sin θ)}^{2 α} d θ, x, y \in [0, 1] \end{matrix}$

has the expansion of

$\begin{matrix} T_{x} (f) (y) = \sum_{i = 1}^{+ \infty} a_{i}^{*} (f) j_{α}^{*} (λ_{i} x) j_{α}^{*} (λ_{i} y), x, y \in [0, 1], \end{matrix}$

(12)

where $a_{i}^{*} (f) = \int_{0}^{1} x^{2 α + 1} f (x) j_{α} (λ_{i} x) d x,$ and

$\begin{matrix} ∥ T_{h} {(f) (\cdot) ∥}_{L_{α}^{2}} \leq {∥ f ∥}_{L_{α}^{2}}, \forall h \in [0, 1] . \end{matrix}$

(13)
(iii): The zeros ${λ_{1}, λ_{2}, \dots,}$ satisfy

$\begin{matrix} λ_{n} = n π + \frac{α π}{2} - \frac{π}{4} + O (\frac{1}{n}) . \end{matrix}$

(14)

Proof.

See it from Section 5. □

Inequality (13) is a theoretical basis for defining the moduli of smoothness with translation operators

T_{x} (f) (y)

.

Let

{h_{i}}_{i = 1}^{+ \infty}

be the set of given positive real sequences such that the right side of the series

\begin{matrix} K_{x}^{(α)} (y) = K^{(α)} (x, y) = \sum_{i = 1}^{+ \infty} h_{i} j_{α}^{*} (λ_{i} x) j_{α}^{*} (λ_{i} y), x, y \in [0, 1], \end{matrix}

(15)

has uniform convergence for all

x \in R_{+}

. It therefore is a Mercer kernel. Then

\begin{matrix} L_{K^{(α)}} (f, x) = \sum_{i = 1}^{+ \infty} h_{i} a_{i} (f) j_{α}^{*} (λ_{i} x), x \in [0, 1] . \end{matrix}

(16)

Take

\begin{matrix} L_{K^{(α)}}^{\frac{1}{2}} (f, x) = \sum_{i = 1}^{+ \infty} \sqrt{h_{i}} a_{i} (f) j_{α}^{*} (λ_{i} x), x \in [0, 1] . \end{matrix}

(17)

Then it is easy to verify that

L_{K^{(α)}} = L_{K^{(α)}}^{\frac{1}{2}} \circ L_{K^{(α)}}^{\frac{1}{2}}

, and

\begin{matrix} H_{K^{(α)}} = L_{K^{(α)}}^{\frac{1}{2}} (L_{α}^{2}) = {g \in L_{α}^{2} {: ∥ g ∥}_{K^{(α)}} = ∥ L_{K^{(α)}}^{- \frac{1}{2}} (g) ∥_{L_{α}^{2}} = {(\sum_{i = 1}^{+ \infty} \frac{| a_{i} {(g) |}^{2}}{h_{i}})}^{\frac{1}{2}} < + \infty} \end{matrix}

is a RKHS in

L_{α}^{2}

associating with reproducing kernel

K^{(α)} (x, y)

and an inner product

{〈 \cdot, \cdot 〉}_{K^{(α)}}

defined as

\begin{matrix} {〈 f, g 〉}_{K^{(α)}} = \sum_{i = 1}^{+ \infty} \frac{a_{i} (f) a_{i} (g)}{h_{i}}, f, g \in H_{K^{(α)}} . \end{matrix}

Since

\begin{matrix} a_{i} (K_{x}^{(α)} (\cdot)) & = & \int_{0}^{1} y^{2 α + 1} K^{(α)} (x, y) j_{α}^{*} (λ_{i} y) d y \\ = & \int_{0}^{1} y^{2 α + 1} (\sum_{k = 1}^{+ \infty} h_{k} j_{α}^{*} (λ_{k} x) j_{α}^{*} (λ_{k} y)) j_{α}^{*} (λ_{i} y) d y \\ = & h_{i} j_{α}^{*} (λ_{i} x), \end{matrix}

we have

\begin{matrix} {〈 f, K_{x}^{(α)} (\cdot) 〉}_{K^{(α)}} & = & \sum_{i = 1}^{+ \infty} \frac{a_{i} (f) a_{i} (K_{x}^{(α)} (\cdot))}{h_{i}} \\ = & \sum_{i = 1}^{+ \infty} \frac{a_{i} (f) h_{i} j_{α}^{*} (λ_{i} x)}{h_{i}} \\ = & \sum_{i = 1}^{+ \infty} a_{i} (f) j_{α}^{*} (λ_{i} x) = f (x) . \end{matrix}

Equality (6) becomes

\begin{matrix} I {(f, γ)}_{L_{α}^{2}} & = & inf_{g \in H_{K^{(α)}}, {∥ g ∥}_{K^{(α)}} \leq γ} {∥ f - g ∥}_{L_{α}^{2}}, γ > 0 \end{matrix}

(18)

as

γ \to + \infty .

Let

C_{*} (R)

be the class of even

C^{\infty}

-functions on

R = {- \infty, + \infty}

. Denoted by

A_{*} (R)

, the space of even

C^{\infty}

-functions on R which are rapidly decreasing together with all their derivatives, i.e.,

\forall p, k \in N, sup_{x \geq 0} (| x^{p} f^{(k)} (x) |) < + \infty,

where

N

is the set of natural numbers.

Let

D_{*, a}

denote the space of even

C^{\infty}

-functions on R with support in

[- a, a], a \geq 0

and

D_{*} (R) = ⋃_{a \geq 0} D_{*, a} .

Additionally, define the generalized translation operator

T_{x}

on

L^{1} (R_{+}, d μ_{α})

as

\begin{matrix} T_{x} (f) (y) = \frac{Γ (α + 1)}{\sqrt{π} Γ (α + \frac{1}{2})} \int_{0}^{π} f (\sqrt{x^{2} + y^{2} - 2 x y cos θ}) {(sin θ)}^{2 α} d θ, x, y \in R_{+} . \end{matrix}

and define a convolution on

L^{1} (R_{+}, d μ_{α})

by

(f *_{B} g) (x) = \int_{R_{+}} T_{x} (f) (y) g (y) d μ_{α} (y), f, g \in L^{1} (R_{+}, d μ_{α}), x \in R_{+} .

For the Bessel operators

l_{α} = \frac{d^{2}}{d x^{2}} + \frac{2 α + 1}{x} \frac{d}{d x}

we have (see p. 12 or p. 177 of [22])

\begin{matrix} (- l_{α}) (j_{α} (λ \cdot)) (x) = λ^{2} j_{α} (λ x), {(- l_{α})}^{- 1} (j_{α} (λ \cdot)) (x) = \frac{1}{λ^{2}} j_{α} (λ x), λ, x \in R_{+} . \end{matrix}

(19)

and therefore

{(- l_{α})}^{\mp \frac{1}{2}} (j_{α} (λ \cdot)) (x) = λ^{\mp} j_{α} (λ x), x \in R_{+} .

Moreover, we have the following lemma.

Lemma 2.

There hold the following:

(i): $D_{*} (R)$ is dense in $A_{*} (R)$ ;
(ii): Both $D_{*} (R)$ and $A_{*} (R)$ are dense in $L^{p} (R_{+}, d μ_{α}), 1 \leq p < + \infty,$ and

$\begin{matrix} D_{*} (R) \subset A_{*} (R) \subset L^{p} (R_{+}, d μ_{α}), 1 \leq p < + \infty; \end{matrix}$

(20)
(iii): If $f \in A_{*} (R)$ , then $F_{B}^{(α)} (f) \in A_{*} (R)$ and $T_{x} (f) \in A_{*} (R)$ ;
(iv): $F_{B}^{(α)}$ is a topological isomorphism from $A_{*} (R)$ to itself and ${F^{(α)}}_{B}^{- 1} = F_{B}^{(α)} .$
(v): There hold

$\begin{matrix} F_{B}^{(α)} (f *_{B} g) = F_{B}^{(α)} (f) F_{B}^{(α)} (g), f, g \in L^{1} (R_{+}, d μ_{α}), \end{matrix}$

(21)

$\begin{matrix} (f *_{B} g) (x) = \int_{R_{+}} F_{B}^{(α)} (f) (λ) F_{B}^{(α)} (g) (λ) j_{α} (λ x) d μ_{α} (λ) \end{matrix}$

(22)

and

$\begin{matrix} F_{B}^{(α)} (T_{x} (f)) (λ) = j_{α} (λ x) F_{B}^{(α)} (f) (λ), f \in L^{1} (R_{+}, d μ_{α}) . \end{matrix}$

(23)

It follows

$\begin{matrix} T_{x} (f, y) = \int_{R_{+}} F_{B}^{(α)} (f) (λ) j_{α} (λ x) j_{α} (λ y) d μ_{α} (λ), f \in L^{1} (R_{+}, d μ_{α}) . \end{matrix}$

(24)
(vi): If $f, F_{B}^{(α)} (f) \in L^{1} (R_{+}, d μ_{α})$ , then

$\begin{matrix} f (x) = \int_{R_{+}} F_{B}^{(α)} (f) (λ) j_{α} (λ x) d μ_{α} (λ), a . e . x \in R_{+}; \end{matrix}$

(25)
(vii): Let $f \in A_{*} (R)$ or $f \in L^{2} (R_{+}, d μ_{α})$ . Then

$\begin{matrix} \int_{R_{+}} {|f (x)|}^{2} d μ_{α} = \int_{R_{+}} {|F_{B}^{(α)} (f) (λ)|}^{2} d μ_{α} (λ); \end{matrix}$

(26)
(viii): There hold the following relations

$\begin{matrix} F_{B}^{(α)} (l_{α}^{p} (f)) (λ) = {(- 1)}^{p} λ^{2 p} F_{B}^{(α)} (f) (λ), f \in L^{1} (R_{+}, d μ_{α}), \end{matrix}$

(27)

$\begin{matrix} ∥ T_{x} {(f) ∥}_{p, α} \leq {∥ f ∥}_{p, α}, f \in L^{p} (R_{+}, d μ_{α}), 1 \leq p < + \infty, \end{matrix}$

(28)

$\begin{matrix} F_{B}^{(α)} (j_{α} (λ \cdot)) (y) = j_{α} (λ x) j_{α} (λ y), \forall x, y, λ \in R_{+} . \end{matrix}$

(29)

Proposition 2.1 of [23] shows that if

ϕ \in L^{1} (R_{+}, d μ_{α})

satisfies

F_{B}^{(α)} (ϕ) \geq 0

and

F_{B}^{(α)} (ϕ) \in L^{1} (R_{+}, d μ_{α})

, then

\begin{matrix} K (ϕ, x, y) = K_{x} (ϕ, y) = T_{x} (ϕ, y) = \int_{R_{+}} F_{B}^{(α)} (ϕ) (λ) j_{α} (λ x) j_{α} (λ y) d μ_{α}, y \in R_{+} . \end{matrix}

defines a Mercer kernel on

R_{+}

. We give an assumption

Assumption I.

Let

ϕ \in L^{1} (R_{+}, d μ_{α})

satisfy

F_{B}^{(α)} (ϕ) > 0, F_{B}^{(α)} (ϕ) \in L^{1} (R_{+}, d μ_{α})

and for any

μ > 0

there is a real number

a \in R_{+}

such that

\begin{matrix} {λ \in R_{+} : F_{B}^{(α)} (ϕ) (λ) \leq \frac{1}{μ}} \subset [0, a] . \end{matrix}

(30)

We point here that the functions

ϕ

satisfying Assumption 1 are existent, and give two examples.

Example 1.

For

t \in (0, + \infty)

the function

p_{t} : [0, + \infty) \to R_{+}

defined by

\begin{matrix} p_{t} (x) = \frac{2^{α + 1} Γ (α + \frac{3}{2})}{\sqrt{π}} \frac{t}{{(t^{2} + x^{2})}^{α + \frac{3}{2}}} \end{matrix}

satisfies

∥ p_{t} ∥_{L^{1} (R_{+}, d μ_{α})} = 1, p_{t} *_{B} p_{s} = p_{t + s}

and

F_{B}^{(α)} (p_{t}) (λ) = e^{- t λ}

for

λ \in R_{+}

(see Problem 5. VIII 2 in Section 5.VIII Problems of [22]).

Example 2.

For

t, s \in (0, + \infty)

the function

k_{t} : R_{+} \to R_{+}

defined by

\begin{matrix} k_{t} (x) = \frac{e^{- \frac{x^{2}}{4 t}}}{{(2 t)}^{α + 1}} \end{matrix}

satisfies

∥ k_{t} ∥_{L^{1} (R_{+}, d μ_{α})} = 1, k_{t} *_{B} k_{s} = k_{t + s}

and

F_{B}^{(α)} (k_{t}) (λ) = e^{- t λ^{2}}

for

λ \in R_{+}

(see Problem 5. VIII 1 in Section 5.VIII Problems of [22]).

Define

\begin{matrix} H_{K (ϕ)} = {g \in L^{2} (R_{+}, d μ_{α}) \cap C_{*} (R) : & \frac{F_{B}^{(α)} (g)}{F_{B}^{(α)} {(ϕ)}^{\frac{1}{2}}} \in L^{2} (R_{+}, d μ_{α}), \\ g (u) = \int_{R_{+}} F_{B}^{(α)} (g) (λ) j_{α} (λ u) d μ_{α} (λ)} \end{matrix}

with norm

{∥ g ∥}_{H_{K (ϕ)}} = {(\int_{R_{+}} \frac{| F_{B}^{(α)} {(g) (λ) |}^{2}}{F_{B}^{(α)} (ϕ) (λ)} d μ_{α})}^{\frac{1}{2}}

Define an inner product on

H_{K (ϕ)}

as

{〈 g, f 〉}_{K (ϕ)} = \int_{R_{+}} \frac{F_{B}^{(α)} (f) (λ) F_{B}^{(α)} (g) (λ)}{F_{B}^{(α)} (ϕ) (λ)} d μ_{α}, f, g \in H_{K (ϕ)} .

It is known that

K (ϕ, x, y)

is a reproducing kernel of

H_{K (ϕ)}

(see [24]), i.e.,

\begin{matrix} {〈 g, K (ϕ, x, \cdot) 〉}_{K (ϕ)} = g (x), g \in H_{K (ϕ)}, x \in R_{+} . \end{matrix}

(31)

We have

\begin{matrix} L_{K (ϕ)} (f, x) & = & \int_{R_{+}} K_{x} (ϕ, u) f (u) d μ_{α} (u) \\ = & \int_{R_{+}} F_{B}^{(α)} (ϕ) (λ) F_{B}^{(α)} (f) (λ) j_{α} (λ x) d μ_{α} (λ), f \in L^{1} (R_{+}, d μ_{α}) . \end{matrix}

(32)

Defi

ne for a given real number

r \in R

an operator as

\begin{matrix} L_{K (ϕ)}^{r} (f, x) = \int_{R_{+}} {(F_{B}^{(α)} (ϕ) (λ))}^{r} F_{B}^{(α)} (f) (λ) j_{α} (λ x) d μ_{α} (λ), f \in L^{1} (R_{+}, d μ_{α}) . \end{matrix}

(33)

Then it is easy to show that

L_{K (ϕ)} = L_{K (ϕ)}^{\frac{1}{2}} (L_{K (ϕ)}^{\frac{1}{2}}) = L_{K (ϕ)}^{\frac{1}{2}} \circ L_{K (ϕ)}^{\frac{1}{2}}

,

\begin{matrix} L_{K (ϕ)}^{\frac{1}{2}} (L^{2} (R_{+}, d μ_{α})) = {g \in L^{2} (R_{+}, d μ_{α}) : {(\int_{R_{+}} \frac{| F_{B}^{(α)} {(g) (λ) |}^{2}}{F_{B}^{(α)} (ϕ) (λ)} d μ_{α})}^{\frac{1}{2}} < + \infty} = H_{K (ϕ)}, \end{matrix}

and

{∥ f ∥}_{K (ϕ)} = {∥ L_{K (ϕ)}^{- \frac{1}{2}} (f) ∥}_{L^{2} (R_{+}, d μ_{α})}, f \in H_{K (ϕ)} .

In this case, the decay (6) becomes

\begin{matrix} I {(f, γ)}_{L^{2} (R_{+}, d μ_{α})} & = & inf_{g \in H_{K (ϕ)}, {∥ g ∥}_{K (ϕ)} \leq γ} {∥ f - g ∥}_{L^{2} (R_{+}, d μ_{α})}, f \in L^{2} (R_{+}, d μ_{α}) \end{matrix}

(34)

for

γ \to + \infty .

If

F_{B}^{(α)} (ϕ) (λ) = \frac{1}{λ^{2}}

, then we define the corresponding RKHS

\begin{matrix} H_{K (ϕ)}^{*} = L_{K (ϕ)}^{\frac{1}{2}} (A_{*} (R)) = {g \in A_{*} (R) : {(\int_{R_{+}} λ^{2} {|F_{B}^{(α)} (g) (λ)|}^{2} d μ_{α})}^{\frac{1}{2}} < + \infty} \end{matrix}

and for

g \in H_{K (ϕ)}^{*}

, there holds

\begin{matrix} {∥ g ∥}_{K (ϕ)} & = & ∥ L_{K (ϕ)}^{- \frac{1}{2}} {(g) ∥}_{L^{2} (R_{+}, d μ_{α})} \\ = & {(\int_{R_{+}} λ^{2} {|F_{B}^{(α)} (g) (λ)|}^{2} d μ_{α} (λ))}^{\frac{1}{2}} \\ = & ∥ {(- l_{α})}^{\frac{1}{2}} {g ∥}_{L^{2} (R_{+}, d μ_{α})} . \end{matrix}

We have by (34) that

\begin{matrix} I {(f, γ)}_{L^{2} (R_{+}, d μ_{α})} & = & inf_{∥ {(- l_{α})}^{\frac{1}{2}} {g ∥}_{L^{2} (R_{+}, d μ_{α})} \leq γ} {∥ f - g ∥}_{L^{2} (R_{+}, d μ_{α})} \end{matrix}

(35)

for

γ \to + \infty .

3. An Upper Bound Estimate with Fourier–Bessel Series

To bound the decay of (18), we define a K-functional

\begin{matrix} D_{H_{K^{(α)}}} {(f, t)}_{L_{α}^{2}} = inf_{g \in H_{K^{(α)}}} ({∥ f - g ∥}_{L_{α}^{2}} + t {∥ g ∥}_{K^{(α)}}), f \in L_{α}^{2}, t > 0 \end{matrix}

(36)

and a modulus of smoothness

\begin{matrix} ω_{H_{K^{(α)}}} {(f, t)}_{L_{α}^{2}} = {∥ (T_{K^{(α)}} (t) - I) f ∥}_{L_{α}^{2}}, f \in L_{α}^{2}, t > 0, \end{matrix}

(37)

where

T_{K^{(α)}} (t) f (x) = \sum_{i = 1}^{\infty} e^{- \frac{t}{\sqrt{h_{i}}}} a_{i} (f) j_{α}^{*} (λ_{i} x), x \in [0, 1] .

Then we have the following Proposition 1 whose proofs can be found from Section 5.

Proposition 1.

There holds an equivalent relation

\begin{matrix} D_{H_{K^{(α)}}} {(f, t)}_{L_{α}^{2}} \sim ω_{H_{K^{(α)}}} {(f, t)}_{L_{α}^{2}}, f \in L_{α}^{2}, t > 0 . \end{matrix}

(38)

Proof.

See it from Section 5. □

Theorem 1.

There is a constant

C > 0

such that

\begin{matrix} I {(f, γ)}_{L_{α}^{2}} \leq C ω_{H_{K^{(α)}}} {(f, \frac{{∥ f ∥}_{L_{α}^{2}}}{γ})}_{L_{α}^{2}}, f \in L_{α}^{2} \end{matrix}

(39)

if

γ \to + \infty .

Proof.

See it from Section 5. □

Taking

h_{i} = \frac{1}{λ_{i}^{2}}

into (15), we have a kernel

\begin{matrix} K_{x}^{*} (y) = K^{*} (x, y) = \sum_{i = 1}^{+ \infty} \frac{1}{λ_{i}^{2}} j_{α}^{*} (λ_{i} x) j_{α}^{*} (λ_{i} y), x, y \in [0, 1], \end{matrix}

It follows that

\begin{matrix} H_{K^{*}} & = & L_{K^{*}}^{\frac{1}{2}} (L_{α}^{2}) \\ = & {g \in L_{α}^{2} : ∥ g ∥_{K^{*}} = {(\sum_{i = 1}^{+ \infty} λ_{i}^{2} {| a_{i} (g) |}^{2})}^{\frac{1}{2}} < + \infty}, \end{matrix}

which shows that

{∥ g ∥}_{K^{*}} = {∥ {(- l_{α})}^{\frac{1}{2}} (g) ∥}_{L_{α}^{2}}

and

\begin{matrix} D_{H_{K^{*}}} {(f, t)}_{L_{α}^{2}} = inf_{g \in H_{K^{*}}} ({∥ f - g ∥}_{L_{α}^{2}} + t {∥ {(- l_{α})}^{\frac{1}{2}} (g) ∥}_{L_{α}^{2}}), f \in L_{α}^{2}, t > 0 \end{matrix}

and

\begin{matrix} ω_{H_{K^{*}}} {(f, t)}_{L_{α}^{2}} = {∥ (T_{K^{*}} (t) - I) f ∥}_{L_{α}^{2}}, f \in L_{α}^{2}, t > 0, \end{matrix}

where

T_{K^{*}} (t) f (x) = \sum_{i = 1}^{\infty} e^{- t λ_{i}} a_{i} (f) j_{α}^{*} (λ_{i} x), x \in [0, 1] .

We have two corollaries.

Corollary 1.

For any

f \in L_{α}^{2}

, there holds

\begin{matrix} D_{H_{K^{*}}} {(f, t)}_{L_{α}^{2}} \sim ω_{H_{K^{*}}} {(f, t)}_{L_{α}^{2}}, f \in L_{α}^{2}, t > 0 \end{matrix}

Corollary 2.

For any

f \in L_{α}^{2}

, there holds

\begin{matrix} I {(f, γ)}_{L_{α}^{2}} \leq C ω_{H_{K^{*}}} {(f, \frac{{∥ f ∥}_{L_{α}^{2}}}{γ})}_{L_{α}^{2}}, γ \to + \infty . \end{matrix}

4. An Upper Bound Estimate with the Fourier–Bessel Transform

To bound

I {(f, γ)}_{L^{2} (R_{+}, d μ_{α})}

, we define a K-functional

D_{K (ϕ)} {(f, t)}_{L^{2} (R_{+}, d μ_{α})}

and a modulus

ω_{K (ϕ)} {(f, t)}_{L^{2} (R_{+}, d μ_{α})}

respectively corresponding to

H_{K (ϕ)}

as

\begin{matrix} D_{K (ϕ)} {(f, t)}_{L^{2} (R_{+}, d μ_{α})} \\ = & inf_{g \in H_{K (ϕ)}} ({∥ f - g ∥}_{L^{2} (R_{+}, d μ_{α})} + t {∥ g ∥}_{K (ϕ)}) \\ = & inf_{g \in L_{K (ϕ)}^{\frac{1}{2}} (L^{2} (R_{+}, d μ_{α}))} ({∥ f - g ∥}_{L^{2} (R_{+}, d μ_{α})} + t {∥ L_{K (ϕ)}^{- \frac{1}{2}} (f) ∥}_{L^{2} (R_{+}, d μ_{α})}), f \in L^{2} (R_{+}, d μ_{α}), \end{matrix}

and

ω_{K (ϕ)} {(f, t)}_{L^{2} (R_{+}, d μ_{α})} = {∥ (T_{K (ϕ)} (t) - I) f ∥}_{L^{2} (R_{+}, d μ_{α})}, f \in L^{2} (R_{+}, d μ_{α}), t > 0,

where

T_{K (ϕ)} (t) f (x) = \int_{R_{+}} e^{- \frac{t}{\sqrt{F_{B}^{(α)} (ϕ) (λ)}}} F_{B}^{(α)} (f) (λ) j_{α} (λ x) d μ_{α} (λ) .

The K-functional and the modulus are equivalent, i.e., we have the following proposition.

Proposition 2.

Let

ϕ \in L^{1} (R_{+}, d μ_{α})

satisfy Assumption 1. Then there holds the equivalence

\begin{matrix} D_{K (ϕ)} {(f, t)}_{L^{2} (R_{+}, d μ_{α})} \sim ω_{K (ϕ)} {(f, t)}_{L^{2} (R_{+}, d μ_{α})}, f \in L^{2} (R_{+}, d μ_{α}), t > 0 . \end{matrix}

(40)

We now give an upper bound estimate for (34).

Theorem 2.

Under the conditions of Proposition 2, there is a constant

C > 0

such that

\begin{matrix} I {(f, γ)}_{L^{2} (R_{+}, d μ_{α})} & \leq & C ω_{K (ϕ)} {(f, \frac{{∥ f ∥}_{L^{2} (R_{+}, d μ_{α})}}{γ})}_{L^{2} (R_{+}, d μ_{α})}, f \in L^{2} (R_{+}, d μ_{α}) \end{matrix}

(41)

if

γ \to + \infty .

For

F_{B}^{(α)} (ϕ) (λ) = \frac{1}{λ^{2}}

we define a K-functional on

L^{2} (R_{+}, d μ_{α})

as

\begin{matrix} D_{l_{α}^{\frac{1}{2}}} {(f, t)}_{L^{2} (R_{+}, d μ_{α})} = inf_{g \in H_{K (ϕ)}^{*}} (∥ f - g ∥ + t ∥ {(- l_{α})}^{\frac{1}{2}} {g ∥}_{L^{2} (R_{+}, d μ_{α})}), t > 0 . \end{matrix}

Define a modulus of smoothness as

\begin{matrix} ω_{l_{α}^{\frac{1}{2}}} {(f, t)}_{L^{2} (R_{+}, d μ_{α})} = {∥ (T_{l_{α}^{\frac{1}{2}}} (t) - I) f ∥}_{L^{2} (R_{+}, d μ_{α})}, t > 0, \end{matrix}

where

T_{l_{α}^{\frac{1}{2}}} (t) f (x) = \int_{R_{+}} e^{- λ t} F_{B}^{(α)} (f) (λ) j_{α} (λ x) d μ_{α} (λ) .

Then we have the following two corollaries.

Corollary 3.

There holds the equivalent relation

\begin{matrix} D_{l_{α}^{\frac{1}{2}}} {(f, t)}_{L^{2} (R_{+}, d μ_{α})} \sim ω_{l_{α}^{\frac{1}{2}}} {(f, t)}_{L^{2} (R_{+}, d μ_{α})}, f \in L^{2} (R_{+}, d μ_{α}), t > 0 . \end{matrix}

Corollary 4.

There is a constant

C > 0

such that

\begin{matrix} I {(f, R)}_{L^{2} (R_{+}, d μ_{α})} \leq C ω_{l_{α}^{\frac{1}{2}}} {(f, \frac{{∥ f ∥}_{L^{2} (R_{+}, d μ_{α})}}{R})}_{L^{2} (R_{+}, d μ_{α})}, f \in L^{2} (R_{+}, d μ_{α}) . \end{matrix}

(42)

We give further computations for

T_{l_{α}^{\frac{1}{2}}} (t) f (x)

. By Example 1, we know

F_{B}^{(α)} (p_{t}) (λ) = e^{- λ t}

, which, together with (21), gives

\begin{matrix} T_{l_{α}^{\frac{1}{2}}} (t) f (x) & = & \int_{R_{+}} F_{B}^{(α)} (p_{t}) (λ) F_{B}^{(α)} (f) (λ) j_{α} (λ x) d μ_{α} (λ) \\ = & \int_{R_{+}} F_{B}^{(α)} (f *_{B} p_{t}) (λ) j_{α} (λ x) d μ_{α} (λ) \\ = & (f *_{B} p_{t}) (x), x \in R_{+}, \end{matrix}

which with (42) shows that

\begin{matrix} ω_{l_{α}^{\frac{1}{2}}} {(f, t)}_{L^{2} (R_{+}, d μ_{α})} = {∥ (f *_{B} p_{t}) - f ∥}_{L^{2} (R_{+}, d μ_{α})}, t > 0 . \end{matrix}

(43)

Take (43) into (42). Then

\begin{matrix} I {(f, γ)}_{L^{2} (R_{+}, d μ_{α})} \leq C {∥ (f *_{B} p_{t}) - f ∥}_{L^{2} (R_{+}, d μ_{α})} |_{t = \frac{{∥ f ∥}_{L^{2} (R_{+}, d μ_{α})}}{γ}}, f \in L^{2} (R_{+}, d μ_{α}) . \end{matrix}

(44)

(44) shows that the decay of

I {(f, γ)}_{L^{2} (R_{+}, d μ_{α})}

is controlled by the approximation order of convolution operator

f *_{B} p_{t}

for

t = \frac{{∥ f ∥}_{L^{2} (R_{+}, d μ_{α})}}{γ}

.

For

F_{B}^{(α)} (ϕ) (λ) = \frac{1}{λ^{4}}

we define

\begin{matrix} H_{K^{♯} (ϕ)} & = & L_{K^{♯} (ϕ)}^{\frac{1}{2}} (A_{*} (R)) \\ = & {g \in A_{*} (R) : {(\int_{R_{+}} λ^{4} {|F_{B}^{(α)} (f) (λ) d μ_{α}|}^{2})}^{\frac{1}{2}} < + \infty} . \end{matrix}

(45)

Then

\begin{matrix} {∥ g ∥}_{K^{♯} (ϕ)} & = & {(\int_{R_{+}} λ^{4} {|F_{B}^{(α)} (f) (λ) d μ_{α}|}^{2})}^{\frac{1}{2}} \\ = & ∥ (- l_{α}) {g ∥}_{L^{2} (R_{+}, d μ_{α})} . \end{matrix}

(46)

Define a K-functional on

L^{2} (R_{+}, d μ_{α})

as

\begin{matrix} D_{l_{α}} {(f, t)}_{L^{2} (R_{+}, d μ_{α})} = inf_{g \in H_{K^{♯} (ϕ)}} (∥ f - g ∥ + t ∥ (- l_{α}) {g ∥}_{L^{2} (R_{+}, d μ_{α})}), t > 0 . \end{matrix}

Define a modulus of smoothness as

\begin{matrix} ω_{l_{α}} {(f, t)}_{L^{2} (R_{+}, d μ_{α})} = {∥ (T_{l_{α}} (t) - I) f ∥}_{L^{2} (R_{+}, d μ_{α})}, t > 0, \end{matrix}

where

T_{l_{α}} (t) f (x) = \int_{R_{+}} e^{- λ^{2} t} F_{B}^{(α)} (f) (λ) j_{α} (λ x) d μ_{α} (λ) .

Then we have the following two corollaries.

Corollary 5.

There holds

\begin{matrix} D_{l_{α}} {(f, t)}_{L^{2} (R_{+}, d μ_{α})} \sim ω_{l_{α}} {(f, t)}_{L^{2} (R_{+}, d μ_{α})}, f \in L^{2} (R_{+}, d μ_{α}), t > 0 . \end{matrix}

Corollary 6.

There is a constant

C > 0

such that

\begin{matrix} I {(f, γ)}_{L^{2} (R_{+}, d μ_{α})} \leq C ω_{l_{α}} {(f, \frac{{∥ f ∥}_{L^{2} (R_{+}, d μ_{α})}}{γ})}_{L^{2} (R_{+}, d μ_{α})}, f \in L^{2} (R_{+}, d μ_{α}) . \end{matrix}

(47)

Additionally, by Example 2, we know

F_{B}^{(α)} (k_{t}) (λ) = e^{- λ^{2} t}

, which, together with (21), gives

\begin{matrix} T_{l_{α}} (t) f (x) & = & \int_{R_{+}} F_{B}^{(α)} (k_{t}) (λ) F_{B}^{(α)} (f) (λ) j_{α} (λ x) d μ_{α} (λ) \\ = & \int_{R_{+}} F_{B}^{(α)} (f *_{B} k_{t}) (λ) j_{α} (λ x) d μ_{α} (λ) \\ = & (f *_{B} k_{t}) (x), x \in R_{+}, \end{matrix}

which, with (47), shows that

\begin{matrix} ω_{l_{α}} {(f, t)}_{L^{2} (R_{+}, d μ_{α})} = {∥ (f *_{B} k_{t}) - f ∥}_{L^{2} (R_{+}, d μ_{α})}, t > 0 . \end{matrix}

(48)

Take (48) into (47), we have

\begin{matrix} I {(f, γ)}_{L^{2} (R_{+}, d μ_{α})} \leq C {∥ (f *_{B} k_{t}) - f ∥}_{L^{2} (R_{+}, d μ_{α})} |_{t = \frac{{∥ f ∥}_{L^{2} (R_{+}, d μ_{α})}}{γ}}, f \in L^{2} (R_{+}, d μ_{α}) . \end{matrix}

(49)

We know by (49) that the decay of

I {(f, γ)}_{L^{2} (R_{+}, d μ_{α})}

is controlled by the approximation order of the convolution operator

f *_{B} k_{t}

for

t = \frac{{∥ f ∥}_{L^{2} (R_{+}, d μ_{α})}}{γ} .

5. Proofs

Proof of Lemma 1.

Formula (11) can be obtained by the orthonormal of

{j_{α}^{*} (λ_{i} x)}_{i = 1}^{+ \infty}

. Formula (13) can be seen from [11] or Lemma 1 in [12]. Formula (14) can be seen from [16]. □

Proof of Lemma 2.

Proof of (i). See Proposition 2.III.1 in P51 of [22].

Proof of (ii). See Corollary 4.III.2 in P104 and Corollary 4.III.3 in P105 of [22].

Proof of (iii). See Theorem 5.III.1 in P127 and Proposition 5.II.4 in P129 of [22].

Proof of (iv). See Theorem 5.III.1 in P127 and (5.III.3) in P128 of [22].

Proof of (v). See Proposition 5.II.2 in P120 of [22] and (4.III.10) in Proposition 4.III.4 of [22].

Proof of (vi). See Theorem 5.II.2 in P126 of [22].

Proof of (vii). See (5.III.5) and (5.III.6) in Proposition 5.III.2 in P128,(5.V.2) in P139 of [22], and Proposition 2.2 in [25].

Proof of (viii). Formula (27) may be found from (5.II.12) of Proposition 5.II.3 in P122 of [22]; (28) may be found from (4.II.9) of Proposition 4.II.2 in P94 of [22]; (29) may be found from (4.II.8) in P93 of [22]. □

Proof of Proposition 1.

We show it with the help of Proposition A1 in the Appendix A.

It is easy to see that

T_{K^{(α)}} (t)

satisfies (A1) and (A2). Simple computations show

\begin{matrix} E f (x) & = & lim_{t \to 0} \frac{T_{K^{(α)}} (t) f (x) - f (x)}{t} \\ = & \sum_{i = 1}^{+ \infty} a_{i} (f) lim_{t \to 0} \frac{(e^{- \frac{t}{\sqrt{h_{i}}}} - 1)}{t} j_{α}^{*} (λ_{i} x) \\ = & \sum_{i = 1}^{+ \infty} (- \frac{1}{\sqrt{h_{i}}}) a_{i} (f) j_{α}^{*} (λ_{i} x) \end{matrix}

and

\begin{matrix} t E T_{K^{(α)}} (t) f (x) = \sum_{i = 1}^{+ \infty} (- \frac{t}{\sqrt{h_{i}}}) e^{- \frac{t}{\sqrt{h_{i}}}} a_{i} (f) j_{α}^{*} (λ_{i} x) . \end{matrix}

It follows

\begin{matrix} ∥ t E T_{K^{(α)}} {(t) f ∥}_{L_{α}^{2}} & = & {(\sum_{i = 1}^{+ \infty} {|(- \frac{t}{\sqrt{h_{i}}}) e^{- \frac{t}{\sqrt{h_{i}}}}|}^{2} a_{i}^{2} (f))}^{\frac{1}{2}} \\ \leq & {(\sum_{i = 1}^{+ \infty} a_{i}^{2} (f))}^{\frac{1}{2}} = {∥ f ∥}_{L_{α}^{2}} . \end{matrix}

(50)

Collecting (50), and (A5), we have (38). □

Proof of Theorem 1.

Because

h_{i} \to 0^{+} (i \to + \infty),

defining

\begin{matrix} f_{μ}^{(α)} (x) = \sum_{\frac{1}{h_{i}} < μ} a_{i} (f) j_{α}^{*} (λ_{i} x), \end{matrix}

(51)

we have for any

g \in H_{K^{(α)}}

that

\begin{matrix} f (x) - f_{μ}^{(α)} (x) = \sum_{\frac{1}{h_{i}} \geq μ} a_{i} (f) j_{α}^{*} (λ_{i} x) = \sum_{\frac{1}{h_{i}} \geq μ} a_{i} (f - g) j_{α}^{*} (λ_{i} x) + \sum_{\frac{1}{h_{i}} \geq μ} a_{i} (g) j_{α}^{*} (λ_{i} x) \end{matrix}

and

\begin{matrix} ∥ f - f_{μ}^{(α)} ∥_{L_{α}^{2}} & \leq & {(\sum_{\frac{1}{h_{i}} \geq μ} {|a_{i} (f - g)|}^{2})}^{\frac{1}{2}} + {(\sum_{\frac{1}{h_{i}} \geq μ} {|a_{i} (g)|}^{2})}^{\frac{1}{2}} \\ \leq & {∥ f - g ∥}_{L_{α}^{2}} + {(\sum_{\frac{1}{h_{i}} \geq μ} \frac{h_{i}}{h_{i}} {|a_{i} (g)|}^{2})}^{\frac{1}{2}} \\ \leq & {∥ f - g ∥}_{L_{α}^{2}} + \frac{1}{\sqrt{μ}} {(\sum_{\frac{1}{h_{i}} \geq μ} \frac{1}{h_{i}} {| a_{i} (g) |}^{2})}^{\frac{1}{2}} \\ \leq & {∥ f - g ∥}_{L_{α}^{2}} + \frac{1}{\sqrt{μ}} {∥ g ∥}_{K^{(α)}} . \end{matrix}

(52)

Since the arbitrariness of

g \in H_{K^{(α)}}

, we have

\begin{matrix} ∥ f - f_{μ}^{(α)} ∥_{L_{α}^{2}} \leq inf_{g \in H_{K^{(α)}}} ({∥ f - g ∥}_{L_{α}^{2}} + \frac{1}{\sqrt{μ}} {∥ g ∥}_{K^{(α)}}) . \end{matrix}

(53)

Take

h_{μ}^{(α)} (x) = \sum_{\frac{1}{h_{i}} < μ} \frac{a_{i} (f)}{\sqrt{h_{i}}} j_{α}^{*} (λ_{i} x)

. Then

f_{μ}^{(α)} (x) = L_{K^{(α)}}^{\frac{1}{2}} (h_{μ}^{(α)}, x) \in H_{K^{(α)}}

and

\begin{matrix} ∥ f_{μ} ∥_{K^{(α)}} & = & ∥ h_{μ} ∥_{L_{α}^{2}} \\ = & {(\sum_{\frac{1}{h_{i}} < μ} \frac{| a_{i} {(f) |}^{2}}{h_{i}})}^{\frac{1}{2}} \\ \leq & \sqrt{μ} {(\sum_{\frac{1}{h_{i}} < μ} {|a_{i} (f)|}^{2})}^{\frac{1}{2}} \leq \sqrt{μ} {∥ f ∥}_{L_{α}^{2}} . \end{matrix}

Take

\sqrt{μ} {∥ f ∥}_{L_{α}^{2}} = γ

. Then

\frac{1}{\sqrt{μ}} = \frac{{∥ f ∥}_{L_{α}^{2}}}{γ}

. By the definition of

I {(f, γ)}_{L_{α}^{2}}

, we have (39). □

Proof of Proposition 2.

It is easy to see that

T_{K (ϕ)} (t)

satisfies (A1) and (A2). Simple computations show

\begin{matrix} E f (x) & = & lim_{t \to 0} \frac{T_{K (ϕ)} (t) f (x) - f (x)}{t} \\ = & lim_{t \to 0} \frac{\int_{R_{+}} (e^{- \frac{t}{\sqrt{F_{B}^{(α)} (ϕ) (λ)}}} - 1) F_{B}^{(α)} (f) (λ) j_{α} (λ x) d μ_{α} (λ)}{t} \\ = & \int_{R_{+}} (- \frac{1}{\sqrt{F_{B}^{(α)} (ϕ) (λ)}}) F_{B}^{(α)} (f) (λ) j_{α} (λ x) d μ_{α} (λ) \end{matrix}

and

\begin{matrix} (t E T_{K (ϕ)} (t) f) (x) & = & \int_{R_{+}} (- \frac{t}{\sqrt{F_{B}^{(α)} (ϕ) (λ)}}) e^{- \frac{t}{\sqrt{F_{B}^{(α)} (ϕ) (λ)}}} F_{B}^{(α)} (f) (λ) j_{α} (λ x) d μ_{α} (λ) . \end{matrix}

Since

f \in L^{2} (R_{+}, d μ_{α})

, we know by (26) that

F_{B}^{(α)} (ϕ) (\cdot) \in L^{2} (R_{+}, d μ_{α})

. Additionally, since

\begin{matrix} |(- \frac{t}{\sqrt{F_{B}^{(α)} (ϕ) (λ)}}) e^{- \frac{t}{\sqrt{F_{B}^{(α)} (ϕ) (λ)}}}| \leq 1, \forall t \geq 0, \end{matrix}

we know

h_{t} (\cdot) = (- \frac{t}{\sqrt{F_{B}^{(α)} (ϕ) (\cdot)}}) e^{- \frac{t}{\sqrt{F_{B}^{(α)} (ϕ) (\cdot)}}} F_{B}^{(α)} (f) (\cdot) \in L^{2} (R_{+}, d μ_{α}) .

It is easy to see that

\begin{matrix} (t E T_{K (ϕ)} (t) f) (x) = F_{B}^{(α)} (h_{t}) (x) . \end{matrix}

It follows by (26) again that

\begin{matrix} ∥ t E T_{K (ϕ)} {(t) f ∥}_{L^{2} (R_{+}, d μ_{α})}^{2} & = & ∥ F_{B}^{(α)} (h_{t}) ∥_{L^{2} (R_{+}, d μ_{α})} \\ = & ∥ h_{t} ∥_{L^{2} (R_{+}, d μ_{α})} \\ \leq & ∥ F_{B}^{(α)} {(f) ∥}_{L^{2} (R_{+}, d μ_{α})} = {∥ f ∥}_{L^{2} (R_{+}, d μ_{α})}^{2} . \end{matrix}

(54)

By the same method, we have

\begin{matrix} ∥ T_{K (ϕ)} {(t) f ∥}_{L^{2} (R_{+}, d μ_{α})}^{2} & = & \int_{R_{+}} {(e^{- \frac{t}{\sqrt{F_{B}^{(α)} (ϕ) (λ)}}})}^{2} {|F_{B}^{(α)} (f) (λ)|}^{2} d μ_{α} (λ) \\ \leq & \int_{R_{+}} {|F_{B}^{(α)} (f) (λ)|}^{2} d μ_{α} (λ) = {∥ f ∥}_{L^{2} (R_{+}, d μ_{α})}^{2} . \end{matrix}

(55)

Collect (54), (55) and (A6) we have (40). □

Proof of Theorem 2.

Define

ℜ_{μ, λ} = {λ \in R_{+} : \frac{1}{F_{B}^{(α)} (ϕ) (λ)} < μ}

and

f_{*} (x) = \int_{ℜ_{μ, λ}} F_{B}^{(α)} (f) (λ) j_{α} (λ x) d μ_{α} (λ) .

Then

\begin{matrix} f (x) - f_{*} (x) = \int_{R_{+} \ ℜ_{μ, λ}} F_{B}^{(α)} (f) (λ) j_{α} (λ x) d μ_{α} (λ) . \end{matrix}

It follows that for any

g \in H_{K (ϕ)}

, there holds

\begin{matrix} f (x) - f_{*} (x) \\ = & \int_{R_{+} \ ℜ_{μ, λ}} F_{B}^{(α)} (f - g) (λ) j_{α} (λ x) d μ_{α} (λ) + \int_{R_{+} \ ℜ_{μ, λ}} F_{B}^{(α)} (g) (λ) j_{α} (λ x) d μ_{α} (λ) . \end{matrix}

Define the characteristic of

R_{+} \ ℜ_{μ, λ}

as

χ_{R_{+} \ ℜ_{μ, λ}} (λ)

. Then

\begin{matrix} f (x) - f_{*} (x) \\ = & \int_{R_{+}} χ_{R_{+} \ ℜ_{μ, λ}} (λ) F_{B}^{(α)} (f - g) (λ) j_{α} (λ x) d μ_{α} (λ) \\ + \int_{R_{+}} χ_{R_{+} \ ℜ_{μ, λ}} (λ) F_{B}^{(α)} (g) (λ) j_{α} (λ x) d μ_{α} (λ) \\ = & F_{B}^{(α)} (g_{μ}) (x) + F_{B}^{(α)} (b_{μ}) (x), \end{matrix}

(56)

where

g_{μ} (λ) = χ_{R_{+} \ ℜ_{μ, λ}} (λ) F_{B}^{(α)} (f - g) (λ), b_{μ} (λ) = χ_{R_{+} \ ℜ_{μ, λ}} (λ) F_{B}^{(α)} (g) (λ) .

Since

ϕ

satisfies Assumption 1, by (30) we know

g_{μ} \in D_{*} (R) \subset A_{*} (R) \subset L^{2} (R_{+}, d μ_{α})

. By (26), we have

\begin{matrix} ∥ F_{B}^{(α)} (g_{μ}) ∥_{L^{2} (R_{+}, d μ_{α})} = {∥ g_{μ} ∥}_{L^{2} (R_{+}, d μ_{α})} . \end{matrix}

(57)

By the same method, we have

\begin{matrix} ∥ F_{B}^{(α)} (b_{μ}) ∥_{L^{2} (R_{+}, d μ_{α})} = {∥ b_{μ} ∥}_{L^{2} (R_{+}, d μ_{α})} . \end{matrix}

(58)

It follows from (56), (57) and (58) that

\begin{matrix} ∥ f - f_{*} ∥_{L^{2} (R_{+}, d μ_{α})} \\ \leq & ∥ χ_{R_{+} \ ℜ_{μ, λ}} (\cdot) F_{B}^{(α)} {(f - g) (\cdot) ∥}_{L^{2} (R_{+}, d μ_{α})} + {∥ χ_{R_{+} \ ℜ_{μ, λ}} (\cdot) F_{B}^{(α)} (g) (\cdot) ∥}_{L^{2} (R_{+}, d μ_{α})} \end{matrix}

\begin{matrix} = & {(\int_{R_{+} \ ℜ_{μ, λ}} {|F_{B}^{(α)} (f - g) (λ)|}^{2} d μ_{α})}^{\frac{1}{2}} + {(\int_{R_{+} \ ℜ_{μ, λ}} {|F_{B}^{(α)} (g) (λ)|}^{2} d μ_{α})}^{\frac{1}{2}} \\ \leq & {(\int_{R_{+}} {|F_{B}^{(α)} (f - g) (λ)|}^{2} d μ_{α})}^{\frac{1}{2}} + {(\int_{R_{+} \ ℜ_{μ, λ}} {|F_{B}^{(α)} (g) (λ)|}^{2} d μ_{α})}^{\frac{1}{2}} . \end{matrix}

Since (26), we have by the definition of

ℜ_{μ, λ}

that

\begin{matrix} ∥ f - f_{*} ∥_{L^{2} (R_{+}, d μ_{α})} \\ \leq & {∥ f - g ∥}_{L^{2} (R_{+}, d μ_{α})} + {(\int_{R_{+} \ ℜ_{μ, λ}} \frac{F_{B}^{(α)} (ϕ) (λ)}{F_{B}^{(α)} (ϕ) (λ)} {|F_{B}^{(α)} (g) (λ)|}^{2} d μ_{α})}^{\frac{1}{2}} \\ \leq & {∥ f - g ∥}_{L^{2} (R_{+}, d μ_{α})} + {(max_{λ \in R_{+} \ ℜ_{μ, λ}} F_{B}^{(α)} (ϕ) (λ))}^{\frac{1}{2}} {(\int_{R_{+} \ ℜ_{μ, λ}} \frac{{|F_{B}^{(α)} (g) (λ)|}^{2}}{F_{B}^{(α)} (ϕ) (λ)} d μ_{α})}^{\frac{1}{2}} \\ \leq & {∥ f - g ∥}_{L^{2} (R_{+}, d μ_{α})} + {(max_{λ \in R_{+} \ ℜ_{μ, λ}} F_{B}^{(α)} (ϕ) (λ))}^{\frac{1}{2}} {(\int_{R_{+}} \frac{{|F_{B}^{(α)} (g) (λ)|}^{2}}{F_{B}^{(α)} (ϕ) (λ)} d μ_{α})}^{\frac{1}{2}} \\ = & {∥ f - g ∥}_{L^{2} (R_{+}, d μ_{α})} + \frac{1}{\sqrt{μ}} {∥ g ∥}_{H_{K (ϕ)}} . \end{matrix}

Because of the arbitrariness of

g \in H_{K (ϕ)}

, we have

\begin{matrix} ∥ f - f_{*} ∥_{L^{2} (R_{+}, d μ_{α})} \leq inf_{g \in H_{K (ϕ)}} ({∥ f - g ∥}_{L^{2} (R_{+}, d μ_{α})} + \frac{1}{\sqrt{μ}} {∥ g ∥}_{H_{K (ϕ)}}) . \end{matrix}

(59)

Let

h_{*} (x) = \int_{ℜ_{μ, λ}} \frac{F_{B}^{(α)} (f) (λ) j_{α} (λ x)}{\sqrt{F_{B}^{(α)} (ϕ) (λ)}} d μ_{α}

. Then by (20) we have

h_{*} \in L^{2} (R_{+}, μ_{α})

and

f_{*} (x) = L_{K (ϕ)}^{\frac{1}{2}} (h_{*}, x) = \int_{ℜ_{μ, λ}} F_{B}^{(α)} (f) (λ) j_{α} (λ x) d μ_{α} (λ) .

Therefore,

f_{*} \in H_{K (ϕ)}

. It follows that

\begin{matrix} ∥ f_{*} ∥_{K (ϕ)} & = & ∥ h_{*} ∥_{L^{2} (R_{+}, d μ_{α})} \\ = & {(\int_{ℜ_{μ, λ}} \frac{| F_{B}^{(α)} {(f) (λ) |}^{2}}{F_{B}^{(α)} (ϕ) (λ)} d μ_{α})}^{\frac{1}{2}} \\ \leq & \sqrt{μ} ∥ F_{B}^{(α)} {(f) ∥}_{L^{2} (R_{+}, d μ_{α})} = \sqrt{μ} {∥ f ∥}_{L^{2} (R_{+}, d μ_{α})} . \end{matrix}

(60)

Take

\sqrt{μ} {∥ f ∥}_{L^{2} (R_{+}, d μ_{α})} = γ

. Then

\sqrt{μ} = \frac{γ}{{∥ f ∥}_{L^{2} (R_{+}, d μ_{α})}}

. Collecting (60) and (59), together with the definition of

I {(f; γ)}_{L^{2} (R_{+}, d μ_{α})}

we arrive at

\begin{matrix} I {(f; γ)}_{L^{2} (R_{+}, d μ_{α})} & \leq & inf_{g \in H_{K (ϕ)}} ({∥ f - g ∥}_{L^{2} (R_{+}, d μ_{α})} + \frac{{∥ f ∥}_{L^{2} (R_{+}, d μ_{α})}}{γ} {∥ g ∥}_{H_{K (ϕ)}}) \\ = & D {(f, \frac{{∥ f ∥}_{L^{2} (R_{+}, d μ_{α})}}{γ})}_{L^{2} (R_{+}, d μ_{α})} \\ \sim & ω {(f, \frac{{∥ f ∥}_{L^{2} (R_{+}, d μ_{α})}}{γ})}_{L^{2} (R_{+}, d μ_{α})} . \end{matrix}

□

6. Further Discussions

We now give some comments on the results obtained in the present paper.

A more general problem arising from learning theory is to bound the decay rate of the function (see [2])

\begin{matrix} I (a, R) = inf_{{∥ g ∥}_{H} \leq R} (∥ a - b ∥), a \in B, R \to + \infty, \end{matrix}

(61)

where

(B, ∥ \cdot ∥)

is a Banach space and

(H, ∥ \cdot ∥_{H})

is a dense subspace with

∥ b ∥ \leq {∥ b ∥}_{H}

for

b \in H .

It is known that the approximation ability of a function class is determined by the smoothness of its functions. So the decay of

I (a, R)

is influenced by the smoothness of the functions in

H .

Smale and Zhou (see [2]) give the first estimate for the decay of (61) in the case that

a \in {(B, H)}_{θ, \infty}

, which is a particular Besov space (in fact, it is the interpolation space of B and H). This work is improved in [9]. For

B = H^{s} (R^{d}) (s > 0)

(the Sobolev space, see [2] for the definition) and the reproducing kernel Hilbert space

H = H_{K_{σ}}

, Zhou gives an estimate as (see [3])

\begin{matrix} inf_{{∥ g ∥}_{K_{σ}} \leq R} ∥ f - g ∥ \leq B_{d, s} {(l o g R)}^{- s} \end{matrix}

(62)

if

R \geq {A ∥ f ∥}_{L^{2} (R^{d})}

, where

K_{σ}

is the Gaussian kernels

K_{σ} (x, y) = exp {- \frac{{∥ x - y ∥}^{2}}{σ^{2}}}, x, y \in {[0, 1]}^{d}, σ > 0 .

The tools used is the RKHS function interpolation.

It is known that the most commonly used tool in approximation theory is the K-functional. The most helpful relation is the strong equivalent relation between a K-functional and a corresponding modulus of smoothness (see, for example, [26]). The most commonly used quantity for describing the approximation ability of a function class is the Jackson inequality expressed with a K-functional or a modulus of smoothness (see also [26]). As far as we know from the literature, no Jackson inequality has been established for the decay of (6). There is little description for the smoothness of a RKHS. Recent research shows that any RKHS has some smoothness; it can be considered from the view of fractional derivative and orthogonal series and show that the well-known K-functional ([27])

\begin{matrix} D_{H_{K}} {(f, λ)}_{L_{ρ_{X}}^{2} (X)} = inf_{g \in H_{K}} {(∥ f - g ∥ + λ ∥ g ∥}_{H_{K}}), λ > 0, \end{matrix}

(63)

is equivalent to a modulus of smoothness, where X is chosen as some compact sets, for example,

X = S^{d - 1} = {x \in R^{d} : ∥ x ∥ = 1}

and

X = B^{d} = {x \in R^{d} : ∥ x ∥ \leq 1}

. It is valuable for us to extend these results to the RKHS defined on a noncompact set. The set X used in the present paper is

X = R^{1}

, which is a noncompact set and has essential properties different from those of a compact set (see, for example, [5]). Moreover, it is the first time that a Jackson inequality is established to describe the decay (6). A advantage of this manuscript is the use of the Bessel series and Bessel transforms, which transforms the RKHS approximation problem into the classical Bessel–Fourier approximation problem and gives the decay rate with Bessel–Fourier approximation skills.

The Jackson inequalities in Theorem 1 and Theorem 2 show that the RKHSs constructed with Bessel series and Bessel transforms have the same approximation as that of the Bessel series and Bessel transforms.

The moduli of smoothness defined in this manuscript are one-order moduli. It is a valuable problem for us to define higher-order moduli of smoothness and show the Jackson inequality to describe the decay of (6).

Author Contributions

Formal analysis, B.S.; Investigation, S.W.; Writing—original draft, M.T. All authors have read and agreed to the published version of the manuscript.

Funding

This work is supported partially by the NSF (Project No. 61877039), the NSFC/RGC Joint Research Scheme (Project No. 12061160462 and N_-CityU102/20) of China, the NSF (Project No. LY19F020013) of Zhejiang Province, the Science and Technology Project in Jiangxi Province Department of Education (Project No. GJJ211334).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

It is known that the moduli of smoothness defined by a semi-group of operators have the same properties as those of the usual moduli of smoothness defined by the difference of the function (see Chapter Two of [28]) and have been used to describe the degree of approximation in approximation theory (see, for example, [27,29,30,31,32]). We restate here a proposition for a general strong equivalent relation.

Let

(B, ∥ \cdot ∥_{B})

be a normed linear space,

{\{{T (t) : (B, a n d ∥ \cdot ∥}_{B} {) \to (B, ∥ \cdot ∥}_{B})\}}_{t > 0}

be a strongly continuous semi-group of operators satisfying

\begin{matrix} T (s + t) = T (s) T (t), lim_{t \to 0^{+}} T (t) = I, \end{matrix}

(A1)

and

\begin{matrix} {∥ T (t) f ∥}_{B} \leq {∥ f ∥}_{B}, f \in B, t > 0 . \end{matrix}

(A2)

The infinitesimal generator E is given by

\begin{matrix} E f = lim_{t \to 0^{+}} \frac{T (t) f - f}{t}, (i n B), \end{matrix}

(A3)

whenever the limit exists.

D (E)

is the domain of E. Then we have the following proposition.

Proposition A1.

(Theorem 5.1 of [33]) Let

T (t)

satisfy (A1), (A2) and (A3),

\begin{matrix} T (t) f \in D (E) f o r a l l f \in B, \end{matrix}

(A4)

and there exists a positive constant N independent of t and

T (t)

such that

\begin{matrix} {t ∥ E T (t) ∥}_{B} \leq N (N i s a c o n s t a n t i n d e p e n d e n t o f t), E T (t) : B \to B f o r t \geq 0, \end{matrix}

(A5)

Then for

r \in N

and

t > 0

, there holds

\begin{matrix} ω_{r} {(f, t)}_{B} = {∥ {(T (t) - I)}^{r} f ∥}_{B} \sim inf_{g \in D (E^{r})} ({∥ f - g ∥}_{B} + t^{r} {∥ E^{r} g ∥}_{B}) = K_{E^{r}} {(f, t^{r})}_{B}, \end{matrix}

(A6)

where

\begin{matrix} {(T (s) - I)}^{r} f = \sum_{k = 1}^{r} (\begin{matrix} r \\ k \end{matrix}) {(- 1)}^{r - k} T (k s) f + {(- 1)}^{r} f . \end{matrix}

References

Cucker, F.; Smale, S. On the mathematical foundations of learning. Bull. Amer. Math. Soc. 2001, 39, 1–49. [Google Scholar] [CrossRef] [Green Version]
Smale, S.; Zhou, D.X. Estimating the approximation error in learning theory. Anal. Appl. 2003, 1, 17–41. [Google Scholar] [CrossRef]
Zhou, D.X. Density problem and approximation error in learning theory. Abstr. Appl. Anal. 2013, 715683. [Google Scholar] [CrossRef]
Aronszajn, N. Theory of reproducing kernels. Trans. Amer. Math. Soc. 1950, 68, 337–404. [Google Scholar] [CrossRef]
Sun, H.W. Mercer theorem for RKHS on noncompact sets. J. Complex. 2005, 21, 337–349. [Google Scholar] [CrossRef] [Green Version]
Cucker, F.; Zhou, D.X. Learning Theory: An Approximation Theory Viewpoint; Cambridge University Press: Cambridge, UK, 2007. [Google Scholar]
Ferreiar, J.C.; Menegatto, V.A. Reproducing kernel Hilbert spaces associated with kernels on topological spaces. Funct. Anal. Appl. 2012, 46, 89–91. [Google Scholar] [CrossRef]
Williams, D. Probability with Martingales; Cambridge University Press: Cambridge, UK, 1990. [Google Scholar]
Ye, P.X. Some Approximation Problems in Learning Theory, Post-Doctoral Research Work Report; Chinese Academy of Sciences: Bejing, China, 2003. [Google Scholar]
Sun, H.W. Behavior of a functional in learning theory. Front. Math. China 2007, 2, 455–465. [Google Scholar] [CrossRef]
Abilov, V.A.; Abilova, F.V. Approximation of functions by Fourier-Bessel sums. Izv. Vyssh. Uchebn. Zaved. Math. 2001, 8, 3–9. [Google Scholar]
Abilov, V.A.; Abilova, F.V.; Kerimov, M.K. Some issues concerning approximation of functions by Fourier-Bessel sums. Comput. Math. Math. Phy. 2013, 53, 867–873. [Google Scholar] [CrossRef]
Abilov, V.A.; Abilova, F.V.; Kerimov, M.K. Sharp estimates for the convergence rate of Fourier-Bessel series. Comput. Math. Math. Phy. 2015, 55, 907–916. [Google Scholar] [CrossRef]
Abilov, V.A.; Abilova, F.V.; Kerimov, M.K. On sharp estimates of the convergence of double Fourier-Bessel series. Comput. Math. Math. Phy. 2017, 57, 1735–1740. [Google Scholar] [CrossRef]
Abilov, V.A.; Kerimov, M.K. Some estimates for the error in mixed Fourier-Bessel expansions of functions of two variables. Comput. Math. Math. Phy. 2006, 46, 1465–1486. [Google Scholar] [CrossRef]
Hochstadt, H. The mean convergence of Fourier-Bessel series. SIAM Rev. 1967, 9, 211–218. [Google Scholar] [CrossRef]
Arteaga, C.; Marrero, I. Universal approximation by radial basis function networks of Delsarte translates. Neural Netw. 2013, 46, 299–305. [Google Scholar] [CrossRef] [PubMed]
Arteaga, C.; Marrero, I. Approximation in weighted p-mean by RBF networks of Delsarte translates. J. Math. Anal. Appl. 2014, 414, 450–460. [Google Scholar] [CrossRef]
Dai, F.; Wang, H.P. Interpolation by weighted Paley-Wiener spaces associated with the Dunkl transform. J. Math. Anal. Appl. 2012, 390, 556–572. [Google Scholar] [CrossRef] [Green Version]
Marrero, I. Radial basisi function neural networks of Hankel translates as universal approximation. Anal. Appl. 2019, 17, 897–930. [Google Scholar] [CrossRef]
Vladimirov, V.S. Equations of Matheamtical Physics; Marcel Dekker: New York, NY, USA, 1971. [Google Scholar]
Triméche, K. Generalized Harmonic Analysis and Wavelet Packets; Gordon and Breach Science Publishers: Singapore, 2001. [Google Scholar]
Sheng, B.H. The weighted norm for some Mercer kernel matrices. Acta Math. Sci. 2013, 33A, 6–15. (In Chinese) [Google Scholar]
Sheng, B.H.; Zuo, L. Error analysis of the kernel regularized regression based on refined convex losses and RKBSs. Int. J. Wavelets Multiresolut. Inform. Process 2021, 19, 2150012. [Google Scholar] [CrossRef]
Quadih, S.E.; Daher, R. Estimates for the generalized Fourier-Bessel transform in the space $L_{α, n}^{2}$ . Internat. J. Math. Model Comput. 2016, 6, 269–275. [Google Scholar]
Ditzian, Z.; Totik, V. Moduli of Smoothness; Springer: New York, NY, USA, 1987. [Google Scholar]
Sheng, B.H.; Wang, J.L. On the K-functional in learning theory. Anal. Appl. 2020, 18, 423–446. [Google Scholar] [CrossRef]
Butzer, P.L.; Berens, H. Semi-Group of Operators and Approximation; Springer: New York, NY, USA, 1967. [Google Scholar]
Dai, F.; Ditzian, Z. Strong converse inequality for Poisson sums. Proc. Amer. Math. Soc. 2005, 133, 2609–2611. [Google Scholar] [CrossRef]
Dai, F.; Ditzian, Z. Cesaro summability and Marchaud inequality. Constr. Approx. 2007, 25, 73–88. [Google Scholar] [CrossRef]
Ditzian, Z. New moduli of smoothness on the unit ball and other domains, introduction and main properties. Constr. Approx. 2014, 40, 1–36. [Google Scholar] [CrossRef]
Ditzian, Z. New moduli of smoothness on the unit ball, applications and computability. J. Approx. Theory 2014, 180, 49–76. [Google Scholar] [CrossRef]
Ditzian, Z.; Ivanov, K.G. Strong converse inequalities. J. Anal. Math. 1993, 61, 61–111. [Google Scholar] [CrossRef]

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Tian, M.; Sheng, B.; Wang, S. Some Upper Bounds for RKHS Approximation by Bessel Functions. Axioms 2022, 11, 233. https://doi.org/10.3390/axioms11050233

AMA Style

Tian M, Sheng B, Wang S. Some Upper Bounds for RKHS Approximation by Bessel Functions. Axioms. 2022; 11(5):233. https://doi.org/10.3390/axioms11050233

Chicago/Turabian Style

Tian, Mingdang, Baohuai Sheng, and Shuhua Wang. 2022. "Some Upper Bounds for RKHS Approximation by Bessel Functions" Axioms 11, no. 5: 233. https://doi.org/10.3390/axioms11050233

APA Style

Tian, M., Sheng, B., & Wang, S. (2022). Some Upper Bounds for RKHS Approximation by Bessel Functions. Axioms, 11(5), 233. https://doi.org/10.3390/axioms11050233

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Some Upper Bounds for RKHS Approximation by Bessel Functions

Abstract

1. Introduction

2. Preliminaries

3. An Upper Bound Estimate with Fourier–Bessel Series

4. An Upper Bound Estimate with the Fourier–Bessel Transform

5. Proofs

6. Further Discussions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

Appendix A

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI