The Schwartz Space: Tools for Quantum Mechanics and Infinite Dimensional Analysis

Becnel, Jeremy; Sengupta, Ambar

doi:10.3390/math3020527

Open AccessArticle

The Schwartz Space: Tools for Quantum Mechanics and Infinite Dimensional Analysis

by

Jeremy Becnel

^1,* and

Ambar Sengupta

²

¹

Department of Mathematics, Stephen F. Austin State University, PO Box 13040 SFA Station, Nacogdoches, TX 75962, USA

²

Department of Mathematics, Louisiana State University, Baton Rouge, LA 70803, USA

^*

Author to whom correspondence should be addressed.

Mathematics 2015, 3(2), 527-562; https://doi.org/10.3390/math3020527

Submission received: 17 April 2015 / Accepted: 3 June 2015 / Published: 16 June 2015

(This article belongs to the Special Issue Mathematical physics)

Download Versions Notes

Abstract

:

An account of the Schwartz space of rapidly decreasing functions as a topological vector space with additional special structures is presented in a manner that provides all the essential background ideas for some areas of quantum mechanics along with infinite-dimensional distribution theory.

Keywords:

quantum mechanics; Schwartz space; test functions

1. Introduction

In 1950–1951, Laurent Schwartz published a two volumes work Théorie des Distributions [1,2], where he provided a convenient formalism for the theory of distributions. The purpose of this paper is to present a self-contained account of the main ideas, results, techniques, and proofs that underlie the approach to distribution theory that is central to aspects of quantum mechanics and infinite dimensional analysis. This approach develops the structure of the space of Schwartz test functions by utilizing the operator

T = - \frac{d^{2}}{d x^{2}} + \frac{x^{2}}{4} + \frac{1}{2} .

(1)

This operator arose in quantum mechanics as the Hamiltonian for a harmonic oscillator and, in that context as well as in white noise analysis, the operator N = T − 1 is called the number operator. The physical context provides additional useful mathematical tools such as creation and annihilation operators, which we examine in detail.

In this paper we include under one roof all the essential necessary notions of this approach to the test function space

S (R^{n})

. The relevant notions concerning topological vector spaces are presented so that the reader need not wade through the many voluminous available works on this subject. We also describe in brief the origins of the relevant notions in quantum mechanics.

We present

the essential notions and results concerning topological vector spaces;
a detailed analysis of the creation operator C, the annihilation operator A, the number operator N and the harmonic oscillator Hamiltonian T;
a detailed account of the Schwartz space $S (R^{d})$ , and its topology, as a decreasing intersection of subspaces $S_{p} (R^{d})$ , for p ∈ {0, 1, 2, …}:

$S (R^{d}) = \underset{p \geq 0}{\cap} S_{p} (R^{d}) \subset \dots \subset S_{2} (R^{d}) \subset S_{1} (R^{d}) \subset L^{2} (R^{d})$
an exact characterization of the functions in the space $S_{p} (R)$ ;
summary of notions from spectral theory and quantum mechanics;

Our exposition of the properties of T and of

S (R)

follows Simon’s paper [3], but we provide more detail and our notational conventions are along the lines now standard in infinite-dimensional distribution theory.

The classic work on spaces of smooth functions and their duals is that of Schwartz [1,2]. Our purpose is to present a concise and coherent account of the essential ideas and results of the theory. Of the results that we discuss, many can be found in other works such as [1–6], which is not meant to be a comprehensive list. We have presented portions of this material previously in [7], but also provide it here for convenience. The approach we take has a direct counterpart in the theory of distributions over infinite dimensional spaces [8,9].

2. Basic Notions and Framework

In this section we summarize the basic notions, notation, and results that we discuss in more detail in later sections. Here, and later in this paper, we work mainly with the case of functions of one variable and then describe the generalization to the multi-dimensional case.

We use the letter W to denote the set of all non-negative integers:

W = {0, 1, 2, 3, \dots} .

(2)

2.1. The Schwartz Space

The Schwartz space

S (R)

is the linear space of all functions f : R → C which have derivatives of all orders and which satisfy the condition

p_{a, b} (f) \overset{def}{=} \sup_{x \in R} | x^{a} f^{b} (x) | < \infty

for all a, b ∈ W = {0, 1, 2, …}. The finiteness condition for all a ≥ 1 and b ∈ W, implies that x^af^b(x) actually goes to 0 as |x| → ∞, for all a, b ∈ W, and so functions of this type are said to be rapidly decreasing.

2.2. The Schwartz Topology

The functions p_a,b are semi-norms on the vector space

S (R)

, in the sense that

p_{a, b} (f + g) \leq p_{a, b} (f) + p_{a, b} (g)

and

p_{a, b} (z f) = | z | p_{a, b} (f)

for all

f, g \in S (R)

, and z ∈ C. For this semi-norm, an open ball of radius r centered at some

f \in S (R)

is given by

B_{p}_{_{a, b}} (f; r) = {g \in S (R) : p_{a, b} (g - f) < r} .

(3)

Thus each p_a,b specifies a topology τ_a,b on

S (R)

. A set is open according to τ_a,b if it is a union of open balls.

One way to generate the standard Schwartz topology τ on

S (R)

is to "combine" all the topologies τ_a,b. We will demonstrate how to generate a "smallest" topology containing all the sets of τ_a,b for all a, b ∈ W. However, there is a different approach to the topology on

S (R)

that is very useful, which we describe in detail below.

2.3. The Operator T

The operator

T = - \frac{d^{2}}{d x^{2}} + \frac{x^{2}}{4} + \frac{1}{2}

(4)

plays a very useful role in working with the Schwartz space. As we shall see, there is an orthonormal basis

{ϕ_{n}}_{n \in W}

of L²(R,dx), consisting of eigenfunctions ϕ_n of T:

T ϕ_{n} = (n + 1) ϕ_{n} .

(5)

The functions ϕ_n, called the Hermite functions are actually in the Schwartz space

S (R)

. Let B be the bounded linear operator on L²(R) given on each f ∈ L²(R) by

B f = \sum_{n \in W} {(n + 1)}^{- 1} 〈 f, ϕ_{n} 〉 ϕ_{n} .

(6)

It is readily checked that the right side does converge and, in fact,

{‖ B f ‖}_{L^{2}}^{2} = \sum_{n \in W} {(n + 1)}^{- 2} {| 〈 f, ϕ_{n} 〉 |}_{L^{2}}^{2} \leq \sum_{n \in W} {| 〈 f, ϕ_{n} 〉 |}_{L^{2}}^{2} = {‖ f ‖}_{L^{2}}^{2} .

(7)

Note that B and T are inverses of each other on the linear span of the vectors ϕ_n:

T B f = f and B T f = f for all f \in ℒ,

(8)

where

ℒ = linear span of the vector ϕ_{n}, for n \in W .

(9)

2.4. The L² Approach

For any p ≥ 0, the image of B^p consists of all f ∈ L²(R) for which

\sum_{n \in W} {(n + 1)}^{2 p} {| 〈 f, ϕ_{n} 〉 |}^{2} < \infty .

Let

S_{p} (R) = B^{p} (L^{2} (R)) .

(10)

This is a subspace of L²(R), and on

S_{p} (R)

there is an inner-product 〈·, ·〉_p given by

{〈 f, g 〉}_{p} \overset{def}{=} \sum_{n \in W} {(n + 1)}^{2 p} 〈 f, ϕ_{n} 〉 〈 ϕ_{n}, g 〉 = {〈 B^{- p} f, B^{- p} g 〉}_{L^{2}}

(11)

which makes it a Hilbert space, having

ℒ

, and hence also

S (R)

, a dense subspace. We will see later that functions in

S_{p} (R)

are p-times differentiable.

We will prove that the intersection

\cap_{p}_{\in W} S_{p} (R)

is exactly equal to S(R). In fact,

S (R) = \underset{p \in W}{\cap} S_{p} (R) \subset \dots S_{2} (R) \subset S_{1} (R) \subset S_{0} (R) = L^{2} (R) .

(12)

We will also prove that the topology on

S (R)

generated by the norms ║·║_p coincides with the standard topology. Furthermore, the elements

{(n + 1)}^{- p} ϕ_{n} \in S_{p} (R)

form an orthonormal basis of

S_{p} (R)

, and

\sum_{n \in W} {‖ {(n + 1)}^{- (p + 1)} ϕ_{n} ‖}_{p}^{2} = \sum_{n \geq 1} \frac{n^{2 p}}{n^{2 (p + 1)}} < \infty,

showing that the inclusion map

S_{p}_{+ 1} (R) \to S_{p} (R)

is Hilbert-Schmidt.

The topological vector space

S (R)

has topology generated by a complete metric [10], and has a countable dense subset given by all finite linear combinations of the vectors ϕ_n with rational coefficients.

2.5. Coordinatization as a Sequence Space

All of the results described above follow readily from the identification of

S (R)

with a space of sequences. Let {ϕ_n}_n_∈_W be the orthonormal basis of L²(R) mentioned above, where W = {0, 1, 2, …}. Then we have the set C^W; an element a ∈ C^W is a map W → C : n ⟼a_n. So we shall often write such an element a as (a_n)_n_∈_W.

We have then the coordinatizing map

I : L^{2} (R) \to C^{W} : f \mapsto {(〈 f, ϕ_{n} 〉)}_{n \in W} .

(13)

For each p ∈ W let E_p be the subset of C^W consisting of all (a_n)_n_∈_W such that

\sum_{n \in W} {(n + 1)}^{2 p} {| a_{n} |}^{2} < \infty .

On E_p define the inner-product 〈·, ·〉_p by

{〈 a, b 〉}_{p} = \sum_{n \in W} {(n + 1)}^{2 p} a_{n} {\bar{b}}_{n} .

(14)

This makes E_p a Hilbert space, essentially the Hilbert space L²(W, µ_p), where µ_p is the measure on W given by µ_p({n}) = (n + 1)²^p for all n ∈ W.

The definition, Equation (10), for

S_{p} (R)

shows that it is the set of all f ∈ L²(R) for which I(f) belongs to E_p.

We will prove in Theorem 16 that I maps

S (R)

exactly onto

E \overset{def}{=} \underset{p \in W}{\cap} E_{p} .

(15)

This will establish essentially all of the facts mentioned above concerning the spaces

S_{p} (R)

.

Note the chain of inclusions:

E = \underset{p \in W}{\cap} E_{p} \subset \dots \subset E_{2} \subset E_{1} \subset E_{0} = L^{2} (W, μ_{0}) .

(16)

2.6. The Multi-Dimensional Setting

In the multidimensional setting, the Schwartz space

S (R^{d})

consists of all infinitely differentiable functions f on R^d for which

\sup_{x \in R^{d}} | x_{1}^{k_{1}} \dots x_{d}^{k_{d}} \frac{\partial^{m_{1} + \dots + m_{k}} f (x)}{\partial x_{1}^{m_{1}} \dots \partial x_{d}^{m_{d}}} | < \infty,

for all (k₁, …, k_d) ∈ W ^d and m = (m₁, …, m_d) ∈ W^d. For this setting, it is best to use some standard notation:

| k | = k_{1} + \dots + k_{d} for k = (k_{1}, \dots, k_{d}) \in W^{d}

(17)

x^{k} = x^{k_{1}} \dots x^{k_{d}}

(18)

D^{m} = \frac{\partial^{| m |}}{\partial x_{1}^{m_{1}} \dots \partial x_{d}^{m_{d}}} .

(19)

For the multi-dimensional case, we use the indexing set W ^d whose elements are d–tuples j = (j₁, …, j_d), with j₁, …, j_d ∈ W, and counting measure µ₀ on W^d. The sequence space is replaced by

C^{W^{d}}

; a typical element

a \in C^{W^{d}}

, is a map

a : W^{d} \to C : j = (j_{1}, \dots, j_{d}) \mapsto a_{j} = a_{j_{1} \dots j_{d}} .

(20)

The orthonormal basis (ϕ_n)_n_∈_W of L²(R) yields an orthonormal basis of L²(R^d) consisting of the vectors

ϕ_{j} = ϕ_{j_{1}} \oplus \dots \oplus ϕ_{j_{d}} : (x_{1}, \dots, x_{d}) \mapsto ϕ_{j_{1}} (x_{1}) \dots ϕ_{j_{d}} (x_{d}) .

(21)

The coordinatizing map I is replaced by the map

I_{d} : L^{2} (R^{d}) \to C^{W^{d}}

(22)

where

I_{d} {(f)}_{j} = {〈 f, ϕ_{j} 〉}_{L^{2} (R^{d})} .

(23)

Replace the operator T by

T_{d} \overset{def}{=} T^{\oplus d} = (- \frac{\partial^{2}}{\partial x_{d}^{2}} + \frac{x_{d}^{2}}{4} + 1) \dots (- \frac{\partial^{2}}{\partial x_{1}^{2}} + \frac{x_{1}^{2}}{4} + 1) .

(24)

Then

T_{d} ϕ_{j} = (j_{1} + 1) \dots (j_{d} + 1) ϕ_{j}

for all j ∈ W^d.

In place of B, we now have the bounded operator B_d on L²(R^d) given by

B_{d} f \sum_{j \in W^{d}} {[(j_{1} + 1) \dots (j_{d} + 1)]}^{- 1} 〈 f, ϕ_{j} 〉 ϕ_{j}

(25)

Again, T_d and B_d are inverses of each other on the linear subspace

ℒ

_d of L²(R^d) spanned by the vectors ϕ_j.

The space E_p is now the subset of

C^{W^{d}}

consisting of all

a \in C^{W^{d}}

for which

{‖ a ‖}_{p}^{2} \overset{def}{=} \sum_{j \in W^{d}} {[(j_{1} + 1) \dots (j_{d} + 1)]}^{2 p} {| a_{j} |}^{2} < \infty .

(26)

Thus,

E_{p} \overset{def}{=} {a \in C^{W^{d}} : {‖ a ‖}_{p}^{2} < \infty}

(27)

This is a Hilbert space with inner-product

{〈 a, b 〉}_{p} = \sum_{j \in W^{d}} {[(j_{1} + 1) \dots (j_{d} + 1)]}^{2 p} a_{j} {\bar{b}}_{j} .

(28)

Again we have the chain of spaces

E \overset{def}{=} \cap_{p \in W} E_{p} \subset \dots E_{2} \subset E_{1} \subset E_{0} = L^{2} (W^{d}, μ_{0}),

(29)

with the inclusion E_p₊₁ → E_p being Hilbert-Schmidt.

To go back to functions on R^d, define

S_{p} (R^{d})

to be the range of B_d. Thus

S_{p} (R^{d})

is the set of all f ∈ L²(R^d) for which

\sum_{j \in W^{d}} {[(j_{1} + 1) \dots (j_{d} + 1)]}^{2 p} {| 〈 f, ø_{j} 〉 |}^{2} < \infty

The inner-product 〈·, ·〉_p comes back to an inner-product, also denoted 〈·, ·〉_p, on

S_{p} (R^{d})

and is given by

{〈 f, g 〉}_{p} = {〈 B_{d}^{- p} f, B_{d}^{- p} g 〉}_{L^{2} (R^{d})} .

(30)

The intersection

\cap_{p \in W} S_{p} (R^{d})

equals

S (R^{d})

. Moreover, the topology on

S (R^{d})

is the smallest one generated by the inner-products obtained from 〈·, ·〉_p, with p running over W.

3. Topological Vector Spaces

The Schwartz space is a topological vector space, i.e., it is a vector space equipped with a Hausdorff topology with respect to which the vector space operations (addition, and multiplication by scalar) are continuous. In this section we shall go through a few of the basic notions and results for topological vector spaces.

Let V be a real vector space. A vector topology τ on V is a topology such that addition V × V → V : (x, y) ⟼ x + y and scalar multiplication R × V → V : (t, x) ⟼ tx are continuous. If V is a complex vector space we require that C × V → V : (α, x) ⟼ αx be continuous.

It is useful to observe that when V is equipped with a vector topology, the translation maps

t_{x} : V \to V : y \mapsto y + x

are continuous, for every x ∈ V, and are hence also homeomorphisms since

t_{x}^{- 1} = t_{- x}

.

A topological vector space is a vector space equipped with a Hausdorff vector topology. A local base of a vector topology τ is a family of open sets {U_α}_α_∈_I containing 0 such that if W is any open set containing 0 then W contains some U_α. If U is any open set and x any point in U then U − x is an open neighborhood of 0 and hence contains some U_α, and so U itself contains a neighborhood x + U_α of x:

I f U i s o p e n a n d x \in U t h e n x + U_{α} \subset U, f o r s o m e α \in I .

(31)

Doing this for each point x of U, we see that each open set is the union of translates of the local base sets U_α.

3.1. Local Convexity and the Minkowski Functional

A vector topology τ on V is locally convex if for any neighborhood W of 0 there is a convex open set B with 0 ∈ B ⊂ W. Thus, local convexity means that there is a local base of the topology τ consisting of convex sets. The principal consequence of having a convex local base is the Hahn-Banach theorem which guarantees that continuous linear functionals on subspaces of V extend to continuous linear functionals on all of V. In particular, if V ≠ {0} is locally convex then there exist non-zero continuous linear functionals on V.

Let B be a convex open neighborhood of 0. Continuity of R × V → V : (s, x) ⟼ sx at s = 0 shows that for each x the multiple sx lies in B if s is small enough, and so t⁻¹x lies in B if t is large enough. The smallest value of t for which t⁻¹x is just outside B is clearly a measure of how large x is relative to B. The Minkowski functional µ_B is the function on V given by

μ_{B} (x) = \inf {t > 0 : t^{- 1} x \in B} .

Note that 0 ≤ µ_B(x) < ∞. The definition of µ_B shows that µ_B(kx) = kµ_B(x) for any k ≥ 0. Convexity of B can be used to show that

μ_{B} (x + y) \leq μ_{B} (x) + μ_{B} (y) .

If B is symmetric, i.e., B = −B, then µ_B(kx) = |k|µ_B(x) for all real k. If V is a complex vector space and B is balanced in the sense that αB = B for all complex numbers α with |α| = 1, then µ_B(kx) = |k|µ_B(x) for all complex k. Note that in general it could be possible that µ_B(x) is 0 without x being 0; this would happen if B contains the entire ray {tx : t ≥ 0}.

3.2. Semi-Norms

A typical vector topology on V is specified by a semi-norm on V, i.e., a function µ : V → R such that

μ (x + y) \leq μ (x) + μ (y), μ (t x) = | t | μ (x)

(32)

for all x, y ∈ V and t ∈ R (complex t if V is a complex vector space). Note that then, using t = 0, we have µ(0) = 0 and, using −x for y, we have µ(x) ≥ 0. For such a semi-norm, an open ball around x is the set

B_{μ} (x; r) = {y \in V : μ (y - x) < r},

(33)

and the topology τ_µ consists of all sets which can be expressed as unions of open balls. These balls are convex and so the topology τ_µ is locally convex. If µ is actually a norm, i.e., µ(x) is 0 only if x is 0, then τ_µ is Hausdorff.

A consequence of the triangle inequality Equation (32) is that a semi-norm µ is uniformly continuous with respect to the topology it generates. This follows from the inequality

| μ (x) - μ (y) | \leq μ (x - y),

(34)

which implies that µ, as a function on V, is continuous with respect to the topology τ_µ it generates. Now suppose µ is continuous with respect to a vector topology τ. Then the open balls {y ∈ V : µ(y − x) < r} are open in the topology τ and so τ_µ ⊂ τ.

3.3. Topologies Generated by Families of Topologies

Let {τ_α}_α_∈_I be a collection of topologies on a space. It is natural and useful to consider the the least upper bound topology τ, i.e., the smallest topology containing all sets of ∪_α_∈_Iτ_α. In our setting, we work with each τ_α a vector topology on a vector space V.

Theorem 1. The least upper bound topology τ of a collection {τ_α}_α_∈_I of vector topologies is again a vector topology. If

{W_{α}_{, i}}_{i}_{\in I_{α}}

is a local base for τ_α then a local base for τ is obtained by taking all finite intersections of the form

W_{α}_{_{1}, i_{1}} \cap \dots \cap W_{α_{n}}_{, i_{n}}

.

Proof. Let B be the collection of all sets which are of the form

W_{α}_{_{1}, i_{1}} \cap \dots \cap W_{α_{n}}_{, i_{n}}

.

Let τ′ be the collection of all sets which are unions of translates of sets in B (including the empty union). Our first objective is to show that τ′ is a topology on V. It is clear that τ′ is closed under unions and contains the empty set. We have to show that the intersection of two sets in τ′ is in τ′. To this end, it will suffice to prove the following:

\begin{array}{l} If C_{1} and C_{2} are sets in B, and x is a point in \\ the intersection of the translates a + C_{1} and b + C_{2}, \\ then x + C \subset (a + C_{1}) \cap (b + C_{2}) for some C in B . \end{array}

(35)

Clearly, it suffices to consider finitely many topologies τ_α. Thus, consider vector topologies τ₁, …, τ_n on V.

Let

B_{n}

be the collection of all sets of the form B₁ ∩⋯∩ B_n with B_i in a local base for τ_i, for each i ∈ {1, …, n}. We can check that if D,

D^{'} \in B_{n}

then there is an

E \in B_{n}

with E ⊂ D ∩ D′.

Working with B_i drawn from a given local base for τ_i, let z be a point in the intersection B₁∩⋯∩B_n. Then there exist sets

{B^{'}}_{i}

, with each

{B^{'}}_{i}

being in the local base for τ_i, such that

z + {B^{'}}_{i} \subset B_{i}

(this follows from our earlier observation Equation (31)). Consequently,

z + \cap_{i = 1}^{n} {B^{'}}_{i} \subset \cap_{i = 1}^{n} B_{i} .

Now consider sets C₁ an C₂, both in

B_{n}

. Consider a, b ∈ V and suppose x ∈ (a + C₁) ∩ (b + C₂). Then since x − a ∈ C₁ there is a set

{C^{'}}_{1} \in B_{n}

with

x - a + {C^{'}}_{1} \subset C_{1}

; similarly, there is a

{C^{'}}_{2} \in B_{n}

with

x - b + {C^{'}}_{2} \subset C_{2}

. So

x + {C^{'}}_{1} \subset a + C_{1}

and

x + {C^{'}}_{1} \subset a + C_{1}

. So

x + C \subset (a + C_{1}) \cap (b + C_{2}),

where

C \in B_{n}

satisfies C ⊂ C₁ ∩ C₂. This establishes Equation (35), and shows that the intersection of two sets in τ′ is in τ′.

Thus τ′ is a topology. The definition of τ′ makes it clear that τ′ contains each τ_α. Furthermore, if any topology σ contains each τ_α then all the sets of τ′ are also open relative to σ. Thus τ′ = τ, the topology generated by the topologies τ_α.

Observe that we have shown that if W ∈ τ contains 0 then W ⊃ B for some

B \in B

.

Next we have to show that τ is a vector topology. The definition of τ shows that τ is translation invariant, i.e., translations are homeomorphisms. So, for addition, it will suffice to show that addition V × V ⟼V : (x, y) ⟼x + y is continuous at (0, 0). Let W ∈ τ contain 0. Then there is a

B \in B

with 0 ∈ B ⊂ W. Suppose B = B₁ ∩⋯∩ B_n, where each B_i is in the given local base for τ_i. Since τ_i is a vector topology, there are open sets D_i,

{D^{'}}_{i} \in τ_{i}

, both containing 0, with

D_{i} + {D^{'}}_{i} \subset B_{i}

. Then choose C_i,

{C^{'}}_{i}

in the local base for τ_i with C_i ⊂ D_i and

{C^{'}}_{i} \subset {D^{'}}_{i}

. Then

C_{i} + {C^{'}}_{i} \subset B_{i}

. Now let C = C₁∩⋯∩C_n, and

C^{'} = {C^{'}}_{1} \cap \dots \cap {C^{'}}_{n}

. Then C,

C^{'} \in B

and

C + C^{'} \subset B

. Thus, addition is continuous at (0, 0).

Now consider the multiplication map R × V → V : (t, x) ⟼tx. Let (s, y), (t, x) ∈ R × V. Then

s y - t x = (s - t) x + t (y - x) + (s - t) (y - x) .

Suppose F ∈ τ contains tx. Then

F \supset t x + W^{'},

for some

W^{'} \in B

. Using continuity of the addition map

V \times V \times V \to V : (a, b, c) \mapsto a + b + c

at (0, 0, 0), we can choose

W_{1}, W_{2}, W_{3} \in B

with W₁ + W₂ + W₃ ⊂ W′. Then we can choose

W \in B

, such that

W \subset W_{1} \cap W_{2} \cap W_{3} .

Then

W \in B

and

W + W + W \subset W^{'} .

Suppose W = B₁ ∩⋯∩ B_n, where each B_i is in the given local base for the vector topology τ_i. Then for s close enough to t, we have (s − t)x ∈ B_i for each i, and hence (s − t)x ∈ W. Similarly, if y is τ–close enough to x then t(y − x) ∈ W. Lastly, if s − t is close enough to 0 and y is close enough to x then (s − t)(y − x) ∈ W. So sy − tx ∈ W′, and so sy ∈ F, when s is close enough to t and y is τ–close enough to x. □

The above result makes it clear that if each τ_α has a convex local base then so is τ. Note also that if at least one τ_α is Hausdorff then so is τ.

A family of topologies {τ_α}_α_∈_I is directed if for any α, β ∈ I there is a γ ∈ I such that τ_α ∪ τ_β ⊂ τ_γ. In this case every open neighborhood of 0 in the generated topology contains an open neighborhood in one of the topologies τ_γ.

3.4. Topologies Generated by Families of Semi-Norms

We are concerned mainly with the topology τ generated by a family of semi-norms {µ_α}_α_∈_I; this is the smallest topology containing all sets of

\cup_{α \in I} τ_{μ_{α}}

. An open set in this topology is a union of translates of finite intersections of balls of the form

B_{µ_{i}} (0; r_{i})

. Thus, any open neighborhood of f contains a set of the form

B_{µ_{1}} (f; r_{1}) \cap \dots \cap B_{µ_{n}} (f; r_{n}) .

This topology is Hausdorff if for any non-zero x ∈ V there is some norm µ_α for which µ_α(x) is not zero.

The description of the neighborhoods in the topology τ shows that a sequence f_n converges to f with respect to τ if and only if µ_α(f_n − f) → 0, as n → ∞, for all α ∈ I.

We will need to examine when two families of semi-norms give rise to the same topology:

Theorem 2. Let τ be the topology on V generated by a family of semi-norms

ℳ

= {µ_i}_i_∈_I, and τ′ the topology generated by a family of semi-norms

ℳ^{'} = {{µ^{'}}_{j}}_{j}_{\in J}

. Suppose each µ_i is bounded above by a linear combination of the

{µ^{'}}_{j}

. Then τ ⊂ τ′.

Proof. Let µ ∈ M. Then there exist

{µ^{'}}_{j}, \dots, \in ℳ^{'}

, and real numbers c₁, …, c_n > 0, such that

μ \leq c_{1} {μ^{'}}_{1} + \dots + c_{n} {μ^{'}}_{n} .

Now consider any x, y ∈ V. Then

| μ (x) - μ (y) | \leq μ (x - y) \leq \sum_{i = 1}^{n} | c_{i} | {μ^{'}}_{i} (x - y) .

So µ is continuous with respect to the topology generated by

{µ^{'}}_{1}, \dots, {µ^{'}}_{n}

. Thus, τ_µ ⊂ τ′. Since this is true for all µ ∈

ℳ

, we have τ ⊂ τ′. □

3.5. Completeness

A sequence (x_n)_n_∈N in a topological vector space V is Cauchy if for any neighborhood U of 0 in V, the difference x_n − x_m lies in U when n and m are large enough. The topological vector space V is complete if every Cauchy sequence converges.

Theorem 3. Let {τ_α}_α_∈_I be a directed family of Hausdorff vector topologies on V, and τ the generated topology. If each τ_α is complete then so is τ.

Proof. Let (x_n)_n_≥1 be a sequence in V, which is Cauchy with respect to τ. Then clearly it is Cauchy with respect to each τ_α. Let x_α = lim_n_→_∞ x_n, relative to τ_α. If τ_α ⊂ τ_γ then the sequence (x_n)_n_≥1 also converges to x_γ relative to the topology τ_α, and so x_γ = x_α. Consider α, β ∈ I, and choose γ ∈ I such that τ_α ∪ τ_β ⊂ τ_γ. This shows that x_α = x_γ = x_β, i.e., all the limits are equal to each other. Let x denote the common value of this limit. We have to show that x_n → x in the topology τ. Let W ∈ τ contain x. Since the family {τ_α}_α_∈_I which generates τ is directed, it follows that there is a β ∈ I and a B_β ∈ τ_β with x ∈ B_β ⊂ W. Since (x_n)_n_≥1 converges to x with respect to τ_β, it follows x_n ∈ B_β for large n. So x_n → x with respect to τ. □

3.6. Metrizability

Suppose the topology τ on the topological vector space V is generated by a countable family of semi-norms µ₁, µ₂, …. For any x, y ∈ V define

d (x, y) = \sum_{n \geq 1} 2^{- n} d_{n} (x, y)

(36)

where

d_{n} (x, y) = \min {1, μ_{n} (x - y)} .

Then d is a metric, it is translation invariant, and generates the topology τ [10].

4. The Schwartz Space $S (R)$

Our objective in this section is to show that the Schwartz space is complete, in the sense that every Cauchy sequence converges. Recall that

S (R)

is the set of all C^∞ functions f on R for which

p_{a, b} (f) \overset{def}{=} {‖ f ‖}_{a, b} \sup_{x \in R} | x^{a} D^{b} f (x | < \infty

(37)

for all a, b ∈ W = {0, 1, 2, …}. The functions p_a,b are semi-norms, with ║ · ║_0,0, being just the sup-norm. Thus the family of semi-norms given above specify a Hausdorff vector topology on

S (R)

. We will call this the Schwartz topology on

S (R)

.

Theorem 4. The topology on

S (R)

generated by the family of semi-norms ║·║_a,b for all a, b ∈ {0, 1, 2, …}, is complete.

Proof. Let (f_n)_n_≥1 be a Cauchy sequence on

S (R)

. Then this sequence is Cauchy in each of the semi-norms ║·║_a,b, and so each sequence of functions x^aD^bf_n(x) is uniformly convergent. Let

g_{b} (x) = \lim_{n \to \infty} D^{b} f_{n} (x) .

(38)

Let f = g₀. Using a Taylor theorem argument it follows that g_b is D^bf. For instance, for b = 1, observe first that

f_{n} (y) = f_{n} (x) + \int_{0}^{1} \frac{d f_{n} ((1 - t) x + t y))}{d t} d t = f_{n} (x) + \int_{0}^{1} {f^{'}}_{n} ((1 - t) x + t y) (y - x) d t,

and so, letting n → ∞, we have

f (y) = f (x) + \int_{0}^{1} g_{1} ((1 - t) x + t y) (y - x) d t,

which implies that f′(x) exists and equals g₁(x).

In this way, we have x^aD^bf_n(x) → x^aD^bf(x) pointwise. Note that our Cauchy hypothesis implies that the sequence of functions x^aD^bf_n(x) is Cauchy in sup-norm, and so the convergence

x^{a} D^{b} f_{n} (x) \to x^{a} D^{b} f (x)

is uniform. In particular, the sup-norm of x^aD^bf(x) is finite, since it is the limit of a uniformly convergent sequence of bounded functions. Thus

f \in S (R)

.

Finally, we have to check that f_n converges to f in the topology of

S (R)

. We have noted above that x^aD^bf_n(x) → x^aD^bf(x) uniformly. Thus f_n → f relative to the semi-norm ║·║_a,b. Since this holds for every a, b ∈ {0, 1, 2, 3, …}, we have f_n → f in the topology of

S (R)

. □

Now let’s take a quick look at the Schwartz space

S (R^{d})

. First some notation. A multi-index a is an element of {0, 1, 2, …}^d, i.e., it is a mapping

a : {1, \dots, d} \to {0, 1, 2, \dots} : j \mapsto a_{j} .

If a is a multi-index, we write |a| to mean the sum a₁+⋯+a_d, x^a to mean the product

x_{1}^{a_{1}} \dots x_{d}^{a_{d}}

, and D^a to mean the differential operator

x_{x_{1}}^{a_{1}} \dots x_{x_{d}}^{a_{d}}

. The space

S (R^{d})

consists of all C^∞ functions f on Rd such that each function x^aD^bf(x) is bounded. On

S (R^{d})

we have the semi-norms

{‖ f ‖}_{a, b} = \sup_{x \in R^{d}} | x^{a} D^{b} f (x) |

for each pair of multi-indices a and b. The Schwartz topology on

S (R^{d})

is the smallest topology making each semi-norm ║·║_a,b continuous. This makes

S (R^{d})

a topological vector space.

The argument for the proof of the preceding theorem goes through with minor alterations and shows that:

Theorem 5. The topology on

S (R^{d})

generated by the family of semi-norms ║·║_a,b for all a, b ∈ {0, 1, 2, …}^d, is complete.

5. Hermite Polynomials, Creation and Annihilation Operators

We shall summarize the definition and basic properties of Hermite polynomials (our approach is essentially that of Hermite’s original [11]). We repeat for convenience of reference much of the presentation in Section 2.1 of [7].

A central role is played by the Gaussian kernel

p (x) = \frac{1}{\sqrt{2 π}} e^{- x^{2} / 2} .

(39)

Properties of translates of p are obtained from

e^{x y - \frac{y^{2}}{2}} = \frac{p (x - y)}{p (x)} .

(40)

Expanding the right side in a Taylor series we have

e^{x y - \frac{y^{2}}{2}} = \frac{p (x - y)}{p (x)} = \sum_{n = 0}^{\infty} \frac{1}{n!} H_{n} (x) y^{n},

(41)

where the Taylor coefficients, denoted H_n(x), are

H_{n} (x) = \frac{1}{p (x)} {(- \frac{d}{d x})}^{n} p (x) .

(42)

This is the n–th Hermite polynomial and is indeed an n–th degree polynomial in which xⁿ has coefficient 1, facts which may be checked by induction.

Observe the following

\begin{matrix} \int_{R} \frac{p (x - y)}{p (x)} \frac{p (x - z)}{p (x)} p (x) d x = e^{- \frac{y^{2} + z^{2}}{2}} \int_{R} e^{x (y + z)} p (x) d x \\ = e^{- \frac{y^{2} + z^{2}}{2} + \frac{{(y + z)}^{2}}{2}} \\ = e^{y z} . \end{matrix}

Going over to the Taylor series and comparing the appropriate Taylor coefficients (differentiation with respect to y and z can be carried out under the integral) we have

{〈 H_{n}, H_{m} 〉}_{L^{2} (p (x) d x)} = n! δ_{n m} .

(43)

Thus an orthonormal set of functions is given by

h_{n} (x) = \frac{1}{\sqrt{n!}} H_{n} (x) .

(44)

Because these are orthogonal polynomials, the n–th one being exactly of degree n, their span contains all polynomials. It can be shown that the span is in fact dense in L²(p(x)dx). Thus the polynomials above constitute an orthonormal basis of L²(p(x)dx).

Next, consider the derivative of H_n:

\begin{matrix} {H^{'}}_{n} (x) = {(- 1)}^{n} p {(x)}^{- 1} p^{(n + 1)} (x) - {(- 1)}^{n} p {(x)}^{- 1} p^{'} (x) p {(x)}^{- 1} p^{n} (x) \\ = - H_{n + 1} (x) + x H_{n} (x) . \end{matrix}

So

(- \frac{d}{d x} + x) h_{n} (x) = \sqrt{n + 1} h_{n + 1} (x) .

(45)

The operator

(- \frac{d}{d x} + x)

is called the creation operator in L²(R;p(x)dx).

Officially, we can take the creation operator to have domain consisting of all functions f which can be expanded in L²(p(x)dx) as ∑_n_≥0 a_nh_n, with each a_n a complex number, and satisfying the condition ∑_n_≥0(n + 1)|a_n|² < ∞; the action of the operator on f yields the function

\sum_{n \geq 0} \sqrt{n + 1} a_{n} h_{n + 1}

. This makes the creation operator unitarily equivalent to a multiplication operator (in the sense discussed later in subsection A.5) and hence a closed operator (see A.1 for definition). For the type of smooth functions f we will mostly work with, the effect of the operator on f will in fact be given by application of

(- \frac{d}{d x} + x)

to f.

Next, from the fundamental generating relation Equation (41) we have :

y e^{x y - y^{2} / 2} = \lim_{ϵ ↓ 0} \sum_{n \geq 0} \frac{1}{n!} \frac{1}{ϵ} [H_{n} (x + ϵ) - H_{n} (x)] y^{n} .

(46)

Using Equation (41) again on the left we have

\sum_{n \geq 1} \frac{1}{(n - 1)!} H_{n - 1} (x) y^{n} = \lim_{ϵ ↓ 0} \sum_{n \geq 0} \frac{1}{n!} \frac{1}{ϵ} [H_{n} (x + ϵ) - H_{n} (x)] y^{n} .

(47)

Letting y = 0 allows us to equate the n = 0 terms, and then, successively, the higher order terms. From this we see that

{H^{'}}_{n} (x) = n H_{n - 1} (x)

(48)

where H₋₁ = 0. Thus:

\frac{d}{d x} h_{n} (x) = \sqrt{n} h_{n - 1} (x) .

(49)

The operator

\frac{d}{d x}

is the annihilation operator in L²(R;p(x)dx). As with the creation operator, we may define it in a more specific way, as a closed operator on a specified domain.

6. Hermite Functions, Creation and Annihilation Operators

In the preceding section we studied Hermite polynomials in the setting of the Gaussian space L²(R;p(x)dx). Let us translate the concepts and results back to the usual space L²(R; dx).

To this end, consider the isomorphism:

U : L^{2} (R, p (x) d x) \to L^{2} (R, d x) : f \mapsto \sqrt{p} f .

(50)

Then the orthonormal basis polynomials h_n go over to the functions ϕ_n given by

ϕ_{n} (x) = {(- 1)}^{n} \frac{1}{\sqrt{n!}} {(2 π)}^{- 1 / 4} e^{x^{2} / 4} \frac{d^{n} e^{- x^{2} / 2}}{d x^{n}} .

(51)

The family {ϕ_n}_n_≥0 forms an orthonormal basis for L²(R, dx).

We now determine the annihilation and creation operators on L²(R, dx). If f ∈ L²(R, dx) is differentiable and has derivative f′ also in L²(R, dx), we have:

\begin{matrix} (U \frac{d}{d x} U^{- 1}) f (x) = \sqrt{p (x)} \frac{d}{d x} [p {(x)}^{- 1 / 2} f (x)] \\ = f^{'} (x) + p {(x)}^{1 / 2} (- 1 / 2) p {(x)}^{- 3 / 2} p^{'} (x) f (x) \\ = f^{'} (x) + \frac{1}{2} x f (x) . \end{matrix}

So, on L²(R, dx), the annihilator operator is

A = \frac{d}{d x} + \frac{1}{2} x

(52)

which will satisfy

A ϕ_{n} = \sqrt{n} ϕ_{n - 1}

(53)

where ϕ₋₁ = 0. For the moment, we proceed by taking the domain of A to be the Schwartz space

S (R)

.

Next,

\begin{matrix} (U (- \frac{d}{d x} + x) U^{- 1}) f (x) = f^{'} (x) + x f (x) - \frac{1}{2} x f (x) \\ = (- \frac{d}{d x} + \frac{1}{2} x) f (x) \end{matrix}

Thus the creation operator is

C = A^{*} = - \frac{d}{d x} + \frac{1}{2} x .

(54)

The reason we have written A* is that, as is readily checked, we have the adjoint relation

〈 A f, g 〉 = 〈 f, (- \frac{d}{d x} + \frac{1}{2} x) g 〉

(55)

with the inner-product being the usual one on L²(R, dx). Again, for the moment, we take the domain of C to be the Schwartz space

S (R)

(though, technically, in that case we should not write C as A^∗, since the latter, if viewed as the L²–adjoint operator, has a larger domain).

For this we have

C ϕ_{n} = \sqrt{n + 1} ϕ_{n + 1} .

(56)

Observe also that

A C = \frac{1}{4} x^{2} - \frac{d^{2}}{d x^{2}} + \frac{1}{2} I and C A = \frac{1}{4} x^{2} - \frac{d^{2}}{d x^{2}} - \frac{1}{2} I

(57)

which imply:

[A, C] = A C - C A = I, the identity .

(58)

Next observe that

C A ϕ_{n} = \sqrt{n} \sqrt{n} ϕ_{n} = n ϕ_{n}

(59)

and so CA is called the number operator N:

N = A * A = C A = - \frac{d^{2}}{d x^{2}} + \frac{x^{2}}{4} - \frac{1}{2} the n u m b e r o p e r a t e r .

(60)

As noted above in Equation (59), the number operator N has the eigenfunctions ϕ_n:

N ϕ_{n} = n ϕ_{n} .

(61)

Integration by parts (see Lemma 10) shows that

〈 f, g^{'} 〉 = - 〈 f^{'}, g 〉

for every

f, g \in S (R)

, and so also

〈 f, g^{″} 〉 = - 〈 f^{″}, g 〉 .

It follows that the operator N satisfies

〈 N f, g 〉 = 〈 f, N g 〉

(62)

for every

f, g \in S (R)

.

Now consider the case of R^d. For each j ∈ {1, …, d}, there are creation, annihilation, and number operators:

A_{j} = \frac{\partial}{\partial x_{j}} + \frac{1}{2} x_{j}, C_{j} = - \frac{\partial}{\partial x_{j}} + \frac{1}{2} x_{j}, N_{j} = C_{j} A_{j} .

(63)

These map

S (R^{d})

into itself and, as is readily verified, satisfy the commutation relations

[A_{j}, C_{k}] = δ_{j k} I, [N_{j}, A_{k}] = - δ_{j k} A_{j}, [N_{j}, C_{k}] = δ_{j k} C_{j} .

(64)

Now let us be more specific about the precise definition of the creation and annihilation operators. The basis {ϕ_n}_n≥0 for L²(R) yields an orthonormal basis

{ϕ_{m}}_{m}_{\in W^{d}}

of L²(R^d) given by

ϕ_{m} = ϕ_{m_{1}} (x_{1}) \dots ϕ_{m_{d}} (x_{d})

. For convenience we say ϕ_m=0 if some m_j <0. Given its effect on the orthonormal basis

{ϕ_{m}}_{m}_{\in W^{d}}

the operator C_k has the form:

ϕ_{m} \mapsto \sqrt{m_{k} + 1 ϕ_{m^{'}}},

where

{m^{'}}_{i} = m_{i}

for all i ∈ {1, …, d} except when i = k, in which case

{m^{'}}_{k} = m_{k} + 1

. The domain of C_k is the set

D (C_{k})

given by

D (C_{k}) = {f \in L^{2} (R) | \sum_{m \in W^{d}} (m_{k} + 1) {| a_{m} |}^{2} < \infty where a_{m} = 〈 f, ϕ_{m} 〉} .

The operator C_k is then officially defined by specifying its action on a typical element of its domain:

C_{k} (\sum_{m \in W^{d}} a_{m} ϕ_{m}) = \sum_{m \in W^{d}} a_{m} \sqrt{m_{m} + 1} ϕ_{m^{'}},

(65)

where m′ is as before. The operator C_k is essentially the composite of a multiplication operator and a bounded linear map taking ϕ_m → ϕ_m_′ where m′ is as defined above. (See subsection A.5 for precise formulation of a multiplication operator.) Noting this, it can be readily checked that C_k is a closed operator using the following argument: Let T be a bounded linear operator and M_h a multiplication operator (any closed operator will do); we show that the composite M_hT is a closed operator. Suppose x_n → x. Since T is a bounded linear operator, Tx_n → Tx. Now suppose also that M_h(Tx_n) → y. Since M_h is closed, it follows then that

T x \in D (M_{h})

and y = M_hTx.

The operators A_k and N_k are defined analogously.

Proposition 6. Let

ℒ

₀ be the vector subspace of L²(R^d) spanned by the basis vectors

{ϕ_{m}}_{m}_{\in W^{d}}

. Then for k ∈ {1, 2, …, d}, C_k|

ℒ

₀ and A_k|

ℒ

₀ have closures given by C_k and A_k, respectively (see subsection A.4 for the notion of closure).

Proof. We need to show that the graph of C_k, denoted Gr(C_k), is equal to the closure of the graph of C_k|

ℒ

₀, i.e., to

\bar{G r (C_{k} | ℒ_{0})}

(see to subsection A.1 for the notion of graph). It is clear that

G r (C_{k} | ℒ_{0}) \subseteq G r (C_{k})

. Using this and the fact that Ck is a closed operator, we have

\bar{G r (C_{k} | ℒ_{0})} \subseteq \bar{G r (C_{k})} = G r (C_{k}) .

Going in the other direction, take (f, C_kf) ∈ Gr(C_k). Now

\sum_{m \in W^{d}} a_{m} ϕ_{m}

where a_m = 〈f, ϕ_m〉. Let f_N be given by

f_{N} = \sum_{m \in W_{N}^{d}} a_{m} ϕ_{m} where W_{N}^{d} = {m \in W^{d} | 0 \leq m_{1} \leq N, \dots, 0 \leq m_{d} \leq N} .

Observe that f_N ∈

ℒ

₀. Moreover

\lim_{N \to \infty} f_{N} = f and \lim_{N \to \infty} C_{k} f_{N} = \lim_{N \to \infty} \sum_{m \in W_{N}^{d}} (m_{k} + 1) a_{m} ϕ_{m} = C_{k} f

in L²(R^d). Thus

(f, C_{k} f) \in \bar{G r (C_{k} | L_{0})}

and so we have

G r (C_{k}) \subseteq \bar{G r (C_{k} | L_{0})}

.

The proof for A_k follows similarly. □

Linking this new definition for C_k with our earlier formulas Equation (63) we have:

Proposition 7. If

f \in S (R^{d})

then

C_{k} f = - \frac{\partial f}{\partial x_{k}} + \frac{x_{k}}{2} f, a n d A_{k} f = \frac{\partial f}{\partial x_{k}} + \frac{x_{k}}{2} f .

Proof. Let

g = \frac{\partial f}{\partial x_{k}} + \frac{x_{k}}{2} f

. Since

f \in S (R^{d})

, we have g ∈ L²(R^d). So we can write g as

g = \sum_{j \in W^{d}} a_{j} ϕ_{j}

where a_j=〈g, ϕ_j〉. Let us examine these a_j’s more closely. Observe

\begin{matrix} a_{j} = 〈 g, ϕ_{j} 〉 = 〈 - \frac{\partial f}{\partial x_{k}} + \frac{x_{k}}{2} f, ϕ_{j} 〉 \\ = 〈 f, \frac{\partial ϕ_{j}}{\partial x_{k}} + \frac{x_{k}}{2} ϕ_{j} 〉 \\ = 〈 f, \sqrt{j k ϕ_{j^{″}}} 〉 \end{matrix}

where

{j^{″}}_{i} = j_{i}

for all i ∈ {1, …, d} except when i = k, in which case

{j^{″}}_{k} = j_{k} - 1

.

Bringing this information back to our expression for g we see that

\begin{matrix} g = \sum_{j \in W^{d}} \sqrt{j k 〈 f, ϕ_{j^{″}} 〉} ϕ_{j} \\ = \sum_{m \in W^{d}} \sqrt{m_{k} + 1 〈 f, ϕ_{m} 〉} ϕ_{m^{″}} where m^{″} is as defined above \\ = C_{k} f by (65) . \end{matrix}

The second equality is obtained by letting m = j″ and noting that ϕ_j_″ = 0 when

{j^{″}}_{k}

is −1. The proof follows similarly for A_k. □

7. Properties of the Functions in $S_{p} (R)$

Our aim here is to obtain a complete characterization of the functions in

S_{p} (R)

. We will prove that

S_{p} (R)

consists of all square-integrable functions f for which all derivatives f⁽^k⁾ exist for k ∈ {1, 2, …, p} and

\sup_{x \in R} | x^{a} f^{(b)} (x) | < \infty

for all a, b ∈ {0, 1, …, p − 1} with a + b ≤ p − 1.

A significant tool we will use is the Fourier transform:

\hat{f} (p) = ℱ f (p) = {(2 π)}^{- 1 / 2} \int_{R} e^{- i p x} f (x) d x .

(66)

This is meaningful whenever f is in L¹(R), but we will work mainly with f in

S (R)

. We will use the following standard facts:

$ℱ$ maps $S (R)$ onto itself and satisfies the Plancherel identity:

$\int_{R} {| f (x) |}^{2} d x = \int_{R} {| \hat{f} (p) |}^{2} d p$

(67)
for any $f \in S (R)$ ,

$f (x) = {(2 π)}^{- 1 / 2} \int_{R} e^{i p x} \hat{f} (p) d p$

(68)
if $f \in S (R)$ then

p \hat{f} (p) = - i ℱ (f^{'}) (p) .

(69)

Consequently, we have

\begin{array}{l} {‖ f ‖}_{\sup} \leq {(2 π)}^{- 1 / 2} \int_{R} | \hat{f} (p) | d p \\ = {(2 π)}^{- 1 / 2} \int_{R} {(1 + p^{2})}^{1 / 2} | \hat{f} (p) | {(1 + p^{2})}^{- 1 / 2} d p \\ = {(2 π)}^{- 1 / 2} {[\int_{R} (1 + p^{2}) {| \hat{f} (p) |}^{2} d p]}^{1 / 2} π^{1 / 2} by Cauchy-Schwartz \\ \leq 2^{- 1 / 2} [{‖ \hat{f} ‖}_{L^{2}} + {‖ p \hat{f} ‖}_{L^{2} (R, d p)}] \\ \leq 2^{- 1 / 2} [{‖ f ‖}_{L^{2}} + {‖ f^{'} ‖}_{L^{2}}] by Plancherel and Equation (69) . \end{array}

(70)

For the purposes of this section it is necessary to be precise about domains. So we take now A and C to be closed operators in L²(R), with common domain

D (C) = D (A) = {f \in L^{2} (R) | \sum_{n \geq 0} n {| 〈 f, ϕ_{n} 〉 |}^{2} < \infty}

and

C f = \sum_{n \geq 0} 〈 f, ϕ_{n} 〉 \sqrt{n + 1} ϕ_{n + 1}, A f = \sum_{n \geq 0} 〈 f, ϕ_{n} 〉 \sqrt{n + 1} ϕ_{n - 1} .

Moreover, define operators C₁ and A₁ on the common domain

D_{1} = all differentiable f \in L^{2} with f^{'} \in L^{2} and x f (x) \in L^{2} (d x)

(71)

and

C_{1} f (x) = [- \frac{d}{d x} + \frac{1}{2} x] f (x), A_{1} f (x) = [\frac{d}{d x} + \frac{1}{2} x] f (x) for all f \in D_{1} .

(72)

We will prove below that C and C₁ (and A and A₁) are, in fact, equal.

For a function

f = \sum_{n \geq 0} a_{n} ϕ_{n} \in L^{2} (R),

we will use the notation f_N for the partial sum:

f_{N} \sum_{n = 0}^{N} a_{n} ϕ_{n} .

Observe the following about the derivatives

{f^{'}}_{N}

:

Lemma 8. If

f \in D (C)

, then

{{f^{'}}_{N}}

is Cauchy in L²(R).

Proof. Note that

\begin{array}{l} {f^{'}}_{N} = (\frac{A - C}{2}) f_{N} . \\ So {‖ {f^{'}}_{N} - {f^{'}}_{M} ‖}_{L^{2}} \leq \frac{1}{2} {‖ A f_{N} - A f_{M} ‖}_{L^{2}} + {‖ C f_{N} - C f_{M} ‖}_{L^{2}} . \end{array}

Now for M < N we have

\begin{matrix} {‖ A f_{N} - A f_{M} ‖}_{L^{2}}^{2} = {‖ \sum_{n = M + 1}^{N} a_{n} \sqrt{n} ϕ_{n - 1} ‖}_{L^{2}}^{2} \\ = \sum_{n = M + 1}^{N} {| a_{n} |}^{2} n . \end{matrix}

Likewise,

\begin{matrix} {‖ C f_{N} - C f_{M} ‖}_{L^{2}}^{2} = {‖ \sum_{n = M + 1}^{N} a_{n} \sqrt{n + 1} ϕ_{n + 1} ‖}_{L^{2}}^{2} \\ = \sum_{n = M + 1}^{N} {| a_{n} |}^{2} (n + 1) . \end{matrix}

Since

f \in D (C)

, we know

\sum_{n = M + 1}^{N} | a_{n} |^{2} (n + 1)

tends to 0 as M goes to infinity. Thus

{{f^{'}}_{N}}

is Cauchy in L²(R). □

Lemma 9. If

f \in D (C)

then f is, up to equality almost everywhere, bounded, continuous and {f_N} converges uniformly to f, i.e., ║f−f_N║_sup→0 as N→∞.

Proof. It is enough to show that ║f−f_N║_sup→0 as M,N→∞. Note that

║ f_{M} - f_{N} ║_{\sup} \leq 2^{- 1 / 2} (║ f_{N} - f_{M} ║_{L^{2}} + ║ {f^{'}}_{N} - {f^{'}}_{M} ║_{L^{2}})

by Equation (70). Since f ∈ L²(R) we have

║ f_{N} - f_{M} ║_{L^{2}} \to 0

as M, N → ∞ and by Lemma 8 we have that

║ {f^{'}}_{N} - {f^{'}}_{M} ║_{L^{2}} \to 0

as M, N → ∞ Therefore {f_N} converges uniformly to f. □

Next we establish an integration-by-parts formula:

Lemma 10. If f,g ∈ L²(R) are differentiable with derivatives also in L²(R) then

\int_{R} f^{'} (x) g (x) d x = - \int_{R} f (x) g^{'} (x) d x .

(73)

Proof. The derivative of fg, being f′g + fg′, is in L¹. So the fundamental theorem of calculus applies to give:

\int_{a}^{b} f^{'} (x) g (x) d x + \int_{a}^{b} f (x) g^{'} (x) d x = f (b) g (b) - (a) g (a)

(74)

for all real numbers a < b.

Now fg ∈ L¹, and so

\lim_{N \to \infty} \int_{N}^{\infty} f (x) g (x) d x = \lim_{N \to \infty} \int_{- \infty}^{- N} f (x) g (x) d x = 0.

Consequently, there exist a_N < −N < N < b_N with

f (a_{N}) g (a_{N}) \to 0 and f (b_{N}) g (b_{N}) \to 0 as N \to \infty .

Plugging into Equation (74) we obtain the desired result. □

Next we have the first step to showing that C₁ equals C:

Lemma 11. If f is in the domain of C₁ then f is in the domain of C and

C f = C_{1} f a n d A f = A_{1} f .

(75)

Proof. Let f be in the domain of C₁. Then we may assume that f is differentiable and both f and the derivative f′ are in L²(R). We have then

\begin{array}{l} 〈 C_{1} f, ϕ_{n} 〉 = \int_{R} ϕ_{n} (x) [- \frac{d}{d x} + \frac{1}{2} x] f (x) d x \\ = \int_{R} f (x) [\frac{d}{d x} + \frac{1}{2} x] ϕ_{n} (x) d x by Equation (73) \\ = \sqrt{n} 〈 f, ϕ_{n - 1} 〉 by Equation (53) . \end{array}

(76)

Then

| | C_{1} f | |^{2} = \sum_{n \geq 0} {| 〈 C_{1} f, ϕ_{n} 〉 |}^{2} = \sum_{n \geq 1} n {| 〈 f, ϕ_{n - 1} 〉 |}^{2} .

(77)

Because this sum is finite, it follows that f is in the domain

D (C)

of C. Moreover,

\begin{array}{l} C f = \sum_{n \geq 0} \sqrt{n + 1} 〈 f, ϕ_{n + 1} 〉 \\ = \sum_{m \geq 0} 〈 C_{1} f, ϕ_{m} 〉 ϕ_{m} by Equation (76) \\ = C_{1} f . \end{array}

The argument showing Af = A₁f is similar. □

We can now prove:

Theorem 12. The operators C and C₁ are equal, and the operators A and A₁ are equal. Thus, a function f ∈ L²(R) is in the domain of C (which is the same as the domain of A) if and only if f is, up to equality almost everywhere, a differentiable function with derivative f′ also in L²(R) and with

\int_{R} {| x f (x) |}^{2} d x < \infty

.

Proof. In view of Lemma 11, it will suffice to prove that

D (C) \subset D (C_{1})

. Let f ∈

D (C)

. Then

\sum_{n \geq 0} n {| 〈 f, ϕ_{n} 〉 |}^{2} < \infty .

This implies that the sequences {C₁f_N}_N_≥0 and {A₁f_N}_N_≥0 are Cauchy, where f_N is the partial sum

f_{N} = \sum_{n = 0}^{N} 〈 f, ϕ_{n} 〉 ϕ_{n} .

Now

\frac{1}{2} (A_{1} - C_{1}) f_{N} = f_{N}^{'}, and \frac{1}{2} (A_{1} + C_{1}) f_{N} (x) = x f_{N} (x) .

So the sequences of functions

{f_{N}^{'}}_{N \geq 0}

and {h_N}_N_≥0, where

h_{N} (x) = x f_{N} (x),

are also L²–Cauchy. Now, as shown in Lemma 9, we can take f to be the uniformly convergent pointwise limit of the sequence of continuous functions f_N.

By Lemma 8, the sequence of derivatives

f_{N}^{'}

is Cauchy in L²(R). Let

g = \lim_{N \to \infty} f_{N}^{'}

in L²(R). Observe that

f_{N} (y) = f_{N} (x) + \int_{0}^{1} f_{N}^{'} ((x + t (y - x)) (y - x) d t .

(78)

Now

{\int_{0}^{1} | f_{N}^{'} (x + t (y - x)) (y - x) - g (x + t (y - x)) (y - x) | d t \leq \sqrt{| y - x |} ‖ f_{N}^{'} - g ‖}_{L^{2}}

by the Cauchy-Schwartz inequality. Since

{‖ f_{N}^{'} - g ‖}_{L^{2}} \to 0

as N → ∞, we have

\int_{0}^{1} f_{N}^{'} (x + t (y - x)) (y - x) d t \to \int_{0}^{1} g (x + t (y - x)) (y - x) d t .

Because

{f_{N}}_{N \geq 0}

converges to f uniformly by Lemma 9, taking the limit as N → ∞ in Equation (78) we obtain

f (y) = f (x) + \int_{0}^{1} g ((x + t (y - x)) (y - x) d t .

Therefore f′ = g ∈ L²(R). Lastly, we have, by Fatou’s Lemma:

\int_{R} {| x f (x) |}^{2} d x \leq \underset{N \to \infty}{\lim \inf} \int_{R} {| x f_{N} (x) |}^{2} d x < \infty,

because the sequence {g_N}_N_≥0 is convergent. Thus we have established that

f \in D (C_{1})

. □

Finally we can characterize the space S_p(R):

Theorem 13. Suppose f ∈ S_p(R), where p ≥ 1. Then f is (up to equality almost every where) a 2p times differentiable function and

\sup_{x \in R} | x^{a} f^{(b)} (x) | < \infty

for every a, b ∈ {0, 1, 2, …} with a + b < 2p. Moreover, S_p(R) consists of all 2p times differentiable functions for which the functions x ⟼ x^af⁽^b⁾(x) are in L²(R) for every a, b ∈ {0, 1, 2, …} with a + b ≤ 2p.

Proof. Consider f ∈ S₁(R). Then

f = \sum_{n \geq 0} a_{n} ϕ_{n} with \sum_{n \geq 0} n^{2} {| a_{n} |}^{2} < \infty .

(79)

In particular,

f \in D (C)

. Moreover,

C f = \sum_{n \geq 0} a_{n} \sqrt{n + 1} ϕ_{n + 1} A f = \sum_{n \geq 0} a_{n} \sqrt{n ϕ_{n - 1}} .

From these expressions and Equation (79) it is clear that Cf and Af both belong to

D (C)

. Thus,

B_{1} B_{2} f \in L^{2} (R) for all B_{2} \in {C, A, I} .

Similarly, we can check that if f ∈ S_p(R), where p ≥ 2, then

B_{1} B_{2} f \in S_{p - 1} (R) for all B_{1}, B_{2} \in {C, A, I} .

Thus, inductively, we see that

B_{1} \dots B_{2 p} f \in L^{2} (R) for all B_{1}, \dots, B_{2 p} \in {C, A, I} .

(This really means that f is in the domain of each product operator B₁ ···B₂_p.) Now the operators

\frac{d}{d x}

and multiplication by x are simple linear combinations of A and C. So for any a, b ∈ {0, 1, 2, …} with a + b ≤ 2p we can write the operator

x^{a} {(\frac{d}{d x})}^{b}

as a linear combination of operators B₁…B₂_p with B₁, …, B₂_p ∈ {C, A, I}.

Conversely, suppose f is 2p times differentiable and the functions x ⟼ x^af⁽^b⁾(x) are in L²(R) forevery a, b ∈ {0, 1, 2, … } with a + b ≤ 2p. Then f is in the domain of C²^p and so

\sum_{n \geq 0} {| 〈 f, ϕ_{n} 〉 |}^{2} n^{2 p} < \infty .

Thus f ∈ S_p(R).

The preceding facts show that if f ∈ S_p(R) then for every B₁, …, B₂_p ∈ {C, A, I}, the element B₁ ····B₂_p₋₁f is in the domain of C, and so, in particular, is bounded. Thus,

\sup_{x \in R} | x^{a} f^{(b)} (x) | < \infty

for all a, b ∈ {0, 1, 2, …} with a + b ≤ 2p − 1. □

We do not carry out a similar study for S_p(R^d), but from the discussions in the following sections, it will be clear that:

S_p(R^d) is a Hilbert space with inner-product given by

${〈 f, g 〉}_{p} = 〈 f, T_{d}^{2 p} g 〉$
as a Hilbert space, S_p (R^d) is the d–fold tensor product of S_p(R) with itself.

8. Inner-Products on S(R) from N

For f ∈ L²(R), define

{‖ f ‖}_{t} = {\sum_{n \geq 0} {(n + 1)}^{2 t} {| 〈 f, ϕ_{n} 〉 |}^{2}}^{1 / 2}

(80)

for every t > 0. More generally, define

{〈 f, g 〉}_{t} = \sum_{n \geq 0} {(n + 1)}^{2 t} {〈 f, ϕ_{n} 〉}_{L^{2}} {〈 ϕ_{n}, g 〉}_{L^{2}},

(81)

for all f, g in the subspace of L²(R) consisting of functions F for which ||F ||_t < ∞.

Theorem 14. Let f ∈ S(R). Then for every t > 0 we have ||f||_t < ∞. Moreover, for every integer m ≥ 0, we also have

N^{m} f = \sum_{n \geq 0} n^{m} 〈 f, ϕ_{n} 〉 ϕ_{n},

(82)

where on the left N^m is the differential operator

- \frac{d^{2}}{d x^{2}} + \frac{x^{2}}{4} - \frac{1}{2}

applied n times, and on the right the series is taken in the sense of L²(R, dx). Furthermore,

{‖ f ‖}_{m / 2}^{2} = 〈 f, {(N + 1)}^{m} f 〉 .

(83)

This result will be strengthened and a converse proved later.

Proof. Let m ≥ 0 be an integer. Since f ∈ S(R), it is readily seen that N f is also in S(R), and thus, inductively, so is N^mf. Then we have

\begin{array}{l} 〈 f, N^{m} f 〉 = \sum_{n \geq 0} 〈 f, ϕ_{n} 〉 〈 ϕ_{n}, N^{m} f 〉 \\ = \sum_{n \geq 0} 〈 f, ϕ_{n} 〉 〈 N^{m} ϕ_{n}, f 〉 by Equation (62) \\ = \sum_{n \geq 0} 〈 f, ϕ_{n} 〉 〈 n^{m} ϕ_{n}, f 〉 \\ = \sum_{n \geq 0} n^{m} {| 〈 f, ϕ_{n} 〉 |}^{2} . \end{array}

Thus we have proven the relation

〈 f, N^{m} f 〉 = \sum_{n \geq 0} n^{m} {| 〈 f, ϕ_{n} 〉 |}^{2} .

(84)

An exactly similar argument shows

〈 f, {(N + 1)}^{m} f 〉 = \sum_{n \geq 0} {(n + 1)}^{m} {| 〈 f, ϕ_{n} 〉 |}^{2} = {‖ f ‖}_{m / 2}^{2} .

(85)

So if t > 0, choosing any integer m ≥ t we have

{‖ f ‖}_{t / 2}^{2} \leq {‖ f ‖}_{m / 2}^{2} = 〈 f, {(N + 1)}^{m} f 〉 < \infty .

Observe that the series

\sum_{n \geq 0} n^{m} 〈 f, ϕ_{n} 〉 ϕ_{n}

(86)

is convergent in L²(R, dx) since

\sum_{n \geq 0} n^{2 m} {| 〈 f, ϕ_{n} 〉 |}^{2} = 〈 N^{2 m} f, f 〉 < \infty .

So for any g ∈ L²(R, dx) we have, by an argument similar to the calculations done above:

\begin{array}{l} 〈 N^{m} f, g 〉 = \sum_{n \geq 0} n^{m} 〈 f, ϕ_{n} 〉 〈 ϕ_{n}, g 〉 \\ = \sum_{n \geq 0} 〈 n^{m} 〈 f, ϕ_{n} 〉 ϕ_{n}, g 〉 \\ = 〈 \sum_{n \geq 0} n^{m} 〈 f, ϕ_{n} 〉 ϕ_{n}, g 〉 . \end{array}

This proves the statement about N^mf. □

We have similar observations concerning C^mf and A^mf. First observe that since C and A are operators involving

\frac{d}{d x}

and x, they map S(R) into itself. Also,

〈 A f, g 〉 = 〈 f, C g 〉,

for all f, g ∈ S(R), as already noted. Using this, for f ∈ S(R), we have

\begin{array}{l} 〈 ϕ_{n + m}, C^{m} f 〉 = 〈 A^{m} ϕ_{n + m}, f 〉 \\ = \sqrt{(n + m) (n + m - 1) \dots (n + 1)} 〈 ϕ_{n}, f 〉 . \end{array}

Therefore,

C^{m} f = \sum_{n \geq 0} {[\frac{(n + m)!}{n!}]}^{1 / 2} 〈 f, ϕ_{n} 〉 ϕ_{n + m} .

(87)

Similarly,

A^{m} f = \sum_{n \geq 0} {[\frac{n!}{(n - m)!}]}^{1 / 2} 〈 f, ϕ_{n} 〉 ϕ_{n - m} .

(88)

More generally, if B₁, …, B_k are such that each B_i is either A or C then

B_{1 \dots} B_{k} f = \sum_{n \geq 0} θ_{n, k} 〈 f, ϕ_{n} 〉 ϕ_{n + r},

(89)

where the integer r is the excess number of C’s over the A’s in the sequence B₁, …, B_k, and θ_n,k is a real number determined by n and k. We do have the upper bound

θ_{n, k}^{2} \leq {(n + k)}^{k} \leq {[(n + 1) k]}^{k} = {(n + 1)}^{k} k^{k} .

(90)

Note also that

{‖ B_{1 \dots} B_{k} f ‖}^{2} = 〈 {(B_{1 \dots} B_{k})}^{*} B_{1 \dots} B_{k} f, f 〉 = \sum_{n \geq 0} θ_{n, k}^{2} {| 〈 f, ϕ_{n} 〉 |}^{2} .

(91)

Let’s look at the case of R^d. The functions ϕ_n generate an orthonormal basis by tensor products. In more detail, if a ∈ W^d is a multi-index, define ϕ_a ∈ L²(R^d) by

ϕ_{a} (x) = ϕ_{a_{1}} (x_{1}) \dots ϕ_{a_{d}} (x_{d}) .

Now, for each t > 0, and f ∈ L²(R^d), define

{‖ f ‖}_{t} \overset{def}{=} {{\sum_{a \in W^{d}} [(a_{1} + 1) \dots (a_{d} + 1)]}^{2 t} {| 〈 f, ϕ_{a} 〉 |}^{2}}^{1 / 2},

(92)

and then define

{〈 f, g 〉}_{t} = \sum_{a \in W^{d}} {[(a_{1} + 1) \dots (a_{d} + 1)]}^{2 t} 〈 f, ϕ_{a} 〉 〈 ϕ_{a}, g 〉,

(93)

for all f, g in the subspace of L²(R^d) consisting of functions F for which ║F║_t < ∞.

Let T_d be the operator on

S (R^{d})

given by

T_{d} = (N_{d} + 1) \dots (N_{1} + 1) .

Then, for every non-negative integer m, we have

{‖ f ‖}_{m / 2}^{2} = 〈 f, T_{d}^{m} f 〉 .

The other results of this section also extend in a natural way to R^d.

9. L²–Type Norms on $S (R)$

For integers a, b ≥ 0, and f ∈

S (R)

, define

{‖ f ‖}_{a, b, 2} = {‖ x^{a} D^{b} f (x) ‖}_{L^{2} (R, d x)} .

(94)

Recall the operators

A = \frac{d}{d x} + \frac{1}{2} x, C = - \frac{d}{d x} + \frac{1}{2} x, N = C A

and the norms

{‖ f ‖}_{m} = 〈 f, {(N + 1)}^{m} f 〉 .

The purpose of this section is to prove the following:

Theorem 15. The system of semi-norms given by ║f║_a,b,₂ and the system given by the norms ║f║_m generate the same topology on

S (R)

.

Proof. Let a, b be non-negative integers. Then

\begin{array}{l} {‖ f ‖}_{a, b, 2} = {‖ {(A + C)}^{a} 2^{- b} {(A - C)}^{b} f ‖}_{L^{2}} \\ \leq a linear combination of terms \\ of the form {‖ B_{1} \dots B_{k} f ‖}_{L^{2}}, \end{array}

where each B_i is either A or C, and k = a + b. Writing c_n = ⟨f, ϕ_n⟩, we have where

\begin{array}{l} {‖ B_{1} \dots B_{k} f ‖}_{L^{2}}^{2} = ‖ \sum_{n \geq 0} c_{n} θ_{n, k} ϕ_{n + r} ‖ \\ = \sum_{n \geq 0} {| c_{n} |}^{2} θ_{n, k}^{2}, \end{array}

where

r = # {j : B_{j} = C} - # {j : B_{j} = A},

and, as noted earlier in Equation (90),

θ_{n, k}^{2} \leq (n + 1) {}^{k}k^{k} .

So

{‖ B_{1} \dots B_{k} f ‖}_{L^{2}}^{2} \leq {\sum_{n \geq 0} | c_{n} |}^{2} (n + 1) {}^{k}k^{k} = k^{k} {‖ f ‖}_{k / 2}^{2} \leq k^{k} {‖ f ‖}_{k}^{2} .

(95)

Thus ║f║_a,b,₂ is bounded above by a multiple of the norm ║f║_a₊_b.

It follows, that the topology generated by the semi-norms || · ||_a,b,₂ is contained in the topology generated by the norms || · ||_k.

Now we show the converse inclusion. From

{‖ f ‖}_{k}^{2} = {〈 f, {(N + 1)}^{2 k} f 〉}_{L^{2}} \leq {‖ f ‖}_{L^{2}}^{2} + {‖ {(N + 1)}^{2 k} f ‖}_{L^{2}}^{2}

and the expression of N as a differential operator we see that

{‖ f ‖}_{k}^{2}

is bounded above by a linear combination of

{‖ f ‖}_{a, b, 2}^{2}

for appropriate a and b. It follows then that the topology generated by the norms ║·║_k is contained in the topology generated by the semi-norms ║·║_a,b,₂. □

Now consider R^d. Let a, b ∈ W^d be multi-indices, where W = {0, 1, 2, …}. Then for f ∈ S(R^d) define

{‖ f ‖}_{a, b, 2} = {\int_{R^{d}} {| x^{a} D^{b} f (x) |}^{2} d x} \frac{1}{2}

These specify semi-norms and they generate the same topology as the one generated by the norms || · ||_m, with m ∈ W. The argument is a straightforward modification of the one used above.

10. Equivalence of the Three Topologies

We will demonstrate that the topology generated by the family of norms || · ||_k, or, equivalently, by the semi-norms ║ · ║_a,b,₂, is the same as the Schwartz topology on.

S (R)

.

Recall from Equation (70) that we have, for f ∈

S (R)

{‖ f ‖}_{\sup} \leq 2^{- 1 / 2} ({‖ f ‖}_{L^{2}} + {‖ f^{'} ‖}_{L^{2}}) .

Putting in x^aD^bf(x) in place of f(x) we then have

{‖ f ‖}_{a, b} \leq a linear combination of {‖ f ‖}_{a, b, 2}, and {‖ f ‖}_{a, b + 1, 2} .

(96)

Next we bound the semi-norms ║f║_a,b,₂ by the semi-norms ║f║_a,b. To this end, observe first

\begin{array}{l} {‖ f ‖}_{L^{2}}^{2} = {\int_{R} (1 + x^{2})}^{- 1} (1 + x^{2}) {| f (x) |}^{2} d x \\ \leq π {‖ (1 + x^{2}) {| f (x) |}^{2} ‖}_{\sup} \\ \leq π ({‖ f ‖}_{\sup}^{2} + {‖ x f (x) ‖}_{\sup}^{2}) \\ \leq π {({‖ f ‖}_{\sup} + {‖ x f (x) ‖}_{\sup})}^{2} . \end{array}

So for any integers a, b ≥ 0, we have

{‖ f ‖}_{a, b, 2} = {‖ x^{a} D^{b} f ‖}_{L^{2}} \leq π^{1 / 2} ({‖ f ‖}_{a, b} + {‖ f ‖}_{a + 1, b}) .

(97)

Thus, the topology generated by the semi-norms ║ · ║_a,b,₂ coincides with the Schwartz topology.

Now lets look at the situation for R^d. The same result holds in this case and the arguments are similar. The appropriate Sobolev inequalities require using (1 + |p|²)^d instead of 1 + p². For

f \in S (R^{d})

, we have the Fourier transform given by

ℱ (f) (p) = \hat{f} (p) = {(2 π)}^{- d / 2} \int_{R^{d}} e^{- i 〈 p, x 〉} f (x) d x .

(98)

Again, this preserves the L² norm, and transforms derivatives into multiplications:

p_{i} \hat{f} (p) = - i ℱ (\frac{\partial f}{\partial x_{i}}) (p) .

Repeated application of this shows that

{| p |}^{2} \hat{f} (p) = - ℱ (Δ f) (p),

(99)

where

Δ = \sum_{j = 1}^{n} \frac{\partial^{2}}{\partial x_{j}^{2}}

is the Laplacian. Iterating this gives, for each r ∈ {0, 1, 2, …} and f ∈

S (R^{d})

,

{| p |}^{2 r} \hat{f} (p) = {(- 1)}^{r} ℱ (Δ^{r} f) (p),

(100)

which in turn implies, by the Plancherel formula Equation (67), the identity:

{\int_{R^{d}} | {| p |}^{2 r} | \hat{f} (p) | |}^{2} d p = {\int_{R^{d}} | Δ^{r} f (x) |}^{2} d x .

(101)

Then we have, for any m > d/4,

\begin{array}{l} {‖ f ‖}_{\sup} \leq {(2 π)}^{- d / 2} \int_{R^{d}} | \hat{f} (p) | d p \\ = {(2 π)}^{- d / 2} \int_{R^{d}} {(1 + {| p |}^{2})}^{m} | \hat{f} (p) | {(1 + {| p |}^{2})}^{- m} d p \\ = K {[\int_{R^{d}} {(1 + {| p |}^{2})}^{2 m} {| \hat{f} (p) |}^{2} d p]}^{1 / 2} by Cauchy-Schwartz, \end{array}

where

K = {(2 π)}^{- d / 2} {[\int_{R^{d}} \frac{d p}{{(1 + | p |^{2})}^{2 m}}]}^{1 / 2} < \infty

The function (1 + s)ⁿ/(1 + sⁿ), for s ≥ 0, attains a maximum value of 2ⁿ⁻¹, and so we have the inequality (1 + s)²^m ≤ 2²^m−¹(1 + s²^m), which leads to

{(1 + {| p |}^{2})}^{2 m} \leq 2^{2 m - 1} (1 + {| p |}^{4 m}) .

Then, from Equation (102), we have

{‖ f ‖}_{\sup}^{2} \leq K^{2} 2^{2 m - 1} ({‖ f ‖}_{L^{2}}^{2} + {‖ Δ^{m} f ‖}_{L^{2}}^{2}) .

(102)

This last quantity is clearly bounded above by a linear combination of ║f║₀_,b,₂ for certain multi-indices b. Thus ║f║_sup is bounded above by a linear combination of ║f║₀_,b,₂ for certain multi-indices b. It follows that ║x^a D^bf║_sup is bounded above by a linear combination of ║f║_a_′,_b_′,₂ for certain multi-indices a′, b′.

For the inequality going the other way, the reasoning used above for Equation (97) generalizes readily, again with (1 + x²) replaced by (1 + |x|²)^d. Thus, on

S (R^{d})

the topology generated by the family of semi-norms ║ · ║_a,b,₂ coincides with the Schwartz topology.

Now we return to Equation (102) for some further observations. First note that

Δ = \frac{1}{4} \sum_{j = 1}^{d} {(C_{j} - A_{j})}^{2}

and so Δ^m consists of a sum of multiples of (3d)^m terms each a product of 2m elements drawn from the set {A₁, C₁,…, A_d, C_d}. Consequently, by Equation (95)

{‖ Δ^{m} f ‖}_{L^{2}}^{2} \leq c_{d, m}^{2} {‖ f ‖}_{m}^{2},

(103)

for some positive constant c_d,m. Combining this with Equation (102), we see that for m > d/4, there is a constant k_d,m such that

{‖ f ‖}_{\sup} \leq k_{d, m} {‖ f ‖}_{m}

(104)

holds for all

f \in S (R^{d})

.

Now consider

f \in S_{p} (R^{d})

, with p > d/4. Let

f_{N} = \sum_{j \in W^{d}, | j | \leq N} 〈 f, ϕ_{j} 〉 ϕ_{j} .

Then f_N → f in L² and so a subsequence

{f_{N_{k}}}_{k \geq 1}

converges pointwise almost everywhere to f. It follows then that the essential supremum ║f║_∞ is bounded above as follows:

{‖ f ‖}_{\infty} \leq \underset{N \to \infty}{\lim \sup} {| f_{N} |}_{\sup} .

Note that f_N → f also in the ║ · ║_p–norm. It follows then from Equation (104) that

{‖ f ‖}_{\infty} \leq k_{d, p} {‖ f ‖}_{p}

(105)

holds for all

f \in S_{p} (R^{d})

with p > d/4. Replacing f by the difference f − f_N in Equation (105), we see that f is the L^∞–limit of a sequence of continuous functions which, being Cauchy in the sup-norm, has a continuous limit; thus f is a.e. equal to a continuous function, and may thus be redefined to be continuous.

11. Identification of $S (R)$ with a Sequence Space

Suppose a₀, a₁, … form a sequence of complex numbers such that

{\sum_{n \geq 0} {(n + 1)}^{m} | a_{n} |}^{2} < \infty, for every integer m \geq 0.

(106)

We will show that the sequence of functions given by

s_{n} = \sum_{j = 0}^{n} a_{j} ϕ_{j}

converges in the topology of

S (R)

to a function

f \in S (R)

for which a_n = ⟨f, ϕ_n⟩ for every n ≥ 0.

All the hard work has already been done. From Equation (106) we see that (s_n)_n≥₀ is Cauchy in each norm ║·║_m. So it is Cauchy in the Schwartz topology of

S (R)

, and hence convergent to some

f \in S (R)

. In particular, s_n → f in L². Taking inner-products with ϕ_j we see that a_j = ⟨f, ϕ_j⟩.

Thus we have

Theorem 16. Let W = {0, 1, 2, …}, and define

F : L^{2} (R) \to C^{W}

by requiring that

F {(f)}_{n} = {〈 f, ϕ_{n} 〉}_{L^{2}}

for all n ∈ W. Then the image of

S (R)

under F is the set of all a ∈ C^W for which

{‖ a ‖}_{m}^{2} \overset{def}{=} {\sum_{n \geq 0} {(n + 1)}^{2 m} | a_{n} |}^{2} < \infty

for every integer m ≥ 0. Moreover, if

F (S (R))

is equipped with the topology generated by the norms ║ · ║_m then F is a homeomorphism.

A. Spectral Theory in Brief

In this section we present a self contained summary of the concepts and results of spectral theory that are relevant for the purposes of this article.

Let H be a complex Hilbert space. A linear operator on H is a linear map

A : D_{A} \to H,

where D_A is a subspace of H. Usually, we work with densely defined operators, i.e., operators A for which D_A is dense.

A.1. Graph and Closed Operators

The graph of the operator A is the set of all ordered pairs (x, Ax) with x running over the domain of A:

G r (A) = {(x, A x) : x \in D_{A}} .

(107)

Thus Gr(A) is A viewed as a set of ordered pairs, and is thus A itself taken as a mapping in the set-theoretic sense. The operator A is said to be closed if its graph is a closed subset of H⊕H; put another way, this means that if (x_n)_n≥₁ is any sequence in H which converges to a limit x and if lim_n→∞ Ax_n = y also exists then x is in the domain of A and y = Ax.

A.2. The Adjoint A^∗

If A is a densely defined operator on H then there is an adjoint operator A* defined as follows. Let D_A_∗ be the set of all y ∈ H for which the map

f_{y} : D_{A} \to C : x \mapsto 〈 A x, y 〉

is bounded linear. Clearly, D_A_∗ is a subspace of H. The bounded linear functional f_y extends to a bounded linear functional f_y on H. So there exists a vector z ∈ H such that f_y(x) = ⟨z, x⟩ for all x ∈ H. Since D_A is dense in H, the element z is uniquely determined by x and A. Denote z by A*y. Thus, A^*y is the unique vector in H for which

〈 x, A * y 〉 = 〈 A x, y 〉

(108)

holds for all x ∈ D_A. Using the definition of A* for a densely-defined operator A it is readily seen that A* is a closed operator.

A.3. Self-Adjoint Operators

The operator A is self-adjoint if it is densely defined and A = A*. Thus, if A is self-adjoint then D_A = D_A* and

〈 x, A y 〉 = 〈 A x, y 〉

(109)

for all x, y ∈ D_A. Note that a self-adjoint operator A, being equal to its adjoint A*, is automatically a closed operator.

A.4. Closure, and Essentially Self-Adjoint Operators

Consider a densely-defined linear operator S on H. Assume that the closure of the graph of S is the graph of some operator

\bar{S}

. Then

\bar{S}

is called the closure of S. We say that S is essentially self-adjoint if its closure is a self-adjoint operator. In particular, S must then be a symmetric operator, i.e., it satisfies

〈 S x, y 〉 = 〈 x, S y 〉

(110)

for all x, y ∈ H. A symmetric operator may not, in general, be essentially self-adjoint.

A.5. The Multiplication Operator

Let us turn to a canonical example. Let

(X, ℱ, µ)

be a sigma-finite measure space. Consider the Hilbert space L²(µ). Let f : X → C be a measurable function. Define the operator M_f on L²(µ) by setting

M_{f} g = f g,

(111)

with the domain of M_f given by

D (M_{f}) = {g \in L^{2} (μ) : f g \in L^{2} (μ)} .

(112)

Let us check that D(M_f) is dense in L²(µ). By sigma-finiteness of µ, there is an increasing sequence of measurable sets X_n such that ∪_n_≥1X_n=X and µ (X_n) < ∞. For any h ∈ L²(µ) let

h_{n} = 1_{X_{n} \cap {| f | \leq n}} h

. Than

\int | f h_{n} | d μ \leq n \int | h | 1_{X_{n}} d μ \leq n μ {(X_{n})}^{1 / 2} {‖ h ‖}_{L^{2}} < \infty

and so hn ∈ D(M_f). On the other hand,

{‖ h_{n} - h ‖}_{L^{2}}^{2} \to 0

by dominated convergence. So D(M_f) is dense in H.

It may be shown that

M_{f}^{*} = M_{\bar{f}} .

(113)

Thus M_f is self-adjoint if f is real-valued.

A very special case of the preceding example is obtained by taking X to be a finite set, say X = {1, 2, …, d}, and µ as counting measure on the set of all subsets of X. In this case, L²(µ) = C^d, and the operator M_f, viewed as a linear map

M_{f} : C^{d} \to C^{d}

is given by the diagonal matrix

(\begin{matrix} f_{1} & 0 & 0 & \dots & 0 \\ 0 & f_{2} & 0 & \dots & 0 \\ 0 & 0 & f_{3} & \dots & 0 \\ ⋮ & ⋮ & ⋮ & ⋮ & ⋮ \\ 0 & 0 & 0 & \dots & f_{d} \end{matrix}) .

(114)

Now take the case where µ is counting measure on the sigma-algebra of all subsets of a countable set X. Let f be any real-valued function on X. Let

D_{f}^{0}

be the subspace of L²(µ) consisting of all functions g for which {g ≠ 0} is a finite set, and let

D_{f}^{0}

be the restriction of M_f to

D_{f}^{0}

. Then it is readily checked that

D_{f}^{0}

is essentially self-adjoint. Consequently, the restriction of M_f to any subspace of D_f larger than

D_{f}^{0}

is also essentially self-adjoint.

A.6. The Spectral Theorem

The spectral theorem for a self-adjoint operator A on a separable complex Hilbert space H says that there is a sigma-finite measure space

(X, ℱ, µ)

, a unitary isomorphism

U : H \to L^{2} (μ)

and a measurable real-valued function f on X such that

A = U^{- 1} M_{f} U .

(115)

Expressing A in this way is called a diagonalization of A (the terminology being motivated by Equation (114)).

A.7. The Functional Calculus

If g is any measurable function on R we can then form the operator

g (A) \overset{def}{=} U^{- 1} M_{g ° f} U .

(116)

If g is a polynomial then g(A) works out to be what it should be, a polynomial in A. Another example, is the function g(x) = e^ikx, where k is any constant; this gives the operator e^ikA.

A.8. The Spectrum

The essential range of f is the smallest closed subset of R whose complement U satisfies µ (f⁻¹(U))=0. It consists of all λ ∈ R for which the operator M_f− λI=M_f_−λ has a bounded inverse (wich is

M_{{(f - λ)}^{- 1}}

). This essential range forms the spectrum σ(A) of the operator A. Thus σ(A) is the set of all real numbers λ for which the operator A− λI has a bounded linear operator as inverse.

A.9. The Spectral Measure

Associate to each Borel set E ⊂ R the operator

{P^{'}}_{E} = M_{1_{f^{- 1} (E)}}

on L²(X, µ). This is readily checked to be an orthogonal projection operator. Hence, so is the operator

P^{A} (E) = U^{- 1} {P^{'}}_{E} U .

Moreover, it can be checked that the association E ↦ P^A(E) is a projection-valued measure, i.e., P^A(∅) = 0, P^A(R) = I, P^A(E ∩ F ) = P^A(E)P^A(F), and for any disjoint Borel sets E₁, E₂, … and any vector x ∈ H we have

P^{A} (\cup_{n \geq 1} E_{n}) x = \sum_{n \geq 1} P^{A} (E_{n}) x .

(117)

This is called the spectral measure for the operator A, and is uniquely determined by the operator A.

A.10. The Number Operator

Let us examine an example. Let W = {0, 1, 2, …}, and let µ be counting measure on W. On W we have the function

N^{'} : W \to R : n \mapsto n .

Correspondingly we have the multiplication operator M_N_′ on the Hilbert space L²(W, µ). Now consider the Hilbert space L²(R). We have the unitary isomorphism

U : L^{2} (R) \to L^{2} (W, μ) : f \mapsto {(〈 f, ϕ_{n} 〉)}_{n \geq 0} .

Consider the operator N on L²(R) given by

N = U^{- 1} M_{N^{'}} U .

Then

N f = \sum_{n \in W} n 〈 f, ϕ_{n} 〉 ϕ_{n}

and the domain of N is

D_{N} = {f \in L^{2} (R) : {\sum_{n \in W} n^{2} | 〈 f, ϕ_{n} 〉 |}^{2} < \infty} .

Comparing with Equation (82) we see that

(N f) (x) = (- \frac{d^{2}}{d x^{2}} + \frac{x^{2}}{4} - \frac{1}{2}) f (x)

for every

f \in S (R)

.

Thus the self-adjoint operator N extends the differential operator

- \frac{d^{2}}{d x^{2}} + \frac{x^{2}}{4} - \frac{1}{2}

, and, notationally, we will often not make a distinction. In view of the observation made at the end of subsection A.5, the differential operator

- \frac{d^{2}}{d x^{2}} + \frac{x^{2}}{4} - \frac{1}{2}

on the domain

S (R)

is essentially self-adjoint, with closure equal to the operator N.

The operator U above helps realize the operator N as the multiplication operator M_N′, and is thus an explicit realization of the fact guaranteed by the spectral theorem.

B. Explanation of Physics Terminology

In quantum theory, one associates to each physical system a complex Hilbert space.

ℋ [4]

Each state of the system is represented by a bounded self-adjoint operator ρ ≥ 0 for which tr(ρ) = 1. An observable is represented by a self-adjoint operator A on

ℋ

. The relationship of the mathematical formalism with physics is obtained by declaring that

tr (P^{A} (E) ρ)

is the probability that in state ρ the observable A has value in the Borel set E ⊂ R. Here, P^A is the spectral measure for the self-adjoint operator A.

The states form a convex set, any convex linear combination of any two states being also clearly a state. There are certain states which cannot be expressed as a convex linear combination of distinct states. These are called pure states. A pure state is always given by the orthogonal projection onto a ray (1–dimensional subspace of

ℋ

). If ϕ is any unit vector on such a ray then the orthogonal projection onto the ray is given by: P_ϕ_:ψ ↦ ⟨ψ, ϕ⟩ϕ and then the probability of the observable A having value in a Borel set E in the state P_ϕ then works out to be

〈 P^{A} (E) ϕ, ϕ 〉 .

Suppose, for instance, the spectrum of A consists of eigenvalues λ₁, λ₂, …, with Au_n = λ_nu_n for an orthonormal basis {u_n}_n≥₁ of

ℋ

. Then the probability that the observable represented by A has value in E in state P_ϕ is

{\sum_{{n : λ_{n} \in E}} | 〈 u_{n}, ϕ 〉 |}^{2} .

Thus the spectrum σ(A) here consists of all the possible values of A that could be realized.

To every system there is a special observable H called the Hamiltonian. The physical significance of this observable is that it describes the energy of the system. There is a second significance to this observable: if ρ is the state of the system at a given time then time t later the system evolves to the state

ρ_{t} = e^{- i \frac{t}{ℏ} H} ρ e^{i \frac{t}{ℏ} H},

where ℏ is Planck’s constant.

A basic system considered in quantum mechanics is the harmonic oscillator. One may think of this crudely as a ball attached to a spring, but the model is used widely, for instance also for the quantum theory of fields. The Hilbert space for the harmonic oscillator is L²(R). The Hamiltonian operator, up to scaling and addition of the constant

- \frac{1}{2}

, is

H = - \frac{d^{2}}{d x^{2}} + \frac{x^{2}}{4} - \frac{1}{2} .

The energy levels are then the spectrum of this operator. In this case the spectrum consists of all the eigenvalues 0, 1, 2, … The creation operator bumps an eigenstate of energy n up to a state of energy n + 1; an annihilation operator lowers the energy by 1 unit.

In many applications, the eigenstates represent quanta, i.e., excitations of the system. Thus raising the energy by one unit corresponds to the creation of an excited state, while lowering the energy by one unit corresponds to annihilating an excited state.

Acknowledgments

A first version of this paper was written when Becnel was supported by National Security Agency Young Investigators Grant (H98230-10-1-0182) and a Stephen F. Austin State University Faculty Research Grant; Sengupta was supported by National Science Foundation Grant (DMS-0201683) and is currently supported by National Security Agency Grant (H98230-13-1-0210).

The authors are also very grateful to the three referees for their remarks and comments.

Author Contributions

This work is a collaboration between the authors Becnel and Sengupta.

Conflicts of Interest

The authors declare no conflict of interest.

References

Schwartz, L. Théorie des Distributions; Herman: Paris, France, 1950; Volume 1. [Google Scholar]
Schwartz, L. Théorie des Distributions; Herman: Paris, France, 1951; Volume 2. [Google Scholar]
Simon, B. Distributions and Their Hermite Expansions. J. Math. Phys. 1971, 12, 140. [Google Scholar] [CrossRef]
Von Neumann, J. Mathematical Foundations of Quantum Mechanics; Princeton University Press: Princeton, New Jersey, USA, 1957. [Google Scholar]
Glimm, J.; Jaffe, A. Quantum Physics; Springer Verlag: New York, NY, USA, 1987. [Google Scholar]
Gelfand, I.M.; Vilenkin, N. Generalized Functions; Academic Press: New York, NY, USA, 1964; Volume IV. [Google Scholar]
Becnel, J.; Sengupta, A. White Noise Analysis: Background and a Recent Application. In Infinite Dimensional Stochastic Analysis: In Honor of Hui-Hsiung Kuo; World Scientific Publishing Company: Singapore, 2008. [Google Scholar]
Hida, T.; Kuo, H.-H.; Potthoff, J.; Streit, L. White Noise: An Infinite Dimensional Calculus; Kluwer Academic Publishers: Norwell, MA, USA, 1993. [Google Scholar]
Kuo, H.-H. White Noise Distribution Theory; CRC Press: Boca Raton, Florida, USA, 1996. [Google Scholar]
Becnel, J. Equality of Topologies and Borel Fields for Countably-Hilbert Spaces. Proc. Am. Math. Soc. 2006, 134, 313–321. [Google Scholar]
Hermite, C. Sur un Nouveau Développement en Série des Fonctions. Comptes rendus de l’Academie des Sciences 1864, 14, 93–266. [Google Scholar]

© 2015 by the authors; licensee MDPI, Basel, Switzerland This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Becnel, J.; Sengupta, A. The Schwartz Space: Tools for Quantum Mechanics and Infinite Dimensional Analysis. Mathematics 2015, 3, 527-562. https://doi.org/10.3390/math3020527

AMA Style

Becnel J, Sengupta A. The Schwartz Space: Tools for Quantum Mechanics and Infinite Dimensional Analysis. Mathematics. 2015; 3(2):527-562. https://doi.org/10.3390/math3020527

Chicago/Turabian Style

Becnel, Jeremy, and Ambar Sengupta. 2015. "The Schwartz Space: Tools for Quantum Mechanics and Infinite Dimensional Analysis" Mathematics 3, no. 2: 527-562. https://doi.org/10.3390/math3020527

APA Style

Becnel, J., & Sengupta, A. (2015). The Schwartz Space: Tools for Quantum Mechanics and Infinite Dimensional Analysis. Mathematics, 3(2), 527-562. https://doi.org/10.3390/math3020527

Article Menu

The Schwartz Space: Tools for Quantum Mechanics and Infinite Dimensional Analysis

Abstract

1. Introduction

2. Basic Notions and Framework

2.1. The Schwartz Space

2.2. The Schwartz Topology

2.3. The Operator T

2.4. The L2 Approach

2.5. Coordinatization as a Sequence Space

2.6. The Multi-Dimensional Setting

3. Topological Vector Spaces

3.1. Local Convexity and the Minkowski Functional

3.2. Semi-Norms

3.3. Topologies Generated by Families of Topologies

3.4. Topologies Generated by Families of Semi-Norms

3.5. Completeness

3.6. Metrizability

4. The Schwartz Space S ( R )

5. Hermite Polynomials, Creation and Annihilation Operators

6. Hermite Functions, Creation and Annihilation Operators

7. Properties of the Functions in S p ( R )

8. Inner-Products on S(R) from N

9. L2–Type Norms on S ( R )

10. Equivalence of the Three Topologies

11. Identification of S ( R ) with a Sequence Space

A. Spectral Theory in Brief

A.1. Graph and Closed Operators

A.2. The Adjoint A∗

A.3. Self-Adjoint Operators

A.4. Closure, and Essentially Self-Adjoint Operators

A.5. The Multiplication Operator

A.6. The Spectral Theorem

A.7. The Functional Calculus

A.8. The Spectrum

A.9. The Spectral Measure

A.10. The Number Operator

B. Explanation of Physics Terminology

Acknowledgments

Author Contributions

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

2.4. The L² Approach

4. The Schwartz Space $S (R)$

7. Properties of the Functions in $S_{p} (R)$

9. L²–Type Norms on $S (R)$

11. Identification of $S (R)$ with a Sequence Space

A.2. The Adjoint A^∗