Non-Relativistic Quantum Electrodynamics and the Coulomb Interaction

Woolley, R. Guy

doi:10.3390/physics8010020

Open AccessReview

Non-Relativistic Quantum Electrodynamics and the Coulomb Interaction

by

R. Guy Woolley

School of Science and Technology, Nottingham Trent University, Nottingham NG11 8NS, UK

Physics 2026, 8(1), 20; https://doi.org/10.3390/physics8010020

Submission received: 16 September 2025 / Revised: 3 November 2025 / Accepted: 6 November 2025 / Published: 12 February 2026

(This article belongs to the Special Issue Quantum Theory 100 Years Later: Advances on Foundations and Applications)

Download

Browse Figures

Versions Notes

Abstract

This review explores the foundations of non-relativistic quantum electrodynamics (QED) and its application to atoms and molecules. It follows the traditional route of placing classical electrodynamics in an Hamiltonian framework, followed by Dirac’s canonical quantisation algorithm. The properties of the resulting quantum Hamiltonian are reviewed from a non-perturbative perspective. It discusses the gauge invariance of the S-matrix, the Coulomb interaction, and the challenges posed by infinities in classical and quantum electrodynamics. The paper examines the mathematical frameworks used to address these issues, including the use of distributions and the Colombeau algebra. The review also highlights the limitations of the Coulomb Hamiltonian in explaining molecular structure and chemistry, emphasizing the need for additional theoretical modifications to bridge quantum mechanics and chemical phenomena.

Keywords:

quantum electrodynamics; atoms; molecules

1. Introduction

Lorentz invariant quantum electrodynamics is a theory of electrons and photons; nuclei are treated largely as classical spectators [1]. It is a mathematical idealisation of a scattering experiment in which the S-matrix calculated through the Feynman rules for perturbation theory is related to experimental cross-sections of scattered particles. Atoms and molecules are characterised minimally by the specification of a definite number of nuclei and electrons, particles with electric charge, which clearly at a fundamental level require electrodynamics for their description. There is no Lorentz and gauge invariant account of atoms, molecules, condensed matter etc. interacting with the electromagnetic field. The non-relativistic Hamiltonian focusses essentially on the bound states associated with the Coulombic interactions of electrons and nuclei that are coupled to electromagnetic radiation, and cannot be viewed simply as a defined limit of the Lorentz invariant theory. Of course, the generic molecule is defined precisely in terms of a ‘classical structure’ and this chemical fact sets a comprehensive molecular quantum theory [2,3,4,5,6] apart from the well established quantum theory of the atom. Although bound states can be incorporated in an S-matrix theory it is not possible as a practical procedure to obtain bound states through perturbation theory starting from the continuum. Importantly, the non-relativistic Hamiltonian is of interest in its own right; for example the question as to whether it has a ground state, which cannot be answered by perturbation theories, is important for understanding the stability of matter [7].

We begin with a classical description in the knowledge that the canonical quantisation scheme due to Dirac, based on a correspondence between the classical Poisson brackets (P.B.s) and quantum commutators, is a standard procedure for obtaining a quantum theory from a classical analogue that has been cast in Hamiltonian form. It has long been recognised however that the scheme is based on an asymptotic correspondence (’

ℏ \to 0

’) which may not be reliable, since the resulting quantum theory may or may not turn out to be satisfactory. There is no classical limit for spin so it is added to non-relativistic quantum theory ad hoc. The classical theory is thus a recognizable starting point towards a quantum theory, the required endpoint. A very recent review of the development and applications of molecular QED based on the Power-Zienau-Woolley (PZW) Hamiltonian, particularly in optical physics, is complementary to this account [8].

The outline of the paper is as follows. In Section 2, we summarise the fundamental equations of classical electromagnetism. In view of some elementary relations in vector calculus and the Maxwell equations, the field variables

(E, B)

, can be expressed in terms of the field ‘potentials’

(ϕ, a)

which are only defined up to a ‘gauge transformation’. Likewise the charge-current density for the charged particles can be related to so-called electric, (

P

), and magnetic,

(M)

, polarisation fields, which have a similar arbitrary character. The field potentials and the polarisation fields are best thought of as ‘auxiliary’ or ‘working’ variables as they cannot be chosen so as to describe a specific experimental setup. In this context the independence of the physical quantities from the auxiliary variables is usually referred to as gauge invariance. A conventional account of the electric polarisation field follows in Section 3.

The Hamiltonian formulation of the classical electrodynamics of charged particles and the electromagnetic field can be obtained via the intermediate step of a Lagrangian and appeal to the Principle of Least Action. This is reviewed in Section 4. We meet for the first time the functional scalar product of the electric polarisation field and the vector potential,

F = \int_{ℜ^{3}} P \cdot a d x

(1)

which is a quantity with dimensions of the mechanical variable action that turns out to be of fundamental significance in both classical and quantum electrodynamics. There are several subtle changes of viewpoint here; the original equations of motion, modelled on macroscopic classical electrodynamics, describe the electromagnetic fields associated with prescribed sources through Maxwell’s equations, while Newton’s laws are used to describe the motion of charged particles in space in a prescribed electromagnetic field. The Lagrangian formalism based on Section 4, Equation (44), however, describes a closed system of charges and field for which

\partial L / \partial t = 0

, so that by the usual arguments the Hamiltonian H is the constant energy of the whole system. It is important to note that the customary starting point for Lagrangian electrodynamics involves symbols for the electric charges

{e_{n}}

and masses

{m_{n}}

of the particles which are parameters that cannot be assumed to have the experimentally determined values at this stage of the formalism.

In the next section, Section 5, we describe the transition to a Hamiltonian scheme using Dirac’s method for dealing with a Lagrangian that leads to constraints, equations between the Hamiltonian variables. An important consequence of transforming to the Hamiltonian formalism is that the scalar potential

ϕ

is eliminated, and gauge transformations involve only the vector potential

a

. The classical Hamiltonian for a collection of charges interacting with electromagnetic radiation may be put in the conventional form

H = H_{charges} + H_{rad} + H_{int} = E (a constant)

(2)

where

H_{charges}

is the standard Coulomb Hamiltonian for charged particles,

H_{rad}

describes the free electromagnetic field, and

H_{int}

describes their interaction (cf. Equations (105)–(107)). Every term in the interaction Hamiltonian is gauge-dependent; nevertheless the classical equations of motion that follow from H are properly gauge-invariant.

The quantisation of the classical Hamiltonian for electrodynamics is described in Section 6; it follows from the application of the usual canonical quantisation rules for the particle and field variables. They are to be regarded as linear operators on some Hilbert space; the classical P.B.s are transformed into commutators which are of a purely algebraic nature,

A \to A; i ℏ {A, B} \to [A, B]; [A, B] = A B - B A .

(3)

The question arises as to how the Dirac constraint

Γ

, Equation (64), which is a ‘weak’ equality, should be dealt with in quantum theory. Two different methods are described; in the first, the classical variables in

Γ

are interpreted as operators, and physical states of the system, {

Ψ

} are required to satisfy the condition

Γ Ψ = 0 .

(4)

This leads to the identification of the operator that generates gauge transformations, and the operator relationship between the general Hamiltonian (arbitrary

g

obtained from (Equation (78))) and the familiar Coulomb gauge Hamiltonian (

g^{⊥} = 0

) with interaction terms of the form Equation (120)—the Power-Zienau-Woolley (PZW) transformation operator (138).

The condition (4) however is difficult to implement for practical calculations; these are usually based on the quantisation of the reduced classical Hamiltonian obtained with Dirac’s method of handling the constraint

Γ

. This leads to the PZW Hamiltonian (141). In the quantum mechanics of systems with a finite number of degrees of freedom it is a theorem that the Hilbert space is essentially unique, and different representations are related by unitary transformation. The theorem is not valid for field theories like QED which have an infinite number of degrees of freedom, so that the physical Hilbert space is not known a priori. One has to make a choice (guess); the usual choice is the free field Fock space.

Some of the properties of the Hamiltonian for non-relativistic QED are summarized in Section 7. The field operators like the Coulomb gauge vector potential,

A

, are represented as Fourier integrals satisfying the field commutation relations. Firstly, the infinite energy of the field vacuum state is recalled. Although the offending infinity is commonly waved away, its occurence points to a deeper problem in the formalism that reappears when interactions between charges and the field are addressed. This is illustrated with a simple ’chain’ calculation using the Recursion Method which has the aim of representing the Hamiltonian as an infinite dimensional tri-diagonal matrix. The usual reponse to appearance of infinities is simply to cut-off the large momentum contributions to sums (integrals) over photon momenta. The resulting Hamiltonian is then properly defined as a linear, self-adjoint operator on Fock space, and a considerable mathematical literature devoted to non-perturbative methods of studying its properties has developed in the twenty first century.

In the literature of atomic, molecular and optical physics as well as theoretical chemistry perturbation methods, based on scattering theory (the diagram technique for the S-matrix) or response theory, are more or less universal. The two popular forms for the Hamiltonian either involve the Coulomb gauge (

P^{⊥} = 0

) or the ‘multipolar’ (or Poincaré) gauge with the polarisation field expressed as a (truncated) multipole series. While there has been much discussion as to which is to be preferred, the reality is that

P^{⊥}

is arbitrary, so that there are infinitely many possible choices, and a ’beauty contest’ between just two familiar formalisms does not address the deeper issue. What are the conditions that guarantee that the results of a calculation are independent of the choice of

P^{⊥}

? This is considered in Section 8 with some remarks on the conditions for the invariance of the S-matrix.

In the limit of point charges the unitarity of the PZW transformation is lost, and gauge invariance cannot be assumed. The origin of the problem is the occurrence of the Dirac ‘delta function’,

δ

, in the original definitions of the charge and current density for point charges from which the polarisation fields are derived, and in the commutation relations of the electric and magnetic fields. The

δ

must be handled with great care since it is a distribution; there is no continuous function with the properties Dirac postulated when he introduced the ‘delta function’. All the fields are distributions and they occur in quadratic combinations in the Hamiltonian.The particular difficulty here is that the product of two distributions is not generally defined. A sketch is given in Section 8 of how this problem can be overcome in non-relativistic QED by construction of a Colombeau algebra; a brief account of distribution theory and Colombeau’s extension of it is given in Appendix A.

The unperturbed Hamiltonian in the S-matrix theory is composed of just those terms in the full Hamiltonian that have no coupling between the charges and the electromagnetic field. This is the topic for the final Section 9. The properties of the field are well-known and need not be repeated. Some work has to be done to show in a gauge-invariant fashion that the Hamiltonian for the charges is the familiar Coulomb Hamiltonian. In the special case of one nucleus this is the basis for the quantum theory of the atom, one of the great successes of the early work in quantum theory. With more than one nucleus one has the Isolated Molecule model which is usually approached through versions of the ‘Born-Oppenheimer approximation’ discussed here from a modern viewpoint. The crucial step in their approach was to modify the Coulomb Hamiltonian by simply replacing the nuclear position operators with fixed classical position vectors which were then treated as parameters in the resulting purely electronic Hamiltonian. This leads directly to the idea of a ‘potential energy surface’, a cornerstone in chemistry. Calculations that make no reference to the Born-Oppenheimer idea remain limited to only the simplest molecules. How all of this relates to a quantum mechanical understanding of chemistry is an open question.

2. Classical Electromagnetism

The microscopic classical electrodynamics of a collection of charged particles with charge-current density (

j_{0}, j

) is modelled on the Maxwell equations for the electromagnetic fields (

E, B

) associated with this charge-density (For simplicity the space-time variables

x, t

are suppressed)

\begin{matrix} \nabla \cdot B & = 0 \\ \nabla \land E + \frac{\partial B}{\partial t} & = 0 \\ ϵ_{0} \nabla \cdot E & = j_{0} \\ \nabla \land B - \frac{1}{c^{2}} \frac{\partial E}{\partial t} & = μ_{0} j, \end{matrix}

(5)

where c is the speed of light,

ϵ_{0}

is the vacuum permittivity, and

μ_{0}

is the vacuum permeability, and the Lorentz force density

F = j_{0} E + j \land B

(6)

required for Newton’s mechanics for the particles in the field (

E, B

).

The classical Maxwell equations for the field of a point charge at rest at the origin of the coordinates reduce to

\begin{matrix} ϵ_{0} \nabla \cdot E = & j_{0} \\ j_{0} (x) = & e δ^{3} (x) \end{matrix}

(7)

where

δ

is the Dirac ‘delta function’ (see Appendix A). The ‘solution’ that vanishes at ∞ for the electric field at the point

x

, E(x),is

E (x) = \frac{e \hat{x}}{4 π ϵ_{0} x^{2}}

(8)

where

\hat{x}

is a unit vector. This leads to a divergent energy integral

ε = \frac{1}{2} ϵ_{0} \int E (x) \cdot E (x) d^{3} x = \infty,

(9)

a troubling result that is a harbinger of much of the subsequent story.

The third equation in Equation (5) is usually referred to as Gauss’s Law; in the following it will turn out to play a fundamental role in the discussion of gauge invariance. The fields (

E, B

) describe six degrees of freedom (at each space-time point) which are not all independent by virtue of the Maxwell equations. This redundancy is reduced by the introduction of a pair of auxiliary variables called field potentials as follows. In view of standard vector identities and the Maxwell equations, the fields (

E, B

) can be reconstructed from the fields (

ϕ, a

) according to

B = \nabla \land a, E = - \frac{\partial a}{\partial t} - \nabla ϕ .

(10)

It is easily seen that the potentials (

ϕ, a)

are arbitrary since equally good ones (

U, A)

can be defined by the relationships

\begin{matrix} a \to & A = a - \nabla f \\ ϕ \to & U = ϕ + \frac{\partial f}{\partial t} \end{matrix}

(11)

for any suitably smooth scalar field f that is consistent with the behaviour of the field-strengths (

E, B)

at infinity. This arbitrary character of the potentials is referred to as their gauge dependence and Equation (11) defines a gauge transformation.

The general solutions of Equations (10) and (11) can be expressed in terms of the field-strengths by setting [9]

\begin{matrix} A (x) = & \int_{ℜ^{3}} \frac{\nabla^{'} \land B^{'}}{4 π | x^{'} - x |} d x^{'} \\ U (x) = & \int_{ℜ^{3}} \frac{\nabla^{'} \cdot E^{'}}{4 π | x^{'} - x |} d x^{'} \\ = & \int_{ℜ^{3}} \frac{j_{0}^{'}}{4 π ϵ_{0} | x^{'} - x |} d x^{'} \\ f (x) = & - \int_{ℜ^{3}} \frac{𝒳^{'}}{4 π | x^{'} - x |} d x^{'} \end{matrix}

(12)

with an arbitrary choice

𝒳 = \nabla \cdot a

for the divergence of the vector potential; any such linear condition on the vector potential defines a gauge condition. Then, from the last line in Equation (12), we have that f is a solution of the Poisson equation with an arbitrary source term

\nabla^{2} f = 𝒳 .

(13)

This arbitrariness can be removed by making a definite choice for

𝒳

. A particularly important choice of gauge leads to the ‘Coulomb’ or ’radiation’ gauge characterised by the condition,

𝒳 = \nabla \cdot A = 0

(14)

so that

f = 0

. In the particular case of the free field (

j_{0} = 0

) we also have

U = 0

, from Equation (12), and so the field-strengths in this case can be described by the potentials (Throughout the review, we use

A

for the transverse Coulomb gauge vector potential)

\begin{matrix} ϕ = 0 \end{matrix}

(15)

\begin{matrix} a = A . \end{matrix}

(16)

Thus the free field is described completely by the transverse vector field

A

, that is, with two degrees of freedom that correspond to the two independent polarisation states of the radiation. On the other hand the general potentials

(ϕ, a)

in the presence of sources are still four degrees of freedom so there is a redundancy in the chosen auxiliary variables. The Hamiltonian scheme described in Section 5 removes this redundancy.

In a similar way, the charge-current density can be represented by a pair of vector fields according to

j_{0} = - \nabla \cdot P, j = \frac{d P}{d t} + \nabla \land M

(17)

and again there is a transformation rule

\begin{matrix} P \to \tilde{P} = & P + \nabla \land U \end{matrix}

(18)

\begin{matrix} M \to \tilde{M} = & M - \frac{d U}{d t} + \nabla u \end{matrix}

(19)

where u and

U

are arbitrary fields (with appropriate derivatives).

P

and

M

are conventionally referred to as the ‘electric polarisation’ and ‘magnetisation’ fields respectively. The terminology is traditional though these two fields have nothing to do with the classical electromagnetism of bulk condensed matter. Any two pairs {

P, M

} and {

\tilde{P}, \tilde{M}

} related by Equations (18) and (19) will satisfy Equation (17) and so cannot be distinguished. The polarisation fields have no explicit time dependence and are treated as functions of the coordinates and velocities of the charged particles.

3. The Electric Polarisation Field

In early formulations of the electrodynamics of atoms and molecules, the electric polarisation field was thought of as derived from a multipole expansion of an atomic/molecular charge density made about some privileged point

O

within the charge density which, for example, could be the centre-of-mass of the atom. Commonly, the expansion would be restricted to just the leading terms [10]

P = (d + Q : \nabla \dots) δ^{3} (\cdot; O)

(20)

where

d

and

Q

are the electric dipole and quadrupole moments of the charge density respectively and the delta function locates the point about which the expansion is made. Later it was shown that the complete multipole series can be summed up into an integral which again retains the arbitrary origin [11]

P = \sum_{n} e_{n} r_{n} \int_{0}^{1} δ^{3} (\cdot; O + λ r_{n}) d λ

(21)

where

r_{n} = x_{n} - O

; Taylor series expansion of the integral [12] about

O

leads back to Equation (20).

The integral can also be recognised as a parametric form for the sum of line integrals of the Dirac delta function taken over straight paths from the arbitrary origin to the particle positions (It is often convenient to omit the space point

x

and write formulae with · as a simple place-holder.

δ^{3} (\cdot; y)

is the three-dimensional Dirac delta function in which

y

appears as a parameter; later on we will need to view the Dirac delta as a distribution. In this expression the integral is taken along a path

C

; on such a curve,

z

is a definite function

z (r, x^{'}, λ)

of the particle coordinates and a real parameter which varies between upper and lower limits corresponding to the endpoints of the line integral. Provided

\partial z / \partial λ

is bounded and piecewise continuous on

C

, the integral is well defined. The path

C

is a purely spatial curve

P = \sum_{n} e_{n} \int_{O}^{x_{n}} δ^{3} (\cdot; z) d z .

(22)

In Equations (20)–(22) the symbol · is to be understood as a placeholder for the space variable

x

in the delta function. Evaluation of this integral is straightforward; for the straight line path from

X_{1}

to

X_{2}

one finds [13]

P (x) = \frac{e \hat{r} δ^{2} (1 - cos (ϑ))}{{| X_{1} - x |}^{2}}, | X_{1} - x | \leq | r |

(23)

and otherwise 0, where

r = X_{2} - X_{1}

, and

ϑ

is the angle between the vectors

r

and

X_{1} - x

.

P

has dimensions of

Q / L^{2}

since

δ^{2} (θ)

is dimensionless. This particular path however has no physical significance, and any other path is just as valid. Let

C_{1}

and

C_{2}

be two distinct paths from the charge at

X_{1}

to the charge at

X_{2}

with

C_{2}

the straight line path between the two charges so that

C_{1}

–

C_{2}

is a closed loop. Then formally

P (x; C_{1}) = P (x; C_{2}) + e (\int_{Σ_{12}} \nabla_{z} \land δ^{3} (z - x) d S)

(24)

where

Σ_{12}

is a surface bounded by the closed path

z

formed from

C_{1}

and

C_{2}

. The arbitrariness in

P

is carried by the surface integral.

The electric polarisation field,

P

, of N charged particles {

e_{i}, i = 1 \dots N

} is any solution of the divergence equation

\nabla \cdot P = - j_{0} .

(25)

For point charges at positions {

X_{i} i = 1 \dots N

} it is customary to take

j_{0}

to be given by

j_{0} = \sum_{i}^{N} e_{i} δ^{3} (\cdot; X_{i})

(26)

where the {

X_{i}

} are parameters. Since Equation (25) is linear we may write

P = \sum_{i}^{N} P_{i}

(27)

so that

\frac{1}{e_{i}} \nabla \cdot P_{i} = - δ^{3} (\cdot; X_{i}) .

(28)

This equation is, to within a constant, the defining equation for the Green’s function or fundamental solution for the divergence equation,

\nabla \cdot g (.; x^{'}) = - δ^{3} (\cdot; x^{'}) .

(29)

so if we can find a Green’s function

g

we have

P_{i} = e_{i} g (\cdot; X_{i}) .

(30)

Equation (28) is also, to within a constant, the same as Gauss’s Law for the electric field intensity,

E

, in the Maxwell equations, so what is said about

g

here also applies to

E

. The correspondence between their Green’s function solutions is just

E \leftrightarrow - \frac{1}{ϵ_{0}} P

. The difference between the two cases is that there is only the divergence Equation (25) to specify the electric polarisation field. This is enough to fix the longitudinal component (and likewise Gauss’s Law determines

E^{‖}

) but the transverse component of the polarisation field is left undetermined, whereas the physical electric field is constrained by the other Maxwell equations. Thus in addition to Gauss’s Law we have for example

\nabla \land E + \frac{\partial B}{\partial t} = 0

(31)

which must be satisfied by the transverse component

E^{⊥}

.

The literature identifies the vector-valued function

g {(x; x^{'})}^{‖} = \nabla_{x} (\frac{1}{4 π | x - x^{'} |})

(32)

valid for

x \neq x^{'}

as a Green’s function for Equation (29); it is not defined when

x

and

x^{'}

coincide. The solution set of Equation (29) is much more general than purely (32). A transverse vector field defined by

g {(x; x^{'})}^{⊥} =

Curl_xf(x,x′) where

f

is any differentiable vector field in the variable

x

can be added to

g^{‖}

along with any solution

g_{0}

of the homogeneous equation associated with Equation (29).

The line integral form

g {(x; x^{'})}^{C} = \nabla_{x} (\frac{1}{4 π | x - O |}) + \int_{C_{[O]}}^{x^{'}} δ^{3} (x - z) d z

(33)

for paths

C_{[O]}

from some origin

O

to the field point

x^{'}

is particularly important in electrodynamics. If the Dirac delta function is multiplied by the unit dyadic and then decomposed into longitudinal and transverse components [14,15]

δ_{α β} δ^{3} (x - y) = δ_{α β}^{‖} (x - y) + δ_{α β}^{⊥} (x - y)

(34)

Equation (33) becomes, in component form (

α, β = 1, 2, 3

)

\begin{matrix} g {(x; x^{'})}_{α}^{C} = & \nabla_{x, α} (\frac{1}{4 π | x - O |}) + \int_{C_{[O]}}^{x^{'}} δ_{α β}^{‖} (x - z) d z_{β} \\ + & \int_{C_{[O]}}^{x^{'}} δ_{α β}^{⊥} (x - z) d z_{β} . \end{matrix}

(35)

The first two terms combine to give precisely

g {(x; x^{'})}^{‖}

(32) and the third term is purely transverse by construction.

If one chooses

g = g^{‖}

the polarisation field (27) is that appropriate to electrostatics because its Curl vanishes in accordance with the Maxwell equations (zero magnetic field). When radiation is involved (moving charges) the polarisation field may have a transverse component and this can be accommodated with the use of the line integral form (33) for

g

. The Coulomb gauge version of electrodynamics corresponds to choosing the purely longitudinal polarisation field. Since the origin

O

and the choice of path

C_{[O]}

in Equation (33) are arbitrary, this freedom is an expression of gauge symmetry.

A useful simplification for the line integral Green’s function follows from the recognition that the arbitrary origin

O

should not appear in the final result. For an overall neutral system of charges this can be achieved by a reordering of the terms in the charge density so that the limits in every line integral are associated with coordinates of charges, and terms involving

O

no longer appear [16]. Thus for the neutral two-particle system, the function

g (x; x^{'}; x^{″}) = \int_{x^{'}}^{x^{″}} δ^{3} (x - z) d z

(36)

derived directly from the Green’s function, and the charge density

j_{0} (x^{'} - X) = e δ^{3} (x^{'} - X_{1}) - e δ^{3} (x^{'} - X_{2})

(37)

yields the polarisation field in the well-known form (38)

P = e \int_{X_{1}}^{X_{2}} δ (\cdot; z) d z .

(38)

The vector

g

was used by Dirac [17] in a manifestly gauge-invariant formulation of quantum electrodynamics (The quantity

c_{r} (x, x^{'})

in ref. [17], Dirac’ s equations [14,15,18], is essentially

g

); he considered the example of a single electron located at a point

X

and examined the electric field around it. At a point

x

in space this turns out to exceed the electric field of the vacuum state by an amount

e ϵ_{0}^{- 1} g (x : X)

. The choice of

g

specified in Equation (32) leads to the result that the excess field is precisely the Coulomb field of the charge; a more general choice such as Equation (33) leads to the Coulomb field plus a field of pure electromagnetic radiation as the excess. Furthermore the line integral form implies that the excess electric field is concentrated purely on the path

C

ending at the charge.

Dirac interpreted the electric field associated with the path

C

as a single Faraday line of force extending from the charge to the reference point

r

, which he took to be spatial infinity. He also noted that a closed path would describe a state of the electromagnetic field that is connected with the particles because the elementary charge e occurs in the coefficient of the integral. He further conjectured that a novel quantum electrodynamics might be constructed using the lines of force (the paths

C

) as the basic dynamical variables from which our conventional notions of charged particles and electromagnetic fields would be derived. This radical idea has recently been pursued in a different direction inspired by modern string theory [19,20] which has an obvious visual relationship to Faraday’s pictorial representation. Faraday’s picture of electromagnetism in which the lines of force were physical objects (strings of electric flux) held sway for many years in the nineteenth century and only really gave way to the Maxwellian picture of charged particles as sources and currents leading to electromagnetic fields after Heaviside had cast Maxwell’s theory into the vector equations that are now so familiar.

4. Classical Electrodynamics in Lagrangian Form

The Principle of Least Action asserts that a dynamical system can be characterised by a function of coordinates and velocities called the Lagrangian,

L_{0}

, such that the integral

S = \int L_{0} d t

(39)

has its minimum value for the actual motion of the system. The extremum can be found by the usual Calculus of Variations with the restriction that the variations vanish at the endpoints; the condition for the minimum

δ S = 0

(40)

leads to the Euler-Lagrange equations, which are the equations of motion. The Lagrangian formalism can be extended to work with second (or higher) derivatives of the coordinates with a method originally due to Ostrogradsky [21].

The potential role of the involvement of the particle acceleration in electrodynamics is suggested by the fact that in the point-particle limit it is known that the vector potential for the interacting system contains a term proportional to it because of self-interaction [22]. Consider the simple one-dimensional system described by the following Lagrangian

L = \frac{1}{2} m {\dot{x}}^{2} + a x \ddot{x}

(41)

in which the particle acceleration makes an explicit appearance. L is a highly simplified abstraction of the dynamics of a charged particle interacting with its own electromagnetic field. Application of Ostrogradsky’s method [21] yields the equation of motion as

(m - 2 a) \ddot{x} = 0

(42)

that is, L describes free motion of a particle with a modified mass. In this ‘toy’ example we see that Equation (41) may be rewritten as

L = \frac{1}{2} (m - 2 a) {\dot{x}}^{2} + a \frac{d}{d t} (x \dot{x})

(43)

which leads directly to Equation (42) in the usual approach since a total time derivative makes no contribution to the variation of the action S and may be dropped. The actual situation in classical electrodynamics is unfortunately much more complicated [23,24].

The Lagrangian variables for electrodynamics are customarily identified as the ‘coordinates’

{x_{n}, ϕ, a}

and their time derivatives. The conventional Lagrangian for the complete system of N charges + field may be written [15,25,26] in an obvious notation

L_{0} = L_{charges} + L_{int} + L_{field}

(44)

where

\begin{matrix} L_{charges} = & \frac{1}{2} \sum_{n} m_{n} {| {\dot{x}}_{n} |}^{2} \end{matrix}

(45)

\begin{matrix} L_{int} = & - \int_{ℜ^{3}} j_{0} ϕ d x + \int_{ℜ^{3}} j \cdot a d x \end{matrix}

(46)

\begin{matrix} L_{field} = & \frac{1}{2} ϵ_{0} \int_{ℜ^{3}} [(\dot{a} + \nabla ϕ) \cdot (\dot{a} + \nabla ϕ) \\ - c^{2} \nabla \land a \cdot \nabla \land a] d x . \end{matrix}

(47)

Implementing Equation (40) with Equations (44)–(47) yields the expected equations of motion for the charged particles and the electromagnetic field provided only that the field and particle variables can be varied independently in the derivation of the Euler-Lagrange equations. There is no requirement for the potentials to be restricted by a specific gauge condition (see below) since the action

S

is gauge-invariant.

If we make the gauge transformation (11) in this Lagrangian, Equation (44) is augmented by an additional term

L^{'} = - \int_{ℜ^{3}} j_{0} \frac{\partial f}{\partial t} d x - \int_{ℜ^{3}} j \cdot \nabla f d x .

(48)

Varying f in the corresponding term in the action integral then yields

δ S^{'} = \int_{ℜ^{4}} (\frac{d j_{0}}{d t} + \nabla \cdot j) δ f d x

(49)

which must vanish for the actual motion. This requires that

\nabla \cdot j + \frac{d j_{0}}{d t} = 0

(50)

which is the equation of continuity for electric charge. If we integrate Equation (50) over a small volume

Ω

with boundary surface S we have

\int_{Ω} \nabla \cdot j (x) = \int_{S} j \cdot d S = - \int_{Ω} \frac{d j_{0}}{d t} d x = - \frac{d q_{Ω}}{d t},

(51)

that is, the flux of charge through the surface S is precisely equal to the rate of change of charge inside the volume

Ω

; no net charge is created or destroyed. This is the law of conservation of electric charge, expressed locally. Taking the integral in Equation (51) over all space leads to

\begin{matrix} \int_{ℜ^{3}} \nabla \cdot j d x & = - \frac{d Q}{d t} = 0 \\ \to Q = \int_{ℜ^{3}} j_{0} d x = constant . \end{matrix}

(52)

Thus not only does the Lagrangian (44) lead to the equations of motion but it also encodes a fundamental conservation law. In the modern viewpoint the argument is reversed; the fundamental property of conservation of electric charge is incorporated by coupling the conserved quantity, the electric charge of the particles described by the charge-current density (

j_{0}, j

), to a commensurate set of ‘gauge variables’, (

ϕ, a

), in a Lagrangian formalism, with the Euler-Lagrange equations providing the equations of motion.

Using the freedom to change the field potentials according to Equation (11) it is straightforward to show using the equation of continuity (50), that a more general form of Equation (46) is [27]

L_{int} = - \int_{ℜ^{3}} j_{0} ϕ d x + \int_{ℜ^{3}} j \cdot a d x - \frac{d F}{d t}

(53)

where the action F is,

F = \int_{ℜ^{3}} P \cdot a d x .

(54)

A total time-derivative in the Lagrangian does not affect the equations of motion obtained from the Principle of Least Action. In other words, if the potentials (

ϕ, a

) are transformed to new potentials (

U, A

) according to Equation (11) then the corresponding

L_{int}

is obtained by simply replacing (

ϕ, a

) by (

U, A

) in every term including the time derivative; the interaction Lagrangian (53) is form-invariant under such a gauge transformation. However if the time derivative in Equation (53) is explicitly evaluated the result can be put in the form [28]

L_{int}^{'} = \int_{ℜ^{3}} P \cdot E d x + \int_{ℜ^{3}} M \cdot B d x .

(55)

The other parts of the Lagrangian are not altered by these transformations.

According to Equations (17)–(19) the transverse component of

P

is quite arbitrary and all possible paths in Equation (21) should be taken on an equal footing. Obviously any physical quantity should be independent of any particular choice of path; this is no more than a restatement of the requirement for physical quantities to be gauge-invariant. For the case of a single point charge e with position

q

we have

P (x) = e \int_{r}^{q} δ^{3} (z - x) d z + e \nabla_{x} \frac{1}{4 π | x - r |}

(56)

so that

F (q) = e \int_{r}^{q} a (z) \cdot d z + e \int_{ℜ^{3}} a \cdot \nabla_{x} \frac{1}{4 π | x - r |} d x .

(57)

The general vector potential may be written in terms of the unique Coulomb gauge vector potential, and an arbitrary longitudinal vector field

a = A + \nabla f

(58)

according to Equation (11), and so the action F is

F (q) = \int_{r}^{q} d ω + e f (q) .

(59)

The differential 1-form

d ω = e A (z) \cdot d z

(60)

is defined in terms of an infinitesimal path element

d z

along a path

C

from the arbitrary reference point

r

to the particle’s position

q

.

Its most important property is demonstrated by evaluating

F (q)

for two different paths

C_{1}

and

C_{2}

connecting

r

to the position of the charge

q

. We have

\begin{matrix} Δ F_{1, 2} = & F {(q)}_{1} - F {(q)}_{2} \\ = & \int_{r, C_{1}}^{q} d ω - \int_{r, C_{2}}^{q} d ω \\ = & \int_{r, C_{1}}^{q} d ω + \int_{q}^{r, C_{2}} d ω \equiv e \oint d ω \\ = & e \int_{S_{12}} B \cdot d S \end{matrix}

(61)

where

S_{12}

is the area bounded by the curves

C_{1}

and

C_{2}

. Thus if

q

lies in a region where the magnetic field is non-zero, the action

F (q)

does not have a definite value. This is not important for the classical Lagrangian; whatever path is specified for the evaluation of the time-derivative, it ends up in the modified interaction Lagrangian (55), and the equations of motion are unaffected. The differential 1-form

d ω

turns out to be of fundamental significance in the Hamiltonian formalism particularly for quantum charges interacting with either classical or quantised electromagnetic fields.

5. Classical Electrodynamics in Hamiltonian Form

The Lagrangian description is based on a configuration space in which the familiar Euclidean notions of distance and angles derived from a metric are valid. The Hamiltonian description introduces ‘momenta’ defined as

p = \partial L / \partial \dot{x}

, and regards x and p as independent variables on an equal footing. The resulting ‘phase-space’ has no natural metric with which to define distances and angles; its geometry is altogether more abstract, so-called symplectic geometry. A characteristic feature of the Hamiltonian description is its use of the P.B. in, for example, the statement of Hamilton’s equations of motion. The P.B. provides a rule for differentiation of functions of the phase-space variables, that is, their variation under infinitesimal displacement. We must compare the value of the function at one point,

z = (x, p)

, with its value at an infinitesimally displaced point,

z + d z

; infinitesimal displacements are evaluated with the differential operators

\partial / \partial x, \partial / \partial p

.

The assumption in the Lagrangian formalism that the particle and field variables can be varied independently in the action for the derivation of their equations of motion, translates into the statement in the Hamiltonian scheme that their mutual P.B.s vanish. In the case of a field described by a ‘field coordinate’

σ

, the corresponding canonical ‘momentum’ variable is defined as a functional derivative,

π = δ L / δ \dot{σ}

. Thus taking the customary Lagrangian coordinates for electrodynamics (Section 4), the corresponding Hamiltonian momentum variables are

p_{n} = \frac{\partial L}{\partial {\dot{x}}_{n}}, π = \frac{δ L}{δ \dot{a}}, π_{0} = \frac{δ L}{δ \dot{ϕ}} .

(62)

An important fact about the Lagrangian (53) is that the time-derivative of the scalar potential

ϕ

is absent which means its corresponding momentum,

π_{0}

, is null and

ϕ

can play no role in the dynamics; such a Lagrangian is said to be degenerate because its associated Hessian matrix is singular. On passing to the Hamiltonian in the usual way one cannot then eliminate all the velocities in favour of the momenta. Lagrangian degeneracy implies the presence of degrees of freedom which are not all linearly independent. The Hamiltonian formulation for a degenerate Lagrangian is due to Dirac; it is most simply understood as a mathematical procedure for systematically removing the redundancies among the variables. Dirac’s method is well described in the literature [1,29,30,31,32,33,34] and its application to non-relativistic electrodynamics is too [12,18,34,35].

The Hamiltonian that results from the Lagrangian (44) with the assumption that the charge-current density (

j_{0}, j)

describes point charged particles is [35]

\begin{matrix} H = & \sum_{n}^{N} \frac{1}{2 m_{n}} {(p_{n} - e_{n} a (x_{n}))}^{2} \\ + & \frac{1}{2} ϵ_{0}^{- 1} \int_{ℜ^{3}} ({| π |}^{2} + ϵ_{0}^{2} c^{2} {| B |}^{2}) d x + \int_{ℜ^{3}} w Γ d x \end{matrix}

(63)

where

Γ = \nabla \cdot π + j_{0} \approx 0

(64)

and

w (x)

is an arbitrary coefficient. The particle and field conjugate momenta, {

p_{n}

},

π (x)

, have canonical P.B.s with their coordinate partners

\begin{matrix} {x_{n}^{r}, p_{m}^{s}} & = δ_{n m} δ_{r, s} \\ {a {(x, t)}^{r}, π {(x^{'}, t)}^{s}} & = δ_{r s} δ^{3} (x - x^{'}) . \end{matrix}

(65)

The canonical momenta in these relations may be expressed in terms of their conjugate positions as derivatives (functional derivatives for the field variable)

\begin{matrix} p_{n} & \to - \frac{\partial}{\partial x_{n}} \\ π & \to - \frac{δ}{δ a} . \end{matrix}

(66)

At this stage both

a

and

π

have three components with no gauge specified for the vector potential; the scalar potential is a redundant variable in the canonical formalism and has been eliminated.

Γ

is an equation of constraint which we write with Dirac’s ‘weak’ equality symbol ≈ to emphasise its special status. The Hamiltonian equation of motion in P.B. notation,

\dot{Ω} = {Ω, H}

(67)

for any dynamical variable

Ω

is valid only when the equation of constraint (64) is valid.

The P.B. of H and

Γ

vanishes ‘strongly’ that is, as an ordinary equation (or definition),

{Γ (x), H} = 0

(68)

and so

Γ (x)

is a symmetry of the system—it is in fact responsible for gauge transformations of the vector potential. To see this, take the general linear superposition of

Γ

with a suitably smooth field f

G = \int_{ℜ^{3}} Γ f d x,

(69)

as the generator of a canonical transformation of the dynamical variables; for the vector potential there results

a (x, t) \to a {(x, t)}^{'} = a (x, t) + \nabla f (x)

(70)

as in Equation (11), while the particle momentum p_n transforms as

p_{n} \to p_{n}^{'} = p_{n} - e_{n} \nabla_{n} f (x_{n})

(71)

and so compensates in the Hamiltonian H for the change in Equation (70) to leave H invariant. The particle coordinates {

x_{n}

} and the field canonical momentum,

π

, are left unchanged since their P.B.s with G vanish. If we put

j_{0} = 0

in Equation (64) these equations are applicable to the electromagnetic field in a volume where there are no charges, that is, the free-field; the corresponding quantities for the free-field will be denoted by adding a subscript 0 to

Γ

and G, so that for example the canonical transformation with

G_{0}

again gives (70), and obviously there is nothing to be said about particle variables. The relationship between

G_{0}

and G is actually another canonical transformation as will be described later (see Equation (118)).

For a classical theory this scheme is an essentially complete replacement of the Maxwell-Lorentz account of charged particles and the electromagnetic field. The classical Hamiltonian incorporates the possibility of an ‘external free field’ since the field variables can have contributions from an electromagnetic field due to sources that are far from the volume of physical space that the ‘system’ (the collection of N charged particles) is supposed to reside in. Looking towards quantisation however one has to recognise that the occurrence of the arbitrary coefficient w in the Hamiltonian (63) is problematic since it is not clear how w could be interpreted as an operator, although Equation (64) may be imposed as a condition that picks out physical states.

Dirac’s method offers a solution to this problem by demonstrating that we are free to introduce a second equation of constraint subject only to the condition that it should have a non-zero P.B. with Equation (64). Then it is possible to redefine the P.B.s of all the dynamical variables so that the two equations of constraint can be taken as ordinary equations and the equations of motion for physical quantities are preserved.

The modified P.B.s are called ‘Dirac-brackets’; to distinguish them from the classical P.B. of two phase-space variables A and B, we write a Dirac-bracket as

{[A, B]}^{*}

. The Dirac-brackets have the same algebraic properties as the usual P.B.s; they are antisymmetric, associative, obey Jacobi’s identity, and satisfy the product rule

{[f g, h]}^{*} = {[f, h]}^{*} g + f {[g, h]}^{*}

(72)

which is a non-commutative version of the familiar Leibniz product rule in calculus. They are to be used in exactly the same way as the standard P.B.s and the equation of motion of a dynamical variable

Ω

takes the usual form

\dot{Ω} = {[Ω, H]}^{*} .

(73)

A possible second constraint introduces the electric polarisation field through the action F discussed in Section 4

F = \int_{ℜ^{3}} P \cdot a d x \approx 0,

(74)

since the pair

(a, π)

have a non-vanishing P.B. [18]. A simpler constraint equation to work with, independent of the charges, that can be derived from Equations (27) and (30) is

a [g] = \int_{ℜ^{3}} a (x) \cdot g (x : x^{'}) d x \approx 0

(75)

where the square brackets denote functional dependence. With the introduction of the Dirac-brackets, the two constraint equations become ordinary equations:

\begin{matrix} \nabla \cdot π & = - j_{0} \\ a [g] & = \int_{ℜ^{3}} a (x) \cdot g (x : x^{'}) d x = 0 . \end{matrix}

(76)

The first equation in Equation (76) is essentially Gauss’s Law, while the second, which should be understood as a gauge condition on the vector potential,

a

, makes (74) an ordinary equation,

F = 0

.

Equation (76) implies that the last term in Equation (63) may be dropped provided the reduced Hamiltonian is used with the Dirac-brackets instead of the original P.B.s. Once the equations of constraint are interpreted as ordinary equations, the field canonical variable,

π

, is seen to be proportional to the electric field by virtue of Gauss’s Law, and so we make the identification

π = - ϵ_{0} E .

(77)

The reduced Hamiltonian scheme then reads

\begin{matrix} H [g] = & \sum_{n}^{N} \frac{1}{2 m_{n}} {(p_{n} - e_{n} a (x_{n}))}^{2} \\ + & \frac{1}{2} ϵ_{0} \int_{ℜ^{3}} ({| E |}^{2} + c^{2} {| B |}^{2}) d x \end{matrix}

(78)

with Dirac-brackets

\begin{array}{l} {[x_{n}^{r}, p_{m}^{s}]}^{*} = δ_{n m} δ_{r s}, \end{array}

(79)

\begin{array}{l} {[a {(x)}^{r}, E {(x^{'})}^{s}]}^{*} = - ϵ_{0}^{- 1} (δ_{r s} δ^{3} (x - x^{'}) - \nabla_{x}^{r} g {(x^{'} : x)}^{s}), \end{array}

(80)

\begin{array}{l} {[p_{n}^{r}, E {(x)}^{s}]}^{*} = ϵ_{0}^{- 1} e_{n} \nabla_{x_{n}}^{r} g {(x : x_{n})}^{s} . \end{array}

(81)

However it is important to keep in mind in the following that

E

, as essentially the conjugate momentum to the vector potential

a

, is related to it by the Dirac-bracket (80) and that the scalar potential has been eliminated. Thereby nothing has been lost.

The notation

H [g]

reminds us that the Hamiltonian expressed in terms of the original canonical variables is also a functional of

g

. Note that a change of gauge will no longer be implemented as a canonical transformation since the Dirac-brackets are different in every gauge if the variables depend on

g

, while the form of the Hamiltonian (78) remains fixed. It is evident that the only changes in gauge that are possible in the Hamiltonian theory are those involving the vector potential

a

. Choosing a particular form for the polarisation field

P

, that is

g

, fixes a particular vector potential

a

through the condition

F = 0

. Obviously without a scalar potential it makes no sense to return to the gauge transformations of the original Maxwell equations (10).

Since a particle coordinate variable,

x_{n}

, has vanishing P.B.s with both constraints, its Hamiltonian equation of motion yields the velocity,

{\dot{x}}_{n}

, as

\begin{matrix} {\dot{x}}_{n} = & {[x_{n}, H]}^{*} \equiv {x_{n}, H} = \frac{\partial H}{\partial p_{n}} \\ = \frac{1}{m_{n}} (p_{n} - e_{n} a (x_{n})) = \frac{1}{m_{n}} {\bar{p}}_{n} . \end{matrix}

(82)

{\bar{p}}_{n}

is independent of the gauge of the vector potential. Thus the Hamiltonian structure (78)–(81) can be written in explicitly gauge-invariant form (that is no dependence on

g

), with

H = \sum_{n} \frac{1}{2 m_{n}} {| {\bar{p}}_{n} |}^{2} + \frac{1}{2} ϵ_{0} \int_{ℜ^{3}} ({| E |}^{2} + c^{2} {| B |}^{2}) d x .

(83)

Evidently the Coulomb interaction between the charges is left implicit in the Hamiltonian (83). At the classical level this is unimportant since Newton’s law of motion with the Lorentz force for the charges, and Maxwell’s equations with

j_{0}

and

j

as sources, may be derived formally from Hamilton’s equations of motion with Equation (83) as the Hamiltonian generating the motion. Of course one of these equations is Gauss’s Law relating the longitudinal electric field to the sources,

j_{0}

. Hamilton’s equations for the charge would be expected to be a pair of first-order differential equations for its position and momentum variables; However once their self-interactions are made explicit they are seen to be pathological since they include a term proportional to the particle’s acceleeration,

\ddot{x}

which leads to a runaway solution for the orbit [22].

Superficially the Hamiltonian (83) appears to describe ‘free’ charges and the electromagnetic field. However their interaction is carried through the Dirac-bracket relations of the modified momentum components

\begin{matrix} {[{\bar{p}}_{n}^{t}, {\bar{p}}_{n}^{r}]}^{*} = & e_{n} ϵ_{r t s} B {(x_{n})}^{s}, \\ [{\bar{p}}_{n}^{r}, E {(x)}^{s}]^{*} = & e_{n} ϵ_{0}^{- 1} δ_{r s} δ^{3} (x - x_{n}), \\ [x_{n}^{i}, {\bar{p}}_{m}^{j}]^{*} = & δ_{n m} δ_{i j} . \end{matrix}

(84)

As expected, the fundamental Dirac-bracket for the field strengths is independent of the Green’s function

g (x : x^{'})

(equivalently, is gauge-invariant),

{[B {(x)}^{r}, E {(x^{'})}^{s}]}^{*} = - ϵ_{0}^{- 1} ϵ_{r u s} \nabla_{x}^{u} δ^{3} (x - x^{'}) .

(85)

Here

ϵ_{r u s}

is the usual antisymmetric Levi-Civita symbol.

We noted earlier that the significance of the P.B. is that it provides the rule for differentiation of a function of the phase-space variables. According to the last Dirac bracket in (84) we may still identify

\bar{p}

as the generator of an infinitesimal translation of the particle

x \to x + d x

(86)

through an infinitesimal canonical transformation with the relation

d x = {x, \bar{p} \cdot d x}^{*} .

(87)

An infinitesimal translation

d x

of a general phase-space function

Ω

is given by

Ω (x + d x) = Ω (x) + {Ω, \bar{p} \cdot d x}^{*} .

(88)

If one transports

Ω

around an infinitesimal rectangle with sides

d x, d x^{'}

the result after one complete circuit is a change in

Ω

of

δ Ω = {Ω, {{\bar{p}}^{r}, {\bar{p}}^{s}}^{*}}^{*} d x^{r} d x^{' s} .

(89)

With the aid of Equation (84), this becomes

δ Ω = e {Ω, B (x) \cdot d σ}^{*}

(90)

where the area

d σ

is

d σ = d x \land d x^{'} .

(91)

A non-zero value for Equation (90) implies that translation of

Ω

by

d x

followed by a translation of

d x^{'}

is not the same as translation first by

d x^{'}

followed by

d x

; it is a basic geometrical fact that successive translations on curved surfaces do not commute, so we conclude that classical electrodynamics involves a curved phase-space characterised in some way by the vector potential.

Corresponding to the infinitesimal version (90) there is a finite integrated form involving the integral

e \int_{S} B \cdot d S

(92)

where the integral is taken over a surface

S

bounded by a closed curve

P

. By Stokes theorem this is also

e \oint_{P} a (x) \cdot d x = \oint_{P} d ω

(93)

where

B (x) = \nabla \land a (x)

(94)

expresses the usual relationship between the magnetic field and a vector potential. The close connection with Equations (58)–(60) is evident. In the terms of differential geometry the 1-form

d ω

is the ’connection’ that specifies how to make infinitesimal displacements in the phase-space, and the magnetic field

B

is the associated ’curvature’ of the space.

An alternative approach to the formulation of an Hamiltonian theory of electrodynamics originated in the work of Fermi [36]. Fermi didn’t actually write down a Lagrangian as an intermediate step towards the Hamiltonian; however his method amounts to subtracting the Lorentz gauge condition

\nabla \cdot a + \frac{\partial ϕ}{\partial t} = 0

(95)

from the original Lagrangian L (44) used here. Thereby the time derivative,

\dot{ϕ}

, of the scalar potential,

ϕ

, is introduced, and one can define non-vanishing field canonical momenta

(π_{0}, π)

by the usual calculus [15,37]. A variant is to replace the 0 on the right-hand side (RHS) of Equation (95) with an arbitrary function and add the whole combination to L with a Lagrange multiplier as a coefficient [38]. Equation (95) is an equation of constraint and one must verify that it remains valid for all times, and is consistent with the equations of motion (as Fermi did), so one is essentially back with Dirac’s method; of course the final answer for the Hamiltonian is the same independently of how it is developed and in the end the scalar potential is eliminated.

The Coulomb gauge condition (14) can be related to the polarisation field by choosing the gauge condition in the form

F^{‖} \equiv \int_{ℜ^{3}} P^{‖} \cdot A d x = 0

(96)

since vector fields orthogonal to

P^{‖}

are purely transverse. One thus obtains the usual P.B. of the Coulomb gauge vector potential and the transverse electric field strength, proportional to the transverse delta function [14]

\begin{matrix} {[A {(x)}^{r}, E {(x^{'})}^{⊥ s}]}^{*} = & - ϵ_{0}^{- 1} (δ_{r s} δ^{3} (x - x^{'}) \\ - \nabla_{x}^{r} \nabla_{x^{'}}^{s} \frac{1}{4 π | x - x^{'} |}) \\ \equiv & - ϵ_{0}^{- 1} δ_{r s}^{⊥} (x - x^{'}) . \end{matrix}

(97)

We may also write Equation (81) as

{[p_{n}^{r}, E {(x)}^{s}]}^{*} = - ϵ_{0}^{- 1} {[p_{n}^{r}, P {(x)}^{s}]}^{*} .

(98)

This suggests that we should separate the electric field vector

E

according to

E = {\tilde{E}}^{⊥} - ϵ_{0}^{- 1} P

(99)

where

{\tilde{E}}^{⊥}

is independent of the particle variables. The longitudinal part of the electric field is due purely to the charges (

E^{‖} = ϵ_{0}^{- 1} P^{‖}

) but in the polarisation field description the transverse field is shared between the field variable

\tilde{E}

and the transverse part of the polarisation field for the charges (

ϵ_{0}^{- 1} P^{⊥}

). Since the latter is arbitrary so to is

\tilde{E}

, while their difference, Equation (99) is of course definite.

The electromagnetic field contribution to H may then be expanded as

\begin{matrix} \frac{1}{2} ϵ_{0} \int_{ℜ^{3}} ({| E |}^{2} & + c^{2} {| B |}^{2}) d x = \frac{1}{2} ϵ_{0} \int_{ℜ^{3}} ({| {\tilde{E}}^{⊥} |}^{2} + c^{2} {| B |}^{2}) d x \\ - & \int_{ℜ^{3}} P \cdot {\tilde{E}}^{⊥} d x + \frac{1}{2 ϵ_{0}} \int_{ℜ^{3}} {| P |}^{2} d x . \end{matrix}

(100)

The classical Hamiltonian, expressed in this way, contains the Hamiltonian for free radiation, and terms that are linear and quadratic in

P

. With the path-dependent form for the polarisation field, the linear term can be seen as a particle-field interaction in which the transverse component,

{\tilde{E}}^{⊥}

, is integrated along paths

C

between pairs of charges. The familiar electric dipole interaction

V = - d \cdot {\tilde{E}}^{⊥}

(101)

is recovered if it is assumed that

{\tilde{E}}^{⊥}

is effectively constant along the path, and the path is of finite length. These assumptions remove the path-dependence. The quadratic term is the functional scalar product of the polarisation field with itself and involves only the particle coordinates; the evaluation of its contribution to the Hamiltonian (100),

E_{P} = \frac{1}{2 ϵ_{0}} \int_{ℜ^{3}} {| P |}^{2} d x,

(102)

is a delicate matter and is left for now; it will be discussed later. The full classical Hamiltonian (83) may be written in terms of the particle variables, the Coulomb gauge vector potential

A

and its conjugate

{\tilde{E}}^{⊥}

, and the electric polarisation field

P

, while maintaining complete freedom in the vector potential

a

. The general vector potential satisfying the gauge condition (76), may be expressed in terms of the Coulomb gauge vector potential,

A

, and the Green’s function

g (x^{'}, x)

as

a (x) = A (x) - \nabla_{x} \int_{ℜ^{3}} A (x^{'}) \cdot g (x^{'}, x) d x^{'} .

(103)

There then results

H [g] = H_{charges} + H_{rad} + H {[g^{⊥}]}_{int},

(104)

where the three terms are defined as

\begin{matrix} H_{charges} & = \sum_{n} \frac{p_{n}^{2}}{2 m_{n}} + \frac{1}{2 ϵ_{0}} \int_{ℜ^{3}} {| P |}^{2} d x \\ = T + E_{P}, \end{matrix}

(105)

H_{rad} = \frac{1}{2} ϵ_{0} \int_{ℜ^{3}} ({\tilde{E}}^{⊥} \cdot {\tilde{E}}^{⊥} + c^{2} B \cdot B) d x,

(106)

\begin{matrix} H {[g^{⊥}]}_{int} & = - \sum_{n} \frac{e_{n}}{2 m_{n}} (p_{n} \cdot a (x_{n}) + a (x_{n}) \cdot p_{n} \\ - e_{n} a (x_{n}) \cdot a (x_{n})) \\ - \int_{ℜ^{3}} P {(x)}^{⊥} \cdot {\tilde{E}}^{⊥} (x) d x . \end{matrix}

(107)

The Dirac bracket relations for the particle and field variables are

\begin{matrix} {[x_{n}^{r}, p_{m}^{s}]}^{*} & = δ_{n m} δ_{r s}, \end{matrix}

(108)

\begin{matrix} {[A {(x, t)}^{r}, \tilde{E} {(x^{'}, t)}^{⊥ s}]}^{*} & = - ϵ_{0}^{- 1} δ_{r s}^{⊥} (x - x^{'}) . \end{matrix}

(109)

Only the arbitrary transverse component of

g

contributes here; it occurs in every term in the ‘perturbation’

H {[g^{⊥}]}_{int}

.

As an example of Equation (103), suppose we specify the straight-line path

z = r + λ D, D = q - r, 0 \leq λ \leq 1

in

g (x^{'} : x)

; then evaluation of the gradient in Equation (103) yields

a (q) = \int_{0}^{1} λ B (r + λ D) \land D d λ .

(110)

Direct computation shows that

a (q)

is indeed a vector potential since it satisfies

\nabla_{q} \land a (q) = B (q)

. The gauge condition (75) can then be put in the simple form

D \cdot a (q) = 0 .

(111)

Thus in this gauge the vector potential at the position,

q

, of a charge is such [39] that its component along the straight line connecting

q

to the fixed point

r

vanishes, by Equation (111).

If we take Equation (103) at the position

x_{n}

of a charge

e_{n}

and multiply through with

e_{n}

we may interpret the result as the transformation

\begin{matrix} e_{n} A (x_{n}) \to e_{n} a (x_{n}) = & e_{n} A (x_{n}) - \nabla_{x_{n}} \int_{ℜ^{3}} P \cdot A d x \\ = & e_{n} A (x_{n}) + {p_{n}, F [0]} \end{matrix}

(112)

in P.B. notation, with the ‘0’ indicates that

g^{⊥} = 0

since this implies

a \to A

.

F [0] = \int_{ℜ^{3}} P \cdot A d x .

(113)

Although the first line of Equation (112) has the form of a gauge transformation, it is more instructive to associate Equation (99) with Equation (112), and view these changes together as a classical canonical transformation leading to modified particle and field canonical momenta. This is because we know from general arguments that the modification of the field potentials leading to the Lagrangian (53) becomes a finite canonical transformation of the Hamiltonian based on the integral

F [0]

. According to Equation (59) it may be expressed as a line integral over the 1-form

d ω

with the vector potential in the Coulomb gauge (

f = 0

here).

The differential 1-form

d ω

may be taken as the generator of an infinitesimal canonical transformation of a phase-space variable

Ω

, according to the usual rule

Ω \to Ω^{'} = Ω + d Ω,

(114)

with

d Ω

determined by the P.B. (or Dirac-bracket as required)

d Ω = e {Ω, d ω} .

(115)

Composition of this continuous transformation along some path

C

ending at the particle with charge e, leads to a finite canonical transformation which may be expressed using

F [0]

; indeed if we define the Lie derivative operator

L_{F [0]}

by

L_{F [0]} Ω = {Ω, F [0]}

(116)

then

e^{L_{F [0]}} Ω

is the new phase-space function obtained by transforming

Ω

using the power series expansion of the exponential according to

\begin{matrix} \tilde{Ω} & = e^{L_{F [0]}} Ω \\ = Ω + {Ω, F [0]} + \frac{1}{2!} {{Ω, F [0]}, F [0]} + \dots . \end{matrix}

(117)

It is readily verified that the P.B. relations are preserved under such a transformation so it is canonical.

A simple illustration of the relationship (117) is afforded by the Gauss’s Law constraints,

G_{0}

and G defined by Equation (69). Recall that as weak equations the vector potential is not constrained by a gauge condition, and its conjugate acts as the functional derivative operator (66). Then the relationship

G = e^{L_{F [0]}} G_{0}

(118)

is easily established, using Equations (17) and (66) with Equation (117).

In the notation of Equation (104) the Coulomb gauge Hamiltonian is

H [0]

; the Hamiltonian

H [g^{⊥}]

for an arbitrary

g^{⊥}

displayed in Equations (105)–(107) is obtained by setting

Ω = H [0],

(119)

in Equation (117). In this case the ⋯ in the series (117) are zero since with this choice of

Ω

the second order term is purely a function of the ‘position’ variables for the particles and field, and so has vanishing P.B. with the generator

F [0]

.

In the Coulomb gauge we have

g^{⊥} = 0

and the interaction operator (107) reduces to the familiar form

H {[0]}_{int} = - \sum_{n} \frac{e_{n}}{m_{n}} p_{n} \cdot A (x_{n}) + \sum_{n} \frac{e_{n}^{2}}{2 m_{n}} A (x_{n}) \cdot A (x_{n}) .

(120)

This defines the Coulomb gauge Hamiltonian from Equation (104). The field variables in

H_{rad}

are purely transverse and describe radiation; they have Fourier expansions in terms of running waves [22] in a box of volume

Ω

,

\begin{matrix} A (x) & = \sqrt{\frac{1}{2 ϵ_{0} Ω c}} \sum_{k, μ} \frac{\hat{ϵ} {(k)}_{μ}}{\sqrt{k}} (a_{k, μ} e^{i k \cdot x} + a_{k, μ}^{*} e^{- i k \cdot x}), \end{matrix}

(121)

\begin{matrix} π (x) & = - i \sqrt{\frac{ϵ_{0} c}{2 Ω}} \sum_{k, μ} \sqrt{k} \hat{ϵ} {(k)}_{μ} (a_{k, μ} e^{i k \cdot x} - a_{k, μ}^{*} e^{- i k \cdot x}) . \end{matrix}

(122)

The {

\hat{ϵ} {(k)}_{μ}, μ = 1, 2

} are the usual rectangular polarisation unit vectors, orthogonal to the wavevector

k

. The Fourier coefficient

a_{k, μ}

and its complex conjugate are not a canonical pair; they have a non-zero P.B.

{a_{k, μ}, a_{k, ν}^{*}} = - i δ_{μ ν} δ_{k, k^{'}} .

(123)

They can be related to canonically conjugate oscillator variables (

X {(k)}_{μ}, P {(k)}_{μ}

) for the field modes through the relation

a_{k, μ} = \sqrt{\frac{ω}{2}} X {(k)}_{μ} + i \sqrt{\frac{1}{2 ω}} P {(k)}_{μ}

(124)

where

ω = k c

. The P.B.s of the particle variables with the Fourier coefficients are zero.

A gauge-invariant theory guarantees charge conservation and at non-relativistic energies there are no physical processes that can modify the value of the charge e; this is true in both classical and quantum theories. The charge parameter

e_{n}

would therefore be expected to be the experimentally observed charge of a particle n. The situation with the mass parameter for a particle is quite different since the conventional Lagrangian includes a charge-field interaction that leads to an arbitrary ‘electromagnetic mass’ additional to the ‘mechanical mass’ m; this is the problem of self-interaction. It is possible for the electromagnetic mass due to self-interaction to become arbitrarily large and this requires m to be negative so that the observed mass = mechanical mass + electromagnetic mass has its observed (positive) value. This pathology certainly occurs in the point charge limit, and is the origin of so-called ‘runaway’ solutions to the classical equations of motion for a point charged particle interacting with its own electromagnetic field; it reappears in a different form in the quantum theory [22,23,24,40].

6. Quantisation

The closed curve

P

in Equation (93) can be taken to be a closed trajectory of a charged particle in an electromagnetic field; such an integral was considered in the years covering the transition from the Old Quantum Theory to Quantum Mechanics with a motivation that came from a quite different area of theoretical physics due to Weyl [41]. Schrödinger observed that it could be fitted in with the Bohr-Sommerfeld quantisation rule

\oint p \cdot d x = N h

(125)

and considered [42] several elementary situations involving a charge in an electromagnetic field based on the quantisation of a modified action integral

\oint (p - e A (x)) \cdot d x \equiv \oint {\bar{p}}_{n} \cdot d x .

(126)

The quantisation of action integrals was interpreted as part of a ‘particle’ picture of subatomic processes; in the late version of the Old Quantum Theory a corresponding ‘wave’ picture could be accessed through the de Broglie wavefunction associated with the particle. Thereby a phase is attached to the particle that is determined by its action integrals; this step was taken by London [43] and it was eventually recognised that the line integral of the field potential played a fundamental role in the quantum mechanics of charged particles in electromagnetic fields through a modification of the phase of the Schrödinger wavefunction. Thus was born modern gauge theory.

The quantisation of the classical Hamiltonian formalism for electrodynamics described in Section 5 follows from the application of the usual canonical quantisation rules to the particle and field variables which become operators subject to a non-commutative algebra defined by the commutation relations. The resulting formal quantum theory is then a Heisenberg representation in which the state vectors are fixed in time, and the operators carry the time dependence encoded in the operator forms of the equations of motion. For practical calculations it is customary to transform to the Schrödinger representation in which the Hamiltonian is time independent and the states vary in time according to

i ℏ \frac{d}{dt} | Ψ 〉 = H | Ψ 〉 .

(127)

Formal quantisation can be approached in two different ways, depending on how the Dirac constraint for the interacting system of charges and field (64), is dealt with in the quantum theory.

1.: Classical variables such as $π$ and $j_{0}$ are reinterpreted as Hilbert space operators $π$ and $j_{0}$ respectively, and no gauge condition is imposed. The vector potential operator then has a longitudinal degree of freedom in addition to the two transverse degrees of freedom that describe polarised photons; similarly its conjugate $π$ also has three degrees of freedom. Since the commutation relations fix the Hilbert space of states, the Hilbert space will be ‘too large’ and at the outset the calculations will involve the extra degrees of freedom; an extra condition on the state space is thus required to pick out the physically significant states

$(\nabla \cdot π + j_{0}) | Ψ 〉 = 0 .$

(128)

Their mutual commutator remains canonical

$[a {(x)}^{r}, π {(x^{'})}_{s}] = i ℏ δ_{r s} δ^{3} (x - x^{'})$

(129)

and so the vector potential operator’s canonical conjugate, $π$ , may be realised as a functional derivative

$π = - i ℏ \frac{δ}{δ a} .$

(130)

The Hamiltonian operator is given by the canonical quantisation of Equation (63).
2.: The canonical Poisson-brackets are redefined as Dirac-brackets by the imposition of a gauge condition for the vector potential so that $Ω = 0$ is valid as an ordinary equation (one of the Maxwell equations). The reduced Hamiltonian and the Dirac-brackets given by Equations (78)–(85) are then reinterpreted as operator relations on a Hilbert space. Two such choices, corresponding to taking $P$ as either the purely longitudinal form (96) (the Coulomb gauge) or the line integral form (22), and the multipolar expansion associated with it (the ‘multipolar’ or Poincaré gauge), have been widely used in practical calculations [15,44,45,46].

In the first method we can define the generator,

G

, of a unitary gauge transformation operator as the quantised form of the classical variable G (69), when charges are present. The resulting unitary operator is

\begin{matrix} U_{j_{0}} [f] & = exp (- i G / ℏ) \\ = exp (- \frac{i}{ℏ} \int_{ℜ^{3}} f (x) (\nabla \cdot π (x) + j_{0} (x)) d x) . \end{matrix}

(131)

In a representation which is diagonal in the particle coordinates and the vector potential, a state

| Ψ 〉

is a wavefunctional

〈 {x_{n}}, a (x) | Ψ 〉 = Ψ ({x_{n}}, [a (x)])

(132)

and under the action of

U_{j_{0}}

Ψ \to Ψ^{'} = U_{j_{0}} Ψ

(133)

where

\begin{matrix} Ψ^{'} & = exp (- \frac{i}{ℏ} \sum_{n} e_{n} f (x_{n})) Ψ ({x_{n}}, [a (x) - \nabla f (x)]) \\ = exp (- \frac{i}{ℏ} \sum_{n} e_{n} f (x_{n})) Ψ ({x_{n}}, [a {(x)}^{'}]) \end{matrix}

(134)

that is, in the transformed representation the state is a functional of the translated vector potential and acquires a phase determined by the charges present. Provided that charge is conserved there is unrestricted validity for the quantum-mechanical superposition principle because the phase factor is the same for all possible states. Conversely one cannot have superpositions of states associated with different charge densities since their relative phases could be changed by a gauge transformation; this is the quantum-mechanical formulation of the principle of charge conservation in terms of the states of the system.

The states {

Φ

} of the free-field transform in the same way under the operator

G_{0}

obtained from Equation (131) with

j_{0} = 0

, except that there is no phase factor. By analogy with the classical canonical transformation (118) we assume that

G_{0}

and

G

are unitary equivalent, that is,

G = W G_{0} W^{- 1}, Ψ = W Φ .

(135)

The operator

W

is easily found by direct computation to be

W = W [P] = exp \{\frac{i}{ℏ} \int_{ℜ^{3}} P \cdot a d x\} .

(136)

This means that if

Φ [a]

is a physical state of the free field, then

Ψ [a]

with

W

given by Equation (136) is a physical state of the interacting system, by Equation (128).

In quantum electrodynamics the same unitary transformation applied to the Hamiltonian corresponds to Equation (117). The original idea of Power and Zienau [10] was that the electric polarisation field,

P

(20), could be regarded as a source representation of the whole atom (or molecule) and it was this quantity that should be coupled to the radiation field rather than the conventional interaction through individual charged particles. They proposed a unitary transformation of the usual gauge-fixed atom/molecule-electromagnetic field Hamiltonian

H [0]

in the Coulomb gauge with an operator

U

,

U = e^{i F / ℏ}

(137)

where

F = \int_{ℜ^{3}} P (x) \cdot A (x) d x .

(138)

Here

A (x)

is the Coulomb gauge vector potential operator for the field, and

P

is the electric polarisation field operator.

Equation (138) will be recognised as the fully quantised form of the classical generator F discussed in Section 5 with the vector potential

a

chosen specifically in the Coulomb gauge. The (formally) unitary transformation equations corresponding to the classical canonical transformation (117) are

\begin{matrix} H_{U} & = U H [0] U^{- 1} \\ Ψ_{U} & = U Ψ . \end{matrix}

(139)

This transformation is known as the Power-Zienau-Woolley (PZW) transformation in non-relativistic quantum electrodynamics [44,45,47,48]. The result of the transformation,

H

, is precisely what one would have obtained if the classical Hamiltonian

H [g]

were reinterpreted as a quantum operator with the replacements

x_{n} \to x_{n}, p_{n} \to p_{n}, a (x) \to a (x), {\tilde{E}}^{⊥} \to {\tilde{E}}^{⊥}

(140)

It is conventional to follow the second approach to quantisation since it is obviously easier to formulate calculations; introducing the polarisation fields using Equation (100) the general non-relativistic quantum Hamiltonian for electrodynamics obtained from the canonical quantisation of Equation (104), may be written for a closed system of

N \geq 1

spinless charges in a radiation field (

E^{⊥}, B

) as,

\begin{matrix} H_{P} = & \sum_{n = 1}^{N} \frac{| p_{n} |^{2}}{2 m_{n}} + \frac{1}{2} ϵ_{0} \int (| {\tilde{E}}^{⊥} |^{2} + c^{2} {| B |}^{2}) d^{3} x \\ - & \int P \cdot {\tilde{E}}^{⊥} d^{3} x - \int M \cdot B d^{3} x + \int \int X : B B d^{3} x d^{3} x^{'} \\ + & \frac{1}{2 ϵ_{0}} \int P \cdot P d^{3} x . \end{matrix}

(141)

This is the general form of the PZW Hamiltonian [6,16].

In Equation (141), the first term accounts for the total kinetic energy for N free charges, and the second term is the usual Hamiltonian for free radiation. The next three terms couple the charges to the radiation, while the last term has no dependence on the field nor on the particles’ motion; it is of a purely static nature. One must keep in mind the involvement of the charges in the transverse electric field described by Equation (99). Using Equation (110) with Equation (107) leads directly to the explicit forms for

M

and

X

.

M

is a magnetisation density linear in the charge e that involves the particles’ position and momentum variables, and

X

is a generalised diamagnetic susceptibility tensor that is proportional to

e^{2}

. Their particular forms depend on the choice made for the electric polarisation field

P

which is also linear in the charge e.

In quantum mechanics, systems with a finite number of degrees of freedom, the Stone-von Neumann theorem guarantees that there is essentially a unique Hilbert space and that different representations of the conjugate operators (

p_{n}

,

q_{n}

) are related by unitary transformation. Thus the physical interpretation is also unique. For a quantised field with an infinite number of degrees of freedom this is no longer true and different representations are generally unitary inequivalent and lead to different physical pictures. The Hilbert space for

H

is a priori unknown; we have to choose. On the grounds of simplicity and experience the usual choice is the free electromagnetic field’s Fock space.

The Hamiltonian scheme is completed by giving the equal-time commutators of the dynamical variables, which for QED are

\begin{matrix} [\tilde{E} {(x)}^{r}, \tilde{E} {(x^{'})}^{s}] = & [B {(x)}^{r}, B {(x^{'})}^{s}] = 0, \end{matrix}

(142)

\begin{matrix} [q_{m}^{r}, p_{n}^{s}] = & i ℏ δ_{m n} δ_{r s}, \end{matrix}

(143)

\begin{matrix} [\tilde{E} {(x)}^{r}, B {(x^{'})}^{s}] = & i ℏ ϵ_{0}^{- 1} ϵ_{r s t} \nabla_{x^{'}}^{t} δ^{3} (x - x^{'}) . \end{matrix}

(144)

Routine calculation yields the equations of motion as the Maxwell equations for the fields associated with the polarisation fields (

P, M

), and the Lorentz force law for the particle motion in the fields (

E, B

) [6]. Of course these must be solved in a self-consistent manner for the closed system, and one learns from the conventional calculations that both classical and quantum formulations lead to infinite quantities, which physically is a nonsense. We explore in the next section some ideas about the origin of the infinities which can be traced to invalid assumptions in the calculations, and what might be done to ameliorate them.

7. The Hamiltonian

In the usual development of non-relativistic QED in atomic, molecular and optical physics, the transverse electromagnetic field variables

A

(the Coulomb gauge vector potential), and the fields

E^{⊥}, B

are represented as Fourier series derived from the standing waves in a ’box’ of finite volume

Ω

(the usual quantisation of Equations (121) and (122)). On passing to the continuum limit these quantities satisfy the bracket relation (144). As operator valued quantities the transverse electric field operator for example is given the Fourier expansion [7,34]

E {(x)}^{⊥} = \int (\hat{E} (k) e^{i k \cdot x} + h . c .) d^{3} k,

(145)

where

\hat{E} (k) = i \sqrt{\frac{ℏ k c}{{(2 π)}^{3} 2 ϵ_{0}}} c (k) .

(146)

The vector

c (k)

satisfies

k \cdot c = 0

and so can be expressed in terms of components with respect to the usual polarisation unit vectors {

\hat{ϵ} {(k)}_{μ}

}. The components are the familiar annihilation,

c {(k)}_{μ}

, and creation,

c {(k)}_{μ}^{+}

, operators for a photon with momentum

k

and polarisation

μ, (μ = 1, 2)

, with commutator

[c {(k)}_{μ}, c {(k^{'})}_{ν}^{+}] = δ (k - k^{'}) δ_{μ ν} .

(147)

There are similar expansions for the transverse vector potential

A (x) = \int (\hat{A} (k) e^{i k \cdot x} + h . c .) d^{3} k

(148)

where the Fourier coefficient is

\hat{A} (k) = \sqrt{\frac{ℏ}{{(2 π)}^{3} 2 ϵ_{0} k c}} c (k) .

(149)

and for the magnetic field operator

B

(

\hat{k} = k / | k |

)

B (x) = \int (\hat{B} (k) e^{i k \cdot x} + h . c .) d^{3} k

(150)

with coefficient

\hat{B} (k) = i \sqrt{\frac{ℏ k}{{(2 π)}^{3} 2 ϵ_{0} c}} \hat{k} \land c (k) .

(151)

In terms of the photon operators the free field Hamiltonian is [34]

H_{0} = \sum_{μ} \int ℏ c k (c {(k)}_{μ}^{+} c {(k)}_{μ} + \frac{1}{2} δ^{3} (0)) d^{3} k

(152)

so that even in the vacuum state the field energy is infinite,

E_{0} = \frac{1}{2} ℏ c δ^{3} (0) \int k d^{3} k .

(153)

The delta function arises from the integration over all space (

R^{3})

, and the momentum integral diverges for large k. Although the practical response is simply to wave away the offending infinite contribution, this does not really dispose of the underlying reasons for its occurrence, which manifest themselves again when interactions are introduced.

To see this we make a simple innocent calculation modelled on quantum mechanics. For simplicity we consider a single charge

e, m

with canonical operators

x, p

. The full Hamiltonian in the Coulomb gauge is then

\begin{matrix} H = & \frac{{| p |}^{2}}{2 m} + \int ℏ k c c {(k)}^{+} \cdot c (k) d^{3} k - \frac{e}{m} p \cdot A (x) \\ + & \frac{e^{2}}{2 m} A (x) \cdot A (x), \end{matrix}

(154)

where the zero point energy of the free-field has been dropped in the conventional way. This Hamiltonian is easily extended to the many-particle case with appropriate sums over the particle variables and the inclusion of the contribution of Equation (102).

In quantum mechanics the Hamiltonian,

H

, is taken to be a self-adjoint operator on a Hilbert space,

H

. An orthonormal basis for the space can be constructed from a three-term recurrence relation generated from a specified initial state

| u_{0} 〉

in the space,

H | u_{n} 〉 = a_{n} | u_{n} 〉 + b_{n + 1} | u_{n + 1} 〉 + b_{n - 1} | u_{n - 1} 〉

(155)

with starting coefficients,

\begin{matrix} a_{0} = & 〈 u_{0} | H | u_{0} 〉, 〈 u_{0} | u_{0} 〉 = 1, \\ b_{1} = & 〈 u_{1} | H | u_{0} 〉, b_{- 1} = 0, b_{0} = 1, \\ | u_{1} 〉 = & b_{1}^{- 1} (H - a_{0}) | u_{0} 〉 . \end{matrix}

(156)

Imposing the condition

〈 u_{1} | u_{1} 〉 = 1

leads to

b_{1}^{2} = 〈 u_{0} | H^{2} | u_{0} 〉 - a_{0}^{2}

(157)

that is, the first off-diagonal coefficient is the variance of the Hamiltonian in the initial state. In the basis {

| u_{i} 〉

},

H

becomes a symmetric tri-diagonal matrix T, whose diagonal elements are the {

a_{n}

}, and the sub-diagonals are populated by the {

b_{i}

}; this is the Recursion Method [49,50,51].

The construction (155) was discussed briefly in ref. [6] for the case where

| u_{0} 〉

is chosen as a product of a normalised state of the particle,

| N 〉

, and the photon Fock space vacuum,

| Ψ_{0} 〉

,

| u_{0} 〉 = | N, Ψ_{0} 〉, | u_{0} 〉 \in H .

(158)

The normalisation requirement for the particle state

| N 〉

is satisfied minimally by a wave-packet

ϕ {(x)}_{N} = \int f {(k)}_{N} e^{i k \cdot x} d^{3} k

(159)

for any square integrable function

f {(k)}_{N}

. Since the recurrence relation introduces arbitrarily high powers, n, of the Hamiltonian there will be matrix elements of

{(\frac{{| p |}^{2}}{2 m})}^{n}

to evaluate, so

f {(k)}_{N}

must tend to zero faster than any polynomial to ensure finite matrix elements. Straightforward calculations show that both

a_{0}

and

b_{1}

contain infinities because of the vector potential. The free-field Hamiltonian and the

p \cdot A (x)

interaction make no contribution to

a_{0}

and one is left with

a_{0} = {(\frac{{| p |}^{2}}{2 m})}_{N} + \frac{α}{π} (\frac{ℏ^{2}}{m}) \int_{0}^{\infty} k d k,

(160)

where the second term comes from

A (x) \cdot A (x)

and

α

is the fine structure constant. This term also contributes to

b_{1}

at order

α^{2}

, and additionally the squaring of

p \cdot A (x)

is similarly divergent in the continuum limit,

\begin{matrix} 〈 Ψ_{0}, N | {(\frac{e}{m})}^{2} {(p \cdot A (x))}^{2} | N, Ψ_{0} 〉 \\ = (\frac{8 α}{3 π}) (\frac{ℏ^{2}}{m}) {(\frac{{| p |}^{2}}{2 m})}_{N} \int_{0}^{\infty} k d k . \end{matrix}

(161)

Thus the recurrence breaks down at the first step and the full Hamiltonian including interactions,

H

, as commonly understood is shown to be not a well defined operator on the usual choice of Hilbert (Fock) space.

The difficulties in perturbation theory are of two different sorts. Firstly, the involvement of intermediate states with virtual photons of unrestricted momentum is allowed, and hence there are energies far beyond the regime of validity of the non-relativistic theory. These are the ‘ultraviolet’ divergences dealt with, for example, by a maximum momentum cut-off so as to suppress their contributions. Secondly, charged particles in the field can be associated with an arbitrarily large number of virtual photons with energy close to zero and an infrared cut-off must be imposed.

With the full apparatus of covariant QED and an invariant method of calculation (for example, Feynman diagrams) one can extract finite values for particular observable quantities. When that is done for interacting electrons and photons the agreement with experiment is remarkable, perhaps the most accurate quantities that can be calculated in physics [1]. Nevertheless the occurrence of infinities is an ugly feature which hints at underlying problems in the formalism of QED. Furthermore there are important questions in QED which cannot be answered using a perturbation expansion, for example, the demonstration of the existence of a ground state for interacting charges and field required for an explanation of the stability of bulk matter in the presence of the field, and the nature of the excitations. Such questions cannot even be formulated in the Lorentz invariant formalism, and in any case require analytical techniques that are not based on perturbation methods. The use of a cut-off is a realisation of the notion that high momentum (high energy) states must be eliminated in order to construct an ‘effective’ theory that is adequate for the low-energy physics of interest. This can be better achieved with the systematic use of Feshbach projection (also known as Löwdin’s partitioning technique) which can reduce the problem to an investigation of only a limited portion of the energy spectrum. Over the past several decades a mathematical approach to non-relativistic QED has been developed using the techniques of modern functional analysis; there is now a considerable research literature, and several monographs available too [7,52,53].

The use of the Coulomb gauge condition is the normal choice in the mathematical literature, though as we will see the PZW transformation makes an appearance. The full Hamiltonian for charged particles interacting with the quantised electromagnetic field can be written in the form

H_{λ} = H_{0} + λ H_{1} + λ^{2} H_{2}

(162)

where the coupling constant

λ

is proportional to the fundamental charge e. Here

H_{0}

is the same as the unperturbed Hamiltonian used in the perturbation theory approach, that is, the sum of the first two terms and the last term in Equation (141). The terms in Equation (162) involving

λ

and

λ^{2}

are, respectively the familiar

p \cdot A

and

{| A |}^{2}

terms in this gauge, the quantised version of Equation (120). The nuclei are treated as spin-zero particles, while the electrons are properly regarded as spin

\frac{1}{2}

fermions with the ‘semi-relativistic’ Pauli interaction for the electrons sometimes included in the interaction Hamiltonian; it is of order

λ

. Importantly if this is done, the operator

B

in it is the quantised field operator. Thus the terms in Equation (162) are explicitly

\begin{matrix} H_{0} & = \sum_{i} \frac{| p_{i} |^{2}}{2 m_{i}} + \sum_{i, j} \frac{e_{i} e_{j}}{4 π ϵ_{0} | x_{i} - x_{j} |} + \sum_{k, μ} ℏ ω c_{k, μ}^{+} c_{k, μ}, \\ λ H_{1} & = \sum_{i} \frac{e_{i}}{m_{i}} p_{i} \cdot A (x_{i}) - \frac{e ℏ}{2 m_{e}} \sum_{τ} σ_{τ} \cdot B (x_{τ}), \\ λ^{2} H_{2} & = \sum_{i} \frac{e_{i}^{2}}{m_{i}} A (x_{i}) \cdot A (x_{i}) \end{matrix}

(163)

where the

i, j

sums are over electrons and nuclei, while those over

τ

are restricted to the electrons; the

σ_{τ}

are the usual Pauli matrices. We suppose there are N electrons (so

τ = 1, \dots N

) and M nuclei with charges

e Z_{M}

; the total charge is then

Q = e (- N + \sum_{n = 1}^{M} Z_{n}) .

(164)

The total linear momentum of the system of charges and photons is

P = \sum_{i} p_{i} + ℏ \sum_{k, μ} k c_{k, μ}^{+} c_{k, μ} .

(165)

It commutes with

H_{λ}

, which is an expression of the translation invariance of the whole system. If

H (P)

is the Hamiltonian at fixed total momentum

P

, the full Hamiltonian may be written as a direct integral (this is the conventional symbol for the total momentum; it must not be confused with the electric polarisation field which is not involved in this discussion),

H_{λ} = \int_{ℜ^{3}}^{\oplus} H (P) d P .

(166)

It is then sufficient to analyse the properties of

H (P)

for some fixed

P

, and in particular it is essential to establish whether

H (P)

has an eigenvalue (that is a bound state) at the bottom of its spectrum. The obvious physical interpretation of such a state is a stable atom/molecule dressed with a cloud of photons in motion [54,55].

When the radiation field is involved one may reasonably surmise that the overall charge Q of the particles will be a crucial parameter to be considered, not least because of the infrared singularity for a charged particle in QED, and the observation that if not up close, a charged molecule looks much like a charged ‘particle’, and the (spatial) far-field is related to the

k \to 0

limit of the modes. If

Q = 0

, the photons see an electrically neutral charge distribution and the resulting vector potential (which determines the fields) decays faster than

1 / | x |

which can be accommodated in the Fock space description. Then there is no infrared divergence, and a stable ground state is found for some range of values [54] of

| P | \geq 0

. The situation is more delicate if

Q \neq 0

; classically the radiation field reduces to the free field if the ion is at rest (being at rest means

{\dot{R}}_{cm} = 0

where

{\dot{R}}_{cm}

is the velocity of the ion’s centre-of-mass (cm)). In the quantum theory account the equivalent condition is expressed in terms of an expectation value of the momentum being zero. Otherwise for

| P | \neq 0

and

Q \neq 0

there is no ground state unless an infrared cut-off is applied. When electrons and nuclei interact through purely the Coulombic part of the electromagnetic field, the specification of the total momentum is not required since it is physically reasonable that neutral and positively charged species are much more likely to be stable than ones with an excess of electrons.

In order to make the vector potential a well defined operator in the Fock space of the free field, its mode expansion must be modified by the inclusion of an ultraviolet cut-off in the Fourier expansion; thus we write

A {(x)}_{𝒳} = \sqrt{\frac{ℏ}{2 ϵ_{0} Ω c}} \sum_{k, μ} 𝒳 (k) \frac{\hat{ϵ} {(k)}_{μ}}{\sqrt{k}} (c_{k, μ} e^{i k \cdot x} + c_{k, μ}^{+} e^{- i k \cdot x}) .

(167)

The precise form of

𝒳 (k)

is often unimportant but typical examples are:

\begin{matrix} 𝒳 (k) & \leq \{\begin{matrix} 1 for k near 0, \\ {(\frac{k}{κ})}^{- 3} for large k, \end{matrix} \\ 𝒳 (k) & = exp (- k^{2} / κ^{2}), \\ 𝒳 (k) & = \{\begin{matrix} 0 if k \geq κ \\ 1 if k < κ, \end{matrix} \end{matrix}

(168)

where

α^{2} ≪ κ λ_{c} ≪ 1

. They define a non-relativistic regime where such effects as pair production and polarisation of the vacuum, which result in charge renormalisation in standard QED, cannot occur, while giving an energy,

ℏ c κ

much greater than the typical ionisation energies of the atomic system [56].

The electrons are treated in a fully quantum mechanical way (as fermions) using 2-component wavefunctions; in early work the nuclei were regarded as fixed classical sources of a Coulomb field. This is unimportant in atoms since one can reinterpret the origin (the nucleus) as the true centre-of-mass and bring in the reduced mass of the electron without losing any symmetries of the atomic states. For molecules however this would be a highly non-trivial assumption since nuclear permutation symmetry is a feature of the generic molecule if the nuclei are quantum mechanical particles. However in more recent work, attention has changed to moving atoms and ions so that the nuclei are treated as quantum particles. This is important since, as noted above, a distinction between neutral and charged species becomes apparent. The earliest investigations required much smaller values for the coupling constant

λ

than the actual physical values for electrons and nuclei determined by the fine structure constant [57], but many of these restrictions have been removed in later calculations, for example [58]. The systematic analysis of the consequences of the quantum mechanical Hamiltonian (162) can be traced back at least as far as a pioneering investigation by Pauli and Fierz [59]; in the mathematical literature

H_{λ}

is commonly known as ‘the Pauli-Fierz Hamiltonian’.

Even with the restriction of Equation (167) to the non-relativistic regime there is still the problem of its behaviour as

| k | \to 0

which gives rise to the infrared divergence problem for a charged particle interacting with the quantised electromagnetic field. For the neutral atom/molecule this may be ameliorated by making a unitary transformation of

H_{λ}

with a generator used by Pauli and Fierz [57,58]; in the mathematical physics literature this transformation commonly bears their name. In atomic/molecular physics it is known as the electric dipole approximation to the PZW transformation (Section 6) but with the Coulomb gauge vector potential replaced by the operator form including the cut-off

𝒳 (k)

, and its spatial variation suppressed so there is no magnetic field,

Λ = e^{- i F / ℏ}, F = d \cdot A {(0)}_{𝒳} .

(169)

The cost of making such a transformation is an interaction term,

- d \cdot E^{⊥} {(0)}_{𝒳}

, that increases as

| d | \to \infty

. The Combes dilatation transformation [60] of both the particle coordinates and the photon momenta, described below, acts sufficiently to control this growth. Alternatively, one can argue that since one is interested in bound states in which the charges are exponentially localised this is sufficient to bound the dipole contribution. More recently a ‘generalised’ Pauli-Fierz transformation has been describe [56]; for charges {

e_{n}

} with position operators {

x_{n}

} this involves the following quantity as the generator to be used in Equation (169) in place of the dipole approximation

\bar{F} = \sum_{n} e_{n} \sum_{k, μ} \frac{1}{\sqrt{k}} (f {(x_{n})}_{k, μ}^{*} c_{k, μ} + f {(x_{n})}_{k, μ} c_{k, μ}^{+}),

(170)

where

\begin{matrix} f {(x_{n})}_{k, μ} = & \sqrt{\frac{ℏ}{2 ϵ_{0} Ω k c}} e^{- i k \cdot x_{n}} 𝒳_{a} (k) φ (\sqrt{k} \hat{ϵ} {(k)}_{μ} \cdot x_{n}), \\ φ^{'} (0) = & 1 . \end{matrix}

(171)

A simple form for

φ

which controls the long distance behaviour is

φ (r) = r

if

| r | \leq 1 / 2

and

| φ | = 1

if

| r | \geq 1

[61]. This reduces to Equation (169) if

φ

is taken as a linear function, and the exponential factors are neglected as required for the dipole approximation. As usual we write

{\bar{H}}_{λ} = e^{- i \bar{F}} H_{λ} e^{+ i \bar{F}}

(172)

and this is evaluated by expanding the exponentials; as with the PZW transformation,

\bar{F}

commutes with the field and particle ‘position’ variables, and produces new terms from the particle and field ‘momenta’.

It is useful to keep in mind a qualitative description of the spectrum of the QED Hamiltonian beginning with the reference Hamiltonian

H_{0}

. The spectrum of the particle Hamiltonian,

H_{charges}

, is described by the HVZ theorem [62,63,64]; there is a continuum corresponding to the half-axis

[Σ, \infty)

for some

Σ \leq 0

, and isolated discrete energy levels

E_{0}, E_{1}, \dots E_{i}

below the continuum, that is,

E_{0} \leq E_{1} \leq, \dots, < Σ

. The spectrum of the free electromagnetic field Hamiltonian (the zero-point energy has been dropped for this review) consists of a simple eigenvalue at 0, corresponding to the vacuum state,

Ψ_{0}

, and absolutely continuous spectrum on the half-axis

[0, \infty)

. The ‘eigenstates’ of

H_{0}

corresponding to the eigenvalues

{E_{i}}

are simple products of these independent states; what happens to them in the presence of interactions is of course a significant question in the quantum theory of radiation [57]. These facts about the unperturbed spectra of the particles and field mean that the reference Hamiltonian,

H_{0}

, has the same discrete spectrum as

H_{charges}

, that is {

E_{i}

}, and a continuous spectrum covering the half-axis

[E_{0}, \infty)

consisting of a union of branches

[E_{i}, \infty)

starting at the energy levels

E_{i}

and the branch

[Σ, \infty)

. Thus all the discrete energy levels of the atomic system including

E_{0}

become thresholds of continuous spectra; they are said to be ‘embedded’ eigenvalues. This is the mathematical reason for the difficulties in perturbation theory; non-relativistic QED, which is focused on the behaviour of the discrete states of atoms and molecules in the presence of electromagnetic radiation, requires the perturbation theory of continuous spectra.

The spectrum of the full Hamiltonian,

{\bar{H}}_{λ}

, is most usefully defined in terms of matrix elements of its resolvent; the discrete and continuous spectra are the poles and cuts respectively of

〈 ϕ | \frac{1}{{\bar{H}}_{λ} - z} | ψ 〉 for all ϕ, ψ \in H

(173)

where z is a complex variable. The structure of the resolvent can be exposed by using the idea of dilatation (or complex coordinate rotation [65]) transformations. Consider the family of transformed Hamiltonians defined by [56]

{\bar{H}}_{λ} (θ) = U (θ) {\bar{H}}_{λ} U {(θ)}^{- 1},

(174)

where

θ

is a real parameter, and

U (θ)

is chosen to transform the particle positions and photon momenta as

\begin{matrix} x_{n} & \to e^{θ} x_{n}, n = 1, \dots N \\ k & \to e^{- θ} k . \end{matrix}

(175)

The transformed Hamiltonian,

{\bar{H}}_{λ} (θ)

has an analytic continuation in the variable

θ

in a disc

D (0, θ)

about

θ = 0

in the complex

θ

plane. If

Φ_{θ} = U (θ) Φ

then for

z \in C^{+}

(that is

ℑ z > 0

)

〈 Φ | R (z) | Φ 〉 = {〈 Φ | U (θ)}^{- 1} R (θ, z) U (θ) | Φ 〉 = 〈 Φ_{θ} | R (θ) | Φ_{θ} 〉 .

(176)

The quantity

F (θ, z) = 〈 Φ (\bar{θ} | R (θ, z) | Φ (θ) 〉

(177)

has an analytic continuation [60] into a neighbourhood of

θ = 0

, and subject to certain technical requirements the same is true for the untransformed resolvent (the LHS of Equation (176)).

The real eigenvalues of

\bar{H} (θ)

give real poles of the RHS of Equation (176), and so they are the real eigenvalues of

{\bar{H}}_{λ}

; the point of the complex coordinate rotation transformation however is that it reveals new structure in the RHS of Equation (176), namely complex eigenvalues for

ℑ θ > 0

. These are the poles of the meromorphic continuation of the LHS of Equation (176) across the essential spectrum of

{\bar{H}}_{λ}

onto the second Riemann sheet, that is, into the lower complex half-plane. They are interpreted as the resonances of

{\bar{H}}_{λ}

and since the transformation (172) is unitary these are also the resonances of the original Hamiltonian,

H_{λ}

. Every eigenvalue of

H_{0}

apart from the ground state behaves in this way; thus all excited stationary states of the free atomic/molecular system become metastable states because of the interaction with the quantised electromagnetic field. Every resonance of the transformed QED Hamiltonian is attached to a branch of the essential spectrum; this occurs because the photon has zero mass, and leads to the problem of infrared divergences. In the Coulomb Hamiltonian case, that is, in the absence of radiation, the resonances are isolated. These spectral features are illustrated in Figure 1.

An essential idea used in the characterisation of the spectrum of

{\bar{H}}_{λ}

is to concentrate on a limited energy range. A similar idea is used in the mathematical analysis of the Born-Oppenheimer approximation (Section 9); it is based on the systematic reduction of the degrees of freedom of the Hilbert space to the energy range of interest using projection operator techniques. This is the precise formulation of the idea of using cut-offs to eliminate unwanted degrees of freedom. We sketch some of the ideas below; the formidable technical details can be found in the original literature references. The formal setting, which is essentially that familiar from Löwdin’s partitioning technique [66], is as follows [57]; suppose we have an Hamiltonian

H

acting on a Hilbert space

H

. A pair of projection operators

P

and

Q

are defined such that

P^{2} = P, P + Q = I

(178)

and used to construct the projected Hamiltonians

H_{P} = P H P, H_{Q} = Q H Q

(179)

which are operators on

P H

and

Q H

respectively. Now let

ρ (Ω)

denote the resolvent set of

Ω

, that is the set of complex numbers z such that

Ω - z I

has a bounded inverse. Then provided 0 lies in

ρ (H_{Q})

, the inverse operator

H_{Q}^{- 1}

exists on

Q H

, and is bounded.

A Feshbach map,

f_{P} (H)

, is defined on the reduced space

P H

by

f_{P} (H) = P H P - P H Q {(H_{Q})}^{- 1} Q H P

(180)

provided 0 belongs to

ρ (H_{Q})

. Then we have for example

1.: $ε$ belonging to the resolvent set $ρ (H)$ implies 0 belongs to $ρ (f_{P} (H - ε))$ .
2.: For an eigenstate $(E, Ψ)$ of $H$ we have

H Ψ = E Ψ ⟺ f_{P} (H - E) Φ, Φ = P Ψ .

(181)

The map is isospectral in the sense that it leads to an ‘effective’ operator,

f_{P}

, that in the energy range of interest has the same spectrum as the original operator.

We now focus on a particular discrete state of the unperturbed Hamiltonian for the charges with energy

E_{k}

obtained from the Schrödinger equation

H_{charges} | ϕ_{k} 〉 = E_{k} | ϕ_{k} 〉 .

(182)

and enquire about its fate in the presence of quantised radiation using a Feshbach map constructed as follows. We define a new field operator by the relation

ξ (H_{r a d} : ρ_{0}) = \sum_{μ} \int (c_{k, μ}^{+} c_{k, μ}) ℏ ω d^{3} k, ℏ ω < ρ_{0}

(183)

which describes photons with energies

< ρ_{0}

. The projection operator required for the Feshbach map is then

P = \sum_{m = 1}^{N} | ϕ_{k} 〉 〈 ϕ_{k} | \otimes ξ (H_{r a d} : ρ_{0})

(184)

which is combined with Equation (180) and the Hamiltonian

{\bar{H}}_{λ}

; the maximum photon energy

ρ_{0}

is related to the coupling constant

λ

. Iteration of Feshbach maps is like using a microscope to inspect tiny regions of the spectrum and gain ever finer information as the energy interval examined is reduced; in the limit of an infinite sequence of such maps one can in principle obtain the exact ground state of the Hamiltonian of interest and the precise location of its resonances [67,68].

The Feshbach map construction outlined above has a straightforward physical interpretation but suffers from a technical disadvantage for computation because the sharp cut-off in the photon energies implies that it is not differentiable. A significant improvement is apparent with the introduction of so-called ‘smooth’ Feshbach maps, which though lacking a straightforward interpretation in terms of a block diagonalisation of the Hamiltonian, have much nicer mathematical properties. In place of

P

and

Q

in Equation (178) one introduces a pair of operators

𝒳

and

\bar{𝒳}

with

𝒳^{2} + {\bar{𝒳}}^{2} = I

(185)

The Hamiltonian may be decomposed in the usual perturbation theory form

H = H_{0} + V

(186)

where

H_{0}

is independent of the coupling between the field and charges. One requires that

H_{0}

commutes with both

𝒳

and

\bar{𝒳}

and then the ‘smooth’ Feshbach map

F (H, H_{0})

can be defined in exactly the same way as in Equation (180) with

𝒳

and

\bar{𝒳}

replacing

P

and

Q

respectively [56,67,69,70].

The main results of detailed mathematical analysis of the iterated Feshbach map construction are summarised below. They refer to a neutral atomic or molecular system with linear momentum less than some critical value

{| P |}_{c}

[54,58]:

1.: There is a ground state of $H_{λ}$ derived from the ground state of $H_{0}$ ; it is exponentially localised in the coordinates of the charges. The existence of the ground state can be demonstrated for the physical coupling constant, $α$ .
2.: There are complex eigenvalues { $E {(λ)}_{k . m}, m = 1, \dots n_{k}$ } associated with each eigenvalue $E_{k}$ of $H_{0}$ with multiplicity $n_{k}$ ; they are independent of the angle $θ$ used in the dilatation described above. The energy $E_{0, 1}$ with the smallest real part is real, and is the ground state energy of $H_{λ}$ . Under certain further technical assumptions all the { $E {(λ)}_{k, m}, k \geq 1$ } for non-zero $λ$ can be shown to be complex quantities with negative imaginary parts, that is they are the complex resonance energies of ${\hat{H}}_{λ}$ .
3.: The radiative corrections are of the form

$E {(λ)}_{k, m} - E {(0)}_{k} \approx λ^{2} ϵ_{k, m} + {O (| λ |}^{2})$

(187)

where $ℜ ϵ_{k, m}$ is given by Bethe’s formula for the Lamb shift in the case of the hydrogen atom, and $ℑ ϵ_{k, m}$ is given by Fermi’s golden rule for the decay rate. This identification is valid when allowance is made for the effects of the ultraviolet cut-off $κ$ in Equation (168).

The demonstration that the non-relativistic QED Hamiltonian for a system of N electrons and M nuclei has a lowest energy eigenvalue

E_{0} > - \infty

confirms ‘stability of the first kind’. The demonstration of ‘stability’ of the second kind’ is also an important result since it is the guarantee that the energy of a n-body system is extensive, that is, proportional to the number of particles. What is required is the inequality

E_{0} > C (N + M)

(188)

for some constant

C \leq 0

that does not depend on N and M, but will generally involve the basic physical parameters of the system (mass, charge, Planck’s constant etc.);

C < 0

means that the binding energy per particle is positive.

To begin with consider the charges in the absence of radiation, that is the Coulomb Hamiltonian, which can be put in the form,

H^{C} = T_{N} + H_{0}

(189)

where

T_{N}

represents all the nuclear kinetic energy operators. Let

φ

be any electronic wavefunction with nuclear positions as parameters, and

Ψ

be any square integrable wavefunction of finite energy (The contribution of the overall centre-of-mass motion is assumed to be removed, otherwise the spectrum is purely continuous) for

H^{C}

. Lieb and Seiringer [7] show that

φ

satisfies a lower bound

〈 φ | H_{0} {| φ 〉}_{e l} \geq - C {〈 φ | φ 〉}_{e l}

(190)

where the expectation valuers involve solely integrations over the electronic variables. Since C does not depend on the nuclear variables we have

〈 Ψ | H_{0} {| Ψ 〉}_{e l} \geq - C {〈 Ψ | Ψ 〉}_{e l}

(191)

and so also

〈 Ψ | H_{0} | Ψ 〉 \geq - C 〈 Ψ | Ψ 〉

(192)

where both electronic and nuclear variables are integrated over. The nuclear kinetic energy operator

T_{N}

is non-negative and so

〈 Ψ | T_{N} + H_{0} | Ψ 〉 \geq - C 〈 Ψ | Ψ 〉

(193)

which proves stability of the second kind.

Now introduce the charge-radiation interactions; the Coulomb Hamiltonian is replaced by the Pauli Hamiltonian for properly fermionic electrons, so the

σ \cdot B

interaction is included, and

B

is the quantised magnetic field operator. The nuclei are initially considered to be fixed;

φ

will involve electronic variables, including spin, and the field variables are represented in the usual Fock space manner. With this new interpretation of

H_{0}

and the states we can essentially repeat the steps in the argument above for

H^{C}

, starting by replacing Equation (190) by

〈 φ | H_{0} {| φ 〉}_{e l, F} \geq - C^{'} {〈 φ | φ 〉}_{e l, F}

(194)

for some new constant

C^{'}

and the argument above can be continued with an appropriate state

Ψ

for the QED system. A similar lower bound, Theorem 11.1, is given in ref. [7].

If the spin-zero nuclei are taken as dynamical (quantum) particles then we must introduce a new state

Ψ

in place of

φ

to allow for the additional nuclear variables, and make the change:

T_{N} \to \frac{1}{2 M_{N}} {(P_{N} - e_{N} A (R_{N}))}^{2}

(195)

to include the nuclear kinetic momentum; the symbols stand for all the nuclei so a sum over them is understood. The contribution of Equation (195) to the expectation value is non-negative:

T_{N}

is as before, the term linear in the vector potential,

P_{N} \cdot A

, contributes nothing to the expectation value since

A

is off-diagonal in the Fock space number basis, while the

{| A |}^{2}

term is divergent and must be cut off at some maximum momentum, but anyway is non-negative. Thus one concludes that a lower bound on the energy satisfying (188) holds also in the case of electrons and spin-zero moving nuclei interacting with the quantised electromagnetic field at

T = 0

.

8. The S-Matrix and Gauge Invariance

Perturbation theory in the Dirac or interaction representation is based on the perturbation operator carrying the time dependence induced by the ‘unperturbed’ Hamiltonian,

H_{0}

,

V (t) = e^{i H_{0} t / ℏ} V e^{- i H_{0} t / ℏ},

(196)

and a time evolution operator,

U

, satisfying the operator differential equation [6]

\frac{\partial U (t)}{\partial t} = - \frac{i}{ℏ} V (t) U (t) .

(197)

We take the initial condition to be that the reference state of the unperturbed problem at initial time

t_{0}

,

Φ (t_{0})

is the same as the solution of the perturbed problem,

Ψ (t_{0})

. This initial condition is encoded in the integral equation

U (t, t_{0}) = 1 - \frac{i}{ℏ} \int_{t_{0}}^{t} V (τ) U (τ, t_{0}) d τ

(198)

which can be solved (at least formally) by iteration [34] yielding a perturbation series (the Dyson series) in powers of

V

.

The issue of gauge invariant calculation in the perturbation approach to QED can be formulated succinctly [71] by considering the dependence of the time development operator on the Green’s function inherited from the coupling terms in Equation (141). Consider two arbitrary gauges specified by

g^{1}

and

g^{2}

for which in general

U (t, t_{0}, [g^{1}]) \neq U (t, t_{0}, [g^{2}]) g^{1} \neq g^{2},

(199)

since

t, t_{0}

are arbitrary. Physical observables are obtained from squared matrix elements of

U

so a transition

Φ_{n} \to Φ_{k}

is a physical process if and only if

| 〈 Φ_{k} | U (t, t_{0}, [g^{1}]) | Φ_{n} {〉 |}^{2} = | 〈 Φ_{k} | U (t, t_{0}, [g^{2}]) | Φ_{n} {〉 |}^{2},

(200)

whatever

g^{1}

and

g^{2}

may be. The operator

U (+ \infty, - \infty)

is the S-matrix. It is of special importance for the calculation of physical quantities, particularly cross-sections for light scattering.

We choose the Coulomb gauge Hamiltonian as the reference case (

g^{1} = g^{‖}

) and

g^{2}

as arising from any Hamiltonian obtained from a PZW transformation with

g^{2}

determining the generator

F

(138). The first-order approximation for

U = U^{(1)}

is linear in the coupling constant

\begin{matrix} U {(t, t_{0}; [g])}^{(1)} & = 1 - \frac{i}{ℏ} \int_{t_{0}}^{t} V (t^{'}, [g]) d t^{'}, \\ = 1 - \frac{i}{ℏ} \int_{t_{0}}^{t} V (t^{'}, [0]) d t^{'} - {(\frac{i}{ℏ})}^{2} \int_{t_{0}}^{t} [H_{0}, F (t^{'})] d t^{'}, \\ = U {(t, t_{0}; [0])}^{(1)} - {(\frac{i}{ℏ})}^{2} \int_{t_{0}}^{t} [H_{0}, F (t^{'})] d t^{'} . \end{matrix}

(201)

In the energy representation defined by the reference Hamiltonian

H_{0} | Φ_{n} 〉 = E_{n} | Φ_{n} 〉,

(202)

this becomes the matrix equation,

\begin{matrix} 〈 Φ_{k} | U {(t, t_{0}; [g])}^{(1)} | Φ_{n} 〉 & = 〈 Φ_{k} | U {(t, t_{0}; [0])}^{(1)} | Φ_{n} 〉 \end{matrix}

(203)

\begin{matrix} - {(\frac{i}{ℏ})}^{2} (E_{k} - E_{n}) {(F)}_{k n} \int_{t_{0}}^{t} e^{i (E_{k} - E_{n}) t^{'} / ℏ} d t^{'} . \end{matrix}

(204)

This result implies definite restrictions on the kinds of questions that can be asked about the time evolution of an ‘atom’ in the presence of electromagnetic radiation. For example, suppose the state

Φ_{n}

describes an atom initially

(t_{0})

in its ground-state

| ψ_{0} 〉

, with the radiation field in a specified state

| i 〉

; the probability that the atom is in a state

| ψ_{p} 〉

while the field is in a state

| j 〉

at a later time t is determined by,

| 〈 ψ_{p}, j | U {(t, t_{0}; [g])}^{(1)} | i, ψ_{0} {〉 |}^{2} .

(205)

This will only be gauge invariant if either

E_{k} = E_{n}

or

〈 Φ_{k} | F | Φ_{n} 〉 = 0

since the integral never vanishes for

t \neq t_{0}

. Any state in the Hilbert space is a possible final state however, and the difficulty for time-dependent perturbation theory is that it does not generally restrict the final states to those that give gauge invariant amplitudes in Equation (204). The customary appeal in time-dependent perturbation theory to the time-energy uncertainty relation as the guarantee of approximate energy conservation is not sufficient to eliminate the gauge dependent contribution.

Although a question about the probability (205) may seem very natural it evidently may have no physically meaningful answer. Crucially there is an exceptional case. Gauge invariance may be ensured, irrespective of the matrix elements of

F

, through the matrix

{(F)}_{k n}

having a zero coefficient; this occurs in the asymptotic limit

t_{0} \to - \infty, t \to + \infty

because

lim_{\begin{matrix} t \to + \infty \\ t_{0} \to - \infty \end{matrix}} E_{k n} \int_{t_{0}}^{t} e^{i E_{k n} τ / ℏ} d τ \propto (E_{k} - E_{n}) δ (E_{k} - E_{n}) = 0 .

(206)

The first-order perturbation calculation just discussed can be substantially generalised [71]; the main conclusion is the same however. Probability amplitudes for transitions induced by electromagnetic radiation are gauge invariant provided that the initial and final states are stable, that is, stationary [45].

The gauge invariance of the S-matrix for QED is the expected result. The gauge invariance of the S-matrix in the full Lorentz invariant QED is a well established and fundamental result [1]. However we cannot use the Lorentz invariant theory to calculate S-matrix amplitudes for processes involving atoms and molecules, and then take the non-relativistic limit; instead we adopt the Hamiltonian (141) and start all over again with a non-covariant perturbation theory, We use the same methods as the ones actually used in practice for light scattering calculations. The quantisation of both particles and radiation is essential for the logical consistency of the theory since the S-matrix is a quantum mechanical probability amplitude for the combined system of charged particles and the electromagnetic field described by the Schrödinger equation for

H

(141).

The literature of specific perturbation theory calculations is very extensive; it includes absorption/emission of radiation, spectral lineshapes, the Kramers-Heisenberg formula for light scattering, nonlinear optical processes, intermolecular forces and atomic self-energies [15,44,46,72,73,74,75,76,77,78]. The majority of such calculations use only the leading terms of a multipole expansion of the interaction, usually just the electric dipole contribution (the long-wavelength approximation), and are based on either the Coulomb gauge interaction, or the multipolar Hamiltonian that arises from the PZW transformation. It is obviously of interest to establish the circumstances in which these two formulations yield the same observables, and also when different answers might be expected. A familiar example of the first case is afforded by the calculation of light scattering cross-sections. For example, the identity of the generalised Kramers-Heisenberg dispersion formula obtained from either of these gauge choices can be demonstrated by a direct transformation of the matrix elements of the two different forms of

V

without any multipole approximation [79]; this result can also be obtained by a more formal method involving the PZW transformation theory [80]. Such calculations have been limited to low order perturbation theory and just two specific gauges. On the other hand the two gauges give different results in, for example, line shape calculations based on time-dependent perturbation theory [10]—hardly surprising in view of the above discussion.

From the theoretical point of view it is very desirable to investigate the gauge invariance of the perturbation theory in a general way that goes beyond the low-order theory that is sufficient for most experiments, and is not tied to any particular gauge. One reason is to exclude possible gauge dependent contributions in higher orders of perturbation theory; these cannot be ruled out just on the grounds of the smallness of the coupling constant (the dimensionless fine structure constant

α \approx 1 / 137)

since terms involving

g

in perturbation theory have a polynomial dependence on it, and so can be arbitrarily large (

g

is potentially unbounded).

The idea of the argument in ref. [71] is to show that the difference between the T-matrices in two different gauges can be written as a commutator involving the reference Hamiltonian

H_{0}

and powers of

F

(138); the

(k n)

-matrix elements of the difference term are then always proportional to

(E_{k} - E_{n})

and so will be annihilated by the energy conservation Dirac delta function (cf. Equation (206)) provided the matrix element containing

g

does not have a pole at

E = E_{n}

. A proof by induction shows that this is true in every order of perturbation theory.

As we have seen the Hamiltonian (141) is related to the familiar Coulomb gauge Hamiltonian by the Power-Zienau-Woolley transformation with the operator

\begin{matrix} Λ_{p z w} = & exp (- \frac{i}{ℏ} \int P (x) \cdot A (x) d^{3} x) \\ = & e^{i F / ℏ} \end{matrix}

(207)

where

A

is the Coulomb gauge vector potential as usual. One can view the PZW transformation as a coherent state boson translation which, for any choice of polarisation field, creates a corresponding Fock space from the original Fock space of the Coulomb gauge theory. The resulting coherent state operators involve a mixture of the original particle and field variables, and the coupling constants {

e_{i}

}; they only make sense for the interacting system. The integration in Equation (207) has always been interpreted using the usual product for continuous functions. For point charges and using Equation (38) one finds that the transformed space and the original Fock space have orthogonal vacuum states

〈 0 | Λ_{p z w} | 0 〉 = 0

(208)

because in this limit [6,16]

\int P (x) \cdot A (x) d^{3} x \to \infty .

(209)

Thus the transformation is no longer unitary and the proof of the gauge invariance of the S-matrix fails.

Consideration must be given to the infinities found in non-relativistic quantum electrodynamics based on the Hamiltonian (141); they are due to the neglect of the true mathematical nature of the field operators (electromagnetic and matter polarisation) that it is formulated in terms of. Since the familiar Coulomb gauge form is simply a special case of Equation (141) this remark is quite general. Calculation treating the operators as ordinary continuous functions in the point-particle model leads to an infinite ‘electromagnetic mass’, and to problems for the energy

E_{P}

. Even without considering interactions there is the infinite zero-point energy of the free electromagnetic field. In mathematical terms the electromagnetic field operators are neither absolutely integrable, nor square integrable; consequently the conventional Fourier representations (145), (148), (150) are simply formal expressions. The field variables are however locally integrable on any compact subspace of

R^{3}

so may be viewed as tempered distributions in the space variable

x

(see Appendix A for a short description of Schwartz distributions). One must give up the idea that these fields are continuous vector-valued functions/operators, and reinterpret them as distributions [7]. This means ‘smearing’ the field variables with a function belonging to the Schwartz space,

S

; in the notation of Appendix A this is explicitly, for the vector potential

A \to T_{A}, s \to 〈 T_{A}; s 〉 = \int_{R^{3}} A (x) s (x) d x

(210)

as in Equation (A4), and similarly for the electric field,

E

, and the magnetic field,

B

.

At non-relativistic energies the electrons and nuclei appear to have no structure, and it is natural to describe them as ‘point-like’. Having said that there is something paradoxical about associating mass to entities that have no extension in space. This tension manifests itself in the appearance of the Dirac ‘delta function’ in the formalism, an object that needs to be handled with great care. The Dirac delta is a distribution, and since Equation (144) is an equality the LHS of the commutator must also involve distributions. Likewise with Equation (147). The electric polarisation field

P

(33) is also a distribution, and the vector potential does not belong to the Schwartz space so the usual interpretation of the PZW transformation operator does not respect the distributional nature of the polarisation field. Recognition that we are dealing with distributions on its own does not solve the problem of giving meaning to the non-linear terms in the Hamiltonian (141), nor indeed to the PZW transformation operator since multiplication of distributions may not be defined.

The Colombeau algebra described in Appendix A offers a means to address these foundational problems at the expense of an unfamiliar mathematical framework [81,82]. The ambiguity in the multiplication of the field distributions that appear in the non-linear terms in the Hamiltonian is solved by embedding them in a Colombeau algebra by convolution with a so-called mollifier

s_{ϵ} (x)

. So for example, a Colombeau representative,

A_{ϵ}

, of the vector potential is constructed by convolution with a mollifier,

s_{ϵ} (x)

, which is chosen as a test function, (a Schwartz function with compact support). The final result is that

A_{ϵ} (x)

is obtained from Equation (148) by the simple inclusion of a factor

\hat{s} (ϵ k)

A_{ϵ} (x) = \int (\hat{A} (k) \hat{s} (ϵ k) e^{i k \cdot x} + h . c .) d^{3} k

(211)

where

\hat{s}

is the Fourier transform of s. Furthermore,

\hat{s} (k)

is required to have the value 1 in a finite neighbourhood of

| k | = 0

; the {

\hat{s}

} functions are referred to as dampers [83] and are also used as test functions (in the sense of Schwartz).

\hat{A} (k) \hat{s} (ϵ k)

is absolutely integrable for

ϵ > 0

, and the Fourier integral (211) is defined properly. The general idea of this approach is that a representative

A_{ϵ}

replaces

A

in all calculations with

ϵ

non-zero until the end of the calculation. One has

lim_{ϵ \to 0} A_{ϵ} = A

(212)

so the usual theory is recovered in the limit

ϵ = 0

. The occurrence of divergences indicates a singular limit and one must keep

ϵ > 0

for such cases. The singular quantities do not arise in perturbation calculations involving only real photons, for example, the Kramers-Heisenberg formula for light scattering; they appear to be confined to the contributions of virtual photons in the sense of perturbation theory. Thus if both factors in the integrand of Equation (207) are considered as distributions they can be transferred to a Colombeau algebra such that the integral (207) becomes finite for

ϵ

non-zero; this is sufficient to regularise

Λ_{pzw}

and then the proof of gauge invariance goes through.

The work in ref. [83] is concerned with translating the formal calculations of a model quantum field theory - the original Heisenberg-Pauli (HP) quantum theory of a scalar boson field - into the Colombeau algebra to start a mathematically rigorous justification for what is done conventionally. The boson field operator and its conjugate are reinterpreted as distributions in the space variable

x

and then transformed into elements of the Colombeau algebra. The ‘free-field’ part of the HP Hamiltonian is closely related to the free-field part of the QED Hamiltonian (141) since both involve only quadratic combinations of the field operators. They can be written as Hamiltonian densities which, interpreted as distributions so that when integrated over all space (

R^{3}

) with a suitable ‘damper’ acting as a test function, give the Hamiltonian as the energy operator. The zero-point energy of the free HP Hamiltonian was shown to be finite, and one can reasonably expect the same result for the free quantised electromagnetic field since it is also quadratic in the field operators.

The self-interactions of charged particles can also be expected to be finite. The Colombeau construction is sufficient to render divergent integrals like in Equations (160) and (161) finite; such regularised integrals have to be understood as ‘generalised numbers’. They cannot be given a definite value since there is no unique choice of the mollifier s. Such contributions to the matrix element

a_{0}

have no dependence on any of the physical variables and so do not contribute to the equations of motion; they can therefore be dropped quite properly. The contribution to

b_{1}

can be removed by mass renormalisation with the replacement of the parameter m by the experimental mass of the charge, and the omission of the (now finite) integral.

9. The Coulomb Hamiltonian and Chemical Physics

In the perturbation theory approach the charge-field coupling terms are separated off from the full Hamiltonian (141), and the remainder is taken to be the ‘unperturbed’ Hamiltonian,

H_{0}

; it described the dynamics of the charges, and of the free-field. The free-field Hamiltonian was discussed in Section 7, and it remains to consider the Hamiltonian for the charges,

h = \sum_{n = 1}^{N} \frac{| p_{n} |^{2}}{2 m_{n}} + \frac{1}{2 ϵ_{0}} \int_{ℜ^{3}} {| P |}^{2} d x .

(213)

Some results for specific choices for the polarisation field

P

were discussed in Refs. [6,16] and, in particular, the second term in Equation (213) was examined. Although it was noted that the components of the vector

P

in the point particle model were distributions, the subsequent calculations were performed as though they were continuous functions. These calculations were entirely classical. The properties of

P

are determined (Section 3) by a vector quantity,

g

, which is a Green’s function or fundamental solution of the divergence equation. Writing the polarisation field in terms of the Green’s function,

E_{P}

can be put in the form

E_{P} = \frac{1}{2 ϵ_{0}} \sum_{i, j}^{N} e_{i} e_{j} \int g (x; X_{i}) \cdot g (x; X_{j}) d^{3} x .

(214)

This results in not only the Coulomb interaction between pairs of charges and the usual infinite ‘self-energies’ (with

g \equiv g^{‖}

) but also a variety of other divergent terms; with the line-integral form, for example, one can obtain

E_{P} \sim \frac{e^{2}}{4 π ϵ_{0}} δ^{2} (0) \int d l + \dots

(215)

where l is the arc length along the integration path, and

δ^{2} (0)

is the singular spatial delta function in two dimensions evaluated at the origin with dimension

L^{- 2}

. This is the leading term in the non-relativistic limit of a result obtained by string theory techniques [16,20]. What is surprising about Equation (215) is that the familiar Coulomb interaction found in the longitudinal polarisation field’s contribution has been precisely cancelled by an equal and opposite sign term coming from the squared transverse polarisation field contribution, leaving only a ‘contact’ type of interaction. Such a result seems to depend on exactly how the calculation is done; for example one may also obtain just the Coulomb interaction

1 / r

form but with the same divergent coefficient [84] as in Equation (215).

However once one recognises that the Green’s function is really a distribution

g

with which

g

is associated it becomes clear why the calculation of

E_{P}

in the usual manner is problematic. As noted in Appendix A the Schwartz ‘impossibility theorem’ shows that, in general, distributions cannot be multiplied unambiguously unlike the continuous functions we are used to; thus there is the obvious question: how should

E_{P}

(214) be understood? This observation was the initial motivation for an appeal to the Colombeau algebra [82]. In the special case of the Coulomb gauge,

g^{⊥} = 0

, the result is that

E_{P}

, is precisely the usual Coulomb energy of distinct pairs of charges for

r = | X_{i} - X_{j} | \neq 0

together with a finite self-energy contribution. For any

g^{⊥} \neq 0

, the integral in general yields a

1 / r

dependence modulated by a function of r that depends on the precise expression for

g^{⊥}

. However with the understanding that the PZW transformation operator is regularized in the same way, Section 8, the transformation is unitary and the physical picture cannot change. To show this explicitly one must isolate

E_{P}^{⊥}

and combine it with the contribution to the Hamiltonian of the interaction term

\int P \cdot {\tilde{E}}^{⊥} d^{3} x,

(216)

again treating both factors as distributions transferred into the Colombeau algebra framework (this calculation remains to be done).

The quantum theory of the atom is based on the Coulomb Hamiltonian for a specified number of electrons interacting with a given nucleus such that the overall system is electrically neutral. A molecule is composed of atoms and an obvious extension of the formalism offers a possible description of the molecule and its properties. The Coulomb Hamiltonian for the electrons and nuclei specified by a chemical formula of a chemical substance is modeled on the quantum theory of the atom through the inclusion of terms arising from the additional nuclei associated with the molecules of the chosen substance. Written out in full for a system of N electrons with position variables,

x_{i}^{e}

, and a set of A nuclei with position variables

x_{i}^{n}

it may be written, in the Schrödinger (position) representation, as

\begin{matrix} H (x^{n}, x^{e}) = - \frac{ℏ^{2}}{2} \sum_{k = 1}^{A} \frac{\nabla^{2} (x_{k}^{n})}{m_{k}} - \frac{ℏ^{2}}{2 m} \sum_{i = 1}^{N} \nabla^{2} (x_{i}^{e}) + \frac{e^{2}}{8 π ϵ_{0}} \sum_{i, j = 1}^{N}' \frac{1}{| x_{i}^{e} - x_{j}^{e} |} \\ - \frac{e^{2}}{4 π ϵ_{0}} \sum_{i = 1}^{A} \sum_{j = 1}^{N} \frac{Z_{i}}{| x_{j}^{e} - x_{i}^{n} |} + \frac{e^{2}}{8 π ϵ_{0}} \sum_{i, j = 1}^{A}' \frac{Z_{i} Z_{j}}{| x_{i}^{n} - x_{j}^{n} |} \end{matrix}

(217)

in which the position operators are simple time-independent multiplicative operators acting on functions of the coordinate variables (‘wavefunctions’). The primes on the second and last summations require the diagonal (

i = j

) terms to be omitted; they refer to the (finite) self-energy of each charge. It is assumed that the charge and mass parameters are the experimentally observed values for the particles. The Hamiltonian may be written symbolically in the form

\begin{matrix} H = & T_{N} + T_{e} + V_{Coul} \\ \equiv & T_{N} + H^{el} \end{matrix}

(218)

where

T_{N}

is the sum of the kinetic energy operators for the nuclei,

T_{e}

is the analogous sum for the electrons, and

V_{Coul}

is the Coulombic (electrostatic) energy operator for all pairs of charges.

‘Free space’ boundary conditions are assumed so the full Galilean symmetry group of an isolated system can be realised. The Hamiltonian (217) is the time-translation generator for that group. The other nine generators are the components of the vector operators describing space translations (the total momentum

P

), space rotations (the total angular momentum

J

), and the relationship between reference frames moving at different velocities (the ‘booster’

K

).The group generators can all be separated into centre-of-mass and internal contributions which are uncoupled, so that the dynamics of the centre-of-mass can be discussed quite separately from the internal (‘spectroscopic’) dynamics of the charges. There are no explicit spin interactions; spin enters indirectly through the permutation symmetry of identical particles, bosons or fermions. Additionally, the Hamiltonian

H

commutes with the operator for space inversions, so all non-degenerate states can be assigned definite parity.

In 1929 Dirac wrote famously [85]

The underlying physical laws necessary for the mathematical theory of a large part of physics and the whole of chemistry are thus completely known, and the difficulty is only that the exact application of these laws leads to equations much too complicated to be soluble. It therefore becomes desirable that approximate practical methods of applying quantum mechanics should be developed, which can lead to an explanation of the main features of complex atomic systems without too much computation.

In 1928 just before Dirac wrote this he’d been in Leipzig in the company of Debye and London, and Sidgwick and Hinshelwood. He might just have heard London speak of treating the nuclei as classical clamped particles in quantum mechanical calculations of the simplest molecules like H₂, and it is the clamped nuclei form of the Coulomb Hamiltonian that he was possibly thinking of when he wrote the famous quotation. Certainly this clamped-nuclei Hamiltonian was the relevant one for all of the above mentioned company.

There was scant evidence for his sweeping claim, “the whole of chemistry”, but even so the quotation still resonates in Physics today, for example [86],

At one stroke, everything makes sense, and you can calculate everything. Take one example: do you remember the periodic table, devised by Mendeleev, which lists all the possible elementary substances of which the universe is made, from hydrogen to uranium, and which was hung on so many classroom walls? Why are precisely these elements listed there, and why does the periodic table have this particular structure, with these periods, and with the elements having these specific properties ? The answer is that each element corresponds to one solution of the main equation of quantum mechanics. The whole of chemistry emerges from a single equation.

Of course no one doubts the importance of the periodic table as an organizing principle for the rational classification of the elements, but the claim that the “whole of chemistry” can be obtained from the solutions of the main equation of quantum mechanics has never been demonstrated. The devil is in the detail !

Clamping the nuclei in the Coulomb Hamiltonian and treating them as classical particles in order to calculate a potential energy surface (PES) on which the nuclei can move is nowadays usually called ‘making the Born-Oppenheimer approximation’. The clamped-nuclei Hamiltonian is the one used in almost all modern computational chemistry, and in the following we discuss its relationship to the Coulomb Hamiltonian. It is convenient to follow Born and Oppenheimer’s simplified notation, using

x, \frac{\partial}{\partial x}

to stand for the electronic positions and momenta, and X for the nuclear positions, and consider the Schrödinger equation for

H

in a position representation [87]

(H (x, X) - E) ψ (x, X) = 0 .

(219)

The argument can be summarized as follows. Born and Oppenheimer had two main ideas: firstly to regard

T_{N}

as a small perturbation of

H^{el}

with the quantity

κ = {(\frac{m}{M_{o}})}^{\frac{1}{4}}

(220)

where

M_{o}

is any nuclear mass,

m_{k}

, or their mean, chosen as the small parameter in a perturbation series. Rewriting the total Hamiltonian (218) with

κ

displayed explicitly one has

H = κ^{4} H_{1} + H_{0}

(221)

where the terms can be identified from Equation (218). So far, no approximation has been made.

The crucial second idea is the introduction of the clamped-nuclei Hamiltonian. They put Equation (221) into Equation (219) and commented

If one sets $κ = 0$ … one obtains a differential equation in the x alone, the X appearing as parameters:

[H_{o} (x, \frac{\partial}{\partial x}, X) - W] ψ = 0 .

“Sie stellt offenber die Bewegung der Elektronen bei festgehaltenen Kernen dar” - “evidently this represents the electronic motion for stationary nuclei”.

H_{o}

is now referred to as the ‘clamped-nuclei Hamiltonian’; it is not the same as

H_{0}

in Equation (221) which regards the position variables for the nuclei as quantum operators, whereas in the above quotation they have become simply classical parameters.

Consider the unperturbed electronic Hamiltonian

H_{o} (x, X_{f})

at a fixed nuclear configuration

X_{f}

. The Schrödinger equation for this

H_{o}

is

(H_{o} (x, X_{f}) - E^{o} {(X_{f})}_{m}) φ {(x, X_{f})}_{m} = 0 .

(222)

For every

X_{f}

,

H_{o}

is self-adjoint on the electronic Hilbert space

H (X_{f})

; thus its spectrum lies on the real energy axis. This Hamiltonian’s natural domain,

D_{o}

, is the set of square integrable electronic wavefunctions {

φ_{m}

} with square integrable first and second derivatives;

D_{o}

is independent of

X_{f}

. We may suppose the {

φ_{m}

} are orthonormalized independently of

X_{f}

\int d x φ {(x, X_{f})}_{n}^{*} φ {(x, X_{f})}_{m} = δ_{n m} .

(223)

The clamped-nuclei Hamiltonian can be analyzed with the HVZ theorem [62,63,64] and has both discrete and continuous parts to its spectrum

σ (X_{f}) \equiv σ (H_{o} (x, X_{f})) = [E^{o} {(X_{f})}_{0}, \dots E^{o} {(X_{f})}_{m}) ⋃ [Λ (X_{f}), \infty)

(224)

where the {

E^{o} {(X_{f})}_{k}

} are isolated eigenvalues of finite multiplicities. Their associated normalized eigenvectors form a complete orthonormal system for a subspace

H {(X_{f})}_{d}

of the electronic Hilbert space.

Λ (X_{f})

is the bottom of the essential spectrum marking the lowest continuum threshold. In the case of a diatomic molecule the electronic eigenvalues depend only on the internuclear separation r, and have the form of the familiar potential curves shown in Figure 2.

The continuous spectrum lies in the orthogonal complement,

H {(X_{f})}_{c} = H (X_{f}) - H {(X_{f})}_{d}

; its description requires the use of spectral projectors, rather than a set of ‘eigenvectors’.

Born and Oppenheimer used the set {

φ_{m}

} to calculate, by perturbation theory, approximate eigenvalues of the full molecular Hamiltonian

H

on the assumption that the nuclear motion is confined to a small vicinity of a special (equilibrium) configuration

X_{f}^{0}

. They obtained the energy levels of the low-lying states typical of small polyatomic molecules as an expansion in powers of

κ^{2}

E_{n v J} \approx V_{n}^{(0)} + κ^{2} E_{n v}^{(2)} + κ^{4} E_{n v J}^{(4)} + \dots

(225)

where

V_{n}^{(0)}

is the minimum value of the electronic energy which characterized the molecule at rest,

E_{n v}^{(2)}

is the energy of the nuclear vibrations, and

E_{n v J}^{(4)}

contains the rotational energy. The corresponding approximate wavefunctions are simple products of an electronic function

φ_{m}

and a nuclear wavefunction. This is known as the adiabatic approximation. In the original perturbation formulation the simple product form is valid through

κ^{4}

, but not for higher order terms.

About 25 years later Born gave a modified formulation of the ‘adiabatic approximation’. Making the assumption that the functions

E^{o} {(X)}_{m}

and

φ {(x, X)}_{m}

arising from Equation (222), which represent the energy and wavefunction of the electrons in the state m for a fixed nuclear configuration X, are known Born proposed to solve the wave equation (219) by using them in an expansion [88,89]

ψ (x, X) = \sum_{m} Φ {(X)}_{m} φ {(x, X)}_{m}

(226)

with coefficients {

Φ {(X)}_{m}

} that play the role of nuclear wavefunctions. Substituting this expansion into the full Schrödinger equation (219), multiplying the result by

φ {(x, X)}_{n}^{*}

and integrating over the electronic coordinates x leads to a system of coupled equations for the nuclear functions {

Φ

},

(T_{N} + E^{o} {(X)}_{n} - E) Φ {(X)}_{n} + \sum_{n n^{'}} C {(X, P)}_{n n^{'}} Φ {(X)}_{n^{'}} = 0

(227)

where the coupling coefficients {

C {(X, P)}_{{nn}^{'}}

} have a well-known form. In this formulation the adiabatic approximation consists of retaining only the diagonal terms in the coupling matrix

C (X, P)

, for then

ψ (x, X) \approx ψ {(x, X)}_{n}^{A D} = φ {(x, X)}_{n} Φ {(X)}_{n} .

(228)

Some comments seem pertinent. A perturbation expansion in powers of

κ

is a singular perturbation method because

κ

is a coefficient of differential operators of the highest order occurring in the original Schrödinger equation. It is now known that the energy level expansion (225) is an asymptotic series, of the character of the semiclassical WKB approximation [90]. An obvious problem, which Born was aware of, is the neglect of the overall centre-of-mass of the molecule because all solutions of the full Schrödinger equation lie in the continuum. However one can simply interpret their coordinates as referring to the internal Hamiltonian obtained by transformation to centre-of-mass and internal coordinates (see below). Much more important however is that the tacit assumption in Equation (226) that the expansion is over a ‘complete set of states’ is not mathematically correct. There is no known calculational algorithm for the spectral projectors required for the continuous part of the spectrum of the clamped-nuclei Hamiltonian. As they have unknown analytic properties one is unable to check (by functional analysis) the calculation, and something that cannot be checked cannot rationally be claimed to be ‘exact’.

The key idea in the Born-Oppenheimer approach is the decomposition of the Coulomb Hamiltonian into a part containing all contributions of the nuclear momenta, and a remainder, as in Equation (221). Let us reconsider their argument [3,4,5,91]. It is easily seen from Equation (217) that the Coulomb interaction is translation invariant. Thus the total momentum operator

P = \sum_{i}^{n} p_{i},

(229)

commutes with

H

, and

H

has purely continuous spectrum. Physically the centre-of-mass of the whole system, with position operator

R = \frac{1}{M_{T}} \sum_{i}^{n} m_{i} x_{i}, M_{T} = \sum_{i}^{n} m_{i}

(230)

behaves like a free particle. It is helpful to introduce

R

and its conjugate

P_{R}

, together with appropriate internal coordinates, into

H

to make explicit the separation of the centre-of-mass and the internal dynamics

H = H_{C M} + H^{'}

(231)

H

may then be written as a direct integral

H = \int_{R^{3}}^{\oplus} H (P) d P,

(232)

where

H (P) = H^{'} + \frac{{| P |}^{2}}{2 M_{T}}

(233)

is the Hamiltonian at fixed total momentum

P

. The internal Hamiltonian

H^{'}

is independent of the centre-of-mass variables and acts on

L^{2} (R^{3 (n - 1)})

; it is explicitly translation invariant.

A simple procedure to make the internal Hamiltonian explicit is to refer the particle coordinates to a point moving with the system, for example, the centre-of-mass itself, the centre-of-nuclear-mass or one of the moving particles. As a result of the transformation to internal variables the kinetic energy operators are no longer diagonal in the particle indices and certain choices of the moving point, for example the choice of a single nucleus, result in an operator in which the nuclear and electronic indices are mixed. However the choice of the centre-of-nuclear-mass as the point of origin avoids this mixing, so this is a practical choice since the implementation of the permutation symmetry of identical particles is then feasible, should an actual calculation be attempted.

There are

A - 1

translationally invariant coordinates

t_{i}^{n}

expressed entirely in terms of the original

x_{g}^{n}

that may be associated with the nuclei, and there are N translationally invariant coordinates for the electrons

t_{i}^{e}

which are simply the original electronic coordinates

x_{i}^{e}

referred to the centre-of-nuclear-mass. There are corresponding canonically conjugate internal momentum operators.

The total kinetic energy operator separates in the form

T = T_{CM} + T_{Nu} + T_{el} .

(234)

T_{CM}

is the kinetic energy for the centre-of-mass and, for example,

T_{el}

only involves electronic variables

T_{el} = \frac{1}{2 μ} \sum_{i = 1}^{N} {(π_{i}^{e})}^{2} + \frac{1}{2 M} \sum_{i j = 1}^{N}' (π_{i}^{e}) \cdot (π_{j}^{e})

(235)

with

M = \sum_{g = 1}^{A} m_{g}, \frac{1}{μ} = \frac{1}{m} + \frac{1}{M} .

(236)

After this unitary transformation the original Coulomb Hamiltonian operator for the molecule can be rewritten in the form

H (x^{e}, p^{e}, x^{n}, p^{n}) \to T_{C M} + T_{N u} + H^{el}

(237)

where

H^{el}

is composed of Equation (235) together with all the Coulomb interaction operators expressed in terms of the {

t^{e}, t^{n}

} position operators. The internal Hamiltonian is

H^{'} = T_{N u} + H^{el}

(238)

with Schrödinger equation

H^{'} | Ψ 〉 = E | Ψ 〉 .

(239)

The Hamiltonian

H^{el}

has the same invariance under the rotation-reflection group O(3) as does the full translationally invariant Hamiltonian

H^{'}

. In a position representation its Schrödinger equation is

H^{el} Ψ {(t^{e}, t^{n})}_{m} = E_{m} Ψ {(t^{e}, t^{n})}_{m}

(240)

where

m

is used to denote a set of quantum numbers

(J M p r k)

: J and M for the angular momentum state: p specifying the parity of the state: r specifying the permutationally allowed irreps within the group(s) of identical particles and k to specify a particular energy value if there are any discrete energy levels.

Let

b

be some eigenvalue of the

t^{n}

corresponding to choices {

x_{g} = a_{g}, g = 1, \dots A

} in the laboratory-fixed frame; then the {

a_{g}

} describe a classical nuclear geometry. The set, X, of all

b

is

R^{3 (A - 1)}

. We denote the Hamiltonian

H^{el}

evaluated at the nuclear position eigenvalue

b

as

H {(b, t^{e})}_{o} = H_{o}

for short; this

H_{o}

is very like the usual clamped-nuclei Hamiltonian but it is explicitly translationally invariant, and has an extra term, the second term in Equation (235) which is often called the Hughes-Eckart term, or the mass polarisation term. The Schrödinger equation for

H_{o}

is of the same form as Equation (222), with eigenvalues

E^{o} {(b)}_{k}

and corresponding eigenfunctions

φ {(t^{e}, b)}_{k}

, and with spectrum

σ (b)

analogous to Equation (224),

H_{o} φ {(b, t^{e})}_{k} = E^{o} {(b)}_{k} φ {(b, t^{e})}_{k} .

(241)

To every solution of Equation (241) there corresponds a ‘wavefunction’

Φ {(t^{e}, t^{n})}_{m} = φ {(b, t^{e})}_{m} δ (t^{n} - b)

(242)

in the

(t^{e}, t^{n})

position representation which is a formal solution of the Schrödinger equation for

H^{el}

. The energy,

E_{m} (b)

of the function (242) is independent of the orientation of the Figure 3 defined by the

b

, and is also unaltered by the parity operation

b \to - b

, and by permutations of the labelling of any identical nuclei.

Φ_{m}

however depends on the orientation of the body-fixed frame defined by the configuration

b

with respect to some space-fixed reference frame. Let the Euler angles relating these two frames be

Ω

so that

Φ {(b)}_{m} = Φ {(\bar{b}, Ω)}_{m}

in an obvious notation, so we have a continuous family of degenerate states. The dependence on orientation is eliminated by forming a continuous superposition through integration over the Euler angles with some weight function

c (Ω)

Ψ_{m} = \int d Ω^{'} c (Ω^{'}) Φ {(\bar{b}, Ω^{'})}_{m}

Similarly one may form superpositions of the space-inverted and permuted states in order to form a new basis that displays the corresponding symmetries that leave the energy eigenvalue unchanged.

As with the clamped-nuclei Hamiltonian,

H_{o}

is self-adjoint on an electronic Hilbert space

H (b)

, so we have a family of Hilbert spaces {

H (b)

} which are parameterized by the nuclear position vectors

b \in X

that are the ‘eigenspaces’ of the family of self-adjoint operators

H_{o}

. From them we can construct a big Hilbert space as a direct integral over all the

b

values

H = \int_{X}^{\oplus} H (b) d b

(243)

and this is the Hilbert space for

H^{el}

in Equation (237). As before, the nuclear kinetic energy operator is proportional to

κ^{4}

, so the internal Hamiltonian

H^{'}

is seen to be of the same form as Equation (221). There is however a fundamental difference between Equations (221) and (237) which may be seen as follows; with the centre-of-nuclear-mass chosen as the electronic origin

H^{el}

is independent of the nuclear momentum operators and so it commutes with the nuclear position operators

[H^{el}, t^{n}] = 0 .

(244)

Equation (243) leads directly to a fundamental result; since

H^{el}

commutes with all the {

t^{n}

}, it has the direct integral decomposition

H^{el} = \int_{X}^{\oplus} H {(b, t^{e})}_{o} d b .

(245)

The internal molecular Hamiltonian

H^{'}

in Equation (233) and the clamped-nuclei like operator

H_{o}

just defined can be shown to be self-adjoint (on their respective Hilbert spaces) by reference to the Kato-Rellich theorem because in both cases there are kinetic energy operators that dominate the (singular) Coulomb interaction. This argument cannot be made for

H^{el}

because it contains nuclear position operators in some Coulombic terms but there are no corresponding nuclear kinetic energy terms to dominate those Coulomb potentials. Reassuringly, the abstract direct integral operator (245) is indeed self-adjoint since the resolvent of the clamped-nuclei Hamiltonian is integrable. This is demonstrated in Theorem XIII.85 in the book by Reed and Simon [92]. It is in this direct integral form that the operator is used in mathematically rigorous accounts of the Born-Oppenheimer approximation in, for example, refs. [90,93,94].

The direct integral representation (245) implies at once that the spectrum of

H^{el}

lies on the real axis and is purely continuous

σ = σ (H^{e l}) = ⋃_{b} σ (b) \equiv [V_{0}, \infty)

(246)

where

V_{0}

is the minimum value of

E {(b)}_{0}

; in the diatomic molecule case this is the minimum value of the potential energy curve,

E_{0} (r)

.

H^{el}

has no normalizable eigenvectors [3,4,5]. The same conclusion about a purely continuous spectrum is demonstrated in a less formal way in Weinberg’s book [34]. Hence an expansion analogous to the one proposed by Born is not feasible.

The decomposition of the molecular Hamiltonian into a nuclear kinetic energy contribution, proportional to

κ^{4}

, and a remainder does not yield molecular potential energy surfaces. Allowing the nuclear masses to increase without limit in

H^{el}

does not produce an operator with a discrete spectrum since this would just cause the mass polarisation term to vanish and the effective electronic mass to become the rest mass. It is thus not possible to reduce the molecular Schrödinger equation to a system of coupled differential equations of classical type for nuclei moving on potential energy surfaces as suggested by Born. An extra choice of fixed nuclear positions must be made to give any discrete spectrum and normalizable

L^{2}

eigenfunctions. This choice, that is, the introduction of the clamped-nuclei Hamiltonian by hand, into the molecular theory is the essence of the ‘Born-Oppenheimer approximation’. If the molecular Hamiltonian

H

were classical, the removal of the nuclear kinetic energy terms would indeed leave a Hamiltonian representing the electronic motion for stationary nuclei, as claimed by Born and Oppenheimer. The argument is a subtle one, for subsequently, once the classical energy surface has emerged, the nuclei are treated as quantum particles; while the classical limit of a collection of fermions is a system of classical particles, the corresponding limit for bosons is a classical field theory [26]. In practice, indistinguishably of the nuclei is rarely carried through.

Functions of the type (242) have been used as the basis of a Rayleigh-Ritz calculation being, hopefully, well-adapted to the construction of useful trial functions. Several different lines have been developed; in the adiabatic model the trial function is written as the continuous linear superposition

\begin{matrix} Ψ {(t^{e}, t^{n})}_{m} = & \int F (b) φ {(b, t^{e})}_{m} δ (t^{n} - b) d b \\ = & F (t^{n}) φ {(t^{n}, t^{e})}_{m} \end{matrix}

(247)

where the square-integrable weight factor

F (t^{n})

may be determined by reducing Equaton (239) to an effective Schrödinger equation for the nuclei in which

F (t^{n})

appears as the eigenfunction [95].

One can replace the unnormalizable delta function in Equation (247) by a continuous function,

𝒳_{a} (t^{n}, b)

, where the only constraint required is that

𝒳_{a}

should be square integrable. One thereby arrives at trial wavefunctions

Ψ {(t^{e}, t^{n})}_{m}^{G C M} = \int F (b) φ {(b, t^{e})}_{m} 𝒳_{a} (t^{n}, b) d b

(248)

for some suitably chosen parameter

a > 0

. This is the basis of the molecular Generator Coordinate Method (GCM) which is a non-adiabatic formalism since the electronic and nuclear variables are no longer separable; as before the weight factor

F (b)

is determined by appeal to the Rayleigh-Ritz quotient, although part of its structure can be determined purely by symmetry arguments. In the GCM the effective Schrödinger equation for the weight function becomes an integral equation (the Hill-Wheeler equation) [96]. Again the trial function may be improved, in the sense of a variational calculation, by forming linear superpositions of the wavefunctions {

Ψ^{GCM}

}; this has been done for simple diatomic molecules for which a fairly complete GCM account has been developed [96,97]. Usually however the dependence on the nuclear variables {

t^{n}

} is not expressed through functions adapted to nuclear permutation symmetry, and the GCM weight functions are determined by molecular structure considerations.

The rigorous mathematical analysis of the original perturbation approach proposed by Born and Oppenheimer for a molecular Hamiltonian with Coulombic interactions was initiated by Combes and co-workers [93,94,98] with results for the diatomic molecule. Some properties of the operator

H^{el}

(245) seem to have been first discussed in this work. A perturbation expansion in powers of

κ

leads to a singular perturbation problem because

κ

is a coefficient of differential operators of the highest order in the problem; the resulting series expansion of the energy is an asymptotic series, closely related to the WKB approximation obtained by a semiclassical analysis of the effective Hamiltonian for the nuclear dynamics. This requires a more complete treatment than the adiabatic model using the partitioning technique to project the full Coulomb Hamiltonian,

H^{'}

, onto the adiabatic subspace. A comprehensive account can be found in the recent review by Jecko [91]. A normalized electronic eigenvector

{| φ (b)}_{j} 〉

is associated with a projection operator by the usual correspondence

P {(b)}_{j} {= | φ (b)}_{j} {〉 〈 φ (b)}_{j} | .

(249)

In view of our earlier discussion of the ‘big Hilbert space’

H

, we can form a direct integral over all nuclear positions

P_{j} = \int_{X}^{\oplus} P {(b)}_{j} d b

(250)

to yield a projection operator on the adiabatic subspace. If we want to include m electronic levels we can form a direct sum of the contributing {

P_{j}

}

P = ⨁_{j = 0}^{m} P_{j} .

(251)

This is an Hermitian projection operator and it, and its complement,

Q

, have the usual properties

P + Q = 1, P^{2} = P, Q^{2} = Q, P Q = Q P = 0 .

(252)

Using these projection operators the original molecular Schrödinger equation for the internal dynamics can be transformed into a pair of coupled equations

\begin{matrix} P H^{'} P | ψ 〉 + P H^{'} Q | 𝒳 〉 = & E 1 | ψ 〉 \end{matrix}

(253)

\begin{matrix} Q H^{'} P | ψ 〉 + Q H^{'} Q | 𝒳 〉 = & E 1 | 𝒳 〉 \end{matrix}

(254)

where

| ψ 〉 = P | Ψ 〉, | 𝒳 〉 = Q | Ψ 〉 .

(255)

Solving Equation (254) for

| 𝒳 〉

| 𝒳 〉 = \frac{1}{E 1 - Q H^{'} Q} Q H^{'} P | ψ 〉

(256)

and substituting in Equation (253) yields the usual Löwdin partitioned equation [66]

(P H^{'} P + P H^{'} Q \frac{1}{E 1 - Q H^{'} Q} Q H^{'} P) | ψ 〉 = E 1 | ψ 〉 .

(257)

Further progress depends crucially on establishing the properties of the energy dependent operator in Equation (257). A detailed consideration of the diatomic molecule case can be found in refs. [94,99]. The main result is that Equation (257) is a generalized version of the effective nuclear Schrödinger equation in the adiabatic model, so it contains the nuclear kinetic energy operators and an effective potential

V

. Thus a rigorous quantum mechanical calculation yields estimates of the energies of the bound states of the internal Hamiltonian of the isolated molecule model that retains the spirit of Born and Oppenheimer while overcoming the mathematical weaknesses of the earlier treatments. Its accuracy depends crucially on what one can make of the second term in Equation (257). Of course it is a spectroscopic account.

Computational quantum chemists started to investigate the Coulomb Hamiltonian without reference to the Born-Oppenheimer (clamped-nuclei) approach in the late 1960s, particularly after the use of Gaussian orbitals became common in electronic structure calculations. A brief summary with references to the early work can be found in ref. [100]. As with the GCM approach described earlier the computational load is considerable, and progress beyond simple diatomic molecules is still very limited [101,102,103]. As the distinguished spectroscopist Arthur Schawlow is reputed to have remarked “A diatomic molecule is a molecule with one atom too many”, a jocular comment that reflects the sheer complexity of the energy level spectrum of the simplest molecules [104].

The generic molecule has a molecular formula that allows for more than one sensible chemical structure; ‘sensible’ here means that the proposed structures have typical chemical bonds. The distinct species are isomers; one expects to be able to prepare such isomeric substances in the synthesis laboratory. Obviously diatomic molecules fall outside this description; although their spectroscopy was traditionally discussed in terms of potential energy curves, it can be treated within quantum mechanics based on the Coulomb Hamiltonian for the isolated molecule [96]. Of course for a diatomic molecule to have any chemistry one has to have other atoms/molecules available to interact with it, and then there is no longer an isolated diatomic molecule, and one has the possibility of isomerism. Here’s a typical example.

The cyanogen radical, CN, has two reaction pathways with the hydrogen atom yielding the isomers HCN (hydrogen cyanide) and its zwitterion, HNC (hydrogen isocyanide). It is well known to astronomers being abundant in the tails of comets, in the interstellar medium, and in some stellar atmospheres. Its characteristic spectrum gives information about the chemical composition in these environments. A typical recent spectroscopic study of the Helix planetary nebula reported the details of the emission spectrum at frequencies around 270, 179 and 90 GHz respectively [105], transitions attributed to HCN and its isomer HNC. They are easily distinguished by their microwave spectra, and the intensities of their emission lines give information about the gas temperature in that environment.

It was already remarked 20 years ago that it was not explained how the unique set of energy levels obtained from a non-Born-Oppenheimer calculation on the Coulomb Hamiltonian for (H+C+N) (treating all the particles as quantum mechanical) could account for the distinct spectra of the two isomers [106]. To the writer’s knowledge it has never been explained. That prospect gets ever more distant as the complexity of the isomer family grows. Some may be ‘polar’, some ‘non-polar’, and in quite simple families (by chemical standards) some may even be chiral, for example the isomers of C₃H₂D₂, all of which have been prepared by suitable syntheses [6].

Towards the end of his life, P.-O. Löwdin made an extended study of a quantum mechanical definition of a molecule; in one of his late papers he lamented [107]

The Coulombic Hamiltonian $H^{'}$ does not provide much obvious information or guidance, since there is [sic] no specific assignments of the electrons occurring in the systems to the atomic nuclei involved—hence there are no atoms, isomers, conformations etc. In particular one sees no molecular symmetry, and one may even wonder where it comes from. Still it is evident that all this information must be contained somehow in the Coulombic Hamiltonian.

Löwdin seems to be echoing the sentiment in the famous Dirac quotation. In the light of the analysis given here it does not seem evident that the Coulombic Hamiltonian on its own will give rise to the chemically interesting features Löwdin required of it, nor will they be approachable by regular perturbation theory (supposedly convergent) starting from its eigenstates. Fundamental modifications of the quantum theory of the Coulomb Hamiltonian for a generic molecule have to be made for a chemically significant account of dipole moments, functional groups and isomerism, optical activity and so on. In other words one should not expect useful contact between the quantum theory of an isolated molecule (which is what the eigenstates of the Coulombic Hamiltonian refer to) and a quantum account of individual molecules, as met in ordinary chemical situations. If the molecule is not isolated it must be interacting with something; that something is loosely referred to as the ‘environment’. It might be other molecules, the macroscopic substance the molecule finds itself in, or quantized electromagnetic radiation (blackbody radiation is all pervasive), or all of them. Moreover chemistry occurs at finite temperatures,

T > 0

, whereas the Isolated Molecule model is a

T = 0

account. The characteristic feature of discussions of individual molecules, is that the crucial idea of structure is put in by hand at the outset. This is a feature of many-body physics (condensed matter, nuclei, chemistry) and results in remarkably powerful and fruitful theoretical formalisms [108].

This year we celebrate the International Year of Quantum Science and Technology to mark the centenary of Heisenberg’s trip to Helgoland where he conceived the idea of basing quantum theory purely on observable quantities: atomic emission spectra might be described purely in terms of the frequency and intensity of the emitted radiation, combined somehow with an array of probabilities. The unobservable orbits of the Bohr-Rutherford ‘planetary model’ of the atom would henceforth be banished, and there would be no picture of what was happening in spectroscopy. This approach was eventually formalized in a fundamental postulate of quantum mechanics [109],

All observable physical quantities correspond to Hermitian operators. The only measurable values of a physical observable are the various eigenvalues of the corresponding operator.

There is no such operator for the orbits.

This year is also the 150

t h

anniversary of the publication by J.H. van ‘t Hoff of La Chimie dans l’Espace which marked the beginning of stereochemistry, an all pervasive concept in chemistry thereafter; the physicochemical properties of a chemical substance was to be rationalized in terms of the three-dimensional structure of the molecules associated with it [110]. The American physical chemist G.N. Lewis once wrote [111]

No generalization of science, even if we include those capable of exact mathematical statement, has ever achieved a greater success in assembling in a simple way a multitude of heterogeneous observations than this group of ideas we call structural theory.

Today no one doubts that molecular structure is a key concept in chemical science—perhaps the key concept. The important point is that the molecular structure hypothesis offers a representation of chemical phenomena that has enabled chemists to grasp fresh and significant relationships in their experimental findings. The unresolved paradox for a fundamental account of chemistry from the point of view of quantum mechanics is that, like Bohr’s orbits, there is no Hermitian operator for molecular structure which is not, according to the fundamental postulate just quoted, an ‘observable’.

Funding

This research received no external funding.

Conflicts of Interest

The author declares no conflicts of interest.

Appendix A. Generalised Functions

The Schwartz space,

S (R^{n})

, is the vector space of smooth, complex-valued functions on

R^{n}

that, together with all their derivatives, decrease at infinity faster than any polynomial; the (n-dimensional) function

f (x) = g (x) e^{- x^{2}}

, for any polynomial

g (x)

is an example. More precisely this means that such a function f has the property that for any partial derivative

\partial^{α}

and any integer m, there is a constant

C_{α, m} > 0

such that for all x

{(1 + | x |)}^{m} | (\partial^{α} f) (x) | \leq C_{α, m} .

(A1)

We denote a general element of S by

s = s (x)

; they are essential elements of the mathematical theory of ‘generalised functions’. From their definition it follows that Schwartz functions have well defined Fourier transforms {

\hat{s}

} which are also elements of

S

. The same is true for the sum, product and convolution of two Schwartz functions. The subset of elements, {

x \mapsto ϕ (x)

}, of

S

that are not only smooth but have compact support are usually referred to as ‘test functions’. This subset is also a vector space that is usually denoted

D

.

A distribution is a particular type of linear functional and can be thought of as a development of the classical idea of a function. Instead of f ‘acting on’ point values to give an outcome, a distribution ‘acts on’ elements of a set of functions [112]. A regular distribution is a continuous, linear functional on the set,

D

; it is associated with a locally integrable function f according to the pairing formula

ϕ \mapsto 〈 T_{f}; ϕ 〉 = \int_{R^{n}} f (x) ϕ (x) d x .

(A2)

In general the result of such an integration is a complex number. The collection of such functionals on

D

is a vector space denoted

D^{'}

. A simpler notation which maintains the distinction between a distribution and the function it is associated with is to write the distribution in a different typeface so that for example using fraktur

f \equiv T_{f} .

(A3)

A tempered distribution may be defined in the same way as Equation (A2) except that the integrand involves the elements {s} of the whole Schwartz space

S

,

s \mapsto 〈 T_{f}; s 〉 = \int_{R^{n}} f (x) s (x) d x .

(A4)

Such functionals belong to a vector space denoted

S^{'}

. The tempered distributions are important because they provide the basis for the modern account of the Fourier transform for functions that fall outside the classical definition. The Fourier transform of a tempered distribution

T

is another tempered distribution,

\hat{T}

, acting on s as

〈 \hat{T}, s 〉 = 〈 T, \hat{s} 〉

(A5)

where

\hat{s}

is the usual classical Fourier transform of s.

The framework expressed by Equation (A5) can be applied to give a meaning to the Fourier transform of the Coulomb potential which in the classical function framework is ill-defined. In

R^{3}

the Coulomb potential is locally integrable and bounded at infinity so we may view it, and its Fourier transform, as tempered distributions, specifically

〈 T_{V}, s 〉 = \int \frac{1}{| x |} s (x) d^{3} x

(A6)

From Equation (A5) we have

〈 \hat{T_{V}}, s 〉 = \int_{R^{3}} \frac{1}{| x |} \hat{s} (x) d x = lim_{R \to \infty} \int_{| x | < R} \frac{1}{| x |} \hat{s} (x) d x .

(A7)

Now for

R > 0

\begin{matrix} \int_{| x | < R} \frac{1}{| x |} \hat{s} (x) d x = & \int_{| x | < R} \frac{1}{| x |} \int e^{i x \cdot k} s (k) d^{3} k d^{3} x \\ = & \int d^{3} k s (k) \int_{| x | < R} \frac{1}{| x |} e^{i x \cdot k} d^{3} x \\ = & \int d^{3} k s (k) \frac{4 π}{{| k |}^{2}} (1 - cos (| k | R)) . \end{matrix}

(A8)

The integral involving the cosine factor in Equation (A8) vanishes for

R \to \infty

, as follows from the properties of s and an integration by parts. Hence

\int_{R^{3}} \hat{T_{V}} s (x) d x = \int_{R^{3}} \frac{4 π}{{| k |}^{2}} s (k) d k

(A9)

where the Fourier transform of the Coulomb potential is understood as the distribution associated with the function

k \mapsto 4 π / {| k |}^{2}

.

The convolution of a tempered distribution with a function

s \in S

is given by

s^{'} * 〈 T, s 〉 = 〈 T, {\tilde{s}}^{'} * s 〉

(A10)

where

{\tilde{s}}^{'} (x) = s^{'} (- x)

is the reflection of

s^{'}

about the origin. The distributional derivative is defined in similar fashion by passing the differentiation through to the Schwartz function; we set

{(T_{f})}^{'} = - 〈 T_{f}; s^{'} 〉 .

(A11)

If f is a differentiable function, the derivative of the distribution associated with it,

T_{f}

, is defined to be the distribution associated with the usual derivative of the function f. That is the RHS of Equation (A11) is simply

〈 T_{f^{'}}; s 〉

. This follows from an integration by parts and recognition that the boundary term vanishes because s belongs to the Schwartz space.

There are numerous distributions that have no associated function and cannot be represented as in Equation (A4); familiar examples are the Cauchy principal value and the Dirac delta function, which perhaps is the most well-known generalised function. The ‘delta function’,

δ (x)

was introduced by Dirac for handling the continuous spectrum in quantum mechanics with the definition [113]

\int_{R} δ (x) d x = 1, δ (x) = 0, x \neq 0

(A12)

but with no value defined at

x = 0

. That the conventional description of the Dirac delta function is problematic can be seen as follows; the definition (A12) is consistent with the criterion for a function to be integrable. However the associated tempered distribution

T_{δ}

defined by Equation (A4) is trivial,

T_{δ} = 0

, since an integral over an interval of zero length (the point 0) is zero whatever

s (x)

is; instead the precise definition of the delta function relies on its action on elements s of S, that is

〈 δ_{x_{0}}, s 〉 = s (x_{0});

(A13)

however by analogy with Equation (A4) one commonly writes

\int_{R} δ (x - x_{0}) s (x) d x = s (x_{0}) .

(A14)

This is a purely formal statement since there is no function

δ

satisfying Equation (A12) according to the classical definition of a function, and the ‘integral’ in Equation (A14) cannot be interpreted in the usual way as a Riemann or Lebesgue integral. To indicate that the ‘delta function’ is a distribution we write it as

δ

.

The Fourier transform of the delta distribution is simply a constant, the value of which depends on how the factor of

2 π

is shared between the transform and its inverse. The tempered distribution

T_{δ}

is the identity operation for convolution in the sense that

\begin{matrix} s^{'} * 〈 T_{δ}, s 〉 = & 〈 T_{δ}, s^{'} * s 〉 \\ = & \int_{R} s^{'} (y) s (y) d y \equiv 〈 T_{s^{'}}, s 〉 \end{matrix}

(A15)

Distributions can be given a concrete realisation in the following way which is an alternative view to the ‘abstract’ description above. For simplicity of exposition we restrict the discussion to distributions on

R

. Let

s (x) \in S

be a normalised function in the Schwartz space

\int s (x) d x = 1,

(A16)

and note that its translations,

s (z - x)

, also belong to the set of Schwartz functions,

S

. By dilation of s with a parameter

0 < ϵ < 1

, we obtain the scaled function

s_{ϵ} = \frac{1}{ϵ} s (\frac{x}{ϵ})

(A17)

with the same normalisation.

s_{ϵ}

can be viewed as a representation of the Dirac

δ

which has been ‘broadened out’ or mollified about

x = 0

. It has area 1 (normalisation (A16)), approximate height

1 / 2 ϵ

, and approximate width

2 ϵ

. A function such as Equation (A17) is called a mollifier.

Given a continuous function or distribution f we can construct ‘representatives’ of it as a sequence of functions that are smooth in the variable x using the convolution integral with

s_{ϵ}

,

F_{ϵ} (s, x) = \int_{R} \frac{1}{ϵ} s (\frac{z - x}{ϵ}) f (z) d z

(A18)

which is an obvious echo of the ‘sifting property’ of the Dirac delta. With a change of integration variable we have

F_{ϵ} (s, x) = \int_{R} s (y) f (x + ϵ y) d y .

(A19)

which is such that

f = lim_{ϵ \to 0} F_{ϵ} (s)

(A20)

irrespective of the particular Schwartz function s in the convolution. Such representatives are very convenient when derivatives of distributions are required, but as they stand there is a severe limitation, namely, representatives when multiplied together do not give a representative of a distribution.

As an example, consider the Dirac delta which is the

ϵ \to 0

limit of

s_{ϵ}

(A17). According to the above discussion its square should be available from the square of its representative, so taking

ξ \in S

we should have for a distribution

\begin{matrix} 〈 δ^{2}, ξ 〉 = & lim_{ϵ \to 0} \int_{R} ξ (x) \frac{1}{ϵ} s {(\frac{x}{ϵ})}^{2} d x \\ = & lim_{ϵ \to 0} \frac{ξ (0)}{ϵ} \int_{R} s {(z)}^{2} d z \end{matrix}

(A21)

which is evidently ∞. Thus

δ^{2}

does not belong to the space of distributions,

D^{'}

. This illustrates a quite general result: there is no multiplication on all of

D^{'}

giving a result that is in

D^{'}

.

As another example, take two functions,

f_{1}, f_{2}

, on some set

Ω \subset R^{n}

. When considered as distributions they are the linear forms

〈 T_{f_{1}}, s 〉 = \int_{Ω} f_{1} (x) s (x) d x, 〈 T_{f_{2}}, s 〉 = \int_{Ω} f_{2} (x) s (x) d x .

(A22)

Their product is then

s \mapsto \int_{Ω} f_{1} (x) s (x) d x \int_{Ω} f_{2} (x) s (x) d x,

(A23)

while the classical product of

f_{1}, f_{2}

interpreted as a distribution leads to

s \mapsto \int_{Ω} f_{1} (x) f_{2} (x) s (x) d x

(A24)

which in general is not the same as Equation (A23); thus the notion of ‘product’ is ambiguous. The idea of a distribution as a generalisation of the classical notion of a function has been very fruitful in analysis; however some non-linear problems require new mathematics that goes beyond the notion of a distribution as a linear form so that the problem that multiplication is not defined for distributions [114] can be overcome.

An essential mathematical notion that is required for this development is that of ‘embedding’ elements of a set into another set. Roughly speaking the relationship between two sets

X

and

Y

is an embedding if the map

f : X \to Y

has the properties:

For every $x_{1}, x_{2} \in X$ such that $x_{1} \neq x_{2}, f (x_{1}) \neq f (x_{2})$ , that is, different elements of $X$ correspond to distinct elements of $Y$ .
If some property holds for $x_{1}, x_{2}, \dots x_{n}$ the same property holds for $f (x_{1}), f (x_{2}) \dots f (x_{n})$ .

A subgroup in a larger group, the integers in relation to the rational numbers, the rationals in relation to real numbers are all examples of embeddings.

The aim is to define an embedding that gives a set,

G

, of generalised functions containing the distributions and ordinary functions such that the usual rules of differentiation apply, and multiplication is defined; such a set is called a differential algebra. The Schwartz impossibility theorem [114] is the demonstration that there is no differential algebra in which the ordinary product of continuous functions is equal to the corresponding product of generalised functions they are related to ref. [115]. The Schwartz theorem can be evaded if the factors

u, v

in the distributional product

(u * v)

satisfy certain regularity conditions. For example Hörmander gave a criterion in terms of the ‘wave front sets’ of the two distributions. Roughly speaking when the Fourier transform of a factor, u, around any point does not decay exponenetially in the direction of a particular wave vector (an element of its wave-front set), the Fourier transform of the other factor, v, must decay exponentially in the opposite direction of that wave vector [116]. This idea can be used to regularise the singular Feynman diagrams in the usual perturbation theory approach to Lorentz invariant QED and to carry through the renormalisation programme [1,117]. The key insight is the recognition that the Schwartz theorem did not apply to the smooth functions usually denoted by the set (

C^{\infty}

). Using the properties of the Schwartz space he demonstrated that the product of two smooth functions embedded in

G

coincides with their ordinary product (in

C^{\infty}

).

The details of the algebra are given in several monographs [81,118,119], and applications, developments and further literature can be found in refs. [83,115,120]. The first step in constructing an element of an algebra of generalised functions from f (function or distribution) is the definition of representatives as in Equation (A18); such representatives can be freely multiplied. Colombeau’s aim was to replace f by some

F_{ϵ}

(a generalised function) with an infinitesimal error for finite

ϵ

, and this requires further restrictions on the admissible functions {s}. Note however that there is not a single s to be considered; the construction is available to any element of the set of mollifiers. For ease of presentation the following is restricted to the one-variable case (

R

); it can be extended to many variables (

R^{n}

). If one puts

ϵ = 0

the conventional account in terms of functions and/or distributions is recovered.

1.: Let $F s_{ϵ}$ be the Fourier transform of Equation (A17)

$F s_{ϵ} : = {\hat{s}}_{ϵ} (k) \equiv \hat{s} (ϵ k) .$

(A25)

A mollifier s is said to be ‘suitable’ if the transform $\hat{s} (k) = 1$ in a finite neighbourhood of $k = 0$ and not just at the point 0. Such a transform has height 1, approximate area $2 / ϵ$ and approximate width $2 / ϵ$ . It can be viewed as a ‘cut-off’ for large k values that vanishes smoothly as $k \to \infty$ ; for this reason the Fourier transforms ${\hat{s}}$ are referred to as dampers. As with the mollifiers a product of dampers belongs to the set of dampers. A Fourier transform of an element of $S$ belongs to $S$ ; in the following we will require the further condition that the transform $\hat{s}$ be a test function, that is, has compact support ( $\hat{s} \in D \subset S)$ [83] (see below, Equation (A29)).
2.: Since $F_{ϵ} (s, x)$ is smooth it has a Taylor series expansion in powers of $ϵ$ with remainder

$\begin{matrix} F_{ϵ} (s, x) = & f (x) + \dots \frac{ϵ^{m}}{m!} f^{m} (x) \int_{R} y^{m} s (y) d y \\ + & O_{x} (ϵ^{m + 1}) \end{matrix}$

(A26)

where as usual $f^{m} (x)$ is the m^th derivative of $F_{ϵ}$ evaluated at x. It follows that

$\begin{matrix} F_{ϵ} (s, x) - f (x) = & \sum_{m = 1} \frac{ϵ^{m}}{m!} f^{m} (x) \int_{R} y^{m} s (y) d y \\ + & O_{z} (ϵ^{m + 1}) . \end{matrix}$

(A27)
3.: Colombeau showed that it is possible to construct a set of Schwartz functions {s} such that their first m moments vanish [118],

$\int_{R} y^{k} s (y) d y = 0, 1 \leq k \leq m .$

(A28)

This means that the sum term in Equation (A27) can be made to vanish and $F_{ϵ} (x) = f (x) + O_{x} (ϵ^{m + 1})$ . Hence the remainder can be made as small as we please for m large enough for any $ϵ$ .
4.: An equivalent formulation of the conditions (A28) can be expressed in terms of the Fourier transformation [115]; using the ‘hat’ notation for the transform of any function $s (x)$ in the Schwartz space S, we have the relations

$\begin{matrix} \hat{s} (0) = & \int_{R} s (x) d x \\ {(- i)}^{n} D^{n} \hat{s} (0) = & \int_{R} x^{n} s (x) d x \end{matrix}$

(A29)

where D stands for any derivative operator. Taking $s (x)$ with $\hat{s} (0) = 1$ , the conditions (A28) are satisfied for any m as large as we like. All the mollifiers can be assumed to belong to the set

$A_{\infty} = \{s (x) \in S, \hat{s} \in D, with \hat{s} (0) = 1\}$

(A30)

where ‘0’ implies a finite neighbourhood of zero, not just the point 0. This is the viewpoint adopted in ref. [83].
5.: Among the functionals $F_{ϵ} (s; x)$ we focus on two categories, $F_{ϵ} (s; x) \in E_{M}$ , called moderate, and $F_{ϵ} (s; x) \in N$ , called negligible. The precise distinction between these two categories can be found in the references to the Colombeau algebra cited previously. The important point is that the Colombeau algebra $G$ is the quotient $E_{M} / N$ . The moderate functionals have polynomial growth in $1 / ϵ$ as $ϵ \to 0$ , whereas the negligible ones decay faster than any power of $ϵ$ as $ϵ \to 0$ . Heuristically, the moderate functionals are the ones of interest, and the negligible ones effectively play the role of the generalised number ‘0’ in the algebra.The product of two moderate functionals is again moderate, whereas if the product contains at least one negligible functional the result is negligible. Two functionals are said to be ‘equivalent’ if their difference, $F - F^{'}$ , is negligible.

A generalised function is associated with a distribution

T_{f}

if it has a representative

F_{ϵ}

belonging to the class of moderate functionals such that

lim_{ϵ \to 0} F_{ϵ} = T_{f} \in D^{'} .

(A31)

Every distribution can be converted to a moderate family through the construction of a representative by convolution as above. However not every moderate family is the regularisation of a distribution. The square of the Dirac delta is not a distribution (A21), but its representative (A17) squared is moderate. Whereas there is only one Dirac

δ

distribution in

D^{'}

there is an infinity of Dirac

δ

like generalised functions in

G

, and likewise for any other distribution. Colombeau emphasised that ‘equivalent’ functionals in

G

are not necessarily equal, in the sense of the classical equality denoted by =, because they may differ by infinitesimal quantities; rather there is a weak equality for which he proposed the relational symbol ≈ [120].

The general idea then is that the functions {f} are transformed to {

F_{ϵ}

} as in Equation (A18) with mollifiers taken from Equation (A30), and the appropriate

F_{ϵ}

is used in all calculations with

ϵ

finite until the end of the calculation. If the model is linear the results will be the same as though one stayed within distribution theory which is recovered in the limit

ϵ \to 0

. In linear or nonlinear models which result in divergences as

ϵ \to 0

, the parameter

ϵ

can be kept finite, and the results will be ‘generalised numbers’ or ‘generalised functions’, genuinely new mathematical objects.

References

Weinberg, S. The Quantum Theory of Fields. Volume I: Foundations; Cambridge University Press: Cambridge UK, 1995. [Google Scholar] [CrossRef]
Woolley, R.G.; Sutcliffe, B.T.P.-O. Löwdin and the quantum mechanics of molecules. In Fundamental World of Quantum Chemistry: A Tribute to the Memory of Per-Olov Löwdin. Volume 1; Brändas, E.J., Kryachko, E.S., Eds.; Kluwer Academic: Springer Science+Business Media: Dordrecht, The Netherlands, 2003; pp. 21–65. [Google Scholar]
Sutcliffe, B.T.; Woolley, R.G. On the quantum theory of molecules. J. Chem. Phys. 2012, 137, 22A544. [Google Scholar] [CrossRef]
Sutcliffe, B.T.; Woolley, R.G. Comment on “On the quantum theory of molecules”. J. Chem. Phys. 2014, 140, 037101. [Google Scholar] [CrossRef]
Sutcliffe, B.T.; Woolley, R.G. The potential energy surface in molecular Quantum Mechanics. In Advances in Quantum Methods and Applications in Chemistry, Physics, and Biology; Hotokka, M., Brändas, E.J., Maruani, J., Delgado-Barrio, G., Eds.; Springer International Publishing Switzerland: Cham, Switzerland, 2013; pp. 3–40. [Google Scholar] [CrossRef]
Woolley, R.G. Foundations of Molecular Quantum Electrodynamics; Cambridge University Press: Cambridge, UK, 2022. [Google Scholar] [CrossRef]
Lieb, E.H.; Seiringer, R. The Stability of Matter in Quantum Mechanics; Cambridge University Press: Cambridge, UK, 2010. [Google Scholar] [CrossRef]
Andrews, D.L. Molecular quantum electrodynamics: Developments of principle and progress in applications. Physics 2025, 7, 49. [Google Scholar] [CrossRef]
Belinfante, F.J. Consequences of the postulate of a complete commuting set of observables in quantum electrodynamics. Phys. Rev. 1962, 128, 2832–2837. [Google Scholar] [CrossRef]
Power, E.A.; Zienau, S. Coulomb gauge in non-relativistic quantum electro-dynamics and the shape of spectral lines. Philos. Trans. R. Soc. A Math. Phys. Engin. Sci. 1959, 251, 427–454. [Google Scholar] [CrossRef]
Fiutak, J. The multipole expansion in quantum theory. Can. J. Phys. 1963, 41, 12–20. [Google Scholar] [CrossRef]
Atkins, P.W.; Woolley, R.G. The interaction of molecular multipoles with the electromagnetic field in the canonical formulation of non-covariant quantum electrodynamics. Proc. R. Soc. A Math. Phys. Engin. Sci. 1970, 319, 549–563. [Google Scholar] [CrossRef]
Woolley, R.G. On the hamiltonian theory of the molecule-electromagnetic field system. Mol. Phys. 1971, 22, 1013–1023. [Google Scholar] [CrossRef]
Belinfante, F.J. On the longitudinal and the transversal delta-function, with some applications. Physica 1946, 12, 1–16. [Google Scholar] [CrossRef]
Power, E.A. Introductory Quantum Electrodynamics; Longmans: London, UK, 1964; Available online: https://archive.org/details/introductoryquan0000unse_d5h5 (accessed on 4 November 2025).
Woolley, R.G. Power–Zienau–Woolley representations of nonrelativistic QED for atoms and molecules. Phys. Rev. Res. 2020, 2, 013206. [Google Scholar] [CrossRef]
Dirac, P.A.M. Gauge-invariant formulation of quantum electrodynamics. Can. J. Phys. 1955, 33, 650–660. [Google Scholar] [CrossRef]
Woolley, R.G. A reformulation of molecular quantum electrodynamics. J. Phys. B 1974, 7, 488–499. [Google Scholar] [CrossRef]
Mansfield, P.R.W. Faraday’s lines of force as strings: From Gauss’s law to the arrow of time. J. High Energy Phys. 2012, 2012, 149. [Google Scholar] [CrossRef]
Edwards, J.P.; Mansfield, P.R.W. Delta-function interactions for the bosonic and spinning strings and the generation of Abelian gauge theory. J. High Energy Phys. 2015, 2015, 127. [Google Scholar] [CrossRef]
Ostrogradsky, M. Mémoires sur les équations différentielles, relatives au problème des isopérimètres. Mém. Acad. Sci. St.-Petersburg 1850, 6, 385–517. Available online: https://archive.org/details/mmoiresdelacad6461850impe/page/384/mode/2up (accessed on 4 November 2025).
Woolley, R.G. The electrodynamics of atoms and molecules. Adv. Chem. Phys. 1975, 33, 153–233. [Google Scholar] [CrossRef]
Schiller, R.; Schwartz, M. Structure-independent electrodynamics in the electric-dipole approximation. Phys. Rev. 1962, 126, 1582–1588. [Google Scholar] [CrossRef]
Moniz, E.J.; Sharp, D.H. Absence of runaways and divergent self-mass in nonrelativistic quantum electrodynamics. Phys. Rev. D 1974, 10, 1133–1136. [Google Scholar] [CrossRef]
Schwarzschild, K. Zur Elektrodynamik. II. Die elementare elektrodynamische Kraft. Nachr. Gesell. Wissen. Göttingen Math.-Phys. Kl. 1903, 1903, 132–141. Available online: https://eudml.org/doc/58547 (accessed on 4 November 2025).
Heitler, W. The Quantum Theory of Radiation; Clarendon Press/Oxford University Press: Oxford, UK, 1936; Available online: https://archive.org/details/in.ernet.dli.2015.37198 (accessed on 4 November 2025).
Woolley, R.G. Classical electrodynamics. In Handbook of Molecular Physics and Quantum Chemistry, Volume 1: Foundations; Wilson, S., Bernath, P.F., McWeeny, R., Eds.; John Wiley & Sons: Chichester, UK, 2003; pp. 623–645. Available online: https://archive.org/details/handbookofmolecu0001unse_c8a7 (accessed on 4 November 2025).
Woolley, R.G. On non-relativistic electron theory. Ann. Inst. Henri Poincaré A Phys. Théor. 1975, 23, 365–378. Available online: https://www.numdam.org/item/AIHPA_1975__23_4_365_0/ (accessed on 4 November 2025).
Dirac, P.A.M. Generalized Hamiltonian dynamics. Can. J. Math. 1950, 2, 129–148. [Google Scholar] [CrossRef]
Dirac, P.A.M. The Hamiltonian form of field dynamics. Can. J. Math. 1951, 3, 1–23. [Google Scholar] [CrossRef]
Dirac, P.A.M. Les transformations de jauge en Électrodynamique. Ann. Inst. Henri Poincaré 1952, 13, 1–42. Available online: http://www.numdam.org/item?id=AIHP_1952__13_1_1_0 (accessed on 4 November 2025).
Dirac, P.A.M. Lectures on Quantum Mechanics; Dover Publications, Inc.: Mineola, NY, USA, 2001; Available online: https://archive.org/details/lecturesonquantu0000dira_h7c5 (accessed on 4 November 2025).
Rothe, H.J.; Rothe, K.D. Classical and Quantum Dynamics of Constrained Hamiltonian Systems; World Scientific Co. Pte. Ltd.: Singapore, 2010. [Google Scholar] [CrossRef]
Weinberg, S. Lectures on Quantum Mechanics; Cambridge University Press: Cambridge, UK, 2013. [Google Scholar] [CrossRef]
Woolley, R.G. Molecular quantum electrodynamics. Proc. R. Soc. Lond. A Math. Phys. Engin. Sci. 1971, 321, 557–572. [Google Scholar] [CrossRef]
Fermi, E. Quantum theory of radiation. Rev. Mod. Phys. 1932, 4, 87–132. [Google Scholar] [CrossRef]
Dirac, P.A.M. Lectures on Quantum Field Theory; Belfer Graduate School of Science, Yeshiva University: New York, NY, USA, 1966; p. 73ff. Available online: https://archive.org/details/lecturesonquantu0000dira (accessed on 4 November 2025).
Mandal, A.; Hunt, K.L.C. Gauge-invariant expectation values of the energy of a molecule in an electromagnetic field. J. Chem. Phys. 2016, 144, 044109. [Google Scholar] [CrossRef]
Valatin, J.G. Singularities of electron kernel functions in an external electromagnetic field. Proc. R. Soc. A Math Phys. Engin. Sci. 1954, 222, 93–108. [Google Scholar] [CrossRef]
Kramers, H.A. Non-relativistic quantum-electrodynamics and correspondence principle. In Proceeding of the Solvay Conference “Les Particules Elémentaires” (1948); Reprinted in Kramers, H.A. Collected Scientific Papers; North-Holland Publishing Co.: Amsterdam, The Netherlands, 1948; pp. 241–265. Available online: https://www.lorentz.leidenuniv.nl/IL-publications/Kramers.html (accessed on 4 November 2025).
Weyl, H. Gravitation und Elektrizität. Sitzungsber. Preuss. Akad. Wissen. Phys. Math. Kl. 1918, 26, 465–480, English translation: Gravitation and electricity. In O’Raifeartaigh, L. The Dawning of Gauge Theory; Princeton University Press: Princeton, NJ, USA, 1997; pp. 24–37. [Google Scholar] [CrossRef]
Schrödinger, E. Über eine bemerkenswerte Eigenschaft der Quantenbahnen eines einzelnen Elektrons. Z. d. Phys. 1922, 12, 13–23, English translation: On a remarkable property of the quantum-orbits of a single electron. In O’Raifeartaigh, L. The Dawning of Gauge Theory; Princeton University Press: Princeton, NJ, USA, 1997; pp. 77–86.. [Google Scholar] [CrossRef]
London, F. Quantenmechanische Deutung der Theorie von Weyl. Z. d. Phys. 1927, 42, 375–389, English translation: Quantum-mechanical interpretation of Weyl’s theory. In O’Raifeartaigh, L. The Dawning of Gauge Theory; Princeton University Press: Princeton, NJ, USA, 1997; pp. 94–106.. [Google Scholar] [CrossRef]
Loudon, R. The Quantum Theory of Light; Oxford University: New York, NY, USA, 2001; Available online: https://www.scribd.com/document/671415430/Quantum-Theory-of-Light (accessed on 4 November 2025).
Cohen-Tannoudji, C.; Dupont-Roc, C.; Grynberg, G. Photons and Atoms: Introduction to Quantum Electrodynamics; Wiley–VCH Verlag GmbH Co. KGaA: Weinheim, Germany, 2004. [Google Scholar] [CrossRef]
Barron, L.D. Molecular Light Scattering and Optical Activity; Cambridge University Press: Cambridge, New York, NY, USA, 2004. [Google Scholar] [CrossRef]
Babiker, M.; Power, E.A.; Thirunamachandran, T. Atomic field equations for maxwell fields interacting with non-Relativistic quantal sources. Proc. Roy. Soc. A Math. Phys. Engin. Sci. 1973, 332, 187–197. [Google Scholar] [CrossRef]
Babiker, M.; Power, E.A.; Thirunamachandran, T. On a generalization of the Power–Zienau–Woolley transformation in quantum electrodynamics and atomic field equations. Proc. R. Soc. A Math. Phys. Engin. Sci. 1974, 338, 235–249. [Google Scholar] [CrossRef]
Haydock, R. The recursive solution of the Schrödinger equation. Solid State Phys. 1980, 35, 215–294. [Google Scholar] [CrossRef]
Pettifor, D.G.; Weaire, D.L. (Eds.) The Recursion Method and Its Applications; Springer: Berlin/Heidelberg, Germany, 1985. [Google Scholar] [CrossRef]
Haydock, R.; Nex, C.M.M. Densities of states, moments, and maximally broken time-reversal symmetry. Phys. Rev. B 2006, 74, 205121. [Google Scholar] [CrossRef]
Spohn, H. Dynamics of Charged Particles and Their Radiation Field; Cambridge University Press: Cambridge, UK, 2004. [Google Scholar] [CrossRef]
Gustafson, S.; Sigal, I.M. Mathematical Concepts of Quantum Mechanics; Springer Nature Switzerland AG: Cham, Switzerland, 2020. [Google Scholar] [CrossRef]
Loss, M.; Miyao, T.; Spohn, H. Lowest energy states in nonrelativistic QED: Atoms and ions in motion. J. Funct. Anal. 2007, 243, 353–393. [Google Scholar] [CrossRef]
Hasler, D.; Herbst, I. Absence of ground states for a class of translation invariant models of non-relativistic QED. Commun. Math. Phys. 2008, 279, 769–787. [Google Scholar] [CrossRef]
Sigal, I.M. Ground state and resonances in the standard model of the non-relativistic QED. J. Stat. Phys. 2009, 134, 899–939. [Google Scholar] [CrossRef]
Bach, V.; Fröhlich, J.; Sigal, I.M. Quantum electrodynamics of confined nonrelativistic particles. Adv. Math. 1998, 137, 299–395. [Google Scholar] [CrossRef]
Lieb, E.H.; Loss, M. Existence of atoms and molecules in non-relativistic quantum electrodynamics. Adv. Theor. Math. Phys. 2003, 7, 667–710. [Google Scholar] [CrossRef]
Pauli, W.; Fierz, M. Zur Theorie der Emission langwelliger Lichtquanten. Nuovo Cim. 1938, 15, 167–188. [Google Scholar] [CrossRef]
Balslev, E.; Combes, J.-M. Spectral properties of many-body Schrödinger operators with dilatation-analytic interactions. Commun. Math. Phys. 1971, 22, 280–294. [Google Scholar] [CrossRef]
Faupin, J.; Sigal, I.M. On Rayleigh scattering in non-relativistic quantum electrodynamics. Commun. Math. Phys. 2014, 328, 1199–1254. [Google Scholar] [CrossRef]
Hunziker, W. The essential spectrum of relativistic multi-particle operators. Helv. Phys. Acta 1966, 39, 451–462. Available online: https://www.e-periodica.ch/digbib/view?pid=hpa-001%3A1966%3A39%3A%3A453 (accessed on 4 November 2025).
van Winter, C.; Brascamp, H.J. The N-body problem with spin–orbit or Coulomb interactions. Commun. Math. Phys. 1968, 11, 19–55. [Google Scholar]
Zhislin, G.M. A study of the spectrum of the Schrödinger operator for a system of several particles. Tr. Mosk. Mat. Obs. [Proc. Moscow Math. Soc.] 1960, 9, 81–120. Available online: https://www.mathnet.ru/eng/mmo/v9/p81 (accessed on 4 November 2025). (In Russian)
Reinhardt, W.P. Complex coordinates in the theory of atomic and molecular structure and dynamics. Ann. Rev. Phys. Chem. 1982, 33, 223–255. [Google Scholar] [CrossRef]
Löwdin, P.-O. The calculation of upper and lower bounds of energy eigenvalues in the perturbation theory by means of partitioning techniques. In Perturbation Theory and Its Application in Quantum Mechanics; Wilcox, C.H., Ed.; John Wiley & Sons, Inc.: New York, NY, USA, 1966; pp. 255–294. [Google Scholar]
Fröhlich, J.; Griesemer, M.; Sigal, I.M. Spectral renormalization group and local decay in the standard model of the non-relativistic quantum electrodynamics. Rev. Math. Phys. 2011, 23, 179–209. [Google Scholar] [CrossRef]
Ballesteros, M.; Faupin, M.J.; Fröhlich, J.; Schubnel, B. Quantum electrodynamics of atomic resonances. Commun. Math. Phys. 2015, 337, 633–680. [Google Scholar]
Bach, V.; Chen, T.; Fröhlich, J.; Sigal, I.M. Smooth Feshbach map and operator-theoretic renormalization group methods. J. Funct. Anal. 2003, 203, 44–92. [Google Scholar] [CrossRef]
Griesemer, M.; Hasler, D. On the smooth Feshbach–Schur map. J. Funct. Anal. 2008, 254, 2329–2335. [Google Scholar] [CrossRef]
Woolley, R.G. Gauge invariance of the S-matrix for atoms, molecules and electromagnetic radiation. Mol. Phys. 1998, 94, 409–416. [Google Scholar] [CrossRef]
Craig, D.P.; Thirunamachandran, T. Molecular Quantum Electrodynamics: An Introduction to Radiation–Molecule Interactions; Dover Publications, Inc.: Mineola, NY, USA, 1998. [Google Scholar]
Andrews, D.L.; Allcock, P. Optical Harmonics in Molecular Systems: Quantum Electrodynamical Theory; Wiley–VCH Verlag GmbH: Weinheim, Germany, 2002. [Google Scholar] [CrossRef]
Salam, A. Molecular Quantum Electrodynamics: Long-Range Intermolecular Interactions; John Wiley & Sons, Inc.: Hoboken, NJ, USA, 2010. [Google Scholar] [CrossRef]
Atkins, P.W.; Barron, L.D. Quantum field theory of optical birefringence phenomena. I. Linear and nonlinear optical rotation. Proc. R. Soc. A Math. Phys. Engin. Sci. 1968, 304, 303–317. [Google Scholar]
Atkins, P.W.; Barron, L.D. Quantum field theory of optical birefringence phenomena. II. Birefringence induced by static and optical electric fields. Proc. R. Soc. A Math. Phys. Engin. Sci. 1968, 306, 119–134. [Google Scholar] [CrossRef]
Atkins, P.W.; Miller, M.H. Quantum field theory of optical birefringence phenomena. III. Birefringence induced by magnetic fields. Mol. Phys. 1968, 15, 491–502. [Google Scholar]
Atkins, P.W.; Miller, M.H. Quantum field theory of optical birefringence phenomena. IV. The inverse and optical Faraday effects. Mol. Phys. 1968, 15, 503–514. [Google Scholar] [CrossRef]
Healy, W.P. A generalization of the Kramers-Heisenberg dispersion formula. Phys. Rev. A 1977, 16, 1568–1574. [Google Scholar] [CrossRef]
Healy, W.P.; Woolley, R.G. On the derivation of the Kramers–Heisenberg dispersion formula from non-relativistic quantum electrodynamics. J. Phys. B 1978, 11, 1131–1136. [Google Scholar] [CrossRef]
Colombeau, J.F. New Generalised Functions and Multiplication of Distributions; North-Holland: Amsterdam, The Netherlands, 1984; Available online: https://www.sciencedirect.com/bookseries/north-holland-mathematics-studies/vol/84/suppl/C (accessed on 4 November 2025).
Woolley, R.G. Infinities in molecular quantum electrodynamics, and generalized functions. Phys. Rev. A 2024, 110, 012204. [Google Scholar] [CrossRef]
Colombeau, J.F.; Gsponer, A. The Heisenberg–Pauli canonical formalism of quantum field theory in the rigorous setting of nonlinear generalized functions (Part I). arXiv 2008, arXiv:0807.0289. [Google Scholar] [CrossRef]
Reinhardt, H.; Quandt, M.; Burgio, G. Temporal Wilson loop in the Hamiltonian approach in Coulomb gauge. Phys. Rev. D 2012, 85, 025001. [Google Scholar] [CrossRef]
Dirac, P.A.M. Quantum mechanics of many-electron systems. Proc. R. Soc. A Math. Phys. Engin. Sci. 1929, 123, 714–733. [Google Scholar] [CrossRef]
Rovelli, C. Seven Brief Lessons on Physics; Penguin Books: London, UK, 2014; p. 15. Available online: https://vialogue.wordpress.com/wp-content/uploads/2020/06/seven-brief-lessons-on-physics-carlo-rovelli_4482.pdf (accessed on 4 November 2025).
Born, M.; Oppenheimer, J.R. Zur Quantentheorie der Molekeln. Ann. d. Phys. 1927, 84, 457–484, English translation: On the quantum theory of molecules. In Quantum Chemistry: Classic Scientific Papers; Hettema, H., Ed.; pp. 1–24. [Google Scholar] [CrossRef]
Born, M. Kopplung der Elektronen- und Kernbewegung in Molekeln und Kristallen. Nachr. Akad. Wissen. Göttingen Math.-Phys. K1. 1951, 6, 1–3, English translation: Coupling of electron and nuclear motion in molecules and crystals. Available online: https://truhlar.chem.umn.edu/courses/chemistry-8565-chemical-reaction-dynamics (accessed on 4 November 2025).
Born, M.; Huang, K. Dynamical Theory of Crystal Lattices; Clarendon Press: Oxford, UK, 1954; Available online: https://archive.org/details/dynamical-theory-of-crystal-lattices (accessed on 4 November 2025).
Klein, M.; Martinez, A.; Seiler, R.; Wang, X.P. On the Born–Oppenheimer expansion for polyatomic molecules. Commun. Math. Phys. 1992, 143, 607–639. [Google Scholar] [CrossRef]
Jecko, T. On the mathematical treatment of the Born–Oppenheimer approximation. J. Math. Phys. 2014, 55, 053504. [Google Scholar] [CrossRef]
Reed, M.; Simon, B. Methods of Modern Mathematical Physics. IV: Analysis of Operators; Academic Press, Inc.: New York, USA, 1978; Available online: https://archive.org/details/methodsofmodernm0000reed_b5n6 (accessed on 4 November 2025).
Combes, J.-M. On the Born–Oppenheimer approximation. In Proceedings of the International Symposium on Mathematical Problems in Theoretical Physics, Kyoto University, Kyoto, Japan, 23–29 January 1975; Araki, H., Ed.; Springer: Berlin, Heidelberg, Germany, 1975; pp. 467–471. [Google Scholar] [CrossRef]
Combes, J.-M.; Seiler, R. Spectral properties of atomic and molecular systems. In Quantum Dynamics of Molecules. The New Experimental Challenge to Theorists; Woolley, R.G., Ed.; Plenum Press/Springer Science+Business Media: New York, NY, USA, 1980; pp. 435–482. [Google Scholar] [CrossRef]
Messiah, A. Quantum Mechanics; Dover Publications, Inc.: Mineola, New York, USA, 1999; Available online: https://archive.org/details/quantummechanics0000mess (accessed on 4 November 2025).
Lathouwers, L.; van Leuven, P. Generator coordinate theory of nuclear motion in molecules. Adv. Chem. Phys. 1982, 49, 115–189. [Google Scholar] [CrossRef]
Broeckhove, J.; Lathouwers, L.; van Leuven, P. The generator coordinate approximation for molecules: A review. J. Math. Chem. 1991, 6, 207–241. [Google Scholar] [CrossRef]
Combes, J.-M. The Born–Oppenheimer approximation. In The Schrödinger Equation; Thirring, W., Urban, P., Eds.; Springer: Vienna, Austria, 1977; pp. 139–159. [Google Scholar] [CrossRef]
Combes, J.-M.; Duclos, P.; Seiler, R. The Born–Oppenheimer approximation. In Rigorous Atomic and Molecular Physics; Velo, G., Wightman, A.S., Eds.; Plenum Press/Springer: New York, NY, USA, 1981; pp. 185–212. [Google Scholar] [CrossRef]
Sutcliffe, B.T.; Woolley, R.G. Molecular structure calculations without clamping the nuclei. Phys. Chem. Chem. Phys. 2005, 7, 3664–3676. [Google Scholar] [CrossRef]
Cafiero, M.; Bubin, S.; Adamowicz, L. Non Born–Oppenheimer calculations of atoms and molecules. Phys. Chem. Chem. Phys. 2003, 5, 1491–1501. [Google Scholar] [CrossRef]
Nasiri, S.; Bubin, S.; Adamowicz, L. Treating the motion of nuclei and electrons in atomic and molecular quantum mechanical calculations on an equal footing: Non–Born–Oppenheimer quantum chemistry. Adv. Quant. Chem. 2020, 81, 143–166. [Google Scholar] [CrossRef]
Lang, L.; Cezar, H.M.; Adamowicz, L.; Pedersen, T.B. Quantum definition of molecular structure. J. Amer. Chem. Soc. 2024, 146, 1760–1764. [Google Scholar] [CrossRef]
Schawlow, A. 1981. Available online: https://www.nobelprize.org/prizes/physics/1981/schawlow/lecture (accessed on 4 November 2025).
Bublitz, J.; Kastner, J.H.; Hily-Blant, P.; Forveille, T.; Santander-Garcia, M.; Alcolea, J.; Bujarrabal, V. Sampling molecular gas in the Helix planetary nebula: Variation in HNC/HCN with UV flux. Astron. Astrophys. 2022, 659, A197. [Google Scholar] [CrossRef]
Sutcliffe, B.T.; Woolley, R.G. Comment on “Molecular structure in non-Born–Oppenheimer quantum mechanics”. Chem. Phys. Lett. 2005, 408, 445–447. [Google Scholar]
Löwdin, P.-O. On the long way from the Coulombic Hamiltonian to the electronic structure of molecules. Pure Appl. Chem. 1989, 61, 2065–2074. [Google Scholar] [CrossRef]
Anderson, P.W. Basic Notions in Condensed Matter Physics; CRC Press/Taylor & Francis Group: Boca Raton, FL, 1984. [Google Scholar] [CrossRef]
Roman, P. Advanced Quantum Theory: An Outline of the Fundamental Ideas; Addison–Wesley Publishing Company: Reading, MA, USA, 1965; Available online: https://archive.org/details/advancedquantumt0000roma (accessed on 4 November 2025).
van’t Hoff, J.H. La Chimie dans l’Espace; P.M. Bazendijk: Rotterdam, The Netherlands, 1875. Available online: https://books.google.ch/books?id=5ho4AQAAIAAJ (accessed on 4 November 2025).
Lewis, G.N. Valence and the Structure of Atoms and Molecules; Chemical Catalog Company, Inc.: New York, NY, USA, 1923; Available online: https://www.scribd.com/document/311289340/Lewis-Valence-and-the-Structure-of-Atoms-and-Molecules-ACS (accessed on 4 November 2025).
Roman, P. Some Modern Mathematics for Physicists and Other Outsiders: An Introduction to Algebra, Topology, and Functional Analysis. Volume 2: Functional Analysis with Applications; Pergamon Press, Inc.: Elmsford, NY, USA, 1975; Available online: https://archive.org/details/somemodernmathem0002roma (accessed on 4 November 2025).
Dirac, P.A.M. The Principles of Quantum Mechanics; Clarendon Press/Oxford University Press: Oxford, UK, 1958; Available online: https://www.scribd.com/document/395899045/DIRAC-The-Principles-of-Quantum-Mechanics (accessed on 4 November 2025).
Schwartz, L. Sur l’impossibilité de la multiplication des distributions. Comp. Rend. Acad. Sci. 1954, 239, 847–848. Available online: http://sites.mathdoc.fr/OCLS/ (accessed on 4 November 2025).
Gsponer, A. The sequence of ideas in a re-discovery of the Colombeau algebra. arXiv 2008, arXiv:0807.0529. [Google Scholar] [CrossRef]
Hörmann, G.; Kunzinger, M. Microlocal properties of basic operations in Colombeau algebras. J. Math. Anal. Appl. 2001, 261, 254–270. [Google Scholar] [CrossRef]
Delamotte, B. A hint of renormalization. Am. J. Phys. 2004, 72, 170–184. [Google Scholar] [CrossRef]
Colombeau, J.F. Elementary Introduction to New Generalised Functions; North-Holland: Amsterdam, The Netherlands, 1985; Available online: https://www.sciencedirect.com/bookseries/north-holland-mathematics-studies/vol/113/suppl/C (accessed on 4 November 2025).
Colombeau, J.F. Multiplication of Distributions. A Tool in Mathematics, Numerical Engineering and Theoretical Physics; Springer: Berlin/Heidelberg, Germany, 1992. [Google Scholar] [CrossRef]
Colombeau, J.F. Generalized functions as a tool for nonsmooth nonlinear problems in mathematics and physics. arXiv 2006, arXiv:math-ph/0612077. [Google Scholar] [CrossRef]

Figure 1. The spectrum of the rotated Hamiltonian,

{\bar{H}}_{λ} (θ)

;

a_{0}, a_{1}

: discrete eigenvalues of

H_{0}

;

a_{0}

is the ground state, b: continuum embedded eigenvalues, c: thresholds, d: discrete, complex eigenvalue (resonance), e: complex threshold of

H_{λ} (θ)

.

Σ

is the start of the essential spectrum of the atomic system in the absence of the field; the solid line is the spectrum of the electromagnetic field.

Θ = ℑ θ

.

Figure 1. The spectrum of the rotated Hamiltonian,

{\bar{H}}_{λ} (θ)

;

a_{0}, a_{1}

: discrete eigenvalues of

H_{0}

;

a_{0}

is the ground state, b: continuum embedded eigenvalues, c: thresholds, d: discrete, complex eigenvalue (resonance), e: complex threshold of

H_{λ} (θ)

.

Σ

is the start of the essential spectrum of the atomic system in the absence of the field; the solid line is the spectrum of the electromagnetic field.

Θ = ℑ θ

.

Figure 2. Potential energy curves for a diatomic molecule.

Figure 3. Definition of translation invariant coordinates for a diatomic molecule.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Woolley, R.G. Non-Relativistic Quantum Electrodynamics and the Coulomb Interaction. Physics 2026, 8, 20. https://doi.org/10.3390/physics8010020

AMA Style

Woolley RG. Non-Relativistic Quantum Electrodynamics and the Coulomb Interaction. Physics. 2026; 8(1):20. https://doi.org/10.3390/physics8010020

Chicago/Turabian Style

Woolley, R. Guy. 2026. "Non-Relativistic Quantum Electrodynamics and the Coulomb Interaction" Physics 8, no. 1: 20. https://doi.org/10.3390/physics8010020

APA Style

Woolley, R. G. (2026). Non-Relativistic Quantum Electrodynamics and the Coulomb Interaction. Physics, 8(1), 20. https://doi.org/10.3390/physics8010020

Article Menu

Non-Relativistic Quantum Electrodynamics and the Coulomb Interaction

Abstract

1. Introduction

2. Classical Electromagnetism

3. The Electric Polarisation Field

4. Classical Electrodynamics in Lagrangian Form

5. Classical Electrodynamics in Hamiltonian Form

6. Quantisation

7. The Hamiltonian

8. The S-Matrix and Gauge Invariance

9. The Coulomb Hamiltonian and Chemical Physics

Funding

Conflicts of Interest

Appendix A. Generalised Functions

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI