8.1. Link to Local Quantum Field Theory: The Cluster Property
What we have discussed thus far very different from what is usually laid out in textbooks on relativistic quantum field theory, which consider local theories. The main idea in the subsequent presentations is to emphasize that locality is not essential in most cases, and that the fields themselves are irrelevant. We do not know what should be called ’fields’ in the approach we are discussing here, even though all of quantum field theory is present.
We want to begin by establishing a connection between what we are currently discussing and what is commonly referred to as local relativistic quantum field theory.
In the axiomatic approach to local theory, there are different systems of axioms, starting with Wightman’s axioms, where local fields which are generalized operator functions represent the main objects. This is not very convenient, as while these fields are local, they are generalized functions; if we integrate them, we obtain ordinary operators. Then they are no longer local, and in a sense are quasilocal, i.e., concentrated in certain domains. We do not discuss Wightman’s axioms further.
We now discuss the system of axioms belonging to Araki, Haag, and Kastler. This system considers fields concentrated in some open subset of Minkowski space. It is assumed that such fields form an algebra of operators acting in a Hilbert space; the algebra should be closed with respect to weak convergence, though this is not essential. These operators should act in the representation space of a unitary representation of the Poincaré group .
It is assumed that for each bounded domain (bounded open subset) of Minkowski space we have an algebra of operators acting in Hilbert space such that:
When the domain becomes larger (), the algebra becomes larger ();
The action of the Poincaré group on algebras agrees with the action on domains if ;
If the space–time interval between the points of domains and is space-like, then the operators belonging to the algebra commute with operators belonging to the algebra (roughly speaking, this means that we cannot have a causal relation between observables separated by a space-like interval);
The ground state of the energy operator is invariant with respect to the Poincaré group (with the Poincaré group representation, we can consider the energy operator (Hamiltonian) and momentum operators as infinitesimal generators of temporal and spatial translations, respectively);
The vector corresponding to the ground state is cyclic with respect to the union of all algebras .
This is the axiomatic relativistic local quantum field theory.
In the above, a particle is defined as an irreducible subrepresentation of the Poincaré group as represented in the space .
Let us return now to the definition of scattering in the algebraic approach. The consideration in Lecture 7 was based on axioms which are not easy to check. Here, we will impose requirements that are much easier to check. In particular, these requirements are fulfilled in relativistic local theory.
The starting point, as before, is an associative algebra with involution (a *-algebra). Space–time translations are automorphisms of this algebra.
Recall that non-normalized states correspond to positive linear functionals on the algebra and form a cone . We will work here with non-normalized states. A translation-invariant stationary state will always be denoted as . Excitations of states are elements of the pre-Hilbert space , which is constructed from using the GNS construction.
In the algebra with which we started there is no norm; however, because it is represented in the pre-Hilbert space and its completion (the Hilbert space ), we can consider a normed algebra consisting of operators . Moreover, we can work with the completion of the algebra with respect to this norm, though this is not necessary.
Now let us take an element
A of the algebra
which is represented by a bounded operator
in Hilbert space
. We can consider both temporal and spatial translations of this operator. The result will be denoted by
. Moreover, we can average such an operator with a smooth and fast decreasing function
:
It is possible to shift the operator
B in time and space:
One can differentiate under the sign of the integral. As the function
is assumed to be smooth, one can differentiate as many times as desired. Here, we always work with operators of the form (
60), and we call them smooth.
We now consider asymptotically commutative algebras. In other words, we require the commutator of a shifted operator with another operator to become small at large spatial shifts. This can be formalized in different ways. We do this here in such a way that it is instantly clear that our condition is satisfied in the axiomatics of Araki, Haag, and Kastler. Specifically, we require that the norm of the commutator of a shifted operator with respect to another operator corresponding to an element of the algebra decreases faster than any power of when . We will impose the same condition on where the dot denotes the time derivative. All operators are smooth.
In the axiomatics of Araki, Haag, and Kastler this is always fulfilled, as after a large spatial shift the space-time interval between the corresponding domains becomes space-like; therefore, in this case we can say that. starting from some point, the commutator we consider is equal to zero (and as such decreases faster than any power).
Another definition of asymptotic commutativity is the condition
where
is a polynomial and
n is arbitrary (strong asymptotic commutativity). This condition is satisfied in the Araki, Haag, and Kastler axiomatics if the mass spectrum is bounded from below by a positive number.
In addition to asymptotic commutativity, we want to impose the cluster property on the state
In its simplest form, this means that
where
is small for large
To formulate the cluster property in a more general form we require the notion of a correlation function, which is a generalization of the Wightman function from relativistic quantum field theory.
For this, we take some elements
and shift them in both space and time. By multiplying them, we obtain an element of the algebra, after which we apply
or, which is the same, we take the average (the expectation value) of this product in the state
. The result is
which is a correlation function.
It is useful to define the notion of a truncated correlation function
This is done somewhat formally using an inductive formula linking truncated correlation functions to regular correlation functions:
Here,
denotes the set of all partitions of the set
into subsets
s (denoted by
), the number of elements in the subset
is denoted by
, and
denotes the truncated correlation function with arguments
, where
. This formula expresses correlation functions in terms of truncated functions for all possible partitions of the set of indices.
When there are only two operators, the truncated correlation function has the form
As
is translation-invariant and stationary, both usual and truncated correlation functions depend only on the differences
. We can say that the cluster property is satisfied if the truncated correlation functions become small at
. Smallness can be understood in different ways; here, we mean the strongest condition, namely, that at fixed
the functions tend to zero faster than any power of the difference
. More precisely, we assume that
where
s is any natural number and
is a polynomial function of times
.
We can proceed to the momentum representation by applying the Fourier transform with respect to spatial variables. The invariance with respect to spatial translations leads to the appearance of the
-function of the sum of momenta
. It follows from the cluster property that the truncated correlation function in the momentum representation is a smooth function of the momenta multiplied by the
-function:
Note that the Fourier transform of a fast-decreasing function is smooth.
In relativistic quantum theory, the cluster property is satisfied if the particle masses are bounded from below by a positive number (the mass gap).
8.2. Green’s Functions: Connection to the Scattering Matrix
A correlation function is defined as , where In our definition of Green’s function, we replaced M with the chronological product, where the same factors are ordered by time in descending order. This is what is called the chronological product (it is not defined when some times coincide, though this is irrelevant to our considerations). We can say that the Green’s function is the average (expectation value) of the chronological product with respect to . Equivalently, we can say that we are taking the expectation value of this product with respect to the vector corresponding to in the GNS construction.
We obtain the function
which is called the Green’s function in the
-representation (i.e., in the coordinate representation).
As always, we can proceed to the momentum representation by taking the Fourier transform over . This is what is called the -representation (momentum and time). In addition, we can take the (inverse) Fourier transform with respect to the time variable, in which case the Green’s functions will be in the -representation, where the main variables are momenta and energies. We will need all of these representations.
Due to translational invariance, Green’s function in the -representation depends on the differences ; therefore, we have the factor in the -representation, which corresponds to the momentum conservation law. In the -representation we additionally have the factor , which corresponds to the energy conservation law.
Let us consider the poles of Green’s function in the
-representation. It should be noted that we always ignore
-functions when talking about the poles. In particular, when Green’s function includes only two operators, in the
-representation we have two momenta, two energies, and
-functions depending on the momenta and energies:
The function
depends on the variable
and the variable
. It is important to note that the poles of such two-point Green’s functions with respect to energy at a fixed momentum correspond to particles. These poles depend on the momentum, and the corresponding function
provides the dispersion law for particles (the dependence of energy on momentum). These well-known facts can be easily deduced from of the reasoning that we use below.
We will prove here that in order to find the scattering amplitudes one should consider the asymptotic behavior of Green’s function in the -representation when . This is the first and most basic observation. The other observation is that this asymptotic behavior in the -representation is governed by the poles in the -representation. More precisely, the residues in these poles describe the asymptotics. This is called “the on-shell value of the Green function”.
There is a well-known mathematical fact in that if the asymptotic behavior of a function at has the form , or put another way, if there is a limit , then the (inverse) Fourier transform has poles at the points with residues ; in other words, the limit corresponds to the residues in the poles and the exponents correspond to poles. The poles are slightly shifted in the complex plane from the real axis in either the up or down direction. This is an extremely important observation.
One can either look at the poles in the -representation or look at the asymptotics in the -representation. We show that the calculation of the scattering amplitudes is reduced to finding out the asymptotic behavior of the Green’s functions in the -representation. Turning to the -representation, we can say that the scattering amplitudes are expressed in terms of the on-shell values of the Green’s functions. This is the Lehmann–Simanzyk–Zimmermann (LSZ) formula.
Below, we will prove the LSZ formula under certain conditions. First, we assume that the theory has an interpretation in terms of particles. This means that the Møller matrices are unitary. Both and provide unitary equivalence between the free Hamiltonian in the asymptotic space and the Hamiltonian in the space obtained with the GNS procedure. Second, we assume that the conservation laws for energy and momentum guarantee the stability of particles. The second condition will be relaxed in the next lecture.
As we want to simplify the notation, we discuss the case in which there is only one type of particle. Recall that we previously considered a generalized function
corresponding to the state of a particle with a given momentum
and that this state is an eigenvector for both momentum and energy operators. The Hamiltonian acts on
as multiplication by the function
(dispersion law):
We must remember here that
is a generalized function, and does not really exist. In order for all of this to make exact mathematical sense, we should integrate it with some test function
to obtain a vector
Now, we want to make the assumption that the one-particle spectrum does not overlap with the multi-particle spectrum.
Let us formulate this assumption more precisely. We denote by the one-dimensional subspace containing the vector , by the smallest closed subspace of containing all vectors (one-particle space), and by the orthogonal complement of the direct sum (multiparticle space). A corresponding decomposition exists in asymptotic space. We assume that the joint spectra of the Hamiltonian and the momentum operator in these three spaces do not overlap.
The asymptotic Hamiltonian is free; it (and hence,
) has a spectrum completely determined by the function
. The energies of multiparticle excitations are simply the sums
, while the corresponding momenta are
. If we want to say that the one-particle spectrum does not overlap with the multi-particle spectrum, we must require the inequality
This means that particles with momentum
cannot decay into particles with momenta
. The conservation laws forbid decay.
Now, we will formulate the LSZ formula. To do this, we fix some elements
of the algebra
. Recall that we are working with smooth elements, though this is not especially important here. In addition, it is required that we obtain a non-zero vector by applying the operator
to the vector
(which in relativistic quantum theory is interpreted as the physical vacuum) and projecting to one-particle space. More precisely, we require the projection of the vector
to be a one-particle state of the form
where
is a function which does vanish anywhere. The projection of this vector onto the vector
must vanish.
Let us consider Green’s functions containing both the elements
and their adjoint elements
. We take the Green’s function in the
-representation:
Then, we proceed to the
-representation. It is convenient to change the sign of the variables
and
to
. We multiply the Green’s function in the
-representation by the expression
where we have introduced the notation
Then, we take the limit for and for .
Only the poles contribute to the limit; in other words, the calculation boils down to taking the residues of the poles.
We can carry out this procedure in two steps. First, we multiply the Green’s function by
and take the limit
for
and
for
.
At the end, we multiply by for and by if . In physics, this is called the renormalization of the wave function. In the case where these factors are not included we can talk about on-shell Green’s functions, while if they are included we say that we are considering normalized on-shell Green’s functions.
The basic statement in the approach of Lehmann, Simanzyk, and Zimmermann is that the normalized on-shell Green’s function provides the scattering amplitude. To prove this, we will first consider the case where the operators simply provide one-particle states i.e., there is no need to project. We call these good operators. At the end of the lecture, we will explain that the general case can be reduced to this particular case.
Thus far, we have considered the case in which there is only one type of particles. Let us now consider the case in which there are many types of particles, in other words, there are many functions
which are eigenvectorss for both momentum and energy:
which have different dispersion laws
provided by smooth functions. As always,
are generalized functions, i.e., we should integrate them with test functions to obtain vectors from
. We consider test functions from the space
of smooth fast-decreasing functions. To guarantee that time shifts are well-defined in the space
, we should assume that the functions
grow polynomially at most.
As already mentioned, we will work with good operators
(operators which are smooth and transform vectors
into one-particle states
). Now, we define the operator
depending on the function
as follows:
where the function
is obtained as the Fourier transform of the function
with respect to the momentum variable.
Similar operators were considered in
Section 7.2. They have the property that, when applying them to
, we obtain a
t-independent one-particle state
hence,
where the dot stands for the time derivative. (in
Section 7.2, the function
was equal to 1). In general, what was said in
Section 7.2 can be repeated here as well. The fact that the resulting state is independent of time is the result of a formal calculation. The calculations become quite simple if we introduce operators
If this operator is applied to , we obtain a one-particle state that does not depend on t.
Now, we are repeating the considerations of
Section 7.2, though the notations have changed because we do not want to work with elementary spaces; thus, we write the indices explicitly.
We introduce a vector
where it is assumed that the functions
have compact supports.
Now, as in
Section 7.2, we consider vectors
, which can be interpreted as velocities. We denote by
an open set containing all possible velocities
, where
belongs to the support of the function
. We require that all these sets do not overlap, and we will call the functions
non-overlapping. This means that all classical velocities are different; therefore, the wave packets are moving in different directions. Then, as explained in
Section 6.3, the corresponding wave functions almost do not overlap in the coordinate representation (i.e., the essential supports do not overlap).
Now, taking the limit
, we will prove that the vector
has a limit, which we denote by
The proof uses the same reasoning as in
Section 7.2. Again, we assume that
(i.e., all times coincide). In order to prove that there is a limit, we should prove that the derivative with respect to
t is a summable function. This condition is satisfied. By definition, the vector
is the result of repeatedly applying the operators
to
. When we differentiate this expression with respect to
t, we have a dot (denoting the time derivative) over one of the operators
. We can move the operator with the dot to the right using the asymptotic commutativity and (
39) (only if we work with non-overlapping functions). We obtain additional summands that are summable functions of
t. This operator with a dot applied to
gives zero due to (
61), meaning that there is a limit.
Because the limit exists, we can define Møller matrices. To do this, we introduce the asymptotic space
as a Fock representation of the operators
and define the Møller matrices
and
as operators defined on a subset of
and taking values in
, following the formula
where
is the Fock vacuum. This is the same formula as in the last lecture, except with the difference that we now have factors
(recall here that a good operator
acting on
provides
). The Møller matrices are defined on a dense subspace of the asymptotic Hilbert space
.
It can be proved, as we do now, that Møller matrices provide isometric embeddings of the asymptotic space
into the space
. The physical meaning of Møller matrices can be understood from the following formula (which was written in the last lecture with different notation):
This formula means that when we consider evolution in the space , the action of the evolution operator on the vector in the limit corresponds in the asymptotic space to the evolution governed by a free Hamiltonian. In other words, the evolution of the vector for large t corresponds to the evolution of a system of n distant particles with non-overlapping wave functions .
Our definition of can be ambiguous. For example, we can use different good operators, and it is not clear whether we obtain the same answer. However, we can prove that the answer does not depend on our choice. We can derive from the cluster property that are isometric operators, that is, they preserve the norm and the scalar product. Such operators cannot be multivalued, as if two vectors coincide, the distance between them is 0; hence, two coinciding vectors must go to coinciding vectors. At the same time, we can see that the vector does not change when the arguments and are permuted.
The main line of proof is as follows.
To define the Møller matrices, we used the vectors
specified by Formula (
62). Note that according to (
62) such a vector is obtained by repeatedly applying the operators
B to
. It is easy to see that the scalar product of two such vectors can be expressed in terms of correlation functions defined as the average values (expectation values) of products of the operators
B and
. The correlation functions are expressed in terms of truncated correlation functions, and in truncated correlation functions only two-point correlation functions can survive in the limit
if we require the cluster property. This remark relates the scalar product of two vectors of the form
to the scalar product in the asymptotic space.
This allows us to say that a Møller matrix is an isometric mapping.
Having introduced the notion of Møller matrix, we now introduce the notions of the
-operator and
-operator:
Again, we do not write an index here describing the types of particles.
The limit in (
63) exists on the set of all vectors of the form
provided that
is a non-overlapping family of functions. Interestingly, when the dimension of the space
, under certain conditions this limit exists without the non-overlapping condition. This is insignificant for our purposes, because the non-overlapping condition provides a limit on a dense subset, which is sufficient.
On the basis of what we have said above, we can explicitly write out how the operators we have defined act on the
-states:
These formulas can be seen as definitions of operators and . Roughly speaking, the operators add one function to and their associated operators destroy one of these functions.
If the operators
and
are unitary, we can say that the theory has an interpretation in terms of particles. In this case, as well as in the more general case when the image of
coincides with the image of
, we can define the scattering matrix (
S-matrix)
as a unitary operator in the asymptotic space
. The asymptotic space is a Fock space. It has a generalized basis
In this basis, the matrix elements of the unitary operator
S (the scattering amplitudes) can be expressed in in terms of
-operators and
-operators. These are the same matrix elements for which the squares provide the effective scattering cross-sections. We obtain the following formula:
This follows directly from the definition of
-operators and
-operators. In Formula (
63) and in the following, we omit the numerical coefficients
The above formula is proved only for the case when all the momentum values
are different. More precisely, we must assume that all vectors
are different. When the function
is strictly convex, it is sufficient to assume that
are different. While this is not an essential constraint, it is present. Formula (
63) should be understood in the sense of generalized functions. This means that the set of functions
with non-overlapping subsets
should be taken as test functions.
We now present the Formula (
63) in different ways:
Recalling that we defined the S-matrix (scattering matrix) taking limits
and using Formula (
63), we arrive at the following representation:
where
We can write a more general formula
where all
are different good operators and
.
A very important observation follows from the non-overlapping condition: in the limit , the order of factors is irrelevant both in the group with times tending to plus-infinity and in the group with times tending to minus-infinity. This means that for large times we can rearrange these operators. In particular, we can consider them to be ordered by time. This means that we can regard the expression under the sign of the limit as a Green’s function. This ends the proof of the statement that the matrix element of the scattering matrix can be expressed in terms of the asymptotic behavior of a Green’s function.
We can express the operators
in terms of
and obtain the following result:
We obtain the following formula for the matrix elements of the scattering matrix:
This formula tells us that, starting with good operators, we can express the scattering matrix in terms of Green’s functions in the -representation, or more precisely, in terms of their asymptotics at for and for . We have factors , which are exactly the same that were introduced in order to obtain normalized Green’s functions. The fact that we are considering asymptotics means that we are now taking on-shell Green’s functions in the energy representation. The fact that we have factors means that we have obtained normalized Green’s functions, which for our purposes is the end of the story.
We have provided a proof of the LSZ formula for good operators. From this, as we have said, one can draw the conclusion that the same is true for a much broader class of operators. We will explain this in a situation where there is only one type of particle.
In the approach of Lehmann, Simanzyk, and Zimmermann, the operators are almost arbitrary. It is only necessary that the projection of the vector on the one-particle states is nonzero and that the projection of this vector on the vector vanishes.
What is important to us is that in the definition of the on-shell Green’s function these operators can be replaced by smooth operators It is easy to confirm that this does not change the normalized on-shell Green’s functions. The proof is based on the remark that can be obtained as a convolution of with . This is the first observation; the second observation is that for an appropriate choice of one can consider as good operators. Specifically, we can take in such a way that the support of its Fourier transform does not intersect with the multiparticle spectrum and does not contain zero (we assume that the one-particle spectrum does not intersect the multiparticle spectrum). In this case, we automatically obtain a good operator.
Let us sketch the proof of this fact. We have already said that the operator is obtained from by convolution with . In the -representation, the convolution turns into multiplication via the Fourier transform of . If we consider the spectrum of the energy and momentum operators, multiplication by the function in the -representation kills all points of the spectrum where this function is equal to zero. The function we consider here kills the multi-particle spectrum, and we obtain a good operator.