1. Introduction
We discuss a quantization of the Yang–Mills theory with an internal symmetry
. When some matter fields are added, then the model can be considered as a candidate for a unified theory of all interactions. The conceptual origin of such an approach to unification came from the paper of Yang [
1]. Soon it was followed by Cho [
2], Freund [
3], MacDowell, and Mansouri [
4]. The non-compact internal gauge group also appears in models of supergravity [
5,
6]. The
model with
(
in [
7],
in [
8] for string-inspired higher-dimensional theories) could encompass gravity within the Georgi–Glashow [
9] or Pati–Salam [
10] extensions of the Standard Model (SM) based on
internal symmetry group. It is shown in [
7] that the Brout–Englert–Higgs symmetry-breaking mechanism will break
symmetry to
in agreement with the SM phenomenology.
is needed (according to [
8]) in order to obtain a correct content of Weyl–Majorana spinors in models resulting from higher-dimensional superstring theories. In such a model, the
connection is decomposed into the
Lorentz connection, the vierbein and the compact gauge fields, which are supposed (after symmetry breaking) to describe weak and strong interactions of SM. The vierbein is used to describe the metric. Then, both the vierbein and the
connection are dynamical variables. The decomposition seems ad hoc. There is a scheme where the metric appears as a result of
or
symmetry breaking by scalar fields [
11,
12] (see also [
13]). With the vierbein-connection decomposition, it has been noticed early in the study of the
model [
14] that the Yang–Mills Lagrangian can be decomposed into the (non-compact)
Yang–Mills part, the Einstein–Hilbert Lagrangian (in the Palatini form), the cosmological term, and the
(compact) Yang–Mills Lagrangian. Unfortunately, there is an arbitrariness in the coupling constants in this decomposition. Without a coupling to matter fields, the values of these constants (crucial for the phenomenology) cannot be fixed. In addition, the bare constants undergo renormalization (depending on the matter field content) in quantum theory. The project encounters many other problems in the composition of matter fields in the realization of SO(n-4) symmetry breaking for an agreement with the phenomenology of particle physics (for recent reviews, see [
8,
15]). Concerning the weak interactions, the crucial CP violation is described by a set of left and right Weyl fermions (mirror fermion problem [
16]). The problem of the composition of fermions in SM may be related to gravity, as discussed in [
17,
18,
19]. The formulation of classical and quantum gravity in Ashtekar variables [
20,
21,
22,
23,
24] uses (anti)self-dual connections which are not invariant under spatial reflections. Their coupling to fermions violates parity invariance (on the other hand, determinants of Weyl fermions give parity non-invariant Chern–Simons terms [
25,
26]). A calculation of the effective action from a coupling of the
connection to fermions can lead to new terms in the model of weak interactions.
The Yang–Mills theory with a non-compact symmetry group [
14,
19,
27,
28,
29,
30] is renormalizable, asymptotically free, and infrared-unstable. The infrared instability can be interpreted as a confinement of the
connection [
28], which could justify the classical limit of the Yang–Mills theory to its Einstein form. There are some difficult problems in the Yang–Mills interpretation of gravity. The non-compactness of the internal symmetry group has as a consequence that the energy is not bounded from below. This does not lead to difficulties in the Feynman integral approach to quantum Yang–Mills theory. However, the unitarity of the model is questionable (see a discussion of a possible solution of the problem in [
19]). An approach based on the Euclidean formulation does not answer this question. The analytic continuation of
to
allows us to quantize the model on the lattice in the Euclidean framework [
31], but the continuation to Minkowski space-time will lead to difficulties. In the canonical quantization, the negative energy could be interpreted as the gravitational energy, but this interpretation needs further investigation. There is still the question of the general coordinate invariance and the equivalence principle in gauge theories of gravity, which require an explanation. In three-dimensional gravity, the gauge invariance and the general coordinate invariance can be unified [
32].
In this paper, we concentrate on the gauge field self-interaction. In
Section 2, we show that if the Lagrangian is represented as a quadratic form plus the Chern–Simons divergence term; then, a Chern–Simons wave function
does not change in time; i.e., it is an eigenstate. As a consequence, the Schrödinger equation of the wave function
can be expressed as a diffusion equation for
with an (anti)self-dual drift. Then, in the leading order of the
ℏ expansion, the solution of the Schrödinger equation is expressed by the solution of the self-duality equation. The higher orders can be obtained from a solution of a stochastic self-duality equation. In
Section 3, we explain the method in the Abelian model. The Chern–Simons (CS) state is not invariant under the spatial reflection
. Such states may appear from an interaction with Weyl fermions in physical models of weak interactions, but we do not discuss such models (see [
17,
19]). We calculate correlation functions in CS states. The CS wave function is not normalizable [
33]. We show in Abelian gauge theories that a rigorous calculation of correlation functions with a regularized CS wave function has a limit that can be considered as an analytic continuation of formal Gaussian calculations [
34,
35,
36]. Such states are of interest in condensed matter physics in 2 + 1 dimensions (anyons [
37], charged vortices with fractional statistics [
38]). We expect that they may be relevant in models with gravitational and weak interactions [
19]. The Chern–Simons Lagrangian defines a topological field theory related to quantum gravity [
39].
Our main concern in this paper is the problem of the undue number of variables in the
Yang–Mills model (
Section 4) as compared with the Palatini form of the Einstein–Hilbert action. The
Yang–Mills Lagrangian contains both the vierbeins and the
connection as dynamical variables. It is suggested in [
27,
28,
29] that, owing to the infrared confinement, the connection being confined does not play any dynamical role. In the standard Palatini approach, the functional integration over the Lorentz connection leads to the Einstein–Hilbert action. The procedure is equivalent to the expression of the connection in terms of vierbeins on the basis of classical equations of motion. In
Section 5, we perform the functional integral in the
ℏ (loop) expansion over the connection in the
Yang–Mills model. As a result, we obtain the effective action, which consists of the classical action and the quantum one-loop correction. We show that if the square of the curvature of the Lorentz
connection is neglected, then until the first order in the expansion in
ℏ, we obtain the classical Einstein action (in Palatini form) plus the one-loop quantum correction of order
ℏ. The result shows that in the loop expansion, we can consistently control the Palatini (infrared) approximation and the ultraviolet behavior of the model.
In
Section 6 we discuss an expansion in
ℏ in the
Yang–Mills model by a stochastic perturbation of the self-duality equation. Then, the correlation functions of the Wilson loops can be calculated in regularized CS states in a perturbation expansion.
2. A Total Divergence Term in the Path Integral
The time evolution of the state
dependent on the variable
A describing the system is given by the path integral formula (integration is over paths
starting from
A, and
is the initial condition for the Schrödinger equation)
We assume that the Lagrangian can be expressed in the form
where
does not depend on
A. Then, the path integral (1) reads (we assume that
)
Let
We express the initial state
in the form
Then
If the Jacobian
, then we can change variables
so that
where
Q is a Gaussian variable. We can express this change of variables as
where
(the Brownian motion) has Gaussian distribution with the functional measure
If we choose
then
and Equation (
4) reads
Hence,
for
, where the factor
Z does not depend on
A. If the Jacobian
(as will be the case in our models), then the integral on the RHS of Equation (
6) is a time-independent constant
. It follows from Equation (
6) that
does not change over time.
The Formula (
3) can be considered either as a similarity transformation of states in quantum mechanics or as a result of a change of the canonical formalism resulting from the presence of an extra time derivative in the Lagrangian [
40,
41]. In both interpretations, the modified Schrödinger equation for the Lagrangian quadratic in velocity reads
where
is the quantum realization of the canonical momentum.
3. Quantization of an Electromagnetic Field on a Lorentzian Manifold
We consider first the Lagrangian of the electromagnetic field
defined on a Lorentzian manifold with the metric
(
g denotes the determinant of the metric). In Equation (
8),
where on the LHS we have the tensor density
and on the RHS we have the Levi-Civita antisymmetric symbol on the Minkowski space-time defined by
,
.
The last term in the Lagrangian (8) is of the form (2) as
The self-dual variable
can be chosen as the Gaussian variable
Q. As a consequence of Equation (
6), the Chern–Simons state
is an eigenstate. It does not change in time because the Jacobian
. The term
does not depend on the metric (it is a topological invariant, the Chern–Simons term).
We prove
, independently constructing in the canonical formalism the Hamiltonian
H for the quantum electromagnetic field on a globally hyperbolic manifold. Then, we can choose coordinates [
42] such that the metric takes the form
(we use Greek letters for space-time indices and Latin letters for spatial indices). Then, the canonical momentum in the
gauge is
The Hamiltonian is
with the canonical momentum
We prove that on a globally hyperbolic manifold in the metric (12),
. This condition requires
The general proof needs some strenuous calculations [
41]. It is simple in the case of the orthogonal isotropic metric
In this metric on the LHS of Equation (
16)
whereas on the RHSk we have
Hence, (18) is equal to (19).
Using the transformation (4), we obtain the Schrödinger equation for
(a version of Equation (
7)):
If the term
in the limit
is neglected, then the solution of Equation (
20) is expressed by the solution
(with the initial condition
) of the self-duality equation
Then,
(written for a time-independent metric).
The transformation to Gaussian variables
Q discussed in Equation (
5) can be expressed as a stochastic equation corresponding to the
state. In the time-independent isotropic orthogonal metric, it reads (see refs. [
43,
44,
45,
46] for a general framework)
where the covariance of the Brownian motion is
Then, Equation (
22) (after averaging over the Brownian motion) determines an exact solution of Equation (
20). The solution of the self-duality Equation (
21) gives the leading order behavior of
. The corrections can be calculated by means of an averaging over the Brownian motion.
The Chern–Simons wave function
is not square-integrable. We need
in Equation (
4) to decay sufficiently fast in order to make
square-integrable. It turns out that
may preserve some crucial properties of
after a multiplication by
(treated as a regularization). As an example, we consider QED on the Minkowski space-time with
where (,) denotes the
scalar product,
and
. We calculate the correlation functions (the integral over
is well-defined for
) at
in the state
. We obtain
Equation (
26) is singular at
. It makes sense at
. We may treat the RHS of Equation (
26) for
as an analytic continuation in the complex
plane of the integral on the LHS of Equation (
26). A definition of the Gaussian integral of a function non-integrable in the Riemann sense by means of an analytic continuation appears in the ordinary Feynman oscillatory integrals with the Chern–Simons Lagrangian [
34,
35,
36].
We calculate the loop expectation values in the regularized CS states for the loops
and
where
denotes the Gauss linking number. It follows from Equation (
22) that after the time evolution, we obtain the linking number up to the terms of order
.
4. Yang–Mills Gauge Theory on a Lorentzian Manifold
We consider an
-valued connection 1-form
where
with
are the generators of rotations in
planes of the
group,
,
,
, and
are the generators of
rotations. We have
where
is the Minkowski metric
in n + 1 dimensions.
We may introduce a dimensional parameter
rescaling
. This is equivalent to a rescaling of the
generators (
) in the commutators
so that
where
satisfy the standard commutation relations of the Lorentz group for the rotations in the
plane. When
, then
will contract to the generators of momenta
of the Poincare group. The curvature of the connection
can be defined by the commutator of covariant derivatives
where
are structural constants of the group
(after the rescaling (29)).
For notational simplicity, we restrict our discussion to
.
adds the compact group
, which is supposed to describe internal symmetries. We decompose
as
For the term
, we have
where
,
is the Minkowski metric and
is the torsion.
We consider as the non-Abelian generalization of Equation (
8) the Lagrangian of the
Yang–Mills field
defined on a Lorentzian manifold with the metric
. In Equation (
33),
as in Equation (
9), where on the LHS, we have the tensor density
,
.
is the dimensionless coupling constant for the Yang–Mills theory.
is the Killing form on the algebra
.
The representation of the Lagrangian (33) leads to the Chern–Simons term
Let us note that
does not depend on the metric (it is a topological invarian, the Chern–Simons term). It has the form
where
Specifically, in components in
,
where
are the structure constants of
. If
is interpreted as the tetrad (vierbein), then we can see that the Lagrangian (33) contains the Einstein–Hilbert Lagrangian (in the Palatini formalism) as well as the Yang–Mills part (the square of the
field strength), the cosmological term, and the square of the torsion. In fact, expanding the Lagrangian (33), we get
where
. Till now, in Equation (
33), we did not specify the manifold and the metric
on this manifold. The unification of gravity and gauge theory is based on a choice of the metric, identifying the
part of the
connection (28) with the vierbein [
7,
8,
14]. It can be seen from Equation (
36) that the choice
where
satisfy
leads in Equation (
36) to the appearance (as the second term) of the Einstein Lagrangian in the Palatini formalism, if we choose
, where
is the Planck mass. The term four-linear in
h in Equation (
36) is the cosmological term. The last term is a square of the torsion (32). As a consequence of the identification (37), the
symmetry is broken. The Hamiltonian depends on the Ansatz (37). The Chern–Simons wave function does not depend on the metric. As follows from
Section 2 and
Section 3, the insertion of the metric (37) does not change the conclusion that
does not evolve in time (it is an eigenfunction of the Hamiltonian). The identification of the connection
with the vierbein in the metric (37) has already been discussed by Townsend [
14] in classical theory. In quantum theory (as discussed in
Section 5), all four terms are separately renormalized. The bare Yang–Mills coupling
tends to zero at small distances (asymptotic freedom), but it can become large at large distances (confinement of the connection; see [
27,
28,
29,
30]). When discussing physical consequences of the choice of bare couplings and renormalization, we should consider the
gauge field model as immersed in a unified GUT model. In such a case, the renormalization is changed by the couplings to other fields (with a preservation of the asymptotic freedom). The cosmological constant
acquires a large (positive) contribution depending on the matter field content.
5. Functional Integration over the Lorentz Connection
We perform the functional integral (1) over
(assuming that
does not depend on
) in an expansion in
ℏ. We obtain as a result the factor
defining the effective action. We treat
as an external field and decompose
into the classical part
A and quantum fluctuations
as in the background field method. We write
where
has only the
components and
has only
components. We expand the Lagrangian (36) in
. In order to cancel (in the expansion) the terms proportional to
, we choose
in such a way that
This is a non-linear equation for
. We can solve it by iteration. In the lowest order, (linear in
), only
terms contribute to the last term in Equation (
40). Then, Equation (
40) reads (with the term quadratic in
A neglected)
When the first term (coming from
) is neglected, then we obtain the equation for zero torsion
This equation has the solution [
47]
where
,
and
It follows that by neglecting the derivative term (the first term in (41)) in the functional integration over
, we obtain in the leading order of the
ℏ expansion the Einstein gravity. This could also be seen as a consequence of neglecting the first (Yang–Mills) term in the Yang–Mills action (36) and taking into account only the second (Palatini) term. In an exact
quantum Yang–Mills theory at the one-loop level, we obtain in the functional integral the classical term
(the tree approximation) and subsequently the one-loop contributions
and the Fadeev–Popov determinant, whose form depends on the choice of gauge (see the calculations in [
48]). The determinants must be renormalized. It can be seen that at one loop, the
gauge theory is asymptotically free [
27,
28,
29,
30].
Specifically, we have
It follows that until the terms of order
ℏ,
where
is the Ricci tensor of the metric (37) defined by the tetrad
h and
The requirement of a vanishing of the
term leads to Equation (
40). There remains
The integration over
gives the determinant
. It is convenient to choose the
gauge leading to the factor
In particular, if
, then the
term in Equation (
46) is vanishing. The determinant of
can be represented by means of a Feynman–Kac formula as in [
48]. In the gauge (47), the Fadeev–Popov determinant is equal to the determinant of the Laplacian for vector fields
. We can conclude that for the renormalization of the determinant
, we need the terms
where
is the Ricci tensor for the metric (37) and
is the scalar curvature. In the effective action, there still will be the logarithm of the Faddev–Popov determinant whose renormalization requires the same counterterms as in Equation (
48) (with different constants). In Equation (
48), besides the square of the
curvature of the connection
appearing in the renormalization of the Yang–Mills theory on a flat manifold (the first term), we have the counterterms required for the renormalization of the determinant of the Laplacian
for the vector fields on a curved background. Then, the
gauge invariance of the model is broken as a consequence of the identification (37).
The one-loop expansion is easily generalized to
. We shall have the coupling of the
gauge field to the
Yang–Mills as in an
Yang–Mills theory. If the
algebra is denoted as
then we just make a change in Equation (
28) for the background field expansion
where
is the
gauge field. It is also possible to achieve a non-trivial extension of internal and space-time symmetries by constructing the Yang–Mills theory with the group
.
6. Chern–Simons States
The evolution Equation (
7) for
theory is a generalization of the one for the Hamiltonian (14). In the gauge
, Equation (
7) reads
In the quantized model,
or in components,
and
In the Hamiltonian evolution Equation (
49), we need
In components
and
It follows already from the change of variables (5) that in the leading order of
ℏ expansion of the solution of the Schrödinger equation with the initial condition
is
where
is the solution of the self-duality equation (with the initial condition
)
i.e.,
The self-duality equation in the gauge
can be expressed as
As the
part in the Schrödinger Equation (
49) is of a higher order in
ℏ, we confirm on the basis of the calculations (53)–(55) that the time evolution (56) generated by the classical flow
is the solution of the Schrödinger Equation (
49) in the leading order in
ℏ. In order to derive an exact solution or calculate higher orders in
ℏ, we need a stochastic equation (a generalization of Equation (
23)).
As mentioned in the introduction, the Chern–Simons states arise from the total derivative terms in the Lagrangian, which appear in Yang–Mills as well as in other gauge theories of gravity (see [
23,
24]). The wave functions
are not normalizable. One can regularize these states in order to make them normalizable while still preserving their exceptional properties [
40,
41]. After the removal of the regularization, the result is the same as a formal (Gaussian) functional integral. In this way, we can perform a perturbative calculation of the Wilson loop variables
We could consider the correlation functions in the
“ground state”
In the lowest order of calculations, we obtain
where
is the linking number of the curves
and
and the higher order terms involve expectation values of powers of
and higher orders of
ℏ. The result (63) is a consequence of the formula
and a similar formula for the tetrads
.
(35) is trilinear in
with the propagator
. By elementary power counting, a perturbation expansion for a theory with the Lagrangian
is renormalizable in three dimensions. This follows from the formula for the divergence index
[
45,
49] for theories with a propagator
as
where
d is the dimension,
n is the degree of the interaction polynomial,
E is the number of external lines, and
V is the number of vertices. In our Chern–Simons model,
. It follows that we can calculate the correlation functions in a perturbation theory in the Chern–Simons states.