The title of this essay is of course meant as a provocation, a provocation of the kind people engage in when they say such things as ER = EPR (denoting the conjectured correspondence [
1] between Einstein–Rosen bridges (ER) [
2] and Einstein–Podolsky–Rosen (EPR) entanglement [
3]). Such statements appear nonsensical but perhaps suggest a novel interpretation of the terms involved that could be quite revealing. To say that gravity is a Yang–Mills theory is not nonsense but actually false if by Yang–Mills theory one means the standard text book theories labeled by the structure constants
of a ‘color’ Lie algebra
and by gravity one means the Einstein–Hilbert action in the same dimension. Indeed, in the context of perturbative quantum field theory (QFT), Yang–Mills theories carry only cubic and quartic couplings and are renormalizable, while Einstein–Hilbert gravity is non-polynomial and non-renormalizable.
Rather, here, we suggest the definition of a broader class of ‘Yang–Mills-type’ theories based on the homotopy algebra formulation of field theories to be explained below [
4,
5,
6]. In this framework, the textbook Yang–Mills theory takes the form of a tensor product
of a homotopy algebra
that encodes the kinematics of Yang–Mills theory, with the color Lie algebra
[
7]. One may then define the larger class of Yang–Mills-type theories as the tensor product of
with more general ‘Lie-type’ algebras, namely homotopy Lie or
algebras. The kinematic algebra
itself carries, in a hidden way, such a Lie-type algebra up to obstructions that, however, cancel on a subspace of the tensor product
with a second copy
, as has been proved to the order relevant for quartic couplings [
8,
9,
10]. We may thus define the yet larger class of Yang–Mills-type theories as those given by the tensor product of
with an obstructed Lie-type algebra for which the obstructions are canceled on a non-trivial subspace. So defined, it is just a fact that there is a subspace of
defining a Yang–Mills-type theory that
is gravity. Importantly, this theory is
not Einstein–Hilbert gravity but rather double field theory, which includes the graviton, an antisymmetric tensor (B-field) and a scalar (dilaton) [
11,
12,
13].
Our construction is directly inspired by, and a gauge invariant off-shell generalization of, the ‘double copy’ technique of amplitudes [
14,
15,
16], known to the general public under the slogan ‘ Gravity = (Yang–Mills)
’. We think that the slogan ‘ Gravity = Yang–Mills ’ is more appropriate, a viewpoint that can perhaps be justified most succinctly as follows: Since the standard textbook presentation of Yang–Mills theory gives a Lagrangian in terms of structure constants
, we can think of Yang–Mills theory as a machine: a machine that takes as input structure constants
and produces as output a QFT. The framework of homotopy algebra allows us to reinvent Yang–Mills theory as a more powerful machine that accepts as input more general ‘Lie-type’ algebras. Feeding in a conventional Lie algebra, this machine produces standard Yang–Mills theory, but feeding in a second copy of the kinematic algebra itself produces double field theory.
Let us begin by explaining the relationship between homotopy algebras and field theories. A field theory is defined by introducing a set of fields, which here we denote collectively by
, by specifying field equations or an action and, possibly, by specifying gauge symmetries and their dual Noether/Bianchi identities. We will focus on perturbation theory, where the fields form a vector space (the sum of two fields is again a field), and so do the gauge parameters, etc. More precisely, the totality of these objects form a
graded vector space
X, which means that we assign, as a book-keeping device, an integer called degree to each object, depending, e.g., on whether it is a field or a gauge parameter. Such gradings are familiar from the BV-BRST formalism, where they are related to the ghost number [
17,
18].
One then defines various maps on this graded vector space in order to encode, for instance, kinetic terms, interaction vertices, and gauge transformations, which define the theory. Specifically, at least in perturbation theory, one can write the action as
i.e., as a sum of quadratic terms, cubic terms, and so on. We have written the terms of order
in fields using a universal inner product
and multilinear maps
with
n arguments. Furthermore, one assumes, for instance, that the field equations and gauge transformations with gauge parameters
, respectively, are given by
and
It must be emphasized that the
, when evaluated on different objects such as fields or gauge parameters, are a priori independent maps, distinguished by the degrees of their inputs. For instance,
, which is known as the differential, acts according to the following chain:
where the interpretation of each space is indicated in the second line. According to (
3),
acting on
is defined by the zeroth-order, inhomogeneous terms of the gauge transformations, while according to (
2),
acting on
is defined by the linear terms of the equations of motion. For instance, in Yang–Mills theory,
on fields is the second-order differential ‘Maxwell operator’, while
on gauge parameters is the first-order differential operator of the Abelian gauge transformation. Linearized gauge invariance requires
This relationship is summarized as
, which moreover must hold on the entire complex (
4), where it also encodes linearized Noether identities.
More generally, the
are determined so that the non-linear equations of motion, gauge transformations, and so on correctly describe the desired field theory. They must, of course, satisfy consistency conditions following from those of field theory, such as, for instance, gauge covariance of the field equations or closure of the gauge algebra. Thus, with any consistent (perturbative) field theory, we are given a graded vector space
X equipped with maps
obeying various consistency relations. This is what mathematicians call a
structure. So what is the general structure of field theory? This is the category of homotopy Lie algebras or
algebras. These generalizations of Lie algebras are defined as integer graded vector spaces
equipped with multilinear maps
of intrinsic degree
(i.e., the degree of the output is the sum of the input degrees plus 1), which are graded symmetric (e.g.,
, where in exponents,
x denotes the degree). This means that, depending on the input, the map may be symmetric, as when evaluated on fields, or antisymmetric, as when evaluated on gauge parameters. The
are subject to an infinite number of
relations. The first relation is
, and we display the next two relations:
These state, respectively, that the differential
obeys the Leibniz rule with respect to the 2-bracket
and that the Jacobi identity only needs to hold ‘up to homotopy’, with the failure being governed by the differential
and the ‘3-bracket’
. The
relations encode the consistency conditions of field theory, for instance, the gauge covariance of (
2) under (
3).
We now turn to the
algebra of Yang–Mills theory. The familiar action, written in terms of the non-Abelian field strength
, reads
where
denotes the flat space volume element in
D dimensions. Expanded in fields and using integrations by part, this becomes
where
denotes the d’Alembert operator. For the following applications, it is useful to pass to an equivalent formulation by introducing an auxiliary scalar:
with the ellipsis denoting the same cubic and quartic terms as in (
8). Integrating out
, one recovers (
8).
One could now determine the
algebra of Yang–Mills theory as sketched above, but it is more useful to immediately ‘strip off’ color and to write this algebra as a tensor product of the color Lie algebra
with another kind of homotopy algebra. The latter ‘kinematic’ algebra is defined on a vector space
that carries the same objects as the
algebra of Yang–Mills but without color indices, which by an abuse of language we denote by the same letters and names.
defines a chain complex (with degrees shifted by one)
where the differential
satisfies
. For instance, on gauge parameters and fields, respectively,
acts as
where we use the short-hand notation
. Thanks to the introduction of
, we have a
grading
and a map
b of intrinsic degree
, whose action is indicated in the diagram (
10). For instance,
, where the output is re-interpreted as a gauge parameter and hence degree shifted. In addition,
carries a graded symmetric 2-product
of degree zero and a 3-product
of degree
, which we display evaluated on the fields
where the external
index indicates the vector component in
.
A graded vector space
equipped with maps
and possibly higher maps, subject to certain symmetry properties, is called a
algebra (the homotopy version of a commutative associative algebra) provided that, in addition to
, the following relations hold:
The first relation is the Leibniz rule. The second relation states that
is associative ‘up to homotopy’, and in general, there may be infinitely many more relations. A
algebra is a special case of an
algebra where the
is graded symmetric, while the higher
for
are subject to graded Young–Tableaux-type symmetries.
The
algebra of Yang–Mills theory is now obtained from the
algebra
by tensoring with the color Lie algebra
[
7] (see [
19] for a review):
At the level of the vector space, this just means that the objects in (
10) are made
-valued by decorating them with color indices (and degree shifting by one):
where
are the generators of
. One obtains the fields, gauge parameters, etc., of Yang–Mills theory. The
encoding the
structure on (
14) are
where in this case all
for
are zero. The
relations on
follow from the
relations on
and the Jacobi identities for
. The resulting
and
encode the full Yang–Mills theory; in particular, using (
12), they reproduce the cubic and quartic interactions.
With the decomposition (
14) of homotopy algebras, we have separated Yang–Mills theory into its ‘kinematic’ and its ‘color’ parts, with the former being an ‘associative-type’ algebra and the latter a ‘Lie-type’ algebra. The idea inspired by the double copy procedure of amplitudes is to replace
in (
14) by another type of Lie algebra based on the kinematics of Yang–Mills theory (‘kinematic Lie algebra’) in order to obtain gravity. While
started its life as an associative-type algebra, it actually also admits a hidden Lie-type algebra (in a suitably generalized sense). Borrowing terminology from linguistics, we may refer to the former as the ‘surface structure’ and the latter as the ‘deep structure’.
In order to display this deep structure, we use the map
b defined in (
10), which is nilpotent,
, to define a new Lie-type bracket on
as the failure of
b to act via the Leibniz rule on
:
From this definition, it follows with
that
b obeys the Leibniz rule with
, and hence,
b can be thought of as a second differential of opposite degree to
. The deep structure on
is a generalization of a Batalin–Vilkovisky (BV) algebra. A BV algebra consists of a (graded) commutative and associative product together with a nilpotent operator that, however, does not act via the Leibniz rule on the product but rather is of ‘second order’ (similar to the BV Laplacian of the BV formalism). Defining then a 2-bracket as the failure of the differential to act via the Leibniz rule on the product, one obtains a Lie bracket satisfying the Jacobi identities and a compatibility condition with the product. The differential
b, the 2-product
, and the 2-bracket
above want to be a BV algebra but fail to be that because (i)
b is not second order with respect to
, and (ii)
is not associative. These failures suggest that there is a homotopy BV algebra (BV
algebra [
20]), but there is an additional failure due to the relation
where □ denotes the d’Alembert operator. This relation quickly follows with (
10) and (
11), and it means that
and
b are compatible only up to ‘□–failures’. This has various ramifications. For instance, the original differential
does not obey the Leibniz rule with respect to
:
where the ‘anomaly’ on the right-hand side follows from (
17), (
18), and □ being second-order. Similar □-failures appear in other relations. Formalizing these failures one can define a more general algebraic structure that is realized on the kinematic space
of Yang–Mills theory, which following Reiterer we denote by BV
[
21]. This includes as a subalgebra a
algebra and as a ‘subsector’ an
algebra that, however, is obstructed by □-failures. (An operator
b and an associated BV
algebra are also realized in self-dual Yang–Mills theory [
22] and 3D Chern–Simons theory [
8,
23,
24].)
We can now turn to the construction of gravity in the form of double field theory (DFT). Due to the □-failures, a general BV
algebra is not quite of ‘Lie-type’ and cannot be tensored with the kinematic algebra
to obtain a genuine
algebra of gravity. However, taking a second copy
of the kinematic algebra itself, these failures can be canceled on a subspace of the tensor product
. Denoting all objects of
with a bar, the full tensor product space
is a chain complex carrying two natural differentials of opposite degrees:
which both square to zero due to
. The □-failure relation (
18) now implies
We can eliminate the ‘-failure’ by going to a subspace with . To explain this point, we first note that since is a space of functions of coordinates x, and is a space of functions of coordinates , is a space of functions of doubled coordinates . (This is familiar from quantum mechanics: the tensor product of two one-particle Hilbert spaces of wave functions of one coordinate yields the two-particle Hilbert space of wave functions of two coordinates). We may then impose on functions and products of functions, which in DFT is known as the strong constraint and essentially equivalent to identifying coordinates x with coordinates . We will return to the ‘weakly constrained’ case momentarily.
We thus consider the subspace
where
restricts the spectrum appropriately. Together, both constraints in here are known as level-matching constraints. The space
is precisely the complex of DFT as derived from closed string field theory in [
11]. For instance, the fields in degree zero (with an overall degree shift by 2) are given by
where
encodes spin-2 and B-field fluctuations,
e and
are two ‘dilatons’, one of which is pure gauge, and
and
are auxiliary fields that can be integrated out. More generally,
encodes the gauge parameters and gauge-for-gauge parameters of DFT, etc., so that, for instance,
implies
, exhibiting the ‘double copy’ structure of linearized diffeomorphisms and B-field gauge transformations.
Turning to the non-linear structure, we have to define higher brackets on
. The 2-bracket can be written, in an input-free notation explained in [
8], as
where the second equality holds on the subspace
. This 2-bracket obeys the Leibniz relation, c.f. (
6), thanks to
anticommuting with
for
. The second
relation in (
6) involving the Jacobiator can be satisfied upon defining a suitable
, which is more involved but can be written entirely in terms of the BV
structures of Yang–Mills theory [
8]. With the above general
dictionary, this determines the complete gravity theory to quartic order, in particular, via (
1), the quartic couplings.
Let us emphasize two features of this construction of gravity as a Yang–Mills-like theory:
The gauge algebra of DFT, which is a duality covariant version of the diffeomorphism algebra of gravity, originates rather directly from the couplings of Yang–Mills theory.
4-graviton amplitudes can be computed with the and above and by construction exhibit the factorization into Yang–Mills amplitudes.
Above, we have essentially identified the two coordinates
x and
in order to obtain ‘
supergravity’ (Einstein gravity coupled to B-field and dilaton) as a strongly constrained DFT. It is, however, possible to obtain a weakly constrained DFT, at least if all dimensions are toroidal, in which the fields genuinely depend on
x and
, subject only to
(level-matching for string theory on tori). One uses that the total space
carries a BV
structure and performs an operation known as homotopy transfer (see, e.g., [
25,
26,
27,
28]) to the subspace
, together with an additional non-local shift of
[
29]. The resulting space realizes an
algebra of weakly constrained functions and hence defines a consistent field theory containing momentum and winding modes without other higher string modes. The momentum and winding modes are encoded in the doubled coordinate dependence of the massless fields. Hull and Zwiebach constructed such a theory to cubic order starting from string field theory in 2009 [
11], but since then, it has been an open problem to construct the theory to quartic and higher orders.
We close this essay with a tantalizing possibility. Suppose a weakly constrained DFT with doubled compact coordinates and standard non-compact coordinates can be constructed to all orders in fields. This theory is expected to have an improved UV behavior. While the strongly constrained theory must exhibit the usual UV divergencies of general relativity, a weakly constrained theory features infinite towers of additional massive states (that in particular are charged under diffeomorphisms along the non-compact dimensions). Since these states run in loops, this theory should have an improved UV behavior. Indeed, as shown by Sen [
30], a weakly constrained DFT can in principle be derived from the full closed string field theory upon integrating out all string modes that are not part of the DFT sector [
28]. The theory so constructed then inherits the UV finiteness of the full string theory [
30]. Therefore, constructing a weakly constrained DFT from scratch might lead, possibly upon including
corrections [
31,
32], to a consistent theory of quantum gravity. Perhaps there is a quantum theory of gravity much ‘smaller’ than the currently explored string theories, and perhaps this quantum gravity is secretly a Yang–Mills theory.
Note added: After submitting this essay to the arxiv, we were informed by Anton Zeitlin that his early papers [
7,
33] already contain BV
structures, which, remarkably, have even been suggested to relate to gravity along lines closely related to the above discussion [
34].