1. Introduction
Modified teleparallel gravity models are very popular nowadays, both for fundamental research in gravity and for applications in the field of cosmology. The dynamical properties of these models are not very clear yet, except for in the case of General Relativity-equivalent options, which are nothing but standard Einstein–Hilbert GR models rewritten in terms of a different action functional. The standard Einstein–Hilbert action is the only one available in metric geometry for equations of GR when keeping up with the full diffeomorphism invariance at the level of Lagrangian density. So, one might ask how these new approaches can be claimed to be more fundamental.
The way to address this issue would be to assume that the spacetime manifold is globally parallelisable or, even better, topologically trivial, and to introduce yet another parallel transport, but this time, favouring a flat one. The required Lagrangian can then be written in terms of the torsion [
1] and/or nonmetricity [
2,
3,
4] of the flat connection. Note that, regardless, we would still need a metric tensor to describe the observational aspects of gravity. The Lagrangian density can then be fine-tuned in such a way that the corresponding field equation is just the same as the Einstein equation for the metric tensor and with no restriction on the flat parallel transport whatsoever. In other words, it is a more involved reformulation of essentially the very same theory [
5].
In the philosophy of trinity [
4], people usually like to discuss two particular cases of teleparallel worlds: the metric one, i.e., with torsion but no nonmetricity, and the symmetric one, i.e., with nonmetricity but no torsion. In any case, a flat connection can be represented as a field of tangent space bases (or tetrads, Vielbeine, or frames…) composed of covariantly constant vectors [
6]. If the flat connection is metric-compatible, then such a tetrad field can be chosen with orthonormal values everywhere in spacetime, which is a common type of choice for most practitioners from other areas of gravity research. At the same time, if the flat connection is symmetric, then the basis can be viewed as a coordinate one for a special, preferred type of coordinates, which then play the role of the Cartesian ones [
7]. In other words, the GR-equivalent models do introduce some additional geometric structures on top of the usual metric when including no new predictions [
5].
For certain, it is much more interesting to try modifications of teleparallel equivalents of GR, even though there are many challenges related to this endeavour. Historically, there are two basic ways of doing so. Recent activity has been centred mostly around non-linear modifications of the GR-equivalent Lagrangian densities, similarly to the famous
gravity framework and using the
gravity framework as a starting point [
8]. Later, this approach was also generalised by adding boundary terms to the arguments of a non-linear function [
9], which actually only meaning mixing Riemannian and teleparallel objects inside the function. Another more classical approach [
10], which is much older and is recently receiving new attention, concerns the modification of the quadratic fine-tuned action without introducing more non-linearities into it.
It is not just an accident that the models mentioned above, the
gravity [
8] and new GR [
10] models, are torsion-based; this is the historical version of teleparallel theories dating back to Einstein himself. Symmetric teleparallelism was created much later [
2,
11]. Nowadays, it is widely used in modified gravity research. We cannot give a full reference list for this active field here. Let us simply mention, with a few examples, that there are many directions available: unveiling the geometric foundations [
7], studying gravitational waves in non-linear extensions of STEGR [
12]
1, discussing the so-called geometric trinity beyond the simplest cases [
14], investigating possible cosmological solutions [
15], or even investigating the phenomenology of these [
16]. This all means that we need a better understanding of the foundational aspects of this approach, starting from the simplest features of its weak gravity regime.
We would like to analyse the weak gravity limit of a quadratic symmetric teleparallel theory known as newer GR [
3], named after an analogous construction in the metric teleparallel approach, which was called new GR [
10]. Since, in symmetric teleparallelism, one can always say that partial derivatives of the metric tensor components are just their (flat) covariant derivatives written in the preferred coordinates, it is simply a quadratic theory of a symmetric rank-two tensor field in a Minkowski spacetime. We all know that GR is virtually the only available theory of a massless spin-two particle, which is formally acceptable for the standard approach to quantum field theories, at least at the linear level. However, in this paper, we will discuss the newer GR models from the viewpoint of purely classical equations and see what can be said about them considering the current linearised weak gravity regime.
The plan of our paper is as follows: In
Section 2, we give a brief introduction to the models at hand; then, we turn to the weak gravity limit in
Section 3. The impossibility of having a ghost-free model with all metric components being dynamical is discussed in
Section 4. After that, we present the weak gravity equations in
Section 5, and the tensor and vector sectors’ behaviour is immediately obvious, while the scalars are described in the two following sections.
Section 6 is devoted to a review of pathological models, while
Section 7 counts the degrees of freedom in potentially viable theories, with a particular focus on clarifying the situation with a unimodular-like extension of STEGR. In
Section 8, we conclude.
2. The Structure of Newer GR
We assume that the spacetime has a flat symmetric connection on top of the usual metric and its Levi-Civita connection, which is always there [
5]. Moreover, we will work in what is called a coincident gauge. Namely, for such a connection, it is always possible to find coordinates in which all connection coefficients vanish [
17]. In these coordinates, and for this flat connection, all covariant derivatives coincide with the partial ones. In particular, the simple derivative
defines the nonmetricity tensor
. And we also define two vectors
constructed from the nonmetricity tensor (
1) in the usual way. Operations of raising and lowering the indices are performed by using the metric, as usual.
The most general (parity-preserving) action of newer GR [
3] can be given as
in terms of the nonmetricity tensor (
1) and vectors (
2) defined above. The case of Symmetric Teleparallel Equivalent of General Relativity (STEGR) is
when the nonmetricity scalar
appears to be the usual
of STEGR. The convenience of discussing this limit is precisely the reason why we have rescaled the coefficients in the action (
3) compared to many other works.
We can now define the nonmetricity conjugate, also often called superpotential,
via
as a symmetric (
) tensor
which defines the quadratic form of the nonmetricity scalar.
One can check [
7], term by term, that it has a nice property of
and therefore the variation in the action (
3) works as
where the last two terms in the brackets are written separately from each other to ensure clarity regarding what has been carried out.
Note also [
7] that the expression inside the brackets above,
, is automatically
symmetric so that the equations of motion, in the form of
, can safely be written as
One can also easily transform the upper-indices Equation (
5) to the mixed-position form
which is the simplest one and the most convenient for cosmology; or rather, arguably, in the most familiar approach, this can also be written as
with all the indices down.
These are the relevant equations. Then comes the task of analysing the physical properties of the theories at hand. Unfortunately, if anything regarding modified teleparallel models is understood well, it is that they are problematic in many respects [
18,
19]. The Hamiltonian analysis of
gravity already appears to be a complicated topic [
20,
21,
22], with no proper discussion of the strong coupling issues in the Hamiltonian language as of yet. A much easier problem of new GR was studied from this point of view only very recently [
23,
24]. Many questions still remain open, up to the point that, when a naive count of degrees of freedom in type 7 new GR
2 results in a negative number of
, it is simply interpreted as no dynamical modes being present at all [
24], and with no more comments whatsoever. Symmetric teleparallel models are even less studied than that, and related Hamiltonian works even extend to claiming that the standard algorithm fails [
29].
As a first step to better understanding a theory of gravity, it is reasonable to analyse its weak gravity limit first, i.e., small perturbations around the trivial background. In models of
and
gravity, it is not very interesting since all the new aspects are in the strong coupling regime so that nothing but the standard GR can be seen. However, the new and newer GR theories are different in this respect, even though some strong coupling issues are still present [
26]. In our previous papers [
25,
26], the weak gravity limit of new GR has been studied. Now, we turn to the newer GR case, not restricting ourselves to the principle symbol only [
30]. One might also want to pay attention to works on primary Hamiltonian constraints in newer GR [
31] and in general quadratic teleparallel models [
32], as well as to very recent investigations of Hamiltonian metric teleparallel models [
33] and linear perturbations in symmetric teleparallel models [
34].
3. The Weak Gravity Limit
In the weak gravity limit, we consider small perturbations around the Minkowski metric in Cartesian coordinates,
and parametrise them in the usual cosmological perturbations-like manner [
35]:
or, in other words,
with the standard restrictions on the variables, namely
, in order to fully separate, in the linear order, scalars (
,
V,
,
), vectors (
,
), and tensors (
) from each other. A quadratic model is always somewhat simple. Below, we specify each term in the action to the quadratic limit around
and derive their contributions to the linearised equations.
Of course, we could simply use equations in any form (
5)–(
7). In any case, the linearised weak gravity equation in vacuum can be taken as
where the position of indices does not matter in the linear order. However, it is also instructive to derive the shape of Equation (
9) explicitly. Note that we do not substitute the parametrisations (
8) right into the action (
3); rather, we do it directly into Equation (
9) so that everything is safe in this respect, even though substitutions with spatial derivatives would only change the contents of equations beyond our approach to perturbation theory, in which we solve every equation of the form
simply as
.
We define the generalisation of the linearised Einstein tensor as
and calculate it as a sum of different terms’ contributions (
3). Note that, with our sign convention,
, the Einstein–Hilbert Lagrangian density would be equal to
, and therefore, the Einstein tensor would be given by
or, equivalently, by
, while
.
The first term (
3)
produces
and therefore,
in the linearised limit.
The second term (
3)
produces
and therefore,
in the linearised limit.
The third term (
3)
produces
and therefore,
in the linearised limit.
The fifth term (
3)
produces
and therefore,
in the linearised limit.
On the Term Which Was Neglected Above
Above, we have neglected the fourth term (
3). The reason is that it gives us nothing new in the linearised limit. Indeed, this term
can be integrated by parts to
and therefore, in the linearised limit, it coincides with subtracting the second term (
3).
This is the first sign of the strong coupling issues, at least in the generalised meaning [
36]. Let us consider a model with
Non-linearly, its equations of motion are of first order in derivatives only; however, they are non-trivial. At the same time, in the linear limit, it is an empty model, with everything being just pure gauge. One can see this at the level of Equation (
7) directly. Indeed, given the nonmetricity conjugate of the form
we obtain
and also see that the second derivatives cancel each other out in the full non-linear equations, too.
Note in passing that what was called type 1 newer GR,
in the recent paper [
37] modifies the STEGR equations in the lower derivative part only, and, what is more worrisome, only by non-linear terms around Minkowski. Therefore, it is prone to strong coupling issues. This would not be the case if the whole dynamical structure was the same in both. However, the STEGR action is the only one with full diffeomorphism invariance in it.
All in all, the linearised Equation (
10) takes the reduced form
and we define the new coefficient
to be used in all the formulae below.
4. Impossibility of a Fully Dynamical Ghost-Free Model
Given the flat connection in the foundations of the approach, it would be reasonable to look for a model with no gauge freedom when choosing it. This requires a serious modification of GR-equivalent models, and there are many doubts regarding whether such a modification is feasible in a stable and physical way. The constraint structure of a highly non-linear model can often exhibit bifurcations [
38], leading to the number of degrees of freedom being ill defined, often around physically interesting backgrounds.
One possible way to avoid such troubles is to ensure that the kinetic matrix is wholly non-degenerate and, therefore, all the variables are truly dynamical. Both new GR and newer GR are the most general models with the action functionals quadratic in velocities and, therefore, making the matrix non-degenerate is not difficult. However, another question is whether the result would be stable. Already, in new GR, there are various opinions regarding this [
19,
39]. We can take [
19] the metric teleparallel models as theories of four vector fields with kinetic parts in terms of
; therefore, one might hope to obtain a healthy model with all components, modulo diffeomorphisms, being dynamical.
The situation in newer GR is different. First of all, already in GR itself, the kinetic matrix is not positive definite, and the would-be ghost is killed by a constraint, which makes it non-dynamical. Therefore, making a stable, fully dynamical model necessarily requires modifying the action far away from the STEGR case. What we would like to show is that even this does not allow for a ghost-free fully dynamical version of newer GR. Of course, it should not be very surprising when we are trying to provide dynamics to all tensor field components in a Lorentzian setting.
One can immediately calculate the quadratic in the velocities part of the quadratic action as
By using integrations by parts and the Fourier representation of
for both
V and
, and also treating some collections of variables as one, for a vector of variables such as
one finds the matrix
Of course it is easy to use it in its non-degenerate form (even with
). However, the issue then concerns its positive definiteness. The necessary condition of
is obvious; thus, one has to study the most difficult cases of scalars (except for
V). We need to make the non-trivial lower right corner of the matrix positive definite. Again, if it was not for the mixed spatiotemporal metric components (
V and
), a possible case would be
and
. We aim to prove that making the whole matrix (
17) positive definite is not possible.
Sylvester’s criterion tells us that our matrix is positive definite if and only if its leading principle minors (i.e., left upper corners) have positive determinants. Since renumbering the elements does not change the quadratic form, every principle minor (i.e., complementary to any subset of diagonal elements) must have its determinant positive, too. In case of any doubt, we refer the reader to any textbook on matrix algebra. However, to prove the claimed impossibility, we only need the positive determinants as a necessary condition, and this is obvious.
Indeed, let us take a vector in a subspace of
as an example; then, the quadratic form is positive if the corresponding (diagonal) matrix element is positive. Therefore, we need
thus already moving far away from GR. If we take an arbitrary vector in the subspace of
and
, then the quadratic form is governed by the lower right corner of the matrix, and the corresponding determinant must also be positive, as a product of positive eigenvalues. Together with
, this reproduces
.
Let us finally look at what the
and
subspace needs. With a little bit of simple algebra, we find the corresponding determinant
to be negative due to the previous requirements. Note that it is precisely the positivity in the mixed spatiotemporal components that makes positivity in all the rest impossible.
Summarising the above, making a fully dynamical model ghost-free is not possible, let alone its full stability. At the same time, neglecting the questions of stability, one can easily find the condition for it to be fully dynamical. By adding three times the penultimate row to the last row of the kinetic matrix (
17), one obtains a matrix whose determinant is very easy to compute:
where we have taken into account that the first and the second entries of the matrix represent four and three variables, respectively, while the fifth power of
comes from the non-trivial part in the left lower corner. As long as the quantity (
18) is not equal to zero, all the modes are dynamical.
5. Field Equations in the Weak Gravity Limit
In order to analyse the models of newer GR and classify the possible numbers of degrees of freedom in them, we combine the contributions (
11)–(
14) in the gravity tensor (
15). Let us start from the tensor sector. Its only contribution to the linearised
tensor (
15) reads
Therefore, with the equation of
the
field is never constrained. It is dynamical as long as
and is a pure gauge otherwise.
In the vector sector, one can obtain (
15)
where
, as we have introduced above (
16). We immediately see that, even inside the vector sector only, the fully dynamical ghost freedom,
leads to gradient instability in both vector fields. Neglecting the stability issues, we can say again that the necessary condition for all vector modes being dynamical is
In the perturbative approach, we can write the equation of motion in vacuum as
The three special cases are very easy to see. In the case of GR, where
and
, Equations (
20) and (21) reduce to
, which equates to one constrained mode and one gauge freedom. In the non-GR case of
and
, one obtains
, with the same nature of modes, although differently distributed. Finally, the vector sector is fully empty, i.e., pure gauge, if
. As we see, unlike in the case of new GR, the vector sector of newer GR is extremely simple.
At the same time, we will find below that the scalar sector is somewhat more complicated. The
tensor components (
15) can be presented as
As always in such perturbation theory, we treat △ as a nonzero number, and we also solve the
and
parts of the spatial equation separately. Therefore, we put the vacuum equations into the following form:
where the order is the following: mixed spatiotemporal components, the nondiagonal part of spatial components, the temporal component, and the diagonal part of spatial components.
Obviously, the scalars are the challenging part of this analysis. Note, though, that the condition of , which is necessary for any hint at stability in non-scalars, makes the variable V represent one half of a dynamical degree of freedom at most.
7. Potentially Interesting Models
We turn to the case of
as in GR. The tensors are dynamical, while the vectors are half gauge and half constrained. Up to now, this is the same as in GR. Only the scalars can be different.
Since
has disappeared from Equation (
22), we immediately obtain a constraint of
Then, Equation (24) for
and
can be transformed by
thus reproducing the combination of these accelerations in another Equation (25). In other words, the matrix in front of accelerations becomes degenerate, and, at most, one combination of
and
appears to be dynamical, while the difference of the two Equations (24) and (25) tells us that
where we have relied on the assumption that
. Then, by substituting this
into another Equation (23), we obtain a constraint of
which makes the previously found constraint (
34) obsolete.
Altogether, we obtain the system of equations
and can take
as pure gauge; then, three initial data are needed to solve the system, one datum for
V and two data for
, so that it behaves as three halves constrained and three halves dynamical modes. This is true as long as the potentially dynamical combination of
and
is linearly independent of the combination
constrained by the first Equation (
35), i.e.,
which is nothing but the negation of the condition (
28) when
.
If this inequality is not satisfied, then both
and
are constrained, with only one half degree of freedom remaining in
V. An exception to this happens if the whole would-be dynamical equation just disappears. In other words, it might be that, on the constraint surface (
35), the other two Equations (36) and (37) just totally coincide with each other up to an overall factor. A straightforward, though somewhat uninteresting, game with quadratic equations shows that it happens if
only.
This is a very well-known result. In the case of
we obtain the linearised GR,
of two pure gauge and two constrained modes in the scalar sector.
7.1. The Case of a Unimodular-like Deformation of STEGR
The results of this section may come as a surprise. The case of breaking only the volume part of diffeomorphism invariance,
has been considered [
40] before
3, claiming only one new dynamical degree of freedom (compared to GR) and one global free parameter (like the cosmological constant in unimodular gravity). Namely, the model was shown to be equivalent to STEGR with a canonical scalar field of an exponential potential term with an arbitrary constant in front of it [
40].
In the perturbative analysis, we are not expected to see global freedoms; however, the local dynamical modes must be clear, while what we see is one half dynamical degree of freedom more than what was claimed before. More precisely, Equations (
35)–(37) can be reduced to
One gauge freedom should be taken in terms of either
or
, and the remaining system then requires three initial data.
What has happened? It is simply that the previous claim [
40] is wrong—not in the scalar–tensor representation, but in how the degrees of freedom count was carried out. Below, we confirm our result once more. Then, we show that their analysis is also correct. Finally, we explain where the extra half degree of freedom comes from.
To check our derivations, one might notice that the models of
obtain the linearised gravity tensor (
15)
where
is the usual Einstein tensor, and we have renormalised it all by the common value of the coefficients in the STEGR part. Therefore, we obtain the following equations:
which obviously reproduce our result (
39) when
.
At the same time, we do agree with the scalar–tensor representation [
40], too. Indeed, one can either directly use Equation (
7) substituting every
coming with
by
for the field
or (suppressing
) consider a Lagrangian system of
When we perform a variation of this action with respect to
, in both its explicit appearance and through the field
, we obtain
with the superscript
for Levi-Civita-related quantities. The Levi-Civita covariant divergence of the right-hand side must be zero. In the covariant approach, it follows from the equation of motion for the connection. In our approach, it is nevertheless a self-consistency requirement [
7] since we work in vacuum, and the
part satisfies it.
The scalar field
does not have its own equation of motion, for it was not an independent variable. However, by finding the covariant divergence, we see
from which we obtain
with an arbitrary constant
. Indeed, the field
behaves precisely as it was stated in the Ref. [
40] and contributes the correct amount of effective energy momentum to source the Einstein tensor.
What went wrong then? The count of degrees of freedom. If we take it as just the fully covariant GR with a dynamical scalar field possessing an arbitrary constant
in its Lagrangian, then it is indeed three dynamical modes, four constrained ones, and four pure gauges—the usual GR with a canonical scalar field. However, from the symmetric teleparallel perspective, there is also the flat connection, and the metric tensor is not a true tensor (unless we go for the covariant approach with more variables in it). The condition they have forgot to add is the requirement of
In the logic of GR, it is just a choice of coordinates. However, we are not free in choosing the coordinates any longer due to the preferred coordinates, the so-called coincident gauge ones. For a single GR solution, there might exist different flat structures on the manifold. As long as we take the flat connection as something objective and sensical, we must also take care of that. This is where the extra half degree of freedom comes from. Indeed, suppose we have found one solution; then, there exist other solutions related to the initial one by a coordinate change
which, due to the condition (
40), must satisfy
. Therefore, only three out of four former gauge freedoms are still pure gauge, while the fourth one becomes half-dynamical.
7.2. On Other Modifications
An interesting feature of the analysis above is that is does not show much difference in deviations from GR in terms of
or
. In particular, we always see a half-integer number of degrees of freedom, though the condition of non-degeneracy (
38) is more intricate in terms of varying
than that of varying
. It makes sense to also try studying the
-modified models, not only the
-modified ones, for the latter can be constructed in the line of unimodular gravity, therefore not requiring the symmetric teleparallel framework per se. At the same time, these two are the only parameters we might change in the linearised theory while keeping non-scalars potentially stable.
An interesting point to note is that, having taken a model of
one can solve (
35) for
and substitute it in the kinetic quadratic form, with the result
which is non-negative for
. This does not really guarantee anything about stability by itself, though it looks very nice. We leave these models for future analysis.
8. Conclusions
We have analysed the models of newer GR in the weak gravity limit. In this limit, they depend on only four free parameters out of five: , , , and . We cannot make the flat connection fully physical, as some parts of it must still be in pure gauge. This is because the first two parameters are immediately fixed to their GR values; even though for the presence of the usual tensors (gravitational waves), we then put , for, otherwise, there would be either ghosts or gradient instabilities in the vectors.
Of course, the ghosts themselves do not produce any instability at the level of linearised theory (as long as the gradient energy also has a reversed sign), and some may even (correctly) argue that ghosts might sometimes be safe [
42,
43]; see also some recent developments [
44,
45]. However, in the case of gravity, it seems plausible that practically any situation which does not locally provide us with a reasonably stable ground state must lead to catastrophic outcomes. Nevertheless, more research on tamed ghosts should be conducted.
As we see it for now, there are not so many options of generalising STEGR with any kind of guaranteed stability. Some people find interest in using the GR-equivalent teleparallel models for defining notions like conserved energy [
46]. In our opinion, this only includes artificially endowing gravity with properties which do not naturally belong there. Therefore, it would be good to try as much of the new structure beyond the GR-equivalent options as possible.
For the scalars, after the restrictions from tensors and vectors, there remains the freedom of choosing
and
. The former is a safe option. It gives us the standard Einstein equation and a canonical scalar, with one-half extra degree of freedom living in the flat connection, which has not been noticed before [
40]. However, it does not make much of the symmetric teleparalell framework physical, for it can be constructed in a unimodular approach to gravity. In our opinion, the case of modifying
is worthy of more study.