3.1. Specialness of the Gauss-Bonnet Case
The results of
Section 2.1 show that choosing
or any constant multiple of this leads to a 4D effective action in the form of a Horndeski theory. It turns out this is a special result in that other choices of the
coefficients do not lead to Horndeski theories. For example, consider the four-derivative terms in Equation (
9). Typically, terms with four derivatives do not correspond to second order equations of motion, but Horndeski’s theory tells us that specific combinations of such higher derivative interactions can cancel out each others’ pathological elements, leading to an overall stable theory. In the Gauss-Bonnet case, the specific terms in question correspond to the following terms in the Horndeski action:
This identification is possible because in Equation (
9), the choice
makes the coefficients of
and
equal up to a sign difference; in the Horndeski action these two terms must appear in this form in order to cancel out parts that do not lead to second order contributions to the equations of motion. We see that general choices of
do not generally lead to Horndeski-compatible theories in 4D. In fact, upon a more comprehensive study of the correspondence between the 4D effective action and Horndeski-allowed terms, one can prove that the Gauss-Bonnet combination is the only choice of
,
and
that will lead to a Horndeski theory. We present the proof of this uniqueness in
Appendix A.
It is interesting that the Gauss-Bonnet term’s specialness among combinations of quadratic curvature scalars manifests in this way. Please note that this means that even the higher-dimensional analogue of the well-studied Starobinsky or
theory of modified gravity
would reduce in 4D to a theory that is not Horndeski. As well as the fact that the structure of the
b field’s Lagrangian would have greater than second order equations of motion, a residual
term would persist in the 4D action, and could in principle be conformally transformed into a second scalar field that interacts non-trivially with
b. It is not immediately clear that this interaction would avoid pathology in
b’s dynamics, but as the full
-dimensional theory should be stable, one might hypothesise that its dimensionally reduced form is no different, though testing this explicitly is beyond the scope of the present work. Given the speciality of the Gauss-Bonnet combination, its physical relevance, and the relative simplicity of its 4D effective action compared to Starobinsky theory or another choice of
, we will henceforth focus on the special case of the action (
1) given by
3.2. Effective 4D Planck Mass and Cosmological Constant
We will discuss below the properties of effective potential for the field
b and whether
b can in principle stabilise at all. Assuming for now that a static solution
is reached and this persists until late-time cosmology, we can evaluate the effective cosmological constant and four dimensional effective Planck mass resulting from this. Directly broadcasting the 4D effective action (
12) into a form where the prefactor of the Ricci Scalar is
gives us the equation
Here we have introduced the notation
because as we have seen already in
Section 2, this combination appears very often in the four–dimensional effective theory. Explicitly using the form of
given in Equations (21)–(26), and evaluating at the point (
,
) we find that the observed (constant) Planck-mass today would be
Enforcing that this equality holds for the measured value of
, coming from say a weak-gravity experimental determination of Newton’s constant, constrains one parameter of the theory. It is helpful to rewrite the 4D effective action using this expression for
such that the
coupling takes the form
where in the first step we have added and subtracted
, and in the second step we have used the definition of
. The action now can be written in a form explicitly containing the 4D Planck Mass such as
where
is the same Horndeski Lagrangian specified by Equations (21)–(26), except
is replaced with
, defined by
where in the second line we have used (32).
Similarly, one can show that the 4D effective cosmological constant for a stabilised field is simply
and this would allow us to write the action in a form
where now the barred Horndeski Lagrangian
is again specified by Equations (21)–(26), this time with both
and
V replaced by barred variations given respectively by (34) and
The action (37) then has the property that the second term containing the Horndeski Lagrangian vanishes when , as all the parts not proportional to derivatives of b (i.e., X) have been absorbed into the definitions of the effective Planck mass and cosmological constant in the first term, and by construction.
3.3. Effective Potential and Static Solutions
We will now address the question under which circumstances the effective potential has at least one minimum. This is a necessary condition for the extra dimension, whose dynamics is described by the scalar–field b, to stabilise. Please note that this condition itself is not sufficient: the dynamics of the field b has to be such that it is driven towards the minima of the effective potential. We will study this in future work, in particular in the context of inflation and whether b itself can be an inflaton field.
In the limit of negligible motion of the field, the Friedmann and Klein-Gordon equations (whose full forms can be found e.g., in [
32]) can be approximated at zeroth order in
, and take the form
Using the equations of motion in this form allows us to search for static solutions of the system—those in which the field is frozen in place. These are of particular interest in the context of cases such as ours where the field represents the size of extra dimensions, as it implies their stabilisation, as well as the context of cosmology where static scalar fields in a potential behave as fluids with equation of state of . Particularly when thinking about stabilisation of the extra dimensions, we are also interested in static solutions where as this would imply extra dimensions of zero size, which has obvious difficulties in interpretation.
The presence of terms other than
in the Klein-Gordon equation implies that even in the static limit, the points at which the field can sustain a constant value will not be the minima of the bare potential
V but instead some effective potential due to a combination of
V and
. As we have neglected motion of the field in this, this calculation also makes no claims regarding whether the system can dynamically reach such a stable point
3, simply whether stable points exist. What we can say from such an analysis is whether such static solutions exist in the first place, and if so what the sufficient, necessary or preferable conditions for their existence are in terms of say the number of dimensions and the combination of model parameters such as
and
4. In this limit of negligible field motion, the static solutions we find will also be stable to zeroth order in
as long as they occur at minima of the effective potential (e.g., in the slice
. Their stability against small perturbations in the field velocity, however, is also dependent on the matter coupling, and cannot be fully addressed within the scope of the present work.
In conventional scalar-tensor theory, the equation of motion is just
, and so it is natural to define an effective potential gradient
to mimic this form. Using the Friedmann equation to eliminate
from this expression and using the specific forms of
V and
for our theory gives the effective potential gradient as
Finding solutions to to identify the existence of static solutions, using the effective potential as given by (43), then reduces to a problem of solving a polynomial equation of six terms with orders , , , , , and . This is, of course, not generally possible for either general d nor for any specified value of d. Below we will look at each choice of d as a special case and categorise the possible behaviours.
Before proceeding to this, however, we can also directly integrate the expression (42) or (43) with respect to
b to see what the effective potential
itself looks like, i.e., what corrections do the presence of a
term impose on the shape of the potential felt by the field in the limit of negligible motion. This procedure yields the expression,
where
is the hypergeometric function. Note the structure of this potential: the first three terms are as in the bare potential, with the difference that the coefficient of the second term is now 3 instead of 1. Furthermore, logarithmic and hypergeometric correction terms are generated. The logarithmic term in particular imposes a boundary on the values
b may take if the reality of the potential is enforced, that is
This is trivially satisfied if the sign of the combination of terms on the left hand side is negative, but when it is positive there will generally be some limit on the value of
b. As well as representing a divergence in the effective potential, this inequality also imposes essentially the positivity of
and that of
in the Friedman equation (assuming
V is also positive) as well as that of the effective square Planck mass, a topic that will be more explicitly discussed in
Section 3.2.
We would also like the stabilised system to admit a positive effective cosmological constant in four dimensions. Inspecting the Friedman equation or the effective action, one can see that the ratio of the signs of
V and
will determine the sign of the effective cosmological constant, and we will again see this more explicitly in
Section 3.2. As we previously discussed,
is constrained to be positive by enforcing positivity of the square Planck mass, and so
V must also be positive at the point
of the static solution for it to satisfy this. For now we will simply search for criteria the parameters must fulfil to permit a positive effective cosmological constant, rather than find bounds on how small the parameters must be to actually replicate the observed cosmological constant as our foremost interest here is to exclude models and numbers of dimensions on grounds of wider criteria rather than precise and comprehensive parameter searches.
3.3.1. Case of d = 1 or 2
In
the expressions derived in this work are largely simplified by the fact that a one-dimensional space has zero Ricci curvature. One result of this is that the effective potential is just the bare potential, which in the limit of
is simply
. Similarly, the expression for the effective potential in Equation (44) is invalid for the case
, as in this case
and therefore the effective potential is identical to the bare potential, which takes the form
. Neither of these cases of are of interest to us, as the
case has a linear effective potential which cannot stabilise, and the
case has a quadratic potential which is capable of stabilising but only at
. We therefore rule out these models. Please note that our findings do not contradict [
6], as our starting point with a simple cosmological constant in the bulk is different from the starting point in that paper.
3.3.2. Case of
Inspecting the expression for in Equation (43), we see that for , will always be a solution of the Klein-Gordon equation, implying a turning point of the potential necessarily exists at . This alone would not be a problem, if not for the fact that in models with one also finds that and that the effective cosmological constant at the point is also zero (as this is set by the bare potential). These two facts combined mean that if there exists a stable potential minimum point other than , then it is either not a global minimum, or it possesses a negative cosmological constant. This does not strictly rule out the case of for interesting applications; it is entirely possible that we have a metastable state represented by a non-global minimum in effective vacuum energy. However, as this is an additional complication, we are less inclined to explore this possibility in our initial survey of models. When we, in a future work, return to the question of dynamical stabilisation, such metastable points will be undesirable as we would not expect them to be reached from generic initial conditions.
An additional complication with the class of models is that the leading term in the polynomial is of order , that is, even for the case we would have to deal with a potential with up to nine possible turning points whose b values can not in general be analytically determined.
Given these observations, while we do not absolutely rule out models of , we initially choose to relegate them as more convoluted and complex than strictly necessary. We will see next that the remaining cases of d between 3 and 5 provide enough feasible behaviour that we need not delve into this possibility at this stage.
3.3.3. Case of
In
, the integer arguments of the hypergeometic functions in the effective potential allow us to write it in the simplified form
in which the hypergeometric functions can now be seen to simply change the prefactor of the logarithmic correction to the polynomial bare potential. Manipulation of Equation (43) additionally reveals that the turning points of the potential occur at
b values that are solutions of the cubic equation
As the logarithm in the effective potential imposes
,
will always be positive for viable field values, and so the sign of the effective cosmological constant is determined entirely by the bare potential, which in this case looks like
As there are an odd number of extra dimensions in this case, it works out that the potential and effective potential take these asymmetrical forms. Considering we wish to interpret
b as a scale factor of the extra dimensions, common sense suggests we should impose
to facilitate this.
5 We therefore ignore the possibility of negative
b. Whether the time-evolution of the system, taking matter couplings into account, obeys this is of course left to future work probing the dynamics of our model.
The simplest way to ensure a positive effective cosmological constant is possible, from Equation (48), would be to impose
and
. However, by Descartes’s Rule of Signs, we would require in this case that
as a necessary and sufficient condition for a positive-
b turning point to exist. The discriminant of the cubic equation can also be shown to be negative for this combination of signs, identifying this as the sole real root. Then, as
, and
is positive, the single turning point at positive-
b will always be a maximum rather than a minimum, making this case of no use. If we instead choose
to ensure that the first turning point at positive
b (should it exist at all) be a minimum, then
is also a necessary but not sufficient condition for the positivity of
V at positive
b values. That is, from (48) we see that
V will only be positive for
. Finally, from arguments again based on Descartes’s Rule, one can only guarantee the existence of a positive turning point in this situation when
, and that there will only be one turning point. The gradient of the effective potential at the point
is
which, for
is always positive. This implies that as there has been a change in sign of the effective potential’s gradient between
and
, the minimum occurs at a
b value in this interval where by (48) the effective cosmological constant will be negative. Conversely, if one imposes
then there will be either zero or two positive-
b turning points. In the case where two exist (excluding the 0-case as uninteresting), the first of these will still be a minimum but will fail to have positive cosmological constant for the same reason as the
case, and the second turning point will be a maximum.
To conclude, the case looks initially promising compared to the other possible numbers of dimensions we have looked at above, but upon a closer analysis, is unable to sustain both a positive-b minimum in the effective potential and a positive-valued effective cosmological constant at that minimum, making it unsuitable to produce a realistic model of our universe with stabilised extra dimensions.
3.3.4. Case of
Similarly to the previous case, the hypergeometric functions have integer arguments in
, and, therefore, the potential can once again be simplified to the form
As expected for an even-
d case, the potential only depends on powers of
and therefor is symmetric under
, and we do not need to worry about the physical meaning of the sign of
b. We can, as before, also find a polynomial from (43) whose solutions are the turning points of this potential, and find that in this case the polynomial in question is quintic. Despite this, however, it is mathematically simpler than the
case as one can see that
is always a solution, and the remaining four solutions are given by the quartic equation
which is really just a quadratic in
, rendering this simpler to analyse than even the
case. Casting this in the form
, the four solutions are
and
. The signs of
and
thus determine if there are 0, 2 or 4 turning points apart from the guaranteed one at
. By comparison of coefficients we find the relations
which suggests that there are two extrema if
and
have different signs (regardless of
), and none if
and
are of the same sign but
is different. In the case where all three parameters share a sign, there may either be four or zero extrema depending on the relative size of the parameters. These equations also imply constraints on the parameter. Multiplying the second of the equations above with
and using the first equation, one obtains a quadratic equation for
and the roots of that equation are real only if
Returning to the turning point at
, the second derivative of the effective potential at this point,
suggests that it will be a maximum if
is positive, and a minimum if
is negative. The former of these is preferable as a
minimum is phenomenologically undesirable, and it further guarantees that if any other extrema exist, at least the one closest to 0 will be a minimum instead. It is then sufficient to confirm the existence of other extrema as a condition for there to be a minimum. A positive Ricci curvature of the extra dimensions in also desirable in that the volume of the extra dimensions can then be thought of as a finite volume of a hypersphere.
Assuming positive
, existence of other turning points would require that at least one of
or
are positive by the above analysis. Another condition comes from the form of the bare potential (which, as
is positive for allowed field values, determines the sign of the effective cosmological constant),
which we desire to be positive. We can see that this will not generally be true but for particular parameter choices, it may be. Numerical investigation reveals that the most promising scenario is when both
and
are positive, as it is under these conditions that one can most readily construct models where a positive cosmological constant minimum is present.
Figure 1 shows a specific example of this with parameters
,
and
. The dashed (purple) line shows the potential as it appears in the Friedmann Equation (57), whereas the solid (green) line shows the effective potential (51). Please note that we consider
(as the scale factor is positive) and that the effective potential is not bounded from below.
Dynamically we expect
b to grow from small values and settle down at the minimum of the effective potential. We believe that the effective potential, derived under the assumption that the time–derivatives of the field
b are negligible, is a good approximation for the case that the field
b is slowly varying, but a detailed study of the full dynamics of
b is beyond the scope of the paper. Finally we note that the effective mass of the field
b, given by
evaluated at the minimum, is much larger than the expansion rate
H, given by Equation (40), as it should. For the values given in
Figure 1, we find
at the minimum.
3.3.5. Case of
In contrast to the previous cases of and , no such simple form for the effective potential exists for as the arguments of the hypergeometric function are now non-integer. The case also has turning points corresponding to the roots of a seventh-order polynomial, which cannot be found in general. While we have no physical reasons to particularly neglect the case of , it also offers no apparent advantages over and which permit much more minimalistic and simple analysis. Qualitatively, we might also expect it to behave more like than on parity grounds. While we note here that numerical studies of might unveil useful behaviour, we do not undertake that work here.
3.4. Inclusion of Matter
So far we have considered a higher dimensional theory of gravity containing quadratic curvature scalars which reduces to a scalar-tensor theory in 4D. Effects resulting from the presence of matter fields in addition to this have been neglected, and while they are beyond the main scope of this work, we will briefly consider them here. First, by directly adding a minimally-coupled (for simplicity) matter term dependent on both coordinates
x and
y to Equation (
1),
we immediately come across a problem in the dimensional reduction procedure in that the integrating-out of
y-dependent terms can no longer proceed fully. Instead we find
such that the mixed
x and
y dependencies of the matter Lagrangian prevent further simplification in the absence of an ansatz further specifying the nature of the matter such as
where
is the Dirac delta function. Proceeding with this ansatz, the procedure of dimensional reduction and identification with a Horndeski theory as in
Section 2.1 would then yield an action
where
is a constant given by
The 4D effective action then contains
x-dependent matter in the form of
fields with a non-minimal coupling to the Horndeski field
b. This implies that our scalar-tensor theory is written in the “Einstein” Frame where the energy momentum tensor associated with the
fields is not covariantly conserved due to explicit interaction with the scalar. That is, variation of the above action gives
where
is a re-defined energy-momentum tensor which, by total energy-momentum conservation, must obey
The bare energy-momentum tensor for
however is independent of
b and only interacts with it via rescaling by a factor of
. We would expect that
Combining this with the explicit covariant differentiation of
:
we would conclude that
. The equation of motion for
b then take the form
In a cosmological background, where the left-hand side produces the usual Klein-Gordon equation terms (whose particular forms are well documented for Horndeski theories), this means that the right-hand side of the equation of motion for
b would be not zero, but instead
while the energy density of the matter field would obey the usual fluid equation
and the rescaled matter energy density corresponding to the energy-momentum tensor
via covariant conservation would obey
where the second equality uses
to rewrite the interaction term also in terms of the
picture. What we see from these calculations is that the presence of matter would in general affect the dynamics of the extra-dimensional scale factor in this theory. As this effect is dependent on the model of the matter fields, including the possibility of more complicated effects arising from non-minimal coupling in the
-dimensional action, we will neglect its presence in the name of simplicity (the bare field theory already contains 6 free parameters) and model independence (not having results depend on a specific realisation of matter) but at the cost of generality. This means our results will only strictly be applicable to situations in which the matter term in Equation (67) is negligible. The inclusion of matter will, however, influence the location of any effective potential minima as well as potentially alter the dynamics of the system which determine whether such minima are in practice reached from physically useful initial conditions or not. We leave an in-depth look at these possibilities for the future, emphasising that the purpose of the present work is to assess the initial feasibility of this theory first in the simplest available context and in turn provide a starting point for a future work which does systematically consider matter interactions.
A further complication to inclusion of matter which is not as immediate as the dynamical effects, particularly if we were to consider a non-minimal matter coupling in the -dimensional action, is that one would have to take into account the point that the conformal rescaling of the metric G would alter the meaning of the function b. In particular, a constant b in one conformal frame does not necessarily mean a constant b in other frames, and one would have to be careful when defining things like “static” extra dimensions to specify in which frame they are static and what the physical meaning of this is. On the other hand, if one were to conformally transform a 4D effective action such as Equation (60) to remove the matter-scalar coupling, one would only have to conformally rescale the metric g to achieve this, not the full metric G, leaving unaltered from its original definition, so it may be possible to avoid such complications.