1. Introduction
In a recent work [
1,
2], we studied a simple
supergravity model having the property of relating the scales of supersymmetry (SUSY) breaking and inflation, partly motivated by string theory. Besides the gravity multiplet, the minimal field content consists of a chiral multiplet with a shift symmetry promoted to a gauged R-symmetry using a vector multiplet. In the string theory context, the chiral multiplet can be identified with the string dilaton (or an appropriate compactification modulus) and the shift symmetry associated to the gauge invariance of a two-index antisymmetric tensor field, dual to a (pseudo)scalar. The shift symmetry fixes the form of the superpotential and gauging allows for the presence of a Fayet-Iliopoulos (FI) term [
3,
4], leading to a supergravity action with two independent parameters, which can be tuned so that the scalar potential possesses a metastable de Sitter minimum with a tiny vacuum energy (essentially the relative strength between the F- and D-term contributions). A third parameter fixes the Vacuum Expectation Value (VEV) of the string dilaton at the desired (phenomenologically) weak coupling regime. An important consistency constraint of the model is anomaly cancellation, which has been studied in [
5] and implies the existence of additional charged fields under the gauged R-symmetry.
In a subsequent work [
6], we analysed a small variation of this model which is manifestly anomaly-free without additional charged fields and allows coupling in a straightforward way of a visible sector containing the minimal supersymmetric extension of the Standard Model (MSSM), and studied the mediation of supersymmetry breaking and its phenomenological consequences. This model has the necessary ingredients to be obtained as a remnant of moduli stabilisation within the framework of internal magnetic fluxes in type I string theory, turned on along the compact directions for several abelian factors of the gauge group. All geometric moduli can, in principle, be fixed in a supersymmetric way, while the shift symmetry is associated to the 4d axion and its gauging is a consequence of anomaly cancellation [
7,
8,
9].
We then made an attempt to connect the scale of inflation with the electroweak and supersymmetry breaking scales within the same effective field theory, that at the same time allows the existence of an infinitesimally small (tuneable) positive cosmological constant, describing the present dark energy of the universe. We thus addressed the question whether the same scalar potential can provide inflation with dilaton playing also the role of inflaton at an earlier stage of universe evolution [
10]. We showed that this is possible if one modifies the Kähler potential by a correction that plays no role around the minimum, but creates an appropriate plateau around the maximum. In general, the Kähler potential receives perturbative and non-perturbative corrections that vanish in the weak coupling limit. After analysing all such corrections, we find that only those that have the form of (Neveu-Schwarz) NS5-brane instantons can lead to an inflationary period compatible with cosmological observations. The scale of inflation turns out, then, to be of the order of low energy supersymmetry breaking, in the TeV region. On the other hand, the predicted tensor-to-scalar ratio is too small to be observed.
Inflationary models [
11,
12,
13] in supergravity
1 suffer, in general, from several problems, such as fine-tuning to satisfy the slow-roll conditions, large field initial conditions that break the validity of the effective field theory, and stabilisation of the (pseudo) scalar companion of the inflaton arising from the fact that the bosonic components of superfields are always even. The simplest argument to see the fine tuning of the potential is that a canonically normalised kinetic term of a complex scalar field
X corresponds to a quadratic Kähler potential
which brings one unit contribution to the slow-roll parameter
, arising from the
proportionality factor in the expression of the scalar potential
V. This problem can be avoided in models with no-scale structure, where cancellations arise naturally due to non-canonical kinetic terms leading to potentials with flat directions (at the classical level). However, such models require often trans-Planckian initial conditions that invalidate the effective supergravity description during inflation. A concrete example where all these problems appear is the Starobinsky model of inflation [
16], despite its phenomenological success.
All three problems above are solved when the inflaton is identified with the scalar component of the goldstino superfield
2, in the presence of a gauged R-symmetry [
21]. Indeed, the superpotential is, in that case, linear and the big contribution to
described above cancels exactly. Since inflation arises at a plateau around the maximum of the scalar potential (hill-top), no large field initial conditions are needed, while the pseudo-scalar companion of the inflaton is absorbed into the R-gauge field which becomes massive, leading the inflaton as a single scalar field present in the low-energy spectrum. This model therefore provides a minimal realisation of natural small-field inflation in supergravity, compatible with present observations, as we show below. Moreover, it allows the presence of a realistic minimum describing our present Universe with an infinitesimal positive vacuum energy, arising due to a cancellation between an F- and D-term contributions to the scalar potential, without affecting the properties of the inflationary plateau along the lines of Refs. [
1,
2,
10,
22].
In the above models, the D-term has a constant FI contribution but plays no role during inflation and can be neglected, while the pseudoscalar partner of the inflaton is absorbed by the
gauge field which becomes massive away from the origin. Recently, a new FI term was proposed [
23] that has three important properties: (1) It is manifestly gauge invariant already at the Lagrangian level; (2) it is associated to a
that should not gauge an R-symmetry; and (3) supersymmetry is broken by (at least) a D-auxiliary expectation value, and the extra bosonic part of the action is reduced in the unitary gauge to a constant FI contribution, leading to a positive shift of the scalar potential in the absence of matter fields. In the presence of matter fields, the FI contribution to the D-term acquires a special field dependence
that violates invariance under Kähler transformations.
In a recent work [
24], we studied the properties of the new FI term and explored its consequences on the class of inflation models we introduced in [
21].
3 We first showed that matter fields charged under the
gauge symmetry can consistently be added in the presence of the new FI term, as well as a non-trivial gauge kinetic function. We then observed that the new FI term is not invariant under Kähler transformations. On the other hand, a gauged R-symmetry in ordinary Kähler invariant supergravity can always be reduced to an ordinary (non-R)
by a Kähler transformation. By then going to such a frame, we find that the two FI contributions to the
D-term can coexist, leading to a novel contribution to the scalar potential.
The resulting D-term scalar potential provides an alternative realisation of inflation from supersymmetry breaking, driven by a D-instead of an F-term. The inflaton is still a superpartner of the goldstino, which is now a gaugino within a massive vector multiplet, where again the pseudoscalar partner is absorbed by the gauge field away from the origin. For a particular choice of inflaton charge, the scalar potential has a maximum at the origin where inflation occurs and a supersymmetric minimum at zero energy, in the limit of negligible F-term contribution (such as in the absence of superpotential). The slow roll conditions are automatically satisfied near the point where the new FI term cancels the charge of the inflaton, leading to higher than quadratic contributions due to its non trivial field dependence.
The Kähler potential can be canonical, modulo the Kähler transformation that takes it to the non R-symmetry frame. In the presence of a small superpotential, the inflation is practically unchanged and driven by the D-term, as before. However, the maximum is now slightly shifted away from the origin and the minimum has a small non-vanishing positive vacuum energy, where supersymmetry is broken by both F- and D-auxiliary expectation values of similar magnitude. The model predicts, in general, small primordial gravitational waves with a tensor-to-scalar ration r well below the observability limit. However, when higher order terms are included in the Kähler potential, one finds that r can increase to large values .
An interesting point on these models of inflation by supersymmetry breaking concerns the general problem of the fine-tuning of initial conditions (see, e.g., [
26]). Here, the initial point of inflation is special since it corresponds to a restoration of the R-symmetry.
In the following, we will present the main features of these models, where inflation occurs near the maximum of the scalar potential where R-symmetry is restored and supersymmetry breaking is driven predominantly either by an F-term or by a D-term. The first part describing the F-term is a short summary of an earlier review [
27] which was added for self consistency and convenience of the reader.
2. Conventions and Preliminaries
Throughout this paper, we use the conventions of [
28]. A supergravity theory is specified (up to Chern-Simons terms) by a Kähler potential
, a superpotential
W, and the gauge kinetic functions
. The chiral multiplets
are enumerated by the index
and the indices
indicate the different gauge groups. Classically, a supergravity theory is invariant under Kähler tranformations, viz.
where
is the inverse of the reduced Planck mass,
TeV. The gauge transformations of chiral multiplet scalars are given by holomorphic Killing vectors, that is,
, where
is the gauge parameter of the gauge group
A. The Kähler potential and superpotential need not be invariant under this gauge transformation, but can change by a Kähler transformation
provided that the gauge transformation of the superpotential satisfies
. One then has, from
where
and
labels the chiral multiplets.
The scalar potential is given by
where W appears with its Kähler covariant derivative
The moment maps
are given by
Here, we are concerned with theories having a gauged R-symmetry, for which
is given by an imaginary constant
. In this case,
is a Fayet-Iliopoulos [
3,
4] constant parameter.
We consider a class of inflationary models in supergravity containing a single chiral multiplet transforming under a gauged R-symmetry with a corresponding abelian vector multiplet [
21]. We assume that the chiral multiplet
(with scalar component
X) transforms as:
where
q is its charge and
is the gauge parameter.
The Kähler potential is therefore a function of
, while the superpotential is constrained to be of the form
:
where
X is a dimensionless field. For
, the gauge symmetry Equation (
7) becomes a gauged R-symmetry. The gauge kinetic function can have a constant contribution, as well as a contribution proportional to
The latter contribution proportional to
is not gauge invariant, and can be used as a Green-Schwarz counter term to cancel possible anomalies. It turns out, however, that the constant
is very small by anomaly cancellation conditions and does not change our results [
21]. We will therefore omit this term in our analysis below.
Before performing our analysis, a distinction should be made concerning the initial point where slow-roll inflation starts. The inflaton field (which will turn out to be , where ) can either have its initial value close to the symmetric point where , or at a generic point . The minimum of the potential, however, is always at a nonzero point . This is because, at , the negative contribution to the scalar potential vanishes and no cancellation between F-term and D-term is possible. The supersymmetry breaking scale is related to the cosmological constant as . We will therefore assume that inflation starts near , and the inflaton field rolls towards a minimum of the potential at .
5. The Scalar Potential in a Non R-Symmetry Frame
In this section, we work in the Kähler frame where the superpotential does not transform, and take into account the two types of FI terms which were discussed in the last section. For convenience, we repeat here the Kähler potential in Equation (
36) and restore the inverse reduced Planck mass
:
The superpotential and the gauge kinetic function are set to be constant
10:
After performing a change of the field variable
, where
and setting
, the full scalar potential
is a function of
. The F-term contribution to the scalar potential is given by
and the D-term contribution is
Note that we rescaled the second FI parameter by
. We consider the case with
, as we are interested in the role of the new FI-term in inflationary models driven by supersymmetry breaking. Moreover, the limit
is ill-defined [
23].
The first FI parameter
b was introduced as a free parameter. We now proceed to narrowing the value of
b by the following physical requirements. We first consider the behaviour of the potential around
,
Here we are interested in small-field inflation models, in which the inflation starts in the neighbourhood of a local maximum at
. In [
21], we considered models of this type with
(which were called Case 1 models), and found that the choice
is forced by the requirement that the potential takes a finite value at the local maximum
. Now, we will investigate the effect of the new FI parameter
on the choice of
b under the same requirement.
First, in order for to be finite, we need . We first consider the case . We next investigate the condition that the potential at has a local maximum. For clarity, we discuss below the cases of and separately. The case will be treated at the end of this section.
5.1. Case
In this case,
and the scalar potential is given by only the D-term contribution
. Let us first discuss the first derivative of the potential:
For
to be convergent, we need
(note that
). When
, we have
, which does not give an extremum as we chose
. On the other hand, when
, we have
. To narrow the allowed value of
b further, let us turn to the second derivative,
When , the second derivative diverges. When , the second derivative becomes , which gives a minimum.
We therefore conclude that to have a local maximum at
, we need to choose
, for which we have
the condition that
is a local maximum requires
.
Let us next discuss the global minimum of the potential with
and
. The first derivative of the potential without approximation reads
Since
for
and
, the extremum away from
is located at
satisfying the condition
Substituting this condition into the potential gives .
We conclude that, for
and
, the potential has a maximum at
, and a supersymmetric minimum at
. We postpone the analysis of inflation near the maximum of the potential to
Section 6, and the discussion of the uplifting of the minimum, in order to obtain a small but positive cosmological constant below. In the next subsection, we investigate the case
.
We finally comment on supersymmetry breaking in the scalar potential. Since the superpotential is zero, the SUSY breaking is measured by the D-term order parameter, namely the Killing potential associated with the gauged
, which is defined by
This enters the scalar potential as . So, at the local maximum and during inflation, is of order q and supersymmetry is broken. On the other hand, at the global minimum, supersymmetry is preserved and the potential vanishes.
5.2. Case
In this section, we take into account the effect of
; its first derivative reads:
For
to be convergent, we need
, for which
holds. For
, we have
, which does not give an extremum. For
, we have
. To narrow the allowed values of
b further, let us turn to the second derivative:
For , the second derivative diverges. For , the second derivative is positive , which gives a minimum (note that as well in this range).
We conclude that the potential cannot have a local maximum at
for any choice of
b. Nevertheless, as we will show below, the potential can have a local maximum in the neighbourhood of
if we choose
and
. For this choice, the derivatives of the potential have the following properties:
The extremisation condition around
becomes
Note that the extremum is in the neighbourhood of
, as long as we keep the
F-contribution to the scalar potential small by taking
, which guarantees the approximation-ignoring higher order terms in
. We now choose
, so that
for this extremum is positive. The second derivative at the extremum reads
as long as we ignore the higher order terms in
. By our choice
, the extremum is a local maximum, as desired.
Let us comment on the global minimum after turning on the F-term contribution. As long as we choose the parameters so that
, the change in the global minimum
is very small (of order
), because the extremisation condition depends only on the ratio
. So, the change in the value of the global minimum is of order
. The plot of this change is given in
Figure 2.
In the present case
, the order parameters of SUSY breaking are both the Killing potential
and the F-term contribution
, which read
where the F-term order parameter
is defined by
therefore, at the local maximum,
is of order
because
at that point is of order
. On the other hand, at the global minimum, both
and
are of order
, assuming that
at the minimum is of order
, which is true in our models below. This makes tuning of the vacuum energy between the F- and D-contribution, in principle, possible, along the lines of [
10,
21].
A comment must be made here on the action in the presence of non-vanishing
F and
. As mentioned above, the supersymmetry is broken both by the gauge sector and by the matter sector. The associated goldstino therefore consists of a linear combination of the
gaugino and the fermion in the matter chiral multiplet
X. In the unitary gauge the goldstino is set to zero, so the gaugino is not vanishing and the action does not simplify as in Ref. [
23]. This, however, only affects the part of the action with fermions, while the scalar potential does not change. This is why we nevertheless used the scalar potential (
42) and (
43).
Let us consider the case
, where only the new FI parameter
contributes to the potential. In this case, the condition for the local maximum of the scalar potential at
can be satisfied for
. When
F is set to zero, the scalar potential (
43) has a minimum at
. In order to have
, we can choose
. However, we find that this choice of parameter
does not allow slow-roll inflation near the maximum of the scalar potential. Similar to the previous model of
Section 3, it may be possible to achieve both the scalar potential satisfying slow-roll conditions and a small cosmological constant at the minimum, by adding correction terms to the Kähler potential and turning on a parameter
F. However, here, we will focus on the
case where, as we will see shortly, less parameters are required to satisfy the observational constraints.
6. Application in Inflation
We recall that in the models we described in
Section 3, the inflaton is identified with the sgoldstino, carrying a
charge under a gauged R-symmetry and inflation occurs around the maximum of the scalar potential, where the
symmetry is restored, with the inflaton rolling down towards the electroweak minimum. These models avoid the so-called
-problem in supergravity by taking a linear superpotential,
. In contrast, here we will consider models with two FI parameters
in the Kähler frame, where the
gauge symmetry is not an R-symmetry. If the new FI term
is zero, these models are Kähler equivalent to those with a linear superpotential (Case 1 models with
). The presence of non-vanishing
, however, breaks the Kähler invariance (as discussed above). Moreover, the FI parameter
b cannot be 1, but is forced to be
, according to the argument in
Section 5. So, the new models do not seem to avoid the
-problem. Nevertheless, we will show below that this is not the case, and the new models with
avoid the
-problem thanks to the other FI parameter
—which is chosen near the value at which the effective charge of
X vanishes between the two FI-terms. Inflation is again driven from supersymmetry breaking, but from a D-term rather than an F-term as we had before.
6.1. Example for Slow-Roll D-Term Inflation
In this section, we focus on the case where and derive the condition that leads to slow-roll inflation scenarios, where the start of inflation (or, horizon crossing) is near the maximum of the potential, at . We also assume that the scalar potential is D-term dominated by choosing , for which the model has only two parameters, namely q and . The parameter q controls the overall scale of the potential, and will be fixed by the amplitude of the CMB data. The only free parameter left over is , which can be tuned to satisfy the slow-roll condition.
In order to calculate the slow-roll parameters, we need to work with the canonically normalised field
defined by Equations (
20) and (
21). Since we assume inflation to start near
, the slow-roll parameters for small
can be expanded as
Note that
is negative when
. We can, therefore, tune the parameter
to avoid the
-problem. The observation is that at
, the effective charge of
X vanishes and thus the
-dependence in the D-term contribution (
43) becomes of quartic order.
For our present choice
, the potential and the slow-roll parameters become functions of
and the slow-roll parameters for small
read
Note that we obtain the same relation between
and
as in the model of inflation from supersymmetry breaking, driven by an F-term from a linear superpotential and
(see Equation (
22)). Thus, there is a possibility to have a flat plateau near the maximum that satisfies the slow-roll condition and at the same time a small cosmological constant at the minimum nearby.
The number of e-folds
N during inflation is determined by
where we choose
. Notice that the slow-roll parameters for small
satisfy the simple relation
by Equation (
61). Therefore, the number of e-folds between
and
(
) takes the following simple approximate form, exactly as in the previous model of F-term inflation described in
Section 3.
as long as the expansions in (
61) are valid in the region
. Here, we also used the approximation
, which holds in this approximation.
We can compare the theoretical predictions of our model to the observational data, via the power spectrum of scalar perturbations of the CMB, namely the amplitude
, tilt
, and the tensor-to-scalar ratio of primordial fluctuations
r. These are written in terms of the slow-roll parameters:
where all parameters are evaluated by the field value at horizon crossing
. From the relation of the spectral index above, one should have
, and thus Equation (
63) gives approximately the desired number of e-folds when the logarithm is of order one. Actually, using this formula, we can estimate the upper bound of the tensor-to-scalar ratio
r and the Hubble scale
, following the same argument given in
Section 3; that is, the upper bounds are given by computing the parameters
assuming that the expansions (
61) hold until the end of inflation. We then get the bound
where we used
,
, and
, which are consistent with our models. In the next subsection, we will present a model which gives a tensor-to-scalar ratio bigger than the upper bound above, by adding some perturbative corrections to the Kähler potential.
As an example, let us consider the case where
By choosing the initial condition
and
, we obtain the results
,
,
, and
, which are within the
-region of the Planck’15 data [
24].
As was shown in
Section 5.1, this model has a supersymmetric minimum with zero cosmological constant as
F was chosen to be zero. One possible way to generate a non-zero cosmological constant at the minimum is to turn on the superpotential
, as mentioned in
Section 5.2. In this case, the scale of the cosmological constant is of order
. It would be interesting to find an inflationary model which has a minimum at a tiny tuneable vacuum energy with a supersymmetry breaking scale consistent with low energy particle physics.
6.2. A Small Field Inflation Model from Supergravity with Observable Tensor-To-Scalar Ratio
While the results in the previous example agree with the current limits on
r set by Planck, supergravity models with higher
r are of particular interest. In this section we show that our model can attain large
r, at the price of introducing some additional terms in the Kähler potential. Let us consider the previous model with additional quadratic and cubic terms in
:
while the superpotential and the gauge kinetic function remain as in Equation (
41). We now assume that inflation is driven by the D-term, setting the parameter
. In terms of the field variable
, we obtain the scalar potential:
We, thus, have two more parameters
A and
B. This does not affect the arguments of the choices of
b in the previous sections, because these parameters appear in higher orders in
in the scalar potential. So, we consider the case
. The simple formula (
63) for the number of e-folds for small
also holds, even when
are turned on, because the new parameters appear at order
and higher. To obtain
, we can choose, for example,
By choosing the initial condition
and
, we obtain the results
,
,
, and
, which agree with the Planck’15 data, as shown in
Figure 3.
In summary, in contrast to the model in
Section 3, where the F-term contribution is dominant during inflation, here inflation is driven purely by a D-term. Moreover, a canonical Kähler potential (
40) together with two FI-parameters (
q and
) is enough to satisfy the Planck’15 constraints, and no higher-order correction to the Kähler potential is needed. However, to obtain a larger tensor-to-scalar ratio, we have to introduce perturbative corrections to the Kähler potential, up to cubic order in
(i.e., up to order
). This model provides a supersymmetric extension of the model [
47], which realises large
r at small field inflation, without referring to supersymmetry.