1. Introduction
Variational formulations for continuum fields represent a fertile subject of research which pertains to both their classical description as well as the formulation of related quantum theories. The issue becomes particularly intriguing in the case of the gravitational field because of some crucially-important open questions arising both in the framework of the so-called standard formulation of general relativity (SF-GR), i.e., the formulation of GR associated with the Einstein field equations (EFE), customarily named as the metric formulation of GR [
1,
2,
3], and corresponding theories of quantum gravity (QG). In this regard, it is generally agreed that any physical theory of gravity worthy of this name, either classical (in particular variational formulations of GR) or quantum, should satisfy the following three basic requirements:
It should fulfill the principle of general covariance (PGC) in arbitrary GR-frames that are related by means of local point transformations (see discussion below in
Section 2).
It should have the goal of determining the structure of space-time, based on the identification of a Riemannian differential manifold associated with it. In particular, this refers to the prescription of its local metric tensor, namely the so-called background metric field tensor, to be represented with respect to an arbitrary GR-frame.
It should determine the Hamiltonian structures, both classical and quantum ones, respectively, realized in the frameworks of SF-GR and QG, both associated with the Einstein field equations (EFE).
The crucial issue is how precisely these requirements can be satisfied and implemented. In this regard, however, further important features must be taken into account that pertain to the physically admissible tensor representations for Lagrangian and equivalent Hamiltonian theories of GR. These include the properties of objectivity, background independence and gauge symmetries.
This paper is the first part of a two-paper investigation, the present one being devoted to variational theories of classical gravity, while leaving to the second part the treatment of QG. The motivations for splitting the treatment of the two subject areas are several, including that:
To achieve an admissible candidate for a QG theory worth of this name, the constructions of a, possibly non-unique, Hamiltonian representation for GR, namely of EFE, is required. The crucial characteristics of the same representation that are consistent with the basic principles of GR will be investigated.
Besides PGC, the consequences of the principle of manifest covariance (PMC) [
4] and its relationship with the principle of objectivity must be addressed. The latter principle concerns the determination of the classical background space-time and the prescription of the appropriate variational treatments to be adopted for the gravitational field.
Furthermore, comparisons with literature are necessary both in GR and QG and must be discussed separately, given the different and peculiar properties of currently available theories.
Regarding specifically GR, a basic conceptual remark in GR refers to the issue of whether and under what conditions (i.e., if any) the validity of PGC in the same context implies or is equivalent to the validity of PMC. Here we wish to point out that if such a conjecture indeed applies, it means that it should always be possible to cast all physical laws, including all the corresponding relevant dynamical variables, in manifestly covariant form, i.e., in an equivalent tensor form. A related issue then concerns the prescription of the background metric tensor. In fact, two possible choices emerge from the literature. Accordingly, the background metric tensor can be either identified with a classical physical observable (i.e., a uniquely prescribed tensor field) or a dynamical variable. Nevertheless, the construction of the theory should not depend on its particular choice (background independence).
Some of the subjects indicated above, which also have much to do with possible alternative representations of the theory of classical and quantum gravity, have already been discussed elsewhere (see, in particular, Refs. [
5,
6]). For this reason, some of the main outcomes displayed in the same references are adopted as reference background material, particularly those regarding aspects of the Einstein–Hilbert (EH) variational theory of SF-GR, conditions of validity of the ADM Hamiltonian theory (Arnowitt, Deser and Misner [
7,
8]) and the
tensor manifestly-covariant Hamiltonian theory of classical gravity referred to as CCG-theory [
6,
9,
10].
In order to proceed consistently, the principles of covariance and manifest covariance in GR are first recalled in connection with the tensorial representations of physical laws, the establishment of their objective character and the definition of classical physical observables. The following related questions arise in this connection.
First question: is there a manifestly covariant Hamiltonian theory of GR which can be based on the EH-action principle? The issue concerns the construction of a Hamiltonian theory of GR, for which, historically, a famous starting point is provided by the EH theory for SF-GR, i.e., the EH-action principle for EFE. In particular, the first inquiry that is posed is whether such a theory can be set in manifestly covariant form, i.e., it is expressible in some suitable tensor form, so that it may identically satisfy the requirement of invariance in form with respect to arbitrary local coordinate (point) transformations. By analyzing in detail the precise nature of the functional setting required for the validity of the EH action principle, we intend to show that the answer to such a question is negative. In other words, in the functional setting required by the EH-action principle, there is no tensor nor tensor-density Hamiltonian approach for EFE to be based on the EH-action principle.
Second question: is there a non-trivial manifestly-covariant and
tensor Hamiltonian theory of GR? The issue is whether an alternative variational approach exists which, contrary to the EH case, is capable of leading to a unique
tensor Hamiltonian representation of GR and which is also manifestly covariant. The task must be reached consistent with the DeDonder–Weyl formalism of the continuum field dynamics variational formulation, in which Lagrangian coordinates and a corresponding phase-space Hamiltonian state are realized only in terms of
tensors [
11,
12,
13,
14,
15,
16,
17]. Such an approach is shown to exist and to coincide with CCG theory, which satisfies the physical requirements set by the principle of objectivity. The latter demands that it should be possible to identify physical observables with objective observables, namely that such a character must be unique and independent of a particular realization of the GR reference frame where the same observables are evaluated or measured. Necessarily, this notion can only be mathematically formulated after the identification of the metric tensor of the background space-time solution of the Einstein field equations and with respect to which the tensorial properties of a theory and physical observables are established. As shown below, the objectivity principle is therefore strictly related to the principles of general and manifest covariance, both satisfied by CCG theory. It is shown, in fact, that such a theory can be achieved thanks to the adoption of a suitably defined functional setting and the notion of the synchronous variational principle, while within the same approach, a non-trivial Hamiltonian representation of this type can be obtained for GR.
Third question: is there an extended-variable and manifestly covariant tensor Hamiltonian theory of GR? This question, which is closely related to the previous one, concerns the (still unsolved) issue of whether there may also exist independent extended-type variational approaches for EFE, namely based on an extended set of Lagrangian independent variables, to be suitably identified, and in terms of which a corresponding extended-variable manifestly covariant Hamiltonian approach can be achieved.
The first question indicated above may apply, in principle, to the majority of the literature’s variational approaches known so far. These include, in particular, the Dirac and ADM approaches, as well as the so-called covariant canonical gauge approach [
18]. In this regard, with the exception of CCG theory, almost all previous approaches share a unique feature of the EH-action principle, namely—as clarified below—that of being all based on the adoption of an asynchronous action principle. This means, in other words, that they adopt variational principles in which the relevant space-time volume element is considered a function of the variational space-time metric field tensor (see discussion below in
Section 2). The main differences characterizing CCG theory arise, however, because of: (a) the prescription of the functional setting for the variational tensor functions which are adopted, denoted here as
and not to be identified with metric tensors; (b) the synchronous type of action variational principle adopted; (c) the further requirement concerning the precise prescription of the gauge invariance property to be adopted in this context. In fact, a natural choice should follow by analogy with the corresponding well-known flat space-time gauge theories available for continuum fields. Nevertheless, it is well-known that such a property is not met by the majority of variational GR approaches to be found in the literature. As a consequence, for the appropriate treatment of the subject posed in this paper, the investigation must necessarily rely on the
tensor manifestly covariant Hamiltonian formulation of classical gravity (CCG-theory) established in Refs. [
5,
9,
10], which in turn is at the basis of the corresponding covariant quantum gravity theory (CQG-theory) [
19,
20].
In detail, the contents of the paper deal with a number of conceptual issues and related goals, which are discussed below according to the following scheme. In
Section 2, the possible non-unique realizations of the property of manifest covariance, i.e., invariance in form, implied by the general covariance principle (GCP) is addressed. It is shown that possible realizations of the notion of invariance in form can be in principle obtained either by
tensor or
tensor-density transformation laws. In particular, the principle of objectivity (PO) is formulated, and the existence of the space-time of the universe (background space-time) is pointed out. Such a space-time is shown to be characterized by prescribed geometric properties, i.e., a background metric tensor
and corresponding background Ricci tensor
.
In
Section 3, an objective prescription of the variational action principle for GR is achieved. It is shown that validity of PO necessarily requires the introduction of a suitable functional setting and the adoption of an appropriate synchronous variational principle for the fundamental action principle of GR. In such a setting, the variational tensor field
is not treated as a metric tensor by itself, being instead regarded as a tensor function prescribed with respect to the metric field tensor of the background space-time
. This property corresponds effectively to a realization of a variational principle of GR in terms of superabundant variables. In particular, it is shown that a unique prescription can be obtained for the invariance properties of the fundamental action functional, while validity of PO warrants that manifest covariance can only be realized in terms of a
scalar variational Lagrangian and
tensor canonical fields.
For this purpose, various different synchronous Lagrangian variational principles are presented. In particular, in this context, the topic will be recalled that concerns the construction of a manifestly covariant Lagrangian dynamical equation for the variational tensor field , which recovers exactly EFE under suitable prescription of the extremal functions. The resulting Lagrangian action principle, in particular, appears significant because it permits the construction of a corresponding manifestly covariant Hamiltonian treatment of SF-GR.
Furthermore, in
Section 4, the possible existence of an extended set of independent Lagrangian variables is investigated, which can generalize the customary identification in terms of the tensor
alone, but remain nevertheless associated only with the same EFE. It is proved that this task can be successfully met when the same Lagrangian variables are identified with the set of independent variational coordinates
, with
and
denoting, respectively, the variational tensor field and the variational Ricci tensor field. The corresponding Lagrangian formulation of EFE is referred to as the “metric-Ricci" Lagrangian variational theory. This result permits us to prove in turn in
Section 5 that a manifestly covariant formulation of the “metric-Ricci” Hamiltonian theory of GR can be equally achieved. In such a case, the canonical variables are identified with the independent Lagrangian coordinates, represented by the set of variational tensor fields
together with corresponding extended conjugate canonical momenta, namely realized in agreement with the DeDonder–Weyl formalism. As a final issue, a fundamental gauge property that must apply for consistency to any variational theory of continuum fields is proved to also hold for the variational treatment of GR only, provided one adopts synchronous variational principles, i.e., consistent with the principle of manifest covariance.
2. The General Covariance Principle
The General Covariance Principle (GCP), which is set at the basis of GR, represents a theoretical cornerstone common to all variational approaches to EFE. In theoretical physics (following Weinberg, 1972 [
21]), the notion of general covariance, also known as diffeomorphism covariance, is well-known. It consists of the realization of a suitable condition of invariance in form for the relevant physical laws, or more generally, the condition of (simple) covariance, with respect to suitable coordinate transformations. However, such coordinate transformations are not completely arbitrary since they must be intended as diffeomorphisms of the space-time in itself, i.e., local transformations which preserve the space-time structure. This requires, therefore, prescribing both the said space-time structure and stating explicitly the local nature of the same transformations for such a purpose. Thus, let us assume for definiteness that the space-time is represented by a Riemannian differential manifold of the type
, with
being the
dimensional real vector space
representing the space-time and
being a real and symmetric metric tensor which is parametrized with respect to a coordinate system (or GR-frame)
. Then, the same coordinate transformations, denoted as local point transformations (or briefly LPT), must preserve the structure of space-time, i.e., they must be realized by local and differentiable bijections of the form
with inverse
characterized by a non-singular Jacobian matrix
. Thus,
and
are arbitrary points belonging to the initial and transformed space-time structures
and
, respectively. By construction, the same space-time structure is preserved under the LPT-group (the group of local point transformations of the type (
1)), so that actually
, while the metric tensors
and
transform in each other in accordance with the appropriate tensor (i.e.,
tensor) transformation laws. For definiteness, denoting
as the same symmetric metric
tensor expressed in the transformed coordinates
, by construction, it follows that:
- (a)
The Riemann distance in the two space-times and is the same; namely, it is realized by means of a scalar, so that .
- (b)
The fields
and
are
tensors. Hence, their covariant components
and
are related via the corresponding covariant
tensor transformation laws. In tensor and symbolic form, the direct and inverse transformations
and
read, respectively,
where
and
are the direct and inverse Jacobian matrices and, for brevity, “•” denotes here a symbolic matrix product.
- (c)
The tensor fields
and
are metric tensors so that they are required to satisfy the orthogonality conditions
- (d)
Finally, the Ricci and Riemann tensors
and
, which are associated, respectively, with the two structures
and
, are transformed in each other in accordance with the covariance
tensor transformation laws indicated above by Equations (
4) and (
8).
Coming now to the specific realization of GCP, all covariant physical laws should hold in arbitrary GR-frames. Of course this does not mean that they should necessarily take the same functional form in all GR-frames (as corresponds to the notion of simple covariance). This means that an arbitrary non-
tensor smooth real function
represented with respect to
will generally transform with respect to the LPT-group (
1)–(
2) in such a way that its corresponding transformed function
represented with respect to
takes the form
, with
denoting a real and smooth function of
and
generally different from
.
2.1. The Principle of Manifest Covariance and Its Extension
In this section we define the precise meaning of manifest covariance and its possible extension. As we intend to show here, the two notions depend actually on the precise representation chosen for the field variables and physical observables in GR, to be all considered here as purely classical ones.
One possible realization of the notion of invariance in form/manifest covariance is provided by the so-called
principle of manifest covariance (PMC) first introduced in Ref. [
4]. According to such a principle, it should always be possible to cast all physical laws in
tensor form, i.e., in manifestly covariant form. It is obvious that PMC implies the validity of GCP. The proof is elementary. In fact, let us assume that the space-time is of the form
. Then, according to PMC, it should always be possible to cast all physical laws in
tensor form, i.e., to be expressed equivalently, just as the metric
tensor (see Equation (
3)), in covariant or counter-variant forms. Indeed, all tensor indexes are necessarily raised and lowered by its countervariant and covariant components
and
, respectively. Thus, for example, let us consider the case in which a given (arbitrary) set of physical laws is expressed by the covariant
tensor equations
Then, in terms of the local point transformation (
1)–(
2), it follows that, by assumption, the same observables
must transform according to the covariant
tensor transformation laws
As a consequence, the transformed
tensor equations become
and are therefore invariant in form, since they take exactly the same form as Equation (
7).
We stress, however, that in principle, the covariant
tensor transformation laws (
8) may not be the only possible realization consistent with the notion of invariance in form, i.e., equation of the type (
9). Thus, for example, introducing the determinant of the Jacobi matrix
, another possible realization can be achieved assuming that at least some of the observables
, identify
tensor-densities of order 0 to
n. This requires assuming for them a
covariant tensor-density transformation law, i.e., of the form
We also stress that in this case, the transformed equations recover again the form (
9) and can therefore also be considered as invariant in form. The conclusion drawn here is, therefore, that the notion of manifest covariance can be conveniently extended to include, besides
tensors,
tensor-densities as well. This notion will be referred to as extended manifest covariance. Conversely, we shall refer to the extended principle of manifest covariance (EPMC) as the one in which all physical laws are cast either in
tensor or in
tensor-density forms.
2.2. The Principle of Objectivity
From the discussion above, it might appear that, in the case of arbitrary classical physical dynamical variables and/or observables, there is no mandatory physical requirement for the occurrence of either
tensor or
tensor-density transformation properties (i.e., either Equations (
8) or (
10)). In fact, even if some of them are certainly
tensors, such as the variational metric tensor, it is not obvious nor necessary why all observables should be such.
One such example, in the context of GR, is represented by the structure of space-time (universe) , which also lays at the basis of PGC and PMC and is associated with the background metric tensor . In this regard, there are two possible viewpoints.
The first one is based on the
principle of objectivity and is called an
objective viewpoint. For definiteness, the principle can be stated here as follows: in GR, the classical background metric tensor
, which determines the structure of the space-time
, should be “objective in character”. In other words, it should be realized by a classical observable, namely a
tensor field that can actually be measured locally in terms of a classical measurement. Hence, it should always be possible to measure it by setting an arbitrary GR-frame and by means of a classical (ideal) measurement experiment. In particular, within the standard formulation of GR (SF-GR) [
6], the same background metric tensor should coincide with a particular solution of EFE (i.e., subject to appropriate boundary conditions). The same
tensor field, which can be locally measured by means of a classical (ideal) measurement experiment, determines the structure of the space-time of the universe (or background space-time).
We notice, however, that an alternate viewpoint is also still possible, i.e., in which the background metric tensor is considered variational, namely treated as a non-observable tensor field. Thus, in contrast, this will be referred to as a non-objective viewpoint. The two viewpoints actually lead to two different possible variational formulations of GR to be reviewed in the next Section. However, what we intend to point out in this paper is that only the first one (i.e., the objective viewpoint) actually permits us to achieve a manifestly covariant Hamiltonian representation for GR.
Before closing, it is worth summarizing the key consequences of the objective viewpoint:
Consequence #1—the validity of the Objectivity Principle implies, therefore, the existence of a classical background space-time, characterized by a prescribed background metric tensor .
Consequence #2— all observables depending on
must be regarded as objective observables as well. Equivalently, the geometry of the background space-time
depends uniquely on
. This implies, in particular, that the Riemann distance and the invariant
volume element of the background space-time should be unique functions of
, namely of the form
Similarly, the space-time Riemann and Ricci
tensors associated with the same background space-time, i.e.,
should be regarded as objective observables. As a consequence, the covariant derivative of
must vanish, since identically the metric compatibility conditions must hold, namely
where
, with
, and
as the standard connections (Christoffel symbols) and Ricci tensor expressed in terms of the background metric tensor
. All such observables determined in terms of the background metric tensor
therefore coincide with background or extremal tensor fields. In particular, this means that the Riemann and Ricci tensors are necessarily
tensors with respect to the background space-time
. Hence, the objectivity principle implies the existence of a background space-time, characterized by prescribed geometric properties, i.e., a background metric tensor
and corresponding background Ricci tensor
.
Consequence #3— In the following, we intend to prove that, once the objective viewpoint is taken, then necessarily all relevant dynamical variables and physical observables can always be identified with tensors. This will be referred to here as the universal tensor property of classical field variables.
4. Extended Lagrangian Variables: The Metric-Ricci Action Principle
Here we address the issue of completeness for the manifestly-covariant Lagrangian description of GR: namely whether there may exist additional independent Lagrangian variables, possibly associated with classical treatment of covariant gravity theory. The complete set of Lagrangian variables determined in this way will be called extended Lagrangian variables. Here, we claim that a possible variational formulation exists in which both the tensor field
(not a metric tensor) and the Ricci tensor
are considered variational and endowed with independent tensor variations
and
. A positive answer requires, however, that the Einstein field equations should remain unchanged, i.e., that the extremal values of
and
are mutually related via the standard connections. In order to develop a synchronous variational formulation of the type considered above, this involves adopting the varied tensor fields
, hereon referred to briefly as metric and Ricci variational fields, belonging to the
metric-Ricci functional setting (
A4) (see again
Appendix A), with the extremal tensor fields
to be properly prescribed. Here, the variations
and
are both required to vanish on the improper boundary of
. Notice that here, the tensor indexes of both variation fields
and
are raised and lowered by the countervariant and covariant components of the background metric tensor
, while
and
, i.e., the variations of the tensor fields
and
, are assumed to be linearly independent. In addition, the standard connections (Christoffel symbols)
are again considered functions of the background metric tensor
. It follows that the covariant derivatives of
defined in terms of the standard connection are generally non-vanishing (see Equations (
40) and (
41)). Then, a possible realization of the variational Lagrangian function also applicable in the case of non-vacuum fields is provided by the coupled
metric-Ricci (extended) variational 4-scalar Lagrangian field, i.e., a linear combination (via a suitable dimensionless and real coupling constant
) of two Lagrangians, respectively, of the
dependent Lagrangian defined above, namely
given by Equation (57), and suitably effective
dependent Lagrangian,
, namely
with
denoting a
scalar coupling coefficient, and with
being defined here as follows:
Notice that
and
denote now the effective kinetic and potential energies
Thus, in particular,
and
identify, respectively,
while the rest of the notations are standard. In the case of non-vanishing coupling coefficient
, the corresponding variational principle
which is assumed to hold for arbitrary independent tensor variations
, requires the following Frechet derivatives to vanish identically:
These equations represent the Euler–Lagrange equations associated with Equation (
65). Due to the linear independence of the synchronous variations
and
, these can be written in terms of the variational derivatives. Upon letting
then Equations (
66) and (
67) deliver the metric-Ricci Euler–Lagrange equations
The proof of the statement is an immediate consequence of the extended (synchronous) Lagrangian formalism here pointed out. In fact, from Equation (
59), we first notice that the variational field
is only carried by the Lagrangian
, while the variational field
is only contained in
, with the consequence that the two equations in (
69) are effectively decoupled. In addition, we have that identically, when evaluated for extremal fields, the new Ricci contribution is vanishing, namely
The explicit evaluation of the Euler–Lagrange equation for
is then exactly the same as the one outlined in
Section 3.3 and
Section 3.4. On the other hand, one can readily see that the mathematical structure of the two Lagrangians
and
is the same for their respective variational fields, so that both contain the same type of quadratic kinetic term and analogous potential contributions. Based on these considerations, thanks to the validity of the customary Lagrangian formalism for continuum fields, one can immediately reach the proof of the formal representation of the Euler–Lagrange equation for
(i.e., the second equation in (
69)).
We notice that by construction,
. If one requires additionally that
should hold identically as well, this means that
must actually depend on
, such as, for example, in the case in which the background metric tensor identifies the de Sitter solution. From the Einstein field Equation (
35), it follows, therefore, that the extremal stress-energy tensor
must satisfy the constraint condition
Such a requirement is, of course, trivially satisfied in a vacuum. However, subject to the requirement
, it may also be fulfilled in the case of gravitational external sources. In such a case, it follows that the Lagrangian Equation (
69) reduces identically to
Again, the proof of the first equation in (
73) is exactly similar to
Section 3.3. Instead, the second equation in (
73) represents the new theoretical goal of the formalism developed in the present section and deserves a separate proof. The latter can be obtained with straightforward algebra as follows. First, we notice that in validity of the condition (
71) on the extremal fields, the dynamical contribution generated by the kinetic term in the Lagrangian
necessarily vanishes identically. Therefore, in such a setting, only the potential contribution matters. From the second equation in (
69), we need only to evaluate the following contribution:
In detail, from Equation (
63) and the fact that necessarily
, one finds
When evaluated for the extremal field
these contributions yield the second equation in (
73).
The last two equations coincide identically with the non-vacuum Einstein field equations. The first implication from this result is that the original condition on the extremal Ricci tensor introduced in the synchronous approach is eliminated. In other words, the requirement (not a constraint)
originally set in the synchronous variational principle (see Equation (
49) above) is formally replaced by the new metric-Ricci variational principle and subject to the requirement (
68) on the extremal fields
. Second, the new classical physical requirement
arises. In the present context, it can be simply explained as a necessary requirement for the validity of the metric-Ricci variational principle. This feature suggests, however, the interesting role taken by the cosmological constant in this approach, which provides a new derivation of the classical Einstein equations based on the variational theory for the Ricci tensor, implying that
must be a foundational element of GR theory [
25]. In fact, the role of
is proved to be of high relevance in cosmology and astrophysics, as summarized in Refs. [
26,
27]. In addition, the request of a positive cosmological constant in the variational principle for the Einstein equations of GR is a condition in agreement with the result pointed out in Ref. [
4], where it was shown that
can be associated with the validity of the orthogonality condition
for the extremal metric tensor.
5. Manifestly Covariant Metric-Ricci Hamiltonian Approach
Given the previous results, we are now in position to attempt formulating a manifestly covariant metric-Ricci Hamiltonian approach in which the canonical variables coincide with the extended Lagrangian variables represented by the set of the metric-Ricci coordinates
, together with their conjugate momenta, to be suitably defined. Of course, to make sense, it is necessary to assume as before that the covariant derivatives of both the Lagrangian tensor variables
are non-vanishing. Again, however, a
tensor extended Hamiltonian approach, i.e., in which both the
tensor fields
are variational, cannot be based on the asynchronous EH-action principle (
30). The reason for the tensor field
is the same one indicated above, i.e., in the functional set
its covariant derivative is identically vanishing.
However, on the contrary, it is worth stressing that alternate manifestly covariant Hamiltonian approaches are possible. As pointed out in [
25], these approaches can be equivalently based on the asynchronous EH-action principle (
30) or the synchronous action principle (
49). In both cases, the
tensor field
is variational, while the tensor field
(and/or
) is considered prescribed. In such a case, in fact, the distinction between the two types of variational principles disappears altogether.
Coming back to the issue of joint variational formulations holding for the whole set of metric-Ricci coordinates
no difficulty is expected in case of a synchronous extended Hamiltonian action principle. This can be achieved by generalizing the corresponding extended Lagrangian principle given above (see Equation (
65)) to the whole extended set of the metric-Ricci coordinates
. The key assumption is that of considering all the variational fields as tensor functions with respect to the background metric field tensor
. The corresponding metric-Ricci Hamiltonian formulation is straightforward. Thus, following the guidelines adopted in
Section 3, upon denoting as
the canonical momenta conjugate to the Ricci tensor
, the functional setting is expected to be analogous to (
A4), i.e., with the varied fields
required to belong to the synchronous Hamiltonian metric-Ricci setting (
A5), while the corresponding extremal fields
are again to be suitably prescribed. We notice, in particular, that here, all variations
,
,
,
are required to vanish on the improper boundary of
(see
Appendix A, Item 5). Thus, in analogy with (
A10), the relevant
synchronous metric-Ricci Hamiltonian action functional will be taken of the form
where
denotes a
scalar function, to be suitably prescribed consistent with the variational Lagrangian
defined above (see Equation (
78)). Finally, here we notice that the second term in the integral, identifying the new exchange term, is carried by the new canonical momentum
and the covariant derivative of the countervariant components of the variational Ricci tensor, namely
, with
again being the covariant derivative evaluated with respect to standard connections depending on
. From the
metric-Ricci Hamiltonian action principle
the corresponding Hamilton equations follow:
These equations read explicitly
where again it is understood that the equations are evaluated for the appropriate prescribed extremal fields
. The proof of the statement can be obtained at once based on straightforward algebra, simply invoking the same calculations developed for the evaluation of the derivatives in the metric-Ricci Euler–Lagrange equations outlined above, recalling that by construction, the Hamiltonian equations for the variables
decouple from those holding for the set
. In particular, one has, for example, that
, and therefore, the calculation reduces to that reported in Equation (
74).
Indeed, the previous equations are equivalent to the corresponding metric-Ricci Euler–Lagrange Equation (
69). In particular, the metric-Ricci action principle and the corresponding Hamilton equations imply the validity of the Einstein field equations under prescriptions of the extremal fields
. This happens indeed, as expected, when one sets
and the generalized velocities
,
, together with the conjugate momenta
and
, vanish identically (see previous discussion in
Section 4). As a result, the second and fourth Hamilton equation recover at once the extremal Euler–Lagrange Equations (
73). In particular, the last two equations in (
80) become
where, provided
, the second equation recovers identically EFE (
35). As stated before, the first equation (83) also implies the validity of the constraint condition (
72), a requirement which can be satisfied in the case of vacuum and suitably prescribed external sources. In such cases, Equation (
80) provides, therefore, a manifestly covariant Hamiltonian representation for EFE (see Equation (
35)) in which all variational canonical fields
are considered independent
tensor fields.
5.1. Gauge Properties of the metric-Ricci Lagrangian and
Hamiltonian Action Principles
An interesting issue concerns the characteristic gauge properties of all the synchronous Lagrangian and Hamiltonian action principles given above, i.e., in particular, the metric-Ricci synchronous Lagrangian and Hamiltonian action principles (
A9) and (
65), together with its Hamiltonian counterpart (
A9). Here, we refer in particular to the additive gauge properties fulfilled by the variational metric-Ricci Lagrangian
(
59) and the modified variational metric-Ricci Lagrangian
(
78). The first property is achieved noting that a general gauge transformation is provided by
which holds for an arbitrary
scalar real field of the form
. Furthermore, if
is an arbitrary smooth
vector field, then it determines a gauge function. In fact, one can show that in the functional settings (
A4) and (
A5), the map
realizes a gauge transformation. This includes the particular cases in which
and
.
The interesting implication is, therefore, that a necessary condition for the invariance with respect to the previous gauge transformation properties (which are all characteristic of flat-space-time classical field theory) is the variational Lagrangian to be a
scalar. In turn this means that, under the same condition, the adoption of a synchronous variational principle for GR is required. The synchronous variational principle (
79), therefore, is characterized by unique characteristic gauge properties.
5.2. Discussion and Comparisons
An interesting comparison can be made with some of the literature approaches which have some apparent similarity with the present treatment. These include, in particular, approaches ‘
a la Palatini’, based typically on the adoption of an asynchronous variational principle as the original EH-action principle [
28]. However, unlike in the asynchronous functional setting
considered above, these involve dropping any a priori relationship between the metric tensor
and the connections
and considering them as independent geometric quantities [
29] while at the same time retaining the relationship between the same connections and the Ricci tensor. In practice, this means that in the resulting action functional, the metric tensor
and the connections now denoted
are treated as independent variational quantities, while the Riemann and/or the Ricci tensors are considered prescribed or expressed in terms of the same
. We stress that an analogous approach is developed in Ref. [
18], where the Riemann tensor, and hence the Ricci tensor as well, is assumed prescribed in terms of the same independent connections
. Such a setting is therefore substantially different from the metric-Ricci one considered in this Section, with the Ricci tensor to be considered variational and independent of the standard connections
. Most importantly, all ‘
a la Palatini’ approaches violate the principle of manifest covariance because the variational connections
are not
tensors. This is the primary motivation of our choice of considering them prescribed in terms of the background metric tensor.
For completeness, however, it is worth pointing out how, despite the feature indicated above (i.e., the intrinsic non-manifestly covariant feature carried by the varied non-tensor fields
) a manifestly covariant Hamiltonian representation can also be achieved in the contest of ‘
a la Palatini’ approaches of this type. This information, in fact, can be valuable by itself at least for comparison with the metric-Hamiltonian theory described above in
Section 2 and
Section 3. To begin with, one notices that, as a consequence of such a setting, the covariant derivatives of
and
are now defined in terms of variational connections
and therefore take the form
and are generally non-vanishing, while, in addition, the same covariant derivatives identify, up to an arbitrary multiplicative
scalar factor
, 3rd-order
tensor fields. The conjecture is whether they can actually be associated with canonical momenta of the form
. One can show that the prerequisite for this to happen is that the
scalar function
(where
still denotes here the standard covariant derivative, i.e., with standard connections defined with respect to
) results in a gauge function, namely it is such that the identity
is fulfilled. Such a constraint condition can indeed be satisfied by properly prescribing
and
to vanish on the improper boundary
. In fact, since the rhs of the previous equation does not depend on the choice of the connections (i.e., it holds for arbitrary connections), by introducing the auxiliary
tensor-density [
18]
, then the following identity holds:
in which the contributions carried by the variational connections
have been introduced. Let us invoke therefore, in analogy to the prescription for
adopted above (see Equation (
A1)), the
asynchronous Hamiltonian–Palatini functional setting (
A6) for the varied fields
which are defined with respect to corresponding properly defined extremal fields
.
This implies that if, in such a setting, one considers only variations with respect to the independent tensor functions
(see
Appendix A, Item 6), i.e., performed while also keeping constant the varied connections and at the same time letting
, an approach ’a la Palatini’ should admit an asynchronous Hamiltonian action principle of the form
with
and
denoting a
scalar Palatini functional and corresponding
scalar Hamiltonian function. Thus, introducing the corresponding
scalar-density Hamiltonian
, the resulting Hamilton equations, defined with respect the modified covariant derivatives (
86) and (
87), namely
are implied. Notice that, as expected, these equations are cast in
tensor-density form and cannot be cast in an equivalent
tensor form, with the exception of the first equation in (
91), which can also be written as
i.e., in a manifestly covariant
tensor form. However, it must be noted that, once
is set, the same equation reduces manifestly to
so that
is also implied. As a consequence, the second Hamilton equation in (
91) is expected to recover exactly EFE (i.e., Equation (
35)), which means that
should coincide with
. We finally notice that if the variation with respect to
is performed, one finds out that the constraint equation
must be fulfilled. This happens if the Ricci tensor is considered a prescribed tensor field which is independent of
and if the Palatini Hamiltonian
does not depend explicitly on the variational derivatives of the tensor fields
or
. A first solution is obviously
. Another one is that of requiring identically
, which implies in turn that
should again coincide with
.
We stress that Equations (
90) and (
91) appear formally analogous to the variational principles and Hamilton equations reported in Ref. [
18], being also expressed in terms of
tensor-density canonical momenta and a
scalar-density Hamiltonian. In particular, this means that all such variational principles are asynchronous, just as the action principle (
90) reported here. Comparison with the
tensor metric-Hamilton Equation (
A15) (reported above in
Section 3) is also instructive. In contrast with all approaches based on asynchronous variational principles, the latter appears, in fact, as the only possible realization of a manifestly-covariant Hamiltonian formulation of GR which is expressed in
tensor form and satisfies the same gauge properties that are characteristic of flat-space-time field theories.
6. Conclusions
In this paper, aspects of the variational theory of general relativity (GR) have been investigated in the context of the general covariance and manifest covariance principles for GR, which concern both the tensor and gauge representations of Lagrangian and Hamiltonian theories for the Einstein field equations.
After recalling the general covariance principle, the possible realization of the conditions of invariance, i.e., the properties of manifest covariance and covariance have been pointed out. In particular, manifest covariance has been shown to correspond either to tensor or tensor-density transformation laws for the relevant tensor fields. The conjecture concerning a possible universal tensor property has been investigated and proved to be realized by the objectivity principle. Its physical implications regarding the notion of background space-time and the identification of objective classical observables have been pointed out. In this reference, an interesting result is that customary non-4-tensor formulations of GR, i.e., represented in non-manifestly covariant form, can, in principle, be reduced to a purely manifestly covariant one, i.e., in which the Lagrangian field variables are represented by tensor fields.
As far as Lagrangian theories of the Einstein field equations are concerned, two possible formulations based either on asynchronous or synchronous variational action principles have been considered. However, their major difference is related to the existence of a possible corresponding manifestly covariant Hamiltonian realization of the Einstein–Hilbert action principle. The notable conclusion which is reached is that the explicit construction of a manifestly covariant Hamiltonian approach, in fact, can only be achieved by adopting a synchronous action principle which is cast in Hamiltonian form.
Subsequently, the issue of completeness for the Lagrangian manifestly covariant description of GR has been addressed, namely whether there may exist additional independent Lagrangian variables, possibly associated with classical treatment of covariant gravity theory. The complete set of Lagrangian variables thus determined, which also includes the variational Ricci tensor, besides the variational tensor , is referred to as extended Lagrangian variables. Such a set is shown to realize an ensemble of independent Lagrangian variational coordinates. For this purpose, a synchronous variational formulation of the Lagrangian action principle based on the extended variables is shown to exist and to recover exactly, under appropriate assumptions, the Einstein field equations. In terms of the same variational principle, the construction of a manifestly covariant extended Hamiltonian approach has been developed, in which the canonical variables coincide with the extended Lagrangian variables represented by the “metric-Ricci” set and by their conjugate momenta, to be suitably defined. Additionally, in this case an extended Hamiltonian approach has been achieved by means of a synchronous EH-action principle.
As a final topic, the issue of the prescription of the appropriate gauge properties has been addressed. As a result, it has been shown that for synchronous variational principles, the same gauge properties actually apply, which are characteristic of flat-space-time theories. This property appears notable especially because it is peculiar and unique to the synchronous variational principles considered here. Indeed, on the contrary, it is manifestly violated by all asynchronous variational principles, including the Einstein–Hilbert action variational principle.
The main conclusions of the paper can be summarized as follows:
The first one is that the asynchronous EH-action principle does not permit the construction of a manifestly covariant Hamiltonian approach for EFE. In fact, such a possibility is simply ruled out by the prescription of the functional setting for the same action principle, the reason being that the “generalized velocities”, i.e., the covariant derivatives of the variational metric tensors vanish.
On the contrary, a manifestly–covariant tensor Hamiltonian theory exists, which is based on the adoption of a synchronous action variational principle. As shown here, such a result can be achieved by means of suitably prescribed Lagrangian and Hamiltonian (synchronous) action principles.
As a third issue, we have shown that based on the same type of synchronous action principles, an extended-variable manifestly-covariant Hamiltonian variational formulation can also be reached, in which Lagrangian coordinates are represented by the set of the metric-Ricci variational fields . The resulting Euler–Lagrange equations have been obtained in manifestly covariant tensor form. In terms of the corresponding metric-Ricci action principle, the Einstein field equations have been recovered as particular solutions.
Fourth, all the synchronous variational principles considered here have been shown to fulfill the fundamental gauge property characteristic of all classical field theories in flat space-time. Namely, the scalar variational Lagrangian, which characterizes these theories, is gauge-invariant with respect to an arbitrary additive constant of the same variational Lagrangian.
Fifth, all synchronous variational principles obtained here are background independent in the sense that they hold for an arbitrary background metric field tensor , namely an arbitrary particular solution of the Einstein field equations.
Finally, further interesting results concern the establishment of alternate possible Hamiltonian formulations for the Einstein field equations.
The most significant one is undoubtedly the first one, which is based on the treatment of the Ricci tensor
as an independent variational tensor field (
Section 5) while keeping the tensor field
(together with
) as prescribed. This means that generally, two different Hamiltonian treatments can effectively coexist. Indeed, they can actually be coupled by means of a dimensionless and
scalar coupling coefficient (
) through their corresponding Lagrangian and Hamiltonian functions in which
and
are treated as independent variations and
and
are treated as constrained quantities. The significant feature in both cases is their unique manifestly covariant character, which affords its joint
tensor representation represented by a linear combination of the corresponding Lagrangian and Hamiltonian functions. This route is interesting in that it represents a possible non-trivial generalization of the manifestly covariant approach represented by CCG-theory (see, in particular, [
6,
25]).
However, for the sake of reference, a further variational approach is pointed out. This is here achieved in the framework of a ’la Palatini’ variational approach, in which—as usual—the connections (denoted as
) are treated as variational trial functions (
Section 5.2). Its characteristic property, as in the original EH-action principle, is due to being based on an asynchronous variational principle. For this reason, the approach appears analogous to the covariant canonical gauge approach earlier developed in Ref. [
18]. In fact, both the variational Hamiltonian and the canonical moment are found to be expressed, respectively, by means of a scalar density and a
tensor density. However, this approach (just as all Palatini approaches) is obviously not manifestly covariant since it is not expressible in fully manifestly
tensor form, a conclusion that follows due to the adoption of non-tensor Lagrangian variables, i.e., the same variational connections
.
These conclusions are interesting because they indicate that, based on the principles of general and manifest covariance, as well as the principle of objectivity, a conceptually sound route to a possible complete Hamiltonian representation of classical GR has actually been achieved, which is still based, as the original CCG-theory, on the adoption of synchronous variational principles. Nevertheless, its actual relevance for the explicit construction of a consistent theory of quantum gravity, i.e., satisfying a suitable set of axioms [
19], remains to be ascertained. Fundamental open questions in this regard include, in particular, the possible non-uniqueness of a manifestly covariant theory of quantum gravity achieved on this basis, the notion of background space-time in quantum gravity, the role of manifest covariance and the meaning of background independence, i.e., whether quantum gravity can also self-consistently prescribe its background metric tensor by means of a suitable dynamical equation. These issues will be discussed in the forthcoming Part 2 of the present investigation.