1. Introduction
Symmetries have acquired a central role in Physics. In Theoretical Physics discrete symmetries encode most of the intriguing structure of the Standard Model for particles, in Chemistry they encode for spectroscopic and physical properties of molecules.
Lie groups of transformations encode properties (often enhancing physical interpretation) of dynamical and Lagrangian systems. For example, theoretical Relativistic Cosmology is entirely based on the symmetry ansatz of homogeneity and isotropy (which of course can be later relaxed by using techniques of Perturbation Theory). Most applications of Quantum Gravity (either loopy or stringy) to Cosmology are entirely based on symmetries since the approach in full generality is still hindered by massive technical difficulties.
In the beginning of the 20th century, Emmy Noether discovered (see [
1]) a relation between (continuous families of) symmetries of Lagrangian systems and their first integrals,
i.e., physical quantities which remain constant during the evolution of the system and are often related to fundamental physical quantities such as energy, momentum, angular momentum,
etc.Symmetries of mechanical systems (together with their associated conserved quantities) are the basis of the definition of integrable systems that form a class of systems for which dynamics can be determined in general by integration. Generally speaking, knowing a conserved quantity of a dynamical system allows to reduce the dimension in which the system is defined. General integration techniques, such as Hamilton–Jacobi equation, rely in fact entirely on existence of conserved quantities.
Very interesting issues arise when one extends the notion of transformation to encompass transformations depending on velocities (and more generally on accelerations, etc.). These can be seen as transformations on the infinite jet prolongation of the configuration space where the dynamical system can be seen to be prolonged. Here infinite dimensionality enters strategically to require new techniques to extend the procedures which are standard in a finite dimensional arena.
Another interesting setting where symmetries play their role is the framework of Lagrangian field theories, which are the current basis for any approach to fundamental interactions in Physics. This corresponds, loosely speaking, to consider a dynamical system with a continuous infinity of degrees of freedom. In this context, (second) Noether theorem still holds true, though conservation laws needs to be interpreted correctly and differently from what is usually done in Mechanics (in view of the so-called first Noether theorem); if the system lives on a manifold M of dimension —which usually is identified with the physical spacetime, or, mathematically speaking, m is the number of independent variables of the equations—Noether theorem implies the existence of a –form on M called the Noether current. Such a current results to be closed along solutions and hence implies a continuity equation. The conserved quantities are thence defined as the integrals of such currents on a –volume in M. The continuity equation holding for the Noether current relates the changes of conserved quantities to the flows at the boundary of the region (something entering or escaping the region) and some residual at singularities. This setting is particularly suitable for physical interpretation; it was in fact developed, e.g., to define electric charges in Electromagnetism. This setting results to go far beyond Electromagnetism and to be the typical situations for (at least) all fundamental interactions.
In our approach the difference between first and second Noether theorem that is often drawn in literature is quite evanescent. In fact, on one hand in field theory the so-called global transformations cannot in general be defined in a gauge covariant way and only so-called local transformations are intrinsically defined, unless configuration bundle is trivialized (as it happens in Mechanics). In any event, both local and global transformations are generated by a vector field on the configuration bundle of the system. On the other hand, Mechanics can be considered as a particular case of field theory. In fact, in standard (Newtonian) Mechanics (since ) and Noether currents are functions; being closed they are constant along solutions and the situation introduced above is obtained as a special case.
Of course, in Mechanics one can also consider
local transformations (e.g., in relativistic point particle invariance with respect to reparametrizations is considered); in this case the Lagrangian is degenerate and constraints on momenta appear. Such constraints are known to be hidden in some (consequences) of the Lagrange equations though currently the most efficient tool to analyze the situation is based on canonical Hamiltonian formalism (see [
2,
3,
4,
5,
6,
7]). However, let us here only mention that these constraints are somehow trivial as long as conservation laws are considered since their associated currents vanish on-shell (once initial conditions obey constraints). The meaning of these conservation laws is very different depending if one is considering first of second Noether theorem but we shall discuss these differences below when discussing the hole argument. Hereafter, we are interested in conservation laws and hence we shall mainly consider general transformations and their relation to continuity equations.
In Field Theory the role of conservation laws, though certainly important as a support to the physical interpretation, appears to be weaker than in Mechanics since the mechanical geometrical picture is lost (we mean, the picture of first integrals each of which determines a level hypersurface of configuration space so that motion is constrained on the intersection of all these hypersurfaces and first integrals define a
reduced system living on it). Noether currents are
–forms and define no level hypersurface; on the other hand a Field Theory has infinitely many degrees of freedom and it is not obvious how to extend mechanical techniques to use conservation laws with the purpose of reducing the systems accordingly; however, though extra care is due to infinite dimensionality, such a reduction can be done; see [
8].
On the other hand, the history of Field Theory is strongly entangled with symmetries. All the theory of continuity equations was developed to account in Field Theory of charge conservation. When Einstein proposed equations to describe the gravitational field in General Relativity (GR) their form was obtained on the basis of conservation of energy—momentum tensor describing matter (as well as, independently, by Hilbert on the basis of his variational principle).
GR itself is a source of fantastic examples of the role of symmetries in Field Theories. One can prove general theorems to show that Noether currents are not only closed forms, but even exact forms along solutions. This introduces a superpotential for each Noether current and conserved quantities are obtained by surface integrals of the superpotential. Generalized Stokes theorem in this context establishes a bridge between conserved quantities and (co)homology of forms (and hence with topology and global structure of the spacetime manifold) which happens to be central in the physical interpretation of the model.
Similar situations can be recognized in other areas of Physics such as in Gauge Theories; the same structures of superpotential forms can be generically recognized in Gauge Theories as well as back in Mechanics when one wants to use (or needs to use, as it happens for the relativistic material points) homogeneous formalism (i.e., treating time and space on equal footing).
Hereafter we shall show some of the relations among symmetries and conservation laws in different areas, preferring a unifying languages that stress similarities.
2. Geometrical Setting
We shall here present notation for a geometrical setting for Lagrangian systems which encompasses both Field Theories and a suitable setting for Mechanics. We refer the reader to [
9] for further and deeper details.
Let M be an (orientable, connected, paracompact) manifold of dimension with local coordinates that will be eventually considered as independent variables in the Variational Calculus.
Let
be a bundle over
M with projection
and standard fiber
F; see [
9]. Let
be local coordinates on
F that will eventually represent the dependent variables of the variational principle representing the dynamics of the Lagrangian system under considerat ion.
A vector is vertical iff , i.e., is expressed in any fibered coordinate system as . The set of all vertical vectors at is denoted by ; the union is a sub-bundle of .
This setting is general enough to encode Field Theories and Mechanics; in particular in Field Theory
M is assumed to represent spacetime (hence usually
) while in Mechanics one has
(hence
) and base coordinates
usually reduce to
t. Configurations of the system are locally given by assigning the dependent variables as functions of the independent ones. In Field Theories this means
, while in Mechanics
locally represent curves in
F (which in this case is called the
configuration space of the system). Of course global structure of
C (or
F) encodes the global properties of the system; in particular in Field Theories one considers transformation rules of fields with respect to changes of fibered coordinates; these transformation rules encode in particular the geometrical character of fields,
i.e., for a Riemannian metric one has for example
where
denotes the anti-Jacobian of the coordinate change on spacetime, namely of
.
Moreover, these transformation rules encode also global properties, in the technical sense that once they have been fixed one can prove that there exists a unique bundle (modulo isomorphisms) having those functions as transition functions. This is quite satisfactory from the physical viewpoint since local descriptions given by local observers in terms of local fibered coordinates, together with transformation rules (which dictate how one can deduce the readings of an observer knowing the readings of any nearby observer) allow to uniquely describe the physical situation in an observer-independent and global way.
Global configurations are global sections of the bundle , i.e., maps such that . They are locally expressed by functions and transformation rules account for global properties. Usually only local sections exist (i.e., sections on open subsets ) and the existence of global sections of implies topological restrictions and/or obstructions.
Once the configuration bundle is given one can define a new bundle, canonically (i.e., functorially) associated to , which accounts for derivatives of dependent variables with respect to independent variables up to some finite order k. This new bundle is called the k-order prolongation of and it is denoted by . If are fibered coordinates on , then are fibered coordinates on , where stands for the first derivatives, stands for the second derivatives (and are accordingly assumed to be symmetric in the lower indices ), and so on.
Any is a bundle over any other (for any integer ). All prolongations of a bundle define a projective family and one can define its inverse limit, that is denoted by . (This is a bundle in a broader sense, since it is infinite dimensional).
A Lagrangian of order
k is a bundle map
where
is the bundle of
m-forms over
M. Equivalently, the Lagrangian can be seen as a horizontal
m-form over
. The
action is the functional defined as
for any
m-region
and any (local) configuration (
i.e., section)
.
Hamilton stationary action principle is in this framework a definition for
critical sections,
i.e., sections that are critical points of the action functional with respect to a canonical class of deformations. A
deformation is a vertical vector field
X on
. Let us denote by
its flow so that we can drag any configuration
along
X defining a 1-parameter family of configurations
. The
variation of the action along the deformation
X is defined as
Hamilton stationary action principle: a configuration
is critical iff for any compact
m-region
and deformation
X with
one has
Equivalently, one can also consider more general deformations only requiring that
X vanishes on the boundary
together with its derivatives up to order
. Critical sections can be shown to obey Euler-Lagrangian equations and physically represent allowed configurations (
i.e., configurations which satisfy field equations). This framework reduces locally to the usual Variational Calculus.
As we said, in Mechanics one has
and
. Being
a contractible manifold the bundle
is necessarily trivial,
i.e., diffeomorphic to a Cartesian product
; see [
10]. Nevertheless, this diffeomorphism is not canonical, but it depends on a
reference frame which is realized mathematically by a connection on the bundle
. A connection is a family of hyperplanes
such that
is a sub-bundle in
and at each point
(by the way, since the curvature of the connection is skew and in this case
, any connection on
is
flat). A connection in this case is represented by a distribution
of rank 1 on
C; being of rank 1 it is involutive and, by Fröbenious theorem, integrable. Hence one has defined a foliation in curves (trajectories,
i.e. unparametrized curves, to be precise) of
C. The leaves
are nowhere vertical and establish diffeomorphisms between any pair of fibers
and
; hence they induce a particular diffeomorphism
,
i.e., a global trivialization, in which
for some constant
. These sections represent
rest for the global observer associated to the trivialization
t defined above. One has thence two possible frameworks for Mechanics: one on
, that is a framework for a fixed observer (or reference frame); and another on
C, that is potentially independent of the observer. In both cases the framework is suitable for describing in particular any holonomic, possibly time-dependent, Lagrangian system.
Since the Lagrangian is used only to define the action (and then the action itself is used in Hamilton principle), in all Field Theories one could add to the Lagrangian terms which do not affect the value of the action functional. In fact there are forms, called contact forms, which vanish when computed along each configuration of the system; in fact, if a form is contact it factorizes terms such as , , and so on.
A
Poincaré–Cartan form for a Lagrangian system is a form
on
that differs from the Lagrangian
L by a contact form
. The action can be written also in terms of the Poincaré–Cartan form as
The contact term , which does not affect field equations, solutions and so on, can then be tuned to enhance the properties related to conservation laws and symmetries.
One requires (besides some other technical requirements; see [
9]) that for all vertical fields
of
the form
is contact. Here
denotes the usual duality between forms and vector fields. In the next Section we shall see how Poincaré–Cartan forms are adapted to conservation laws and symmetries. Below we shall see explicit examples for coordinate expressions of Poincaré–Cartan form.
The theory of Poincaré–Cartan form was fully developed in the late ’70s–early ’80s (see [
11,
12,
13,
14,
15]). In Mechanics one can prove that there exists a unique Poincaré–Cartan form at each order. In Field Theory, there is a unique Poincaré–Cartan form in theories of order 1, there is a canonical choice for theories of order 2, while for Field Theories of order
there is a Poincaré–Cartan form for each connection on the base manifold (one needs integration by parts to define it, and one needs
covariant integration by parts to control globality; different connections define then different Poincaré–Cartan forms when
). This non-trivial (and to some extent
unexpected) structure on uniqueness issue is very beautiful and inspiring, for example for issues related to Hamiltonian formalisms.
2.1. Field Equations
Most of Variational Calculus is in fact related (if not completely encoded) in how deformations defined on the configuration bundle prolong to higher order jet prolongations .
A (projectable) vector field (here projectable refers to the fact that the components are functions of the independent variables only, i.e., ) can be prolonged to jet bundles to define vector fields on each .
In general, let be the flow of X which projects onto the flow of the vector field on M. For any configuration one can drag it along the flow by defining ; these can be easily checked to be configurations as well. Accordingly, one can define prolonged flows together with their infinitesimal generators .
For example for
one has that
is a good (
i.e., global) vector field on
. Here
denotes the
total derivative operator for (local) functions on
.
If one restricts to
deformations (
i.e., vertical vector fields) prolongations are
Field equations of a (
k-order) Lagrangian
L are the differential equations which a configuration should obey in order to be critical. The (
k-order) Lagrangian form is locally expressed as
where
is the local canonical basis of
m-forms on
M induced by coordinates
(
i.e., local
volume). The deformation is in the form
.
By suitable integration by parts one can split the Lagrangian form in two parts, so that
where
is the local canonical basis of
-forms on
M induced by coordinates
. This splitting is encoded in global bundle morphisms
where
are dual to vertical vectors on
C, while
are dual to vertical vectors on
. Denoting the dualities by
one has local expressions
For a first order Lagrangian one has for instance
While the Euler-Lagrange morphism is directly related to field equations (and it is unique at all orders) the Poincaré–Cartan morphism , that is not unique in general because of the non-uniqueness of the Poincaré–Cartan form, is more properly related to conservation laws.
Globally one can characterize the morphisms
and
by the so-called
first variation formula, that once integrated gives (9)
where Div is the
divergence operator that corresponds, after evaluation along a configuration, to exterior differential on forms on
M.
Let us remark that in Field Theory most Lagrangians are highly degenerate, which means that they do not depend on all derivatives of fields but just on some (suitable) combinations of them. Usually, these combinations are chosen to define some geometrical object with simple transformation laws. In this case it is often convenient to proceed by covariant integration by parts, in order to control globality of each single term in the equations instead of controlling only the globality of the whole equation. Examples will be considered below.
Before turning our attention to symmetries let us mention that the role played by flows above can be replaced by more general objects. Let us consider a family of maps
that are locally expressed by
and represent a sort of field transformation depending on the derivatives of field (as it happens in general for supersymmetries, Backlund transformations,
etc.). Also in this more general case—in which the transformation can no longer be interpreted as a geometrical transformation on the manifold
C—one can still define an
infinitesimal generator
However, this object is no longer a vector field on
C (since its components are not local functions on
C). Objects like this are called
generalized vector fields and can be seen as sections
of the bundle
that is defined by pull-back as follows:
By the usual abuse of language the section
has local expression (15).
One can then extend the prolongation to generalized vector fields by formally using the same formulae (6) and (7). In the projective limit these families generate well-defined vector fields on , while they are, although in a not yet formalized way, generalized at each finite level .
We shall see below examples of such generalized vector fields when they represent symmetries and hence they define conservation laws. Let us stress that Emmy Noether was in fact the first to consider these examples just in her early studies aimed at showing a sort of inverse theorem; each conservation law can be generated by a suitable generalized symmetry.
3. Noether Theorem
Since field equations are mainly encoded in the geometry of jet prolongations (see [
16]) Noether theorem can be understood in terms of Lie derivatives (see [
17]).
Let
be a (projectable) vector field on the configuration bundle
that projects onto a vector field
on
M and let
be a configuration; let us define the
Lie derivative of
with respect to
to be
This is a (generalized) vertical vector field and it accounts for the change of the configuration when dragged along the flow of
. The same expression holds true when
is a generalized vector field by itself.
This Lie derivative is
natural in the sense that it preserves commutators,
i.e.,
Of course this is true when commutators are considered as the commutators of (possibly generalized) vector fields on
. It is essential to notice here that configurations can be in principle dragged along the flow of vector fields on
only. In general, there is no dragging in
along vector fields on the base
M, as one is instead “physically” used to expect in many physically relevant cases (e.g. in relativistic applications). Accordingly, there is no reason to expect that in general one could define Lie derivative of configurations in
along vector fields on
M, nor that this can be done in such a way that they are natural,
i.e., they preserve commutators.
There are however specific bundles in which one could naturally associate a vector field
on
to each vector field
on
M (as, e.g., natural bundles or, in a sense, gauge-natural bundles; see [
9].) For naturality one has that
One classical example of this situation is on tangent bundles
; on any tangent bundle one has in fact a tangent lift of vector fields which preserves commutators. We have to stress that the existence of such a natural lift is a property of the bundle
. When such a lift is defined the bundle
is in fact called
natural; one could prove that natural bundles are associated to (some finite higher order) frame bundle
on the base manifold; see [
18]. The tangent bundles are in fact always associated to the standard frame bundle
.
Only on natural bundles one can define dragging along
and the corresponding Lie derivative is defined as
which is in turn defined as above. Since the lift
is natural one can prove that this Lie derivative is also natural,
i.e.,
We shall present below various examples of this and similar situations. We wish to stress here that the naturality (17) of Lie derivatives on is essential for applications to conservation laws, while the naturality of the lift (18)—as well as the naturality (20) of the Lie derivatives on M—is not. These are “good news,” since not all Field Theories have a natural lift; examples will be presented in Gauge Theories and Spinor Theories, in which one could not define a gauge covariant lift that is natural while conservation laws are still perfectly defined.
Noether theorem asserts a correspondence between Lagrangian symmetries and conservation laws. A
Lagrangian symmetry is a (possibly generalized) vector field
on the configuration bundle
such that
for some bundle morphism
.
By using the first variation formula on the l.h.s. one can easily prove Noether theorem in the form
that can be recasted as
By direct inspection, the Noether current defined as is closed along critical configurations, which manifestly annihilate the r.h.s. Of course the difficulty in finding conservation laws is here replaced by the difficulties in finding Lagrangian symmetries . Examples will also be given below.
Symmetries can be described and treated directly also in terms of the Poincaré–Cartan form. Condition (21) can be written in an equivalent form as follows
for some
-form
;
i.e., a Lagrangian symmetry leaves the Poincaré–Cartan form invariant. This characterization of symmetries is beautiful and geometric, and this is a motivation to define Poincaré–Cartan form.
Noether theorem is then simply obtained in the usual form for Lie derivatives of forms, namely
The term
is nothing but field equations and the Noether current is given as
which, of course, is closed on-shell.
Conservation laws are thence expressed as (on-shell) closures of Noether currents. If one chooses coordinates
adapted to a spacelike foliation of
M, to mimic what one is used to do when choosing Cartesian coordinates in Minkowski space, the conservation law is expressed as
where we set
and
and · denotes derivative with respect to time
t. These kind of equations are called
continuity equations since
is a density and the quantity
(for any spacelike
-region
D in
M) is
conserved, in the sense that its variations are controlled by the flow of
j through the boundary of
D. Having said that, conservation laws
are nothing but the covariant form of continuity equations.
For a wide class of Field Theories (including Natural and Gauge-Natural Theories) one can also show that Noether currents are not only closed forms on-shell, but they are also
exact; see [
9]. One can in fact show, by defining an explicit and algorithmic procedure of covariant integration by parts, that in those cases the Noether current can be (globally) recasted as
The
-form
is called the
reduced current and it vanishes on-shell, while the
-form
is called the
superpotential. Examples will be presented below. Accordingly, the Noether current is written on-shell as the differential of the superpotential.
The corresponding conserved quantities are thence defined as surface integrals of the superpotential on -regions . This establishes a deep connection between conserved quantities and cohomology of the spacetime manifold M.
The conservation laws involving Noether current (i.e., ) hold on on-shell, i.e., along critical configurations, and therefore they are called weak conservation laws. When one defines the superpotential then an equivalent conservation law can be written as ; this holds true for each single configuration (also non-critical). For such a reason these conservation laws induced by superpotentials are also called strong conservation laws.
In the case of Mechanics one can use the augmented de-Rham sequence
and the Noether current is an element in
that is closed on-shell. In view of exactness at the relevant level, Noether currents in Mechanics are therefore
constant on-shell and they are called
first integrals.
Finally, let us remark that everything we said above applies also to Lagrangian symmetries ; equivalently, one has a flow of transformations that leaves the Poincaré–Cartan form invariant, namely the flow of , i.e., a 1-parameter group of symmetries. In many interesting cases, some of which will be discussed below, Field Theories have a symmetry group of dimension higher than 1.
Noether current, which depends on the symmetry generator, can also be considered as a map from the Lie algebra of (infinitesimal) symmetries to (on-shell) closed
-forms. Such a map is usually called
momentum map (see [
2]) and the naturality with respect to Lie derivation is the remnant of the group structure of symmetries and their preservation.
Let us stress that symmetries are required to form a group of transformations of configurations, not necessarily a group of transformations of spacetime. Sometimes a group of spacetime transformations naturally induces a group of transformations on configurations (e.g., when configurations are represented in terms of spacetime tensors or more generally by geometrical objects) but in general it does not. From this viewpoint, Lie derivatives of fields with respect to spacetime vector fields is an additional (and not necessary) structure; one somehow needs to require naturality (17) while the lift (19) and its naturality (18) is not always available (nor, in fact, necessary).
4. Applications to Mechanics
As examples in Mechanics we shall briefly review some standard examples with the aim of fixing notation.
The standard framework for first order Mechanics is based on a configuration manifold Q with points that correspond to system configurations; local coordinates are also known as Lagrangian coordinates for the system. The histories of the system are encoded by (parametrized) curves in the configuration space Q.
Equivalently, one can define, as discussed above, the configuration bundle ; by means of a reference frame (i.e., a connection on ) one can define a global trivialization . We shall denote fibered coordinates on by .
In this setting histories are in 1-to-1 correspondence with sections of the configuration bundle, that will be denoted by an abuse of language again by . Since we are restricting to the first order Mechanics, the Lagrangian should be given on the first jet prolongation . By means of the reference frame we have a special set of motions singled out in the configuration bundle; these denote “rest motions” and can be prolonged (as we saw for any motion) to first jet bundle. They define a global isomorphism since they define what one has to understand for “zero velocity” (of course depending on the reference frame).
Accordingly, we can use fibered coordinates
on
, where
denote the Lagrangian velocities. The dynamics is described by a Lagrangian
or equivalently by a Poincaré–Cartan form
where we set
and
for the relevant contact 1-form on
.
The Noether theorem presented in general above is generalized in order to encompass the standard one for:
- i)
a symmetry that is a vertical vector field on the configuration bundle, i.e., such that its tangent prolongation leaves the Lagrangian invariant ();
- ii)
a symmetry that is a (non-vertical) vector field on the configuration bundle, e.g., if the Lagrangian density is independent of time t then is in fact a symmetry;
- iii)
vector fields that leave essentially invariant the system (e.g., when in the notation introduced above one has );
- iv)
generalized vector fields to obtain first integrals that are not simply linear in the momenta, as for example the Runge-Lenz vector in Kepler’s motion.
4.1. Case (i)
One can always change fibered coordinates to new fibered coordinates in which . That is a symmetry iff the Lagrangian does not depend on , i.e., it is cyclic. In other words these cases correspond to all cases of ignorable coordinates, though is a symmetry in all coordinate systems, while is cyclic only when . Thence these cases are equivalent to a coordinate free notation for cyclic coordinates.
The corresponding first integral is
that in adapted coordinates corresponds in fact to the momentum
conjugated to the cyclic coordinate
.
For example, a particle in a constant gravitational field is described by the Lagrangian
in Cartesian coordinates
.
The coordinate x (y, respectively) is cyclic and the corresponding momentum (, respectively) is a first integral that corresponds to linear momentum.
The vector field
is a symmetry and it corresponds to the fact that in cylindrical coordinates
the angular coordinate
is cyclic. The corresponding first integral
corresponds to the
z-component of
angular momentum.
In cylindrical coordinates one has in fact and the corresponding first integral is given by .
4.2. Case (ii)
When the Lagrangian is independent of time
t (e.g., whenever holonomic constraints are imposed with no explicit time dependence) the vector field
is a symmetry and the corresponding first integral is given by
that corresponds to
mechanical energy.
In the case of particles in a constant gravitational field one has
4.3. Case (iii)
For the Lagrangian (34) the vector field
fails to leave the Lagrangian invariant; in fact one has Lie derivatives
and
. The l.h.s. of the covariance identity (21) reads then as
that fails to vanish though it is easily recognized to be the total derivative of a quantity
. This is for sure physically expected since, in view of gravitational field constancy, the system is unchanged if everything is translated up along the
z-axis. If this is physically trivial, one should notice that mathematically one needs to consider generalized symmetries even to encompass these simple examples.
Noether theorem hence applies and the corresponding first integral is
In this case we know the general solution of the equations of motion (
,
,
) and it is easy to show that the quantity
is in fact constant on-shell.
4.4. Case (iv)
For Kepler system
one can consider the following generalized vector fields
These are symmetries; in fact, the l.h.s. of the covariance identity for the first vector reads as
while for the second vector it reads as
The corresponding first integrals
(
) are
One can easily check that these are the two components of the vector field
which is called
Laplace vector or
Runge-Lenz vector. In other words the vector
is constant on-shell. This vector was known in Kepler problems (or, equivalently, in Coulomb electrostatics) to be related to the fact that perihelia are fixed (and when perturbations are introduced it relates to the precession of perihelia).
Notice that the vector is quadratic in the Lagrangian velocities. Since momenta are linear in the Lagrangian velocities this means that is quadratic in the momenta. Hence, it could not be obtained by ordinary Noether theorem which produces only first integrals linear in the momenta.
5. GR and Natural Theories
General Relativity (GR) is based on the principle of general covariance (together with the equivalence principle). General covariance is a symmetry requirement; one assumes that spacetime diffeomorphisms act on configurations (i.e., fields are geometrical objects, e.g., tensor fields) and all these transformations induced by spacetime diffeomorphisms are symmetries for the dynamics.
This assumption combined with Noether theorem has plenty of consequences. The configuration bundle is a
natural bundle, the Lie derivatives are defined with respect to spacetime vector fields that are all symmetries and they all generate conservation laws. Moreover, one can show (see [
9]) that Noether currents always admit superpotentials and conservation laws are always defined
à la Gauss by surface integrals.
Moreover, in such kind of theories the whole set of conservation laws are equivalent to the dynamics of the system. In fact, one has a
k-order Lagrangian
L which defines field equations
via first variation formula
First variation formula holds in particular for the Lie derivative along each spacetime vector field
and gives conservation laws
for the Noether currents
. Then one can define superpotentials U
The reduced current is always a combination of field equations. Thus if one knows Noether currents of a Natural Theory and conserved quantities by means of their superpotentials, then one can compute the reduced currents (
i.e., field equations) purely in terms of conservation laws.
For example, let us consider “standard GR”, which is defined as a second order theory on the configuration bundle
of Lorentian metrics on spacetime
M, with coordinates
. Dynamics is defined by the Hilbert Lagrangian
where
R is the scalar curvature of the metric
g and
denotes the square root of the absolute value of the determinant of the metric
.
The variation of this Lagrangian along the deformation
where
and
denotes the Levi-Civita connection of the metric
g,
i.e. its Christoffel symbols.
Let us stress that
each spacetime vector field
—and not only Killing vectors as sometimes erroneously claimed in the literature—generates conservation laws. By expanding the Lie derivatives in terms of the spacetime vectors field one obtains Noether current as
which by suitable covariant integration by parts can be recasted as
, where we set
The superpotential
is called
Komar potential in honor of Komar who first proposed it, though originally restricted to a timelike Killing vector
, while here
is instead
any spacetime vector field.
As we discussed above, one can deduce field equations
from Noether current
ε and Komar superpotential
by computing reduced current as
just noticing that this quantity has to vanish for all solutions and all symmetry generators
.
This situation, in fact, is completely general for any Natural Theory of any order and any matter coupling, since it is based on general covariance principle only. Any Natural Theory comes with a huge symmetry group, namely , which identifies intrinsically (i.e., in an observer independent way) a huge set of conservation laws and conserved quantities.
Problems start when one wants to identify some physically relevant quantity—e.g., the energy, the momentum, the angular momentum, …—within this intrinsic set of conservation laws. This is already a problem in Newtonian Mechanics (see [
19,
20]) where it should be clear from the very beginning that there is no intrinsic notion as
the energy of a system, in the sense that different observers (even inertial observers) do in fact measure different energies, momenta, … for the same system. This obvious circumstance is often undervalued (or even ignored) in current literature on Mechanics, with the consequence of generating a number of misunderstandings about conservation laws that reverberate and amplify in Field Theories. This would indeed be a trivial remark if it were not used sometimes in the literature to argue that conserved quantities in GR must have a non-covariant genesis.
Such arguments come down to (at least) two different main points:
- i)
covariant conservation laws are not conservation laws;
- ii)
covariant conserved quantities would not depend on the observer as physically expected and as pseudotensorial prescriptions do.
Both these points have a long history and can be somehow traced back to Einstein himself. However, after almost one century of investigations they can be today shown to be flat wrong at least in some sense.
5.1. Covariant Conservation Laws
Item
comes historically from the observation that covariant conservation laws
would not reduce to continuity equations (which is what one usually means for “conservation”) due to terms depending on the connection
used to define the covariant derivatives. This would be certainly the case if
were components of a vector field. Then
covariant conservation laws would be
conservation laws tout court only for those observers (here identified with coordinate systems) in which such terms vanish (
i.e., when
). Only in these cases covariant conservation laws are genuine conservation laws and they are intrinsically non-covariant since they break down general covariance.
This argument is true for all currents
ε except in one single case: the case in which
is a vector density of weight 1. In this case there are two terms depending on the connection and they cancel out. In other words, when
is a vector density of weight 1 the covariant divergence
is automatically identical to the
ordinary divergence,
i.e.,
and they always define true continuity equations for any observer.
Of course, Noether theorem in the form presented here dictates for Noether currents to be -forms, i.e., their components are in fact vector densities of weight 1. Hence it always produces authentic conservation laws which are at the same time covariant.
5.2. Observers
Item comes from the belief that covariant conservation laws would necessarily define conserved quantities independent of the observer. This is certainly true for the whole set of conservation quantities; it is defined covariantly and all observers agree on it.
However, each observer can then be asked to identify within the set of all conserved quantities which one represents a physically relevant quantity, e.g., the energy. It is not the notion of first integral to be observer dependent; a quantity either changes or not during the evolution of the system and all observers agree on it. But it is rather the energy that is related (in Newtonian Mechanics, but also in Special Relativity) to time translations; since in SR time depends on the observer, the notion of energy depends on the observer, accordingly.
In GR each observer (which is identified with a local coordinate system together with a protocol to synchronize clocks at different points, that foliates spacetime with leaves representing synchronous event sets; such foliations are known as ADM-foliations, see [
21]) comes with a timelike vector field
. The Noether quantity associated to such a timelike vector field (no request about it being Killing; see [
22]) is defined to be the
energy for the observer for which timeflow is given by ξ.
Since the prescription is covariant, any other observer agrees that this energy is the Noether quantity associated to that timeflow ; however, different observers might disagree on being the timeflow and consequently that such a conserved quantity has to be considered to be the energy of the system. On the contrary, they use their own timeflow and define their own energy. In this way, which is the only reasonable way to extend to GR what one is used to do in SR, despite the prescription is covariant still the energy is not and still it depends on the observer.
It is therefore an open issue whether we really need to know what is the energy or we could have a fundamentally good description of the physical world by just using the set of conserved quantities. In other words, if energy is just something we are used from Newtonian Physics, or rather one can produce a fundamental description of the physical world without singling out special conserved quantities to be given special meanings.
One could also ask whether it is useful to have covariant prescriptions for conserved quantities, since sooner or later covariance have to be broken in favor of observer-dependent quantities. Why one should not be satisfied with coordinate-depending prescriptions such as the ones based on pseudotensors? Let us counter-argue that on manifolds there is nothing defined as coordinate dependent integrals to define conserved quantities from pseudotensors. Pseudotensors are coordinate dependent quantities, often their genesis from geometric objects is not clear, they are known in some coordinate system but they are often unspecified in other coordinates. Moreover—and for obvious reasons—they lack of any coherent and (mathematically and physically) intrinsic meaning, if any may even exist!
We do not claim that pseudotensors should be completely forbidden in Physics, since they appear to be often “useful”; but we believe that for a reasonable, global and covariant interpretation their genesis from some geometric object must be always made explicit and, in any case, they have to be treated
cum grano salis. Pseudotensors should be defined starting from geometric objects by fixing coordinates in order to neglect some term. This scenario has two good features to be noticed: it implicitly defines the coordinate systems in which pseudotensors can be used and it defines its integrals by means of the integral of the original geometric object. Let us refer to [
22] for an example of this coherent strategy.
5.3. Extended Theories of Gravitation
As we said above, the structure of GR is not peculiar of the Hilbert Lagrangian (49); most of this structure is determined by the symmetry group, i.e., spacetime diffeomorphisms, and it is a feature of any generally covariant theory.
Let us here briefly consider the class of
Extended Theories of Gravitation (ETG); see [
23,
24,
25]. This class of theories is used in Cosmology and Astrophysics in order to model phenomena and observations that are usually related to dark matter and dark energy; [
26].
Let us first consider a Lorentzian metric and a connection as fundamental fields. The configuration bundle is the product of the bundle of Lorentzian metrics and the bundle of connections on M (here assumed for simplicity with no torsion). Here connections are in principle independent of the metric field.
Let us denote by
(
) the Riemann (Ricci) tensor of the connection
and by
the scalar curvature which depends on both the metric and the connection. These
curvatures are not to be confused with the curvatures
,
,
of the metric
g alone. This double affine structure (the one induced by
and the one induced by
g) opens a number of issues about physical interpretation; see [
27].
In the notation here introduced let us consider the Lagrangian
where
f is a
generic analytic function of the scalar curvature. For the specific case
this reduces to standard Hilbert-Einstein Gravity (49).
When varying, the metric and connection are deformed independently and there are field equations from variation of both fields:
The second equation is solved defining a new metric field conformal to the original one; the connection is then forced to be the Levi-Civita connection of the metric h, i.e., .
The trace (by ) of the first equation is called the master equation and (once f is considered a fixed function) it is algebraic in the scalar curvature R. In general the master equation allows to solve the scalar curvature as a function of the matter sources; here we are considering the vacuum case and R is constant (on M) and determined by the zeroes of the master equation (which are finite and generically simple). When this information is plugged into the first field equation one can show that the metric h obeys modified Einstein equations; the function f can be chosen so that the modified dynamics accounts for dark matter/dark energy phenomenology with no need to introduce exotic matter sources, which currently cannot be observed directly.
Again these models are generally covariant and any spacetime diffeomorphism is a symmetry of the theory; they are Natural Theories. Any spacetime vector field
is a symmetry generator and its Noether current is given by
where we used the master equation,
and the curvature tensors of the conformal metric
h are denoted by
,
and
.
This corresponds to define the reduced current and the superpotential as:
Notice once again that the reduced current
identically vanishes on-shell and it is equivalent to field equations.
ETG are also considered in the purely metric formalism, in which one considers the connection to be from the beginning, i.e., at the kinematical level. The configuration bundle is simply and the Lagrangian (57) (now considered as a function of the metric g alone) is second order. This leads to forth order equations with modified effects with respect to standard GR.
From this point of view standard GR is shown to be degenerate (it has second order field equations) with respect to the extensions considered here (which generically have forth order field equations). Let us remark how the structure of symmetry group and conservation laws is instead similar in standard GR and ETG. It does not depend in fact on the particular dynamics considered but just on the principle of general covariance.
6. Gauge Theories
Natural Theories are defined starting from a spacetime manifold M together with its diffeomorphisms that encode symmetries. It is well-known that this is suitable for some physical systems (specifically, Relativistic Theories) while there are more general physical systems (e.g., Gauge Theories) which need bigger symmetry groups. Gauge Theories are needed today to describe some fundamental interactions and together with GR they are enough for Fundamental Physics.
A Gauge Theory depends on a principal bundle P with a structure Lie group G over spacetime M. Principal automorphisms form a transformation group; this group projects onto , but is not in general realised as a subgroup of . Automorphisms are realised locally as changes of coordinates together with an element of G acting at all spacetime points (which in Physics is called a local action of G).
A Gauge-Natural Theory is a Field Theory in which
acts on configurations generating Noether symmetries of the dynamics. This request forces the configuration bundle to be a gauge-natural bundle (see [
9,
28,
29]); all fundamental theories of Physics can be cast in this form.
As in Natural Theories, also in Gauge-Natural Theories one can associate a Noether current to any right–invariant vector field on
P (
i.e., to each generator of principal automorphisms). Noether currents always allow superpotentials (see [
9]) and field equations can be obtained purely from conservation laws. Yang–Mills theories will be discussed below.
6.1. Yang–Mills Theory
Let us briefly present Yang–Mills theories as a paradigm for Gauge(-Natural) Theories. Also Dirac spinors share most of the characteristics of Gauge Theories and can in fact be formulated as a Gauge-Natural Theory; see [
9].
Let us consider a semisimple Lie group
G and denote by
the Cartan–Killing (ad-invariant) metric on its Lie algebra g. Let
P be a principal bundle with structure group
G and
be the bundle of principal connections on
P. Automorphisms act on connections so that
is a gauge-natural bundle. Once one fixes a
-orthonormal basis
in the Lie algebra g a connection is expressed by coefficients
; the curvature is defined by
, where
are “structure constants” of the group
G. An automorphism on
P is locally expressed as
for some local map
, · being the product in the group; it acts on the curvature as
where
is the anti-Jacobian of the coordinate change and
is the adjoint action of the group
G on its Lie algebra g.
The Yang–Mills Lagrangian
defines the dynamics and gauge transformations (
i.e., automorphisms in
P) are symmetries.
The first variation formula of the Lagrangian with respect to the deformation
defines the morphisms
where we set
and
. The tensor
is also known as the energy-momentum tensor of the Yang–Mills field. If the metric is freezed, then field equations are
,
i.e., they are the Yang–Mills equations. In general, the Yang–Mills Lagrangian is coupled to gravity and
acts as a source for the gravitational field.
Given a pointwise basis
of vertical right-invariant vector fields on
P, symmetry generators are in the form
. The corresponding Noether current is
where we set
for the vertical part of
with respect to the dynamical connection
.
The superpotential is given by
In the special case (i.e., Maxwell’s Electromagnetism) the group is commutative and the adjoint representation is trivial; the curvature is then a 2-form on spacetime and it is invariant with respect to (proper) gauge transformations (i.e., vertical automorphisms). In this case Yang–Mills theory reduces to standard Electromagnetism and the connection represents the quadripotential of electromagnetic field. Field equations reduce to Maxwell equations and gauge charges reduce to electric charges.
6.2. Hole Argument
There is another important crosspoint among GR, Gauge Theories and their symmetries. It is called the hole argument and it was first discovered in GR by Einstein, though it persists in Gauge Theories as well. It is originated when among symmetries of the system there are compact supported symmetries, i.e., symmetries that vanish (or became the identity) out of a compact sub-domain of spacetime M.
In GR one can easily define compact supported spacetime diffeomorphisms (and vector fields) on any paracompact spacetime manifold M. In Gauge Theories one can define compact supported automorphisms. Of course in Mechanics it is difficult to find examples of compact supported symmetries (e.g., there are no compact supported rotations in space). The only well-known exception is curve reparametrization in SR; there Physics is totally represented in terms of trajectories in spacetime (the so-called worldlines) and their parametrizations are introduced for technical reasons only, since they are not endowed with any physical meaning. Equations of motion should however induce equations on trajectories; hence they must be invariant with respect to general reparametrizations, which are thence required to be symmetries of the system. In this case, it is easy to produce compact support reparametrizations since the parameter lives in which is paracompact.
The hole argument is based on the fact that symmetries transform solutions into solutions (since they leave the action invariant). One can show in an elementary way that compact supported symmetries contradict directly and essentially Cauchy theorem (i.e., determinism). In fact, given a solution and acting on it by a compact supported symmetry one obtains a new solution . The two solutions and are identical everywhere except within the compact support D of . One has thence two different solutions which are exactly the same far in the past (in particular on a spacelike hypersurface where initial conditions can be imposed and which does not intersect ) while they differ at some point (in D).
This is a very general property of possible dynamics with symmetry groups that contain compact supported symmetries. Since in Physics one does not want to give up determinism too easily the only other possible way out is to assume that configurations which differ by compact supported symmetries (e.g., and above) represent in fact the same physical situation. One physical situation can be represented by many different mathematical representations, i.e., physical situations are not in 1-to-1 correspondence with configurations but with equivalence classes of configurations. In other words, physical situations are realized by some suitable quotient of configuration space.
Dynamics must therefore respect the quotient (that is equivalent to assume that a change of representative in equivalence class is a dynamical symmetry). On the other way, just because of the structure of the symmetry group, one cannot hope to observe any difference among different representatives of the same class of equivalence. In other words, symmetries not only constrain physically reasonable dynamics but they also impose strong constraints on what can be physically observed. Accordingly, in all cases that are relevant to fundamental interaction Physics, the dynamics is degenerate.
This is, to the best of our knowledge, the widest definition of what one could mean by Gauge Theories. A Gauge Theory could be defined as a Field Theory with a symmetry group that contains enough compact supported symmetries so to be able to act non-trivially on any compact hole in the base manifold M (being trivial outside the holes, as above). [Of course if M is compact then any symmetry is compactly supported; here we mean the support is small, i.e., disjoint from the Cauchy surface where initial conditions are set.]. Accordingly, its dynamics is degenerate and it induces a deterministic dynamics on the space of gauge classes of configurations that are identified with physical situations. Of course, one cannot obtain every feature of standard gauge theories under these very mild hypotheses. For example, one has to ensure the symmetry group to be infinite dimensional (or, equivalently, parametrized by functions) to allow reduction of the gauge degrees of freedom. However, these are the mildest conditions for which field equations are underdetermined and hole argument apply. If one wants to keep be compliant to the standard nomenclature these could be called Underdetermined Field Theories or Generalized Gauge Theories.
For this reason physical gravitational fields in GR are not described in terms of “metrics” but rather in terms of “geometries” (i.e., equivalence classes of metrics with respect to spacetime diffeomorphisms). In Gauge Theories Yang-Mills fields are identified with gauge classes of connections on P. In SR motions are unparametrized worldlines.
7. Frame-Affine Formalism for GR
In order to provide an example of concrete application of our formalism here introduced in action we shall here consider the application to the so called
Holst’s action principle (see [
30]) which is used as an equivalent formulation of GR suitable for developing LQG through the use of the Barbero–Immirzi connection (see [
31,
32,
33,
34] as well as references quoted therein).
Let
M be a
m-dimensional manifold (which will be required to allow global metrics of signature
, with
). Let us denote by
local coordinates on
M, which induce a basis
of tangent spaces; let
denote the
general frame bundle of
M and set
for fibered coordinates on
. We can define a right-invariant basis for vertical vectors on
The general frame bundle is natural (see [
28]), hence any spacetime vector field
defines a natural lift on
We stress that the lift vector field
is global whenever
is global.
A connection on
is denoted by
and it defines a lift
This lift does not in general preserve commutators, unless the connection is flat.
Let now
be a principal bundle over the manifold
M and let
be (overdetermined) fibered “coordinates” on the principal bundle
. We can define a right-invariant pointwise basis
for vertical vectors on
by setting
where
is the canonical diagonal matrix of signature
and square brackets denote skew-symmetrization over indices.
A connection on
is in the form
Also in this case the connection on
induces connections on any associated bundle and there defines covariant derivatives of sections.
A
frame is a bundle map
which preserves the right action,
i.e., such that
i.e.,
, where
R denotes the relevant canonical right actions defined on both principal bundles
and
and where
is the canonical group inclusion. We stress that on any
M which allows global metrics of signature
the bundle
can always be chosen so that there exist global frames; see [
35]. Locally the frame is represented by invertible matrices
and it defines a spacetime metric
which is called the
induced metric.
As for the Levi-Civita connection, a frame defines a connection on
(called the spin-connection of the frame) given by
where
denote Christoffel symbols of the induced metric. The spin-connection is compatible with the frame in the sense that
In general the (natural) lift
to
of a spacetime vector field
is not adapted to the image
and thence it does not define a vector field on
. With this notation the Kosmann lift of
is defined by
(see [
36]) where we set:
and where
while
denote the inverse frame matrix.
Let us stress that despite appearing so, the Kosmann lift (75) does not in fact depend on the connection, but just on the frame and its first derivatives. The same lift can be written as
where we set
since one can prove that
Another useful equivalent expression for the Kosmann lift is giving the vertical part of the lift with respect to the spin connection (see [
9], pp. 288–290), namely
This last expression is useful since it expresses a manifestly covariant quantity.
We have to stress that the Kosmann lift does not preserve commutators. In fact if one considers two spacetime vectors
and
and computes the Kosmann lift of the commutator
one can easily prove that
Thence only if one restricts to Killing vectors (
i.e.,
) one recovers that the lift preserves commutators.
Let us now consider tetrad-affine formulation of GR: the fundamental fields are a Lorentz connection and a vielbein . The connection defines the curvature form . Let us also set , and ; here denotes the inverse frame matrix of . The frame also defines a metric which in turn defines its Levi-Civita spacetime connection .
On a spacetime of dimension 4 let us consider the Lagrangian
By variation we obtain
Thus one obtains field equations in the form
The second field equation forces the connection to be the connection induced by the frame,
i.e.,
; then the first equation forces the induced metric to obey Einstein equations.
This Field Theory is dynamically equivalent to standard GR, in the sense that it obeys equivalent field equations. However, the theory is in fact richer in its physical interpretation, since the use of different variables and action principles generate larger symmetry and extra conservation laws. In fact, this theory has a bigger symmetry group being both generally covariant and Lorentz covariant.
Noether theorem implies then conservation of the current
along any Lorentz gauge generator
. The Lie derivative of a connection is given by
where we set
.
Hence one obtains
The first and second terms vanish on-shell; hence one obtains
Let us stress that this current depends only on the Lorentz generator
.
Here is the issue with physical interpretation: we have two equivalent formulations of Einstein GR. Noether currents in one case depend on spacetime vector fields, while in tetrad-affine formulation Noether currents depend on Lorentz generators that a priori have nothing to do with spacetime transformations. Let us stress of course that unless the spacetime is Minkowski, there is no class of (global) spacetime diffeomorphisms representing Lorentz transformations.
Considering the dynamical equivalence at the level of field equations and solution space, one would like this equivalence to be extended at the level of conservation laws. Moreover, some of the conserved quantities in standard GR are known to be related to physical quantities such as energy, momentum and angular momentum, while one would wish to be able to identify the corresponding quantities also in the second formulation. Kosmann lift is in fact essential to relate Lorentz generators to spacetime diffeomorphisms and the corresponding conservation laws.
The Noether current (85) can be restricted by setting
, so that one obtains
which corresponds to the standard conserved quantity associated to spacetime diffeomorphisms in GR written in terms of Komar superpotential. This (and only this) restores the equivalence between standard GR and tetrad-affine formulation at the level of conservation laws.
As a further example let us consider the covariant Lagrangian:
which is known as Holst’s Lagrangian.
By variations one obtains equations
The second equation still imposes
; this in turns implies
(first Bianchi identity) and hence Einstein equations. This shows how Holst’s Lagrangian provides an equivalent formulation of standard GR as well.
It is interesting to check if also in this case the equivalence is preserved also at level of conservation laws. The Noether current is
As in the previous case this can be recasted modulo terms vanishing on-shell as follows
Again this has nothing to do with spacetime symmetries and in general it would affect conserved quantities. When Kosmann lift is again inserted into these conservation laws one obtains
that vanishes being the divergence of a divergence. Hence once again the correspondence at the level of conservation laws is preserved when the Kosmann lift is used.