Abstract
We revisit the physical arguments that led to the definition of the stress–energy tensor T in the Lorentz–Finsler setting starting with classical relativity. Both the standard heuristic approach using fluids and the Lagrangian one are taken into account. In particular, we argue that the Finslerian breaking of Lorentz symmetry makes T an anisotropic 2-tensor (i.e., a tensor for each L-timelike direction), in contrast with the energy-momentum vectors defined on M. Such a tensor is compared with different ones obtained by using a Lagrangian approach. The notion of divergence is revised from a geometric viewpoint, and, then, the conservation laws of T for each observer field are revisited. We introduce a natural anisotropic Lie bracket derivation, which leads to a divergence obtained from the volume element and the non-linear connection associated with L alone. The computation of this divergence selects the Chern anisotropic connection, thus giving a geometric interpretation to previous choices in the literature.
1. Introduction
This article has a double aim in Lorentz–Finsler geometry. The first one is to revisit the physical grounds of the stress–energy tensor T, Section 3. The possible extensions of the relativistic T are discussed from the viewpoint of both fluids mechanics and Lagrangian systems. The second one is to revise geometrically the notion of divergence, Section 4, yielding consequences about the conservation of T, Section 5. With this aim, we introduce new notions of the Lie bracket and the derivative associated with a nonlinear connection and applicable to anisotropic tensors fields, which appear naturally in Finsler geometry.
Finslerian modifications of General Relativity aim to find a tensor T collecting the possible anisotropies in the distribution of energy, momentum, and stress, which will serve as a source for the (now Lorentz–Finsler) geometry of the spacetime [,,,,]. Some of these proposals may be waiting for experimental evidence, postponing then how the basic relativistic notions would be affected. However, such a discussion is relevant to understand the scope and implications of the introduced Finslerian elements. In a previous reference [], the fundamentals of observers in the Finslerian setting were extensively studied, including its compatibility with the Ehlers–Pirani–Schild approach. Now we focus on the stress–energy tensor T.
The difficulty to study such a T is apparent. Recall that, using the principle of equivalence, General Relativity is reduced infinitesimally into the Special one, which provides a background for interpretations. However, in the Lorentz–Finsler case, the infinitesimal model is changed into a Lorentz norm (instead of scalar product), implying a breaking of Lorentz invariance. This is a substantial issue in its own right which has been studied in the context of Very Special Relativity and others [,,,,]. As an additional difficulty, the infinitesimal model changes with the point1.
Two noticeable pre-requisites are the following: (a) only the value of the Lorentz–Finsler metric on causal directions is relevant [,] (this is briefly commented in the setup Section 2.3), and (b) there is a significant variety of possible extensions of the relativistic kinematic objects to the Finsler case, at least from the geometric viewpoint (see Appendix A). Taking into account these issues, the extension of the notion of the stress–energy tensor to the Finslerian setting is discussed in Section 3.
We start at the fluids approach. As a preliminary question, energy-momentum is discussed, in Section 3.1. We emphasize that, even though this is well-defined as a tangent vector in each tangent space , , different observers u, at p will use coordinates related by non-trivial linear transformations. Indeed, the latter will depend on both L and the chosen method to measure relative velocities. Moreover, when the stress–energy T is considered in Section 3.2, the arguments in Classical Mechanics and Relativity, which support its status as a tensor, hold only partially in the Lorentz–Finsler setting. Indeed, T acquires a nonlinear nature that is codified in an (observer-dependent) anisotropic tensor, rather than in a tensor on M.
The Lagrangian approach is discussed in Section 3.3. This approach has been developed recently by Hohmann, Pfeifer, and Voicu [,], who introduced an energy-momentum scalar function. Here, we discuss the analogies and differences of this function with the canonical relativistic stress–energy tensor and the 2-tensor T obtained from the fluids approach above. Relevant issues are the existence of different methods to obtain a 2-tensor starting at a scalar function, the recovery of this function from a matter Lagrangian, and the possibility to consider the Palatini Lagrangian as the background one (rather than Einstein–Hilbert-type Lagrangians used by the cited authors; recall that Palatini’s becomes especially meaningful in the Finslerian case []). The important case of kinetic gases is considered explicitly (Example 2).
Once the definition of T has been discussed, we focus on its conservation, Section 5, revisiting first the divergence theorem, Section 4. This is crucial in the Finslerian setting because, as discussed before, the Lagrangian approach above does not guarantee a conservation law as the relativistic .
Section 4 analyzes the divergence from a purely mathematical viewpoint. Now, L is regarded as pseudo-Finsler (the results will be useful not only in any indefinite signature but also in the classical positive definite case), and T will not be assumed to be symmetric a priori. Classically, the divergence of a vector field Z is defined with the derivation associated with the Lie bracket , applied to the volume element. In the Finslerian case, however, the Lie derivative and bracket do not make sense for arbitrary anisotropic vector fields. This difficulty was circumvented by Rund [], who redefined in such a way that a type of divergence theorem held. However, the Lie viewpoint is restored here.
Section 4.1. Once a nonlinear connection (seen as a horizontal distribution on A) is prescribed, we can define a Lie bracket and, then, a Lie derivative ( Definitions 1 and 2; Theorem 1 (C)). Noticeably, the former is expressible in terms of the infinitesimal flow of Z (Proposition 1).
Section 4.2. The divergence of Z is naturally defined by using this Lie bracket (Definition 3). For the computation of , however, one can use an anisotropic connection ∇ (this can be seen as a Finsler connection dropping its vertical part, see Section 2) and a priori Chern’s one is not especially privileged (Proposition 2).
Section 4.3. We give a general Finslerian version of the divergence theorem for any anisotropic vector field Z, emphasizing the role of the choice of an (admissible) vector field , which in the Lorentzian case can be interpreted as an observer field; this is expressed in terms of integration of forms in the spirit of Cartan’s formula (Theorem 2, Remark 5). We also explain how the boundary term can be expressed in different ways by using a normal either with respect to the pseudo-Riemannian metric or to the fundamental tensor, which were the choices of Rund [] and Minguzzi [] resp.
Section 5 gives some applications to conservation laws.
Section 5.1. First, we discuss the definition of divergence for the case of T. Our definition for vector fields was not biased to the Chern anisotropic connection, but this will be used for (Definition 4). The reason is that should behave under contraction in a similar way as in the isotropic case (namely, as in Formula (11)), which privileges Chern’s connection (Proposition 3).
Section 5.2. As an interlude about the appearance of Chern’s ∇, a comparison with the possible use of Berwald’s and previous approaches in the literature is done.
Section 5.3. A conservation law for the flow of is obtained (Corollary 2), stressing three hypotheses on the vanishing for V of elements related to the stress–energy T (), the anisotropic vector X (, generalizing the isotropic case) and a derivative of V. The latter hypothesis is genuinely Finslerian, and it means that some terms related to the nonlinear covariant derivative must vanish globally (V can always be chosen such that they vanish at some point). It is worth pointing out that our general formula for the integral of the divergence (36) recovers the classical interpretation of the divergence as an infinitesimal growth of the flow (now observer-dependent). So, is equivalent to the conservation of energy-momentum in the instantaneous rest-space of each observer—see Remark 10.
We finish by applying this general result to two examples. First, we apply it to Lorentz norms, showing that the conservation laws of Special Relativity still hold even though, now, the conserved quantity may be different for different observers. As a second example, we give natural conditions so that the flow of (whenever it exists as a Lebesgue integral, eventually equal to ) is equal in two Cauchy hypersurfaces of a globally hyperbolic Finsler spacetime. Indeed, we refine a previous result by Minguzzi [], who assumed that L was defined on the whole and was compactly supported. We show that a combination of Rund’s and Minguzzi’s methods to compute the boundary terms allows one to obtain appropriate decay rates (namely, the properly Finslerian hypothesis (49)), which ensure the conservation.
2. Preliminaries and Setup
First, let us set up some notation. In all the present text, M is a connected smooth () manifold of dimension . As in previous references [,], any coordinate chart of M naturally induces a chart of defined by the fact that
for , where is the canonical projection. We abbreviate
these are vector fields on . At any rate, we will express our results in coordinate-free and geometric terms.
2.1. Anisotropic Tensors
We shall employ the framework of anisotropic tensors, following [,,], as it is simpler than previous ones. An open subset with is fixed; the elements are called observers. We will denote by the space of (smooth) r-contravariant s-covariant A-anisotropic tensor fields () and by the full anisotropic tensor algebra. will be the space of functions on A. This time we will also put for the space of anisotropic vector fields and for the space of anisotropic s-forms (alternating anisotropic tensors, so that ). The space of classical tensor fields will be seen as a subspace of , formed by the isotropic elements, namely, those which depend only on the point and not on the observer at it. In particular, . There is a distinguished element of : the canonical (or Liouville) anisotropic vector field,
For an open set ; we will put for the set of (local) observer fields, that is, those such that for all . Given one of these and , their composition, denoted by , makes sense. Finally, for , there is also a canonical derivation : the vertical derivative along X,
2.2. Nonlinear and Anisotropic Connections
In this article, a nonlinear connection on is defined as a (horizontal) subbundle such that , where is the vertical subbundle. For other options and the rudiments, see []. Nonlinear connections are characterized by their nonlinear coefficients ,
and also by their nonlinear covariant derivative ,
for . They also provide (at least locally) a nonlinear parallel transport of observers along curves . Namely, a map defined as , being V the only vector field along such that and (see (Definition 12 in []) and the comment below).
An A-anisotropic connection is an operator satisfying the usual Koszul derivation properties—see [,,]. In a chart domain U, they are characterized by their Christoffel symbols ,
They can be seen as vertically trivial linear connections on the vector bundle (Theorem 3 in []). On the other hand, every anisotropic connection has an underlying nonlinear connection, the only one with nonlinear coefficients
As a consequence, they define the covariant derivative for any anisotropic tensor:
2.3. Lorentz–Finsler Metrics
From now on, we will always assume that A is conic ( for and ). We shall follow the definitions and conventions in [,]. In particular, a Finsler spacetime is a (connected) manifold M endowed with a (properly) Lorentz–Finsler metric . L is required to be smooth, positive homogeneous, and, when restricted to each (), its vertical Hessian g is non-degenerate with signature ; must be connected and salient, and its boundary in , which must be equal to , is a (strong) cone structure . In particular, at each point p, L is a Lorentz norm. By positive homogeneity, L is determined by its indicatrix .
Notice that the cone yields a natural notion of timelike, lightlike, and spacelike tangent vectors, but L is not defined on the latter. Indeed, we are not interested in the value of L on spacelike vectors by physical reasons, which are analyzed in []. Roughly, only particles (massive, massless) can be measured, and, so, experimental evidences only can affect and . Even though this also happens in classical relativity, the value of the Lorentz metric on the (future-directed) timelike vectors is enough to extend it to all the directions. Indeed, the anisotropies in Finsler spacetimes should be regarded as originated by the distribution of matter and energy in the causal directions rather than by (unobservable) spacelike anisotropies.
Even though it is the Lorentz–Finsler case that has a physical interpretation, in all other aspects, the theory carries on if L is just pseudo-Finsler, namely, positively 2-homogeneous with non-degenerate g on A. In fact, this is the context in which we will develop Section 4 and Section 5, as they are of a more mathematical character.
The Cartan tensor of L is
It is actually symmetric, so one can define the mean Cartan tensor as
for . L has also a canonically associated connection: the metric nonlinear connection, , of nonlinear coefficients
This is the underlying nonlinear connection of several anisotropic connections. One is the (Levi–Civita)–Chern ∇, the only symmetric anisotropic connection that parallelizes g. It is the horizontal part of Chern–Rund’s and Cartan’s classical connections, and it has Christoffel symbols
where the are those associated with (4). Another one is the Berwald . This is the horizontal part of Berwald’s and Hashiguchi’s classical connections, and it has Christoffel symbols
Here, are the components of a tensor metrically equivalent to the Landsberg tensor of L, which, among many other methods, can be defined as
for the of (4) (see (37 in [])). The Landsberg tensor is actually symmetric too, so one can define the mean Landsberg tensor of L as
3. Basic Interpretations on the Stress–Energy Tensor
Let us start with a discussion at each event of a Finsler spacetime . We can consider endowed with the Lorentz norm . In most of this section, the discussion relies essentially on the particular case when M is a real affine n-space with associated vector space V (which plays the role of in the general case) and L is a Lorentz–Finsler norm on V with indicatrix and cone included in V. Given , consider the corresponding fundamental tensors and and take orthonormal bases , , obtained extending u, . In a natural way, these bases live in , and they can be identified with bases in V itself. Assuming this, the change in coordinates between , is linear but not a Lorentz transformation, in general.
Extending the interpretations in relativity, is an event; the affine simplification includes the case of Very Special Relativity [,,]; can be regarded as an observer; the tangent space to the indicatrix (i.e., the subspace -orthogonal to u in ) becomes the rest-space of the observer u; and is an inertial reference frame for this observer. The Lorentz invariance breaking corresponds to the fact that the bases and are orthonormal for the different metrics , and, thus, the linear transformation between the coordinates of and (when regarded as elements of the same vector space ) is not a Lorentz one. If the affine simplification is dropped, such elements (observers and rest-spaces) must be regarded as instantaneous at .
It is worth emphasizing that, according to the viewpoint introduced in [] and discussed extensively in [], the space-like directions are not physically relevant for the Lorentz–Finsler metric. However, each (instantaneous) observer does have a restspace with a Euclidean scalar product. In the case of classical relativity, Lorentz-invariance permits natural identifications between these rest-spaces, and they become consistent with the value of the scalar product on space-like directions. Certainly, a Lorentz norm L could be extended outside these directions (maintaining the Lorentz signature for its fundamental tensor), but this can be done in many different ways, and no relation with the scalar products would hold.
The dropping of natural identifications associated with the Lorentz invariance implies that many notions that are unambiguously defined in classical relativity admit many different alternatives now. In the Appendix A, we analyze some of them for the relative velocity between observers as well as other kinematical concepts. This is taken into account in the following discussion about how the Finslerian setting affects the notion of the energy–momentum–stress tensor.
3.1. Particles and Dusts: Anisotropic Picture of Isotropic Elements
In principle, there is no reason to modify the classical relativistic interpretation of as the (energy-) momentum vector of a particle of (rest) mass moving in the observer’s direction . Moreover, if the particle moves in such a way that m is constant, it will be represented by a unit time-like curve such that will be its instantaneous momentum at each proper time . The (covariant) derivative would be the force F acting on the particle, which is necessarily -orthogonal to (i.e., the force lies in the instantaneous rest-space of the particle). Then, the relativistic conservation of the momentum in the absence of external forces would retain its natural meaning, namely, if the particle represented by splits into two and at some then .
The Appendix A suggests that the way how an observer u may measure the energy-momentum and conservation may be non-trivial. In particular, if one assumes that an observer u measures by using a -orthonormal basis in general, . Moreover, as we have already commented, the coordinates for other observer will not transform by means of Lorentz transformation. However, as the transformation of their coordinates is still linear, and both of them will write consistently in their coordinates.
Particles are also the basis to model dusts, which constitute the simplest class of relativistic fluids. A dust is represented by a number-flux vector field , where U represents the intrinsic velocity of the particle in the dust, i.e., a comoving observer, and n is the density of the dust for each momentaneously comoving reference frame. Comparing with the case of energy momentum, N is also an intrinsic object that lives at the tangent space of each point, and U gives the privileged observer who measures n. However, the measures of n by different observers involve different measures of the volume. As explained in the Appendix A, the length contraction may be fairly unrelated to the relative velocities of the observers. This implies a more complicated transformation of the coordinates by different observers. Anyway, the transformations between these coordinates would remain linear, and, so, they could still agree in the fact that they are measuring the same intrinsic vector field.
Summing up, in the case of both particles and dusts, one assumes that the physical property lives in V (or, more properly, in each tangent space of the affine space), and there is a privileged (comoving) observer u. The transformation of coordinates for another observer may be complicated, but, at the end, it is a linear transformation that can be determined by specifying the geometric quantities that are being measured as well as the geometry of . Thus, by using the coordinates measured by each observer one could construct and anisotropic vector field at each , which will fulfill some constraints, as the measurement by one of the observers (in particular, the privileged one) would determine the measurements by all the others.
3.2. Emergence of an Anisotropic Stress–Energy Tensor
The situation, however, is subtler for more general fluids, which are modeled classically by a 2-tensor on the underlying manifold.
Let us start by recalling the Newtonian and Lorentzian cases. In Classical Mechanics, one starts working in an orthonormal basis of Euclidean space to obtain the components of the Cauchy stress tensor, which give the flux of i-momentum (or force) across the j-surface in the background2. The laws of conservation of linear momentum and static equilibrium of forces imply that these components give truly a 2-tensor (linear in each variable), and the conservation of linear momentum implies that this tensor is symmetric.
In the relativistic setting, each observer will determine some symmetric components in its rest-space by essentially the same procedure as above. Additionally, it constructs , , and as the density energy and the energy flux across i-surface and i-momentum density, resp. The interpretation of these magnitudes completes the symmetry3 as well as the linearity in the 0-component. However, the bilinearity in the components has been only ensured for vectors in the rest-space of the observer. In relativity, one can claim Lorentz invariance in order to complete the reasons justifying that, finally, the components will transform as a tensor4.
Nevertheless, it is not clear in Lorentz–Finsler geometry why the transformation of the components from an observer u to a second one must be linear, taking into account that they apply to space-like coordinates in distinct Euclidean subspaces and no Lorentz-invariance is assumed. Indeed, the following simple academic example shows that this is not the case.
Example 1.
Assume that is an affine space with a Lorentz norm with domain A and consider the anisotropic tensor , where ℂ is the canonical (Liouville) vector field, and is a smooth function, which is extended as a 0-homogeneous function on A. Then, for each and , one has , , and . In this case, each is a symmetric 2-tensor, but the information on requires the knowledge of for all possible . Recall that this example holds even if is the Lorentz–Minkowski spacetime regarded as a Finsler spacetime (but no Lorentz-invariance is assumed for ).
Therefore, the following issues about T appear:
- (a)
- Observer dependence: even if we assume that the components measured by any observer u are bilinear, and, then, it is a standard tensor, the components measured by a second observer may transform by a linear map, which depends on as well as the experimental method of measuring (as in the case of the energy–momentum vector).
- (b)
- Nonlinearity: it is not clear even why such a linear transformation must exist, as bilinearity is only ensured in the direction of u and of its rest-space. Thus, the tensor measured by a single observer u would not be enough to grasp the physics of the fluid at each event , as in the example above.
- (c)
- Contribution of the anisotropies of : as an additional possibility, the local geometry of at u underlies the measurements of this observer and might provide a contribution for the stress–energy tensor itself.
Summing up, Lorentz–Finsler geometry leads to assume that the measurements by u are not enough to determine the state of the fluid, and the stress-energy tensor should be regarded as a non-isotropic tensor field, determined by the measurements of all the observers.
Formally, this means an anisotropic tensor (see [] for a summary of the formal approach), which can be expressed locally as
where for all (i.e., depends only on the direction of v). As a first approach, we can assume (recall Appendix A.5). Consistently, we will assume that there exists a Lorentz–Finsler metric L on M with indicatrix , and, so, indexes can be raised and lowered by using its fundamental tensor g. The fact that T has order 2 is important to establish classical analogies. However, other tensors might appear as more fundamental energy-momentum tensors, and, then, one would try to derive a semi-classical 2-tensor as in Section 3.3.
In principle, the intuitive relativistic interpretations would be transplanted directly to each v, whenever . That is, given two -unit vectors , the value of the 2-covariant stress–energy tensor perceived by the observer v (at ) is obtained as the flux of w-energy-momentum per unit of -volume orthogonal to u. More precisely, let be a small coordinate 3-cube in a hypersurface -orthogonal to u, and is the total flux of the energy-momentum of particles crossing (being positive from the side to the u side and negative the opposite direction), then the w-energy-momentum per unit of -volume is
where . As a Finslerian subtlety, recall that is only defined in and then in (i.e., it is trivially extended to in a coordinate-depending way), but the above limit depends only on the value of . Namely, if one considers two semi-Riemannian metrics g and in a neighborhood of p such that and are open subsets with p in the interior of for all and , then
In particular, we have the interpretations (recall signature ):
- is the energy density measured by ,being the measured energy.
- If w is -orthogonal to v and -unit, measures the flow of energy per unit of -volume in a surface -orthogonal to v and w (i.e., some small surface of area A flowing a lapse ), while measures the w-momentum density,
- If are -orthogonal to v and -unit, measures the flow of w-momentum per unit of -volume in a surface -orthogonal to v and z,
3.3. Lagrangian Viewpoint
In the Lagrangian approach for Special Relativity, the background spacetime is assumed to be endowed with a flat metric . So, the Lagrangian is constructed by using the prescribed and some matter fields . The stress–energy tensor coincides with the canonical energy–momentum tensor associated with the Lagrangian, in most cases (the exceptions include theories involving spin). This canonical tensor appears as the Noether current associated with the invariance by spacetime translations (i.e., when ), namely5
In principle, these interpretations would hold unaltered for the case of an affine space with a Lorentz norm, including the case of Very Special Relativity.
In General Relativity, however, the Lagrangian formulation introduces a background Lagrangian independent of matter fields (the Einstein–Hilbert one, eventually with a cosmological constant) and, then, a matter Lagrangian , which includes a constant of coupling with the background. Then, the safest way to define the stress–energy tensor is the canonical one obtained as the corresponding action term in the Euler–Lagrange equations6,
Any tensor obtained in this way will have some advantages to play the role of a stress–energy tensor, because it will be automatically symmetric (in contrast to (8)) and will have vanishing divergence.
In the Finslerian setting, the variational viewpoint has been systematically studied in a very recent study by Hohmann, Pfeifer, and Voicu []. Previously, the background Lagrangian closest to the Einstein–Hilbert functional in the Finslerian setting had been studied in [,]. Such a functional is obtained as the integral of the Ricci scalar function on the indicatrix of the Lorentz–Finsler metric7 L. Taking into account this background functional, they define the energy-momentum scalar function by taking the corresponding variational action term (Formula (84) in []),
Notice that, here, the functional coordinate for the Lagrangian is L, and, thus, an (anisotropic) function rather than a 2-tensor is obtained. However, starting at this function some tensors become useful (Formulas (88) and (91) in []), in particular a canonically associated (anisotropic Liouville) 2-tensor
as in Example 1. Notice that, essentially, the information of these tensors is codified in . Even though such a tensor is justified by the procedure of Gotay–Mardsen in [], some issues as the following ones might deserve interest for a further discussion:
- This is not the unique natural possibility to construct an anisotropic 2-tensor starting at . For example, an alternative would be the vertical Hessian,It is natural to wonder about the choice closer to the relativistic intuitions about the stress–energy tensor.
- Recently, the Palatini approach has also been studied for the Finslerian setting []. There, the dynamic variables are L and the components of an (independent) non-linear connection. Thus, a similar Lagrangian procedure would lead to a higher-order tensor. In the relativistic setting, this approach supports classical relativity, as it recovers both equations and (in the symmetric case) the Levi–Civita connection. However, the Palatini approach is no longer equivalent in the Finslerian case, as it yields non-equivalent connections, and it shows a variety of possibilities for the non-linear connections. So, it is natural to wonder about the most natural choice of a Lagrangian-based stress–energy tensor in this setting.
Finally, let us discuss an example analyzed from the Lagrangian viewpoint in [,] taking into account also the observers’ one in Section 3.2.
Example 2.
The gravitational field sourced by a kinetic gas has been deeply studied in [,]. In the relativistic setting, this is derived from the Einstein–Vlasov equations in terms of a 1-particle distribution function (1PDF) , which encodes how many gas particles at a given spacetime point x propagate on worldlines with a normalized 4-velocity . Specifically, the stress energy tensor is:
being the indicatrix (future-directed unit vectors of the Lorentz metric) and the volume element induced by the scalar product at each x. In [], they propose to derive the gravitational field of a kinetic gas directly from the 1PDF without averaging, i.e., taking into account the full information on the velocity distribution. This leads to consider the function , as an energy–momentum function, which plays the role of a stress–energy tensor (even though it is a scalar rather than a 2-tensor). Moreover, the original Lorentz metric is naturally allowed to be Lorentz–Finsler, which permits to obtain more general cosmological models (Section III in []).
Indeed, up to a coupling constant, ϕ is regarded directly as the matter source in the Finslerian Einstein–Hilbert equation (i.e., it is placed at the right-hand side of this equation, (Equation (7) in [])). It is worth pointing out:
- ϕ can be reobtained as a Lagrangian energy-momentum by inserting it directly as a term in the background Lagrangian (Equation (75) in []). However, the Lagrangian is not natural then, as ϕ is written in terms of (recall (Appendix 3, Section (a) in [])).
- As discussed above, such a function allows one to construct several tensors, in particular, the vertical Hessian (as in (10)), which also might play a role to compare with the relativistic .
Anyway, starting at the 1PDF ϕ, another Finslerian interpretation would be possible. In particular, one can define the energy–momentum distribution. Then, given an observer and a -unit vector, the w-energy–momentum might be defined as
In particular, when , this would be the energy perceived by v, and when w is unit and -orthogonal to v would be (minus) the momentum in the direction w(compare with the discussion at the end of Section 3.2). So, an alternative stress–energy tensor perceived by each observermight be defined as the anisotropic tensor:
where the integration in u is carried out with the volume form of , denoted by .
4. Divergence of Anisotropic Vector Fields
After studying the basic properties of the Finslerian stress–energy tensor T, our next aim is to analyze the meaning and significance of the infinitesimal conservation law . Along this and the next section, we will always consider an anisotropic tensor interpreted as an endomorphism of anisotropic vector fields. and will be defined on vectors and 1-forms by and resp., where is the inverse fundamental tensor, and is the transpose of T. They will have components and , and in principle we will not even assume that these are symmetric. We will be assuming that M is orientable and oriented. This is not restrictive: one could always reduce the theory to this case by pulling back all the objects (the fibered manifold included) to the oriented double cover of M (Chapter 15 in []).
Let us briefly recall the mathematically precise meaning of the conservation laws in classical General Relativity (g, T, and X isotropic). One has
with ∇ the Levi–Civita connection. The first contribution vanishes due to , and there are different situations in which the second one vanishes as well. For instance, if is antisymmetric, then
and if is symmetric and is antisymmetric (equiv., X is a Killing vector field), then also
Anyway, whenever , one can integrate (11) and apply the pseudo-Riemannian divergence theorem to get the integral conservation law
where is a domain of appropriate regularity, ı is the interior product operator, and is the metric volume form. In a sense that will be made more precise in §5, this is expressing that the total amount of X-momentum in a space region only changes along time as much as it flows across the spatial boundary of the region. In this way, there is no “creation” nor “destruction” of X-momentum in any space region.
Extending the infinitesimal or the integral conservation laws poses, first and foremost, the problem of appropriately defining the divergence of an anisotropic T. Observe that a priori it is not clear even how to define the divergence of a vector field Z, isotropic or not, as one could consider for different anisotropic connections ∇, mainly Chern’s and Berwald’s. An alternative is to seek for a more geometric, hence, unbiased, definition. For instance, the metric (anisotropic) volume form of L,
for positively oriented, is well-defined, and when (i.e., Z is isotropic), so is the Lie derivative
(see § 5 in []). So, by analogy with the classical case, one could think of for defining .
It turns out that the unbiased definition, including all , is achieved with a modification of this Lie derivative that we will regard as an extension of the classical Lie bracket. We devote the next subsection to the technical mathematical foundations of such an anisotropic Lie bracket, which needs of a nonlinear connection on to be well-defined. All the maps that will appear in Section 4.1 will be (anisotropic) tensor derivations in the sense of (Definition 2.6 in []), and their local nature will be apparent, so we will not explicitly discuss it. For example, the Lie derivative along is the only tensor derivation such that for and ,
4.1. Mathematical Formalism of the Anisotropic Lie Bracket
During this subsection, we fix an arbitrary nonlinear connection given by or by the nonlinear covariant derivative (keep in mind (1) and (2)), and also an anisotropic vector field .
For , it is very natural to consider the commutator of the horizontal lifts of Z and X:
We recall that is always vertical. Indeed, , where is the curvature tensor of the nonlinear connection (see [], where this curvature is regarded as an anisotropic tensor and the homogeneity of the connection is not really required). This means that the horizontal part of has coordinates , and this corresponds to a globally well-defined A-anisotropic vector field:
Definition 1.
is the anisotropic Lie bracket of Z and X with respect to the nonlinear connection.
Remark 1.
The word “anisotropic” could be omitted in the previous definition, in the sense that for , there is no other Lie bracket, isotropic or not, defined in general. Nonetheless, (17) makes apparent that when (i.e., when Z and X are isotropic), coincides with the standard Lie bracket regardless of the connection.
Lemma 1.
Given a nonlinear connection , , and anisotropic vector fields , it holds that
We also recall that the torsion of an A-anisotropic connection ∇ (18 in []), (Definition 5 in []) is the anisotropic tensor defined first on isotropic fields by and then extended by -bilinearity. Therefore, it can be regarded as and -bilinear map and it has coordinates
where the ’s are the Christoffel symbols of ∇8.
Theorem 1.
Let a nonlinear connection and an anisotropic vector field be fixed.
- (A)
- If ∇ is any A-anisotropic connection whose underlying nonlinear connection is , then for any ,(where is the torsion of ∇).
- (B)
- By imposing the Leibniz rule with respect to tensor products and the commutativity with contractions, the map extends unequivocally to an (anisotropic) tensor derivation given byfor and . In coordinates, ifthen
- (C)
- The mapis also a tensor derivation. When ,for all , where is the Lie derivative (16), regardless of the nonlinear connection.
- (D)
- Given and (), it holds that
Proof.
(A) It is straightforward to compute that the right-hand side of (21) is -multilinear. Moreover, the identity is trivial on isotropic vector fields , as in this case, which concludes.
(B) Given , for it follows from (17) that
Thus, in order to respect the Leibniz rule, the only possibility is to define
Now, given , in order to respect again the Leibniz rule and the commutativity with contractions, the only possibility is to define on every by
(26), (17), and (27) make apparent that is already local on functions, vector fields, and 1-forms, and they allow to compute
Finally, given , one is led to define by (22). Clearly, this indeed provides a tensor derivation and (23) follows from the evaluation of (22) at together with (26) and (28).
(C) is a tensor derivation for any , in particular for
(see (17)). Thus, the difference is again a derivation. As for the last assertion, where , we are going to use (Proposition 2.7 in []). For , we have
(recall Remark 1). For , we have
(see (26), (29), (1), and (16)). As and act the same on isotropic vector field and anisotropic functions, they are equal.
(D) Observe that for , the term vanishes in (19). Moreover, if and , then . Given a local reference frame , and taking into account the last two identities and the definitions of and , it follows that
As , (25) follows. □
Definition 2.
The tensor derivation defined in Theorem 1 (B) is the (anisotropic) Lie bracket with Z, while is the (anisotropic) Lie derivative along Z, both of them with respect to the connection.
Remark 2
(Anisotropic Lie bracket and Lie derivative). The derivation defined in Theorem 1 (C) would be the Lie derivative along Z with respect to . Analogously to the discussion of Remark 1, what makes this name consistent is (24): whenever the Lie derivative along Z was already defined, coincides with it. Even though the Lie bracket and the Lie derivative are equal in the classical regime, it is heuristically useful to regard as the anisotropic generalization of the former and as that of the latter, in order to distinguish them. It is actually , and not , which will be relevant for the definition of divergence. The reason is that the former, as we will see below, has a clear geometric interpretation in terms of flows, while the latter would just add the term to that interpretation. Moreover, Theorem 1 (D) actually corresponds to a Cartan formula for whose full development we postpone for a future work. Thus, can be regarded as an initial guess for the divergence of Z, but we will not employ from now on.
Let us observe that given a diffeomorphism that is the flow of an isotropic vector field Z, we can define the pullback of an anisotropic differential form as the anisotropic form given by , where is the -parallel transport of v along the integral curve of Z and .
Proposition 1.
If and , then
where is the (possibly local) flow of Z.
Proof.
Remark 3.
Even though, for convenience, we stated the previous geometrical interpretation for an s-form ω, it should be clear that it holds true for any r-contravariant s-covariant A-anisotropic tensor.
4.2. Lie Bracket Definition of Divergence
Finally, in this and the next subsections a pseudo-Finsler metric L defined on A is fixed again. In its presence, and in view of the Riemannian case and Proposition 1, the most natural way of defining the divergence of an anisotropic vector field Z is by . Here there is a canonical choice for : the metric nonlinear connection of L. The definition obtained this way is unbiased, in that one does not choose any anisotropic connection a priori. Notwithstanding, it will turn out to be most conveniently expressed in terms of the Chern connection.
Definition 3.
Remark 4.
Even though we will keep assuming it for simplicity, the hypothesis of M being orientable is not really needed for this definition. As in pseudo-Riemannian geometry, on small enough open sets it is always possible to choose an orientation, define with respect to it and put . The different definitions will be coherent because when the orientation changes, changes to and
In particular, when M is orientable, is independent of the orientation choice.
Proposition 2.
Let L be a fixed pseudo-Finsler metric defined on A, and let . If ∇ is any symmetric A-anisotropic connection such that its underlying nonlinear connection is the metric one and , then
or in coordinates,
This, in particular, is true for the (Levi-Civita)–Chern anisotropic connection of L, so one can take the Christoffel symbols to be those of (5).
Proof.
One expresses the Z-Lie bracket of the volume form in terms of the anisotropic connection, analogously to the isotropic case. From (15) and the fact that is a tensor derivation, we obtain
(26) and the fact that is the underlying nonlinear connection of ∇ give
(21) and
From these and ,
where the last equality is reasoned analogously as in the proof of (25).
For the Chern connection, it can be checked that by considering a parallel orthonormal basis with respect to a parallel observer V along the integral curves of any vector field. The coordinate expression of in this case concludes (33). □
4.3. Divergence Theorem and Boundary Term Representations
Our Lie bracket derivation allows us to obtain a statement of the Finslerian divergence theorem that subsumes both Rund’s (3.17 in []) and Minguzzi’s (Theorem 2 in []). This way, it does not need of computations in coordinates from the beginning nor of the “pullback metric” ( in our notation). Naturally, our statement does not include Shen’s (Theorem 2.4.2 in []), as this one is an independent generalization of the Riemannian theorem not dealing with anisotropic differential forms nor vector fields.
Lemma 2.
Proof.
Let , …, be a positively oriented -orthonormal basis for every for a certain . Then for all . This implies that
Moreover, as ,
Using this relation above, we conclude (35). □
In the present article, by a domain we understand a nonempty connected set that coincides with the closure of its interior D; then, its boundary is . Physically, it is very important to include examples in which different parts of have different causal characters, and this tipically leads to the boundary not being totally smooth. Hence, we will make a weaker regularity assumption that still allows one to apply Stokes’ theorem on . A subset of M has 0 m-dimensional measure if its intersection with any embedded m-dimensional submanifold is of 0 measure in the smooth manifold . Finally, the interior product of an s-form with a vector field X will be
Theorem 2.
Let L be a fixed pseudo-Finsler metric defined on A. If
- (i)
- is an anisotropic vector field,
- (ii)
- is an A-admissible field with open, and
- (iii)
- is a domain with smooth up to subset of 0 -dimensional measure on M and compact,
Proof.
Remark 5
(Riemannian and Finslerian unit normals). Let be the inclusion of a smooth open subset .
- (i)
- Even though we do not use the pseudo-Riemannian metric to derive Theorem 2, from our physical viewpoint it is natural to use it to re-express the boundary term. If Γ is non--lightlike, then for a -normal field and a transverse field X along i, the formis nonvanishing and independent of X. In particular,is independent of the scale of , which we will always assume to be -unitary and D-salient, socoincides with the hypersurface -volume form of Γ. Taking into account that vanishes wherever is tangent to Γ and that , (37) allows us to represent and the right-hand side of (36) asIn fact, this is how Rund’s divergence theorem follows from Theorem 2.
- (ii)
- There is another way that one can try to represent the boundary term. Namely, assume that there exists a smooth with and (in the Lorentz–Finsler case, it will necessarily be ). This is called a Finslerian unit normal along . Analogously as in (i), one can puthere, due to the possible orientation difference between both sides,In fact, this is how Minguzzi deduces his divergence theorem (Theorem 2 in []). Note, however, that he does it under the hypothesis of vanishing mean Cartan tensor (), which implies that is independent of V. As we do not require this, Theorem 2 is more general statement than Minguzzi’s.
- (iii)
- The Finslerian unit normal presents some issues in the general case, as we are not taking . In our physical interpretation, with L Lorentz–Finsler, A consists of timelike vectors, so asking for a Finslerian unit normal is only reasonable when Γ is L-spacelike, that is, for . In such a case, the strong concavity of the indicatrix guarantees the existence and uniqueness of ξ: one defines to be the unique vector such that and the indicatrix are tangent at .
- (iv)
- Of course, if L comes from a pseudo-Riemannian metric on M, then and .
- (v)
- It should be clear from this discussion that the form that one integrates on the right-hand side of (36) is always the same and that the only difference between Rund’s and Minguzzi’s divergence theorems is how each of them represents it. Notwithstanding, this is an important difference, for the boundary terms (38) and (39) could potentially have different physical interpretations.
5. Divergence of Anisotropic Tensor Fields
Our developments of the previous section will allow us to obtain integral Finslerian conservation laws for a tensor T with . We obtain one for each satisfying certain hypotheses. Physically, T can be interpreted as an anisotropic stress–energy tensor and V as an observer field. We will also revisit two of the main examples with a clearer physical interpretation: Special Relativity and the conservation of the “total energy of the universe.” In order to do all this, let us see how the Chern connection enters the Finslerian definition of .
5.1. Definition of Divergence with the Chern Connection
Proposition 2 motivates the most natural definition of divergence of . Namely, by analogy with the classical case, we shall require (11) to hold for any anisotropic vector field . This makes the Chern connection appear now: it is the only Finslerian connection ∇ for which one can assure that (32) holds independently of . We shall also explore the conditions under which the term vanishes in the general Finslerian setting.
Proposition 3.
Let L be a fixed pseudo-Finsler metric defined on A with metric nonlinear connection and Chern anisotropic connection ∇. Also, let be symmetric, , and .
(A) The following are equivalent.
- (Ai)
- is antisymmetric.
- (Aii)
- is anti-self-adjoint with respect to , that is, .
- (Aiii)
- .
(B) One has
where is the operator that contracts the contravariant index with the covariant one introduced by ∇.
(C) One has assuming any of the following conditions.
- (Ci)
- is antisymmetric.
- (Cii)
- is symmetric and .
Proof.
For (A), take . The antisymmetry of reads
which is exactly the anti-self-adjointness of with respect to . Besides, (26) and (21) together with for the Chern connection give
which shows that also is equivalent to the anti-self-adjointness.
For (B), all the computations in (11) hold formally the same in the general Finslerian case due to Proposition 2.
As for the vanishing of , it follows from (Ci) by the same computations as in (12). Indeed, the antisymmetry can be expressed as
It also follows from (Cii) by (13). Indeed, is equivalent to being anti-self-adjoint with respect to , and this can be expressed as
□
Remark 6
( and Finslerian Killing fields). In classical relativity (g, T, and X isotropic), the second condition in (C ii) above would read , and would be equivalent to X being a Killing vector field. In the general case, X being Killing can be defined by the conditions and (Section 5 in []), but (using Theorem 1 (C), the facts that and , and also (40))
This way, we see that neither of X being Killing or implies the other, and additionally we recover the characterization of (Proposition 6.1 (i) in []).
Definition 4.
Let L be a fixed pseudo-Finsler metric defined on A with (Levi-Civita–)Chern anisotropic connection ∇. For , its divergence with respect to L is defined as
where is the operator that contracts the contravariant index with the covariant one introduced by ∇. In coordinates,
for the Christoffel symbols of (5).
Remark 7
(Divergence vs. raising and lowering indices).
- (i)
- First and foremost, by construction, (11) indeed holds for any . At this point, it is important that the connection with which one defines is the Chern one.
- (ii)
- Thanks to the fact that the Chern connection parallelizes g, namely, and , the following hold:
- (iii)
- Regardless of this, in general we are not assuming the symmetry of or —we only did in Proposition 3 (Cii). Instead, at the beginning of §5 we fixed a convention for the order of the indices in and (for example, )—in the remainder of §4 and with said condition (Cii) only.
5.2. Chern vs. Berwald
One needs to keep in mind a discussion present in []. The metric connection is the underlying nonlinear connection of an infinite family of A-anisotropic connections ∇. One of them is the (Levi–Civita)–Chern connection of L, which is the horizontal part of Chern–Rund’s and Cartan’s classical connections and has Christoffel symbols (5). All the others are this one plus an anisotropic tensor with when viewed as an -bilinear map . In particular, for , one gets the Berwald anisotropic connection of L, which is the horizontal part of Berwald’s and Hasiguchi’s classical connections and has Christoffel symbols (6). We did not as a priory select any of these ∇’s.
In some of the previous literature [,,,], the Finslerian divergence of vector fields was chosen to be defined directly with the Chern connection. In [,], the quantity , with ∇ the Chern anisotropic connection, was referred to as the divergence of Z, though only after it had appeared in the divergence theorem. We have proven that the most natural definition leads to this characterization, hence clarifying why using Chern’s covariant derivative is not arbitrary. Moreover, we have seen that said derivative fulfills the natural requisite (11) and is compatible with the lowering and raising of indices; these are key properties when it comes to the stress–energy tensor T. Still, it is important to compare this with what happens when one uses the other most natural covariant derivative: Berwald’s.
Remark 8
(Divergence in terms of the Berwald connection). Let ∇ be the Chern anisotropic connection of L, with Christoffel symbols (5), and be the Berwald one, with symbols (6).
- (i)
- (ii)
- One sees that the vanishing of (or of the mean Cartan , see ((6.37) in [])) implies that the divergence of elements of coincides with the trace of their Berwald covariant derivative. However, (or even ) is not enough if one wants to obtain the same characterization for elements of .
Remark 9
(Sufficient conditions for and being Finslerian Killing). In Remark 13 one could see that together with is sufficient for X to be Killing. This condition does not privilege the Chern connection ∇ against the Berwald :
(see ((38) in []), where is what here we would denote ). However, when it comes to the stress–energy tensor, we have seen that the relevant condition is not this but rather . Proposition 3 (A) implies that is sufficient for , and this does privilege ∇ against .
5.3. Finslerian Conservation Laws and Main Examples
Compare the results here with the classical case (14) and also with [].
Corollary 1.
Let L be a fixed pseudo-Finsler metric defined on A. If
- (i)
- is an anisotropic vector field,
- (ii)
- is an A-admissible field with open,
- (iii)
- is an anisotropic 2-tensor, and
- (iv)
- is a domain with smooth up to subset of 0 -dimensional measure on M and compact,
Proof.
Just take in Theorem 2 and use part (B) or Proposition 3. □
Remark 10.
Observe that (44) allows for an interpretation of the divergence of T in terms of the flow in the boundary. Consider a sequence of domains such that their volumes go to zero when and consider an observer V such that is infinitesimally parallel at , namely, in and X such that . Then (44) and the mean value theorem imply that
In particular, can be interpreted as that the observer v measures conservation of energy in its restspace.
Corollary 2.
In the ambient of the previous corollary, assume:
- (i)
- .
- (ii)
- Any of the conditions (Ci) or (Cii) of Proposition 3 holds for .
- (iii)
- .
Then
Proof.
It follows from Corollary 1, taking into account that the hypotheses , , and imply that the three first integrals in (44) vanish. □
Remark 11.
(Sufficient conditions for the hypotheses (i), (ii), and (iii)).
- (i)
- Obviously, suffices, but we do not need to assume that the divergence vanishes for all observers.
- (ii)
- suffices. In fact, (Proposition 2.9 in []), so (Ci) of Proposition 3 holds for . Thus, assuming the other two hypotheses, we get
- (iii)
- Although the hypothesis may seem artificial as it stands, there are a number of natural situations in which it is guaranteed. First, in classical relativity (g, T, and X isotropic), because and ; the result is then independent of V. Second, when the observer field is parallel (), trivially. Third, when for some 1-form V and is 0-homogeneous, because of Euler’s theorem. Fourth, in the situation described in (Section 5.1 in []) (Z is our , s is our V, and I is our ).
Remark 12.
(Representations of (45)). One needs to keep in mind Remark 5. For a smooth part Γ of , one can use the (salient) Riemannian unit normal to represent
when Γ is non--lightlike, and the Finslerian unit normal to represent
when L is Lorentz–Finsler and Γ is L-spacelike. This makes it possible to have the very same conservation law (45) written in distinct ways, and in the examples below we will see that different expressions are preferable in different situations.
In the remainder of the section, we analyze the Finslerian conservation laws in two settings in which L is Lorentz–Finsler. In particular, g has signature , A determines a time orientation, on A, and is maximal with these properties. We also have regularity conditions at , and in fact one sees that Theorem 2 and Corollary 2 still hold when allowing that , and . Despite this, in both settings it will be necessary to take V as L-timelike, so the regularity at will not be used.
5.3.1. Example: Lorentz Norms on an Affine Space
In this example, we shall particularize Corollary 2 to the easiest Finslerian setting in which we can assure that its hypothesis (iii) holds. Namely, the structure of an affine space automatically provides an infinite number of parallel observer fields, with .
To be preicse, suppose that is an affine space equipped with a Lorentz norm on an open conic subset (a positive pseudo-Minkowski norm with Lorentzian signature in (Definition 2.11 in [])). Under the usual identifications, such a norm can be seen as a Lorentz–Finsler L on that is independent of the first factor. Consequently, its fundamental tensor is nothing more than a Lorentzian scalar product for each . The metric nonlinear connection of L coincides with the canonical connection of E, hence so do the Chern and Berwald anisotropic connections10. This is what implies that the parallel correspond exactly to the elements .
Let us introduce some notation. Given with , we can consider the Lorentzian scalar product and the orthogonal hyperplane . We get an isometry , where is equipped with (a Euclidean scalar product), with (a Lorentzian one) and E with . Let be a compact domain of with smooth up to a null -dimensional measure set, and let be its salient unit -normal. Then, for , the compact domain has the required smoothness to apply Corollary 2, its boundary is , and its salient -normal is given by
Remark 13.
For a identifiable with , we know that the hypothesis (iii) of Corollary 2 holds automatically. If (i) and (ii) hold too, then we get (45), for which we can use the representation (46). However, given the nature of the metric “nonlinear” and Chern “anisotropic” connections, it is easy to convince oneself that evaluating the result of anisotropic computations on this V is the same as first evaluating on V and then computing with isotropic tensors. For instance and . As a consequence, mathematically we get exactly the same conservation laws as if we just were in the Lorentzian affine space . Physically, though, different observers will measure different momenta.
Corollary 3.
Let parallely identifiable with an . If is such that and is such that is antisymmetric, or is symmetric and , then
where is identifiable with the volume form of on and coincides with the volume form of on .
Physically, even though Lorentz norms generalize Very Special Relativity [], the classical interpretations of Special Relativity are still valid; we list them for completeness: v is an instantaneous observer at an event , is its restspace and is the simultaneity hyperplane of v, namely, the “universe at an instant, say , as seen by v.” The affine space structure allows for a canonical propagation of v to all of the spacetime. Hence, if is a space region at , then is the “evolution of along the time interval as witnessed by v.” (47) expresses that the variation after some time of the total amount of -momentum in Ω is exactly equal to the amount of it that flowed across .
5.3.2. Example: Cauchy Hypersurfaces in a Finsler Spacetime
Here we present a construction which manifestly generalizes that of the previous example, again with straightforward physical interpretations, and we find an estimate that allows us to interpret (47) when is “at infinity”. We will take with open, and we recall that we will assume the hypotheses of Corollary 2.
Suppose that the Finsler spacetime is globally hyperbolic. By this, we mean that there is some (smooth, for simplicity) L-Cauchy hypersurface : every inextensible L-timelike curve (thus ) meets exactly once. Let us assume that there are two L-spacelike Cauchy hypersurfaces which do not intersect11. Then the results of [] can be automatically transplanted: there exists a foliation by spacelike Cauchy hypersurfaces such that and . Taking the Finslerian unit normal to each level produces an L-timelike field . We can take this to be our V, but we will not do so for the most part of this example.
Suppose also that is an exhaustion by compact domains of , namely and , such that is smooth a. e. For , let be the integral curve of V starting at p, which necessarily meets at a unique instant . Put
Remark 14.
By construction,
- (i)
- is again an exhaustion by compact domains of such that is smooth a. e.
- (ii)
- is a compact domain of U with smooth a. e. We do not really need to consider the union of all the ’s.
Next, for , we shall give the quantitative decay condition on (some components of) so that the integral
vanishes in the limit. The key fact for it will be that V is everywhere tangent to (this is composed of ’s). In particular, as V is -timelike, so must be .
Remark 15.
The presence of V allows us to define an auxiliar Riemannian metric on U with norm , which gives a very natural way of quantifying. Namely, if is an orthonormal basis for , then we prescribe it to be also -orthonormal; equivalently,
Then, by construction:
- (i)
- The volume form of coincides with that of , namely .
- (ii)
- The salient unit -normal to coincides with the corresponding -normal. We denote it by , as in Remark 12.
- (iii)
- The hypersurface volume form of with respect to coincides with the one computed with , namely with the inclusion. Hence we speak just of the hypersurface volume of , namely . As is -orthogonal to V, and hence -spacelike, we can use the representation
Thanks to (48) and the fact that , we intuitively see that if is proportional to V at infinity and the hypersurface volume does not grow too much, then the integral will be negligible. To be precise, we require that
where
Corollary 4.
In the above set-up, let , , and be such that the hypotheses of Corollary 2 hold on all the ’s, and put . If the decay condition (49) holds too, then
where is constructed from by intersecting the integral curves of V with .
Proof.
Corollary 2 can be applied on , as is always compact. This and the representation (48) give
Using the definition of (Remark 15) and the Cauchy–Schwarz inequality,
so if tends to 0, then so does the integral along in (51). □
Remark 16.
In Corollary 4, if one of the integrals of along or exists in the Lebesgue sense, then so does the other and (50) reads
Note that they could be , as we have not assumed, for instance, that is compactly supported in the union of all the ’s. Rather, we have assumed the decay condition (49) alone.
Remark 17.
(Sufficient conditions for (49)). As for ensuring the decay condition, there are two possible scenarios.
- (i)
- The hypersurface volume stays bounded. Then, it is enough for (49) that , and one could instead postulate the stronger condition that the maximum outside tends to 0, which is independent of the concrete compact exhaustion.
- (ii)
- grows without bound. In this case, one can just postulate that the decay of compensates the growth of , but this does depend on the compact exhaustion.
Notice that this is a purely Finslerian difficulty. Indeed, suppose that g, T, and X were isotropic and that was timelike. Then one could just set and then carry out all the construction. Corollary 2 would be independent of the observer field (and its hypothesis (iii) would hold trivially), and regardless of . This is how we get the following statement of the classical law.
Corollary 5.
In the above se-up, suppose that L comes from a Lorentzian metric on M. Let and be such that and is antisymmetric, or is symmetric and . If is timelike, then
where is constructed from by intersecting the integral curves of Z with .
Remark 18
(Conservation in terms of the Finslerian unit normal).
- (i)
- One could try to represent also the integrals of (50) in terms of , as in Section 5.3.1. However, according to Remark 12, that would require assuming that is non--lightlike, which is not very reasonable when all we know is that L-spacelike and L-Cauchy.
- (ii)
- On the other hand, in terms of the Finslerian unit normal ξ, (50) readswhen . The sign in front of the second integral is explained as follows (see Remark 5 (ii)). selects an orientation on each : the one for which is positive. However, in (50) already had an orientation and had : the -salient ones. Necessarily12, exactly one of these agrees with the -orientation: if lays in the future of and if it is the opposite. Notice that this, and hence (52), would fail if the Cauchy hypersurfaces crossed.
- (iii)
Summing up, in this example we have proven a Finslerian (observer-dependent) version of the classical law that the total amount of X-momentum in the universe is conserved (Corollary 4). Our formulation is asymptotic, so it is valid even for infinite total -momentum (Remark 16). We have recovered the classical law (Corollary 5), which always holds under hypotheses on T and X alone, while in the general Finslerian case nontrivial difficulties appear in the regime of big separation between the Cauchy hypersurfaces (high , Remark 17). Finally, we have expressed the law naturally in terms of the Finslerian unit normal (see (52)).
6. Conclusions
About the physical interpretation of T, Section 3:
- 1
- Heuristic interpretations from fluids, Section 3.1 and Section 3.2—Possible breakings of Lorentz-invariance lead to non-trivial transformations of coordinates between observers. Such transformations are still linear and permit a well-defined energy-momentum vector at each tangent space , Section 3.1.However, the stress–energy–momentum T must not be regarded as a tensor on each but as an anisotropic tensor. This depends intrinsically on each observer and may vary with u in a nonlinear way. Indeed, the breaking of Lorentz invariance does not permit to fully replicate the relativistic arguments leading to (isotropic) tensors on M, even though classical interpretations of the anisotropic T in terms of fluxes can be maintained, Section 3.2.
- 2
- Lagrangian viewpoint, Section 3.3. In principle, the interpretations of Special Relativity about the canonical energy–momentum tensor associated with the invariance by translations remain for Lorentz norms and, thus, in Very Special Relativity. In the case of Lorentz–Finsler metrics, some issues to be studied further appear:
- (a)
- The canonical stress–energy tensor in Relativity leads to different types of (anisotropic) tensors in the Finslerian setting (a scalar function on in the Einstein-Hilbert setting, higher order tensors in Palatini’s). Starting at such tensors, different alternatives to recover the heuristic physical interpretations in terms of a 2-tensor appear.
- (b)
- In the particularly interesting case of a kinetic gas [,], the 1-PDF becomes naturally the matter source for the Euler-Lagrange equation of the Finslerian Einstein-Hilbert functional. However, the variational derivation of is obtained by means of a possibly non-natural Lagrangian. This might be analyzed by sharpening the framework of variational completion for Finslerian Einstein equations [].
About the divergence theorem for anisotropic vector fields Z, Section 4:
- 1
- Section 4.1: For any Lorentz–Finsler metric L, there is a natural definition of anisotropic Lie bracket derivation along Z, which depends only on the nonlinear connection and admits an interpretation by using flows.
- 2
- Section 4.2: This bracket allows one to give a natural definition of which depends exclusively on and the volume form of L. This provides a geometric interpretation for the definition of divergence introduced by Rund [].
- 3
- A general divergence theorem is obtained (Theorem 2) so that Section 4.3:
- (a)
- It can be seen as a conservation law for Z measured by each observer field V, even if the conserved quantity depends on V.
- (b)
- The computation of the boundary term is intrinsically expressed in terms of forms. However, several metric elements can be used to re-express it, in particular the normal vector field for: (i) the pseudo-Riemannian metric (Rund), or (ii) the pseudo-Finsler metric L, when L is defined on the whole (Minguzzi).
About the conservation of the stress–energy T Section 5:
- 1
- Section 5.1 and Section 5.2: The computation of priviledges the Levi-Civita–Chern anisotropic connection, showing explicit equivalence with Rund’s approach.
- 2
- Corollaries 1 and 2: A vector field on M is preserved assuming that some natural elements vanish on V for T, X and .
- 3
- Section 5.3: Natural laws of conservation on Cauchy hypersurfaces under general conditions (including rates of decay for unbounded domains) can be obtained by a combination of the techniques (i) and (ii) in the item 3b above.
Author Contributions
Investigation, M.Á.J., M.S. and F.F.V. All authors have read and agreed to the published version of the manuscript.
Funding
MAJ was partially supported by the project PGC2018-097046-B-I00 funded by MCIN/ AEI /10.13039/501100011033/ FEDER “Una manera de hacer Europa” and Fundación Séneca project with reference 19901/GERM/15. This work is a result of the activity developed within the framework of the Programme in Support of Excellence Groups of the Región de Murcia, Spain, by Fundación Séneca, Science and Technology Agency of the Región de Murcia. MS and FFV were partially supported by the project PID2020-116126GB-I00 funded by MCIN/AEI/10.13039/501100011033, by the project PY20_01391 (PAIDI 2020) funded by Junta de Andalucía—FEDER and by the framework of IMAG-María de Maeztu grant CEX2020-001105-M funded by MCIN/AEI/10.13039/50110001103.
Conflicts of Interest
The authors declare no conflict of interest.
Appendix A. Kinematics: Observers and Relative Velocities
Here, we discusss a series of different possibilities for the notion of relative velocity between two observers, each one with a well-defined geometric construction. This is done as an academic exercise, because we do not discuss experimental issues (compare with [,]). However, it is worth emphasizing that all the possibilities studied here are intrinsic to the geometry of a flat model and, thus to any Finsler spacetime.
Start at an affine space endowed with a Lorentz norm let be two distinct observers and consider the plane Span, which intersects transversally and inherits a Lorentz–Finsler norm with indicatrix . Recall that both tangent spaces and inherit naturally a Lorentz scalar product by restricting the fundamental tensors and , resp. Moreover, their (1-dimensional) restspaces , also inherit a positive definite metric. In what follows, only the geometry of will be relevant.
Appendix A.1. The Lorentz Metric g Π up to a Constant
Notice that is composed by two half-lines spanned by two -lightlike directions ; we will consider the orientation provided by the choice . One can determine a scalar product in (which is unique up to a positive constant), regarding both and as -lightlike in the same causal cone. It is easy to check that must be a strongly convex curve which converges asymptotically to the vector lines spanned by . This implies both will be timelike for and its restpace l will be -spacelike; we can assume also that the orientation in l is induced by the chosen .
Notice that by the fundamental inequality, but might be timelike or spacelike for (although as ). This possibility might be regarded as a possible measurement of the speed of light with respect to u by the observers in , namely, this velocity is in the orientation when is -spacelike and smaller than 1 when it is timelike. However, a priori it is not clear an operational way to carry out such a measurement. Moreover such a measurement might be regarded as something non-intrinsic to the speed of light but to the way of measuring it.
Nevertheless, as pointed out in (Section 6 in []), there are several effects which might lead to a measurement of different speeds of light in different directions. So, we will consider that each has its own speeds of light in each spacelike orientation . Indeed, given u and an orientation , the speed of light will be defined as the the supremum of the relative velocities between u and all the observers such that yields the orientation . Next, we will explain several possible meanings of these velocities. To avoid cluttering, next we will write , assuming that the appropriate choice in is done for each .
Appendix A.2. Simple Relative Velocity
As determines naturally a Lorentz metric on V, we can define the simple relative velocity of measured by u as the usual -relativistic velocity between u, normalized to , i.e.,
(the latter by the reversed fundamental inequality). Clearly, in general, but this does not seem a drawback in the Finslerian setting.
A support for the physical plausibility of this velocity is that one could expect that each observer u will work as in Special Relativity just choosing an orthonormal frame of . The possibility might seem ackward from a dynamical viewpoint (see below), but it seems harmless as far as only kinematics is being considered. In principle, the comparison between the measurements of the two observers would be geometrically possible by using the unique isometry of to which maps u into and is consistent with orientations induced from . What is more, this isometry can also be extended to a natural isometry from to , namely, regard as a Riemannian metric and use the parallel transport from u to along the segment of the curve from u to . However, the following fact might suggest to explore further possibilities.
Remark A1.
Assume that Σ is modified into the indicatrix of another Lorentz–Finsler norm so that (i) around u and (ii) but its restspace is different from . Then, the simple velocity would remain unaltered, i.e., .
Appendix A.3. Velocity as a Distance between Observers
Notice that can be regarded as a Riemannian manifold with the restriction of the fundamental tensor g and, then, can be regarded as a curve whose length can be computed. Then, the observers’ distance velocity is defined as:
Notice that this velocity is symmetric and it generalizes directly the one in Special Relativity providing a geometric interpretation for the addition of velocities. Recall that has been defined essentially as a distance in , where depends of each pair of observers, thus, one might have when . If one prefers to avoid such a possibility, it is enough to consider g-distance in the whole space of observers (observers’ space distance velocity), at least in the case that is regarded as independent of .
Remark A2.
In the case studied in Remark A1, one would have in general. However, the relative position of the restspaces l and does not play any special role.
Appendix A.4. Length-Contraction and Velocity
Consider a segment S of l with -length ℓ and the strip of V obtained by translating S in the direction of u. Let be the intersection of this strip with , which will be a new segment of -length . Let be the length-contraction parameter. In the relativistic case, and as . The former property does not hold for a general Lorentz norm but the latter does. So, whenever holds, we can define the length-contractive velocity of with respect tou as:
Again, this velocity is not symmetric. Because of the strong convexity of , a different observer will have a different restspace , but this does not imply a different length nor velocity . However, this velocity gives a comparison between restspaces which was absent in the previous two velocities.
Appendix A.5. Symmetric Lorentz Velocities in Π
Let us consider the Lorentzian scalar product en , uncfique up to a positive constant (which will be irrelevant for our purposes) introduced above. Recall that u and were timelike for and, moreover, both l and were spacelike. Now, we can define two velocities between u and : the simple Lorentz velocity,
and the length-contractive Lorentz velocity,
where, in the latter, n, are -timelike vectors orthogonal to l, , resp.
Clearly, both velocities are symmetric. Their appearance might be physically sound because the intrinsic Lorentz metric (up to a constant) can be regarded as an object available (or, at least, a compromise one) for all the observers, as it would depend directly on physical light rays.
Notes
| 1 | Berwald spaces [,] are an exception, as the parallel transport becomes an isometry between the Lorentz norms. Thus, in some sense, these spaces would admit a principle of equivalence with respect to a Lorentz-normed space (non-necessarily Lorentz–Minkowski). |
| 2 | In this section, and , but in the others they will run freely from 1 to n (= dim M). |
| 3 | The symmetry of T is dropped for the case of theories with high spin because of its contribution to angular momentum. |
| 4 | See for example (Section 4.5 in [], >Section 35 in []). |
| 5 | See for example [] (around formula (E.1.36)) or (Section 32 in []). |
| 6 | See for example, (Section E.1 in []), (Section 4.3 in []), (Sections 21.2 and 21.3 in []). |
| 7 | Some arguments which support strongly their choice are (see []): (a) the simplest analogous to the vacuum Einstein equation in the Finslerian approach Ricci (proposed by Rund [], and satisfied by Finsler pp-waves []) is not a variational equation; (b) the Ricci scalar functional yields an Euler–Lagrange equation, which agrees with Einstein’s in the vacuum Lorentz case, and (c) this Euler–Lagrange equation is the variational completion of the Finslerian Ricci . |
| 8 | This is not to be mistaken by the torsion of the nonlinear connection , which would have coordinates (even though this can be seen as a particular case of the torsion of some ∇ and hence it is also denoted by in []). |
| 9 | Here, is the operator that (metrically) contracts the first index of S with the one introduced by ∇, and is the operator that (naturally) contracts the first index of R with the one introduced by ∇. |
| 10 | For instance, it is clear that in affine coordinates the components of the metric spray vanish, so the geodesics are the straight lines of E. |
| 11 | The case when they interesect can be also conisdered by taking into account that, then, the open set is still globally hyperbolic and a Cauchy hypersurface of this open subset will be also Cauchy for M (and it will not intersect any of the previous ones). |
| 12 | Suppose, for instance, that lays in the future of : the ’s departing from reach points with . Take bases for and for such that and are -positive. Then and are both -positive ( and V always lie in the same half-space), the former is -negative (V is -entering at ) and the latter is -positive (V is -salient at ). |
References
- Hohmann, M.; Pfeifer, C.; Voicu, N. Relativistic kinetic gases as direct sources of gravity. Phys. Rev. D 2020, 101, 024062. [Google Scholar] [CrossRef]
- Hohmann, M.; Pfeifer, C.; Voicu, N. Cosmological Finsler Spacetimes. Universe 2020, 6, 65. [Google Scholar] [CrossRef]
- Kouretsis, A.P.; Stathakopoulos, M.; Stavrinos, P.C. The General Very Special Relativity in Finsler Cosmology. Phys. Rev. D 2009, 79, 104011. [Google Scholar] [CrossRef]
- Li, X.; Chang, Z. Exact solution of vacuum field equation in Finsler spacetime. Phys. Rev. D 2014, 90, 064049. [Google Scholar] [CrossRef]
- Stavrinos, P.; Vacaru, O.; Vacaru, S. Modified Einstein and Finsler Like Theories on Tangent Lorentz Bundles. Int. J. Mod. Phys. D 2014, 23, 1450094. [Google Scholar] [CrossRef]
- Bernal, A.N.; Javaloyes, M.A.; Sánchez, M. Foundations of Finsler Spacetimes from the Observers’ Viewpoint. Universe 2020, 6, 55. [Google Scholar] [CrossRef]
- Bogoslovsky, G. A special-relativistic theory of the locally anisotropic space-time. Il Nuovo C. B Ser. 1977, 40, 99–115. [Google Scholar] [CrossRef]
- Cohen, A.G.; Glashow, S.L. Very special relativity. Phys. Rev. Lett. 2006, 97, 021601. [Google Scholar] [CrossRef]
- Fuster, A.; Pabst, C. Finsler pp-waves. Phys. Rev. D 2016, 94, 104072. [Google Scholar] [CrossRef]
- Gibbons, G.W.; Gomis, J.; Pope, C.N. General very special relativity is Finsler geometry. Phys. Rev. D 2007, 76, 081701. [Google Scholar] [CrossRef]
- Kostelecký, V.A. Riemann-Finsler geometry and Lorentz-violating kinematics. Phys. Lett. B 2011, 701, 137–143. [Google Scholar] [CrossRef]
- Fuster, A.; Heefer, S.; Pfeifer, C.; Voicu, N. On the non metrizability of Berwald Finsler spacetimes. Universe 2020, 6, 64. [Google Scholar] [CrossRef]
- Fuster, A.; Pabst, C.; Pfeifer, C. Berwald spacetimes and very special relativity. Phys. Rev. D 2018, 98, 084062. [Google Scholar] [CrossRef]
- Javaloyes, M.A.; Sánchez, M. Finsler metrics and relativistic spacetimes. Int. J. Geom. Methods Mod. Phys. 2014, 11, 1460032. [Google Scholar] [CrossRef]
- Hohmann, M.; Pfeifer, C.; Voicu, N. Finsler gravity action from variational completion. Phys. Rev. D 2019, 100, 064035. [Google Scholar] [CrossRef]
- Hohmann, M.; Pfeifer, C.; Voicu, N. Finsler-based field theory—A mathematical foundation. arXiv 2021, arXiv:2106.14965v1. [Google Scholar]
- Javaloyes, M.A.; Sánchez, M.; Villasenor, F.F. The Einstein-Hilbert-Palatini formalism in pseudo-Finsler geometry. arXiv 2021, arXiv:2108.03197v2. [Google Scholar]
- Rund, H. A divergence theorem for Finsler metrics. Monatshefte Math. 1975, 79, 233–252. [Google Scholar] [CrossRef]
- Minguzzi, E. A divergence theorem for pseudo-Finsler spaces. Rep. Math. Phys. 2017, 80, 307–315. [Google Scholar] [CrossRef][Green Version]
- Javaloyes, M.A.; Sánchez, M.; Villasenor, F.F. Anisotropic connections and parallel transport in Finsler spacetimes. arXiv 2021, arXiv:2107.05986. [Google Scholar]
- Javaloyes, M.A. Anisotropic tensor calculus. Int. J. Geom. Methods Mod. Phys. 2019, 16, 1941001. [Google Scholar] [CrossRef]
- Javaloyes, M.A. Curvature computations in Finsler Geometry using a distinguished class of anisotropic connections. Mediterr. J. Math. 2020, 17, 123. [Google Scholar] [CrossRef]
- Javaloyes, M.A.; Sanchez, M. On the definition and examples of cones and Finsler spacetimes. Rev. Real Acad. Cienc. Exactas Fís. Nat. Ser. Matemáticas 2020, 114, 30. [Google Scholar] [CrossRef]
- Schutz, B.F. A First Course in General Relativity, 2nd ed.; Cambridge University Press: New York, NY, USA, 2009. [Google Scholar]
- Landau, L.D.; Lifshitz, E.E. The Classical Theory of Fields, 3rd ed.; Pergamon Press Ltd.: Headington Hill Hall, UK, 1971. [Google Scholar]
- Wald, R.M. General Relativity; University of Chicago Press: Chicago, IL, USA, 1984. [Google Scholar]
- Carroll, S.M. Spacetime and Geometry: An Introduction to General Relativity; Addison-Wesley: San Francisco, CA, USA, 2004. [Google Scholar]
- Misner, C.W.; Thorne, K.S.; Wheler, J.A. Gravitation; W. H. Freeman: San Francisco, CA, USA, 1973. [Google Scholar]
- Pfeifer, C.; Wohlfarth, M. Causal structure and electrodynamics on Finsler space-times. Phys. Rev. D 2020, 84, 044039. [Google Scholar] [CrossRef]
- Gotay, M.J.; Marsden, J.E. Stress-Energy-Momentum Tensors and the Belinfante-Rosenfeld Formula. Contemp. Math. 1992, 132, 367–392. [Google Scholar]
- Lee, J.M. Introduction to Smooth Manifolds; Springer: Berlin/Heidelberg, Germany, 2012. [Google Scholar]
- Shen, Z. Lectures on Finsler Geometry; World Scientific: Singapore, 2001. [Google Scholar]
- Herrera, J.; Javaloyes, M.A.; Piccione, P. On a monodromy theorem for sheaves of local fields and applications. Rev. Real Acad. Cienc. Exactas, Fís. Nat. Ser. Matemáticas 2017, 111, 999–1029. [Google Scholar] [CrossRef]
- Dragomir, S.; Larato, B. Harmonic functions on Finsler spaces. Instanbul Univ. Sci. Fac. J. Math. Phys. Astron. 1991, 48, 67–76. [Google Scholar]
- Mbatakou, J.S.; Todjihounde, L. Conformal change of Finsler-Ehresmann connections. Appl. Sci. 2014, 16, 32–47. [Google Scholar]
- Nibaruta, G.; Degla, S.; Todjihounde, L. Finslerian Ricci deformation and conformal metrics. J. Appl. Math. Phys. 2018, 6, 1522–1536. [Google Scholar] [CrossRef][Green Version]
- Nibaruta, G.; Nibirantiza, A.; Karimumuryango, M.; Ndayirukiye, D. Divergence lemma and Hopf’s theorem on Finslerian slit tangent bundle. Balk. J. Geom. Its Appl. 2020, 25, 93–103. [Google Scholar]
- Shen, Z. Differential Geometry of Spray and Finsler Spaces; Kluwer Academic Publishers: Dordrecht, The Netherlands, 2001. [Google Scholar]
- Bernal, A.N.; Sanchez, M. Further results on the smoothability of Cauchy hypersurfaces and Cauchy temporal functions. Lett. Math. Phys. 2006, 77, 183–197. [Google Scholar] [CrossRef]
- Lammërzahl, C.; Perlick, V. Finsler geometry as a model for relativistic gravity. Int. J. Geom. Methods Mod. Phys. 2018, 15 (Suppl. 1), 1850166. [Google Scholar] [CrossRef]
- Pfeifer, C. Finsler spacetime geometry in Physics. Int. J. Geom. Methods Mod. Phys. 2019, 16 (Suppl. 2), 1941004. [Google Scholar] [CrossRef]
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).