On the Significance of the Stress–Energy Tensor in Finsler Spacetimes

Miguel Ángel Javaloyes; Miguel Sánchez; Fidel F. Villaseñor

doi:10.3390/universe8020093

,

and

¹

Departamento de Matemáticas, Campus de Espinardo, Universidad de Murcia, 30100 Murcia, Spain

²

Departamento de Geometría y Topología, Facultad de Ciencias & IMAG, Campus de Fuentenueva s/n, Universidad de Granada, 18071 Granada, Spain

^*

Author to whom correspondence should be addressed.

Universe2022, 8(2), 93;https://doi.org/10.3390/universe8020093

This article belongs to the Special Issue Beyond Riemannian Geometry in Classical and Quantum Gravity

Version Notes

Order Reprints

Abstract

We revisit the physical arguments that led to the definition of the stress–energy tensor T in the Lorentz–Finsler setting

(M, L)

starting with classical relativity. Both the standard heuristic approach using fluids and the Lagrangian one are taken into account. In particular, we argue that the Finslerian breaking of Lorentz symmetry makes T an anisotropic 2-tensor (i.e., a tensor for each L-timelike direction), in contrast with the energy-momentum vectors defined on M. Such a tensor is compared with different ones obtained by using a Lagrangian approach. The notion of divergence is revised from a geometric viewpoint, and, then, the conservation laws of T for each observer field are revisited. We introduce a natural anisotropic Lie bracket derivation, which leads to a divergence obtained from the volume element and the non-linear connection associated with L alone. The computation of this divergence selects the Chern anisotropic connection, thus giving a geometric interpretation to previous choices in the literature.

Keywords:

divergence in Finsler manifolds; stress–energy tensor; Finsler spacetime; Lorentz symmetry breaking; very special relativity

1. Introduction

This article has a double aim in Lorentz–Finsler geometry. The first one is to revisit the physical grounds of the stress–energy tensor T, Section 3. The possible extensions of the relativistic T are discussed from the viewpoint of both fluids mechanics and Lagrangian systems. The second one is to revise geometrically the notion of divergence, Section 4, yielding consequences about the conservation of T, Section 5. With this aim, we introduce new notions of the Lie bracket and the derivative associated with a nonlinear connection and applicable to anisotropic tensors fields, which appear naturally in Finsler geometry.

Finslerian modifications of General Relativity aim to find a tensor T collecting the possible anisotropies in the distribution of energy, momentum, and stress, which will serve as a source for the (now Lorentz–Finsler) geometry of the spacetime [1,2,3,4,5]. Some of these proposals may be waiting for experimental evidence, postponing then how the basic relativistic notions would be affected. However, such a discussion is relevant to understand the scope and implications of the introduced Finslerian elements. In a previous reference [6], the fundamentals of observers in the Finslerian setting were extensively studied, including its compatibility with the Ehlers–Pirani–Schild approach. Now we focus on the stress–energy tensor T.

The difficulty to study such a T is apparent. Recall that, using the principle of equivalence, General Relativity is reduced infinitesimally into the Special one, which provides a background for interpretations. However, in the Lorentz–Finsler case, the infinitesimal model is changed into a Lorentz norm (instead of scalar product), implying a breaking of Lorentz invariance. This is a substantial issue in its own right which has been studied in the context of Very Special Relativity and others [7,8,9,10,11]. As an additional difficulty, the infinitesimal model changes with the point1.

Two noticeable pre-requisites are the following: (a) only the value of the Lorentz–Finsler metric on causal directions is relevant [6,14] (this is briefly commented in the setup Section 2.3), and (b) there is a significant variety of possible extensions of the relativistic kinematic objects to the Finsler case, at least from the geometric viewpoint (see Appendix A). Taking into account these issues, the extension of the notion of the stress–energy tensor to the Finslerian setting is discussed in Section 3.

We start at the fluids approach. As a preliminary question, energy-momentum is discussed, in Section 3.1. We emphasize that, even though this is well-defined as a tangent vector in each tangent space

T_{p} M

,

p \in M

, different observers u,

u^{'}

at p will use coordinates related by non-trivial linear transformations. Indeed, the latter will depend on both L and the chosen method to measure relative velocities. Moreover, when the stress–energy T is considered in Section 3.2, the arguments in Classical Mechanics and Relativity, which support its status as a tensor, hold only partially in the Lorentz–Finsler setting. Indeed, T acquires a nonlinear nature that is codified in an (observer-dependent) anisotropic tensor, rather than in a tensor on M.

The Lagrangian approach is discussed in Section 3.3. This approach has been developed recently by Hohmann, Pfeifer, and Voicu [15,16], who introduced an energy-momentum scalar function. Here, we discuss the analogies and differences of this function with the canonical relativistic stress–energy tensor

δ S_{m a t t e r} / δ g^{μ ν}

and the 2-tensor T obtained from the fluids approach above. Relevant issues are the existence of different methods to obtain a 2-tensor starting at a scalar function, the recovery of this function from a matter Lagrangian, and the possibility to consider the Palatini Lagrangian as the background one (rather than Einstein–Hilbert-type Lagrangians used by the cited authors; recall that Palatini’s becomes especially meaningful in the Finslerian case [17]). The important case of kinetic gases is considered explicitly (Example 2).

Once the definition of T has been discussed, we focus on its conservation, Section 5, revisiting first the divergence theorem, Section 4. This is crucial in the Finslerian setting because, as discussed before, the Lagrangian approach above does not guarantee a conservation law as the relativistic

div (G) = 0

.

Section 4 analyzes the divergence from a purely mathematical viewpoint. Now, L is regarded as pseudo-Finsler (the results will be useful not only in any indefinite signature but also in the classical positive definite case), and T will not be assumed to be symmetric a priori. Classically, the divergence of a vector field Z is defined with the derivation associated with the Lie bracket

[Z, X] = L_{Z} X

, applied to the volume element. In the Finslerian case, however, the Lie derivative and bracket do not make sense for arbitrary anisotropic vector fields. This difficulty was circumvented by Rund [18], who redefined

div (Z)

in such a way that a type of divergence theorem held. However, the Lie viewpoint is restored here.

Section 4.1. Once a nonlinear connection

H A

(seen as a horizontal distribution on A) is prescribed, we can define a Lie bracket

l_{Z}^{H} X

and, then, a Lie derivative

L_{Z}^{H} X

( Definitions 1 and 2; Theorem 1 (C)). Noticeably, the former

l_{Z}^{H}

is expressible in terms of the infinitesimal flow of Z (Proposition 1).

Section 4.2. The divergence of Z is naturally defined by using this Lie bracket (Definition 3). For the computation of

div (Z)

, however, one can use an anisotropic connection ∇ (this can be seen as a Finsler connection dropping its vertical part, see Section 2) and a priori Chern’s one is not especially privileged (Proposition 2).

Section 4.3. We give a general Finslerian version of the divergence theorem for any anisotropic vector field Z, emphasizing the role of the choice of an (admissible) vector field

V : M \to A

, which in the Lorentzian case can be interpreted as an observer field; this is expressed in terms of integration of forms in the spirit of Cartan’s formula (Theorem 2, Remark 5). We also explain how the boundary term can be expressed in different ways by using a normal either with respect to the pseudo-Riemannian metric

g_{V}

or to the fundamental tensor, which were the choices of Rund [18] and Minguzzi [19] resp.

Section 5 gives some applications to conservation laws.

Section 5.1. First, we discuss the definition of divergence for the case of T. Our definition for vector fields was not biased to the Chern anisotropic connection, but this will be used for

div (T)

(Definition 4). The reason is that

div (T)

should behave under contraction in a similar way as in the isotropic case (namely, as in Formula (11)), which privileges Chern’s connection (Proposition 3).

Section 5.2. As an interlude about the appearance of Chern’s ∇, a comparison with the possible use of Berwald’s and previous approaches in the literature is done.

Section 5.3. A conservation law for the flow of

T_{V} (X_{V})

is obtained (Corollary 2), stressing three hypotheses on the vanishing for V of elements related to the stress–energy T (

div (T) = 0

), the anisotropic vector X (

l_{X}^{H} g = 0

, generalizing the isotropic case) and a derivative of V. The latter hypothesis is genuinely Finslerian, and it means that some terms related to the nonlinear covariant derivative

D V

must vanish globally (V can always be chosen such that they vanish at some point). It is worth pointing out that our general formula for the integral of the divergence (36) recovers the classical interpretation of the divergence as an infinitesimal growth of the flow (now observer-dependent). So,

div (T) = 0

is equivalent to the conservation of energy-momentum in the instantaneous rest-space of each observer—see Remark 10.

We finish by applying this general result to two examples. First, we apply it to Lorentz norms, showing that the conservation laws of Special Relativity still hold even though, now, the conserved quantity may be different for different observers. As a second example, we give natural conditions so that the flow of

T_{V} (X_{V})

(whenever it exists as a Lebesgue integral, eventually equal to

\pm \infty

) is equal in two Cauchy hypersurfaces of a globally hyperbolic Finsler spacetime. Indeed, we refine a previous result by Minguzzi [19], who assumed that L was defined on the whole

T M

and

T_{V} (X_{V})

was compactly supported. We show that a combination of Rund’s and Minguzzi’s methods to compute the boundary terms allows one to obtain appropriate decay rates (namely, the properly Finslerian hypothesis (49)), which ensure the conservation.

2. Preliminaries and Setup

First, let us set up some notation. In all the present text, M is a connected smooth (

C^{\infty}

) manifold of dimension

n \geq 2

. As in previous references [17,20], any coordinate chart

(U, (x^{1}, \dots, x^{n}))

of M naturally induces a chart

(T U, (x^{1}, \dots, x^{n}, y^{1}, \dots, y^{n}))

of

T M

defined by the fact that

v = y^{i} (v) {\frac{\partial}{\partial x^{i}}|}_{π (v)}

for

v \in T U

, where

π : T M \to M

is the canonical projection. We abbreviate

\frac{\partial}{\partial x^{i}} = : \partial_{i}, \frac{\partial}{\partial y^{i}} = : {\dot{\partial}}_{i};

these are vector fields on

T U

. At any rate, we will express our results in coordinate-free and geometric terms.

2.1. Anisotropic Tensors

We shall employ the framework of anisotropic tensors, following [20,21,22], as it is simpler than previous ones. An open subset

A \subseteq T M

with

π (A) = M

is fixed; the elements

v \in A

are called observers. We will denote by

T_{s}^{r} (M_{A})

the space of (smooth) r-contravariant s-covariant A-anisotropic tensor fields (

r, s \in N \cup \{0\}

) and by

T (M_{A}) : = ⨁_{r, s} T_{s}^{r} (M_{A})

the full anisotropic tensor algebra.

F (A) = T_{0}^{0} (M_{A})

will be the space of functions on A. This time we will also put

X (M_{A}) : = T_{0}^{1} (M_{A})

for the space of anisotropic vector fields and

Ω_{s} (M_{A})

for the space of anisotropic s-forms (alternating anisotropic tensors, so that

Ω_{1} (M_{A}) : = T_{1}^{0} (M_{A})

). The space

T (M)

of classical tensor fields will be seen as a subspace of

T (M_{A})

, formed by the isotropic elements, namely, those which depend only on the point

p \in M

and not on the observer at it. In particular,

X (M) \subseteq X (M_{A})

. There is a distinguished element of

X (M_{A})

: the canonical (or Liouville) anisotropic vector field,

ℂ = y^{i} \partial_{i}, ℂ_{v} : = v .

For an open set

U \subseteq M

; we will put

X^{A} (U)

for the set of (local) observer fields, that is, those

V \in X (U)

such that

V_{p} \in A \cap T_{p} M

for all

p \in U

. Given one of these and

T \in T_{s}^{r} (M_{A})

, their composition, denoted by

T_{V} \in T_{s}^{r} (U)

, makes sense. Finally, for

X \in X (M_{A})

, there is also a canonical derivation

{\dot{\partial}}_{X} : T_{s}^{r} (M_{A}) \to T_{s}^{r} (M_{A})

: the vertical derivative along X,

{({\dot{\partial}}_{X} T)}_{v} : = lim_{t \to 0} \frac{T_{v + t X_{v}} - T_{v}}{t}, {({\dot{\partial}}_{X} T)}_{j_{1}, \dots, j_{s}}^{i_{1}, \dots, i_{r}} = X^{j_{s + 1}} {\dot{\partial}}_{j_{s + 1}} T_{j_{1}, \dots, j_{s}}^{i_{1}, \dots, i_{r}} .

2.2. Nonlinear and Anisotropic Connections

In this article, a nonlinear connection on

A \to M

is defined as a (horizontal) subbundle

H A \subseteq T A

such that

T A = H A \oplus V A

, where

V A : = {Ker (d π)|}_{A}

is the vertical subbundle. For other options and the rudiments, see [20]. Nonlinear connections are characterized by their nonlinear coefficients

N_{j}^{i}

,

H_{v} A = Span \{{δ_{i}|}_{v}\}, δ_{i} : = \frac{δ}{δ x^{i}} : = \frac{\partial}{\partial x^{i}} - N_{i}^{j} \frac{\partial}{\partial y^{j}},

(1)

and also by their nonlinear covariant derivative

D_{X} : X^{A} (U) \to X (U)

,

D_{X} V : = X^{j} (\frac{\partial V^{i}}{\partial x^{j}} + N_{j}^{i} (V)) \partial_{i},

(2)

for

X \in X (U)

. They also provide (at least locally) a nonlinear parallel transport of observers

v \in A \cap T_{γ (0)} M

along curves

γ : [0, t] \to M

. Namely, a map

P_{t} : A_{γ (0)} \to A_{γ (t)}

defined as

P_{t} (v) = V (t)

, being V the only vector field along

γ

such that

V (0) = v

and

D_{\dot{γ}} V = 0

(see (Definition 12 in [20]) and the comment below).

An A-anisotropic connection is an operator

\nabla : X (M) \times X (M) \to X (M_{A})

satisfying the usual Koszul derivation properties—see [17,21,22]. In a chart domain U, they are characterized by their Christoffel symbols

Γ_{j k}^{i} : A \cap T U \to R

,

\nabla_{\partial_{j}} \partial_{k} = : Γ_{j k}^{i} \partial_{i} .

They can be seen as vertically trivial linear connections on the vector bundle

V A \to A

(Theorem 3 in [20]). On the other hand, every anisotropic connection has an underlying nonlinear connection, the only one with nonlinear coefficients

N_{j}^{i} : = Γ_{j k}^{i} y^{k} .

As a consequence, they define the covariant derivative

\nabla : T_{s}^{r} (M_{A}) \to T_{s + 1}^{r} (M_{A})

for any anisotropic tensor:

\nabla_{j_{s + 1}} T_{j_{1}, \dots, j_{s}}^{i_{1}, \dots, i_{r}} = δ_{j_{s + 1}} T_{j_{1}, \dots, j_{s}}^{i_{1}, \dots, i_{r}} + \sum_{μ = 1}^{r} Γ_{j_{s + 1} k}^{i_{μ}} T_{j_{1}, \dots, j_{s}}^{i_{1}, \dots, k, \dots, i_{r}} - \sum_{ν = 1}^{s} Γ_{j_{s + 1} j_{μ}}^{k} T_{j_{1}, \dots, k, \dots, j_{s}}^{i_{1}, \dots, i_{r}} .

2.3. Lorentz–Finsler Metrics

From now on, we will always assume that A is conic (

λ v \in A

for

v \in A

and

λ \in (0, \infty)

). We shall follow the definitions and conventions in [20,23]. In particular, a Finsler spacetime

(M, L)

is a (connected) manifold M endowed with a (properly) Lorentz–Finsler metric

L : \bar{A} \subseteq T M ∖ 0 \to [0, \infty)

. L is required to be smooth, positive homogeneous, and, when restricted to each

A_{p} : = T_{p} M \cap A

(

p \in M

), its vertical Hessian g is non-degenerate with signature

(+, -, \dots, -)

;

A_{p}

must be connected and salient, and its boundary in

T M ∖ 0

, which must be equal to

L^{- 1} (0)

, is a (strong) cone structure

C

. In particular, at each point p, L is a Lorentz norm. By positive homogeneity, L is determined by its indicatrix

L^{- 1} (1)

.

Notice that the cone

C

yields a natural notion of timelike, lightlike, and spacelike tangent vectors, but L is not defined on the latter. Indeed, we are not interested in the value of L on spacelike vectors by physical reasons, which are analyzed in [6]. Roughly, only particles (massive, massless) can be measured, and, so, experimental evidences only can affect

Σ

and

C

. Even though this also happens in classical relativity, the value of the Lorentz metric on the (future-directed) timelike vectors is enough to extend it to all the directions. Indeed, the anisotropies in Finsler spacetimes should be regarded as originated by the distribution of matter and energy in the causal directions rather than by (unobservable) spacelike anisotropies.

Even though it is the Lorentz–Finsler case that has a physical interpretation, in all other aspects, the theory carries on if L is just pseudo-Finsler, namely, positively 2-homogeneous with non-degenerate g on A. In fact, this is the context in which we will develop Section 4 and Section 5, as they are of a more mathematical character.

The Cartan tensor of L is

C : = \frac{1}{2} \dot{\partial g}, C_{i j k} = \frac{1}{2} \frac{\partial g_{i j}}{\partial y^{k}} .

It is actually symmetric, so one can define the mean Cartan tensor as

C^{m} (X) : = {trace}_{g} \{C (X, -, -)\}, {(C^{m})}_{j} = g^{i k} C_{i j k} = : C_{j},

(3)

for

X \in X (M_{A})

. L has also a canonically associated connection: the metric nonlinear connection,

H A

, of nonlinear coefficients

N_{j}^{i} : = γ_{j k}^{i} y^{k} - C_{j k}^{i} γ_{a b}^{k} y^{a} y^{b}, γ_{j k}^{i} : = \frac{1}{2} g^{i c} (\frac{\partial g_{c j}}{\partial x^{k}} + \frac{\partial g_{c k}}{\partial x^{j}} - \frac{\partial g_{j k}}{\partial x^{c}}) .

(4)

This is the underlying nonlinear connection of several anisotropic connections. One is the (Levi–Civita)–Chern ∇, the only symmetric anisotropic connection that parallelizes g. It is the horizontal part of Chern–Rund’s and Cartan’s classical connections, and it has Christoffel symbols

Γ_{j k}^{i} : = \frac{1}{2} g^{i l} (\frac{δ g_{l j}}{δ x^{k}} + \frac{δ g_{l k}}{δ x^{j}} - \frac{δ g_{j k}}{δ x^{l}}),

(5)

where the

δ_{i}

are those associated with (4). Another one is the Berwald

\hat{\nabla}

. This is the horizontal part of Berwald’s and Hashiguchi’s classical connections, and it has Christoffel symbols

{\hat{Γ}}_{j k}^{i} : = \frac{1}{2} g^{i l} (\frac{δ g_{l j}}{δ x^{k}} + \frac{δ g_{l k}}{δ x^{j}} - \frac{δ g_{j k}}{δ x^{l}}) + {Lan}_{j k}^{i} .

(6)

Here,

{Lan}_{j k}^{i}

are the components of a tensor metrically equivalent to the Landsberg tensor of L, which, among many other methods, can be defined as

{Lan}_{i j k} : = \frac{1}{2} g_{l m} {\dot{\partial}}_{i} {\dot{\partial}}_{j} N_{k}^{l} y^{m}

for the

N_{k}^{l}

of (4) (see (37 in [21])). The Landsberg tensor is actually symmetric too, so one can define the mean Landsberg tensor of L as

{Lan}^{m} (X) : = {trace}_{g} \{Lan (X, -, -)\}, {({Lan}^{m})}_{j} = g^{i k} {Lan}_{i j k} = : {Lan}_{j} .

(7)

3. Basic Interpretations on the Stress–Energy Tensor $T$

Let us start with a discussion at each event

p \in M

of a Finsler spacetime

(M, L)

. We can consider

T_{p} M

endowed with the Lorentz norm

{L |}_{T_{p} M}

. In most of this section, the discussion relies essentially on the particular case when M is a real affine n-space with associated vector space V (which plays the role of

T_{p} M

in the general case) and L is a Lorentz–Finsler norm on V with indicatrix

Σ

and cone

C

included in V. Given

u, u^{'} \in Σ

, consider the corresponding fundamental tensors

g_{u}

and

g_{u^{'}}

and take orthonormal bases

B_{u}

,

B_{u^{'}}

, obtained extending u,

u^{'}

. In a natural way, these bases live in

T_{u} V, T_{u^{'}} V

, and they can be identified with bases in V itself. Assuming this, the change in coordinates between

B_{u}

,

B_{u^{'}}

is linear but not a Lorentz transformation, in general.

Extending the interpretations in relativity,

p \in M

is an event; the affine simplification includes the case of Very Special Relativity [7,8,10];

u \in Σ

can be regarded as an observer; the tangent space to the indicatrix

T_{u} Σ

(i.e., the subspace

g_{u}

-orthogonal to u in

T_{u} V \equiv V

) becomes the rest-space of the observer u; and

B_{u}

is an inertial reference frame for this observer. The Lorentz invariance breaking corresponds to the fact that the bases

B_{u}

and

B_{u^{'}}

are orthonormal for the different metrics

g_{u}, g_{u^{'}}

, and, thus, the linear transformation between the coordinates of

B_{u}

and

B_{u^{'}}

(when regarded as elements of the same vector space

T_{u} V \equiv V \equiv T_{u^{'}} V

) is not a Lorentz one. If the affine simplification is dropped, such elements (observers and rest-spaces) must be regarded as instantaneous at

p \in M

.

It is worth emphasizing that, according to the viewpoint introduced in [14] and discussed extensively in [6], the space-like directions are not physically relevant for the Lorentz–Finsler metric. However, each (instantaneous) observer does have a restspace with a Euclidean scalar product. In the case of classical relativity, Lorentz-invariance permits natural identifications between these rest-spaces, and they become consistent with the value of the scalar product on space-like directions. Certainly, a Lorentz norm L could be extended outside these directions (maintaining the Lorentz signature for its fundamental tensor), but this can be done in many different ways, and no relation with the scalar products

g_{u}, u \in Σ

would hold.

The dropping of natural identifications associated with the Lorentz invariance implies that many notions that are unambiguously defined in classical relativity admit many different alternatives now. In the Appendix A, we analyze some of them for the relative velocity between observers as well as other kinematical concepts. This is taken into account in the following discussion about how the Finslerian setting affects the notion of the energy–momentum–stress tensor.

3.1. Particles and Dusts: Anisotropic Picture of Isotropic Elements

In principle, there is no reason to modify the classical relativistic interpretation of

p = m u

as the (energy-) momentum vector of a particle of (rest) mass

m > 0

moving in the observer’s direction

u \in Σ

. Moreover, if the particle moves in such a way that m is constant, it will be represented by a unit time-like curve

γ (τ)

such that

p (τ) = m γ^{'} (τ)

will be its instantaneous momentum at each proper time

τ

. The (covariant) derivative

p^{'} = m γ^{″}

would be the force F acting on the particle, which is necessarily

g_{γ^{'}}

-orthogonal to

γ^{'}

(i.e., the force lies in the instantaneous rest-space of the particle). Then, the relativistic conservation of the momentum in the absence of external forces would retain its natural meaning, namely, if the particle represented by

(m, γ)

splits into two

(m_{1}, γ_{1})

and

(m_{2}, γ_{2})

at some

τ_{0}

then

m γ^{'} (τ_{0}) = m_{1} γ_{1}^{'} (τ_{0}) + m_{2} γ_{2}^{'} (τ_{0})

.

The Appendix A suggests that the way how an observer u may measure the energy-momentum and conservation may be non-trivial. In particular, if one assumes that an observer u measures

m γ^{'} \in T_{p} M

by using a

g_{u}

-orthonormal basis

B_{u}

in general,

g_{u} (m γ^{'}, m γ^{'}) \neq m^{2} (= L (m γ^{'}))

. Moreover, as we have already commented, the coordinates for other observer

u^{'}

will not transform by means of Lorentz transformation. However, as the transformation of their coordinates is still linear, and both of them will write consistently

m γ^{'} (τ_{0}) = m_{1} γ_{1}^{'} (τ_{0}) + m_{2} γ_{2}^{'} (τ_{0})

in their coordinates.

Particles are also the basis to model dusts, which constitute the simplest class of relativistic fluids. A dust is represented by a number-flux vector field

N = n U

, where U represents the intrinsic velocity of the particle in the dust, i.e., a comoving observer, and n is the density of the dust for each momentaneously comoving reference frame. Comparing with the case of energy momentum, N is also an intrinsic object that lives at the tangent space of each point, and U gives the privileged observer who measures n. However, the measures of n by different observers involve different measures of the volume. As explained in the Appendix A, the length contraction may be fairly unrelated to the relative velocities of the observers. This implies a more complicated transformation of the coordinates by different observers. Anyway, the transformations between these coordinates would remain linear, and, so, they could still agree in the fact that they are measuring the same intrinsic vector field.

Summing up, in the case of both particles and dusts, one assumes that the physical property lives in V (or, more properly, in each tangent space

T_{p} M

of the affine space), and there is a privileged (comoving) observer u. The transformation of coordinates for another observer

u^{'}

may be complicated, but, at the end, it is a linear transformation that can be determined by specifying the geometric quantities that are being measured as well as the geometry of

Σ

. Thus, by using the coordinates measured by each observer one could construct and anisotropic vector field at each

p \in M

, which will fulfill some constraints, as the measurement by one of the observers (in particular, the privileged one) would determine the measurements by all the others.

3.2. Emergence of an Anisotropic Stress–Energy Tensor

The situation, however, is subtler for more general fluids, which are modeled classically by a 2-tensor on the underlying manifold.

Let us start by recalling the Newtonian and Lorentzian cases. In Classical Mechanics, one starts working in an orthonormal basis of Euclidean space to obtain the components

T_{i j}

of the Cauchy stress tensor, which give the flux of i-momentum (or force) across the j-surface in the background2. The laws of conservation of linear momentum and static equilibrium of forces imply that these components give truly a 2-tensor (linear in each variable), and the conservation of linear momentum implies that this tensor is symmetric.

In the relativistic setting, each observer will determine some symmetric components

T^{i j}

in its rest-space by essentially the same procedure as above. Additionally, it constructs

T^{00}

,

T^{0 i}

, and

T^{i 0}

as the density energy and the energy flux across i-surface and i-momentum density, resp. The interpretation of these magnitudes completes the symmetry3

T^{0 i} = T^{i 0}

as well as the linearity in the 0-component. However, the bilinearity in the components

T^{μ ν}

has been only ensured for vectors in the rest-space of the observer. In relativity, one can claim Lorentz invariance in order to complete the reasons justifying that, finally, the components

T^{μ ν}

will transform as a tensor4.

Nevertheless, it is not clear in Lorentz–Finsler geometry why the transformation of the components

T_{i j}

from an observer u to a second one

u^{'}

must be linear, taking into account that they apply to space-like coordinates in distinct Euclidean subspaces and no Lorentz-invariance is assumed. Indeed, the following simple academic example shows that this is not the case.

Example 1.

Assume that

(M, L)

is an affine space with a Lorentz norm with domain A and consider the anisotropic tensor

T = ϕ ℂ \otimes ℂ

, where ℂ is the canonical (Liouville) vector field, and

ϕ : Σ \to R

is a smooth function, which is extended as a 0-homogeneous function on A. Then, for each

u \in Σ

and

w \in T_{u} Σ

, one has

T_{u} (u, u) = ϕ (u)

,

T_{u} (w, w) = 0

, and

T_{u} (u, w) = 0

. In this case, each

T_{u}

is a symmetric 2-tensor, but the information on

T

requires the knowledge of

ϕ (u)

for all possible

u \in Σ

. Recall that this example holds even if

(M, L)

is the Lorentz–Minkowski spacetime regarded as a Finsler spacetime (but no Lorentz-invariance is assumed for

T

).

Therefore, the following issues about T appear:

(a): Observer dependence: even if we assume that the components $T^{μ ν}$ measured by any observer u are bilinear, and, then, it is a standard tensor, the components measured by a second observer $u^{'}$ may transform by a linear map, which depends on $Σ$ as well as the experimental method of measuring (as in the case of the energy–momentum vector).
(b): Nonlinearity: it is not clear even why such a linear transformation must exist, as bilinearity is only ensured in the direction of u and of its rest-space. Thus, the tensor $T_{u}$ measured by a single observer u would not be enough to grasp the physics of the fluid at each event $p \in M$ , as in the example above.
(c): Contribution of the anisotropies of $Σ$ : as an additional possibility, the local geometry of $Σ$ at u underlies the measurements of this observer and might provide a contribution for the stress–energy tensor itself.

Summing up, Lorentz–Finsler geometry leads to assume that the measurements by u are not enough to determine the state of the fluid, and the stress-energy tensor should be regarded as a non-isotropic tensor field, determined by the measurements of all the observers.

Formally, this means an anisotropic tensor

T \in T_{0}^{2} (M_{A})

(see [20] for a summary of the formal approach), which can be expressed locally as

T_{v} = T^{μ ν} (v) {\partial_{μ}|}_{x} \otimes {\partial_{ν}|}_{x} \forall v \equiv (x, y) \equiv y^{μ} {\frac{\partial}{\partial x^{μ}}|}_{x} \in A \subset T M,

where

T^{μ ν} (λ v) = T^{μ ν} (v)

for all

λ > 0

(i.e.,

T_{v}

depends only on the direction of v). As a first approach, we can assume

T^{μ ν} = T^{ν μ}

(recall Appendix A.5). Consistently, we will assume that there exists a Lorentz–Finsler metric L on M with indicatrix

Σ \subset T M

, and, so, indexes can be raised and lowered by using its fundamental tensor g. The fact that T has order 2 is important to establish classical analogies. However, other tensors might appear as more fundamental energy-momentum tensors, and, then, one would try to derive a semi-classical 2-tensor as in Section 3.3.

In principle, the intuitive relativistic interpretations would be transplanted directly to each v, whenever

v \in Σ

. That is, given two

g_{v}

-unit vectors

u, w

, the value

T_{v} (u, w)

of the 2-covariant stress–energy tensor perceived by the observer v (at

x = π (v)

) is obtained as the flux of w-energy-momentum per unit of

g_{v}

-volume orthogonal to u. More precisely, let

B (u)

be a small coordinate 3-cube in a hypersurface

g_{v}

-orthogonal to u, and

P_{B}

is the total flux of the energy-momentum of particles crossing

B (u)

(being positive from the

- u

side to the u side and negative the opposite direction), then the w-energy-momentum per unit of

g_{v}

-volume is

ϵ T_{v} (u, w) : = lim_{V o l_{g_{v}} (B (u)) \to 0} \frac{g_{v} (P_{B}, w)}{V o l_{g_{v}} (B (u))} .

where

ϵ = g_{v} (w, w)

. As a Finslerian subtlety, recall that

g_{v}

is only defined in

T_{v} (T_{x} M)

and then in

T_{x} M

(i.e., it is trivially extended to

B (u)

in a coordinate-depending way), but the above limit depends only on the value of

g_{v}

. Namely, if one considers two semi-Riemannian metrics g and

\tilde{g}

in a neighborhood of p such that

g_{p} = {\tilde{g}}_{p}

and

B_{n}

are open subsets with p in the interior of

B_{m}

for all

n \in N

and

{lim}_{n \to + \infty} v o l_{g} (B_{m}) = 0

, then

lim_{m \to + \infty} \frac{v o l_{g} (B_{m})}{v o l_{\tilde{g}} (B_{m})} = 1 .

In particular, we have the interpretations (recall signature

(+, -, -, -)

):

$T_{v} (v, v)$ is the energy density measured by $v \in Σ$ ,

$T_{v} (v, v) : = lim_{V o l_{g_{v}} (B (v)) \to 0} \frac{g_{v} (P_{B}, v)}{V o l_{g_{v}} (B (v))} = lim_{V o l_{g_{v}} (B (v)) \to 0} \frac{E_{B}}{V o l_{g_{v}} (B (v))},$

being $E_{B} : = g_{v} (P_{B}, v)$ the measured energy.
If w is $g_{v}$ -orthogonal to v and $g_{v}$ -unit, $T_{v} (w, v)$ measures the flow of energy per unit of $g_{v}$ -volume in a surface $g_{v}$ -orthogonal to v and w (i.e., some small surface of area A flowing a lapse $Δ t$ ), while $T_{v} (v, u)$ measures the w-momentum density,

$T_{v} (w, v) : = lim_{V o l_{g_{v}} (B (w)) \to 0} \frac{g_{v} (P_{B}, v)}{V o l_{g_{v}} (B (w))} = lim_{V o l_{g_{v}} (A) \to 0} \frac{1}{A} \{lim_{Δ t \to 0} \frac{E_{B}}{Δ t}\} .$

$- T_{v} (v, w) : = lim_{V o l_{g_{v}} (B (v)) \to 0} \frac{g_{v} (P_{B}, w)}{V o l_{g_{v}} (B (v))} .$
If $z, w$ are $g_{v}$ -orthogonal to v and $g_{v}$ -unit, $T_{v} (z, w)$ measures the flow of w-momentum per unit of $g_{v}$ -volume in a surface $g_{v}$ -orthogonal to v and z,

$- T_{v} (z, w) : = lim_{V o l_{g_{v}} (B (z)) \to 0} \frac{g_{v} (P_{B}, w)}{V o l_{g_{v}} (B (z))} = lim_{V o l_{g_{v}} (A) \to 0} \frac{1}{A} \{lim_{Δ t \to 0} \frac{g_{v} (P_{B}, w)}{Δ t}\} .$

3.3. Lagrangian Viewpoint

In the Lagrangian approach for Special Relativity, the background spacetime is assumed to be endowed with a flat metric

η

. So, the Lagrangian

L

is constructed by using the prescribed

η

and some matter fields

ϕ_{α}

. The stress–energy tensor coincides with the canonical energy–momentum tensor associated with the Lagrangian, in most cases (the exceptions include theories involving spin). This canonical tensor appears as the Noether current associated with the invariance by spacetime translations (i.e., when

L (ϕ_{α}, \partial_{μ} ϕ_{α}, x^{μ}) \equiv L (ϕ_{α}, \partial_{μ} ϕ_{α})

), namely5

T^{μ ν} = \frac{\partial L}{\partial (\partial_{μ} ϕ_{α})} \partial^{ν} ϕ_{α} - η^{μ ν} L .

(8)

In principle, these interpretations would hold unaltered for the case of an affine space with a Lorentz norm, including the case of Very Special Relativity.

In General Relativity, however, the Lagrangian formulation introduces a background Lagrangian independent of matter fields (the Einstein–Hilbert one, eventually with a cosmological constant) and, then, a matter Lagrangian

L_{m a t t e r}

, which includes a constant of coupling with the background. Then, the safest way to define the stress–energy tensor is the canonical one obtained as the corresponding action term

δ S_{m a t t e r} / δ g^{μ ν}

in the Euler–Lagrange equations6,

T_{μ ν} = - 2 \frac{δ L_{m a t t e r}}{δ g^{μ ν}} + g_{μ ν} L_{m a t t e r} .

(9)

Any tensor obtained in this way will have some advantages to play the role of a stress–energy tensor, because it will be automatically symmetric (in contrast to (8)) and will have vanishing divergence.

In the Finslerian setting, the variational viewpoint has been systematically studied in a very recent study by Hohmann, Pfeifer, and Voicu [16]. Previously, the background Lagrangian closest to the Einstein–Hilbert functional in the Finslerian setting had been studied in [15,29]. Such a functional is obtained as the integral of the Ricci scalar function on the indicatrix of the Lorentz–Finsler metric7 L. Taking into account this background functional, they define the energy-momentum scalar function by taking the corresponding variational action term (Formula (84) in [16]),

T = - 2 \frac{L^{3}}{| g |} \frac{δ L_{m a t t e r}}{δ L} .

Notice that, here, the functional coordinate for the Lagrangian is L, and, thus, an (anisotropic) function rather than a 2-tensor is obtained. However, starting at this function some tensors become useful (Formulas (88) and (91) in [16]), in particular a canonically associated (anisotropic Liouville) 2-tensor

Θ_{ν}^{μ} = \frac{T}{L} ℂ^{μ} ℂ_{ν}

as in Example 1. Notice that, essentially, the information of these tensors is codified in

T

. Even though such a tensor is justified by the procedure of Gotay–Mardsen in [30], some issues as the following ones might deserve interest for a further discussion:

This is not the unique natural possibility to construct an anisotropic 2-tensor starting at $T$ . For example, an alternative would be the vertical Hessian,

$T_{μ ν} = {\dot{\partial}}_{μ, ν} (L T) \equiv \frac{\partial^{2} (L T)}{\partial y^{μ} \partial y^{ν}} .$

(10)

It is natural to wonder about the choice closer to the relativistic intuitions about the stress–energy tensor.
Recently, the Palatini approach has also been studied for the Finslerian setting [17]. There, the dynamic variables are L and the components of an (independent) non-linear connection. Thus, a similar Lagrangian procedure would lead to a higher-order tensor. In the relativistic setting, this approach supports classical relativity, as it recovers both equations and (in the symmetric case) the Levi–Civita connection. However, the Palatini approach is no longer equivalent in the Finslerian case, as it yields non-equivalent connections, and it shows a variety of possibilities for the non-linear connections. So, it is natural to wonder about the most natural choice of a Lagrangian-based stress–energy tensor in this setting.

Finally, let us discuss an example analyzed from the Lagrangian viewpoint in [1,16] taking into account also the observers’ one in Section 3.2.

Example 2.

The gravitational field sourced by a kinetic gas has been deeply studied in [1,16]. In the relativistic setting, this is derived from the Einstein–Vlasov equations in terms of a 1-particle distribution function (1PDF)

ϕ (x, \dot{x})

, which encodes how many gas particles at a given spacetime point x propagate on worldlines with a normalized 4-velocity

\dot{x}

. Specifically, the stress energy tensor is:

T^{μ ν} (x) = \int_{Σ_{x}} {\dot{x}}^{μ} {\dot{x}}^{ν} ϕ (x, \dot{x}) d {vol}_{g_{x}}, x \in M,

being

Σ_{x}

the indicatrix (future-directed unit vectors of the Lorentz metric) and

d {vol}_{g_{x}}

the volume element induced by the scalar product

g_{x}

at each x. In [1], they propose to derive the gravitational field of a kinetic gas directly from the 1PDF without averaging, i.e., taking into account the full information on the velocity distribution. This leads to consider the function

ϕ : Σ \to R

,

u \equiv (x, \dot{x}) \mapsto ϕ (u) \geq 0

as an energy–momentum function, which plays the role of a stress–energy tensor (even though it is a scalar rather than a 2-tensor). Moreover, the original Lorentz metric is naturally allowed to be Lorentz–Finsler, which permits to obtain more general cosmological models (Section III in [1]).

Indeed, up to a coupling constant, ϕ is regarded directly as the matter source in the Finslerian Einstein–Hilbert equation (i.e., it is placed at the right-hand side of this equation, (Equation (7) in [1])). It is worth pointing out:

ϕ can be reobtained as a Lagrangian energy-momentum by inserting it directly as a term in the background Lagrangian (Equation (75) in [16]). However, the Lagrangian is not natural then, as ϕ is written in terms of $x, \dot{x}$ (recall (Appendix 3, Section (a) in [16])).
As discussed above, such a function allows one to construct several tensors, in particular, the vertical Hessian $\partial^{2} ϕ / \partial {\dot{x}}^{μ} \partial {\dot{x}}^{ν}$ (as in (10)), which also might play a role to compare with the relativistic $T^{μ ν} (x)$ .

Anyway, starting at the 1PDF ϕ, another Finslerian interpretation would be possible. In particular, one can define the energy–momentum distribution

ϕ (u) u

. Then, given an observer

v \in Σ

and a

g_{v}

-unit vector, the w-energy–momentum might be defined as

g_{v} (u, w) ϕ (u) .

In particular, when

w = v

, this would be the energy perceived by v, and when w is unit and

g_{v}

-orthogonal to v would be (minus) the momentum in the direction w(compare with the discussion at the end of Section 3.2). So, an alternative stress–energy tensor perceived by each observer

v \in Σ

might be defined as the anisotropic tensor:

T_{v} (w, z) = \int_{Σ_{π (v)}} g_{v} (u, w) g_{v} (u, z) ϕ (u) d {vol}_{g_{v}},

where the integration in u is carried out with the volume form of

(Σ_{π (v)}, g_{v})

, denoted by

d {vol}_{g_{v}}

.

4. Divergence of Anisotropic Vector Fields

After studying the basic properties of the Finslerian stress–energy tensor T, our next aim is to analyze the meaning and significance of the infinitesimal conservation law

div (T) = 0

. Along this and the next section, we will always consider an anisotropic tensor

T \in T_{1}^{1} (M_{A})

interpreted as an endomorphism of anisotropic vector fields.

T^{♭} \in T_{2}^{0} (M_{A})

and

T^{♯} \in T_{2}^{0} (M_{A})

will be defined on vectors and 1-forms by

T^{♭} (X, Y) : = g (X, T (Y))

and

T^{♯} (θ, η) : = g^{*} (T^{*} (θ), η)

resp., where

g^{*}

is the inverse fundamental tensor, and

T^{*}

is the transpose of T. They will have components

{(T^{♭})}_{i j} = g_{i l} T_{j}^{l} = : T_{i j}

and

{(T^{♯})}^{i j} = T_{l}^{i} g^{l j} = : T^{i j}

, and in principle we will not even assume that these are symmetric. We will be assuming that M is orientable and oriented. This is not restrictive: one could always reduce the theory to this case by pulling back all the objects (the fibered manifold

A \to M

included) to the oriented double cover of M (Chapter 15 in [31]).

Let us briefly recall the mathematically precise meaning of the conservation laws in classical General Relativity (g, T, and X isotropic). One has

div (T (X)) = \nabla_{i} (T_{j}^{i} X^{j}) = \nabla_{i} T_{j}^{i} X^{j} + T_{j}^{i} \nabla_{i} X^{j} = div (T) (X) + trace (T (\nabla X))

(11)

with ∇ the Levi–Civita connection. The first contribution vanishes due to

div (T) = 0

, and there are different situations in which the second one vanishes as well. For instance, if

T^{♭} (-, \nabla_{-} X)

is antisymmetric, then

trace (T (\nabla X)) = T_{j}^{i} \nabla_{i} X^{j} = g^{i l} T_{l j} \nabla_{i} X^{j} = \frac{1}{2} g^{i l} (T_{l j} \nabla_{i} X^{j} + T_{i j} \nabla_{l} X^{j}) = 0,

(12)

and if

T^{♭}

is symmetric and

\nabla X^{♯}

is antisymmetric (equiv., X is a Killing vector field), then also

trace (T (\nabla X)) = g^{i l} T_{l j} \nabla_{i} X^{j} = \frac{1}{2} T_{l j} (g^{l i} \nabla_{i} X^{j} + g^{j i} \nabla_{i} X^{l}) = 0 .

(13)

Anyway, whenever

trace (T (\nabla X)) = 0

, one can integrate (11) and apply the pseudo-Riemannian divergence theorem to get the integral conservation law

\int_{\partial D} ı_{T (X)} (d Vol) = 0,

(14)

where

\bar{D}

is a domain of appropriate regularity, ı is the interior product operator, and

d Vol

is the metric volume form. In a sense that will be made more precise in §5, this is expressing that the total amount of X-momentum in a space region only changes along time as much as it flows across the spatial boundary of the region. In this way, there is no “creation” nor “destruction” of X-momentum in any space region.

Extending the infinitesimal or the integral conservation laws poses, first and foremost, the problem of appropriately defining the divergence of an anisotropic T. Observe that a priori it is not clear even how to define the divergence of a vector field Z, isotropic or not, as one could consider

trace (\nabla Z)

for different anisotropic connections ∇, mainly Chern’s and Berwald’s. An alternative is to seek for a more geometric, hence, unbiased, definition. For instance, the metric (anisotropic) volume form of L,

d Vol = \sqrt{|det g_{a b} (x, y)|} d x^{1} \land \dots \land d x^{n} \in Ω_{n} (M_{A})

(15)

for

(x^{1}, \dots, x^{n})

positively oriented, is well-defined, and when

Z \in X (M)

(i.e., Z is isotropic), so is the Lie derivative

L_{Z} : T (M_{A}) \to T (M_{A})

(see § 5 in [21]). So, by analogy with the classical case, one could think of

L_{Z} (d Vol)

for defining

div (Z)

.

It turns out that the unbiased definition, including all

Z \in X (M_{A})

, is achieved with a modification of this Lie derivative that we will regard as an extension of the classical Lie bracket. We devote the next subsection to the technical mathematical foundations of such an anisotropic Lie bracket, which needs of a nonlinear connection on

A \to M

to be well-defined. All the maps

T (M_{A}) \to T (M_{A})

that will appear in Section 4.1 will be (anisotropic) tensor derivations in the sense of (Definition 2.6 in [21]), and their local nature will be apparent, so we will not explicitly discuss it. For example, the Lie derivative along

Z \in X (M)

is the only tensor derivation such that for

X \in X (M)

and

f \in F (A)

,

L_{Z} X = [Z, X], L_{Z} f = Z^{c} (f) : = Z^{k} \frac{\partial f}{\partial x^{k}} + y^{k} \frac{\partial Z^{i}}{\partial x^{k}} \frac{\partial f}{\partial y^{i}} .

(16)

4.1. Mathematical Formalism of the Anisotropic Lie Bracket

During this subsection, we fix an arbitrary nonlinear connection given by

T A = H A \oplus V A

or by the nonlinear covariant derivative

D

(keep in mind (1) and (2)), and also an anisotropic vector field

Z \in X (M_{A})

.

For

X \in X (M_{A})

, it is very natural to consider the commutator of the horizontal lifts of Z and X:

[Z^{H}, X^{H}] = [Z^{j} δ_{j}, X^{k} δ_{k}] = (Z^{j} δ_{j} X^{i} - X^{j} δ_{j} Z^{i}) δ_{i} + Z^{j} X^{k} [δ_{j}, δ_{k}] \in X (A) .

We recall that

Z^{j} X^{k} [δ_{j}, δ_{k}]

is always vertical. Indeed,

[δ_{j}, δ_{k}] = R_{j k}^{i} {\dot{\partial}}_{i}

, where

R

is the curvature tensor of the nonlinear connection (see [17], where this curvature is regarded as an anisotropic tensor and the homogeneity of the connection is not really required). This means that the horizontal part of

[Z^{H}, X^{H}]

has coordinates

Z^{j} δ_{j} X^{i} - X^{j} δ_{j} Z^{i}

, and this corresponds to a globally well-defined A-anisotropic vector field:

l_{Z}^{H} X : = (Z^{j} δ_{j} X^{i} - X^{j} δ_{j} Z^{i}) \partial_{i} \in X (M_{A}) .

(17)

Definition 1.

l_{Z}^{H} X

is the anisotropic Lie bracket of Z and X with respect to the nonlinear connection

H A

.

Remark 1.

The word “anisotropic” could be omitted in the previous definition, in the sense that for

Z, X \in X (M_{A})

, there is no other Lie bracket, isotropic or not, defined in general. Nonetheless, (17) makes apparent that when

Z, X \in X (M)

(i.e., when Z and X are isotropic),

l_{Z}^{H} X

coincides with the standard Lie bracket

[Z, X]

regardless of the connection.

Lemma 1.

Given a nonlinear connection

H A

,

V \in X^{A} (U)

,

f \in F (A)

and anisotropic vector fields

X, Z \in X (M_{A})

, it holds that

Z^{H} (f) = Z (f (V)) - {\dot{\partial}}_{D_{Z} V} f,

(18)

{(l_{Z}^{H} X)}_{V} = [Z_{V}, X_{V}] - {({\dot{\partial}}_{D_{Z} V} X)}_{V} + {({\dot{\partial}}_{D_{X} V} Z)}_{V} .

(19)

Proof.

Observe that

\begin{matrix} Z (f (V)) - {\dot{\partial}}_{D_{Z} V} f & = Z^{i} (\frac{\partial f}{\partial x^{i}} (V) + \frac{\partial f}{\partial y^{j}} (V) \frac{\partial V^{j}}{\partial x^{i}}) \\ - \frac{\partial f}{\partial y^{j}} (V) Z^{k} (\frac{\partial V^{j}}{\partial x^{k}} - N_{k}^{j} (V)) \\ = Z^{i} (\frac{\partial f}{\partial x^{i}} (V) - \frac{\partial f}{\partial y^{j}} (V) N_{i}^{j} (V)) \\ = Z^{H} (f), \end{matrix}

which concludes (18). In particular,

δ_{i} f (V) = \partial_{i} (f (V)) - ({\dot{\partial}}_{D_{\partial_{i}} V} f) (V)

, and using this in (17), (19) follows. □

We also recall that the torsion of an A-anisotropic connection ∇ (18 in [21]), (Definition 5 in [20]) is the anisotropic tensor

Tor \in T_{2}^{1} (M_{A})

defined first on isotropic fields

Z, X \in X (M)

by

Tor (Z, X) = \nabla_{Z} X - \nabla_{X} Z - [Z, X]

and then extended by

F (A)

-bilinearity. Therefore, it can be regarded as and

F (A)

-bilinear map

Tor : X (M_{A}) \times X (M_{A}) \to X (M_{A})

and it has coordinates

{Tor}_{j k}^{i} = Γ_{j k}^{i} - Γ_{k j}^{i},

(20)

where the

Γ_{j k}^{i}

’s are the Christoffel symbols of ∇8.

Theorem 1.

Let a nonlinear connection

T A = H A \oplus V A

and an anisotropic vector field

Z \in X (M_{A})

be fixed.

(A): If ∇ is any A-anisotropic connection whose underlying nonlinear connection is $H A$ , then for any $X \in X (M_{A})$ ,

$Tor (Z, X) = \nabla_{Z} X - \nabla_{X} Z - l_{Z}^{H} X$

(21)

(where $Tor$ is the torsion of ∇).
(B): By imposing the Leibniz rule with respect to tensor products and the commutativity with contractions, the map $X \mapsto l_{Z}^{H} X$ extends unequivocally to an (anisotropic) tensor derivation $l_{Z}^{H} : T_{s}^{r} (M_{A}) \to T_{s}^{r} (M_{A})$ given by

$\begin{matrix} l_{Z}^{H} T (θ^{1}, \dots, θ^{r}, X_{1}, \dots, X_{s}) & = Z^{H} (T (θ^{1}, \dots, θ^{r}, X_{1}, \dots, X_{s})) \\ - \sum_{μ = 1}^{r} T (θ^{1}, \dots, l_{Z}^{H} θ^{μ}, \dots, θ^{r}, X_{1}, \dots, X_{s}) \\ - \sum_{ν = 1}^{s} T (θ^{1}, \dots, θ^{r}, X_{1}, \dots, l_{Z}^{H} X_{ν}, \dots, X_{s}) \end{matrix}$

(22)

for $θ^{μ} \in Ω_{1} (M)$ and $X_{ν} \in X (M)$ . In coordinates, if

$T = T_{j_{1}, \dots, j_{s}}^{i_{1}, \dots, i_{r}} (x, y) \partial_{i_{1}} \otimes \dots \otimes \partial_{i_{r}} \otimes d x^{j_{1}} \otimes \dots \otimes d x^{j_{s}},$

then

${(l_{Z}^{H} T)}_{j_{1}, \dots, j_{s}}^{i_{1}, \dots, i_{r}} = Z^{k} \frac{δ T_{j_{1}, \dots, j_{s}}^{i_{1}, \dots, i_{r}}}{δ x^{k}} - \sum_{μ = 1}^{r} \frac{δ Z^{i_{μ}}}{δ x^{k}} T_{j_{1}, \dots, j_{s}}^{i_{1}, \dots, k, \dots, i_{r}} + \sum_{ν = 1}^{s} \frac{δ Z^{k}}{δ x^{j_{ν}}} T_{j_{1}, \dots, k, \dots, j_{s}}^{i_{1}, \dots, i_{r}} .$

(23)
(C): The map

$L_{Z}^{H} : = l_{Z}^{H} - {\dot{\partial}}_{l_{Z}^{H} ℂ} : T (M_{A}) \to T (M_{A})$

is also a tensor derivation. When $Z \in X (M)$ ,

$L_{Z}^{H} T = L_{Z} T$

(24)

for all $T \in T (M_{A})$ , where $L_{Z}$ is the Lie derivative (16), regardless of the nonlinear connection.
(D): Given $V \in X^{A} (U)$ and $ω \in Ω_{n} (M_{A})$ ( $n = dim M$ ), it holds that

${(l_{Z}^{H} ω)}_{V} = L_{Z_{V}} (ω_{V}) - {\dot{\partial}}_{D_{Z} V} ω - trace ({\dot{\partial}}_{D V} Z) ω .$

(25)

Proof.

(A) It is straightforward to compute that the right-hand side of (21) is

F (A)

-multilinear. Moreover, the identity is trivial on isotropic vector fields

X, Z \in X (M)

, as

l_{Z}^{H} X = [X, Z]

in this case, which concludes.

(B) Given

f \in T_{0}^{0} (M_{A}) = F (A)

, for

X \in T_{0}^{1} (M_{A}) = X (M_{A})

it follows from (17) that

l_{Z}^{H} (f X) = Z^{H} (f) X + f l_{Z}^{H} X .

Thus, in order to respect the Leibniz rule, the only possibility is to define

l_{Z}^{H} f = Z^{H} (f) = Z^{k} \frac{δ f}{δ x^{k}} .

(26)

Now, given

θ \in T_{1}^{0} (M_{A}) = Ω_{1} (M_{A})

, in order to respect again the Leibniz rule and the commutativity with contractions, the only possibility is to define

l_{Z}^{H} θ

on every

X \in X (M_{A})

by

(l_{Z}^{H} θ) (X) = Z^{H} (θ (X)) - θ (l_{Z}^{H} X) = (Z^{k} \frac{δ θ_{j}}{δ x^{k}} + \frac{δ Z^{k}}{δ x^{j}} θ_{k}) X^{j} .

(27)

(26), (17), and (27) make apparent that

l_{Z}^{H}

is already local on functions, vector fields, and 1-forms, and they allow to compute

l_{Z}^{H} (\partial_{i}) = - \frac{δ Z^{k}}{δ x^{i}} \partial_{k}, l_{Z}^{H} (d x^{j}) = \frac{δ Z^{j}}{δ x^{k}} d x^{k} .

(28)

Finally, given

T \in T_{s}^{r} (M_{A})

, one is led to define

l_{Z}^{H} T

by (22). Clearly, this indeed provides a tensor derivation and (23) follows from the evaluation of (22) at

(d x^{i_{1}}, \dots, d x^{i_{r}},

\partial_{j_{1}}, \dots, \partial_{j_{s}})

together with (26) and (28).

(C)

{\dot{\partial}}_{X} : T (M_{A}) \to T (M_{A})

is a tensor derivation for any

X \in X (M_{A})

, in particular for

X = l_{Z}^{H} ℂ = (Z^{j} δ_{j} y^{i} - y^{j} δ_{j} Z^{i}) \partial_{i} = - (Z^{j} N_{j}^{i} + y^{j} δ_{j} Z^{i}) \partial_{i}

(29)

(see (17)). Thus, the difference

L_{Z}^{H} = l_{Z}^{H} - {\dot{\partial}}_{l_{Z}^{H} ℂ}

is again a derivation. As for the last assertion, where

Z \in X (M)

, we are going to use (Proposition 2.7 in [21]). For

X \in X (M)

, we have

L_{Z}^{H} X = l_{Z}^{H} X = [Z, X] = L_{Z} X

(30)

(recall Remark 1). For

f \in F (A)

, we have

\begin{matrix} L_{Z}^{H} f = l_{Z}^{H} f - {\dot{\partial}}_{l_{Z}^{H} ℂ} f & = Z^{j} δ_{j} f + (Z^{j} N_{j}^{i} + y^{j} δ_{j} Z^{i}) {\dot{\partial}}_{i} f \\ = Z^{j} (\partial_{j} f - N_{j}^{i} {\dot{\partial}}_{i} f) + (Z^{j} N_{j}^{i} + y^{j} δ_{j} Z^{i}) {\dot{\partial}}_{i} f \\ = Z^{j} \partial_{j} f + y^{j} δ_{j} Z^{i} {\dot{\partial}}_{i} f \\ = L_{Z} f \end{matrix}

(see (26), (29), (1), and (16)). As

L_{Z}^{H}

and

L_{Z}

act the same on isotropic vector field and anisotropic functions, they are equal.

(D) Observe that for

X \in X (M)

, the term

{\dot{\partial}}_{D_{Z} V} X

vanishes in (19). Moreover, if

Z \in X (M_{A})

and

f \in F (A)

, then

Z^{H} {(f)}_{V} = Z_{V} (f (V)) - ({\dot{\partial}}_{D_{Z} V} f) (V)

. Given a local reference frame

E_{1}, \dots, E_{n} \in X (U)

, and taking into account the last two identities and the definitions of

l^{H}

and

L

, it follows that

\begin{matrix} {(l_{Z}^{H} ω)}_{V} (E_{1}, \dots, E_{n}) - L_{Z_{V}} (ω_{V}) (E_{1}, \dots, E_{n}) & = - {\dot{\partial}}_{D_{Z} V} ω (E_{1}, \dots, E_{n}) \\ - \sum_{i = 1}^{n} ω (E_{1}, \dots, {\dot{\partial}}_{D_{E_{i}} V} Z, \dots, E_{n}) . \end{matrix}

As

ω (E_{1}, \dots, {\dot{\partial}}_{D_{E_{i}} V} Z, \dots, E_{n}) = E_{i}^{*} ({\dot{\partial}}_{D_{E_{i}} V} Z) ω_{V} (E_{1}, \dots, E_{n})

, (25) follows. □

Definition 2.

The tensor derivation

l_{Z}^{H} : T (M_{A}) \to T (M_{A})

defined in Theorem 1 (B) is the (anisotropic) Lie bracket with Z, while

L_{Z}^{H} : T (M_{A}) \to T (M_{A})

is the (anisotropic) Lie derivative along Z, both of them with respect to the connection

H A

.

Remark 2

(Anisotropic Lie bracket and Lie derivative). The derivation

L_{Z}^{H}

defined in Theorem 1 (C) would be the Lie derivative along Z with respect to

H A

. Analogously to the discussion of Remark 1, what makes this name consistent is (24): whenever the Lie derivative along Z was already defined,

L_{Z}^{H}

coincides with it. Even though the Lie bracket and the Lie derivative are equal in the classical regime, it is heuristically useful to regard

l^{H}

as the anisotropic generalization of the former and

L^{H}

as that of the latter, in order to distinguish them. It is actually

l^{H}

, and not

L

, which will be relevant for the definition of divergence. The reason is that the former, as we will see below, has a clear geometric interpretation in terms of flows, while the latter would just add the term

{\dot{\partial}}_{l_{Z}^{H} ℂ}

to that interpretation. Moreover, Theorem 1 (D) actually corresponds to a Cartan formula for

L_{Z}

whose full development we postpone for a future work. Thus,

L_{Z} (d Vol) = L_{Z}^{H} (d Vol)

can be regarded as an initial guess for the divergence of Z, but we will not employ

L^{H}

from now on.

Let us observe that given a diffeomorphism

ψ_{t} : M \to M

that is the flow of an isotropic vector field Z, we can define the pullback

ψ_{t}^{*} (ω)

of an anisotropic differential form

ω \in Ω_{s} (M_{A})

as the anisotropic form given by

ψ_{t}^{*} {(ω)}_{v} (u_{1}, \dots, u_{s}) : = ω_{P_{t} (v)} (d ψ_{t} (u_{1}), \dots, d ψ_{t} (u_{s}))

, where

P_{t} (v)

is the

H A

-parallel transport of v along the integral curve of Z and

u_{1}, \dots, u_{s} \in T_{π (v)} M

.

Proposition 1.

If

Z \in X (M)

and

ω \in Ω_{s} (M_{A})

, then

l_{Z}^{H} ω = lim_{t \to 0} \frac{ψ_{t}^{*} (ω) - ω}{t},

(31)

where

ψ_{t}

is the (possibly local) flow of Z.

Proof.

Observe that

ψ_{t}^{*} {(ω)}_{v}

can be obtained as

ψ_{t}^{*} (ω_{V})

with V an extension of v such that

D_{Z} V = 0

. Then (25) and the classical formula for the Lie derivative in terms of the flow imply (31). □

Remark 3.

Even though, for convenience, we stated the previous geometrical interpretation for an s-form ω, it should be clear that it holds true for any r-contravariant s-covariant A-anisotropic tensor.

4.2. Lie Bracket Definition of Divergence

Finally, in this and the next subsections a pseudo-Finsler metric L defined on A is fixed again. In its presence, and in view of the Riemannian case and Proposition 1, the most natural way of defining the divergence of an anisotropic vector field Z is by

l_{Z}^{H} (d Vol)

. Here there is a canonical choice for

H A

: the metric nonlinear connection of L. The definition obtained this way is unbiased, in that one does not choose any anisotropic connection a priori. Notwithstanding, it will turn out to be most conveniently expressed in terms of the Chern connection.

Definition 3.

For

Z \in X (M_{A})

, its divergence with respect to the pseudo-Finsler metric L is the anisotropic function

div (Z) \in F (A)

defined by

l_{Z}^{H} (d Vol) = : div (Z) d Vol,

where

H A

and

d Vol

are the metric nonlinear connection (4) and the metric volume form (15) of L, resp.

Remark 4.

Even though we will keep assuming it for simplicity, the hypothesis of M being orientable is not really needed for this definition. As in pseudo-Riemannian geometry, on small enough open sets

U \subseteq M

it is always possible to choose an orientation, define

d {Vol}_{U} \in Ω_{n} (M_{A})

with respect to it and put

{div (Z)|}_{A \cap T U} d {Vol}_{U} : = l_{Z}^{H} (d {Vol}_{U})

. The different definitions will be coherent because when the orientation changes,

d {Vol}_{U}

changes to

- d {Vol}_{U}

and

l_{Z}^{H} (- d {Vol}_{U}) = - l_{Z}^{H} (d {Vol}_{U}) = {- div (Z)|}_{A \cap T U} d {Vol}_{U} = div {(Z)}_{A \cap T U} (- d {Vol}_{U}) .

In particular, when M is orientable,

div (Z)

is independent of the orientation choice.

Proposition 2.

Let L be a fixed pseudo-Finsler metric defined on A, and let

Z \in X (M_{A})

. If ∇ is any symmetric A-anisotropic connection such that its underlying nonlinear connection is the metric one and

\nabla_{Z} (d Vol) = 0

, then

div (Z) = trace (\nabla Z),

(32)

or in coordinates,

div (Z) = \frac{δ Z^{i}}{δ x^{i}} + Γ_{i k}^{i} Z^{k}

(33)

This, in particular, is true for the (Levi-Civita)–Chern anisotropic connection of L, so one can take the Christoffel symbols to be those of (5).

Proof.

One expresses the Z-Lie bracket of the volume form in terms of the anisotropic connection, analogously to the isotropic case. From (15) and the fact that

l_{Z}^{H}

is a tensor derivation, we obtain

\begin{matrix} div (Z) \sqrt{|det g_{a b}|} & = div (Z) d Vol (\partial_{1}, \dots, \partial_{n}) \\ = l_{Z}^{H} (d Vol) (\partial_{1}, \dots, \partial_{n}) \\ = l_{Z}^{H} (d Vol (\partial_{1}, \dots, \partial_{n})) - \sum_{i = 1}^{n} d Vol (\partial_{1}, \dots, l_{Z}^{H} \partial_{i}, \dots, \partial_{n}) . \end{matrix}

(26) and the fact that

H A

is the underlying nonlinear connection of ∇ give

l_{Z}^{H} (d Vol (\partial_{1}, \dots, \partial_{n})) = Z^{H} (d Vol (\partial_{1}, \dots, \partial_{n})) = \nabla_{Z} (d Vol (\partial_{1}, \dots, \partial_{n})) .

(21) and

Tor = 0

d Vol (\partial_{1}, \dots, l_{Z}^{H} \partial_{i}, \dots, \partial_{n}) = d Vol (\partial_{1}, \dots, \nabla_{Z} \partial_{i}, \dots, \partial_{n}) - d Vol (\partial_{1}, \dots, \nabla_{\partial_{i}} Z, \dots, \partial_{n}) .

From these and

\nabla_{Z} (d Vol) = 0

,

\begin{matrix} div (Z) \sqrt{|det g_{a b}|} & = \nabla_{Z} (d Vol (\partial_{1}, \dots, \partial_{n})) - \sum_{i = 1}^{n} d Vol (\partial_{1}, \dots, \nabla_{Z} \partial_{i}, \dots, \partial_{n}) \\ + \sum_{i = 1}^{n} d Vol (\partial_{1}, \dots, \nabla_{\partial_{i}} Z, \dots, \partial_{n}) \\ = \nabla_{Z} (d Vol) (\partial_{1}, \dots, \partial_{n}) + \sum_{i = 1}^{n} d Vol (\partial_{1}, \dots, \nabla_{\partial_{i}} Z, \dots, \partial_{n}) \\ = \sum_{i = 1}^{n} d Vol (\partial_{1}, \dots, \nabla_{\partial_{i}} Z, \dots, \partial_{n}) \\ = trace (\nabla Z) \sqrt{|det g_{a b}|}, \end{matrix}

(34)

where the last equality is reasoned analogously as in the proof of (25).

For the Chern connection, it can be checked that

\nabla (d Vol) = 0

by considering a parallel orthonormal basis with respect to a parallel observer V along the integral curves of any vector field. The coordinate expression of

trace (\nabla Z)

in this case concludes (33). □

4.3. Divergence Theorem and Boundary Term Representations

Our Lie bracket derivation allows us to obtain a statement of the Finslerian divergence theorem that subsumes both Rund’s (3.17 in [18]) and Minguzzi’s (Theorem 2 in [19]). This way, it does not need of computations in coordinates from the beginning nor of the “pullback metric” (

g_{V}

in our notation). Naturally, our statement does not include Shen’s (Theorem 2.4.2 in [32]), as this one is an independent generalization of the Riemannian theorem not dealing with anisotropic differential forms nor vector fields.

Lemma 2.

For

X \in X (M_{A})

, the vertical derivative of

d Vol

is given by

{\dot{\partial}}_{X} (d Vol) = C^{m} (X) d Vol,

(35)

where

C^{m}

is the mean Cartan tensor of L (see (3)).

Proof.

Let

E_{1} (t)

, …,

E_{n} (t)

be a positively oriented

g_{v + t X}

-orthonormal basis for every

t \in [0, ε]

for a certain

ε > 0

. Then

d {Vol}_{v + t X} (E_{1} (t), \dots, E_{n} (t)) = 1

for all

t \in [0, ε]

. This implies that

{\dot{\partial}}_{X} {(d Vol)}_{v} (E_{1} (0), \dots, E_{n} (0)) + \sum_{i = 1}^{n} d {Vol}_{v} (E_{1} (0), \dots, {\dot{E}}_{i} (0), \dots, E_{n} (0)) = 0 .

Moreover, as

g_{v + t X} (E_{i} (t), E_{i} (t)) = \pm 1

,

2 C_{v} (E_{i} (0), E_{i} (0), X) + 2 g_{v} ({\dot{E}}_{i} (0), E_{i} (0)) = 0 .

Using this relation above, we conclude (35). □

In the present article, by a domain

\bar{D}

we understand a nonempty connected set that coincides with the closure of its interior D; then, its boundary is

\partial \bar{D} = \partial D

. Physically, it is very important to include examples in which different parts of

\partial D

have different causal characters, and this tipically leads to the boundary not being totally smooth. Hence, we will make a weaker regularity assumption that still allows one to apply Stokes’ theorem on

\bar{D}

. A subset of M has 0 m-dimensional measure if its intersection with any embedded m-dimensional submanifold

σ \subseteq M

is of 0 measure in the smooth manifold

σ

. Finally, the interior product of an s-form

ω

with a vector field X will be

ı_{X} ω : = ω (X, -, \dots, -) .

Theorem 2.

Let L be a fixed pseudo-Finsler metric defined on A. If

(i): $Z \in X (M_{A})$ is an anisotropic vector field,
(ii): $V \in X^{A} (U)$ is an A-admissible field with $U \subseteq M$ open, and
(iii): $\bar{D} \subseteq U$ is a domain with $\partial D$ smooth up to subset of 0 $(n - 1)$ -dimensional measure on M and $Supp (Z_{V}) \cap \bar{D}$ compact,

then

\begin{matrix} \int_{D} div {(Z)}_{V} d {Vol}_{V} + \int_{D} \{C^{m} (D_{Z} V) + trace ({\dot{\partial}}_{D V} Z)\} d {Vol}_{V} \\ = \int_{\partial D} ı_{Z_{V}} (d {Vol}_{V}), \end{matrix}

(36)

where

C^{m}

is the mean Cartan tensor, and

D V

is computed with the metric nonlinear connection (4).

Proof.

The idea is to apply Stokes’ theorem to

L_{Z_{V}} (d {Vol}_{V})

. However, taking into account (25) and Lemma 2, it follows that

L_{Z_{V}} (d {Vol}_{V}) = l_{Z}^{H} {(d Vol)}_{V} + \{C^{m} (D_{Z} V) + trace ({\dot{\partial}}_{D V} Z)\} d {Vol}_{V},

concluding (36). □

Remark 5

(Riemannian and Finslerian unit normals). Let

i : Γ ↪ M

be the inclusion of a smooth open subset

Γ \subseteq \partial D

.

(i): Even though we do not use the pseudo-Riemannian metric $g_{V}$ to derive Theorem 2, from our physical viewpoint it is natural to use it to re-express the boundary term. If Γ is non- $g_{V}$ -lightlike, then for a $g_{V}$ -normal field ${\hat{N}}_{V}$ and a transverse field X along i, the form

$d σ_{V} : = sgn (g_{V} ({\hat{N}}_{V}, {\hat{N}}_{V})) \frac{\sqrt{|g_{V} ({\hat{N}}_{V}, {\hat{N}}_{V})|}}{g_{V} ({\hat{N}}_{V}, X)} i^{*} (ı_{X} (d {Vol}_{V})) \in Ω_{n - 1} (Γ)$

(37)

is nonvanishing and independent of X. In particular,

$d σ_{V} = \frac{1}{\sqrt{|g_{V} ({\hat{N}}_{V}, {\hat{N}}_{V})|}} i^{*} (ı_{{\hat{N}}_{V}} (d {Vol}_{V}))$

is independent of the scale of ${\hat{N}}_{V}$ , which we will always assume to be $g_{V}$ -unitary and D-salient, so

$d σ_{V} = i^{*} (ı_{{\hat{N}}_{V}} (d {Vol}_{V}))$

coincides with the hypersurface $g_{V}$ -volume form of Γ. Taking into account that $i^{*} (ı_{Z_{V}} (d {Vol}_{V}))$ vanishes wherever $Z_{V}$ is tangent to Γ and that $g_{V} ({\hat{N}}_{V}, {\hat{N}}_{V}) = \pm 1$ , (37) allows us to represent and the right-hand side of (36) as

$\int_{Γ} ı_{Z_{V}} (d {Vol}_{V}) = \int_{Γ} g_{V} ({\hat{N}}_{V}, {\hat{N}}_{V}) g_{V} ({\hat{N}}_{V}, Z_{V}) d σ_{V} .$

(38)

In fact, this is how Rund’s divergence theorem follows from Theorem 2.
(ii): There is another way that one can try to represent the boundary term. Namely, assume that there exists a smooth $ξ : p \in Γ \to ξ_{p} \in A \cap T_{p} M$ with $T_{p} Γ = Ker g_{ξ_{p}} (ξ_{p}, -)$ and $L (ξ_{p}) = \pm 1$ (in the Lorentz–Finsler case, it will necessarily be $L (ξ) = 1$ ). This is called a Finslerian unit normal along $Γ$ . Analogously as in (i), one can put

$d Σ_{V}^{ξ} : = L (ξ) \frac{1}{g_{ξ} (ξ, X)} i^{*} (ı_{X} (d {Vol}_{V})) = i^{*} (ı_{ξ} (d {Vol}_{V})),$

$\int_{Γ} ı_{Z_{V}} (d {Vol}_{V}) = \int_{Γ} ϵ_{ξ} L (ξ) g_{ξ} (ξ, Z_{V}) d Σ_{V}^{ξ};$

(39)

here, due to the possible orientation difference between both sides,

$ϵ_{ξ} = \{\begin{matrix} 1, & where ξ is D - salient, \\ - 1 & where ξ is D - entering . \end{matrix}$

In fact, this is how Minguzzi deduces his divergence theorem (Theorem 2 in [19]). Note, however, that he does it under the hypothesis of vanishing mean Cartan tensor ( $C^{m} = 0$ ), which implies that $d Σ_{V}^{ξ}$ is independent of V. As we do not require this, Theorem 2 is more general statement than Minguzzi’s.
(iii): The Finslerian unit normal presents some issues in the general case, as we are not taking $A = T M ∖ 0$ . In our physical interpretation, with L Lorentz–Finsler, A consists of timelike vectors, so asking for a Finslerian unit normal is only reasonable when Γ is L-spacelike, that is, $T_{p} Γ \cap (A \cap \partial A) = \emptyset$ for $p \in Γ$ . In such a case, the strong concavity of the indicatrix $\{v \in A_{p} : L (v) = 1\}$ guarantees the existence and uniqueness of ξ: one defines $ξ_{p}$ to be the unique vector such that $T_{p} Γ + ξ_{p}$ and the indicatrix are tangent at $ξ_{p}$ .
(iv): Of course, if L comes from a pseudo-Riemannian metric on M, then $ξ = ϵ_{ξ} {\hat{N}}_{V} = ϵ_{ξ} \hat{N}$ and $d Σ_{V}^{ξ} = ϵ_{ξ} d σ_{V} = ϵ_{ξ} d σ$ .
(v): It should be clear from this discussion that the form that one integrates on the right-hand side of (36) is always the same and that the only difference between Rund’s and Minguzzi’s divergence theorems is how each of them represents it. Notwithstanding, this is an important difference, for the boundary terms (38) and (39) could potentially have different physical interpretations.

5. Divergence of Anisotropic Tensor Fields

Our developments of the previous section will allow us to obtain integral Finslerian conservation laws for a tensor T with

div (T) = 0

. We obtain one for each

V \in X^{A} (U)

satisfying certain hypotheses. Physically, T can be interpreted as an anisotropic stress–energy tensor and V as an observer field. We will also revisit two of the main examples with a clearer physical interpretation: Special Relativity and the conservation of the “total energy of the universe.” In order to do all this, let us see how the Chern connection enters the Finslerian definition of

div (T)

.

5.1. Definition of Divergence with the Chern Connection

Proposition 2 motivates the most natural definition of divergence of

T \in T_{1}^{1} (M_{A})

. Namely, by analogy with the classical case, we shall require (11) to hold for any anisotropic vector field

X \in X (M_{A})

. This makes the Chern connection appear now: it is the only Finslerian connection ∇ for which one can assure that (32) holds independently of

Z : = T (X)

. We shall also explore the conditions under which the term

trace (\nabla Z)

vanishes in the general Finslerian setting.

Proposition 3.

Let L be a fixed pseudo-Finsler metric defined on A with metric nonlinear connection

H A

and Chern anisotropic connection ∇. Also, let

S \in T_{2}^{0} (M_{A})

be symmetric,

v \in A

,

T \in T_{1}^{1} (M_{A})

and

X \in X (M_{A})

.

(A) The following are equivalent.

(Ai): $S_{v} (-, \nabla_{-}^{v} X)$ is antisymmetric.
(Aii): $\nabla^{v} X$ is anti-self-adjoint with respect to $S_{v}$ , that is, $S_{v} (\nabla_{-}^{v} X, -) = - S_{v} (-, \nabla_{-}^{v} X)$ .
(Aiii): ${(l_{X}^{H} S)}_{v} = \nabla_{X}^{v} S$ .

(B) One has

div (T (X)) - trace (T (\nabla X)) = C_{2}^{1} (\nabla T) (X),

where

C_{2}^{1}

is the operator that contracts the contravariant index with the covariant one introduced by ∇.

(C) One has

trace (T (\nabla X)) (v) = 0

assuming any of the following conditions.

(Ci): $T_{v}^{♭} (-, \nabla_{-}^{v} X)$ is antisymmetric.
(Cii): $T_{v}^{♭}$ is symmetric and ${(l_{X}^{H} g)}_{v} = 0$ .

Proof.

For (A), take

Y, W \in X (M)

. The antisymmetry of

S_{v} (-, \nabla_{-}^{v} X)

reads

S_{v} (\nabla_{Y}^{v} X, W) = S_{v} (W, \nabla_{Y}^{v} X) = - S_{v} (Y, \nabla_{W}^{v} X),

which is exactly the anti-self-adjointness of

\nabla^{v} X

with respect to

S_{v}

. Besides, (26) and (21) together with

Tor = 0

for the Chern connection give

\begin{matrix} l_{X}^{H} S (Y, W) \\ = X^{H} (S (Y, W)) - S (l_{X}^{H} Y, W) - S (Y, l_{X}^{H} W) \\ = X^{H} (S (Y, W)) - S (\nabla_{X} Y - \nabla_{Y} X, W) - S (Y, \nabla_{X} W - \nabla_{W} X) \\ = \nabla_{X} S (Y, W) + S (\nabla_{Y} X, W) + S (Y, \nabla_{W} X), \end{matrix}

(40)

which shows that

{(l_{X}^{H} S)}_{v} = \nabla_{X}^{v} S

also is equivalent to the anti-self-adjointness.

For (B), all the computations in (11) hold formally the same in the general Finslerian case due to Proposition 2.

As for the vanishing of

trace (T (\nabla X)) (v)

, it follows from (Ci) by the same computations as in (12). Indeed, the antisymmetry can be expressed as

T_{l j} (v) \nabla_{i} X^{j} (v) + T_{i j} (v) \nabla_{l} X^{j} (v) = 0 .

It also follows from (Cii) by (13). Indeed,

{(l_{X}^{H} g)}_{v} = 0

is equivalent to

\nabla^{v} X

being anti-self-adjoint with respect to

g_{v}

, and this can be expressed as

g^{l i} (v) \nabla_{i} X^{j} (v) + g^{j i} (v) \nabla_{i} X^{l} (v) = 0 .

□

Remark 6

(

l_{X}^{H} g

and Finslerian Killing fields). In classical relativity (g, T, and X isotropic), the second condition in (C ii) above would read

{(L_{X} g)}_{π (v)} = 0

, and

L_{X} g = 0

would be equivalent to X being a Killing vector field. In the general case, X being Killing can be defined by the conditions

X \in X (M)

and

L_{X} L = 0

(Section 5 in [21]), but (using Theorem 1 (C), the facts that

\dot{\partial} ℂ = Id

and

C (ℂ, -, -) = 0

, and also (40))

\begin{matrix} L_{X} L & = L_{X} (g (ℂ, ℂ)) \\ = L_{X} g (ℂ, ℂ) + 2 g (L_{X} ℂ, ℂ) \\ = (l_{X}^{H} g - {\dot{\partial}}_{l_{X}^{H} ℂ} g) (ℂ, ℂ) + 2 g (l_{X}^{H} ℂ - {\dot{\partial}}_{l_{X}^{H} ℂ} ℂ, ℂ) \\ = l_{X}^{H} g (ℂ, ℂ) - 2 C (ℂ, ℂ, l_{X}^{H} ℂ) + 2 g (l_{X}^{H} ℂ - l_{X}^{H} ℂ, ℂ) \\ = l_{X}^{H} g (ℂ, ℂ) \\ = \nabla g (ℂ, ℂ) + g (\nabla_{ℂ} X, ℂ) + g (ℂ, \nabla_{ℂ} X) \\ = 2 g (ℂ, \nabla_{ℂ} X) \end{matrix}

This way, we see that neither of X being Killing or

l_{X}^{H} g = 0

implies the other, and additionally we recover the characterization of (Proposition 6.1 (i) in [33]).

Definition 4.

Let L be a fixed pseudo-Finsler metric defined on A with (Levi-Civita–)Chern anisotropic connection ∇. For

T \in T_{1}^{1} (M_{A})

, its divergence with respect to L is defined as

div (T) : = C_{2}^{1} (\nabla T) \in T_{1}^{0} (M_{A}) = Ω_{1} (M_{A}),

where

C_{2}^{1}

is the operator that contracts the contravariant index with the covariant one introduced by ∇. In coordinates,

div {(T)}_{j} = \nabla_{i} T_{j}^{i} = δ_{i} T_{j}^{i} + Γ_{i k}^{i} T_{j}^{k} - Γ_{i j}^{k} T_{k}^{i}

(41)

for the Christoffel symbols of (5).

Remark 7

(Divergence vs. raising and lowering indices).

(i): First and foremost, by construction, (11) indeed holds for any $X \in X (M_{A})$ . At this point, it is important that the connection with which one defines $trace (\nabla X)$ is the Chern one.
(ii): Thanks to the fact that the Chern connection parallelizes g, namely, $\nabla_{k} g_{i j} = 0$ and $\nabla_{k} g^{i j} = 0$ , the following hold:

$g^{i k} \nabla_{k} T_{i j} = g^{i k} g_{i l} \nabla_{k} T_{j}^{l} = \nabla_{k} T_{j}^{k} = div {(T)}_{j},$

(42)

$\nabla_{i} T^{i j} = \nabla_{i} T_{l}^{i} g^{l j} = g^{j l} div {(T)}_{l} .$

(43)

This means that one could define the divergences of $S \in T_{2}^{0} (M_{A})$ and $R \in T_{0}^{2} (M_{A})$ straightforwardly,9 $div (S) = C_{1, 3} (\nabla S) \in T_{1}^{0} (M_{A}) = Ω_{1} (M_{A})$ and $div (R) = C_{1}^{1} (\nabla R) \in T_{0}^{1} (M_{A}) = X (M_{A})$ , and then (42) and (43) would read, respectively

$div (T^{♭}) = div (T),$

$div (T^{♯}) = div {(T)}^{♯} .$
(iii): Regardless of this, in general we are not assuming the symmetry of $T^{♭}$ or $T^{♯}$ —we only did in Proposition 3 (Cii). Instead, at the beginning of §5 we fixed a convention for the order of the indices in $T_{i j}$ and $T^{i j}$ (for example, $T^{♭} (X, Y) = g (X, T (Y)) \neq g (T (X), Y)$ )—in the remainder of §4 and with said condition (Cii) only.

5.2. Chern vs. Berwald

One needs to keep in mind a discussion present in [20]. The metric connection

H A

is the underlying nonlinear connection of an infinite family of A-anisotropic connections ∇. One of them is the (Levi–Civita)–Chern connection of L, which is the horizontal part of Chern–Rund’s and Cartan’s classical connections and has Christoffel symbols (5). All the others are this one plus an anisotropic tensor

Q \in T_{2}^{1} (M_{A})

with

Q (-, ℂ) = 0

when viewed as an

F (A)

-bilinear map

X (M_{A}) \times X (M_{A}) \to X (M_{A})

. In particular, for

Q = - {Lan}^{♯}

, one gets the Berwald anisotropic connection of L, which is the horizontal part of Berwald’s and Hasiguchi’s classical connections and has Christoffel symbols (6). We did not as a priory select any of these ∇’s.

In some of the previous literature [34,35,36,37], the Finslerian divergence of vector fields was chosen to be defined directly with the Chern connection. In [18,19], the quantity

trace (\nabla Z)

, with ∇ the Chern anisotropic connection, was referred to as the divergence of Z, though only after it had appeared in the divergence theorem. We have proven that the most natural definition leads to this characterization, hence clarifying why using Chern’s covariant derivative is not arbitrary. Moreover, we have seen that said derivative fulfills the natural requisite (11) and is compatible with the lowering and raising of indices; these are key properties when it comes to the stress–energy tensor T. Still, it is important to compare this with what happens when one uses the other most natural covariant derivative: Berwald’s.

Remark 8

(Divergence in terms of the Berwald connection). Let ∇ be the Chern anisotropic connection of L, with Christoffel symbols (5), and

\hat{\nabla}

be the Berwald one, with symbols (6).

(i): (33) and (41) read respectively

$div (Z) = {\hat{\nabla}}_{i} Z^{i} + {Lan}_{k} Z^{k} = trace (\hat{\nabla} Z) + {Lan}^{m} (Z),$

$\begin{matrix} div {(T)}_{j} & = {\hat{\nabla}}_{i} T_{j}^{i} + {Lan}_{k} T_{j}^{k} - {Lan}_{i j}^{k} T_{k}^{i} \\ = C_{2}^{1} {(\hat{\nabla} T)}_{j} + {Lan}^{m} {(T)}_{j} - C_{1}^{1} {({Lan}^{♯} (T (-), -))}_{j}, \end{matrix}$

where ${Lan}^{m}$ is the mean Landsberg tensor (see (7)) and the contraction operators have the obvious meanings. Moreover, for $X \in X (M_{A})$

$\begin{matrix} trace (T (\nabla X)) = T_{j}^{i} \nabla_{i} X^{j} & = T_{j}^{i} {\hat{\nabla}}_{i} X^{j} + T_{j}^{i} {Lan}_{i k}^{j} X^{k} \\ = trace (T (\hat{\nabla} X)) + trace ({Lan}^{♯} (T (-), X)), \end{matrix}$

which makes (11) consistent with the previous formulas.
(ii): One sees that the vanishing of ${Lan}^{m}$ (or of the mean Cartan $C^{m}$ , see ((6.37) in [38])) implies that the divergence of elements of $X (M_{A})$ coincides with the trace of their Berwald covariant derivative. However, ${Lan}^{m} = 0$ (or even $C^{m} = 0$ ) is not enough if one wants to obtain the same characterization for elements of $T_{1}^{1} (M_{A})$ .

Remark 9

(Sufficient conditions for

l_{X}^{H} g = 0

and being Finslerian Killing). In Remark 13 one could see that

X \in X (M)

together with

\nabla_{ℂ} X = 0

is sufficient for X to be Killing. This condition does not privilege the Chern connection ∇ against the Berwald

\hat{\nabla}

:

\nabla_{ℂ} X = {\hat{\nabla}}_{ℂ} X + {Lan}^{♯} (ℂ, X) = {\hat{\nabla}}_{ℂ} X

(see ((38) in [21]), where

L^{♭}

is what here we would denote

{Lan}^{♯}

). However, when it comes to the stress–energy tensor, we have seen that the relevant condition is not this but rather

l_{X}^{H} g = 0

. Proposition 3 (A) implies that

\nabla^{v} X = 0

is sufficient for

{(l_{X}^{H} g)}_{v} = 0

, and this does privilege ∇ against

\hat{\nabla}

.

5.3. Finslerian Conservation Laws and Main Examples

Compare the results here with the classical case (14) and also with [19].

Corollary 1.

Let L be a fixed pseudo-Finsler metric defined on A. If

(i): $X \in X (M_{A})$ is an anisotropic vector field,
(ii): $V \in X^{A} (U)$ is an A-admissible field with $U \subseteq M$ open,
(iii): $T \in T_{1}^{1} (M_{A})$ is an anisotropic 2-tensor, and
(iv): $\bar{D} \subseteq U$ is a domain with $\partial D$ smooth up to subset of 0 $(n - 1)$ -dimensional measure on M and $Supp (X_{V}) \cap \bar{D}$ compact,

then

\begin{matrix} \int_{D} div (T) (X) d {Vol}_{V} + \int_{D} trace {(T (\nabla X))}_{V} d {Vol}_{V} \\ + \int_{D} \{C^{m} (D_{T (X)} V) + trace ({\dot{\partial}}_{D V} T (X))\} d {Vol}_{V} = \int_{\partial D} ı_{T {(X)}_{V}} (d {Vol}_{V}), \end{matrix}

(44)

where

C^{m}

is the mean Cartan tensor, and

D V

is computed with the metric nonlinear connection (4).

Proof.

Just take

Z = T (X)

in Theorem 2 and use part (B) or Proposition 3. □

Remark 10.

Observe that (44) allows for an interpretation of the divergence of T in terms of the flow in the boundary. Consider a sequence of domains

D_{m}

such that their volumes go to zero when

m \to + \infty

and consider an observer V such that is infinitesimally parallel at

p \in M

, namely,

D V = 0

in

p \in M

and X such that

\nabla^{v} X = 0

. Then (44) and the mean value theorem imply that

div {(T)}_{v} (X) = lim_{m \to + \infty} \frac{1}{{Vol}_{V} (D_{m})} \int_{\partial D_{m}} ı_{T {(X)}_{V}} (d {Vol}_{V}) .

In particular,

div {(T)}_{v} = 0

can be interpreted as that the observer v measures conservation of energy in its restspace.

Corollary 2.

In the ambient of the previous corollary, assume:

(i): $div {(T)}_{V} = 0$ .
(ii): Any of the conditions (Ci) or (Cii) of Proposition 3 holds for $T_{V}^{♭}$ .
(iii): $C^{m} (D_{T (X)} V) + trace \{{\dot{\partial}}_{D V} (T (X))\} = 0$ .

Then

\int_{\partial D} ı_{T_{V} (X_{V})} (d {Vol}_{V}) = 0 .

(45)

Proof.

It follows from Corollary 1, taking into account that the hypotheses

(i)

,

(i i)

, and

(i i i)

imply that the three first integrals in (44) vanish. □

Remark 11.

(Sufficient conditions for the hypotheses (i), (ii), and (iii)).

(i): Obviously, $div (T) = 0$ suffices, but we do not need to assume that the divergence vanishes for all observers.
(ii): $X = ℂ$ suffices. In fact, $\nabla ℂ = 0$ (Proposition 2.9 in [17]), so (Ci) of Proposition 3 holds for $T_{V}^{♭}$ . Thus, assuming the other two hypotheses, we get

$\int_{\partial D} ı_{T_{V} (V)} (d {Vol}_{V}) = 0 .$
(iii): Although the hypothesis may seem artificial as it stands, there are a number of natural situations in which it is guaranteed. First, in classical relativity (g, T, and X isotropic), because $C^{m} = 0$ and $\dot{\partial} (T (X)) = 0$ ; the result is then independent of V. Second, when the observer field is parallel ( $D V = 0$ ), trivially. Third, when $D V = θ \otimes V$ for some 1-form V and $T (X)$ is 0-homogeneous, because of Euler’s theorem. Fourth, in the situation described in (Section 5.1 in [19]) (Z is our $T (X)$ , s is our V, and I is our $C^{m}$ ).

Remark 12.

(Representations of (45)). One needs to keep in mind Remark 5. For a smooth part Γ of

\partial D

, one can use the (salient) Riemannian unit normal to represent

\begin{matrix} \int_{Γ} ı_{T_{V} (X_{V})} (d {Vol}_{V}) & = \int_{Γ} g_{V} ({\hat{N}}_{V}, {\hat{N}}_{V}) g_{V} ({\hat{N}}_{V}, T_{V} (X_{V})) d σ_{V} \\ = \int_{Γ} g_{V} ({\hat{N}}_{V}, {\hat{N}}_{V}) T_{V}^{♭} ({\hat{N}}_{V}, X_{V}) d σ_{V} \end{matrix}

(46)

when Γ is non-

g_{V}

-lightlike, and the Finslerian unit normal to represent

\int_{Γ} ı_{T_{V} (X_{V})} (d {Vol}_{V}) = \int_{Γ} ϵ_{ξ} L (ξ) g_{ξ} (ξ, T_{V} (X_{V})) d Σ_{V}^{ξ}

when L is Lorentz–Finsler and Γ is L-spacelike. This makes it possible to have the very same conservation law (45) written in distinct ways, and in the examples below we will see that different expressions are preferable in different situations.

In the remainder of the section, we analyze the Finslerian conservation laws in two settings in which L is Lorentz–Finsler. In particular, g has signature

(+, -, \dots, -)

, A determines a time orientation,

L > 0

on A, and

(A, L)

is maximal with these properties. We also have regularity conditions at

\partial A

, and in fact one sees that Theorem 2 and Corollary 2 still hold when allowing that

Z, X \in X (M_{\bar{A}})

,

T \in T_{1}^{1} (M_{\bar{A}})

and

V \in X^{\bar{A}} (U)

. Despite this, in both settings it will be necessary to take V as L-timelike, so the regularity at

\partial A

will not be used.

5.3.1. Example: Lorentz Norms on an Affine Space

In this example, we shall particularize Corollary 2 to the easiest Finslerian setting in which we can assure that its hypothesis (iii) holds. Namely, the structure of an affine space automatically provides an infinite number of parallel observer fields,

V \in X^{A} (M)

with

D V = 0

.

To be preicse, suppose that

M = E

is an affine space equipped with a Lorentz norm on an open conic subset

A_{*} \subseteq \vec{E} ∖ 0

(a positive pseudo-Minkowski norm with Lorentzian signature in (Definition 2.11 in [23])). Under the usual identifications, such a norm can be seen as a Lorentz–Finsler L on

A \subseteq T E ∖ 0 \equiv E \times (\vec{E} ∖ 0)

that is independent of the first factor. Consequently, its fundamental tensor is nothing more than a Lorentzian scalar product

g_{v}

for each

v \in A_{*}

. The metric nonlinear connection of L coincides with the canonical connection of E, hence so do the Chern and Berwald anisotropic connections10. This is what implies that the parallel

V \in X^{A} (E)

correspond exactly to the elements

v \in A_{*}

.

Let us introduce some notation. Given

(p_{0}, v) \in A

with

L (v) = 1

, we can consider the Lorentzian scalar product

g_{v}

and the orthogonal hyperplane

R : = p_{0} + \vec{R} : = p_{0} + \{w \in \vec{E} : g_{v} (v, w) = 0\}

. We get an isometry

(t, p) \in ℝ \times R \mapsto p + t v \in E

, where

R

is equipped with

- {g_{v}|}_{R}

(a Euclidean scalar product),

ℝ \times R

with

d t^{2} + {g_{v}|}_{R}

(a Lorentzian one) and E with

g_{v}

. Let

\bar{Ω}

be a compact domain of

R

with

\partial Ω \subseteq R

smooth up to a null

(n - 2)

-dimensional measure set, and let

{\hat{n}}_{v}

be its salient unit

(- {g_{v}|}_{R})

-normal. Then, for

t_{0} < t_{1}

, the compact domain

\bar{D} \equiv [t_{0}, t_{1}] \times \bar{Ω} \subseteq E

has the required smoothness to apply Corollary 2, its boundary is

\partial D = \{t_{1}\} \times \bar{Ω} \cup [t_{0}, t_{1}] \times \partial Ω \cup \{t_{0}\} \times \bar{Ω}

, and its salient

g_{v}

-normal is given by

{{\hat{N}}_{v}|}_{\{t_{1}\} \times Ω} = v, {{\hat{N}}_{v}|}_{]t_{0}, t_{1}[\times \partial Ω} = {\hat{n}}_{v}, {{\hat{N}}_{v}|}_{\{t_{0}\} \times Ω} = - v;

g_{v} (- v, - v) = g_{v} (v, v) = L (v) = 1,

g_{v} ({\hat{n}}_{v}, {\hat{n}}_{v}) = - (- {g_{v}|}_{R}) ({\hat{n}}_{v}, {\hat{n}}_{v}) = - 1 .

Remark 13.

For a

V \in X^{A} (E)

identifiable with

v \in A_{*}

, we know that the hypothesis (iii) of Corollary 2 holds automatically. If (i) and (ii) hold too, then we get (45), for which we can use the representation (46). However, given the nature of the metric “nonlinear” and Chern “anisotropic” connections, it is easy to convince oneself that evaluating the result of anisotropic computations on this V is the same as first evaluating on V and then computing with isotropic tensors. For instance

div {(T)}_{V} = div (T_{V})

and

{(l_{X}^{H} g)}_{V} = L_{X_{V}} (g_{V})

. As a consequence, mathematically we get exactly the same conservation laws as if we just were in the Lorentzian affine space

(E, g_{v})

. Physically, though, different observers will measure different momenta.

Corollary 3.

Let

V \in X^{A} (E)

parallely identifiable with an

v \in A_{*}

. If

T \in T_{1}^{1} (E_{A})

is such that

div (T_{V}) = 0

and

X \in X (E_{A})

is such that

T_{V}^{♭} (-, \nabla_{-}^{V} X)

is antisymmetric, or

T_{V}^{♭}

is symmetric and

L_{X_{V}} (g_{V}) = 0

, then

\begin{matrix} 0 & = \int_{\{t_{1}\} \times Ω} T_{V}^{♭} (V, X_{V}) d σ_{V} - \int_{\{t_{0}\} \times Ω} T_{V}^{♭} (V, X_{V}) d σ_{V} \\ - \int_{]t_{0}, t_{1}[\times \partial Ω} T_{V}^{♭} ({\hat{n}}_{V}, X_{V}) d σ_{V}, \end{matrix}

(47)

where

d σ_{V}

is identifiable with the volume form of

- {g_{v}|}_{Ω}

on

\{t_{μ}\} \times Ω

and coincides with the volume form of

{g_{v}|}_{]t_{0}, t_{1}[\times \partial Ω}

on

]t_{0}, t_{1}[\times \partial Ω

.

Physically, even though Lorentz norms generalize Very Special Relativity [7], the classical interpretations of Special Relativity are still valid; we list them for completeness: v is an instantaneous observer at an event

p_{0}

,

\vec{R}

is its restspace and

R

is the simultaneity hyperplane of v, namely, the “universe at an instant, say

t = 0

, as seen by v.” The affine space structure allows for a canonical propagation of v to all of the spacetime. Hence, if

\bar{Ω}

is a space region at

t = 0

, then

\bar{D}

is the “evolution of

\bar{Ω}

along the time interval

[t_{0}, t_{1}]

as witnessed by v.” (47) expresses that the variation after some time of the total amount of

X_{v}

-momentum in Ω is exactly equal to the amount of it that flowed across

\partial Ω

.

5.3.2. Example: Cauchy Hypersurfaces in a Finsler Spacetime

Here we present a construction which manifestly generalizes that of the previous example, again with straightforward physical interpretations, and we find an estimate that allows us to interpret (47) when

\partial Ω

is “at infinity”. We will take

V \in X^{A} (U)

with

U \subseteq M

open, and we recall that we will assume the hypotheses of Corollary 2.

Suppose that the Finsler spacetime

(M, L)

is globally hyperbolic. By this, we mean that there is some (smooth, for simplicity) L-Cauchy hypersurface

S \subseteq M

: every inextensible L-timelike curve

γ : I \to M

(thus

\dot{γ} (t) \in A

) meets

S

exactly once. Let us assume that there are two L-spacelike Cauchy hypersurfaces

S_{0}, S_{1} \subseteq U

which do not intersect11. Then the results of [39] can be automatically transplanted: there exists a foliation by spacelike Cauchy hypersurfaces

M \equiv R \times S

such that

S_{0} \equiv \{t_{0}\} \times S

and

S_{1} \equiv \{t_{1}\} \times S

. Taking the Finslerian unit normal

ξ

to each level

\{t\} \times S

produces an L-timelike field

ξ \in X^{A} (M)

. We can take this

ξ

to be our V, but we will not do so for the most part of this example.

Suppose also that

\{\bar{Ω_{0, m}}\}

is an exhaustion by compact domains of

S_{0}

, namely

\bar{Ω_{0, m}} \subseteq Ω_{0, m + 1}

and

⋃_{m \in N} Ω_{0, m} = S_{0}

, such that

\partial Ω_{0, m} \subseteq S_{0}

is smooth a. e. For

p \in S_{0}

, let

γ_{p}

be the integral curve of V starting at p, which necessarily meets

S_{1}

at a unique instant

t_{p} \in R

. Put

Ω_{1, m} : = ⋃_{p \in Ω_{0, m}} γ_{p} (\{t_{p}\}) \subseteq S_{1}, Γ_{p} : = γ_{p} [min \{0, t_{p}\}, max \{0, t_{p}\}],

D_{m} : = ⋃_{p \in Ω_{0, m}} Γ_{p} \subseteq U, Γ_{m} : = ⋃_{p \in \partial Ω_{0, m}} Γ_{p} .

Remark 14.

By construction,

(i): $\{\bar{Ω_{1, m}}\}$ is again an exhaustion by compact domains of $S_{1}$ such that $\partial Ω_{1, m} = ⋃_{p \in \partial Ω_{0, m}} γ_{p} (\{t_{p}\}) \subseteq S_{1}$ is smooth a. e.
(ii): $\bar{D_{m}}$ is a compact domain of U with $\partial D_{m} = \bar{Ω_{1, m}} \cup Γ_{m} \cup \bar{Ω_{0, m}} \subseteq U$ smooth a. e. We do not really need to consider the union of all the $D_{m}$ ’s.

Next, for

Z \in X (M_{A})

, we shall give the quantitative decay condition on (some components of)

Z_{V}

so that the integral

\int_{Γ_{m}} ı_{Z_{V}} (d {Vol}_{V})

vanishes in the limit. The key fact for it will be that V is everywhere tangent to

Γ_{m}

(this is composed of

γ_{p}

’s). In particular, as V is

g_{V}

-timelike, so must be

Γ_{m}

.

Remark 15.

The presence of V allows us to define an auxiliar Riemannian metric

h_{V}

on U with norm

{∥-∥}_{V}

, which gives a very natural way of quantifying. Namely, if

\{e_{0} = V_{p} / F (V_{p}), e_{1}, \dots, e_{n}\}

is an orthonormal basis for

g_{V_{p}}

, then we prescribe it to be also

h_{V_{p}}

-orthonormal; equivalently,

h_{V_{p}} (u, w) = 2 g_{V_{p}} (u, \frac{V_{p}}{F (V_{p})}) g_{V_{p}} (w, \frac{V_{p}}{F (V_{p})}) - g_{V_{p}} (u, w) .

Then, by construction:

(i): The volume form of $h_{V}$ coincides with that of $g_{V}$ , namely $d {Vol}_{V}$ .
(ii): The salient unit $h_{V}$ -normal to $Γ_{m}$ coincides with the corresponding $g_{V}$ -normal. We denote it by ${\hat{N}}_{V}$ , as in Remark 12.
(iii): The hypersurface volume form of $Γ_{m}$ with respect to $h_{V}$ coincides with the one computed with $g_{V}$ , namely $d σ_{V} = i_{m}^{*} (ı_{{\hat{N}}_{V}} (d {Vol}_{V}))$ with $i_{m} : Γ_{m} ↪ U$ the inclusion. Hence we speak just of the hypersurface volume of $Γ_{m}$ , namely $σ_{V} (Γ_{m})$ . As ${\hat{N}}_{V}$ is $g_{V}$ -orthogonal to V, and hence $g_{V}$ -spacelike, we can use the representation

$\begin{matrix} \int_{Γ_{m}} ı_{Z_{V}} (d {Vol}_{V}) & = \int_{Γ_{m}} g_{V} ({\hat{N}}_{V}, {\hat{N}}_{V}) g_{V} ({\hat{N}}_{V}, Z_{V}) d σ_{V} \\ = - \int_{Γ_{m}} g_{V} ({\hat{N}}_{V}, Z_{V}) d σ_{V} . \end{matrix}$

(48)

Thanks to (48) and the fact that

g_{V} ({\hat{N}}_{V}, V) = 0

, we intuitively see that if

Z_{V}

is proportional to V at infinity and the hypersurface volume does not grow too much, then the integral will be negligible. To be precise, we require that

K_{m} σ_{V} (Γ_{m}) ⟶ 0 (m ⟶ \infty),

(49)

where

\begin{matrix} K_{m} : & = max_{Γ_{m}} {∥Z_{V} - g_{V} (Z_{V}, \frac{V}{F (V)}) \frac{V}{F (V)}∥}_{V} \\ = max_{Γ_{m}} \{\sqrt{g_{V} {(Z_{V}, \frac{V}{F (V)})}^{2} - g_{V} (Z_{V}, Z_{V})}\} . \end{matrix}

Corollary 4.

In the above set-up, let

T \in T_{1}^{1} (M_{A})

,

X \in X (M_{A})

, and

V \in X^{A} (U)

be such that the hypotheses of Corollary 2 hold on all the

D_{m}

’s, and put

Z : = T (X)

. If the decay condition (49) holds too, then

\int_{Ω_{1, m}} ı_{Z_{V}} (d {Vol}_{V}) + \int_{Ω_{0, m}} ı_{Z_{V}} (d {Vol}_{V}) ⟶ 0 (m ⟶ \infty),

(50)

where

Ω_{1, m}

is constructed from

Ω_{0, m}

by intersecting the integral curves of V with

S_{1}

.

Proof.

Corollary 2 can be applied on

\bar{D_{m}}

, as

Supp (Z_{V}) \cap \bar{D_{m}}

is always compact. This and the representation (48) give

0 = \int_{Ω_{1, m}} ı_{Z_{V}} (d {Vol}_{V}) + \int_{Ω_{0, m}} ı_{Z_{V}} (d {Vol}_{V}) - \int_{Γ_{m}} g_{V} ({\hat{N}}_{V}, Z_{V}) d σ_{V} .

(51)

Using the definition of

h_{V}

(Remark 15) and the Cauchy–Schwarz inequality,

\begin{matrix} 0 & \leq |\int_{Γ_{m}} - g_{V} ({\hat{N}}_{V}, Z_{V}) d σ_{V}| \\ \leq \int_{Γ_{m}} |g_{V} ({\hat{N}}_{V}, Z_{V})| d σ_{V} \\ = \int_{Γ_{m}} |g_{V} ({\hat{N}}_{V}, Z_{V} - g_{V} (Z_{V}, \frac{V}{F (V)}) \frac{V}{F (V)})| d σ_{V} \\ = \int_{Γ_{m}} |- h_{V} ({\hat{N}}_{V}, Z_{V} - g_{V} (Z_{V}, \frac{V}{F (V)}) \frac{V}{F (V)})| d σ_{V} \\ \leq \int_{Γ_{m}} {∥{\hat{N}}_{V}∥}_{V} {∥Z_{V} - g_{V} (Z_{V}, \frac{V}{F (V)}) \frac{V}{F (V)}∥}_{V} d σ_{V} \\ = \int_{Γ_{m}} {∥Z_{V} - g_{V} (Z_{V}, \frac{V}{F (V)}) \frac{V}{F (V)}∥}_{V} d σ_{V} \\ \leq \int_{Γ_{m}} K_{m} d σ_{V} \\ = K_{m} σ_{V} (Γ_{m}), \end{matrix}

so if

K_{m} σ_{V} (Γ_{m})

tends to 0, then so does the integral along

Γ_{m}

in (51). □

Remark 16.

In Corollary 4, if one of the integrals of

ı_{Z_{V}} (d {Vol}_{V})

along

S_{0}

or

S_{1}

exists in the Lebesgue sense, then so does the other and (50) reads

\int_{S_{1}} ı_{Z_{V}} (d {Vol}_{V}) + \int_{S_{0}} ı_{Z_{V}} (d {Vol}_{V}) = 0 .

Note that they could be

\pm \infty

, as we have not assumed, for instance, that

Z_{V}

is compactly supported in the union of all the

D_{m}

’s. Rather, we have assumed the decay condition (49) alone.

Remark 17.

(Sufficient conditions for (49)). As for ensuring the decay condition, there are two possible scenarios.

(i): The hypersurface volume $σ_{V} (Γ_{m})$ stays bounded. Then, it is enough for (49) that $K_{m} \to 0$ , and one could instead postulate the stronger condition that the maximum outside $D_{m}$ tends to 0, which is independent of the concrete compact exhaustion.
(ii): $σ_{V} (Γ_{m})$ grows without bound. In this case, one can just postulate that the decay of $K_{m}$ compensates the growth of $σ_{V} (Γ_{m})$ , but this does depend on the compact exhaustion.

Notice that this is a purely Finslerian difficulty. Indeed, suppose that g, T, and X were isotropic and that

Z = T (X)

was timelike. Then one could just set

V : = Z

and then carry out all the construction. Corollary 2 would be independent of the observer field (and its hypothesis (iii) would hold trivially), and

K_{m} = 0

regardless of

Γ_{m}

. This is how we get the following statement of the classical law.

Corollary 5.

In the above se-up, suppose that L comes from a Lorentzian metric on M. Let

T \in T_{1}^{1} (M)

and

X \in X (M)

be such that

div (T) = 0

and

T^{♭} (-, \nabla_{-} X)

is antisymmetric, or

T^{♭}

is symmetric and

L_{X} g = 0

. If

Z : = T (X)

is timelike, then

\int_{Ω_{1, m}} ı_{Z_{V}} (d {Vol}_{V}) + \int_{Ω_{0, m}} ı_{Z_{V}} (d {Vol}_{V}) ⟶ 0 (m ⟶ \infty),

where

Ω_{1, m}

is constructed from

Ω_{0, m}

by intersecting the integral curves of Z with

S_{1}

.

Remark 18

(Conservation in terms of the Finslerian unit normal).

(i): One could try to represent also the integrals of (50) in terms of $d σ_{V}$ , as in Section 5.3.1. However, according to Remark 12, that would require assuming that $S_{μ}$ is non- $g_{V}$ -lightlike, which is not very reasonable when all we know is that $S_{μ}$ L-spacelike and L-Cauchy.
(ii): On the other hand, in terms of the Finslerian unit normal ξ, (50) reads

$\int_{Ω_{1, m}} g_{ξ} (ξ, T_{V} (X_{V})) d Σ_{V}^{ξ} - \int_{Ω_{0, m}} g_{ξ} (ξ, T_{V} (X_{V})) d Σ_{V}^{ξ} ⟶ 0$

(52)

when $m \to \infty$ . The sign in front of the second integral is explained as follows (see Remark 5 (ii)). $d Σ_{V}^{ξ}$ selects an orientation on each $Ω_{μ, m}$ : the one for which $d {Vol}_{V} (ξ, -, \dots, -)$ is positive. However, in (50) $Ω_{1, m}$ already had an orientation $O_{1}$ and $Ω_{0, m}$ had $O_{0}$ : the $D_{m}$ -salient ones. Necessarily12, exactly one of these agrees with the $d Σ_{V}^{ξ}$ -orientation: $O_{1}$ if $S_{1}$ lays in the future of $S_{0}$ and $O_{0}$ if it is the opposite. Notice that this, and hence (52), would fail if the Cauchy hypersurfaces crossed.
(iii): In the case $V = ξ$ , (52) becomes

$\int_{Ω_{1, m}} T_{ξ}^{♭} (ξ, X_{ξ}) d Σ_{ξ} - \int_{Ω_{0, m}} T_{ξ}^{♭} (ξ, X_{ξ}) d Σ_{ξ} ⟶ 0,$

a conservation law in which all the terms are purely Finslerian.

Summing up, in this example we have proven a Finslerian (observer-dependent) version of the classical law that the total amount of X-momentum in the universe is conserved (Corollary 4). Our formulation is asymptotic, so it is valid even for infinite total

X_{V}

-momentum (Remark 16). We have recovered the classical law (Corollary 5), which always holds under hypotheses on T and X alone, while in the general Finslerian case nontrivial difficulties appear in the regime of big separation between the Cauchy hypersurfaces (high

σ (Γ_{m})

, Remark 17). Finally, we have expressed the law naturally in terms of the Finslerian unit normal (see (52)).

6. Conclusions

About the physical interpretation of T, Section 3:

1

Heuristic interpretations from fluids, Section 3.1 and Section 3.2—Possible breakings of Lorentz-invariance lead to non-trivial transformations of coordinates between observers. Such transformations are still linear and permit a well-defined energy-momentum vector at each tangent space

T_{p} M

, Section 3.1.

However, the stress–energy–momentum T must not be regarded as a tensor on each

T_{p} M

but as an anisotropic tensor. This depends intrinsically on each observer

u \in Σ

and may vary with u in a nonlinear way. Indeed, the breaking of Lorentz invariance does not permit to fully replicate the relativistic arguments leading to (isotropic) tensors on M, even though classical interpretations of the anisotropic T in terms of fluxes can be maintained, Section 3.2.

2

Lagrangian viewpoint, Section 3.3. In principle, the interpretations of Special Relativity about the canonical energy–momentum tensor associated with the invariance by translations remain for Lorentz norms and, thus, in Very Special Relativity. In the case of Lorentz–Finsler metrics, some issues to be studied further appear:

(a): The canonical stress–energy tensor in Relativity $δ S_{m a t t e r} / δ g^{μ ν}$ leads to different types of (anisotropic) tensors in the Finslerian setting (a scalar function $δ S_{m a t t e r} / δ L$ on $A \subseteq T M$ in the Einstein-Hilbert setting, higher order tensors in Palatini’s). Starting at such tensors, different alternatives to recover the heuristic physical interpretations in terms of a 2-tensor appear.
(b): In the particularly interesting case of a kinetic gas [1,16], the 1-PDF $ϕ$ becomes naturally the matter source for the Euler-Lagrange equation of the Finslerian Einstein-Hilbert functional. However, the variational derivation of $ϕ$ is obtained by means of a possibly non-natural Lagrangian. This might be analyzed by sharpening the framework of variational completion for Finslerian Einstein equations [15].

About the divergence theorem for anisotropic vector fields Z, Section 4:

1

Section 4.1: For any Lorentz–Finsler metric L, there is a natural definition of anisotropic Lie bracket derivation along Z, which depends only on the nonlinear connection

H A

and admits an interpretation by using flows.

2

Section 4.2: This bracket allows one to give a natural definition of

div (Z)

which depends exclusively on

H A

and the volume form of L. This provides a geometric interpretation for the definition of divergence introduced by Rund [18].

3

A general divergence theorem is obtained (Theorem 2) so that Section 4.3:

(a): It can be seen as a conservation law for Z measured by each observer field V, even if the conserved quantity depends on V.
(b): The computation of the boundary term is intrinsically expressed in terms of forms. However, several metric elements can be used to re-express it, in particular the normal vector field for: (i) the pseudo-Riemannian metric $g_{V}$ (Rund), or (ii) the pseudo-Finsler metric L, when L is defined on the whole $T M$ (Minguzzi).

About the conservation of the stress–energy T Section 5:

1: Section 5.1 and Section 5.2: The computation of $div (T)$ priviledges the Levi-Civita–Chern anisotropic connection, showing explicit equivalence with Rund’s approach.
2: Corollaries 1 and 2: A vector field $T {(X)}_{V}$ on M is preserved assuming that some natural elements vanish on V for T, X and $D V$ .
3: Section 5.3: Natural laws of conservation on Cauchy hypersurfaces under general conditions (including rates of decay for unbounded domains) can be obtained by a combination of the techniques (i) and (ii) in the item 3b above.

Author Contributions

Investigation, M.Á.J., M.S. and F.F.V. All authors have read and agreed to the published version of the manuscript.

Funding

MAJ was partially supported by the project PGC2018-097046-B-I00 funded by MCIN/ AEI /10.13039/501100011033/ FEDER “Una manera de hacer Europa” and Fundación Séneca project with reference 19901/GERM/15. This work is a result of the activity developed within the framework of the Programme in Support of Excellence Groups of the Región de Murcia, Spain, by Fundación Séneca, Science and Technology Agency of the Región de Murcia. MS and FFV were partially supported by the project PID2020-116126GB-I00 funded by MCIN/AEI/10.13039/501100011033, by the project PY20_01391 (PAIDI 2020) funded by Junta de Andalucía—FEDER and by the framework of IMAG-María de Maeztu grant CEX2020-001105-M funded by MCIN/AEI/10.13039/50110001103.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. Kinematics: Observers and Relative Velocities

Here, we discusss a series of different possibilities for the notion of relative velocity between two observers, each one with a well-defined geometric construction. This is done as an academic exercise, because we do not discuss experimental issues (compare with [40,41]). However, it is worth emphasizing that all the possibilities studied here are intrinsic to the geometry of a flat model and, thus to any Finsler spacetime.

Start at an affine space endowed with a Lorentz norm let

u, u^{'} \in Σ

be two distinct observers and consider the plane

Π : =

Span

{u, u^{'}} \subset V

, which intersects transversally

C

and inherits a Lorentz–Finsler norm with indicatrix

Σ_{Π} : = Π \cap Σ

. Recall that both tangent spaces

T_{u} Π

and

T_{u^{'}} Π

inherit naturally a Lorentz scalar product by restricting the fundamental tensors

g_{u}

and

g_{u^{'}}

, resp. Moreover, their (1-dimensional) restspaces

l : = T_{u} Σ_{Π}

,

l^{'} : = T_{u^{'}} Σ_{Π}

also inherit a positive definite metric. In what follows, only the geometry of

Π

will be relevant.

Appendix A.1. The Lorentz Metric g Π up to a Constant

Notice that

Π \cap C_{p}

is composed by two half-lines spanned by two

C

-lightlike directions

w_{\pm}

; we will consider the orientation

Π

provided by the choice

(w_{+}, w_{-})

. One can determine a scalar product

g_{Π}

in

Π

(which is unique up to a positive constant), regarding both

w_{+}

and

w_{-}

as

g_{Π}

-lightlike in the same causal cone. It is easy to check that

Σ

must be a strongly convex curve which converges asymptotically to the vector lines spanned by

w_{\pm}

. This implies both

u \in Σ

will be timelike for

g_{Π}

and its restpace l will be

g_{Π}

-spacelike; we can assume also that the orientation

l_{+}

in l is induced by the chosen

w_{+}

.

Notice that

g_{u} (u, w_{\pm}) \geq 0

by the fundamental inequality, but

w_{\pm}

might be timelike or spacelike for

g_{u}

(although

g_{u} (u, w_{\pm}) \to 0

as

u \to w_{\pm}

). This possibility might be regarded as a possible measurement of the speed of light with respect to u by the observers in

Π

, namely, this velocity is in the orientation

l_{+}

when

w_{+}

is

g_{u}

-spacelike and smaller than 1 when it is timelike. However, a priori it is not clear an operational way to carry out such a measurement. Moreover such a measurement might be regarded as something non-intrinsic to the speed of light but to the way of measuring it.

Nevertheless, as pointed out in (Section 6 in [6]), there are several effects which might lead to a measurement of different speeds of light in different directions. So, we will consider that each

Π

has its own speeds of light

c_{Π}^{\pm}

in each spacelike orientation

l_{\pm}

. Indeed, given u and an orientation

l^{+}

, the speed of light

c_{Π}^{+}

will be defined as the the supremum of the relative velocities between u and all the observers

u^{'}

such that

u^{'} - u

yields the orientation

l^{+}

. Next, we will explain several possible meanings of these velocities. To avoid cluttering, next we will write

c_{Π}

, assuming that the appropriate choice in

c_{Π}^{\pm}

is done for each

u^{'}

.

Appendix A.2. Simple Relative Velocity

As

g_{u}

determines naturally a Lorentz metric on V, we can define the simple relative velocity

v_{u}^{s} (u^{'})

of

u^{'}

measured by u as the usual

g_{u}

-relativistic velocity between u,

u^{'}

normalized to

c_{Π}

, i.e.,

v_{u}^{s} (u^{'}) = c_{Π} tanh (θ) where cosh θ = - g_{u} (u, u^{'}) > 1,

(the latter by the reversed fundamental inequality). Clearly,

v_{u^{'}}^{s} (u) \neq v_{u}^{s} (u^{'})

in general, but this does not seem a drawback in the Finslerian setting.

A support for the physical plausibility of this velocity is that one could expect that each observer u will work as in Special Relativity just choosing an orthonormal frame of

g_{u}

. The possibility

g_{u} (v, v) \neq 1

might seem ackward from a dynamical viewpoint (see below), but it seems harmless as far as only kinematics is being considered. In principle, the comparison between the measurements of the two observers would be geometrically possible by using the unique isometry of

(T_{u} Π, g_{u})

to

(T_{u^{'}} Π, g_{u^{'}})

which maps u into

u^{'}

and is consistent with orientations induced from

Π

. What is more, this isometry can also be extended to a natural isometry from

(T_{u} V, g_{u})

to

(T_{u^{'}} V, g_{u^{'}})

, namely, regard

(Σ, g)

as a Riemannian metric and use the parallel transport from u to

u^{'}

along the segment of the curve

Π \cap Σ

from u to

u^{'}

. However, the following fact might suggest to explore further possibilities.

Remark A1.

Assume that Σ is modified into the indicatrix

\bar{Σ}

of another Lorentz–Finsler norm so that (i)

\bar{Σ} = Σ

around u and (ii)

u^{'} \in \bar{Σ}

but its

\bar{Σ}

restspace

{\bar{l}}^{'}

is different from

l^{'}

. Then, the simple velocity would remain unaltered, i.e.,

{\bar{v}}_{u}^{s} (u^{'}) = v_{u}^{s} (u^{'})

.

Appendix A.3. Velocity as a Distance between Observers

Notice that

Σ

can be regarded as a Riemannian manifold with the restriction of the fundamental tensor g and, then,

Σ \cap Π

can be regarded as a curve whose length can be computed. Then, the observers’ distance velocity is defined as:

v^{d} (u, u^{'}) = c_{Π} tanh ({length}_{g} {segment of Σ \cap Π from u to u^{'}}) .

Notice that this velocity is symmetric and it generalizes directly the one in Special Relativity providing a geometric interpretation for the addition of velocities. Recall that

v^{d} (u, u^{'})

has been defined essentially as a distance in

Σ \cap Π

, where

Π

depends of each pair of observers, thus, one might have

v^{d} (u, u^{'}) + v^{d} (u^{'}, u^{″}) < v^{d} (u, u^{″})

when

n > 2

. If one prefers to avoid such a possibility, it is enough to consider g-distance in the whole space of observers

Σ

(observers’ space distance velocity), at least in the case that

c_{Π}

is regarded as independent of

Π

.

Remark A2.

In the case studied in Remark A1, one would have

{\bar{v}}^{d} (u, u^{'}) \neq v^{d} (u, u^{'})

in general. However, the relative position of the restspaces l and

l^{'}

does not play any special role.

Appendix A.4. Length-Contraction and Velocity

Consider a segment S of l with

g_{u}

-length ℓ and the strip of V obtained by translating S in the direction of u. Let

S^{'}

be the intersection of this strip with

l^{'}

, which will be a new segment of

g_{u^{'}}

-length

ℓ^{'}

. Let

λ = ℓ^{'} / ℓ

be the length-contraction parameter. In the relativistic case,

λ < 1

and

λ \to 0

as

u^{'} \to C_{Π}

. The former property does not hold for a general Lorentz norm but the latter does. So, whenever

λ < 1

holds, we can define the length-contractive velocity

v_{u}^{c} (u^{'})

of

u^{'}

with respect tou as:

v_{u}^{c} (u^{'}) = c_{Π} \sqrt{1 - λ^{2}} .

Again, this velocity is not symmetric. Because of the strong convexity of

Σ

, a different observer

u^{'}

will have a different restspace

l^{'}

, but this does not imply a different length

ℓ^{'}

nor velocity

v_{u}^{c} (u^{'})

. However, this velocity gives a comparison between restspaces which was absent in the previous two velocities.

Appendix A.5. Symmetric Lorentz Velocities in Π

Let us consider the Lorentzian scalar product

g_{Π}

en

Π

, uncfique up to a positive constant (which will be irrelevant for our purposes) introduced above. Recall that u and

u^{'}

were timelike for

g_{Π}

and, moreover, both l and

l^{'}

were spacelike. Now, we can define two velocities between u and

u^{'}

: the simple Lorentz velocity,

v^{s} (u, u^{'}) = c_{Π} tanh (θ) where cosh θ = - \frac{g_{Π} (u, u^{'})}{\sqrt{g_{Π} (u, u) g_{Π} (u^{'}, u^{'})}},

and the length-contractive Lorentz velocity,

v^{c} (u, u^{'}) = c_{Π} tanh (θ) where cosh θ = - \frac{| g_{Π} (n, n^{'}) |}{\sqrt{g_{Π} (n, n) g_{Π} (n^{'}, n^{'})}},

where, in the latter, n,

n^{'}

are

g_{Π}

-timelike vectors orthogonal to l,

l^{'}

, resp.

Clearly, both velocities are symmetric. Their appearance might be physically sound because the intrinsic Lorentz metric

g_{Π}

(up to a constant) can be regarded as an object available (or, at least, a compromise one) for all the observers, as it would depend directly on physical light rays.

Notes

1	Berwald spaces [12,13] are an exception, as the parallel transport becomes an isometry between the Lorentz norms. Thus, in some sense, these spaces would admit a principle of equivalence with respect to a Lorentz-normed space (non-necessarily Lorentz–Minkowski).
2	In this section, $i, j = 1, 2, 3$ and $μ, ν = 0, 1, 2, 3$ , but in the others they will run freely from 1 to n (= dim M).
3	The symmetry of T is dropped for the case of theories with high spin because of its contribution to angular momentum.
4	See for example (Section 4.5 in [24], >Section 35 in [25]).
5	See for example [26] (around formula (E.1.36)) or (Section 32 in [25]).
6	See for example, (Section E.1 in [26]), (Section 4.3 in [27]), (Sections 21.2 and 21.3 in [28]).
7	Some arguments which support strongly their choice are (see [1]): (a) the simplest analogous to the vacuum Einstein equation in the Finslerian approach Ricci $= 0$ (proposed by Rund [18], and satisfied by Finsler pp-waves [9]) is not a variational equation; (b) the Ricci scalar functional yields an Euler–Lagrange equation, which agrees with Einstein’s in the vacuum Lorentz case, and (c) this Euler–Lagrange equation is the variational completion of the Finslerian Ricci $= 0$ .
8	This is not to be mistaken by the torsion of the nonlinear connection $H A$ , which would have coordinates $N_{j \cdot k}^{i} - N_{k \cdot j}^{i}$ (even though this can be seen as a particular case of the torsion of some ∇ and hence it is also denoted by $Tor$ in [17]).
9	Here, $C_{1, 3}$ is the operator that (metrically) contracts the first index of S with the one introduced by ∇, and $C_{1}^{1}$ is the operator that (naturally) contracts the first index of R with the one introduced by ∇.
10	For instance, it is clear that in affine coordinates the components of the metric spray vanish, so the geodesics are the straight lines of E.
11	The case when they interesect can be also conisdered by taking into account that, then, the open set $M ∖ J^{+} (S_{1} \cup S_{2})$ is still globally hyperbolic and a Cauchy hypersurface $S_{3}$ of this open subset will be also Cauchy for M (and it will not intersect any of the previous ones).
12	Suppose, for instance, that $S_{1}$ lays in the future of $S_{0}$ : the $γ_{p}$ ’s departing from $Ω_{0, m}$ reach points $γ_{p} (t_{p}) \in Ω_{1, m}$ with $t_{p} > 0$ . Take bases $(e_{1}, \dots, e_{n - 1})$ for $T_{p} Ω_{0, m}$ and $(e_{1}^{'}, \dots, e_{n - 1}^{'})$ for $T_{γ_{p} (t_{p})} Ω_{1, m}$ such that $(V_{p}, e_{1}, \dots, e_{n - 1})$ and $(V_{γ_{p} (t_{p})}, e_{1}^{'}, \dots, e_{n - 1}^{'})$ are $d Vol$ -positive. Then $(e_{1}, \dots, e_{n - 1})$ and $(e_{1}^{'}, \dots, e_{n - 1}^{'})$ are both $d Σ_{V}^{ξ}$ -positive ( $ξ$ and V always lie in the same half-space), the former is $O_{0}$ -negative (V is $D_{m}$ -entering at $S_{0}$ ) and the latter is $O_{1}$ -positive (V is $D_{m}$ -salient at $S_{1}$ ).

References

Hohmann, M.; Pfeifer, C.; Voicu, N. Relativistic kinetic gases as direct sources of gravity. Phys. Rev. D 2020, 101, 024062. [Google Scholar] [CrossRef]
Hohmann, M.; Pfeifer, C.; Voicu, N. Cosmological Finsler Spacetimes. Universe 2020, 6, 65. [Google Scholar] [CrossRef]
Kouretsis, A.P.; Stathakopoulos, M.; Stavrinos, P.C. The General Very Special Relativity in Finsler Cosmology. Phys. Rev. D 2009, 79, 104011. [Google Scholar] [CrossRef]
Li, X.; Chang, Z. Exact solution of vacuum field equation in Finsler spacetime. Phys. Rev. D 2014, 90, 064049. [Google Scholar] [CrossRef]
Stavrinos, P.; Vacaru, O.; Vacaru, S. Modified Einstein and Finsler Like Theories on Tangent Lorentz Bundles. Int. J. Mod. Phys. D 2014, 23, 1450094. [Google Scholar] [CrossRef]
Bernal, A.N.; Javaloyes, M.A.; Sánchez, M. Foundations of Finsler Spacetimes from the Observers’ Viewpoint. Universe 2020, 6, 55. [Google Scholar] [CrossRef]
Bogoslovsky, G. A special-relativistic theory of the locally anisotropic space-time. Il Nuovo C. B Ser. 1977, 40, 99–115. [Google Scholar] [CrossRef]
Cohen, A.G.; Glashow, S.L. Very special relativity. Phys. Rev. Lett. 2006, 97, 021601. [Google Scholar] [CrossRef]
Fuster, A.; Pabst, C. Finsler pp-waves. Phys. Rev. D 2016, 94, 104072. [Google Scholar] [CrossRef]
Gibbons, G.W.; Gomis, J.; Pope, C.N. General very special relativity is Finsler geometry. Phys. Rev. D 2007, 76, 081701. [Google Scholar] [CrossRef]
Kostelecký, V.A. Riemann-Finsler geometry and Lorentz-violating kinematics. Phys. Lett. B 2011, 701, 137–143. [Google Scholar] [CrossRef]
Fuster, A.; Heefer, S.; Pfeifer, C.; Voicu, N. On the non metrizability of Berwald Finsler spacetimes. Universe 2020, 6, 64. [Google Scholar] [CrossRef]
Fuster, A.; Pabst, C.; Pfeifer, C. Berwald spacetimes and very special relativity. Phys. Rev. D 2018, 98, 084062. [Google Scholar] [CrossRef]
Javaloyes, M.A.; Sánchez, M. Finsler metrics and relativistic spacetimes. Int. J. Geom. Methods Mod. Phys. 2014, 11, 1460032. [Google Scholar] [CrossRef]
Hohmann, M.; Pfeifer, C.; Voicu, N. Finsler gravity action from variational completion. Phys. Rev. D 2019, 100, 064035. [Google Scholar] [CrossRef]
Hohmann, M.; Pfeifer, C.; Voicu, N. Finsler-based field theory—A mathematical foundation. arXiv 2021, arXiv:2106.14965v1. [Google Scholar]
Javaloyes, M.A.; Sánchez, M.; Villasenor, F.F. The Einstein-Hilbert-Palatini formalism in pseudo-Finsler geometry. arXiv 2021, arXiv:2108.03197v2. [Google Scholar]
Rund, H. A divergence theorem for Finsler metrics. Monatshefte Math. 1975, 79, 233–252. [Google Scholar] [CrossRef]
Minguzzi, E. A divergence theorem for pseudo-Finsler spaces. Rep. Math. Phys. 2017, 80, 307–315. [Google Scholar] [CrossRef][Green Version]
Javaloyes, M.A.; Sánchez, M.; Villasenor, F.F. Anisotropic connections and parallel transport in Finsler spacetimes. arXiv 2021, arXiv:2107.05986. [Google Scholar]
Javaloyes, M.A. Anisotropic tensor calculus. Int. J. Geom. Methods Mod. Phys. 2019, 16, 1941001. [Google Scholar] [CrossRef]
Javaloyes, M.A. Curvature computations in Finsler Geometry using a distinguished class of anisotropic connections. Mediterr. J. Math. 2020, 17, 123. [Google Scholar] [CrossRef]
Javaloyes, M.A.; Sanchez, M. On the definition and examples of cones and Finsler spacetimes. Rev. Real Acad. Cienc. Exactas Fís. Nat. Ser. Matemáticas 2020, 114, 30. [Google Scholar] [CrossRef]
Schutz, B.F. A First Course in General Relativity, 2nd ed.; Cambridge University Press: New York, NY, USA, 2009. [Google Scholar]
Landau, L.D.; Lifshitz, E.E. The Classical Theory of Fields, 3rd ed.; Pergamon Press Ltd.: Headington Hill Hall, UK, 1971. [Google Scholar]
Wald, R.M. General Relativity; University of Chicago Press: Chicago, IL, USA, 1984. [Google Scholar]
Carroll, S.M. Spacetime and Geometry: An Introduction to General Relativity; Addison-Wesley: San Francisco, CA, USA, 2004. [Google Scholar]
Misner, C.W.; Thorne, K.S.; Wheler, J.A. Gravitation; W. H. Freeman: San Francisco, CA, USA, 1973. [Google Scholar]
Pfeifer, C.; Wohlfarth, M. Causal structure and electrodynamics on Finsler space-times. Phys. Rev. D 2020, 84, 044039. [Google Scholar] [CrossRef]
Gotay, M.J.; Marsden, J.E. Stress-Energy-Momentum Tensors and the Belinfante-Rosenfeld Formula. Contemp. Math. 1992, 132, 367–392. [Google Scholar]
Lee, J.M. Introduction to Smooth Manifolds; Springer: Berlin/Heidelberg, Germany, 2012. [Google Scholar]
Shen, Z. Lectures on Finsler Geometry; World Scientific: Singapore, 2001. [Google Scholar]
Herrera, J.; Javaloyes, M.A.; Piccione, P. On a monodromy theorem for sheaves of local fields and applications. Rev. Real Acad. Cienc. Exactas, Fís. Nat. Ser. Matemáticas 2017, 111, 999–1029. [Google Scholar] [CrossRef]
Dragomir, S.; Larato, B. Harmonic functions on Finsler spaces. Instanbul Univ. Sci. Fac. J. Math. Phys. Astron. 1991, 48, 67–76. [Google Scholar]
Mbatakou, J.S.; Todjihounde, L. Conformal change of Finsler-Ehresmann connections. Appl. Sci. 2014, 16, 32–47. [Google Scholar]
Nibaruta, G.; Degla, S.; Todjihounde, L. Finslerian Ricci deformation and conformal metrics. J. Appl. Math. Phys. 2018, 6, 1522–1536. [Google Scholar] [CrossRef][Green Version]
Nibaruta, G.; Nibirantiza, A.; Karimumuryango, M.; Ndayirukiye, D. Divergence lemma and Hopf’s theorem on Finslerian slit tangent bundle. Balk. J. Geom. Its Appl. 2020, 25, 93–103. [Google Scholar]
Shen, Z. Differential Geometry of Spray and Finsler Spaces; Kluwer Academic Publishers: Dordrecht, The Netherlands, 2001. [Google Scholar]
Bernal, A.N.; Sanchez, M. Further results on the smoothability of Cauchy hypersurfaces and Cauchy temporal functions. Lett. Math. Phys. 2006, 77, 183–197. [Google Scholar] [CrossRef]
Lammërzahl, C.; Perlick, V. Finsler geometry as a model for relativistic gravity. Int. J. Geom. Methods Mod. Phys. 2018, 15 (Suppl. 1), 1850166. [Google Scholar] [CrossRef]
Pfeifer, C. Finsler spacetime geometry in Physics. Int. J. Geom. Methods Mod. Phys. 2019, 16 (Suppl. 2), 1941004. [Google Scholar] [CrossRef]

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

On the Significance of the Stress–Energy Tensor in Finsler Spacetimes

Abstract

1. Introduction

2. Preliminaries and Setup

2.1. Anisotropic Tensors

2.2. Nonlinear and Anisotropic Connections

2.3. Lorentz–Finsler Metrics

3. Basic Interpretations on the Stress–Energy Tensor $T$

3.1. Particles and Dusts: Anisotropic Picture of Isotropic Elements

3.2. Emergence of an Anisotropic Stress–Energy Tensor

3.3. Lagrangian Viewpoint

4. Divergence of Anisotropic Vector Fields

4.1. Mathematical Formalism of the Anisotropic Lie Bracket

4.2. Lie Bracket Definition of Divergence

4.3. Divergence Theorem and Boundary Term Representations

5. Divergence of Anisotropic Tensor Fields

5.1. Definition of Divergence with the Chern Connection

5.2. Chern vs. Berwald

5.3. Finslerian Conservation Laws and Main Examples

5.3.1. Example: Lorentz Norms on an Affine Space

5.3.2. Example: Cauchy Hypersurfaces in a Finsler Spacetime

6. Conclusions

Author Contributions

Funding

Conflicts of Interest

Appendix A. Kinematics: Observers and Relative Velocities

Appendix A.1. The Lorentz Metric g Π up to a Constant

Appendix A.2. Simple Relative Velocity

Appendix A.3. Velocity as a Distance between Observers

Appendix A.4. Length-Contraction and Velocity

Appendix A.5. Symmetric Lorentz Velocities in Π

Notes

References

Article Metrics

Citations

Article Access Statistics

On the Significance of the Stress–Energy Tensor in Finsler Spacetimes

Abstract

1. Introduction

2. Preliminaries and Setup

2.1. Anisotropic Tensors

2.2. Nonlinear and Anisotropic Connections

2.3. Lorentz–Finsler Metrics

3. Basic Interpretations on the Stress–Energy Tensor T

3.1. Particles and Dusts: Anisotropic Picture of Isotropic Elements

3.2. Emergence of an Anisotropic Stress–Energy Tensor

3.3. Lagrangian Viewpoint

4. Divergence of Anisotropic Vector Fields

4.1. Mathematical Formalism of the Anisotropic Lie Bracket

4.2. Lie Bracket Definition of Divergence

4.3. Divergence Theorem and Boundary Term Representations

5. Divergence of Anisotropic Tensor Fields

5.1. Definition of Divergence with the Chern Connection

5.2. Chern vs. Berwald

5.3. Finslerian Conservation Laws and Main Examples

5.3.1. Example: Lorentz Norms on an Affine Space

5.3.2. Example: Cauchy Hypersurfaces in a Finsler Spacetime

6. Conclusions

Author Contributions

Funding

Conflicts of Interest

Appendix A. Kinematics: Observers and Relative Velocities

Appendix A.1. The Lorentz Metric g Π up to a Constant

Appendix A.2. Simple Relative Velocity

Appendix A.3. Velocity as a Distance between Observers

Appendix A.4. Length-Contraction and Velocity

Appendix A.5. Symmetric Lorentz Velocities in Π

Notes

References

Article Metrics

Citations

Article Access Statistics

3. Basic Interpretations on the Stress–Energy Tensor $T$