An Exterior Algebraic Derivation of the Euler–Lagrange Equations from the Principle of Stationary Action

Colombaro, Ivano; Font-Segura, Josep; Martinez, Alfonso

doi:10.3390/math9182178

Open AccessArticle

An Exterior Algebraic Derivation of the Euler–Lagrange Equations from the Principle of Stationary Action

by

Ivano Colombaro

^*

,

Josep Font-Segura

and

Alfonso Martinez

Department of Information and Communication Technologies, Universitat Pompeu Fabra, 08018 Barcelona, Spain

^*

Author to whom correspondence should be addressed.

Mathematics 2021, 9(18), 2178; https://doi.org/10.3390/math9182178

Submission received: 4 August 2021 / Revised: 31 August 2021 / Accepted: 2 September 2021 / Published: 7 September 2021

(This article belongs to the Section E4: Mathematical Physics)

Download Versions Notes

Abstract

:

In this paper, we review two related aspects of field theory: the modeling of the fields by means of exterior algebra and calculus, and the derivation of the field dynamics, i.e., the Euler–Lagrange equations, by means of the stationary action principle. In contrast to the usual tensorial derivation of these equations for field theories, that gives separate equations for the field components, two related coordinate-free forms of the Euler–Lagrange equations are derived. These alternative forms of the equations, reminiscent of the formulae of vector calculus, are expressed in terms of vector derivatives of the Lagrangian density. The first form is valid for a generic Lagrangian density that only depends on the first-order derivatives of the field. The second form, expressed in exterior algebra notation, is specific to the case when the Lagrangian density is a function of the exterior and interior derivatives of the multivector field. As an application, a Lagrangian density for generalized electromagnetic multivector fields of arbitrary grade is postulated and shown to have, by taking the vector derivative of the Lagrangian density, the generalized Maxwell equations as Euler–Lagrange equations.

Keywords:

Euler–Lagrange equations; exterior algebra; exterior calculus; tensor calculus; action principle; Lagrangian; electromagnetism; Maxwell equations

MSC:

primary 37J05; secondary 15A75

1. Introduction

In classical mechanics, the action is a scalar quantity, with units of energy × time, that encodes the dynamical evolution of a given physical system; mathematically, the action is given by an integral functional of the trajectory or dynamical path (or an integral of the Lagrangian density for field theories) followed by the physical system over space-time. The principle of stationary action states that the actual dynamical path followed by the system, subject to some appropriate boundary constraints, possibly at infinity, corresponds to a stationary point of the action [1] (Ch. 19), [2] (Section 8). An application of the principle yields the Euler–Lagrange equations, which describe the dynamics of the system [3] (Section I.3), [4] (Section 3.1), [5] (Section 7.2). The historical development of the stationary-action principle—in essence, a far-reaching generalization of Fermat’s principle—that states that light follows the shortest-time path between two points is described in detail in [6] (Section X).

This paper revisits the derivation of the Euler–Lagrange equations for field theories from the principle of stationary action from the point of view of exterior algebra and calculus. There exist several alternative mathematical representations for the fields, ranging from the original vector calculus by Gibbs [7] and Heaviside to geometric and Clifford algebras [8], where vectors are replaced by multivectors and operations such as the cross and the dot products subsumed in the geometric product; a modern perspective on the use of geometric algbra in physics is given in [9]. Early in the 20th century, tensors such as the Faraday tensor in electromagnetism were quickly and almost universally adopted as the natural mathematical representation of fields in space-time [10] (pp. 135–144). In parallel, mathematicians such as Cartan generalized the fundamental theorems of vector calculus i.e., Gauss, Green, and Stokes, by means of differential forms [11]. Later on, differential forms were used in Hamiltonian mechanics, e.g., to calculate trajectories as vector field integrals [12] (pp. 194–198).

Since differential forms may be seen as the circulation or flux over appropriate space-time regions of multivector fields, it may be preferable in some contexts to directly study the multivector fields. Therefore, we build our analysis on the exterior algebra originally developed by Grassmann [13], which has comparatively received little attention in the literature and leads to simple formulae that merge the simplicity and intuitiveness of standard vector calculus with the power of tensors and differential forms [14,15].

In Section 2, we provide the necessary background on exterior algebra and calculus, including the important notion of multivector-valued derivative with respect to a vector

v

. Then, we obtain in Section 3 two related coordinate-free forms of the Euler–Lagrange equations for the dynamics of a multivector field

a

of grade r as vector derivatives of the Lagrangian density

L

.

Our work is related to the geometric–algebraic multivectorial formulation of the Euler–Lagrange equations in [16] (Equations (4.7) and (4.8)). The first form in (39) is valid for a generic Lagrangian density that only depends on the first-order derivatives of the field, more specifically on the tensor derivative

\partial \otimes a

in (27), and is given by

\partial_{a} ℒ = \partial \times (\partial_{\partial \otimes a} ℒ),

(1)

as a function of the vector and matrix derivatives

\partial_{a} ℒ

and

\partial_{\partial \otimes a} ℒ

in (28) and (29), respectively. The

(k + n)

-dimensional differential operator ∂ is defined in (20); together with the matrix product × defined in (19), the operation in the right-hand side generalizes the concept of the divergence of a field. The second form (47), expressed in exterior algebra notation, is specific to the case when the Lagrangian density depends only on exterior (denoted by

\partial \land

; see (21)) and interior derivatives (denoted by

\partial ⨼

; see (22)) of the multivector field, and is given by

\partial_{a} ℒ = {(- 1)}^{r - 1} \partial_{⨼} (\partial_{\partial \land a} ℒ) + {(- 1)}^{r} \partial \land (\partial_{\partial ⨼ a} ℒ),

(2)

where r is the grade of the multivector field

a

. A complementary analysis, which shows the invariance of the action to infinitesimal space–time translations in exterior algebra, was conducted in [17], where the stress–energy–momentum tensor is evaluated and profusely discussed. We conclude the paper in Section 4 with an application of our analysis to a Lagrangian density for generalized electromagnetic multivector fields that leads, by directly taking the vector derivative of the Lagrangian density, to the generalized Maxwell equations for multivector fields of grade r [15]. We also provide a short discussion, of independent interest, of a dual form of Maxwell equations where the exterior derivative is replaced by the interior derivative in the definition of the field from the potential.

2. Fundamentals of Exterior Algebra and Calculus: Notation, Definitions, and Operations

2.1. Multivector Fields

While our space-time has four space-time dimensions in relativistic terms, it will prove convenient to consider a generic flat space-time

R^{k + n}

with k temporal dimensions and n spatial dimensions, as this generality allows for a more natural description of the underlying algebraic structure of the equations and of their derivations. Points and position in space-time are denoted by

x

, with components

x_{i}

in the canonical basis

{e_{i}}_{i = 0}^{k + n - 1}

; by convention, the first k indices, i.e.,

i = 0, \dots, k - 1

, correspond to time components while the indices

i = k, \dots, k + n - 1

represent space components. We let space and time coordinates have the same units. Although we shall not make use of this fact, space-time vectors transform contravariantly under changes of coordinates.

In exterior algebra, one considers vector spaces whose basis elements

e_{I}

are indexed by lists

I = (i_{1}, \dots, i_{m})

drawn from

I_{m}

, the set of all ordered lists with m nonrepeated indices, with

m \in I = {0, 1 \dots, k + n

}. Later on, in (6), we express the basis elements

e_{I}

in terms of the vectorial canonical basis

e_{i}

, for an ordered list

i_{1}, \dots, i_{m}

. These vectors, which we identify with fields, live in the tangent space and transform covariantly under changes of coordinates [18] (Ch. 2), [19] (Ch. V). We refer to elements of these vector field spaces as multivector fields of grade m. While multivector fields do not cover all relevant physical models, e.g., spinor fields or the tensor field in general relativity, they do model a number of interesting cases; for instance, a scalar field is represented by multivectors of grade 0, the electric field, the electromagnetic vector potential and source current by multivectors of grade 1, and the electromagnetic field by a multivector of grade 2. A multivector field

a (x)

of grade m, possibly a function of the position

x

, with components

a_{I} (x)

in the canonical basis

{e_{I}}_{I \in I_{m}}

can be written as

a (x) = \sum_{I \in I_{m}} a_{I} (x) e_{I} .

(3)

We denote by

gr (a)

the operation that returns the grade of a vector

a

and by

| I |

the length of a list I. The dimension of the vector space of all grade m multivectors is

(\binom{k + n}{m})

, the number of lists in

I_{m}

.

2.2. Operations on Index Lists

As the basis elements of multivector fields are indexed by lists I, it proves convenient to define some basic operations on such lists: permutations and their signatures, concatenations (mergers), and subtractions of lists.

First of all, if the list I is not ordered, let

σ (I)

denote the signature of the permutation sorting the elements of I in increasing order. If the permutation is even (resp. odd), the signature is

+ 1

(resp.

- 1

). If the list I contains repeated indices, its signature is 0.

More generally, for two index lists I and J with respective lenghts

m = | I |

and

m^{'} = | J |

, let

(I, J) = {i_{1}, \dots, i_{m}, j_{1}, \dots, j_{m^{'}}}

be the concatenation of the two index lists I and J. We let

σ (I, J)

denote the signature of the permutation sorting the concatenated list of

| I | + | J |

indices, and let

I + J

, or

ε (I, J)

if the notation

I + J

is ambiguous in a given context, denote the sorted concatenated list, which we refer to as merged list.

In general, we view the lists as ordered sets, and apply standard operations on sets to the lists. For instance, I is contained in J, the list

J \ I

is the result of removing from J all the elements in I, while keeping the order. As another example, we denote by

I^{c}

the complement of I, namely the ordered sequence of indices not included in I. We denote the empty list by ∅; it holds that

σ (\emptyset, K) = σ (K, \emptyset) = 1

for an ordered list K, and that

e_{\emptyset} = 1

.

2.3. Operations on Multivectors

We next define several operations acting on multivectors; our presentation loosely follows [14] (Sections 2 and 3) and [15] (Section 2) and is close in spirit and form to vector calculus. Introductions to exterior algebra from the perspective and language of differential forms can be found in [18,19]. A geometric algebra perspective can be found in [9]. With no real loss of generality, we define the operations only for the canonical basis vectors, the operation acting on general multivectors being a mere extension by linearity of the former.

First, the dot product · of two arbitrary grade m basis vectors

e_{I}

and

e_{J}

is defined as

e_{I} \cdot e_{J} = Δ_{I J} = Δ_{i_{1} j_{1}} Δ_{i_{2} j_{2}} \dots Δ_{i_{m} j_{m}},

(4)

where I and J are the ordered lists

I = (i_{1}, i_{2}, \dots, i_{m})

and

J = (j_{1}, j_{2}, \dots, j_{m})

and

Δ_{i j} = 0

if

i \neq j

, and we let time unit vectors

e_{i}

have negative metric

Δ_{i i} = - 1

and space unit vectors

e_{i}

have positive metric

Δ_{i i} = + 1

. When

m = 0

, we interpret the dot product in (4) as 1 since

e_{\emptyset} = 1

.

The following operations to be defined are the interior and exterior products, which subsume and generalize the operations of gradient, curl, and divergence of vector calculus to multivector fields. These operations transform pairs of multivectors into a multivector of a different grade, introducing in the process some signs, i.e.,

\pm 1

. When these signs are related to the dot product in (4), we explicitly write the signs as quantities such as

Δ_{I J}

. Other sign contributions arise from the signatures of permutations ordering lists of indices. A common practice in the literature to deal with these signatures is to write factors such as

{(- 1)}^{| I | + | J |}

. However, it seems more convenient to explicitly keep track of the lists and write the permutation associated to this factor, e.g.,

σ (I, J)

, as clearer connections between different formulae can be established by harnessing the power of group theory for permutations.

Let two basis vectors

e_{I}

and

e_{J}

have grades

m = | I |

and

m^{'} = | J |

. As defined in Section 2.2, let

(I, J) = {i_{1}, \dots, i_{m}, j_{1}, \dots, j_{m^{'}}}

be the concatenation of the two index lists I and J, let

σ (I, J)

denote the signature of the permutation sorting the elements of this concatenated list.

Then, the exterior product of

e_{I}

and

e_{J}

is defined as

e_{I} \land e_{J} = σ (I, J) e_{I + J} .

(5)

The exterior product is thus either zero or a multivector of grade

| I | + | J |

, since

σ (I, J) = 0

when the lists I and J have elements in common. The unit scalar (multivector of grade 0) is an identity of the exterior product, as

1 \land e_{I} = e_{I} \land 1 = e_{I}

. The exterior product provides a construction of the basis vector

e_{I}

, with I an ordered list

I = (i_{1}, \dots, i_{m})

, from the canonical basis vectors

e_{i}

, namely

e_{I} = e_{i_{1}} \land e_{i_{2}} \land \dots \land e_{i_{m}} .

(6)

When

I = \emptyset

, we adopt the usual convention that the right-hand side is 1.

We next define two generalizations of the dot product, the left and right interior products. Let

e_{I}

and

e_{J}

be two basis vectors of respective grades

| I |

and

| J |

. The left interior product, denoted by

⨼

, is defined as

e_{I} ⨼ e_{J} = \{\begin{matrix} Δ_{I I} σ (J \ I, I) e_{J \ I}, & if I \subseteq J, \\ 0, & otherwise . \end{matrix}

(7)

Although we might have overloaded the meaning of

σ (J \ I, I)

to be zero when

I ⊈ J

, we prefer to list the separate cases in (7). The vector

e_{J \ I}

has grade

| J | - | I |

and is indexed by the elements of J not in common with I. The use of the word left represents the fact that

e_{I}

acts from the left on

e_{J}

and removes the elements in I from J.

Analogously, the right interior product, denoted by

⌙

, of two basis vectors

e_{I}

and

e_{J}

is defined as

e_{J} ⌙ e_{I} = \{\begin{matrix} Δ_{I I} σ (I, J \ I) e_{J \ I}, & if I \subseteq J, \\ 0, & otherwise . \end{matrix}

(8)

As in the previous case, the use of the word right represents the fact that

e_{I}

acts from the right on

e_{J}

and removes the elements in I from J. The unit scalar (multivector of grade 0) acting from the left (resp. right) is an identity of the left (resp. right) interior product, as

1 + ⨼ e_{I} = e_{I} + ⌙ 1 = e_{I}

.

It proves instructive to evaluate the left and right interior products between two multivectors of the same grade, i.e., if

| I | = | J |

. From (7) and (8), and taking into account that

σ (\emptyset, K) = σ (K, \emptyset) = 1

for an ordered list K, and that

e_{\emptyset} = 1

, we see that

e_{I} ⨼ e_{J} = e_{J} ⌙ e_{I} = e_{I} \cdot e_{J}, if | I | = | J |,

(9)

supporting the idea that the interior products generalize the dot product. Both interior products are grade-lowering operations, as the interior product is either zero or a multivector of grade

| J | - | I |

.

Finally, we define the complement of a multivector. For a multivector

e_{I}

with grade m, its Grassmann or Hodge complement, denoted by

e_{I}^{H}

, is the unit

(k + n - m)

-vector

e_{I}^{H} = Δ_{I I} σ (I, I^{c}) e_{I^{c}},

(10)

where

I^{c}

is the complement of the list I and

σ (I, I^{c})

is the signature of the permutation sorting the elements of the concatenated list

(I, I^{c})

containing all space-time indices. In other words,

e_{I^{c}}

is the basis multivector of grade

k + n - m

whose indices are in the complement of I. In addition, we define the inverse complement transformation as

e_{I}^{H^{- 1}} = Δ_{I^{c} I^{c}} σ (I^{c}, I) e_{I^{c}} .

(11)

The interior products are not independent operations from the exterior product, as they can be expressed in terms of the latter, the Hodge complement and its inverse:

\begin{matrix} e_{I} ⨼ e_{J} = {(e_{I} \land e_{J}^{H})}^{H^{- 1}}, \end{matrix}

(12)

\begin{matrix} e_{J} ⌙ e_{I} = {(e_{J}^{H^{- 1}} \land e_{I})}^{H} . \end{matrix}

(13)

The vector calculus cross product between two vectors in

R^{3}

can be expressed in several alternative ways in terms of the interior, and exterior products and Hodge dual [14] (Equation (18)). This fact allows us to distinguish various roles that the cross product takes in Maxwell equations and lies at the origin of generalized electromagnetism described by multivectors in generic flat space-time [15].

2.4. Matrix Vector Spaces

We do not need to consider general tensor fields but rather the matrix field (vector) space whose basis elements can be represented as

w_{I_{1}, I_{2}}

, where both

I_{1}

and

I_{2}

are ordered lists of nonrepeated

ℓ_{1}

and

ℓ_{2}

elements, respectively. We may identify these basis elements with the tensor product of two multivectors of grade

ℓ_{1}

and

ℓ_{2}

, namely

w_{I_{1}, I_{2}} = e_{I_{1}} \otimes e_{I_{2}} .

(14)

The dimension of the vector space spanned by these basis elements is

(\binom{k + n}{ℓ_{1}})

(\binom{k + n}{ℓ_{2}})

; the elements of this vector space can be identified with matrices

A

whose rows and columns are indexed by lists,

I_{1} \in I_{ℓ_{1}}

and

I_{2} \in I_{ℓ_{2}}

, respectively,

A = \sum_{I_{1} \in I_{ℓ_{1}}, I_{2} \in I_{ℓ_{2}}} A_{I_{1} I_{2}} w_{I_{1}, I_{2}} .

(15)

The transpose of a matrix element

w_{I_{1}, I_{2}}

, denoted as

w_{I_{1}, I_{2}}^{T}

, is defined as

w_{I_{2}, I_{1}}

. These matrices, the underlying vector space, and the operations that we describe next are fundamental in the study of changes of coordinates in space-time. However, consideration of these changes is beyond the scope of this paper. To any extent, this short section provides a perspective on matrices from the point of view of exterior algebra, highlighting the connections between multivectors and matrices, and bypassing the standard introduction of tensor fields.

As we did with multivectors, we consider the dot product · of two arbitrary matrix basis elements

w_{I_{1}, I_{2}}

and

w_{J_{1}, J_{2}}

. This dot product is written

w_{I_{1}, I_{2}} \cdot w_{J_{1}, J_{2}} = Δ_{I_{1} J_{1}} Δ_{I_{2} J_{2}} .

(16)

The ordering within the pairs

(I_{1}, I_{2})

and

(J_{1}, J_{2})

is important in (16). This dot product, when applied to two matrices, is seen to give their Frobenius inner product, or equivalently, the square of the Frobenius norm (also known as the Hilbert–Schmidt norm) [20] when the product is of a matrix with itself.

We also define the matrix product × between two matrix basis elements

w_{I, J}

and

w_{K, L}

as

w_{I, J} \times w_{K, L} = w_{I, L} Δ_{J K},

(17)

an operation that coincides with the standard product of two matrices for matrices labeled by spatial indices. For square matrices

A

indexed by grade m multivectors, it is natural to define the matrix inverse (whenever the inverse exists), denoted as

A^{- 1}

, such that

A^{- 1} \times A = I_{m} = A \times A^{- 1}

, where the grade ℓ square identity matrix, denoted by

I_{ℓ}

, is given by

I_{ℓ} = \sum_{I \in I_{ℓ}} Δ_{I I} w_{I, I} .

(18)

Last, we define the matrix product × between a matrix

w_{I, J}

and a multivector

e_{K}

(or between a multivector

e_{K}

and the matrix

w_{J, I}

, i.e., the transpose of

w_{I, J}

) as

w_{I, J} \times e_{K} = e_{K} \times w_{J, I} = e_{I} Δ_{J K},

(19)

a generalization of the idea of multiplication of a row (or column) vector by a matrix.

2.5. Exterior and Matrix Calculus

In vector calculus, extensive use is made of the partial time derivative,

\partial_{t}

, and the nabla operator ∇ of partial space derivatives. In our case, we need the generalization to

(k, n)

space-time to the differential vector operator ∂, defined as

(- \partial_{0}, - \partial_{1}, \dots, - \partial_{k - 1}, \partial_{k}, \dots, \partial_{k + n - 1})

, that is,

\partial = \sum_{i \in I} Δ_{i i} e_{i} \partial_{i} .

(20)

As was done in [14] (Section 3), we define the exterior derivative,

\partial \land a

, of a given multivector field

a

of grade m as

\partial \land a = \sum_{i \in I, I \in I_{m} : i \notin I} Δ_{i i} σ (i, I) \partial_{i} a_{I} e_{i + I} .

(21)

The grade of the exterior derivative of

a

is

m + 1

, unless

m = k + n

, in which case the exterior derivative is zero. In addition, we define the interior derivative,

\partial ⨼ a

, of

a

as

\partial ⨼ a = \sum_{i \in I, I \in I_{m} : i \in I} σ (I \ i, i) \partial_{i} a_{I} e_{I \ i} .

(22)

The grade of the interior derivative of

a

is

m - 1

, unless

m = 0

, in which case the interior derivative is zero.

The formulae for the exterior and interior derivatives allow us to recover some standard formulae in vector calculus. For a scalar function

ϕ

, its gradient is given by its exterior derivative

\nabla ϕ = \partial \land ϕ

, while for a vector field

v

, its divergence

\nabla \cdot v

is given by its interior derivative

\nabla \cdot v = \partial ⨼ v

.

Also, for a vector fields

v

in

R^{3}

, taking into account [14] (Equation (18)), the curl can be variously expressed as

\nabla \times v = {(\nabla \land v)}^{H^{- 1}} = \nabla ⨼ v^{H^{- 1}} = \nabla ⨼ v^{H}

, thereby generalizing both the cross product and the curl to grade m vector fields in space-time algebras with different dimensions. Specific vector calculus formulae such as that for the divergence of a gradient or the curl of the curl of a vector can be seen as instances of general exterior calculus formulae such as [14] (Equation (38)) and [15] (Equation (35)),

\begin{matrix} \partial ⨼ (a \land b) = a (\partial \cdot b) - (\partial \cdot a) b, \end{matrix}

(23)

\begin{matrix} \partial \cdot (a ⨼ b) = (\partial \land a) \cdot b + {(- 1)}^{gr (a)} (\partial ⨼ b) \cdot a, \end{matrix}

(24)

where in (23),

a

and

b

are 1-vectors, while in (24),

a

and

b

are

(s - 1)

-vector and s-vector, respectively.

The exterior and interior derivatives satisfy the property

\partial \land (\partial \land a) = 0 = \partial + ⨼ (\partial ⨼ a)

, for a general twice-differentiable multivector field

a

. These identities imply the well-known facts that the curl of the gradient and the divergence of the curl are zero.

The circulation

C (a, V^{ℓ})

and the flux

F (a, V^{ℓ})

of a multivector field

a

over an ℓ-dimensional space-time hypervolume

V^{ℓ}

are defined as integrals of interior products of the field with infinitesimal integration volumes:

\begin{matrix} C (a, V^{ℓ}) & = \int_{V^{ℓ}} d^{ℓ} x ⌙ a, \end{matrix}

(25)

\begin{matrix} F (a, V^{ℓ}) & = \int_{V^{ℓ}} d^{ℓ} x^{H^{- 1}} ⨼ a . \end{matrix}

(26)

As a specific example for (26), the flux of a field over an

(k + n)

-dimensional hypervolume is the volume integral of the field. For both of these operations, the interior product in the integrand is expressed as a differential form, which allows us to invoke the theory of differential forms to prove a Stokes theorem. This Stokes theorem relates the circulation (resp. the flux) of the field over the boundary of some hypervolume to the circulation (resp. flux) over the same hypervolume of the exterior (resp. interior) derivative of the multivector field.

We also define the tensor derivative of

a

,

\partial \otimes a

, of a given vector field

a

of grade m as

\partial \otimes a = \sum_{i \in I, I \in I_{m}} Δ_{i i} \partial_{i} a_{I} w_{i, I},

(27)

where

w_{i, I}

is a matrix vector space basis element.

To conclude this section, we define a derivative operator with respect to an element of a vector space, e.g., a multivector field or a matrix. A relevant example of vector derivative operator is ∂, where the derivative is taken with respect to the position vector

x

. In general, the vector derivative operator with respect to a multivector field

a

of grade m (resp. matrix

A

of dimensions

ℓ_{1} \times ℓ_{2}

) is a multivector field (resp. matrix) denoted by

\partial_{a}

(resp.

\partial_{A}

) [21] and given by

\begin{matrix} \partial_{a} & = \frac{\partial}{\partial a} = \sum_{I \in I_{m}} Δ_{I I} e_{I} \frac{\partial}{\partial a_{I}}, \end{matrix}

(28)

\begin{matrix} \partial_{A} & = \frac{\partial}{\partial A} = \sum_{I \in I_{ℓ_{1}}, J \in I_{ℓ_{2}}} Δ_{I I} Δ_{J J} w_{I, J} \frac{\partial}{\partial A_{I, J}} . \end{matrix}

(29)

Specifically, we shall later need the exterior vector derivative of a scalar function

g (x)

, denoted by

\partial_{a} \land g (x)

or with some abuse of notation simply by

\partial_{a} g (x)

, and given by

\begin{matrix} \partial_{a} \land g (x) = \partial_{a} g (x) = \sum_{I \in I_{m}} Δ_{I I} e_{I} \frac{\partial g (x)}{\partial a_{I}}, \end{matrix}

(30)

and similarly for the matrix derivative. This exterior vector derivative is thus some form of generalized gradient. We shall need the derivative of a scalar function given by a quadratic form in the field and/or its interior or exterior derivatives. Let

a

and

b

represent two vectors of the same grade. Evaluation of the vector derivatives is straightforward and coincides with the infinitesimal calculus expressions [21]:

\begin{matrix} \partial_{a} (a \cdot a) = 2 a \end{matrix}

(31)

\begin{matrix} \partial_{a} (a \cdot b) = b . \end{matrix}

(32)

3. Principle of Stationary Action: Derivation of the Euler–Lagrange Equations

3.1. General Case: Lagrangian Dependent on the Tensor Derivative

As we briefly reviewed in the Introduction, in classical mechanics, one defines the action

𝒮

as a scalar quantity, with units of energy × time, that encodes the dynamical evolution of a physical system. Mathematically, the action

𝒮

is an integral functional of the trajectory or path over space-time, or of the Lagrangian density

ℒ (x)

for field theories, followed by the physical system. The principle of stationary action states the the path actually followed by the system, e.g., the field dynamics, corresponds to a stationary point of the action [1] (Ch. 19), [2] (Section 8).

In general, the application of the principle of stationary action gives the Euler–Lagrange equations, which describe the dynamics of the system [3] (Section I.3), [4] (Section 3.1), [5] (Section 7.2). We start by reviewing how to obtain these equations in coordinate-free form with tensorial notation. Differently from the usual approach, that gives the dynamics for the individual components of the field, our coordinate-free derivation directly works with some twice-differentiable multivector field.

a

of grade s and its tensor derivative

\partial \otimes a

in (27).

For a given region

R

that comprises the physical system under consideration, let the action

𝒮 (a)

be given by

𝒮 (a) = \int_{R} d^{k + n} x ℒ (a, \partial \otimes a) .

(33)

We assume that the region

R

is large enough to make the physical system closed, and that the fields decay fast enough over

R

so that the flux of the fields over the boundary of

R

is negligible. We note that the Lagrangian density

ℒ

is a real-valued function of the

(\binom{k + n}{s})

components of

a

and the

(k + n) (\binom{k + n}{s})

components of

\partial \otimes a

, and the Lagrangian density does not depend explicitly on the space-time components.

Let the field

a

be infinitesimally perturbed by an amount

a_{ε}

, possibly dependent on the space-time coordinates, so that the field is transformed as

a \to a + a_{ε}

and the tensor derivative is transformed as

\partial \otimes a \to \partial \otimes a + \partial \otimes a_{ε}

. We assume that

a_{ε}

is twice differentiable. We can expand the Lagrangian density in a first-order multivariate Taylor series, where the matrix of partial derivatives with respect to the

(k + n + 1) (\binom{k + n}{s})

variables is a block matrix having along the diagonal the vector derivatives

\partial_{a} ℒ

and

\partial_{\partial \otimes a} ℒ

of the density

ℒ

with respect to the field

a

and its tensorial derivative

\partial_{\partial \otimes a}

, defined in (28) and (29), respectively. Neglecting terms of second and higher order in the perturbation

a_{ε}

and grouping terms in the Taylor series yields

𝒮 (a + a_{ε}) = \int_{R} d^{k + n} x (ℒ + (\partial_{a} ℒ) \cdot a_{ε} + (\partial_{\partial \otimes a} ℒ) \cdot (\partial \otimes a_{ε})) .

(34)

We may thus evaluate the first-order change of action

δ 𝒮

as

δ 𝒮 = 𝒮 (a + a_{ε}) - 𝒮 (a) = \int_{R} d^{k + n} x ((\partial_{a} ℒ) \cdot a_{ε} + (\partial_{\partial \otimes a} ℒ) \cdot (\partial \otimes a_{ε})),

(35)

always neglecting all the contributions of order

{(a_{ε})}^{2}

or higher in the action change.

Next, we note the following Leibniz product rule, an equality between scalar quantities proved in Appendix A.1, for a multivector field

a

and a matrix field

B

with basis

w_{i, I}

, involving the product × defined in (19),

\partial \cdot (B \times a) = (\partial \times B) \cdot a + B \cdot (\partial \otimes a) .

(36)

Choosing

a = a_{ε}

and

B = \partial_{\partial \otimes a} ℒ

in (36), we can then rewrite Equation (35) as

δ 𝒮 = \int_{R} d^{k + n} x (\partial_{a} ℒ - \partial \times (\partial_{\partial \otimes a} ℒ)) \cdot a_{ε} + \int_{R} d^{k + n} x \partial \cdot ((\partial_{\partial \otimes a} ℒ) \times a_{ε}) .

(37)

We identify the second integrand with a flux (26) over an

(k + n)

-dimensional hypervolume

R

and use the Stokes theorem [14] (Section 3.5) to rewrite the flux of the interior derivative of a vector field as the flux of the field itself across the region boundary

\partial R

. The second integral in (37) then vanishes if we assume that the field

a

and its perturbation

a_{ε}

vanish sufficiently fast at infinity.

Under this assumption, if the change of action is zero for any perturbation of the field

a_{ε}

, the integrand in the first summand of (37) must be identically zero. Setting to zero the quantity between parentheses in the integrand yields the coordinate-free form of the Euler–Lagrangian equations,

\begin{matrix} \partial_{a} ℒ & = \partial \times (\partial_{\partial \otimes a} ℒ), \end{matrix}

(38)

\begin{matrix} \frac{\partial ℒ}{\partial a} & = \partial \times (\frac{\partial ℒ}{\partial (\partial \otimes a)}) . \end{matrix}

(39)

Both expressions in (38) and (39) are equivalent since they only differ in the notation for the vector derivative. It is also possible to recover a component form of the Euler–Lagrange equations from (39) [3] (Section I.3), [4] (Section 3.1), [5] (Section 7.2). Explicitly writing out the definitions in (28) and (29), we obtain the standard formula for each

I \in I_{m}

,

\frac{\partial ℒ}{\partial a_{I}} = \sum_{i \in I} \partial_{i} (\frac{\partial ℒ}{\partial (\partial_{i} a_{I})}) .

(40)

In general, the use of coordinate-free expression as in (39) is closer to the common practice of vector calculus and allows us to better identify the algebraic structure of the underlying equations, which gets obscured when the components are used. Moreover, expressions as (39) are better suited to generalizations, or more properly, particularizations, to exterior calculus when the Lagrangian depends on the exterior and interior derivatives of the field, rather than the tensor derivative. This case is explored and analyzed in the next subsection.

3.2. Derivation of the Euler–Lagrange Equations in Exterior Algebraic Form

For electromagnetism, the Lagrangian density

ℒ (x)

is a function of the vector potential

A

, the bivector field

F

, and the source density vector

J

. Expressed in exterior calculus notation, the Lagrangian density is given by

\begin{matrix} ℒ (x) & = - \frac{1}{2} F \cdot F + J \cdot A . \end{matrix}

(41)

Remark 1.

If the field is represented by an antisymmetric tensor of rank 2, the factor before

F \cdot F

becomes

- \frac{1}{4}

to account for the repeated sum over pairs of indices [3] (Section I.5), [4] (Section 3.5), [2] (Section 27).

The Lagrangian density depends on the field through the potential

A

and its exterior derivative

F = \partial \land A

[15] (Section 3). Instead of using (39), which was derived from the assumption that the Lagrangian density depends explicitly only the field and its tensor derivative, it is worth obtaining the Euler–Lagrange equations when the Lagrangian density is a function of a generic multivector field

a

of grade s, and its exterior and interior derivatives.

As in (33), for a given region

R

that comprises the physical system under consideration, and assumed to be large enough to make the physical system closed so that the fields decay fast enough over

R

and the flux of the fields over the boundary of

R

is arbitrarily small, the action

𝒮 (a)

is given by the integral

𝒮 (a) = \int_{R} d^{k + n} x (a, \partial \land a, \partial ⨼ a) .

(42)

Again, for an infinitesimal perturbation of the field

a_{ε}

, and neglecting all the contributions of order

{(a_{ε})}^{2}

or higher in the Taylor expansion of the Lagrangian density and the action, the first-order change in action

δ 𝒮

is given by

\begin{matrix} δ 𝒮 = \int_{R} d^{k + n} x ((\partial_{a} ℒ) \cdot a_{ε} + (\partial_{\partial \land a} ℒ) \cdot (\partial \land a_{ε}) + ({\partial \partial}_{⨼ a} ℒ) \cdot (\partial ⨼ a_{ε})) . \end{matrix}

(43)

From (35) in [15], given a vector

a

and a vector

b

of grade

gr (a) + 1

, the Leibniz product rule in (24) holds.

Choosing

a = a_{ε}

and

b = \partial_{\partial \land a} ℒ

(resp.

a = \partial_{\partial ⨼ a} ℒ

and

b = a_{ε}

) in the second (resp. third) summand inside the integral, substituting these values in (24) and the result back into (43), we obtain

\begin{matrix} δ 𝒮 & = \int_{R} d^{k + n} x (\partial_{a} ℒ + {(- 1)}^{s + 1} \partial ⨼ (\partial_{\partial \land a} ℒ) - {(- 1)}^{s - 1} \partial \land ({\partial \partial}_{+ ⨼ a} ℒ)) \cdot a_{ε} \\ + \int_{R} d^{k + n} x \partial \cdot (a_{ε} ⨼ (\partial_{\partial \land a} ℒ) + {(- 1)}^{s} (\partial_{\partial ⨼ a}) ⨼ a_{ε}) . \end{matrix}

(44)

In the second integrand, a flux over the

(k + n)

-dimensional region

R

, the Stokes theorem [14] (Section 3.5) allows us to rewrite the flux of the interior derivative as the flux across the region boundary

\partial R

. As both the field

a

and its perturbation

a_{ε}

vanish sufficiently fast at infinity, the first-order change in action is given by

δ 𝒮 = \int_{R} d^{k + n} x (\partial_{a} ℒ + {(- 1)}^{s + 1} \partial ⨼ (\partial_{\partial \land a} ℒ) - {(- 1)}^{s - 1} \partial \land (\partial_{\partial ⨼ a} ℒ)) \cdot a_{ε} .

(45)

and the principle of stationary action, namely that the first-order change in action identically vanishes, leads to the coordinate-free form of the Euler–Lagrange equations, in one of the two equivalent forms:

\begin{matrix} \partial_{a} ℒ & = {(- 1)}^{s} \partial ⨼ (\partial_{\partial \land a} ℒ) - {(- 1)}^{s} \partial \land (\partial_{\partial ⨼ a} ℒ) \end{matrix}

(46)

\begin{matrix} \frac{\partial ℒ}{\partial a} & = {(- 1)}^{s} \partial ⨼ (\frac{\partial ℒ}{\partial (\partial \land a)}) - {(- 1)}^{s} \partial \land (\frac{\partial ℒ}{\partial (\partial ⨼ a)}) . \end{matrix}

(47)

It might appear that the tensorial and multivectorial expressions in (39) and (47) differ. If the Lagrangian density depends on the tensor derivative only through the interior and exterior derivatives, we verify in Appendix A.2 that both expressions are indeed identical and the following identity holds:

\partial \times (\frac{\partial ℒ}{\partial (\partial \otimes a)}) = {(- 1)}^{s} \partial ⨼ (\frac{\partial ℒ}{\partial (\partial \land a)}) - {(- 1)}^{s} \partial \land (\frac{\partial ℒ}{\partial (\partial ⨼ a)}) .

(48)

4. Application to Generalized Electromagnetism: Maxwell Equations

4.1. Generalized Maxwell Equations

As application of the methods derived in the previous section, we study the generalized Maxwell equations [15] and their associated fields. For a given natural number r, the Maxwell field

F (x)

and the generalized source density

J (x)

are respectively characterized by multivector fields of grade r and

r - 1

at every point

x

of the flat

(k, n)

-space-time [15] (Section 3).

The potential field

A (x)

is a multivector field of grade

r - 1

such that

F = \partial \land A .

(49)

If we replace the potential

A

by a new field

A^{'} = A + \bar{A} + \partial \land G

, where

\bar{A}

is a constant

(r - 1)

-vector and

G

is an

(r - 2)

-vector gauge field, the homogenous Maxwell Equation (51) is unchanged [15] (Section 3). For a given Maxwell field, there is therefore some unavoidable (gauge) ambiguity on the value of the vector potential.

Scalar fields are given by the vector potential by setting

r = 1

in Minkowski space-time, namely

k = 1

and

n = 3

. For classical electromagnetism (

r = 2

,

k = 1

,

n = 3

), the bivector field is usually expressed as an antisymmetric tensor of rank 2; electrostatics and magnetostatics are recovered for

k = 0

,

n = 3

, by setting

r = 1

and

r = 2

, respectively. The generalized Maxwell equations for arbitrary values of r, k, and n are the following pair of coupled differential equations:

\begin{matrix} \partial ⨼ F = J, \end{matrix}

(50)

\begin{matrix} \partial \land F = 0 . \end{matrix}

(51)

The interior derivative in (50) and the exterior derivative in (51) are respectively defined in (22) and (21). As we stated in Section 2.5, the interior derivative lowers the grade by one, while the exterior derivative increases the grade by one; therefore, Equation (50) is an identity of

(r - 1)

-vectors while Equation (51) is an identity of

(r + 1)

-vectors.

4.2. Lagrangian Density for Generalized Electromagnetism

For electromagnetism, the Lagrangian density

ℒ (x)

is a function of the potential

A

, the Maxwell field

F

, and the source density

J

. Expressed in exterior calculus notation, we postulate the generalized Lagrangian density to be

ℒ (x) = \frac{{(- 1)}^{r - 1}}{2} F \cdot F + J \cdot A .

(52)

For classical electromagnetism (

r = 2

,

k = 1

,

n = 3

), if the field is expressed as an antisymmetric tensor of rank 2 the factor before

F \cdot F

becomes

- \frac{1}{4}

, see Remark 1. In contrast, for electrostatics (

r = 1

,

k = 0

,

n = 3

), the Lagrangian density is given by

ℒ = \frac{1}{2} E \cdot E + ρ ϕ

, where

E

is the electric field,

ρ

the charge density, and

ϕ

is the opposite in sign of the usual electric potential, that is,

E = \partial \land ϕ = \nabla ϕ

[1] (Ch. 19).

While the Lagrangian in (52) leads to the generalized Maxwell Equations (50) and (51), as we shall see in the following section, it is not the most general Lagrangian associated to electromagnetism. Two terms that can be added to it respectively deal with the hypothetical mass of the photon, that is, the Proca term [4] (p. 107), [22] (Section 12.8), and a gauge-fixing term that appears in the context of quantization of the electromagnetic field [3] (Section II.7), [4] (Section 7.1). This general Lagrangian density for electromagnetism is now given by

ℒ (x) = \frac{{(- 1)}^{r - 1}}{2} F \cdot F + J \cdot A - \frac{1}{2} m^{2} A \cdot A + \frac{{(- 1)}^{r - 1}}{2 ξ} (\partial ⨼ A) \cdot (\partial ⨼ A),

(53)

where m is the hypothetical photon mass and

ξ

is a parameter that determines the so-called

R_{ξ}

gauge; for

ξ = 1

, we have the Feynman gauge, and in the limit

ξ \to 0

, we have the Landau gauge.

4.3. Euler–Lagrange Equations

For Lagrangian densities such as (52) or (53), which are essentially quadratic forms in the field and/or its interior or exterior derivatives, evaluation of the vector derivatives is straightforward, as the derivative has the same form as that obtained in infinitesimal calculus for the derivative of a polynomial (31) and (32). For the Lagrangian density in (52), evaluation of the derivatives in the Euler–Lagrange Equation (47) give

\begin{matrix} \partial_{A} ℒ & = J, \end{matrix}

(54)

\begin{matrix} \partial_{\partial \land A} ℒ & = {(- 1)}^{r - 1} (\partial \land A), \end{matrix}

(55)

from which the Euler–Lagrange equations themselves (46), with

s = r - 1

, can be expressed as

J = \partial ⨼ (\partial \land A) = \partial ⨼ F,

(56)

namely the generalized nonhomogenous Maxwell Equation (50) for arbitrary r, k, and n. The homogeneous Maxwell Equation (51) is also satisfied as a consequence of the definition of

F = \partial \land A

.

The exterior algebraic formulation of the Lagrangian and the Euler–Lagrange equations brings the advantage of allowing for a more direct derivation of the Maxwell equations, since evaluation of the vector derivatives mimicks more closely the steps carried out in usual differential calculus to evaluate the derivatives.

The factor

{(- 1)}^{r - 1}

in the Lagrangian density is needed to compensate for the identical term

{(- 1)}^{r - 1}

that appears in the Euler–Lagrange Equation (47). An alternative way of writing the Lagrangian density, without this sign factor, would involve replacing one of the exterior derivatives

\partial \land A

by a right exterior derivative

A \land \partial

, where the partial derivative operator is understood to act from the right on the potential. In this case, the skew commutativity of the wedge product,

\partial \land A = {(- 1)}^{r - 1} (A \land \partial)

[14] (Section 2.2), cancels the sign in the Lagrangian density and results in a somewhat neater expression for it.

As for the Lagrangian density in (53), evaluation of the derivatives in the Euler–Lagrange Equation (47) give

\begin{matrix} \partial_{A} ℒ = J - m^{2} A, \end{matrix}

(57)

\begin{matrix} \partial_{\partial \land A} ℒ = {(- 1)}^{r - 1} (\partial \land A), \end{matrix}

(58)

\begin{matrix} \partial_{\partial ⨼ A} ℒ = {(- 1)}^{r - 1} \frac{1}{ξ} (\partial ⨼ A), \end{matrix}

(59)

from which the Euler–Lagrange Equation (46) with the Proca and quantization

R_{ξ}

-gauge terms become

\begin{matrix} \partial ⨼ (\partial \land A) + m^{2} A = J + \frac{1}{ξ} \partial \land (\partial ⨼ A) . \end{matrix}

(60)

Using the relationship (34) in [15], we may rewrite (60) in an alternative form with a wave equation,

\begin{matrix} {(- 1)}^{r - 1} (\partial \cdot \partial) A + m^{2} A = J + (\frac{1}{ξ} - 1) \partial \land (\partial ⨼ A), \end{matrix}

(61)

which somewhat simplifies in the Feynman gauge, for which

ξ = 1

.

4.4. Dual Generalized Maxwell Equations

An interesting dual form of Maxwell equations is obtained by swapping the roles played by the interior and exterior derivatives in the Lagrangian density and the Maxwell equations themselves. Let the “potential”

\bar{A}

and “source density”

\bar{J}

be multivectors of grade s, and let us define a dual Maxwell field

\bar{F}

of grade

r = s - 1

by

\bar{F} = \partial ⨼ \bar{A}

. The Lagrangian density is now given by

\begin{matrix} ℒ (x) & = \frac{{(- 1)}^{r}}{2} (\partial ⨼ \bar{A}) \cdot (\partial ⨼ \bar{A}) + \bar{J} \cdot \bar{A} \end{matrix}

(62)

\begin{matrix} = \frac{1}{2} (\partial ⨼ \bar{A}) \cdot (\bar{A} ⌙ \partial) + \bar{J} \cdot \bar{A}, \end{matrix}

(63)

where we used the relationship between left and right interior derivatives

\partial ⨼ \bar{A} = {(- 1)}^{s + 1} (\bar{A} ⌙ \partial)

to write (63). Direct evaluation of the Euler–Lagrange Equation (47), with

r = s - 1

, gives

\bar{J} = \partial \land \bar{F} .

(64)

This nonhomogeneous “Maxwell” equation is complemented by a homogeneous equation

\partial ⨼ \bar{F} = 0

, itself a consequence of the definition of

\bar{F}

as

\bar{F} = \partial ⨼ \bar{A}

.

As it happened with the generalized Maxwell equations, the exterior algebraic formulation of the Lagrangian and the Euler–Lagrange equations allows for a more direct derivation of the dual Maxwell equations. An interesting question, which we do not dwell upon as it lies beyond the scope of this paper, is whether the physics of the dual Maxwell equations is different from the usual Maxwell equations, or simply involves a transformation of the fields, potential, and source density, with no new phenomena. Along this direction, and leaving the details left as an exercise to the reader, it is relatively easy to verify that one obtains a wave equation relating

\bar{A}

and

\bar{J}

in a “Lorenz gauge” where

\partial \land \bar{A} = 0

. Solutions to this wave equation have several independent degrees of freedom or polarizations. The number of these polarizations is

(\binom{k + n - 2}{r - 1})

, as for the standard Maxwell Equation [15] (Section 4.3); this number can be justified as the number of possible

(r + 1)

-vectors where two dimensions, one temporal and one spatial are fixed, and the remaining

r + 1 - 2 = r - 1

indices have to be filled with

k + n - 2

possible values. We also have a “Lorentz force” density

f = \bar{J} ⌙ \bar{F}

such that a conservation law holds for the stress–energy-momentum tensor

T_{em}

of the field [15] (Appendix A.2), as for the usual Maxwell equations.

Author Contributions

Conceptualization, A.M.; methodology, I.C., J.F.-S. and A.M.; investigation, I.C. and A.M.; writing—original draft preparation, A.M.; writing—review and editing, I.C., J.F.-S. and A.M. All authors have read and agreed to the published version of the manuscript.

Funding

This work was funded in part by the Spanish Ministry of Science, Innovation and Universities under grants TEC2016-78434-C3-1-R and BES-2017-081360.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

No new data were created or analyzed in this study. Data sharing is not applicable to this article.

Acknowledgments

The authors acknowledge the anonymous reviewers for the constructive comments and suggestions that have helped to improve the manuscript.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

Appendix A.1. Proof of the Leibniz Product Rule in (36)

Let us consider a multivector field

a

of grade m and a matrix field

B

with basis

w_{i, I} = e_{i} \otimes e_{I}

, where

| I | = m

. Using the definitions of dot and matrix product (4) and (19), the first term of the right-hand side of (36) is evaluated as

\begin{matrix} (\partial \times B) \cdot a & = (\sum_{i, j \in I, J \in I_{m}} Δ_{i i} \partial_{i} b_{j, J} e_{i} \times (e_{j} \otimes e_{J})) \cdot (\sum_{I \in I_{m}} a_{I} e_{I}) \end{matrix}

(A1)

\begin{matrix} = (\sum_{i \in I, J \in I_{m}} \partial_{i} b_{i, J} e_{J}) \cdot (\sum_{I \in I_{m}} a_{I} e_{I}) \end{matrix}

(A2)

\begin{matrix} = \sum_{i \in I, I \in I_{m}} Δ_{I I} a_{I} \partial_{i} b_{i, I}, \end{matrix}

(A3)

where in (A1), we wrote the components of

\partial B

and

a

, in (A2), we computed the matrix product and removed the j index, and in (A3), we carried out the dot product and removed the J index.

In turn, the second term in the right-hand side of (36) can similarly be evaluated using the dot and times products in (16) and (19) as

\begin{matrix} B \cdot (\partial \otimes a) & = (\sum_{i \in I, I \in I_{m}} b_{i, I} e_{i} \otimes e_{I}) \cdot (\sum_{j \in I, J \in I_{m}} Δ_{j j} \partial_{j} a_{J} e_{j} \otimes e_{J}) \end{matrix}

(A4)

\begin{matrix} = \sum_{i \in I, I \in I_{m}} Δ_{I I} b_{i, I} \partial_{i} a_{I} . \end{matrix}

(A5)

Next, using again the definitions of dot and matrix product (4) and (17), the left-hand side of (36) becomes

\begin{matrix} \partial \cdot (B \times a) & = (\sum_{i \in I} Δ_{i i} e_{i} \frac{\partial}{\partial x_{i}}) \cdot (\sum_{I \in I_{m}} \sum_{j \in I, J \in I_{m}} a_{I} b_{j, J} (e_{j} \otimes e_{J}) \times e_{I}) \end{matrix}

(A6)

\begin{matrix} = (\sum_{i \in I} Δ_{i i} e_{i} \frac{\partial}{\partial x_{i}}) \cdot (\sum_{j \in I, I \in I_{m}} a_{I} b_{j, I} Δ_{I I} e_{j}) \end{matrix}

(A7)

\begin{matrix} = \sum_{i \in I, I \in I_{m}} Δ_{I I} \partial_{i} (a_{I} b_{i, I}) . \end{matrix}

(A8)

Summing (A3) and (A5) and applying the rule for the derivative of a product yields the desired (36).

Appendix A.2. Identity between Tensorial and Exterior Algebraic Euler–Lagrange Equations

Since both the exterior and interior derivatives are surjective linear functions of the tensor derivative, the respective vector derivatives of the Lagrangian density are related. Each component of the exterior and interior derivatives (21) and (22) is a scalar (affine) function of several distinct components of the tensor derivative. The Lagrangian density depends on the components of the tensor derivative only through these scalar functions. We thus need to compute the derivative of a function

ℒ (g_{1} (z), \dots, g_{ℓ} (z))

, where

z

stands for a vector with the

ℓ^{'} = (k + n) (\binom{k + n}{s})

components of the tensor derivative, and

(g_{1}, \dots, g_{ℓ})

are the (differentiable) functions that give the components of the exterior (resp. interior) derivative from the tensor derivative, where

ℓ = (\binom{k + n}{s + 1})

(resp.

ℓ = (\binom{k + n}{s - 1})

). By construction, a given

z_{k} = \partial_{j} a_{J}

appears only in one

g_{i} (z)

, the I component of either the exterior or the interior derivative. In the former case,

I = j + J

, in the latter case

I = J \ j

.

From the definition of partial derivative, and for any

i = 1, \dots, ℓ

, we have the relation

\frac{\partial ℒ}{\partial g_{i} (z)} = lim_{h \to 0} \frac{ℒ (g_{1} (z), \dots, g_{i} (z) + h, \dots, g_{ℓ} (z)) - ℒ (g_{1} (z), \dots, g_{ℓ} (z))}{h} .

(A9)

Then, assuming that

\frac{\partial g_{i} (z)}{\partial z_{k}} \neq 0

and defining

h_{i k}^{'} = \frac{h}{\partial g_{i} / \partial z_{k}}

, we can then write for every value of k such that the partial derivative

z_{k} = \partial_{j} a_{J}

appears

g_{i} (z)

,

g_{i} (z) + h = g_{i} (z) + h_{i k}^{'} \frac{\partial g_{i} (z)}{\partial z_{k}} ≃ g_{i} (z_{1}, \dots, z_{k} + h_{i k}^{'}, \dots, z_{ℓ^{'}}),

(A10)

where we used the differentiablility of the function

g_{i}

. Substituting (A10) back into (A9) yields

\begin{matrix} \frac{\partial ℒ}{\partial g_{i} (z)} = \frac{1}{\partial g_{i} / \partial z_{k}} lim_{h_{i k}^{'} \to 0} \frac{ℒ (g_{1} (z), \dots, g_{i} (z_{1}, \dots, z_{k} + h_{i k}^{'}, \dots, z_{ℓ^{'}}), \dots, g_{ℓ} (z)) - ℒ (g_{1} (z), \dots, g_{ℓ} (z))}{h_{i k}^{'}} . \end{matrix}

(A11)

Since the

z_{k}

component appears only in one of the functions

g_{i}

, the limit in (A11) is the partial derivative of the Lagrangian density with respect to the kth component of the tensor derivative, for any k, that is,

\frac{\partial ℒ}{\partial g_{i} (z)} = \frac{1}{\partial g_{i} / \partial z_{k}} \frac{\partial ℒ}{\partial z_{k}} .

(A12)

We now proceed to evaluate the vector derivative with respect to the exterior derivative. First, we have

\begin{matrix} \frac{\partial ℒ}{\partial (\partial \land a)} = \sum_{I \in I_{s + 1}} \frac{\partial ℒ}{\partial {(\partial \land a)}_{I}} Δ_{I I} e_{I}, \end{matrix}

(A13)

where the Ith component of the exterior derivative,

{(\partial \land a)}_{I}

is the equivalent of

g_{i} (z)

in (A12). The equivalent of k is any pair of j and

J \in I_{s}

such that

I = j + J

and the corresponding

z_{k}

is

\partial_{j} a_{J}

. The partial derivative

\partial g_{i} / \partial z_{k}

in (A12) is thus

Δ_{j j} σ (j, J)

, of value

\pm 1

, and we therefore have for any pair of i and

J \in I_{s}

such that

I = j + J

that

\begin{matrix} \frac{\partial ℒ}{\partial {(\partial \land a)}_{I}} = \frac{1}{Δ_{j j} σ (j, J)} \frac{\partial ℒ}{\partial (\partial_{j} a_{J})} = Δ_{j j} σ (j, J) \frac{\partial ℒ}{\partial (\partial_{j} a_{J})} . \end{matrix}

(A14)

Substituting (A14) back in (A13) yields

\begin{matrix} \frac{\partial ℒ}{\partial (\partial \land a)} = \sum_{I \in I_{s + 1}} \frac{\partial ℒ}{\partial (\partial_{j} a_{J})} Δ_{j j} Δ_{I I} σ (j, J) e_{I}, \end{matrix}

(A15)

where j and J are any pair such that

I = j + J

. Now, taking the interior derivative of (A15), we obtain for any

j, J

such that

I = j + J

,

\begin{matrix} \partial ⨼ (\frac{\partial ℒ}{\partial (\partial \land a)}) & = \sum_{i \in I} Δ_{i i} e_{i} \partial_{i} ⨼ (\sum_{I \in I_{s + 1}} \frac{\partial ℒ}{\partial (\partial_{j} a_{J})} Δ_{j j} Δ_{I I} σ (j, J) e_{I}) \end{matrix}

(A16)

\begin{matrix} = \sum_{i \in I, I \in I_{s + 1} : i \in I} Δ_{i i} Δ_{I I} \partial_{i} (\frac{\partial ℒ}{\partial (\partial_{i} a_{I \ i})}) σ (i, I \ i) σ (I \ i, i) e_{I \ i}, \end{matrix}

(A17)

where we have selected

j = i

and, therefore,

J = I \ i

. We also note that

σ (i, I \ i) σ (I \ i, i) = {(- 1)}^{s}

.

We now evaluate the vector derivative with respect to the interior derivative in an analogous manner,

\begin{matrix} \frac{\partial ℒ}{\partial (\partial ⨼ a)} = \sum_{I \in I_{s - 1}} \frac{\partial ℒ}{\partial (\partial_{j} a_{J})} Δ_{I I} σ (J \ j, j) e_{I}, \end{matrix}

(A18)

where j and J are any pair such that

I = J \ j

. Now, taking the exterior derivative of (A18), we obtain, for any

j, J

such that

I = J \ j

,

\begin{matrix} \partial \land (\frac{\partial ℒ}{\partial (\partial ⨼ a)}) & = \sum_{i \in I} Δ_{i i} e_{i} \partial_{i} \land (\sum_{I \in I_{s - 1}} \frac{\partial ℒ}{\partial (\partial_{j} a_{J})} Δ_{I I} σ (J \ j, j) e_{I}) \end{matrix}

(A19)

\begin{matrix} = \sum_{i \in I, I \in I_{s - 1} : i \notin I} Δ_{i i} Δ_{I I} \partial_{i} (\frac{\partial ℒ}{\partial (\partial_{i} a_{i + I})}) σ (I, i) σ (i, I) e_{i + I}, \end{matrix}

(A20)

where we have selected

j = i

and, therefore,

J = i + I

. We also note that

σ (I, i) σ (i, I) = {(- 1)}^{s - 1}

.

Putting (A17) and (A20), as well as the relationships between the product of permutation signatures, back into the right-hand side of (48) yields the expression

\begin{matrix} \sum_{i \in I, I \in I_{s + 1} : i \in I} Δ_{i i} Δ_{I I} \partial_{i} (\frac{\partial ℒ}{\partial (\partial_{i} a_{I \ i})}) e_{I \ i} + \sum_{i \in I, I \in I_{s - 1} : i \notin I} Δ_{i i} Δ_{I I} \partial_{i} (\frac{\partial ℒ}{\partial (\partial_{i} a_{i + I})}) e_{i + I} . \end{matrix}

(A21)

Since the basis elements are multivectors with s components, we may rewrite (A21) as

\sum_{i \in I, I \in I_{s} : i \notin I} Δ_{I I} \partial_{i} (\frac{\partial ℒ}{\partial (\partial_{i} a_{I})}) e_{I} + \sum_{i \in I, I \in I_{s} : i \in I} Δ_{I I} \partial_{i} (\frac{\partial ℒ}{\partial (\partial_{i} a_{I})}) e_{I} .

(A22)

The two summations in (A21) might be further combined in a single summation over

I \in I_{s}

. The resulting expression coincides with the left-hand side of (48), which can be expanded using the computation in (40) into

\sum_{i, I \in I_{s}} Δ_{I I} \partial_{i} (\frac{\partial ℒ}{\partial (\partial_{i} a_{I})}) e_{I} .

(A23)

References

Feynman, R.P.; Leighton, R.B.; Sands, M. The Feynman Lectures on Physics, Vol. II: Mainly Electromagnetism and Matter; Addison-Wesley: Boston, MA, USA, 1977. [Google Scholar]
Landau, L.D.; Lifshitz, E.M. The Classical Theory of Fields. In Course of Theoretical Physics, 4th ed.; Butterworth-Heinemann: Oxford, UK, 1987; Volume 2. [Google Scholar]
Zee, A. Quantum Field Theory in a Nutshell. In Nutshell Handbook; Princeton University Press: Princeton, NJ, USA, 2003. [Google Scholar]
Maggiore, M. A Modern Introduction to Quantum Field Theory. In Oxford Master Series in Statistical, Computational, and Theoretical Physics; Oxford Univeristy Press: Oxford, OH, USA, 2005. [Google Scholar]
Weinberg, S. The Quantum Theory of Fields, Volume 1: Foundations; Cambridge University Press: Cambridge, UK, 2005. [Google Scholar]
Lanczos, C. The Variational Principles of Mechanics, 4th ed.; University of Toronto Press: Toronto, ON, Canada, 1970; 418p. [Google Scholar]
Gibbs, J.W.; Wilson, E.B. Vector Analysis. In New Haven; Yale University Press: New Haven, CT, USA, 1929. [Google Scholar]
Clifford, W.K. Mathematical Papers; Macmillan: London, UK, 1882. [Google Scholar]
Doran, C.; Lasenby, A. Geometric Algebra for Physicists; Cambridge University Press: Cambridge, UK, 2003. [Google Scholar]
Ricci, M.M.G.; Levi-Civita, T. Méthodes de calcul différentiel absolu et leurs applications. Math. Ann. 1900, 54, 125–201. [Google Scholar] [CrossRef] [Green Version]
Cartan, E. Les Systemes Differentiels Exterieurs Et Leurs Applications Geometriques; Hermann & Cie: Paris, France, 1945. [Google Scholar]
Arnold, V.I.; Weinstein, A.; Vogtmann, K. Mathematical Methods of Classical Mechanics; Springer: Berlin, Germany, 1989. [Google Scholar]
Grassmann, H. Extension Theory. In Number 19 in History of Mathematics Sources; American Mathematical Society: Providence, RI, USA, 2000. [Google Scholar]
Colombaro, I.; Font-Segura, J.; Martinez, A. An Introduction to Space–Time Exterior Calculus. Mathematics 2019, 7, 564. [Google Scholar] [CrossRef] [Green Version]
Colombaro, I.; Font-Segura, J.; Martinez, A. Generalized Maxwell equations for exterior-algebra multivectors in (k, n) space-time dimensions. Eur. Phys. J. Plus 2020, 135, 1–31. [Google Scholar] [CrossRef]
Lasenby, A.; Doran, C.; Gull, S. A Multivector Derivative Approach to Lagrangian Field Theory. Found. Phys. 1993, 23, 1295–1327. [Google Scholar] [CrossRef]
Martinez, A.; Font-Segura, J.; Colombaro, I. An Exterior-Algebraic Derivation of the Symmetric Stress-Energy-Momentum Tensor in Flat Space-Time. Eur. Phys. J. Plus 2021, 136, 1–28. [Google Scholar] [CrossRef]
Lovelock, D.; Rund, H. Tensors, Differential Forms, and Variational Principles; Dover Publications: New York, NY, USA, 1989. [Google Scholar]
Flanders, H. Differential Forms with Applications to the Physical Sciences; Dover Publications: New York, NY, USA, 1989. [Google Scholar]
Horn, R.A.; Johnson, C.R. Matrix Analysis, 2nd ed.; Cambridge University Press: Cambridge, UK, 2013. [Google Scholar]
Dwyer, P.S.; Macphail, M.S. Symbolic Matrix Derivatives. Ann. Math. Statist. 1948, 19, 517–534. [Google Scholar] [CrossRef]
Jackson, J.D. Classical Electrodynamics, 3rd ed.; John Wiley & Sons: Hoboken, NJ, USA, 1999. [Google Scholar]

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Colombaro, I.; Font-Segura, J.; Martinez, A. An Exterior Algebraic Derivation of the Euler–Lagrange Equations from the Principle of Stationary Action. Mathematics 2021, 9, 2178. https://doi.org/10.3390/math9182178

AMA Style

Colombaro I, Font-Segura J, Martinez A. An Exterior Algebraic Derivation of the Euler–Lagrange Equations from the Principle of Stationary Action. Mathematics. 2021; 9(18):2178. https://doi.org/10.3390/math9182178

Chicago/Turabian Style

Colombaro, Ivano, Josep Font-Segura, and Alfonso Martinez. 2021. "An Exterior Algebraic Derivation of the Euler–Lagrange Equations from the Principle of Stationary Action" Mathematics 9, no. 18: 2178. https://doi.org/10.3390/math9182178

APA Style

Colombaro, I., Font-Segura, J., & Martinez, A. (2021). An Exterior Algebraic Derivation of the Euler–Lagrange Equations from the Principle of Stationary Action. Mathematics, 9(18), 2178. https://doi.org/10.3390/math9182178

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

An Exterior Algebraic Derivation of the Euler–Lagrange Equations from the Principle of Stationary Action

Abstract

1. Introduction

2. Fundamentals of Exterior Algebra and Calculus: Notation, Definitions, and Operations

2.1. Multivector Fields

2.2. Operations on Index Lists

2.3. Operations on Multivectors

2.4. Matrix Vector Spaces

2.5. Exterior and Matrix Calculus

3. Principle of Stationary Action: Derivation of the Euler–Lagrange Equations

3.1. General Case: Lagrangian Dependent on the Tensor Derivative

3.2. Derivation of the Euler–Lagrange Equations in Exterior Algebraic Form

4. Application to Generalized Electromagnetism: Maxwell Equations

4.1. Generalized Maxwell Equations

4.2. Lagrangian Density for Generalized Electromagnetism

4.3. Euler–Lagrange Equations

4.4. Dual Generalized Maxwell Equations

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

Appendix A

Appendix A.1. Proof of the Leibniz Product Rule in (36)

Appendix A.2. Identity between Tensorial and Exterior Algebraic Euler–Lagrange Equations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI