Next Article in Journal
Genetic and Swarm Algorithms for Optimizing the Control of Building HVAC Systems Using Real Data: A Comparative Study
Previous Article in Journal
Uniform Dichotomy Concepts for Discrete-Time Skew Evolution Cocycles in Banach Spaces
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

An Exterior Algebraic Derivation of the Euler–Lagrange Equations from the Principle of Stationary Action

Department of Information and Communication Technologies, Universitat Pompeu Fabra, 08018 Barcelona, Spain
*
Author to whom correspondence should be addressed.
Mathematics 2021, 9(18), 2178; https://doi.org/10.3390/math9182178
Submission received: 4 August 2021 / Revised: 31 August 2021 / Accepted: 2 September 2021 / Published: 7 September 2021
(This article belongs to the Section Mathematical Physics)

Abstract

:
In this paper, we review two related aspects of field theory: the modeling of the fields by means of exterior algebra and calculus, and the derivation of the field dynamics, i.e., the Euler–Lagrange equations, by means of the stationary action principle. In contrast to the usual tensorial derivation of these equations for field theories, that gives separate equations for the field components, two related coordinate-free forms of the Euler–Lagrange equations are derived. These alternative forms of the equations, reminiscent of the formulae of vector calculus, are expressed in terms of vector derivatives of the Lagrangian density. The first form is valid for a generic Lagrangian density that only depends on the first-order derivatives of the field. The second form, expressed in exterior algebra notation, is specific to the case when the Lagrangian density is a function of the exterior and interior derivatives of the multivector field. As an application, a Lagrangian density for generalized electromagnetic multivector fields of arbitrary grade is postulated and shown to have, by taking the vector derivative of the Lagrangian density, the generalized Maxwell equations as Euler–Lagrange equations.

1. Introduction

In classical mechanics, the action is a scalar quantity, with units of energy × time, that encodes the dynamical evolution of a given physical system; mathematically, the action is given by an integral functional of the trajectory or dynamical path (or an integral of the Lagrangian density for field theories) followed by the physical system over space-time. The principle of stationary action states that the actual dynamical path followed by the system, subject to some appropriate boundary constraints, possibly at infinity, corresponds to a stationary point of the action [1] (Ch. 19), [2] (Section 8). An application of the principle yields the Euler–Lagrange equations, which describe the dynamics of the system [3] (Section I.3), [4] (Section 3.1), [5] (Section 7.2). The historical development of the stationary-action principle—in essence, a far-reaching generalization of Fermat’s principle—that states that light follows the shortest-time path between two points is described in detail in [6] (Section X).
This paper revisits the derivation of the Euler–Lagrange equations for field theories from the principle of stationary action from the point of view of exterior algebra and calculus. There exist several alternative mathematical representations for the fields, ranging from the original vector calculus by Gibbs [7] and Heaviside to geometric and Clifford algebras [8], where vectors are replaced by multivectors and operations such as the cross and the dot products subsumed in the geometric product; a modern perspective on the use of geometric algbra in physics is given in [9]. Early in the 20th century, tensors such as the Faraday tensor in electromagnetism were quickly and almost universally adopted as the natural mathematical representation of fields in space-time [10] (pp. 135–144). In parallel, mathematicians such as Cartan generalized the fundamental theorems of vector calculus i.e., Gauss, Green, and Stokes, by means of differential forms [11]. Later on, differential forms were used in Hamiltonian mechanics, e.g., to calculate trajectories as vector field integrals [12] (pp. 194–198).
Since differential forms may be seen as the circulation or flux over appropriate space-time regions of multivector fields, it may be preferable in some contexts to directly study the multivector fields. Therefore, we build our analysis on the exterior algebra originally developed by Grassmann [13], which has comparatively received little attention in the literature and leads to simple formulae that merge the simplicity and intuitiveness of standard vector calculus with the power of tensors and differential forms [14,15].
In Section 2, we provide the necessary background on exterior algebra and calculus, including the important notion of multivector-valued derivative with respect to a vector v . Then, we obtain in Section 3 two related coordinate-free forms of the Euler–Lagrange equations for the dynamics of a multivector field a of grade r as vector derivatives of the Lagrangian density L .
Our work is related to the geometric–algebraic multivectorial formulation of the Euler–Lagrange equations in [16] (Equations (4.7) and (4.8)). The first form in (39) is valid for a generic Lagrangian density that only depends on the first-order derivatives of the field, more specifically on the tensor derivative a in (27), and is given by
a = × a ,
as a function of the vector and matrix derivatives a and a in (28) and (29), respectively. The ( k + n ) -dimensional differential operator is defined in (20); together with the matrix product × defined in (19), the operation in the right-hand side generalizes the concept of the divergence of a field. The second form (47), expressed in exterior algebra notation, is specific to the case when the Lagrangian density depends only on exterior (denoted by ; see (21)) and interior derivatives (denoted by ; see (22)) of the multivector field, and is given by
a = ( 1 ) r 1 a + ( 1 ) r a ,
where r is the grade of the multivector field a . A complementary analysis, which shows the invariance of the action to infinitesimal space–time translations in exterior algebra, was conducted in [17], where the stress–energy–momentum tensor is evaluated and profusely discussed. We conclude the paper in Section 4 with an application of our analysis to a Lagrangian density for generalized electromagnetic multivector fields that leads, by directly taking the vector derivative of the Lagrangian density, to the generalized Maxwell equations for multivector fields of grade r [15]. We also provide a short discussion, of independent interest, of a dual form of Maxwell equations where the exterior derivative is replaced by the interior derivative in the definition of the field from the potential.

2. Fundamentals of Exterior Algebra and Calculus: Notation, Definitions, and Operations

2.1. Multivector Fields

While our space-time has four space-time dimensions in relativistic terms, it will prove convenient to consider a generic flat space-time R k + n with k temporal dimensions and n spatial dimensions, as this generality allows for a more natural description of the underlying algebraic structure of the equations and of their derivations. Points and position in space-time are denoted by x , with components x i in the canonical basis { e i } i = 0 k + n 1 ; by convention, the first k indices, i.e., i = 0 , , k 1 , correspond to time components while the indices i = k , , k + n 1 represent space components. We let space and time coordinates have the same units. Although we shall not make use of this fact, space-time vectors transform contravariantly under changes of coordinates.
In exterior algebra, one considers vector spaces whose basis elements e I are indexed by lists I = ( i 1 , , i m ) drawn from I m , the set of all ordered lists with m nonrepeated indices, with m I = { 0 , 1 , k + n }. Later on, in (6), we express the basis elements e I in terms of the vectorial canonical basis e i , for an ordered list i 1 , , i m . These vectors, which we identify with fields, live in the tangent space and transform covariantly under changes of coordinates [18] (Ch. 2), [19] (Ch. V). We refer to elements of these vector field spaces as multivector fields of grade m. While multivector fields do not cover all relevant physical models, e.g., spinor fields or the tensor field in general relativity, they do model a number of interesting cases; for instance, a scalar field is represented by multivectors of grade 0, the electric field, the electromagnetic vector potential and source current by multivectors of grade 1, and the electromagnetic field by a multivector of grade 2. A multivector field a ( x ) of grade m, possibly a function of the position x , with components a I ( x ) in the canonical basis { e I } I I m can be written as
a ( x ) = I I m a I ( x ) e I .
We denote by gr ( a ) the operation that returns the grade of a vector a and by | I | the length of a list I. The dimension of the vector space of all grade m multivectors is k + n m , the number of lists in I m .

2.2. Operations on Index Lists

As the basis elements of multivector fields are indexed by lists I, it proves convenient to define some basic operations on such lists: permutations and their signatures, concatenations (mergers), and subtractions of lists.
First of all, if the list I is not ordered, let σ ( I ) denote the signature of the permutation sorting the elements of I in increasing order. If the permutation is even (resp. odd), the signature is + 1 (resp. 1 ). If the list I contains repeated indices, its signature is 0.
More generally, for two index lists I and J with respective lenghts m = | I | and m = | J | , let ( I , J ) = { i 1 , , i m , j 1 , , j m } be the concatenation of the two index lists I and J. We let σ ( I , J ) denote the signature of the permutation sorting the concatenated list of | I | + | J | indices, and let I + J , or ε ( I , J ) if the notation I + J is ambiguous in a given context, denote the sorted concatenated list, which we refer to as merged list.
In general, we view the lists as ordered sets, and apply standard operations on sets to the lists. For instance, I is contained in J, the list J \ I is the result of removing from J all the elements in I, while keeping the order. As another example, we denote by I c the complement of I, namely the ordered sequence of indices not included in I. We denote the empty list by ∅; it holds that σ ( , K ) = σ ( K , ) = 1 for an ordered list K, and that e = 1 .

2.3. Operations on Multivectors

We next define several operations acting on multivectors; our presentation loosely follows [14] (Sections 2 and 3) and [15] (Section 2) and is close in spirit and form to vector calculus. Introductions to exterior algebra from the perspective and language of differential forms can be found in [18,19]. A geometric algebra perspective can be found in [9]. With no real loss of generality, we define the operations only for the canonical basis vectors, the operation acting on general multivectors being a mere extension by linearity of the former.
First, the dot product · of two arbitrary grade m basis vectors e I and e J is defined as
e I · e J = Δ I J = Δ i 1 j 1 Δ i 2 j 2 Δ i m j m ,
where I and J are the ordered lists I = ( i 1 , i 2 , , i m ) and J = ( j 1 , j 2 , , j m ) and Δ i j = 0 if i j , and we let time unit vectors e i have negative metric Δ i i = 1 and space unit vectors e i have positive metric Δ i i = + 1 . When m = 0 , we interpret the dot product in (4) as 1 since e = 1 .
The following operations to be defined are the interior and exterior products, which subsume and generalize the operations of gradient, curl, and divergence of vector calculus to multivector fields. These operations transform pairs of multivectors into a multivector of a different grade, introducing in the process some signs, i.e., ± 1 . When these signs are related to the dot product in (4), we explicitly write the signs as quantities such as Δ I J . Other sign contributions arise from the signatures of permutations ordering lists of indices. A common practice in the literature to deal with these signatures is to write factors such as ( 1 ) | I | + | J | . However, it seems more convenient to explicitly keep track of the lists and write the permutation associated to this factor, e.g., σ ( I , J ) , as clearer connections between different formulae can be established by harnessing the power of group theory for permutations.
Let two basis vectors e I and e J have grades m = | I | and m = | J | . As defined in Section 2.2, let ( I , J ) = { i 1 , , i m , j 1 , , j m } be the concatenation of the two index lists I and J, let σ ( I , J ) denote the signature of the permutation sorting the elements of this concatenated list.
Then, the exterior product of e I and e J is defined as
e I e J = σ ( I , J ) e I + J .
The exterior product is thus either zero or a multivector of grade | I | + | J | , since σ ( I , J ) = 0 when the lists I and J have elements in common. The unit scalar (multivector of grade 0) is an identity of the exterior product, as 1 e I = e I 1 = e I . The exterior product provides a construction of the basis vector e I , with I an ordered list I = ( i 1 , , i m ) , from the canonical basis vectors e i , namely
e I = e i 1 e i 2 e i m .
When I = , we adopt the usual convention that the right-hand side is 1.
We next define two generalizations of the dot product, the left and right interior products. Let e I and e J be two basis vectors of respective grades | I | and | J | . The left interior product, denoted by , is defined as
e I e J = Δ I I σ ( J \ I , I ) e J \ I , if I J , 0 , otherwise .
Although we might have overloaded the meaning of σ ( J \ I , I ) to be zero when I J , we prefer to list the separate cases in (7). The vector e J \ I has grade | J | | I | and is indexed by the elements of J not in common with I. The use of the word left represents the fact that e I acts from the left on e J and removes the elements in I from J.
Analogously, the right interior product, denoted by , of two basis vectors e I and e J is defined as
e J e I = Δ I I σ ( I , J \ I ) e J \ I , if I J , 0 , otherwise .
As in the previous case, the use of the word right represents the fact that e I acts from the right on e J and removes the elements in I from J. The unit scalar (multivector of grade 0) acting from the left (resp. right) is an identity of the left (resp. right) interior product, as 1 + e I = e I + 1 = e I .
It proves instructive to evaluate the left and right interior products between two multivectors of the same grade, i.e., if | I | = | J | . From (7) and (8), and taking into account that σ ( , K ) = σ ( K , ) = 1 for an ordered list K, and that e = 1 , we see that
e I e J = e J e I = e I · e J , if | I | = | J | ,
supporting the idea that the interior products generalize the dot product. Both interior products are grade-lowering operations, as the interior product is either zero or a multivector of grade | J | | I | .
Finally, we define the complement of a multivector. For a multivector e I with grade m, its Grassmann or Hodge complement, denoted by e I H , is the unit ( k + n m ) -vector
e I H = Δ I I σ ( I , I c ) e I c ,
where I c is the complement of the list I and σ ( I , I c ) is the signature of the permutation sorting the elements of the concatenated list ( I , I c ) containing all space-time indices. In other words, e I c is the basis multivector of grade k + n m whose indices are in the complement of I. In addition, we define the inverse complement transformation as
e I H 1 = Δ I c I c σ ( I c , I ) e I c .
The interior products are not independent operations from the exterior product, as they can be expressed in terms of the latter, the Hodge complement and its inverse:
e I e J = e I e J H H 1 ,
e J e I = e J H 1 e I H .
The vector calculus cross product between two vectors in R 3 can be expressed in several alternative ways in terms of the interior, and exterior products and Hodge dual [14] (Equation (18)). This fact allows us to distinguish various roles that the cross product takes in Maxwell equations and lies at the origin of generalized electromagnetism described by multivectors in generic flat space-time [15].

2.4. Matrix Vector Spaces

We do not need to consider general tensor fields but rather the matrix field (vector) space whose basis elements can be represented as w I 1 , I 2 , where both I 1 and I 2 are ordered lists of nonrepeated 1 and 2 elements, respectively. We may identify these basis elements with the tensor product of two multivectors of grade 1 and 2 , namely
w I 1 , I 2 = e I 1 e I 2 .
The dimension of the vector space spanned by these basis elements is k + n 1 k + n 2 ; the elements of this vector space can be identified with matrices A whose rows and columns are indexed by lists, I 1 I 1 and I 2 I 2 , respectively,
A = I 1 I 1 , I 2 I 2 A I 1 I 2 w I 1 , I 2 .
The transpose of a matrix element w I 1 , I 2 , denoted as w I 1 , I 2 T , is defined as w I 2 , I 1 . These matrices, the underlying vector space, and the operations that we describe next are fundamental in the study of changes of coordinates in space-time. However, consideration of these changes is beyond the scope of this paper. To any extent, this short section provides a perspective on matrices from the point of view of exterior algebra, highlighting the connections between multivectors and matrices, and bypassing the standard introduction of tensor fields.
As we did with multivectors, we consider the dot product · of two arbitrary matrix basis elements w I 1 , I 2 and w J 1 , J 2 . This dot product is written
w I 1 , I 2 · w J 1 , J 2 = Δ I 1 J 1 Δ I 2 J 2 .
The ordering within the pairs ( I 1 , I 2 ) and ( J 1 , J 2 ) is important in (16). This dot product, when applied to two matrices, is seen to give their Frobenius inner product, or equivalently, the square of the Frobenius norm (also known as the Hilbert–Schmidt norm) [20] when the product is of a matrix with itself.
We also define the matrix product × between two matrix basis elements w I , J and w K , L as
w I , J × w K , L = w I , L Δ J K ,
an operation that coincides with the standard product of two matrices for matrices labeled by spatial indices. For square matrices A indexed by grade m multivectors, it is natural to define the matrix inverse (whenever the inverse exists), denoted as A 1 , such that A 1 × A = I m = A × A 1 , where the grade square identity matrix, denoted by I , is given by
I = I I Δ I I w I , I .
Last, we define the matrix product × between a matrix w I , J and a multivector e K (or between a multivector e K and the matrix w J , I , i.e., the transpose of w I , J ) as
w I , J × e K = e K × w J , I = e I Δ J K ,
a generalization of the idea of multiplication of a row (or column) vector by a matrix.

2.5. Exterior and Matrix Calculus

In vector calculus, extensive use is made of the partial time derivative, t , and the nabla operator ∇ of partial space derivatives. In our case, we need the generalization to ( k , n ) space-time to the differential vector operator , defined as ( 0 , 1 , , k 1 , k , , k + n 1 ) , that is,
= i I Δ i i e i i .
As was done in [14] (Section 3), we define the exterior derivative, a , of a given multivector field a of grade m as
a = i I , I I m : i I Δ i i σ ( i , I ) i a I e i + I .
The grade of the exterior derivative of a is m + 1 , unless m = k + n , in which case the exterior derivative is zero. In addition, we define the interior derivative, a , of a as
a = i I , I I m : i I σ ( I \ i , i ) i a I e I \ i .
The grade of the interior derivative of a is m 1 , unless m = 0 , in which case the interior derivative is zero.
The formulae for the exterior and interior derivatives allow us to recover some standard formulae in vector calculus. For a scalar function ϕ , its gradient is given by its exterior derivative ϕ = ϕ , while for a vector field v , its divergence · v is given by its interior derivative · v = v .
Also, for a vector fields v in R 3 , taking into account [14] (Equation (18)), the curl can be variously expressed as × v = ( v ) H 1 = v H 1 = v H , thereby generalizing both the cross product and the curl to grade m vector fields in space-time algebras with different dimensions. Specific vector calculus formulae such as that for the divergence of a gradient or the curl of the curl of a vector can be seen as instances of general exterior calculus formulae such as [14] (Equation (38)) and [15] (Equation (35)),
( a b ) = a ( · b ) ( · a ) b ,
· ( a b ) = ( a ) · b + ( 1 ) gr ( a ) ( b ) · a ,
where in (23), a and b are 1-vectors, while in (24), a and b are ( s 1 ) -vector and s-vector, respectively.
The exterior and interior derivatives satisfy the property ( a ) = 0 = + ( a ) , for a general twice-differentiable multivector field a . These identities imply the well-known facts that the curl of the gradient and the divergence of the curl are zero.
The circulation C ( a , V ) and the flux F ( a , V ) of a multivector field a over an -dimensional space-time hypervolume V are defined as integrals of interior products of the field with infinitesimal integration volumes:
C ( a , V ) = V d x a ,
F ( a , V ) = V d x H 1 a .
As a specific example for (26), the flux of a field over an ( k + n ) -dimensional hypervolume is the volume integral of the field. For both of these operations, the interior product in the integrand is expressed as a differential form, which allows us to invoke the theory of differential forms to prove a Stokes theorem. This Stokes theorem relates the circulation (resp. the flux) of the field over the boundary of some hypervolume to the circulation (resp. flux) over the same hypervolume of the exterior (resp. interior) derivative of the multivector field.
We also define the tensor derivative of a , a , of a given vector field a of grade m as
a = i I , I I m Δ i i i a I w i , I ,
where w i , I is a matrix vector space basis element.
To conclude this section, we define a derivative operator with respect to an element of a vector space, e.g., a multivector field or a matrix. A relevant example of vector derivative operator is , where the derivative is taken with respect to the position vector x . In general, the vector derivative operator with respect to a multivector field a of grade m (resp. matrix A of dimensions 1 × 2 ) is a multivector field (resp. matrix) denoted by a (resp. A ) [21] and given by
a = a = I I m Δ I I e I a I ,
A = A = I I 1 , J I 2 Δ I I Δ J J w I , J A I , J .
Specifically, we shall later need the exterior vector derivative of a scalar function g ( x ) , denoted by a g ( x ) or with some abuse of notation simply by a g ( x ) , and given by
a g ( x ) = a g ( x ) = I I m Δ I I e I g ( x ) a I ,
and similarly for the matrix derivative. This exterior vector derivative is thus some form of generalized gradient. We shall need the derivative of a scalar function given by a quadratic form in the field and/or its interior or exterior derivatives. Let a and b represent two vectors of the same grade. Evaluation of the vector derivatives is straightforward and coincides with the infinitesimal calculus expressions [21]:
a ( a · a ) = 2 a
a ( a · b ) = b .

3. Principle of Stationary Action: Derivation of the Euler–Lagrange Equations

3.1. General Case: Lagrangian Dependent on the Tensor Derivative

As we briefly reviewed in the Introduction, in classical mechanics, one defines the action 𝒮 as a scalar quantity, with units of energy × time, that encodes the dynamical evolution of a physical system. Mathematically, the action 𝒮 is an integral functional of the trajectory or path over space-time, or of the Lagrangian density ( x ) for field theories, followed by the physical system. The principle of stationary action states the the path actually followed by the system, e.g., the field dynamics, corresponds to a stationary point of the action [1] (Ch. 19), [2] (Section 8).
In general, the application of the principle of stationary action gives the Euler–Lagrange equations, which describe the dynamics of the system [3] (Section I.3), [4] (Section 3.1), [5] (Section 7.2). We start by reviewing how to obtain these equations in coordinate-free form with tensorial notation. Differently from the usual approach, that gives the dynamics for the individual components of the field, our coordinate-free derivation directly works with some twice-differentiable multivector field.
a of grade s and its tensor derivative a in (27).
For a given region R that comprises the physical system under consideration, let the action 𝒮 ( a ) be given by
𝒮 ( a ) = R d k + n x ( a , a ) .
We assume that the region R is large enough to make the physical system closed, and that the fields decay fast enough over R so that the flux of the fields over the boundary of R is negligible. We note that the Lagrangian density is a real-valued function of the k + n s components of a and the ( k + n ) k + n s components of a , and the Lagrangian density does not depend explicitly on the space-time components.
Let the field a be infinitesimally perturbed by an amount a ε , possibly dependent on the space-time coordinates, so that the field is transformed as a a + a ε and the tensor derivative is transformed as a a + a ε . We assume that a ε is twice differentiable. We can expand the Lagrangian density in a first-order multivariate Taylor series, where the matrix of partial derivatives with respect to the ( k + n + 1 ) k + n s variables is a block matrix having along the diagonal the vector derivatives a and a of the density with respect to the field a and its tensorial derivative a , defined in (28) and (29), respectively. Neglecting terms of second and higher order in the perturbation a ε and grouping terms in the Taylor series yields
𝒮 ( a + a ε ) = R d k + n x + ( a ) · a ε + ( a ) · ( a ε ) .
We may thus evaluate the first-order change of action δ 𝒮 as
δ 𝒮 = 𝒮 ( a + a ε ) 𝒮 ( a ) = R d k + n x ( a ) · a ε + ( a ) · ( a ε ) ,
always neglecting all the contributions of order ( a ε ) 2 or higher in the action change.
Next, we note the following Leibniz product rule, an equality between scalar quantities proved in Appendix A.1, for a multivector field a and a matrix field B with basis w i , I , involving the product × defined in (19),
· ( B × a ) = ( × B ) · a + B · ( a ) .
Choosing a = a ε and B = a in (36), we can then rewrite Equation (35) as
δ 𝒮 = R d k + n x a × a · a ε + R d k + n x · a × a ε .
We identify the second integrand with a flux (26) over an ( k + n ) -dimensional hypervolume R and use the Stokes theorem [14] (Section 3.5) to rewrite the flux of the interior derivative of a vector field as the flux of the field itself across the region boundary R . The second integral in (37) then vanishes if we assume that the field a and its perturbation a ε vanish sufficiently fast at infinity.
Under this assumption, if the change of action is zero for any perturbation of the field a ε , the integrand in the first summand of (37) must be identically zero. Setting to zero the quantity between parentheses in the integrand yields the coordinate-free form of the Euler–Lagrangian equations,
a = × ( a ) ,
a = × ( a ) .
Both expressions in (38) and (39) are equivalent since they only differ in the notation for the vector derivative. It is also possible to recover a component form of the Euler–Lagrange equations from (39) [3] (Section I.3), [4] (Section 3.1), [5] (Section 7.2). Explicitly writing out the definitions in (28) and (29), we obtain the standard formula for each I I m ,
a I = i I i ( i a I ) .
In general, the use of coordinate-free expression as in (39) is closer to the common practice of vector calculus and allows us to better identify the algebraic structure of the underlying equations, which gets obscured when the components are used. Moreover, expressions as (39) are better suited to generalizations, or more properly, particularizations, to exterior calculus when the Lagrangian depends on the exterior and interior derivatives of the field, rather than the tensor derivative. This case is explored and analyzed in the next subsection.

3.2. Derivation of the Euler–Lagrange Equations in Exterior Algebraic Form

For electromagnetism, the Lagrangian density ( x ) is a function of the vector potential A , the bivector field F , and the source density vector J . Expressed in exterior calculus notation, the Lagrangian density is given by
( x ) = 1 2 F · F + J · A .
Remark 1.
If the field is represented by an antisymmetric tensor of rank 2, the factor before F · F becomes 1 4 to account for the repeated sum over pairs of indices [3] (Section I.5), [4] (Section 3.5), [2] (Section 27).
The Lagrangian density depends on the field through the potential A and its exterior derivative F = A [15] (Section 3). Instead of using (39), which was derived from the assumption that the Lagrangian density depends explicitly only the field and its tensor derivative, it is worth obtaining the Euler–Lagrange equations when the Lagrangian density is a function of a generic multivector field a of grade s, and its exterior and interior derivatives.
As in (33), for a given region R that comprises the physical system under consideration, and assumed to be large enough to make the physical system closed so that the fields decay fast enough over R and the flux of the fields over the boundary of R is arbitrarily small, the action 𝒮 ( a ) is given by the integral
𝒮 ( a ) = R d k + n x ( a , a , a ) .
Again, for an infinitesimal perturbation of the field a ε , and neglecting all the contributions of order ( a ε ) 2 or higher in the Taylor expansion of the Lagrangian density and the action, the first-order change in action δ 𝒮 is given by
δ 𝒮 = R d k + n x ( a ) · a ε + ( a ) · ( a ε ) + ( a ) · ( a ε ) .
From (35) in [15], given a vector a and a vector b of grade gr ( a ) + 1 , the Leibniz product rule in (24) holds.
Choosing a = a ε and b = a (resp. a = a and b = a ε ) in the second (resp. third) summand inside the integral, substituting these values in (24) and the result back into (43), we obtain
δ 𝒮 = R d k + n x a + ( 1 ) s + 1 ( a ) ( 1 ) s 1 ( + a ) · a ε + R d k + n x · a ε ( a ) + ( 1 ) s ( a ) a ε .
In the second integrand, a flux over the ( k + n ) -dimensional region R , the Stokes theorem [14] (Section 3.5) allows us to rewrite the flux of the interior derivative as the flux across the region boundary R . As both the field a and its perturbation a ε vanish sufficiently fast at infinity, the first-order change in action is given by
δ 𝒮 = R d k + n x a + ( 1 ) s + 1 ( a ) ( 1 ) s 1 ( a ) · a ε .
and the principle of stationary action, namely that the first-order change in action identically vanishes, leads to the coordinate-free form of the Euler–Lagrange equations, in one of the two equivalent forms:
a = ( 1 ) s ( a ) ( 1 ) s ( a )
a = ( 1 ) s ( a ) ( 1 ) s ( a ) .
It might appear that the tensorial and multivectorial expressions in (39) and (47) differ. If the Lagrangian density depends on the tensor derivative only through the interior and exterior derivatives, we verify in Appendix A.2 that both expressions are indeed identical and the following identity holds:
× ( a ) = ( 1 ) s ( a ) ( 1 ) s ( a ) .

4. Application to Generalized Electromagnetism: Maxwell Equations

4.1. Generalized Maxwell Equations

As application of the methods derived in the previous section, we study the generalized Maxwell equations [15] and their associated fields. For a given natural number r, the Maxwell field F ( x ) and the generalized source density J ( x ) are respectively characterized by multivector fields of grade r and r 1 at every point x of the flat ( k , n ) -space-time [15] (Section 3).
The potential field A ( x ) is a multivector field of grade r 1 such that
F = A .
If we replace the potential A by a new field A = A + A ¯ + G , where A ¯ is a constant ( r 1 ) -vector and G is an ( r 2 ) -vector gauge field, the homogenous Maxwell Equation (51) is unchanged [15] (Section 3). For a given Maxwell field, there is therefore some unavoidable (gauge) ambiguity on the value of the vector potential.
Scalar fields are given by the vector potential by setting r = 1 in Minkowski space-time, namely k = 1 and n = 3 . For classical electromagnetism ( r = 2 , k = 1 , n = 3 ), the bivector field is usually expressed as an antisymmetric tensor of rank 2; electrostatics and magnetostatics are recovered for k = 0 , n = 3 , by setting r = 1 and r = 2 , respectively. The generalized Maxwell equations for arbitrary values of r, k, and n are the following pair of coupled differential equations:
F = J ,
F = 0 .
The interior derivative in (50) and the exterior derivative in (51) are respectively defined in (22) and (21). As we stated in Section 2.5, the interior derivative lowers the grade by one, while the exterior derivative increases the grade by one; therefore, Equation (50) is an identity of ( r 1 ) -vectors while Equation (51) is an identity of ( r + 1 ) -vectors.

4.2. Lagrangian Density for Generalized Electromagnetism

For electromagnetism, the Lagrangian density ( x ) is a function of the potential A , the Maxwell field F , and the source density J . Expressed in exterior calculus notation, we postulate the generalized Lagrangian density to be
( x ) = ( 1 ) r 1 2 F · F + J · A .
For classical electromagnetism ( r = 2 , k = 1 , n = 3 ), if the field is expressed as an antisymmetric tensor of rank 2 the factor before F · F becomes 1 4 , see Remark 1. In contrast, for electrostatics ( r = 1 , k = 0 , n = 3 ), the Lagrangian density is given by = 1 2 E · E + ρ ϕ , where E is the electric field, ρ the charge density, and ϕ is the opposite in sign of the usual electric potential, that is, E = ϕ = ϕ [1] (Ch. 19).
While the Lagrangian in (52) leads to the generalized Maxwell Equations (50) and (51), as we shall see in the following section, it is not the most general Lagrangian associated to electromagnetism. Two terms that can be added to it respectively deal with the hypothetical mass of the photon, that is, the Proca term [4] (p. 107), [22] (Section 12.8), and a gauge-fixing term that appears in the context of quantization of the electromagnetic field [3] (Section II.7), [4] (Section 7.1). This general Lagrangian density for electromagnetism is now given by
( x ) = ( 1 ) r 1 2 F · F + J · A 1 2 m 2 A · A + ( 1 ) r 1 2 ξ ( A ) · ( A ) ,
where m is the hypothetical photon mass and ξ is a parameter that determines the so-called R ξ gauge; for ξ = 1 , we have the Feynman gauge, and in the limit ξ 0 , we have the Landau gauge.

4.3. Euler–Lagrange Equations

For Lagrangian densities such as (52) or (53), which are essentially quadratic forms in the field and/or its interior or exterior derivatives, evaluation of the vector derivatives is straightforward, as the derivative has the same form as that obtained in infinitesimal calculus for the derivative of a polynomial (31) and (32). For the Lagrangian density in (52), evaluation of the derivatives in the Euler–Lagrange Equation (47) give
A = J ,
A = ( 1 ) r 1 ( A ) ,
from which the Euler–Lagrange equations themselves (46), with s = r 1 , can be expressed as
J = ( A ) = F ,
namely the generalized nonhomogenous Maxwell Equation (50) for arbitrary r, k, and n. The homogeneous Maxwell Equation (51) is also satisfied as a consequence of the definition of F = A .
The exterior algebraic formulation of the Lagrangian and the Euler–Lagrange equations brings the advantage of allowing for a more direct derivation of the Maxwell equations, since evaluation of the vector derivatives mimicks more closely the steps carried out in usual differential calculus to evaluate the derivatives.
The factor ( 1 ) r 1 in the Lagrangian density is needed to compensate for the identical term ( 1 ) r 1 that appears in the Euler–Lagrange Equation (47). An alternative way of writing the Lagrangian density, without this sign factor, would involve replacing one of the exterior derivatives A by a right exterior derivative A , where the partial derivative operator is understood to act from the right on the potential. In this case, the skew commutativity of the wedge product, A = ( 1 ) r 1 ( A ) [14] (Section 2.2), cancels the sign in the Lagrangian density and results in a somewhat neater expression for it.
As for the Lagrangian density in (53), evaluation of the derivatives in the Euler–Lagrange Equation (47) give
A = J m 2 A ,
A = ( 1 ) r 1 ( A ) ,
A = ( 1 ) r 1 1 ξ ( A ) ,
from which the Euler–Lagrange Equation (46) with the Proca and quantization R ξ -gauge terms become
( A ) + m 2 A = J + 1 ξ ( A ) .
Using the relationship (34) in [15], we may rewrite (60) in an alternative form with a wave equation,
( 1 ) r 1 ( · ) A + m 2 A = J + 1 ξ 1 ( A ) ,
which somewhat simplifies in the Feynman gauge, for which ξ = 1 .

4.4. Dual Generalized Maxwell Equations

An interesting dual form of Maxwell equations is obtained by swapping the roles played by the interior and exterior derivatives in the Lagrangian density and the Maxwell equations themselves. Let the “potential” A ¯ and “source density” J ¯ be multivectors of grade s, and let us define a dual Maxwell field F ¯ of grade r = s 1 by F ¯ = A ¯ . The Lagrangian density is now given by
( x ) = ( 1 ) r 2 ( A ¯ ) · ( A ¯ ) + J ¯ · A ¯
= 1 2 ( A ¯ ) · ( A ¯ ) + J ¯ · A ¯ ,
where we used the relationship between left and right interior derivatives A ¯ = ( 1 ) s + 1 ( A ¯ ) to write (63). Direct evaluation of the Euler–Lagrange Equation (47), with r = s 1 , gives
J ¯ = F ¯ .
This nonhomogeneous “Maxwell” equation is complemented by a homogeneous equation F ¯ = 0 , itself a consequence of the definition of F ¯ as F ¯ = A ¯ .
As it happened with the generalized Maxwell equations, the exterior algebraic formulation of the Lagrangian and the Euler–Lagrange equations allows for a more direct derivation of the dual Maxwell equations. An interesting question, which we do not dwell upon as it lies beyond the scope of this paper, is whether the physics of the dual Maxwell equations is different from the usual Maxwell equations, or simply involves a transformation of the fields, potential, and source density, with no new phenomena. Along this direction, and leaving the details left as an exercise to the reader, it is relatively easy to verify that one obtains a wave equation relating A ¯ and J ¯ in a “Lorenz gauge” where A ¯ = 0 . Solutions to this wave equation have several independent degrees of freedom or polarizations. The number of these polarizations is k + n 2 r 1 , as for the standard Maxwell Equation [15] (Section 4.3); this number can be justified as the number of possible ( r + 1 ) -vectors where two dimensions, one temporal and one spatial are fixed, and the remaining r + 1 2 = r 1 indices have to be filled with k + n 2 possible values. We also have a “Lorentz force” density f = J ¯ F ¯ such that a conservation law holds for the stress–energy-momentum tensor T em of the field [15] (Appendix A.2), as for the usual Maxwell equations.

Author Contributions

Conceptualization, A.M.; methodology, I.C., J.F.-S. and A.M.; investigation, I.C. and A.M.; writing—original draft preparation, A.M.; writing—review and editing, I.C., J.F.-S. and A.M. All authors have read and agreed to the published version of the manuscript.

Funding

This work was funded in part by the Spanish Ministry of Science, Innovation and Universities under grants TEC2016-78434-C3-1-R and BES-2017-081360.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

No new data were created or analyzed in this study. Data sharing is not applicable to this article.

Acknowledgments

The authors acknowledge the anonymous reviewers for the constructive comments and suggestions that have helped to improve the manuscript.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

Appendix A.1. Proof of the Leibniz Product Rule in (36)

Let us consider a multivector field a of grade m and a matrix field B with basis w i , I = e i e I , where | I | = m . Using the definitions of dot and matrix product (4) and (19), the first term of the right-hand side of (36) is evaluated as
( × B ) · a = i , j I , J I m Δ i i i b j , J e i × ( e j e J ) · I I m a I e I
= i I , J I m i b i , J e J · I I m a I e I
= i I , I I m Δ I I a I i b i , I ,
where in (A1), we wrote the components of B and a , in (A2), we computed the matrix product and removed the j index, and in (A3), we carried out the dot product and removed the J index.
In turn, the second term in the right-hand side of (36) can similarly be evaluated using the dot and times products in (16) and (19) as
B · ( a ) = i I , I I m b i , I e i e I · j I , J I m Δ j j j a J e j e J
= i I , I I m Δ I I b i , I i a I .
Next, using again the definitions of dot and matrix product (4) and (17), the left-hand side of (36) becomes
· ( B × a ) = i I Δ i i e i x i · I I m j I , J I m a I b j , J ( e j e J ) × e I
= i I Δ i i e i x i · j I , I I m a I b j , I Δ I I e j
= i I , I I m Δ I I i a I b i , I .
Summing (A3) and (A5) and applying the rule for the derivative of a product yields the desired (36).

Appendix A.2. Identity between Tensorial and Exterior Algebraic Euler–Lagrange Equations

Since both the exterior and interior derivatives are surjective linear functions of the tensor derivative, the respective vector derivatives of the Lagrangian density are related. Each component of the exterior and interior derivatives (21) and (22) is a scalar (affine) function of several distinct components of the tensor derivative. The Lagrangian density depends on the components of the tensor derivative only through these scalar functions. We thus need to compute the derivative of a function g 1 ( z ) , , g ( z ) , where z stands for a vector with the = ( k + n ) k + n s components of the tensor derivative, and ( g 1 , , g ) are the (differentiable) functions that give the components of the exterior (resp. interior) derivative from the tensor derivative, where = k + n s + 1 (resp. = k + n s 1 ). By construction, a given z k = j a J appears only in one g i ( z ) , the I component of either the exterior or the interior derivative. In the former case, I = j + J , in the latter case I = J \ j .
From the definition of partial derivative, and for any i = 1 , , , we have the relation
g i ( z ) = lim h 0 g 1 ( z ) , , g i ( z ) + h , , g ( z ) g 1 ( z ) , , g ( z ) h .
Then, assuming that g i ( z ) z k 0 and defining h i k = h g i / z k , we can then write for every value of k such that the partial derivative z k = j a J appears g i ( z ) ,
g i ( z ) + h = g i ( z ) + h i k g i ( z ) z k g i ( z 1 , , z k + h i k , , z ) ,
where we used the differentiablility of the function g i . Substituting (A10) back into (A9) yields
g i ( z ) = 1 g i / z k lim h i k 0 g 1 ( z ) , , g i ( z 1 , , z k + h i k , , z ) , , g ( z ) g 1 ( z ) , , g ( z ) h i k .
Since the z k component appears only in one of the functions g i , the limit in (A11) is the partial derivative of the Lagrangian density with respect to the kth component of the tensor derivative, for any k, that is,
g i ( z ) = 1 g i / z k z k .
We now proceed to evaluate the vector derivative with respect to the exterior derivative. First, we have
( a ) = I I s + 1 ( a ) I Δ I I e I ,
where the Ith component of the exterior derivative, ( a ) I is the equivalent of g i ( z ) in (A12). The equivalent of k is any pair of j and J I s such that I = j + J and the corresponding z k is j a J . The partial derivative g i / z k in (A12) is thus Δ j j σ ( j , J ) , of value ± 1 , and we therefore have for any pair of i and J I s such that I = j + J that
( a ) I = 1 Δ j j σ ( j , J ) ( j a J ) = Δ j j σ ( j , J ) ( j a J ) .
Substituting (A14) back in (A13) yields
( a ) = I I s + 1 ( j a J ) Δ j j Δ I I σ ( j , J ) e I ,
where j and J are any pair such that I = j + J . Now, taking the interior derivative of (A15), we obtain for any j , J such that I = j + J ,
( a ) = i I Δ i i e i i I I s + 1 ( j a J ) Δ j j Δ I I σ ( j , J ) e I
= i I , I I s + 1 : i I Δ i i Δ I I i ( i a I \ i ) σ ( i , I \ i ) σ ( I \ i , i ) e I \ i ,
where we have selected j = i and, therefore, J = I \ i . We also note that σ ( i , I \ i ) σ ( I \ i , i ) = ( 1 ) s .
We now evaluate the vector derivative with respect to the interior derivative in an analogous manner,
( a ) = I I s 1 ( j a J ) Δ I I σ ( J \ j , j ) e I ,
where j and J are any pair such that I = J \ j . Now, taking the exterior derivative of (A18), we obtain, for any j , J such that I = J \ j ,
( a ) = i I Δ i i e i i I I s 1 ( j a J ) Δ I I σ ( J \ j , j ) e I
= i I , I I s 1 : i I Δ i i Δ I I i ( i a i + I ) σ ( I , i ) σ ( i , I ) e i + I ,
where we have selected j = i and, therefore, J = i + I . We also note that σ ( I , i ) σ ( i , I ) = ( 1 ) s 1 .
Putting (A17) and (A20), as well as the relationships between the product of permutation signatures, back into the right-hand side of (48) yields the expression
i I , I I s + 1 : i I Δ i i Δ I I i ( i a I \ i ) e I \ i + i I , I I s 1 : i I Δ i i Δ I I i ( i a i + I ) e i + I .
Since the basis elements are multivectors with s components, we may rewrite (A21) as
i I , I I s : i I Δ I I i ( i a I ) e I + i I , I I s : i I Δ I I i ( i a I ) e I .
The two summations in (A21) might be further combined in a single summation over I I s . The resulting expression coincides with the left-hand side of (48), which can be expanded using the computation in (40) into
i , I I s Δ I I i ( i a I ) e I .

References

  1. Feynman, R.P.; Leighton, R.B.; Sands, M. The Feynman Lectures on Physics, Vol. II: Mainly Electromagnetism and Matter; Addison-Wesley: Boston, MA, USA, 1977. [Google Scholar]
  2. Landau, L.D.; Lifshitz, E.M. The Classical Theory of Fields. In Course of Theoretical Physics, 4th ed.; Butterworth-Heinemann: Oxford, UK, 1987; Volume 2. [Google Scholar]
  3. Zee, A. Quantum Field Theory in a Nutshell. In Nutshell Handbook; Princeton University Press: Princeton, NJ, USA, 2003. [Google Scholar]
  4. Maggiore, M. A Modern Introduction to Quantum Field Theory. In Oxford Master Series in Statistical, Computational, and Theoretical Physics; Oxford Univeristy Press: Oxford, OH, USA, 2005. [Google Scholar]
  5. Weinberg, S. The Quantum Theory of Fields, Volume 1: Foundations; Cambridge University Press: Cambridge, UK, 2005. [Google Scholar]
  6. Lanczos, C. The Variational Principles of Mechanics, 4th ed.; University of Toronto Press: Toronto, ON, Canada, 1970; 418p. [Google Scholar]
  7. Gibbs, J.W.; Wilson, E.B. Vector Analysis. In New Haven; Yale University Press: New Haven, CT, USA, 1929. [Google Scholar]
  8. Clifford, W.K. Mathematical Papers; Macmillan: London, UK, 1882. [Google Scholar]
  9. Doran, C.; Lasenby, A. Geometric Algebra for Physicists; Cambridge University Press: Cambridge, UK, 2003. [Google Scholar]
  10. Ricci, M.M.G.; Levi-Civita, T. Méthodes de calcul différentiel absolu et leurs applications. Math. Ann. 1900, 54, 125–201. [Google Scholar] [CrossRef] [Green Version]
  11. Cartan, E. Les Systemes Differentiels Exterieurs Et Leurs Applications Geometriques; Hermann & Cie: Paris, France, 1945. [Google Scholar]
  12. Arnold, V.I.; Weinstein, A.; Vogtmann, K. Mathematical Methods of Classical Mechanics; Springer: Berlin, Germany, 1989. [Google Scholar]
  13. Grassmann, H. Extension Theory. In Number 19 in History of Mathematics Sources; American Mathematical Society: Providence, RI, USA, 2000. [Google Scholar]
  14. Colombaro, I.; Font-Segura, J.; Martinez, A. An Introduction to Space–Time Exterior Calculus. Mathematics 2019, 7, 564. [Google Scholar] [CrossRef] [Green Version]
  15. Colombaro, I.; Font-Segura, J.; Martinez, A. Generalized Maxwell equations for exterior-algebra multivectors in (k, n) space-time dimensions. Eur. Phys. J. Plus 2020, 135, 1–31. [Google Scholar] [CrossRef]
  16. Lasenby, A.; Doran, C.; Gull, S. A Multivector Derivative Approach to Lagrangian Field Theory. Found. Phys. 1993, 23, 1295–1327. [Google Scholar] [CrossRef]
  17. Martinez, A.; Font-Segura, J.; Colombaro, I. An Exterior-Algebraic Derivation of the Symmetric Stress-Energy-Momentum Tensor in Flat Space-Time. Eur. Phys. J. Plus 2021, 136, 1–28. [Google Scholar] [CrossRef]
  18. Lovelock, D.; Rund, H. Tensors, Differential Forms, and Variational Principles; Dover Publications: New York, NY, USA, 1989. [Google Scholar]
  19. Flanders, H. Differential Forms with Applications to the Physical Sciences; Dover Publications: New York, NY, USA, 1989. [Google Scholar]
  20. Horn, R.A.; Johnson, C.R. Matrix Analysis, 2nd ed.; Cambridge University Press: Cambridge, UK, 2013. [Google Scholar]
  21. Dwyer, P.S.; Macphail, M.S. Symbolic Matrix Derivatives. Ann. Math. Statist. 1948, 19, 517–534. [Google Scholar] [CrossRef]
  22. Jackson, J.D. Classical Electrodynamics, 3rd ed.; John Wiley & Sons: Hoboken, NJ, USA, 1999. [Google Scholar]
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Colombaro, I.; Font-Segura, J.; Martinez, A. An Exterior Algebraic Derivation of the Euler–Lagrange Equations from the Principle of Stationary Action. Mathematics 2021, 9, 2178. https://doi.org/10.3390/math9182178

AMA Style

Colombaro I, Font-Segura J, Martinez A. An Exterior Algebraic Derivation of the Euler–Lagrange Equations from the Principle of Stationary Action. Mathematics. 2021; 9(18):2178. https://doi.org/10.3390/math9182178

Chicago/Turabian Style

Colombaro, Ivano, Josep Font-Segura, and Alfonso Martinez. 2021. "An Exterior Algebraic Derivation of the Euler–Lagrange Equations from the Principle of Stationary Action" Mathematics 9, no. 18: 2178. https://doi.org/10.3390/math9182178

APA Style

Colombaro, I., Font-Segura, J., & Martinez, A. (2021). An Exterior Algebraic Derivation of the Euler–Lagrange Equations from the Principle of Stationary Action. Mathematics, 9(18), 2178. https://doi.org/10.3390/math9182178

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop