As preliminaries, a few basic tools from variational calculus will be reviewed. Let be independent variables and be dependent variables, and let denote all k-th order partial derivatives of u with respect to x. Introduce an index notation for the components of x and u: , ; and , . In this notation, the components of are given by , , , with . Summation is assumed over each pair of repeated indices in any expression. The coordinate space is called the jet space associated with the variables . A differential function is a locally smooth function of finitely many variables in J. Total derivatives with respect to x applied to differential functions are denoted .
The necessary tools that will now be introduced are the Fréchet derivative and its adjoint derivative, the Euler operator and its product rule, and the Helmholtz conditions.
This linearization can be viewed as a local directional derivative in jet space, corresponding to the action of a generator in characteristic form, , where is a set of m arbitrary differential functions.
2.1. Conservation Laws and Symmetries
Consider an
N-th-order system of
DEs
The space of solutions of the system will be denoted . When the number of independent variables x is , each DE is an ordinary differential equation (ODE), whereas when the number of independent variables x is , each DE is a partial differential equation (PDE). The number, m, of dependent variables u need not be the same as the number, M, of DEs in the system.
A
local infinitesimal symmetry [
24,
31,
32] of a given DE system (
14) is a generator
whose prolongation leaves invariant the DE system
which holds on the whole solution space
of the system. (In this determining equation, the notation
means that the given DE system, as well as its differential consequences, are to be used). The differential functions
and
in the symmetry generator are called the
symmetry characteristic functions. When acting on the solution space
, an infinitesimal symmetry generator can be formally exponentiated to produce a one-parameter group of transformations
, with parameter
ϵ, where the infinitesimal transformation is given by
for all solutions
of the DE system.
Two infinitesimal symmetries are equivalent if they have the same action (
17) on the solution space
of a given DE system. An infinitesimal symmetry is thereby called
trivial if it leaves all solutions
unchanged. This occurs iff its characteristic functions satisfy the relation
The corresponding generator (
15) of a trivial symmetry is thus given by
which has the prolongation
. Conversely, any generator of this form on the solution space
represents a trivial symmetry. Thus, any two generators that differ by a trivial symmetry are equivalent. The differential order of an infinitesimal symmetry is defined to be the smallest differential order among all equivalent generators.
Any symmetry generator is equivalent to a generator given by
under which
u is infinitesimally transformed while
x is invariant, due to the relation
This generator (
20) defines the
characteristic form for the infinitesimal symmetry. The symmetry invariance (
16) of the DE system can then be expressed by
holding on the whole solution space
of the given system. Note that the action of
is the same as a Fréchet derivative (
1), and hence, an equivalent, modern formulation [
24,
29,
31] of this invariance (
22) is given by the
symmetry determining equation(Recall, the notation means that the given DE system, as well as its differential consequences, are to be used in these determining equations.)
In jet space
J, a group of transformations
with a non-trivial generator
in general will not act in a closed form on
and derivatives
up to a finite order, except [
24,
31] for point transformations acting on
and contact transformations acting on
. Moreover, a contact transformation is a prolonged point transformation when the number of dependent variables is
[
24,
31]. A
point symmetry is defined as a symmetry transformation group on
, whose generator is given by characteristic functions of the form
corresponding to the infinitesimal point transformation
Likewise, a
contact symmetry is defined as a symmetry transformation group on
whose generator corresponds to an infinitesimal transformation that preserves the contact relations
. The set of all admitted point symmetries and contact symmetries for a given DE system comprises its group of
Lie symmetries. The corresponding generators of this group comprise a Lie algebra [
24,
31,
32].
A
local conservation law of a given DE system (
14) is a divergence equation
which holds on the whole solution space
of the system, where
is the
conserved current vector. In the case when one of the independent variables represents a time coordinate and the remaining
independent variables represent space coordinates, namely
, then
is a
conserved density and
is a
spatial flux vector, while the conservation law has the form of a local continuity equation
. (Similarly to the symmetry determining equation, the notation
here means that the given DE system, as well as its differential consequences, are to be used).
A conservation law (
26) is
locally trivial if
holds for some differential antisymmetric tensor function
on
, since any total curl is identically divergence free,
due to the commutativity of total derivatives. Two conservation laws are said to be
locally equivalent if, on the solution space
, their conserved currents differ by a locally trivial current (
28). The
differential order of a conservation law is defined to be the smallest differential order among all locally equivalent conserved currents. (Sometimes a local conservation law is itself defined as the equivalence class of locally equivalent conserved currents).
For a given DE system (
14), the set of all non-trivial local conservation laws (up to local equivalence) forms a vector space on which the local symmetries of the system have a natural action [
24,
31,
33]. In particular, the infinitesimal action of a symmetry (
15) on a conserved current (
27) is given by [
24]
When the symmetry is expressed in characteristic form (
20), its action has the simple form
The conserved currents
and
are locally equivalent,
with
which follows from the relation (
21).
A DE system is
variational if it arises as the Euler–Lagrange equations of a local Lagrangian. This requires that the number of equations in the system is the same as the number of dependent variables,
, and that the differential order
N of the system is even, in which case the system is given by
where the Lagrangian is a differential function
The necessary and sufficient conditions [
24,
29,
31,
32] for a given DE system (
14) to be variational consist of the Helmholtz conditions (
13), which are given by
where
is a set of arbitrary differential functions. Note that these conditions (
35) are required to hold identically in jet space
J (and not just on the solution space
of the DE system).
2.2. Ibragimov’s Conservation Law Formula
The starting point is the well-known observation [
24] that any
N-th-order system of
DEs (
14) can be embedded into a larger system by appending an “adjoint variable” for each DE in the system, where this set of
variables
is taken to satisfy the adjoint of the linearization of the original DE system. Specifically, the enlarged DE system is given by
for
and
, in Ibragimov’s notation. This system (
36)–(
37) comprises the Euler–Lagrange equations of the Lagrangian function
since, clearly,
through the product rule (
9).
All solutions
of the original DE system (
36) give rise to solutions of the Euler–Lagrange system (
39) by letting
be any solution (for instance
) of the DEs (
37). Conversely, all solutions
of the Euler–Lagrange system (
39) yield solutions of the original DE system (
36) by projecting out
.
This embedding relationship can be used to show that every symmetry of the original DE system (
36) can be extended to a variational symmetry of the Euler–Lagrange system (
39). The proof is simplest when the symmetries are formulated in characteristic form (
20).
Let
be any local symmetry generator (in characteristic form) admitted by the DE system (
36). Under some mild regularity conditions [
29] on the form of these DEs, the symmetry determining Equation (
23) implies that the characteristic functions
satisfy
where
is some linear differential operator whose coefficients
are differential functions that are non-singular on solution space
of the DE system (
14). Now, consider the action of this symmetry generator (
40) on the Lagrangian (
38). From the operator relation (
41) followed by integration by parts, the symmetry action is given by
where
is the adjoint of the operator (
42), with the non-singular coefficients
Although the Lagrangian is not preserved, the expression (
43) for the symmetry action shows that if the symmetry is extended to act on
v via
then, under this extended symmetry, the Lagrangian will be invariant up to a total divergence,
This completes the proof. A useful remark is that the vector
in the total divergence (
47) is a linear expression in terms of
(and total derivatives of
), and hence, this vector vanishes whenever
is a solution of the DE system (
36). Consequently,
is a trivial current for the Euler–Lagrange system (
39).
Some minor remarks are that the proof given by Ibragimov [
9] does not take advantage of the simplicity of working with symmetries in characteristic form and also glosses over the need for some regularity conditions on the DE system so that the symmetry operator relation (
41) will hold. Moreover, that proof is stated only for DE systems in which the number of equations is the same as the number of dependent variables,
.
Now, since the extended symmetry (
46) is variational, Noether’s theorem can be applied to obtain a corresponding conservation law for the Euler–Lagrange system (
39), without the need for any additional conditions. The formula in Noether’s theorem comes from applying the variational identity (
5) to the Lagrangian (
38), which yields
for any generator
The total divergence term
is given by the formula (
8) derived using the Euler operator (
6). This yields
When this variational identity (
48) is combined with the action (
47) of the variational symmetry (
46) on the Lagrangian, the following Noether relation is obtained:
where
is expression (
37). Since
,
and
vanish when
is any solution of the Euler–Lagrange system (
39), the Noether relation (
51) yields a local conservation law
where
denotes the solution space of the system (
39) (including its differential consequences). This conservation law is locally equivalent to the conservation law formula underlying Ibragimov’s work [
9,
12], which is given by
where
since
. Strangely, nowhere does Ibragimov (or subsequent authors) point out that the term
in the conserved current trivially vanishes on all solutions
of the Euler–Lagrange system!
Hence, the following result has been established.
Proposition 1. Any DE system (
36)
can be embedded into a larger Euler–Lagrange system (
39)
such that every symmetry (
40)
of the original system can be extended to a variational symmetry (
46)
of the Euler–Lagrange system. Noether’s theorem then yields a conservation law (
52)
for all solutions of the Euler–Lagrange system (
39).
A side remark is that the locally equivalent conservation law (
53) also can be derived from Noether’s theorem if the extended symmetry (
46) is expressed in canonical form
as obtained from relations (
20)–(
21). In particular, the corresponding form of the variational identity (
48) becomes
where
and
, while the action of the symmetry (
54) on the Lagrangian is given by
where
vanishes whenever
is a solution of the DE system (
36). Hence, the Noether relation obtained from combining Equations (
55) and (
56) yields the conserved current
modulo the locally trivial current
. If the original symmetry (
40) being used is a point symmetry, then this trivial current
can be shown to vanish identically, which is the situation considered in Ibragimov’s papers [
9,
12] and in nearly all subsequent applications in the literature.
2.3. “Nonlinear Self-Adjointness”
The conservation law (
52) holds for all solutions
of the Euler–Lagrange system (
39). It seems natural to restrict this to solutions of the original DE system (
36) for
by putting
. However, the resulting conserved current is trivial,
, because
L is a linear expression in terms of
v. Consequently, some other way must be sought to project the solution space
of the Euler–Lagrange system onto the solution space
of the original DE system (
36).
Ibragimov’s first paper [
9] proposes to put
, which is clearly a significant restriction on the form of the original DE system (
36). In particular, this requires that
hold identically, where the DE system is assumed to have the same number of equations as the number of dependent variables,
, which allows the indices
to be identified. He calls such a DE system
“strictly self-adjoint”. This definition is motivated by the case of a linear DE system, since linearity implies that
and
are identities, whereby a linear DE system with
is “strictly self-adjoint” iff it satisfies
, which is the condition for the self-adjointness of a linear system. However, for nonlinear DE systems, the definition of “strictly self-adjoint” conflicts with the standard of definition [
6,
24] in variational calculus that a general DE system
is self-adjoint iff its associated Fréchet derivative operator is self-adjoint,
, which requires
.
Ibragimov subsequently [
10] proposed to have
, which he called “quasi-self-adjointness”. A more general proposal
was then introduced first in Reference [
14] and shortly later appears in Ibragimov’s next paper [
12], with the condition that
must hold for some coefficients
, again with
. This condition is called “weak self-adjointness” in Reference [
14] and “nonlinear self-adjointness” in Reference [
12]. Ibragimov also mentions an extension of this definition to
, but does not pursue it. Later, he applies this definition in Reference [
13] to a specific PDE, where
is extended to be a linear differential operator. However, unlike in the previous papers, no conservation laws are found from using this extension. A subsequent paper [
15] then uses this extension, which is called “nonlinear self-adjointness through a differential substitution”, to obtain conservation laws for several similar PDEs. Finally, the same definition is stated more generally in Reference [
17] for DE systems with
:
where the coefficients
are differential functions.
These developments lead to the following conservation law theorem, which is a generalization of Ibragimov’s main theorem [
9,
12] to arbitrary DE systems (not restricted by
), combined with the use of a differential substitution [
12,
15,
17].
Theorem 1. Suppose a system of DEs (
14)
satisfiesfor some differential functions and , ,…, that are non-singular on the solution space of the DE system, where is the adjoint linearization (
2)
of the system. Then, any local symmetryadmitted by the DE system yields a local conservation law (
26)
given in an explicit form by the conserved current (
50)
with and . An important remark is that all of the functions
,
must be non-singular on
, as otherwise, the condition (
58) can be satisfied in a trivial way. This point is not mentioned in any of the previous work [
9,
12,
14,
15,
17].
The “nonlinear self-adjointness” condition (
58) turns out to have a simple connection to the determining equations for symmetries. This connection is somewhat obscured by the unfortunate use of non-standard definitions and non-standard notation in References [
9,
12]. Nevertheless, it is straightforward to show that Equation (
58) is precisely the adjoint of the determining Equation (
23) for symmetries formulated as an operator Equation (
41).
2.4. Adjoint-Symmetries and a Formula for Generating Conservation Laws
For any given DE system (
14), the adjoint of the symmetry determining Equation (
23) is given by
for a set of differential functions
. (Similarly to the symmetry determining equation, the notation
here means that the given DE system, as well as its differential consequences, are to be used). These differential functions are called an
adjoint-symmetry [
4], in analogy to the characteristic functions of a symmetry (
40), and so, Equation (
60) is called the
adjoint-symmetry determining equation. As shown in Reference [
29], this analogy has a concrete geometrical meaning in the case when a DE system is an evolutionary system
with
and
, where
t is a time coordinate and
,
, are space coordinates. In this case,
can be viewed as the coefficients of a one-form or a covector
, in analogy to
being the coefficients of a vector
. The condition for
to be a symmetry can be formulated as
where
denotes the Lie derivative [
24,
29] with respect to the time evolution vector
. Then, the condition for
to be an adjoint-symmetry is equivalent to
. (Note the awkwardness in the index positions here comes from Ibragimov’s choice of index placement
for a DE system with
. A better notation would be
and
when
, which is used in References [
4,
26,
27,
31].)
In the case when a DE system is variational (
33), the symmetry determining equation is self-adjoint, since
. Then, the adjoint-symmetry determining Equation (
60) reduces to the symmetry determining Equation (
23), with
, where the indices
can be identified, due to
. Consequently, adjoint-symmetries of any variational DE system are the same as symmetries.
Other aspects of adjoint-symmetries and their connection to symmetries are discussed in Reference [
34].
Now, under some mild regularity conditions [
29] on the form of a general DE system (
14), the adjoint-symmetry determining Equation (
60) implies that the functions
satisfy
where
is some linear differential operator whose coefficients
are differential functions that are non-singular on the solution space
of the DE system (
14). In Ibragimov’s notation
, the adjoint-symmetry Equation (
61) coincides with the “nonlinear self-adjointness” condition (
58) in Theorem 1, where the operator on the right-hand side of Equation (
58) is precisely the adjoint-symmetry operator (
62).
Therefore, the following equivalence has been established.
Proposition 2. For a general DE system (
14),
the condition (
58)
of “nonlinear self-adjointness” coincides with the condition of existence of an adjoint-symmetry (
60).
When a DE system is variational (
33),
these conditions reduce to the condition of the existence of a symmetry. One remark is that the formulation of “nonlinear self-adjointness” given here is more general than what appears in References [
12,
15,
17] since those formulations assume that the DE system has the same number of equations as the number of dependent variables,
. Another remark is that the meaning of “nonlinear self-adjointness” shown here in the case of variational DE systems has not previously appeared in the literature.
Example: Consider the class of semilinear wave equations
for
, with a nonlinearity coefficient
, damping coefficients
and a mass-type coefficient
. In Reference [
20], the conditions under which a slightly more general family of wave equations is “nonlinearly self-adjoint” (
58) are stated for
. These results will be generalized here by considering
. A first observation is that this class of wave equations admits an equivalence transformation
, with
, which can be used to put
by
where
. (Equivalence transformations were not considered in Reference [
20], and so, their results are considerably more complicated than is necessary). This transformation gives
In Ibragimov’s notation, the condition of “nonlinear self-adjointness” with
is given by
where
For comparison, the determining Equation (
23) for local symmetries
(in characteristic form) is given by
Its adjoint is obtained by multiplying by
and integrating by parts, which yields
. After the
terms are expanded out, this gives the determining Equation (
60) for local adjoint-symmetries
which coincides with the “nonlinear self-adjointness” condition (
65) extended to differential substitutions [
12,
14,
16] given by
. All adjoint-symmetries of lowest-order form
can be found in a straightforward way. After
is substituted into the determining Equation (
67) and
is eliminated through the wave Equation (
63), the determining equation splits with respect to the variables
and
, yielding a linear overdetermined system of four equations (after some simplifications):
It is straightforward to derive and solve this determining system by Maple. Hereafter, the conditions
will be imposed, which corresponds to studying wave equation (
63) whose lower-order terms are nonlinear and homogeneous. The general solution of the determining system (
68)–(
70) then comprises three distinct cases (as obtained using the Maple package ’rifsimp’), after merging. This leads to the following complete classification of solution cases shown in
Table 1. The table is organized by listing each solution
Q and the conditions on
for which it exists. (From these conditions, a classification of maximal linear spaces of multipliers can be easily derived). Note that if the transformation
is inverted, then
Q transforms to
. (Also note that, under the restriction
considered in Reference [
20], the classification reduces to just the first case with
and
).
The Fréchet derivative operator in the symmetry determining Equation (
23) and the adjoint of this operator in the adjoint-symmetry determining Equation (
60) are related by the integration-by-parts formula (
3). For a general DE system (
14), this formula is given by
where the vector
is given by the explicit expression (
4) with
,
, and
. As shown in References [
2,
3,
4], this vector
will be a conserved current
whenever the differential functions
and
respectively satisfy the symmetry and adjoint-symmetry determining equations. Moreover, it is straightforward to see
which follows from relation (
7), where
is the Noether conserved current (
50) and
L is the Lagrangian (
38). Alternatively, the equality (
74) can be derived indirectly by applying formula (
72) to the variational identity (
48) with
, giving
which implies
holds (up to the possible addition of a total curl).
When the relation (
74) is combined with Propositions 1 and 2, the following main result is obtained.
Theorem 2. For any DE system (
14)
admitting an adjoint-symmetry (
60)
(namely, a “nonlinearly self-adjoint system” in the general sense), the conserved current (
50)
derived from applying Noether’s theorem to the extended Euler–Lagrange system (
39)
using any given symmetry (
46)
is equivalent to the conserved current obtained using the adjoint-symmetry/symmetry formula (
72).
This theorem shows that the “nonlinear self-adjointness” method based on Ibragimov’s theorem as developed in papers [
9,
12,
14,
15,
17] for DE systems with
is just a special case of the adjoint-symmetry/symmetry formula (
72) introduced for general DE systems in prior papers References [
2,
3,
4], which were never cited. Moreover, the adjoint-symmetry/symmetry formula (
72) has the advantage that there is no need to extend the given DE system by artificially adjoining variables to get a Euler–Lagrange system.
Another major advantage of the adjoint-symmetry/symmetry formula is that it can be used to show how the resulting local conservation laws are, in general, not necessarily non-trivial and comprise only a subset of all of the non-trivial local conservation laws admitted by a given DE system. In particular, in many applications of Theorem 1, it is found that some non-trivial symmetries, particularly translation symmetries, only yield trivial conservation laws [
17,
18,
19], and that some local conservation laws are not produced even when all admitted symmetries are used. These observations turn out to have a simple explanation through the equivalence of Theorem 1 and the adjoint-symmetry/symmetry formula (
72), as explained in the next section.
Example: For the semilinear wave Equation (
63), the extended Euler–Lagrange system in Ibragimov’s notation consists of
where
is defined by the adjoint-symmetry Equation (
67) with
, and where the Lagrangian (
38) is simply
in terms of the variables
u and
v. Consider any point symmetry of the wave Equation (
76) for
u, given by a generator
Its equivalent characteristic form is
, with
satisfying the symmetry determining Equation (
66) on the space of solutions
of the wave Equation (
76). Every point symmetry can be extended to a variational symmetry (
54) admitted by the Euler–Lagrange system, which is given by the generator
where
is the adjoint of the operator
defined by relation (
41) for the point symmetry holding off of the solution space of the wave Equation (
76). In particular,
can be obtained by a straightforward computation of
, where the terms in
are simplified by using the equations
,
and
that arise from splitting the determining Equation (
66). This yields
and thus
Hence, the variational symmetry is simply
which is a point symmetry.
The action of this variational symmetry on the Lagrangian
is given by
since
. This symmetry action then can be combined with the variational identity (
55) to get the Noether relation
using
and
, where
are obtained from formula (
50). This yields a conservation law
on the solution space
of the Euler–Lagrange system
,
. Since
, this conservation law is locally equivalent to the conservation law (
52) which is given by
Moreover, from the identity (
72) relating the symmetry Equation (
66) and the adjoint-symmetry Equation (
67), the conserved current
in the conservation law (
86) is the same as the conserved current
in the adjoint-symmetry/symmetry formula
where
In Reference [
20], the conservation law formula (
85) is used to obtain a single local conservation law for a special case of the wave Equation (
63) given by
and
, corresponding to
after an equivalence transformation
is made. The formula is applied to the adjoint-symmetry
and the point symmetry
with characteristic
, which respectively correspond to
and
with
. The likely reason why the obvious translation symmetries
and
were not considered in Reference [
20] is that these symmetries lead to locally trivial conservation laws when
is used.
To illustrate the situation, consider the translation symmetries
admitted by the wave Equation (
63) for arbitrary
,
,
. The characteristic functions of these two symmetries are, respectively,
and
. Local conservation laws can be obtained by applying formula (
85), or its simpler equivalent version (
86), with
being the adjoint-symmetries classified in
Table 1. The resulting conserved currents
, modulo locally trivial currents, are shown in
Table 2.
Notice that for
the conserved currents
obtained from the two translation symmetries vanish. This implies that Ibragimov’s theorem (
85) yields just trivial conserved currents
for some cases of the wave Equation (
63) when a non-trivial conserved current exists. A full explanation of why this occurs will be given in the next section.