1. Introduction
In this paper, we consider perturbations of the Toda lattice with two different kinds of boundary conditions, namely periodic and Dirichlet (fixed ends) boundary conditions. The periodic case is the structurally fundamental one, whereas the Dirichlet case is more imporant for applications; in particular, the famous numerical experiments of Fermi, Pasta, and Ulam with the family of “FPU chains” (see [
1]), of which the Toda lattice is a special case, were performed with Dirichlet boundary conditions.
Among the entire family of FPU chains, the Toda lattice has especially strong integrability properties, which makes it possible to obtain results not only in a neighborhood of the equilibrium point, but on the entire phase. This makes it possible to rigorously prove certain stability properties also far away from the equilibrium, i.e., for high energies of the system.
The Toda lattice however should not only be considered as a member of the family of FPU chains; on the contrary, it has been investigated mostly independently from the FPU framework, since it is, as a recent review article [
2] states, “a wonderful case study in mechanics and symplectic geometry” with ramifications ranging from representation theory [
3] to numerical analysis [
4]. Moreover, the Toda lattice is a model with has numerous applications in the physical sciences, ranging from solid state physics [
5] and quantum field theory [
6] to DNA transcription [
7], to mention just a few. Some review articles on the history of the research on the Toda lattice have been published very recently [
8,
9,
10].
On the other hand, the research on general FPU chains also has its own rich history since the numerical experiments of Fermi, Pasta, and Ulam; since the present paper does not discuss general FPU chains, we refer to the literature for further references, e.g., the overview article [
11].
The goal of this paper is the proof of a Nekhoroshev-type result for the Dirichlet Toda lattice. Even though we have already obtained a result of this type for the periodic lattice, we consider the result for the lattice with Dirichlet boundary conditions to be of additional interest, since, as just mentioned, most applications of this type of chains have Dirichlet boundary conditions. Besides these two types of boundary conditions, there also exists a rich literature on the infinite Toda lattice, i.e., the lattice with an infinite number of particles without periodicity. For results on the long-time asymptotics of this kind of Toda lattices, using in particular tools from inverse scattering transform and Riemann-Hilbert theory see e.g., [
12] and the references therein. The inverse scattering transform as a tool to investigate the dynamics of nonlinear systems was originally developed for the Korteveg-de Vries (KdV) equation [
13].
Finally, we consider this result to be an interesting case study for the Nekhoroshev theorem, a major perturbation theory result for integrable systems. It is well-known that the Toda lattice with both types of boundary conditions is an integrable system (for the periodic case see [
14,
15,
16], for the Dirichlet case see [
15]). The Toda lattice is one of a small number of physical examples outside of the realm of celestial mechanics, where the prerequisites of the Nekhoroshev theorem have actually been checked. A better-known stability result for integrable systems is the KAM theorem (see e.g., [
17]), whose applicability to this system we have already shown for both types of boundary conditions [
18,
19]; however, the Nekhoroshev theorem has the advantage of being free from probabilistic elements–recall that the stability statements of the KAM theorem only hold for a majority of initial conditions.
We accomplish our task by embedding the Dirichlet lattice into a periodic lattice of a higher dimension, and the idea is to use the results on the periodic lattice already obtained. However, we cannot directly apply this previous work to the Dirichlet case, since the image of the phase space of the Dirichlet lattice under this embedding is contained in a subset of the periodic phase space not covered by the previous result. Therefore, we have to adapt some of the previous results to the new circumstances. For most of the auxiliary facts needed for our final result, they can be obtained in a similar way as before, and we do not reformulate everything explicitly in the present paper and sometimes refer to the work already done.
2. Results
To formally state our results, we first need to introduce the mathematical model of the Toda lattice with various types of boundary conditions. In the periodic case, the lattice with
N particles (
) is given by the Hamiltonian
where
is a positive parameter,
, and the boundary conditions
In the Dirichlet case, the lattice with
particles (
) is given by the Hamiltonian
where
again is a positive parameter,
, and the boundary conditions
We will show in
Section 3 how the aforementioned embedding of the model (
3) into the model (
1) can be accomplished explicitly (for a suitable
N depending on the chosen
).
The main result of this paper is the following:
Theorem 1. There exists an open and dense subset of the phase space of the Toda lattice with Dirichlet boundary conditions on which Nekhoroshev’s theorem applies to sufficiently small perturbations of the Dirichlet Toda Hamiltonian.
We describe the general set-up and give the precise formulation of Nekhoroshev’s theorem in
Appendix A, thereby closely following [
20]; see also the references given in [
21]. Nekhoroshev’s original work can be found in [
22,
23] . Its main prerequisites are the construction of action variables on the phase space and the convexity of the Hamiltonian with respect to these action variables. We accomplish the tasks of constructing action variables and proving the convexity of the Hamiltonian in Theorems 2 and 3, respectively. We proved the following Theorem in [
18]:
Theorem 2. For any fixed and , the Dirichlet Toda lattice admits a Birkhoff normal form. More precisely, there are (globally defined) canonical coordinates so that , when expressed in these coordinates, is a function of the action variables () alone, .
Technically, the main result of this paper states that the Hamiltonian is a convex function of the actions variables .
Theorem 3. In the open quadrant , the Hamiltonian introduced in Theorem 2 is a strictly convex function of the action variables . Precisely, for any compact subset and any compact interval , there exists , such thatfor any , and any . The convextiy statement (
5) shows that the prerequisites of Nekhoroshev’s general theorem (see Theorem A1 in
Appendix A) are satisfied in the present case. Therefore Theorem 3 implies that Nekhoroshev’s Theorem holds for the Dirichlet Toda lattice on
, an open and dense subset of
by Theorem 2. This then proves our main result, Theorem 1.
To prove Theorem 3, we use the Birkhoff normal form of the Dirichlet Hamiltonian
near
, also established in [
18].
Theorem 4. Near , the function introduced in Theorem 2 admits an expansion of the formwith . In particular, the Hessian of at is given by As an immediate consequence of (
7), we obtain
Corollary 1. Near , ist strictly convex for any .
Outside of
, we argue differently. As mentioned above, we embed the phase space of the Dirichlet lattice into the phase space of the periodic lattice and use an analogous convexity result for the periodic lattice via pullback to the Dirichlet lattice. The image of this embedding is the fixed point set of a certain symmetry map
S of the periodic lattice. This fixed point set is a submanifold of the entire phase space of the periodic lattice which is invariant under the evolution induced by the Hamiltonian (
1); in this way, this embedding allows us to obtain results on the Dirichlet lattice by exploiting the properties of the symmetry map
S.
Related work: As already mentioned, similar results have been obtained for the periodic lattice, see [
21], and for other type of perturbation theories, namely the KAM theorem instead of the Nekhoroshev theorem, see [
18]. The technique of expressing symmetry properties of a system in terms of action-angle variables and Birkhoff coordinates has developed for the defocusing nonlinear Schrödinger equation, see [
24]. The technique of embedding the phase space of a lattice with Dirichlet boundary conditions into the phase space of the corresponding lattice with periodic boundary conditions has been used in the case of arbitrary FPU chains, see [
25,
26]. On perturbations of the Toda lattice in general, there have been published many papers, see e.g., [
27,
28,
29], usually however applying other techniques than the theorems of KAM and Nekhoroshev which are the main theoretical tool of our approach. The entire work was originally motivated by the pioneering work on a KAM-type result for the KdV equation [
30].
Outline of the paper: In
Section 3 we describe suitable coordinates for both types of lattices and then the aforementioned embedding of the Dirichlet lattice into the periodic lattice of a higher dimension. In
Section 4, we describe some spectral quantities associated with the Toda lattice and their behavior under the symmetry map needed for the embedding described before. This symmetry map also induces a special Riemann surface needed in the sequel, see
Section 5; this allows then us to prove Theorem 3. In the appendices, certain lengthy calculations are presented, in addition to some proofs which are very similar to analogous proofs in our previous work [
21].
3. Coordinates and Symmetries
Here we first describe suitable coordinates for the Dirichlet lattice and then an embedding of the Dirichlet lattice wih particles into the periodic lattice with particles.
Coordinates for the Dirichlet lattice As a first step in the Dirichlet case, following Flaschka [
14], we introduce the (noncanonical) coordinates
The boundary conditions (
4) imply that
and
; we will identify related Casimir functions below.
In these coordinates the Hamiltonian
, given by (
3), takes the simple form
and the equations of motion are, in terms of the
’s and
’s defined by (
8),
We study the system of Equation (
10) on the
-dimensional phase space
This system is Hamiltonian with respect to a suitable nonstandard and degenerate Poisson structure
described in [
18]. The degeneracy can be described by the two Casimir functions
Let
denote the level set of
for
. Please note that we are mainly interested in the case
, i.e., the set
, because the assumption
contradicts the boundary conditions (
4). We have included the case of general
’s in order to have an even-dimensional phase space. In the sequel, we write
.
In [
18] we constructed global Birkhoff coordinates for the Dirichlet lattice. Precisely, we constructed a map
where
and
are the Casimir functions introduced by (
12). The crucial point of the map (
13) is that the coordinates
are
global Birkhoff coordinates for the Dirichlet Toda lattice, i.e., the transformed Toda Hamiltonian
is a function of the actions
and
alone.
Coordinates for the periodic lattice In the following, we describe an analogous set-up for the periodic lattice. Although the set-up looks similar to what we just introduced for the Dirichlet lattice, it is not identical to the Dirichlet case, which makes it necessary to be introduced explicitly.
The main tool for the investigation of the periodic lattice are the (noncanonical) coordinates
introduced by Flaschka [
14], analogously to (
8),
In these coordinates the Hamiltonian
, analogously to (
9), takes the simple form
and the equations of motion are, analogously to (
10),
Please note that for any , and , which means that the sequences and can be identified with the vectors and , respectively.
In [
31] we studied the normal form of the system of Equation (
16) on the phase space
and the model space
. Precisely, we constructed a map
where
,
are Casimir functions associated with the Poisson structure of the Flaschka coordinates (
14). The crucial point of the Birkhoff map (
18) is that the coordinates
are global Birkhoff coordinates for the periodic Toda lattice. Hence the Toda Hamiltonian, when expressed in these coordinates, takes the form
where the term
is a real analytic function of the action variables
,
, and where
,
are the values of the Casimirs
,
.
Note that on an open dense subset
of the phase space
, the coordinates
of the Birkhoff map (
18) are given in terms of action and angle variables
by
Symmetries of the periodic lattice Let
be endowed with the canonical symplectic structure and consider the linear maps
given by
note that
T is the standard shift operator. As already discussed by Rink [
32] for arbitrary FPU chains, the maps
T and
S satisfy the relations
Id and
. Moreover,
T and
S are symplectic maps leaving the Hamiltonian
, given by (
1), invariant. The group
(a representation of the
N-th dihedral group
) is the symmetry group of
. In the sequel, we are mainly interested in the symmetry map
S.
Denote by Fix
the fixed point set of the map
S introduced above. Then Fix
is the subset of all elements
in
satisfying
In particular, if N is even, on Fix we have and .
Note that on Fix, both the center of mass coordinate and its momentum are identically zero. Hence Fix.
On the level of the Flaschka variables
introduced in (
14), the maps
T and
S introduced in (
21) and (22) are given by
, with
(analogously for
) and
Similarly to Fix
defined above, we denote by Fix
the subset of all elements
satisfying
with the indices in (
24)–(
26) understood mod
N. In the sequel, we will omit the tilde and write
T and
S for the operators
and
on
.
We can also express the symmetry transformations
T and
S in terms of the Cartesian coordinates
given by the Birkhoff map (
18), or more suitably, in terms of the associated complex coordinates
, defined for
by
We denote by
the linear subspace of
consisting of such vectors
, and define the map
Like the map
, this new map
is a linear involution. In fact, the two maps
S and
are conjugate to each other under the Birkhoff map
given by (
18)—see [
18] for a proof of the following theorem:
Theorem 5. In terms of the complex variables defined by (27) and the Birkhoff map Φ given by (18) the map S is given by . More precisely, Embedding of the Dirichlet into the periodic lattice We now embed the phase space
of the Dirichlet lattice, with
particles, see (
11), into the phase space
of the periodic lattice with
particles, see (
17), by the map
where
The image of the map
is a subset of Fix
, as the following shows (see [
18]):
Lemma 1. Let .
- (i)
; in particular, Fix for any . Moreover, is a canonical map from to .
- (ii)
The Hamiltonians H and of the Toda lattice in Flaschka variables with periodic and Dirichlet boundary conditions, respectively, given by (15) and (9), satisfy .
To pull back the embedding
to the complex variables
defined by (
27), we consider the space
endowed with the canonical symplectic structure induced from
. Now consider the embedding
Note that
i.e.,
is a parametrization of Fix
. In other words, by (i), for any
,
is a parametrization of
. Together with the Birkhoff maps of the periodic and the Dirichlet lattice, this leaves us with the following commutative diagram of maps:
4. Spectral Quantities and Riemann Surfaces
It is known (cf. e.g., [
33]) that the periodic system (
16) can be expressed in the Lax pair form
, where
is the periodic Jacobi matrix defined by
and a skew-symmetric matrix
B. Hence the flow of
is isospectral.
Discriminant: We denote by
the
discriminant of the difference equation
associated with the matrix
L, defined by
where
and
are two standard fundamental solutions of (
36). In the sequel, we will often write
for
. Additionally, we consider the symmetric
-Jacobi matrix
whose
real eigenvalues
(with multiplicities) are the zeroes of the function
,
(see [
34] for comments on this expansion). When listed in increasing order and with their algebraic multiplicities, they fulfill the following relations (cf. [
35])
In [
18] we proved the following results on the behaviour of the
’s and the discriminant
under the symmetry
S introduced in (22):
Lemma 2. Let . Then for any , Corollary 2. For any Fix,
- (i)
for any .
- (ii)
for any .
- (iii)
and , if N is even.
Corollary 3. For any ,and hence, with ,as well as Riemann surface : Denote by
the Riemann surface obtained as the compactification of the affine curve
for
by (
38). Note that
is a two-sheeted curve with the ramification points
, identified with
in the sequel, and that
and
are spectral invariants; the Riemann surface
is obtained from
by adding two (unramified) points at infinity, namely
and
, one on each of the two sheets, i.e.,
Strictly speaking,
is a Riemann surface only in the case that the spectrum of
is simple, i.e., if the estimates in (
39) are strict. We showed in [
34] that
iff
, therefore the spectrum of
is simple iff
, where
If the spectrum of
is
not simple,
can be transformed into a Riemann surface by doubling the multiple eigenvalues—see e.g.,
Section 2 of [
36]. We will discuss this case in detail below, see
Section 5, since it is of great importance for the main task of this paper.
Canonical sheet and canonical root: For
the canonical sheet of
is given by the set of points
in
, where the special
c-root
is defined on
(with
and
) and determined by imposing the sign condition
As a consequence we have for any
Cycles on : We introduce the projection onto the -plane, i.e., and its extension to a map , where .
We now introduce the cycles and on . Denote by the cycles on the canonical sheet of (see above) in such a way that is a counterclockwise oriented closed curve in which contains in its interior the two ramification points and , whereas all other ramification points are located outside of .
Moreover, denote by pairwise disjoint cycles on defined in a way such that for any , the intersection indices with the cycles defined above with respect to the orientation on , induced by the complex structure, are . In order to be more precise, we choose the cycles in a way that (i) the projection of is a smooth and convex counterclockwise oriented curve in and that (ii) those points of whose projection by onto the -plane have a negative imaginary part are located on the canonical sheet of .
Abelian differentials: Let
and
. Then there exists a uniquely given polynomial
of degree at most
such that for any
Using the definition of the cycles
given above, we can rewrite (
50) as
The coefficients of the polynomials
can be computed explicitly, see e.g., Appendix A of [
37]. In [
18] we proved
Lemma 3. Let . Then for any real λ, Note that the Formula (63) remains valid if the assumption is weakened, as long as the -functions appearing in (63) are well-defined. We will return to this point in the next section.
On the surface
, we consider the differentials
We proved in [
21] the following lemma on
and
:
Lemma 4. The Abelian differentials and on satisfy the following properties:
- (i)
and are holomorphic differentials on except at the points f and where in the standard charts, the ’s admit an expansion of the following form - (ii)
and fulfill the normalization condtions - (iii)
When expressed in the local coordinate λ, on each of the two sheets, has an asymptotic expansion as (λ real) of the following form
On , and take the formwhere are polynomials in λ of the form and . Note that and do not depend on the parameter α. Conversely, (i) and (ii) uniquely determine and . Remark 1. As discussed in [21], the differentials and exist for any strictly increasing sequence of ’s, i.e., , even if these ’s are not the spectrum of some matrix . In this more general case, the associated Riemann surface, similarly to (44)–(46), is defined as the compactification ofi.e., The differentials and are then simply defined by the conditions (55) and (56) instead of the Formulas (53) and (54). For any
, introduce the
-periods of the differentials
and
,
In [
21], we proved the following results on the
’s, the
’s and
, the coefficient in the expansion (
57); recall from (
19) that
is the Toda Hamiltonian expressed in the Birkhoff coordinates given by (
18),
:
Proposition 1. For any and any ,where is the Toda frequency . 5. Constructions on the Fixed Point Set
We now assume that
is even, i.e.,
for some
. In addition to
, we consider the smaller set (analogously to
)
It follows from Corollary (2) (ii) that in the case
the spectrum of
is
not simple, hence
, as defined by (
44) and (
46), is
not a Riemann surface in this case, but rather a “punctured” surface (a neighborhood of the point
is a bouquet of two open disks glued together at one common point, hence the surface does not have a manifold strucutre at this point—see [
36]). We can overcome this difficulty by the following construction.
Instead of
as defined by (
45), we consider
and the surface
Note that unlike in the definition of
, see (
45),
. For
, the surface
is isomorphic to the regular Riemann surface
obtained from the punctured surface
by doubling the point
, as shown in [
36] for the analogous case of finite gap potentials of the KdV equation. In the sequel, we will thus consider
instead of
. Note that the genus of
equals the genus of
minus one.
The cycles
and
, defined in
Section 4 for
, are defined on
as well, as long as
. In the case
, the curve on
corresponding to
does not contain any ramification point in its interior, and the curve on
corresponding to
is no longer closed . On
, we thus consider the
curves
and
.
On
, the functions
, defined by (
50), exist as well, as long as
, since there existence only depends on the cycles
. Lemma 3 now takes the form
Lemma 5. Let . Then for any real λ, The proof of Lemma 5 works completely along the lines of the proof of Lemma 3, which we have given in [
18] and which is based on Lemma 2 (which does not assume
), and we then use
. We will use (63) in
Appendix B in the proof of Lemma 6 below.
Moreover, we again consider the differentials
and
defined by (
53) and (54), now on
; for clarity, we momentarily denote these differentials on the new surface
by
They have the same properties as described by Lemma 4 for the original case
, except that the identities (
56) are only valid for
and
has to be replaced by
. That these properties continue to hold in this new case
follows from the fact that the surfaces
and
differ only in the fact that the point
has been doubled in the construction of
, and the curves
for
do not touch this doubled point.
Remark 1 also holds in this case: The Riemann surface
and the differentials
and
exist for any sequence of
’s with
, even if this sequence is
not the spectrum of
for some
. Namely,
is in such a case defined by, similarly to (
58), as the compactification of the affine curve
i.e., as
The differentials
and
are then
defined by the properties (
55) and (
56), where it is understood that (
56) only holds for
. For clarity, we restate this definition in
Appendix C, see Lemma A1. If the sequence
corresponds to some
, one shows that the differentials defined by (
64) and (65) have the properties (
55) and (
56), and in the general case the existence and uniqueness of differentials with these properties is established by general Riemann surface theory. All these facts can be checked exactly as in the case discussed in the previous section, and in our previous work [
21].
For
, we consider as in the original case (
59) the
-periods of
and
, with the integrals now taken on the new surface
, defined by (
67),
On these
’s and
’s in the case
, we have the following lemma, which we prove in
Appendix B.
Lemma 6. Let . Then for any ,for any . Note that we only claim the statements (
69) and (70) to be true in the case of a Riemann surface
originating from a sequence of
’s being the spectrum of
for some
and not in the case of an arbitrary surface
; but we only need the formulas in the former (more special) case, see the proof of Theorem 6.
We now define the extended period map
on the set
as follows
where
is the coefficient in the asymptotic expansion (
57). In the construction (
71),
is extended to
. To this auxiliary
, the associated surface
is constructed via (
66) and (
67), on which the differentials
and
are given, and then the quantities
and
by (
68). It is straightforward to see that
is a smooth map with values in
.
Next we define the map
where
is the first part of the spectrum of the matrix
for some
which is determined by the Birkhoff map
(cf. (
18)),
Note that (
73) ensures that
and therefore
. It can be seen analogously as in the case
explained in [
21] that
is a smooth embedding.
Proposition 2. For any , Proof. Recall from Proposition 1 that in the case
, we have
and
by definition. In our case
, these formulas continue to hold for the
’s and
’s, as long as
, and their proof is completely analogous to the proof of the former case discussed in our previous work [
21]. We will explicitly show the case of the
’s in the proof of Lemma 6 in
Appendix B. From the Formula (
75) and the definition (
71) of the map
, the claimed statement (
74) immediately follows. □
The composition
is therefore given by
The differential
is a
-matrix of the form
where
denotes the
-matrix with all entries 0. We now need the following version of Krichever’s theorem [
21,
38] on the map
. Note that the following theorem is only concerned with the behaviour of
on Im
, the image of the map
as defined by (
72).
Theorem 6. Restricted to , the map is a local embedding, i.e., the differentialof at λ is a linear embedding. Proof. Let
, and assume that Theorem 6 does not hold, i.e., that
is not a local embedding. Then there exists a smooth 1-parameter family
with
and
for any
, so that
but
We extend the family
with
components to a family
with
components by
By this construction, it is ensured that the family
obtained in this way is the full spectrum of a matrix
for any
, since any element
is by definition the first part of the spectrum of a matrix
for some
. In the sequel, by
we mean the
-component family (
80) whose first
components are given by (
77) and whose remaining
components by (
81).
Then, by Lemma 6, (
79) remains satisfied, but for the
component vectors
and analogously for the
’s. i.e., we have
and analogously for the
’s. Moreover, we have
, since (as we showed in [
21]),
, and as we discussed in [
18],
. By the conditions (
81) however, the last sum equals zero. Alltogether, we obtain
Lemma 7. From the estimates (82) it follows that The proof of Theorem 6 is completed once we have proved Lemma 7, since the conclusion (
83) obviously contradicts the assumption (
78). The proof of Lemma 7 is contained in
Appendix C; it completely follows the lines of a similar proof in our previous work [
21]. □
Since both
and
are embeddings, the same holds for the composition
. Hence the rank of the differential
, given by (
76), has to be maximal, i.e.,
. Therefore, by the structure of the matrix (
76), the rank of the
-matrix
has to be
. We have thus proved the following result on the Hamiltonian of the
N-particle periodic lattice with respect to the first
action variables in the case
:
Proposition 3. Let . Then the matrixis regular. We now prove our main result, Theorem 3. The main ingredients are Proposition 3 and the embedding of the
-particle Dirichlet lattice into the
-particle periodic lattice described in
Section 3.
Proof of Theorem 3. By Proposition 3, the
-matrix (
84) is regular for any
. Taking the pullback with respect to the embedding
given by (
32), we have from the commutative diagram (34) the identity
and it therefore follows from Proposition 3 that the matrix
is regular as well (note that the property of being a nondegenerate matrix is invariant under coordinate transformations). In other words, the frequency map
is nondegenerate on all of
. Together with the convexity of
at
, an immediate consequence of Corollary 1, this implies that
is convex on all of
. This proves Theorem 3. □