1. Hamilton–Jacobi–Bellman Equations
Hamilton–Jacobi–Bellman (HJB) equations are a fundamental tool of Optimal Control theory, more precisely, of “Dynamical Programming”, and were created in the 1950s by R. Bellman and collaborators for the needs of aerospace engineering. Although problems of classical calculus of variations can be solved using it, the impact of the HJB equations never stopped extending far beyond their original motivations. In stochastic Optimal Control [
1], they also allow control of Markovian diffusion processes in the form of nonlinear partial differential equations of second-order (in space) for a scalar field
S on
:
where
H is called a second-order (SO) Hamiltonian, analogous to the Hamilton–Jacobi equation of classical mechanics. In Equation (
1), the presence of a Hessian operator in
H is due to the infinitesimal generator of underlying diffusion processes as a consequence of Itô’s correction. On the other hand, HJB equations have become essential in recent developments of the mathematics of, for instance, deep learning [
2] and geometric studies of hydrodynamical interpretation of quantum mechanics [
3].
Here, we are not going to consider difficulties associated with the fact that solutions of HJB (the “value functions”) are generally too irregular to be interpreted in a classical sense, or on those resulting from the practical need to solve very high-dimensional versions of such PDEs. Instead, we shall summarize a recent work answering the following natural questions about Equation (
1):
If Equation (
1) is a kind of deformation of the classical Hamilton–Jacobi equation, what are the relevant stochastic Lagrangian and Hamiltonian mechanics? Additionally, what are the latent geometrical structures?
Our guide to achieve these goals is a program of stochastic deformation of classical mechanics founded on an old idea of E. Schrödinger (often called these days, “Schrödinger’s problem” [
4,
5]). In substance, this is a statistical physics analogue of quantum mechanics, regarded as a stochastic deformation of classical Optimal Transport. The associated solution processes are called Bernstein’s reciprocal processes [
6,
7] and enjoy a special version of time-reversibility despite the fact that they are generally inhomogeneous. This aspect of the theory will not be elaborated here.
Instead of traditional tools of stochastic analysis on manifolds, founded by Itô, Malliavin, etc., we shall adapt a less familiar approach, due to L. Schwartz and P.-A. Meyer, called stochastic (or second-order) differential geometry [
8]. This way to deform classical geometric structures into others, compatible with the stochastic nature of Brownian randomness, can be regarded as a probabilistic counterpart of the quantization procedure.
2. Second-Order Differential Geometry
The first question to ask about Equation (
1) is: In what sense is the “Hamiltonian”, say
H, a natural deformation of a Hamiltonian
in classical mechanics? The first step [
8] is to define second-order versions of tangent and cotangent spaces of a smooth manifold
M.
A
second-order (SO) tangent vector A at a given point
by
for coefficients
,
such that
forms a symmetric
-tensor and the expression on the right-hand side is invariant under changes of coordinates. In general,
is not a vector, which can be seen from changes of coordinates. The
second-order tangent space to
M at
q is the set of all SO tangent vectors at
q. The second-order tangent bundle is then
. Clearly,
as a subbundle. A smooth field of second-order tangent vectors, i.e, a smooth section of
, is called a second-order vector field.
According to Schwartz and Meyer, any geometric statement for such a second-order tangent vector has a probabilistic content. To see this, we consider the following Itô stochastic differential equations (SDEs) on
:
Its associated generator is given by
, which is a typical example of SO vector fields. In general, the generators of diffusion processes are called a second-order elliptic vector field due to the positive semi-definiteness of the coefficients of second-order derivatives. For a diffusion process
X, defined on a probability space
and adapted to a
nondecreasing filtration
, the coefficients of its generator can be characterized by
This last relation encapsulates the second-order statistical information about all trajectories
. The pair
is a process taking values in
and called the
mean derivatives of
X. When
X has differentiable trajectories,
reduces to a classical time derivative and
to 0. For the Itô SDE (
3) on
, its mean derivatives are given by
and
. However, for a general diffusion
X valued on a manifold
M,
does not transform as a vector field, which can be verified through applying Itô’s formula for changes of coordinates. In order to overcome this problem, we equip
M with a linear connection ∇ and use it to compensate as a correction term resulting from Itô’s formula. That is, we define the following ∇-dependent mean derivative, in terms of Christoffel’s symbols,
Then, does transform as a vector field.
Inspired by mean derivatives, we denote the canonical coordinates on
by
and define their action on
A in (
2) as follows:
The objects dual to SO tangent vectors are
second-order cotangent vectors, whose general form is:
where
forms a covector and
is symmetric in
. The pairing of the above
with SO tangent vector
A in (
2) is given by
The SO cotangent bundle, i.e., the set of all SO cotangent vectors on
M, is represented by
. The canonical coordinates on it are denoted by
and are defined, when acting on
in (
5), as follows:
There are two basic examples of second-order forms, say,
and
, where
f and
g are given smooth functions on
M. They are defined as follows: for
,
where
d is the classical exterior differential; the operator
is called a second-order differential; the dot operator · is called a symmetric product; and
is usually called a “carré du champ” operator. By construction, the restriction of any SO form to
, the classical cotangent fibre bundle, is a classical form.
3. Stochastic Hamiltonian Mechanics
In classical mechanics, the canonical symplectic structure on the cotangent bundle plays a substantial role in Hamiltonian mechanics. The symplectic 1-form is given by
, also known as Poincaré’s relative integral invariant [
9]. Now, the second-order version of the Poincaré 1-form [
10] is given, according to (
5) and (
6), by
as a second-order form on the phase space
. Analogous to the classical symplectic 2-form
, one obtains the second-order version involving an extra set of coordinates
:
Associated with a SO Hamiltonian function
, the SO Hamiltonian vector field
on
is defined by
namely, in local coordinates,
where the coefficients
are smooth functions satisfying
The stochastic Hamilton equations associated with
are given, in local coordinates, by
The solution is of the form for X as an M-valued process and as a time-dependent SO form.
Notice that the last three equations describe fundamental second-order additions to deterministic Hamiltonian equations. However, the mean derivatives
D are the only regularization needed in the first two equations. Qualitatively, since
p and
o are functions of
, the last two equations can be simplified by applying
to the second and fourth equations, assuming that (the distribution of)
has full support for all
t. It follows that
the second equality being the Maxwell relations for thermodynamics [
11]. We refer to (
8) as an integrability condition of (
7).
Similar to classical mechanics, when the SO Hamiltonian
H depends explicitly on time, one extends the phase space to be
and endows it with the second-order analogue of the Poincaré-Cartan form:
Canonical transformations are changes of coordinates in the extended phase space
from
to
that leave the stochastic Hamilton Equation (
7) invariant, or equivalently, leave the canonical form (
9) invariant up to an exact second-order differential:
where
is the new SO Hamiltonian after transformation, and
is the total differential of first-order in time and second-order in space. This implies that the generating function of the canonical transformation
satisfies
The Hamilton–Jacobi–Bellman (HJB) equation can be introduced by formally letting the new Hamiltonian
K vanish. In this case, we write the generating function
G as
S. Using (
11), we can write the HJB equation, as the general version of (
1) on manifolds:
4. Stochastic Hamiltonian and Lagrangian Mechanics on Riemannian Manifolds
If
is a Riemannian manifold with Levi–Civita connection ∇, one can produce a class of SO Hamiltonians
by deforming a classical one
in a canonical way; that is,
where
ℏ is a positive constant (our deformation parameter). Then, system (
7) reduces to the following stochastic Hamilton equations on
:
subject to
, where
is the damped mean covariant derivative with respect to
X, and
is the Laplace-de Rham operator on forms.
Such a Hamiltonian formulation can also been transformed into a Lagrangian formulation by the Legendre transform. Recall that the Legendre transform is a change of variables
given by
. If the Legendre transform is a diffeomorphism (in which case,
H is called hyperregular), a Lagrangian function can be produced from
H; that is,
In this way, the stochastic Hamilton equations (
13) are equivalent to the stochastic Euler–Lagrange equation,
which results from the stochastic Hamilton’s stationary-action principle
for the following action functional:
where the variation
is taken over all diffusions
X on
M over time interval
, satisfying
, and with given endpoint marginal distributions
and
.
Consider a time-dependent Hamiltonian
. On Riemannian manifolds, canonical transformations of the last section can also be reduced to the tangent bundle. First, we observe that by the Legendre transform (
14), the definition (
4) of
, and the integrability condition (
8), the action functional (
15) can be rewritten as
where
denotes the Stratonovich stochastic differential. Now, we make a change of coordinates on
from
to
and denote the SO Hamiltonian by
and its classical part by
.
As in the previous section, the general condition for a transformation to be canonical is to preserve the form of a stochastic Hamilton system (
13). This is equivalent to preserving the form of the stochastic stationary++action principle of (
15). It follows that
Since the underlying process
X has zero variation at the endpoints, both equalities will be satisfied if the integrands are related by the following SDE:
In contrast with classical theory of canonical transformations and also (
10), which are described by equations for forms, Equation (
16) is understood as a stochastic differential equation. However, as in classical theory [
9], here we can also have all four types of generating functions for (
16) that are related to each other through classical Legendre transforms. Indeed, canonical transformations here are processed on cotangent bundles, which means they are a special case of (
10) where the canonical transformations on SO cotangent bundles are induced by classical ones. We take the type-one generating function
. Using Itô’s formula,
, and vanishing the coefficients of every (stochastic) differential,
,
, and
in (
16), we get
which partially recovers (
11). By requiring the new Hamiltonian
to be identically zero and writing
as
S, the last equation turns into the following Hamilton–Jacobi–Bellman equations:
where
are regarded as coordinates on the product manifold of the two manifolds before and after transformation and are equipped with the direct-sum Riemannian metric and Levi–Civita connection. Clearly, Equation (
17) can be interpreted as
ℏ-deformation of the classical Hamilton–Jacobi equation:
Type-two generating functions are also useful. Let
. In the same way as type one, we can get
As an example, we consider the Hamiltonian
. We take
. Then
,
and
. Thus, the new Hamiltonian is
. To make
be the standard form
, we only need to assume that
S and
V solve the following HJB equation:
A key observation is that
is a
-martingale but
is not. A stochastic Noether’s theorem in [
10] shows that such a martingale is always associated with symmetry of an HJB equation.
5. Relations with Stochastic Deformation and Schrödinger’s Problem
The last observation had already been made long ago in the research program of stochastic deformation (cf. [
12] and references therein) from a completely different perspective, namely the analogy between Schrödinger’s problem and quantum mechanics.
We are going to specialize our analysis to the HJB Equation (
12) on Euclidean space
with the SO Hamiltonian given by the
ℏ-deformation of the classical Hamiltonian
:
namely, Equation (
19) for a given final boundary condition
, where
is a bounded (for simplicity) scalar potential. Notice the opposite sign of the potential with respect to classical Hamiltonians of such elementary systems. This is expected when well-defined measures are associated with (
18), as in the (“Euclidean”) quantization procedure. The left-hand side of the second equation of (
7) means
. Let us introduce a positive solution
of the retrograde (or backward) heat equation:
with nonnegative final boundary condition
. Now define
, solving HJB Equation (
19). If
, take ∇ of Equation (
19) and use the integrability condition (
8). The result agrees with our second equation of (
7). Therefore, the first and third ones characterize a Bernstein’s reciprocal diffusion
X:
On the other hand, the Lagrangian associated with
is
. The Benamou–Brenier formula for Schrödinger’s problem, from the Optimal Transport perspective [
4], shows that minimizing the action functional (
15) is equivalent to minimizing the following
relative entropy:
over all probability measures
on the path space
, such that
are the initial and final time marginal distributions of
, i.e.,
and
. Here,
, called the reference measure, is the distribution of a reversible diffusion (in Kolmogorov’s sense) with generator
.
As explained in [
12], the quantum “expectation” in state
of the Hamiltonian operator
, the quantization of
in Equation (
18), is
where
is in the domain
of
dense in
, and
is interpreted as a (Born) probability density. Now consider
solving the retrograde heat Equation (
21), i.e., after a change of the variable
to the Schrödinger equation of
. Using
, the random variable playing the role of
should be (minus of):
namely, our
in (
20). This is why Schwartz–Meyer second-order differential geometry can be regarded as a kind of (Euclidean) quantization method. We have only summarized here the forward geometric part of our construction. The role of
is played by positive solutions
of a Cauchy problem for the usual heat equation (with the same
, which is self-adjoint). Then, any well-defined expectation with respect to Bernstein’s reciprocal diffusion
X is computed using the fundamental aspect of Schrödinger’s analogy:
Associated with
, there is a dual formulation of our results involving a
nonincreasing filtration
. In particular, there is another (Cauchy) problem of HJB, adjoint to Equation (
19):
In classical mechanics, it is known (but often forgotten) that the coexistence of two adjoint Hamilton–Jacobi equations, in a given Hamiltonian system, is closely related with the regularity of the trajectories. Our two adjoint HJB equations play the same role for the trajectories
of Bernstein’s reciprocal processes solving Schrödinger’s problem, cf. [
12].
The analogy of the
complex conjugate is Schrödinger’s version of time-reversal involved in (
22). Consequently, although typically time inhomogeneous, the resulting diffusions are invariant under this time-reversal.
The SO Poincaré-Cartan form allows formulation of a global stochastic Euler–Lagrange equation compatible with our Hamiltonian ones and then a global Noether’s theorem, which is a more general perspective than Schrödinger’s original problem [
10]. All these results are founded on HJB equations, which are regarded as SO deformations of classical Hamilton–Jacobi equations.