Mean Field Game with Delay: A Toy Model

Fouque, Jean-Pierre; Zhang, Zhaoyu

doi:10.3390/risks6030090

Open AccessArticle

Mean Field Game with Delay: A Toy Model

by

Jean-Pierre Fouque

^*,†

and

Zhaoyu Zhang

Department of Statistics & Applied Probability, University of California, Santa Barbara, CA 93106-3110, USA

^*

Author to whom correspondence should be addressed.

^†

Work supported by NSF grants DMS-1409434 and DMS-1814091.

Risks 2018, 6(3), 90; https://doi.org/10.3390/risks6030090

Submission received: 12 July 2018 / Revised: 17 August 2018 / Accepted: 20 August 2018 / Published: 1 September 2018

(This article belongs to the Special Issue Systemic Risk in Finance and Insurance)

Download Versions Notes

Abstract

:

We study a toy model of linear-quadratic mean field game with delay. We “lift” the delayed dynamic into an infinite dimensional space, and recast the mean field game system which is made of a forward Kolmogorov equation and a backward Hamilton-Jacobi-Bellman equation. We identify the corresponding master equation. A solution to this master equation is computed, and we show that it provides an approximation to a Nash equilibrium of the finite player game.

Keywords:

inter-bank borrowing and lending; stochastic game with delay; Nash equilibrium; master equation

MSC:

91A15; 91G80; 60G99

1. Introduction

A linear quadratic stochastic game model of inter-bank borrowing and lending was proposed in (Carmona et al. 2015). In this model, each individual bank tries to minimize its costs by controlling its rate of borrowing or lending to a central bank with no obligation to pay back its loan. The finding is that, in equilibrium, the central bank acts as a clearing house providing liquidity, and hence stability is enhanced. This model was extended in (Carmona et al. 2018), where a delay in the controls was introduced. The financial motivation is that banks are responsible for the past borrowing or lending, and need to make a repayment after a fixed time (the delay). In this model, the dynamics of the log-monetary reserves of the banks are described by stochastic delayed differential equations (SDDE). A closed-loop Nash equilibrium is identified by formulating the original SDDE in an infinite dimensional space formed by the state and the past of the control, and by solving the corresponding infinite dimensional Hamilton-Jacobi-Bellman (HJB) equation. For general stochastic equations and control theory in infinite dimension, we refer to (Bensoussan et al. 2007; Fabbri et al. 2017; Da Prato and Zabczyk 2008).

In this paper, we study the mean field game (MFG) corresponding to the model proposed in (Carmona et al. 2018) as the number of banks goes to infinity. We identify the mean field game system, which is a system of coupled partial differential equations (PDEs). The forward Kolmogorov equation describes the dynamics of the joint law of current state and past control, and the backward HJB equation describes the evolution of the value function. Recently, J.-M. Lasry and P.-L. Lions introduced the concept of “master equation” which contains all the information about the MFG. The well-posedness of this master equation in presence of a common noise and convergence of the N-player system is analyzed in (Cardaliaguet et al. 2015) by a PDE approach. A probabilistic approach is proposed in (Carmona and Delarue 2014; Chassagneux et al. 2014). See also the two-volume book (Carmona and Delarue 2018) for a complete account of this approach.

In this paper, the master equation for our delayed mean field game is derived, a solution is given explicitly, and we show that it is the limit of the closed-loop Nash equilibrium of the N-player game system as

N \to \infty

.

The paper is organized as follows. In Section 2, we briefly review the stochastic game model with delay presented in (Carmona et al. 2018). Then, in Section 3, we construct the corresponding mean field game system. In Section 4, we define derivatives with respect to probability measures in the space

P (H)

where

H

is the Hilbert space defined at the beginning of Section 2.2. In addition, we derive the master equation, and exhibit an explicit solution. Furthermore, in Section 5, we show that this solution of the master equation is an approximation of order

1 / N

to the solution of the finite-player Nash system. Lastly, in Section 6, we compare the solution of the Nash system, the solution of the mean field game system, and the solution to the master equation.

2. A Differential Game with Delay

2.1. The Model

Let

(X_{t}^{i}, i = 1, \dots, N)

represents the log-monetary reserves of the N banks at time t. At each time t, bank i controls its rate of borrowing or lending

α_{t}^{i}

, and it also needs to make a repayment after a fixed time

τ

such that

0 \leq τ \leq T

, at a rate denoted by

α_{t - τ}^{i}

. The dynamic of log-monetary reserves for each bank is given by

d X_{t}^{i} = (α_{t}^{i} - α_{t - τ}^{i}) d t + σ d W_{t}^{i},

(1)

with deterministic initial conditions

X_{0}^{i} = ξ^{i}, and α_{s}^{i} = ϕ^{i} (s) for s \in [- τ, 0],

(2)

where

W_{t}^{i}, i = 1, \dots, N

are independent standard Brownian motions, and banks have the same volatility

σ > 0

.

Bank i interacts with other banks by choosing its own strategy in order to minimize its cost functional

J^{i} (α^{i}, α^{- i})

, which involves the average of log-monetary reserves of all the other banks. The notation

α^{- i}

is a

(N - 1)

tuple of the

α^{j}

with

j \neq i

and

j \in {1, \dots, N}

, which represents all other banks’ control except bank i. The cost functional for bank

i \in {1, \dots, N}

is given by:

J^{i} (α^{i}, α^{- i}) = E [\int_{0}^{T} f_{i} (X_{t}, α_{t}^{i}) d t + g_{i} (X_{T})],

(3)

where the running and terminal cost functions f and g are:

\begin{matrix} f_{i} (x, α^{i}) = \frac{1}{2} {(α^{i})}^{2} + \frac{ϵ}{2} {(\bar{x} - x^{i})}^{2}, with \bar{x} : = \frac{1}{N} \sum_{k = 1}^{N} x^{k}, and ϵ > 0, \\ g_{i} (x) = \frac{c}{2} {(\bar{x} - x^{i})}^{2}, c \geq 0 . \end{matrix}

(4)

2.2. Construction of a Nash Equilibrium

In order to apply the dynamic programming principle to identify a closed-loop Nash equilibrium, we have to enlarge the state space by including the path of past controls, which lie in

H : = L^{2} ([- τ, 0]; R)

, the Hilbert space of square integrable real functions defined on

[- τ, 0]

, and write an infinite dimensional representation for our system. This evolution equation approach was initiated in (Vinter and Kwong 1981) under a deterministic control setting, and later was generalized in (Gozzi and Marinelli 2006) to a stochastic control problem.

Given

z \in R \times H

,

z_{0} \in R

, and

z_{1} \in H

will denote the two components of the product space

R \times H

. The inner product on

R \times H

will be denoted by

〈 \cdot, \cdot 〉

, and it is defined by

〈 z, \tilde{z} 〉 = z_{0} {\tilde{z}}_{0} + \int_{- τ}^{0} z_{1} (s) {\tilde{z}}_{1} (s) d s .

(5)

Therefore, the new state is denoted by

Z_{t}^{i} = (Z_{0, t}^{i}, Z_{1, t}^{i} (s)), s \in [- τ, 0]

, which corresponds to

(X_{t}^{i}, α_{t - τ - s}^{i})

in the notation of the original system (1).

Bank i tries to minimize its cost functional

J^{i} (α^{i}, α^{- i})

defined by

J^{i} (t, z, α^{i}, α^{- i}) = E [\int_{t}^{T} f_{i} (Z_{0, s}, α_{s}^{i}) d s + g_{i} (Z_{0, T}) | Z_{t} = z] .

(6)

After all other players

j \neq i

have chosen their optimal strategies which minimize their cost functionals, player i’s value function

V^{i} (t, z)

is defined by

V^{i} (t, z) = inf_{α^{i}} J^{i} (t, z, α^{i}, α^{- i}) .

By dynamic programming principle, the value function

V^{i} (t, z)

must satisfy the following infinite dimensional HJB equation (see Fabbri et al. 2017 Chapter 2 for details):

\partial_{t} V^{i} (t, z) + \frac{1}{2} T r (G^{*} G \partial_{z z} V^{i} (t, z)) + \sum_{k = 1}^{N} 〈 A z^{k}, \partial_{z^{k}} V^{i} (t, z) 〉 + {inf}_{α^{i}} [\sum_{k = 1}^{N} 〈 B α^{k}, \partial_{z^{k}} V^{i} (t, z) 〉 + f_{i} (z_{0}, α^{i})] = 0,

(7)

with terminal condition

V^{i} (T, z) = \frac{c}{2} {({\bar{z}}_{0} - z_{0}^{i})}^{2}

, where the operator

A : D (A) \subset R \times H \to R \times H

is defined as

A : (z_{0}, z_{1} (s)) \to (z_{1} (0), - \frac{d z_{1} (s)}{d s}) a . e ., s \in [- τ, 0],

and its domain is

D (A) = {(z_{0}, z_{1} (\cdot)) \in R \times H : z_{1} (\cdot) \in W^{1, 2} ([- τ, 0]; R), z_{1} (- τ) = 0}

.

The adjoint of A is

A^{*} : D (A^{*}) \subset R \times H \to R \times H

and is defined by

A^{*} : (z_{0}, z_{1} (s)) \to (0, \frac{d z_{1} (s)}{d s}) a . e ., s \in [- τ, 0],

with domain

D (A^{*}) = {(z_{0}, z_{1} (\cdot)) \in R \times H : z_{1} (\cdot) \in W^{1, 2} ([- τ, 0]; R), z_{0} = z_{1} (0)}

.

The operator

B : R \to R \times H

is defined by

B : u \to (u, - δ_{- τ} (s) u), s \in [- τ, 0],

where

δ_{- τ} (\cdot)

is the Dirac measure at

- τ

.

The adjoint of B is

B^{*} : R \times H \to R

given by

B^{*} : (z_{0}, z_{1} (s)) \to z_{0} - z_{1} (- τ) .

The operator

G : R^{N} \to R^{N} \times H^{N}

is defined by

G : z_{0} \to (σ z_{0}, 0) .

The infinite dimensional representation of the original system (1) is given by

\begin{matrix} d Z_{t}^{i} = (A Z_{t}^{i} + B α_{t}^{i}) d t + G d W_{t}, 0 \leq t \leq T, \\ Z_{0}^{i} = (ξ^{i}, ϕ^{i} (s)) \in H . \end{matrix}

(8)

By minimizing the Hamiltonian in (7), the infimum can be computed, so that the optimal control is attained at

{\hat{α}}^{i} = - 〈 B, \partial_{z^{i}} V^{i} 〉 = - (\partial_{z_{0}^{i}} V^{i} - [\partial_{z_{1}^{i}} V^{i}] (- τ)) .

(9)

Assuming that each player follows its own optimal strategy

{({\hat{α}}^{i})}_{1 \leq i \leq N}

, which forms a Nash equilibrium, the corresponding value function follows the HJB equation

\partial_{t} V^{i} + \frac{1}{2} T r (G^{*} G \partial_{z z} V^{i}) + \sum_{k = 1}^{N} 〈 A z^{k}, \partial_{z^{k}} V^{i} 〉 - \sum_{k \neq i} (B^{*} \partial_{z^{k}} V^{i}) \cdot (B^{*} \partial_{z^{k}} V^{k}) - \frac{1}{2} {(B^{*} \partial_{z^{i}} V^{i})}^{2} + \frac{ϵ}{2} {({\bar{z}}_{0} - z_{0}^{i})}^{2} = 0 .

(10)

After applying the definitions of the operators

A, B

and Q, the HJB equation for player i becomes:

\begin{matrix} \partial_{t} V^{i} + \sum_{k = 1}^{N} \frac{1}{2} σ^{2} \partial_{z_{0}^{k} z_{0}^{k}} V^{i} + \sum_{k = 1}^{N} \int_{- τ}^{0} z_{1}^{k} \frac{d}{d s} (\partial_{z_{1}^{k}} V^{i}) d s \\ - \sum_{k \neq i}^{N} (\partial_{z_{0}^{k}} V^{k} - [\partial_{z_{1}^{k}} V^{k}] (- τ)) (\partial_{z_{0}^{k}} V^{i} - [\partial_{z_{1}^{k}} V^{i}] (- τ)) \\ - \frac{1}{2} {(\partial_{z_{0}^{i}} V^{i} - [\partial_{z_{1}^{i}} V^{i}] (- τ))}^{2} + \frac{ϵ}{2} {({\bar{z}}_{0} - z_{0}^{i})}^{2} = 0 . \end{matrix}

(11)

As shown in (Carmona et al. 2018), a solution of the system (11) can be found in the form

\begin{matrix} V^{i} (t, z) = E_{0} (t) {({\bar{z}}_{0} - z_{0}^{i})}^{2} - 2 ({\bar{z}}_{0} - z_{0}^{i}) \int_{- τ}^{0} E_{1} (t, - τ - s) ({\bar{z}}_{1} - z_{1}^{i}) d s \\ + \int_{- τ}^{0} \int_{- τ}^{0} E_{2} (t, - τ - s, - τ - r) ({\bar{z}}_{1} - z_{1}^{i}) ({\bar{z}}_{1} - z_{1}^{i}) d s d r + E_{3} (t), \end{matrix}

(12)

for some deterministic functions

E_{0} (t)

,

E_{1} (t, s)

,

E_{2} (t, s, r)

, and

E_{3} (t)

satisfying the following PDEs

\begin{matrix} \frac{d E_{0} (t)}{d t} + 2 (\frac{1}{N^{2}} - 1) {(E_{0} (t) + E_{1} (t, 0))}^{2} + \frac{ϵ}{2} = 0, \\ \frac{\partial E_{1} (t, s)}{\partial t} - \frac{\partial E_{1} (t, s)}{\partial s} + 2 (\frac{1}{N^{2}} - 1) (E_{0} (t) + E_{1} (t, 0)) (E_{1} (t, s) + E_{2} (t, s, 0)) = 0, \\ \frac{\partial E_{2} (t, s, r)}{\partial t} - \frac{\partial E_{2} (t, s, r)}{s} - \frac{\partial E_{2} (t, s, r)}{r} \\ + 2 (\frac{1}{N^{2}} - 1) (E_{1} (t, s) + E_{2} (t, s, 0)) (E_{1} (t, r) + E_{2} (t, r, 0)) = 0, \\ \frac{d E_{3} (t)}{d t} + (1 - \frac{1}{N}) σ^{2} E_{0} (t) = 0, \end{matrix}

(13)

with boundary conditions:

\forall t \in [0, T]

and

\forall s, r \in [- τ, 0]

,

\begin{matrix} E_{0} (T) = \frac{c}{2}, E_{1} (T, s) = 0, E_{2} (T, s, r) = 0, E_{2} (t, s, r) = E_{2} (t, r, s), \\ E_{1} (t, - τ) = - E_{0} (t), E_{2} (t, s, - τ) = - E_{1} (t, s), E_{3} (T) = 0 . \end{matrix}

(14)

This set of PDEs (13) with boundary conditions (14) admits a unique solution as shown in (Vinter and Kwong 1981), and the optimal strategies take the integral form

{\hat{α}}_{t}^{i} = 2 (1 - \frac{1}{N}) [(E_{1} (t, 0) + E_{0} (t)) ({\bar{z}}_{0} - z_{0}^{i}) - \int_{- τ}^{0} (E_{2} (t, - τ - s, 0) + E_{1} (t, - τ - s)) ({\bar{z}}_{1} - z_{1}^{i}) d s] .

(15)

3. The Mean Field Game System

The mean field game theory describes the structure of a game with infinite many indistinguishable players. All players are rational, i.e., each player tries to minimize their cost against the mass of other players. This assumption implies that the running cost and terminal cost in (4) only depend on i-th player’s state

z_{0}^{i}

and the empirical distribution of

{(z_{0}^{j})}_{j \neq i}

. Denoting this empirical distribution by

μ_{0}^{i} = \frac{1}{N - 1} \sum_{j \neq i} δ_{z_{0}^{j}},

these costs, as in (4), can be re-written as

\begin{matrix} f_{i} (z_{0}, α^{i}) & = \frac{1}{2} {(α^{i})}^{2} + \frac{ϵ}{2} {({\bar{z}}_{0} - z_{0}^{i})}^{2} \\ = \frac{1}{2} {(α^{i})}^{2} + \frac{ϵ}{2} {(1 - \frac{1}{N})}^{2} {(\int_{R} y_{0} d μ_{0}^{i} (y_{0}) - z_{0}^{i})}^{2} : = f (z_{0}^{i}, μ_{0}^{i}, α^{i}), \\ g_{i} (z_{0}) & = \frac{c}{2} {(1 - \frac{1}{N})}^{2} {(\int_{R} y_{0} d μ_{0}^{i} (y_{0}) - z_{0}^{i})}^{2} : = g (z_{0}^{i}, μ_{0}^{i}) . \end{matrix}

(16)

As the number N of players goes to ∞, the joint empirical distribution of the states and past controls

Z_{t}^{j} = (Z_{0, t}^{j}, Z_{1, t}^{j})

ν_{t}^{i} : = \frac{1}{N - 1} \sum_{j \neq i} δ_{(Z_{0, t}^{j}, Z_{1, t}^{j})},

with marginals

μ_{0, t}^{i} = \frac{1}{N - 1} \sum_{j \neq i} δ_{Z_{0, t}^{j}}, μ_{1, t}^{i} = \frac{1}{N - 1} \sum_{j \neq i} δ_{Z_{1, t}^{j}},

converges to a deterministic limit denoted by

ν (t)

(with marginals denoted by

μ_{0} (t)

and

μ_{1} (t)

). Here, we assume that, at time 0,

ν_{0}^{i}

satisfies the LLN (for instance with i.i.d.

Z_{0}^{j}

), and that the propagation of chaos property holds. A full justification of this property would involve generalizing the result in Section 2.1 of (Carmona and Delarue 2014) to an infinite dimensional setting in order to take into account the past of the controls. This is highly technical but intuitively sound. A complete proof is beyond the scope of this paper.

In the limit, a single representative player tries to minimize his cost functional, and, dropping the index i, his value function is defined as

V (t, z) = inf_{{(α_{s})}_{t \leq s \leq T}} E [\int_{t}^{T} f (s, Z_{0, s}, μ_{0} (s), α_{s}) d s + g (Z_{0, T}, μ_{0} (T)) | Z_{t} = z],

(17)

subject to

d Z_{t} = (A Z_{t} + B α_{t}) d t + G d W_{t} .

(18)

The HJB equation for the value function

V (t, z)

reads

\partial_{t} V + \frac{1}{2} T r (G^{*} G \partial_{z z} V) + 〈 A Z, \partial_{z} V 〉 + {inf}_{α} \{〈 B α, \partial_{z} V 〉 + \frac{1}{2} α^{2} + \frac{ϵ}{2} {(\int_{R} y_{0} d μ_{0} (y_{0}) - z_{0})}^{2}\} = 0,

(19)

with terminal condition

V (T, z) = \frac{c}{2} {(\int_{R} y_{0} d μ_{0} (y_{0}) - z_{0})}^{2} .

Then, we minimize in

α

to get

{\hat{α}}_{t} = - (\partial_{z_{0}} V - [\partial_{z_{1}} V] (- τ)) .

(20)

After plugging it into (19), our backward HJB equation reads:

\begin{matrix} \partial_{t} V + \frac{1}{2} σ^{2} \partial_{z_{0} z_{0}} V + \int_{- τ}^{0} z_{1} \frac{d}{d s} (\partial_{z_{1}} V) d s - \frac{1}{2} {(\partial_{z_{0}} V - [\partial_{z_{1}} V] (- τ))}^{2} \\ + \frac{ϵ}{2} {(\int_{R} y_{0} d μ_{0} (y_{0}) - z_{0})}^{2} = 0, \\ V (T, z) = \frac{c}{2} {(\int_{R} y_{0} d μ_{0} (y_{0}) - z_{0})}^{2} . \end{matrix}

(21)

Next, since we “lift” the original non-Markovian optimization problem into a infinite dimensional Markovian control problem, we are able to characterize the corresponding generator for (18), which is denoted by

L_{t}

,

L_{t} φ (z) = 〈 (A Z + B {\hat{α}}_{t}), \partial_{z} φ 〉 + \frac{1}{2} T r (G^{*} G \partial_{z z} φ),

(22)

where

φ

is a smooth function and the time dependency comes from

{\hat{α}}_{t}

given by (20). The derivation of the adjoint

L_{t}^{*}

of

L_{t}

is given in Appendix A. Consequently, the forward Kolmogorov equation for the distribution

ν (t)

reads

\begin{matrix} \partial_{t} ν & = \int_{- τ}^{0} \partial_{z_{1}} (\frac{d}{d s} z_{1} ν) d s - \int_{- τ}^{0} \partial_{z_{1}} (z_{1} ν) (δ_{0} (s) - δ_{- τ} (s)) d s + \partial_{z_{0}} {(\partial_{z_{0}} V - [\partial_{z_{1}} V] (- τ)) ν} \\ - \int_{- τ}^{0} \partial_{z_{1}} {(\partial_{z_{0}} V - [\partial_{z_{1}} V] (- τ)) ν} δ_{- τ} (s) d s + \frac{1}{2} σ^{2} \partial_{z_{0} z_{0}} ν, \\ ν (0) & = P (ξ, ϕ {(s)}_{s \in [- τ, 0]}) . \end{matrix}

(23)

Combining (21) with (23), we obtain the mean field game system. To solve this, We make the following ansatz for the value function

\begin{matrix} V (t, z) = E_{0} (t) {(m_{0} - z_{0})}^{2} - 2 (m_{0} - z_{0}) \int_{- τ}^{0} E_{1} (t, - τ - s) (m_{1} - z_{1}) d s \\ + \int_{- τ}^{0} \int_{- τ}^{0} E_{2} (t, - τ - s, - τ - r) (m_{1} - z_{1}) (m_{1} - z_{1}) d s d r + E_{3} (t) . \end{matrix}

(24)

where we denote the mean of state

m_{0} : = \int_{R} z_{0} d μ_{0} (z_{0})

, and the mean of past control

m_{1} : = \int_{H} z_{1} d μ_{1} (z_{1})

. Plugging (24) into (23), multiplying both sides of (23) by

z_{0}

, and integrating over

R \times H

, we have

\begin{matrix} \int_{R \times H} z_{0} \partial_{t} ν d z = \int_{R \times H} z_{0} \int_{- τ}^{0} \partial_{z_{1}} (\frac{d}{d s} z_{1} ν) d s d z - \int_{R \times H} z_{0} \int_{- τ}^{0} \partial_{z_{1}} (z_{1} ν) (δ_{0} (s) - δ_{- τ} (s)) d s d z \\ + \int_{R \times H} z_{0} \partial_{z_{0}} {(\partial_{z_{0}} V - [\partial_{z_{1}} V] (- τ)) ν} d z - \int_{R \times H} z_{0} \int_{- τ}^{0} \partial_{z_{1}} {(\partial_{z_{0}} V - [\partial_{z_{1}} V] (- τ)) ν} δ_{- τ} (s) d s d z \\ + \int_{R \times H} z_{0} \frac{1}{2} σ^{2} \partial_{z_{0} z_{0}} ν d z . \end{matrix}

(25)

After integration by parts, we obtain

\partial_{t} m_{0} = \int_{R \times H} \{\partial_{z_{0}} V - [\partial_{z_{1}} V] (- τ)\} ν d z = 0,

(26)

as can be seen directly using (24).

Similarly, plugging (24) to (23), multiplying both sides of (23) by

z_{1}

, and integrating over

R \times H

, we get

\begin{matrix} \int_{R \times H} z_{1} \partial_{t} ν d z = \int_{R \times H} z_{1} \int_{- τ}^{0} \partial_{z_{1}} (\frac{d}{d s} z_{1} ν) d s d z - \int_{R \times H} z_{1} \int_{- τ}^{0} \partial_{z_{1}} (z_{1} ν) (δ_{0} (s) - δ_{- τ} (s)) d s d z \\ + \int_{R \times H} z_{1} \partial_{z_{0}} {(\partial_{z_{0}} V - [\partial_{z_{1}} V] (- τ)) ν} d z - \int_{R \times H} z_{1} \int_{- τ}^{0} \partial_{z_{1}} {(\partial_{z_{0}} V - [\partial_{z_{1}} V] (- τ)) ν} δ_{- τ} (s) d s d z \\ + \int_{R \times H} z_{1} \frac{1}{2} σ \partial_{z_{0} z_{0}}^{2} ν d z . \end{matrix}

(27)

By integration by parts, we deduce

\begin{matrix} \partial_{t} m_{1} & = - \int_{R \times H} \int_{- τ}^{0} \frac{d}{d s} z_{1} ν d s d z + \int_{R \times H} \int_{- τ}^{0} z_{1} ν (δ_{0} (s) - δ_{- τ} (s)) d s d z \\ + \int_{R \times H} \{\partial_{z_{0}} V - [\partial_{z_{1}} V] (- τ)\} ν d z \\ = 0 . \end{matrix}

(28)

Now we are ready to verify the ansatz (24). We first compute the derivative of the ansatz,

\begin{matrix} \partial_{t} V & = \frac{d E_{0} (t)}{d t} {(m_{0} - z_{0})}^{2} - 2 (m_{0} - z_{0}) \int_{- τ}^{0} \frac{\partial E_{1} (t, - τ - s)}{\partial t} (m_{1} - z_{1}) d s \\ + \int_{- τ}^{0} \int_{- τ}^{0} \frac{\partial E_{2} (t, - τ - s, - τ - r)}{\partial t} (m_{1} - z_{1}) (m_{1} - z_{1}) d s d r + \frac{d E_{3} (t)}{d t}, \\ \partial_{z_{0}} V & = - 2 E_{0} (t) (m_{0} - z_{0}) + 2 \int_{- τ}^{0} E_{1} (t, - τ - s) (m_{1} - z_{1}) d s, \\ \partial_{z_{1}} V & = 2 E_{1} (t, - τ - s) (m_{0} - z_{0}) - 2 \int_{- τ}^{0} E_{2} (t, - τ - s, - τ - r) (m_{1} - z_{1}) d r, \\ \partial_{z_{0} z_{0}} V & = 2 E_{0} (t) . \end{matrix}

(29)

Then, we plug the ansatz (24) into (7), and by collecting

{(m_{0} - z_{0})}^{2}

terms,

(m_{0} - z_{0}) (m_{1} - z_{1})

terms,

{(m_{1} - z_{1})}^{2}

terms, and constant terms, we obtain the following system of PDEs:

\begin{matrix} \frac{d E_{0} (t)}{d t} - 2 {(E_{0} (t) + E_{1} (t, 0))}^{2} + \frac{ϵ}{2} = 0, \\ \frac{\partial E_{1} (t, s)}{\partial t} - \frac{\partial E_{1} (t, s)}{\partial s} - 2 (E_{0} (t) + E_{1} (t, 0)) (E_{1} (t, s) + E_{2} (t, s, 0)) = 0, \\ \frac{\partial E_{2} (t, s, r)}{\partial t} - \frac{\partial E_{2} (t, s, r)}{s} - \frac{\partial E_{2} (t, s, r)}{r} - 2 (E_{1} (t, s) + E_{2} (t, s, 0)) (E_{1} (t, r) + E_{2} (t, r, 0)) = 0, \\ \frac{d E_{3} (t)}{d t} + σ^{2} E_{0} (t) = 0, \end{matrix}

(30)

with boundary conditions

\begin{matrix} E_{0} (T) = \frac{c}{2}, E_{1} (T, s) = 0, E_{2} (T, s, r) = 0, E_{2} (t, s, r) = E_{2} (t, r, s), \\ E_{1} (t, - τ) = - E_{0} (t), E_{2} (t, s, - τ) = - E_{1} (t, s), E_{3} (T) = 0 . \end{matrix}

(31)

As for (13)–(14), the system (30)–(31) admits a unique solution.

4. The Master Equation

4.1. Derivatives

The master equation for this delayed game lies in an infinite dimensional space, and it requires a notion of derivatives in the space of measures in

P (H)

.

The set

P (H)

of probability measure on

H

is endowed with Monge-Kantorovich distance

d_{M K} (μ_{1}, μ_{1}^{'}) = sup \{{∥\int_{H} f (z) d (μ_{1} - μ_{1}^{'}) (z)∥}_{H} : f \in L i p_{1} (H)\},

(32)

where

L i p (H)

is the collection of real-valued Lipschitz functions on

H

with Lipschitz constant 1.

Definition 1.

We say that

F : P (H) \to H

is

C^{1}

if there exists an operator

\frac{δ F}{δ ν} : P (H) \times H \to H

such that for any

μ_{1}

and

μ_{1}^{'} \in P (H)

lim_{ϵ \to 0^{+}} \frac{F (μ_{1} + ϵ (μ_{1}^{'} - μ_{1})) - F (μ_{1})}{ϵ} = \int_{H} \frac{δ F}{δ μ_{1}} (μ_{1}, y_{1}) d (μ_{1}^{'} - μ_{1}) (y_{1}) .

(33)

Definition 2.

If

\frac{δ F}{δ μ_{1}} (μ_{1}, y_{1})

is of class

C^{1}

with respect to

y_{1}

, the marginal derivative

D_{μ_{1}} F : P (H) \times H \to H

is defined in the sense of Fréchet derivative:

D_{μ_{1}} F (μ_{1}, y_{1}) : = D_{y_{1}} \frac{δ F}{δ μ_{1}} (μ_{1}, y_{1}) .

(34)

Remark 1.

Usually we will encounter a map

U : P (H) \to R

. In this case, U can be expressed in a form of composition

\tilde{U} \circ F

, where

\tilde{U} : H \to R

, and

F : P (H) \to H

, i.e.,

U = (\tilde{U} \circ F) (μ_{1})

.

If

\frac{δ F}{δ μ_{1}}

is

C^{1}

with respect to

y_{1}

, and

\tilde{U}

is Fréchet differentiable, then

\frac{δ U}{δ μ_{1}} : P (H) \times H \to H

, and

D_{μ_{1}} U : P (H) \times H \to H

are defined by

\frac{δ U}{δ μ_{1}} (μ_{1}, y_{1}) : = (D_{F} \tilde{U}) (\frac{δ F}{δ μ_{1}}), a n d D_{μ_{1}} U (μ_{1}, y_{1}) : = (D_{F} \tilde{U}) (D_{μ_{1}} F) .

(35)

Example 1.

Suppose

U (μ_{1}) = \int_{- τ}^{0} \int_{H} g (x_{1} (s)) d μ_{1} (x_{1}) d s

, where

g : H \to H

is Fréchet differentiable. Then

U (μ_{1})

can be written as

\tilde{U} [F (μ_{1})] (s)

, where

\tilde{U} [F] = \int_{- τ}^{0} F (s) d s

, and

F (μ_{1}) = \int_{H} g (x_{1} (s)) d μ_{1} (x_{1})

. Then

F (μ_{1} + ϵ (μ_{1}^{'} - μ_{1})) = \int_{H} g (x_{1} (s)) d (μ_{1} + ϵ (μ_{1}^{'} - μ_{1})) .

So

\frac{F (μ_{1} + ϵ (μ_{1}^{'} - μ_{1})) - F (μ_{1})}{ϵ} = \int_{H} g (x_{1} (s)) d (μ_{1}^{'} - μ_{1}) .

Then

\frac{δ F}{δ μ_{1}} (μ_{1}, y_{1}) = g (y_{1}), a n d D_{μ_{1}} F (μ_{1}, y_{1}) = D_{y_{1}} g (y_{1}) .

Since

D_{F} \tilde{U} [F] = 1

, we have

\frac{δ U}{δ μ_{1}} (μ_{1}, y_{1}) = g (y_{1}) a n d D_{μ_{1}} U (μ_{1}, y_{1}) = D_{y_{1}} g (y_{1}) .

4.2. The Master Equation

Theorem 1.

For any

(t_{0}, ν_{0}) \in [0, T] \times P (R \times H)

, we define

U (t_{0}, \cdot, ν_{0}) : = V (t_{0}, \cdot),

(36)

where

(V, ν)

is a classical solution to the system of forward-backward Equations (21) and (23), with initial condition

ν (t_{0}) = ν_{0}

, and terminal condition

V (T, z) = \frac{c}{2} {(\int_{R} y_{0} d μ_{0} (y_{0}) - z_{0})}^{2}

, respectively. Then U must satisfy the following master equation

\begin{matrix} \partial_{t} U (t, z_{0}, z_{1}, ν) + \frac{1}{2} σ^{2} \partial_{z_{0} z_{0}} U (t, z_{0}, z_{1}, ν) + \frac{1}{2} σ^{2} \int_{R} \partial_{y_{0}} D_{μ_{0}} U (t, z_{0}, z_{1}, ν, y_{0}) d μ_{0} (y_{0}) \\ + \int_{- τ}^{0} z_{1} \frac{d}{d s} \partial_{z_{1}} U (t, z_{0}, z_{1}, ν) d s + \int_{- τ}^{0} \int_{H} y_{1} \frac{d}{d s} [D_{μ_{1}} U (t, z_{0}, z_{1}, ν, y_{1})] (s) d μ_{1} (y_{1}) d s \\ - \int_{R \times H} (\partial_{y_{0}} U (t, y_{0}, y_{1}, ν) - [\partial_{y_{1}} U (t, y_{0}, y_{1}, ν)] (- τ)) D_{μ_{0}} U (t, z_{0}, z_{1}, ν, y_{0}) d ν (y) \\ + \int_{R \times H} (\partial_{y_{0}} U (t, y_{0}, y_{1}, ν) - [\partial_{y_{1}} U (t, y_{0}, y_{1}, ν)] (- τ)) [D_{μ_{1}} U (t, z_{0}, z_{1}, ν, y_{1})] (- τ) d ν (y) \\ - \frac{1}{2} {(\partial_{z_{0}} U (t, z_{0}, z_{1}, ν) - [\partial_{z_{1}} U (t, z_{0}, z_{1}, ν)] (- τ))}^{2} + \frac{ϵ}{2} {(\int_{R} y_{0} d μ_{0} (y_{0}) - z_{0})}^{2} = 0, \end{matrix}

(37)

where

μ_{0}

and

μ_{1}

are the marginal law for

Z_{0}

and

Z_{1}

respectively.

Proof.

For any

h \in [0, T - t_{0}]

,

V (t_{0} + h, \cdot) = U (t_{0} + h, \cdot, ν (t_{0} + h))

. Then

\begin{matrix} \partial_{t} V (t_{0}, z) \\ = & \partial_{t} U (t_{0}, z, ν_{0}) + \int_{R \times H} \frac{δ U}{δ ν} (t_{0}, z, ν, y) \partial_{t} ν (t_{0}, y) d y \\ = & \partial_{t} U (t_{0}, z, ν_{0}) + \int_{R \times H} \frac{δ U}{δ ν} (t_{0}, z, ν, y) (\int_{- τ}^{0} \partial_{y_{1}} (\frac{d}{d s} y_{1} ν) d s - \int_{- τ}^{0} \partial_{y_{1}} (y_{1} ν) (δ_{0} (s) - δ_{- τ} (s)) d s \\ + \partial_{y_{0}} \{(\partial_{y_{0}} U - [\partial_{y_{1}} U] (- τ)) ν\} - \int_{- τ}^{0} \partial_{y_{1}} \{(\partial_{y_{0}} U - [\partial_{y_{1}} U] (- τ)) ν\} δ_{- τ} (s) d s + \frac{1}{2} σ^{2} \partial_{y_{0} y_{0}} ν) d y \\ = & \partial_{t} U (t_{0}, z, ν_{0}) - \int_{- τ}^{0} \int_{R \times H} D_{μ_{1}} U (t_{0}, z, ν, y) \frac{d}{d s} y_{1} ν d y d s \\ + \int_{- τ}^{0} \int_{R \times H} D_{μ_{1}} U y_{1} ν (δ_{0} (s) - δ_{- τ} (s)) d y d s - \int_{R \times H} D_{μ_{0}} U (\partial_{y_{0}} U - [\partial_{y_{1}} U] (- τ)) ν d y \\ + \int_{- τ}^{0} \int_{R \times H} D_{μ_{1}} U (\partial_{y_{0}} U - [\partial_{y_{1}} U] (- τ) ν) δ_{- τ} (s) d y d s + \int_{R \times H} \frac{1}{2} σ^{2} \partial_{y_{0}} D_{μ_{0}} U ν d y \\ = & \partial_{t} U (t_{0}, z, ν_{0}) + \int_{- τ}^{0} \int_{R \times H} y_{1} \frac{d}{d s} D_{μ_{1}} U ν d y d s - \int_{R \times H} D_{μ_{0}} U (\partial_{y_{0}} U - [\partial_{y_{1}} U] (- τ)) ν d y \\ + \int_{R \times H} [D_{μ_{1}} U] (- τ) ((\partial_{y_{0}} U - [\partial_{y_{1}} U] (- τ)) ν d y + \frac{1}{2} σ^{2} \int_{R \times H} \partial_{y_{0}} D_{μ_{0}} U ν d y . \end{matrix}

(38)

On the other hand, V satisfies the HJB Equation (7).

\begin{matrix} \partial_{t} V \\ = & - \frac{1}{2} σ^{2} \partial_{z_{0} z_{0}} V - \int_{- τ}^{0} z_{1} \frac{d}{d s} (\partial_{z_{1}} V) d s + \frac{1}{2} {(\partial_{z_{0}} V - [\partial_{z_{1}} V] (- τ))}^{2} - \frac{ϵ}{2} {(\int_{R} y_{0} d μ_{0} (y_{0}) - z_{0})}^{2} \\ = & - \frac{1}{2} σ^{2} \partial_{z_{0} z_{0}} U - \int_{- τ}^{0} z_{1} \frac{d}{d s} (\partial_{z_{1}} U) d s + \frac{1}{2} {(\partial_{z_{0}} U - [\partial_{z_{1}} U] (- τ))}^{2} - \frac{ϵ}{2} {(\int_{R} y_{0} d μ_{0} (y_{0}) - z_{0})}^{2} . \end{matrix}

(39)

Therefore, subtracting (39) from (38), we have shown that U satisfies the master Equation (37). □

4.3. Explicit Solution of the Master Equation

It turns out that this master Equation (37) can be solved explicitly by making the following ansatz, and we also define

m_{0} : = \int_{R} y_{0} d μ_{0} (y_{0})

and

m_{1} : = \int_{H} y_{1} d μ_{1} (y_{1})

for convenience.

\begin{matrix} U (t, z_{0}, z_{1}, ν) = E_{0} (t) {(m_{0} - z_{0})}^{2} - 2 (m_{0} - z_{0}) \int_{- τ}^{0} E_{1} (t, - τ - s) (m_{1} - z_{1}) d s \\ + \int_{- τ}^{0} \int_{- τ}^{0} E_{2} (t, - τ - s, - τ - r) (m_{1} - z_{1}) (m_{1} - z_{1}) d s d r + E_{3} (t) . \end{matrix}

(40)

Then, we compute the partial derivatives needed in (37) explicitly, we have

\begin{matrix} \partial_{t} U = \frac{d E_{0} (t)}{d t} {(m_{0} - z_{0})}^{2} - 2 (m_{0} - z_{0}) \int_{- τ}^{0} \frac{\partial E_{1} (t, - τ - s)}{\partial t} (m_{1} - z_{1}) d s \\ + \int_{- τ}^{0} \int_{- τ}^{0} \frac{\partial E_{2} (t, - τ - s, - τ - r)}{\partial t} (m_{1} - z_{1}) (m_{1} - z_{1}) d s d r + \frac{d E_{3} (t)}{d t}, \\ \partial_{z_{0}} U = - 2 E_{0} (t) (m_{0} - z_{0}) + 2 \int_{- τ}^{0} E_{1} (t, - τ - s) (m_{1} - z_{1}) d s, \\ \partial_{z_{1}} U = 2 E_{1} (t, - τ - s) (m_{0} - z_{0}) - 2 \int_{- τ}^{0} E_{2} (t, - τ - s, - τ - r) (m_{1} - z_{1}) d r, \\ D_{μ_{0}} U = 2 E_{0} (t) (m_{0} - z_{0}) - 2 \int_{- τ}^{0} E_{1} (t, - τ - s) (m_{1} - z_{1}) d s, \\ D_{μ_{1}} U = - 2 E_{1} (t, - τ - s) (m_{0} - z_{0}) + 2 \int_{- τ}^{0} E_{2} (t, - τ - s, - τ - r) (m_{1} - z_{1}) d r, \\ \partial_{z_{0} z_{0}} U = 2 E_{0} (t), \end{matrix}

(41)

and plug those into our master Equation (37). We have

\begin{matrix} \frac{d E_{0} (t)}{d t} {(m_{0} - z_{0})}^{2} - 2 (m_{0} - z_{0}) \int_{- τ}^{0} \frac{\partial E_{1} (t, - τ - s)}{\partial t} (m_{1} - z_{1}) d s \\ + \int_{- τ}^{0} \int_{- τ}^{0} \frac{\partial E_{2} (t, - τ - s, - τ - r)}{\partial t} (m_{1} - z_{1}) (m_{1} - z_{1}) d s d r + \frac{d E_{3} (t)}{d t} + \frac{1}{2} σ^{2} (2 E_{0} (t)) \\ - \int_{- τ}^{0} (m_{1} - z_{1}) (2 \frac{\partial E_{1} (t, - τ - s)}{\partial s} (m_{0} - z_{0}) - 2 \int_{- τ}^{0} \frac{\partial E_{2} (t, - τ - s, - τ - r)}{\partial s} (m_{1} - z_{1}) d r) d s \\ - 2 {((E_{0} (t) + E_{1} (t, 0)) (m_{0} - z_{0}) - \int_{- τ}^{0} (E_{1} (t, - τ - s) + E_{2} (t, - τ - s, 0)) (m_{1} - z_{1}) d s)}^{2} \\ + \frac{ϵ}{2} {(m_{0} - z_{0})}^{2} = 0 . \end{matrix}

Collecting

{(m_{0} - z_{0})}^{2}

terms,

(m_{0} - z_{0}) (m_{1} - z_{1})

terms,

{(m_{1} - z_{1})}^{2}

terms, and constant terms, we obtain that the function

E_{i}, i = 0, \dots, 3

, satisfy the system of PDEs (30) with boundary conditions (31).

5. Convergence of the Nash System

From the previous section, we have seen that our master equation is well posed, and we obtained an explicit solution. Furthermore, it also describes the limit of Nash equilibria of the N-player games as

N \to \infty

. In this section, generalizing to the case with delay the results of (Cardaliaguet et al. 2015) (see also Kolokoltsov et al. 2014), we show that the solution of the Nash system (11) converges to the solution of the master Equation (37) as number of players

N \to + \infty

, with a

1 / N

Cesaro convergence rate.

In Section 4, we find that (40) is a solution to the master Equation (37). We set

u^{i} (t, z_{0}, z_{1}) : = U (t, z_{0}^{i}, z_{1}^{i}, ν^{i})

, where

ν^{i} = \frac{1}{N - 1} \sum_{k \neq i} δ_{(z_{0}^{k}, z_{1}^{k})}

, denotes the joint empirical measure of

z_{0}

and

z_{1}

. The empirical measure of

z_{0}

is given by

μ_{0}^{i} = \frac{1}{N - 1} \sum_{k \neq i} δ_{z_{0}^{k}}

, and the empirical measure of

z_{1}

is given by

μ_{1}^{i} = \frac{1}{N - 1} \sum_{k \neq i} δ_{z_{1}^{k}}

. Note that, by direct computation, for

k \neq i

, and any

N \geq 2

,

\begin{matrix} \partial_{z_{0}^{k}} u^{i} (t, z_{0}, z_{1}) = \frac{1}{N - 1} D_{μ_{0}^{i}} U (t, z_{0}^{i}, z_{1}^{i}, ν^{i}, z_{0}^{k}), \\ \partial_{z_{1}^{k}} u^{i} (t, z_{0}, z_{1}) = \frac{1}{N - 1} D_{μ_{1}^{i}} U (t, z_{0}^{i}, z_{1}^{i}, ν^{i}, z_{1}^{k}), \\ \partial_{z_{0}^{k} z_{0}^{k}} u^{i} (t, z_{0}, z_{1}) = \frac{1}{N - 1} \partial_{z_{0}^{k}} [D_{μ_{0}^{i}} U] (t, z_{0}^{i}, z_{1}^{i}, ν^{i}, z_{0}^{k}) + \frac{1}{{(N - 1)}^{2}} D_{μ_{0}^{i} μ_{0}^{i}} U (t, z_{0}^{i}, z_{1}^{i}, ν^{i}, z_{0}^{k}, z_{0}^{k}) . \end{matrix}

(42)

Proposition 1.

For any

i \in {1, \dots, N}

,

u^{i} (t, z_{0}, z_{1})

satisfies

\begin{matrix} \partial_{t} u^{i} + \sum_{k = 1}^{N} \frac{1}{2} σ^{2} \partial_{z_{0}^{k} z_{0}^{k}} u^{i} + \sum_{k = 1}^{N} \int_{- τ}^{0} z_{1}^{k} \frac{d}{d s} (\partial_{z_{1}^{k}} u^{i}) d s - \sum_{k \neq i}^{N} (\partial_{z_{0}^{k}} u^{k} - [\partial_{z_{1}^{k}} u^{k}] (- τ)) (\partial_{z_{0}^{k}} u^{i} - [\partial_{z_{1}^{k}} u^{i}] (- τ)) \\ - \frac{1}{2} {(\partial_{z_{0}^{i}} u^{i} - [\partial_{z_{1}^{i}} u^{i}] (- τ))}^{2} + \frac{ϵ}{2} {({\bar{z}}_{0} - z_{0}^{i})}^{2} + e^{i} (t, z) = 0, \end{matrix}

(43)

where

∥ e^{i} (t, z) ∥ < \frac{C}{N}

, with terminal condition

u^{i} (T, z) = \frac{c}{2} {({\bar{z}}_{0} - z_{0}^{i})}^{2}

.

This shows that

{(u^{i})}_{i \in {1, \dots, N}}

is “almost” a solution to the Nash system (11).

Proof.

We compute each term in the above equation in terms of U using the relationship (42), and we use the fact that U is a solution to the master equation.

$\begin{matrix} \sum_{k = 1}^{N} \frac{1}{2} σ^{2} \partial_{z_{0}^{k} z_{0}^{k}} u^{i} (t, z_{0}, z_{1}) \\ = & \frac{1}{2} σ^{2} \partial_{z_{0}^{i} z_{0}^{i}} u^{i} (t, z_{0}, z_{1}) + \sum_{k \neq i} \frac{1}{2} σ^{2} \partial_{z_{0}^{k} z_{0}^{k}} u^{i} (t, z_{0}, z_{1}) \\ = & \frac{1}{2} σ^{2} \partial_{z_{0}^{i} z_{0}^{i}} U (t, z_{0}^{i}, z_{1}^{i}, ν^{i}) + \frac{1}{2} σ^{2} \sum_{k \neq i} \frac{1}{N - 1} \partial_{z_{0}^{k}} [D_{μ_{0}^{i}} U] (t, z_{0}^{i}, z_{1}^{i}, ν^{i}, z_{0}^{k}) \\ + \frac{1}{2} σ^{2} \sum_{k \neq i} \frac{1}{{(N - 1)}^{2}} D_{μ_{0}^{i} μ_{0}^{i}} U (t, z_{0}^{i}, z_{1}^{i}, ν^{i}, z_{0}^{k}, z_{0}^{k}) \\ = & \frac{1}{2} σ^{2} \partial_{z_{0}^{i} z_{0}^{i}} U (t, z_{0}^{i}, z_{1}^{i}, ν^{i}) + \frac{1}{2} σ^{2} \int_{R} \partial_{y_{0}} [D_{μ_{0}^{i}} U] (t, z_{0}^{i}, z_{1}^{i}, ν^{i}, y_{0}) d μ_{0}^{i} (y_{0}) \\ + \frac{1}{2} σ^{2} \frac{1}{N - 1} \int_{R} D_{μ_{0}^{i} μ_{0}^{i}} U (t, z_{0}^{i}, z_{1}^{i}, ν^{i}, y_{0}, y_{0}) d μ_{0}^{i} (y_{0}) . \end{matrix}$
$\begin{matrix} \sum_{k = 1}^{N} \int_{- τ}^{0} z_{1}^{k} \frac{d}{d s} (\partial_{z_{1}^{k}} u^{i}) d s \\ = & \int_{- τ}^{0} z_{1}^{i} \frac{d}{d s} (\partial_{z_{1}^{i}} u^{i}) d s + \sum_{k \neq i} \int_{- τ}^{0} z_{1}^{k} \frac{d}{d s} (\partial_{z_{1}^{k}} u^{i}) d s \\ = & \int_{- τ}^{0} z_{1}^{i} \frac{d}{d s} (\partial_{z_{1}^{i}} U) d s + \sum_{k \neq i} \int_{- τ}^{0} z_{1}^{k} \frac{d}{d s} [\frac{1}{N - 1} D_{μ_{1}^{i}} U (t, z_{0}^{i}, z_{1}^{i}, ν^{i}, z_{1}^{k})] d s \\ = & \int_{- τ}^{0} z_{1}^{i} \frac{d}{d s} (\partial_{z_{1}^{i}} U) d s + \int_{- τ}^{0} \int_{H} y_{1} \frac{d}{d s} [D_{μ_{1}^{i}} U (t, z_{0}^{i}, z_{1}^{i}, ν^{i}, y_{1})] d μ_{1}^{i} (y_{1}) d s . \end{matrix}$
From the solution (40) of the master equation, $\partial_{z} U$ is Lipschitz with respect to the measures. Namely,

$\begin{matrix} | \partial_{z_{0}} U (t, z^{k}, ν^{i}) - \partial_{z_{0}} U (t, z^{k}, ν^{k}) | \leq C_{1} (d_{M K} (μ_{0}^{i}, μ_{0}^{k}) + d_{M K} (μ_{1}^{i}, μ_{1}^{k})) \leq \frac{C_{1}}{N - 1}, \\ ∥ \partial_{z_{1}} U (t, z^{k}, ν^{i}) - \partial_{z_{1}} U (t, z^{k}, ν^{k}) ∥_{H} \leq C_{2} (d_{M K} (μ_{0}^{i}, μ_{0}^{k}) + d_{M K} (μ_{1}^{i}, μ_{1}^{k})) \leq \frac{C_{2}}{N - 1} . \end{matrix}$

(44)

Thus,

$\begin{matrix} \sum_{k \neq i} (\partial_{z_{0}^{k}} u^{k} - [\partial_{z_{1}^{k}} u^{k}] (- τ)) (\partial_{z_{0}^{k}} u^{i} - [\partial_{z_{1}^{k}} u^{i}] (- τ)) \\ = & \sum_{k \neq i} \partial_{z_{0}^{k}} U (t, z_{0}^{k}, z_{1}^{k}, ν^{k}) (\frac{1}{N - 1} D_{μ_{0}^{i}} U (t, z_{0}^{i}, z_{1}^{k}, ν^{i}, z_{0}^{k}) - \frac{1}{N - 1} [D_{μ_{1}^{i}} U (t, z_{0}^{i}, z_{1}^{i}, ν^{i}, z_{1}^{k})] (- τ)) \\ - \sum_{k \neq i} [\partial_{z_{1}^{k}} U (t, z_{0}^{k}, z_{1}^{k}, ν^{k})] (- τ) (\frac{1}{N - 1} D_{μ_{0}^{i}} U (t, z_{0}^{i}, z_{1}^{k}, ν^{i}, z_{0}^{k}) - \frac{1}{N - 1} [D_{μ_{1}^{i}} U (t, z_{0}^{i}, z_{1}^{i}, ν^{i}, z_{1}^{k})] (- τ)) \\ = & \sum_{k \neq i} \partial_{z_{0}^{k}} U (t, z_{0}^{k}, z_{1}^{k}, ν^{i}) (\frac{1}{N - 1} D_{μ_{0}^{i}} U (t, z_{0}^{i}, z_{1}^{k}, ν^{i}, z_{0}^{k}) - \frac{1}{N - 1} [D_{μ_{1}^{i}} U (t, z_{0}^{i}, z_{1}^{i}, ν^{i}, z_{1}^{k})] (- τ)) \\ - \sum_{k \neq i} [\partial_{z_{1}^{k}} U (t, z_{0}^{k}, z_{1}^{k}, ν^{i})] (- τ) (\frac{1}{N - 1} D_{μ_{0}^{i}} U (t, z_{0}^{i}, z_{1}^{k}, ν^{i}, z_{0}^{k}) - \frac{1}{N - 1} [D_{μ_{1}^{i}} U (t, z_{0}^{i}, z_{1}^{i}, ν^{i}, z_{1}^{k})] (- τ)) \\ + O (\frac{1}{N}) \\ = & \int_{- τ}^{0} \int_{R \times H} (\partial_{y_{0}} U - \partial_{y_{1}} U) (t, y_{0}, y_{1}, ν^{i}) \cdot (D_{μ_{0}^{i}} U - D_{μ_{1}^{i}} U) (t, z_{0}^{i}, z_{1}^{i}, ν^{i}, y_{0}, y_{1}) d ν (y_{0}, y_{1}) δ_{- τ} (s) d s \\ + O (\frac{1}{N}) . \end{matrix}$

Then,

$\begin{matrix} \partial_{t} u^{i} + \sum_{k = 1}^{N} \frac{1}{2} σ^{2} \partial_{z_{0}^{k} z_{0}^{k}} u^{i} + \sum_{k = 1}^{N} \int_{- τ}^{0} z_{1}^{k} \frac{d}{d s} (\partial_{z_{1}^{k}} u^{i}) d s \\ - \sum_{k \neq i}^{N} (\partial_{z_{0}^{k}} u^{k} - [\partial_{z_{1}^{k}} u^{k}] (- τ)) (\partial_{z_{0}^{k}} u^{i} - [\partial_{z_{1}^{k}} u^{i}] (- τ)) - \frac{1}{2} {(\partial_{z_{0}^{i}} u^{i} - [\partial_{z_{1}^{i}} u^{i}] (- τ))}^{2} + \frac{ϵ}{2} {({\bar{z}}_{0} - z_{0}^{i})}^{2} \\ = & \partial_{t} U + \frac{1}{2} σ^{2} \partial_{z_{0}^{i} z_{0}^{i}} U (t, z_{0}^{i}, z_{1}^{i}, ν^{i}) + \frac{1}{2} σ^{2} \int_{R} \partial_{y_{0}} [D_{μ_{0}^{i}} U] (t, z_{0}^{i}, z_{1}^{i}, ν^{i}, y_{0}) d μ_{0}^{i} (y_{0}) \\ + \int_{- τ}^{0} z_{1}^{i} \frac{d}{d s} (\partial_{z_{1}^{i}} U) d s + \int_{- τ}^{0} \int_{H} y_{1} \frac{d}{d s} [D_{μ_{1}^{i}} U (t, z_{0}^{i}, z_{1}^{i}, ν^{i}, y_{1})] d μ_{1}^{i} (y_{1}) d s \\ - \int_{R \times H} (\partial_{y_{0}} U - [\partial_{y_{1}} U] (- τ)) (t, y_{0}, y_{1}, ν^{i}) \cdot (D_{μ_{0}^{i}} U - [D_{μ_{1}^{i}} U] (- τ)) (t, z_{0}^{i}, z_{1}^{i}, ν^{i}, y_{0}, y_{1}) d ν^{i} (y_{0}, y_{1}) \\ - \frac{1}{2} {(\partial_{z_{0}^{i}} U - [\partial_{z_{1}^{i}} U] (- τ))}^{2} + \frac{ϵ}{2} {(\int y_{0} d μ_{0}^{i} (y_{0}) - z_{0}^{i})}^{2} \\ + O (\frac{1}{N}) + \frac{1}{2} σ^{2} \frac{1}{N - 1} \int_{R} D_{μ_{0}^{i} μ_{0}^{i}} U (t, z_{0}^{i}, z_{1}^{i}, ν^{i}, y_{0}, y_{0}) d μ_{0}^{i} (y_{0}) \\ = & O (\frac{1}{N}) . \end{matrix}$

□

Theorem 2.

Let

V^{i}

be the solution to the HJB Equation (11) of the N-player system, where

N \geq 1

fixed, and U be the solution to the master Equation (37). Fix any

(t_{0}, ν_{0}) \in [0, T] \times P (R \times H)

. Then for any

z \in R^{N}

, let

ν^{i} = \frac{1}{N - 1} \sum_{j \neq i}^{N} δ_{(z_{0}^{j}, z_{1}^{j})}

, we have

\frac{1}{N} \sum_{i = 1}^{N} | V^{i} (t_{0}, z) - U (t_{0}, z^{i}, ν^{i}) | \leq C N^{- 1} .

(45)

Proof.

We first apply Ito’s formula to

{(V^{i})}_{i \in {1, \dots, N}}

, and use the fact that

V^{i}

satisfies the HJB Equation (11) for the Nash system.

\begin{matrix} d V^{i} (t, Z_{t}) \\ = & \partial_{t} V^{i} d t + \partial_{z} V^{i} d Z_{t} + \frac{1}{2} T r (\partial_{z z} V^{i} d {[Z, Z]}_{t}) \\ = & \partial_{t} V^{i} d t + 〈 A Z, \partial_{z} V^{i} 〉 d t + 〈 B {\hat{α}}^{i}, \partial_{z} V^{i} 〉 d t + 〈 \partial_{z} V^{i}, G 〉 d W_{t} + \frac{1}{2} T r (G^{*} G \partial_{z z} V^{i}) d t \\ = & \partial_{t} V^{i} d t + \sum_{k = 1}^{N} \int_{- τ}^{0} z_{1}^{k} \frac{d}{d s} (\partial_{z_{1}^{k}} V^{i}) d s d t - \sum_{k = 1}^{N} (\partial_{z_{0}^{k}} V^{i} - [\partial_{z_{1}^{k}} V^{i}] (- τ)) (\partial_{z_{0}^{k}} V^{k} - [\partial_{z_{1}^{k}} V^{k}] (- τ)) d t \\ + \sum_{k = 1}^{N} \frac{1}{2} σ^{2} \partial_{z_{0}^{k} z_{0}^{k}} V^{i} d t + \sum_{k = 1}^{N} σ \partial_{z_{0}^{k}} V^{i} d W_{t}^{k} \\ = & [- \frac{1}{2} {(\partial_{z_{0}^{i}} V^{i} - [\partial_{z_{1}^{i}} V^{i}] (- τ))}^{2} - \frac{ϵ}{2} {({\bar{Z}}_{0} - Z_{0}^{i})}^{2}] d t + \sum_{k = 1}^{N} σ \partial_{z_{0}^{k}} V^{i} d W_{t}^{k} . \end{matrix}

(46)

Then, we apply Ito’s formula to

u^{i} (t, Z_{t})

, and use the fact that u satisfies (43)

\begin{matrix} d u^{i} (t, Z) \\ = & \partial_{t} u^{i} d t + \partial_{z} u^{i} d Z_{t} + \frac{1}{2} T r (\partial_{z z} u^{i} d {[Z, Z]}_{t}) \\ = & \partial_{t} u^{i} d t + 〈 A Z, \partial_{z} u^{i} 〉 d t + 〈 B {\hat{α}}^{i}, \partial_{z} u^{i} 〉 d t + 〈 \partial_{z} u^{i}, G 〉 d t + \frac{1}{2} T r (G^{*} G \partial_{z z} u^{i}) d t \\ = & \partial_{t} u^{i} d t + \sum_{k = 1}^{N} \int_{- τ}^{0} z_{1}^{k} \frac{d}{d s} (\partial_{z_{1}^{k}} u^{i}) d s d t - \sum_{k = 1}^{N} (\partial_{z_{0}^{k}} u^{i} - [\partial_{z_{1}^{k}} u^{i}] (- τ)) (\partial_{z_{0}^{k}} V^{k} - [\partial_{z_{1}^{k}} V^{k}] (- τ)) d t \\ + \sum_{k = 1}^{N} \frac{1}{2} σ^{2} \partial_{z_{0}^{k} z_{0}^{k}} u^{i} d t + \sum_{k = 1}^{N} σ \partial_{z_{0}^{k}} u^{i} d W_{t}^{k} \\ = & \sum_{k = 1}^{N} (\partial_{z_{0}^{k}} u^{k} - [\partial_{z_{1}^{k}} u^{k}] (- τ)) (\partial_{z_{0}^{k}} u^{i} - [\partial_{z_{1}^{k}} u^{i}] (- τ)) d t \\ - \sum_{k = 1}^{N} (\partial_{z_{0}^{k}} u^{i} - [\partial_{z_{1}^{k}} u^{i}] (- τ)) (\partial_{z_{0}^{k}} V^{k} - [\partial_{z_{1}^{k}} V^{k}] (- τ)) d t - \frac{1}{2} {(\partial_{z_{0}^{i}} u^{i} - [\partial_{z_{1}^{i}} u^{i}] (- τ))}^{2} d t \\ - \frac{ϵ}{2} {({\bar{Z}}_{0} - Z_{0}^{i})}^{2} d t - e^{i} d t + \sum_{k = 1}^{N} σ \partial_{z_{0}^{k}} u^{i} d W_{t}^{k} . \end{matrix}

(47)

Substracting (46) from (47), taking the square and applying Ito’s formula again, we obtain

\begin{matrix} d {[u^{i} (t, Z_{t}) - V^{i} (t, Z_{t})]}^{2} \\ = & 2 [u^{i} (t, Z_{t}) - V^{i} (t, Z_{t})] (d u^{i} (t, Z_{t}) - d V^{i} (t, Z_{t})) + d {[u^{i} - V^{i}, u^{i} - V^{i}]}_{t} \\ = & - 2 (u^{i} - V^{i}) (\frac{1}{2} {(\partial_{z_{0}^{i}} u^{i} - [\partial_{z_{1}^{i}} u^{i}] (- τ))}^{2} - \frac{1}{2} {(\partial_{z_{0}^{i}} V^{i} - [\partial_{z_{1}^{i}} V^{i}] (- τ))}^{2}) d t - 2 (u^{i} - V^{i}) e^{i} d t \\ - 2 (u^{i} - V^{i}) (\sum_{k = 1}^{N} (\partial_{z_{0}^{k}} u^{i} - [\partial_{z_{1}^{k}} u^{i}] (- τ)) ((\partial_{z_{0}^{k}} V^{k} - \partial_{z_{0}^{k}} u^{k}) - ([\partial_{z_{1}^{k}} V^{k}] (- τ) - [\partial_{z_{1}^{k}} u^{k}] (- τ)))) d t \\ + \sum_{k = 1}^{N} σ^{2} {| \partial_{z_{0}^{k}} u^{i} - \partial_{z_{0}^{k}} V^{i} |}^{2} d t + \sum_{k = 1}^{N} σ (\partial_{z_{0}^{k}} u^{i} - \partial_{z_{0}^{k}} V^{i}) d W_{t}^{k} . \end{matrix}

(48)

Recall that

\partial_{z_{0}^{k}} u^{i} (t, z_{0}, z_{1}) = \frac{1}{N - 1} D_{μ_{0}^{i}} U (t, z_{0}^{i}, z_{1}^{i}, ν^{i}, z_{0}^{k})

is bounded by

\frac{C}{N}

for

k \neq i

, and

e^{i}

is bounded by

\frac{C}{N}

. Let

{(Ξ^{i})}_{i \in {1, \dots, N}}

be a family of independent random variable with common law

ν_{0}

. By integrating (48) from t to T, and taking expectation conditional on Ξ, we have

\begin{matrix} E^{Ξ} [| u_{t}^{i} - V_{t}^{i} |^{2}] + σ^{2} \sum_{k = 1}^{N} E^{Ξ} [\int_{t}^{T} {| \partial_{z_{0}^{k}} u_{s}^{i} - \partial_{z_{0}^{k}} V_{s}^{i} |}^{2} d s] \\ + C E^{Ξ} [\int_{t}^{T} | u_{s}^{i} - V_{s}^{i} | \cdot | [\partial_{z_{1}^{i}} u_{s}^{i}] (- τ) - [\partial_{z_{1}^{i}} V_{s}^{i}] (- τ) |] d s \\ + \frac{C}{N} \sum_{k = 1, k \neq i}^{N} E^{Ξ} [\int_{t}^{T} | u_{s}^{i} - V_{s}^{i} | \cdot | [\partial_{z_{1}^{k}} u_{s}^{k}] (- τ) - [\partial_{z_{1}^{k}} V_{s}^{k}] (- τ) |] d s \\ \leq E^{Ξ} [| u_{T}^{i} - V_{T}^{i} |^{2}] + C E^{Ξ} [\int_{t}^{T} | u_{s}^{i} - V_{s}^{i} | \cdot | \partial_{z_{0}^{i}} u_{s}^{i} - \partial_{z_{0}^{i}} V_{s}^{i} |] d s \\ + \frac{C}{N} \sum_{k = 1, k \neq i}^{N} E^{Ξ} [\int_{t}^{T} | u_{s}^{i} - V_{s}^{i} | \cdot | \partial_{z_{0}^{k}} u_{s}^{k} - \partial_{z_{0}^{k}} V_{s}^{k} |] d s \\ + \frac{C}{N} \int_{t}^{T} E^{Ξ} [| u_{s}^{i} - V_{s}^{i} |] d s . \end{matrix}

(49)

By the fact that

u_{T}^{i} = V_{T}^{i}

, and using Young’s inequality, we have

\begin{matrix} E^{Ξ} [| u_{t}^{i} - V_{t}^{i} |^{2}] + E^{Ξ} [\int_{t}^{T} {| \partial_{z_{0}^{i}} u_{s}^{i} - \partial_{z_{0}^{i}} V_{s}^{i} |}^{2} d s] \\ \leq & 0 + \frac{C}{2 ϵ_{1}} E^{Ξ} [\int_{t}^{T} {| u_{s}^{i} - V_{s}^{i} |}^{2} d s] + \frac{C ϵ_{1}}{2} E^{Ξ} [\int_{t}^{T} {| \partial_{z_{0}^{i}} u_{s}^{i} - \partial_{z_{0}^{i}} V_{s}^{i} |}^{2} d s] + \frac{C}{2 N ϵ_{2}} \sum_{k = 1}^{N} E^{Ξ} [\int_{t}^{T} {| u_{s}^{i} - V_{s}^{i} |}^{2} d s] \\ + \frac{C ϵ_{2}}{2 N} \sum_{k = 1}^{N} E^{Ξ} [\int_{t}^{T} {| \partial_{z_{0}^{k}} u_{s}^{k} - \partial_{z_{0}^{k}} V_{s}^{k} |}^{2} d s] + \frac{C}{2 N ϵ_{3}} E^{Ξ} [\int_{t}^{T} {| u_{s}^{i} - V_{s}^{i} |}^{2} d s] + \frac{C ϵ_{3}}{2 N} \int_{t}^{T} 1 d s \\ \leq & \frac{C}{N^{2}} + C E^{Ξ} [\int_{t}^{T} {| u_{s}^{i} - V_{s}^{i} |}^{2} d s] + \frac{C}{2 N} \sum_{k = 1}^{N} E^{Ξ} [\int_{t}^{T} {| \partial_{z_{0}^{k}} u_{s}^{k} - \partial_{z_{0}^{k}} V_{s}^{k} |}^{2} d s] . \end{matrix}

(50)

Taking average on both sides, we have

\begin{matrix} \frac{1}{N} \sum_{i = 1}^{N} E^{Ξ} [| u_{t}^{i} - V_{t}^{i} |^{2}] + \frac{1}{N} \sum_{i = 1}^{N} E^{Ξ} [\int_{t}^{T} {| \partial_{z_{0}^{i}} u_{s}^{i} - \partial_{z_{0}^{i}} V_{s}^{i} |}^{2} d s] \\ \leq & \frac{C}{N^{2}} + \frac{1}{N} \sum_{i = 1}^{N} C E^{Ξ} [\int_{t}^{T} {| u_{s}^{i} - V_{s}^{i} |}^{2} d s] + \frac{C}{2 N} \sum_{k = 1}^{N} E^{Ξ} [\int_{t}^{T} {| \partial_{z_{0}^{k}} u_{s}^{k} - \partial_{z_{0}^{k}} V_{s}^{k} |}^{2} d s] \\ \Rightarrow & \frac{1}{N} \sum_{i = 1}^{N} E^{Ξ} [| u_{t}^{i} - V_{t}^{i} |^{2}] \leq \frac{C}{N^{2}} + C E^{Ξ} [\frac{1}{N} \sum_{i = 1}^{N} \int_{t}^{T} {| u_{s}^{i} - V_{s}^{i} |}^{2} d s] . \end{matrix}

(51)

By Gronwall’s inequality and taking supremum over

[0, T]

, we have

sup_{t \in [0, T]} [\frac{1}{N} \sum_{i = 1}^{N} E^{Ξ} {| u_{t}^{i} - V_{t}^{i} |}^{2}] \leq \frac{C}{N^{2}},

(52)

which implies

\frac{1}{N} \sum_{i = 1}^{N} | u^{i} (t_{0}, Ξ) - V^{i} (t_{0}, Ξ) | \leq \frac{C}{N} .

(53)

Choosing Ξ uniformly distributed in

{(R \times H)}^{N}

, then by continuity of

u^{i}

and

V^{i}

, and the fact that

u^{i} (t, Z)

is defined by

U (t, Z_{0}^{i}, Z_{1}^{i}, ν^{i})

, we have, for any

z \in {(R \times H)}^{N}

,

\frac{1}{N} \sum_{i = 1}^{N} | U (t_{0}, z^{i}, ν_{0}^{i}) - V^{i} (t_{0}, z) | \leq \frac{C}{N} .

(54)

□

6. Conclusions

The mean field game system acts as a characteristic of the master equation. The master equation contains all the information in the mean field game system, and it turns the forward-backward PDE into a single equation. The solution to the mean field game system is a pair

(V, ν)

, that is the value function and the joint law of current state and past law. The solution to the master equation is a function of

(t, z, ν)

.

Since our model is linear quadratic, we are able to solve both the mean field game system and the master equation as shown in Section 3 and Section 4, however, the techniques are not the same. The technique for solving the mean field game is that we first make an ansatz for the solution of the HJB equation. Then plugging this ansatz into the Fokker-Planck Equation (23), we find that the means of state and past control are constant. Hence, the ansatz (24) can be verified. On the other hand, a notion of derivative with respect to measure is needed in order to solve the master equation. Again, we make an ansatz (40), which has a similar form as (24) but is a function of

(t, z, ν)

, and we verify that it satisfies the master equation.

The sets of PDEs (30) with boundary conditions (31) are the same for the two problems. This is due to the fact that our model is linear-quadratic and the means of states and past controls are constants.

Last but not the least, the Nash equilibrium of the corresponding N-player game is presented in Section 2. The value function (12) looks similar to the value function (24) in the mean field game system and the solution (40) to the master equation. As

N \to \infty

, the set of PDEs (13) becomes the same as (30). This implies that the solution to the mean filed game appears to be the limit of the Nash system, but generally, the convergence has been known in very few specific situations. Additionally, the solution to the master equation is also a limit to the Nash system, as shown in Section 5.

To summarize, we have extended the notion of master equation in the context of our toy model with delay, and we have shown that, as in the case without delay, this master equation provides an approximation to the corresponding finite-player game with delay. A general form of such a result, not necessarily for linear-quadratic games, is part of our ongoing research.

Author Contributions

The two authors contributed equally to all aspects of the content of this article.

Funding

This research was funded by NSF grants DMS-1409434 and DMS-1814091.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:

SDDE	Stochastic Delayed Differential Equation
HJB	Hamilton-Jacobi-Bellman
MFG	Mean Field Game
PDE	Partial Differential Equation
LLN	Law of Large Numbers
NSF	National Science Foundation

Appendix A. Adjoint Operator

Let φ be a smooth test function defined on

R \times H

. In the following computation, we use the notation

〈 φ, ν (t) 〉 = \int_{R \times H} φ (z) d ν (t, z) .

If the test function φ is of the form

φ (z) = \int_{- τ}^{0} ψ (z_{0}, z_{1} (s)) d s

for a smooth function ψ defined on

R^{2}

, then

〈 φ, ν (t) 〉 = \int_{- τ}^{0} \int_{R \times R} ψ (z_{0}, z_{1} (s)) ν (t, z_{0}, z_{1} (s)) d z_{0} d z_{1} (s) d s,

where

ν (t, z_{0}, z_{1} (s))

is understood as a two-dimensional density. By abuse of notation, we also use

〈 φ, ν (t) 〉 = \int_{R \times H} φ (z) ν (t, z) d z = \int_{- τ}^{0} \int_{R \times R} ψ (z) ν (t, z) d z d s .

Then, we have

\begin{matrix} 〈 L_{t} φ, ν (t) 〉 \\ = & \int_{- τ}^{0} \int_{R \times R} z_{1} \frac{d \partial_{z_{1}} φ (z)}{d s} ν (t, z) d z d s + \int_{R \times H} - (\partial_{z_{0}} V - [\partial_{z_{1}} V] (- τ)) \partial_{z_{0}} φ (z) ν (t, z) d z \\ - \int_{- τ}^{0} \int_{R \times R} - (\partial_{z_{0}} V - [\partial_{z_{1}} V] (- τ)) \partial_{z_{1}} φ (z) δ_{- τ} (s) ν (t, z) d z d s \\ + \int_{R \times H} \frac{1}{2} σ^{2} \partial_{z_{0} z_{0}} φ (z) ν (t, z) d z \\ = & - \int_{- τ}^{0} \int_{R \times R} \frac{d z_{1}}{d s} \partial_{z_{1}} φ (z) ν (t, z) d z d s + \int_{- τ}^{0} \int_{R \times R} z_{1} \partial_{z_{1}} φ (z) ν (t, z) (δ_{0} (s) - δ_{- τ} (s)) d z d s \\ + \int_{R \times H} \partial_{z_{0}} \{(\partial_{z_{0}} V - [\partial_{z_{1}} V] (- τ)) ν (t, z)\} φ (z) d z \\ - \int_{- τ}^{0} \int_{R \times R} \partial_{z_{1}} \{(\partial_{z_{0}} V - [\partial_{z_{1}} V] (- τ)) ν (t, z)\} δ_{- τ} (s) φ (z) d z d s \\ + \int_{R \times H} \frac{1}{2} σ^{2} \partial_{z_{0} z_{0}} ν (t, z) φ (z) d z \\ = & \int_{- τ}^{0} \int_{R \times R} \partial_{z_{1}} (\frac{d z_{1}}{d s} ν (t, z)) φ (z) d z d s - \int_{- τ}^{0} \int_{R \times R} \partial_{z_{1}} (z_{1} ν (t, z)) φ (z) (δ_{0} (s) - δ_{- τ} (s)) d z d s \\ + \int_{R \times H} \partial_{z_{0}} \{(\partial_{z_{0}} V - [\partial_{z_{1}} V] (- τ)) ν (t, z)\} φ (z) d z \\ - \int_{- τ}^{0} \int_{R \times R} \partial_{z_{1}} \{(\partial_{z_{0}} V - [\partial_{z_{1}} V] (- τ)) ν (t, z)\} δ_{- τ} (s) φ (z) d z d s \\ + \int_{R \times H} \frac{1}{2} σ^{2} \partial_{z_{0} z_{0}} ν (t, z) φ (z) d z \\ = & 〈 φ, L_{t}^{*} ν (t) 〉 . \end{matrix}

References

Bensoussan, Alain, Giuseppe Da Prato, Michel C. Delfour, and Sanjoy Mitter. 2007. Representation and Control of Infinite Dimensional Systems. Basel: Birkhauser. [Google Scholar]
Cardaliaguet, Pierre, Francois Delarue, Jean-Michel Lasry, and Pierre-Louis Lions. 2015. The master equation and the convergence problem in mean field games. arXiv, arXiv:1509.02505. [Google Scholar]
Carmona, Rene, and Francois Delarue. 2014. The Master Equation for Large Population Equilibriums. arXiv, arXiv:1404.4694. [Google Scholar]
Carmona, Rene, and Francois Delarue. 2018. Probabilistic Theory of Mean Field Games with Applications I & II. Berlin/Heidelberg: Springer International Publishing. [Google Scholar]
Carmona, Rene, Jean-Pierre Fouque, Seyyed Mostafa Mousavi, and Li-Hsien Sun. 2018. Systemic risk and stochastic games with delay. Journal of Optimization and Applications (JOTA), 1–34. [Google Scholar] [CrossRef]
Carmona, Rene, Jean-Pierre Fouque, and Li-Hsien Sun. 2015. Mean field games and systemic risk. Communications in Mathematical Sciences 13: 911–33. [Google Scholar] [CrossRef]
Chassagneux, Jean-Francois, Dan Crisan, and Francois Delarue. 2014. A Probabilistic approach to classical solutions of the master equation for large population equilibria. arXiv, arXiv:1411.3009. [Google Scholar]
Da Prato, Guiseppe, and Jerzy Zabczyk. 2008. Stochastic Equations in Infinite Dimensions. Cambridge: Cambridge University Press. [Google Scholar]
Fabbri, Giorgio, Fausto Gozzi, and Andrzej Swiech. 2017. Stochastic Optimal Control in Infinite Dimension. Berlin/Heidelberg: Springer International Publishing. [Google Scholar]
Gozzi, Fausto, and Carlo Marinelli. 2006. Stochastic optimal control of delay equations arising in advertising models. Stochastic PDEs and Applications VII 245: 133–48. [Google Scholar]
Kolokoltsov, Vassili, Marianna Troeva, and Wei Yang. 2014. On the rate of convergence for the mean-field approximation of controlled diffusions with large number of players. Dynamic Games and Applications 4: 208–30. [Google Scholar] [CrossRef]
Vinter, Richard B., and Raymond H. Kwong. 1981. The infinite time quadratic control problem for linear systems with state and control delays: An evolution equation approach. SIAM Journal on Control and Optimization 19: 139–53. [Google Scholar] [CrossRef]

© 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Fouque, J.-P.; Zhang, Z. Mean Field Game with Delay: A Toy Model. Risks 2018, 6, 90. https://doi.org/10.3390/risks6030090

AMA Style

Fouque J-P, Zhang Z. Mean Field Game with Delay: A Toy Model. Risks. 2018; 6(3):90. https://doi.org/10.3390/risks6030090

Chicago/Turabian Style

Fouque, Jean-Pierre, and Zhaoyu Zhang. 2018. "Mean Field Game with Delay: A Toy Model" Risks 6, no. 3: 90. https://doi.org/10.3390/risks6030090

APA Style

Fouque, J.-P., & Zhang, Z. (2018). Mean Field Game with Delay: A Toy Model. Risks, 6(3), 90. https://doi.org/10.3390/risks6030090

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Mean Field Game with Delay: A Toy Model

Abstract

1. Introduction

2. A Differential Game with Delay

2.1. The Model

2.2. Construction of a Nash Equilibrium

3. The Mean Field Game System

4. The Master Equation

4.1. Derivatives

4.2. The Master Equation

4.3. Explicit Solution of the Master Equation

5. Convergence of the Nash System

6. Conclusions

Author Contributions

Funding

Conflicts of Interest

Abbreviations

Appendix A. Adjoint Operator

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI