Quantum Mean-Field Games with the Observations of Counting Type

Vassili N. Kolokoltsov

doi:10.3390/g12010007

¹

Department of Statistics, University of Warwick, Coventry CV4 7AL, UK

²

Higher School of Economics, 109028 Moscow, Russia

Games2021, 12(1), 7;https://doi.org/10.3390/g12010007

Version Notes

Order Reprints

Abstract

Quantum games and mean-field games (MFG) represent two important new branches of game theory. In a recent paper the author developed quantum MFGs merging these two branches. These quantum MFGs were based on the theory of continuous quantum observations and filtering of diffusive type. In the present paper we develop the analogous quantum MFG theory based on continuous quantum observations and filtering of counting type. However, proving existence and uniqueness of the solutions for resulting limiting forward-backward system based on jump-type processes on manifolds seems to be more complicated than for diffusions. In this paper we only prove that if a solution exists, then it gives an

ϵ

-Nash equilibrium for the corresponding N-player quantum game. The existence of solutions is suggested as an interesting open problem.

Keywords:

quantum dynamic law of large numbers; quantum filtering; observation of counting type; Belavkin equation; nonlinear stochastic Schrödinger equation; quantum interacting particles; quantum control; quantum mean field games; mean field games of jump type on manifolds

MSC:

91A15; 81Q05; 81Q93; 82C22; 93E11; 93E20

1. Introduction

In [1], two recently developed branches of game theory, quantum games and mean field games (MFGs), were merged, creating quantum MFGs. MFGs represent a very popular recent development in game theory. It was initiated in [2,3]. For recent developments one can consult monographs [4,5,6,7] and numerous references therein. Quantum games were initiated by Meyer [8], Eisert, Wilkens and Lewenstein [9], and Marinatto and Weber [10], and were dealt with afterwards in numerous publications, see, e.g., surveys [11,12], and Chapter 13 of textbook [13].

Using approaches from [9,10], one can transform any game to a new quantum version. This transformation modifies in a systematic way all properties of the games: equilibria, their stability, etc. For instance, stability of the equilibria of the transformed Replicator Dynamics for two-player two-action games was analyzed in [14]. ESS (evolutionary stable strategies) for the transformed Rock-Paper-Scissors game was analyzed in [15], and for 3 player games in [16]. The transformations of the simplest cooperative games were analyzed in [17]. In [18] the EWL (Eisert, Wilkens and Lewenstein) protocol was applied to the Battle of Sexes, in [19] to the general prisoner’s dilemma and in [20] to the three player quantum Prisoner’s dilemma. Peculiar behavior and remarkable phase transitions were found. The extension of EWL protocol for games with continuous strategy space was suggested in [21].

For application of related quantum concepts (including quantum probability) to cognitive sciences we refer to [22,23] and references therein.

The main accent in all these developments was made on stationary or repeated games, see, e.g., [24,25] for the latter, and [26] for their interpretation in economics. Not only for games, but generally for quantum control the main stream of quantum control research is based on open loop controls, with a rare appearance of a feedback control, see, e.g., [27] and [28].

The present paper initiates the study of the truly dynamic theory with observations of counting type and with the strategies chosen by players in real time. Since direct continuous observations are known to destroy quantum evolutions (so-called quantum Zeno paradox) the necessary new ingredient for quantum dynamic games must be the theory of non-direct observations and the corresponding quantum filtering. This theory is usually performed in two forms: diffusive (or homodyne) type and counting type. In paper [1] the author developed quantum MFGs based on diffusive type filtering. In the present paper quantum MFGs are built for counting type quantum observations and filtering.

As a part of the construction we show that the limiting behavior of mean field interacting controlled quantum particles (or N-player quantum game) can be described by certain classical MFG forward-backward system of jump-type equations on manifolds, the forward part being given by a new kind of nonlinear jump-type stochastic Schrödinger equations. One of the objectives of the paper is to draw the attention of game theorists to this type of games and this type of forward-backward systems, which were not studied before, and no results even on the existence of solutions are available. These objects are fully classical, but represent the limit of quantum games.

The main result states that any solution to this forward-backward system represents an approximate

N^{- 1 / 4}

-Nash equilibrium for the initial N-player dynamic quantum game.

The content of the paper is as follows. In the next section we recall the basic theory of quantum continuous measurement and filtering. In Section 3, as a warm-up, we discuss briefly an example of a two-player quantum dynamic game on a qubit with observation and feedback control of counting type. In Section 4 the new nonlinear equations are introduced for the case of controlled counting detection and the convergence of N-particle observed quantum evolutions to the decoupled system of these equations is obtained, together with explicit rates of convergence. In Section 5 the MFG limits for quantum N-player games are introduced and it is proven that solutions for the limiting MFG equations specify

ϵ

-Nash equilibria for N-player quantum game, with

ϵ

of order

N^{- 1 / 4}

. The limiting MFGs can be also looked at as classical MFGs, though complex-valued and evolving in infinite-dimensional manifolds. In the final section we state the problem of existence of the solutions, even in the simplest case of the control problem on a qubit.

2. Quantum Filtering of Counting Type

The general theory of quantum non-demolition observation, filtering and resulting feedback control was built essentially in papers [29,30,31]. For alternative simplified derivations of the main filtering equations given below (by-passing the heavy theory of quantum filtering) we refer to [32,33,34,35,36] and references therein. For the technical side of organising feedback quantum control in real time, see, e.g., [37,38,39].

We shall describe briefly the main result of this theory.

The non-demolition measurement of quantum systems can be organised in two versions: photon counting and homodyne detection. As was stressed above, here we shall deal only with counting measurements. In this case the main equation of quantum filtering takes the form

d γ_{t} = - i [H, γ_{t}] d t + \sum_{j} (- \frac{1}{2} {L_{j}^{*} L_{j}, γ_{t}} + tr (L_{j} γ_{t} L_{j}^{*}) γ_{t}) d t + \sum_{j} (\frac{L_{j} γ_{t} L_{j}^{*}}{tr (L_{j} γ_{t} L_{j}^{*})} - γ_{t}) d N_{t}^{j},

(1)

in terms of the density matrices

γ_{t}

, where H is the Hamiltonian of the free (not observed) motion of a quantum system, the operators

{L_{j}}

define the coupling of the system with the measurement devices, and the (counting) observed Poisson processes

N_{t}^{j}

are independent and have the position dependent intensities

tr (L_{j}^{*} L_{j} γ_{t})

, so that the compensated processes

M_{t}^{j} = N_{t}^{j} - \int_{0}^{t} tr (L_{j}^{*} L_{j} γ_{s}) d s

are martingales. By

{A, B}

we denote the anticommutator of two operators:

{A, B} = A B + B A

. In terms of the compensated processes

M_{t}^{j}

Equation (1) rewrites as

d γ_{t} = - i [H, γ_{t}] d t + \sum_{j} (L_{j} γ L_{j}^{*} - \frac{1}{2} {L_{j}^{*} L_{j}, γ_{t}}) d t + \sum_{j} (\frac{L_{j} γ_{t} L_{j}^{*}}{tr (L_{j} γ_{t} L_{j}^{*})} - γ_{t}) d M_{t}^{j} .

(2)

In this paper we shall deal only with the simplest case when the operators L are unitary. In this case

d M_{t}^{j} = d N_{t}^{j} - d t

and Equations (1) and (2) become linear and take the form

d γ_{t} = - i [H, γ_{t}] d t + \sum_{j} (L_{j} γ_{t} L_{j}^{*} - γ_{t}) d N_{t}^{j} = - i [H, γ_{t}] d t + \sum_{j} (L_{j} γ_{t} L_{j}^{*} - γ_{t}) (d M_{t}^{j} + d t) .

(3)

This dynamics preserves the set of pure states. Namely, if

ϕ

satisfies the equation

d ϕ_{t} = - i H ϕ_{t} d t + \sum_{j} (L_{j} - 1) ϕ_{t} d N_{t}^{j} = - i H ϕ_{t} d t + \sum_{j} (L_{j} - 1) ϕ_{t} (d M_{t}^{j} + d t),

(4)

then

γ_{t} = ϕ_{t} \otimes {\bar{ϕ}}_{t}

satisfies Equation (3).

The theory of quantum filtering reduces the analysis of quantum dynamic control and games to the controlled version of evolutions (1). Two types of control can be naturally considered (see [40]). The players can control the Hamiltonian H, say, by applying appropriate electric or magnetic fields to the atom, or the coupling operators

L_{j}

. Thus (3) extends to the equation

\begin{matrix} d γ_{t} & = - i [H + u \hat{H}, γ_{t}] d t + \sum_{j} (L_{j} (v) γ_{t} L_{j}^{*} (v) - γ_{t}) d N_{t}^{j} \\ = - i [H + u \hat{H}, γ_{t}] d t + \sum_{j} (L_{j} (v) γ_{t} L_{j}^{*} (v) - γ_{t}) (d M_{t}^{j} + d t), \end{matrix}

(5)

with some self-adjoint

\hat{H}

, control u and a family of unitary operators

L (v)

depending on a control parameter v.

It is seen from Equation (5) that its evolution preserves traces of matrices. One can also show that these evolutions preserve positivity of matrices

γ

(see, e.g., [36]).

3. Example of a Quantum Dynamic Two-Player Game

Let us stress again that the whole physics of quantum dynamic games with a feedback control of a finite number of players is incorporated into the stochastic filtering Equation (1), so that the quantum dynamic games are reduced to the stochastic games with jumps governed by this equation with operators H and L that may depend on control. As a warm-up before the mean-field setting let us consider the simple example of a zero-sum quantum dynamic two-player game on a qubit, where a complete analytic solution can be found.

Working with a qubit means that the Hilbert space of the quantum system is two dimensional. Let L be fixed and the Hamiltonian be the sum of two parts, controlled by the first and the second player respectively. Stochastic filtering Equation (1) simplifies to the equation (omitting index t)

d ψ = - i H ψ d t + (L ψ - ψ) d N_{t}, H = u H_{1} + v H_{2},

(6)

u, v

being control parameters of players I and II. Assume

u \in [- U, U]

,

v \in [- V, V]

with some positive

U, V

. Moreover,

ψ

has only two coordinates:

ψ = (ψ_{0}, ψ_{1})

. Using Ito’s rule

d N_{t} d N_{t} = d N_{t}

we find the equation for

ψ_{0}^{- 1}

:

d ψ_{0}^{- 1} = \frac{i}{ψ_{0}^{2}} {(H ψ)}_{0} d t - \frac{{(L ψ)}_{0} - ψ_{0}}{ψ_{0} {(L ψ)}_{0}} d N_{t} .

Consequently, again by Ito’s rule, we find the equation for

w = ψ_{1} / ψ_{0}

:

d w = i [w {(H W)}_{0} - {(H W)}_{1}] d t + [{(L W)}_{1} - w {(L W)}_{0}] d N_{t},

(7)

where

W = (w_{0}, w_{1}) = (1, w)

.

Let us choose the simplest possible L:

L = σ_{3}

—the third Pauli matrix (diagonal with diagonal elements 1 and

- 1

). Then Equation (7) simplifies to

d w = i [w {(\hat{H} W)}_{0} - {(\hat{H} W)}_{1}] d t - 2 w d N_{t} .

(8)

The payoffs in quantum setting are given by certain operators, that is, they have the form

P (t, W; u (.), v (.)) = \int_{t}^{T} (ψ_{s}, J ψ_{s}) d s + (ψ_{T}, F ψ_{T}),

(9)

where J and F are some self-adjoint operators. They may depend on the control parameters, but we shall look for the case when they do not. In terms of w this payoff rewrites as

P (t, W; u (.), v (.)) = \int_{t}^{T} \frac{(W_{s}, J W_{s})}{1 + | w_{s} |^{2}} d s + \frac{(W_{T}, F W_{T})}{1 + | w_{T} |^{2}} .

(10)

Thus the zero-sum quantum dynamic two-player game (with a feedback control) with a fixed horizon T in this setting is the stochastic dynamic game with the state space

C

, with the evolution described by the jump-type stochastic Equation (8) and payoff (10). The aim of the first player is to maximise the expectation of (10) using an appropriate feedback strategies

u (.) = u (t, W_{t})

. The second player tries to minimise it using an appropriate feedback strategies

v (.) = v (t, W_{t})

.

The remarkable feature of this game is that the possible jumps are only of type

w \to - w

. Consequently, in the coordinates

r = \sqrt{x^{2} + y^{2}}

and

ξ = y / x

(where

w = x + i y

), the dynamics is deterministic. Therefore, if the operators J and F of current and terminal payoffs are invariant under the transformation

w \to - w

, the game can be reduced to a deterministic differential game. This game is still very complicated.

Let us consider now the most trivial example of commuting operators

H_{1}

and

H_{2}

controlled by two players. To be concrete, let us chose

H_{1}

diagonal with diagonal elements 1 and 0, and

H_{2}

diagonal with elements 0 and 1. Then Equation (8) becomes linear in w:

d w = i [(u - v) w] d t - 2 w d N_{t},

(11)

and then the modulus

ρ = {| w |}^{2}

becomes the integral of motion:

d (| w |^{2}) = 0

. Choosing

ρ = 1

for definiteness we get the equation for the angle

ϕ

on the circle

ρ = 1

:

d cos ϕ = (u - v) sin ϕ d t - 2 cos ϕ d N_{t} .

(12)

If J and F are invariant under the transformation

w \to - w

, we can identify points when

cos ϕ

differ only by a sign (so that possible jumps

cos ϕ \to - cos ϕ

become irrelevant), and the evolution on a circle, given by the set

ϕ \in [- π / 2, π / 2]

with identified endpoints, becomes deterministic:

\frac{d}{d t} cos ϕ = (u - v) sin ϕ ⟺ \dot{ϕ} = - (u - v),

(13)

that is a simple rotation. Choosing

F = 0

and the simplest nontrivial J with zero diagonal elements and real numbers j as non-diagonal terms. The payoff (10) for

ρ = | w | = 1

simplifies to

P (t, W; u (.), v (.)) = j \int_{t}^{T} cos ϕ_{s} d s .

(14)

The HJB-Isaacs equation takes the form

\frac{\partial S}{\partial t} + max_{u} (- u \frac{\partial S}{\partial ϕ}) + min_{v} (v \frac{\partial S}{\partial ϕ}) + j cos ϕ = 0 .

Assuming for definiteness that

U > V

, so that the first player has an edge in this game, the equation rewrites as

\frac{\partial S}{\partial t} + (U - V) |\frac{\partial S}{\partial ϕ}| + j cos ϕ = 0 .

(15)

This is HJB of a pure maximisation problem. It can be solved via the method of viscosity solutions. For instance, let us find a stationary solution describing the average winning of the first player per unit of time in a long lasting game. For this one searches for a solution to (15) in the form

S = λ (T - t) + S_{0} (ϕ)

with a constant

λ

. Then

S_{0} (ϕ)

(obviously defined up to a constant multiplier, so that we can set

S_{0} (0) = 0

) satisfies the equation

- λ + (U - V) |\frac{\partial S_{0}}{\partial ϕ}| + j cos ϕ = 0 .

(16)

To guess the right solution one can derive from the meaning of this equation that

S_{0}

must be an even function of

ϕ

with maximum at

ϕ = 0

, decreasing on

[0, π / 2]

. Hence

(\partial S_{0} / \partial ϕ) (0) = 0

and thus

λ = j

and Equation (16) on

[0, π / 2]

becomes

- (U - V) \frac{\partial S_{0}}{\partial ϕ} = j (1 - cos ϕ),

(17)

so that

S_{0} (ϕ) = - \frac{j}{U - V} (ϕ - sin ϕ) .

This function (considered as periodically continued with period

π

to the whole line) is smooth outside points

(2 k + 1) π / 2

, where it has convex kinks. Hence this is really the viscosity solution to (16) confirming that our educated guess above was correct and that

λ = j

is the income per unit of time to the first player for a long lasting game.

Another example for the case of quantum control (without games) was given in [28].

4. Controlled Limiting Stochastic Equation

Let X be a Borel space with a fixed Borel measure that we denote

d x

.

For a linear operator O in

L^{2} (X)

we shall denote by

O_{j}

the operator in

L^{2} (X^{N})

that acts on functions

f (x_{1}, \dots, x_{N})

as O acting on the variable

x_{j}

. For a linear operator A in

L^{2} (X^{2})

we shall denote by

A_{i j}

the operator in

L^{2} (X^{N})

that acts as A on the variables

x_{i}, x_{j}

.

Let H and

\hat{H}

be two self-adjoint operators in

L^{2} (X)

and A a self-adjoint integral operator in

L^{2} (X^{2})

with the kernel

A (x, y; x^{'}, y^{'})

that acts on the functions of two variables as

A ψ (x, y) = \int_{X^{2}} A (x, y; x^{'}, y^{'}) ψ (x^{'}, y^{'}) d x^{'} d y^{'} .

It is assumed that A is symmetric in the sense that it takes symmetric functions

ψ (x, y)

(symmetric with respect to permutation of x and y) to symmetric functions.

Let us consider the quantum evolution of N particles driven by the interaction Hamiltonian

H_{u} (N) f (x_{1}, \dots, x_{N}) = \sum_{j = 1}^{N} (H_{j} + u_{j} (t, Γ_{N}^{(j)}) {\hat{H}}_{j}) f (x_{1}, \dots, x_{N}) + \frac{1}{N} \sum_{i < j \leq N} A_{i j} f (x_{1}, \dots, x_{N}) .

(18)

Here continuous functions

u_{j} (t, γ)

describe the controls of jth agent, who is supposed to have access to the jth subsystem, namely to the partial trace

Γ_{N, t}^{(j)}

(with respect to all other variables but j) of the state

Γ_{N, t}

. All

u_{j}

are taken from a bounded interval

[- U, U]

.

In order to be able to carry out a feedback control we assume further that this quantum system is observed via coupling with the collection of (possibly controlled) identical one-particle unitary families

L (v)

. That is, we consider the filtering Equation (3) of the type

d Ψ_{N, t} = - i H_{u} (N) Ψ_{N, t} d t + \sum_{j = 1}^{N} (L_{j} (v_{j} (t, Γ_{N}^{(j)})) - 1) Ψ_{N, t} d N_{t}^{j} .

(19)

The corresponding density matrix

Γ_{N, t} = Ψ_{N, t} \otimes \bar{Ψ_{N, t}}

satisfies the equation of type (5):

d Γ_{N, t} = - i [H_{u} (N), Γ_{N, t}] d t + \sum_{j} (L_{j} (v_{j} (t, Γ_{N}^{(j)})) Γ_{N, t} L_{j}^{*} (v_{j} (t, Γ_{N}^{(j)})) - Γ_{N, t}) d N_{t}^{j} .

(20)

The main ingredient in the construction of quantum MFG theory is the quantum law of large numbers that states that as

N \to \infty

, the limiting evolution of each particle (precise conditions are given in the theorem below) is described by the nonlinear stochastic equation

d ψ_{j, t} = - i [H + u (t, γ_{j, t}) \hat{H} + A^{{\bar{η}}_{t}}] ψ_{j, t} d t + (L (v_{j} (t, γ_{j, t})) ψ_{j, t} - ψ_{j, t}) d N_{t}^{j},

(21)

where

A^{{\bar{η}}_{t}}

is the integral operator in

L^{2} (X)

with the integral kernel

A^{{\bar{η}}_{t}} (x; y) = \int_{X^{2}} A (x, y; x^{'}, y^{'}) \bar{η_{t} (y, y^{'})} d y d y^{'}

and

η_{t} (y, z) = E (ψ_{j, t} (y) {\bar{ψ}}_{j, t} (z)) .

The equation for the corresponding density matrix

γ_{j, t} = ψ_{j, t} \otimes {\bar{ψ}}_{j, t}

writes down as

\begin{matrix} d γ_{j, t} & = - i [H + u (t, γ_{j, t}) \hat{H}, γ_{j, t}] d t - i [A^{{\bar{η}}_{t}}, γ_{j, t}] d t \\ + (L (v_{j} (t, γ_{j, t})) γ_{j, t} L^{*} - γ_{j, t}) d N_{t}^{j}, \\ η_{t} (y, z) & = E (ψ_{j, t} (y) {\bar{ψ}}_{j, t} (z)) = E γ_{j, t} (y, z) . \end{matrix}

(22)

For the analysis of the limiting behavior we use an approach from [41,42], where the main measures of the deviation of the solutions

Ψ_{N, t}

to N-particle systems from the product of the solutions

ψ_{t}

to the Hartree equations are the following positive numbers from the interval

[0, 1]

:

α_{N} (t) = 1 - (ψ_{t}, Γ_{N, t} ψ_{t}) .

In the present stochastic case, these quantities depend not just on the number of particles in the product, but on the concrete choice of these particles. The proper stochastic analog of the quantity

α_{N} (t)

is the collection of random variables

α_{N, j} (t) = 1 - (ψ_{j, t}, Γ_{N, t} ψ_{j, t}) = 1 - tr (γ_{j, t} Γ_{N, t}) = 1 - tr (γ_{j, t} Γ_{N, t}^{(j)}),

(23)

where the latter equation holds by the definition of the partial trace. Here

γ_{j, t}

is identified with the operator in

L^{2} (X^{N})

acting on the jth variable and

Γ_{N, t}^{(j)}

denotes the partial trace of

Γ_{N, t}

with respect to all variables except for the jth.

Since evolutions (20) preserve the set of operators with the unit trace, (23) rewrites as

α_{N, j} (t) = tr ((1 - γ_{j, t}) Γ_{N, t}) = tr ((1 - γ_{j, t}) Γ_{N, t}^{(j)}) .

(24)

Assuming that all controls

u_{j}

and

v_{j}

are given by identical feedback functions

u (t, γ)

,

v (t, γ)

and that the initial conditions for Equation (19) is the tensor product of i.i.d. random vectors, the expectations

E α_{N} (t) = E α_{N, j} (t)

are well defined (they do not depend on a particular choice of particles).

Expressions

α_{N, j}

can be linked with the traces by the following inequalities, due to Knowles and Pickl:

α_{N, j} (t) \leq tr | Γ_{N, t}^{(j)} - γ_{j, t} | \leq 2 \sqrt{2 α_{N, j} (t)},

(25)

see Lemma 2.3 from [42].

Theorem 1.

Let

H, \hat{H}

be self-adjoint operators in

L^{2} (X)

, with

\hat{H}

bounded, and

L (v)

be a family of unitary operators depending Lipschitz continuously on v:

∥ L (v_{1}) - L (v_{2}) ∥ \leq ϰ_{L} | v_{1} - v_{2} | .

(26)

Let A be a symmetric self-adjoint integral operator A in

L^{2} (X^{2})

with a Hilbert-Schmidt kernel, that is a kernel

A (x, y; x^{'}, y^{'})

such that

{∥ A ∥}_{H S}^{2} = \int_{X^{4}} {| A (x, y; x^{'}, y^{'}) |}^{2} d x d y d x^{'} d y^{'} < \infty,

(27)

A (x, y; x^{'} y^{'}) = A (y, x; y^{'}, x^{'}), A (x, y; x^{'}, y^{'}) = \bar{A (x^{'}, y^{'}; x, y)} .

(28)

Let the functions

u (t, γ)

and

v (t, γ)

with values in bounded intervals

[- U, U]

and

[- V, V]

respectively be Lipschitz in the sense that

| u (t, γ) - u (t, \tilde{γ}) | \leq ϰ tr | γ - \tilde{γ} |, | v (t, γ) - v (t, \tilde{γ}) | \leq ϰ tr | γ - \tilde{γ} | .

(29)

Let

ψ_{j, t}

be solutions to Equation (21) with i.i.d. initial conditions

ψ_{j, 0}

,

∥ ψ_{j, 0} ∥ = 1

. Let

Ψ_{N, t}

be the solution to the N-particle Equation (19) with

H_{u} (N)

given by (18) and with the initial condition

Ψ_{N, 0} (x_{1}, \dots, x_{N}) = \prod ψ_{j, 0} (x_{j}) .

Then

E α_{N} (t) \leq {(exp {(7 ∥ A ∥}_{H S} + 12 ϰ (∥ \hat{H} ∥ + ϰ_{L} + ϰ_{L}^{2} ϰ)) t} - 1) \frac{1}{\sqrt{N}} .

(30)

Proof.

By Ito’s product rule for counting processes,

d α_{N, j} (t) = - tr (d Γ_{N, t} γ_{j, t}) - tr (Γ_{N, t} d γ_{j, t}) - tr (d Γ_{N, t} d γ_{j, t}),

(31)

with the Ito product rule being

d N_{t}^{j} d N_{t}^{i} = δ_{i}^{j} d N_{t}^{j}

.

Let us denote by I and II the parts of the differential

d α_{N, j} (t)

that contain

L_{j}

and, respectively, not.

Starting with II we obtain, denoting

A_{j}^{{\bar{η}}_{t}}

the operator

A^{{\bar{η}}_{t}}

acting on the jth variable, that

\begin{matrix} I I & = i tr ([H_{j} + u_{j} (t, γ_{j, t}) {\hat{H}}_{j} + A_{j}^{{\bar{η}}_{t}}, γ_{j, t}] Γ_{N, t}) d t + i tr (γ_{j, t} [H (N), Γ_{N, t}]) d t \\ = i tr ([H_{j} + u_{j} (t, γ_{j, t}) {\hat{H}}_{j} + A_{j}^{{\bar{η}}_{t}}, γ_{j, t}] Γ_{N, t}) d t + i tr ([γ_{j, t}, H (N)] Γ_{N, t}) d t \\ = - i tr ([H_{j} + u_{j} (t, γ_{j, t}) {\hat{H}}_{j} + A_{j}^{{\bar{η}}_{t}}, q_{j, t}] Γ_{N, t}) d t + i tr ([H (N), q_{j, t}] Γ_{N, t}) d t \\ = i tr ([H (N) - H_{j} - u_{j} (t, γ_{j, t}) {\hat{H}}_{j} - A_{j}^{{\bar{η}}_{t}}, q_{j, t}] Γ_{N, t}) d t = I I_{1} + I I_{2}, \end{matrix}

with

I I_{1} = i tr ([\frac{1}{N} \sum_{m \neq j} A_{m j} - A_{j}^{{\bar{η}}_{t}}, q_{j, t}] Γ_{N, t}) d t

and

I I_{2} = i tr ([(u_{j} (t, Γ_{N, t}^{(j)}) - u_{j} (t, γ_{j, t})) {\hat{H}}_{j}, q_{j, t}] Γ_{N, t}) d t = B_{j, t} d t,

with

| B_{j, t} | \leq 2 | u_{j} (t, Γ_{N, t}^{(j)}) - u_{j} (t, γ_{j, t}) | ∥ \hat{H} ∥ ∥ q_{j, t} Ψ_{N, t} ∥

\leq 2 ϰ tr | Γ_{N, t}^{(j)} - γ_{j, t} | ∥ \hat{H} ∥ \sqrt{α_{N, j} (t)}

\leq 4 ϰ \sqrt{2 α_{N, j} (t)} ∥ \hat{H} ∥ \sqrt{α_{N, j} (t)} = 4 \sqrt{2} ϰ ∥ \hat{H} ∥ α_{N, j} (t),

where for the last inequality we used (25).

The term

I I_{1}

was dealt with in [1] (proof of Theorems 3.1) yielding the estimate

E | I I_{1} {| \leq 7 ∥ A ∥}_{H S} (E α_{N} (t) + \frac{1}{\sqrt{N}}) .

(32)

Let us turn to I. We have

\begin{matrix} I & = - tr [(L (v_{j} (t, γ_{j, t})) γ_{j, t} L^{*} (v_{j} (t, γ_{j, t})) - γ_{j, t}) Γ_{N, t}] d N_{t}^{j}, \\ - tr \sum_{k} [(L_{k} (v_{k} (t, Γ_{N, t}^{(k)})) Γ_{N, t} L_{k}^{*} (v_{k} (t, Γ_{N, t}^{(k)})) - Γ_{N, t}) γ_{j, t}] d N_{t}^{k} \\ - tr [(L_{j} (v_{j} (t, Γ_{N, t}^{(j)})) Γ_{N, t} L_{j}^{*} (v_{j} (t, Γ_{N, t}^{(j)})) - Γ_{N, t}) (L_{j} (v_{j} (t, γ_{j, t})) γ_{j, t} L_{j}^{*} (v_{j} (t, γ_{j, t})) - γ_{j, t})] d N_{t}^{j} . \end{matrix}

Since

γ_{j, t}

and

L_{k}

with

k \neq j

commute, it follows that all terms with

k \neq j

cancel. Taking into account other cancelation (arising from the unitarity of

L_{j}

) we obtain

I = tr [Γ_{N, t} γ_{j, t} - L_{j} (v_{j} (t, Γ_{N, t}^{(j)})) Γ_{N, t} L_{j}^{*} (v_{j} (t, Γ_{N, t}^{(j)})) L_{j} (v_{j} (t, γ_{j, t})) γ_{j, t} L_{j}^{*} (v_{j} (t, γ_{j, t}))] d N_{t}^{j} .

If

L_{j}

would be constant, this expression would vanish. In the present controlled version, some work is required. First of all, writing

γ_{j, t} = 1 - q_{j, t}

we obtain

I = C_{j, t} d N_{t}^{j} = C_{j, t} (d M_{t}^{j} + d t)

with

C_{j, t} = tr [Γ_{N, t} q_{j, t} - L_{j} (v_{j} (t, Γ_{N, t}^{(j)})) Γ_{N, t} L_{j}^{*} (v_{j} (t, Γ_{N, t}^{(j)})) L_{j} (v_{j} (t, γ_{j, t})) q_{j, t} L_{j}^{*} (v_{j} (t, γ_{j, t}))] .

To make the calculations more transparent, let us omit indices at

v, γ, q, Γ

. Thus

\begin{matrix} C_{j, t} & = tr [Γ q - L (v (t, Γ_{N, t}^{(j)})) Γ L^{*} (v (t, Γ_{N, t}^{(j)})) L (v (t, γ)) q L^{*} (v (t, γ))] \\ = tr [L (v (t, γ)) Γ L^{*} (v (t, γ)) L (v (t, γ)) q L^{*} (v (t, γ))] \\ - tr [L (v (t, Γ_{N, t}^{(j)})) Γ L^{*} (v (t, Γ_{N, t}^{(j)})) L (v (t, γ)) q L^{*} (v (t, γ))] \\ = tr [(L (v (t, γ)) Γ L^{*} (v (t, γ)) - L (v (t, Γ_{N, t}^{(j)})) Γ L^{*} (v (t, Γ_{N, t}^{(j)}))) L (v (t, γ)) q L^{*} (v (t, γ))] \\ = C_{j, t}^{1} + C_{j, t}^{2}, \end{matrix}

where

\begin{matrix} C_{j, t}^{1} & = tr [(L (v (t, γ)) - L (v (t, Γ_{N, t}^{(j)}))) Γ L^{*} (v (t, γ)) L (v (t, γ)) q L^{*} (v (t, γ))] \\ = tr [(L (v (t, γ)) - L (v (t, Γ_{N, t}^{(j)}))) Γ q L^{*} (v (t, γ))] \\ = tr [Γ q L^{*} (v (t, γ)) (L (v (t, γ)) - L (v (t, Γ_{N, t}^{(j)})))], \\ C_{j, t}^{2} & = tr [L (v (t, Γ_{N, t}^{(j)}) Γ (L^{*} (v (t, γ)) - L^{*} (v (t, Γ_{N, t}^{(j)})) L (v (t, γ)) q L^{*} (v (t, γ))] . \end{matrix}

We can now estimate

C_{j, t}^{1}

as

I I_{2}

above yielding

\begin{matrix} | C_{j, t}^{1} | & \leq | (Ψ_{N, t} q, L^{*} (v (t, γ)) (L (v (t, γ)) - L (v (t, Γ_{N, t}^{(j)}))) Ψ_{N, t}) | \\ \leq ∥ q Ψ_{N, t} ∥ ∥ L (v (t, γ)) - L (v (t, Γ_{N, t}^{(j)})) ∥ \leq \sqrt{α_{N, j} (t)} ϰ_{L} ϰ tr | γ - Γ_{N, t}^{(j)} | \\ \leq 2 \sqrt{2} ϰ_{L} ϰ α_{N, j} (t) . \end{matrix}

With

C_{j, t}^{2}

yet another add-and-subtract manipulation is required. Namely,

C_{j, t}^{2} = tr [(L^{*} (v (t, γ)) - L^{*} (v (t, Γ_{N, t}^{(j)})) L (v (t, γ)) q L^{*} (v (t, γ)) L (v (t, Γ_{N, t}^{(j)}) Γ] = C_{j, t}^{21} + C_{j, t}^{22}

with

\begin{matrix} C_{j, t}^{21} & = tr [(L^{*} (v (t, γ)) - L^{*} (v (t, Γ_{N, t}^{(j)})) L (v (t, γ)) q L^{*} (v (t, γ)) L (v (t, γ) Γ] \\ = tr [(L^{*} (v (t, γ)) - L^{*} (v (t, Γ_{N, t}^{(j)})) L (v (t, γ)) q Γ], \\ C_{j, t}^{22} & = tr [(L^{*} (v (t, γ)) - L^{*} (v (t, Γ_{N, t}^{(j)})) L (v (t, γ)) q L^{*} (v (t, γ)) (L (v (t, Γ_{N, t}^{(j)}) - L (v (t, γ))) Γ] . \end{matrix}

The first term is estimated as above yielding

| C_{j, t}^{21} | \leq 2 \sqrt{2} ϰ_{L} ϰ α_{N, j} (t) .

And the second one is estimated as

| C_{j, t}^{22} | \leq ϰ_{L}^{2} {| v (t, Γ_{N, t}^{(j)}) - v (t, γ) |}^{2} \leq 8 ϰ_{L}^{2} ϰ^{2} α_{N, j} (t) .

Thus

d α_{N, j} (t) = I I_{1} + (B_{j, t} + C_{j, t}) d t + C_{j, t} d M_{t}^{j} .

Therefore, since

M_{t}^{j}

is a martingale and its differential does not contribute to the expectation, it follows that

d E α_{N} (t) \leq {7 ∥ A ∥}_{H S} (E α_{N} (t) + \frac{1}{\sqrt{N}}) d t + (4 \sqrt{2} ϰ ∥ \hat{H} ∥ + 8 \sqrt{2} ϰ_{L} ϰ + 8 ϰ_{L}^{2} ϰ^{2}) E α_{N} (t) d t .

Applying Gronwall’s inequality yields (30). □

5. Quantum MFG

Let us consider the quantum dynamic game of N players, where the dynamics of the density matrix

Γ_{N, t}

is given by the controlled dynamics of type (20):

\begin{matrix} d Γ_{N, t} & = - i \sum_{j} [H_{j} + u_{j} (t, Γ_{N, t}^{(j)}) {\hat{H}}_{j}, Γ_{N, t}] - \frac{i}{N} \sum_{l < j \leq N} [A_{l j}, Γ_{N, t}] \\ + \sum_{j} (L_{j} (v_{j} (t, Γ_{N, t}^{(j)})) Γ_{N, t} L_{j}^{*} (v_{j} (t, Γ_{N, t}^{(j)})) - Γ_{N, t}) d N_{t}^{j} . \end{matrix}

(33)

Assume as above that controls

u_{j}

and

v_{j}

of each jth player can be chosen from some bounded closed intervals

[- U, U]

and

[- V, V]

respectively, that the initial matrix is the product of iid states,

Γ_{N, 0} (x_{1}, \dots, x_{n}; y_{1}, \dots, y_{N}) = \prod_{j = 1}^{N} ψ_{j} (x_{j}) \bar{ψ_{j} (y_{j})},

and that the payoff of each player on the interval

[t, T]

is given by the expression

P_{j} (t, W; u (.)) = \int_{t}^{T} (tr (J_{j} Γ_{N, s}) - \frac{c}{2} u_{j}^{2} (s)) d s + tr (F_{j} Γ_{N, T}),

(34)

where J and F are some operators in

L^{2} (X)

expressing the current and the terminal costs of the agent,

J_{j}

and

F_{j}

denote their actions on the jth variable, constants

c \geq 0

measure the cost of applying control u.

Remark 1.

(i) We choose the simplest payoff function. Of course more general dependence on u, v is possible. As long as payoff is convex in u and v the results below are still valid. (ii) Also everything remains in force if only H or only L is controlled, that is either u or v is not present in all formulas.

Notice that by the property of the partial trace, the payoff (34) rewrites as

P_{j} (t, W; u (.)) = \int_{t}^{T} (tr (J_{j} Γ_{N, s}^{(j)}) - \frac{c}{2} u_{j}^{2} (s)) d s + tr (F_{j} Γ_{N, T}^{(j)}),

(35)

so that it really depends explicitly only on the individual partial traces

Γ_{N, t}^{(j)}

, which can be considered as quantum analogs of the positions of classical particles.

Let us stress again that, after all equations arising from physics are written, our quantum dynamic N-player game can be formulated in fully classical terms. Namely, the goal of each jth player is to maximise the expectation of payoff (35) under the evolution (33) depending on all controls

u = (u_{j})

. The information available to the jth player is the ‘position’ of jth player, which is the partial trace

Γ_{N, t}^{(j)}

, and thus the actions of jth player are chosen among the feedback strategies

u_{j}

that are measurable functions

u_{j} (t, Γ_{N, t}^{(j)})

. An additional technical assumption that we are using in the analysis below is that the class of feedback strategies is reduced to Lipschitz continuous functions of partial traces. Therefore both the information setting and technical assumptions are slightly different from the simpler setting of two-player game of Section 3, where players were assumed to define their strategies on the basis of the whole state (not a partial trace). The restriction to partial traces is necessary to uncouple the dynamics in the limit of

N \to \infty

.

The limiting evolution of each player can be expected to be described by the equations

\begin{matrix} d γ_{j, t} & = - i [H + u_{j} (t, γ_{j, t}) \hat{H}, γ_{j, t}] d t - i [A^{\bar{η_{t}}}, γ_{j, t}] d t \\ + (L (v_{j} (t, γ_{j, t})) γ_{j, t} L^{*} (v_{j} (t, γ_{j, t})) - γ_{j, t}) d N_{t}^{j}, \end{matrix}

(36)

with

η_{t} (x, y) = lim_{N \to \infty} \frac{1}{N} \sum_{j = 1}^{N} γ_{j, t} (x, y),

and with payoffs given by

P_{j} (t, W; u (.)) = \int_{t}^{T} (tr (J γ_{j, s}) - \frac{c}{2} u_{j}^{2} (s)) d s + tr (F γ_{j, T}) .

(37)

For pure states

γ_{j, t} = ψ_{j, t} \otimes {\bar{ψ}}_{j, t}

this payoff turns to

P_{j} (t, W; u (.)) = \int_{t}^{T} ((ψ_{j, t}, J ψ_{j, t}) - \frac{c}{2} u_{j}^{2} (s)) d s + (ψ_{j, T}, F ψ_{j, T}) .

(38)

Let us say that the pair of functions

{(u, v)}_{t}^{M F G} (γ) = {(u, v)}^{M F G} (t, γ)

with

t \in [0, T]

and

γ

from the set of density matrices in

L^{2} (X)

, and

η_{t}^{M F G} (x, y)

with

x, y \in X

,

t \in [0, T]

, solve the limiting MFG problem if (i)

{(u, v)}_{t} (γ)

is an optimal feedback strategy for the stochastic control problem (36), (37) under the fixed function

η_{t} = η_{t}^{M F G}

and (ii)

η_{t}^{M F G}

arises from the solution of (36) under fixed

{(u, v)}_{t} = {(u, v)}_{t}^{M F G}

.

Theorem 2.

Let the conditions on

H, L, A

from Theorem 1 hold. Assume that the pair

{(u, v)}_{t}^{M F G} (γ)

and

η_{t}^{M F G} (x, y)

solves the limiting MFG problem and moreover

u_{t}^{M F G}

is Lipschitz in the sense of inequality (29). Then the strategies

{(u, v)}_{j} (t, Γ_{N_{t}}) = {(u, v)}_{t}^{M F G} (Γ_{N, t}^{(j)}),

form a symmetric ϵ-Nash equilibrium for the N-agent quantum game described by (33) and (34), where strategies of all players are sought among measurable controls

(u, v) (t, γ)

that depend Lipschitz in γ in the sense of inequality (29), with

ϵ = C (T) N^{- 1 / 4}

,

C (T)

depending on

{∥ A ∥}_{H S}

,

∥ \hat{H} ∥

, ϰ,

ϰ_{L}

.

Proof.

Assume that all players, except for one of them, say the first one, are playing according to the MFG strategy

{(u, v)}^{M F G} (t, Γ_{N, t}^{(j)})

,

j > 1

, and the first player is following some other strategy

(\tilde{u}, \tilde{v}) (t, Γ_{N, t}^{(1)})

. By the law of large numbers (which is not affected by a single deviation), all

η_{t}^{j}

are equal and are given by the formula

η_{t} = E γ_{j, t}

for all

j > 1

. Moreover,

E α_{N, j} (t) = E α_{N} (t)

are the same for all

j > 1

.

Following the proof of Theorem 1 we obtain

{\dot{α}}_{N, j} (t) = I + I I_{1} + I I_{2}

(39)

with the same

I, I I_{1}, I I_{2}

, as in the proof of Theorem 1, though

(u, v)

being

{(u, v)}^{M F G} (t, Γ_{N, t}^{(j)})

,

j > 1

, and

(\tilde{u}, \tilde{v}) (t, Γ_{N, t}^{(1)})

for

j = 1

. Looking first at

j > 1

we note that up to an additive correction of magnitude not exceeding

{4 ∥ A ∥}_{H S} / N

expression

I I_{1}

can be substituted by the expression

i tr ([\frac{1}{N} \sum_{m \neq j, 1} A_{m j} - A_{j}^{{\bar{η}}_{t}}, q_{j, t}] γ_{N, t}),

which is then dealt with exactly as in the proof of Theorem 1 (with

N - 1

instead of N) yielding the same estimate (30) (with a corrected multiplier) for

E α_{N} (t) = E α_{N, j} (t)

,

j > 1

, that is

E α_{N} (t) \leq {(exp {(7 ∥ A ∥}_{H S} + 12 ϰ (∥ \hat{H} ∥ + ϰ_{L} + ϰ_{L}^{2} ϰ)) t} - 1) \frac{1}{\sqrt{N}} {(1 + 4 ∥ A ∥}_{H S}) .

(40)

The same estimate is obtained for

E α_{N, 1} (t)

(even without the correcting term

{4 ∥ A ∥}_{H S}

) yielding

E α_{N, j} (t) \leq C (T) N^{- 1 / 2}

for all j and a constant

C (T)

depending on

{∥ A ∥}_{H S}, ϰ, ϰ_{L}, ∥ \hat{H} ∥

.

We can now compare the expected payoffs (35) received by the players in the N-player quantum game with the expected payoff (37) received in the limiting game. For each jth player the difference is bounded by

E \int_{t}^{T} | tr (J (Γ_{N, s}^{(j)} - γ_{j, s})) | d s + E | tr (F (Γ_{N, T}^{(j)} - γ_{j, T})) | .

Since,

| tr (J (Γ_{N, s}^{(j)} - γ_{j, s})) | \leq ∥ J ∥ tr | Γ_{N, s}^{(j)} - γ_{j, s} |,

and by (25),

tr | Γ_{N, s}^{(j)} - γ_{j, s} | \leq 2 \sqrt{2 α_{N, j} (s)},

it follows that the expectation of the difference of the payoffs is bounded by

2 \sqrt{2} (∥ J ∥ T + ∥ F ∥) sup_{t} E \sqrt{α_{N, j} (t)}

\leq 2 \sqrt{2} (∥ J ∥ T + ∥ F ∥) sup_{t} \sqrt{E α_{N, j} (t)} \leq (∥ J ∥ T + ∥ F ∥) C (T) N^{- 1 / 4},

with a constant

C (T)

depending on

{∥ A ∥}_{H S}, ϰ, ϰ_{L}, ∥ \hat{H} ∥

.

But by the assumption of the Theorem,

{(u, v)}_{t}^{M F G}

is the optimal choice for the limiting optimization problem. Hence the claim of the theorem follows. □

6. Discussion

The problem of proving existence or uniqueness for the solution of the limiting MFG on manifold seems to be nontrivial. We suggest it as an interesting open problem.

Let us give a bit more detail for the simplest case of two-dimensional Hilbert space (a qubit), as in Section 3.

When there is no control v (that is, operator L is constant) and there is no free (uncontrolled) part of the Hamiltonian, the limiting Equation (21) simplify to the equation (omitting indices j and t for simplicity)

d ψ = - i [u \hat{H} + A^{\bar{η}}] ψ d t + (L ψ - ψ) d N_{t} .

(41)

Moreover,

ψ

has only two coordinates:

ψ = (ψ_{0}, ψ_{1})

. Using Ito’s rule as in Section 3, we find the equation for

w = ψ_{1} / ψ_{0}

:

d w = i [u (w {(\hat{H} W)}_{0} - {(\hat{H} W)}_{1}) + w {(A^{\bar{η}} W)}_{0} - {(A^{\bar{η}} W)}_{1}] d t + ({(L W)}_{1} - w {(L W)}_{0}) d N_{t},

(42)

where

W = (w_{0}, w_{1}) = (1, w)

.

The most common interaction operator between qubits is the operator describing the possible exchange of photons,

A = a_{1}^{*} a_{2} + a_{2}^{*} a_{1}

, with the annihilation operators

a_{1}

and

a_{2}

of the two atoms. This interaction is given by the tensor

A (j, k; m, n)

such that

A (1, 0; 0, 1) = A (0, 1; 1, 0) = 1

with other elements vanishing. Hence

A_{10}^{η} = η_{01}

,

A_{01}^{η} = η_{10}

, with other elements vanishing. Let us take also the simplest possible L:

L = σ_{3}

—the third Pauli matrix. Then Equation (42) simplifies to

d w = i [u (w {(\hat{H} W)}_{0} - {(\hat{H} W)}_{1}) + {\bar{η}}_{10} w^{2} - {\bar{η}}_{01}] d t - 2 w d N_{t} .

(43)

If

\hat{H}

is diagonal with diagonal elements

h_{0}, h_{1}

, this turns to

d w = i [u w (h_{0} - h_{1}) + {\bar{η}}_{10} w^{2} - {\bar{η}}_{01}] d t - 2 w d N_{t} .

(44)

In this simplest case, choosing

h_{0} - h_{1} = 1

and

c = 0

in payoff (38), we obtain the HJB equation for the individual control in the form

\begin{matrix} \frac{\partial S}{\partial t} + max_{u} (u x \frac{\partial S}{\partial y} - u y \frac{\partial S}{\partial x}) + \frac{(W, J W)}{1 + {| w |}^{2}} \\ + \frac{\partial S}{\partial y} R e ({\bar{η}}_{10} w^{2} - {\bar{η}}_{01})] - \frac{\partial S}{\partial x} I m ({\bar{η}}_{10} w^{2} - {\bar{η}}_{01})] + (S (- x, - y) - S (x, y)) = 0 . \end{matrix}

(45)

Already this equation on the complex plane

C

, describing optimal control for the individual quantum feedback control in a qubit, is quite nonstandard. And to deal with the corresponding forward-backward system one needs not only its well-posedness in a certain generalized sense, but some continuous dependence on parameters. May be some method from [43] or [28] can be used to get insight into this problem.

As a future research direction it is worth mentioning the general development of the theory of the limiting classical mean-field games, which are mean-field games on infinite dimensional curvilinear manifolds based on Markov processes with jumps, highly fascinating and nontrivial objects. Of course usual questions of classical mean-field games on the connection between stationary and time dependent solutions are fully open here, as well as the theory of the corresponding master equation. On the other hand, quantum dynamic games of finite number of players (touched upon in Section 3) lead to new nonlinear functional-differential equations on manifolds of Hamilton-Jacobi or Isaacs type, which are also worthy of proper analysis.

Funding

The author gratefully acknowledges the funding by the Russian Academic Excellence project ‘5–100’.

Conflicts of Interest

The author declares no conflict of interest.

References

Kolokoltsov, V.N. Quantum Mean Field Games. arXiv 2020, arXiv:2005.02350. [Google Scholar]
Huang, M.; Malhamé, R.; Caines, P. Large population stochastic dynamic games: Closed-loop Mckean-Vlasov systems and the Nash certainty equivalence principle. Commun. Inf. Syst. 2006, 6, 221–252. [Google Scholar]
Lasry, J.-M.; Lions, P.-L. Jeux à champ moyen, I. Le cas stationnaire. C. R. Math. Acad. Sci. Paris 2006, 343, 619–625. [Google Scholar] [CrossRef]
Bensoussan, A.; Frehse, J.; Yam, P. Mean Field Games and Mean Field Type Control Theory; Springer: Berlin/Heidelberg, Germany, 2013. [Google Scholar]
Carmona, R.; Delarue, F. Probabilistic Theory of Mean Field Games with Applications, V. I, II. Probability Theory and Stochastic Modelling; Springer: Berlin/Heidelberg, Germany, 2018; Volume 83, p. 84. [Google Scholar]
Gomes, D.A.; Pimentel, E.A.; Voskanyan, V. Regularity Theory for Mean-Field Game Systems; Springer: Berlin/Heidelberg, Germany, 2016. [Google Scholar]
Kolokoltsov, V.N.; Malafeyev, O.A. Many Agent Games in Socio-Economic Systems: Corruption, Inspection, Coalition Building, Network Growth, Security; Springer Series in Operations Research and Financial Engineering; Springer Nature: Berlin/Heidelberg, Germany, 2019. [Google Scholar]
Meyer, D.A. Quantum strategies. Phys. Rev. Lett. 1999, 82, 1052–1055. [Google Scholar] [CrossRef]
Eisert, J.; Wilkens, M.; Lewenstein, M. Quantum Games and Quantum Strategies. Phys. Rev. Lett. 1999, 83, 3077–3080. [Google Scholar] [CrossRef]
Marinatto, L.; Weber, T. A quantum approach to static games of complete information. Phys. Lett. A 2000, 272, 291–303. [Google Scholar] [CrossRef]
Khan, F.S.; Solmeyer, N.; Balu, R.; Humble, T. Quantum games: A review of the history, current state, and interpretation. Quantum Inf. Process. 2018, 17, 309. [Google Scholar] [CrossRef]
Guo, H.; Zhang, J.; Koehler, G.J. A survey of quantum games. Decis. Support Syst. 2008, 46, 318–332. [Google Scholar] [CrossRef]
Kolokoltsov, V.N.; Malafeyev, O.A. Understanding Game Theory, 2nd ed.; World Scientific: Singapore, 2019. [Google Scholar]
Iqbal, A.; Toor, A.H. Equilibria of Replicator Dynamics in Quantum Games. arXiv 2001, arXiv:quant-ph/0106135. [Google Scholar]
Iqbal, A.; Toor, A.H. Quantum mechanics gives stability to a Nash equilibrium. Phys. Rev. A 2002, 65, 022306. [Google Scholar] [CrossRef]
Iqbal, A.; Toor, A.H. Darwinism in quantum systems? Phys. Lett. A 2002, 294, 261–270. [Google Scholar] [CrossRef]
Iqbal, A.; Toor, A.H. Quantum cooperative games. Phys. Lett. A 2002, 293, 103–108. [Google Scholar] [CrossRef]
Du, J.; Li, H.; Xu, X.; Shi, M.; Zhou, X.; Han, R. Nash equilibrium in the Quantum Battle of the Sexes Game. arXiv 2001, arXiv:quant-ph/0010050v3. [Google Scholar]
Du, J.; Li, H.; Xu, X.; Zhou, X.; Han, R. Phase-transition-like Behavior of Quantum Games. arXiv 2003, arXiv:quant-ph/0111138v4. [Google Scholar]
Du, J.; Li, H.; Xu, X.; Zhou, X.; Han, R. Entanglement Enhanced Multiplayer Quantum Games. Phys. Lett. A 2002, 302, 229–233. [Google Scholar] [CrossRef]
Li, H.; Du, J.; Massar, S. Continuous variable quantum games. Phys. Lett. A 2002, 306, 73–78. [Google Scholar] [CrossRef]
Pothos, E.; Busemeyer, J. Can quantum probability provide a new direction for cognitive modeling? Behav. Brain Sci. 2013, 36, 255–274. [Google Scholar] [CrossRef]
Khrennikov, A.Y. Ubiquitous Quantum Structure: From Psychology to Finance; Springer: Berlin/Heidelberg, Germany, 2010. [Google Scholar]
Aoki, S.; Ikeda, K. Repeated Quantum Games and Strategic Efficiency. Available online: https://arxiv.org/abs/2005.05588 (accessed on 6 January 2021).
Ikeda, K. Foundation of quantum optimal transport and applications. Quantum Inf. Process. 2020, 19, 25. [Google Scholar] [CrossRef]
Aoki, S.; Ikeda, K. Theory of Quantum Games and Quantum Economic Behavior. Available online: https://arxiv.org/abs/2010.14098 (accessed on 6 January 2021).
Bouten, L.; Handel, R.V. On the separation principle of quantum control. arXiv 2006, arXiv:math-ph/0511021v2. [Google Scholar]
Kolokoltsov, V.N. The stochastic Bellman equation as a nonlinear equation in Maslov spaces. Perturbation theory. Dokl. Akad. Nauk 1992, 323, 223–228. [Google Scholar]
Belavkin, V.P. Nondemolition measurement and control in quantum dynamical systems. In Information Complexity and Control in Quantum Physics; Diner, S., Lochak, G., Eds.; CISM Courses and Lectures; Springer: Vienna, Austria, 1987; Volume 294, pp. 331–336. [Google Scholar]
Belavkin, V.P. Nondemolition stochastic calculus in Fock space and nonlinear filtering and control in quantum systems. In Proceedings of the XXIV Karpacz Winter School Stochastic Methods in Mathematics and Physics, Karpacz, Poland, 13–27 January 1988; Guelerak, R., Karwowski, W., Eds.; World Scientific: Singapore, 1988; pp. 310–324. [Google Scholar]
Belavkin, V.P. Quantum stochastic calculus and quantum nonlinear filtering. J. Multivar. Anal. 1992, 42, 171–201. [Google Scholar] [CrossRef]
Belavkin, V.P.; Kolokoltsov, V.N. Stochastic evolution as interaction representation of a boundary value problem for Dirac type equation. Infin. Dimens. Anal. Probab. Relat. Fields 2002, 5, 61–92. [Google Scholar] [CrossRef]
Pellegrini, C. Poisson and Diffusion Approximation of Stochastic Schrödinger Equations with Control. Ann. Henri Poincaré 2009, 10, 995–1025. [Google Scholar] [CrossRef]
Barchielli, A.; Belavkin, V.P. Measurements contunuous in time and a posteriori states in quantum mechanics. J. Phys. A Math. Gen. 1991, 24, 1495–1514. [Google Scholar] [CrossRef]
Holevo, A.S. Statistical Inference for quantum processes. In Quanum Aspects of Optical Communications; Springer: Berlin, Germany, 1991; Volume 378, pp. 127–137. [Google Scholar]
Kolokoltsov, V.N. Continuous time random walks modeling of quantum measurement and fractional equations of quantum stochastic filtering and control. arXiv 2020, arXiv:2008.07355. [Google Scholar]
Armen, M.A.; Au, J.K.; Stockton, J.K.; Doherty, A.C.; Mabuchi, H. Adaptive homodyne measurement of optical phase. Phys. Rev. Lett. 2002, 89, 133602. [Google Scholar] [CrossRef]
Bushev, P.; Rotter, D.; Wilson, A.; Dubin, F.; Becher, C.; Eschner, J.; Blatt, R.; Steixner, V.; Rabl, P.; Zoller, P. Feedback cooling of a singe trapped ion. Phys. Rev. Lett. 2006, 96, 043003. [Google Scholar] [CrossRef]
Wiseman, H.M.; Milburn, G.J. Quantum Measurement and Control; Cambridge Univesity Press: Cambridge, UK, 2010. [Google Scholar]
Bouten, L.; Handel, R.V.; James, M. An introduction to quantum filtering. SIAM J. Control Optim. 2007, 46, 2199–2241. [Google Scholar] [CrossRef]
Pickl, P. A simple derivation of mean-field limits for quantum systems. Lett. Math. Phys. 2011, 97, 151–164. [Google Scholar] [CrossRef]
Knowles, A.; Pickl, P. Mean-field dynamics: Singular potentials and rate of convergence. Commun. Math. Phys. 2010, 298, 101–138. [Google Scholar] [CrossRef]
Averboukh, Y. Viability analysis of the first-order mean field games. ESAIM Control Optim. Calc. Var. 2020, 26, 33. [Google Scholar] [CrossRef]

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Quantum Mean-Field Games with the Observations of Counting Type

Abstract

1. Introduction

2. Quantum Filtering of Counting Type

3. Example of a Quantum Dynamic Two-Player Game

4. Controlled Limiting Stochastic Equation

5. Quantum MFG

6. Discussion

Funding

Conflicts of Interest

References

Article Metrics

Citations

Article Access Statistics