Population Games, Stable Games, and Passivity

Fox, Michael J.; Shamma, Jeff S.

doi:10.3390/g4040561

Open AccessArticle

Population Games, Stable Games, and Passivity

by

Michael J. Fox

and

Jeff S. Shamma

^*

School of Electrical and Computer Engineering, Georgia Institute of Technology, 777 Atlantic Drive NW, Atlanta, GA 30332, USA

^*

Author to whom correspondence should be addressed.

Games 2013, 4(4), 561-583; https://doi.org/10.3390/g4040561

Submission received: 4 April 2013 / Revised: 3 September 2013 / Accepted: 26 September 2013 / Published: 7 October 2013

(This article belongs to the Special Issue Advances in Evolutionary Game Theory and Applications)

Download

Browse Figures

Versions Notes

Abstract

:

The class of “stable games”, introduced by Hofbauer and Sandholm in 2009, has the attractive property of admitting global convergence to equilibria under many evolutionary dynamics. We show that stable games can be identified as a special case of the feedback-system-theoretic notion of a “passive” dynamical system. Motivated by this observation, we develop a notion of passivity for evolutionary dynamics that complements the definition of the class of stable games. Since interconnections of passive dynamical systems exhibit stable behavior, we can make conclusions about passive evolutionary dynamics coupled with stable games. We show how established evolutionary dynamics qualify as passive dynamical systems. Moreover, we exploit the flexibility of the definition of passive dynamical systems to analyze generalizations of stable games and evolutionary dynamics that include forecasting heuristics as well as certain games with memory.

Keywords:

Population games; evolutionary games; passivity theory

1. Introduction

Evolutionary game theory (e.g., [1]), along with the related topic of learning in games, explores the dynamics of interacting players or populations. The framework entails introducing an “evolutionary dynamic” (or behavioral rule) and proceeding to analyze the resulting trajectories that a game induces. One of the goals of evolutionary game theory is to understand how rational solution concepts, such as Nash equilibrium, can emerge and be selected through simplistic evolutionary interactions. Oftentimes, an evolutionary dynamic for a specific game need not induce any stable equilibrium points. Indeed, different evolutionary dynamics can produce outcomes ranging from chaos to convergence (cf., [2] and [3]). Furthermore, games have been constructed that can be shown to never exhibit stable Nash equilibria under only very mild conditions on the dynamics themselves [4]. In contrast to such specific examples, researchers also have sought to identify broad classes of games for which correspondingly broad classes of dynamics converge to equilibrium, one example of which is potential games [5]. For a broader discussion of these and related issues, see [6,7,8].

We focus here on the recently proposed class of stable games [9]—a generalization of a number of earlier classes of games ranging from concave potential games to symmetric normal form games with an interior evolutionarily stable strategy (ESS). The appealing property of stable games is that their Nash equilibria comprise a convex set that many dynamics are guaranteed to reach [9].

In this paper, we show that stable games can be identified as a special case of the feedback-system-theoretic notion of passive input–output systems. Passivity is an abstraction of energy conservation and dissipation in mechanical and electrical systems [10] that has become a standard tool in the design and analysis of nonlinear feedback systems [10,11,12,13]. It provides conditions under which particular system interconnections will be stable.

The connection to evolutionary games is that an evolutionary dynamic can be viewed as a dynamical system in feedback with a game. Accordingly, after we identify stable games as passive systems, we are guaranteed that play by any admissible passive evolutionary dynamic will result in stability and convergence—provided that one indeed can define an analogous notion of passivity for evolutionary dynamics. It turns out that the various dynamics that guarantee global convergence in stable games do indeed satisfy a natural notion of passivity. In particular, passivity of an evolutionary dynamic can be interpreted as long run correlation between the time derivative of payoffs and the direction of motion (see Equation (54)). While passivity techniques have been used in analysis of game theoretic learning dynamics employed in certain specific engineering models [14,15], the notion of passivity capturing a class of dynamics or games is novel.

The contributions of this paper are summarized as follows:

We show that stable games can be formulated as dynamical systems exhibiting an appropriate form of passivity. We develop a complementary notion of passivity for learning dynamics, resulting in sufficient conditions for stability.
We show that learning dynamics known to exhibit convergence in stable games do indeed satisfy a relevant passivity condition.
The passivity conditions we introduce also apply to more general forms of games and dynamics. We provide stability results for certain games with memory as well as for learning dynamics that employ forecasting heuristics.
Finally, we extend our methods to games and dynamics that depend on finite histories of strategy change and proceed to analyze learning techniques that can achieve convergence in certain games with lags or time delays.

In pursuing a broad class of evolutionary dynamics that converges for the class of stable games, the analysis in [9] considers a particular evolutionary dynamic called “excess payoff/target” (EPT) dynamics that generalizes various other evolutionary dynamics. Certain EPT dynamics exhibit a property known as “positive correlation” (cf., its variant, termed virtual positive correlation [16]). In particular, there is positive correlation between between the instantaneous payoffs and the instantaneous direction of motion. It turns out that positive correlation alone need not imply convergence for stable games [9]. This realization motivated the introduction of additional properties such as “integrability” or the stronger “separability” specific to EPT dynamics. We will see that, rather than positive correlation, the aforementioned long run correlations provide the essential characteristic for convergence under stable games. Furthermore, these results suggest an interpretation of passive learning that complements the interpretation of stable games as strategic environments exhibiting self-defeating externalities.

An immediate benefit of our characterization, beyond providing a unifying framework to assess stability, is the novel generalizations it enables. Evolutionary game theory has historically placed particular emphasis on the study of memoryless games, in which payoffs are a function of current population levels rather than histories of population levels. Likewise, evolutionary game theory has historically only considered evolutionary dynamics of restricted dimension, in particular with order equal to the total number of strategies across all players. While our definitions include such settings, they are not restricted to them. Dynamic learning schemes that utilize additional, auxiliary states in reckoning strategy changes also can be analyzed using passivity. In particular, we identify games that preserve the convergence properties of passive learning dynamics when they are combined with prevalent forecasting heuristics like smoothing and trend following. Alternatively, certain dynamic games, that is, strategic environments where payoffs can depend on the entire action trajectory, can be shown to exhibit passivity.

Lastly, we probe the limits of the class of passive learning dynamics by suggesting an evolutionary dynamic in which players attempt to update strategies in a contrarian manner. Specifically, they discount payoffs to actions that have seen a rise in popularity over a defined lookback period. This scheme leads to an infinite-dimensional system. We find that this predisposition has no consequences for convergence of passive dynamics in stable games and all other passive strategic environments.

The remainder of this paper is organized as follows. Section 2 presents background material on population games and stable games. Section 3 presents a brief tutorial on passivity analysis for feedback systems. Section 4 contains the main results that define the notion of passivity for evolutionary dynamics and establish stability when coupled with a stable game. This section goes on to present various generalizations. Finally, Section 5 contains concluding remarks.

This paper expands and develops on the work reported by the authors in [17].

2. Background

2.1. Population Games

We begin with a description of single population games. The main results herein are extendable to multi-population games considered in [9] in a straightforward manner, and so the restriction to single populations simplifies notation.

A single population has a set of available strategies

S = \{1, 2, . . ., n\}

. The set of strategy distributions is

X = {x \in R_{+}^{n} : \sum_{i \in S} x_{i} = 1}

. Since strategies lie in the simplex, admissible changes in strategy are restricted to the tangent space

T X = {z \in R^{n} : \sum_{i \in S} z_{i} = 0}

.

The payoff function

F : X \to R^{n}

is a continuous map associating each strategy distribution with a payoff vector so that

F_{i} : X \to R^{n}

is the payoff to strategy

i \in S

.

A state

x \in X

is a Nash equilibrium, denoted

x \in NE (F)

, if each strategy in the support of x receives the maximum payoff available to the population, i.e.,

\begin{matrix} x \in NE (F) \Leftrightarrow [x_{i} > 0 \Rightarrow F_{i} (x) \geq F_{j} (x)], \forall i, j \in S . \end{matrix}

(1)

2.2. Stable Games

We say that

F : X \to R^{n}

is a stable game if

\begin{matrix} {(y - x)}^{T} (F (y) - F (x)) \leq 0 \forall x, y \in X . \end{matrix}

(2)

For a detailed discussion with examples of stable games, see [9]. Many evolutionary dynamics are well-behaved when restricted to the stable games. The above definition has the following equivalence when F is continuously differentiable (i.e.,

C^{1}

).

Theorem 2.1 [9] Suppose the population game F is continuously differentiable. Then F is a stable game if and only if the Jacobian matrix

D F (x)

is negative semidefinite with respect to

T X

for all

x \in X

, i.e.,

z^{T} D F (x) z \leq 0

for all

z \in T X

and

x \in X

.

If

D F (x)

is negative definite for all

x \in X

, then we say that F is strictly stable.

Reference [9] gives the definition of stable games the interpretation of self-defeating externalities. That is, the payoff improvements to strategies being adopted are dominated by the payoff improvements to strategies being abandoned. This is easy to see by letting

z = e_{j} - e_{i} \in T X

, the difference between two unit vectors, and noting that (by definition)

z^{T} D F (x) z \leq 0

.

Many games are known to be stable games including zero sum games, symmetric multi/network zero sum games, and concave potential games. For a thorough discussion see [9].

3. A Primer on Passivity in Feedback Systems

This section presents a brief tutorial on passivity analysis for feedback systems. For further discussion, see [10,11,12,13].

3.1. Circuit Origins

The term “passivity” is derived from circuit theory. Figure 1a illustrates a circuit network with two terminals. Let

v (t)

denote the time-varying voltage across the terminals and

i (t)

denote the corresponding time-varying current into the network. The relationship between

v (\cdot)

and

i (\cdot)

is determined by the specifics of the network. Let

E (t)

denote the energy stored in the network. The instantaneous power delivered to the network at time t is

v (t) \cdot i (t)

. If the network is passive, it does not have any internal “active” or “power generating” components. Accordingly, over any time interval

[t_{0}, t]

,

\begin{matrix} E (t) \leq E (t_{0}) + \int_{t_{0}}^{t} v (τ) i (τ) d τ . \end{matrix}

(3)

Figure 1. (a) One-port circuit network. (b) Interconnection of two circuit networks.

That is, the change in stored energy is less than the supplied energy (“less than” because energy may have dissipated, e.g., through resistors).

Figure 1b illustrates an interconnection of two networks. If both networks are passive, i.e., have no power generation, then one would expect that the stored energy, which can be transferred from network to network, does not increase overall.

Mathematically, passivity of the networks individually means

\begin{matrix} E_{1} (t) \leq E_{1} (t_{0}) + \int_{t_{0}}^{t} v_{1} (τ) i_{1} (τ) d τ \end{matrix}

(4)

\begin{matrix} E_{2} (t) \leq E_{2} (t_{0}) + \int_{t_{0}}^{t} v_{2} (τ) i_{2} (τ) d τ \end{matrix}

(5)

where

v_{j} (\cdot)

,

i_{j} (\cdot)

, and

E_{j} (\cdot)

,

j \in \{1, 2\}

, are the voltage, current, and energy of the two networks. Furthermore, the interconnection implies that

\begin{matrix} v_{1} (t) = v_{2} (t) and i_{1} (t) = - i_{2} (t) . \end{matrix}

(6)

Combining these relations with the above passivity conditions implies that

\begin{matrix} E_{1} (t) + E_{2} (t) \leq E_{1} (t_{0}) + E_{2} (t_{0}), \end{matrix}

(7)

as expected. Passivity analysis extends these ideas from circuits to feedback interconnections of dynamical systems.

3.2. Input–Output Operators and Feedback Interconnections

A dynamical system can be viewed as a (nonlinear) operator defined on function spaces. Let

L_{2}

denote the usual Hilbert space of square integrable functions1 mapping

[0, \infty) : = R_{+}

to

R^{n}

with inner product

〈 \cdot, \cdot 〉

and norm

∥\cdot∥

, i.e., for

f, g \in L_{2}

,

\begin{matrix} 〈 f, g 〉 = \int_{0}^{\infty} f {(t)}^{T} g (t) d t, \end{matrix}

(8)

\begin{matrix} ∥f∥ = {(\int_{0}^{\infty} f {(t)}^{T} f (t) d t)}^{1 / 2} = {〈 f, f 〉}^{1 / 2} . \end{matrix}

(9)

Let

L_{2, e}

denote the “extended” space of functions that are square integrable over finite intervals, i.e.,

\begin{matrix} L_{2, e} = \{f : R_{+} \to R^{n} : \int_{0}^{T} f {(t)}^{T} f (t) d t < \infty for all T \in R_{+}\} . \end{matrix}

(10)

For

U, Y \subset L_{2, e}

subsets of functions, an input–output operator is a mapping

S : U \to Y

.

Figure 2. (a) Feedback interconnection with negative feedback. (b) Feedback interconnection with positive feedback.

Figure 2a represents a feedback interconnection of two input–output operators,

S_{1} : U \to Y

and

S_{2} : Y \to U

. The feedback interconnection makes the “output” of

S_{1}

becomes the “input” of

S_{2}

. Likewise, the negative output of

S_{2}

becomes the input of

S_{1}

after summation with the external

r \in U

. Traditionally in control theory, feedback interconnections are with negative feedback as in Figure 2a rather than positive feedback as in Figure 2b. However, we will use positive feedback interconnections for games in the following analysis .

Figure 2a graphically represents the equations

\begin{matrix} u_{1} & = & r - y_{2} \\ y_{1} & = & S_{1} u_{1} \\ y_{2} & = & S_{2} u_{2} \\ u_{2} & = & y_{1}, \end{matrix}

(11)

which imply

\begin{matrix} u_{1} = r - S_{2} S_{1} u_{1} . \end{matrix}

(12)

We assume that for any

r \in U

, there exists a solution

u_{1} \in U

. Given this solution,

u_{1}

, one can go on to construct

y_{1}

,

u_{2}

, and

y_{2}

accordingly.

3.3. Passivity with input–output Operators

We now define passivity for input–output operators and present a basic theorem on the stability of feedback interconnections of passive operators. First, for

T \in R_{+}

, define the truncated inner-product

\begin{matrix} {〈 f, g 〉}_{T} = \int_{0}^{T} f {(t)}^{T} g (t) d t \end{matrix}

(13)

and truncated norm

\begin{matrix} {∥f∥}_{T} = {〈 f, f 〉}_{T}^{1 / 2} . \end{matrix}

(14)

The input–output operator

S : U \to Y

is passive if there exists a constant α such that

\begin{matrix} {〈 S u, u 〉}_{T} \geq α, for all u \in U, T \in R_{+}, \end{matrix}

(15)

and input strictly passive if there exist

β > 0

and α such that

\begin{matrix} {〈 S u, u 〉}_{T} \geq α + β {〈 u, u 〉}_{T}, for all u \in U, T \in R_{+} . \end{matrix}

(16)

Theorem 3.1 Consider the feedback interconnection of

S_{1}

and

S_{2}

defined by Equation (11). Assume

S_{1}

is passive and

S_{2}

is input strictly passive. Then

r \in L_{2}

implies

y_{1} \in L_{2}

.

In words, Theorem 3.1 can be interpreted as the stability of a feedback interconnection between a passive and strictly passive system. The proof is straightforward. Using Equation (11),

\begin{matrix} {〈 r, y_{1} 〉}_{T} = {〈 u_{1} + y_{2}, y_{1} 〉}_{T} = {〈 u_{1}, S_{1} u_{1} 〉}_{T} + {〈 S_{2} u_{2}, u_{2} 〉}_{T} . \end{matrix}

(17)

Therefore

\begin{matrix} {〈 r, y_{1} 〉}_{T} \geq α_{1} + α_{2} + β_{2} {〈 u_{2}, u_{2} 〉}_{T} = α_{1} + α_{2} + β_{2} {〈 y_{1}, y_{1} 〉}_{T}, \end{matrix}

(18)

where

α_{1}

,

α_{2}

, and

β_{2}

are the passivity constants associated with

S_{1}

and

S_{2}

. By the Schwarz inequality,

\begin{matrix} {∥r∥}_{T} {∥y_{1}∥}_{T} \geq α_{1} + α_{2} + β_{2} {∥y_{1}∥}_{T}^{2} . \end{matrix}

(19)

By assumption,

r \in L_{2}

, and so

{∥r∥}_{T}

is uniformly bounded over

T \geq 0

. Since

β_{2} > 0

, it follows that

{∥y_{1}∥}_{T}

is also uniformly bounded over

T \geq 0

.

3.4. Passivity with State Space Systems

The previous discussion was in terms of general input–output operators and illustrates that passivity methods need not be restricted to dynamical systems described by differential equations. We now present specialized results when the input–output operators are, in fact, represented by a system of differential equations. Consider the state space system

\begin{matrix} \dot{x} & = & f (x, u), x (0) = x_{0}, \\ y & = & g (x, u), \end{matrix}

(20)

where

u (t) \in U \subset R^{m}

,

y (t) \in R^{m}

, and

x_{0} \in M \subset R^{n}

. Let us assume that for some classes of functions,

U

and

Y

, and for any initial condition in

M

, there exists a solution for all

u \in U

resulting in

y \in Y

and

x (t) \in M

for all

t \geq 0

(i.e.,

M

is invariant). These equations define a family of input–output operators (indexed by the initial condition,

x_{0}

), mapping

u (\cdot)

to

y (\cdot)

. Accordingly, one can apply input–output operator notions of passivity for each initial condition.

For state space systems, however, the traditional approach is to define passivity irrespective of initial conditions as follows. A state-space system is passive if there exists a storage function

L : M \to R_{+}

such that for all

x_{0} \in M

,

u \in U

, and

t \in R_{+}

,

\begin{matrix} L (x (t)) \leq L (x_{0}) + \int_{0}^{t} u {(τ)}^{T} y (τ) d τ . \end{matrix}

(21)

The parallels with the discussion of circuits are that the “input” is identified as the voltage and “output” is identified as the current (or vice versa), which in turns motivate the terminology of a “storage” function.

We will focus on

C^{1}

storage functions, in which case the above definition may be written as

\begin{matrix} \dot{L} : = D L (x) f (x, u) \leq u^{T} y = u^{T} g (x, u) . \end{matrix}

(22)

for all

x \in M

and

u \in U

. Inequalities such as (22) are referred to as “dissipation inequalities”.

We now consider a feedback interconnection of two state space systems as in Figure 2a, in which each system,

S_{i}

, is represented by state space equations

\begin{matrix} {\dot{x}}_{i} & = & f_{i} (x, u) x_{i} (0) = x_{i 0} \in M_{i}, i \in \{1, 2\} \\ y_{i} & = & g_{i} (x_{i}, u_{i}), i \in \{1, 2\} \\ u_{1} & = & r - y_{2} \\ u_{2} & = & y_{1} \end{matrix}

(23)

Again, we assume the existence of solutions.

Theorem 3.2 Consider the feedback interconnection of

S_{1}

and

S_{2}

defined by Equation (23). Assume each

S_{i}

is passive with storage function

L_{i}

. Then for all

x_{1} (0) \in M_{1}

,

x_{2} (0) \in M_{2}

, and

t \in R_{+}

,

\begin{matrix} L_{1} (x_{1} (t)) + L_{2} (x_{2} (t)) \leq L_{1} (x_{1} (0)) + L_{2} (x_{2} (0)) + \int_{0}^{t} r {(τ)}^{T} y_{1} (τ) d τ . \end{matrix}

(24)

An interpretation is that the closed-loop system mapping r to

y_{1}

is passive with storage function

L_{1} + L_{2}

. The derivation is an immediate consequence of the definitions.

In case

r = 0

, Theorem 3.2 implies that

L_{1} (x_{1} (t)) + L_{2} (x_{2} (t))

is non-increasing along solutions of state space (23). Compare to the previous discussion on circuits, in which energy was non-increasing for interconnected circuits. This monotonicity can have stability implications depending on the underlying specifics. For example, if both

L_{1}

and

L_{2}

are positive definite with

L_{i} (0) = 0

, and if the passivity inequalities of (22) are strict for

x_{i} \neq 0

, then the origin is locally asymptotically stable. These details are omitted here. The main point for now is that, as before, there are stability implications associated with the feedback interconnection of two passive systems.

4. Main Results

4.1. Preview

Traditionally, we think of games as memoryless mappings from strategy

x \in X

to payoff

F (x)

. In a dynamic setting, we can extend this viewpoint to a mapping of strategy trajectories

x (\cdot)

to payoff trajectories

π (\cdot) = F (x (\cdot))

. Likewise, evolutionary dynamics (see below) can be viewed as mappings from payoff trajectories to strategy trajectories. Accordingly, we can view evolutionary games as a feedback interconnection. In terms of the positive feedback digram Figure 2b, we will set

r = 0

and associate

S_{1}

with an evolutionary dynamic process and

S_{2}

with a game. Accordingly,

u_{1} = y_{2}

represent payoff trajectories and

y_{1} = u_{2}

represent strategy trajectories.

Here we find the first departure from standard passivity analysis. Namely, we are dealing with positive feedback as in Figure 2b instead of the traditional negative feedback of Figure 2a, and so we will need to make suitable (simple) modifications to the analysis.

There is a more significant departure from standard passivity analysis that is at the heart of associating stable games with passive systems. As mentioned above, let us think of a passive game as an input–output operator mapping strategy trajectories to payoff trajectories. Then for

C^{1}

games,

\begin{matrix} \dot{π} (t) = D F (x (t)) \dot{x} (t) . \end{matrix}

(25)

By the definition of stable games,

\begin{matrix} \dot{x} {(t)}^{T} \dot{π} (t) = \dot{x} {(t)}^{T} D F (x (t)) \dot{x} (t) \leq 0 \end{matrix}

(26)

pointwise in time. Consequently,

\begin{matrix} \int_{0}^{T} \dot{x} {(t)}^{T} \dot{π} (t) d t = {〈 \dot{π}, \dot{x} 〉}_{T} \leq 0 . \end{matrix}

(27)

While this inequality resembles a passivity condition, we find two differences. First, the direction of the inequality has changed. This reversal suits the shift to positive feedback interconnections and will result in a definition of “anti-passivity”.

Second, and more significantly, the inner product involves the derivatives of the input and output. The approach taken in the precursor paper [17] was to represent a

C^{1}

game as an input–output system through these derivatives as follows:

\begin{matrix} \dot{x} = u \end{matrix}

(28)

\begin{matrix} y = \dot{π} = D F (x) u . \end{matrix}

(29)

Here, we see that the “input” u is actually

\dot{x}

, and the “output” y is

\dot{π}

. Using such associations enables the application of standard passivity theory at the cost of associating time derivatives as inputs and outputs.

In this paper, we take an alternative and more direct (but essentially equivalent) approach. We will continue to view evolutionary dynamics as mappings from payoff trajectories (not their derivatives) to strategy trajectories. However, we will introduce a notion of “differential” or simply δ-passivity (see also [18]) which mimics standard passivity, but uses derivatives of inputs and outputs as the integrand for passivity conditions.

Our notion of differential passivity can be interpreted as a local version of “incremental passivity”, defined in [11] as

\begin{matrix} {〈 S u - S u^{'}, u - u^{'} 〉}_{T} \geq 0, \end{matrix}

(30)

which closely resembles the original definition of Equation (2).

4.2. δ-Passivity and δ-Anti-Passivity

We begin with definitions for input–output operators. Let

S : U \to Y

be an input–output operator. We assume both

U

and

Y

are subsets of locally Lipschitz functions over

[0, \infty)

. This assumption implies a requisite differentiability (for almost all

t \in R_{+}

) as well as certain required boundedness properties as well.

The input–output operator

S : U \to Y

is

δ-passive if there exists a constant α such that

$\begin{matrix} {〈 \dot{(S u)}, \dot{u} 〉}_{T} \geq α, for all u \in U, T \in R_{+} . \end{matrix}$

(31)
input strictly δ-passive if there exist $β > 0$ and α such that

$\begin{matrix} {〈 \dot{(S u)}, \dot{u} 〉}_{T} \geq α + β {〈 \dot{u}, \dot{u} 〉}_{T}, for all u \in U, T \in R_{+} . \end{matrix}$

(32)
δ-anti-passive if $- S$ is passive.
input strictly δ-anti-passive if $- S$ is input strictly δ-passive.

We now present a stability theorem for δ-passivity analogous to Theorem 3.1 for the positive feedback interconnection Figure 2b, where

S_{1} : U \to Y

and

S_{2} : Y \to U

. The feedback equations are:

\begin{matrix} u_{1} & = & r + y_{2} \\ y_{1} & = & S_{1} u_{1} \\ y_{2} & = & S_{2} u_{2} \\ u_{2} & = & y_{1} . \end{matrix}

(33)

We assume existence of solutions and that r is such that

u_{1} \in U

(which is satisfied, in particular, for

r = 0

).

Theorem 4.1 Consider the feedback interconnection of

S_{1}

and

S_{2}

defined by Equation (33). Assume

S_{1}

is δ-passive and

S_{2}

is input strictly δ-anti-passive. Then

\dot{r} \in L_{2}

implies

{\dot{y}}_{1} \in L_{2}

.

The proof parallels that of Theorem 3.1 and is omitted.

We continue with a definition of δ-passivity for state space systems as in Equation (20). We assume that

g (\cdot, \cdot)

is

C^{1}

and input functions in

U

are locally Lipschitz continuous. We assume further that functions in

U

are

U

valued, with

U \subset R^{m}

, and denote

T U

to be the associated tangent space. Throughout this paper, we will have

T U

as either

R^{m}

or

T X

.

A state space system is δ-passive if there exists a

C^{1}

storage function

L : M \times R^{m} \to R_{+}

such that for all

x \in M

,

u \in U

, and

\dot{u} \in T U

,

\begin{matrix} \nabla_{x} L (x, u) f (x, u) + \nabla_{u} L (x, u) \dot{u} \leq {\dot{u}}^{T} (\nabla_{x} g (x, u) f (x, u) + \nabla_{u} g (x, u) \dot{u}), \end{matrix}

(34)

or more succinctly,

\begin{matrix} \dot{L} \leq {\dot{u}}^{T} \dot{y} . \end{matrix}

(35)

It is δ-anti-passive if

\begin{matrix} \dot{L} \leq - {\dot{u}}^{T} \dot{y} . \end{matrix}

(36)

In case the state space system has no state, as in

y = g (u)

, then we take

L = 0

in the above definitions.

We now state a stability theorem that is tailored for the forthcoming discussion. We are concerned with a special case of positive feedback depicted in Figure 2b. Each system,

S_{i}

, is represented by state space equations, as in

\begin{matrix} {\dot{x}}_{i} & = & f_{i} (x, u) x_{i} (0) = x_{i 0} \in M_{i}, i \in \{1, 2\} \\ y_{1} & = & g_{1} (x_{1}), y_{2} = g_{2} (x_{2}, u_{2}), i \in \{1, 2\} \\ u_{1} & = & y_{2} \\ u_{2} & = & y_{1} . \end{matrix}

(37)

In particular, we have set

r = 0

and eliminated dependence of

y_{1}

on

u_{1}

to avoid any pathological issues with algebraic loops. Again, we assume the existence of solutions.

Theorem 4.2 Consider the feedback interconnection of

S_{1}

and

S_{2}

defined by Equation (37). Assume

S_{1}

is δ-passive and

S_{2}

is δ-anti-passive, with storage functions

L_{1}

and

L_{2}

, respectively.

For all $x_{1} (0) \in M_{1}$ , $x_{2} (0) \in M_{2}$ , and for $t \in R_{+}$ ,

$\begin{matrix} {\dot{L}}_{1} + {\dot{L}}_{2} \leq 0 . \end{matrix}$

(38)
Furthermore, if the level set

$\begin{matrix} \{(x_{1}, x_{2}) : L_{1} (x_{1}, g_{2} (x_{2}, g_{1} (x_{1}))) + L_{2} (x_{2}, g_{1} (x_{1})) \leq ℓ\} \end{matrix}$

(39)

with

$\begin{matrix} ℓ = L_{1} (x_{1} (0), g_{2} (x_{2} (0), g_{1} (x_{1} (0)))) + L_{2} (x_{2} (0), g_{1} (x_{1} (0))) \end{matrix}$

(40)

is compact, and

$\begin{matrix} {\dot{L}}_{1} + {\dot{L}}_{2} \leq - ψ (\dot{h}), \end{matrix}$

(41)

for some positive definite $ψ (\cdot)$ and $C^{1}$ function $h : M_{1} \times M_{2} \to R^{k}$ , then ${lim}_{t \to \infty} \dot{h} (t) = 0 .$

Proof. Statement 1 is a direct consequence of the definitions of δ-passive and δ-anti-passive. Expanding the definition of

L_{1}

and

L_{2}

, we see that

\begin{matrix} L_{1} (x_{1}, u_{1}) + L_{2} (x_{2}, u_{2}) = L_{1} (x_{1}, g_{2} (x_{2}, g_{1} (x_{1}))) + L_{2} (x_{2}, g_{1} (x_{1})) \end{matrix}

(42)

is non-increasing. Statement 2 assumes some “strictness” in passivity expressed through

ψ (\dot{h})

. The conclusion follows from an application of LaSalle’s invariance theorem. ☐

We conclude this section with a discussion relating passivity and δ-passivity. Starting from an original state space system:

\begin{matrix} \dot{x} & = & f (x, u), x (0) = x_{0} \\ y & = & g (x, u), \end{matrix}

(43)

we can construct an extended system defined by

\begin{matrix} \dot{u} & = & u^{'}, u (0) = u_{0} \\ \dot{x} & = & f (x, u), x (0) = x_{0} \\ y^{'} & = & \nabla_{x} g (x, u) f (x, u) + \nabla_{u} g (x, u) u^{'} . \end{matrix}

(44)

The extended system has as an “input”

u^{'}

which equals

\dot{u}

of the original system. Similarly, the extended system has as an “output”

y^{'}

which equals

\dot{y}

of the original. Also, note that the state of the expanded system,

x^{'}

is

(x, u)

.

The condition for a storage function,

L (x, u)

, for δ-passivity for the original system is

\begin{matrix} \dot{L} \leq {\dot{u}}^{T} \dot{y} \end{matrix}

(45)

which is expanded as

\begin{matrix} \nabla_{x} L (x, u) f (x, u) + \nabla_{u} L (x, u) \dot{u} \leq {\dot{u}}^{T} (\nabla_{x} g (x, u) f (x, u) + \nabla_{u} g (x, u) \dot{u}) . \end{matrix}

(46)

The condition for a storage function,

L^{'}

, for standard passivity of the extended system is

\begin{matrix} \dot{L^{'}} \leq {u^{'}}^{T} y^{'} . \end{matrix}

(47)

Since the state of the extended system is

(x, u)

, the resulting expansion is

\begin{matrix} \nabla_{x} L^{'} (x, u) f (x, u) + \nabla_{u} L^{'} (x, u) u^{'} \leq {u^{'}}^{T} (\nabla_{x} g (x, u) f (x, u) + \nabla_{u} g (x, u) u^{'}) . \end{matrix}

(48)

Identifying

u^{'}

with

\dot{u}

we see that δ-passivity for the original system corresponds to standard passivity for the extended system.

This correspondence was used in the precursor paper [17] by defining an extended system for both the evolutionary dynamic and population. That approach resulted in a duplication of states in the two extended systems. The present approach, by defining δ-passivity, avoids such difficulties and is more direct.

4.3. Application to Stable Games and Evolutionary Dynamics

4.3.1. Stable Games

We now state formally the motivating connection between stable games and passivity. Let

X

denote locally Lipschitz X-valued functions over

R_{+}

, and

P

denote locally Lipschitz

R^{n}

-valued functions over

R_{+}

.

Theorem 4.3 A

C^{1}

stable game mapping

X

to

P

is δ-anti-passive, i.e.,

\begin{matrix} {〈 \dot{(F (x))}, \dot{x} 〉}_{T} \leq 0, f o r a l l x (\cdot) \in X, T \in R_{+} . \end{matrix}

(49)

Furthermore, if F is strictly stable, then the mapping is input strictly δ-anti-passive, i.e., for some

β > 0

,

\begin{matrix} {〈 \dot{(F (x))}, \dot{x} 〉}_{T} \leq - β {〈 \dot{x}, \dot{x} 〉}_{T}, f o r a l l x (\cdot) \in X, T \in R_{+} . \end{matrix}

(50)

Proof. By definition,

\begin{matrix} {〈 \dot{(F (x))}, \dot{x} 〉}_{T} = \int_{0}^{T} \dot{x} {(τ)}^{T} D F (x (τ)) \dot{x} (τ) d τ . \end{matrix}

(51)

δ-anti-passivity follows from negative definiteness of the integrand. Furthermore, if F is strictly stable, there exists a

β > 0

such that

\begin{matrix} \dot{x} {(τ)}^{T} D F (x (τ)) \dot{x} (τ) < - β \dot{x} {(τ)}^{T} \dot{x} (τ), \end{matrix}

(52)

which implies the desired result. ☐

In identifying stable games as δ-anti-passive systems, we see that evolutionary dynamics that are δ-passive are the complement of stable games in that the stability Theorems 4.1–4.2 are applicable for any δ-passive dynamic. Inspecting the definition for δ-passivity, we will require that for some α,

\begin{matrix} \int_{0}^{T} \dot{x} {(t)}^{T} \dot{p} (t) d t \geq α . \end{matrix}

(53)

Since this equation holds for all

T \geq 0

, it implies a long run correlation between the flow of population state with the flow of payoffs, namely,

\begin{matrix} \underset{T \to \infty}{lim inf} \frac{1}{T} \int_{0}^{T} \dot{x} {(t)}^{T} \dot{p} (t) d t \geq 0 . \end{matrix}

(54)

4.3.2. Passive Evolutionary Dynamics

We now examine evolutionary dynamics from the perspective of δ-passivity. A general form for evolutionary dynamics is

\begin{matrix} \dot{x} = V (x, F (x)) \end{matrix}

(55)

which describes the evolution of the strategy state,

x \in X

, for the population game F. From a feedback interconnection perspective, an evolutionary dynamic describes how strategy trajectories evolve in response to payoff trajectories. Accordingly, we will remove any explicit game description and write

\begin{matrix} \dot{x} = V (x, p) . \end{matrix}

(56)

In terms of previous discussions, the payoff vector p is an “input” and the strategy x is the output. Again, in establishing that an evolutionary dynamic is δ-passive, we do not assert that

p = F (x)

. Rather,

p (\cdot)

is drawn from a class of trajectories. Since stable games are δ-anti-passive as mappings from

X

to

P

, we are interested in conditions for an evolutionary dynamic to be δ-passive as a mapping from

P

to

X

. Specializing the definition of δ-passivity for state space systems to the current setup, we seek to find a storage function

L : X \times R^{n} \to R_{+}

such that

\begin{matrix} \nabla_{x} L (x, p) V (x, p) + \nabla_{p} L (x, p) \dot{p} \leq {\dot{p}}^{T} V (x, p) \end{matrix}

(57)

for all admissible

\dot{p}

. Once the above equality is established, then one can employ Theorems 4.1 and 4.2 to make conclusions about stability.

We will focus specifically on so-called excess payoff target (EPT) dynamics [9,19]. These dynamics form a class of evolutionary dynamics that contain several well studied cases.

First, define the excess payoff function

ξ : X \times R^{n} \to R^{n}

by

\begin{matrix} ξ (x, p) = p - (x^{T} p) \cdot 1, \end{matrix}

(58)

where

1

is a vector of ones. EPT dynamics take the form

\begin{matrix} \dot{x} = V_{EPT} (x, p) : = τ (ξ (x, p)) - (1^{T} τ (ξ (x, p))) \cdot x, \end{matrix}

(59)

where

τ : R^{n} \to R_{+}^{n}

is called the revision protocol (see [9] for a thorough discussion.). Following [9], we make the following assumptions:

positive correlation: $V_{EPT} (x, p) \neq 0 \Rightarrow p^{T} V_{EPT} (x, p) > 0$ .
integrability: $τ = \nabla γ$ for some $C^{1}$ function $γ : R^{n} \to R$ .

Finally, let

P_{ρ}

denote the subset

\begin{matrix} \{p \in P : |p (t)| \leq ρ, for all t \in R_{+}\}, \end{matrix}

(60)

where

|\cdot|

denotes the euclidean norm on

R^{n}

.

Theorem 4.4 For any

ρ > 0

, EPT dynamics are δ-passive as a mapping from

P_{ρ}

to

X

with storage function

γ (ξ (x, p)) + C

for some constant C.

Proof. Following [9], take as a candidate storage function,

\begin{matrix} L (x, p) = γ (ξ (x, p)) . \end{matrix}

(61)

From integrability of τ,

\begin{matrix} \nabla_{x} L (x, p) \dot{x} + \nabla_{p} L (x, p) \dot{p} & = & τ {(ξ (x, p))}^{T} \nabla_{x} ξ (x, p) \dot{x} + τ {(ξ (x, p))}^{T} \nabla_{p} ξ (x, p) \dot{p} \\ = & τ {(ξ (x, p))}^{T} (- 1 \cdot p^{T} V_{EPT} (x, p)) + τ {(ξ (x, p))}^{T} ((I - 1 \cdot x^{T}) \dot{p}) \\ = & - (1^{T} τ (ξ (x, p))) (p^{T} V_{E P T} (x, p)) + (τ {(ξ (x, p))}^{T} - ((τ {(ξ (x, p))}^{T} 1) \cdot x^{T}) \dot{p} \\ = & - (1^{T} τ (ξ (x, p))) (p^{T} V_{E P T} (x, p)) + {\dot{p}}^{T} V_{EPT} (x, p) \\ \leq & {\dot{p}}^{T} V_{EPT} (x, p) \\ = & {\dot{p}}^{T} \dot{x}, \end{matrix}

(62)

where the last inequality is due to non-negativity of τ and positive correlation.

The remainder of the proof resolves a technicality that defines storage functions to be positive. Set

\begin{matrix} C = min \{γ (ξ (x, p)) : x \in X, |p| \leq ρ\} . \end{matrix}

(63)

Then

γ (ξ (x, p)) + C

is non-negative for all

x \in X

and

|p| \leq ρ

. ☐

The proof closely follows the stability proof in [9] that establishes

γ (ξ (x, F (x)))

as a Lyapunov function. This resemblance should not be surprising, in light of Theorem 4.2, which establishes that the sum of the storage functions in a feedback interconnection is non-increasing.

An important difference here is that the proof does not specify the origins of the payoff trajectory. That is, we do not presume that

p (t) = F (x (t))

for some stable game. Accordingly, the established δ-passivity will have stability implications for “generalized” stable games (see forthcoming sections).

The above proposition also establishes that EPT dynamics are δ-passive as an input–output operator. In particular, since p is the “input” and x is the “output”,

\begin{matrix} γ (ξ (x (T), p (T))) - γ (ξ (x (0), p (0))) \leq {〈 \dot{x}, \dot{p} 〉}_{T} . \end{matrix}

(64)

One can construct a suitable passivity constant α by maximizing over

x (0)

,

x (T)

,

p (0)

, and

p (t)

, as in the construction of C.

It also is possible to establish δ-passivity for other learning dynamics considered in [9]. As in the case with EPT dynamics, the proofs parallel previous stability proofs in [9], but with new interpretations with broader implications. The only subtlety is that, as before, we do not associate

p = F (x)

in the process of establishing passivity.

We illustrate this argument for so-called impartial pairwise comparison dynamics, which are not of the EPT form [9]. First, define Lipschitz continuous switch rates

\begin{matrix} ϕ_{j} : R \to R_{+} \end{matrix}

(65)

with the property that

\begin{matrix} ϕ_{j} (δ) > 0 \Leftrightarrow δ > 0 . \end{matrix}

(66)

Impartial pairwise comparison dynamics are defined as

\begin{matrix} {\dot{x}}_{i} = \sum_{j = 1}^{n} x_{j} ϕ_{i} (p_{i} - p_{j}) - \sum_{j = 1}^{n} x_{i} ϕ_{j} (p_{j} - p_{i}) . \end{matrix}

(67)

The interpretation from [9] is that the flow from strategy i to j depends on the relative payoffs,

p_{i}

and

p_{j}

. Furthermore, impartiality means that the flow rate,

ϕ_{j} (\cdot)

, only depends on the destination (and not origin) strategy.

Theorem 4.5 Impartial pairwise comparison dynamics defined by Equation (67) are δ-passive as a mapping from

P

to

X

.

Proof. Following [9], take as a candidate storage function

\begin{matrix} L (x, p) = \sum_{i = 1}^{n} x_{i} (\sum_{j = 1}^{n} \int_{0}^{p_{j} - p_{i}} ϕ_{j} (s) d s) . \end{matrix}

(68)

The derivative is

\begin{matrix} \dot{L} = \sum_{i = 1}^{n} ({\dot{x}}_{i} (\sum_{j = 1}^{n} \int_{0}^{p_{j} - p_{i}} ϕ_{j} (s) d s) + x_{i} (\sum_{j = 1}^{n} ϕ_{j} (p_{j} - p_{i}) ({\dot{p}}_{j} - {\dot{p}}_{i}))) . \end{matrix}

(69)

Note that we do not take

\dot{p} = \nabla F (x) \dot{x}

.

Arguments in [9] establish that the summation of the first term satisfies

\begin{matrix} \sum_{i = 1}^{n} {\dot{x}}_{i} (\sum_{j = 1}^{n} \int_{0}^{p_{j} - p_{i}} ϕ_{j} (s) d s) \leq 0 . \end{matrix}

(70)

Rearranging the summation of the second term,

\begin{matrix} \sum_{i = 1}^{n} x_{i} (\sum_{j = 1}^{n} ϕ_{j} (p_{j} - p_{i}) ({\dot{p}}_{j} - {\dot{p}}_{i})) \end{matrix}

\begin{matrix} = & (\sum_{j = 1}^{n} {\dot{p}}_{j} \sum_{i = 1}^{n} x_{i} ϕ_{j} (p_{j} - p_{i})) - (\sum_{i = 1}^{n} {\dot{p}}_{i} \sum_{j = 1}^{n} x_{i} ϕ_{j} (p_{j} - p_{i})) \end{matrix}

(71)

\begin{matrix} = & \sum_{i = 1}^{n} {\dot{p}}_{i} (\sum_{j = 1}^{n} x_{j} ϕ_{i} (p_{i} - p_{j}) - \sum_{j = 1}^{n} x_{i} ϕ_{j} (p_{j} - p_{i})) \end{matrix}

(72)

\begin{matrix} = & \sum_{i = 1}^{n} {\dot{p}}_{i} {\dot{x}}_{i} . \end{matrix}

(73)

Therefore,

\begin{matrix} \dot{L} \leq {\dot{p}}^{T} \dot{x}, \end{matrix}

(74)

as desired. ☐

4.3.3. Dynamically Modified Payoffs

We now illustrate how passivity methods can be used for the analysis of evolutionary dynamics with auxiliary states. In this section, we consider evolutionary dynamics acting on dynamically modified payoffs. These modifications can be interpreted in two ways: (i) dynamic modification as part of an evolutionary process coupled with a static game, or (ii) a game with dynamic dependencies coupled with a standard evolutionary dynamic. Figure 3 illustrates these two perspectives. In either case, the interconnection, and hence analysis, remains the same.

A consequence of dynamic modifications is the introduction of auxiliary states other than the strategy states. This setting is a departure from much of the literature on evolutionary games, which, almost exclusively, considers evolutionary dynamics whose dimension equals the number of strategies. Likewise, game payoffs typically are static functions of strategies.

Throughout this section, the stable games we consider are affine functions the state, i.e.,

\begin{matrix} p = A x + b . \end{matrix}

(75)

where A is symmetric negative definite.

Smoothed payoff modification: Payoffs are subject to an exponentially weighted moving average. Given a payoff stream

p (t)

, the smoothed payoffs are

\begin{matrix} \tilde{p} (t) = e^{- λ t} p (0) + \int_{0}^{t} e^{- λ (t - τ)} p (τ) d τ . \end{matrix}

(76)

Figure 3. (a) Static game with dynamically modified evolution. (b) Standard evolution with dynamically modified payoffs.

An effect of the averaging is to smooth out short term fluctuations in order to isolate longer term trends.

In state space form, the mapping from strategies to modified payoffs is described by

\begin{matrix} \dot{\tilde{p}} = λ (\underset{p}{\underset{︸}{A x + b}} - \tilde{p}) . \end{matrix}

(77)

Theorem 4.6 The state space system Equation (77) is δ-anti-passive as a mapping from

X

to

P

.

Proof. Take as a candidate storage function

\begin{matrix} L (\tilde{p}, x) = - \frac{λ}{2} {(A x + b - \tilde{p})}^{T} A^{- 1} (A x + b - \tilde{p}) . \end{matrix}

(78)

Then

\begin{matrix} \dot{L} & = & \nabla_{\tilde{p}} L (\tilde{p}, x) \dot{\tilde{p}} + \nabla_{x} L (\tilde{p}, x) \dot{x} \end{matrix}

(79)

\begin{matrix} = & - λ {(A x + b - \tilde{p})}^{T} A^{- 1} (A \dot{x} - \dot{\tilde{p}}) \end{matrix}

(80)

\begin{matrix} = & - {\dot{\tilde{p}}}^{T} \dot{x} + {\dot{\tilde{p}}}^{T} A^{- 1} \dot{\tilde{p}} \end{matrix}

(81)

\begin{matrix} \leq & - {\dot{\tilde{p}}}^{T} \dot{x}, \end{matrix}

(82)

where the last inequality is due to the negative definiteness of A. ☐

An implication of Theorem 4.6 is that Theorems 4.1 and 4.2 are now applicable for any δ-passive evolutionary dynamic coupled with smoothed payoffs of an affine stable game.

Anticipatory payoff modification: In anticipatory payoff modification, payoff streams are used to construct myopic forecasts of payoffs. Evolution (or learning) then acts on these myopic forecasts rather than the instantaneous payoffs. The concept is inspired by classical methods in feedback control as well as the psychological tendency to extrapolate from past trends. Anticipatory learning was utilized in [3,20,21], where it was shown how anticipatory learning can alter the convergence to both mixed and pure equilibria.

The state space equations for anticipatory payoff modification are

\begin{matrix} \dot{q} & = & λ (A x + b - q) \end{matrix}

\begin{matrix} \tilde{p} & = & (A x + b) + k λ (A x + b - q) . \end{matrix}

(83)

Here, the modified payoff is a combination of the original payoff and an estimate of its derivative, i.e.,

\begin{matrix} \tilde{p} \approx p + k {\dot{p}}^{est} . \end{matrix}

(84)

The specific estimate of

\dot{p}

here is

\dot{q}

(see the discussion in [20]), which can be constructed from payoff measurements. The scalar k reflects the weighting on the derivative estimate.

Theorem 4.7 The state space system Equation (83) is δ-anti-passive as a mapping from

X

to

P

.

Proof. Take as a candidate storage function

\begin{matrix} L (q, x) = - \frac{λ}{2} {(A x + b - q)}^{T} A^{- 1} (A x + b - q) . \end{matrix}

(85)

Then

\begin{matrix} \dot{L} = - λ {(A x + b - q)}^{T} A^{- 1} (A \dot{x} - \dot{q}) . \end{matrix}

(86)

By definition,

\begin{matrix} \dot{\tilde{p}} = A \dot{x} + k λ (A \dot{x} - \dot{q}) . \end{matrix}

(87)

Therefore,

\begin{matrix} \dot{L} & = & - λ {(A x + b - q)}^{T} A^{- 1} (\dot{\tilde{p}} - A \dot{x}) \frac{1}{k λ} \end{matrix}

(88)

\begin{matrix} = & - {\dot{q}}^{T} A^{- 1} (\dot{\tilde{p}} - A \dot{x}) \frac{1}{k λ} . \end{matrix}

(89)

Using the above equation for

\dot{\tilde{p}}

results in

\begin{matrix} \dot{L} & = & {(\frac{1}{k λ} (\dot{\tilde{p}} - A \dot{x}) - A \dot{x})}^{T} A^{- 1} (\dot{\tilde{p}} - A \dot{x}) \frac{1}{k λ} \end{matrix}

(90)

\begin{matrix} = & \frac{1}{{(k λ)}^{2}} {(\dot{\tilde{p}} - A \dot{x})}^{T} A^{- 1} (\dot{\tilde{p}} - A \dot{x}) + \frac{1}{k λ} {\dot{x}}^{T} A \dot{x} - \frac{1}{k λ} {\dot{x}}^{T} \dot{\tilde{p}} \end{matrix}

(91)

\begin{matrix} \leq & - \frac{1}{k λ} {\dot{x}}^{T} \dot{\tilde{p}}, \end{matrix}

(92)

where the last inequality is due to the negative definiteness of A. We see that rescaling

L_{new} (q, x) = k λ L (q, x)

leads to the desired result. ☐

The proof of Theorem 4.7 reveals that the associated dissipation inequality is satisfied strictly because of the two terms involving A. In particular, the anticipatory payoff modification of Equation (83) defines an input strictly δ-anti-passive system. The following representative theorem is then an immediate consequence of Theorem 4.1.

Theorem 4.8 In the positive feedback interconnection defined by Equation (33) (as in Figure 2b), let

S_{1}

be any δ-passive evolutionary dynamic mapping payoff trajectories

p (\cdot) \in P

to state trajectories

x (\cdot) \in X

, and let

S_{2}

be anticipatory payoff modification defined by Equation (83). Then

\dot{x} (\cdot) \in L_{2}

.

In case the evolutionary dynamic has a state space description (e.g., EPT), one can use Theorem 4.2 to conclude

{lim}_{t \to \infty} \dot{x} (t) = 0

.

Implicit in the above discussion is that convergence of

\dot{x}

has implications about convergence to Nash equilibrium. Any such conclusions are specific to the underlying evolutionary dynamic.

4.3.4. Contrarian Effect Payoffs

In this section, we illustrate the use of passivity methods in the presence of lags or time delays. In this model, players perceive advantages in avoiding strategies that have seen net increase in recent usage. In particular, for a fixed lag

ℓ > 0

, payoffs are given by “contrarian effect” payoffs, defined by

\begin{matrix} \tilde{p} (t) = F (x (t)) - Λ (x (t) - x (t - ℓ)), \end{matrix}

(93)

where

Λ > 0

is a diagonal scaling matrix. We assume that strategies are initialized by some Lipschitz continuous

x_{0} : [- ℓ, 0] \to X

, so that for

t - ℓ \leq 0

,

\begin{matrix} x (t - ℓ) : = x_{0} (t - ℓ) . \end{matrix}

(94)

As intended, an increase in the usage of a strategy diminishes the perceived payoff derived from that strategy. While such a contrarian effect defined here may seem simplistic, our main interest is to illustrate the analysis of delays using passivity methods.

In this section, we will deal exclusively with the input–output operator formulation of passivity. We begin by establishing the following variant of δ-anti-passivity of contrarian effect payoffs. First, define

X_{x_{o}}

to be the restriction of

X

to functions with

x (0) = x_{o}

.

Proposition 4.1 Let F be a strictly stable game with strict passivity constant,

β_{2} > 0

, i.e.,

\begin{matrix} {\dot{z}}^{T} D F (x) \dot{z} \leq - β_{2} {\dot{z}}^{T} \dot{z}, \forall z \in T X, x \in X . \end{matrix}

(95)

Let

x_{0} : [- ℓ, 0] \to X

be Lipschitz continuous. Let

S : X_{x_{0} (0)} \to P

be the (contrarian effect payoff) input–output operator defined by Equation (93). Then

\begin{matrix} - {〈 \dot{\tilde{p}}, \dot{x} 〉}_{T} \geq β_{2} {〈 \dot{x}, \dot{x} 〉}_{T} - α_{2}^{'} {∥\dot{x}∥}_{ℓ}, \end{matrix}

(96)

with

α_{2}^{'} = {(\int_{- ℓ}^{0} {\dot{x}}_{0} {(t)}^{T} Λ^{2} {\dot{x}}_{0} (t) d t)}^{1 / 2}

.

Proof. See appendix.

Proposition 4.1 states that contrarian effect payoffs satisfy a version of passivity that deviates slightly from the usual definition of δ-anti-passivity, since the associated passivity lower bound (associated with α) is

- α_{2}^{'} {∥\dot{x}∥}_{ℓ}

and depends on the input signal, but in a bounded manner.

The following theorem now follows from arguments similar to those for Theorem 4.1.

Theorem 4.9 In the positive feedback interconnection defined by Equation (33) (as in Figure 2b), let

S_{1}

be any δ-passive evolutionary dynamic mapping payoff trajectories

p (\cdot) \in P

to state trajectories

x (\cdot) \in X

, and let

S_{2}

be contrarian effect payoffs defined by Equation (93) with F strictly stable. Then

\dot{x} (\cdot) \in L_{2}

.

Proof. Let

α_{1}

be the passivity constant of the δ-passive evolutionary dynamic, so that

\begin{matrix} {〈 \dot{\tilde{p}}, \dot{x} 〉}_{T} \geq α_{1} . \end{matrix}

(97)

By Proposition 4.1, contrarian effect payoffs satisfy

\begin{matrix} - {〈 \dot{\tilde{p}}, \dot{x} 〉}_{T} \geq β_{2} {〈 \dot{x}, \dot{x} 〉}_{T} - α_{2}^{'} {∥\dot{x}∥}_{ℓ} . \end{matrix}

(98)

Summing these inequalities leads to

\begin{matrix} - α_{1} + α_{2}^{'} {∥\dot{x}∥}_{ℓ} \geq β_{2} {〈 \dot{x}, \dot{x} 〉}_{T}, \end{matrix}

(99)

which then implies that

\dot{x} (\cdot) \in L_{2}

, as desired. ☐

5. Concluding Remarks

This paper has proposed passivity theory as a unifying and extending framework for the study of evolutionary games. In particular, the passivity condition property of long run correlated payoff flows and population flows in Equation (54) appears to be a natural complement to the class of stable games. The methods are applicable to generalizations of stable games with dynamic payoffs or evolutionary dynamics that include auxiliary states. We believe that there is significant potential in bringing in related methods of feedback control theory, such as generalized dissipativity, multipliers, and loop transformations, to complement the more traditional analytical approaches to evolutionary games. A lingering question is to understand the extent to which passive evolutionary dynamics and stable games are complementary. We conjecture that if an evolutionary dynamic is not passive then one can construct a (generalized) stable game that results in instability. Stated differently, an evolutionary dynamic results in stability for all generalized stable games if and only if it is passive.

Acknowledgments

Research supported by ONR project #N00014-09-1-0751. We thank William H. Sandholm, Georgios Piliouras, Nikhil Chopra, Jason Marden, Georgios Chasparis, Ozan Candogan, and Georgios Kotsalis for helpful discussions.

A. Proof of Proposition 4.1

Contrarian effect payoffs in Equation (93) can be written as the sum of two terms,

\begin{matrix} \tilde{p} (t) = {\tilde{p}}_{SG} (t) - {\tilde{p}}_{CE} (t), \end{matrix}

(100)

i.e., the “stable game” portion,

\begin{matrix} {\tilde{p}}_{SG} (t) = F (x (t)) \end{matrix}

(101)

and the “contrarian effect” portion,

\begin{matrix} {\tilde{p}}_{CE} (t) = Λ (x (t) - x (t - ℓ)) . \end{matrix}

(102)

Accordingly,

\begin{matrix} - {〈 \dot{\tilde{p}}, \dot{x} 〉}_{T} & = & - {〈 {\dot{\tilde{p}}}_{SG}, \dot{x} 〉}_{T} + {〈 {\dot{\tilde{p}}}_{CE}, \dot{x} 〉}_{T} \end{matrix}

(103)

\begin{matrix} = & - \int_{0}^{T} \dot{x} {(τ)}^{T} D F (x (τ)) \dot{x} (τ) d τ + {〈 {\dot{\tilde{p}}}_{CE}, \dot{x} 〉}_{T} \end{matrix}

(104)

\begin{matrix} \geq & β_{2} {〈 \dot{x}, \dot{x} 〉}_{T} + {〈 {\dot{\tilde{p}}}_{CE}, \dot{x} 〉}_{T} . \end{matrix}

(105)

It then remains to show that

\begin{matrix} {〈 {\dot{\tilde{p}}}_{CE}, \dot{x} 〉}_{T} \geq - α_{2}^{'} {∥\dot{x}∥}_{ℓ} . \end{matrix}

(106)

We will need the following lemma, whose proof relies on standard arguments from systems theory (e.g., [11]).

Lemma A.1 For any

v \in L_{2}

and

ℓ \geq 0

,

\begin{matrix} \int_{0}^{\infty} (v (t) - v (t - ℓ)) v (t) d t \geq 0, \end{matrix}

(107)

where

v (t - ℓ) : = 0

for

t - ℓ < 0

.

Proof. Define the extensions

\bar{v} : (- \infty, \infty) \to R^{n}

\begin{matrix} \bar{v} (t) = \{\begin{matrix} 0 & t < 0; \\ v (t) & t \geq 0 . \end{matrix} \end{matrix}

(108)

Let

\hat{v}

be the Fourier transform of

\bar{v}

, i.e.,

\begin{matrix} \hat{v} (j ω) : = \int_{- \infty}^{\infty} \bar{v} (t) e^{- j ω t} d t . \end{matrix}

(109)

Likewise, let

\begin{matrix} \bar{w} (t) = \bar{v} (t) - \bar{v} (t - ℓ) \end{matrix}

(110)

and let

\hat{w}

be the Fourier transform of

\bar{w}

. Parseval’s Theorem states that

\begin{matrix} \int_{- \infty}^{\infty} \bar{w} {(t)}^{T} \bar{v} (t) d t = \frac{1}{2 π} \int_{- \infty}^{\infty} \hat{w} {(j ω)}^{*} \hat{v} (j ω) d ω, \end{matrix}

(111)

where superscript “*” denotes complex conjugate transform. Using that

\begin{matrix} \hat{w} (j ω) = \hat{v} (j ω) - e^{- j ω ℓ} \hat{v} (j ω) \end{matrix}

(112)

results in

\begin{matrix} \int_{- \infty}^{\infty} \bar{w} {(t)}^{T} \bar{v} (t) d t & = & \frac{1}{2 π} \int_{- \infty}^{\infty} \hat{v} {(j ω)}^{*} (I - e^{j ω ℓ} I) \hat{v} (j ω) d ω, \end{matrix}

(113)

\begin{matrix} = & \frac{1}{2 π} \int_{- \infty}^{\infty} \hat{v} {(j ω)}^{*} (I - \frac{1}{2} (e^{j ω ℓ} + e^{- j ω ℓ}) I) \hat{v} (j ω) d ω, \end{matrix}

(114)

\begin{matrix} = & \frac{1}{2 π} \int_{- \infty}^{\infty} (1 - cos (ω ℓ)) \hat{v} {(j ω)}^{*} \hat{v} (j ω) d ω \end{matrix}

(115)

\begin{matrix} \geq & 0, \end{matrix}

(116)

as desired. ☐

Lemma A.1 almost provides the desired conclusion, except that

\begin{matrix} {〈 {\dot{\tilde{p}}}_{CE}, \dot{x} 〉}_{T} = {〈 Λ^{1 / 2} (\dot{x} (t) - \dot{x} (t - ℓ)), Λ^{1 / 2} \dot{x} 〉}_{T} \end{matrix}

(117)

involves terms due to the initialization

x_{0} (\cdot)

over the interval

[- ℓ, 0]

. These terms can be bounded as follows. For any

T > 0

, define

\begin{matrix} v (t) = \{\begin{matrix} 0, & t < 0; \\ \dot{x} (t), & 0 \leq t \leq T; \\ 0 & t \geq T . \end{matrix} \end{matrix}

(118)

and

\begin{matrix} v_{0} (t) = \{\begin{matrix} 0, & t < - ℓ; \\ {\dot{x}}_{0} (t), & - ℓ \leq t \leq 0; \\ 0, & t \geq 0 . \end{matrix} \end{matrix}

(119)

and set

\begin{matrix} w (t) = Λ (v (t) - v (t - ℓ) - v_{0} (t - ℓ)) \end{matrix}

(120)

for

t \in (- \infty, \infty)

. Then

\begin{matrix} {〈 {\dot{\tilde{p}}}_{CE}, \dot{x} 〉}_{T} & = & {〈 w, v 〉}_{T} \\ = & \int_{0}^{T} w {(t)}^{T} v (t) d t \\ = & \int_{0}^{T} {(v (t) - v (t - ℓ))}^{T} Λ v (t) d t - \int_{0}^{T} v_{0} {(t - ℓ)}^{T} Λ v (t) d t \\ = & \int_{- \infty}^{T} {(v (t) - v (t - ℓ))}^{T} Λ v (t) d t - \int_{0}^{T} v_{0} {(t - ℓ)}^{T} Λ v (t) d t, (since v (t) = 0 for t < 0) \\ = & \int_{- \infty}^{\infty} {(v (t) - v (t - ℓ))}^{T} Λ v (t) d t - \int_{0}^{T} v_{0} {(t - ℓ)}^{T} Λ v (t) d t, (since v (t) = 0 for t > T) \end{matrix}

(121)

Using Lemma A.1, the first term above is positive, and the second term is bounded from below via

\begin{matrix} - \int_{0}^{T} v_{0} {(t - ℓ)}^{T} Λ v (t) \geq - {∥v∥}_{ℓ} α_{2}^{'} . \end{matrix}

(122)

References

Sandholm, W.H. Population Games and Evolutionary Dynamics (Economic Learning and Social Evolution); The MIT Press: Cambridge, MA, USA, 2011. [Google Scholar]
Sato, Y.; Akiyama, E.; Farmer, J.D. Chaos in learning a simple two-person game. Proc. Natl. Acad. Sci. U.S.A. 2002, 99, 4748–4751. [Google Scholar] [CrossRef] [PubMed]
Arslan, G.; Shamma, J.S. Anticipatory Learning in General Evolutionary Games. In Proceedings of the 45th IEEE Conference on Decision and Control, San Diego, CA, USA, 13–15 December, 2006; pp. 6289–6294.
Hart, S.; Colell, A.M. Uncoupled dynamics do not lead to Nash equilibrium. The American Economic Review 2003, 93, 1830–1836. [Google Scholar] [CrossRef]
Monderer, D.; Shapley, L. Potential games. Games Econ. Behav. 1996, 14, 124–143. [Google Scholar] [CrossRef]
Hart, S. Adaptive Heuristics. Econometrica 2005, 73, 1401–1430. [Google Scholar] [CrossRef]
Young, H.P. Strategic Learning and its Limits; Oxford University Press: Oxford, UK, 2005. [Google Scholar]
Fudenberg, D.; Levine, D. Learning and equilibrium. Annu. Rev. Econom. 2009, 1, 385–420. [Google Scholar] [CrossRef]
Sandholm, W.H.; Hofauer, J. Stable games and their dynamics. J. Econ. Theory 2009, 144, 1665–1693. [Google Scholar]
Willems, J.C. Dissipative dynamical systems - Part I, Part II. Arch. Ration. Mech. Anal. 1972, 45, 321–393. [Google Scholar] [CrossRef]
Desoer, C.; Vidyasagar, M. Feedback Systems: Input-Output Properties; Academic Press, Inc.: New York, NY, USA, 1975. [Google Scholar]
Khalil, H. Nonlinear Systems, 3rd ed.; Prentice Hall: Upper Saddle River, New Jersey, 2002. [Google Scholar]
van der Schaft, A. L2-Gain and Passivity Techniques in Nonlinear Control; Springer Verlag: Berlin/Heidelberg, Germany, 2000. [Google Scholar]
Ramrez-Llanos, E.; Quijano, N. A population dynamics approach for the water distribution problem. Int. J. Control 2010, 83, 1947–1964. [Google Scholar] [CrossRef]
Fan, X.; Alpcan, T.; Arcak, M.; Wen, T.J.; Basar, T. A passivity approach to game-theoretic CDMA power control. Automatica 2006, 42, 1837–1847. [Google Scholar] [CrossRef]
Hofbauer, J.; Sandholm, W.H. Evolution in games with randomly disturbed payoffs. J. Econ. Theory 2007, 132, 47–69. [Google Scholar] [CrossRef]
Fox, M.J.; Shamma, J.S. Population Games, Stable Games, and Passivity. In Proceedings of the 51st IEEE Conference on Decision and Control, Maui, Hawaii, 10–13 December, 2012; pp. 7445–7450.
Wang, H. Differential-Passivity based Controlled Synchronization of Networked Robots with Additive Disturbances. In Proceedings of the 31st Chinese Control Conference (CCC‘2), Hefei, China, 25–27 July, 2012; pp. 5838–5843.
Sandholm, W. Excess payoff dynamics and other well-behaved evolutionary dynamics. J. Econ. Theory 2005, 124, 149–170. [Google Scholar] [CrossRef]
Shamma, J.S.; Arslan, G. Dynamic fictitious play, dynamic gradient play, and distributed convergence to Nash equilibria. IEEE Trans. Automat. Contr. 2005, 50, 312–327. [Google Scholar] [CrossRef]
Chasparis, G.; Shamma, J. Distributed dynamic reinforcement of efficient outcomes in multiagent coordination and network formation. Dynamic Games and Applications 2012, 2, 18–50. [Google Scholar] [CrossRef]

^1.For notational simplicity, we suppress the dimension n in $L_{2}$ .

© 2013 by the authors; licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/3.0/).

Share and Cite

MDPI and ACS Style

Fox, M.J.; Shamma, J.S. Population Games, Stable Games, and Passivity. Games 2013, 4, 561-583. https://doi.org/10.3390/g4040561

AMA Style

Fox MJ, Shamma JS. Population Games, Stable Games, and Passivity. Games. 2013; 4(4):561-583. https://doi.org/10.3390/g4040561

Chicago/Turabian Style

Fox, Michael J., and Jeff S. Shamma. 2013. "Population Games, Stable Games, and Passivity" Games 4, no. 4: 561-583. https://doi.org/10.3390/g4040561

Article Menu

Population Games, Stable Games, and Passivity

Abstract

1. Introduction

2. Background

2.1. Population Games

2.2. Stable Games

3. A Primer on Passivity in Feedback Systems

3.1. Circuit Origins

3.2. Input–Output Operators and Feedback Interconnections

3.3. Passivity with input–output Operators

3.4. Passivity with State Space Systems

4. Main Results

4.1. Preview

4.2. δ-Passivity and δ-Anti-Passivity

4.3. Application to Stable Games and Evolutionary Dynamics

4.3.1. Stable Games

4.3.2. Passive Evolutionary Dynamics

4.3.3. Dynamically Modified Payoffs

4.3.4. Contrarian Effect Payoffs

5. Concluding Remarks

Acknowledgments

A. Proof of Proposition 4.1

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI