Partial Order Games

Zahoransky, Valeria; Gutierrez, Julian; Harrenstein, Paul; Wooldridge, Michael

doi:10.3390/g13010002

Open AccessEditor’s ChoiceArticle

Partial Order Games

¹

Fachbereich Informatik, Université du Luxembourg, Maison du Nombre, 6 Avenue de la Fonte, L-4364 Esch-sur-Alzette, Luxembourg

²

Faculty of Information Technology, Monash University, Woodside Building for Technology and Design, 20 Exhibition Walk, Clayton, VIC 3168, Australia

³

Department of Computer Science, University of Oxford, Wolfson Building, Parks Road, Oxford OX1 3QD, UK

^*

Author to whom correspondence should be addressed.

Games 2022, 13(1), 2; https://doi.org/10.3390/g13010002

Submission received: 19 November 2021 / Revised: 9 December 2021 / Accepted: 15 December 2021 / Published: 21 December 2021

(This article belongs to the Section Algorithmic and Computational Game Theory)

Download

Browse Figures

Versions Notes

Abstract

We introduce a non-cooperative game model in which players’ decision nodes are partially ordered by a dependence relation, which directly captures informational dependencies in the game. In saying that a decision node v is dependent on decision nodes

v_{1}, \dots, v_{k}

, we mean that the information available to a strategy making a choice at v is precisely the choices that were made at

v_{1}, \dots, v_{k}

. Although partial order games are no more expressive than extensive form games of imperfect information (we show that any partial order game can be reduced to a strategically equivalent extensive form game of imperfect information, though possibly at the cost of an exponential blowup in the size of the game), they provide a more natural and compact representation for many strategic settings of interest. After introducing the game model, we investigate the relationship to extensive form games of imperfect information, the problem of computing Nash equilibria, and conditions that enable backwards induction in this new model.

Keywords:

game theory; non-cooperative games; Nash equilibrium; backwards induction; computational complexity

1. Introduction

The two most important game models in non-cooperative game theory are normal form games and extensive form games. These games are distinguished by the information that players have about the strategies of other players. In normal form games, players must select and commit to strategies without any information relating to the strategies of others. In contrast, in extensive form games, players make moves alternately over time and, during play, their strategy may be informed by moves made previously. Variations of extensive form games—e.g., of imperfect information or recall—make it possible to capture the information available to a player when called upon to make a move.

In this article, we introduce partial order games. The key distinguishing feature of partial order games is that they are equipped with a dependence relation, which explicitly captures the informational dependencies between decision nodes. In saying that a decision node v is dependent on decision nodes

v_{1}, \dots, v_{k}

, we mean that the information available to a strategy making a choice at v is precisely the choices that were made at

v_{1}, \dots, v_{k}

. Observe that the informational dependencies captured by a dependence relation give rise to a partial temporal order over choices: in this case, the choices for

v_{1}, \dots, v_{k}

must strictly precede the choice for v. However, the temporal ordering induced in this way is indeed partial: it may be that two choices

v^{'}

and

v^{″}

are independent, in which case one can say nothing about their temporal order.

In a technical sense, partial order games are no more expressive than extensive form games of imperfect information: one of our key results is to show that every partial order game can be transformed into a strategically equivalent extensive form game of imperfect information. However, for some settings, partial order games have significant advantages over their extensive form representation:

First, informational dependencies in partial order games are explicitly captured, while in extensive form games, they are left implicit in the information sets of the game. As a consequence, some settings are much more transparently and naturally represented using partial order games, compared to the extensive form.
Second, partial order games can be exponentially more compact than their extensive form. From a purely practical point of view, this means that some scenarios with a compact partial order representation cannot be handled (by people or by a computer) in their extensive form.

We analyse partial order games by means of Nash equilibrium—arguably game theory’s most prominent non-cooperative solution concept—and the solutions given by a natural backwards induction procedure for partial order games defined on the dependence relation. Here, we give special attention to the computational complexity of calculating such solutions in partial order games. The remainder of this article is structured as follows.

We begin by briefly recalling some necessary concepts from graph theory and game theory, and then introduce the formal framework of partial order games.
As we are interested in the computational properties of partial order games, we introduce a compact representation for strategies and utility functions in partial order games, based on Boolean circuits, which enables us to investigate questions about their computational complexity.
In Section 4, we study the relationship of our game model to other game models: partial order Boolean games [1], Multi-Agent Influence Diagrams (MAIDs) [2,3], and extensive form games of imperfect information. Our main result in this section is to present a technique for translating partial order games into strategically equivalent extensive form games, although this translation comes at the cost of an exponential blowup in the size of the game. This leads us into a discussion of the use of partial order games as a compact representation of extensive form games.
In Section 5, we investigate the problem of computing Nash equilibria in partial order games. For example, we show that checking whether a game has any pure strategy Nash equilibria is NEXPTIME-complete.
In Section 6, we study backwards induction in partial order games. As partial order games are inherently games of imperfect information, it follows that backwards induction does not always work for partial order games. We thus investigate cases where forms of backward induction work for partial order games, and present a condition on games that we call fit for backwards induction, which is sufficient to allow backwards induction.
We conclude with a brief discussion and pointers for future work.

2. Preliminary Definitions

We begin by recalling some concepts from graph theory and game theory, which we use in the remainder of the paper.

2.1. Directed Acyclic Graphs and Trees

The games in this paper are defined on directed acyclic graphs (DAGs). Formally, a directed acyclic graph is a pair

(V, E)

, where

V = {v_{1}, \dots, v_{| V |}}

is a finite set of vertices or nodes and

E \subseteq V \times V

a set of directed edges (or arcs) on V. We assume E to be acyclic (and thus also irreflexive). We say that vertex u is a parent of vertex v, and v a child of u, if

(u, v) \in E

. We occasionally also use infix notation and write

u E v

for

(u, v) \in E

. The depth of a vertex v, denoted by

depth (v)

, is defined recursively such that if v has no parents,

depth (v) = 0

, and otherwise,

depth (v)

equals the maximum depth of v’s parents plus 1. Formally,

depth (v) = \{\begin{matrix} 0 & if v has no parents, \\ max_{(u, v) \in E} depth (u) + 1 & otherwise . \end{matrix}

We say that vertex v is reachable from vertex u if

u = v

, or if there is some vertex w such that w is reachable from u and

(w, v) \in E

, that is if

u E^{*} v

, where

E^{*}

is the reflexive and transitive closure of E.

A topological sorting is a permutation

τ = (u_{1}, \dots, u_{| V |})

of the vertices

v_{1}, \dots, v_{| V |}

in V such that

u_{i} E u_{j}

implies

i < j

. Every DAG

(V, E)

admits at least one topological sorting of its vertices; this well-known fact will be important and useful in our study of partial order games.

A tree is a directed acyclic graph in which there is a unique vertex with no parents (the root of the tree, often denoted by

v_{0}

), and where every non-root vertex has a single parent. If a tree is finite, then, as it is acyclic, some vertices will have no children: we refer to these as the leaves of the tree. Observe that the root of the tree is the unique vertex of depth 0.

2.2. Normal-Form Games

We use normal-form games as the basis of our game-theoretic analysis of partial order games. Normal-form games are defined by a set of players, the strategies the players have at their disposal, and the preferences the players have over the outcomes that players choosing their strategies give rise to. Formally, a normal-form game is given by a tuple

(N, {(S_{i})}_{i \in N}, {(u_{i})}_{i \in N})

, where

N = {1, \dots, | N |}

is the set of players,

S_{i}

is the set of (pure) strategies available to player i, and

u_{i} : S_{1} \times \dots \times S_{| N |} \to R

is a utility function for each player i [4]. We refer to tuples

s = (s_{1}, \dots, s_{| N |})

in

S_{1} \times \dots \times S_{| N |}

as strategy profiles.

Nash equilibrium is the most important solution concept in non-cooperative game theory; it captures the idea of a joint course of action that is stable in the sense that no player has an incentive to deviate unilaterally from it. Furthermore, in this paper, it is one of the main analytic tools by means of which we evaluate partial order games. Formally, a strategy profile

s = (s_{1}, \dots, s_{| N |})

is a Nash equilibrium if, for all players i and strategies

s_{i}^{'} \in S_{i}

, we have

u_{i} (s) \geq u_{i} (s_{- i}, s_{i}^{'}),

where

(s_{- i}, s_{i}^{'}) = (s_{1}, \dots, s_{i - 1}, s_{i}^{'}, s_{i + 1}, \dots, s_{| N |})

.

We say that two normal-form games

(N, {(S_{i})}_{i \in N}, {(u_{i})}_{i \in N})

and

(N^{'}, {(S_{i}^{'})}_{i \in N^{'}}, {(u_{i}^{'})}_{i \in N})

are strategically isomorphic if there are bijections

ϕ : N \to N^{'}

and

ψ_{i} : S_{i} \to S_{f (i)}^{'}

such that, for all players i in N and all strategy profiles

s = (s_{1}, \dots, s_{n})

and

s^{'} = (s_{1}^{'}, \dots, s_{n}^{'})

we have

u_{i} (s) \leq u_{i} (s^{'}) if and only if u_{ϕ (i)}^{'} (ψ (s)) \leq u_{ϕ (i)}^{'} (ψ (s^{'}))

where

ψ (s) = (ψ_{i} (s_{1}), \dots, ψ_{n} (s_{n}))

and

ψ (s^{'}) = (ψ_{i} (s_{1}^{'}), \dots, ψ_{n} (s_{n}^{'}))

.

Given a normal-form game, two strategies

s_{i}, s_{i}^{'}

in

S_{i}

for a player i are said to be equivalent, in symbols

s_{i} \sim s_{i}^{'}

, if

u_{j} (s_{- i}^{″}, s_{i}) = u_{j} (s_{- i}^{″}, s_{i}^{'})

for all profiles

s_{- i}^{″} \in S_{1} \times \dots \times S_{i - 1} \times S_{i + 1} \times \dots \times S_{| N |}

and all players j. Let

{[s_{i}]}_{\sim}

be the equivalence class of

s_{i}

under the equivalence relation ∼, and denote

{[s]}_{\sim} = ({[s_{1}]}_{\sim}, \dots, {[s_{| N |}]}_{\sim})

. Following the work in [5], we define the reduced normal-form of a normal-form game

(N, {(S_{i})}_{i \in N}, {(u_{i})}_{i \in N})

as the game

(N, {(S_{i}^{'})}_{i \in N}, {(u_{i}^{'})}_{i \in N})

where for each player i,

S_{i}^{'} = {{[s_{i}]}_{\sim} : s_{i} \in S_{i}}

and

u_{i}^{'} ({[s]}_{\sim}) = u_{i} (s)

for all strategy profiles

s = (s_{1}, \dots, s_{| N |})

. Two games are then said to be strategically equivalent if they induce strategically isomorphic reduced normal-forms.

2.3. Extensive-Form Games

An extensive-form game (of imperfect information) is based on a directed tree

(V, E)

and is played by a set of players N choosing from a set of actions A, starting from the root of the tree. Within the set V of vertices, we distinguish between action or decision nodes, which have children, and leaf nodes, which have no children. The set of action nodes we denote by D and the set of leaf nodes by L. With every action node v in D we associate a unique player

i_{v}

, who is then said to be active at v, and an action set

A_{v} \subseteq A

such that every outgoing edge

(v, u)

is associated with a unique action in

A_{v}

. Let

V_{i}

denote the set of action nodes associated with player i, that is,

V_{i} = {v \in V : i_{v} = i}

, and let

A_{i} = {a \in A_{v} : v \in V_{i}}

.

Every vertex v is identified with a unique sequence of actions, called a history, which leads from the root

v_{0}

to v. Histories associated with leaf nodes are referred to as terminal histories. We will denote the history associated with vertex v by

h_{v}

. We can also think of vertices v as histories: vertex v is the sequence

h_{v} = (a_{1}, \dots, a_{depth (v)})

of actions for which there is a path

v_{0} = w_{0} \dots, w_{depth (v)} = v

such that

(w_{i}, w_{i + 1})

is associated with action

a_{i + 1}

for every

0 \leq i < depth (v)

. The root

v_{0}

is thus the empty sequence

()

. Thus, we also find that

v E w

if and only if

w = v a

for some

a \in A_{v}

.

For every player i, we have a partitioning

I_{i} = {I_{1}, \dots, I_{| I_{i} |}}

of their decision nodes

V_{i}

into (non-empty) information sets. We write

[v]

for the information set vertex v belongs to. Here, it is understood that

A_{u} = A_{v}

, whenever u and v are in the same information set I. We then also write

A_{I}

for

A_{u}

and

A_{v}

. Information sets may be singletons. If all information sets for all players are singletons, we say the game is of perfect information and do not distinguish between information set

{v}

and the node v itself, if no confusion is likely.

Finally, each player i’s preferences are represented by a utility value

u_{h}^{i} \in R

at each Version December 9, 2021 submitted to Games terminal history/leaf

h

. See Figure 1 for an example.

A strategy for player i in an extensive-form game is a function

s_{i} : I_{i} \to A_{i}

such that

s_{i} (I) \in A_{I}

for all

I \in I_{i}

. The set of strategies available to player i we then denote by

S_{i}

. A strategy profile is then a sequence

s = (s_{1}, \dots, s_{| N |})

of strategies, one for each player. Let

D = {v_{1}, \dots, v_{| D |}}

. Then, every strategy profile

s = (s_{1}, \dots, s_{| N |})

defines an action profile

a^{s} = (a_{v_{1}}, \dots, a_{v_{| D |}})

such that

a_{v} = s_{i} ([v])

and where i is a player who is active at v. Observe that a strategy profile

s = (s_{1}, \dots, s_{| N |})

also defines a unique path

w_{0}, \dots, w_{k}

from the root

w_{0}

to a leaf

w_{k}

with history

h^{s} = (a_{1}, \dots, a_{k})

defined such that, for all

0 \leq i < k

, edge

(w_{i}, w_{i + i})

is associated with action

a_{i} = s_{i} ([w_{i}])

, where i is the player who is active at

w_{i}

. Finally, we have for each player i a utility function

u_{i} : S_{1} \times \dots \times S_{| N |} \to R

associating each strategy profile

s = (s_{1}, \dots, s_{| N |})

with a real value such that

u_{i} (s) = u_{i}^{h_{s}}

.

The players N of an extensive-form game, together with their strategies

S_{i}

and their utilities

u_{i}

, induce a normal-form game

(N, {(S_{i})}_{i \in N}, {(u_{i})}_{i \in N})

. Note that there may be multiple strategy profiles giving rise to the same terminal history, that is, it may well be that

h^{s} = h^{s^{'}}

, even though

s \neq s^{'}

. Therefore, the size of this normal-form game may be exponential in the size of the underlying extensive-form game, if represented naively. This is because there are

\prod_{I \in I} | A_{I} {| = O (| A |}^{| V |})

strategy profiles, which are implicit in the definition of an extensive-form game: see Figure 2 for an illustration.

2.4. Boolean Circuits

In computer science, Boolean circuits are a well-established model for computing Boolean functions (cf., e.g., [6,7]). In this paper, we will make extensive use of them to concisely represent strategies and utility functions. We will here briefly review their definition, largely following Jukna’s exposition [7].

A k-ary Boolean function is a function

β : {0, 1}^{k} \to {0, 1}

, where we allow

k = 0

. The base of a Boolean circuit is given by a set

B = {β_{1}, \dots, β_{k}}

of Boolean functions. In this paper, we will restrict attention to the set of classical Boolean functions

B = {\neg, \land, \lor}

(“not”, “or”, and “and”, respectively), which is known to be functionally complete (i.e., sufficient to define any Boolean function).

Formally, a Boolean circuit (or straight line program) on n variables

x_{1}, \dots, x_{n}

over base B is given by a sequence of

t \geq n

gates

γ_{1}, \dots, γ_{t}

. The first n gates are given by the variables, that is,

γ_{1} = x_{1}, \dots, γ_{n} = x_{n}

, and are also referred to as input gates. Another subset of gates is singled out as the set of output gates. Boolean formula over variables

x_{1}, \dots, x_{n}

are thus represented by a Boolean circuit with only one output gate. Each subsequent gate

γ_{i}

is the application of a k-ary Boolean function

β_{i}

in the base B to k previous gates, that is,

γ_{i} = β_{i} (γ_{j_{1}}, \dots, γ_{j_{k}})

, where

j_{1}, \dots, j_{k} < i

. The variables take values in

{0, 1}

. Given values

α_{1}, \dots, α_{n}

in

{0, 1}

for the variables

x_{1}, \dots, x_{n}

, one can inductively associate each gate with a value

α (γ_{i})

such that

α (x_{i}) = α_{i}

, if

1 \leq i \leq n

, and

α (β_{i} (γ_{j_{1}}, \dots, γ_{j_{k}})) = β_{i} (α (γ_{j_{1}}), \dots, α (γ_{j_{k}})),

if

n < i \leq t

. A Boolean circuit

C

given by

γ_{1}, \dots, γ_{t}

on n variables

x_{1}, \dots, x_{n}

and with m output gates

γ_{j_{1}}, \dots, γ_{j_{m}}

then computes the function

ϕ_{C} : {0, 1}^{n} \to {0, 1}^{m}

that, on values

α = (b_{1}, \dots, b_{n})

for the input gates, yields

ϕ_{C} (α) = (α (γ_{j_{1}}), \dots, α (γ_{j_{m}}))

.

A Boolean circuit is commonly depicted as a directed acyclic graph

(V, E)

, with the gates as vertices, that is,

V = {γ_{1}, \dots, γ_{n}}

, and

(γ_{i}, γ_{j}) \in E

whenever

γ_{i} = β_{i} (γ_{j_{1}}, \dots, γ_{j_{k}})

and

γ_{j} = γ_{j_{ℓ}}

for some

1 \leq ℓ \leq n

. For examples of Boolean circuits, also see Figure 3 and Figure 4, below.

The size of a Boolean circuit given by gates

γ_{1}, \dots, γ_{t}

on variables

x_{1}, \dots, x_{n}

is given by the number

t - n

of its gates minus the input gates. It is a well-established fact that the problem of computing

ϕ_{C} (α)

for given values

α \in {0, 1}^{n}

for

C

’s variables, which is also known as the Circuit Value Problem or Circuit Evaluation Problem, is complete for

P

under uniform

{AC}_{0}

-reductions, and therefore can be computed in polynomial time (see in [8], p. 59).

3. Partial Order Games

We now introduce the framework of partial order games. The basic idea is that, as in extensive form games, the game contains a number of decision nodes, which are partioned among the players. However, play in the game is not defined by a game tree. Instead, partial order games have a binary dependence relation over decision nodes. This dependence defines the information available to a player when it makes a choice. If a decision node v for player i is dependent on decision nodes

v_{1}, \dots, v_{k}

, then this means that the information available to a strategy when making a choice at v is precisely the choices that were made at decision nodes

v_{1}, \dots, v_{k}

. To play a partial order game, a player must choose a selection of strategies (we call them vertex strategies), one for each of their decision nodes. A strategy for a decision node v must select a choice for that decision node taking as input the choices that were made for the decision nodes

v_{1}, \dots, v_{k}

upon which it is dependent. In this way, we have an explicit representation of the information available when making a choice, in contrast to the use of information sets in extensive form games. Utilities in partial order games are not associated with individual nodes (as in extensive form games), but derive from the total profile of actions that were made.

Our usual way of thinking about extensive form games is as players alternating to make moves, working their way down the game tree to a leaf node; upon reaching a leaf node, the game is over. Thinking about games in this way naturally gives rise to a temporal order over choices in the game: a choice is made for the root node first, and then successive nodes in the game tree. Partial order games are more abstract than this. They induce only a partial temporal ordering over decision nodes: while the choices for

v_{1}, \dots, v_{k}

must strictly precede the choice for v if v is dependent on

v_{1}, \dots, v_{k}

, it may well be that decision nodes are independent: if nodes v and

v^{'}

are in disconnected components of the dependence graph, then we can say nothing about their temporal order.

Let us consider an example.

Example 1.

Consider the three-person partial order game depicted in Figure 3, with the directed acyclic graph it is based on given on the left. To the right of each vertex v, we have indicated the player

i_{v}

and the actions they have at their disposal at v. For instance, at vertex

v_{2}

, player 2 can choose among the actions c and d. If all players make a choice at their respective vertices, an action profile results, which then is associated with a utility value. For instance, if player 1 chooses a at

v_{1}

and h at

v_{4}

, player 2 chooses c at

v_{2}

, and player 3 chooses f at

v_{3}

, the action profile

(a, c, f, h)

(also denoted by

a c f h

) results, yielding utility values of 1 to player 1, and 2 to players 2 and 3. The table on the right summarises the utilities the players get under the different action profiles that may be played. Note that in this game there are

2^{4}

possible action profiles.

The crucial feature of partial order games, however, is that the players can make their choices at a given vertex v depending on the choices the other players make at the parents of v. For instance, player 2 could adopt the strategy to play c at

v_{2}

, if player 1 chooses a at

v_{1}

, and plays d otherwise. Accordingly, player 2 has

2^{2} = 4

strategies at their disposal at

v_{2}

. As

v_{1}

does not have any parents, player 1 has only two strategies at their disposal at

v_{1}

, but they have

2^{2} = 4

strategies at

v_{4}

. Meanwhile, player 3 has

2^{4} = 16

strategies at

v_{3}

. A strategy profile specifies a strategy for each player at each of their vertices. Accordingly, in this game there are

2 \times 2^{2} \times 2^{4} \times 2^{2} = 2^{9} = 512

possible strategy profiles.

The number of strategy profiles clearly outnumber the number of action profiles. Still, every strategy profile induces a unique action profile. This relationship, however, is not generally injective, as multiple strategy profiles may induce the same action profile. Assume, for instance, that the players adopt the following strategies:

player 1 chooses a at $v_{1}$ ;
player 2 chooses c if player 1 plays a at v, and d, otherwise;
player 3 chooses e if player 1 chooses a at $v_{1}$ and player 2 d at $v_{2}$ , and f, otherwise; and
player 4 chooses h independently of whether player 2 chooses c or d at $v_{2}$ .

If this strategy profile is played, it can readily be seen that action profile

a c f h

results, which we have already seen gives utilities 1, 2, and 2 to the players 1, 2, and 3, respectively.

We now proceed to our formal definition of partial order games, which enables the application of Nash equilibrium as well as the formulation of a natural backwards induction procedure later on. As before, let

N = {1, \dots, n}

be a finite set of players. Then a partial order game on a directed acyclic graph

(V, E)

—the game’s dependency graph—associates each vertex v in V with a unique player in N, denoted by

i_{v}

, and a non-empty set of actions, denoted by

A_{v}

.

One player may be associated with multiple vertices, that is, it may very well be that

i_{u} = i_{v}

even if

v \neq u

. We let

V_{i}

denote the set of vertices associated with player i. We also say that player i is active at any vertex v in

V_{i}

. If E is also connected—that is, if

v E w

or

w E v

for all distinct v and w in V—we also refer to any partial order game based on it as a total-order game. As E is assumed to be acyclic, it follows that E is transitive in total order games. On the other end of the spectrum, we have partial order games on dependency graphs

(V, E)

with

E = Ø

. This class of games we will also refer to as empty-order games.

Given a subset of

W = {w_{1}, \dots, w_{k}}

of vertices, an action profile for W is a tuple

a_{W} = (a_{w_{1}}, \dots, a_{w_{k}})

specifying one action

a_{w_{j}}

for each vertex

w_{j}

. In our examples, we occasionally omit parentheses and commas, and write

a_{w_{1}} \dots a_{w_{k}}

for

(a_{w_{1}}, \dots, a_{w_{k}})

. The set of action profiles for W we denote by

A_{W}

. For the set

V = {v_{1}, \dots, v_{| V |}}

of all vertices, we write write

a = (a_{v_{1}}, \dots, a_{v_{| V |}})

for

a_{V}

and refer to

a

simply as an action profile. We denote the set of action profiles by

A = A_{v_{1}} \times \dots \times A_{v_{| V |}}

. For a set

V_{i} = {w_{1}, \dots, w_{k}}

of vertices at which a player i is active, we also write

a_{i} = (a_{w_{1}}, \dots, a_{w_{k}})

for

a_{V_{i}}

and refer to

a_{i}

as an action profile for player i. We let

A_{i}

denote the set of actions profiles for a player i. For action profiles

a_{W} = (a_{w_{1}}, \dots, a_{w_{k}})

and

b_{U} = (b_{u_{1}}, \dots, b_{u_{m}})

for disjoint sets W and U, we denote by

(a_{W}, b_{U})

the action profile

(a_{w_{1}}, \dots, a_{w_{k}}, b_{u_{1}}, \dots, b_{u_{m}})

for

W \cup U

. If

U = V \ W

we also write

(a_{W}, a_{- W})

for

(a_{W}, a_{U})

, and if

U = V \ V_{i}

, we also write

(a_{i}, a_{- i})

. Given a topological sorting

τ = (v_{1}, \dots, v_{| V |})

, we define a $τ$ -history as a sequence of actions

h = (a_{v_{1}}, \dots, a_{v_{k}})

in

A_{v_{1}} \times \dots \times A_{v_{_{k}}}

for some

1 \leq k \leq | V |

. We stipulate the empty sequence of actions, denoted by

()

, to be a

τ

-history as well. For an action profile

a = (a_{v_{1}}, \dots, a_{v_{| V |}})

, we say that

h = (b_{v_{1}}, \dots, b_{v_{k}})

is a history of $a$ if

(b_{v_{1}}, \dots, b_{v_{k}}) = (a_{v_{1}}, \dots, a_{v_{k}})

.

At each vertex v, player

i_{v}

can make their choice of action dependent on the actions chosen at the parents of v. A (conditional) strategy at a vertex v, or vertex strategy, is therefore a function

f_{v} : A_{y_{1}} \times \dots \times A_{y_{k}} \to A_{v},

where

y_{1}, \dots, y_{k}

are the parents of v. We say that

f_{v}

is unconditional if it is constant, that is, if

f_{v} (a_{1}, \dots, a_{k}) = f_{v} (b_{1}, \dots, b_{k})

for all action profiles

a = (a_{1}, \dots, a_{k})

and

b = (b_{1}, \dots, b_{k})

. In this case, for

a \in A_{v}

, we denote by

f_{v}^{a}

the unconditional strategy at vertex v that maps every action profile of v’s parents to action a. We will sometimes identify unconditional strategies

f_{v}

with the action

f_{v} (a_{1}, \dots, a_{k})

it specifies for each action profile

a = (a_{1}, \dots, a_{k})

and write

a = (a_{v_{1}}, \dots, a_{v_{| V |}})

for the profile of unconditional strategies

f^{a} = (f_{v_{1}}^{a_{v_{1}}}, \dots, f_{v_{| V |}}^{a_{v_{| V |}}})

. Note that if vertex v has no parents, then

A_{u_{1}} \times \dots \times A_{u_{k}} = {()}

, where

()

is the empty tuple. Accordingly,

f_{v}

determines a single choice among

A_{v}

, namely,

f_{v} (())

in this case. By

F_{v}

we denote the set of conditional strategies available at a vertex v.

By a strategy profile we then understand a profile

f = (f_{v_{1}}, \dots, f_{v_{| V |}})

consisting of one conditional strategy for each vertex. For

f = (f_{v_{1}}, \dots, f_{v_{| V |}})

a strategy profile and

W = {w_{1}, \dots, w_{k}}

a subset of vertices in V, we let

f_{W} = (f_{w_{1}}, \dots, f_{w_{k}})

denote the partial profile that is like

f

but restricted to

W \subseteq V

. Let

f^{1} = (f_{v_{1}}^{1}, \dots, f_{v_{| V |}}^{1})

through

f^{k} = (f_{v_{1}}^{k}, \dots, f_{v_{| V |}}^{k})

be k strategy profiles and

{X_{1}, \dots, X_{k}}

be a k-partition of the vertex set V. Then,

(f_{X_{1}}^{1}, \dots, f_{X_{k}}^{k})

denotes the strategy profile

g = (g_{v_{1}}, \dots, g_{v_{| V |}})

such that, for every vertex v in V, we have

g_{v} = f_{v}^{j}

, if

v \in X_{j}

.

A strategy for a player i is a profile

f_{V_{i}} = (f_{v_{1}}, \dots, f_{v_{k}})

of vertex-strategies, where

V_{i} = {v_{1}, \dots, v_{k}}

, is the set of vertices at which player i is active. We generally denote

f_{V_{i}}

by

f_{i}

, and the set of strategies available to player i by

F_{i}

. For strategy profiles

f = (f_{v_{1}}, \dots, f_{v_{| V |}})

and

g = (g_{v_{1}}, \dots, g_{v_{| V |}})

and a player i, we also write

(f_{- i}, g_{i})

for the strategy profile

(f_{V \ V_{i}}, g_{V_{i}})

.

For every vertex v, a strategy profile

f = (f_{v_{1}}, \dots, f_{v_{| V |}})

defines an action

a_{v}^{f}

, the evaluation of $f$ at v, recursively as follows:

\begin{matrix} a_{v}^{f} = \{\begin{matrix} f_{v} (()) & if depth (v) = 0, \\ f_{v} (a_{u_{1}}^{f}, \dots, a_{u_{k}}^{f}) & if depth (v) > 0 and u_{1}, \dots, u_{k} are the parents of v . \end{matrix} \end{matrix}

Observe that

a_{v}^{f}

is well defined because in a directed acyclic graph vertices with depth 0 exist and the parents of each vertex are of lower depth than the vertex itself. A strategy profile

f = (f_{v_{1}}, \dots, f_{v_{| V |}})

can thus be seen to induce the action profile

a^{f} = (a_{v_{1}}^{f}, \dots, a_{v_{| V |}}^{f})

. We also say that strategy profile

f = (f_{v_{1}}, \dots, f_{v_{| V |}})

sustains action profile

a = (a_{v_{1}}, \dots, a_{v_{| V |}})

if

a = a^{f}

. We occasionally denote by

(a_{W}, f_{U})

the action profile

a^{g}

where

g

is the strategy profile

(f_{W}^{a}, f_{U})

which specifies unconditional strategies for W. In other words, we have

(a_{v_{1}}, \dots, a_{v_{k}}, f_{v_{k + 1}}, \dots, f_{v_{| V |}})

denote the action profile that results if on

v_{1}

through

v_{v_{k}}

actions

a_{v_{1}}, \dots, a_{v_{k}}

are played, and at vertices

a_{v_{k + 1}}

through

a_{v_{| V |}}

the strategies

f_{v_{k + 1}}

through

f_{v_{| V |}}

.

Note the difference between actions and action profiles on the one hand, and strategies and strategy profiles on the other. Every strategy profile induces a unique action profile, whereas the same action profile may be induced by different strategy profiles. In an important sense, the action profiles are the outcomes of the game.

Therefore, we take the set of action profiles of a partial order game as its set of outcomes, over which the players’ preferences are defined. Each player’s preferences over the action profiles are given by a real-valued utility function

u_{i} : A \to R

, where we assume that player istrictly prefers action profile

a = (a_{v_{1}}, \dots, a_{v_{| V |}})

to action profile

b = (b_{v_{1}}, \dots, b_{v_{|} V |})

whenever

u_{i} (a) > u_{i} (b)

. We extend utility functions

u_{i}

to strategy profiles

f

, and write

u_{i} (f)

for

u_{i} (a^{f})

. To fix concepts and notation, we consider Example 1 once more.

Example 1 (cont’d).

Consider the game in Figure 3. At vertices

v_{1}

,

v_{2}

, and

v_{4}

, we have the following strategies:

\begin{matrix} F_{v_{1}} () & F_{v_{2}} a b & F_{v_{4}} c d \\ f_{v_{1}}^{1} a & f_{v_{2}}^{1} c c & f_{v_{4}}^{1} g g \\ f_{v_{1}}^{2} b & f_{v_{2}}^{2} c d & f_{v_{4}}^{2} g h \\ f_{v_{2}}^{3} d c & f_{v_{4}}^{3} h g \\ f_{v_{2}}^{4} d d & f_{v_{4}}^{4} h h \end{matrix}

At vertex

v_{3}

, moreover, we have

2^{4}

strategies, illustrating the exponential blowup that results from strategies being represented explicitly.

\begin{matrix} F_{3} a c a d b c b d & a c a d b c b d & a c a d b c b d & a c a d b c b d \\ f_{v_{3}}^{1} e e e e & f_{v_{3}}^{5} e f e e & f_{v_{3}}^{9} f e e e & f_{v_{3}}^{13} f f e e \\ f_{v_{3}}^{2} e e e f & f_{v_{3}}^{6} e f e f & f_{v_{3}}^{10} f e e f & f_{v_{3}}^{14} f f e f \\ f_{v_{3}}^{3} e e f e & f_{v_{3}}^{7} e f f e & f_{v_{3}}^{11} f e f e & f_{v_{3}}^{15} f f f e \\ f_{v_{3}}^{4} e e f f & f_{v_{3}}^{8} e f f f & f_{v_{3}}^{12} f e f f & f_{v_{3}}^{16} f f f f \end{matrix}

The (conditional) strategy profile

f = (f_{v_{1}}^{1}, f_{v_{2}}^{3}, f_{v_{3}}^{8}, f_{v_{4}}^{2})

, then, for instance, yields the evaluation

a^{f} = (a_{v_{1}}^{f}, a_{v_{2}}^{f}, a_{v_{3}}^{f}, a_{v_{4}}^{f}) = (a, d, f, h)

as

\begin{matrix} a_{v_{1}}^{f} & = f_{v_{1}}^{1} (()) = a & a_{v_{2}}^{f} & = f_{v_{2}}^{3} (a_{v_{1}}^{f}) = f_{v_{2}}^{3} (a) = d \\ a_{v_{3}}^{f} & = f_{v_{3}}^{8} (a_{v_{1}}^{f}, a_{v_{2}}^{f}) = f_{v_{3}}^{8} (a, d) = f & a_{v_{4}}^{f} & = f_{v_{4}}^{2} (a_{v_{2}}^{f}) = f_{v_{4}}^{2} (d) = h \end{matrix}

Accordingly, the utilities on this profile

f = (f_{v_{1}}^{1}, f_{v_{2}}^{3}, f_{v_{3}}^{8}, f_{v_{4}}^{2})

for these three players are therefore determined by the action profile

a d f h

. Hence,

u_{1} (f) = 0

,

u_{2} (f) = 0

, and

u_{3} (f) = 2

.

With the players N, their conditional strategies

F_{i}

, and their preferences

u_{i} : F \to R

over strategy profiles, a partial order game on a directed acyclic graph

(V, E)

can thus be seen to define immediately a game in normal-form

(N, {(F_{i})}_{i \in N}, {(u_{i})}_{i \in N})

(cf., Section 2.2, above). Accordingly, partial order games are directly amenable to game-theoretical analysis using the usual solution concepts, in particular, Nash equilibrium.

Conversely, every normal-form game

(N, {(S_{i})}_{i \in N}, {(u_{i})}_{i \in N})

can be seen as a partial order game on the same set N of players. Its dependency graph is then given by

(N, Ø)

, associating with vertex

i \in N

, player

i \in N

and strategy set

S_{i}

. Identifying each unconditional strategy

f_{i}

with

f_{i} (()) = s_{i}

in

S_{i}

, we adopt the utility functions

u_{i}

unaltered for the utility functions of the partial order game.

3.1. Concise Representations for Strategies, Profiles, and Utilities

The transformation from partial order games to normal-form games comes at the cost of an exponential blowup. Observe that in the definition of partial order games, the sets of action profiles and strategy profiles are defined implicitly. Given a partial order game on a directed acyclic graph

(V, E)

with a set of players N and actions

A = A_{v_{1}} \cup \dots \cup A_{v_{| V |}}

, the number of action profiles

a = (a_{v_{1}}, \dots, a_{v_{| V |}})

is bounded by

O (| A |^{| V |})

, and so is the number of unconditional strategy profiles

f^{a} = (f_{v_{1}}^{a_{v_{1}}}, \dots, f_{v_{| V |}}^{a_{v_{| V |}}})

. By contrast, the number of both conditional strategies and the number of conditional strategy profiles are bounded by

O (| A |^{(| A |^{| V |})})

. It is also worth observing that the size of both a conditional strategy f and a profile

f = (f_{v_{1}}, \dots, f_{v_{| V |}})

of conditional strategies is bounded by

O (| A |^{| V |})

.

Even though the dependency graph allows a concise representation of the strategies and strategy profiles in a partial order game, they remain large objects. It is therefore desirable to have concise representations of action profiles, utility functions, strategies, and strategy profiles. To this end, we take recourse to Boolean circuits, a formalism well established in theoretical computer science (see Section 2.4). We first show how action profiles, utility functions, and strategy profiles can be represented concisely by Boolean circuits. We then prove two lemmas that will be useful for our later complexity proofs in Section 5 and Section 7.

When representing partial order games concisely, we assume that for distinct vertices v and w the action sets

A_{v}

and

A_{w}

are disjoint. Having assumed

A_{v}

to be non-empty for each vertex v, it thus follows that

| V | \leq | A |

. For ease of presentation, we also assume that

| A_{v} |

is an integer power of 2 for each vertex v. In this section, we assume that for each vertex v, the elements of set

A_{v}

are enumerated as

a_{0}, \dots, a_{| A_{v} | - 1}

, thus associating each action

a_{k}

with a unique local index k at v. Let

bin (k)

denote the binary representation of the local index k, using exactly

{log}_{2} | A_{v} |

digits. Thus, if

| A_{v} | = 16

, we have

bin (3) = 0011

.

Let v be a vertex with parents

u_{1}, \dots, u_{k}

. We then represent a strategy

f_{v} : A_{u_{1}} \times \dots \times A_{u_{k}} \to A_{v}

by a Boolean circuit

C_{f_{v}}

with

{log}_{2} | A_{u_{1}} | + \dots + {log}_{2} | A_{u_{k}} |

input gates and

{log}_{2} | A_{v} |

output gates. For an action profile

a = (a_{u_{1}}, \dots, a_{u_{k}})

in

A_{u_{1}} \times \dots \times A_{u_{k}}

and

a_{v}

an action in

A_{v}

, we then have that

f_{v} (a_{u_{1}}, \dots, a_{u_{k}}) = a_{v}

if and only if on input

bin (j_{1}) \dots bin (j_{k})

the circuit

C_{f_{v}}

evaluates to outputs

bin (i)

, where

j_{1}, \dots, j_{k}

and i, are the local indices of

a_{v_{1}}, \dots, a_{v_{k}}

, and

a_{v}

at their respective vertices. As

C_{f_{v}}

can be seen as combining

{log}_{2} | A_{v} |

Boolean functions in

{log}_{2} | A_{u_{1}} | + \dots + {log}_{2} | A_{u_{k}} |

variables and

{log}_{2} | A_{u_{1}} | + \dots + {log}_{2} | A_{u_{k}} | \leq | A |

, we may assume that the circuit

C_{f_{v}}

is of size at most exponential in

| A |

. More precisely, we may assume that the size of

C_{f_{v}}

is

Θ (\frac{2^{| A |}}{| A |})

[9,10].

Example 2.

For an example, see Figure 4, which depicts the Boolean circuit

C_{f_{v_{3}}^{14}}

for strategy

f_{v_{3}}^{14} : A_{v_{1}} \times A_{v_{2}} \to A_{v_{3}}

in Example 1, given by

f_{v_{3}}^{14} (a) = \{\begin{matrix} e & if a = b c, \\ f & otherwise . \end{matrix}

Recall that

A_{v_{1}} = {a, b}

,

A_{v_{2}} = {c, d}

, and

A_{v_{3}} = {e, f}

. Let the local indices of a, c, and e be given by 0, and those of b, d, and f by 1. As

| A_{v_{1}} | = | A_{v_{2}} | = | A_{v_{3}} | = 2

, any action profile

a \in A_{v_{1}} \times A_{v_{2}}

can be represented by two Boolean variables

x_{0}

and

x_{1}

, where the value of

x_{0}

represents the choice for a or b at

v_{1}

, the value of

x_{1}

the choice for c or d at

v_{2}

. For instance, the action profile

a d \in A_{v_{1}} \times A_{v_{2}}

can thus be represented by setting

x_{0}

to 0 and

x_{1}

to 1. For these values the circuit

C_{f_{v_{3}}^{14}}

evaluates to 1, which corresponds to

f_{v_{3}}^{14} (a d) = f

and as depicted in Figure 4.

Observe that an unconditional strategy

f_{v}^{a}

at a vertex v with parents

u_{1}, \dots, u_{k}

, which maps every profile

(a_{u_{1}}, \dots, a_{u_{k}})

invariably to action a in

A_{v}

, is represented by a Boolean circuit

C_{f_{v}^{a}}

of size at most polynomial in

| A |

. The circuit

C_{f_{v}^{a}}

will still have

\sum_{1 \leq m \leq k} {log}_{2} | A_{u_{m}} |

inputs and

{log}_{2} | A_{v} |

outputs, but will involve only one ⊥-gate and one ⊤-gate. Let j be the local index of a at v and

bin (j) = b_{1}, \dots, b_{{log}_{2} | A_{v} |}

. Then, for

1 \leq ℓ \leq {log}_{2} | A_{v} |

, connect the ⊥-gate with output

y_{ℓ}

, if

b_{ℓ} = 0

, and the ⊤-gate with output

y_{ℓ}

, if

b_{ℓ} = 1

.

Similarly, a rational-valued utility function

u_{i} : A_{v_{1}} \times \dots \times A_{v_{| V |}} \to Q

can be represented by a Boolean circuit

C_{u_{i}}

. This circuit will have

\sum_{v \in V} {log}_{2} | A_{v} |

inputs and outputs

y_{1}, \dots, y_{k}, z_{1}, \dots, z_{m}

. Moreover, for

(a_{v_{1}}, \dots, a_{v_{| V |}})

an action profile in

A_{v_{1}} \times \dots \times A_{v_{| V |}}

with local indices

j_{1}, \dots, j_{| V |}

at their respective vertices, we have

u_{i} (a_{v_{1}}, \dots, a_{v_{| V |}}) = \frac{n}{d + 1}

if and only if on input

bin (j_{1}), \dots, bin (j_{| V |})

the circuit

C_{u_{i}}

yields

bin (n)

for outputs

y_{1}, \dots, y_{k}

and

bin (d)

for outputs

z_{1}, \dots, z_{m}

(Observe that if

u_{i} (a_{v_{1}}, \dots, a_{v_{| V |}})

is a positive integer, the outputs

z_{1}, \dots, z_{m}

can be dispensed with, as

\sum_{x \in Ø} x = 0

and

d = \sum_{1 \leq j \leq m} z_{j} \cdot 2^{j}

). Again, as

| A_{u_{1}} | + \dots + | A_{u_{k}} | \leq | A |

, we may assume that any such circuit

C_{u_{i}}

will be of size at most exponential in

| A |

.

Example 3.

Figure 5 illustrates how a utility function

u_{i} : A_{v_{1}} \times A_{v_{2}} \times A_{v_{3}} \times A_{v_{4}} \to R

for a player i in the game of Example 1 is represented by a Boolean circuit. The utility function

u_{i}

is depicted on the right. We assume that the local indices of a, c, e, and g be given by 0, and those of b, d, f, and h by 1. As

| A_{v_{1}} | = | A_{v_{2}} | = | A_{v_{3}} | = | A_{v_{4}} | = 2

, an action profile

a

in

A_{v_{1}} \times A_{v_{2}} \times A_{v_{3}} \times A_{v_{4}}

is thus given by means of an assignment to four Boolean variables

x_{0}

,

x_{1}

,

x_{2}

, and

x_{4}

, representing the choices at the vertices

v_{1}

,

v_{2}

,

v_{3}

, and

v_{4}

, respectively. For instance, action profile

a c f h

is represented by setting variables

x_{0}

and

x_{1}

to 0, and variables

x_{2}

and

x_{3}

to 1, which can thus also be denoted by the binary string 0011. Evaluating the circuit for these values, we find that

y_{1}

is set to 0 and

y_{0}

to 1. This corresponds to

u_{i} (a c f h) = 1

and

bin (1) = 01

. Observe that the Boolean circuit can be evaluated in time polynomial in its size and that it is exponentially smaller than the explicit tabelling of the utility function on the right.

We conclude this section by showing two useful lemmas. The first establishes that, even if a conditional strategy profile

f = (f_{v_{1}}, \dots, s_{v_{| V |}})

is represented as a Boolean circuit, the action profile

a^{f} = (a_{v_{1}}, \dots, a_{v_{| V |}})

that

f

gives rise to, can be computed in polynomial time.

Lemma 1.

Let

f = (f_{v_{1}}, \dots, f_{v_{| V |}})

be a conditional strategy profile represented by Boolean circuits

C_{f_{v_{1}}}, \dots, C_{f_{v_{| V |}}}

. Then, the action profile

a^{f} = (a_{v_{1}}^{f}, \dots, a_{v_{| V |}}^{f})

can be computed in polynomial time.

Proof Sketch.

Proceeding inductively, find for every vertex v the action

a_{v}^{f}

as follows. For vertices v of depth 0, the Boolean circuit

C_{f_{v}}

should give the local index in binary of action

a_{v}^{f}

at v, and therewith,

a_{v}^{f}

immediately. For a vertex v of a strictly positive depth with parents

u_{1}, \dots, u_{k}

, we may assume that the local indices in binary

bin (j_{u_{1}}), \dots, bin (j_{u_{k}})

of

a_{u_{1}}^{f}, \dots, a_{u_{k}}^{f}

can be computed in polynomial time. As the evaluation problem for Boolean circuits is solvable in polynomial time, the circuit

C_{f_{v}}

for inputs

bin (j_{u_{1}}), \dots, bin (j_{u_{k}})

can also be evaluated in polynomial time, providing us with the local index of

f_{v} (a_{u_{1}}^{f}, \dots, a_{u_{k}}^{f})

at v in binary, which gives us the result. □

Lemma 2.

Let

f = (f_{v_{1}}, \dots, f_{v_{| V |}})

be a conditional strategy profile represented by Boolean circuits

C_{f_{v_{1}}}, \dots, C_{f_{v_{| V |}}}

, and

u_{i} : A_{v_{1}} \times \dots \times A_{v_{| V |}} \to Q

be a utility function represented by a Boolean circuit

C_{u_{i}}

. Then,

u_{i} (f)

can be computed in polynomial time.

Proof Sketch.

Thanks to Lemma 1, we can compute the action profile

a^{f} = (a_{v_{1}}^{f}, \dots, a_{v_{| V |}}^{f})

in polynomial time. Let

j_{1}, \dots, j_{| V |}

be the local indices of

a_{v_{1}}^{f}, \dots, a_{v_{| V |}}^{f}

at their respective vertices. Using

bin (j_{1}), \dots, bin (j_{| V |})

as inputs for the circuit

C_{u_{i}}

, we obtain the binary encoding of

u_{i} (a_{v_{1}}^{f}, \dots, a_{v_{| V |}}^{f})

. As the evaluation problem for Boolean circuits is solvable in polynomial time, this suffices for the result. □

4. Related Game-Theoretic Models

In this section, we explore the interrelations between partial order games and four related game-theoretic models: partial order Boolean games, Multi-Agent Influence Diagrams (MAIDs), concurrent games as event structures, and extensive-form games.

4.1. Boolean Games and Partial Order Boolean Games

A special class of partial order games, referred to as partial order Boolean games, was introduced in [1]. Partial order Boolean games extend Boolean games [11,12,13,14,15], where for a set

Φ

of propositional variables, each player i exercises unique control over the truth-values assigned to a subset

Φ_{i}

of variables and aims to satisfy a goal formula

γ_{i}

over

Φ

. Partial order Boolean games are Boolean games enriched with a dependency graph on

Φ

. To avoid confusion, we will write

v_{p}

for the propositional variable p when it occurs in the role of a vertex in the dependency graph. The player i associated with a vertex

v_{p}

coincides with the player controlling p in the Boolean game, and i can make their choice of truth-value for p depend on the values assigned to the propositional variables

q_{1}, \dots, q_{k}

if

v_{q_{1}}, \dots, v_{q_{k}}

are the parents of

v_{p}

in the dependency graph. In a Boolean game, such dependencies do not exist. Thus, Boolean games can be seen as a special class of partial order Boolean games. The class where the dependency graph does not have any edges but can be expressed by

(Φ, Ø)

.

Clearly, every partial order Boolean game

(N, Φ, Φ_{1}, \dots, Φ_{n}, γ_{1}, \dots, γ_{n}, D)

, where

D = (Φ, E)

is a dependency graph on

Φ

, defines a partial order game with the same set of players N and the same dependency graph, where player i is associated with vertex

v_{p}

if and only if

p \in Φ_{i}

. At vertex

v_{p}

, the player i associated with

v_{p}

has two actions at their disposal: setting p to true (p) and setting p to false (

\bar{p}

). We will assume that the local index of action p at vertex

v_{p}

is 1 and that that of action

\bar{p}

at

v_{p}

is 0.

In a partial order Boolean game, an individual strategy at vertex

v_{p}

, controlled by i, is represented by a so-called choice equation of the form

p : φ (q_{1}, \dots, q_{k})

, where

v_{q_{1}}, \dots, v_{q_{k}}

are the parents of

v_{p}

and

φ (q_{1}, \dots, q_{k})

is a propositional logic formula over

q_{1}, \dots, q_{k}

, with the interpretation that i sets p to true if

φ (q_{1}, \dots, q_{k})

evaluates to true given the decisions made for

q_{1}, \dots, q_{k}

at,

v_{q_{1}}, \dots, v_{q_{k}}

, and to false otherwise. Thus a choice equation

p : φ (q_{1}, \dots, q_{k})

defines an individual strategy

f_{v_{p}} : {q_{1}, {\bar{q}}_{1}} \times \dots \times {q_{k}, {\bar{q}}_{k}} \to {p, \bar{p}}

at vertex

v_{p}

. This strategy, moreover, can be represented by a Boolean circuit

C_{f_{v_{p}}}

for the Boolean function

β_{φ} : {0, 1}^{k} \to {0, 1}

expressed by the formula

φ

. We may assume that the circuit

C_{f_{v_{p}}}

is of size at most polynomial in the size of the choice equation

p : φ (q_{1}, \dots, q_{k})

and that it has exactly k inputs and one output. We may furthermore assume that the circuit

C_{f_{v_{p}}}

can be obtained in time polynomial in the size of

φ

from

p : φ (q_{1}, \dots, q_{k})

.

Similar considerations apply to the representation of the players’ utility functions. In partial order Boolean games, the preferences of each player i are represented by a goal formula

γ_{i}

in the propositional language over

Φ

. Every action profile

a = (a_{v_{p_{1}}}, \dots, a_{v_{p_{| Φ |}}})

of a Boolean partial order game defines a valuation

α_{a} : Φ \to {0, 1}

and yields player i utility 1 if

α_{a}

satisfies

γ_{i}

, and utility 0 otherwise. The Boolean function

β_{γ_{i}} : {0, 1}^{| Φ |} \to {0, 1}

expressed by

γ_{i}

can be represented by a Boolean circuit

C_{γ_{i}}

with at most

| Φ |

inputs and exactly one output. As at every vertex

v_{p}

, the assigned player has exactly two actions at their disposal and utility values are integers, we find that

C_{γ_{i}}

is exactly of the form suggested for the representation of players’ utilities in partial order games in the previous section. Moreover,

C_{γ_{i}}

will be of size at most exponential in

| Φ |

and of size polynomial in

γ_{i}

. We may furthermore assume that the circuit

C_{f_{γ_{i}}}

can be obtained from

γ_{i}

in polynomial time.

4.2. Multi-Agent Influence Diagrams (MAIDs)

Multi-agent Influence Diagrams (MAIDS) were proposed in [2,3] and later extended from a game-theoretic perspective in [16]. MAIDs trace their origins to Bayesian Networks (cf., [17]) and influence diagrams, a decision theoretic extension of Bayesian networks proposed by Howard and Matheson [18].

Like partial order games, MAIDs involve a finite set

A = {a_{1}, \dots, a_{n}}

of agents and are defined in directed acyclic graphs, where three different types of vertices are distinguished: a set of chance nodes

X

, a set of decision nodes

D_{a}

for each agent a in

A

, and a set of utility nodes

U_{a}

also for each agent a. In line with the Bayesian network literature, every vertex is taken to be a random variable X with possible values in a finite domain

dom (X)

. It is generally assumed that utility nodes cannot be parents of other nodes, and that their domain is a finite set of real numbers. (Hammond et al. [16] relax this condition, and also allow utility nodes to have outgoing edges.)

The values any variable in a MAID can take may depend on the values assumed by that variable’s parents in the DAG. This is very much in line with how a choice of action at a node may depend on the decisions taken at parent nodes in partial order games. In the MAIDs framework, the parents of a decision node X are denoted by

Pa (X)

and an instantiation, commonly denoted by

pa

, for

Pa (X) = {Y_{1}, \dots Y_{k}}

defines a value from the domain of each of these variables, that is,

pa \in dom (Y_{1}) \times \dots \times dom (Y_{k})

. A conditional probabilistic distributions (CPD) for a decision or utility node X now assigns a probability

P (X | pa)

for each instantiation

pa

of

Pa (X)

, and a decision rule defines a conditional probabilistic distribution for one particular decision variable. Sometimes it is required that CPDs for utility nodes be deterministic, arguing that all of the stochasticity should be subsumed into chance variables. This convention, however, is not universally adopted in the literature on MAIDS. We refer to the work in [19] for a good in-depth study. On this basis, a strategy

δ_{a}

for an agent a defines a conditional probability distribution for each decision variable D in

D_{a}

. A strategy profile is then a tuple

(δ_{a_{1}}, \dots, δ_{a_{n}})

of strategies, one for each agent. Given a strategy profile

(δ_{a_{1}}, \dots, δ_{a_{n}})

, the MAID reduces to a Bayesian network, and as such defines a joint probability distribution

P_{δ_{a_{1}}, \dots, δ_{a_{n}}}

over all of its variables. Accordingly, every agent a can be assigned a expected utility defined as follows:

E_{a} (δ_{a_{1}}, \dots, δ_{a_{n}}) = \sum_{U \in U_{a}} \sum_{x \in dom (U)} P_{δ_{a_{1}}, \dots, δ_{a_{n}}} (U = x) \cdot x .

With players, their strategies, and their utilities being defined, a MAID defines a strategic-form game, and is amenable to analysis by the usual game-theoretic solution concepts.

It is not hard to see how every partial order game can be seen as a MAID without chance variables, with ‘hidden’ utility nodes, and only allowing for deterministic decision rules. Formally, let

N = {1, \dots, n}

be the set of players;

V_{1}, \dots, V_{n}

be the sets of nodes assigned to the players 1 through n, respectively;

(V, E)

be a directed acyclic graph with

V = V_{1} \cup \dots \cup V_{n}

; and

u_{1}, \dots, u_{n}

be utility functions for the players 1 through n. Then, the partial order game defined from this can be seen as a MAID with set of agents

A = N

and decision variables

V_{1}, \dots, V_{n}

such that

dom (V_{v}) = A_{v}

for all vertices v in V. There are no chance variables, and one utility variable

U_{i}

for each player i that is a child of all decision nodes. The MAID’s partial order is then given by

(V^{'}, E^{'})

where

V^{'} = V \cup {U_{1}, \dots, U_{n}}

and

E^{'} = E \cup V \times {U_{1}, \dots, U_{n}}

. Thus, the instantiations of the parents of each utility variable, the set of instantiations of all decision variables, corresponds to the set of all action profiles of the partial order game. Each of these instantiations

(a_{v_{1}}, . . . a_{v_{| V |}})

, in turn, corresponds to a deterministic strategy profile

(δ_{1}, . . ., δ_{n})

in the MAID, which allows us to complete the model by setting

E_{i} ((δ_{1}, . . ., δ_{n})) = u_{i} ((a_{v_{1}}, . . . a_{v_{| V |}}))

.

4.3. True Concurrency and Games as Event Structures

Event structures [20] are the so-called “true concurrency” analogue of trees: just as transition systems unfold to trees, so some models of true concurrency, such as Petri nets and asynchronous transition systems, unfold to event structures [21]. Similarly, in the same way that sequential games can be represented by trees, certain concurrent games can be represented by event structures—where plays in this much more general setting determine partial instead of total orders of moves; cf., see in [22,23].

An event structure is a triple

(E, \leq, Con)

, consisting of a set E of events that are partially ordered by ≤, the causal dependency relation, and a nonempty consistency relation Con over finite subsets of E, which satisfy four conditions:

1.: ${e^{'} : e^{'} \leq e is finite for all e \in E}$ ,
2.: ${e} \in Con for all e \in E$ ,
3.: $Y \subseteq X \in Con implies Y \in Con$ , and
4.: $X \in Con and e \leq e^{'} \in X implies X \cup {e} \in Con$ .

The states of an event structures E are called configurations, denoted by

C (E)

, and consist of those subsets

x \subseteq E

which are both

consistent: $\forall X \subseteq x . X is finite implies X \in Con$ ; and
down-closed: $\forall e, e^{'} . e^{'} \leq e \in x implies e^{'} \in x$ .

Configurations can be finite or infinite. Concurrency, then, is naturally modelled as follows: two events, say e and

e^{'}

, which are both consistent and incomparable with respect to causal dependency are seen as concurrent, and assumed to be independent in that they can happen in parallel. In a game-theoretic context, it also means that such two events can be played or executed in parallel.

In the context of concurrent games as event structures, we consider only two players, named Player (the system) and Opponent (the environment), who own disjoint sets of events which they can play (execute). They do so asynchronously in an attempt to reach a state in their “winning set” of configurations. In these games, players are allowed to execute an event, say e, only if all events on which e causally depends have been executed. In this setting, events do not have a Boolean or otherwise interpretation; they are simply available actions that a player can execute—and informally are intended to represent observable events in a computer system. The Nash equilibrium of concurrent games as event structures have been studied in the past, and fully characterised in [23] for two-player general-sum games with players’ goals given by Borel sets of winning configurations. However, their main application is in the field of formal semantics for programming languages and logical systems; cf, see in [22,24,25].

4.4. Extensive-Form Games

Games in extensive-form are the canonical game-theoretic model to account for the strategic interactions that result if decisions are made in a prescribed order. Their use in the game-theoretic literature is ubiquitous. In this section, we show that partial order games constitute a concisely represented class of extensive-form games.

4.4.1. Partial Order Games as Extensive Games of Imperfect Information

Given a topological sorting of the decision nodes, there is a natural transformation of partial order games to extensive form games of imperfect information. Let the DAG of the partial order game be given by

(V, E)

, where

V = {v_{1}, \dots, v_{| V |}}

, and the utility function for each player i by

u_{i} : A_{v_{1}} \times \dots \times A_{v_{| V |}} \to R

. First, assume a topological order

τ

to linearise the DAG, which, for ease of presentation, we assume to be

τ = (v_{1}, \dots, v_{| V |})

. Figure 6 illustrates this construction for the partial order game in Figure 3 under topological order

τ

.

The extensive-form game of imperfect information representing the partial order game is then defined for the same set N of players. We define the game tree

(V^{'}, E^{'})

such that

V^{'}

is the set of all prefixes of terminal histories in

A_{v_{1}} \times \dots \times A_{v_{| V |}}

, with

()

being the root node. The player active at each history/vertex

h_{v_{k}} = (a_{v_{1}}, \dots, a_{v_{k}})

in

V^{'}

, we identify with the player that is active at

v_{k + 1}

in the partial order game. At each of these vertices/histories this player has

A_{v_{k + 1}}

as the set of actions to chose from. In particular,

i_{v_{1}}

is the player active at

()

and has

A_{v_{1}}

as action set. Accordingly, all prefixes/histories of the same length are assigned to the same player and each of them have the same action set. For

v, w \in V^{'}

, we then have

v E^{'} w

if and only if

w = v a

for some

a \in A_{v}

, as expected.

Every leaf/terminal history

h = (a_{v_{1}}, \dots, a_{v_{| V |}})

and player i we associate with the utility value

u_{i}^{h} = (a_{v_{1}}, \dots, a_{v_{| V |}})

, where

u_{i}

denotes player i’s utility function in the partial order game. Note that in our construction, every terminal history corresponds with a full action profile.

The definition of the information sets is crucial. For every internal vertex/history

h = (a_{v_{1}}, \dots, a_{v_{k}})

, we formally define these such that

[(a_{v_{1}}, \dots, a_{v_{k}})] = {(a_{v_{1}}^{'}, \dots, a_{v_{k}}^{'}) : a_{y}^{'} = a_{y} for all parents y of v_{k + 1}} .

Intuitively, a player can only distinguish a vertex/history

h = (a_{v_{1}}, \dots, a_{v_{k}})

where it is active from those vertices/histories of the same length that differ on the choice of action in at least one if its parents in the partial order game. All other vertices/histories of the same length belong to the same information set. For an example, consider Figure 6, where vertices

v_{4}^{2} = a c f

and

v_{4}^{5} = b c e

are in the same information set for player 1. This is because, in the underlying partial order game (depicted in Figure 3), vertex

v_{2}

is the only parent of

v_{4}

and the histories

a c f

and

b c e

both specify action c for

v_{2}

. Finally, we define the set of player i’s information sets as

I_{i} = {[v^{'}] : v^{'} \in V^{'} and i is active at v^{'}} .

To see that the extensive-form game defined through this transformation is strategically equivalent to the original partial order game, first consider an arbitrary vertex

v_{j}

along with its parents Y in the partial order game. Together with

v_{j}

itself, each profile

a_{Y}

in

A_{Y}

defines a unique information set for i defined as

I (v_{j}, a_{Y}) = {a^{'} \in A_{v_{1}} \times \dots \times A_{v_{j - 1}} : a_{Y} = a_{Y}^{'}} .

Let

θ

be the function that maps each pair

(v_{j}, a_{Y})

to information set

I (v_{j}, a_{Y})

.

Observe that defined thus,

θ

is both injective and surjective. We now find that we can extend

θ

to a function

\hat{θ}

that maps each conditional strategy

f_{i} = (f_{v_{i_{1}}}, \dots, f_{v_{i_{k}}})

in the partial order game to a strategy

{\hat{f}}_{i} : I_{i} \to A_{i}

for player i in the extensive-form game. To do so, define

{\hat{f}}_{i}

such that for each each information set

I \in I_{i}

with

θ^{- 1} (I) = (v_{j}, a_{Y})

{\hat{f}}_{i} (I) = f_{v_{j}} (a_{Y}) .

Observe that the extended function

\hat{θ}

is also bijective. Moreover, some reflection reveals that the full action profile

a^{f} = (a_{v_{1}}^{f}, \dots, a_{v_{| V |}}^{f})

determined by conditional strategy profile

f = (f_{1}, \dots, f_{| N |})

in the partial order game will be identical to the one determined by the strategy profile

\hat{f} = ({\hat{f}}_{1}, \dots, {\hat{f}}_{| N |})

in the extensive-form game. Having defined the players’ utilities in the extensive-form game as we did, we may conclude that the partial order game is strategically equivalent to the extensive game of imperfect information. Formally:

Proposition 1.

A partial order game and the extensive-form game obtained from it on basis of a given topological sorting, as described in this section, are strategically equivalent.

This result means that, with respect to game theoretic analysis, partial order games are in a precise technical sense (i.e., with respect to strategic equivalence), no more expressive than extensive form games: any scenario we can model with a partial order game can also be modelled with an extensive form game. However, this does not mean that partial order games are redundant. The translation from partial order game to extensive form game comes at the expense of a blow-up in the size of the game: if

| A_{v} | \geq 2

for every vertex v, then

V^{'} = Ω (2^{| V |})

. The practical upshot of this is that there are situations that we can capture using the partial order model that would be infeasible to capture with the extensive form. In addition, we argue that the partial order representation can in some cases be much more comprehensible than the extensive form: compare the partial order game in Figure 3 with its extensive form representation in Figure 6. Understanding what is going on (and in particular, the informational dependencies present) in the latter representation requires much more work than in the former.

Note that, for total-order games, the construction yields (finite) extensive-form games of perfect information (see Figure 7 and Figure 8 below). This can easily be appreciated by recalling that a total order game allows for only one topological order, say

(v_{1}, \dots, v_{V_{| V |}})

. Thus, for every

1 \leq j \leq | V |

we have that the parents of

v_{j}

are all vertices in

Y = {v_{1}, \dots, v_{j - 1}}

. However, then, obviously,

I (v_{j}, a_{Y}) = {a^{'} \in A_{v_{1}} \times \dots \times A_{v_{j - 1}} : a_{Y} = a_{Y}^{'}}

will be a singleton for every action profile

a_{Y}

, and the resulting game one of perfect information. As finite extensive-form games of perfect information can be solved by backwards induction (cf. e.g., in [4], Chapter 7), it follows as a corollary that total-order games always admit Nash equilibria in general.

Proposition 2.

Total-order games always have at least one Nash equilibrium.

We conclude this section with a remark about topological sortings. The DAG of a partial order game may not have a unique topological sorting over its nodes and, under the transformation described above, different topological sortings may lead to different extensive-form games. This is illustrated by Figure 6 and Figure 9, which both depict extensive-form games obtained from the partial order game in our first example in Figure 3. These extensive-form games we obtain using our transformation for the topological sortings

(v_{1}, v_{2}, v_{3}, v_{4})

and

(v_{1}, v_{2}, v_{4}, v_{3})

, respectively, and are clearly distinct. Yet, they are strategically equivalent. A formal proof that any two extensive-form games that are obtained using our transformations for different topological orders will be strategically equivalent is beyond the scope of this paper. It suffices to say that it is due to the so-called Interchange of Moves principle, which is one of the four Thompson transformations [26] and informally captures the idea that in extensive-form games of imperfect information “the order of play is immaterial if one player does not have any information about the other player’s action when making his choice” (see in [5], page 224).

4.4.2. Partial Order Games as a Concise Representation of Extensive-Form Games

As we noted above, the transformation of partial order games to extensive-form games presented in the previous section may give rise to an exponential blow-up. Reasoning conversely, this raises the claim that partial order games could be seen as providing a concise representation of extensive form games. In this section, we argue that if the utilities in partial order games are represented by Boolean circuits, then this claim is supported.

In Section 2.3, we recalled the well-known fact that transforming an extensive-form game to a normal-form game leads to an exponential blow-up. One reason for this is that, in extensive-form games, a player’s utilities are represented by an association of utility values to leaf nodes/terminal histories. One leaf node, however, may be reached by playing different strategy profiles, or even different action profiles, whereas in normal-form games utilities are specified for each strategy/action profile separately.

In partial order games, the players’ utilities are also modelled as an association of action profiles (not strategy profiles) and utility values. We find, however, that an exponential blow-up can be avoided when the players’ utility functions are represented by Boolean circuits.

The main idea behind this can be explained by means of an example. Consider again the extensive form in Figure 1, where each action has already been labelled with its local index in binary. For each internal vertex/non-terminal history v with action set

A_{v}

, we introduce

⌈ {log}_{2} | A_{v} | ⌉

Boolean variables

x_{1}^{v}, \dots, x_{⌈ {log}_{2} | A_{v} | ⌉}^{v}

. In the example, we introduced variables x for

v_{0}

, y and

y^{'}

for

v_{1}

, z for

v_{2}

, and

x^{'}

for

v_{3}

. A truth-value assignment

α

to these variables for all vertices then defines an action profile

a^{α} = (a_{v_{1}}^{α}, \dots, a_{v_{| V |}}^{α})

such that

a_{j}^{α}

is the action in

A_{v}

with numerical index

num (α (x_{1}), \dots, α (x_{⌈ {log}_{2} | A_{v} | ⌉}))

, for each vertex v. In our example, for instance, the assignment

α^{*}

that sets x and

y^{'}

to false and y, z, and

x^{'}

to true, corresponds to the action profile

(a_{0}, b_{10}, c_{1}, d_{1})

. With all utilities in our example being either 0 or 1, we can now associate a Boolean formula

φ_{u_{i}}

that characterises each player’s utility function, in the sense that assignment

α

satisfies

φ_{u_{i}}

if and only if

u_{i} (a^{α}) = 1

. In our example, such a formula

φ_{u_{1}}

representing player 1’s binary utility function could then for instance be obtained by first defining recursively a formula

φ_{v}

for each vertex v as follows. First, for each leaf node/terminal history

h

, we set

φ_{h}

=

⊤

, if

u_{i}^{h}

= 1, and

φ_{h}

=

⊥

, if ui=0

u_{i}^{h}

= 0. In our example, we thus have, for instance,

φ_{a_{0} b_{01}} = ⊥

and

φ_{a_{1} c_{1} d_{1}} = ⊤

. For every internal vertex/non-terminal history v, furthermore, we then set

φ_{v} = ⋁_{a \in A_{v}} (χ_{a} \land φ_{v a}),

where

χ_{a} = {\tilde{x}}_{1}^{v} \land \dots \land {\tilde{x}}_{⌈ {log}_{2} | A_{v} | ⌉}^{c}

characterises the local index k of a, that is

{\tilde{x}}_{j}^{v} = x_{j}^{v}

, if

b_{j} = 1

, and

{\tilde{x}}_{j}^{v} = \neg {\tilde{x}}_{j}^{c}

if

b_{j} = 0

, where

bin (k) = b_{1} \dots b_{⌈ {log}_{2} | A_{v} | ⌉}

. Then, set

φ_{i} = φ_{()}

. Thus, in our example we get, subsequently,

\begin{matrix} φ_{v_{3}} & = ({\bar{x}}^{'} \land ⊥) \lor (x^{'} \land ⊤) \\ φ_{v_{2}} & = (\bar{z} \land ⊥) \lor (z \land φ_{v_{3}}) \\ φ_{v_{1}} & = (\bar{y} {\bar{y}}^{'} \land ⊤) \lor (\bar{y} y^{'} \land ⊥) \lor (y {\bar{y}}^{'} \land ⊤) \lor (y y^{'} \land ⊥) \\ φ_{v_{0}} & = (\bar{x} \land φ_{v_{1}}) \lor (x \land φ_{v_{2}}) \end{matrix}

Finally, we obtain

φ_{u_{1}} = φ_{v_{0}}

. in our example, we now find, for instance, that assignment

α^{*}

does not satisfy

φ_{u_{i}}

and that

u_{i} (a^{α^{*}}) = 0

.

This procedure is general and can be applied to every extensive-form game. The crucial thing to observe is that each variable x occurs at most

2^{⌈ {log}_{2} | A_{v} | ⌉} = O (| A_{v} |)

times in

φ_{u_{i}}

. Therefore, the size of

φ_{u_{i}}

is still polynomial in the size of game and, thus, there is a polynomial-sized Boolean circuit

C_{u_{i}}

representing the Boolean function

β_{φ_{u_{i}}}

. Finally, any rational-valued utility function

u_{i} : A_{v_{1}} \times \dots \times A_{v_{| V |}} \to Q

can be represented by a linear combination of such circuits, and we may conclude that

u_{i}

can be represented by a Boolean circuit whose size is polynomial in the size of the extensive-form game, giving us the following lemma.

Lemma 3.

Let

u_{i} : S_{1} \times \dots \times S_{| N |} \to Q

be the utility function of a player i in an extensive-form game. Then,

u_{i}

can be represented by a Boolean circuit that is of a size polynomial in the size of the extensive-form game.

Recall that an extensive-form game and a partial order game are said to be strategically equivalent if they give rise to strategically equivalent normal-form games. We now find that, provided that utilities may be represented by Boolean circuits, every extensive-form game can be represented by a strategically equivalent partial order game without giving rise to more than an at most a polynomial blowup.

Proposition 3.

For every extensive-form game, there is a strategically equivalent partial order game whose size is at most polynomially larger than that of the extensive-form game.

Sketch of Proof.

Let an extensive form for players in a set N be based on a tree

(V, E)

. Let

I

be the information sets of this game. We construct a partial order game on the trivial DAG

(I, Ø)

, which is obviously polynomial in the size of the extensive-form game. We associate with each information set I in

I

the same player and the same action set

A_{I}

as in the extensive-form game. For every player i with

I_{i} = {I_{1}, \dots, I_{k}}

, we can define a bijection that maps every strategy

s_{i} : I_{i} \to A_{i}

for i in the extensive-form game to a strategy

{\hat{f}}_{i} = ({\hat{f}}_{I_{1}}, \dots, {\hat{f}}_{I_{k}})

for player i in the partial order game such that

{\hat{f}}_{I_{j}}

is the strategy that maps the empty sequence

()

to action

s_{i} (I_{j})

. By virtue of Lemma 3 we may assume the utility function of each player in the extensive-form game can be represented by a polynomially sized Boolean circuit. Accordingly, the the size of the whole partial order game constructed thus is polynomial in the size of the extensive-form game. Conclude by observing that the extensive-form game and the partial order game induce strategically equivalent normal-form games, as desired. □

Proposition 3 shows that every extensive-form game of perfect-information can be represented by a partial order game at the cost of an at most polynomial blow-up. In some cases, moreover, we find that partial order games may be exponentially smaller than any extensive-form game of perfect information that is strategically equivalent. Consider the following extensive-form game with n players, ordered from 1 to n. Every player has two actions, 0 and 1, at their disposal, but can make their choice dependent on the players that occur before them in the ordering. Each strategy profile

s

gives rise to a unique terminal history

h^{s} = (b_{1}, \dots, b_{n})

of actions in

{0, 1}

, where each player i plays

b_{i}

. Accordingly, there are exactly

{| {0, 1} |}^{n} = 2^{n}

terminal histories or leaves in this extensive-form game. Let the preferences for each player i at each of these terminal histories

h = (b_{1}, \dots, b_{n})

be given by the utility function such that

u_{i}^{h} = num (b_{1}, \dots, b_{n})

, where

num (b_{1}, \dots, b_{n})

is the numerical value of the sequence

b_{1}, \dots, b_{n})

conceived as an integer in binary. For instance,

num (01101) = 13

. Then, for each player, every terminal history or leaf yields a unique utility value. See Figure 7 for an illustration of this game for n = 5. To represent this game, an extensive-form game needs to have at least 2ⁿ leaf nodes to account for all the different payoffs a player may get, and thus is of size at least exponential in the number of players.

By contrast, one can represent this game as a partial order game on a transitive graph

(V, E)

, where

V = {v_{1}, \dots, v_{n}}

and

v_{i} E v_{j}

if and only if i < j. Let player i be assigned to vertex v_i and

A_{v_{i}} = {0, 1}

for all

1 \leq i \leq n

. For each player i, the utility function

u_{i} (b_{1}, \dots, b_{n}) = num (b_{1}, \dots, b_{n})

can clearly be represented by a Boolean circuit with V input variables, and as many output variables, with the mth input gate immediately leading to the mth output gate with no intermediate logic gates. See Figure 8 for an illustration of the partial order game representing the extensive-form game in Figure 7. Therefore, we have the following result establishing that partial order games can be seen as presenting a concise representation of extensive-form games. Moreover, this observation still holds if attention is restricted to games of perfect information.

Proposition 4.

There exist partial order games for which every strategically equivalent representation as an extensive-form game of imperfect information is at least exponentially as large.

5. Nash Equilibria

We saw in Section 2.2 how every partial order game defines a normal-form game. Accordingly, partial order games are amenable to game-theoretic analysis using the standard non-cooperative solution concepts that are available for normal-form games. In this section, we consider several complexity problems surrounding Nash equilibrium in partial order games.

We have the following lemma that will be useful for proving the complexity results in this section. It states that, if a player has a profitable deviation to a conditional strategy, then they also have a profitable deviation to an unconditional strategy. In other words, if a player has a best response, then they also have an unconditional best response (This is reminiscent of mixed strategies for (finite) normal-form games, where, due to the linearity of expected utility, a player having a best response implies their having a pure best response).

Lemma 4.

Let

f = (f_{1}, \dots, f_{n})

be a profile of conditional strategies. Then, if there is some strategy

g_{i}

for player i with

u_{i} (f_{- i}, g_{i}) > u_{i} (f)

, there also is an unconditional strategy

g_{i}^{'}

such that

u_{i} (f_{- i}, g_{i}^{'}) > u_{i} (f)

. Therefore,

f = (f_{v_{1}}, \dots, f_{v_{| V |}})

is a Nash equilibrium if and only if

u_{i} (f) \geq u_{i} (f_{- i}, g_{i}^{'})

for all players i and unconditional strategies

g^{'}

for i.

Proof.

Assume that there is some strategy

g_{i}

for player i with

u_{i} (f_{- i}, g_{i}) > u_{i} (f)

. Let

v_{i_{1}}, \dots, v_{i_{k}}

be the vertices assigned to player i. Moreover, let

a^{(f_{- i}, g_{i})} = (a_{v_{1}}^{*}, \dots, a_{v_{| V |}}^{*})

. Now define

g_{i}^{'} = (g_{v_{i_{1}}}^{'}, \dots, g_{v_{i_{k}}}^{'})

such that for every

1 \leq j \leq k

, and every profile

(a_{u_{1}}, \dots, a_{u_{ℓ}})

, where

u_{1}, \dots, u_{ℓ}

are the parents of

v_{i_{j}}

,

g_{v_{i_{j}}}^{'} (a_{u_{1}}, \dots, a_{u_{ℓ}}) = g_{v_{i_{j}}} (a_{u_{1}}^{*}, \dots, a_{u_{ℓ}}^{*}) .

Thus,

g_{i}^{'}

is clearly an unconditional strategy for player i. Let

a^{(f_{- i}, g_{i}^{'})} = (a_{v_{1}}^{* *}, \dots, a_{v_{| V |}}^{* *})

. By a straightforward induction on

depth (w)

it can then easily be shown that

a_{w}^{* *} = a_{w}^{*}

for every vertex w. And thus,

u_{i} (f_{- i}, g_{i}^{'}) > u_{i} (f)

, giving us the result. □

The key point about this lemma is that unconditional strategies are small, in the sense that unconditional strategies can be represented by Boolean circuits whose size is polynomial in the set of actions (see Section 3.1, above). Thus, when we are considering whether a player has a beneficial deviation, we can without loss of generality restrict our attention to small strategies. This has implications for the complexity of the decision problems we consider.

The dependency graph provides a concise representation of the sets of conditional strategies available to the players. A single strategy

f_{v}

for the player i playing at vertex v, however, has to take into account all profiles

(a_{y_{1}}, \dots, a_{y_{k}})

in

A_{y_{1}} \times \dots \times A_{y_{k}}

, where

y_{1}, \dots, y_{k}

are the parents of v. The number of these profiles tends to be exponential in k, the number of parents, as in all non-trivial cases generally

| A_{v} | \geq 2

. Similarly, a naive representation of the player’s utilities for the different action profiles tends to be exponential in the number of actions available to the players in the game. However, as the utilities of players are represented by Boolean circuits, there is a straightforward polynomial transformation of Boolean games and Boolean partial order games to general partial order games. This enables us to leverage hardness results for Boolean and partial order Boolean games to obtain hardness results for general partial order games.

We first consider the decision problem of determining whether a given strategy profile is a Nash equilibrium for a given partial order game. Here, we assume that the utility function

u_{i}

of each player is represented by a Boolean circuit

C_{u_{i}}

and the strategy profile

f = (f_{v_{1}}, \dots, f_{v_{| V |}})

is represented by a sequence of

| V |

Boolean circuits

C_{f_{v_{1}}}, \dots, C_{f_{v_{| V |}}}

.

is-nash
Given:	Partial order game G and strategy profile $f = (f_{v_{1}}, \dots, f_{v_{\| V \|}})$
Problem:	Is $f$ a Nash equilibrium of G?

We find that is-nash is intractable for partial order games.

Theorem 1.

is-nashis

coNP

-complete. The problem remains

coNP

-hard for empty-order games and total-order games.

Proof.

For membership in

coNP

, let partial order game G and conditional strategy profile

f = (f_{v_{1}}, \dots, f_{v_{| V |}})

be given. We can guess a player i and a strategy

g_{i} = (g_{v_{1}}, \dots, g_{v_{m}})

, where

v_{1}, \dots, v_{m}

are the vertices assigned to i. By virtue of Lemma 4, we may assume that strategies

g_{v_{1}}, \dots, g_{v_{m}}

are all unconditional strategies. As we saw in Section 3.1, the Boolean circuits

C_{g_{v_{1}}}, \dots, C_{g_{v_{m}}}

can therefore each be assumed to be of size polynomial in

| A |

. Now, Lemma 2 allows us to find in polynomial time the utilities

u_{i} (f_{- i}, g_{i})

and

u_{i} (f_{- i}, f_{i})

. Then,

f = (f_{v_{1}}, \dots, f_{v_{| V |}})

is not a Nash equilibrium if and only if

u_{i} (f_{- i}, g_{i}) > u_{i} (f_{- i}, f_{i})

, and so we may conclude that is-nash is in

coNP

.

For

coNP

-hardness, we reduce from is-nash for Boolean partial order games, which is known to be

coNP

-hard (see in [1], Proposition 1). Thus, let an instance of is-nash for Boolean partial order games be given by a Boolean game

(N, Φ, Φ_{1}, \dots, Φ_{| N |}, γ_{1}, \dots, γ_{| N |})

, a dependency graph

(Φ, E)

, and a strategy profile

f = (f_{1}, \dots, f_{| N |})

, where for each player i strategy

f_{i}

is given by a sequence of choice equations

p_{1} : φ_{1}

through

p_{m} : φ_{m}

, where

Φ_{i} = {p_{1}, \dots, p_{m}}

. In Section 4.1, we argued how this Boolean partial order game coincides with a partial order game with a slightly different representation of strategies and utilities. A similar remark concerns the strategy profile

f = (f_{1}, \dots, f_{| N |})

. As we found that the transformation of the representation of the game as a Boolean partial order game to the representation of the same game as a partial order game can be effected in polynomial time, we obtain our result. Moreover, because Boolean games are a special type of empty-order games, the problem remains hard for empty-order games as well.

To see that is nash also remains

coNP

-hard for total-order games, we adapt the proof for the is nash problem for Boolean games as presented in (Wooldridge et al. [15], Proposition 1). We reduce from the complement of satisfiability, the problem of determining if a given Boolean formula

φ

is satisfiable. To this end, let

φ

a Boolean formula over propositional variables

p_{1}, \dots, p_{m}

. We construct a total-order Boolean game with one player i controlling

p_{1}, \dots, p_{m}

along with an additional variable

p_{0}

. The dependency graph

(V, E)

is then given by

V = {v_{p_{0}}, \dots, v_{p_{m}}}

with relation E such that

p_{j} E p_{k}

if and only if

j < k

. Player i has as goal

γ_{i} = φ \land p_{0}

. Now, consider the strategy profile

f^{0} = (f_{v_{p_{0}}}^{0}, f_{v_{p_{1}}}^{0}, \dots, f_{v_{p_{m}}}^{0})

that sets all variables to false, that is,

f^{0}

is given by choice equations

p_{j} : ⊥

for all

0 \leq j \leq m

. Now observe that, as we are dealing with a one-player game, there is a natural surjection that maps each of player i’s strategies

f

to valuation

α_{a^{f}} : {p_{0}, \dots, p_{m}} \to {0, 1}

. It can then easily be appreciated that

f^{0}

is a Nash equilibrium in the game constructed if and only if

φ

is satisfiable, as desired. □

Another canonical problem is more general, in that it asks whether a partial order game has any Nash equilibria at all, as opposed to whether or not a specific strategy profile is a Nash equilibrium.

non-emptiness
Given:	Partial order game G
Problem:	Does G have a Nash equilibrium?

In view of Proposition 2, non-emptiness is vacuous for total-order games, as in this class of games Nash equilibria are guaranteed to exist. The problem is considerably more difficult, namely

NEXPTIME

-complete, for general partial order games. It also seems a fair conclusion to draw from this contrast that increase in computational complexity arises from the structure of the dependency graph.

Theorem 2.

non-emptinessis

NEXPTIME

-complete.

Proof.

A

NEXPTIME

algorithm to decide non-emptiness can be designed along the following lines. First, guess a strategy profile

f = (f_{v_{1}}, \dots, f_{v_{| V |}})

. Given that strategies

f_{v}

are given by a Boolean circuit

C_{f_{v}}

, this can be achieved in time not more than exponential in

| A |

, the size of the set of actions. Second, to decide whether

f = (f_{v_{1}}, \dots, f_{v_{| V |}})

is a Nash equilibrium, by virtue of Lemma 4, it suffices to check, for all players i and all unconditional strategies

g_{i} = (g_{v_{1}}, \dots, g_{v_{| V_{i} |}})

for i, that

u_{i} (f_{- i}, f_{i}) \geq u_{i} (f_{- i}, g_{i})

. For player i, there are

| A_{v_{1}} \times \dots \times A_{v_{| V_{i} |}} |

unconditional strategies which is upper bounded by

{| A |}^{| V |}

. Moreover, as we have seen in Section 3.1, each of these unconditional strategies can be represented by a sequence of Boolean circuits of size polynomial in

| A |

. Furthermore, the action profiles

a^{(f_{- i}, f_{i})}

and

a^{(f_{- i}, g_{i})}

can be computed in time exponential in

| A |

, by virtue of Lemma 1 and each circuit involved being at most exponential in the size of

| A |

. Finally, on the basis of Lemma 2, we can check in time polynomial in

| A |

the utilities for player i on action profiles

a^{(f_{- i}, f_{i})}

and

a^{(f_{- i}, g_{i})}

, and therefore also whether

u_{i} (f_{- i}, f_{i}) \geq u_{i} (f_{- i}, g_{i})

. Altogether, the algorithm runs in non-deterministic exponential time.

A proof of

NEXPTIME

-hardness of non-emptiness can be achieved by a reduction from the non-emptiness problem for Boolean partial order games. We rely here on the same direct reduction as in the proof of Theorem 1, above.

NEXPTIME

-hardness for Boolean partial order games was established by [1], giving us the result. □

If we restrict attention to empty-order games, however, non-emptiness has considerably lower computational costs, even though the problem still remains

Σ_{2}^{p}

-hard. Key to this result is the observation that for empty-order games all strategies are unconditional, and, thus, they can be represented by Boolean circuits of polynomial size (see Section 3.1).

Theorem 3.

For empty-order games,non-emptinessis

Σ_{2}^{p}

-complete.

Proof.

To see that non-emptiness is in

Σ_{2}^{p}

, recall that

Σ_{2}^{p} = {NP}^{coNP}

, that is, the set of problems that can be solved in polynomial time on a non-deterministic Turing machine with a

coNP

-oracle. Furthermore, recall that strategy profiles in empty-order games are unconditional and are of the form

f_{v} : {()} \to A_{v}

. Moreover, they can be represented by Boolean circuits of polynomial size (see Section 3.1, above). Accordingly, given an empty-order game, we can guess an unconditional strategy profile

f = (f_{v_{1}}, \dots, f_{v_{| V |}})

, and consult the

coNP

-oracle to check whether

f

is a Nash equilibrium. Theorem 1 guarantees that the latter is feasible.

For

Σ_{2}^{p}

-hardness, recall that Boolean games constitute a subclass of empty-order partial order games.

Σ_{2}^{p}

-hardness then follows immediately from non-emptiness being

Σ_{2}^{p}

-hard for Boolean games (see in [13], Proposition 5). □

Recall that in partial order games, strategy profiles and action profiles are essentially different objects. As a natural counterpart to the is nash problem, we therefore now consider the decision problem whether a given action profile is sustained by a Nash equilibrium in a partial order game.

is nash actions
Given:	Partial order game G and action profile $a = (a_{v_{1}}, \dots, a_{v_{\| V \|}})$
Problem:	Is $a$ sustained by a Nash equilibrium?

In sharp contrast to

coNP

-completeness of is nash, we find that is nash actions is

NEXPTIME

-complete. From the perspective of computational complexity, is nash actions appears to be more kindred to the non-emptiness problem for partial order games. In this connection, it is worth observing that the proof of Bradfield et al. regarding the

NEXPTIME

-completeness of non-emptiness for Boolean partial order games in [1] relied on a reduction from dependency quantifier boolean formula game, which is defined as follows.

An instance of dqbfg is a tuple

〈 φ, X_{1}, X_{2}, Y_{1}, Y_{2} 〉

, where

φ

is a Boolean formula

φ

and

{X_{1}, X_{2}, Y_{1}, Y_{2}}

a partition of the propositional variables over which

φ

is defined. dqbfg then concerns the following game with three players—B (‘Black’),

W_{1}

(‘White 1’), and

W_{2}

(‘White 2’)—where B forms one team and

W_{1}

and

W_{2}

another team W. Player B chooses an assignment for the variables in

X_{1} \cup X_{2} = {x_{1}^{1}, \dots, x_{| X_{1} |}^{1}} \cup {x_{1}^{2}, \dots, x_{| X_{2} |}^{2}}

, player

W_{1}

for those in

Y_{1} = {y_{1}^{1}, \dots, y_{| Y_{1} |}^{1}}

, and player

W_{2}

for those in

Y_{2} = {y_{1}^{2}, \dots, y_{| Y_{2} |}^{2}}

. Player B chooses first, then

W_{1}

and

W_{2}

choose, on the understanding that

W_{1}

can only see the assignment B chooses for

W_{1}

and

W_{2}

only the assignment B chooses for

Y_{2}

. Team B aims to make

φ

true, whereas team W’s goal is to make

φ

false. If the overall assignment for

X_{1} \cup X_{2} \cup Y_{1} \cup Y_{2}

satisfies

φ

, then B wins, otherwise W. A positive instance of dqbfg is when team W has a winning strategy. dependency quantifier boolean formula game was shown to be

NEXPTIME

-complete by Hearn and Demaine in [27].

Theorem 4.

is nash actionsis

NEXPTIME

-complete.

Proof.

For membership in

NEXPTIME

, let

a = (a_{v_{1}}, \dots, a_{v_{| V |}})

be the action profile that is given as input. Recall from the proof of Theorem 2 that we can guess a strategy profile

f

and check whether it is a Nash equilibrium in exponential time, provided that

f

is represented by a Boolean circuit. By Lemma 1 we know we can additionally compute

a^{f}

in polynomial time and check whether

a^{f} = a

. As the latter can also be achieved in polynomial time, we obtain our result.

For

NEXPTIME

-hardness, we reduce from dependency quantifier boolean formula game (dqbfg). Given an instance

〈 φ, X_{1}, X_{2}, Y_{1}, Y_{2} 〉

of dqbfg we construct a partial order Boolean game with the same three players, with B controlling the variables in

X_{1} \cup X_{2} \cup {x_{0}}

, player

W_{1}

those in

Y_{1}

, and player

W_{2}

those in

Y_{2}

, where

x_{0}

is a “fresh” variable not in

X_{1} \cup X_{2} \cup Y_{1} \cup Y_{2}

. Let the players’ goals be given by

\begin{matrix} γ_{B} & = x_{0} \land φ & γ_{W_{1}} = γ_{W_{2}} & = ⊥ \end{matrix}

The dependency graph

(X_{1} \cup X_{2} \cup Y_{1} \cup Y_{2} \cup {x_{0}}, E)

is defined such that

(i): $x_{1} E y_{1}$ , for all $x_{1} \in X_{1}$ and $y_{1} \in Y_{1}$ ,
(ii): $x_{2} E y_{2}$ , for all $x_{2} \in X_{2}$ and $y_{2} \in Y_{2}$ ,
(iii): $x_{0} E z,$ for all $z \in X_{1} \cup X_{2} \cup Y_{1} \cup Y_{2}$ .

Now, consider the action profile

\bar{a} = ({\bar{x}}_{0}, {\bar{x}}_{1}^{1}, \dots {\bar{x}}_{| X_{1} |}^{1}, {\bar{x}}_{1}^{2} \dots, {\bar{x}}_{| X_{2} |}^{2}, {\bar{y}}_{1}^{2}, \dots {\bar{y}}_{| Y_{1} |}^{1}, {\bar{y}}_{1}^{2} \dots, {\bar{y}}_{| Y_{2} |}^{2}),

which sets all variables, including

x_{0}

, to false. Note that none of the three players win if

\bar{a}

is played. We are now in a position to prove that

〈 φ, X_{1}, X_{2}, Y_{1}, Y_{2} 〉

is a positive instance of dqbfg if and only if

\bar{a}

is sustained by a Nash equilibrium in the partial order Boolean game constructed.

First assume that

〈 φ, X_{1}, X_{2}, Y_{1}, Y_{2} 〉

is positive instance of dqbfg. Then, the white team has a joint winning strategy in the original dqbfg-game given by Boolean functions

β_{1}

and

β_{2}

on the variables

X_{1}

and

X_{2}

, respectively. Then, define strategies

f_{W_{1}}^{*}

and

f_{W_{2}}^{*}

for

W_{1}

and

W_{2}

, respectively, in the Boolean partial order game, that are given by the following choice equations for

i = 1, 2

and

j = 1, \dots, | Y_{i} |

:

y_{j}^{i} : (x_{0} \to φ_{β_{i}} (x_{1}^{i}, \dots, x_{| X_{i} |}^{i})) \land ({\bar{x}}_{0} \to ⊥)

As the dependency graph

(V, E)

respects the information dependencies of the dqbfg-instance, observe that

f_{W_{1}}

and

f_{W_{2}}

together embody a winning strategy to render

φ

false, if

x_{0}

is played. Let B’s unconditional strategy

f_{B}^{*}

be defined by the choice equations of the form

x_{j}^{i} : ⊥

, setting

x_{j}^{i}

to false for all for

i = 1, 2

and

j = 1, \dots, | X_{i} |

. Observe that, defined thus,

a^{f^{*}} = \bar{a}

. Also note that

f^{*} = (f_{B}^{*}, f_{W_{1}}^{*}, f_{W_{2}}^{*})

is a Nash equilibrium. As

x_{0}

will be set to false, all players lose when

f^{*}

is played, but none has an incentive to deviate either. Player B could only hope to win by setting at least

x_{0}

to true. If so, however,

W_{1}

and

W_{2}

are playing a winning strategy against B, dashing all the latter’s hopes to win after all. Players

W_{1}

and

W_{2}

will lose, no matter which strategy they adopt. As a consequence, they do not want to deviate either, and we may conclude that

f^{*}

is a Nash equilibrium sustaining action profile

\bar{a}

.

Finally, assume that

〈 φ, X_{1}, X_{2}, Y_{1}, Y_{2} 〉

is a negative instance of dqbfg. Then, team W does not have a winning strategy, meaning that for every strategy

f_{W} = (f_{W_{1}}, f_{W_{2}})

, player B has a best response

f_{B}^{'}

such that B wins under

f = (f_{W_{1}}, f_{W_{2}}, f_{B}^{'})

. Now, consider the partial order Boolean game and an arbitrary strategy profile

\bar{f}

with

a^{\bar{f}} = \bar{a}

. Some reflection reveals that under these circumstances, player B has a strategy

f_{B}^{'}

at their disposal setting

x_{0}

to true and incorporating a winning response to

{\bar{f}}_{W} = ({\bar{f}}_{W_{1}}, {\bar{f}}_{W_{2}})

. Therefore, player B has an incentive to unilaterally deviate from

\bar{f}

and play

f_{B}^{'}

instead. We may conclude that

\bar{f}

is not a Nash equilibrium, as desired. □

For the two extremal classes of empty-order games and total order games, is nash actions are less computationally demanding. More precisely, the problem is

coNP

-complete for empty-order games and

PSPACE

-complete for total-order games. The proof of the former statement is relatively straightforward, when one realises that in empty-order games all strategies are unconditional and that, consequently, there is a natural bijection between action profiles and strategy profiles.

Theorem 5.

For empty-order games,is nash actionsis

coNP

-complete.

Proof.

First recall that, in empty-order games, there is a natural bijection between action profiles and strategy profiles, mapping each action profile

a = (a_{v_{1}}, \dots, a_{v_{| V |}})

to strategy profile

f^{a} = (f_{v_{1}}^{a_{v_{1}}}, \dots, f_{v_{| V |}}^{a_{v_{| V |}}})

, where

f_{v}^{a_{v}} (()) = a_{v}

. Thus, all strategy profiles of an empty-order game are of this form. Moreover, for arbitrary action profiles

a = (a_{v_{1}}, \dots, a_{v_{| V |}})

and strategy profiles

f = (f_{v_{1}}, \dots, f_{v_{| V |}})

, we have that

\begin{matrix} a = a^{f} & if and only if f_{v}^{a_{v}} = a_{v}, for all v \in V \\ if and only if f = f^{a} \end{matrix}

Moreover, recall that unconditional strategies are represented by polynomial-sized Boolean circuits. Theorem 1 established that is nash is

coNP

-complete for empty-order games. Altogether, it suffices to show that, for all action profiles

a = (a_{v_{1}}, \dots, a_{v_{| V |}})

of a empty-order game,

a is sustained by a Nash equilibrium if and only if f^{a} is a Nash equilibrium .

The “if”-direction is immediate. If

f^{a}

is a Nash equilibrium, then, by observing that

a^{f^{a}} = a

, action profile

a

is sustained by some Nash equilibrium, namely

f^{a}

. For the “only if”-direction, assume there is some strategy profile

f^{*}

with

a = a^{f^{*}}

. By our earlier observation we find that

f^{*} = f^{a}

. Hence,

f^{a}

is a Nash equilibrium, as desired. □

The proof of

PSPACE

-completeness of is nash actions for total-order games is considerably more involved, and we defer it to the very end of Section 7.2, where the necessary proof elements are in place. Still, we conclude this section with its statement.

Theorem 6.

For total-order games,is nash actionsis PSPACE-complete.

6. Backwards Induction

Backwards induction is the most fundamental technique for the analysis of extensive form games. The basic backward induction algorithm for extensive form games of perfect information runs in time polynomial in the size of the game tree, and computes Nash equilibrium strategy profiles (which are guaranteed to exist in games of perfect information). It is therefore very natural to ask whether approaches based on backward induction might work for partial order games. However, backward induction is not applicable in extensive form games of imperfect information, and as partial order games correspond to imperfect information games, it follows that the technique is not always applicable. This section introduces a backward induction procedure to find pure Nash equilibria in partial order games. Games in which the procedure is well defined—in the sense that the procedure produces at least one strategy profile—we denote as being fit for backwards induction. We relate this latter concept to an informational notion we refer to as scrutability. For games that are fit for backwards induction, we prove that our backwards induction procedure is guaranteed to produce a Nash equilibrium outcome.

6.1. Backwards Induction

Due to their acyclic nature, it would seem that a natural concept of backward induction procedure can straightforwardly be defined for partial order games. Our aim is thus to define a strategy profile

f^{*} = (f_{v_{1}}^{*}, \dots, f_{v_{| V |}}^{*})

that is obtained in the following fashion. One starts with a vertex v of maximal depth with set of parents

Y = {y_{1}, \dots, y_{k}}

, and one inspects the possible actions—rather than their strategies—the players active at the parents of v can play. Let

a_{Y} = (a_{y_{1}}, \dots, a_{y_{k}})

be any such an action profile, suppose i is the player active at v. Then, the action

a_{v}^{*} = f_{v}^{*} (a_{y_{1}}, \dots, a_{y_{k}})

selected by strategy

f_{v}^{*}

from

A_{v}

should maximise i’s utility against

a_{Y}

and all possible choices of action

b_{X}

at the vertices in

X = V \ (Y \cup {v})

, provided any such action

a_{v}^{*}

exists. Subsequently, with the strategies for the vertices of greatest depth thus fixed, the strategies in

f^{*}

should recursively find optimal strategies at vertices of lesser depth in a similar fashion.

In the case of partial order games, it is not strictly necessary to proceed recursively on the depth of the vertices. We can, instead, use any topological sorting

τ = (v_{τ_{1}}, \dots, v_{τ_{| V |}})

of the vertices that respects the graph and, starting with the vertex

v_{τ_{j}}

with maximal index

τ_{j} = | V |

, we iterate through

τ

until we reach the vertex with minimal topological index. In this section, we will develop this more general concept of backward induction for partial order games that employs a topological order.

Our backwards induction procedure defines recursively a strategy profile

f^{*} = (f_{v_{1}}^{*}, \dots, f_{v_{| V |}}^{*})

for each vertex v relative to a topological sorting

τ = (v_{τ_{1}}, \dots, v_{τ_{| V |}})

, and is formally defined as follows. Let v be a vertex with topological index

τ_{j}

and let i be the player to move at v. Let

Y = {y_{1}, \dots, y_{k}}

be the set of parents of v,

Z = {z_{1}, \dots, z_{m}}

be the set of vertices with a topological index strictly greater than

τ_{j}

, and

X = {x_{1}, \dots, x_{ℓ}}

be given by

V \ (Y \cup Z \cup {v})

. For every profile

a_{Y} = (a_{y_{1}}, \dots, a_{y_{k}})

, we define

f_{v}^{*} (a_{y_{1}}, \dots, a_{y_{k}})

as an action

a_{v}^{*}

in

A_{v}

such that, for all profiles

b_{X} = (b_{x_{1}}, \dots, b_{x_{ℓ}})

,

a_{v}^{*} \in \underset{a_{v} \in A_{v}}{arg max} u_{i} (b_{X}, a_{Y}, a_{v}, f_{Z}^{*}),

(1)

where the basis of this recursion is provided by the case where

Z = Ø

(Here we exploit the notation

b_{X}, a_{Y}, a_{v}, f_{Z}^{*}

to refer to the action profile that results if after

τ

-history

(b_{X}, a_{Y}, a_{v}

the strategy profile

f^{*}

is played over the vertices in Z. Formally,

b_{X}, a_{Y}, a_{v}, f_{Z}^{*}

is the action profile

a^{g}

for strategy profile

g = (f_{X}^{b_{X}}, f_{Y}^{a_{Y}}, f_{v}^{a_{v}}, f_{Z}^{*})

, where

(f_{X}^{b_{X}}, f_{Y}^{a_{Y}}, f_{v}^{a_{v}})

is a profile of unconditional strategies (also see page 9, above)). To illustrate this definition, we have the following example.

Example 4.

Consider the utility function for the game depicted in Figure 10, and fix a topological sorting

(v_{1}, v_{2}, v_{3}, v_{4})

. From inspection, it can easily be appreciated that the utility to player 1 is only dependent on their own choices at vertices

v_{1}

and

v_{4}

, and player 2’s at vertex

v_{2}

. At this moment, we can apply the backwards induction procedure as follows. Inspecting their utility function, we find that player 3 chooses as follows at vertex

v_{3}

:

\begin{matrix} f_{v_{3}} (a c) & = f & f_{v_{3}} (a d) & = e & f_{v_{3}} (b c) & = f & f_{v_{3}} (b d) & = f \end{matrix}

At vertex

v_{4}

, player 1, can set their strategy such that

f_{v_{4}} (c) = g

and

f_{v_{4}} (d) = h

, because they are indifferent to any choice of actions by the other players. Player 2 can have their strategy at

v_{2}

depend on player 1’s choice at

v_{1}

. If player 1 chooses a at

v_{1}

, then by playing c at

v_{2}

, action profile

a c f g

will result and player 2 will obtain utility 3. Whereas, by playing d, action profile

a d e h

will ensue, with a utility of 1 for player 2. Therefore,

f_{v_{2}} (a) = c

. A similar reasoning yields that, if player 1 chooses b at

v_{1}

, by playing c at

v_{2}

, action profile

(b, c, f, g)

results with utility 0 for player 2, whereas, by playing d, action profile

(b, d, f, h)

ensues with utility 2 for player 2. Accordingly,

f_{v_{2}} (b) = d

. Finally, by playing a at

v_{1}

, player 1 obtains utility 3, as action profile

(a, c, f, g)

would be played. Observe that by playing b at

v_{1}

, player 1 obtains utility 2, as in that case action profile

(b, d, f, h)

would result. Accordingly,

f_{v_{1}} (()) = a

. We thus obtain the strategy profile

f = (f_{v_{1}}^{1}, f_{v_{2}}^{2}, f_{v_{3}}^{12}, f_{v_{4}}^{2})

yielding action profile

(a, c, f, g)

as the backwards induction solution of this game.

Some reflection reveals that on total-order games the procedure defined thus mimics standard backwards induction on the corresponding extensive-form games of perfect information as described in Section 4.4. It is important to note, however, that, on general partial order games, our backwards induction procedure does not always yield a strategy profile

f^{*}

. This is because the outcome of a partial order game—and therefore also the players’ utilities as well as the possibility of finding a utility maximising action—need not be fully determined by the actions chosen at a given vertex v, at v’s parent vertices, and at the vertices with a larger topological index than v.

The outcome may also depend on the actions chosen at vertices of an equal or lesser depth that are not parents of the respective vertex. If so, it may happen that, at some stage of the procedure and at some vertex, no action can be singled out as an unequivocal optimal choice based on the choices at the parent nodes and those at the vertices with a greater topological index alone. This would cause the procedure to stall. More formally, this happens if for some vertex v, there are two different profiles

b_{X} = (b_{w_{1}}, \dots, b_{w_{ℓ}})

and

b_{X}^{'} = (b_{w_{1}}^{'}, \dots, b_{w_{ℓ}}^{'})

such that

{arg max}_{a_{v} \in A_{v}} u_{i} (b_{X}, a_{Y}, a_{v}, f_{Z}^{*})

and

{arg max}_{a_{v} \in A_{v}} u_{i} (b_{X}^{'}, a_{Y}, a_{v}, f_{Z}^{*})

are disjoint. The following example illustrates this point.

Example 5.

Consider again the game of Example 1, depicted in Figure 3. Let us first consider the strategies that player 3 can choose at vertex

v_{3}

. This player has to specify an action from among e and f, for each of the possible action profiles player 1 and 2 can choose at vertices

v_{1}

and

v_{2}

, respectively, that is, for the action profiles

a c

,

a d

,

b c

, and

b d

. Inspecting player 3’s utilities, player 3 should arguably choose

f_{v_{3}} (a c) = f

, as no matter what action player 1 chooses at

v_{4}

, player 3 is better off playing f than playing e. However, how to choose a value for

f_{v_{3}} (a d)

is not so obvious, as the optimal choice depends on whether player 1 chooses g or h at vertex

v_{4}

; this is something player 3 may not let their strategy depend upon as not

v_{3} E v_{4}

.

To formalise this issue, we say that a partial order game is fit for backwards induction under a topological sorting

τ = (v_{τ_{1}}, \dots, v_{τ_{| V |}})

if, for every vertex v, and every action profile

a = (a_{v_{1}}, \dots, a_{v_{| V |}})

,

⋂_{b_{X} \in A_{X}} \underset{a_{v} \in A_{v}}{arg max} u_{i} (b_{X}, a_{Y}, a_{v}, a_{Z}) \neq Ø,

where Y is the set of parents of v, Z the set of vertices with a topological index that is not smaller than that of v, and

X = V \ (Y \cup Z \cup {v})

. This ensures that, in Equation (1), the optimal action

a_{v}^{*}

is defined for every vertex v. At this point, it is worth observing that for total-order games, the set X will be empty and that they allow for only one topological sorting. It follows that total-order games vacuously satisfy the fit-for-backwards-induction condition under every topological sorting.

We are now in a position to show that our backwards induction procedure always yields a Nash equilibrium in games that are fit for backwards induction.

Theorem 7.

Let G be a partial order game that is fit for backwards induction under a topological order τ. Then, every strategy profile

f^{*} = (f_{v_{1}}^{*}, \dots, f_{v_{| V |}}^{*})

generated by the backwards induction procedure under τ is a Nash equilibrium.

Proof.

Let

f^{*} = (f_{v_{1}}^{*}, \dots, f_{v_{| V |}}^{*})

be a strategy profile that our backwards induction procedure yields for topological order

τ = (v_{τ_{1}}, \dots, v_{τ_{| V |}})

. Without loss of generality, we may assume that

τ_{j} = j

for all

1 \leq j \leq | V |

, that is,

v_{1}, \dots, v_{| V |}

are already topologically ordered. Assume for contradiction that

f^{*}

is not a Nash equilibrium, that is, there is some strategy

g_{i}

for some player i such that

u_{i} (f_{- i}^{*}, g_{i}) > u_{i} (f_{- i}^{*}, f_{i}^{*})

. Let

g = (f_{- i}^{*}, g_{i})

.

Now, there is at least one vertex

v_{j}

with

1 \leq j \leq | V |

that is controlled by player i with parents

Y = {y_{1}, \dots, y_{k}}

for which there is an action profile

x = (x_{v_{1}}, \dots, x_{v_{j - 1}})

in

A_{v_{1}} \times \dots \times A_{v_{j - 1}}

such that:

u_{i} (x_{v_{1}}, \dots, x_{v_{j - 1}}, a_{v_{j}}, f_{v_{j + 1}}^{*}, \dots, f_{v_{| V |}}^{*}) < u_{i} (x_{v_{1}}, \dots, x_{v_{j - 1}}, b_{v_{j}}, g_{v_{j + 1}}, \dots, g_{v_{| V |}}),

(2)

where

a_{v_{j}} = f_{v_{j}}^{*} (x_{y_{1}}, \dots, x_{y_{k}})

and

b_{v_{j}} = g_{v_{j}} (x_{y_{1}}, \dots, x_{y_{k}})

(To see this, let

a^{f^{*}} = (a_{v_{1}}, \dots, a_{v_{| V |}})

and

a^{g} = (b_{v_{1}}, \dots, b_{v_{| V |}})

. Then observe that there must be a smallest index

1 \leq j \leq | V |

such that

v_{j} \in V_{i}

with

a_{v_{j}} \neq b_{v_{j}}

and

a_{v_{m}} = b_{v_{m}}

for all

1 \leq m < j

, which provides us with the desired

v_{j}

). Now we may assume that j is the largest index with

v_{j} \in V_{i}

for which such an action profile, or

τ

-history,

x = (x_{v_{1}}, \dots, x_{v_{j - 1}})

can be found. Therefore,

u_{i} (x_{v_{1}}, \dots, x_{v_{j - 1}}, b_{v_{j}}, g_{v_{j + 1}}, \dots, g_{v_{| V |}}) \leq u_{i} (x_{v_{1}}, \dots, x_{v_{j - 1}}, b_{v_{j}}, f_{v_{j + 1}}^{*}, \dots, f_{v_{| V |}}^{*}) .

(3)

Combining inequalities (2) and (3), we obtain that

u_{i} (x_{v_{1}}, \dots, x_{v_{j - 1}}, a_{v_{j}}, f_{v_{j + 1}}^{*}, \dots, f_{v_{| V |}}^{*}) < u_{i} (x_{v_{1}}, \dots, x_{v_{j - 1}}, b_{v_{j}}, f_{v_{j + 1}}^{*}, \dots, f_{v_{| V |}}^{*}) .

Having assumed

(v_{1}, \dots, v_{| V |})

to be ordered topologically, the set

Y = {y_{1}, \dots, y_{k}}

of parents of

v_{j}

is a subset of

{v_{1}, \dots, v_{j - 1}}

. Let further

X = {v_{1}, \dots, v_{j - 1}} \ Y

and

Z = {v_{j + 1}, \dots, v_{| V |}}

. Recall that

a_{v_{j}} = f_{v_{j}}^{*} (x_{y_{1}}, \dots, x_{y_{k}})

. We then find that, for profile

x_{Y}

, there is some profile, namely

x_{X}

, such that

f_{v_{j}}^{*} (x_{y_{1}}, \dots, x_{y_{k}}) \notin \underset{a_{v_{j}}^{'} \in A_{v_{j}}}{arg max} u_{i} (x_{X}, x_{Y}, a_{v_{j}}^{'}, f_{Z}^{*}) .

It follows that our backwards induction procedure does not provide us with

f_{v_{j}}^{*}

, which contradicts our initial assumption regarding the profile

f^{*} = (f_{v_{1}}^{*}, \dots, f_{v_{| V |}}^{*})

. □

We also have the following lemma, which is a version of the one-stage-deviation property for partial order games, and which is an almost immediate consequence of the definition of our backwards induction procedure (also see in [28] (pages 108–110) and Osborne and Rubinstein [5] (page 98)). This property will prove to be of great help proving the computational complexity results in Section 7. Intuitively it says that to establish that a strategy profile

f = (f_{v_{1}}, \dots, f_{v_{| V |}})

is a backwards induction solution, it suffices to look at deviations by a player from

f

at a single vertex

v_{k}

given a single history

h = (a_{v_{1}}, \dots, a_{v_{k}})

. The property is perhaps best understood by considering its contrapositive: strategy profile

f

is not a backwards induction solution if there is a single player who has an incentive to deviate from

f

at a single vertex

v_{k}

given a single history

h = (a_{v_{1}}, \dots, a_{v_{k}})

. Note that we here assume that the game in question is fit for backwards induction.

Lemma 5 (One-stage-deviation property).

Let the vertices of a partial order game be sorted according to a topological sorting

τ = (v_{1}, \dots, v_{| V |})

. Then, for games that are fit for backwards induction, profile

f = (f_{v_{1}}, \dots, f_{v_{| V |}})

is a backwards induction solution for τ if and only if for every τ-history

(a_{v_{1}}, \dots, a_{v_{k - 1}})

with

1 \leq k \leq | V |

we have for all

b_{v_{k}} \in A_{v_{k}}

:

u_{i} (a_{v_{1}}, \dots, a_{v_{k - 1}}, f_{v_{k}}, f_{v_{k + 1}}, \dots, f_{v_{| V |}}) \geq u_{i} (a_{v_{1}}, \dots, a_{v_{k - 1}}, b_{v_{k}}, f_{v_{k + 1}}, \dots, f_{v_{| V |}}),

where i is the player assigned to

v_{k}

.

Proof.

First assume that

f = (f_{v_{1}}, \dots, f_{v_{| V |}})

is a backwards induction solution, and consider an arbitrary

τ

-history

(a_{v_{1}}, \dots, a_{v_{k - 1}})

and equally arbitrary

b_{v_{k}}

in

A_{v_{k}}

. Let Y be the parents of

v_{k}

and

X = {v_{1}, \dots, v_{k - 1}} \ Y

. Then,

u_{i} (b_{X}, a_{Y}, f_{v_{k}} (a_{Y}), f_{v_{k + 1}}, \dots, f_{v_{| V |}}) \geq u_{i} (b_{X}, a_{Y}, b_{v_{k}}, f_{v_{k + 1}}, \dots, f_{v_{| V |}}),

(4)

for all profiles

b_{X}

. Now observe that inequality (4) holds for in particular for

b_{X} = a_{X}

.

For the “if”-direction assume for contraposition that

f = (f_{v_{1}}, \dots, f_{v_{| V |}})

is not a backwards induction solution for

τ

. Then, there is a vertex

v_{k}

with minimal topological index, parents Y, and

X = {v_{1}, \dots, a_{k - 1}} \ Y

such that for some profile

a_{Y}

, we have for every

a_{v_{k}}

in

A_{v_{k}}

that there is some profile

b_{X}

such that

u_{i} (b_{X}, a_{Y}, a_{v_{k}}, f_{v_{k + 1}}, \dots, f_{v_{| V |}}) < u_{i} (b_{X}, a_{Y}, b_{v_{k}}, f_{v_{k + 1}}, \dots, f_{v_{| V |}}) .

(5)

At this point, consider any

a_{v_{k}}

in

A_{v_{k}}

and let

b_{X}

be the accompanying profile for which Equation (5) holds. We can conclude the proof by considering the

τ

-history

(b_{X}, a_{Y})

. □

6.2. Games of Adequate Information

In this section, we propose the class of adequate information games, which are guaranteed to be fit for backwards induction, and, thus, always have a Nash equilibrium. As an auxiliary concept we first introduce the notion of scrutability, an informational criterion specifying on which choices of which players a player can have its decision depend when choosing an action at a specific vertex.

6.2.1. Scrutability

We introduce scrutability as a technical term to reason about the strategic information that is available to the players at each stage of the backwards induction process. Intuitively, let v and w be a vertices at which, respectively, players i and j are active. Then, assuming that the game is fit for backwards induction, a vertex w is scrutable to v if j’s choices of action at w can make a difference as to the optimality of i’s choice of action at v when deciding on a backwards induction strategy at v. Thus, all vertices reachable from v are scrutable, as their backwards induction choices will already have been made. Moreover, the player active at v can have their choice of action depend on the choices of action at the parent nodes of v.

Formally, we say that a vertex w is scrutable to vertex v if either

(i): w is reachable from v, or
(ii): $w E v$ , that is w is one of v’s parents.

We refer to a vertex that is not scrutable to v as being inscrutable to v. Recall the reachability relation is reflexive, and, thus, vertex v is always scrutable to v itself. Observe that the relations of scrutability and inscrutability are not necessarily symmetric. For instance, if a vertex v is reachable from w, but w is not a parent of v, then v is scrutable to w, but not vice versa. If v is inscrutable to w, and w is also inscrutable to v, we say that v and w are independent.

We differentiate between a vertex being scrutable to a vertex and a vertex being scrutable to a player. For

V_{i} = {v_{1}, \dots, v_{m}}

player i’s vertices, we denote by

I_{v}

the set of vertices that are inscrutable to v, and by

S_{v}

the set of vertices that are scrutable to v. Furthermore, let

I_{i} = I_{v_{1}} \cup \dots \cup I_{v_{m}}

and

S_{i} = V \ I_{i}

denote the sets of vertices that are inscrutable and scrutable to player i, respectively. That is, if a vertex v is inscrutable to a player i, if v is inscrutable to at least one vertex at which i is active. Vertex v is scrutable to player i, if it is not inscrutable to i, that is, if v is scrutable to all vertices at which i is active. It may be worth observing that in total-order games all vertices will be scrutable from any other vertex.

Furthermore, we introduce a notion of scrutability that is relative to a topological sorting of the vertices. Therefore, let

τ = (v_{τ_{1}}, \dots, v_{τ_{| V |}})

be a topological sorting. Then, a vertex w with topological index

τ_{w}

is scrutable under $τ$ to vertex v with topological index

τ_{v}

, if either

(i): $τ_{w} \geq τ_{v}$ , or
(ii): $w E v$ , that is w is one of v’s parents.

Any vertex that is not scrutable under

τ

is referred to as being inscrutable under $τ$ . For

V_{i} = {v_{1}, \dots, v_{m}}

and topological sorting

τ = (v_{τ_{1}} \dots, v_{τ_{| V |}})

, we furthermore denote by

I_{v}^{τ}

the set of vertices that are inscrutable to v under

τ

, and by

S_{v}^{τ}

the set of vertices that are scrutable to v under

τ

.

Moreover, we define

I_{i}^{τ} = I_{v_{1}}^{τ} \cup \dots \cup I_{v_{m}}^{τ}

and

S_{i}^{τ} = V \ I_{i}^{τ}

. Note that the set of vertices scrutable to a player i need not include the set

V_{i}

of vertices where i is active.

Given a topological sorting

τ

and a vertex v, by Y, we will generally adhere to the following notational conventions. The parents of v we generally denote by Y, the set of vertices inscrutable to v by X, and the set of all vertices that are not parents of v but still scrutable to v by Z. The following example illustrates these concepts.

Example 6.

Consider the game depicted in Figure 10. Then, we have the following sets of scrutable and inscrutable vertices.

\begin{matrix} S_{v_{1}} & = {v_{1}, v_{2}, v_{3}, v_{4}} & I_{v_{1}} & = Ø & S_{1} & = {v_{2}, v_{4}} & I_{1} & = {v_{1}, v_{3}} \\ S_{v_{2}} & = {v_{1}, v_{2}, v_{3}, v_{4}} & I_{v_{2}} & = Ø & S_{2} & = {v_{1}, v_{2}, v_{3}, v_{4}} & I_{2} & = Ø \\ S_{v_{3}} & = {v_{1}, v_{2}, v_{3}} & I_{v_{3}} & = {v_{4}} & S_{3} & = {v_{1}, v_{2}, v_{3}} & I_{3} & = {v_{4}} \\ S_{v_{4}} & = {v_{2}, v_{4}} & I_{v_{4}} & = {v_{1}, v_{3}} \end{matrix}

For the topological sorting

τ = (v_{1}, v_{2}, v_{3}, v_{4})

, we get

\begin{matrix} S_{v_{1}}^{τ} & = {v_{1}, v_{2}, v_{3}, v_{4}} & I_{v_{1}}^{τ} & = Ø & S_{1}^{τ} & = {v_{2}, v_{4}} & I_{1}^{τ} & = {v_{1}, v_{3}} \\ S_{v_{2}}^{τ} & = {v_{1}, v_{2}, v_{3}, v_{4}} & I_{v_{2}}^{τ} & = Ø & S_{2}^{τ} & = {v_{1}, v_{2}, v_{3}, v_{4}} & I_{2}^{τ} & = Ø \\ S_{v_{3}}^{τ} & = {v_{1}, v_{2}, v_{3}, v_{4}} & I_{v_{3}}^{τ} & = Ø & S_{3}^{τ} & = {v_{1}, v_{2}, v_{3}, v_{4}} & I_{3}^{τ} & = Ø \\ S_{v_{4}}^{τ} & = {v_{2}, v_{4}} & I_{v_{4}}^{τ} & = {v_{1}, v_{3}} \end{matrix}

By contrast, under topological sorting

τ^{'} = (v_{1}, v_{2}, v_{4}, v_{3})

, we obtain

\begin{matrix} S_{v_{1}}^{τ^{'}} & = {v_{1}, v_{2}, v_{3}, v_{4}} & I_{v_{1}}^{τ^{'}} & = Ø & S_{1}^{τ^{'}} & = {v_{2}, v_{3}, v_{4}} & I_{1}^{τ^{'}} & = {v_{1}} \\ S_{v_{2}}^{τ^{'}} & = {v_{1}, v_{2}, v_{3}, v_{4}} & I_{v_{2}}^{τ^{'}} & = Ø & S_{2}^{τ^{'}} & = {v_{1}, v_{2}, v_{3}, v_{4}} & I_{2}^{τ^{'}} & = Ø \\ S_{v_{3}}^{τ^{'}} & = {v_{1}, v_{2}, v_{3}} & I_{v_{3}}^{τ^{'}} & = {v_{4}} & S_{3}^{τ^{'}} & = {v_{1}, v_{2}, v_{3}} & I_{3}^{τ^{'}} & = {v_{4}} \\ S_{v_{4}}^{τ^{'}} & = {v_{2}, v_{3}, v_{4}} & I_{v_{4}}^{τ^{'}} & = {v_{1}} \end{matrix}

For another example, consider Figure 11 along with topological order

τ = (v_{1}, v_{2}, v_{3}, v_{4})

. We then obtain for the vertices the following sets of scrutable and inscrutable vertices for vertices as well as for players:

\begin{matrix} S_{v_{1}}^{τ} & = {v_{1}, v_{2}, v_{3}, v_{4}} & I_{v_{1}}^{τ} & = Ø & S_{1}^{τ} & = {v_{1}, v_{2}, v_{3}, v_{4}} & I_{1}^{τ} & = Ø \\ S_{v_{2}}^{τ} & = {v_{2}, v_{3}, v_{4}} & I_{v_{2}}^{τ} & = {v_{1}} & S_{2}^{τ} & = {v_{2}, v_{4}} & I_{2}^{τ} & = {v_{1}, v_{3}} \\ S_{v_{3}}^{τ} & = {v_{1}, v_{2}, v_{3}, v_{4}} & I_{v_{3}}^{τ} & = Ø \\ S_{v_{4}}^{τ} & = {v_{1}, v_{2}, v_{4}} & I_{v_{4}}^{τ} & = {v_{3}} \end{matrix}

Observe that w being reachable from v implies

τ_{w} \geq τ_{v}

, in other words,

S_{v} \subseteq S_{v}^{τ}

for every topological order

τ

. Accordingly, plain scrutability is stronger than scrutability under a topological sorting. The following lemma lays down the precise formal interrelationship between scrutability and scrutability under a topological order.

Lemma 6.

Let i be a player of a partial order game and v a vertex controlled by i. Then, for all vertices w,

(i): w is scrutable to v if and only if w is scrutable to v under all topological orders τ,
(ii): w is inscrutable to v if and only if w is inscrutable to v under some topological order τ,
(iii): w is scrutable to i if and only if w is scrutable to i under all topological orders τ,
(iv): w is inscrutable to i if and only if w is inscrutable to i under some topological order τ.

Proof.

We prove part (i) first. Then, parts (ii) through (iv) follow by straightforward set-theoretic reasoning.

(i): For the “only if”-direction, first assume that w is scrutable to v and consider an arbitrary topological sorting $τ$ with topological indices $τ_{i}$ and $τ_{j}$ for v and w, respectively. Then, either $w E v$ or w is reachable from v, that is, $v E^{*} w$ . In the former case, we have immediately that w is scrutable to v under $τ$ . In the latter case, observe that then $τ_{i} < τ_{j}$ by definition of a topological sorting. It then also follows that w is scrutable to v under $τ$ .
For the “if”-direction, assume for contraposition that w is not scrutable to v. Then, neither $w E v$ nor w being reachable from v hold. In case v is reachable from w, let $τ$ be any topological order with $τ_{i}$ and $τ_{j}$ the topological indices for v and w. Then, $τ_{j} < τ_{i}$ , and it follows that w is not scrutable under $τ$ . In case v is not reachable from w, let $(V, E)$ be the DAG underlying the partial order game. Then, consider the edge set $E^{'} = E \cup {(w, v)}$ . The graph $(V, E^{'})$ must be acyclic because if $(V, E^{'})$ contained a directed cycle, there would be a simple path from v to w not involving the edge $(w, v)$ . Therefore, w would also be reachable from v in $(V, E)$ , a contradiction. Therefore, $(V, E^{'})$ is acyclic and has a topological sorting $τ$ . Observe that $τ$ is also a topological order for $(V, E)$ . Let $τ_{i}$ and $τ_{j}$ be the topologic al indices for v and w, respectively. As $w E^{'} v$ , we have $τ_{j} < τ_{i}$ and again we may conclude that w is not scrutable to v under $τ$ .
(ii): This part follows immediately from part (i), by observing the following equivalences:

$\begin{matrix} w is inscrutable to v \\ if and only if w is not scrutable to v \\ if and only if w is not scrutable to v under some topological order τ \\ if and only if w is inscrutable to v under some topological order τ \end{matrix}$
(ii): Let $v_{1}, \dots, v_{m}$ be the vertices assigned to player i, and let $τ_{1}, \dots, τ_{k}$ be the topological sortings for the game. From (ii) we obtain that $I_{v_{j}} = I_{v_{j}}^{τ_{1}} \cup \dots \cup I_{v_{j}}^{τ_{k}}$ for every $1 \leq j \leq m$ .
(iv): Therefore,

$\begin{matrix} I_{i} = ⋃_{1 \leq j \leq m} I_{v_{j}} = ⋃_{1 \leq j \leq m} I_{v_{j}}^{τ_{1}} \cup \dots \cup I_{v_{j}}^{τ_{k}} = ⋃_{1 \leq ℓ \leq k} I_{v_{1}}^{τ_{ℓ}} \cup \dots \cup I_{v_{m}}^{τ_{ℓ}} = ⋃_{1 \leq ℓ \leq k} I_{i}^{τ_{ℓ}}, \end{matrix}$

and it follows that w is inscrutable to i if and only if w is inscrutable to i under some topological sorting. Part (iii) then is an immediate consequence of de Morgan’s law:

$S_{i} = V \ I_{i} = V \ (I_{i}^{τ_{1}} \cup \dots \cup I_{i}^{τ_{k}}) = V \ I_{i}^{τ_{1}} \cap \dots \cap V \ I_{i}^{τ_{k}} = S_{v_{1}}^{τ_{1}} \cap \dots \cap S_{i}^{τ_{k}} = ⋂_{1 \leq ℓ \leq k} S_{i}^{τ_{ℓ}} .$

Therefore, w is scrutable to i if and only if i is scrutable to i under all topological sortings.

This concludes the proof. □

6.2.2. Games of Adequate Information

In this section, we define a class of partial order games, which we refer to as games of adequate information. We prove that games of adequate information are fit for backward induction, and thus, on account of Theorem 7, allow for Nash equilibria. For this, we require that each player at each vertex assigned to them is in a position to determine their optimal choices if they are given only the choices at their parents and those at the nodes with a greater topological index.

Formally, we introduce the concept of a utility function being of adequate information and a variant of the notion parametrised by a topological order

τ

. These concepts only differ in the vertices that are considered to be scrutable and inscrutable. Let i be a player with utility function

u_{i} : A_{v_{1}} \times \dots \times A_{v_{| V |}} \to R

. Without loss of generality, we assume that

V_{i} = {v_{1}, \dots, v_{m}}

. Let S be the set of vertices scrutable to i (under

τ

) and I the set of vertices inscrutable to i (under

τ

). Then, player i’s utility function

u_{i} : A_{V} \to R

is said to be of adequate information (under $τ$ ) or adequately informed (under $τ$ ), if there is a function

u_{i}^{0} : A_{S} \to R

such that for all profiles

a_{S}, a_{S}^{'} \in A_{S}

and

b_{I} \in A_{I}

u_{i} (a_{S}, b_{I}) \geq u_{i} (a_{S}^{'}, b_{I}) if and only if u_{i}^{0} (a_{S}) \geq u_{i}^{0} (a_{S}^{'}) .

If this is the case, we say that player i’s utility function

u_{i} : A \to R

(ordinally) reduces to

u_{i}^{0} : A_{S_{i}} \to R

.

A game of adequate information under $τ$ is, then, a partial order game in which each player’s utility function is of adequate information under

τ

. We also define a concept of adequate information that is irrespective of the topological sorting

τ

in the straightforward manner. Thus, a game of adequate information simpliciter is a partial order game in which each player’s utility function is of adequate information. Observe that total-order games constitute a subclass of adequate-information games, since such games do not allow for inscrutable vertices. To illustrate, we have the following example:

Example 7(Game of adequate information).

Consider the game depicted in Figure 10 again. Under topological sorting

τ = (v_{1}, v_{2}, v_{3}, v_{4})

, player 1 has scrutables

S_{1}^{τ} = {v_{2}, v_{4}}

and inscrutables

I_{1}^{τ} = {v_{1}, v_{3}}

. For player 1, observe that for all

a \in A_{v_{2}} \times A_{v_{4}}

and all

b, b^{'} \in A_{v_{1}} \times A_{v_{3}}

we have that

u_{1} (a, b) = u_{1} (a, b^{'})

. For instance,

u_{1} (c g, a e) = u_{1} (c g, a f) = 3

and

u_{1} (d g, a f) = u_{1} (d g, b f) = 0

. Accordingly, define

u_{1}^{0} : A_{v_{2}} \times A_{v_{4}} \to R

such that

\begin{matrix} u_{1}^{0} (c g) & = 3 & u_{1}^{0} (c h) & = 1 & u_{1}^{0} (d g) & = 0 & u_{1}^{0} (d h) & = 2, \end{matrix}

and find that

u_{1}

ordinally reduced to

u_{1}^{0}

and, hence, that player 1 is adequately informed in this game. For players 2 and 3, we saw that

S_{2}^{τ} = S_{3}^{τ} = {v_{1}, v_{2}, v_{3}, v_{4}}

and

I_{2}^{τ} = I_{3}^{τ} = Ø

. As some reflection reveals,

u_{2}

and

u_{3}

thus vacuously reduce to themselves. We therefore immediately see that their utility functions are of adequate information under τ as well.

Finally, consider once more the game depicted in Figure 11 along with the natural topological order

τ = (v_{1}, v_{2}, v_{3}, v_{4})

. Under τ, this game is of adequate information. As

I_{1}^{τ} = Ø

, this is easy to see for player 1. For player 2 we have

S_{2}^{τ} = {v_{2}, v_{4}}

and

I_{2}^{τ} = {v_{1}, v_{3}}

. Now, we can define

u_{2}^{0}

such that

\begin{matrix} u_{2}^{0} (c g) & = 1 & u_{2}^{0} (c h) & = 4 & u_{2}^{0} (d g) & = 8 & u_{2}^{0} (d h) & = 2 \end{matrix}

It then only requires an easy check to see that

u_{2}

ordinally reduces to

u_{2}^{0}

. For instance,

u_{2}^{0} (c g) < u_{2}^{0} (d g)

and also

\begin{matrix} u_{2} (c g, a e) & = 2 < 9 = u_{2} (d g, a e) & u_{2} (c g, a f) & = 3 \frac{1}{2} < 7 = u_{2} (d g, a f) \\ u_{2} (c g, b e) & = 2 < 7 = u_{2} (d g, b e) & u_{2} (c g, b f) & = 2 < 4 = u_{2} (d g, b f) \end{matrix}

By contrast, under topological order

τ^{″} = (v_{2}, v_{1}, v_{4}, v_{3})

, player 1’s utility function is not of adequate information. To see this, first observe that

S_{1}^{τ^{''}} = {v_{1}, v_{3}, v_{4}}

and

I_{1}^{τ^{''}} = {v_{2}}

. Now,

u_{1} (a e g, c) = 2 < 9 = u_{1} (a e h, c)

, whereas

u_{1} (a e g, d) = 5 > 3 = u_{1} (a e h, d)

. Had there been a function

u_{1}^{0} : A_{S_{1}^{τ^{''}}} \to R

such that

u_{1}

ordinally reduces to

u_{1}^{0}

, then both

u_{1}^{0} (a e g) < u_{1}^{0} (a e g)

and

u_{1}^{0} (a e g) > u_{1}^{0} (a e g)

, a contradiction.

The following result establishes that games of adequate information (under a topological sorting

τ

) are fit for backwards induction (under

τ

).

Theorem 8.

Let

τ = (v_{τ_{1}}, \dots, v_{τ_{| V |}})

be a topological sorting. Then,

(i): games of adequate information under τ are guaranteed to have a Nash equilibrium, which can be found using our backward induction procedure under τ, and
(ii): games of adequate information simpliciter are guaranteed to have a Nash equilibrium, which can be found using our backward induction procedure underanytopological sorting.

Proof.

Let

τ = (v_{τ_{1}}, \dots, v_{τ_{| V |}})

be an arbitrary topological sorting. For part (i), due to Theorem 7, it suffices to prove that games of adequate information under

τ

are fit for backward induction under

τ

. To this end, consider an arbitrary game of adequate information under

τ

and consider an arbitrary vertex v. Let i be the player assigned to v. Furthermore, let

I_{i}^{τ} = {w_{1}, \dots, w_{k}}

denote the set of vertices that are inscrutable to i under

τ

and

S_{i}^{τ} = {w_{k + 1}, \dots, w_{| V |}}

denote the set of vertices that are scrutable to i under

τ

. We let Y denote the set of parents of v, and Z the set of vertices with a strictly greater topological index than v. Then, set

X = V \ (Y \cup Z \cup {v})

and let

X = {x_{1}, \dots, x_{| X |}]}

. Observe that X is the set of vertices that are inscrutable to v under

τ

, and thus all vertices in X are inscrutable to i under

τ

, that is,

X \subseteq I_{i}^{τ}

.

By assumption, player i’s utility function

u_{i} : A \to R

is adequately informed under

τ

. In particular, we may assume that

u_{i}

ordinally reduces to a function

u_{i}^{0} : A_{S_{i}^{τ}} \to R

.

Now, consider an arbitrary action profile

a = (a_{1}, \dots, a_{| V |})

. Having assumed

A

to be finite, there must be an action

a_{v}^{*}

in

A_{v}

with

a_{v}^{*} \in \underset{a_{v}^{'} \in A_{v}}{arg max} u_{i} (a_{X}, a_{Y}, a_{v}^{'}, a_{Z}) .

Let

a^{*} = (a_{X}, a_{Y}, a_{v}^{*}, a_{Z})

.

Consider an arbitrary

a_{v}^{″}

in

A_{v}

, and let

a^{″} = (a_{X}, a_{Y}, a_{v}^{''}, a_{Z})

. Then,

u_{i} (a^{*}) \geq u_{i} (a^{''}) .

Having assumed that the game is of adequate information we may assume that there is a function

u_{i}^{0} : A_{S_{i}^{τ}} \to R

such that

u_{i}^{0} (a_{S_{i}^{τ}}^{*}) \geq u_{i}^{0} (a_{S_{i}^{τ}}^{''}) .

(∗)

Consider an arbitrary profile

b

in

A

. Then,

u_{i} (b_{I_{i}^{τ}}, a_{S_{i}^{τ}}^{*}) \geq u_{i} (b_{I_{i}^{τ}}, a_{S_{i}^{τ}}^{''})

. Now, either v is scrutable to i or v is inscrutable to i. If the latter,

I_{i}^{τ} = X \cup {v}

. Then,

u_{i} (b_{X}, a_{Y}, a_{v}^{*}, a_{Z}) = u_{i} (b_{X}, a_{Y}, a_{v}^{''}, a_{Z})

. With

a_{v}^{″}

having been chosen arbitrarily,

a_{v}^{*} \in {arg max}_{a^{'} \in A_{v}} u_{i} (b_{X}, a_{Y}, a_{v}^{'}, a_{Z})

. If the former,

I_{i}^{τ} = X

, and we find that

u_{i} (b_{X}, a_{Y}, a_{v}^{*}, a_{Z}) \geq u_{i} (b_{X}, a_{Y}, a_{v}^{''}, a_{Z})

. Again, we may conclude that

a_{v}^{*} \in {arg max}_{a^{'} \in A_{v}} u_{i} (b_{X}, a_{Y}, a_{v}^{'}, a_{Z})

. In either case, and with

b

having been chosen arbitrarily,

a_{v}^{*} \in ⋂_{b_{X} \in A_{X}} {arg max}_{a_{v} \in A_{v}} u_{i} (b_{X}, a_{Y}, a_{v}, a_{Z})

, which proves the case.

Part (ii) follows by an argument that runs along analogous lines as that for part (i), reasoning about the vertices that are inscrutable to i simpliciter rather than under a specific topological sorting

τ

. In this context, observe that, by virtue of Lemma 6, the set

I_{i}^{τ}

of vertices inscrutable to i under a topological sorting

τ

constitutes invariably a subset of the set

I_{i}

of vertices inscrutable to i simpliciter. □

7. Computing Backwards Induction Solutions for Partial Order Games

In this section, we explore a number of natural computational problems surrounding our backwards induction procedure for partial order games and games of adequate information. We first conduct an asymptotic analysis of our backwards induction procedure. We find that the procedure runs in exponential time in the general case, but that a considerable speed up is possible if the in-degree of the vertices or the number of their parents is bounded appropriately. Subsequently, we investigate the computational complexity of deciding whether a partial order game is fit for backwards induction, that is if the precondition for our procedure to produce a Nash equilibrium holds. In a similar vein, we also address the closely related question of whether a given partial order game is a game of adequate information. We find that both these problems are

coNP

-complete. Finally, we come to consider the problems of deciding whether a given strategy profile is a backwards induction solution and whether a given action profile is sustained by a backwards induction solution. We show that the latter problem is

NEXPTIME

-complete for the general case, but that it is

PSPACE

-complete for total-order games.

7.1. Asymptotic Complexity of Backwards Induction

Due to the compactness of the partial order game representation, our backwards induction procedure runs in time exponential in the number

| V |

of vertices. Let

v_{1}, \dots, v_{| V |}

be ordered according to the topological sorting

τ

under which backwards induction is conducted. Throughout this section, as we did before, for each vertex v, by Y we denote the parents of v, by Z the vertices with a greater topological index than v, and let

X = V \ (Y \cup Z \cup {v})

. Then, for each vertex

v_{j}

, where

1 \leq j \leq | V |

, with parents Y and

τ

-inscrutable vertices X we have to find, an action

a_{v_{j}}^{*}

in

A_{v_{j}}

such that, for i the active player at

v_{j}

,

a^{*} \in ⋂_{b_{X} \in A_{X}} \underset{a_{v_{j}} \in A_{v_{j}}}{arg max} u_{i} (b_{X}, a_{Y}, a_{v_{j}}, f_{Z}^{*}),

and abort if no such action exists. This requires for each profile

a_{Y}

in

A_{Y}

the computation of

| A_{v_{j}} | \cdot \prod_{u \in X} | A_{u} |

utility values

u_{i} (a_{X}, a_{Y}, a_{v_{j}}, a_{Z}^{*})

. Letting

\bar{A} = {max}_{v \in V} A_{v}

, this makes for a total number of utility computations of

\sum_{j = 1}^{| V |} \prod_{k = 1}^{j} | A_{k} | \leq \sum_{j = 1}^{| V |} \prod_{k = 1}^{j} | \bar{A} | = \sum_{j = 1}^{| V |} | \bar{A} |^{j} = \sum_{j = 0}^{| V |} | \bar{A} |^{j} - 1 = \frac{| \bar{A} |^{| V | + 1} - 1}{| \bar{A} | - 1} - 1 = O (| \bar{A} |^{| V |}) .

Assuming that the players’ utilities are represented by a Boolean circuit, each of these can be achieved in time polynomial in

| A |

. Following the definition, we can therefore compute a backwards induction solution in

O (| A |^{| V |})

time.

It is not hard to see that, if we restrict ourselves to partial order games of adequate information in which all vertices have at most

{log}_{2} (| A |)

parents, a backwards induction profile can be computed in time pseudo-polynomial in

| A |

. If, in addition to their having at most

{log}_{2} (| A |)

parents, every vertex v of a game of adequate information has at most two actions, that is,

| A_{v} | \leq 2

, then a backwards induction profile can be computed in time polynomial in

| A |

. Summarising, we have the following proposition.

Proposition 5.

Let

Y_{v}

denote the set of parents of a vertex v. For a partial order game of adequate information, a backwards induction profile can then be computed in time:

(i): $O (| A |^{| V |})$ , in the general case,
(ii): $O (| A |^{{log}_{2} (| A |)})$ , if $| Y_{v} | \leq {log}_{2} (| A |)$ for all vertices v,
(iii): $O (| A |)$ , if $| Y_{v} | \leq {log}_{2} (| A |)$ and $| A_{v} | \leq 2$ for all vertices v.

7.2. Computational Complexity of Backwards Induction

In this section, we investigate the natural computation problems of deciding whether a backwards induction solution exists in a given partial order game for a given topological order. Furthermore, we analyse how hard it is to decide whether a given strategy profile incorporates a backwards induction solution. Finally, recalling that action profiles and strategy profiles are essentially different objects in the setting of partial order games, we explore how computational complex it is to decide whether a given action profile is sustained by a backwards induction solution, that is, whether, for a given action profile

a

, there is a backwards induction solution

f

such that

a = a^{f}

. Throughout this section, we assume that strategies and utilities are concisely represented as Boolean circuits as outlined in Section 3.1.

First, we consider the following computational problem concerning the conditions under which a partial order game is fit for backwards induction. Formally:

fit for backwards induction
Given:	Partial order game G and a topological sorting $τ$ of its vertices
Problem:	Is G fit for backwards induction under $τ$ ?

For total-order games, fit for backwards induction is vacuous, as they are all fit for backwards induction. For general partial order games, however, we have the following.

Theorem 9.

fit for backwards inductionis

coNP

-complete.

Proof.

To see that fit for backwards induction is in

coNP

, let

τ

be a topological sorting of the game’s vertices. A certificate for a counterexample is then given by

(i): a vertex v, with $A_{v} = {a_{1}, \dots, a_{k}}$ , and X, Y, and Z the set of inscrutable vertices with a lower topological index, the set of v’s parents, and the vertices with a higher topological index than v, respectively;
(ii): a profile $a_{Y}$ in $A_{Y}$ ;
(iii): a profile $a_{Z}$ in $A_{Z}$ ;
(iv): a tuple $(a_{1}^{'}, \dots, a_{k}^{'})$ of k actions $a_{j}^{'}$ in $A_{k}$ ; and
(v): a tuple $(b_{X}^{1}, \dots, b_{X}^{k})$ of k profiles $b_{X}^{j}$ in $A_{X}$ .

It is easy to verify that such a certificate is still of polynomial size. Observe that, if

u_{i} (b_{X}^{j}, a_{Y}, a_{j}^{'}, a_{Z}) > u_{i} (b_{X}^{j}, a_{Y}, a_{j}, a_{Z})

holds for some

1 \leq j \leq k

, we may conclude that

a_{j} \notin ⋂_{b_{X} \in A_{X}} \underset{a_{v} \in A_{v}}{arg max} u_{i} (b_{X}, a_{Y}, a_{v}, a_{Z}) .

Therefore, such a certificate is a counterexample to the game being fit for backwards induction under

τ

if and only if

u_{i} (b_{X}^{j}, a_{Y}, a_{j}^{'}, a_{Z}) > u_{i} (b_{X}^{j}, a_{Y}, a_{j}, a_{Z})

for every

1 \leq j \leq k

. Observe that determining the sets X, Y, and Z can be achieved in polynomial time. Having assumed, moreover, that the utility function

u_{i}

is part of the input and represented by a Boolean circuits and evaluation problem for Boolean circuits is solvable in polynomial time, we find that this is a property that can be checked in polynomial time.

For

coNP

-hardness, we reduce the complement of satisfiability. Let

φ

be a propositional formula in the propositional variables

p_{1}, \dots, p_{m}

. Let q and r be two additional, fresh, propositional variables. We construct a partial order Boolean game with three players, player 1, player 2, and player 3 on a DAG

(V, E)

with

V = {v_{q}, v_{r}, v_{p_{1}}, \dots, v_{p_{m}}}

and E such that

(v, u) \in E

whenever

v = v_{q}

and

u = v_{p_{j}}

for some

1 \leq j \leq m

, or

v = v_{p_{j}}

for some

1 \leq j \leq m

and

u = v_{r}

. Furthermore, see Figure 12 for an illustration of the construction of such a DAG in the case

φ

is defined over five propositional variables. Let

v_{q}

be assigned to player 1, the vertices

v_{p_{1}}, \dots, v_{p_{m}}

to player 2, and

v_{r}

to player 3. Furthermore, assume that

γ_{1} = γ_{2} = ⊤

and

γ_{3} = φ \to (q \leftrightarrow r)

. Consider an arbitrary topological sorting

τ

of the vertices.

First, assume that

φ

is not satisfiable. Then, all players are indifferent between all outcomes, and it can easily be seen that the game is fit for backwards induction under

τ

.

Finally, assume that assignment

α : {p_{1}, \dots, p_{m}} \to {0, 1}

satisfies

φ

. Observe that

v_{q}

is inscrutable to

v_{r}

and has a lower topological rank under

τ

. Moreover,

v_{p_{1}}, \dots, v_{p_{m}}

are the parents of

v_{r}

. Now, let

a_{v_{p_{1}}}, \dots, a_{v_{p_{m}}}

be such that

a_{p_{j}} = p_{j}

if

α (p_{j}) = 1

, and

a_{p_{j}} = {\bar{p}}_{j}

if

α (p_{j}) = 0

. Furthermore, let

b_{v_{q}} = q

and

b_{v_{q}}^{'} = \bar{q}

, and

a_{v_{r}} = r

and

a_{v_{r}}^{'} = \bar{r}

. Then,

u_{3} (q, a_{v_{p_{1}}}, \dots, a_{v_{p_{m}}}, r) > u_{3} (q, a_{v_{p_{1}}}, \dots, a_{v_{p_{m}}}, \bar{r})

but

u_{3} (\bar{q}, a_{v_{p_{1}}}, \dots, a_{v_{p_{m}}}, r) < u_{3} (\bar{q}, a_{v_{p_{1}}}, \dots, a_{v_{p_{m}}}, \bar{r}) .

Accordingly, we may conclude that the game is not fit for backwards induction. □

The closely related decision problem of whether a given partial order game is of adequate information, can be formulated as follows.

adequate information
Given:	Partial order game G and topological sorting $τ$
Problem:	Is G a game of adequate information under $τ$ ?

Again, note that adequate information is vacuous for total-order games. For general partial order games, we find that the problem of adequate information is not easier than fit for backwards induction. The similarity of the two problems—and so also the closeness of the concepts of fitness of backwards induction and adequate information—is revealed by the closeness of their respective proofs of

coNP

-completeness.

Theorem 10.

adequate informationis

coNP

-complete.

Proof.

For membership in

coNP

, let a certificate for a counterexample consist of a player i, a pair of profiles

a_{S}

and

a_{S}^{'}

in

A_{S}

, and a pair of profiles

b_{I}

and

b_{I}^{'}

in

A_{I}

, where S is the set of vertices that are scrutable to i under

τ

and I the set of vertices that are inscrutable to i under

τ

. Observe that such a certificate is of polynomial size and that the sets S and I can be computed in polynomial time. Then, such a certificate

(a_{S}, a_{S}^{'}, b_{I}, b_{I}^{'})

defines a counterexample against the game being of adequate information whenever it is not the case that

u_{i} (a_{S}, b_{I}) \leq u_{i} (a_{S}^{'}, b_{I})

if and only if

u_{i} (a_{S}, b_{I}^{'}) \leq u_{i} (a_{S}^{'}, b_{I}^{'})

. Having assumed that the utility function

u_{i}

is part of the input and represented by a Boolean circuit and evaluation problem for Boolean circuits is solvable in polynomial time, we find that this is a property that can be checked in polynomial time (see Lemma 2).

The proof of

coNP

-hardness is by a reduction from the complement of satisfiability, and runs along analogous lines as the proof of

coNP

-hardness of fit for backwards induction. Thus, given a propositional formula

φ

in the variables

p_{1}, \dots, p_{k}

, construct the partial order Boolean game as in the proof of Theorem 9.

Then, if

φ

is not satisfiable, we find that for all three players, each action profile yields a utility of 1. It thus immediately follows that the game is of adequate information, as we can set, for instance,

u_{i}^{0} (a_{S}) = 1

for all players i with set S of scrutable vertices under

τ

.

For the opposite direction, assume that assignment

α : {p_{1}, \dots, p_{m}} \to {0, 1}

witnesses the satisfiability of

φ

. Observe that

v_{q}

is inscrutable to player 3 under

τ

, whereas

v_{q}, v_{p_{1}}, \dots, v_{p_{m}}

are scrutable to player 3. Now let

a_{v_{p_{1}}}, \dots, a_{v_{p_{m}}}

be such that

a_{p_{j}} = p_{j}

if

α (p_{j}) = 1

, and

a_{p_{j}} = {\bar{p}}_{j}

if

α (p_{j}) = 0

. Then,

u_{3} (q, a_{v_{p_{1}}}, \dots, a_{v_{p_{m}}}, r) > u_{3} (q, a_{v_{p_{1}}}, \dots, a_{v_{p_{m}}}, \bar{r})

but

u_{3} (\bar{q}, a_{v_{p_{1}}}, \dots, a_{v_{p_{m}}}, r) < u_{3} (\bar{q}, a_{v_{p_{1}}}, \dots, a_{v_{p_{m}}}, \bar{r}) .

Assuming that the game be of adequate information, there then should be a utility function

u_{3}^{0} : A_{S} \to R

such that from the former it would follow that

u_{3}^{0} (a_{v_{p_{1}}}, \dots, a_{v_{p_{m}}}, r) > u_{3}^{0} (a_{v_{p_{1}}}, \dots, a_{v_{p_{m}}}, \bar{r})

, whereas from the latter

u_{3}^{0} (a_{v_{p_{1}}}, \dots, a_{v_{p_{m}}}, r) < u_{3}^{0} (a_{v_{p_{1}}}, \dots, a_{v_{p_{m}}}, \bar{r})

, a contradiction. Accordingly, we may conclude that the game is not of adequate information. □

Using Theorems 7 and 8, we know that games that are fit for backwards induction, like games of adequate information, have Nash equilibria. Thus, Theorems 9 and 10 could be seen to prove

coNP

-completeness of weak versions of the non-emptiness problem for backwards induction. This still leaves the question of whether a given strategy profile is a backwards induction solution and of whether a given action profile is sustained by a backwards induction solution, which can be seen as variants of the non-emptiness problem for backwards induction. The first decision problem can be formulated formally as follows.

is backward induction
Given:	Partial order game G that is fit for backwards induction under topological sorting $τ$ and strategy profile $f = (f_{v_{1}}, \dots, f_{v_{\| V \|}})$
Problem:	Is $f$ a backwards induction solution of G under $τ$ ?

We find that is backward induction is

coNP

-complete. For the membership part, recall that Lemma 5 showed that our backwards induction procedure has the one-stage-deviation property. Let

τ

be a topological sorted and assume

(v_{1}, \dots, v_{| V |})

be sorted according to

τ

. The contrapositive of the one-stage-deviation property says that, if a profile

f

is not a backwards induction solution, there is one player, who, after some

τ

-history, wants to unilaterally deviate and play another action than the one prescribed by

f

, but otherwise has no incentive to change their strategy. This prepares the ground for the following result.

Theorem 11.

is backward inductionis

coNP

-complete. The problem remains

coNP

-hard for total-order games.

Proof.

For membership in

coNP

, one can design a non-deterministic algorithm along the following lines. Assume

v_{v_{1}}, \dots, v_{v_{| V |}}

is ordered according to

τ

. Guess a vertex

v_{k}

assigned to player i along with action profiles

a = (a_{X}, a_{Y}, a_{v_{k}}, a_{Z})

and

a^{'} = (a_{X}, a_{Y}, a_{v_{k}}^{'}, a_{Z}^{'})

(thus,

a_{X}^{'} = a_{X}

and

a_{Y}^{'} = a_{Y}

). Let Y denote the parents of

v_{k}

, Z the vertices with a greater topological index than

v_{k}

, and

X = V \ (Y \cup Z \cup {v_{k}})

. Then, check whether

(i): the actions $a_{v_{k}}, a_{v_{k + 1}} \dots, a_{v_{| V |}}$ are in accordance with $f = (f_{v_{1}}, \dots, f_{v_{| V |}})$ , that is, for every $k \leq m \leq | V |$ , we have $a_{v_{m}} = f_{v_{m}} (a_{Y_{m}})$ , where $Y_{m}$ is the set of parents of $v_{m}$ ;
(ii): the actions $a_{v_{k + 1}}^{'}, \dots, a_{v_{| V |}}^{'}$ are in accordance with $f = (f_{v_{1}}, \dots, f_{v_{| V |}})$ , that is, for every $k < m \leq | V |$ , we have $a_{v_{m}}^{'} = f_{v_{m}} (a_{Y_{m}}^{'})$ , where $Y_{m}$ is the set of parents of $v_{m}$ ; and
(iii): $u_{i} (a^{'}) > u_{i} (a)$ .

In virtue of Lemmas 1 and 2, each of these checks can be performed in polynomial time in the size of the input. Moreover, if all of these checks are positive, we have found a counterexample against

f

being a backwards induction solution.

For

coNP

-hardness, we reduce the complement of satisfiability. Given an Boolean formula

φ

on propositional variables

p_{1}, \dots, p_{k}

, we construct a partial order Boolean game

G_{φ}

with one player i with goal formula

γ_{i} = φ \land p_{0}

, where

p_{0}

is a “fresh” variable distinct from any

p_{1}, \dots, p_{k}

. Let the vertices

v_{p_{0}}, \dots, v_{p_{k}}

be ordered accordingly, that is,

p_{ℓ} E p_{m}

if and only if

ℓ < m

. Thus,

G_{φ}

is a totally-ordered Boolean game, and, consequently, also fit for backwards induction. Now, consider the strategy profile

f^{0} = (f_{v_{p_{0}}}^{0}, \dots, f_{v_{p_{k}}}^{0})

where each

f_{v_{p_{j}}}^{0}

is represented by the choice equation

p_{j} : ⊥

, that is,

f^{0}

unconditionally sets every propositional variable to false. Note that

f^{0}

yields player i a utility of 0, that is

u_{i} (f^{0}) = 0

.

Now, if

φ

is unsatisfiable, so is i’s goal

γ_{i}

, and all strategy profiles yield i the same payoff. Therefore,

f^{0}

is vacuously a backwards induction solution. If, on the other hand,

φ

is satisfiable, say in virtue of assignment

α

, let

f^{α} = (f_{v_{p_{0}}}, \dots, v_{p_{k}})

be the unconditional strategy profile represented by the choice equations

p_{0} = ⊤

and, for each

1 \leq m \leq k

,

p_{m} : ⊤

, if

α (p_{m}) = 1

, and

p_{m} : ⊥

, if

α (p_{m}) = 0

. It can then easily be verified that player i has

u_{i} (f^{α}) = 1

. Accordingly, player i has an incentive to deviate from

f^{0}

to

f^{α}

, indicating that

f^{0}

is not a Nash equilibrium. By Theorem 7, it then follows that

f^{0}

is not a backwards induction solution either. □

Finally, we come to consider the natural counter-part of the is nash actions problem for the backwards induction procedure, which can formally be stated as follows.

is backward induction actions
Given:	Partial order game G, topological sorting $τ$ , and action profile $a = (a_{v_{1}}, \dots, a_{v_{\| V \|}})$
Problem:	Is $a$ sustained by a backwards induction solution under $τ$ in G?

We find that backwards induction actions is

NEXPTIME

-complete for general partial order games. The hardness part of this result follows again from a reduction from dependency quantifier boolean formula game (dqbfg).

Theorem 12.

For partial order games,backward induction actionsis

NEXPTIME

-complete.

Proof.

A

NEXPTIME

algorithm to decide backward induction actions can be designed along the following lines. Given a partial order game G, a topological-ordering

τ = (v_{1}, \dots, v_{| V |})

of its vertices, and an action profile

a = (a_{v_{1}}, \dots, a_{v_{| V |}})

, first, guess a strategy profile

f = (f_{v_{1}}, \dots, f_{v_{| V |}})

. Given that strategies

f_{v}

are given by a Boolean circuit

C_{f_{v}}

, this can be achieved in time not more than exponential in

| A |

, the size of the set of actions. Second, check whether

f = (f_{v_{1}}, \dots, f_{v_{| V |}})

is a backwards induction solution. The inductive definition of a backwards induction solution suggests a procedure for how this can be achieved in exponential time. Finally, check whether action profile

a^{f}

as induced by strategy profile

f

coincides with the action profile

a

as it is given in the input. This can be achieved in time polynomial in the size of the Boolean circuit representing

f

. Altogether, the algorithm runs in non-deterministic exponential time.

For hardness, we reduce from dependency quantifier boolean formula game (dqfg) using a similar construction of a Boolean partial order game as in the proof of Theorem 4. Given an instance

〈 φ, X_{1}, X_{2}, Y_{1}, Y_{2} 〉

of dqfg with

X_{1} = {x_{1}^{1}, \dots, x_{| X_{1} |}^{1}}

,

X_{2} = {x_{1}^{2}, \dots, x_{| X_{2} |}^{2}}

,

Y_{1} = {y_{1}^{1}, \dots, y_{| Y_{1} |}^{1}}

, and

Y_{2} = {y_{1}^{2}, \dots, y_{| Y_{2} |}^{2}}

. Let

x_{0}

be a ‘fresh’ variable

x_{0}

not in

X_{1} \cup X_{2} \cup Y_{1} \cup Y_{2}

. We define a Boolean partial order game on dependency graph

(V, E)

such that

V = {x_{0}} \cup X_{1} \cup X_{2} \cup Y_{1} \cup Y_{2}

and where E is defined as follows:

(i): $x_{0} E z$ for all $z \in X_{1} \cup X_{2} \cup Y_{1} \cup Y_{2}$ ,
(ii): $x_{k}^{i} E x_{m}^{j}$ for all $x_{k}^{i} \in X_{i}$ and $x_{m}^{j} \in X_{j}$ with $i = j$ and $k < m$ , or $i \leq j$ ,
(iii): $x_{1} E x_{2}$ , for all $x_{1} \in X_{1}$ and $x_{2} \in X_{2}$ ,
(iv): $x_{1} E y_{1}$ , for all $x_{1} \in X_{1}$ and $y_{1} \in Y_{2}$ ,
(v): $x_{2} E y_{2}$ , for all $x_{2} \in X_{2}$ and $y_{2} \in Y_{2}$ .

Observe that under this definition on

X_{1} \cup X_{2}

, the relation E is complete and transitive. We extend E to a topological sorting

τ

such that, for instance,

τ = (x_{0}, x_{1}^{1}, \dots, x_{| X_{1} |}^{1}, x_{1}^{2}, \dots, x_{| X_{2} |}^{2}, y_{1}^{1}, \dots, y_{| Y_{1} |}^{1}, y_{1}^{2}, \dots, y_{| Y_{2} |}^{2}) .

The players of the Boolean game are the same as in the instance of dqbfg, where B controls

{x_{0}} \cup X_{1} \cup X_{2}

,

W_{1}

the variables in

Y_{1}

, and

W_{2}

those in

Y_{2}

. Their goals are given by

\begin{matrix} γ_{B} & = x_{0} \land φ & γ_{W_{1}} & = γ_{W_{2}} = ⊥ . \end{matrix}

Observe that all vertices are scrutable to player B under

τ

and that the white players have the same utility for every outcome. It therefore follows that the game constructed is of adequate information and hence fit for backwards induction.

We are now in a position to show that the profile

\bar{a} = ({\bar{x}}_{0}, {\bar{x}}_{1}^{1}, \dots {\bar{x}}_{| X_{1} |}^{1}, {\bar{x}}_{1}^{2} \dots, {\bar{x}}_{| X_{2} |}^{2}, {\bar{y}}_{1}^{2}, \dots {\bar{y}}_{| Y_{1} |}^{1}, {\bar{y}}_{1}^{2} \dots, {\bar{y}}_{| Y_{2} |}^{2}),

which sets all variables, including

x_{0}

, to false, is sustained by a Nash equilibrium in

G^{'}

if and only if

〈 φ, X_{1}, X_{2}, Y_{1}, Y_{2} 〉

is a postive instance of dqbfg.

First, assume

〈 φ, X_{1}, X_{2}, Y_{1}, Y_{2} 〉

is a postive instance of dqbfg. Then, the white team has a joint winning strategy in the original dqbfg-game given by Boolean functions

β_{1}

and

β_{2}

on the variables

X_{1}

and

X_{2}

, respectively. Next, define strategies

f_{W_{1}}^{*}

and

f_{W_{2}}^{*}

for

W_{1}

and

W_{2}

in the Boolean partial order game that are given by the following choice equations for

i = 1, 2

and

j = 1, \dots, | Y_{i} |

,

y_{j}^{i} : (x_{0} \to φ_{β_{i}} (x_{1}^{i}, \dots, x_{| X_{i} |}^{i})) \land ({\bar{x}}_{0} \to ⊥)

As the dependency graph

(V, E)

respects the information dependencies of the dqbfg-instance, observe that

f_{W_{1}}

and

f_{W_{2}}

together embody a winning strategy to render

φ

false if

x_{0}

is played. Let B’s unconditional strategy

f_{B}^{*}

be defined by the choice equations of the form

x_{j}^{i} : ⊥

, setting

x_{j}^{i}

to false for all for

i = 1, 2

and

j = 1, \dots, | X_{i} |

. Observe that for the strategy profile

f^{*} = (f_{B}^{*}, f_{W_{1}}^{*}, f_{W_{2}}^{*})

, we have that

a^{f^{*}} = \bar{a}

, as desired. As

W_{1}

and

W_{2}

will not have their goal achieved under any action profile, their strategies trivially conform with the backwards induction procedure. Observe furthermore that B does not get their goal satisfied no matter which strategy plays against

f_{W_{1}}^{*}

and

f_{W_{2}}^{*}

. If B sets

x_{0}

to false, their goal is not satisfied in a self-defeating fashion. By setting

x_{0}

to true, B will induce

W_{1}

and

W_{2}

to play a winning strategy rendering

\neg φ

false and therewith

γ_{B}

as well. This in particular holds for any backwards induction strategies B has at their disposal, and it follows that

f^{*}

is a backwards induction solution sustaining

\bar{a}

.

For the opposite direction, assume that

〈 φ, X_{1}, X_{2}, Y_{1}, Y_{2} 〉

is a negative instance of dqbfg. Analogously to the proof of Theorem 4, it can then be shown that the constructed partial order Boolean game does not have a Nash equilibrium sustaining

\bar{a}

. In virtue of the contrapositive of Theorem 7, there is no backwards induction solution sustaining

\bar{a}

, giving us the result. □

The computational complexity of backward induction actions reduces considerably when we restrict attention to total-order games. Still, the problem is

PSPACE

-complete for this class of games. To prove this result, we first introduce some auxiliary notation and terminology.

Recall that if the dependency graph

(V, E)

of a total-order game is such that

v_{0} E \dots E v_{| V |}

, then

τ = (v_{0}, \dots, v_{| V |})

is the only topological sorting of the vertices. Moreover, the parents of each vertex

v_{k}

are all vertices with a lower topological index. The set

{v_{1}, \dots, v_{k - 1}}

of parents of vertex

v_{k}

we will denote by

Y_{k}

, on the understanding that

Y_{k} = Ø

, if

k = 0

. Similarly, we will denote by

Z_{k} = {v_{k + 1}, \dots, v_{| V |}}

the set of vertices with a higher topological index than

v_{k}

, here on the understanding that

Z_{k} = Ø

, if

k = | V |

.

For

a = (a_{v_{1}}, \dots, a_{v_{| V |}})

an action profile and

h = (a_{v_{1}}, \dots, a_{v_{k}})

a

τ

-history of

a

, we now introduce the auxiliary concept of

a

being sustained by a backwards induction solution at history h of a. This enables us to reason recursively about action profiles being sustained by backwards induction solutions without having to make explicit reference to specific underlying strategy profiles. Formally, for action profile

a = (a_{v_{1}}, \dots, a_{v_{| V |}})

a n d

τ-history

h = (a_{v_{1}}, \dots, a_{v_{k}})

of a we say that a is sustained (by backwards induction) at h, if either h = a, or

h = (a_{v_{1}}, \dots, a_{v_{k}})

for some 0 < k < |V| and the following two conditions hold:

(a.i): action profile a is sustained by backwards induction at history $(a_{v_{1}}, \dots, a_{v_{k}}, a_{v_{k + 1}})$
(a.ii): for every $b_{v_{k + 1}} \in A_{v_{k + 1}}$ , there is an action profile $a^{'} = (a_{v_{1}}, \dots, a_{v_{k}}, b_{v_{k + 1}}, b_{v_{k + 2}} \dots, b_{v_{| V |}})$ that is sustained by backwards induction at history $(a_{v_{1}}, \dots, a_{v_{k}}, b_{v_{k + 1}})$ and which is such that $u_{i} (a) \geq u_{i} (a^{'})$ , where i is the player active at v_k+1.

We now have the following lemma.

Lemma 7.

For total-order games, an action profile

a = (a_{v_{1}}, \dots, a_{v_{| V |}})

is sustained by a backwards induction solution if and only if

a

is sustained by backwards induction at all histories of

a

. Moreover,

a

is sustained by a backwards induction solution only if

a

is sustained by backwards induction at the empty history

() .

.

Proof.

As we are dealing with total-order games, first observe that, for each vertex v, the set X of vertices that are neither a parent of v nor have a greater topological index than v is empty. Thus, Condition (1) in the definition of a backwards induction solution

f^{*}

for partial order games reduces to

f_{v_{k + 1}}^{*} (a_{v_{1}}, \dots, a_{v_{k}}) \in \underset{b_{v_{k + 1}} \in A_{v_{k + 1}}}{arg max} u_{i} (a_{v_{1}}, \dots, a_{v_{k}}, b_{v_{k + 1}}, f_{v_{k + 2}}, \dots, f_{v_{| V |}}),

(BI″)

for all

0 \leq k < | V |

, and where i is the player active at

v_{k + 1}

.

As an auxiliary concept, we define a strategy profile

f = (f_{v_{1}}, \dots, f_{v_{| V |}})

to sustain action profile $a = (a_{v_{1}}, \dots, a_{v_{| V |}})$ (by backwards induction) at history $h = (a_{v_{1}}, \dots, a_{v_{k}})$ , if either

a = h

, or both

(f.i): f sustains a at $(a_{v_{1}}, \dots, a_{v_{k}}, a_{v_{k + 1}})$ ,
(f.ii): $f_{v_{k + 1}} (a_{v_{1}}, \dots, a_{v_{k}}) = a_{v_{k + 1}}$ , and
(f.iii): for all $b_{v_{k + 1}} \in A_{v_{k + 1}}$ with $b_{v_{k + 1}} \neq a_{v_{k + 1}}$ we have that f sustains action profile $(a_{v_{1}}, \dots, a_{v_{k}}, b_{v_{k + 1}}, f_{v_{k + 2}}, \dots, f_{v_{| V |}})$ at history $(a_{v_{1}}, \dots, a_{v_{k}}, b_{v_{k + 1}})$ and,

u_{i} (a_{v_{1}}, \dots, a_{v_{k}}, a_{v_{k + 1}}, a_{v_{k + 2}} \dots, a_{v_{| V |}}) \geq u_{i} (a_{v_{1}}, \dots, a_{v_{k}}, b_{v_{k + 1}}, f_{v_{k + 1}}, \dots, f_{v_{| V |}}),

where i is the player active at

v_{k + 1}

.

The we split the proof up in three parts, where we prove, respectively, the following statements (I), (II), and (III), below. In conclusion then put them together again, to prove the lemma.

(I): for all $0 \leq k \leq | V |$ and all action profiles $a = (a_{v_{1}}, \dots, a_{v_{| V |}})$ , $a$ is sustained at $(a_{v_{1}}, \dots, a_{v_{k}})$ if and only if some strategy profile $f = (f_{v_{1}}, \dots, f_{v_{| V |}})$ sustains $a$ at $(a_{v_{1}}, \dots, a_{v_{k}})$ .
(II): strategy profile $f = (f_{v_{1}}, \dots, f_{v_{| V |}})$ sustains action profile $a = (a_{v_{1}}, \dots, a_{v_{| V |}})$ at history $()$ if and only if $a^{f} = a$ and for every $0 \leq k \leq | V |$ and all action profiles $x = (x_{v_{1}}, \dots, x_{v_{k}}, f_{v_{k + 1}}, \dots, f_{v_{| V |}})$ , strategy profile $f$ sustains $x$ at $(x_{v_{1}}, \dots, x_{v_{k}})$ .
(III): for every $0 \leq k \leq | V |$ and every action profile $x^{'} = (x_{v_{1}}, \dots, x_{v_{k}}, f_{v_{k + 1}}, \dots, f_{v_{| V |}})$ , strategy profile $f$ sustains $x$ at history $h = (x_{v_{1}}, \dots, x_{v_{k}})$ if and only if strategy profile $f = (f_{v_{1}}, \dots, f_{v_{| V |}})$ is a backwards induction solution.

Part (I) is by induction on

| V | - k

. For the basis, where

k = | V |

, consider an arbitrary action profile

a = (a_{v_{1}}, \dots, a_{v_{| V |}})

and arbitrary history

h = (a_{v_{1}}, \dots, a_{v_{k}})

. In this case, we find that

h = a

and immediately that

a

is sustained at

h

. Moereover, every strategy profile sustains

a

at

h

in this case.

For the induction step, consider an arbitrary

1 \leq k < | V |

, and equally arbitrary action profile

a = (a_{v_{1}}, \dots, a_{v_{| V |}})

and history

h = (a_{v_{1}}, \dots, a_{v_{k}})

.

First, assume that

a

is sustained at

(a_{v_{1}}, \dots, v_{k})

, and let i be the player active at

v_{k + 1}

. Then,

a

is also sustained at

(a_{v_{1}}, \dots, v_{k + 1})

. By the induction hypothesis, there is a strategy profile

f

that sustains

a

at

(a_{v_{1}}, \dots, v_{k + 1})

. Moreover, as

k < | V |

, for each

b_{v_{k} + 1} \in A_{v_{k}}

with

b_{v_{k + 1}} \neq a_{v_{k + 1}}

, there is an action profile

a^{'} = (a_{v_{1}}, \dots, a_{v_{k 1}}, b_{v_{k + 1}}, b_{v_{k + 2}} \dots, b_{v_{| V |}})

that is sustained at history

(a_{v_{1}}, \dots, a_{v_{k - 1}}, b_{v_{k}}, b_{v_{k + 1}})

such that

u_{i} (a) \geq u_{i} (a^{'})

. Let

A_{v_{k + 1}} = {b_{v_{k + 1}}^{1}, \dots, b_{v_{k + 1}}^{| A_{v_{k + 1}} |}}

. By the induction hypothesis, for each

1 \leq j \leq | A_{v_{k + 1}} |

, there is a strategy profile

g^{j} = (g_{v_{1}}^{j}, \dots, g_{v_{| V |}}^{j})

sustaining action profile

(a_{v_{1}}, \dots, a_{v_{k}}, b_{v_{k + 1}}^{j}, g_{v_{k + 2}}^{j} \dots, g_{v_{| V |}}^{j})

at history

(a_{v_{1}}, \dots, a_{v_{k}}, b_{v_{k + 1}}^{j})

, and which is such that

u_{i} (a) \geq u_{i} (a_{v_{1}}, \dots, a_{v_{k}}, b_{v_{k + 1}}^{j}, g_{v_{k + 1}}^{j}, \dots, g_{v_{| V |}}^{j}) .

Now, define the strategy profile

f^{*} = (f_{v_{1}}^{*}, \dots, f_{v_{| V |}}^{*})

such that for

1 \leq k \leq | V |

and every history

x = (x_{v_{1}}, \dots, x_{m})

, where

0 \leq m \leq | V |

,

f_{v_{m}}^{*} (x) = \{\begin{matrix} f_{v_{m}}^{j} (x) & if x_{v_{k + 1}} = a_{v_{k + 1}}^{j} and m > k, \\ g_{v_{m}}^{j} (x) & if x_{v_{k + 1}} = b_{v_{k + 1}}^{j} and m > k, \\ a_{v_{m}} & otherwise . \end{matrix}

It is now easy to check that, by construction, strategy profile

f^{*}

sustains

a

at history

(a_{v_{1}}, \dots, a_{v_{k}})

, as desired. In particular, observe that

f_{v_{k + 1}}^{*} (a_{v_{1}}, \dots, a_{v_{k}}) = a_{v_{k + 1}}

.

For the opposite direction, assume that strategy profile

f = (f_{v_{1}}, \dots, f_{v_{| V |}})

sustains action profile

a = (a_{v_{1}}, \dots, a_{v_{| V |}})

at history

(a_{v_{1}}, \dots, a_{v_{k}})

, where

k < | V |

. Then, by definition, strategy profile

f = (f_{v_{1}}, \dots, f_{v_{| V |}})

also sustains action profile

a = (a_{v_{1}}, \dots, a_{v_{| V |}})

at history

(a_{v_{1}}, \dots, a_{v_{k = 1}})

. By the induction hypothesis, we then immediately obtain that

a

is sustained at history

(a_{v_{1}}, \dots, a_{v_{k + 1}})

. Now, consider an arbitrary

b_{v_{k + 1}} \in A_{v_{k + 1}}

with

a_{v_{k + 1}} \neq b_{v_{k + 1}}

, along with history

(a_{v_{1}}, \dots, a_{v_{k}}, b_{v_{k + 1}})

. Then,

f

sustains action profile

(a_{v_{1}}, \dots, a_{v_{k}}, b_{v_{k + 1}}, f_{v_{k + 2}}, \dots, f_{v_{| V |}})

at history

(a_{v_{1}}, \dots, a_{v_{k}}, b_{v_{k + 1}})

. We have, moreover, that

u_{i} (a) \geq u_{i} ((a_{v_{1}}, \dots, a_{v_{k}}, b_{v_{k + 1}}, f_{v_{k + 2}}, \dots, f_{v_{| V |}}))

. By the induction hypothesis, we find that action profile

(a_{v_{1}}, \dots, a_{v_{k}}, b_{v_{k + 1}}, f_{v_{k + 2}}, \dots, f_{v_{| V |}})

is sustained at history

(a_{v_{1}}, \dots, a_{v_{k}}, b_{v_{k + 1}})

, which proves the case.

For part (II), first assume that

a^{f} = a

and that for every

0 \leq k \leq | V |

, strategy profile

f

sustains action profile

(x_{v_{1}}, \dots, x_{v_{k}}, f_{v_{k + 1}}, \dots, f_{v_{| V |}})

at history

(x_{v_{1}}, \dots, x_{v_{k}})

. The latter then holds in particular for

k = 0

, that is, for history

()

. Accordingly,

f

sustains action profile

(f_{v_{1}}, \dots, f_{v_{| V |}}) = a^{f}

at history

()

. Our initial assumption that

a^{f}

then gives the result for this direction.

For the opposite direction, assume that strategy profile

f = (f_{v_{1}}, \dots, f_{v_{| V |}})

sustains action profile

a = (a_{v_{1}}, \dots, a_{v_{| V |}})

at history

()

. First, we find that by repeatedly applying condition (f.ii) gives us

a^{f} = a

. Now, consider an arbitrary

0 \leq k \leq | V |

and action profile

x = (x_{v_{1}}, \dots, x_{v_{k}}, f_{v_{k + 1}}, \dots, f_{v_{| V |}})

. We prove by induction on k that

f

sustains

(x_{v_{1}}, \dots, x_{v_{k}}, f_{v_{k + 1}}, \dots, f_{v_{| V |}})

at history

h = (x_{v_{1}}, \dots, x_{v_{k}})

.

First assume that

k = 0

, that is,

h = ()

. In that case,

x = (f_{v_{1}}, \dots, f_{v_{| V |}}) = a^{f}

. We have already seen that

a^{f} = a

, and, thus, the base case immediately follows from the assumption that

f = (f_{v_{1}}, \dots, f_{v_{| V |}})

sustains action profile

a = (a_{v_{1}}, \dots, a_{v_{| V |}})

at history

()

.

For the induction step, let

k > 0

. In case

k = | V |

we have

h = a

and we are done immediately. Otherwise, by the induction hypothesis, we may assume that

f

sustains action profile

(x_{v_{1}}, \dots, x_{v_{k - 1}}, f_{v_{k}}, \dots, f_{v_{| V |}})

at history

(x_{v_{1}}, \dots, x_{v_{k - 1}})

. If

f_{v_{k}} (x_{v_{1}}, \dots, x_{v_{k - 1}}) = x_{k}

, then it follows from condition (f.i) that

f

sustains

(x_{v_{1}}, \dots, x_{v_{k}}, f_{v_{k + 1}}, \dots, f_{v_{| V |}})

at

(x_{v_{1}}, \dots, x_{v_{k}})

. If

f_{v_{k}} (x_{v_{1}}, \dots, x_{v_{k - 1}}) \neq x_{k}

, then still

f_{v_{k}} (x_{v_{1}}, \dots, x_{v_{k - 1}}) \in A_{v_{k}}

. Now, it follows from condition (f.iii) that

f

sustains

(x_{v_{1}}, \dots, x_{v_{k}}, f_{v_{k + 1}}, \dots, f_{v_{| V |}})

at

(x_{v_{1}}, \dots, x_{v_{k}})

.

For part (III), first assume, for every

0 \leq k \leq | V |

and action profile

x = (x_{v_{1}}, \dots, x_{v_{k}}, f_{v_{k + 1}},

\dots, f_{v_{| V |}})

, that strategy profile

f = (f_{v_{1}}, \dots, f_{v_{| V |}})

sustains

x

at history

h = (x_{v_{1}}, \dots, x_{v_{k}})

. Now, consider an arbitrary history

(z_{1}, \dots, z_{k})

for some

1 \leq k < | V |

. Then, by assumption,

f

sustains action profile

(z_{v_{1}}, \dots, z_{v_{k}}, f_{v_{k + 1}}, \dots, f_{v_{| V |}})

at history

(z_{v_{1}}, \dots, z_{v_{k}})

. As

k < | V |

, we find that

(z_{1}, \dots, z_{k}) \neq (z_{1}, \dots, z_{k}, f_{v_{k + 1}}, \dots, f_{v_{| V |}})

. Therefore, by condition (f.iii), we then find that

f_{v_{k + 1}} (z_{v_{1}}, \dots, z_{v_{k}}) \in \underset{b_{v_{k + 1}} \in A_{v_{k + 1}}}{arg max} u_{i} (z_{v_{1}}, \dots, z_{v_{k}}, b_{v_{k + 1}}, f_{v_{k + 2}}, \dots, f_{v_{| V |}}),

where i is the player active at

v_{k + 1}

. It thus follows that

f

is a backward induction solution.

For the opposite direction, assume that

f = (f_{v_{1}}, \dots, f_{v_{| V |}})

is a backwards induction solution. Now, consider an arbitrary

0 \leq k \leq | V |

. We prove by induction on

| V | - k

, that, for every action profile

x = (x_{v_{1}}, \dots, x_{v_{k}}, f_{v_{k + 1}}, \dots, f_{v_{| V |}})

, strategy profile

f

sustains

x

at history

h = (x_{v_{1}}, \dots, x_{v_{k}})

.

For the basis assume let

k = | V |

. Consider an arbitrary action profile

x = (x_{v_{1}}, \dots, x_{v_{k}})

along with history

h = (x_{v_{1}}, \dots, x_{v_{k}})

. Obviously,

h = x

, and, thus, immediately that

f

sustains

x

at history h.

For the induction step, let

k < | V |

. Again, consider an arbitrary action profile

h = (x_{v_{1}}, \dots, x_{v_{k}}, f_{v_{k + 1}}, \dots, f_{v_{| V |}})

along with history

h = (x_{v_{1}} x \dots, x_{v_{k}})

. By the induction hypothesis, we find, for all action profiles

y = (y_{v_{1}}, \dots, y_{v_{k + 1}}, f_{v_{k + 1}}, \dots, f_{v_{| V |}})

, that

f

sustains

y

at history

(y_{v_{1}}, \dots, y_{v_{k + 1}})

. This holds in particular for histories

(x_{v_{1}}, \dots, x_{v_{k}}, b_{v_{k + 1}})

where

b_{v_{k + 1}} \in A_{v_{k + 1}}

. For

b_{v_{k + 1}} = f_{v_{k + 1}} (x_{v_{1}}, \dots, f_{v_{k}})

, we obtain condition (f.i). Condition (f.ii) is trivially satisfied. Finally, having assumed that

f

is a backwards induction solution, we find that,

u_{i} (x_{v_{1}}, \dots, x_{v_{k}}, f_{v_{k + 1}}, f_{v_{k + 2}}, \dots, f_{v_{| V |}}) \geq u_{i} (x_{v_{1}}, \dots, x_{v_{k}}, b_{v_{k + 1}}, f_{v_{k + 2}}, \dots, f_{v_{| V |}}),

for all

b_{v_{k + 1}} \in V_{v_{k + 1}}

with

b_{v_{k + 1}} \neq f_{v_{k + 1}} (x_{v_{1}}, \dots, f_{v_{k}})

and i the player active at

v_{k + 1}

. We may therefore conclude that

f

sustains

x = (x_{v_{1}}, \dots, x_{v_{k}}, f_{v_{k + 1}}, \dots, f_{v_{| V |}})

at

(x_{v_{1}}, \dots, x_{v_{k}})

, proving part (III).

With (I) through (III) in place, we conclude the proof as follows. Consider an arbitrary action profile

a = (a_{v_{1}}, \dots, a_{v_{| V |}})

. First, assume, for all

0 \leq k \leq | V |

, that

a

is sustained by backwards induction at

(a_{v_{1}}, \dots, a_{v_{k}})

. Then by (I), for all

0 \leq k \leq | V |

, there is some strategy profile

f = (f_{v_{1}}, \dots, f_{v_{| V |}})

that sustains

a

by backwards induction at

(a_{v_{1}}, \dots, a_{v_{k}})

. For the particular case wherein

k = 0

, we find that this strategy profile

f

that sustains

a

by backwards induction at

()

. By repeated application of (f.ii), moreover, it can easily be established that

a^{f} = a

. Part (II) then yields that for every

0 \leq k \leq | V |

and all action profiles

x = (x_{v_{1}}, \dots, x_{v_{k}}, f_{v_{k + 1}}, \dots, f_{v_{| V |}})

, strategy profile

f

sustains

a

at history

(x_{v_{1}}, \dots, x_{v_{k}})

. Finally, by virtue of (III), we may conclude that

f

is a backwards induction solution that sustains

a

.

For the opposite direction, assume that

f

is a backwards induction solution sustaining

a

. Therefore,

a^{f} = a

, and, accordingly,

(a_{v_{1}}, \dots, a_{v_{k}}, f_{v_{k + 1}}, \dots, f_{v_{| V |}}) = (a_{v_{1}}, \dots, a_{v_{| V |}})

for all

0 \leq k \leq | V |

. Part (III) yields furthermore that for every

0 \leq k \leq | V |

, strategy profile

f

sustains action profile

a^{'} = (a_{v_{1}}, \dots, a_{v_{k}}, f_{v_{k + 1}}, \dots, f_{v_{| V |}})

at history

(a_{v_{1}}, \dots, a_{v_{k}})

. It now follows, for every

0 \leq k \leq | V |

, that

f

sustains

a

at history

(a_{v_{1}}, \dots, a_{v_{k}})

. With part (I), we may now conclude that, for all

0 \leq k \leq | V |

, action profile

a

is sustained by backwards induction at

(a_{v_{1}}, \dots, a_{v_{k}})

, as desired.

Finally, observe that the second part of the lemma also follows from the above argument. □

The intuition behind Lemma is that, for a strategy profile

f

sustaining an action profile

a = (a_{v_{1}}, \dots, a_{v_{| V |}})

to be a backward induction solution, it does not suffice just to induce

a

, in the sense that

a^{f} = a

. It should, in addition, incorporate a threat strategy at every vertex

v_{k}

that succeeds in deterring the player active at

v_{k}

to play any action other than

a_{v_{k}}

, that is, the action prescribed by

a

. The lemma, moreover, shows that a strategy profile that achieve this can be constructed from the strategy profiles that achieve this at each history.

We introduce the following auxiliary decision problem, solving which is an important subroutine in our proof of

PSPACE

-membership of backwards induction actions for total-order games.

backward induction guarantee
Given:	Total-order game G, action profiles $a = (a_{v_{1}}, \dots, a_{v_{\| V \|}})$ and history $h = (a_{v_{1}}, \dots, a_{v_{k}})$ , player i, and $r \in R$ .
Problem:	Is $a$ sustained by backwards induction at h such that $u_{i} (a) \leq r$

We find that backward induction guarantee is in

PSPACE

for total-order games.

Lemma 8.

For total-order games,backward induction guaranteeis in

PSPACE

, that is, the problem can be solved in space polynomial in the size of the game.

Proof.

Let a total-order game be given together with an action profile

a = (a_{v_{1}}, \dots, a_{v_{| V |}})

. We prove by induction on

| V | - k

, that for all histories

h = (a_{v_{1}}, \dots, a_{v_{k}})

of

a

, all players i, and all values

r \in R

the problem backward induction guarantee can be solved in polynomial space.

For the basis, assume

k = | V |

. Then,

h = a

, and consider an arbitrary player i and arbitrary value

r \in R

. In this case, it suffices to compute

u_{i} (a)

and check whether

u_{i} (a) \leq r

. By Lemma 2, this achieved in polynomial time, and hence in polynomial space as well.

For the induction step, let

k < | V |

. Consider an arbitrary player j and an arbitrary value

r \in R

. Let, furthermore, i be the player active at

a_{v_{k}}

. Now, we first check whether

a

is sustained by backward induction at history

h_{v_{k + 1}} = (a_{v_{1}}, \dots, a_{v_{k + 1}})

. By virtue of the induction hypothesis, this can be achieved in polynomial space by computing backward induction guarantee for

a

, history

h_{v_{k + 1}}

, player i, and value

r = u_{i} (a)

. If this check is negative, we can conclude immediately and output “no”.

Otherwise, we deploy two binary counters with, respectively,

{log}_{2} | A (x) |

and

{log}_{2} | H | = O (| V | {log}_{2} | A |)

digits to run through all actions

b_{v_{k + 1}}

in

A_{v_{k}}

and all action profiles

(a_{v_{1}}, \dots, a_{v_{k}}, b_{v_{k + 1}}, b_{k + 1}^{'} \dots, b_{v_{| V |}}^{'}),

respectively. We then subsequently check for each

b_{v_{k + 1}}

in

A_{v_{k + 1}}

whether some action profile

b = (a_{v_{1}}, \dots, a_{v_{k}}, b_{v_{k + 1}}, b_{k + 1}^{'} \dots, b_{v_{| V |}}^{'})

is sustained by backwards induction at

(a_{v_{1}}, \dots, a_{v_{k}},

b_{v_{k + 1}})

with

u_{i} (b) \leq u_{i} (a)

. Then,

a

is sustained by backwards induction a history

h = (a_{v_{1}}, \dots, a_{v_{k + 1}})

if and only if for each

b_{v_{k + 1}}

in

A_{v_{k + 1}}

at least one of these checks is positive. Moreover, as some reflection reveals, all of these checks are independent and, by the induction hypothesis, we may assume that all of them can be achieved in polynomial space, we may conclude that the subsequent execution of these checks can be performed in polynomial space as well, giving us the result. □

With Lemma 8 in place, we are now in a position to now demonstrate the last main result of this section.

Theorem 13.

For total-order games,backwards induction actionsis

PSPACE

-complete.

Proof.

For membership in

PSPACE

, let a total order game with

v_{1} E \dots E v_{| V |}

and an action profile

a = (a_{v_{1}} \dots, a_{v_{| V |}})

be given. Observe that the topological order

τ = (v_{1}, \dots, v_{| V |})

is fixed by E. By virtue of Lemma, it suffices to check whether

a

is sustained by backwards induction at the empty history

()

for value

k = u_{i} (a)

, where i is the player active at

v_{1}

. Lemma 8 ensures that this can be achieved in space polynomial in the input.

For

PSPACE

-hardness, we reduce from qbf, which is known to be

PSPACE

-hard. Let

Φ = Q_{1} x_{1}, \dots, Q_{| X |} x_{| X |} φ (x_{1}, \dots, x_{k})

be a fully quantified propositional formula over

X = {x_{1}, \dots, x_{| X |}}

, where each

Q_{i}

is either ∃ or ∀. We let

X_{\forall}

and

X_{\exists}

denote the sets of universally quantified variables and existentially quantified variables in

Φ

, respectively. Furthermore, for each variable

x_{i}

with

1 \leq i \leq | X |

, we let

B_{i}

denote the set of Boolean functions on the variables

x_{1}, \dots, x_{i - 1}

. With each profile

β = (β_{1}, \dots, β_{| X |})

in

B_{1} \times \dots \times B_{| X |}

, we associate an a valuation

α_{β} : X \to {1, 0}

such that

\begin{matrix} α_{β} (x_{1}) & = β_{1} & α_{β} (x_{i + 1}) & = β_{i + 1} (α_{β} (x_{1}), \dots, α_{β} (x_{i - 1})) . \end{matrix}

We construct a total-order Boolean game

G^{Φ}

with two players: player 1 and player 2. The propositional variables our game is based on are given by

x_{1}, \dots, x_{| X |}

together with an additional ‘fresh’ variable

x_{0}

. The players’ preferences are captured by

\begin{matrix} γ_{1} & = x_{0} \land \neg φ & γ_{2} & = x_{0} \land φ . \end{matrix}

We assign control over variable

x_{i}

to player 1, if

Q_{i} = \forall

, and to player 2, if

Q_{i} = \exists

. Moreover, the auxiliary variable

x_{0}

is assigned to the control of player 1. The dependency graph

({x_{0}, \dots, x_{k}}, E)

is defined such that

x_{i} E x_{j}

if and only if

i < j

. The game

G^{Φ}

is a total-order game and, thus, fit for backwards induction. Backwards induction solutions are therefore bound to exist. We now demonstrate that

Φ = Q_{1} x_{1}, \dots, Q_{| X |} x_{| X |} φ

evaluates to true if and only if the action profile

\bar{x} = ({\bar{x}}_{0}, {\bar{x}}_{1}, \dots, {\bar{x}}_{| X |})

is sustained by a backwards induction solution in

G^{Φ}

.

First assume for the “only if”-direction, that

Φ

holds. Then, there is a profile of Boolean functions

β_{i}

for the variables in

X_{\exists}

, such that for all profiles Boolean functions

β_{j}

for the variables

x_{j} \in X_{\forall}

, such that the assignment

α_{β_{1}, \dots, β_{| V |}} : X \to {0, 1}

satisfies

φ

. On this basis we define a strategy profile

f^{*} = (f_{1}^{*}, f_{2}^{*})

, where

f_{1}^{*}

and

f_{2}^{*}

are given by the following choice equations for each

x_{i} \in X_{\forall} \cup {x_{0}}

(controlled by player 1) and each

x_{j} \in X_{\exists}

(controlled by player 2):

\begin{matrix} x_{i} & : ⊥ & x_{j} & : x_{0} \land φ_{β_{j}} (x_{1}, \dots, x_{j - 1}) . \end{matrix}

Here,

φ_{β_{j}}

denotes the Boolean formula representing Boolean function

β_{j}

. Note, furthermore, that

a^{f^{*}} = \bar{x}

.

First observe that for all action profiles

a = (a_{x_{0}}, \dots, a_{x_{| X |}})

with

a_{x_{0}} = {\bar{x}}_{0}

, the valuation

α_{a}

will satisfy neither

γ_{1}

nor

γ_{2}

. Some reflection, thus reveals that

f^{*}

sustains

\bar{x}

at all histories

h = (a_{x_{0}}, \dots, a_{x_{k}})

with

a_{v_{x_{0}}} = {\bar{x}}_{0}

, particular at history

a_{v_{x_{0}}}

. Now observe that

f_{2}

incorporates a winning strategy for player 2 in the “subgame” that results if player 1 sets

x_{0}

to true. Therefore,

φ

will be satisfied by the valuations induced by the Nash equilibria in which

x_{0}

is set to true—if any such Nash equilibria exist. As, by Theorem 7, every backwards induction solution is a Nash equilibrium, it follows that every action profile

a = (a_{x_{0}}, \dots, a_{x_{| X |}})

with

a_{x_{0}} = x_{0}

that is sustained by backwards induction will render

φ

true, and, hence, player 1’s goal

γ_{1}

false. Now let

f^{'} = (f_{1}^{'}, f_{2}^{'})

be a strategy profile that sustains some action profile

a^{'} = (a_{x_{0}}^{'}, \dots, a_{x_{| X |}}^{'})

with

a_{x_{0}}^{'} = x_{0}

by backwards induction at history

h' = x_{0}

. Observe that we may assume the existence of such a strategy profile. At this juncture, consider the strategy profile

f^{* *} = (f_{x_{0}}^{* *}, \dots, f_{x_{| X |}}^{* *})

defined such that for all histories

h = (a_{x_{0}}, \dots, a_{x_{k - 1}})

with

0 \leq k < | X |

and

i \in {1, 2}

,

f_{i}^{* *} (h) = \{\begin{matrix} f_{1}^{*} (h) & if a_{x_{0}} = {\bar{x}}_{0}, \\ f_{2}^{'} (h) & otherwise . \end{matrix}

Using the auxiliary formal apparatus introduced in the proof of Lemma 7, we find that

f^{* *}

sustains action profile

\bar{x}

at

{\bar{x}}_{0}

and action profile

(x_{0}, f_{x_{1}}^{* *}, \dots, f_{| V |}^{* *})

at

x_{0}

. Moreover,

u_{1} (\bar{x}) \geq u_{1} (x_{0}, f_{x_{1}}^{* *}, \dots, f_{x_{| X |}}^{* *})

. As player 1 controls

x_{0}

, we may therefore conclude that

f^{* *}

sustains

\bar{x}

at

()

. By Lemma, we may now conclude that

\bar{x}

is sustained by a backwards induction solution, as desired.

For the “if”-direction, assume that

Φ

does not hold. In this case, it can be seen that player 2 does not have a winning strategy in the “subgame” that results if

x_{0}

is set to true. Now, consider an arbitrary strategy profile

\bar{f} = ({\bar{f}}_{1}, {\bar{f}}_{2})

that sustains

\bar{x}

. Then, player 1’s goal is not satisfied if

\bar{f}

is played. Moreover, as player 2 does not have a winning strategy in the “subgame” after

x_{0}

is set to true, player 1 has some strategy

f_{i}^{*}

with which they set

x_{0}

to true and such that

(f_{1}^{*}, {\bar{f}}_{2})

guarantees

γ_{1}

to be satisfied. Accordingly, player 1 will want to deviate to

f_{1}^{*}

and strategy profile

\bar{f}

is not a Nash equilibrium. By Theorem 7, strategy profile

\bar{f}

is not a backwards induction solution either. Having chosen

\bar{f}

arbitrarily, we get the result. □

With this result in place, we can now present the proof of Theorem 6, which stated

PSPACE

-completeness of is nash actions for total-order games. We show that, given an action profile

a = (a_{v_{1}}, \dots, a_{v_{| V |}})

, a given total-order game G can be transformed into a total-order game

G^{a}

in which

a

is sustained by a backwards induction solution if and only if

a

is sustained by a Nash equilibrium in the original game G. We then leverage Theorem 13 to prove

PSPACE

-membership of is nash actions.

Formally, let G be a partial order game defined on dependency graph

(V, E)

, and let

a = (a_{v_{1}}, \dots, a_{v_{| V |}})

be an action profile. Then, we define

G^{a}

as exactly the same game as G, be it with the following binary utility functions

u_{i}^{a} : A \to {0, 1}

. For a given action profile

b = (b_{v_{1}}, \dots, b_{v_{| V |}})

, we say that player j is the first to deviate from $a$ (to $b$ ) if, for some

1 \leq k \leq | V |

, player j is active at

v_{k}

,

b_{v_{m}} = a_{v_{m}}

for all

m < k

, and

b_{v_{k}} \neq a_{v_{k}}

. For each player i, we set

u_{i}^{a} (a) = 1

and, for every action profile

b \neq a

where player j is the first player deviate from

a

to

b

, we define

u_{i}^{a} (b) = \{\begin{matrix} 0 & if i = j and u_{j} (b) \leq u_{j} (a), \\ 2 & if i = j and u_{j} (b) > u_{j} (a), \\ 2 & if i \neq j and u_{j} (b) \leq u_{j} (a), \\ 0 & if i \neq j and u_{j} (b) > u_{j} (a) . \end{matrix}

We now have the following two lemmas, the first of which shows that the game

G^{a}

preserves the Nash equilibria of G.

Lemma 9.

Let

a = (a_{v_{1}}, \dots, a_{v_{| V |}})

be an action profile of a total-order game G. Then,

a

is sustained by a Nash equilibrium in G if and only if

a

is sustained by a Nash equilibrium in

G^{a}

.

Proof.

First assume that

f = (f_{v_{1}}, \dots, f_{v_{| V |}})

be a Nash equilibrium sustaining

a

in G. Therefore,

a^{f} = a

. For contradiction, also assume that

f

does not do so in

G^{a}

. Then, there is some player i and some strategy

g_{i}

, such that

u_{i}^{a} (a) < u_{i}^{a} (a^{(f_{- i}, g_{i})})

. Obviously, player i is the first to deviate from

a

to

a^{(f_{- i}, g_{i})}

. Thus,

u_{i}^{a} (a) = 1

and

u_{i}^{a} (a^{(f_{- i}, g_{i})}) = 2

, and, by definition of

u^{a}

, we find that

u_{i} (a^{f}) < u_{i} (a^{(f_{- i}, g_{i})})

. It therefore follows that

f

is not a Nash equilibrium of G, a contradiction.

Now assume that

f

is a Nash equilibrium sustaining

a

in

G^{a}

. Therefore,

a^{f} = a

. It suffices to show that

f

is also a Nash equilibrium in G. To this end, consider an arbitrary player i and strategy

g_{i}

for i. If

a^{(f_{- i}, g_{i})} = a^{f}

, we are done immediately. Otherwise, i is the first player to deviate from

a^{f}

to

a^{(f_{- i}, g_{i})}

. Because

f

is a Nash equilibrium in

G^{a}

, we find that

u_{i}^{a} (a^{f}) \geq u_{i}^{a} (a^{(f_{- i}, g_{i})})

. As

a^{f} = a

, we have

u_{i}^{a} (a^{f}) = 1

. Observe that it now follows that

u_{i}^{a} (a^{(f_{- i}, g_{i})}) = 0

. Accordingly,

u_{i} (a^{(f_{- i}, g_{i})}) \leq u_{i} (a)

, and thus

u_{i} (a^{(f_{- i}, g_{i})}) \leq u_{i} (a^{f})

. We may conclude that

f

is a Nash equilibrium in G, as desired. □

For total-order game

G^{a}

, we can furthermore show that if there is a Nash equilibrium that sustains action profile

a

, then there is also a backwards induction that does the same. The converse of this statement is immediate, as every backwards induction solution is also a Nash equilibrium (Theorem 7).

Lemma 10.

Let

a

be an action profile of a total-order game G. Then,

a

is sustained by a Nash equilibrium in

G^{a}

if and only if

a

is sustained by a backwards induction solution in

G^{a}

.

Proof.

By virtue of Theorem 7, the “if”-direction of the lemma is immediate. For the opposite direction, assume that

a

is sustained by a Nash equilibrium

f^{*}

. For contradiction, also assume that

a

is not sustained by any backwards induction solution. As we are dealing with total-form games, we may assume the existence of backwards induction solutions. These we may also assume to be Nash equilibria in virtue of Theorem 7. Now, let

g = (g_{v_{1}}, \dots, g_{v_{| V |}})

be any backwards induction solution and let

a^{g} = b = (b_{v_{1}}, \dots, b_{v_{| V |}})

. Furthermore, let

1 \leq k \leq | V |

be the greatest index such that

g_{v_{k}} (a_{v_{1}}, \dots, a_{v_{k - 1}}) \neq f_{v_{k}} (a_{v_{1}}, \dots, a_{v_{k - 1}})

and assume i to be the player active at

v_{k}

. We may also assume that

f

is the backwards induction solution for which there are the fewest histories

h = (x_{v_{1}}, \dots, x_{v_{ℓ}})

such that some player

j \neq i

is active at

v_{ℓ}

and

f_{v_{ℓ}} (\neq g_{v_{ℓ}} (h)

. Furthermore, observe that, by choice of k, we have that

a = (a_{v_{1}}, \dots, a_{v_{k}}, g_{v_{k + 1}}, \dots, g_{v_{| V |}})

. Having assumed that

g

is a backwards induction solution, we find moreover that

1 = u_{i} (a) = u_{i} (a^{f}) < u_{i} (a^{g}) = u_{i} (b) = 2 .

Now, observe that

a^{g} \neq a^{(f_{- i}, g_{i})}

; otherwise,

f

would not be a Nash equilibrium. Accordingly, there must be some player

j \neq i

and some vertex

v_{k^{'}}

with

k < k^{'} \leq | V |

and at which player j is active such that

f_{v_{k + 1}} (a_{v_{1}}, \dots, a_{v_{k - 1}}, b_{v_{k}}, \dots, b_{k^{'} - 1}) \neq g_{v_{k + 1}} (a_{v_{1}}, \dots, a_{v_{k - 1}}, b_{v_{k}}, \dots, b_{k^{'} - 1}) .

Having assumed that

g

is a backwards induction solution, we find that

\begin{matrix} u_{j} (a_{v_{1}}, \dots, a_{v_{k - 1}}, b_{v_{k}}, \dots, b_{v_{k^{'} - 1}}, a_{v_{k^{'}}}^{'}, g_{v_{k^{'} + 1}}, \dots, g_{v_{| V |}}) & \leq u_{j} (a_{v_{1}}, \dots, a_{v_{k - 1}}, b_{v_{k}}, \dots, b_{v_{k - 1}}, b_{v_{k^{'}}}, g_{v_{k^{'} + 1}}, \dots, g_{v_{| V |}}) \\ = u_{j} (b) . \end{matrix}

where

a_{v_{k^{'}}}^{'} = f_{v_{k^{'}}} (a_{v_{1}}, \dots, a_{v_{k - 1}}, b_{v_{k}}, \dots, b_{v_{k^{'} - 1}})

. Because

a_{v_{k}} \neq b_{v_{k}}

and

u_{i} (b) = 2

, we have

u_{j} (b) = 0

. It now follows that

u_{j} (a_{v_{1}}, \dots, a_{v_{k - 1}}, b_{v_{k}}, \dots, b_{v_{k^{'} - 1}}, a_{v_{k^{'}}}^{'}, g_{v_{k^{'} + 1}}, \dots, g_{v_{| V |}}) = 0

as well. Accordingly,

\begin{matrix} u_{j} (a_{v_{1}}, \dots, a_{v_{k - 1}}, b_{v_{k}}, \dots, b_{v_{k^{'} - 1}}, a_{v_{k^{'}}}^{'}, g_{v_{k^{'} + 1}}, \dots, g_{v_{| V |}}) & \geq u_{j} (a_{v_{1}}, \dots, a_{v_{k - 1}}, b_{v_{k}}, \dots, b_{v_{k - 1}}, b_{v_{k^{'}}}, g_{v_{k^{'} + 1}}, \dots, g_{v_{| V |}}) . \end{matrix}

At this juncture, consider the strategy profile

g^{'} = (g_{v_{1}}^{'}, \dots, g_{v_{| V |}}^{'})

defined such that for all vertices

v_{ℓ}

and all histories

h = (x_{v_{1}}, \dots, x_{v_{ℓ - 1}})

,

g_{v}^{'} (h) = \{\begin{matrix} a_{v_{k^{'}}}^{'} & if ℓ = k^{'} and h = (a_{v_{1}}, \dots, a_{v_{k - 1}}, b_{v_{k}}, \dots, b_{v_{k^{'} - 1}}), \\ g_{v}^{'} (h) & otherwise . \end{matrix}

The key observation to make at this point is that

g^{'}

is a backwards induction solution just as well as

g

is. As

a_{v_{k^{'}}}^{'} = f_{v_{k^{'}}} (a_{v_{1}}, \dots, a_{v_{k - 1}}, b_{v_{k}}, \dots, b_{v_{k^{'} - 1}})

, however, we find that for

g^{'}

the number of histories

h = (x_{v_{1}}, \dots, x_{v_{ℓ}})

such that there some player

j \neq i

that is active at

v_{ℓ}

and

f_{v_{ℓ}} (h) \neq g_{v_{ℓ}}^{'} (h)

is one fewer than the same number of such histories for

g

. This contradicts our minimality assumption regarding the latter. This concludes the proof. □

We are now finally in a position to prove Theorem 6, and establish that is nash actions is

PSPACE

-complete for total-order games.

Proof of Theorem 6.

For

PSPACE

-membership, first observe that, given total-order game G and action profile

a = (a_{v_{1}}, \dots, a_{v_{| V |}})

, we can construct

G^{a}

in polynomial time. To see this, note for each player i, we have to transform the circuit

C_{u_{i}}

computing i’s utility function

u_{i}

in G to a circuit

C_{u_{i}^{a}}

. This can be achieved by adding a polynomial number of gates to each

C_{u_{i}}

. We then check whether

a

is sustained by a backwards induction solution in

G^{a}

. Theorem 13 guarantees that this can be achieved in polynomial space. Lemma 9 then yields the result.

To prove

PSPACE

-hardness, we adapt the proof of Theorem 13 and reduce from

q b f

. Given an instance

Φ = Q_{1} x_{1}, \dots, Q_{| X |} x_{| X |} φ (x_{1}, \dots, x_{k})

of qbf, we construct the game

G^{Φ}

as in Theorem 13. We demonstrate that

Φ = Q_{1} x_{1}, \dots, Q_{| X |} x_{| X |} φ

evaluates to true if and only if the action profile

\bar{x} = ({\bar{x}}_{0}, {\bar{x}}_{1}, \dots, {\bar{x}}_{| X |})

is sustained by a Nash equilibrium.

First assume that

Φ

holds. Then, there is a profile of Boolean functions

β_{i}

for the variables in

X_{\exists}

, such that for all profiles of Boolean functions

β_{j}

for the variables

x_{j} \in X_{\forall}

, such that the assignment

α_{β_{1}, \dots, β_{| V |}} : X \to {0, 1}

satisfies

φ

. As in the proof of Theorem 13, we consider profile

f^{*} = (f_{1}^{*}, f_{2}^{*})

, where

f_{1}^{*}

and

f_{2}^{*}

are given by the following choice equations for each

x_{i} \in X_{\forall} \cup {x_{0}}

(controlled by player 1) and each

x_{j} \in X_{\exists}

(controlled by player 2):

\begin{matrix} x_{i} & : ⊥ & x_{j} & : x_{0} \land φ_{β_{j}} (x_{1}, \dots, x_{j - 1}) . \end{matrix}

Note that

a^{f^{*}} = \bar{x}

, and so it suffices to show that

f^{*}

is a Nash equilibrium. Recall that

f_{2}

incorporates a winning strategy for player 2 in the “subgame” that results if player 1 sets

x_{0}

to true. It follows that for each of player 1’s strategies

g_{1}

, the assignment

α_{a^{(g_{1}, f_{2}^{*})}}

associated with action profile

a^{(g_{1}, f_{2}^{*})}

either satisfies

φ

or does not satisfy

x_{0}

. In either case, player 1’s goal is not satisfied, and hence does not want to deviate from

f^{*}

. Player 2 does not want to deviate either, as, with player 1 playing

f_{1}^{*}

, variable

x_{0}

will be set to false and player 2’s goal

γ_{2}

will not be satisfied no matter which strategy player 2 chooses. We may conclude that

f^{*}

is a Nash equilibrium, as desired.

Finally, assume that

Φ

does not hold. In that case, player 2 does not have a winning strategy in the “subgame” that results if

x_{0}

is set to true. Now consider an arbitrary strategy profile

\bar{f} = ({\bar{f}}_{1}, {\bar{f}}_{2})

that sustains

\bar{x}

, and observe that

\bar{f}

does not lead to player 1’s goal being satisfied. As player 2 does not have a winning strategy when

x_{0}

is set to true, there must be a strategy

f_{1}^{*}

for player 1 such that

(f_{1}^{*}, {\bar{f}}_{2})

leads to player 1’s goal

γ_{1}

becoming true. Accordingly, player 1 would like to deviate to

f_{1}^{*}

, and

\bar{f}

is not a Nash equilibrium. We may therefore conclude that

\bar{x}

is not sustained by a Nash equilibrium, as desired. □

8. Conclusions

Game theorists have developed a huge range of game models since the field was initiated nearly a century ago. Of these, the most important and prominent are normal form games and extensive form games. These models differ in the assumptions they make about the information available to players and their strategies. In this article, we have introduced a game model in which informational dependencies between decision nodes in a game are captured in what we call a dependence relation. This dependence relation explicitly states what information is available when making a choice at a decision node. Although our partial order games are strictly no more expressive than extensive form games of imperfect information, they can be exponentially more compact, and we believe they are much more transparent with respect to the information flow in a game.

The compact representation of non-cooperative settings as partial order games naturally evokes various questions concerning the complexity of computing solution concepts in this model. We have focussed on Nash equilibrium and backwards induction. We explored six decision problems surrounding these two solution concepts—non-emptiness, is nash, is nash actions, fit for backwards induction, is backwards induction, and is backwards induction actions—for partial-games as well as for the two extremal subclasses of empty-order games and total-order games. Our results are summarised in Table 1 and Table 2. For each of these problems, we find that computational costs are highest for general partial order games, rather than for the two extremal classes. This suggests that the structure of the dependence relation is a major source of complexity.

Furthermore, note the contrast in complexity between is nash and is backwards induction, on the one hand, and is nash actions and is backwards induction actions on the other. Other than in normal-form games, and arguably similar to extensive-form games, partial order games evince a clear distinction between strategy profiles and action profiles, with the former being much (exponentially) ‘larger’ objects than the latter, and of which there can also be exponentially more. Thus, is nash and is nash actions are similar problems, which basically only differ in that the former has a strategy profile as part of its input and the latter an action profile. We propose that the difference in size and number between strategy profiles and action profiles accounts for much of the contrast in computational cost between is nash and is nash actions. A similar remark applies to fit for backwards induction and is backwards induction actions.

For future work, it would be useful to develop software tools to support reasoning in partial order games, and from a theoretical perspective, it would also be interesting to consider restrictions on dependence graph structures that might lead to more tractable decision problems. Finally, of course, it would be valuable to look in more detail at applications of our model.

Author Contributions

Conceptualization, J.G., P.H., M.W., and V.Z.; methodology, J.G., P.H., M.W., and V.Z.; formal analysis, P.H., M.W., and V.Z.; investigation, P.H. and V.Z.; writing—original draft preparation, P.H. and V.Z.; writing—review and editing, all authors; supervision, M.W.; project administration, M.W.; funding acquisition, M.W. All authors have read and agreed to the published version of the manuscript.

Funding

Wooldridge and Harrenstein were supported by the UKRI under a Turing AI World Leading Researcher Fellowship (EP/W002949/1) awarded to Wooldridge.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Acknowledgments

The authors would like to thank James Fox, Lewis Hammond, and Thomas Steeples for discussions on this work, in particular on the relationship between partial order games and MAIDs.

Conflicts of Interest

The authors declare no conflict of interest.

References

Bradfield, J.; Gutierrez, J.; Wooldridge, M. Partial-order Boolean games: Informational independence in a logic-based model of strategic interaction. Synthese 2016, 193, 781–811. [Google Scholar] [CrossRef]
Koller, D.; Milch, B. Multi-agent influence diagrams for representing and solving games. In Proceedings of the 17th International Joint Conference on Artificial Intelligence (IJCAI), Seattle, WA, USA, 4–10 August 2001; pp. 1027–1034. [Google Scholar]
Koller, D.; Milch, B. Multi-agent influence diagrams for representing and solving games. Games Econ. Behav. 2003, 45, 181–221. [Google Scholar] [CrossRef]
Maschler, M.; Solan, E.; Zamir, S. Game Theory; Cambridge University Press: Cambridge, UK, 2013. [Google Scholar]
Osborne, M.J.; Rubinstein, A. A Course in Game Theory; MIT Press: Cambridge, MA, USA, 1994. [Google Scholar]
Vollmer, H. Introduction to Circuit Complexity: A Uniform Approach; Texts in Theoretical Computer Science; Springer: Berlin/Heidelberg, Germany, 1999. [Google Scholar]
Jukna, S. Boolean Function Complexity: Advances and Frontiers; Algorithms and Combinatorics; Springer: Berlin/Heidelberg, Germany, 2012; Volume 27. [Google Scholar]
Greenlaw, R.; Hoover, H.J.; Ruzzo, W.L. Limits to Parallel Computation: P-Completeness Theory; Oxford University Press: Oxford, UK, 1995. [Google Scholar]
Muller, D.E. Complexity in electronic switching circuits. IRE Trans. Electron. Comput. 1956, 1, 15–19. [Google Scholar] [CrossRef]
Lupanov, O.B. A method of circuit synthesis. Izvesitya VUZ Radiofiz 1958, 1, 120–140. (In Russian) [Google Scholar]
Harrenstein, P.; van der Hoek, W.; Meyer, J.J.; Witteveen, C. Boolean Games. In Proceeding of the Eighth Conference on Theoretical Aspects of Rationality and Knowledge (TARK VIII); Certosa Di Pontignano, Italy, 8–10 July 2001, van Benthem, J., Ed.; Morgan Kaufmann: Burlington, MA, USA, 2001; pp. 287–298. [Google Scholar]
Harrenstein, P. Logic in Conflict. Ph.D. Thesis, Utrecht University, Utrecht, The Netherlands, 2004. [Google Scholar]
Bonzon, E.; Lagasquie, M.; Lang, J.; Zanuttini, B. Boolean games revisited. In Proceedings of the ECAI 2006, Riva del Garda, Italy, 28 August–1 September 2006; Volume 141. Available online: https://www.researchgate.net/publication/48445144_Boolean_Games_Revisited (accessed on 17 December 2021).
Bonzon, E.; Lagasquie-Schiex, M.C.; Lang, J.; Zanuttini, B. Compact preference representation and Boolean games. Auton. Agents Multi-Agent Syst. 2009, 18, 1. [Google Scholar] [CrossRef][Green Version]
Wooldridge, M.; Endriss, U.; Kraus, S.; Lang, J. Incentive engineering for Boolean games. Artif. Intell. 2013, 195, 418–439. [Google Scholar] [CrossRef]
Hammond, L.; Fox, J.; Everitt, T.; Abate, A.; Wooldridge, M. Equilibrium Refinements for Multi-Agent Influence Diagrams: Theory and Practice. In Proceedings of the 20th International Conference on Autonomous Agents and Multi-Agent Systems (AAMAS’21), Online, 3–7 May 2021; Endriss, U., Nowé, A., Dignum, F., Lomuscio, A., Eds.; Available online: https://arxiv.org/pdf/2102.05008.pdf (accessed on 17 December 2021).
Pearl, J. Probabilistic Reasoning in Intelligent Systems; Morgan Kaufmann: Burlington, MA, USA, 1988. [Google Scholar]
Howard, R.A.; Matheson, J.E. Influence Diagrams. In Readings on the Principles and Applications of Decision Analysis; Strategic Decisions Group: Menlo Park, CA, USA, 1994; Volume 2, pp. 721–762. [Google Scholar]
Kjaerulff, U.B.; Madsen, A.L. Bayesian Networks and Influence Diagrams; Springer Science+ Business Media 200: Berlin/Heidelberg, Germany, 2008. [Google Scholar]
Nielsen, M.; Plotkin, G.D.; Winskel, G. Petri Nets, Event Structures and Domains, Part I. Theor. Comput. Sci. 1981, 13, 85–108. [Google Scholar] [CrossRef]
Sassone, V.; Nielsen, M.; Winskel, G. A Classification of Models for Concurrency. In Proceedings of the CONCUR ’93, 4th International Conference on Concurrency Theory, Hildesheim, Germany, 23–26 August 1993; Best, E., Ed.; Lecture Notes in Computer Science; Springer: Berlin/Heidelberg, Germany, 1993; Volume 715, pp. 82–96. [Google Scholar]
Clairambault, P.; Gutierrez, J.; Winskel, G. The Winning Ways of Concurrent Games. In Proceedings of the IEEE Symposium on Logic in Computer Science, Dubrovnik, Croatia, 25–28 June 2012; pp. 235–244. [Google Scholar] [CrossRef][Green Version]
Gutierrez, J.; Wooldridge, M. Equilibria of concurrent games on event structures. In Proceedings of the CSL-LICS, Vienna, Austria, 14–18 July; ACM: New York, NY, USA, 2014; pp. 46:1–46:10. [Google Scholar]
Alcolei, A.; Clairambault, P.; Hyland, M.; Winskel, G. The True Concurrency of Herbrand’s Theorem. In Proceedings of the CSL, Birmingham, UK, 4–8 September 2018; Ghica, D.R., Jung, A., Eds.; Schloss Dagstuhl: Wadern, Germany, 2018; Volume 119, pp. 5:1–5:22. [Google Scholar]
Castellan, S.; Clairambault, P.; Winskel, G. Thin Games with Symmetry and Concurrent Hyland-Ong Games. Log. Methods Comput. Sci. 2019, 15, 18:1–18:85. [Google Scholar] [CrossRef]
Thompson, F.B. Equivalence of Games in Extensive Form; Research Memorandum RM-759; Rand Corporation: Santa Monica, CA, USA, 1952. [Google Scholar]
Hearn, R.A.; Demaine, E.D. Games, Puzzles and Computation; A.K. Peters: Natick, MA, USA, 2009. [Google Scholar]
Fudenberg, D.; Tirole, J. Game Theory; The MIT Press: Cambridge, MA, USA, 1991. [Google Scholar]

Figure 1. Extensive-form game of perfect information. The players are depicted besides the nodes at which they are active. Vertices are identified with histories, for instance, the root.

v_{0}

with

()

vertex

v_{3}

with

a_{1} c_{1}

and the leaf node/terminal history labelled Games 13 00002 i001

with

a_{0} b_{00}

omitting parentheses and commas for better readability.

Figure 1. Extensive-form game of perfect information. The players are depicted besides the nodes at which they are active. Vertices are identified with histories, for instance, the root.

v_{0}

with

()

vertex

v_{3}

with

a_{1} c_{1}

and the leaf node/terminal history labelled Games 13 00002 i001

with

a_{0} b_{00}

omitting parentheses and commas for better readability.

Figure 2. The strategic form game associated with the extensive-form game in Figure 1.

Figure 3. A simple example of a partial order game. On the left, the DAG, where each vertex v is labelled

i : x, y

, where i is the player active at the respective vertex and x and y are the actions available to i at v. On the right, the representation of the utility function which associates a numeric value with each action profile.

Figure 3. A simple example of a partial order game. On the left, the DAG, where each vertex v is labelled

i : x, y

, where i is the player active at the respective vertex and x and y are the actions available to i at v. On the right, the representation of the utility function which associates a numeric value with each action profile.

Figure 4. The Boolean circuit

C_{f_{v_{3}}^{14}}

representing strategy

f_{v_{3}}^{14} : A_{v_{1}} \times A_{v_{2}} \to A_{v_{3}}

of Example 1. The value of

x_{0}

represents the choice for a or b at

v_{1}

, the value of

x_{1}

the choice for c or d at

v_{2}

, and the value of

y_{0}

the choice for e or f at

v_{3}

. The local indices of a, c, and e are given by 0, and those of b, d, and f by 1. The instantiation on the right thus represents that

f_{v_{3}}^{14} (a d) = f

.

Figure 4. The Boolean circuit

C_{f_{v_{3}}^{14}}

representing strategy

f_{v_{3}}^{14} : A_{v_{1}} \times A_{v_{2}} \to A_{v_{3}}

of Example 1. The value of

x_{0}

represents the choice for a or b at

v_{1}

, the value of

x_{1}

the choice for c or d at

v_{2}

, and the value of

y_{0}

the choice for e or f at

v_{3}

. The local indices of a, c, and e are given by 0, and those of b, d, and f by 1. The instantiation on the right thus represents that

f_{v_{3}}^{14} (a d) = f

.

Figure 5. The Boolean circuit on the left represents the utility function

u_{i}

on the right for a player i over the action profiles of the game in Example 1. The binary representations of action profiles and numerical values are depicted in parentheses.

Figure 5. The Boolean circuit on the left represents the utility function

u_{i}

on the right for a player i over the action profiles of the game in Example 1. The binary representations of action profiles and numerical values are depicted in parentheses.

Figure 6. Extensive-form game of imperfect information representing the partial order game in Figure 3 assuming topological order

(v_{1}, v_{2}, v_{3}, v_{4})

. Here, for instance,

v_{1}^{1} = ()

,

v_{3}^{2} = a d

, and

v_{4}^{4} = a d f

. The dashed lines connecting vertices indicate the information sets.

Figure 6. Extensive-form game of imperfect information representing the partial order game in Figure 3 assuming topological order

(v_{1}, v_{2}, v_{3}, v_{4})

. Here, for instance,

v_{1}^{1} = ()

,

v_{3}^{2} = a d

, and

v_{4}^{4} = a d f

. The dashed lines connecting vertices indicate the information sets.

Figure 7. Extensive-form game of perfect information illustrating Proposition 4. for every

0 \leq n \leq 31

, we have

n

denote the vector

{(n, n, n, n, n)}^{T}

.

Figure 7. Extensive-form game of perfect information illustrating Proposition 4. for every

0 \leq n \leq 31

, we have

n

denote the vector

{(n, n, n, n, n)}^{T}

.

Figure 8. The DAG for the partial order game with five players illustrating Proposition 4.

Figure 9. Extensive-form game of imperfect information representing the partial order game in Figure 5 assuming topological ordering

(v_{1}, v_{2}, v_{4}, v_{3})

. The dashed lines connecting vertices indicate the information sets.

Figure 9. Extensive-form game of imperfect information representing the partial order game in Figure 5 assuming topological ordering

(v_{1}, v_{2}, v_{4}, v_{3})

. The dashed lines connecting vertices indicate the information sets.

Figure 10. The partial order game from Example 1 with a slightly different utility function.

Figure 11. Another example of a partial order game.

Figure 12. Construction of the DAG for the coNP-hardness proof for fit for backwards induction in the case of the formula φ being defined over five variables p₁ through p₅.

Table 1. Overview of complexity results surrounding Nash equilibrium in partial order games.

	NON-EMPTINESS	IS-NASH	IS-NASH ACTIONS
empty-order games	$Σ_{2}^{p}$ -compl.	$coNP$ -compl.	$coNP$ -compl.
partial order games	$NEXPTIME$ -compl.	$coNP$ -compl.	$NEXPTIME$ -compl.
total-order games	constant time	$coNP$ -compl.	$PSPACE$ -compl.

Table 2. Overview of complexity results surrounding backwards induction in partial order games.

	FIT FOR B-I	IS B-I	IS B-I ACTIONS
partial order games	$coNP$ -compl.	$coNP$ -compl.	$NEXPTIME$ -compl.
total-order games	constant time	$coNP$ -compl.	$PSPACE$ -compl.

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Zahoransky, V.; Gutierrez, J.; Harrenstein, P.; Wooldridge, M. Partial Order Games. Games 2022, 13, 2. https://doi.org/10.3390/g13010002

AMA Style

Zahoransky V, Gutierrez J, Harrenstein P, Wooldridge M. Partial Order Games. Games. 2022; 13(1):2. https://doi.org/10.3390/g13010002

Chicago/Turabian Style

Zahoransky, Valeria, Julian Gutierrez, Paul Harrenstein, and Michael Wooldridge. 2022. "Partial Order Games" Games 13, no. 1: 2. https://doi.org/10.3390/g13010002

APA Style

Zahoransky, V., Gutierrez, J., Harrenstein, P., & Wooldridge, M. (2022). Partial Order Games. Games, 13(1), 2. https://doi.org/10.3390/g13010002

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Partial Order Games

Abstract

1. Introduction

2. Preliminary Definitions

2.1. Directed Acyclic Graphs and Trees

2.2. Normal-Form Games

2.3. Extensive-Form Games

2.4. Boolean Circuits

3. Partial Order Games

3.1. Concise Representations for Strategies, Profiles, and Utilities

4. Related Game-Theoretic Models

4.1. Boolean Games and Partial Order Boolean Games

4.2. Multi-Agent Influence Diagrams (MAIDs)

4.3. True Concurrency and Games as Event Structures

4.4. Extensive-Form Games

4.4.1. Partial Order Games as Extensive Games of Imperfect Information

4.4.2. Partial Order Games as a Concise Representation of Extensive-Form Games

5. Nash Equilibria

6. Backwards Induction

6.1. Backwards Induction

6.2. Games of Adequate Information

6.2.1. Scrutability

6.2.2. Games of Adequate Information

7. Computing Backwards Induction Solutions for Partial Order Games

7.1. Asymptotic Complexity of Backwards Induction

7.2. Computational Complexity of Backwards Induction

8. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI