Next Article in Journal
The Impact of Discretionary Measurement Criteria on Investors’ Judgement and Decisions
Previous Article in Journal
Self-Enforcing Collective Counterterror Retaliation
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Partial Order Games

by
Valeria Zahoransky
1,
Julian Gutierrez
2,
Paul Harrenstein
3 and
Michael Wooldridge
3,*
1
Fachbereich Informatik, Université du Luxembourg, Maison du Nombre, 6 Avenue de la Fonte, L-4364 Esch-sur-Alzette, Luxembourg
2
Faculty of Information Technology, Monash University, Woodside Building for Technology and Design, 20 Exhibition Walk, Clayton, VIC 3168, Australia
3
Department of Computer Science, University of Oxford, Wolfson Building, Parks Road, Oxford OX1 3QD, UK
*
Author to whom correspondence should be addressed.
Games 2022, 13(1), 2; https://doi.org/10.3390/g13010002
Submission received: 19 November 2021 / Revised: 9 December 2021 / Accepted: 15 December 2021 / Published: 21 December 2021
(This article belongs to the Section Algorithmic and Computational Game Theory)

Abstract

:
We introduce a non-cooperative game model in which players’ decision nodes are partially ordered by a dependence relation, which directly captures informational dependencies in the game. In saying that a decision node v is dependent on decision nodes v 1 , , v k , we mean that the information available to a strategy making a choice at v is precisely the choices that were made at v 1 , , v k . Although partial order games are no more expressive than extensive form games of imperfect information (we show that any partial order game can be reduced to a strategically equivalent extensive form game of imperfect information, though possibly at the cost of an exponential blowup in the size of the game), they provide a more natural and compact representation for many strategic settings of interest. After introducing the game model, we investigate the relationship to extensive form games of imperfect information, the problem of computing Nash equilibria, and conditions that enable backwards induction in this new model.

1. Introduction

The two most important game models in non-cooperative game theory are normal form games and extensive form games. These games are distinguished by the information that players have about the strategies of other players. In normal form games, players must select and commit to strategies without any information relating to the strategies of others. In contrast, in extensive form games, players make moves alternately over time and, during play, their strategy may be informed by moves made previously. Variations of extensive form games—e.g., of imperfect information or recall—make it possible to capture the information available to a player when called upon to make a move.
In this article, we introduce partial order games. The key distinguishing feature of partial order games is that they are equipped with a dependence relation, which explicitly captures the informational dependencies between decision nodes. In saying that a decision node v is dependent on decision nodes v 1 , , v k , we mean that the information available to a strategy making a choice at v is precisely the choices that were made at v 1 , , v k . Observe that the informational dependencies captured by a dependence relation give rise to a partial temporal order over choices: in this case, the choices for v 1 , , v k must strictly precede the choice for v. However, the temporal ordering induced in this way is indeed partial: it may be that two choices v and v are independent, in which case one can say nothing about their temporal order.
In a technical sense, partial order games are no more expressive than extensive form games of imperfect information: one of our key results is to show that every partial order game can be transformed into a strategically equivalent extensive form game of imperfect information. However, for some settings, partial order games have significant advantages over their extensive form representation:
  • First, informational dependencies in partial order games are explicitly captured, while in extensive form games, they are left implicit in the information sets of the game. As a consequence, some settings are much more transparently and naturally represented using partial order games, compared to the extensive form.
  • Second, partial order games can be exponentially more compact than their extensive form. From a purely practical point of view, this means that some scenarios with a compact partial order representation cannot be handled (by people or by a computer) in their extensive form.
We analyse partial order games by means of Nash equilibrium—arguably game theory’s most prominent non-cooperative solution concept—and the solutions given by a natural backwards induction procedure for partial order games defined on the dependence relation. Here, we give special attention to the computational complexity of calculating such solutions in partial order games. The remainder of this article is structured as follows.
  • We begin by briefly recalling some necessary concepts from graph theory and game theory, and then introduce the formal framework of partial order games.
  • As we are interested in the computational properties of partial order games, we introduce a compact representation for strategies and utility functions in partial order games, based on Boolean circuits, which enables us to investigate questions about their computational complexity.
  • In Section 4, we study the relationship of our game model to other game models: partial order Boolean games [1], Multi-Agent Influence Diagrams (MAIDs) [2,3], and extensive form games of imperfect information. Our main result in this section is to present a technique for translating partial order games into strategically equivalent extensive form games, although this translation comes at the cost of an exponential blowup in the size of the game. This leads us into a discussion of the use of partial order games as a compact representation of extensive form games.
  • In Section 5, we investigate the problem of computing Nash equilibria in partial order games. For example, we show that checking whether a game has any pure strategy Nash equilibria is NEXPTIME-complete.
  • In Section 6, we study backwards induction in partial order games. As partial order games are inherently games of imperfect information, it follows that backwards induction does not always work for partial order games. We thus investigate cases where forms of backward induction work for partial order games, and present a condition on games that we call fit for backwards induction, which is sufficient to allow backwards induction.
  • We conclude with a brief discussion and pointers for future work.

2. Preliminary Definitions

We begin by recalling some concepts from graph theory and game theory, which we use in the remainder of the paper.

2.1. Directed Acyclic Graphs and Trees

The games in this paper are defined on directed acyclic graphs (DAGs). Formally, a directed acyclic graph is a pair ( V , E ) , where V = { v 1 , , v | V | } is a finite set of vertices or nodes and E V × V a set of directed edges (or arcs) on V. We assume E to be acyclic (and thus also irreflexive). We say that vertex u is a parent of vertex v, and v a child of u, if ( u , v ) E . We occasionally also use infix notation and write u E v for ( u , v ) E . The depth of a vertex v, denoted by depth ( v ) , is defined recursively such that if v has no parents, depth ( v ) = 0 , and otherwise, depth ( v ) equals the maximum depth of v’s parents plus 1. Formally,
depth ( v ) = 0 if v has no parents , max ( u , v ) E depth ( u ) + 1 otherwise .
We say that vertex v is reachable from vertex u if u = v , or if there is some vertex w such that w is reachable from u and ( w , v ) E , that is if u   E * v , where E * is the reflexive and transitive closure of E.
A topological sorting is a permutation τ = ( u 1 , , u | V | ) of the vertices v 1 , , v | V | in V such that u i   E   u j implies i < j . Every DAG ( V , E ) admits at least one topological sorting of its vertices; this well-known fact will be important and useful in our study of partial order games.
A tree is a directed acyclic graph in which there is a unique vertex with no parents (the root of the tree, often denoted by v 0 ), and where every non-root vertex has a single parent. If a tree is finite, then, as it is acyclic, some vertices will have no children: we refer to these as the leaves of the tree. Observe that the root of the tree is the unique vertex of depth 0.

2.2. Normal-Form Games

We use normal-form games as the basis of our game-theoretic analysis of partial order games. Normal-form games are defined by a set of players, the strategies the players have at their disposal, and the preferences the players have over the outcomes that players choosing their strategies give rise to. Formally, a normal-form game is given by a tuple ( N , ( S i ) i N , ( u i ) i N ) , where N = { 1 , , | N | } is the set of players, S i is the set of (pure) strategies available to player i, and u i : S 1 × × S | N | R is a utility function for each player i [4]. We refer to tuples s = ( s 1 , , s | N | ) in S 1 × × S | N | as strategy profiles.
Nash equilibrium is the most important solution concept in non-cooperative game theory; it captures the idea of a joint course of action that is stable in the sense that no player has an incentive to deviate unilaterally from it. Furthermore, in this paper, it is one of the main analytic tools by means of which we evaluate partial order games. Formally, a strategy profile s = ( s 1 , , s | N | ) is a Nash equilibrium if, for all players i and strategies s i S i , we have
u i ( s ) u i ( s i , s i ) ,
where ( s i , s i ) = ( s 1 , , s i 1 , s i , s i + 1 , , s | N | ) .
We say that two normal-form games ( N , ( S i ) i N , ( u i ) i N ) and ( N , ( S i ) i N , ( u i ) i N ) are strategically isomorphic if there are bijections ϕ : N N and ψ i : S i S f ( i ) such that, for all players i in N and all strategy profiles s = ( s 1 , , s n ) and s = ( s 1 , , s n ) we have
u i ( s ) u i ( s ) if and only if u ϕ ( i ) ( ψ ( s ) ) u ϕ ( i ) ( ψ ( s ) )
where ψ ( s ) = ( ψ i ( s 1 ) , , ψ n ( s n ) ) and ψ ( s ) = ( ψ i ( s 1 ) , , ψ n ( s n ) ) .
Given a normal-form game, two strategies s i , s i in S i for a player i are said to be equivalent, in symbols s i s i , if u j ( s i , s i ) = u j ( s i , s i ) for all profiles s i S 1 × × S i 1 × S i + 1 × × S | N | and all players j. Let [ s i ] be the equivalence class of s i under the equivalence relation ∼, and denote [ s ] = ( [ s 1 ] , , [ s | N | ] ) . Following the work in [5], we define the reduced normal-form of a normal-form game ( N , ( S i ) i N , ( u i ) i N ) as the game ( N , ( S i ) i N , ( u i ) i N ) where for each player i, S i = { [ s i ] : s i S i } and u i ( [ s ] ) = u i ( s ) for all strategy profiles s = ( s 1 , , s | N | ) . Two games are then said to be strategically equivalent if they induce strategically isomorphic reduced normal-forms.

2.3. Extensive-Form Games

An extensive-form game (of imperfect information) is based on a directed tree ( V , E ) and is played by a set of players N choosing from a set of actions A, starting from the root of the tree. Within the set V of vertices, we distinguish between action or decision nodes, which have children, and leaf nodes, which have no children. The set of action nodes we denote by D and the set of leaf nodes by L. With every action node v in D we associate a unique player i v , who is then said to be active at v, and an action set A v A such that every outgoing edge ( v , u ) is associated with a unique action in A v . Let V i denote the set of action nodes associated with player i, that is, V i = { v V : i v = i } , and let A i = { a A v : v V i } .
Every vertex v is identified with a unique sequence of actions, called a history, which leads from the root v 0 to v. Histories associated with leaf nodes are referred to as terminal histories. We will denote the history associated with vertex v by h v . We can also think of vertices v as histories: vertex v is the sequence h v = ( a 1 , , a depth ( v ) ) of actions for which there is a path v 0 = w 0 , w depth ( v ) = v such that ( w i , w i + 1 ) is associated with action a i + 1 for every 0 i < depth ( v ) . The root v 0 is thus the empty sequence ( ) . Thus, we also find that v E w if and only if w = v a for some a A v .
For every player i, we have a partitioning I i = { I 1 , , I | I i | } of their decision nodes V i into (non-empty) information sets. We write [ v ] for the information set vertex v belongs to. Here, it is understood that A u = A v , whenever u and v are in the same information set I. We then also write A I for A u and A v . Information sets may be singletons. If all information sets for all players are singletons, we say the game is of perfect information and do not distinguish between information set { v } and the node v itself, if no confusion is likely.
Finally, each player i’s preferences are represented by a utility value u h i R at each Version December 9, 2021 submitted to Games terminal history/leaf h . See Figure 1 for an example.
A strategy for player i in an extensive-form game is a function s i : I i A i such that s i ( I ) A I for all I I i . The set of strategies available to player i we then denote by S i . A strategy profile is then a sequence s = ( s 1 , , s | N | ) of strategies, one for each player. Let D = { v 1 , , v | D | } . Then, every strategy profile s = ( s 1 , , s | N | ) defines an action profile  a s = ( a v 1 , , a v | D | ) such that a v = s i ( [ v ] ) and where i is a player who is active at v. Observe that a strategy profile s = ( s 1 , , s | N | ) also defines a unique path w 0 , , w k from the root w 0 to a leaf w k with history h s = ( a 1 , , a k ) defined such that, for all 0 i < k , edge ( w i , w i + i ) is associated with action a i = s i ( [ w i ] ) , where i is the player who is active at w i . Finally, we have for each player i a utility function u i : S 1 × × S | N | R associating each strategy profile s = ( s 1 , , s | N | ) with a real value such that u i ( s ) = u i h s .
The players N of an extensive-form game, together with their strategies S i and their utilities u i , induce a normal-form game ( N , ( S i ) i N , ( u i ) i N ) . Note that there may be multiple strategy profiles giving rise to the same terminal history, that is, it may well be that h s = h s , even though s s . Therefore, the size of this normal-form game may be exponential in the size of the underlying extensive-form game, if represented naively. This is because there are I I | A I | = O ( | A | | V | ) strategy profiles, which are implicit in the definition of an extensive-form game: see Figure 2 for an illustration.

2.4. Boolean Circuits

In computer science, Boolean circuits are a well-established model for computing Boolean functions (cf., e.g., [6,7]). In this paper, we will make extensive use of them to concisely represent strategies and utility functions. We will here briefly review their definition, largely following Jukna’s exposition [7].
A k-ary Boolean function is a function β : { 0 , 1 } k { 0 , 1 } , where we allow k = 0 . The base of a Boolean circuit is given by a set B = { β 1 , , β k } of Boolean functions. In this paper, we will restrict attention to the set of classical Boolean functions B = { ¬ , , } (“not”, “or”, and “and”, respectively), which is known to be functionally complete (i.e., sufficient to define any Boolean function).
Formally, a Boolean circuit (or straight line program) on n variables x 1 , , x n over base B is given by a sequence of t n gates γ 1 , , γ t . The first n gates are given by the variables, that is, γ 1 = x 1 , , γ n = x n , and are also referred to as input gates. Another subset of gates is singled out as the set of output gates. Boolean formula over variables x 1 , , x n are thus represented by a Boolean circuit with only one output gate. Each subsequent gate γ i is the application of a k-ary Boolean function β i in the base B to k previous gates, that is, γ i = β i ( γ j 1 , , γ j k ) , where j 1 , , j k < i . The variables take values in { 0 , 1 } . Given values α 1 , , α n in { 0 , 1 } for the variables x 1 , , x n , one can inductively associate each gate with a value α ( γ i ) such that α ( x i ) = α i , if 1 i n , and
α ( β i ( γ j 1 , , γ j k ) ) =   β i ( α ( γ j 1 ) , , α ( γ j k ) ) ,
if n < i t . A Boolean circuit C given by γ 1 , , γ t on n variables x 1 , , x n and with m output gates γ j 1 , , γ j m then computes the function ϕ C : { 0 , 1 } n { 0 , 1 } m that, on values α = ( b 1 , , b n ) for the input gates, yields ϕ C ( α ) = ( α ( γ j 1 ) , , α ( γ j m ) ) .
A Boolean circuit is commonly depicted as a directed acyclic graph ( V , E ) , with the gates as vertices, that is, V = { γ 1 , , γ n } , and ( γ i , γ j ) E whenever γ i = β i ( γ j 1 , , γ j k ) and γ j = γ j for some 1 n . For examples of Boolean circuits, also see Figure 3 and Figure 4, below.
The size of a Boolean circuit given by gates γ 1 , , γ t on variables x 1 , , x n is given by the number t n of its gates minus the input gates. It is a well-established fact that the problem of computing ϕ C ( α ) for given values α { 0 , 1 } n for C ’s variables, which is also known as the Circuit Value Problem or Circuit Evaluation Problem, is complete for P under uniform AC 0 -reductions, and therefore can be computed in polynomial time (see in [8], p. 59).

3. Partial Order Games

We now introduce the framework of partial order games. The basic idea is that, as in extensive form games, the game contains a number of decision nodes, which are partioned among the players. However, play in the game is not defined by a game tree. Instead, partial order games have a binary dependence relation over decision nodes. This dependence defines the information available to a player when it makes a choice. If a decision node v for player i is dependent on decision nodes v 1 , , v k , then this means that the information available to a strategy when making a choice at v is precisely the choices that were made at decision nodes v 1 , , v k . To play a partial order game, a player must choose a selection of strategies (we call them vertex strategies), one for each of their decision nodes. A strategy for a decision node v must select a choice for that decision node taking as input the choices that were made for the decision nodes v 1 , , v k upon which it is dependent. In this way, we have an explicit representation of the information available when making a choice, in contrast to the use of information sets in extensive form games. Utilities in partial order games are not associated with individual nodes (as in extensive form games), but derive from the total profile of actions that were made.
Our usual way of thinking about extensive form games is as players alternating to make moves, working their way down the game tree to a leaf node; upon reaching a leaf node, the game is over. Thinking about games in this way naturally gives rise to a temporal order over choices in the game: a choice is made for the root node first, and then successive nodes in the game tree. Partial order games are more abstract than this. They induce only a partial temporal ordering over decision nodes: while the choices for v 1 , , v k must strictly precede the choice for v if v is dependent on v 1 , , v k , it may well be that decision nodes are independent: if nodes v and v are in disconnected components of the dependence graph, then we can say nothing about their temporal order.
Let us consider an example.
Example 1.
Consider the three-person partial order game depicted in Figure 3, with the directed acyclic graph it is based on given on the left. To the right of each vertex v, we have indicated the player i v and the actions they have at their disposal at v. For instance, at vertex v 2 , player 2 can choose among the actions c and d. If all players make a choice at their respective vertices, an action profile results, which then is associated with a utility value. For instance, if player 1 chooses a at v 1 and h at v 4 , player 2 chooses c at v 2 , and player 3 chooses f at v 3 , the action profile ( a , c , f , h ) (also denoted by a c f h ) results, yielding utility values of 1 to player 1, and 2 to players 2 and 3. The table on the right summarises the utilities the players get under the different action profiles that may be played. Note that in this game there are 2 4 possible action profiles.
The crucial feature of partial order games, however, is that the players can make their choices at a given vertex v depending on the choices the other players make at the parents of v. For instance, player 2 could adopt the strategy to play c at v 2 , if player 1 chooses a at v 1 , and plays d otherwise. Accordingly, player 2 has 2 2 = 4 strategies at their disposal at v 2 . As v 1 does not have any parents, player 1 has only two strategies at their disposal at v 1 , but they have 2 2 = 4 strategies at v 4 . Meanwhile, player 3 has 2 4 = 16 strategies at v 3 . A strategy profile specifies a strategy for each player at each of their vertices. Accordingly, in this game there are 2 × 2 2 × 2 4 × 2 2 = 2 9 = 512 possible strategy profiles.
The number of strategy profiles clearly outnumber the number of action profiles. Still, every strategy profile induces a unique action profile. This relationship, however, is not generally injective, as multiple strategy profiles may induce the same action profile. Assume, for instance, that the players adopt the following strategies:
  • player 1 chooses a at v 1 ;
  • player 2 chooses c if player 1 plays a at v, and d, otherwise;
  • player 3 chooses e if player 1 chooses a at v 1 and player 2 d at v 2 , and f, otherwise; and
  • player 4 chooses h independently of whether player 2 chooses c or d at v 2 .
If this strategy profile is played, it can readily be seen that action profile a c f h results, which we have already seen gives utilities 1, 2, and 2 to the players 1, 2, and 3, respectively.
We now proceed to our formal definition of partial order games, which enables the application of Nash equilibrium as well as the formulation of a natural backwards induction procedure later on. As before, let N = { 1 , , n } be a finite set of players. Then a partial order game on a directed acyclic graph ( V , E ) —the game’s dependency graph—associates each vertex v in V with a unique player in N, denoted by i v , and a non-empty set of actions, denoted by A v .
One player may be associated with multiple vertices, that is, it may very well be that i u = i v even if v u . We let V i denote the set of vertices associated with player i. We also say that player i is active at any vertex v in V i . If E is also connected—that is, if v E w or w E v for all distinct v and w in V—we also refer to any partial order game based on it as a total-order game. As E is assumed to be acyclic, it follows that E is transitive in total order games. On the other end of the spectrum, we have partial order games on dependency graphs ( V , E ) with E = Ø . This class of games we will also refer to as empty-order games.
Given a subset of W = { w 1 , , w k } of vertices, an action profile for W is a tuple a W = ( a w 1 , , a w k ) specifying one action a w j for each vertex w j . In our examples, we occasionally omit parentheses and commas, and write a w 1 a w k for ( a w 1 , , a w k ) . The set of action profiles for W we denote by A W . For the set V = { v 1 , , v | V | } of all vertices, we write write a = ( a v 1 , , a v | V | ) for a V and refer to a simply as an action profile. We denote the set of action profiles by A = A v 1 × × A v | V | . For a set V i = { w 1 , , w k } of vertices at which a player i is active, we also write a i = ( a w 1 , , a w k ) for a V i and refer to a i as an action profile for player i. We let A i denote the set of actions profiles for a player i. For action profiles a W = ( a w 1 , , a w k ) and b U = ( b u 1 , , b u m ) for disjoint sets W and U, we denote by ( a W , b U ) the action profile ( a w 1 , , a w k , b u 1 , , b u m ) for W U . If U = V \ W we also write ( a W , a W ) for ( a W , a U ) , and if U = V \ V i , we also write ( a i , a i ) . Given a topological sorting τ = ( v 1 , , v | V | ) , we define a τ -history as a sequence of actions h = ( a v 1 , , a v k ) in A v 1 × × A v k for some 1 k | V | . We stipulate the empty sequence of actions, denoted by ( ) , to be a τ -history as well. For an action profile a = ( a v 1 , , a v | V | ) , we say that h = ( b v 1 , , b v k ) is a history of a if ( b v 1 , , b v k ) = ( a v 1 , , a v k ) .
At each vertex v, player i v can make their choice of action dependent on the actions chosen at the parents of v. A (conditional) strategy at a vertex v, or vertex strategy, is therefore a function
f v : A y 1 × × A y k A v ,
where y 1 , , y k are the parents of v. We say that f v is unconditional if it is constant, that is, if f v ( a 1 , , a k ) = f v ( b 1 , , b k ) for all action profiles a = ( a 1 , , a k ) and b = ( b 1 , , b k ) . In this case, for a A v , we denote by f v a the unconditional strategy at vertex v that maps every action profile of v’s parents to action a. We will sometimes identify unconditional strategies f v with the action f v ( a 1 , , a k ) it specifies for each action profile a = ( a 1 , , a k ) and write a = ( a v 1 , , a v | V | ) for the profile of unconditional strategies f a = ( f v 1 a v 1 , , f v | V | a v | V | ) . Note that if vertex v has no parents, then A u 1 × × A u k = { ( ) } , where ( ) is the empty tuple. Accordingly, f v determines a single choice among A v , namely, f v ( ( ) ) in this case. By F v we denote the set of conditional strategies available at a vertex v.
By a strategy profile we then understand a profile f = ( f v 1 , , f v | V | ) consisting of one conditional strategy for each vertex. For f = ( f v 1 , , f v | V | ) a strategy profile and W = { w 1 , , w k } a subset of vertices in V, we let f W = ( f w 1 , , f w k ) denote the partial profile that is like f but restricted to W V . Let f 1 = ( f v 1 1 , , f v | V | 1 ) through f k = ( f v 1 k , , f v | V | k ) be k strategy profiles and { X 1 , , X k } be a k-partition of the vertex set V. Then, ( f X 1 1 , , f X k k ) denotes the strategy profile g = ( g v 1 , , g v | V | ) such that, for every vertex v in V, we have g v = f v j , if v X j .
A strategy for a player i is a profile f V i = ( f v 1 , , f v k ) of vertex-strategies, where V i = { v 1 , , v k } , is the set of vertices at which player i is active. We generally denote f V i by f i , and the set of strategies available to player i by F i . For strategy profiles f = ( f v 1 , , f v | V | ) and g = ( g v 1 , , g v | V | ) and a player i, we also write ( f i , g i ) for the strategy profile ( f V \ V i , g V i ) .
For every vertex v, a strategy profile f = ( f v 1 , , f v | V | ) defines an action a v f , the evaluation of f at v, recursively as follows:
a v f = f v ( ( ) ) if depth ( v ) = 0 , f v ( a u 1 f , , a u k f ) if depth ( v ) > 0 and u 1 , , u k are the parents of v .
Observe that a v f is well defined because in a directed acyclic graph vertices with depth 0 exist and the parents of each vertex are of lower depth than the vertex itself. A strategy profile f = ( f v 1 , , f v | V | ) can thus be seen to induce the action profile a f = ( a v 1 f , , a v | V | f ) . We also say that strategy profile f = ( f v 1 , , f v | V | ) sustains action profile a = ( a v 1 , , a v | V | ) if a = a f . We occasionally denote by ( a W , f U ) the action profile a g where g is the strategy profile ( f W a , f U ) which specifies unconditional strategies for W. In other words, we have ( a v 1 , , a v k , f v k + 1 , , f v | V | ) denote the action profile that results if on v 1 through v v k actions a v 1 , , a v k are played, and at vertices a v k + 1 through a v | V | the strategies f v k + 1 through f v | V | .
Note the difference between actions and action profiles on the one hand, and strategies and strategy profiles on the other. Every strategy profile induces a unique action profile, whereas the same action profile may be induced by different strategy profiles. In an important sense, the action profiles are the outcomes of the game.
Therefore, we take the set of action profiles of a partial order game as its set of outcomes, over which the players’ preferences are defined. Each player’s preferences over the action profiles are given by a real-valued utility function  u i : A R , where we assume that player istrictly prefers action profile a = ( a v 1 , , a v | V | ) to action profile b = ( b v 1 , , b v | V | ) whenever u i ( a ) > u i ( b ) . We extend utility functions u i to strategy profiles f , and write u i ( f ) for u i ( a f ) . To fix concepts and notation, we consider Example 1 once more.
Example 1 (cont’d).
Consider the game in Figure 3. At vertices v 1 , v 2 , and v 4 , we have the following strategies:
F v 1 ( ) F v 2 a b F v 4 c d f v 1 1 a f v 2 1 c c f v 4 1 g g f v 1 2 b f v 2 2 c d f v 4 2 g h f v 2 3 d c f v 4 3 h g f v 2 4 d d f v 4 4 h h
At vertex v 3 , moreover, we have 2 4 strategies, illustrating the exponential blowup that results from strategies being represented explicitly.
F 3 a c a d b c b d a c a d b c b d a c a d b c b d a c a d b c b d f v 3 1   e   e     e     e f v 3 5   e   f     e     e f v 3 9   f   e     e     e f v 3 13   f   f     e     e f v 3 2   e   e     e     f f v 3 6   e   f     e     f f v 3 10   f   e     e     f f v 3 14   f   f     e     f f v 3 3   e   e     f     e f v 3 7   e   f     f     e f v 3 11   f   e     f     e f v 3 15   f   f     f     e f v 3 4   e   e     f     f f v 3 8   e   f     f     f f v 3 12   f   e     f     f f v 3 16   f   f     f     f
The (conditional) strategy profile f = ( f v 1 1 , f v 2 3 , f v 3 8 , f v 4 2 ) , then, for instance, yields the evaluation a f = ( a v 1 f , a v 2 f , a v 3 f , a v 4 f ) = ( a , d , f , h ) as
a v 1 f = f v 1 1 ( ( ) ) = a a v 2 f = f v 2 3 ( a v 1 f ) = f v 2 3 ( a ) = d a v 3 f = f v 3 8 ( a v 1 f , a v 2 f ) = f v 3 8 ( a , d ) = f a v 4 f = f v 4 2 ( a v 2 f ) = f v 4 2 ( d ) = h
Accordingly, the utilities on this profile f = ( f v 1 1 , f v 2 3 , f v 3 8 , f v 4 2 ) for these three players are therefore determined by the action profile a d f h . Hence, u 1 ( f ) = 0 , u 2 ( f ) = 0 , and u 3 ( f ) = 2 .
With the players N, their conditional strategies F i , and their preferences u i : F R over strategy profiles, a partial order game on a directed acyclic graph ( V , E ) can thus be seen to define immediately a game in normal-form ( N , ( F i ) i N , ( u i ) i N ) (cf., Section 2.2, above). Accordingly, partial order games are directly amenable to game-theoretical analysis using the usual solution concepts, in particular, Nash equilibrium.
Conversely, every normal-form game ( N , ( S i ) i N , ( u i ) i N ) can be seen as a partial order game on the same set N of players. Its dependency graph is then given by ( N , Ø ) , associating with vertex i N , player i N and strategy set S i . Identifying each unconditional strategy f i with f i ( ( ) ) = s i in S i , we adopt the utility functions u i unaltered for the utility functions of the partial order game.

3.1. Concise Representations for Strategies, Profiles, and Utilities

The transformation from partial order games to normal-form games comes at the cost of an exponential blowup. Observe that in the definition of partial order games, the sets of action profiles and strategy profiles are defined implicitly. Given a partial order game on a directed acyclic graph ( V , E ) with a set of players N and actions A = A v 1 A v | V | , the number of action profiles a = ( a v 1 , , a v | V | ) is bounded by O ( | A | | V | ) , and so is the number of unconditional strategy profiles f a = ( f v 1 a v 1 , , f v | V | a v | V | ) . By contrast, the number of both conditional strategies and the number of conditional strategy profiles are bounded by O ( | A | ( | A | | V | ) ) . It is also worth observing that the size of both a conditional strategy f and a profile f = ( f v 1 , , f v | V | ) of conditional strategies is bounded by O ( | A | | V | ) .
Even though the dependency graph allows a concise representation of the strategies and strategy profiles in a partial order game, they remain large objects. It is therefore desirable to have concise representations of action profiles, utility functions, strategies, and strategy profiles. To this end, we take recourse to Boolean circuits, a formalism well established in theoretical computer science (see Section 2.4). We first show how action profiles, utility functions, and strategy profiles can be represented concisely by Boolean circuits. We then prove two lemmas that will be useful for our later complexity proofs in Section 5 and Section 7.
When representing partial order games concisely, we assume that for distinct vertices v and w the action sets A v and A w are disjoint. Having assumed A v to be non-empty for each vertex v, it thus follows that | V | | A | . For ease of presentation, we also assume that | A v | is an integer power of 2 for each vertex v. In this section, we assume that for each vertex v, the elements of set A v are enumerated as a 0 , , a | A v | 1 , thus associating each action a k with a unique local index k at v. Let bin ( k ) denote the binary representation of the local index k, using exactly  log 2 | A v | digits. Thus, if | A v | = 16 , we have bin ( 3 ) = 0011 .
Let v be a vertex with parents u 1 , , u k . We then represent a strategy f v : A u 1 × × A u k A v by a Boolean circuit C f v with log 2 | A u 1 | + + log 2 | A u k | input gates and log 2 | A v | output gates. For an action profile a = ( a u 1 , , a u k ) in A u 1 × × A u k and a v an action in A v , we then have that f v ( a u 1 , , a u k ) = a v if and only if on input bin ( j 1 ) bin ( j k ) the circuit C f v evaluates to outputs bin ( i ) , where j 1 , , j k and i, are the local indices of a v 1 , , a v k , and a v at their respective vertices. As C f v can be seen as combining log 2 | A v | Boolean functions in log 2 | A u 1 | + + log 2 | A u k | variables and log 2 | A u 1 | + + log 2 | A u k | | A | , we may assume that the circuit C f v is of size at most exponential in | A | . More precisely, we may assume that the size of C f v is Θ ( 2 | A | | A | ) [9,10].
Example 2.
For an example, see Figure 4, which depicts the Boolean circuit C f v 3 14 for strategy f v 3 14 : A v 1 × A v 2 A v 3 in Example 1, given by
f v 3 14 ( a ) = e if a = b c , f otherwise .
Recall that A v 1 = { a , b } , A v 2 = { c , d } , and A v 3 = { e , f } . Let the local indices of a, c, and e be given by 0, and those of b, d, and f by 1. As | A v 1 |   =   | A v 2 |   =   | A v 3 |   =   2 , any action profile a A v 1 × A v 2 can be represented by two Boolean variables x 0 and x 1 , where the value of x 0 represents the choice for a or b at v 1 , the value of x 1 the choice for c or d at v 2 . For instance, the action profile a d A v 1 × A v 2 can thus be represented by setting x 0 to 0 and x 1 to 1. For these values the circuit C f v 3 14 evaluates to 1, which corresponds to f v 3 14 ( a d ) = f and as depicted in Figure 4.
Observe that an unconditional strategy f v a at a vertex v with parents u 1 , , u k , which maps every profile ( a u 1 , , a u k ) invariably to action a in A v , is represented by a Boolean circuit C f v a of size at most polynomial in | A | . The circuit C f v a will still have 1 m k log 2 | A u m | inputs and log 2 | A v | outputs, but will involve only one -gate and one -gate. Let j be the local index of a at v and bin ( j ) = b 1 , , b log 2 | A v | . Then, for 1 log 2 | A v | , connect the -gate with output y , if b = 0 , and the -gate with output y , if b = 1 .
Similarly, a rational-valued utility function u i : A v 1 × × A v | V | Q can be represented by a Boolean circuit C u i . This circuit will have v V log 2 | A v | inputs and outputs y 1 , , y k , z 1 , , z m . Moreover, for ( a v 1 , , a v | V | ) an action profile in A v 1 × × A v | V | with local indices j 1 , , j | V | at their respective vertices, we have u i ( a v 1 , , a v | V | ) = n d + 1 if and only if on input bin ( j 1 ) , , bin ( j | V | ) the circuit C u i yields bin ( n ) for outputs y 1 , , y k and bin ( d ) for outputs z 1 , , z m (Observe that if u i ( a v 1 , , a v | V | ) is a positive integer, the outputs z 1 , , z m can be dispensed with, as x Ø x = 0 and d = 1 j m z j · 2 j ). Again, as | A u 1 | + + | A u k | | A | , we may assume that any such circuit C u i will be of size at most exponential in | A | .
Example 3.
Figure 5 illustrates how a utility function u i : A v 1 × A v 2 × A v 3 × A v 4 R for a player i in the game of Example 1 is represented by a Boolean circuit. The utility function u i is depicted on the right. We assume that the local indices of a, c, e, and g be given by 0, and those of b, d, f, and h by 1. As | A v 1 |   =   | A v 2 |   =   | A v 3 |   =   | A v 4 |   =   2 , an action profile a in A v 1 × A v 2 × A v 3 × A v 4 is thus given by means of an assignment to four Boolean variables x 0 , x 1 , x 2 , and x 4 , representing the choices at the vertices v 1 , v 2 , v 3 , and v 4 , respectively. For instance, action profile a c f h is represented by setting variables x 0 and x 1 to 0, and variables x 2 and x 3 to 1, which can thus also be denoted by the binary string 0011. Evaluating the circuit for these values, we find that y 1 is set to 0 and y 0 to 1. This corresponds to u i ( a c f h ) = 1 and bin ( 1 ) = 01 . Observe that the Boolean circuit can be evaluated in time polynomial in its size and that it is exponentially smaller than the explicit tabelling of the utility function on the right.
We conclude this section by showing two useful lemmas. The first establishes that, even if a conditional strategy profile f = ( f v 1 , , s v | V | ) is represented as a Boolean circuit, the action profile a f = ( a v 1 , , a v | V | ) that f gives rise to, can be computed in polynomial time.
Lemma 1.
Let f = ( f v 1 , , f v | V | ) be a conditional strategy profile represented by Boolean circuits C f v 1 , , C f v | V | . Then, the action profile a f = ( a v 1 f , , a v | V | f ) can be computed in polynomial time.
Proof Sketch.
Proceeding inductively, find for every vertex v the action a v f as follows. For vertices v of depth 0, the Boolean circuit C f v should give the local index in binary of action a v f at v, and therewith, a v f immediately. For a vertex v of a strictly positive depth with parents u 1 , , u k , we may assume that the local indices in binary bin ( j u 1 ) , , bin ( j u k ) of a u 1 f , , a u k f can be computed in polynomial time. As the evaluation problem for Boolean circuits is solvable in polynomial time, the circuit C f v for inputs bin ( j u 1 ) , , bin ( j u k ) can also be evaluated in polynomial time, providing us with the local index of f v ( a u 1 f , , a u k f ) at v in binary, which gives us the result. □
Lemma 2.
Let f = ( f v 1 , , f v | V | ) be a conditional strategy profile represented by Boolean circuits C f v 1 , , C f v | V | , and u i : A v 1 × × A v | V | Q be a utility function represented by a Boolean circuit C u i . Then, u i ( f ) can be computed in polynomial time.
Proof Sketch.
Thanks to Lemma 1, we can compute the action profile a f = ( a v 1 f , , a v | V | f ) in polynomial time. Let j 1 , , j | V | be the local indices of a v 1 f , , a v | V | f at their respective vertices. Using bin ( j 1 ) , , bin ( j | V | ) as inputs for the circuit C u i , we obtain the binary encoding of u i ( a v 1 f , , a v | V | f ) . As the evaluation problem for Boolean circuits is solvable in polynomial time, this suffices for the result. □

4. Related Game-Theoretic Models

In this section, we explore the interrelations between partial order games and four related game-theoretic models: partial order Boolean games, Multi-Agent Influence Diagrams (MAIDs), concurrent games as event structures, and extensive-form games.

4.1. Boolean Games and Partial Order Boolean Games

A special class of partial order games, referred to as partial order Boolean games, was introduced in [1]. Partial order Boolean games extend Boolean games [11,12,13,14,15], where for a set Φ of propositional variables, each player i exercises unique control over the truth-values assigned to a subset Φ i of variables and aims to satisfy a goal formula γ i over Φ . Partial order Boolean games are Boolean games enriched with a dependency graph on Φ . To avoid confusion, we will write v p for the propositional variable p when it occurs in the role of a vertex in the dependency graph. The player i associated with a vertex v p coincides with the player controlling p in the Boolean game, and i can make their choice of truth-value for p depend on the values assigned to the propositional variables q 1 , , q k if v q 1 , , v q k are the parents of v p in the dependency graph. In a Boolean game, such dependencies do not exist. Thus, Boolean games can be seen as a special class of partial order Boolean games. The class where the dependency graph does not have any edges but can be expressed by ( Φ , Ø ) .
Clearly, every partial order Boolean game ( N , Φ , Φ 1 , , Φ n , γ 1 , , γ n , D ) , where D = ( Φ , E ) is a dependency graph on Φ , defines a partial order game with the same set of players N and the same dependency graph, where player i is associated with vertex v p if and only if p Φ i . At vertex v p , the player i associated with v p has two actions at their disposal: setting p to true (p) and setting p to false ( p ¯ ). We will assume that the local index of action p at vertex v p is 1 and that that of action p ¯ at v p is 0.
In a partial order Boolean game, an individual strategy at vertex v p , controlled by i, is represented by a so-called choice equation of the form p : φ ( q 1 , , q k ) , where v q 1 , , v q k are the parents of v p and φ ( q 1 , , q k ) is a propositional logic formula over q 1 , , q k , with the interpretation that i sets p to true if φ ( q 1 , , q k ) evaluates to true given the decisions made for q 1 , , q k at, v q 1 , , v q k , and to false otherwise. Thus a choice equation p : φ ( q 1 , , q k ) defines an individual strategy f v p : { q 1 , q ¯ 1 } × × { q k , q ¯ k } { p , p ¯ } at vertex v p . This strategy, moreover, can be represented by a Boolean circuit C f v p for the Boolean function β φ : { 0 , 1 } k { 0 , 1 } expressed by the formula φ . We may assume that the circuit C f v p is of size at most polynomial in the size of the choice equation p : φ ( q 1 , , q k ) and that it has exactly k inputs and one output. We may furthermore assume that the circuit C f v p can be obtained in time polynomial in the size of φ from p : φ ( q 1 , , q k ) .
Similar considerations apply to the representation of the players’ utility functions. In partial order Boolean games, the preferences of each player i are represented by a goal formula γ i in the propositional language over Φ . Every action profile a = ( a v p 1 , , a v p | Φ | ) of a Boolean partial order game defines a valuation α a : Φ { 0 , 1 } and yields player i utility 1 if α a satisfies γ i , and utility 0 otherwise. The Boolean function β γ i : { 0 , 1 } | Φ | { 0 , 1 } expressed by γ i can be represented by a Boolean circuit C γ i with at most | Φ | inputs and exactly one output. As at every vertex v p , the assigned player has exactly two actions at their disposal and utility values are integers, we find that C γ i is exactly of the form suggested for the representation of players’ utilities in partial order games in the previous section. Moreover, C γ i will be of size at most exponential in | Φ | and of size polynomial in γ i . We may furthermore assume that the circuit C f γ i can be obtained from γ i in polynomial time.

4.2. Multi-Agent Influence Diagrams (MAIDs)

Multi-agent Influence Diagrams (MAIDS) were proposed in [2,3] and later extended from a game-theoretic perspective in [16]. MAIDs trace their origins to Bayesian Networks (cf., [17]) and influence diagrams, a decision theoretic extension of Bayesian networks proposed by Howard and Matheson [18].
Like partial order games, MAIDs involve a finite set A = { a 1 , , a n } of agents and are defined in directed acyclic graphs, where three different types of vertices are distinguished: a set of chance nodes  X , a set of decision nodes  D a for each agent a in A , and a set of utility nodes  U a also for each agent a. In line with the Bayesian network literature, every vertex is taken to be a random variable X with possible values in a finite domain dom ( X ) . It is generally assumed that utility nodes cannot be parents of other nodes, and that their domain is a finite set of real numbers. (Hammond et al. [16] relax this condition, and also allow utility nodes to have outgoing edges.)
The values any variable in a MAID can take may depend on the values assumed by that variable’s parents in the DAG. This is very much in line with how a choice of action at a node may depend on the decisions taken at parent nodes in partial order games. In the MAIDs framework, the parents of a decision node X are denoted by Pa ( X ) and an instantiation, commonly denoted by pa , for Pa ( X ) = { Y 1 , Y k } defines a value from the domain of each of these variables, that is, pa dom ( Y 1 ) × × dom ( Y k ) . A conditional probabilistic distributions (CPD) for a decision or utility node X now assigns a probability P ( X | pa ) for each instantiation pa of Pa ( X ) , and a decision rule defines a conditional probabilistic distribution for one particular decision variable. Sometimes it is required that CPDs for utility nodes be deterministic, arguing that all of the stochasticity should be subsumed into chance variables. This convention, however, is not universally adopted in the literature on MAIDS. We refer to the work in [19] for a good in-depth study. On this basis, a strategy  δ a for an agent a defines a conditional probability distribution for each decision variable D in D a . A strategy profile is then a tuple ( δ a 1 , , δ a n ) of strategies, one for each agent. Given a strategy profile ( δ a 1 , , δ a n ) , the MAID reduces to a Bayesian network, and as such defines a joint probability distribution P δ a 1 , , δ a n over all of its variables. Accordingly, every agent a can be assigned a expected utility defined as follows:
E a ( δ a 1 , , δ a n ) = U U a x dom ( U ) P δ a 1 , , δ a n ( U = x ) · x .
With players, their strategies, and their utilities being defined, a MAID defines a strategic-form game, and is amenable to analysis by the usual game-theoretic solution concepts.
It is not hard to see how every partial order game can be seen as a MAID without chance variables, with ‘hidden’ utility nodes, and only allowing for deterministic decision rules. Formally, let N = { 1 , , n } be the set of players; V 1 , , V n be the sets of nodes assigned to the players 1 through n, respectively; ( V , E ) be a directed acyclic graph with V = V 1 V n ; and u 1 , , u n be utility functions for the players 1 through n. Then, the partial order game defined from this can be seen as a MAID with set of agents A = N and decision variables V 1 , , V n such that dom ( V v ) = A v for all vertices v in V. There are no chance variables, and one utility variable U i for each player i that is a child of all decision nodes. The MAID’s partial order is then given by ( V , E ) where V = V { U 1 , , U n } and E = E V × { U 1 , , U n } . Thus, the instantiations of the parents of each utility variable, the set of instantiations of all decision variables, corresponds to the set of all action profiles of the partial order game. Each of these instantiations ( a v 1 , . . . a v | V | ) , in turn, corresponds to a deterministic strategy profile ( δ 1 , . . . , δ n ) in the MAID, which allows us to complete the model by setting E i ( ( δ 1 , . . . , δ n ) ) = u i ( ( a v 1 , . . . a v | V | ) ) .

4.3. True Concurrency and Games as Event Structures

Event structures [20] are the so-called “true concurrency” analogue of trees: just as transition systems unfold to trees, so some models of true concurrency, such as Petri nets and asynchronous transition systems, unfold to event structures [21]. Similarly, in the same way that sequential games can be represented by trees, certain concurrent games can be represented by event structures—where plays in this much more general setting determine partial instead of total orders of moves; cf., see in [22,23].
An event structure is a triple ( E , , Con ) , consisting of a set E of events that are partially ordered by ≤, the causal dependency relation, and a nonempty consistency relation Con over finite subsets of E, which satisfy four conditions:
1.
{ e : e e is finite for all e E } ,
2.
{ e } Con for all e E ,
3.
Y X Con implies Y Con , and
4.
X Con and e e X implies X { e } Con .
The states of an event structures E are called configurations, denoted by C ( E ) , and consist of those subsets x E which are both
  • consistent: X x . X is finite implies X Con ; and
  • down-closed: e , e . e e x implies e x .
Configurations can be finite or infinite. Concurrency, then, is naturally modelled as follows: two events, say e and e , which are both consistent and incomparable with respect to causal dependency are seen as concurrent, and assumed to be independent in that they can happen in parallel. In a game-theoretic context, it also means that such two events can be played or executed in parallel.
In the context of concurrent games as event structures, we consider only two players, named Player (the system) and Opponent (the environment), who own disjoint sets of events which they can play (execute). They do so asynchronously in an attempt to reach a state in their “winning set” of configurations. In these games, players are allowed to execute an event, say e, only if all events on which e causally depends have been executed. In this setting, events do not have a Boolean or otherwise interpretation; they are simply available actions that a player can execute—and informally are intended to represent observable events in a computer system. The Nash equilibrium of concurrent games as event structures have been studied in the past, and fully characterised in [23] for two-player general-sum games with players’ goals given by Borel sets of winning configurations. However, their main application is in the field of formal semantics for programming languages and logical systems; cf, see in [22,24,25].

4.4. Extensive-Form Games

Games in extensive-form are the canonical game-theoretic model to account for the strategic interactions that result if decisions are made in a prescribed order. Their use in the game-theoretic literature is ubiquitous. In this section, we show that partial order games constitute a concisely represented class of extensive-form games.

4.4.1. Partial Order Games as Extensive Games of Imperfect Information

Given a topological sorting of the decision nodes, there is a natural transformation of partial order games to extensive form games of imperfect information. Let the DAG of the partial order game be given by ( V , E ) , where V = { v 1 , , v | V | } , and the utility function for each player i by u i : A v 1 × × A v | V | R . First, assume a topological order τ to linearise the DAG, which, for ease of presentation, we assume to be τ = ( v 1 , , v | V | ) . Figure 6 illustrates this construction for the partial order game in Figure 3 under topological order τ .
The extensive-form game of imperfect information representing the partial order game is then defined for the same set N of players. We define the game tree ( V , E ) such that V is the set of all prefixes of terminal histories in A v 1 × × A v | V | , with ( ) being the root node. The player active at each history/vertex h v k = ( a v 1 , , a v k ) in V , we identify with the player that is active at v k + 1 in the partial order game. At each of these vertices/histories this player has A v k + 1 as the set of actions to chose from. In particular, i v 1 is the player active at ( ) and has A v 1 as action set. Accordingly, all prefixes/histories of the same length are assigned to the same player and each of them have the same action set. For v , w V , we then have v E w if and only if w = v a for some a A v , as expected.
Every leaf/terminal history h = ( a v 1 , , a v | V | ) and player i we associate with the utility value u i h = ( a v 1 , , a v | V | ) , where u i denotes player i’s utility function in the partial order game. Note that in our construction, every terminal history corresponds with a full action profile.
The definition of the information sets is crucial. For every internal vertex/history h = ( a v 1 , , a v k ) , we formally define these such that
[ ( a v 1 , , a v k ) ] = { ( a v 1 , , a v k ) : a y = a y for all parents y of v k + 1 } .
Intuitively, a player can only distinguish a vertex/history h = ( a v 1 , , a v k ) where it is active from those vertices/histories of the same length that differ on the choice of action in at least one if its parents in the partial order game. All other vertices/histories of the same length belong to the same information set. For an example, consider Figure 6, where vertices v 4 2 = a c f and v 4 5 = b c e are in the same information set for player 1. This is because, in the underlying partial order game (depicted in Figure 3), vertex v 2 is the only parent of v 4 and the histories a c f and b c e both specify action c for v 2 . Finally, we define the set of player i’s information sets as
I i = { [ v ] : v V and i is active at v } .
To see that the extensive-form game defined through this transformation is strategically equivalent to the original partial order game, first consider an arbitrary vertex v j along with its parents Y in the partial order game. Together with v j itself, each profile a Y in A Y defines a unique information set for i defined as
I ( v j , a Y ) = { a A v 1 × × A v j 1 : a Y = a Y } .
Let θ be the function that maps each pair ( v j , a Y ) to information set I ( v j , a Y ) .
Observe that defined thus, θ is both injective and surjective. We now find that we can extend θ to a function θ ^ that maps each conditional strategy f i = ( f v i 1 , , f v i k ) in the partial order game to a strategy f ^ i : I i A i for player i in the extensive-form game. To do so, define f ^ i such that for each each information set I I i with θ 1 ( I ) = ( v j , a Y )
f ^ i ( I ) = f v j ( a Y ) .
Observe that the extended function θ ^ is also bijective. Moreover, some reflection reveals that the full action profile a f = ( a v 1 f , , a v | V | f ) determined by conditional strategy profile f = ( f 1 , , f | N | ) in the partial order game will be identical to the one determined by the strategy profile f ^ = ( f ^ 1 , , f ^ | N | ) in the extensive-form game. Having defined the players’ utilities in the extensive-form game as we did, we may conclude that the partial order game is strategically equivalent to the extensive game of imperfect information. Formally:
Proposition 1.
A partial order game and the extensive-form game obtained from it on basis of a given topological sorting, as described in this section, are strategically equivalent.
This result means that, with respect to game theoretic analysis, partial order games are in a precise technical sense (i.e., with respect to strategic equivalence), no more expressive than extensive form games: any scenario we can model with a partial order game can also be modelled with an extensive form game. However, this does not mean that partial order games are redundant. The translation from partial order game to extensive form game comes at the expense of a blow-up in the size of the game: if | A v | 2 for every vertex v, then V = Ω ( 2 | V | ) . The practical upshot of this is that there are situations that we can capture using the partial order model that would be infeasible to capture with the extensive form. In addition, we argue that the partial order representation can in some cases be much more comprehensible than the extensive form: compare the partial order game in Figure 3 with its extensive form representation in Figure 6. Understanding what is going on (and in particular, the informational dependencies present) in the latter representation requires much more work than in the former.
Note that, for total-order games, the construction yields (finite) extensive-form games of perfect information (see Figure 7 and Figure 8 below). This can easily be appreciated by recalling that a total order game allows for only one topological order, say ( v 1 , , v V | V | ) . Thus, for every 1 j | V | we have that the parents of v j are all vertices in Y = { v 1 , , v j 1 } . However, then, obviously, I ( v j , a Y ) = { a A v 1 × × A v j 1 : a Y = a Y } will be a singleton for every action profile a Y , and the resulting game one of perfect information. As finite extensive-form games of perfect information can be solved by backwards induction (cf. e.g., in [4], Chapter 7), it follows as a corollary that total-order games always admit Nash equilibria in general.
Proposition 2.
Total-order games always have at least one Nash equilibrium.
We conclude this section with a remark about topological sortings. The DAG of a partial order game may not have a unique topological sorting over its nodes and, under the transformation described above, different topological sortings may lead to different extensive-form games. This is illustrated by Figure 6 and Figure 9, which both depict extensive-form games obtained from the partial order game in our first example in Figure 3. These extensive-form games we obtain using our transformation for the topological sortings ( v 1 , v 2 , v 3 , v 4 ) and ( v 1 , v 2 , v 4 , v 3 ) , respectively, and are clearly distinct. Yet, they are strategically equivalent. A formal proof that any two extensive-form games that are obtained using our transformations for different topological orders will be strategically equivalent is beyond the scope of this paper. It suffices to say that it is due to the so-called Interchange of Moves principle, which is one of the four Thompson transformations [26] and informally captures the idea that in extensive-form games of imperfect information “the order of play is immaterial if one player does not have any information about the other player’s action when making his choice” (see in [5], page 224).

4.4.2. Partial Order Games as a Concise Representation of Extensive-Form Games

As we noted above, the transformation of partial order games to extensive-form games presented in the previous section may give rise to an exponential blow-up. Reasoning conversely, this raises the claim that partial order games could be seen as providing a concise representation of extensive form games. In this section, we argue that if the utilities in partial order games are represented by Boolean circuits, then this claim is supported.
In Section 2.3, we recalled the well-known fact that transforming an extensive-form game to a normal-form game leads to an exponential blow-up. One reason for this is that, in extensive-form games, a player’s utilities are represented by an association of utility values to leaf nodes/terminal histories. One leaf node, however, may be reached by playing different strategy profiles, or even different action profiles, whereas in normal-form games utilities are specified for each strategy/action profile separately.
In partial order games, the players’ utilities are also modelled as an association of action profiles (not strategy profiles) and utility values. We find, however, that an exponential blow-up can be avoided when the players’ utility functions are represented by Boolean circuits.
The main idea behind this can be explained by means of an example. Consider again the extensive form in Figure 1, where each action has already been labelled with its local index in binary. For each internal vertex/non-terminal history v with action set A v , we introduce log 2 | A v | Boolean variables x 1 v , , x log 2 | A v | v . In the example, we introduced variables x for v 0 , y and y for v 1 , z for v 2 , and x for v 3 . A truth-value assignment α to these variables for all vertices then defines an action profile a α = ( a v 1 α , , a v | V | α ) such that a j α is the action in A v with numerical index num ( α ( x 1 ) , , α ( x log 2 | A v | ) ) , for each vertex v. In our example, for instance, the assignment α * that sets x and y to false and y, z, and x to true, corresponds to the action profile ( a 0 , b 10 , c 1 , d 1 ) . With all utilities in our example being either 0 or 1, we can now associate a Boolean formula φ u i that characterises each player’s utility function, in the sense that assignment α satisfies φ u i if and only if u i ( a α ) = 1 . In our example, such a formula φ u 1 representing player 1’s binary utility function could then for instance be obtained by first defining recursively a formula φ v for each vertex v as follows. First, for each leaf node/terminal history h , we set φ h = , if u i h = 1, and φ h = , if ui=0 u i h = 0. In our example, we thus have, for instance, φ a 0 b 01 = and φ a 1 c 1 d 1 = . For every internal vertex/non-terminal history v, furthermore, we then set
φ v = a A v ( χ a φ v a ) ,
where χ a = x ~ 1 v x ~ log 2 | A v | c characterises the local index k of a, that is x ~ j v = x j v , if b j = 1 , and x ~ j v = ¬ x ~ j c if b j = 0 , where bin ( k ) = b 1 b log 2 | A v | . Then, set φ i = φ ( ) . Thus, in our example we get, subsequently,
φ v 3 = ( x ¯ ) ( x ) φ v 2 = ( z ¯ ) ( z φ v 3 ) φ v 1 = ( y ¯ y ¯ ) ( y ¯ y ) ( y y ¯ ) ( y y ) φ v 0 = ( x ¯ φ v 1 ) ( x φ v 2 )
Finally, we obtain φ u 1 = φ v 0 . in our example, we now find, for instance, that assignment α * does not satisfy φ u i and that u i ( a α * ) = 0 .
This procedure is general and can be applied to every extensive-form game. The crucial thing to observe is that each variable x occurs at most 2 log 2 | A v | = O ( | A v | ) times in φ u i . Therefore, the size of φ u i is still polynomial in the size of game and, thus, there is a polynomial-sized Boolean circuit C u i representing the Boolean function β φ u i . Finally, any rational-valued utility function u i : A v 1 × × A v | V | Q can be represented by a linear combination of such circuits, and we may conclude that u i can be represented by a Boolean circuit whose size is polynomial in the size of the extensive-form game, giving us the following lemma.
Lemma 3.
Let u i : S 1 × × S | N | Q be the utility function of a player i in an extensive-form game. Then, u i can be represented by a Boolean circuit that is of a size polynomial in the size of the extensive-form game.
Recall that an extensive-form game and a partial order game are said to be strategically equivalent if they give rise to strategically equivalent normal-form games. We now find that, provided that utilities may be represented by Boolean circuits, every extensive-form game can be represented by a strategically equivalent partial order game without giving rise to more than an at most a polynomial blowup.
Proposition 3.
For every extensive-form game, there is a strategically equivalent partial order game whose size is at most polynomially larger than that of the extensive-form game.
Sketch of Proof.
Let an extensive form for players in a set N be based on a tree ( V , E ) . Let I be the information sets of this game. We construct a partial order game on the trivial DAG ( I , Ø ) , which is obviously polynomial in the size of the extensive-form game. We associate with each information set I in I the same player and the same action set A I as in the extensive-form game. For every player i with I i = { I 1 , , I k } , we can define a bijection that maps every strategy s i : I i A i for i in the extensive-form game to a strategy f ^ i = ( f ^ I 1 , , f ^ I k ) for player i in the partial order game such that f ^ I j is the strategy that maps the empty sequence ( ) to action s i ( I j ) . By virtue of Lemma 3 we may assume the utility function of each player in the extensive-form game can be represented by a polynomially sized Boolean circuit. Accordingly, the the size of the whole partial order game constructed thus is polynomial in the size of the extensive-form game. Conclude by observing that the extensive-form game and the partial order game induce strategically equivalent normal-form games, as desired. □
Proposition 3 shows that every extensive-form game of perfect-information can be represented by a partial order game at the cost of an at most polynomial blow-up. In some cases, moreover, we find that partial order games may be exponentially smaller than any extensive-form game of perfect information that is strategically equivalent. Consider the following extensive-form game with n players, ordered from 1 to n. Every player has two actions, 0 and 1, at their disposal, but can make their choice dependent on the players that occur before them in the ordering. Each strategy profile s gives rise to a unique terminal history h s = ( b 1 , , b n ) of actions in { 0 , 1 } , where each player i plays b i . Accordingly, there are exactly | { 0 , 1 } | n = 2 n terminal histories or leaves in this extensive-form game. Let the preferences for each player i at each of these terminal histories h = ( b 1 , , b n ) be given by the utility function such that u i h = num ( b 1 , , b n ) , where num ( b 1 , , b n ) is the numerical value of the sequence b 1 , , b n ) conceived as an integer in binary. For instance, num ( 01101 ) = 13 . Then, for each player, every terminal history or leaf yields a unique utility value. See Figure 7 for an illustration of this game for n = 5. To represent this game, an extensive-form game needs to have at least 2n leaf nodes to account for all the different payoffs a player may get, and thus is of size at least exponential in the number of players.
By contrast, one can represent this game as a partial order game on a transitive graph ( V , E ) , where V = { v 1 , , v n } and v i E v j if and only if i < j. Let player i be assigned to vertex vi and A v i = { 0 , 1 } for all 1 i n . For each player i, the utility function u i ( b 1 , , b n ) = num ( b 1 , , b n ) can clearly be represented by a Boolean circuit with V input variables, and as many output variables, with the mth input gate immediately leading to the mth output gate with no intermediate logic gates. See Figure 8 for an illustration of the partial order game representing the extensive-form game in Figure 7. Therefore, we have the following result establishing that partial order games can be seen as presenting a concise representation of extensive-form games. Moreover, this observation still holds if attention is restricted to games of perfect information.
Proposition 4.
There exist partial order games for which every strategically equivalent representation as an extensive-form game of imperfect information is at least exponentially as large.

5. Nash Equilibria

We saw in Section 2.2 how every partial order game defines a normal-form game. Accordingly, partial order games are amenable to game-theoretic analysis using the standard non-cooperative solution concepts that are available for normal-form games. In this section, we consider several complexity problems surrounding Nash equilibrium in partial order games.
We have the following lemma that will be useful for proving the complexity results in this section. It states that, if a player has a profitable deviation to a conditional strategy, then they also have a profitable deviation to an unconditional strategy. In other words, if a player has a best response, then they also have an unconditional best response (This is reminiscent of mixed strategies for (finite) normal-form games, where, due to the linearity of expected utility, a player having a best response implies their having a pure best response).
Lemma 4.
Let f = ( f 1 , , f n ) be a profile of conditional strategies. Then, if there is some strategy g i for player i with u i ( f i , g i ) > u i ( f ) , there also is an unconditional strategy g i such that u i ( f i , g i ) > u i ( f ) . Therefore, f = ( f v 1 , , f v | V | ) is a Nash equilibrium if and only if u i ( f ) u i ( f i , g i ) for all players i and unconditional strategies g for i.
Proof. 
Assume that there is some strategy g i for player i with u i ( f i , g i ) > u i ( f ) . Let v i 1 , , v i k be the vertices assigned to player i. Moreover, let a ( f i , g i ) = ( a v 1 * , , a v | V | * ) . Now define g i = ( g v i 1 , , g v i k ) such that for every 1 j k , and every profile ( a u 1 , , a u ) , where u 1 , , u are the parents of v i j ,
g v i j ( a u 1 , , a u ) = g v i j ( a u 1 * , , a u * ) .
Thus, g i is clearly an unconditional strategy for player i. Let a ( f i , g i ) = ( a v 1 * * , , a v | V | * * ) . By a straightforward induction on depth ( w ) it can then easily be shown that a w * * = a w * for every vertex w. And thus, u i ( f i , g i ) > u i ( f ) , giving us the result. □
The key point about this lemma is that unconditional strategies are small, in the sense that unconditional strategies can be represented by Boolean circuits whose size is polynomial in the set of actions (see Section 3.1, above). Thus, when we are considering whether a player has a beneficial deviation, we can without loss of generality restrict our attention to small strategies. This has implications for the complexity of the decision problems we consider.
The dependency graph provides a concise representation of the sets of conditional strategies available to the players. A single strategy f v for the player i playing at vertex v, however, has to take into account all profiles ( a y 1 , , a y k ) in A y 1 × × A y k , where y 1 , , y k are the parents of v. The number of these profiles tends to be exponential in k, the number of parents, as in all non-trivial cases generally | A v | 2 . Similarly, a naive representation of the player’s utilities for the different action profiles tends to be exponential in the number of actions available to the players in the game. However, as the utilities of players are represented by Boolean circuits, there is a straightforward polynomial transformation of Boolean games and Boolean partial order games to general partial order games. This enables us to leverage hardness results for Boolean and partial order Boolean games to obtain hardness results for general partial order games.
We first consider the decision problem of determining whether a given strategy profile is a Nash equilibrium for a given partial order game. Here, we assume that the utility function u i of each player is represented by a Boolean circuit C u i and the strategy profile f = ( f v 1 , , f v | V | ) is represented by a sequence of | V | Boolean circuits C f v 1 , , C f v | V | .
is-nash
Given: Partial order game G and strategy profile f = ( f v 1 , , f v | V | )
Problem: Is  f a Nash equilibrium of G?
We find that is-nash is intractable for partial order games.
Theorem 1.
is-nashis coNP -complete. The problem remains coNP -hard for empty-order games and total-order games.
Proof. 
For membership in coNP , let partial order game G and conditional strategy profile f = ( f v 1 , , f v | V | ) be given. We can guess a player i and a strategy g i = ( g v 1 , , g v m ) , where v 1 , , v m are the vertices assigned to i. By virtue of Lemma 4, we may assume that strategies g v 1 , , g v m are all unconditional strategies. As we saw in Section 3.1, the Boolean circuits C g v 1 , , C g v m can therefore each be assumed to be of size polynomial in | A | . Now, Lemma 2 allows us to find in polynomial time the utilities u i ( f i , g i ) and u i ( f i , f i ) . Then, f = ( f v 1 , , f v | V | ) is not a Nash equilibrium if and only if u i ( f i , g i ) > u i ( f i , f i ) , and so we may conclude that is-nash is in coNP .
For coNP -hardness, we reduce from is-nash for Boolean partial order games, which is known to be coNP -hard (see in [1], Proposition 1). Thus, let an instance of is-nash for Boolean partial order games be given by a Boolean game ( N , Φ , Φ 1 , , Φ | N | , γ 1 , , γ | N | ) , a dependency graph ( Φ , E ) , and a strategy profile f = ( f 1 , , f | N | ) , where for each player i strategy f i is given by a sequence of choice equations p 1 : φ 1 through p m : φ m , where Φ i = { p 1 , , p m } . In Section 4.1, we argued how this Boolean partial order game coincides with a partial order game with a slightly different representation of strategies and utilities. A similar remark concerns the strategy profile f = ( f 1 , , f | N | ) . As we found that the transformation of the representation of the game as a Boolean partial order game to the representation of the same game as a partial order game can be effected in polynomial time, we obtain our result. Moreover, because Boolean games are a special type of empty-order games, the problem remains hard for empty-order games as well.
To see that is nash also remains coNP -hard for total-order games, we adapt the proof for the is nash problem for Boolean games as presented in (Wooldridge et al. [15], Proposition 1). We reduce from the complement of satisfiability, the problem of determining if a given Boolean formula φ is satisfiable. To this end, let φ a Boolean formula over propositional variables p 1 , , p m . We construct a total-order Boolean game with one player i controlling p 1 , , p m along with an additional variable p 0 . The dependency graph ( V , E ) is then given by V = { v p 0 , , v p m } with relation E such that p j E p k if and only if j < k . Player i has as goal γ i = φ p 0 . Now, consider the strategy profile f 0 = ( f v p 0 0 , f v p 1 0 , , f v p m 0 ) that sets all variables to false, that is, f 0 is given by choice equations p j : for all 0 j m . Now observe that, as we are dealing with a one-player game, there is a natural surjection that maps each of player i’s strategies f to valuation α a f : { p 0 , , p m } { 0 , 1 } . It can then easily be appreciated that f 0 is a Nash equilibrium in the game constructed if and only if φ is satisfiable, as desired. □
Another canonical problem is more general, in that it asks whether a partial order game has any Nash equilibria at all, as opposed to whether or not a specific strategy profile is a Nash equilibrium.
non-emptiness
Given: Partial order game G
Problem: Does G have a Nash equilibrium?
In view of Proposition 2, non-emptiness is vacuous for total-order games, as in this class of games Nash equilibria are guaranteed to exist. The problem is considerably more difficult, namely NEXPTIME -complete, for general partial order games. It also seems a fair conclusion to draw from this contrast that increase in computational complexity arises from the structure of the dependency graph.
Theorem 2.
non-emptinessis NEXPTIME -complete.
Proof. 
A NEXPTIME algorithm to decide non-emptiness can be designed along the following lines. First, guess a strategy profile f = ( f v 1 , , f v | V | ) . Given that strategies f v are given by a Boolean circuit C f v , this can be achieved in time not more than exponential in | A | , the size of the set of actions. Second, to decide whether f = ( f v 1 , , f v | V | ) is a Nash equilibrium, by virtue of Lemma 4, it suffices to check, for all players i and all unconditional strategies g i = ( g v 1 , , g v | V i | ) for i, that u i ( f i , f i ) u i ( f i , g i ) . For player i, there are | A v 1 × × A v | V i | | unconditional strategies which is upper bounded by | A | | V | . Moreover, as we have seen in Section 3.1, each of these unconditional strategies can be represented by a sequence of Boolean circuits of size polynomial in | A | . Furthermore, the action profiles a ( f i , f i ) and a ( f i , g i ) can be computed in time exponential in | A | , by virtue of Lemma 1 and each circuit involved being at most exponential in the size of | A | . Finally, on the basis of Lemma 2, we can check in time polynomial in | A | the utilities for player i on action profiles a ( f i , f i ) and a ( f i , g i ) , and therefore also whether u i ( f i , f i ) u i ( f i , g i ) . Altogether, the algorithm runs in non-deterministic exponential time.
A proof of NEXPTIME -hardness of non-emptiness can be achieved by a reduction from the non-emptiness problem for Boolean partial order games. We rely here on the same direct reduction as in the proof of Theorem 1, above. NEXPTIME -hardness for Boolean partial order games was established by [1], giving us the result. □
If we restrict attention to empty-order games, however, non-emptiness has considerably lower computational costs, even though the problem still remains Σ 2 p -hard. Key to this result is the observation that for empty-order games all strategies are unconditional, and, thus, they can be represented by Boolean circuits of polynomial size (see Section 3.1).
Theorem 3.
For empty-order games,non-emptinessis Σ 2 p -complete.
Proof. 
To see that non-emptiness is in Σ 2 p , recall that Σ 2 p = NP coNP , that is, the set of problems that can be solved in polynomial time on a non-deterministic Turing machine with a coNP -oracle. Furthermore, recall that strategy profiles in empty-order games are unconditional and are of the form f v : { ( ) } A v . Moreover, they can be represented by Boolean circuits of polynomial size (see Section 3.1, above). Accordingly, given an empty-order game, we can guess an unconditional strategy profile f = ( f v 1 , , f v | V | ) , and consult the coNP -oracle to check whether f is a Nash equilibrium. Theorem 1 guarantees that the latter is feasible.
For Σ 2 p -hardness, recall that Boolean games constitute a subclass of empty-order partial order games. Σ 2 p -hardness then follows immediately from non-emptiness being Σ 2 p -hard for Boolean games (see in [13], Proposition 5). □
Recall that in partial order games, strategy profiles and action profiles are essentially different objects. As a natural counterpart to the is nash problem, we therefore now consider the decision problem whether a given action profile is sustained by a Nash equilibrium in a partial order game.
is nash actions
Given: Partial order game G and action profile a = ( a v 1 , , a v | V | )
Problem: Is a sustained by a Nash equilibrium?
In sharp contrast to coNP -completeness of is nash, we find that is nash actions is NEXPTIME -complete. From the perspective of computational complexity, is nash actions appears to be more kindred to the non-emptiness problem for partial order games. In this connection, it is worth observing that the proof of Bradfield et al. regarding the NEXPTIME -completeness of non-emptiness for Boolean partial order games in [1] relied on a reduction from dependency quantifier boolean formula game, which is defined as follows.
An instance of dqbfg is a tuple φ , X 1 , X 2 , Y 1 , Y 2 , where φ is a Boolean formula φ and { X 1 , X 2 , Y 1 , Y 2 } a partition of the propositional variables over which φ is defined. dqbfg then concerns the following game with three players—B (‘Black’), W 1 (‘White 1’), and W 2 (‘White 2’)—where B forms one team and W 1 and W 2 another team W. Player B chooses an assignment for the variables in X 1 X 2 = { x 1 1 , , x | X 1 | 1 } { x 1 2 , , x | X 2 | 2 } , player W 1 for those in Y 1 = { y 1 1 , , y | Y 1 | 1 } , and player W 2 for those in Y 2 = { y 1 2 , , y | Y 2 | 2 } . Player B chooses first, then W 1 and W 2 choose, on the understanding that W 1 can only see the assignment B chooses for W 1 and W 2 only the assignment B chooses for Y 2 . Team B aims to make φ true, whereas team W’s goal is to make φ false. If the overall assignment for X 1 X 2 Y 1 Y 2 satisfies φ , then B wins, otherwise W. A positive instance of dqbfg is when team W has a winning strategy. dependency quantifier boolean formula game was shown to be NEXPTIME -complete by Hearn and Demaine in [27].
Theorem 4.
is nash actionsis NEXPTIME -complete.
Proof. 
For membership in NEXPTIME , let a = ( a v 1 , , a v | V | ) be the action profile that is given as input. Recall from the proof of Theorem 2 that we can guess a strategy profile f and check whether it is a Nash equilibrium in exponential time, provided that f is represented by a Boolean circuit. By Lemma 1 we know we can additionally compute a f in polynomial time and check whether a f = a . As the latter can also be achieved in polynomial time, we obtain our result.
For NEXPTIME -hardness, we reduce from dependency quantifier boolean formula game (dqbfg). Given an instance φ , X 1 , X 2 , Y 1 , Y 2 of dqbfg we construct a partial order Boolean game with the same three players, with B controlling the variables in X 1 X 2 { x 0 } , player W 1 those in Y 1 , and player W 2 those in Y 2 , where x 0 is a “fresh” variable not in X 1 X 2 Y 1 Y 2 . Let the players’ goals be given by
γ B = x 0 φ γ W 1 = γ W 2 =
The dependency graph ( X 1 X 2 Y 1 Y 2 { x 0 } , E ) is defined such that
(i)
x 1   E   y 1 , for all x 1 X 1 and y 1 Y 1 ,
(ii)
x 2   E   y 2 , for all x 2 X 2 and y 2 Y 2 ,
(iii)
x 0   E   z , for all z X 1 X 2 Y 1 Y 2 .
Now, consider the action profile
a ¯ = ( x ¯ 0 , x ¯ 1 1 , x ¯ | X 1 | 1 , x ¯ 1 2 , x ¯ | X 2 | 2 , y ¯ 1 2 , y ¯ | Y 1 | 1 , y ¯ 1 2 , y ¯ | Y 2 | 2 ) ,
which sets all variables, including x 0 , to false. Note that none of the three players win if a ¯ is played. We are now in a position to prove that φ , X 1 , X 2 , Y 1 , Y 2 is a positive instance of dqbfg if and only if a ¯ is sustained by a Nash equilibrium in the partial order Boolean game constructed.
First assume that φ , X 1 , X 2 , Y 1 , Y 2 is positive instance of dqbfg. Then, the white team has a joint winning strategy in the original dqbfg-game given by Boolean functions β 1 and β 2 on the variables X 1 and X 2 , respectively. Then, define strategies f W 1 * and f W 2 * for W 1 and W 2 , respectively, in the Boolean partial order game, that are given by the following choice equations for i = 1 , 2 and j = 1 , , | Y i | :
y j i : ( x 0 φ β i ( x 1 i , , x | X i | i ) ) ( x ¯ 0 )
As the dependency graph ( V , E ) respects the information dependencies of the dqbfg-instance, observe that f W 1 and f W 2 together embody a winning strategy to render φ false, if x 0 is played. Let B’s unconditional strategy f B * be defined by the choice equations of the form x j i : , setting x j i to false for all for i = 1 , 2 and j = 1 , , | X i | . Observe that, defined thus, a f * = a ¯ . Also note that f * = ( f B * , f W 1 * , f W 2 * ) is a Nash equilibrium. As x 0 will be set to false, all players lose when f * is played, but none has an incentive to deviate either. Player B could only hope to win by setting at least x 0 to true. If so, however, W 1 and W 2 are playing a winning strategy against B, dashing all the latter’s hopes to win after all. Players W 1 and W 2 will lose, no matter which strategy they adopt. As a consequence, they do not want to deviate either, and we may conclude that f * is a Nash equilibrium sustaining action profile a ¯ .
Finally, assume that φ , X 1 , X 2 , Y 1 , Y 2 is a negative instance of dqbfg. Then, team W does not have a winning strategy, meaning that for every strategy f W = ( f W 1 , f W 2 ) , player B has a best response f B such that B wins under f = ( f W 1 , f W 2 , f B ) . Now, consider the partial order Boolean game and an arbitrary strategy profile f ¯ with a f ¯ = a ¯ . Some reflection reveals that under these circumstances, player B has a strategy f B at their disposal setting x 0 to true and incorporating a winning response to f ¯ W = ( f ¯ W 1 , f ¯ W 2 ) . Therefore, player B has an incentive to unilaterally deviate from f ¯ and play f B instead. We may conclude that f ¯ is not a Nash equilibrium, as desired. □
For the two extremal classes of empty-order games and total order games, is nash actions are less computationally demanding. More precisely, the problem is coNP -complete for empty-order games and PSPACE -complete for total-order games. The proof of the former statement is relatively straightforward, when one realises that in empty-order games all strategies are unconditional and that, consequently, there is a natural bijection between action profiles and strategy profiles.
Theorem 5.
For empty-order games,is nash actionsis coNP -complete.
Proof. 
First recall that, in empty-order games, there is a natural bijection between action profiles and strategy profiles, mapping each action profile a = ( a v 1 , , a v | V | ) to strategy profile f a = ( f v 1 a v 1 , , f v | V | a v | V | ) , where f v a v ( ( ) ) = a v . Thus, all strategy profiles of an empty-order game are of this form. Moreover, for arbitrary action profiles a = ( a v 1 , , a v | V | ) and strategy profiles f = ( f v 1 , , f v | V | ) , we have that
a = a f if and only if f v a v = a v , for all v V if and only if f = f a
Moreover, recall that unconditional strategies are represented by polynomial-sized Boolean circuits. Theorem 1 established that is nash is coNP -complete for empty-order games. Altogether, it suffices to show that, for all action profiles a = ( a v 1 , , a v | V | ) of a empty-order game,
a is sustained by a Nash equilibrium if and only if f a is a Nash equilibrium .
The “if”-direction is immediate. If f a is a Nash equilibrium, then, by observing that a f a = a , action profile a is sustained by some Nash equilibrium, namely f a . For the “only if”-direction, assume there is some strategy profile f * with a = a f * . By our earlier observation we find that f * = f a . Hence, f a is a Nash equilibrium, as desired. □
The proof of PSPACE -completeness of is nash actions for total-order games is considerably more involved, and we defer it to the very end of Section 7.2, where the necessary proof elements are in place. Still, we conclude this section with its statement.
Theorem 6.
For total-order games,is nash actionsis PSPACE-complete.

6. Backwards Induction

Backwards induction is the most fundamental technique for the analysis of extensive form games. The basic backward induction algorithm for extensive form games of perfect information runs in time polynomial in the size of the game tree, and computes Nash equilibrium strategy profiles (which are guaranteed to exist in games of perfect information). It is therefore very natural to ask whether approaches based on backward induction might work for partial order games. However, backward induction is not applicable in extensive form games of imperfect information, and as partial order games correspond to imperfect information games, it follows that the technique is not always applicable. This section introduces a backward induction procedure to find pure Nash equilibria in partial order games. Games in which the procedure is well defined—in the sense that the procedure produces at least one strategy profile—we denote as being fit for backwards induction. We relate this latter concept to an informational notion we refer to as scrutability. For games that are fit for backwards induction, we prove that our backwards induction procedure is guaranteed to produce a Nash equilibrium outcome.

6.1. Backwards Induction

Due to their acyclic nature, it would seem that a natural concept of backward induction procedure can straightforwardly be defined for partial order games. Our aim is thus to define a strategy profile f * = ( f v 1 * , , f v | V | * ) that is obtained in the following fashion. One starts with a vertex v of maximal depth with set of parents Y = { y 1 , , y k } , and one inspects the possible actions—rather than their strategies—the players active at the parents of v can play. Let a Y = ( a y 1 , , a y k ) be any such an action profile, suppose i is the player active at v. Then, the action a v * = f v * ( a y 1 , , a y k ) selected by strategy f v * from A v should maximise i’s utility against a Y and all possible choices of action b X at the vertices in X = V \ ( Y { v } ) , provided any such action a v * exists. Subsequently, with the strategies for the vertices of greatest depth thus fixed, the strategies in f * should recursively find optimal strategies at vertices of lesser depth in a similar fashion.
In the case of partial order games, it is not strictly necessary to proceed recursively on the depth of the vertices. We can, instead, use any topological sorting τ = ( v τ 1 , , v τ | V | ) of the vertices that respects the graph and, starting with the vertex v τ j with maximal index τ j = | V | , we iterate through τ until we reach the vertex with minimal topological index. In this section, we will develop this more general concept of backward induction for partial order games that employs a topological order.
Our backwards induction procedure defines recursively a strategy profile f * = ( f v 1 * , , f v | V | * ) for each vertex v relative to a topological sorting τ = ( v τ 1 , , v τ | V | ) , and is formally defined as follows. Let v be a vertex with topological index τ j and let i be the player to move at v. Let Y = { y 1 , , y k } be the set of parents of v, Z = { z 1 , , z m } be the set of vertices with a topological index strictly greater than τ j , and X = { x 1 , , x } be given by V \ ( Y Z { v } ) . For every profile a Y = ( a y 1 , , a y k ) , we define f v * ( a y 1 , , a y k ) as an action a v * in A v such that, for all profiles b X = ( b x 1 , , b x ) ,
a v * arg max a v A v u i ( b X , a Y , a v , f Z * ) ,
where the basis of this recursion is provided by the case where Z = Ø (Here we exploit the notation b X , a Y , a v , f Z * to refer to the action profile that results if after τ -history ( b X , a Y , a v the strategy profile f * is played over the vertices in Z. Formally, b X , a Y , a v , f Z * is the action profile a g for strategy profile g = ( f X b X , f Y a Y , f v a v , f Z * ) , where ( f X b X , f Y a Y , f v a v ) is a profile of unconditional strategies (also see page 9, above)). To illustrate this definition, we have the following example.
Example 4.
Consider the utility function for the game depicted in Figure 10, and fix a topological sorting ( v 1 , v 2 , v 3 , v 4 ) . From inspection, it can easily be appreciated that the utility to player 1 is only dependent on their own choices at vertices v 1 and v 4 , and player 2’s at vertex v 2 . At this moment, we can apply the backwards induction procedure as follows. Inspecting their utility function, we find that player 3 chooses as follows at vertex v 3 :
f v 3 ( a c ) = f f v 3 ( a d ) = e f v 3 ( b c ) = f f v 3 ( b d ) = f
At vertex v 4 , player 1, can set their strategy such that f v 4 ( c ) = g and f v 4 ( d ) = h , because they are indifferent to any choice of actions by the other players. Player 2 can have their strategy at v 2 depend on player 1’s choice at v 1 . If player 1 chooses a at v 1 , then by playing c at v 2 , action profile a c f g will result and player 2 will obtain utility 3. Whereas, by playing d, action profile a d e h will ensue, with a utility of 1 for player 2. Therefore, f v 2 ( a ) = c . A similar reasoning yields that, if player 1 chooses b at v 1 , by playing c at v 2 , action profile ( b , c , f , g ) results with utility 0 for player 2, whereas, by playing d, action profile ( b , d , f , h ) ensues with utility 2 for player 2. Accordingly, f v 2 ( b ) = d . Finally, by playing a at v 1 , player 1 obtains utility 3, as action profile ( a , c , f , g ) would be played. Observe that by playing b at v 1 , player 1 obtains utility 2, as in that case action profile ( b , d , f , h ) would result. Accordingly, f v 1 ( ( ) ) = a . We thus obtain the strategy profile f = ( f v 1 1 , f v 2 2 , f v 3 12 , f v 4 2 ) yielding action profile ( a , c , f , g ) as the backwards induction solution of this game.
Some reflection reveals that on total-order games the procedure defined thus mimics standard backwards induction on the corresponding extensive-form games of perfect information as described in Section 4.4. It is important to note, however, that, on general partial order games, our backwards induction procedure does not always yield a strategy profile f * . This is because the outcome of a partial order game—and therefore also the players’ utilities as well as the possibility of finding a utility maximising action—need not be fully determined by the actions chosen at a given vertex v, at v’s parent vertices, and at the vertices with a larger topological index than v.
The outcome may also depend on the actions chosen at vertices of an equal or lesser depth that are not parents of the respective vertex. If so, it may happen that, at some stage of the procedure and at some vertex, no action can be singled out as an unequivocal optimal choice based on the choices at the parent nodes and those at the vertices with a greater topological index alone. This would cause the procedure to stall. More formally, this happens if for some vertex v, there are two different profiles b X = ( b w 1 , , b w ) and b X = ( b w 1 , , b w ) such that arg max a v A v u i ( b X , a Y , a v , f Z * ) and arg max a v A v u i ( b X , a Y , a v , f Z * ) are disjoint. The following example illustrates this point.
Example 5.
Consider again the game of Example 1, depicted in Figure 3. Let us first consider the strategies that player 3 can choose at vertex v 3 . This player has to specify an action from among e and f, for each of the possible action profiles player 1 and 2 can choose at vertices v 1 and v 2 , respectively, that is, for the action profiles a c , a d , b c , and b d . Inspecting player 3’s utilities, player 3 should arguably choose f v 3 ( a c ) = f , as no matter what action player 1 chooses at v 4 , player 3 is better off playing f than playing e. However, how to choose a value for f v 3 ( a d ) is not so obvious, as the optimal choice depends on whether player 1 chooses g or h at vertex v 4 ; this is something player 3 may not let their strategy depend upon as not v 3 E v 4 .
To formalise this issue, we say that a partial order game is fit for backwards induction under a topological sorting τ = ( v τ 1 , , v τ | V | ) if, for every vertex v, and every action profile a = ( a v 1 , , a v | V | ) ,
b X A X arg max a v A v u i ( b X , a Y , a v , a Z ) Ø ,
where Y is the set of parents of v, Z the set of vertices with a topological index that is not smaller than that of v, and X = V \ ( Y Z { v } ) . This ensures that, in Equation (1), the optimal action a v * is defined for every vertex v. At this point, it is worth observing that for total-order games, the set X will be empty and that they allow for only one topological sorting. It follows that total-order games vacuously satisfy the fit-for-backwards-induction condition under every topological sorting.
We are now in a position to show that our backwards induction procedure always yields a Nash equilibrium in games that are fit for backwards induction.
Theorem 7.
Let G be a partial order game that is fit for backwards induction under a topological order τ. Then, every strategy profile f * = ( f v 1 * , , f v | V | * ) generated by the backwards induction procedure under τ is a Nash equilibrium.
Proof. 
Let f * = ( f v 1 * , , f v | V | * ) be a strategy profile that our backwards induction procedure yields for topological order τ = ( v τ 1 , , v τ | V | ) . Without loss of generality, we may assume that τ j = j for all 1 j | V | , that is, v 1 , , v | V | are already topologically ordered. Assume for contradiction that f * is not a Nash equilibrium, that is, there is some strategy g i for some player i such that u i ( f i * , g i ) > u i ( f i * , f i * ) . Let g = ( f i * , g i ) .
Now, there is at least one vertex v j with 1 j | V | that is controlled by player i with parents Y = { y 1 , , y k } for which there is an action profile x = ( x v 1 , , x v j 1 ) in A v 1 × × A v j 1 such that:
u i ( x v 1 , , x v j 1 , a v j , f v j + 1 * , , f v | V | * ) < u i ( x v 1 , , x v j 1 , b v j , g v j + 1 , , g v | V | ) ,
where a v j = f v j * ( x y 1 , , x y k ) and b v j = g v j ( x y 1 , , x y k ) (To see this, let a f * = ( a v 1 , , a v | V | ) and a g = ( b v 1 , , b v | V | ) . Then observe that there must be a smallest index 1 j | V | such that v j V i with a v j b v j and a v m = b v m for all 1 m < j , which provides us with the desired v j ). Now we may assume that j is the largest index with v j V i for which such an action profile, or τ -history, x = ( x v 1 , , x v j 1 ) can be found. Therefore,
u i ( x v 1 , , x v j 1 , b v j , g v j + 1 , , g v | V | ) u i ( x v 1 , , x v j 1 , b v j , f v j + 1 * , , f v | V | * ) .
Combining inequalities (2) and (3), we obtain that
u i ( x v 1 , , x v j 1 , a v j , f v j + 1 * , , f v | V | * ) < u i ( x v 1 , , x v j 1 , b v j , f v j + 1 * , , f v | V | * ) .
Having assumed ( v 1 , , v | V | ) to be ordered topologically, the set Y = { y 1 , , y k } of parents of v j is a subset of { v 1 , , v j 1 } . Let further X = { v 1 , , v j 1 } \ Y and Z = { v j + 1 , , v | V | } . Recall that a v j = f v j * ( x y 1 , , x y k ) . We then find that, for profile x Y , there is some profile, namely x X , such that
f v j * ( x y 1 , , x y k ) arg max a v j A v j u i ( x X , x Y , a v j , f Z * ) .
It follows that our backwards induction procedure does not provide us with f v j * , which contradicts our initial assumption regarding the profile f * = ( f v 1 * , , f v | V | * ) . □
We also have the following lemma, which is a version of the one-stage-deviation property for partial order games, and which is an almost immediate consequence of the definition of our backwards induction procedure (also see in [28] (pages 108–110) and Osborne and Rubinstein [5] (page 98)). This property will prove to be of great help proving the computational complexity results in Section 7. Intuitively it says that to establish that a strategy profile f = ( f v 1 , , f v | V | ) is a backwards induction solution, it suffices to look at deviations by a player from f at a single vertex v k given a single history h = ( a v 1 , , a v k ) . The property is perhaps best understood by considering its contrapositive: strategy profile f is not a backwards induction solution if there is a single player who has an incentive to deviate from f at a single vertex v k given a single history h = ( a v 1 , , a v k ) . Note that we here assume that the game in question is fit for backwards induction.
Lemma 5 (One-stage-deviation property).
Let the vertices of a partial order game be sorted according to a topological sorting τ = ( v 1 , , v | V | ) . Then, for games that are fit for backwards induction, profile f = ( f v 1 , , f v | V | ) is a backwards induction solution for τ if and only if for every τ-history ( a v 1 , , a v k 1 ) with 1 k | V | we have for all b v k A v k :
u i ( a v 1 , , a v k 1 , f v k , f v k + 1 , , f v | V | ) u i ( a v 1 , , a v k 1 , b v k , f v k + 1 , , f v | V | ) ,
where i is the player assigned to v k .
Proof. 
First assume that f = ( f v 1 , , f v | V | ) is a backwards induction solution, and consider an arbitrary τ -history ( a v 1 , , a v k 1 ) and equally arbitrary b v k in A v k . Let Y be the parents of v k and X = { v 1 , , v k 1 } \ Y . Then,
u i ( b X , a Y , f v k ( a Y ) , f v k + 1 , , f v | V | ) u i ( b X , a Y , b v k , f v k + 1 , , f v | V | ) ,
for all profiles b X . Now observe that inequality (4) holds for in particular for b X = a X .
For the “if”-direction assume for contraposition that f = ( f v 1 , , f v | V | ) is not a backwards induction solution for τ . Then, there is a vertex v k with minimal topological index, parents Y, and X = { v 1 , , a k 1 } \ Y such that for some profile a Y , we have for every a v k in A v k that there is some profile b X such that
u i ( b X , a Y , a v k , f v k + 1 , , f v | V | ) < u i ( b X , a Y , b v k , f v k + 1 , , f v | V | ) .
At this point, consider any a v k in A v k and let b X be the accompanying profile for which Equation (5) holds. We can conclude the proof by considering the τ -history ( b X , a Y ) . □

6.2. Games of Adequate Information

In this section, we propose the class of adequate information games, which are guaranteed to be fit for backwards induction, and, thus, always have a Nash equilibrium. As an auxiliary concept we first introduce the notion of scrutability, an informational criterion specifying on which choices of which players a player can have its decision depend when choosing an action at a specific vertex.

6.2.1. Scrutability

We introduce scrutability as a technical term to reason about the strategic information that is available to the players at each stage of the backwards induction process. Intuitively, let v and w be a vertices at which, respectively, players i and j are active. Then, assuming that the game is fit for backwards induction, a vertex w is scrutable to v if j’s choices of action at w can make a difference as to the optimality of i’s choice of action at v when deciding on a backwards induction strategy at v. Thus, all vertices reachable from v are scrutable, as their backwards induction choices will already have been made. Moreover, the player active at v can have their choice of action depend on the choices of action at the parent nodes of v.
Formally, we say that a vertex w is scrutable to vertex v if either
(i)
w is reachable from v, or
(ii)
w E v , that is w is one of v’s parents.
We refer to a vertex that is not scrutable to v as being inscrutable to v. Recall the reachability relation is reflexive, and, thus, vertex v is always scrutable to v itself. Observe that the relations of scrutability and inscrutability are not necessarily symmetric. For instance, if a vertex v is reachable from w, but w is not a parent of v, then v is scrutable to w, but not vice versa. If v is inscrutable to w, and w is also inscrutable to v, we say that v and w are independent.
We differentiate between a vertex being scrutable to a vertex and a vertex being scrutable to a player. For V i = { v 1 , , v m } player i’s vertices, we denote by I v the set of vertices that are inscrutable to v, and by S v the set of vertices that are scrutable to v. Furthermore, let I i = I v 1 I v m and S i = V \ I i denote the sets of vertices that are inscrutable and scrutable to player i, respectively. That is, if a vertex v is inscrutable to a player i, if v is inscrutable to at least one vertex at which i is active. Vertex v is scrutable to player i, if it is not inscrutable to i, that is, if v is scrutable to all vertices at which i is active. It may be worth observing that in total-order games all vertices will be scrutable from any other vertex.
Furthermore, we introduce a notion of scrutability that is relative to a topological sorting of the vertices. Therefore, let τ = ( v τ 1 , , v τ | V | ) be a topological sorting. Then, a vertex w with topological index τ w is scrutable under τ to vertex v with topological index τ v , if either
(i)
τ w τ v , or
(ii)
w E v , that is w is one of v’s parents.
Any vertex that is not scrutable under τ is referred to as being inscrutable under τ . For V i = { v 1 , , v m } and topological sorting τ = ( v τ 1 , v τ | V | ) , we furthermore denote by I v τ the set of vertices that are inscrutable to v under τ , and by S v τ the set of vertices that are scrutable to v under τ .
Moreover, we define I i τ = I v 1 τ I v m τ and S i τ = V \ I i τ . Note that the set of vertices scrutable to a player i need not include the set V i of vertices where i is active.
Given a topological sorting τ and a vertex v, by Y, we will generally adhere to the following notational conventions. The parents of v we generally denote by Y, the set of vertices inscrutable to v by X, and the set of all vertices that are not parents of v but still scrutable to v by Z. The following example illustrates these concepts.
Example 6.
Consider the game depicted in Figure 10. Then, we have the following sets of scrutable and inscrutable vertices.
S v 1 = { v 1 , v 2 , v 3 , v 4 } I v 1 = Ø S 1 = { v 2 , v 4 } I 1 = { v 1 , v 3 } S v 2 = { v 1 , v 2 , v 3 , v 4 } I v 2 = Ø S 2 = { v 1 , v 2 , v 3 , v 4 } I 2 = Ø S v 3 = { v 1 , v 2 , v 3 } I v 3 = { v 4 } S 3 = { v 1 , v 2 , v 3 } I 3 = { v 4 } S v 4 = { v 2 , v 4 } I v 4 = { v 1 , v 3 }
For the topological sorting τ = ( v 1 , v 2 , v 3 , v 4 ) , we get
S v 1 τ = { v 1 , v 2 , v 3 , v 4 } I v 1 τ = Ø S 1 τ = { v 2 , v 4 } I 1 τ = { v 1 , v 3 } S v 2 τ = { v 1 , v 2 , v 3 , v 4 } I v 2 τ = Ø S 2 τ = { v 1 , v 2 , v 3 , v 4 } I 2 τ = Ø S v 3 τ = { v 1 , v 2 , v 3 , v 4 } I v 3 τ = Ø S 3 τ = { v 1 , v 2 , v 3 , v 4 } I 3 τ = Ø S v 4 τ = { v 2 , v 4 } I v 4 τ = { v 1 , v 3 }
By contrast, under topological sorting τ = ( v 1 , v 2 , v 4 , v 3 ) , we obtain
S v 1 τ = { v 1 , v 2 , v 3 , v 4 } I v 1 τ = Ø S 1 τ = { v 2 , v 3 , v 4 } I 1 τ = { v 1 } S v 2 τ = { v 1 , v 2 , v 3 , v 4 } I v 2 τ = Ø S 2 τ = { v 1 , v 2 , v 3 , v 4 } I 2 τ = Ø S v 3 τ = { v 1 , v 2 , v 3 } I v 3 τ = { v 4 } S 3 τ = { v 1 , v 2 , v 3 } I 3 τ = { v 4 } S v 4 τ = { v 2 , v 3 , v 4 } I v 4 τ = { v 1 }
For another example, consider Figure 11 along with topological order τ = ( v 1 , v 2 , v 3 , v 4 ) . We then obtain for the vertices the following sets of scrutable and inscrutable vertices for vertices as well as for players:
S v 1 τ = { v 1 , v 2 , v 3 , v 4 } I v 1 τ = Ø S 1 τ = { v 1 , v 2 , v 3 , v 4 } I 1 τ = Ø S v 2 τ = { v 2 , v 3 , v 4 } I v 2 τ = { v 1 } S 2 τ = { v 2 , v 4 } I 2 τ = { v 1 , v 3 } S v 3 τ = { v 1 , v 2 , v 3 , v 4 } I v 3 τ = Ø S v 4 τ = { v 1 , v 2 , v 4 } I v 4 τ = { v 3 }
Observe that w being reachable from v implies τ w τ v , in other words, S v S v τ for every topological order τ . Accordingly, plain scrutability is stronger than scrutability under a topological sorting. The following lemma lays down the precise formal interrelationship between scrutability and scrutability under a topological order.
Lemma 6.
Let i be a player of a partial order game and v a vertex controlled by i. Then, for all vertices w,
(i) 
w is scrutable to v if and only if w is scrutable to v under all topological orders τ,
(ii) 
w is inscrutable to v if and only if w is inscrutable to v under some topological order τ,
(iii) 
w is scrutable to i if and only if w is scrutable to i under all topological orders τ,
(iv) 
w is inscrutable to i if and only if w is inscrutable to i under some topological order τ.
Proof. 
We prove part (i) first. Then, parts (ii) through (iv) follow by straightforward set-theoretic reasoning.
(i)
For the “only if”-direction, first assume that w is scrutable to v and consider an arbitrary topological sorting τ with topological indices τ i and τ j for v and w, respectively. Then, either w E v or w is reachable from v, that is, v E * w . In the former case, we have immediately that w is scrutable to v under τ . In the latter case, observe that then τ i < τ j by definition of a topological sorting. It then also follows that w is scrutable to v under τ .
For the “if”-direction, assume for contraposition that w is not scrutable to v. Then, neither w E v nor w being reachable from v hold. In case v is reachable from w, let τ be any topological order with τ i and τ j the topological indices for v and w. Then, τ j < τ i , and it follows that w is not scrutable under τ . In case v is not reachable from w, let ( V , E ) be the DAG underlying the partial order game. Then, consider the edge set E = E { ( w , v ) } . The graph ( V , E ) must be acyclic because if ( V , E ) contained a directed cycle, there would be a simple path from v to w not involving the edge ( w , v ) . Therefore, w would also be reachable from v in ( V , E ) , a contradiction. Therefore, ( V , E ) is acyclic and has a topological sorting τ . Observe that τ is also a topological order for ( V , E ) . Let τ i and τ j be the topologic al indices for v and w, respectively. As w E v , we have τ j < τ i and again we may conclude that w is not scrutable to v under τ .
(ii)
This part follows immediately from part (i), by observing the following equivalences:
w is inscrutable to v if and only if w is not scrutable to v if and only if w is not scrutable to v under some topological order τ if and only if w is inscrutable to v under some topological order τ
(ii)
Let v 1 , , v m be the vertices assigned to player i, and let τ 1 , , τ k be the topological sortings for the game. From (ii) we obtain that I v j = I v j τ 1 I v j τ k for every 1 j m .
(iv)
Therefore,
I i = 1 j m I v j = 1 j m I v j τ 1 I v j τ k = 1 k I v 1 τ I v m τ = 1 k I i τ ,
and it follows that w is inscrutable to i if and only if w is inscrutable to i under some topological sorting. Part (iii) then is an immediate consequence of de Morgan’s law:
S i = V \ I i = V \ ( I i τ 1 I i τ k ) = V \ I i τ 1 V \ I i τ k = S v 1 τ 1 S i τ k = 1 k S i τ .
Therefore, w is scrutable to i if and only if i is scrutable to i under all topological sortings.
This concludes the proof. □

6.2.2. Games of Adequate Information

In this section, we define a class of partial order games, which we refer to as games of adequate information. We prove that games of adequate information are fit for backward induction, and thus, on account of Theorem 7, allow for Nash equilibria. For this, we require that each player at each vertex assigned to them is in a position to determine their optimal choices if they are given only the choices at their parents and those at the nodes with a greater topological index.
Formally, we introduce the concept of a utility function being of adequate information and a variant of the notion parametrised by a topological order τ . These concepts only differ in the vertices that are considered to be scrutable and inscrutable. Let i be a player with utility function u i : A v 1 × × A v | V | R . Without loss of generality, we assume that V i = { v 1 , , v m } . Let S be the set of vertices scrutable to i (under τ ) and I the set of vertices inscrutable to i (under τ ). Then, player i’s utility function u i : A V R is said to be of adequate information (under τ ) or adequately informed (under τ ), if there is a function u i 0 : A S R such that for all profiles a S , a S A S and b I A I
u i ( a S , b I ) u i ( a S , b I ) if and only if u i 0 ( a S ) u i 0 ( a S ) .
If this is the case, we say that player i’s utility function u i : A R (ordinally) reduces to  u i 0 : A S i R .
A game of adequate information under τ is, then, a partial order game in which each player’s utility function is of adequate information under τ . We also define a concept of adequate information that is irrespective of the topological sorting τ in the straightforward manner. Thus, a game of adequate information simpliciter is a partial order game in which each player’s utility function is of adequate information. Observe that total-order games constitute a subclass of adequate-information games, since such games do not allow for inscrutable vertices. To illustrate, we have the following example:
Example 7(Game of adequate information).
Consider the game depicted in Figure 10 again. Under topological sorting τ = ( v 1 , v 2 , v 3 , v 4 ) , player 1 has scrutables S 1 τ = { v 2 , v 4 } and inscrutables I 1 τ = { v 1 , v 3 } . For player 1, observe that for all a A v 2 × A v 4 and all b , b A v 1 × A v 3 we have that u 1 ( a , b ) = u 1 ( a , b ) . For instance, u 1 ( c g , a e ) = u 1 ( c g , a f ) = 3 and u 1 ( d g , a f ) = u 1 ( d g , b f ) = 0 . Accordingly, define u 1 0 : A v 2 × A v 4 R such that
u 1 0 ( c g ) = 3 u 1 0 ( c h ) = 1 u 1 0 ( d g ) = 0 u 1 0 ( d h ) = 2 ,
and find that u 1 ordinally reduced to u 1 0 and, hence, that player 1 is adequately informed in this game. For players 2 and 3, we saw that S 2 τ = S 3 τ = { v 1 , v 2 , v 3 , v 4 } and I 2 τ = I 3 τ = Ø . As some reflection reveals, u 2 and u 3 thus vacuously reduce to themselves. We therefore immediately see that their utility functions are of adequate information under τ as well.
Finally, consider once more the game depicted in Figure 11 along with the natural topological order τ = ( v 1 , v 2 , v 3 , v 4 ) . Under τ, this game is of adequate information. As I 1 τ = Ø , this is easy to see for player 1. For player 2 we have S 2 τ = { v 2 , v 4 } and I 2 τ = { v 1 , v 3 } . Now, we can define u 2 0 such that
u 2 0 ( c g ) = 1 u 2 0 ( c h ) = 4 u 2 0 ( d g ) = 8 u 2 0 ( d h ) = 2
It then only requires an easy check to see that u 2 ordinally reduces to u 2 0 . For instance, u 2 0 ( c g ) < u 2 0 ( d g ) and also
u 2 ( c g , a e ) = 2 < 9 = u 2 ( d g , a e ) u 2 ( c g , a f ) = 3 1 2 < 7 = u 2 ( d g , a f ) u 2 ( c g , b e ) = 2 < 7 = u 2 ( d g , b e ) u 2 ( c g , b f ) =   2   <   4 = u 2 ( d g , b f )
By contrast, under topological order τ = ( v 2 , v 1 , v 4 , v 3 ) , player 1’s utility function is not of adequate information. To see this, first observe that S 1 τ = { v 1 , v 3 , v 4 } and I 1 τ = { v 2 } . Now, u 1 ( a e g , c ) = 2 < 9 = u 1 ( a e h , c ) , whereas u 1 ( a e g , d ) = 5 > 3 = u 1 ( a e h , d ) . Had there been a function u 1 0 : A S 1 τ R such that u 1 ordinally reduces to u 1 0 , then both u 1 0 ( a e g ) < u 1 0 ( a e g ) and u 1 0 ( a e g ) > u 1 0 ( a e g ) , a contradiction.
The following result establishes that games of adequate information (under a topological sorting τ ) are fit for backwards induction (under τ ).
Theorem 8.
Let τ = ( v τ 1 , , v τ | V | ) be a topological sorting. Then,
(i) 
games of adequate information under τ are guaranteed to have a Nash equilibrium, which can be found using our backward induction procedure under τ, and
(ii) 
games of adequate information simpliciter are guaranteed to have a Nash equilibrium, which can be found using our backward induction procedure underanytopological sorting.
Proof. 
Let τ = ( v τ 1 , , v τ | V | ) be an arbitrary topological sorting. For part (i), due to Theorem 7, it suffices to prove that games of adequate information under τ are fit for backward induction under τ . To this end, consider an arbitrary game of adequate information under τ and consider an arbitrary vertex v. Let i be the player assigned to v. Furthermore, let I i τ = { w 1 , , w k } denote the set of vertices that are inscrutable to i under τ and S i τ = { w k + 1 , , w | V | } denote the set of vertices that are scrutable to i under τ . We let Y denote the set of parents of v, and Z the set of vertices with a strictly greater topological index than v. Then, set X = V \ ( Y Z { v } ) and let X = { x 1 , , x | X | ] } . Observe that X is the set of vertices that are inscrutable to v under τ , and thus all vertices in X are inscrutable to i under τ , that is, X I i τ .
By assumption, player i’s utility function u i : A R is adequately informed under τ . In particular, we may assume that u i ordinally reduces to a function u i 0 : A S i τ R .
Now, consider an arbitrary action profile a = ( a 1 , , a | V | ) . Having assumed A to be finite, there must be an action a v * in A v with
a v * arg max a v A v u i ( a X , a Y , a v , a Z ) .
Let a * = ( a X , a Y , a v * , a Z ) .
Consider an arbitrary a v in A v , and let a = ( a X , a Y , a v , a Z ) . Then, u i ( a * ) u i ( a ) . Having assumed that the game is of adequate information we may assume that there is a function u i 0 : A S i τ R such that
u i 0 ( a S i τ * ) u i 0 ( a S i τ ) .
Consider an arbitrary profile b in A . Then, u i ( b I i τ , a S i τ * ) u i ( b I i τ , a S i τ ) . Now, either v is scrutable to i or v is inscrutable to i. If the latter, I i τ = X { v } . Then, u i ( b X , a Y , a v * , a Z ) = u i ( b X , a Y , a v , a Z ) . With a v having been chosen arbitrarily, a v * arg max a A v u i ( b X , a Y , a v , a Z ) . If the former, I i τ = X , and we find that u i ( b X , a Y , a v * , a Z ) u i ( b X , a Y , a v , a Z ) . Again, we may conclude that a v * arg max a A v u i ( b X , a Y , a v , a Z ) . In either case, and with b having been chosen arbitrarily, a v * b X A X arg max a v A v u i ( b X , a Y , a v , a Z ) , which proves the case.
Part (ii) follows by an argument that runs along analogous lines as that for part (i), reasoning about the vertices that are inscrutable to i simpliciter rather than under a specific topological sorting τ . In this context, observe that, by virtue of Lemma 6, the set I i τ of vertices inscrutable to i under a topological sorting τ constitutes invariably a subset of the set I i of vertices inscrutable to i simpliciter. □

7. Computing Backwards Induction Solutions for Partial Order Games

In this section, we explore a number of natural computational problems surrounding our backwards induction procedure for partial order games and games of adequate information. We first conduct an asymptotic analysis of our backwards induction procedure. We find that the procedure runs in exponential time in the general case, but that a considerable speed up is possible if the in-degree of the vertices or the number of their parents is bounded appropriately. Subsequently, we investigate the computational complexity of deciding whether a partial order game is fit for backwards induction, that is if the precondition for our procedure to produce a Nash equilibrium holds. In a similar vein, we also address the closely related question of whether a given partial order game is a game of adequate information. We find that both these problems are coNP -complete. Finally, we come to consider the problems of deciding whether a given strategy profile is a backwards induction solution and whether a given action profile is sustained by a backwards induction solution. We show that the latter problem is NEXPTIME -complete for the general case, but that it is PSPACE -complete for total-order games.

7.1. Asymptotic Complexity of Backwards Induction

Due to the compactness of the partial order game representation, our backwards induction procedure runs in time exponential in the number | V | of vertices. Let v 1 , , v | V | be ordered according to the topological sorting τ under which backwards induction is conducted. Throughout this section, as we did before, for each vertex v, by Y we denote the parents of v, by Z the vertices with a greater topological index than v, and let X = V \ ( Y Z { v } ) . Then, for each vertex v j , where 1 j | V | , with parents Y and τ -inscrutable vertices X we have to find, an action a v j * in A v j such that, for i the active player at v j ,
a * b X A X arg max a v j A v j u i ( b X , a Y , a v j , f Z * ) ,
and abort if no such action exists. This requires for each profile a Y in A Y the computation of | A v j | · u X | A u | utility values u i ( a X , a Y , a v j , a Z * ) . Letting A ¯ = max v V A v , this makes for a total number of utility computations of
j = 1 | V | k = 1 j | A k | j = 1 | V | k = 1 j | A ¯ | = j = 1 | V | | A ¯ | j = j = 0 | V | | A ¯ | j 1 = | A ¯ | | V | + 1 1 | A ¯ | 1 1 = O ( | A ¯ | | V | ) .
Assuming that the players’ utilities are represented by a Boolean circuit, each of these can be achieved in time polynomial in | A | . Following the definition, we can therefore compute a backwards induction solution in O ( | A | | V | ) time.
It is not hard to see that, if we restrict ourselves to partial order games of adequate information in which all vertices have at most log 2 ( | A | ) parents, a backwards induction profile can be computed in time pseudo-polynomial in | A | . If, in addition to their having at most log 2 ( | A | ) parents, every vertex v of a game of adequate information has at most two actions, that is, | A v | 2 , then a backwards induction profile can be computed in time polynomial in | A | . Summarising, we have the following proposition.
Proposition 5.
Let Y v denote the set of parents of a vertex v. For a partial order game of adequate information, a backwards induction profile can then be computed in time:
(i) 
O ( | A | | V | ) , in the general case,
(ii) 
O ( | A | log 2 ( | A | ) ) , if | Y v | log 2 ( | A | ) for all vertices v,
(iii) 
O ( | A | ) , if | Y v | log 2 ( | A | ) and | A v | 2 for all vertices v.

7.2. Computational Complexity of Backwards Induction

In this section, we investigate the natural computation problems of deciding whether a backwards induction solution exists in a given partial order game for a given topological order. Furthermore, we analyse how hard it is to decide whether a given strategy profile incorporates a backwards induction solution. Finally, recalling that action profiles and strategy profiles are essentially different objects in the setting of partial order games, we explore how computational complex it is to decide whether a given action profile is sustained by a backwards induction solution, that is, whether, for a given action profile a , there is a backwards induction solution f such that a = a f . Throughout this section, we assume that strategies and utilities are concisely represented as Boolean circuits as outlined in Section 3.1.
First, we consider the following computational problem concerning the conditions under which a partial order game is fit for backwards induction. Formally:
fit for backwards induction
Given: Partial order game G and a topological sorting τ of its vertices
Problem: Is G fit for backwards induction under  τ ?
For total-order games, fit for backwards induction is vacuous, as they are all fit for backwards induction. For general partial order games, however, we have the following.
Theorem 9.
fit for backwards inductionis coNP -complete.
Proof. 
To see that fit for backwards induction is in coNP , let τ be a topological sorting of the game’s vertices. A certificate for a counterexample is then given by
(i)
a vertex v, with A v = { a 1 , , a k } , and X, Y, and Z the set of inscrutable vertices with a lower topological index, the set of v’s parents, and the vertices with a higher topological index than v, respectively;
(ii)
a profile a Y in A Y ;
(iii)
a profile a Z in A Z ;
(iv)
a tuple ( a 1 , , a k ) of k actions a j in A k ; and
(v)
a tuple ( b X 1 , , b X k ) of k profiles b X j in A X .
It is easy to verify that such a certificate is still of polynomial size. Observe that, if u i ( b X j , a Y , a j , a Z ) > u i ( b X j , a Y , a j , a Z ) holds for some 1 j k , we may conclude that
a j b X A X arg max a v A v u i ( b X , a Y , a v , a Z ) .
Therefore, such a certificate is a counterexample to the game being fit for backwards induction under τ if and only if u i ( b X j , a Y , a j , a Z ) > u i ( b X j , a Y , a j , a Z ) for every 1 j k . Observe that determining the sets X, Y, and Z can be achieved in polynomial time. Having assumed, moreover, that the utility function u i is part of the input and represented by a Boolean circuits and evaluation problem for Boolean circuits is solvable in polynomial time, we find that this is a property that can be checked in polynomial time.
For coNP -hardness, we reduce the complement of satisfiability. Let φ be a propositional formula in the propositional variables p 1 , , p m . Let q and r be two additional, fresh, propositional variables. We construct a partial order Boolean game with three players, player 1, player 2, and player 3 on a DAG ( V , E ) with V = { v q , v r , v p 1 , , v p m } and E such that ( v , u ) E whenever v = v q and u = v p j for some 1 j m , or v = v p j for some 1 j m and u = v r . Furthermore, see Figure 12 for an illustration of the construction of such a DAG in the case φ is defined over five propositional variables. Let v q be assigned to player 1, the vertices v p 1 , , v p m to player 2, and v r to player 3. Furthermore, assume that γ 1 = γ 2 = and γ 3 = φ ( q r ) . Consider an arbitrary topological sorting τ of the vertices.
First, assume that φ is not satisfiable. Then, all players are indifferent between all outcomes, and it can easily be seen that the game is fit for backwards induction under τ .
Finally, assume that assignment α : { p 1 , , p m } { 0 , 1 } satisfies φ . Observe that v q is inscrutable to v r and has a lower topological rank under τ . Moreover, v p 1 , , v p m are the parents of v r . Now, let a v p 1 , , a v p m be such that a p j = p j if α ( p j ) = 1 , and a p j = p ¯ j if α ( p j ) = 0 . Furthermore, let b v q = q and b v q = q ¯ , and a v r = r and a v r = r ¯ . Then,
u 3 ( q , a v p 1 , , a v p m , r ) > u 3 ( q , a v p 1 , , a v p m , r ¯ )
but
u 3 ( q ¯ , a v p 1 , , a v p m , r ) < u 3 ( q ¯ , a v p 1 , , a v p m , r ¯ ) .
Accordingly, we may conclude that the game is not fit for backwards induction. □
The closely related decision problem of whether a given partial order game is of adequate information, can be formulated as follows.
adequate information
Given: Partial order game G and topological sorting  τ
Problem: Is G a game of adequate information under  τ ?
Again, note that adequate information is vacuous for total-order games. For general partial order games, we find that the problem of adequate information is not easier than fit for backwards induction. The similarity of the two problems—and so also the closeness of the concepts of fitness of backwards induction and adequate information—is revealed by the closeness of their respective proofs of coNP -completeness.
Theorem 10.
adequate informationis coNP -complete.
Proof. 
For membership in coNP , let a certificate for a counterexample consist of a player i, a pair of profiles a S and a S in A S , and a pair of profiles b I and b I in A I , where S is the set of vertices that are scrutable to i under τ and I the set of vertices that are inscrutable to i under τ . Observe that such a certificate is of polynomial size and that the sets S and I can be computed in polynomial time. Then, such a certificate ( a S , a S , b I , b I ) defines a counterexample against the game being of adequate information whenever it is not the case that u i ( a S , b I ) u i ( a S , b I ) if and only if u i ( a S , b I ) u i ( a S , b I ) . Having assumed that the utility function u i is part of the input and represented by a Boolean circuit and evaluation problem for Boolean circuits is solvable in polynomial time, we find that this is a property that can be checked in polynomial time (see Lemma 2).
The proof of coNP -hardness is by a reduction from the complement of satisfiability, and runs along analogous lines as the proof of coNP -hardness of fit for backwards induction. Thus, given a propositional formula φ in the variables p 1 , , p k , construct the partial order Boolean game as in the proof of Theorem 9.
Then, if φ is not satisfiable, we find that for all three players, each action profile yields a utility of 1. It thus immediately follows that the game is of adequate information, as we can set, for instance, u i 0 ( a S ) = 1 for all players i with set S of scrutable vertices under τ .
For the opposite direction, assume that assignment α : { p 1 , , p m } { 0 , 1 } witnesses the satisfiability of φ . Observe that v q is inscrutable to player 3 under τ , whereas v q , v p 1 , , v p m are scrutable to player 3. Now let a v p 1 , , a v p m be such that a p j = p j if α ( p j ) = 1 , and a p j = p ¯ j if α ( p j ) = 0 . Then,
u 3 ( q , a v p 1 , , a v p m , r ) > u 3 ( q , a v p 1 , , a v p m , r ¯ )
but
u 3 ( q ¯ , a v p 1 , , a v p m , r ) < u 3 ( q ¯ , a v p 1 , , a v p m , r ¯ ) .
Assuming that the game be of adequate information, there then should be a utility function u 3 0 : A S R such that from the former it would follow that u 3 0 ( a v p 1 , , a v p m , r ) > u 3 0 ( a v p 1 , , a v p m , r ¯ ) , whereas from the latter u 3 0 ( a v p 1 , , a v p m , r ) < u 3 0 ( a v p 1 , , a v p m , r ¯ ) , a contradiction. Accordingly, we may conclude that the game is not of adequate information. □
Using Theorems 7 and 8, we know that games that are fit for backwards induction, like games of adequate information, have Nash equilibria. Thus, Theorems 9 and 10 could be seen to prove coNP -completeness of weak versions of the non-emptiness problem for backwards induction. This still leaves the question of whether a given strategy profile is a backwards induction solution and of whether a given action profile is sustained by a backwards induction solution, which can be seen as variants of the non-emptiness problem for backwards induction. The first decision problem can be formulated formally as follows.
is backward induction
Given: Partial order game G that is fit for backwards induction under topological sorting  τ and strategy profile f = ( f v 1 , , f v | V | )
Problem: Is  f a backwards induction solution of G under  τ ?
We find that is backward induction is coNP -complete. For the membership part, recall that Lemma 5 showed that our backwards induction procedure has the one-stage-deviation property. Let τ be a topological sorted and assume ( v 1 , , v | V | ) be sorted according to τ . The contrapositive of the one-stage-deviation property says that, if a profile f is not a backwards induction solution, there is one player, who, after some τ -history, wants to unilaterally deviate and play another action than the one prescribed by f , but otherwise has no incentive to change their strategy. This prepares the ground for the following result.
Theorem 11.
is backward inductionis coNP -complete. The problem remains coNP -hard for total-order games.
Proof. 
For membership in coNP , one can design a non-deterministic algorithm along the following lines. Assume v v 1 , , v v | V | is ordered according to τ . Guess a vertex v k assigned to player i along with action profiles a = ( a X , a Y , a v k , a Z ) and a = ( a X , a Y , a v k , a Z ) (thus, a X = a X and a Y = a Y ). Let Y denote the parents of v k , Z the vertices with a greater topological index than v k , and X = V \ ( Y Z { v k } ) . Then, check whether
(i)
the actions a v k , a v k + 1 , a v | V | are in accordance with f = ( f v 1 , , f v | V | ) , that is, for every k m | V | , we have a v m = f v m ( a Y m ) , where Y m is the set of parents of v m ;
(ii)
the actions a v k + 1 , , a v | V | are in accordance with f = ( f v 1 , , f v | V | ) , that is, for every k < m | V | , we have a v m = f v m ( a Y m ) , where Y m is the set of parents of v m ; and
(iii)
u i ( a ) > u i ( a ) .
In virtue of Lemmas 1 and 2, each of these checks can be performed in polynomial time in the size of the input. Moreover, if all of these checks are positive, we have found a counterexample against f being a backwards induction solution.
For coNP -hardness, we reduce the complement of satisfiability. Given an Boolean formula φ on propositional variables p 1 , , p k , we construct a partial order Boolean game G φ with one player i with goal formula γ i = φ p 0 , where p 0 is a “fresh” variable distinct from any p 1 , , p k . Let the vertices v p 0 , , v p k be ordered accordingly, that is, p E p m if and only if < m . Thus, G φ is a totally-ordered Boolean game, and, consequently, also fit for backwards induction. Now, consider the strategy profile f 0 = ( f v p 0 0 , , f v p k 0 ) where each f v p j 0 is represented by the choice equation p j : , that is, f 0 unconditionally sets every propositional variable to false. Note that f 0 yields player i a utility of 0, that is u i ( f 0 ) = 0 .
Now, if φ is unsatisfiable, so is i’s goal γ i , and all strategy profiles yield i the same payoff. Therefore, f 0 is vacuously a backwards induction solution. If, on the other hand, φ is satisfiable, say in virtue of assignment α , let f α = ( f v p 0 , , v p k ) be the unconditional strategy profile represented by the choice equations p 0 = and, for each 1 m k , p m : , if α ( p m ) = 1 , and p m : , if α ( p m ) = 0 . It can then easily be verified that player i has u i ( f α ) = 1 . Accordingly, player i has an incentive to deviate from f 0 to f α , indicating that f 0 is not a Nash equilibrium. By Theorem 7, it then follows that f 0 is not a backwards induction solution either. □
Finally, we come to consider the natural counter-part of the is nash actions problem for the backwards induction procedure, which can formally be stated as follows.
is backward induction actions
Given: Partial order game G, topological sorting  τ , and action profile a = ( a v 1 , , a v | V | )
Problem: Is  a sustained by a backwards induction solution under  τ in G?
We find that backwards induction actions is NEXPTIME -complete for general partial order games. The hardness part of this result follows again from a reduction from dependency quantifier boolean formula game (dqbfg).
Theorem 12.
For partial order games,backward induction actionsis NEXPTIME -complete.
Proof. 
A NEXPTIME algorithm to decide backward induction actions can be designed along the following lines. Given a partial order game G, a topological-ordering τ = ( v 1 , , v | V | ) of its vertices, and an action profile a = ( a v 1 , , a v | V | ) , first, guess a strategy profile f = ( f v 1 , , f v | V | ) . Given that strategies f v are given by a Boolean circuit C f v , this can be achieved in time not more than exponential in | A | , the size of the set of actions. Second, check whether f = ( f v 1 , , f v | V | ) is a backwards induction solution. The inductive definition of a backwards induction solution suggests a procedure for how this can be achieved in exponential time. Finally, check whether action profile a f as induced by strategy profile f coincides with the action profile a as it is given in the input. This can be achieved in time polynomial in the size of the Boolean circuit representing f . Altogether, the algorithm runs in non-deterministic exponential time.
For hardness, we reduce from dependency quantifier boolean formula game (dqfg) using a similar construction of a Boolean partial order game as in the proof of Theorem 4. Given an instance φ , X 1 , X 2 , Y 1 , Y 2 of dqfg with X 1 = { x 1 1 , , x | X 1 | 1 } , X 2 = { x 1 2 , , x | X 2 | 2 } , Y 1 = { y 1 1 , , y | Y 1 | 1 } , and Y 2 = { y 1 2 , , y | Y 2 | 2 } . Let x 0 be a ‘fresh’ variable x 0 not in X 1 X 2 Y 1 Y 2 . We define a Boolean partial order game on dependency graph ( V , E ) such that V = { x 0 } X 1 X 2 Y 1 Y 2 and where E is defined as follows:
(i)
x 0 E z    for all z X 1 X 2 Y 1 Y 2 ,
(ii)
x k i E x m j  for all x k i X i and x m j X j with i = j and k < m , or i j ,
(iii)
x 1 E x 2 , for all x 1 X 1 and x 2 X 2 ,
(iv)
x 1 E y 1 , for all x 1 X 1 and y 1 Y 2 ,
(v)
x 2 E y 2 , for all x 2 X 2 and y 2 Y 2 .
Observe that under this definition on X 1 X 2 , the relation E is complete and transitive. We extend E to a topological sorting τ such that, for instance,
τ = ( x 0 , x 1 1 , , x | X 1 | 1 , x 1 2 , , x | X 2 | 2 , y 1 1 , , y | Y 1 | 1 , y 1 2 , , y | Y 2 | 2 ) .
The players of the Boolean game are the same as in the instance of dqbfg, where B controls { x 0 } X 1 X 2 , W 1 the variables in Y 1 , and W 2 those in Y 2 . Their goals are given by
γ B = x 0 φ γ W 1 = γ W 2 = .
Observe that all vertices are scrutable to player B under τ and that the white players have the same utility for every outcome. It therefore follows that the game constructed is of adequate information and hence fit for backwards induction.
We are now in a position to show that the profile
a ¯ = ( x ¯ 0 , x ¯ 1 1 , x ¯ | X 1 | 1 , x ¯ 1 2 , x ¯ | X 2 | 2 , y ¯ 1 2 , y ¯ | Y 1 | 1 , y ¯ 1 2 , y ¯ | Y 2 | 2 ) ,
which sets all variables, including x 0 , to false, is sustained by a Nash equilibrium in G if and only if φ , X 1 , X 2 , Y 1 , Y 2 is a postive instance of dqbfg.
First, assume φ , X 1 , X 2 , Y 1 , Y 2 is a postive instance of dqbfg. Then, the white team has a joint winning strategy in the original dqbfg-game given by Boolean functions β 1 and β 2 on the variables X 1 and X 2 , respectively. Next, define strategies f W 1 * and f W 2 * for W 1 and W 2 in the Boolean partial order game that are given by the following choice equations for i = 1 , 2 and j = 1 , , | Y i | ,
y j i : ( x 0 φ β i ( x 1 i , , x | X i | i ) ) ( x ¯ 0 )
As the dependency graph ( V , E ) respects the information dependencies of the dqbfg-instance, observe that f W 1 and f W 2 together embody a winning strategy to render φ false if x 0 is played. Let B’s unconditional strategy f B * be defined by the choice equations of the form x j i : , setting x j i to false for all for i = 1 , 2 and j = 1 , , | X i | . Observe that for the strategy profile f * = ( f B * , f W 1 * , f W 2 * ) , we have that a f * = a ¯ , as desired. As W 1 and W 2 will not have their goal achieved under any action profile, their strategies trivially conform with the backwards induction procedure. Observe furthermore that B does not get their goal satisfied no matter which strategy plays against f W 1 * and f W 2 * . If B sets x 0 to false, their goal is not satisfied in a self-defeating fashion. By setting x 0 to true, B will induce W 1 and W 2 to play a winning strategy rendering ¬ φ false and therewith γ B as well. This in particular holds for any backwards induction strategies B has at their disposal, and it follows that f * is a backwards induction solution sustaining a ¯ .
For the opposite direction, assume that φ , X 1 , X 2 , Y 1 , Y 2 is a negative instance of dqbfg. Analogously to the proof of Theorem 4, it can then be shown that the constructed partial order Boolean game does not have a Nash equilibrium sustaining a ¯ . In virtue of the contrapositive of Theorem 7, there is no backwards induction solution sustaining a ¯ , giving us the result. □
The computational complexity of backward induction actions reduces considerably when we restrict attention to total-order games. Still, the problem is PSPACE -complete for this class of games. To prove this result, we first introduce some auxiliary notation and terminology.
Recall that if the dependency graph ( V , E ) of a total-order game is such that v 0 E E v | V | , then τ = ( v 0 , , v | V | ) is the only topological sorting of the vertices. Moreover, the parents of each vertex v k are all vertices with a lower topological index. The set { v 1 , , v k 1 } of parents of vertex v k we will denote by Y k , on the understanding that Y k = Ø , if k = 0 . Similarly, we will denote by Z k = { v k + 1 , , v | V | } the set of vertices with a higher topological index than v k , here on the understanding that Z k = Ø , if k = | V | .
For a = ( a v 1 , , a v | V | ) an action profile and h = ( a v 1 , , a v k ) a τ -history of a , we now introduce the auxiliary concept of a being sustained by a backwards induction solution at history h of a. This enables us to reason recursively about action profiles being sustained by backwards induction solutions without having to make explicit reference to specific underlying strategy profiles. Formally, for action profile a =   ( a v 1 , , a v | V | ) a n d τ-history h =   ( a v 1 , , a v k ) of a we say that a is sustained (by backwards induction) at h, if either h = a, or h =   ( a v 1 , , a v k ) for some 0 < k < |V| and the following two conditions hold:
(a.i)
action profile a is sustained by backwards induction at history ( a v 1 , , a v k , a v k + 1 )
(a.ii)
for every b v k + 1 A v k + 1 , there is an action profile a = ( a v 1 , , a v k , b v k + 1 , b v k + 2 , b v | V | ) that is sustained by backwards induction at history ( a v 1 , , a v k , b v k + 1 ) and which is such that u i ( a ) u i ( a ) , where i is the player active at vk+1.
We now have the following lemma.
Lemma 7.
For total-order games, an action profile a = ( a v 1 , , a v | V | ) is sustained by a backwards induction solution if and only if a is sustained by backwards induction at all histories of a . Moreover, a is sustained by a backwards induction solution only if a is sustained by backwards induction at the empty history ( ) . .
Proof. 
As we are dealing with total-order games, first observe that, for each vertex v, the set X of vertices that are neither a parent of v nor have a greater topological index than v is empty. Thus, Condition (1) in the definition of a backwards induction solution f * for partial order games reduces to
f v k + 1 * ( a v 1 , , a v k ) arg max b v k + 1 A v k + 1 u i ( a v 1 , , a v k , b v k + 1 , f v k + 2 , , f v | V | ) ,
for all 0 k < | V | , and where i is the player active at v k + 1 .
As an auxiliary concept, we define a strategy profile f = ( f v 1 , , f v | V | )  to sustain action profile a = ( a v 1 , , a v | V | ) (by backwards induction) at history h = ( a v 1 , , a v k ) , if either a = h , or both
(f.i)
f sustains a at ( a v 1 , , a v k , a v k + 1 ) ,
(f.ii)
f v k + 1 ( a v 1 , , a v k ) = a v k + 1 , and
(f.iii)
for all b v k + 1 A v k + 1 with b v k + 1 a v k + 1 we have that f sustains action profile ( a v 1 , , a v k , b v k + 1 , f v k + 2 , , f v | V | ) at history ( a v 1 , , a v k , b v k + 1 ) and,
u i ( a v 1 , , a v k , a v k + 1 , a v k + 2 , a v | V | ) u i ( a v 1 , , a v k , b v k + 1 , f v k + 1 , , f v | V | ) ,
where i is the player active at v k + 1 .
The we split the proof up in three parts, where we prove, respectively, the following statements (I), (II), and (III), below. In conclusion then put them together again, to prove the lemma.
(I)
for all 0 k | V | and all action profiles a = ( a v 1 , , a v | V | ) , a is sustained at ( a v 1 , , a v k ) if and only if some strategy profile f = ( f v 1 , , f v | V | ) sustains a at ( a v 1 , , a v k ) .
(II)
strategy profile f = ( f v 1 , , f v | V | ) sustains action profile a = ( a v 1 , , a v | V | ) at history ( ) if and only if a f = a and for every 0 k | V | and all action profiles x = ( x v 1 , , x v k , f v k + 1 , , f v | V | ) , strategy profile f sustains x at ( x v 1 , , x v k ) .
(III)
for every 0 k | V | and every action profile x = ( x v 1 , , x v k , f v k + 1 , , f v | V | ) , strategy profile f sustains x at history h = ( x v 1 , , x v k ) if and only if strategy profile f = ( f v 1 , , f v | V | ) is a backwards induction solution.
Part (I) is by induction on | V | k . For the basis, where k = | V | , consider an arbitrary action profile a = ( a v 1 , , a v | V | ) and arbitrary history h = ( a v 1 , , a v k ) . In this case, we find that h = a and immediately that a is sustained at h . Moereover, every strategy profile sustains a at h in this case.
For the induction step, consider an arbitrary 1 k < | V | , and equally arbitrary action profile a = ( a v 1 , , a v | V | ) and history h = ( a v 1 , , a v k ) .
First, assume that a is sustained at ( a v 1 , , v k ) , and let i be the player active at v k + 1 . Then, a is also sustained at ( a v 1 , , v k + 1 ) . By the induction hypothesis, there is a strategy profile f that sustains a at ( a v 1 , , v k + 1 ) . Moreover, as k < | V | , for each b v k + 1 A v k with b v k + 1 a v k + 1 , there is an action profile a = ( a v 1 , , a v k 1 , b v k + 1 , b v k + 2 , b v | V | ) that is sustained at history ( a v 1 , , a v k 1 , b v k , b v k + 1 ) such that u i ( a ) u i ( a ) . Let A v k + 1 = { b v k + 1 1 , , b v k + 1 | A v k + 1 | } . By the induction hypothesis, for each 1 j | A v k + 1 | , there is a strategy profile g j = ( g v 1 j , , g v | V | j ) sustaining action profile ( a v 1 , , a v k , b v k + 1 j , g v k + 2 j , g v | V | j ) at history ( a v 1 , , a v k , b v k + 1 j ) , and which is such that
u i ( a ) u i ( a v 1 , , a v k , b v k + 1 j , g v k + 1 j , , g v | V | j ) .
Now, define the strategy profile f * = ( f v 1 * , , f v | V | * ) such that for 1 k | V | and every history x = ( x v 1 , , x m ) , where 0 m | V | ,
f v m * ( x ) = f v m j ( x ) if x v k + 1 = a v k + 1 j and m > k , g v m j ( x ) if x v k + 1 = b v k + 1 j and m > k , a v m otherwise .
It is now easy to check that, by construction, strategy profile f * sustains a at history ( a v 1 , , a v k ) , as desired. In particular, observe that f v k + 1 * ( a v 1 , , a v k ) = a v k + 1 .
For the opposite direction, assume that strategy profile f = ( f v 1 , , f v | V | ) sustains action profile a = ( a v 1 , , a v | V | ) at history ( a v 1 , , a v k ) , where k < | V | . Then, by definition, strategy profile f = ( f v 1 , , f v | V | ) also sustains action profile a = ( a v 1 , , a v | V | ) at history ( a v 1 , , a v k = 1 ) . By the induction hypothesis, we then immediately obtain that a is sustained at history ( a v 1 , , a v k + 1 ) . Now, consider an arbitrary b v k + 1 A v k + 1 with a v k + 1 b v k + 1 , along with history ( a v 1 , , a v k , b v k + 1 ) . Then, f sustains action profile ( a v 1 , , a v k , b v k + 1 , f v k + 2 , , f v | V | ) at history ( a v 1 , , a v k , b v k + 1 ) . We have, moreover, that u i ( a ) u i ( ( a v 1 , , a v k , b v k + 1 , f v k + 2 , , f v | V | ) ) . By the induction hypothesis, we find that action profile ( a v 1 , , a v k , b v k + 1 , f v k + 2 , , f v | V | ) is sustained at history ( a v 1 , , a v k , b v k + 1 ) , which proves the case.
For part (II), first assume that a f = a and that for every 0 k | V | , strategy profile f sustains action profile ( x v 1 , , x v k , f v k + 1 , , f v | V | ) at history ( x v 1 , , x v k ) . The latter then holds in particular for k = 0 , that is, for history ( ) . Accordingly, f sustains action profile ( f v 1 , , f v | V | ) = a f at history ( ) . Our initial assumption that a f then gives the result for this direction.
For the opposite direction, assume that strategy profile f = ( f v 1 , , f v | V | ) sustains action profile a = ( a v 1 , , a v | V | ) at history ( ) . First, we find that by repeatedly applying condition (f.ii) gives us a f = a . Now, consider an arbitrary 0 k | V | and action profile x = ( x v 1 , , x v k , f v k + 1 , , f v | V | ) . We prove by induction on k that f sustains ( x v 1 , , x v k , f v k + 1 , , f v | V | ) at history h = ( x v 1 , , x v k ) .
First assume that k = 0 , that is, h = ( ) . In that case, x = ( f v 1 , , f v | V | ) = a f . We have already seen that a f = a , and, thus, the base case immediately follows from the assumption that f = ( f v 1 , , f v | V | ) sustains action profile a = ( a v 1 , , a v | V | ) at history ( ) .
For the induction step, let k > 0 . In case k = | V | we have h = a and we are done immediately. Otherwise, by the induction hypothesis, we may assume that f sustains action profile ( x v 1 , , x v k 1 , f v k , , f v | V | ) at history ( x v 1 , , x v k 1 ) . If f v k ( x v 1 , , x v k 1 ) = x k , then it follows from condition (f.i) that f sustains ( x v 1 , , x v k , f v k + 1 , , f v | V | ) at ( x v 1 , , x v k ) . If f v k ( x v 1 , , x v k 1 ) x k , then still f v k ( x v 1 , , x v k 1 ) A v k . Now, it follows from condition (f.iii) that f sustains ( x v 1 , , x v k , f v k + 1 , , f v | V | ) at ( x v 1 , , x v k ) .
For part (III), first assume, for every 0 k | V | and action profile x = ( x v 1 , , x v k , f v k + 1 , , f v | V | ) , that strategy profile f = ( f v 1 , , f v | V | ) sustains x at history h = ( x v 1 , , x v k ) . Now, consider an arbitrary history ( z 1 , , z k ) for some 1 k < | V | . Then, by assumption, f sustains action profile ( z v 1 , , z v k , f v k + 1 , , f v | V | ) at history ( z v 1 , , z v k ) . As k < | V | , we find that ( z 1 , , z k ) ( z 1 , , z k , f v k + 1 , , f v | V | ) . Therefore, by condition (f.iii), we then find that
f v k + 1 ( z v 1 , , z v k ) arg max b v k + 1 A v k + 1 u i ( z v 1 , , z v k , b v k + 1 , f v k + 2 , , f v | V | ) ,
where i is the player active at v k + 1 . It thus follows that f is a backward induction solution.
For the opposite direction, assume that f = ( f v 1 , , f v | V | ) is a backwards induction solution. Now, consider an arbitrary 0 k | V | . We prove by induction on | V | k , that, for every action profile x = ( x v 1 , , x v k , f v k + 1 , , f v | V | ) , strategy profile f sustains x at history h = ( x v 1 , , x v k ) .
For the basis assume let k = | V | . Consider an arbitrary action profile x = ( x v 1 , , x v k ) along with history h = ( x v 1 , , x v k ) . Obviously, h = x , and, thus, immediately that f sustains x at history h.
For the induction step, let k < | V | . Again, consider an arbitrary action profile h = ( x v 1 , , x v k , f v k + 1 , , f v | V | ) along with history h = ( x v 1 x , x v k ) . By the induction hypothesis, we find, for all action profiles y = ( y v 1 , , y v k + 1 , f v k + 1 , , f v | V | ) , that f sustains y at history ( y v 1 , , y v k + 1 ) . This holds in particular for histories ( x v 1 , , x v k , b v k + 1 ) where b v k + 1 A v k + 1 . For b v k + 1 = f v k + 1 ( x v 1 , , f v k ) , we obtain condition (f.i). Condition (f.ii) is trivially satisfied. Finally, having assumed that f is a backwards induction solution, we find that,
u i ( x v 1 , , x v k , f v k + 1 , f v k + 2 , , f v | V | ) u i ( x v 1 , , x v k , b v k + 1 , f v k + 2 , , f v | V | ) ,
for all b v k + 1 V v k + 1 with b v k + 1 f v k + 1 ( x v 1 , , f v k ) and i the player active at v k + 1 . We may therefore conclude that f sustains x = ( x v 1 , , x v k , f v k + 1 , , f v | V | ) at ( x v 1 , , x v k ) , proving part (III).
With (I) through (III) in place, we conclude the proof as follows. Consider an arbitrary action profile a = ( a v 1 , , a v | V | ) . First, assume, for all 0 k | V | , that a is sustained by backwards induction at ( a v 1 , , a v k ) . Then by (I), for all 0 k | V | , there is some strategy profile f = ( f v 1 , , f v | V | ) that sustains a by backwards induction at ( a v 1 , , a v k ) . For the particular case wherein k = 0 , we find that this strategy profile f that sustains a by backwards induction at ( ) . By repeated application of (f.ii), moreover, it can easily be established that a f = a . Part (II) then yields that for every 0 k | V | and all action profiles x = ( x v 1 , , x v k , f v k + 1 , , f v | V | ) , strategy profile f sustains a at history ( x v 1 , , x v k ) . Finally, by virtue of (III), we may conclude that f is a backwards induction solution that sustains a .
For the opposite direction, assume that f is a backwards induction solution sustaining a . Therefore, a f = a , and, accordingly, ( a v 1 , , a v k , f v k + 1 , , f v | V | ) = ( a v 1 , , a v | V | ) for all 0 k | V | . Part (III) yields furthermore that for every 0 k | V | , strategy profile f sustains action profile a = ( a v 1 , , a v k , f v k + 1 , , f v | V | ) at history ( a v 1 , , a v k ) . It now follows, for every 0 k | V | , that f sustains a at history ( a v 1 , , a v k ) . With part (I), we may now conclude that, for all 0 k | V | , action profile a is sustained by backwards induction at ( a v 1 , , a v k ) , as desired.
Finally, observe that the second part of the lemma also follows from the above argument. □
The intuition behind Lemma is that, for a strategy profile f sustaining an action profile a = ( a v 1 , , a v | V | ) to be a backward induction solution, it does not suffice just to induce a , in the sense that a f = a . It should, in addition, incorporate a threat strategy at every vertex v k that succeeds in deterring the player active at v k to play any action other than a v k , that is, the action prescribed by a . The lemma, moreover, shows that a strategy profile that achieve this can be constructed from the strategy profiles that achieve this at each history.
We introduce the following auxiliary decision problem, solving which is an important subroutine in our proof of PSPACE -membership of backwards induction actions for total-order games.
backward induction guarantee
Given:Total-order game G, action profiles  a = ( a v 1 , , a v | V | ) and history  h = ( a v 1 , , a v k ) , player i, and  r R .
Problem:Is  a sustained by backwards induction at  h such that u i ( a ) r
We find that backward induction guarantee is in PSPACE for total-order games.
Lemma 8.
For total-order games,backward induction guaranteeis in PSPACE , that is, the problem can be solved in space polynomial in the size of the game.
Proof. 
Let a total-order game be given together with an action profile a = ( a v 1 , , a v | V | ) . We prove by induction on | V | k , that for all histories h = ( a v 1 , , a v k ) of a , all players i, and all values r R the problem backward induction guarantee can be solved in polynomial space.
For the basis, assume k = | V | . Then, h = a , and consider an arbitrary player i and arbitrary value r R . In this case, it suffices to compute u i ( a ) and check whether u i ( a ) r . By Lemma 2, this achieved in polynomial time, and hence in polynomial space as well.
For the induction step, let k < | V | . Consider an arbitrary player j and an arbitrary value r R . Let, furthermore, i be the player active at a v k . Now, we first check whether a is sustained by backward induction at history h v k + 1 = ( a v 1 , , a v k + 1 ) . By virtue of the induction hypothesis, this can be achieved in polynomial space by computing backward induction guarantee for a , history h v k + 1 , player i, and value r = u i ( a ) . If this check is negative, we can conclude immediately and output “no”.
Otherwise, we deploy two binary counters with, respectively, log 2 | A ( x ) | and log 2 | H | = O ( | V | log 2 | A | ) digits to run through all actions b v k + 1 in A v k and all action profiles
( a v 1 , , a v k , b v k + 1 , b k + 1 , b v | V | ) ,
respectively. We then subsequently check for each b v k + 1 in A v k + 1 whether some action profile b = ( a v 1 , , a v k , b v k + 1 , b k + 1 , b v | V | ) is sustained by backwards induction at ( a v 1 , , a v k , b v k + 1 ) with u i ( b ) u i ( a ) . Then, a is sustained by backwards induction a history h = ( a v 1 , , a v k + 1 ) if and only if for each b v k + 1 in A v k + 1 at least one of these checks is positive. Moreover, as some reflection reveals, all of these checks are independent and, by the induction hypothesis, we may assume that all of them can be achieved in polynomial space, we may conclude that the subsequent execution of these checks can be performed in polynomial space as well, giving us the result. □
With Lemma 8 in place, we are now in a position to now demonstrate the last main result of this section.
Theorem 13.
For total-order games,backwards induction actionsis PSPACE -complete.
Proof. 
For membership in PSPACE , let a total order game with v 1 E E v | V | and an action profile a = ( a v 1 , a v | V | ) be given. Observe that the topological order τ = ( v 1 , , v | V | ) is fixed by E. By virtue of Lemma, it suffices to check whether a is sustained by backwards induction at the empty history ( ) for value k = u i ( a ) , where i is the player active at v 1 . Lemma 8 ensures that this can be achieved in space polynomial in the input.
For PSPACE -hardness, we reduce from qbf, which is known to be PSPACE -hard. Let Φ = Q 1 x 1 , , Q | X | x | X | φ ( x 1 , , x k ) be a fully quantified propositional formula over X = { x 1 , , x | X | } , where each Q i is either ∃ or ∀. We let X and X denote the sets of universally quantified variables and existentially quantified variables in Φ , respectively. Furthermore, for each variable x i with 1 i | X | , we let B i denote the set of Boolean functions on the variables x 1 , , x i 1 . With each profile β = ( β 1 , , β | X | ) in B 1 × × B | X | , we associate an a valuation α β : X { 1 , 0 } such that
α β ( x 1 ) = β 1 α β ( x i + 1 ) = β i + 1 ( α β ( x 1 ) , , α β ( x i 1 ) ) .
We construct a total-order Boolean game G Φ with two players: player 1 and player 2. The propositional variables our game is based on are given by x 1 , , x | X | together with an additional ‘fresh’ variable x 0 . The players’ preferences are captured by
γ 1 = x 0 ¬ φ γ 2 = x 0 φ .
We assign control over variable x i to player 1, if Q i = , and to player 2, if Q i = . Moreover, the auxiliary variable x 0 is assigned to the control of player 1. The dependency graph ( { x 0 , , x k } , E ) is defined such that x i E x j if and only if i < j . The game G Φ is a total-order game and, thus, fit for backwards induction. Backwards induction solutions are therefore bound to exist. We now demonstrate that Φ = Q 1 x 1 , , Q | X | x | X | φ evaluates to true if and only if the action profile x ¯ = ( x ¯ 0 , x ¯ 1 , , x ¯ | X | ) is sustained by a backwards induction solution in G Φ .
First assume for the “only if”-direction, that Φ holds. Then, there is a profile of Boolean functions β i for the variables in X , such that for all profiles Boolean functions β j for the variables x j X , such that the assignment α β 1 , , β | V | : X { 0 , 1 } satisfies φ . On this basis we define a strategy profile f * = ( f 1 * , f 2 * ) , where f 1 * and f 2 * are given by the following choice equations for each x i X { x 0 } (controlled by player 1) and each x j X (controlled by player 2):
x i : x j : x 0 φ β j ( x 1 , , x j 1 ) .
Here, φ β j denotes the Boolean formula representing Boolean function β j . Note, furthermore, that a f * = x ¯ .
First observe that for all action profiles a = ( a x 0 , , a x | X | ) with a x 0 = x ¯ 0 , the valuation α a will satisfy neither γ 1 nor γ 2 . Some reflection, thus reveals that f * sustains x ¯ at all histories h = ( a x 0 , , a x k ) with a v x 0 = x ¯ 0 , particular at history a v x 0 . Now observe that f 2 incorporates a winning strategy for player 2 in the “subgame” that results if player 1 sets x 0 to true. Therefore, φ will be satisfied by the valuations induced by the Nash equilibria in which x 0 is set to true—if any such Nash equilibria exist. As, by Theorem 7, every backwards induction solution is a Nash equilibrium, it follows that every action profile a = ( a x 0 , , a x | X | ) with a x 0 = x 0 that is sustained by backwards induction will render φ true, and, hence, player 1’s goal γ 1 false. Now let f = ( f 1 , f 2 ) be a strategy profile that sustains some action profile a = ( a x 0 , , a x | X | ) with a x 0 = x 0 by backwards induction at history h = x 0 . Observe that we may assume the existence of such a strategy profile. At this juncture, consider the strategy profile f * * = ( f x 0 * * , , f x | X | * * ) defined such that for all histories h = ( a x 0 , , a x k 1 ) with 0 k < | X | and i { 1 , 2 } ,
f i * * ( h ) = f 1 * ( h ) if a x 0 = x ¯ 0 , f 2 ( h ) otherwise .
Using the auxiliary formal apparatus introduced in the proof of Lemma 7, we find that f * * sustains action profile x ¯ at x ¯ 0 and action profile ( x 0 , f x 1 * * , , f | V | * * ) at x 0 . Moreover, u 1 ( x ¯ ) u 1 ( x 0 , f x 1 * * , , f x | X | * * ) . As player 1 controls x 0 , we may therefore conclude that f * * sustains x ¯ at ( ) . By Lemma, we may now conclude that x ¯ is sustained by a backwards induction solution, as desired.
For the “if”-direction, assume that Φ does not hold. In this case, it can be seen that player 2 does not have a winning strategy in the “subgame” that results if x 0 is set to true. Now, consider an arbitrary strategy profile f ¯ = ( f ¯ 1 , f ¯ 2 ) that sustains x ¯ . Then, player 1’s goal is not satisfied if f ¯ is played. Moreover, as player 2 does not have a winning strategy in the “subgame” after x 0 is set to true, player 1 has some strategy f i * with which they set x 0 to true and such that ( f 1 * , f ¯ 2 ) guarantees γ 1 to be satisfied. Accordingly, player 1 will want to deviate to f 1 * and strategy profile f ¯ is not a Nash equilibrium. By Theorem 7, strategy profile f ¯ is not a backwards induction solution either. Having chosen f ¯ arbitrarily, we get the result. □
With this result in place, we can now present the proof of Theorem 6, which stated PSPACE -completeness of is nash actions for total-order games. We show that, given an action profile  a = ( a v 1 , , a v | V | ) , a given total-order game G can be transformed into a total-order game  G a in which  a is sustained by a backwards induction solution if and only if  a is sustained by a Nash equilibrium in the original game G. We then leverage Theorem 13 to prove PSPACE -membership of is nash actions.
Formally, let G be a partial order game defined on dependency graph  ( V , E ) , and let  a = ( a v 1 , , a v | V | ) be an action profile. Then, we define G a as exactly the same game as G, be it with the following binary utility functions u i a : A { 0 , 1 } . For a given action profile  b = ( b v 1 , , b v | V | ) , we say that player j is the first to deviate from  a (to  b ) if, for some 1 k | V | , player j is active at  v k , b v m = a v m for all m < k , and b v k a v k . For each player i, we set u i a ( a ) = 1 and, for every action profile  b a where player j is the first player deviate from  a to  b , we define
u i a ( b ) = 0 if i = j and u j ( b ) u j ( a ) , 2 if i = j and u j ( b ) > u j ( a ) , 2 if i j and u j ( b ) u j ( a ) , 0 if i j and u j ( b ) > u j ( a ) .
We now have the following two lemmas, the first of which shows that the game  G a preserves the Nash equilibria of G.
Lemma 9.
Let a = ( a v 1 , , a v | V | ) be an action profile of a total-order game G. Then, a is sustained by a Nash equilibrium in G if and only if a is sustained by a Nash equilibrium in G a .
Proof. 
First assume that f = ( f v 1 , , f v | V | ) be a Nash equilibrium sustaining a in G. Therefore, a f = a . For contradiction, also assume that f does not do so in G a . Then, there is some player i and some strategy g i , such that u i a ( a ) < u i a ( a ( f i , g i ) ) . Obviously, player i is the first to deviate from a to a ( f i , g i ) . Thus, u i a ( a ) = 1 and u i a ( a ( f i , g i ) ) = 2 , and, by definition of u a , we find that u i ( a f ) < u i ( a ( f i , g i ) ) . It therefore follows that f is not a Nash equilibrium of G, a contradiction.
Now assume that f is a Nash equilibrium sustaining a in G a . Therefore, a f = a . It suffices to show that f is also a Nash equilibrium in G. To this end, consider an arbitrary player i and strategy g i for i. If a ( f i , g i ) = a f , we are done immediately. Otherwise, i is the first player to deviate from a f to a ( f i , g i ) . Because f is a Nash equilibrium in G a , we find that u i a ( a f ) u i a ( a ( f i , g i ) ) . As a f = a , we have u i a ( a f ) = 1 . Observe that it now follows that u i a ( a ( f i , g i ) ) = 0 . Accordingly, u i ( a ( f i , g i ) ) u i ( a ) , and thus u i ( a ( f i , g i ) ) u i ( a f ) . We may conclude that f is a Nash equilibrium in G, as desired. □
For total-order game  G a , we can furthermore show that if there is a Nash equilibrium that sustains action profile  a , then there is also a backwards induction that does the same. The converse of this statement is immediate, as every backwards induction solution is also a Nash equilibrium (Theorem 7).
Lemma 10.
Let a be an action profile of a total-order game G. Then, a is sustained by a Nash equilibrium in G a if and only if a is sustained by a backwards induction solution in G a .
Proof. 
By virtue of Theorem 7, the “if”-direction of the lemma is immediate. For the opposite direction, assume that a is sustained by a Nash equilibrium f * . For contradiction, also assume that a is not sustained by any backwards induction solution. As we are dealing with total-form games, we may assume the existence of backwards induction solutions. These we may also assume to be Nash equilibria in virtue of Theorem 7. Now, let g = ( g v 1 , , g v | V | ) be any backwards induction solution and let a g = b = ( b v 1 , , b v | V | ) . Furthermore, let 1 k | V | be the greatest index such that g v k ( a v 1 , , a v k 1 ) f v k ( a v 1 , , a v k 1 ) and assume i to be the player active at v k . We may also assume that f is the backwards induction solution for which there are the fewest histories h = ( x v 1 , , x v ) such that some player j i is active at v and f v ( g v ( h ) . Furthermore, observe that, by choice of k, we have that a = ( a v 1 , , a v k , g v k + 1 , , g v | V | ) . Having assumed that g is a backwards induction solution, we find moreover that
1 = u i ( a ) = u i ( a f ) < u i ( a g ) = u i ( b ) = 2 .
Now, observe that a g a ( f i , g i ) ; otherwise, f would not be a Nash equilibrium. Accordingly, there must be some player j i and some vertex v k with k < k | V | and at which player j is active such that
f v k + 1 ( a v 1 , , a v k 1 , b v k , , b k 1 ) g v k + 1 ( a v 1 , , a v k 1 , b v k , , b k 1 ) .
Having assumed that g is a backwards induction solution, we find that
u j ( a v 1 , , a v k 1 , b v k , , b v k 1 , a v k , g v k + 1 , , g v | V | ) u j ( a v 1 , , a v k 1 , b v k , , b v k 1 , b v k , g v k + 1 , , g v | V | ) = u j ( b ) .
where a v k = f v k ( a v 1 , , a v k 1 , b v k , , b v k 1 ) . Because a v k b v k and u i ( b ) = 2 , we have u j ( b ) = 0 . It now follows that u j ( a v 1 , , a v k 1 , b v k , , b v k 1 , a v k , g v k + 1 , , g v | V | ) = 0 as well. Accordingly,
u j ( a v 1 , , a v k 1 , b v k , , b v k 1 , a v k , g v k + 1 , , g v | V | ) u j ( a v 1 , , a v k 1 , b v k , , b v k 1 , b v k , g v k + 1 , , g v | V | ) .
At this juncture, consider the strategy profile g = ( g v 1 , , g v | V | ) defined such that for all vertices v and all histories h = ( x v 1 , , x v 1 ) ,
g v ( h ) = a v k if = k and h = ( a v 1 , , a v k 1 , b v k , , b v k 1 ) , g v ( h ) otherwise .
The key observation to make at this point is that g is a backwards induction solution just as well as g is. As a v k = f v k ( a v 1 , , a v k 1 , b v k , , b v k 1 ) , however, we find that for g the number of histories h = ( x v 1 , , x v ) such that there some player j i that is active at v and f v ( h ) g v ( h ) is one fewer than the same number of such histories for g . This contradicts our minimality assumption regarding the latter. This concludes the proof. □
We are now finally in a position to prove Theorem 6, and establish that is nash actions is PSPACE -complete for total-order games.
Proof of Theorem 6.
For PSPACE -membership, first observe that, given total-order game G and action profile a = ( a v 1 , , a v | V | ) , we can construct G a in polynomial time. To see this, note for each player i, we have to transform the circuit C u i computing i’s utility function u i in G to a circuit C u i a . This can be achieved by adding a polynomial number of gates to each C u i . We then check whether a is sustained by a backwards induction solution in G a . Theorem 13 guarantees that this can be achieved in polynomial space. Lemma 9 then yields the result.
To prove PSPACE -hardness, we adapt the proof of Theorem 13 and reduce from q b f . Given an instance Φ = Q 1 x 1 , , Q | X | x | X | φ ( x 1 , , x k ) of qbf, we construct the game G Φ as in Theorem 13. We demonstrate that Φ = Q 1 x 1 , , Q | X | x | X | φ evaluates to true if and only if the action profile x ¯ = ( x ¯ 0 , x ¯ 1 , , x ¯ | X | ) is sustained by a Nash equilibrium.
First assume that Φ holds. Then, there is a profile of Boolean functions β i for the variables in X , such that for all profiles of Boolean functions β j for the variables x j X , such that the assignment α β 1 , , β | V | : X { 0 , 1 } satisfies φ . As in the proof of Theorem 13, we consider profile f * = ( f 1 * , f 2 * ) , where f 1 * and f 2 * are given by the following choice equations for each x i X { x 0 } (controlled by player 1) and each x j X (controlled by player 2):
x i : x j : x 0 φ β j ( x 1 , , x j 1 ) .
Note that a f * = x ¯ , and so it suffices to show that f * is a Nash equilibrium. Recall that f 2 incorporates a winning strategy for player 2 in the “subgame” that results if player 1 sets x 0 to true. It follows that for each of player 1’s strategies g 1 , the assignment α a ( g 1 , f 2 * ) associated with action profile a ( g 1 , f 2 * ) either satisfies φ or does not satisfy x 0 . In either case, player 1’s goal is not satisfied, and hence does not want to deviate from f * . Player 2 does not want to deviate either, as, with player 1 playing f 1 * , variable x 0 will be set to false and player 2’s goal γ 2 will not be satisfied no matter which strategy player 2 chooses. We may conclude that f * is a Nash equilibrium, as desired.
Finally, assume that Φ does not hold. In that case, player 2 does not have a winning strategy in the “subgame” that results if x 0 is set to true. Now consider an arbitrary strategy profile f ¯ = ( f ¯ 1 , f ¯ 2 ) that sustains x ¯ , and observe that f ¯ does not lead to player 1’s goal being satisfied. As player 2 does not have a winning strategy when x 0 is set to true, there must be a strategy f 1 * for player 1 such that ( f 1 * , f ¯ 2 ) leads to player 1’s goal γ 1 becoming true. Accordingly, player 1 would like to deviate to f 1 * , and f ¯ is not a Nash equilibrium. We may therefore conclude that x ¯ is not sustained by a Nash equilibrium, as desired. □

8. Conclusions

Game theorists have developed a huge range of game models since the field was initiated nearly a century ago. Of these, the most important and prominent are normal form games and extensive form games. These models differ in the assumptions they make about the information available to players and their strategies. In this article, we have introduced a game model in which informational dependencies between decision nodes in a game are captured in what we call a dependence relation. This dependence relation explicitly states what information is available when making a choice at a decision node. Although our partial order games are strictly no more expressive than extensive form games of imperfect information, they can be exponentially more compact, and we believe they are much more transparent with respect to the information flow in a game.
The compact representation of non-cooperative settings as partial order games naturally evokes various questions concerning the complexity of computing solution concepts in this model. We have focussed on Nash equilibrium and backwards induction. We explored six decision problems surrounding these two solution concepts—non-emptiness, is nash, is nash actions, fit for backwards induction, is backwards induction, and is backwards induction actions—for partial-games as well as for the two extremal subclasses of empty-order games and total-order games. Our results are summarised in Table 1 and Table 2. For each of these problems, we find that computational costs are highest for general partial order games, rather than for the two extremal classes. This suggests that the structure of the dependence relation is a major source of complexity.
Furthermore, note the contrast in complexity between is nash and is backwards induction, on the one hand, and is nash actions and is backwards induction actions on the other. Other than in normal-form games, and arguably similar to extensive-form games, partial order games evince a clear distinction between strategy profiles and action profiles, with the former being much (exponentially) ‘larger’ objects than the latter, and of which there can also be exponentially more. Thus, is nash and is nash actions are similar problems, which basically only differ in that the former has a strategy profile as part of its input and the latter an action profile. We propose that the difference in size and number between strategy profiles and action profiles accounts for much of the contrast in computational cost between is nash and is nash actions. A similar remark applies to fit for backwards induction and is backwards induction actions.
For future work, it would be useful to develop software tools to support reasoning in partial order games, and from a theoretical perspective, it would also be interesting to consider restrictions on dependence graph structures that might lead to more tractable decision problems. Finally, of course, it would be valuable to look in more detail at applications of our model.

Author Contributions

Conceptualization, J.G., P.H., M.W., and V.Z.; methodology, J.G., P.H., M.W., and V.Z.; formal analysis, P.H., M.W., and V.Z.; investigation, P.H. and V.Z.; writing—original draft preparation, P.H. and V.Z.; writing—review and editing, all authors; supervision, M.W.; project administration, M.W.; funding acquisition, M.W. All authors have read and agreed to the published version of the manuscript.

Funding

Wooldridge and Harrenstein were supported by the UKRI under a Turing AI World Leading Researcher Fellowship (EP/W002949/1) awarded to Wooldridge.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Acknowledgments

The authors would like to thank James Fox, Lewis Hammond, and Thomas Steeples for discussions on this work, in particular on the relationship between partial order games and MAIDs.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Bradfield, J.; Gutierrez, J.; Wooldridge, M. Partial-order Boolean games: Informational independence in a logic-based model of strategic interaction. Synthese 2016, 193, 781–811. [Google Scholar] [CrossRef] [Green Version]
  2. Koller, D.; Milch, B. Multi-agent influence diagrams for representing and solving games. In Proceedings of the 17th International Joint Conference on Artificial Intelligence (IJCAI), Seattle, WA, USA, 4–10 August 2001; pp. 1027–1034. [Google Scholar]
  3. Koller, D.; Milch, B. Multi-agent influence diagrams for representing and solving games. Games Econ. Behav. 2003, 45, 181–221. [Google Scholar] [CrossRef]
  4. Maschler, M.; Solan, E.; Zamir, S. Game Theory; Cambridge University Press: Cambridge, UK, 2013. [Google Scholar]
  5. Osborne, M.J.; Rubinstein, A. A Course in Game Theory; MIT Press: Cambridge, MA, USA, 1994. [Google Scholar]
  6. Vollmer, H. Introduction to Circuit Complexity: A Uniform Approach; Texts in Theoretical Computer Science; Springer: Berlin/Heidelberg, Germany, 1999. [Google Scholar]
  7. Jukna, S. Boolean Function Complexity: Advances and Frontiers; Algorithms and Combinatorics; Springer: Berlin/Heidelberg, Germany, 2012; Volume 27. [Google Scholar]
  8. Greenlaw, R.; Hoover, H.J.; Ruzzo, W.L. Limits to Parallel Computation: P-Completeness Theory; Oxford University Press: Oxford, UK, 1995. [Google Scholar]
  9. Muller, D.E. Complexity in electronic switching circuits. IRE Trans. Electron. Comput. 1956, 1, 15–19. [Google Scholar] [CrossRef]
  10. Lupanov, O.B. A method of circuit synthesis. Izvesitya VUZ Radiofiz 1958, 1, 120–140. (In Russian) [Google Scholar]
  11. Harrenstein, P.; van der Hoek, W.; Meyer, J.J.; Witteveen, C. Boolean Games. In Proceeding of the Eighth Conference on Theoretical Aspects of Rationality and Knowledge (TARK VIII); Certosa Di Pontignano, Italy, 8–10 July 2001, van Benthem, J., Ed.; Morgan Kaufmann: Burlington, MA, USA, 2001; pp. 287–298. [Google Scholar]
  12. Harrenstein, P. Logic in Conflict. Ph.D. Thesis, Utrecht University, Utrecht, The Netherlands, 2004. [Google Scholar]
  13. Bonzon, E.; Lagasquie, M.; Lang, J.; Zanuttini, B. Boolean games revisited. In Proceedings of the ECAI 2006, Riva del Garda, Italy, 28 August–1 September 2006; Volume 141. Available online: https://www.researchgate.net/publication/48445144_Boolean_Games_Revisited (accessed on 17 December 2021).
  14. Bonzon, E.; Lagasquie-Schiex, M.C.; Lang, J.; Zanuttini, B. Compact preference representation and Boolean games. Auton. Agents Multi-Agent Syst. 2009, 18, 1. [Google Scholar] [CrossRef] [Green Version]
  15. Wooldridge, M.; Endriss, U.; Kraus, S.; Lang, J. Incentive engineering for Boolean games. Artif. Intell. 2013, 195, 418–439. [Google Scholar] [CrossRef]
  16. Hammond, L.; Fox, J.; Everitt, T.; Abate, A.; Wooldridge, M. Equilibrium Refinements for Multi-Agent Influence Diagrams: Theory and Practice. In Proceedings of the 20th International Conference on Autonomous Agents and Multi-Agent Systems (AAMAS’21), Online, 3–7 May 2021; Endriss, U., Nowé, A., Dignum, F., Lomuscio, A., Eds.; Available online: https://arxiv.org/pdf/2102.05008.pdf (accessed on 17 December 2021).
  17. Pearl, J. Probabilistic Reasoning in Intelligent Systems; Morgan Kaufmann: Burlington, MA, USA, 1988. [Google Scholar]
  18. Howard, R.A.; Matheson, J.E. Influence Diagrams. In Readings on the Principles and Applications of Decision Analysis; Strategic Decisions Group: Menlo Park, CA, USA, 1994; Volume 2, pp. 721–762. [Google Scholar]
  19. Kjaerulff, U.B.; Madsen, A.L. Bayesian Networks and Influence Diagrams; Springer Science+ Business Media 200: Berlin/Heidelberg, Germany, 2008. [Google Scholar]
  20. Nielsen, M.; Plotkin, G.D.; Winskel, G. Petri Nets, Event Structures and Domains, Part I. Theor. Comput. Sci. 1981, 13, 85–108. [Google Scholar] [CrossRef] [Green Version]
  21. Sassone, V.; Nielsen, M.; Winskel, G. A Classification of Models for Concurrency. In Proceedings of the CONCUR ’93, 4th International Conference on Concurrency Theory, Hildesheim, Germany, 23–26 August 1993; Best, E., Ed.; Lecture Notes in Computer Science; Springer: Berlin/Heidelberg, Germany, 1993; Volume 715, pp. 82–96. [Google Scholar]
  22. Clairambault, P.; Gutierrez, J.; Winskel, G. The Winning Ways of Concurrent Games. In Proceedings of the IEEE Symposium on Logic in Computer Science, Dubrovnik, Croatia, 25–28 June 2012; pp. 235–244. [Google Scholar] [CrossRef] [Green Version]
  23. Gutierrez, J.; Wooldridge, M. Equilibria of concurrent games on event structures. In Proceedings of the CSL-LICS, Vienna, Austria, 14–18 July; ACM: New York, NY, USA, 2014; pp. 46:1–46:10. [Google Scholar]
  24. Alcolei, A.; Clairambault, P.; Hyland, M.; Winskel, G. The True Concurrency of Herbrand’s Theorem. In Proceedings of the CSL, Birmingham, UK, 4–8 September 2018; Ghica, D.R., Jung, A., Eds.; Schloss Dagstuhl: Wadern, Germany, 2018; Volume 119, pp. 5:1–5:22. [Google Scholar]
  25. Castellan, S.; Clairambault, P.; Winskel, G. Thin Games with Symmetry and Concurrent Hyland-Ong Games. Log. Methods Comput. Sci. 2019, 15, 18:1–18:85. [Google Scholar] [CrossRef]
  26. Thompson, F.B. Equivalence of Games in Extensive Form; Research Memorandum RM-759; Rand Corporation: Santa Monica, CA, USA, 1952. [Google Scholar]
  27. Hearn, R.A.; Demaine, E.D. Games, Puzzles and Computation; A.K. Peters: Natick, MA, USA, 2009. [Google Scholar]
  28. Fudenberg, D.; Tirole, J. Game Theory; The MIT Press: Cambridge, MA, USA, 1991. [Google Scholar]
Figure 1. Extensive-form game of perfect information. The players are depicted besides the nodes at which they are active. Vertices are identified with histories, for instance, the root. v 0 with (   ) vertex v 3 with a 1 c 1 and the leaf node/terminal history labelled Games 13 00002 i001 with a 0 b 00 omitting parentheses and commas for better readability.
Figure 1. Extensive-form game of perfect information. The players are depicted besides the nodes at which they are active. Vertices are identified with histories, for instance, the root. v 0 with (   ) vertex v 3 with a 1 c 1 and the leaf node/terminal history labelled Games 13 00002 i001 with a 0 b 00 omitting parentheses and commas for better readability.
Games 13 00002 g001
Figure 2. The strategic form game associated with the extensive-form game in Figure 1.
Figure 2. The strategic form game associated with the extensive-form game in Figure 1.
Games 13 00002 g002
Figure 3. A simple example of a partial order game. On the left, the DAG, where each vertex v is labelled i : x , y , where i is the player active at the respective vertex and x and y are the actions available to i at v. On the right, the representation of the utility function which associates a numeric value with each action profile.
Figure 3. A simple example of a partial order game. On the left, the DAG, where each vertex v is labelled i : x , y , where i is the player active at the respective vertex and x and y are the actions available to i at v. On the right, the representation of the utility function which associates a numeric value with each action profile.
Games 13 00002 g003
Figure 4. The Boolean circuit C f v 3 14 representing strategy f v 3 14 : A v 1 × A v 2 A v 3 of Example 1. The value of x 0 represents the choice for a or b at v 1 , the value of x 1 the choice for c or d at v 2 , and the value of y 0 the choice for e or f at v 3 . The local indices of a, c, and e are given by 0, and those of b, d, and f by 1. The instantiation on the right thus represents that f v 3 14 ( a d ) = f .
Figure 4. The Boolean circuit C f v 3 14 representing strategy f v 3 14 : A v 1 × A v 2 A v 3 of Example 1. The value of x 0 represents the choice for a or b at v 1 , the value of x 1 the choice for c or d at v 2 , and the value of y 0 the choice for e or f at v 3 . The local indices of a, c, and e are given by 0, and those of b, d, and f by 1. The instantiation on the right thus represents that f v 3 14 ( a d ) = f .
Games 13 00002 g004
Figure 5. The Boolean circuit on the left represents the utility function u i on the right for a player i over the action profiles of the game in Example 1. The binary representations of action profiles and numerical values are depicted in parentheses.
Figure 5. The Boolean circuit on the left represents the utility function u i on the right for a player i over the action profiles of the game in Example 1. The binary representations of action profiles and numerical values are depicted in parentheses.
Games 13 00002 g005
Figure 6. Extensive-form game of imperfect information representing the partial order game in Figure 3 assuming topological order ( v 1 , v 2 , v 3 , v 4 ) . Here, for instance, v 1 1 = ( ) , v 3 2 = a d , and v 4 4 = a d f . The dashed lines connecting vertices indicate the information sets.
Figure 6. Extensive-form game of imperfect information representing the partial order game in Figure 3 assuming topological order ( v 1 , v 2 , v 3 , v 4 ) . Here, for instance, v 1 1 = ( ) , v 3 2 = a d , and v 4 4 = a d f . The dashed lines connecting vertices indicate the information sets.
Games 13 00002 g006
Figure 7. Extensive-form game of perfect information illustrating Proposition 4. for every 0 n 31 , we have n denote the vector ( n , n , n , n , n ) T .
Figure 7. Extensive-form game of perfect information illustrating Proposition 4. for every 0 n 31 , we have n denote the vector ( n , n , n , n , n ) T .
Games 13 00002 g007
Figure 8. The DAG for the partial order game with five players illustrating Proposition 4.
Figure 8. The DAG for the partial order game with five players illustrating Proposition 4.
Games 13 00002 g008
Figure 9. Extensive-form game of imperfect information representing the partial order game in Figure 5 assuming topological ordering ( v 1 , v 2 , v 4 , v 3 ) . The dashed lines connecting vertices indicate the information sets.
Figure 9. Extensive-form game of imperfect information representing the partial order game in Figure 5 assuming topological ordering ( v 1 , v 2 , v 4 , v 3 ) . The dashed lines connecting vertices indicate the information sets.
Games 13 00002 g009
Figure 10. The partial order game from Example 1 with a slightly different utility function.
Figure 10. The partial order game from Example 1 with a slightly different utility function.
Games 13 00002 g010
Figure 11. Another example of a partial order game.
Figure 11. Another example of a partial order game.
Games 13 00002 g011
Figure 12. Construction of the DAG for the coNP-hardness proof for fit for backwards induction in the case of the formula φ being defined over five variables p1 through p5.
Figure 12. Construction of the DAG for the coNP-hardness proof for fit for backwards induction in the case of the formula φ being defined over five variables p1 through p5.
Games 13 00002 g012
Table 1. Overview of complexity results surrounding Nash equilibrium in partial order games.
Table 1. Overview of complexity results surrounding Nash equilibrium in partial order games.
NON-EMPTINESSIS-NASHIS-NASH ACTIONS
empty-order games Σ 2 p -compl. coNP -compl. coNP -compl.
partial order games NEXPTIME -compl. coNP -compl. NEXPTIME -compl.
total-order gamesconstant time coNP -compl. PSPACE -compl.
Table 2. Overview of complexity results surrounding backwards induction in partial order games.
Table 2. Overview of complexity results surrounding backwards induction in partial order games.
FIT FOR B-IIS B-IIS B-I ACTIONS
partial order games coNP -compl. coNP -compl. NEXPTIME -compl.
total-order gamesconstant time coNP -compl. PSPACE -compl.
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Zahoransky, V.; Gutierrez, J.; Harrenstein, P.; Wooldridge, M. Partial Order Games. Games 2022, 13, 2. https://doi.org/10.3390/g13010002

AMA Style

Zahoransky V, Gutierrez J, Harrenstein P, Wooldridge M. Partial Order Games. Games. 2022; 13(1):2. https://doi.org/10.3390/g13010002

Chicago/Turabian Style

Zahoransky, Valeria, Julian Gutierrez, Paul Harrenstein, and Michael Wooldridge. 2022. "Partial Order Games" Games 13, no. 1: 2. https://doi.org/10.3390/g13010002

APA Style

Zahoransky, V., Gutierrez, J., Harrenstein, P., & Wooldridge, M. (2022). Partial Order Games. Games, 13(1), 2. https://doi.org/10.3390/g13010002

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop