You are currently viewing a new version of our website. To view the old version click .
Games
  • Editor’s Choice
  • Article
  • Open Access

21 December 2021

Partial Order Games

,
,
and
1
Fachbereich Informatik, Université du Luxembourg, Maison du Nombre, 6 Avenue de la Fonte, L-4364 Esch-sur-Alzette, Luxembourg
2
Faculty of Information Technology, Monash University, Woodside Building for Technology and Design, 20 Exhibition Walk, Clayton, VIC 3168, Australia
3
Department of Computer Science, University of Oxford, Wolfson Building, Parks Road, Oxford OX1 3QD, UK
*
Author to whom correspondence should be addressed.
This article belongs to the Section Algorithmic and Computational Game Theory

Abstract

We introduce a non-cooperative game model in which players’ decision nodes are partially ordered by a dependence relation, which directly captures informational dependencies in the game. In saying that a decision node v is dependent on decision nodes v 1 , , v k , we mean that the information available to a strategy making a choice at v is precisely the choices that were made at v 1 , , v k . Although partial order games are no more expressive than extensive form games of imperfect information (we show that any partial order game can be reduced to a strategically equivalent extensive form game of imperfect information, though possibly at the cost of an exponential blowup in the size of the game), they provide a more natural and compact representation for many strategic settings of interest. After introducing the game model, we investigate the relationship to extensive form games of imperfect information, the problem of computing Nash equilibria, and conditions that enable backwards induction in this new model.

1. Introduction

The two most important game models in non-cooperative game theory are normal form games and extensive form games. These games are distinguished by the information that players have about the strategies of other players. In normal form games, players must select and commit to strategies without any information relating to the strategies of others. In contrast, in extensive form games, players make moves alternately over time and, during play, their strategy may be informed by moves made previously. Variations of extensive form games—e.g., of imperfect information or recall—make it possible to capture the information available to a player when called upon to make a move.
In this article, we introduce partial order games. The key distinguishing feature of partial order games is that they are equipped with a dependence relation, which explicitly captures the informational dependencies between decision nodes. In saying that a decision node v is dependent on decision nodes v 1 , , v k , we mean that the information available to a strategy making a choice at v is precisely the choices that were made at v 1 , , v k . Observe that the informational dependencies captured by a dependence relation give rise to a partial temporal order over choices: in this case, the choices for v 1 , , v k must strictly precede the choice for v. However, the temporal ordering induced in this way is indeed partial: it may be that two choices v and v are independent, in which case one can say nothing about their temporal order.
In a technical sense, partial order games are no more expressive than extensive form games of imperfect information: one of our key results is to show that every partial order game can be transformed into a strategically equivalent extensive form game of imperfect information. However, for some settings, partial order games have significant advantages over their extensive form representation:
  • First, informational dependencies in partial order games are explicitly captured, while in extensive form games, they are left implicit in the information sets of the game. As a consequence, some settings are much more transparently and naturally represented using partial order games, compared to the extensive form.
  • Second, partial order games can be exponentially more compact than their extensive form. From a purely practical point of view, this means that some scenarios with a compact partial order representation cannot be handled (by people or by a computer) in their extensive form.
We analyse partial order games by means of Nash equilibrium—arguably game theory’s most prominent non-cooperative solution concept—and the solutions given by a natural backwards induction procedure for partial order games defined on the dependence relation. Here, we give special attention to the computational complexity of calculating such solutions in partial order games. The remainder of this article is structured as follows.
  • We begin by briefly recalling some necessary concepts from graph theory and game theory, and then introduce the formal framework of partial order games.
  • As we are interested in the computational properties of partial order games, we introduce a compact representation for strategies and utility functions in partial order games, based on Boolean circuits, which enables us to investigate questions about their computational complexity.
  • In Section 4, we study the relationship of our game model to other game models: partial order Boolean games [1], Multi-Agent Influence Diagrams (MAIDs) [2,3], and extensive form games of imperfect information. Our main result in this section is to present a technique for translating partial order games into strategically equivalent extensive form games, although this translation comes at the cost of an exponential blowup in the size of the game. This leads us into a discussion of the use of partial order games as a compact representation of extensive form games.
  • In Section 5, we investigate the problem of computing Nash equilibria in partial order games. For example, we show that checking whether a game has any pure strategy Nash equilibria is NEXPTIME-complete.
  • In Section 6, we study backwards induction in partial order games. As partial order games are inherently games of imperfect information, it follows that backwards induction does not always work for partial order games. We thus investigate cases where forms of backward induction work for partial order games, and present a condition on games that we call fit for backwards induction, which is sufficient to allow backwards induction.
  • We conclude with a brief discussion and pointers for future work.

2. Preliminary Definitions

We begin by recalling some concepts from graph theory and game theory, which we use in the remainder of the paper.

2.1. Directed Acyclic Graphs and Trees

The games in this paper are defined on directed acyclic graphs (DAGs). Formally, a directed acyclic graph is a pair ( V , E ) , where V = { v 1 , , v | V | } is a finite set of vertices or nodes and E V × V a set of directed edges (or arcs) on V. We assume E to be acyclic (and thus also irreflexive). We say that vertex u is a parent of vertex v, and v a child of u, if ( u , v ) E . We occasionally also use infix notation and write u E v for ( u , v ) E . The depth of a vertex v, denoted by depth ( v ) , is defined recursively such that if v has no parents, depth ( v ) = 0 , and otherwise, depth ( v ) equals the maximum depth of v’s parents plus 1. Formally,
depth ( v ) = 0 if v has no parents , max ( u , v ) E depth ( u ) + 1 otherwise .
We say that vertex v is reachable from vertex u if u = v , or if there is some vertex w such that w is reachable from u and ( w , v ) E , that is if u   E * v , where E * is the reflexive and transitive closure of E.
A topological sorting is a permutation τ = ( u 1 , , u | V | ) of the vertices v 1 , , v | V | in V such that u i   E   u j implies i < j . Every DAG ( V , E ) admits at least one topological sorting of its vertices; this well-known fact will be important and useful in our study of partial order games.
A tree is a directed acyclic graph in which there is a unique vertex with no parents (the root of the tree, often denoted by v 0 ), and where every non-root vertex has a single parent. If a tree is finite, then, as it is acyclic, some vertices will have no children: we refer to these as the leaves of the tree. Observe that the root of the tree is the unique vertex of depth 0.

2.2. Normal-Form Games

We use normal-form games as the basis of our game-theoretic analysis of partial order games. Normal-form games are defined by a set of players, the strategies the players have at their disposal, and the preferences the players have over the outcomes that players choosing their strategies give rise to. Formally, a normal-form game is given by a tuple ( N , ( S i ) i N , ( u i ) i N ) , where N = { 1 , , | N | } is the set of players, S i is the set of (pure) strategies available to player i, and u i : S 1 × × S | N | R is a utility function for each player i [4]. We refer to tuples s = ( s 1 , , s | N | ) in S 1 × × S | N | as strategy profiles.
Nash equilibrium is the most important solution concept in non-cooperative game theory; it captures the idea of a joint course of action that is stable in the sense that no player has an incentive to deviate unilaterally from it. Furthermore, in this paper, it is one of the main analytic tools by means of which we evaluate partial order games. Formally, a strategy profile s = ( s 1 , , s | N | ) is a Nash equilibrium if, for all players i and strategies s i S i , we have
u i ( s ) u i ( s i , s i ) ,
where ( s i , s i ) = ( s 1 , , s i 1 , s i , s i + 1 , , s | N | ) .
We say that two normal-form games ( N , ( S i ) i N , ( u i ) i N ) and ( N , ( S i ) i N , ( u i ) i N ) are strategically isomorphic if there are bijections ϕ : N N and ψ i : S i S f ( i ) such that, for all players i in N and all strategy profiles s = ( s 1 , , s n ) and s = ( s 1 , , s n ) we have
u i ( s ) u i ( s ) if and only if u ϕ ( i ) ( ψ ( s ) ) u ϕ ( i ) ( ψ ( s ) )
where ψ ( s ) = ( ψ i ( s 1 ) , , ψ n ( s n ) ) and ψ ( s ) = ( ψ i ( s 1 ) , , ψ n ( s n ) ) .
Given a normal-form game, two strategies s i , s i in S i for a player i are said to be equivalent, in symbols s i s i , if u j ( s i , s i ) = u j ( s i , s i ) for all profiles s i S 1 × × S i 1 × S i + 1 × × S | N | and all players j. Let [ s i ] be the equivalence class of s i under the equivalence relation ∼, and denote [ s ] = ( [ s 1 ] , , [ s | N | ] ) . Following the work in [5], we define the reduced normal-form of a normal-form game ( N , ( S i ) i N , ( u i ) i N ) as the game ( N , ( S i ) i N , ( u i ) i N ) where for each player i, S i = { [ s i ] : s i S i } and u i ( [ s ] ) = u i ( s ) for all strategy profiles s = ( s 1 , , s | N | ) . Two games are then said to be strategically equivalent if they induce strategically isomorphic reduced normal-forms.

2.3. Extensive-Form Games

An extensive-form game (of imperfect information) is based on a directed tree ( V , E ) and is played by a set of players N choosing from a set of actions A, starting from the root of the tree. Within the set V of vertices, we distinguish between action or decision nodes, which have children, and leaf nodes, which have no children. The set of action nodes we denote by D and the set of leaf nodes by L. With every action node v in D we associate a unique player i v , who is then said to be active at v, and an action set A v A such that every outgoing edge ( v , u ) is associated with a unique action in A v . Let V i denote the set of action nodes associated with player i, that is, V i = { v V : i v = i } , and let A i = { a A v : v V i } .
Every vertex v is identified with a unique sequence of actions, called a history, which leads from the root v 0 to v. Histories associated with leaf nodes are referred to as terminal histories. We will denote the history associated with vertex v by h v . We can also think of vertices v as histories: vertex v is the sequence h v = ( a 1 , , a depth ( v ) ) of actions for which there is a path v 0 = w 0 , w depth ( v ) = v such that ( w i , w i + 1 ) is associated with action a i + 1 for every 0 i < depth ( v ) . The root v 0 is thus the empty sequence ( ) . Thus, we also find that v E w if and only if w = v a for some a A v .
For every player i, we have a partitioning I i = { I 1 , , I | I i | } of their decision nodes V i into (non-empty) information sets. We write [ v ] for the information set vertex v belongs to. Here, it is understood that A u = A v , whenever u and v are in the same information set I. We then also write A I for A u and A v . Information sets may be singletons. If all information sets for all players are singletons, we say the game is of perfect information and do not distinguish between information set { v } and the node v itself, if no confusion is likely.
Finally, each player i’s preferences are represented by a utility value u h i R at each Version December 9, 2021 submitted to Games terminal history/leaf h . See Figure 1 for an example.
Figure 1. Extensive-form game of perfect information. The players are depicted besides the nodes at which they are active. Vertices are identified with histories, for instance, the root. v 0 with (   ) vertex v 3 with a 1 c 1 and the leaf node/terminal history labelled with a 0 b 00 omitting parentheses and commas for better readability.
A strategy for player i in an extensive-form game is a function s i : I i A i such that s i ( I ) A I for all I I i . The set of strategies available to player i we then denote by S i . A strategy profile is then a sequence s = ( s 1 , , s | N | ) of strategies, one for each player. Let D = { v 1 , , v | D | } . Then, every strategy profile s = ( s 1 , , s | N | ) defines an action profile  a s = ( a v 1 , , a v | D | ) such that a v = s i ( [ v ] ) and where i is a player who is active at v. Observe that a strategy profile s = ( s 1 , , s | N | ) also defines a unique path w 0 , , w k from the root w 0 to a leaf w k with history h s = ( a 1 , , a k ) defined such that, for all 0 i < k , edge ( w i , w i + i ) is associated with action a i = s i ( [ w i ] ) , where i is the player who is active at w i . Finally, we have for each player i a utility function u i : S 1 × × S | N | R associating each strategy profile s = ( s 1 , , s | N | ) with a real value such that u i ( s ) = u i h s .
The players N of an extensive-form game, together with their strategies S i and their utilities u i , induce a normal-form game ( N , ( S i ) i N , ( u i ) i N ) . Note that there may be multiple strategy profiles giving rise to the same terminal history, that is, it may well be that h s = h s , even though s s . Therefore, the size of this normal-form game may be exponential in the size of the underlying extensive-form game, if represented naively. This is because there are I I | A I | = O ( | A | | V | ) strategy profiles, which are implicit in the definition of an extensive-form game: see Figure 2 for an illustration.
Figure 2. The strategic form game associated with the extensive-form game in Figure 1.

2.4. Boolean Circuits

In computer science, Boolean circuits are a well-established model for computing Boolean functions (cf., e.g., [6,7]). In this paper, we will make extensive use of them to concisely represent strategies and utility functions. We will here briefly review their definition, largely following Jukna’s exposition [7].
A k-ary Boolean function is a function β : { 0 , 1 } k { 0 , 1 } , where we allow k = 0 . The base of a Boolean circuit is given by a set B = { β 1 , , β k } of Boolean functions. In this paper, we will restrict attention to the set of classical Boolean functions B = { ¬ , , } (“not”, “or”, and “and”, respectively), which is known to be functionally complete (i.e., sufficient to define any Boolean function).
Formally, a Boolean circuit (or straight line program) on n variables x 1 , , x n over base B is given by a sequence of t n gates γ 1 , , γ t . The first n gates are given by the variables, that is, γ 1 = x 1 , , γ n = x n , and are also referred to as input gates. Another subset of gates is singled out as the set of output gates. Boolean formula over variables x 1 , , x n are thus represented by a Boolean circuit with only one output gate. Each subsequent gate γ i is the application of a k-ary Boolean function β i in the base B to k previous gates, that is, γ i = β i ( γ j 1 , , γ j k ) , where j 1 , , j k < i . The variables take values in { 0 , 1 } . Given values α 1 , , α n in { 0 , 1 } for the variables x 1 , , x n , one can inductively associate each gate with a value α ( γ i ) such that α ( x i ) = α i , if 1 i n , and
α ( β i ( γ j 1 , , γ j k ) ) =   β i ( α ( γ j 1 ) , , α ( γ j k ) ) ,
if n < i t . A Boolean circuit C given by γ 1 , , γ t on n variables x 1 , , x n and with m output gates γ j 1 , , γ j m then computes the function ϕ C : { 0 , 1 } n { 0 , 1 } m that, on values α = ( b 1 , , b n ) for the input gates, yields ϕ C ( α ) = ( α ( γ j 1 ) , , α ( γ j m ) ) .
A Boolean circuit is commonly depicted as a directed acyclic graph ( V , E ) , with the gates as vertices, that is, V = { γ 1 , , γ n } , and ( γ i , γ j ) E whenever γ i = β i ( γ j 1 , , γ j k ) and γ j = γ j for some 1 n . For examples of Boolean circuits, also see Figure 3 and Figure 4, below.
Figure 3. A simple example of a partial order game. On the left, the DAG, where each vertex v is labelled i : x , y , where i is the player active at the respective vertex and x and y are the actions available to i at v. On the right, the representation of the utility function which associates a numeric value with each action profile.
Figure 4. The Boolean circuit C f v 3 14 representing strategy f v 3 14 : A v 1 × A v 2 A v 3 of Example 1. The value of x 0 represents the choice for a or b at v 1 , the value of x 1 the choice for c or d at v 2 , and the value of y 0 the choice for e or f at v 3 . The local indices of a, c, and e are given by 0, and those of b, d, and f by 1. The instantiation on the right thus represents that f v 3 14 ( a d ) = f .
The size of a Boolean circuit given by gates γ 1 , , γ t on variables x 1 , , x n is given by the number t n of its gates minus the input gates. It is a well-established fact that the problem of computing ϕ C ( α ) for given values α { 0 , 1 } n for C ’s variables, which is also known as the Circuit Value Problem or Circuit Evaluation Problem, is complete for P under uniform AC 0 -reductions, and therefore can be computed in polynomial time (see in [8], p. 59).

3. Partial Order Games

We now introduce the framework of partial order games. The basic idea is that, as in extensive form games, the game contains a number of decision nodes, which are partioned among the players. However, play in the game is not defined by a game tree. Instead, partial order games have a binary dependence relation over decision nodes. This dependence defines the information available to a player when it makes a choice. If a decision node v for player i is dependent on decision nodes v 1 , , v k , then this means that the information available to a strategy when making a choice at v is precisely the choices that were made at decision nodes v 1 , , v k . To play a partial order game, a player must choose a selection of strategies (we call them vertex strategies), one for each of their decision nodes. A strategy for a decision node v must select a choice for that decision node taking as input the choices that were made for the decision nodes v 1 , , v k upon which it is dependent. In this way, we have an explicit representation of the information available when making a choice, in contrast to the use of information sets in extensive form games. Utilities in partial order games are not associated with individual nodes (as in extensive form games), but derive from the total profile of actions that were made.
Our usual way of thinking about extensive form games is as players alternating to make moves, working their way down the game tree to a leaf node; upon reaching a leaf node, the game is over. Thinking about games in this way naturally gives rise to a temporal order over choices in the game: a choice is made for the root node first, and then successive nodes in the game tree. Partial order games are more abstract than this. They induce only a partial temporal ordering over decision nodes: while the choices for v 1 , , v k must strictly precede the choice for v if v is dependent on v 1 , , v k , it may well be that decision nodes are independent: if nodes v and v are in disconnected components of the dependence graph, then we can say nothing about their temporal order.
Let us consider an example.
Example 1.
Consider the three-person partial order game depicted in Figure 3, with the directed acyclic graph it is based on given on the left. To the right of each vertex v, we have indicated the player i v and the actions they have at their disposal at v. For instance, at vertex v 2 , player 2 can choose among the actions c and d. If all players make a choice at their respective vertices, an action profile results, which then is associated with a utility value. For instance, if player 1 chooses a at v 1 and h at v 4 , player 2 chooses c at v 2 , and player 3 chooses f at v 3 , the action profile ( a , c , f , h ) (also denoted by a c f h ) results, yielding utility values of 1 to player 1, and 2 to players 2 and 3. The table on the right summarises the utilities the players get under the different action profiles that may be played. Note that in this game there are 2 4 possible action profiles.
The crucial feature of partial order games, however, is that the players can make their choices at a given vertex v depending on the choices the other players make at the parents of v. For instance, player 2 could adopt the strategy to play c at v 2 , if player 1 chooses a at v 1 , and plays d otherwise. Accordingly, player 2 has 2 2 = 4 strategies at their disposal at v 2 . As v 1 does not have any parents, player 1 has only two strategies at their disposal at v 1 , but they have 2 2 = 4 strategies at v 4 . Meanwhile, player 3 has 2 4 = 16 strategies at v 3 . A strategy profile specifies a strategy for each player at each of their vertices. Accordingly, in this game there are 2 × 2 2 × 2 4 × 2 2 = 2 9 = 512 possible strategy profiles.
The number of strategy profiles clearly outnumber the number of action profiles. Still, every strategy profile induces a unique action profile. This relationship, however, is not generally injective, as multiple strategy profiles may induce the same action profile. Assume, for instance, that the players adopt the following strategies:
  • player 1 chooses a at v 1 ;
  • player 2 chooses c if player 1 plays a at v, and d, otherwise;
  • player 3 chooses e if player 1 chooses a at v 1 and player 2 d at v 2 , and f, otherwise; and
  • player 4 chooses h independently of whether player 2 chooses c or d at v 2 .
If this strategy profile is played, it can readily be seen that action profile a c f h results, which we have already seen gives utilities 1, 2, and 2 to the players 1, 2, and 3, respectively.
We now proceed to our formal definition of partial order games, which enables the application of Nash equilibrium as well as the formulation of a natural backwards induction procedure later on. As before, let N = { 1 , , n } be a finite set of players. Then a partial order game on a directed acyclic graph ( V , E ) —the game’s dependency graph—associates each vertex v in V with a unique player in N, denoted by i v , and a non-empty set of actions, denoted by A v .
One player may be associated with multiple vertices, that is, it may very well be that i u = i v even if v u . We let V i denote the set of vertices associated with player i. We also say that player i is active at any vertex v in V i . If E is also connected—that is, if v E w or w E v for all distinct v and w in V—we also refer to any partial order game based on it as a total-order game. As E is assumed to be acyclic, it follows that E is transitive in total order games. On the other end of the spectrum, we have partial order games on dependency graphs ( V , E ) with E = Ø . This class of games we will also refer to as empty-order games.
Given a subset of W = { w 1 , , w k } of vertices, an action profile for W is a tuple a W = ( a w 1 , , a w k ) specifying one action a w j for each vertex w j . In our examples, we occasionally omit parentheses and commas, and write a w 1 a w k for ( a w 1 , , a w k ) . The set of action profiles for W we denote by A W . For the set V = { v 1 , , v | V | } of all vertices, we write write a = ( a v 1 , , a v | V | ) for a V and refer to a simply as an action profile. We denote the set of action profiles by A = A v 1 × × A v | V | . For a set V i = { w 1 , , w k } of vertices at which a player i is active, we also write a i = ( a w 1 , , a w k ) for a V i and refer to a i as an action profile for player i. We let A i denote the set of actions profiles for a player i. For action profiles a W = ( a w 1 , , a w k ) and b U = ( b u 1 , , b u m ) for disjoint sets W and U, we denote by ( a W , b U ) the action profile ( a w 1 , , a w k , b u 1 , , b u m ) for W U . If U = V \ W we also write ( a W , a W ) for ( a W , a U ) , and if U = V \ V i , we also write ( a i , a i ) . Given a topological sorting τ = ( v 1 , , v | V | ) , we define a τ -history as a sequence of actions h = ( a v 1 , , a v k ) in A v 1 × × A v k for some 1 k | V | . We stipulate the empty sequence of actions, denoted by ( ) , to be a τ -history as well. For an action profile a = ( a v 1 , , a v | V | ) , we say that h = ( b v 1 , , b v k ) is a history of a if ( b v 1 , , b v k ) = ( a v 1 , , a v k ) .
At each vertex v, player i v can make their choice of action dependent on the actions chosen at the parents of v. A (conditional) strategy at a vertex v, or vertex strategy, is therefore a function
f v : A y 1 × × A y k A v ,
where y 1 , , y k are the parents of v. We say that f v is unconditional if it is constant, that is, if f v ( a 1 , , a k ) = f v ( b 1 , , b k ) for all action profiles a = ( a 1 , , a k ) and b = ( b 1 , , b k ) . In this case, for a A v , we denote by f v a the unconditional strategy at vertex v that maps every action profile of v’s parents to action a. We will sometimes identify unconditional strategies f v with the action f v ( a 1 , , a k ) it specifies for each action profile a = ( a 1 , , a k ) and write a = ( a v 1 , , a v | V | ) for the profile of unconditional strategies f a = ( f v 1 a v 1 , , f v | V | a v | V | ) . Note that if vertex v has no parents, then A u 1 × × A u k = { ( ) } , where ( ) is the empty tuple. Accordingly, f v determines a single choice among A v , namely, f v ( ( ) ) in this case. By F v we denote the set of conditional strategies available at a vertex v.
By a strategy profile we then understand a profile f = ( f v 1 , , f v | V | ) consisting of one conditional strategy for each vertex. For f = ( f v 1 , , f v | V | ) a strategy profile and W = { w 1 , , w k } a subset of vertices in V, we let f W = ( f w 1 , , f w k ) denote the partial profile that is like f but restricted to W V . Let f 1 = ( f v 1 1 , , f v | V | 1 ) through f k = ( f v 1 k , , f v | V | k ) be k strategy profiles and { X 1 , , X k } be a k-partition of the vertex set V. Then, ( f X 1 1 , , f X k k ) denotes the strategy profile g = ( g v 1 , , g v | V | ) such that, for every vertex v in V, we have g v = f v j , if v X j .
A strategy for a player i is a profile f V i = ( f v 1 , , f v k ) of vertex-strategies, where V i = { v 1 , , v k } , is the set of vertices at which player i is active. We generally denote f V i by f i , and the set of strategies available to player i by F i . For strategy profiles f = ( f v 1 , , f v | V | ) and g = ( g v 1 , , g v | V | ) and a player i, we also write ( f i , g i ) for the strategy profile ( f V \ V i , g V i ) .
For every vertex v, a strategy profile f = ( f v 1 , , f v | V | ) defines an action a v f , the evaluation of f at v, recursively as follows:
a v f = f v ( ( ) ) if depth ( v ) = 0 , f v ( a u 1 f , , a u k f ) if depth ( v ) > 0 and u 1 , , u k are the parents of v .
Observe that a v f is well defined because in a directed acyclic graph vertices with depth 0 exist and the parents of each vertex are of lower depth than the vertex itself. A strategy profile f = ( f v 1 , , f v | V | ) can thus be seen to induce the action profile a f = ( a v 1 f , , a v | V | f ) . We also say that strategy profile f = ( f v 1 , , f v | V | ) sustains action profile a = ( a v 1 , , a v | V | ) if a = a f . We occasionally denote by ( a W , f U ) the action profile a g where g is the strategy profile ( f W a , f U ) which specifies unconditional strategies for W. In other words, we have ( a v 1 , , a v k , f v k + 1 , , f v | V | ) denote the action profile that results if on v 1 through v v k actions a v 1 , , a v k are played, and at vertices a v k + 1 through a v | V | the strategies f v k + 1 through f v | V | .
Note the difference between actions and action profiles on the one hand, and strategies and strategy profiles on the other. Every strategy profile induces a unique action profile, whereas the same action profile may be induced by different strategy profiles. In an important sense, the action profiles are the outcomes of the game.
Therefore, we take the set of action profiles of a partial order game as its set of outcomes, over which the players’ preferences are defined. Each player’s preferences over the action profiles are given by a real-valued utility function  u i : A R , where we assume that player istrictly prefers action profile a = ( a v 1 , , a v | V | ) to action profile b = ( b v 1 , , b v | V | ) whenever u i ( a ) > u i ( b ) . We extend utility functions u i to strategy profiles f , and write u i ( f ) for u i ( a f ) . To fix concepts and notation, we consider Example 1 once more.
Example 1 (cont’d).
Consider the game in Figure 3. At vertices v 1 , v 2 , and v 4 , we have the following strategies:
F v 1 ( ) F v 2 a b F v 4 c d f v 1 1 a f v 2 1 c c f v 4 1 g g f v 1 2 b f v 2 2 c d f v 4 2 g h f v 2 3 d c f v 4 3 h g f v 2 4 d d f v 4 4 h h
At vertex v 3 , moreover, we have 2 4 strategies, illustrating the exponential blowup that results from strategies being represented explicitly.
F 3 a c a d b c b d a c a d b c b d a c a d b c b d a c a d b c b d f v 3 1   e   e     e     e f v 3 5   e   f     e     e f v 3 9   f   e     e     e f v 3 13   f   f     e     e f v 3 2   e   e     e     f f v 3 6   e   f     e     f f v 3 10   f   e     e     f f v 3 14   f   f     e     f f v 3 3   e   e     f     e f v 3 7   e   f     f     e f v 3 11   f   e     f     e f v 3 15   f   f     f     e f v 3 4   e   e     f     f f v 3 8   e   f     f     f f v 3 12   f   e     f     f f v 3 16   f   f     f     f
The (conditional) strategy profile f = ( f v 1 1 , f v 2 3 , f v 3 8 , f v 4 2 ) , then, for instance, yields the evaluation a f = ( a v 1 f , a v 2 f , a v 3 f , a v 4 f ) = ( a , d , f , h ) as
a v 1 f = f v 1 1 ( ( ) ) = a a v 2 f = f v 2 3 ( a v 1 f ) = f v 2 3 ( a ) = d a v 3 f = f v 3 8 ( a v 1 f , a v 2 f ) = f v 3 8 ( a , d ) = f a v 4 f = f v 4 2 ( a v 2 f ) = f v 4 2 ( d ) = h
Accordingly, the utilities on this profile f = ( f v 1 1 , f v 2 3 , f v 3 8 , f v 4 2 ) for these three players are therefore determined by the action profile a d f h . Hence, u 1 ( f ) = 0 , u 2 ( f ) = 0 , and u 3 ( f ) = 2 .
With the players N, their conditional strategies F i , and their preferences u i : F R over strategy profiles, a partial order game on a directed acyclic graph ( V , E ) can thus be seen to define immediately a game in normal-form ( N , ( F i ) i N , ( u i ) i N ) (cf., Section 2.2, above). Accordingly, partial order games are directly amenable to game-theoretical analysis using the usual solution concepts, in particular, Nash equilibrium.
Conversely, every normal-form game ( N , ( S i ) i N , ( u i ) i N ) can be seen as a partial order game on the same set N of players. Its dependency graph is then given by ( N , Ø ) , associating with vertex i N , player i N and strategy set S i . Identifying each unconditional strategy f i with f i ( ( ) ) = s i in S i , we adopt the utility functions u i unaltered for the utility functions of the partial order game.

3.1. Concise Representations for Strategies, Profiles, and Utilities

The transformation from partial order games to normal-form games comes at the cost of an exponential blowup. Observe that in the definition of partial order games, the sets of action profiles and strategy profiles are defined implicitly. Given a partial order game on a directed acyclic graph ( V , E ) with a set of players N and actions A = A v 1 A v | V | , the number of action profiles a = ( a v 1 , , a v | V | ) is bounded by O ( | A | | V | ) , and so is the number of unconditional strategy profiles f a = ( f v 1 a v 1 , , f v | V | a v | V | ) . By contrast, the number of both conditional strategies and the number of conditional strategy profiles are bounded by O ( | A | ( | A | | V | ) ) . It is also worth observing that the size of both a conditional strategy f and a profile f = ( f v 1 , , f v | V | ) of conditional strategies is bounded by O ( | A | | V | ) .
Even though the dependency graph allows a concise representation of the strategies and strategy profiles in a partial order game, they remain large objects. It is therefore desirable to have concise representations of action profiles, utility functions, strategies, and strategy profiles. To this end, we take recourse to Boolean circuits, a formalism well established in theoretical computer science (see Section 2.4). We first show how action profiles, utility functions, and strategy profiles can be represented concisely by Boolean circuits. We then prove two lemmas that will be useful for our later complexity proofs in Section 5 and Section 7.
When representing partial order games concisely, we assume that for distinct vertices v and w the action sets A v and A w are disjoint. Having assumed A v to be non-empty for each vertex v, it thus follows that | V | | A | . For ease of presentation, we also assume that | A v | is an integer power of 2 for each vertex v. In this section, we assume that for each vertex v, the elements of set A v are enumerated as a 0 , , a | A v | 1 , thus associating each action a k with a unique local index k at v. Let bin ( k ) denote the binary representation of the local index k, using exactly  log 2 | A v | digits. Thus, if | A v | = 16 , we have bin ( 3 ) = 0011 .
Let v be a vertex with parents u 1 , , u k . We then represent a strategy f v : A u 1 × × A u k A v by a Boolean circuit C f v with log 2 | A u 1 | + + log 2 | A u k | input gates and log 2 | A v | output gates. For an action profile a = ( a u 1 , , a u k ) in A u 1 × × A u k and a v an action in A v , we then have that f v ( a u 1 , , a u k ) = a v if and only if on input bin ( j 1 ) bin ( j k ) the circuit C f v evaluates to outputs bin ( i ) , where j 1 , , j k and i, are the local indices of a v 1 , , a v k , and a v at their respective vertices. As C f v can be seen as combining log 2 | A v | Boolean functions in log 2 | A u 1 | + + log 2 | A u k | variables and log 2 | A u 1 | + + log 2 | A u k | | A | , we may assume that the circuit C f v is of size at most exponential in | A | . More precisely, we may assume that the size of C f v is Θ ( 2 | A | | A | ) [9,10].
Example 2.
For an example, see Figure 4, which depicts the Boolean circuit C f v 3 14 for strategy f v 3 14 : A v 1 × A v 2 A v 3 in Example 1, given by
f v 3 14 ( a ) = e if a = b c , f otherwise .
Recall that A v 1 = { a , b } , A v 2 = { c , d } , and A v 3 = { e , f } . Let the local indices of a, c, and e be given by 0, and those of b, d, and f by 1. As | A v 1 |   =   | A v 2 |   =   | A v 3 |   =   2 , any action profile a A v 1 × A v 2 can be represented by two Boolean variables x 0 and x 1 , where the value of x 0 represents the choice for a or b at v 1 , the value of x 1 the choice for c or d at v 2 . For instance, the action profile a d A v 1 × A v 2 can thus be represented by setting x 0 to 0 and x 1 to 1. For these values the circuit C f v 3 14 evaluates to 1, which corresponds to f v 3 14 ( a d ) = f and as depicted in Figure 4.
Observe that an unconditional strategy f v a at a vertex v with parents u 1 , , u k , which maps every profile ( a u 1 , , a u k ) invariably to action a in A v , is represented by a Boolean circuit C f v a of size at most polynomial in | A | . The circuit C f v a will still have 1 m k log 2 | A u m | inputs and log 2 | A v | outputs, but will involve only one -gate and one -gate. Let j be the local index of a at v and bin ( j ) = b 1 , , b log 2 | A v | . Then, for 1 log 2 | A v | , connect the -gate with output y , if b = 0 , and the -gate with output y , if b = 1 .
Similarly, a rational-valued utility function u i : A v 1 × × A v | V | Q can be represented by a Boolean circuit C u i . This circuit will have v V log 2 | A v | inputs and outputs y 1 , , y k , z 1 , , z m . Moreover, for ( a v 1 , , a v | V | ) an action profile in A v 1 × × A v | V | with local indices j 1 , , j | V | at their respective vertices, we have u i ( a v 1 , , a v | V | ) = n d + 1 if and only if on input bin ( j 1 ) , , bin ( j | V | ) the circuit C u i yields bin ( n ) for outputs y 1 , , y k and bin ( d ) for outputs z 1 , , z m (Observe that if u i ( a v 1 , , a v | V | ) is a positive integer, the outputs z 1 , , z m can be dispensed with, as x Ø x = 0 and d = 1 j m z j · 2 j ). Again, as | A u 1 | + + | A u k | | A | , we may assume that any such circuit C u i will be of size at most exponential in | A | .
Example 3.
Figure 5 illustrates how a utility function u i : A v 1 × A v 2 × A v 3 × A v 4 R for a player i in the game of Example 1 is represented by a Boolean circuit. The utility function u i is depicted on the right. We assume that the local indices of a, c, e, and g be given by 0, and those of b, d, f, and h by 1. As | A v 1 |   =   | A v 2 |   =   | A v 3 |   =   | A v 4 |   =   2 , an action profile a in A v 1 × A v 2 × A v 3 × A v 4 is thus given by means of an assignment to four Boolean variables x 0 , x 1 , x 2 , and x 4 , representing the choices at the vertices v 1 , v 2 , v 3 , and v 4 , respectively. For instance, action profile a c f h is represented by setting variables x 0 and x 1 to 0, and variables x 2 and x 3 to 1, which can thus also be denoted by the binary string 0011. Evaluating the circuit for these values, we find that y 1 is set to 0 and y 0 to 1. This corresponds to u i ( a c f h ) = 1 and bin ( 1 ) = 01 . Observe that the Boolean circuit can be evaluated in time polynomial in its size and that it is exponentially smaller than the explicit tabelling of the utility function on the right.
Figure 5. The Boolean circuit on the left represents the utility function u i on the right for a player i over the action profiles of the game in Example 1. The binary representations of action profiles and numerical values are depicted in parentheses.
We conclude this section by showing two useful lemmas. The first establishes that, even if a conditional strategy profile f = ( f v 1 , , s v | V | ) is represented as a Boolean circuit, the action profile a f = ( a v 1 , , a v | V | ) that f gives rise to, can be computed in polynomial time.
Lemma 1.
Let f = ( f v 1 , , f v | V | ) be a conditional strategy profile represented by Boolean circuits C f v 1 , , C f v | V | . Then, the action profile a f = ( a v 1 f , , a v | V | f ) can be computed in polynomial time.
Proof Sketch.
Proceeding inductively, find for every vertex v the action a v f as follows. For vertices v of depth 0, the Boolean circuit C f v should give the local index in binary of action a v f at v, and therewith, a v f immediately. For a vertex v of a strictly positive depth with parents u 1 , , u k , we may assume that the local indices in binary bin ( j u 1 ) , , bin ( j u k ) of a u 1 f , , a u k f can be computed in polynomial time. As the evaluation problem for Boolean circuits is solvable in polynomial time, the circuit C f v for inputs bin ( j u 1 ) , , bin ( j u k ) can also be evaluated in polynomial time, providing us with the local index of f v ( a u 1 f , , a u k f ) at v in binary, which gives us the result. □
Lemma 2.
Let f = ( f v 1 , , f v | V | ) be a conditional strategy profile represented by Boolean circuits C f v 1 , , C f v | V | , and u i : A v 1 × × A v | V | Q be a utility function represented by a Boolean circuit C u i . Then, u i ( f ) can be computed in polynomial time.
Proof Sketch.
Thanks to Lemma 1, we can compute the action profile a f = ( a v 1 f , , a v | V | f ) in polynomial time. Let j 1 , , j | V | be the local indices of a v 1 f , , a v | V | f at their respective vertices. Using bin ( j 1 ) , , bin ( j | V | ) as inputs for the circuit C u i , we obtain the binary encoding of u i ( a v 1 f , , a v | V | f ) . As the evaluation problem for Boolean circuits is solvable in polynomial time, this suffices for the result. □

5. Nash Equilibria

We saw in Section 2.2 how every partial order game defines a normal-form game. Accordingly, partial order games are amenable to game-theoretic analysis using the standard non-cooperative solution concepts that are available for normal-form games. In this section, we consider several complexity problems surrounding Nash equilibrium in partial order games.
We have the following lemma that will be useful for proving the complexity results in this section. It states that, if a player has a profitable deviation to a conditional strategy, then they also have a profitable deviation to an unconditional strategy. In other words, if a player has a best response, then they also have an unconditional best response (This is reminiscent of mixed strategies for (finite) normal-form games, where, due to the linearity of expected utility, a player having a best response implies their having a pure best response).
Lemma 4.
Let f = ( f 1 , , f n ) be a profile of conditional strategies. Then, if there is some strategy g i for player i with u i ( f i , g i ) > u i ( f ) , there also is an unconditional strategy g i such that u i ( f i , g i ) > u i ( f ) . Therefore, f = ( f v 1 , , f v | V | ) is a Nash equilibrium if and only if u i ( f ) u i ( f i , g i ) for all players i and unconditional strategies g for i.
Proof. 
Assume that there is some strategy g i for player i with u i ( f i , g i ) > u i ( f ) . Let v i 1 , , v i k be the vertices assigned to player i. Moreover, let a ( f i , g i ) = ( a v 1 * , , a v | V | * ) . Now define g i = ( g v i 1 , , g v i k ) such that for every 1 j k , and every profile ( a u 1 , , a u ) , where u 1 , , u are the parents of v i j ,
g v i j ( a u 1 , , a u ) = g v i j ( a u 1 * , , a u * ) .
Thus, g i is clearly an unconditional strategy for player i. Let a ( f i , g i ) = ( a v 1 * * , , a v | V | * * ) . By a straightforward induction on depth ( w ) it can then easily be shown that a w * * = a w * for every vertex w. And thus, u i ( f i , g i ) > u i ( f ) , giving us the result. □
The key point about this lemma is that unconditional strategies are small, in the sense that unconditional strategies can be represented by Boolean circuits whose size is polynomial in the set of actions (see Section 3.1, above). Thus, when we are considering whether a player has a beneficial deviation, we can without loss of generality restrict our attention to small strategies. This has implications for the complexity of the decision problems we consider.
The dependency graph provides a concise representation of the sets of conditional strategies available to the players. A single strategy f v for the player i playing at vertex v, however, has to take into account all profiles ( a y 1 , , a y k ) in A y 1 × × A y k , where y 1 , , y k are the parents of v. The number of these profiles tends to be exponential in k, the number of parents, as in all non-trivial cases generally | A v | 2 . Similarly, a naive representation of the player’s utilities for the different action profiles tends to be exponential in the number of actions available to the players in the game. However, as the utilities of players are represented by Boolean circuits, there is a straightforward polynomial transformation of Boolean games and Boolean partial order games to general partial order games. This enables us to leverage hardness results for Boolean and partial order Boolean games to obtain hardness results for general partial order games.
We first consider the decision problem of determining whether a given strategy profile is a Nash equilibrium for a given partial order game. Here, we assume that the utility function u i of each player is represented by a Boolean circuit C u i and the strategy profile f = ( f v 1 , , f v | V | ) is represented by a sequence of | V | Boolean circuits C f v 1 , , C f v | V | .
is-nash
Given: Partial order game G and strategy profile f = ( f v 1 , , f v | V | )
Problem: Is  f a Nash equilibrium of G?
We find that is-nash is intractable for partial order games.
Theorem 1.
is-nashis coNP -complete. The problem remains coNP -hard for empty-order games and total-order games.
Proof. 
For membership in coNP , let partial order game G and conditional strategy profile f = ( f v 1 , , f v | V | ) be given. We can guess a player i and a strategy g i = ( g v 1 , , g v m ) , where v 1 , , v m are the vertices assigned to i. By virtue of Lemma 4, we may assume that strategies g v 1 , , g v m are all unconditional strategies. As we saw in Section 3.1, the Boolean circuits C g v 1 , , C g v m can therefore each be assumed to be of size polynomial in | A | . Now, Lemma 2 allows us to find in polynomial time the utilities u i ( f i , g i ) and u i ( f i , f i ) . Then, f = ( f v 1 , , f v | V | ) is not a Nash equilibrium if and only if u i ( f i , g i ) > u i ( f i , f i ) , and so we may conclude that is-nash is in coNP .
For coNP -hardness, we reduce from is-nash for Boolean partial order games, which is known to be coNP -hard (see in [1], Proposition 1). Thus, let an instance of is-nash for Boolean partial order games be given by a Boolean game ( N , Φ , Φ 1 , , Φ | N | , γ 1 , , γ | N | ) , a dependency graph ( Φ , E ) , and a strategy profile f = ( f 1 , , f | N | ) , where for each player i strategy f i is given by a sequence of choice equations p 1 : φ 1 through p m : φ m , where Φ i = { p 1 , , p m } . In Section 4.1, we argued how this Boolean partial order game coincides with a partial order game with a slightly different representation of strategies and utilities. A similar remark concerns the strategy profile f = ( f 1 , , f | N | ) . As we found that the transformation of the representation of the game as a Boolean partial order game to the representation of the same game as a partial order game can be effected in polynomial time, we obtain our result. Moreover, because Boolean games are a special type of empty-order games, the problem remains hard for empty-order games as well.
To see that is nash also remains coNP -hard for total-order games, we adapt the proof for the is nash problem for Boolean games as presented in (Wooldridge et al. [15], Proposition 1). We reduce from the complement of satisfiability, the problem of determining if a given Boolean formula φ is satisfiable. To this end, let φ a Boolean formula over propositional variables p 1 , , p m . We construct a total-order Boolean game with one player i controlling p 1 , , p m along with an additional variable p 0 . The dependency graph ( V , E ) is then given by V = { v p 0 , , v p m } with relation E such that p j E p k if and only if j < k . Player i has as goal γ i = φ p 0 . Now, consider the strategy profile f 0 = ( f v p 0 0 , f v p 1 0 , , f v p m 0 ) that sets all variables to false, that is, f 0 is given by choice equations p j : for all 0 j m . Now observe that, as we are dealing with a one-player game, there is a natural surjection that maps each of player i’s strategies f to valuation α a f : { p 0 , , p m } { 0 , 1 } . It can then easily be appreciated that f 0 is a Nash equilibrium in the game constructed if and only if φ is satisfiable, as desired. □
Another canonical problem is more general, in that it asks whether a partial order game has any Nash equilibria at all, as opposed to whether or not a specific strategy profile is a Nash equilibrium.
non-emptiness
Given: Partial order game G
Problem: Does G have a Nash equilibrium?
In view of Proposition 2, non-emptiness is vacuous for total-order games, as in this class of games Nash equilibria are guaranteed to exist. The problem is considerably more difficult, namely NEXPTIME -complete, for general partial order games. It also seems a fair conclusion to draw from this contrast that increase in computational complexity arises from the structure of the dependency graph.
Theorem 2.
non-emptinessis NEXPTIME -complete.
Proof. 
A NEXPTIME algorithm to decide non-emptiness can be designed along the following lines. First, guess a strategy profile f = ( f v 1 , , f v | V | ) . Given that strategies f v are given by a Boolean circuit C f v , this can be achieved in time not more than exponential in | A | , the size of the set of actions. Second, to decide whether f = ( f v 1 , , f v | V | ) is a Nash equilibrium, by virtue of Lemma 4, it suffices to check, for all players i and all unconditional strategies g i = ( g v 1 , , g v | V i | ) for i, that u i ( f i , f i ) u i ( f i , g i ) . For player i, there are | A v 1 × × A v | V i | | unconditional strategies which is upper bounded by | A | | V | . Moreover, as we have seen in Section 3.1, each of these unconditional strategies can be represented by a sequence of Boolean circuits of size polynomial in | A | . Furthermore, the action profiles a ( f i , f i ) and a ( f i , g i ) can be computed in time exponential in | A | , by virtue of Lemma 1 and each circuit involved being at most exponential in the size of | A | . Finally, on the basis of Lemma 2, we can check in time polynomial in | A | the utilities for player i on action profiles a ( f i , f i ) and a ( f i , g i ) , and therefore also whether u i ( f i , f i ) u i ( f i , g i ) . Altogether, the algorithm runs in non-deterministic exponential time.
A proof of NEXPTIME -hardness of non-emptiness can be achieved by a reduction from the non-emptiness problem for Boolean partial order games. We rely here on the same direct reduction as in the proof of Theorem 1, above. NEXPTIME -hardness for Boolean partial order games was established by [1], giving us the result. □
If we restrict attention to empty-order games, however, non-emptiness has considerably lower computational costs, even though the problem still remains Σ 2 p -hard. Key to this result is the observation that for empty-order games all strategies are unconditional, and, thus, they can be represented by Boolean circuits of polynomial size (see Section 3.1).
Theorem 3.
For empty-order games,non-emptinessis Σ 2 p -complete.
Proof. 
To see that non-emptiness is in Σ 2 p , recall that Σ 2 p = NP coNP , that is, the set of problems that can be solved in polynomial time on a non-deterministic Turing machine with a coNP -oracle. Furthermore, recall that strategy profiles in empty-order games are unconditional and are of the form f v : { ( ) } A v . Moreover, they can be represented by Boolean circuits of polynomial size (see Section 3.1, above). Accordingly, given an empty-order game, we can guess an unconditional strategy profile f = ( f v 1 , , f v | V | ) , and consult the coNP -oracle to check whether f is a Nash equilibrium. Theorem 1 guarantees that the latter is feasible.
For Σ 2 p -hardness, recall that Boolean games constitute a subclass of empty-order partial order games. Σ 2 p -hardness then follows immediately from non-emptiness being Σ 2 p -hard for Boolean games (see in [13], Proposition 5). □
Recall that in partial order games, strategy profiles and action profiles are essentially different objects. As a natural counterpart to the is nash problem, we therefore now consider the decision problem whether a given action profile is sustained by a Nash equilibrium in a partial order game.
is nash actions
Given: Partial order game G and action profile a = ( a v 1 , , a v | V | )
Problem: Is a sustained by a Nash equilibrium?
In sharp contrast to coNP -completeness of is nash, we find that is nash actions is NEXPTIME -complete. From the perspective of computational complexity, is nash actions appears to be more kindred to the non-emptiness problem for partial order games. In this connection, it is worth observing that the proof of Bradfield et al. regarding the NEXPTIME -completeness of non-emptiness for Boolean partial order games in [1] relied on a reduction from dependency quantifier boolean formula game, which is defined as follows.
An instance of dqbfg is a tuple φ , X 1 , X 2 , Y 1 , Y 2 , where φ is a Boolean formula φ and { X 1 , X 2 , Y 1 , Y 2 } a partition of the propositional variables over which φ is defined. dqbfg then concerns the following game with three players—B (‘Black’), W 1 (‘White 1’), and W 2 (‘White 2’)—where B forms one team and W 1 and W 2 another team W. Player B chooses an assignment for the variables in X 1 X 2 = { x 1 1 , , x | X 1 | 1 } { x 1 2 , , x | X 2 | 2 } , player W 1 for those in Y 1 = { y 1 1 , , y | Y 1 | 1 } , and player W 2 for those in Y 2 = { y 1 2 , , y | Y 2 | 2 } . Player B chooses first, then W 1 and W 2 choose, on the understanding that W 1 can only see the assignment B chooses for W 1 and W 2 only the assignment B chooses for Y 2 . Team B aims to make φ true, whereas team W’s goal is to make φ false. If the overall assignment for X 1 X 2 Y 1 Y 2 satisfies φ , then B wins, otherwise W. A positive instance of dqbfg is when team W has a winning strategy. dependency quantifier boolean formula game was shown to be NEXPTIME -complete by Hearn and Demaine in [27].
Theorem 4.
is nash actionsis NEXPTIME -complete.
Proof. 
For membership in NEXPTIME , let a = ( a v 1 , , a v | V | ) be the action profile that is given as input. Recall from the proof of Theorem 2 that we can guess a strategy profile f and check whether it is a Nash equilibrium in exponential time, provided that f is represented by a Boolean circuit. By Lemma 1 we know we can additionally compute a f in polynomial time and check whether a f = a . As the latter can also be achieved in polynomial time, we obtain our result.
For NEXPTIME -hardness, we reduce from dependency quantifier boolean formula game (dqbfg). Given an instance φ , X 1 , X 2 , Y 1 , Y 2 of dqbfg we construct a partial order Boolean game with the same three players, with B controlling the variables in X 1 X 2 { x 0 } , player W 1 those in Y 1 , and player W 2 those in Y 2 , where x 0 is a “fresh” variable not in X 1 X 2 Y 1 Y 2 . Let the players’ goals be given by
γ B = x 0 φ γ W 1 = γ W 2 =
The dependency graph ( X 1 X 2 Y 1 Y 2 { x 0 } , E ) is defined such that
(i)
x 1   E   y 1 , for all x 1 X 1 and y 1 Y 1 ,
(ii)
x 2   E   y 2 , for all x 2 X 2 and y 2 Y 2 ,
(iii)
x 0   E   z , for all z X 1 X 2 Y 1 Y 2 .
Now, consider the action profile
a ¯ = ( x ¯ 0 , x ¯ 1 1 , x ¯ | X 1 | 1 , x ¯ 1 2 , x ¯ | X 2 | 2 , y ¯ 1 2 , y ¯ | Y 1 | 1 , y ¯ 1 2 , y ¯ | Y 2 | 2 ) ,
which sets all variables, including x 0 , to false. Note that none of the three players win if a ¯ is played. We are now in a position to prove that φ , X 1 , X 2 , Y 1 , Y 2 is a positive instance of dqbfg if and only if a ¯ is sustained by a Nash equilibrium in the partial order Boolean game constructed.
First assume that φ , X 1 , X 2 , Y 1 , Y 2 is positive instance of dqbfg. Then, the white team has a joint winning strategy in the original dqbfg-game given by Boolean functions β 1 and β 2 on the variables X 1 and X 2 , respectively. Then, define strategies f W 1 * and f W 2 * for W 1 and W 2 , respectively, in the Boolean partial order game, that are given by the following choice equations for i = 1 , 2 and j = 1 , , | Y i | :
y j i : ( x 0 φ β i ( x 1 i , , x | X i | i ) ) ( x ¯ 0 )
As the dependency graph ( V , E ) respects the information dependencies of the dqbfg-instance, observe that f W 1 and f W 2 together embody a winning strategy to render φ false, if x 0 is played. Let B’s unconditional strategy f B * be defined by the choice equations of the form x j i : , setting x j i to false for all for i = 1 , 2 and j = 1 , , | X i | . Observe that, defined thus, a f * = a ¯ . Also note that f * = ( f B * , f W 1 * , f W 2 * ) is a Nash equilibrium. As x 0 will be set to false, all players lose when f * is played, but none has an incentive to deviate either. Player B could only hope to win by setting at least x 0 to true. If so, however, W 1 and W 2 are playing a winning strategy against B, dashing all the latter’s hopes to win after all. Players W 1 and W 2 will lose, no matter which strategy they adopt. As a consequence, they do not want to deviate either, and we may conclude that f * is a Nash equilibrium sustaining action profile a ¯ .
Finally, assume that φ , X 1 , X 2 , Y 1 , Y 2 is a negative instance of dqbfg. Then, team W does not have a winning strategy, meaning that for every strategy f W = ( f W 1 , f W 2 ) , player B has a best response f B such that B wins under f = ( f W 1 , f W 2 , f B ) . Now, consider the partial order Boolean game and an arbitrary strategy profile f ¯ with a f ¯ = a ¯ . Some reflection reveals that under these circumstances, player B has a strategy f B at their disposal setting x 0 to true and incorporating a winning response to f ¯ W = ( f ¯ W 1 , f ¯ W 2 ) . Therefore, player B has an incentive to unilaterally deviate from f ¯ and play f B instead. We may conclude that f ¯ is not a Nash equilibrium, as desired. □
For the two extremal classes of empty-order games and total order games, is nash actions are less computationally demanding. More precisely, the problem is coNP -complete for empty-order games and PSPACE -complete for total-order games. The proof of the former statement is relatively straightforward, when one realises that in empty-order games all strategies are unconditional and that, consequently, there is a natural bijection between action profiles and strategy profiles.
Theorem 5.
For empty-order games,is nash actionsis coNP -complete.
Proof. 
First recall that, in empty-order games, there is a natural bijection between action profiles and strategy profiles, mapping each action profile a = ( a v 1 , , a v | V | ) to strategy profile f a = ( f v 1 a v 1 , , f v | V | a v | V | ) , where f v a v ( ( ) ) = a v . Thus, all strategy profiles of an empty-order game are of this form. Moreover, for arbitrary action profiles a = ( a v 1 , , a v | V | ) and strategy profiles f = ( f v 1 , , f v | V | ) , we have that
a = a f if and only if f v a v = a v , for all v V if and only if f = f a
Moreover, recall that unconditional strategies are represented by polynomial-sized Boolean circuits. Theorem 1 established that is nash is coNP -complete for empty-order games. Altogether, it suffices to show that, for all action profiles a = ( a v 1 , , a v | V | ) of a empty-order game,
a is sustained by a Nash equilibrium if and only if f a is a Nash equilibrium .
The “if”-direction is immediate. If f a is a Nash equilibrium, then, by observing that a f a = a , action profile a is sustained by some Nash equilibrium, namely f a . For the “only if”-direction, assume there is some strategy profile f * with a = a f * . By our earlier observation we find that f * = f a . Hence, f a is a Nash equilibrium, as desired. □
The proof of PSPACE -completeness of is nash actions for total-order games is considerably more involved, and we defer it to the very end of Section 7.2, where the necessary proof elements are in place. Still, we conclude this section with its statement.
Theorem 6.
For total-order games,is nash actionsis PSPACE-complete.

6. Backwards Induction

Backwards induction is the most fundamental technique for the analysis of extensive form games. The basic backward induction algorithm for extensive form games of perfect information runs in time polynomial in the size of the game tree, and computes Nash equilibrium strategy profiles (which are guaranteed to exist in games of perfect information). It is therefore very natural to ask whether approaches based on backward induction might work for partial order games. However, backward induction is not applicable in extensive form games of imperfect information, and as partial order games correspond to imperfect information games, it follows that the technique is not always applicable. This section introduces a backward induction procedure to find pure Nash equilibria in partial order games. Games in which the procedure is well defined—in the sense that the procedure produces at least one strategy profile—we denote as being fit for backwards induction. We relate this latter concept to an informational notion we refer to as scrutability. For games that are fit for backwards induction, we prove that our backwards induction procedure is guaranteed to produce a Nash equilibrium outcome.

6.1. Backwards Induction

Due to their acyclic nature, it would seem that a natural concept of backward induction procedure can straightforwardly be defined for partial order games. Our aim is thus to define a strategy profile f * = ( f v 1 * , , f v | V | * ) that is obtained in the following fashion. One starts with a vertex v of maximal depth with set of parents Y = { y 1 , , y k } , and one inspects the possible actions—rather than their strategies—the players active at the parents of v can play. Let a Y = ( a y 1 , , a y k ) be any such an action profile, suppose i is the player active at v. Then, the action a v * = f v * ( a y 1 , , a y k ) selected by strategy f v * from A v should maximise i’s utility against a Y and all possible choices of action b X at the vertices in X = V \ ( Y { v } ) , provided any such action a v * exists. Subsequently, with the strategies for the vertices of greatest depth thus fixed, the strategies in f * should recursively find optimal strategies at vertices of lesser depth in a similar fashion.
In the case of partial order games, it is not strictly necessary to proceed recursively on the depth of the vertices. We can, instead, use any topological sorting τ = ( v τ 1 , , v τ | V | ) of the vertices that respects the graph and, starting with the vertex v τ j with maximal index τ j = | V | , we iterate through τ until we reach the vertex with minimal topological index. In this section, we will develop this more general concept of backward induction for partial order games that employs a topological order.
Our backwards induction procedure defines recursively a strategy profile f * = ( f v 1 * , , f v | V | * ) for each vertex v relative to a topological sorting τ = ( v τ 1 , , v τ | V | ) , and is formally defined as follows. Let v be a vertex with topological index τ j and let i be the player to move at v. Let Y = { y 1 , , y k } be the set of parents of v, Z = { z 1 , , z m } be the set of vertices with a topological index strictly greater than τ j , and X = { x 1 , , x } be given by V \ ( Y Z { v } ) . For every profile a Y = ( a y 1 , , a y k ) , we define f v * ( a y 1 , , a y k ) as an action a v * in A v such that, for all profiles b X = ( b x 1 , , b x ) ,
a v * arg max a v A v u i ( b X , a Y , a v , f Z * ) ,
where the basis of this recursion is provided by the case where Z = Ø (Here we exploit the notation b X , a Y , a v , f Z * to refer to the action profile that results if after τ -history ( b X , a Y , a v the strategy profile f * is played over the vertices in Z. Formally, b X , a Y , a v , f Z * is the action profile a g for strategy profile g = ( f X b X , f Y a Y , f v a v , f Z * ) , where ( f X b X , f Y a Y , f v a v ) is a profile of unconditional strategies (also see page 9, above)). To illustrate this definition, we have the following example.
Example 4.
Consider the utility function for the game depicted in Figure 10, and fix a topological sorting ( v 1 , v 2 , v 3 , v 4 ) . From inspection, it can easily be appreciated that the utility to player 1 is only dependent on their own choices at vertices v 1 and v 4 , and player 2’s at vertex v 2 . At this moment, we can apply the backwards induction procedure as follows. Inspecting their utility function, we find that player 3 chooses as follows at vertex v 3 :
f v 3 ( a c ) = f f v 3 ( a d ) = e f v 3 ( b c ) = f f v 3 ( b d ) = f
Figure 10. The partial order game from Example 1 with a slightly different utility function.
At vertex v 4 , player 1, can set their strategy such that f v 4 ( c ) = g and f v 4 ( d ) = h , because they are indifferent to any choice of actions by the other players. Player 2 can have their strategy at v 2 depend on player 1’s choice at v 1 . If player 1 chooses a at v 1 , then by playing c at v 2 , action profile a c f g will result and player 2 will obtain utility 3. Whereas, by playing d, action profile a d e h will ensue, with a utility of 1 for player 2. Therefore, f v 2 ( a ) = c . A similar reasoning yields that, if player 1 chooses b at v 1 , by playing c at v 2 , action profile ( b , c , f , g ) results with utility 0 for player 2, whereas, by playing d, action profile ( b , d , f , h ) ensues with utility 2 for player 2. Accordingly, f v 2 ( b ) = d . Finally, by playing a at v 1 , player 1 obtains utility 3, as action profile ( a , c , f , g ) would be played. Observe that by playing b at v 1 , player 1 obtains utility 2, as in that case action profile ( b , d , f , h ) would result. Accordingly, f v 1 ( ( ) ) = a . We thus obtain the strategy profile f = ( f v 1 1 , f v 2 2 , f v 3 12 , f v 4 2 ) yielding action profile ( a , c , f , g ) as the backwards induction solution of this game.
Some reflection reveals that on total-order games the procedure defined thus mimics standard backwards induction on the corresponding extensive-form games of perfect information as described in Section 4.4. It is important to note, however, that, on general partial order games, our backwards induction procedure does not always yield a strategy profile f * . This is because the outcome of a partial order game—and therefore also the players’ utilities as well as the possibility of finding a utility maximising action—need not be fully determined by the actions chosen at a given vertex v, at v’s parent vertices, and at the vertices with a larger topological index than v.
The outcome may also depend on the actions chosen at vertices of an equal or lesser depth that are not parents of the respective vertex. If so, it may happen that, at some stage of the procedure and at some vertex, no action can be singled out as an unequivocal optimal choice based on the choices at the parent nodes and those at the vertices with a greater topological index alone. This would cause the procedure to stall. More formally, this happens if for some vertex v, there are two different profiles b X = ( b w 1 , , b w ) and b X = ( b w 1 , , b w ) such that arg max a v A v u i ( b X , a Y , a v , f Z * ) and arg max a v A v u i ( b X , a Y , a v , f Z * ) are disjoint. The following example illustrates this point.
Example 5.
Consider again the game of Example 1, depicted in Figure 3. Let us first consider the strategies that player 3 can choose at vertex v 3 . This player has to specify an action from among e and f, for each of the possible action profiles player 1 and 2 can choose at vertices v 1 and v 2 , respectively, that is, for the action profiles a c , a d , b c , and b d . Inspecting player 3’s utilities, player 3 should arguably choose f v 3 ( a c ) = f , as no matter what action player 1 chooses at v 4 , player 3 is better off playing f than playing e. However, how to choose a value for f v 3 ( a d ) is not so obvious, as the optimal choice depends on whether player 1 chooses g or h at vertex v 4 ; this is something player 3 may not let their strategy depend upon as not v 3 E v 4 .
To formalise this issue, we say that a partial order game is fit for backwards induction under a topological sorting τ = ( v τ 1 , , v τ | V | ) if, for every vertex v, and every action profile a = ( a v 1 , , a v | V | ) ,
b X A X arg max a v A v u i ( b X , a Y , a v , a Z ) Ø ,
where Y is the set of parents of v, Z the set of vertices with a topological index that is not smaller than that of v, and X = V \ ( Y Z { v } ) . This ensures that, in Equation (1), the optimal action a v * is defined for every vertex v. At this point, it is worth observing that for total-order games, the set X will be empty and that they allow for only one topological sorting. It follows that total-order games vacuously satisfy the fit-for-backwards-induction condition under every topological sorting.
We are now in a position to show that our backwards induction procedure always yields a Nash equilibrium in games that are fit for backwards induction.
Theorem 7.
Let G be a partial order game that is fit for backwards induction under a topological order τ. Then, every strategy profile f * = ( f v 1 * , , f v | V | * ) generated by the backwards induction procedure under τ is a Nash equilibrium.
Proof. 
Let f * = ( f v 1 * , , f v | V | * ) be a strategy profile that our backwards induction procedure yields for topological order τ = ( v τ 1 , , v τ | V | ) . Without loss of generality, we may assume that τ j = j for all 1 j | V | , that is, v 1 , , v | V | are already topologically ordered. Assume for contradiction that f * is not a Nash equilibrium, that is, there is some strategy g i for some player i such that u i ( f i * , g i ) > u i ( f i * , f i * ) . Let g = ( f i * , g i ) .
Now, there is at least one vertex v j with 1 j | V | that is controlled by player i with parents Y = { y 1 , , y k } for which there is an action profile x = ( x v 1 , , x v j 1 ) in A v 1 × × A v j 1 such that:
u i ( x v 1 , , x v j 1 , a v j , f v j + 1 * , , f v | V | * ) < u i ( x v 1 , , x v j 1 , b v j , g v j + 1 , , g v | V | ) ,
where a v j = f v j * ( x y 1 , , x y k ) and b v j = g v j ( x y 1 , , x y k ) (To see this, let a f * = ( a v 1 , , a v | V | ) and a g = ( b v 1 , , b v | V | ) . Then observe that there must be a smallest index 1 j | V | such that v j V i with a v j b v j and a v m = b v m for all 1 m < j , which provides us with the desired v j ). Now we may assume that j is the largest index with v j V i for which such an action profile, or τ -history, x = ( x v 1 , , x v j 1 ) can be found. Therefore,
u i ( x v 1 , , x v j 1 , b v j , g v j + 1 , , g v | V | ) u i ( x v 1 , , x v j 1 , b v j , f v j + 1 * , , f v | V | * ) .
Combining inequalities (2) and (3), we obtain that
u i ( x v 1 , , x v j 1 , a v j , f v j + 1 * , , f v | V | * ) < u i ( x v 1 , , x v j 1 , b v j , f v j + 1 * , , f v | V | * ) .
Having assumed ( v 1 , , v | V | ) to be ordered topologically, the set Y = { y 1 , , y k } of parents of v j is a subset of { v 1 , , v j 1 } . Let further X = { v 1 , , v j 1 } \ Y and Z = { v j + 1 , , v | V | } . Recall that a v j = f v j * ( x y 1 , , x y k ) . We then find that, for profile x Y , there is some profile, namely x X , such that
f v j * ( x y 1 , , x y k ) arg max a v j A v j u i ( x X , x Y , a v j , f Z * ) .
It follows that our backwards induction procedure does not provide us with f v j * , which contradicts our initial assumption regarding the profile f * = ( f v 1 * , , f v | V | * ) . □
We also have the following lemma, which is a version of the one-stage-deviation property for partial order games, and which is an almost immediate consequence of the definition of our backwards induction procedure (also see in [28] (pages 108–110) and Osborne and Rubinstein [5] (page 98)). This property will prove to be of great help proving the computational complexity results in Section 7. Intuitively it says that to establish that a strategy profile f = ( f v 1 , , f v | V | ) is a backwards induction solution, it suffices to look at deviations by a player from f at a single vertex v k given a single history h = ( a v 1 , , a v k ) . The property is perhaps best understood by considering its contrapositive: strategy profile f is not a backwards induction solution if there is a single player who has an incentive to deviate from f at a single vertex v k given a single history h = ( a v 1 , , a v k ) . Note that we here assume that the game in question is fit for backwards induction.
Lemma 5 (One-stage-deviation property).
Let the vertices of a partial order game be sorted according to a topological sorting τ = ( v 1 , , v | V | ) . Then, for games that are fit for backwards induction, profile f = ( f v 1 , , f v | V | ) is a backwards induction solution for τ if and only if for every τ-history ( a v 1 , , a v k 1 ) with 1 k | V | we have for all b v k A v k :
u i ( a v 1 , , a v k 1 , f v k , f v k + 1 , , f v | V | ) u i ( a v 1 , , a v k 1 , b v k , f v k + 1 , , f v | V | ) ,
where i is the player assigned to v k .
Proof. 
First assume that f = ( f v 1 , , f v | V | ) is a backwards induction solution, and consider an arbitrary τ -history ( a v 1 , , a v k 1 ) and equally arbitrary b v k in A v k . Let Y be the parents of v k and X = { v 1 , , v k 1 } \ Y . Then,
u i ( b X , a Y , f v k ( a Y ) , f v k + 1 , , f v | V | ) u i ( b X , a Y , b v k , f v k + 1 , , f v | V | ) ,
for all profiles b X . Now observe that inequality (4) holds for in particular for b X = a X .
For the “if”-direction assume for contraposition that f = ( f v 1 , , f v | V | ) is not a backwards induction solution for τ . Then, there is a vertex v k with minimal topological index, parents Y, and X = { v 1 , , a k 1 } \ Y such that for some profile a Y , we have for every a v k in A v k that there is some profile b X such that
u i ( b X , a Y , a v k , f v k + 1 , , f v | V | ) < u i ( b X , a Y , b v k , f v k + 1 , , f v | V | ) .
At this point, consider any a v k in A v k and let b X be the accompanying profile for which Equation (5) holds. We can conclude the proof by considering the τ -history ( b X , a Y ) . □

6.2. Games of Adequate Information

In this section, we propose the class of adequate information games, which are guaranteed to be fit for backwards induction, and, thus, always have a Nash equilibrium. As an auxiliary concept we first introduce the notion of scrutability, an informational criterion specifying on which choices of which players a player can have its decision depend when choosing an action at a specific vertex.

6.2.1. Scrutability

We introduce scrutability as a technical term to reason about the strategic information that is available to the players at each stage of the backwards induction process. Intuitively, let v and w be a vertices at which, respectively, players i and j are active. Then, assuming that the game is fit for backwards induction, a vertex w is scrutable to v if j’s choices of action at w can make a difference as to the optimality of i’s choice of action at v when deciding on a backwards induction strategy at v. Thus, all vertices reachable from v are scrutable, as their backwards induction choices will already have been made. Moreover, the player active at v can have their choice of action depend on the choices of action at the parent nodes of v.
Formally, we say that a vertex w is scrutable to vertex v if either
(i)
w is reachable from v, or
(ii)
w E v , that is w is one of v’s parents.
We refer to a vertex that is not scrutable to v as being inscrutable to v. Recall the reachability relation is reflexive, and, thus, vertex v is always scrutable to v itself. Observe that the relations of scrutability and inscrutability are not necessarily symmetric. For instance, if a vertex v is reachable from w, but w is not a parent of v, then v is scrutable to w, but not vice versa. If v is inscrutable to w, and w is also inscrutable to v, we say that v and w are independent.
We differentiate between a vertex being scrutable to a vertex and a vertex being scrutable to a player. For V i = { v 1 , , v m } player i’s vertices, we denote by I v the set of vertices that are inscrutable to v, and by S v the set of vertices that are scrutable to v. Furthermore, let I i = I v 1 I v m and S i = V \ I i denote the sets of vertices that are inscrutable and scrutable to player i, respectively. That is, if a vertex v is inscrutable to a player i, if v is inscrutable to at least one vertex at which i is active. Vertex v is scrutable to player i, if it is not inscrutable to i, that is, if v is scrutable to all vertices at which i is active. It may be worth observing that in total-order games all vertices will be scrutable from any other vertex.
Furthermore, we introduce a notion of scrutability that is relative to a topological sorting of the vertices. Therefore, let τ = ( v τ 1 , , v τ | V | ) be a topological sorting. Then, a vertex w with topological index τ w is scrutable under τ to vertex v with topological index τ v , if either
(i)
τ w τ v , or
(ii)
w E v , that is w is one of v’s parents.
Any vertex that is not scrutable under τ is referred to as being inscrutable under τ . For V i = { v 1 , , v m } and topological sorting τ = ( v τ 1 , v τ | V | ) , we furthermore denote by I v τ the set of vertices that are inscrutable to v under τ , and by S v τ the set of vertices that are scrutable to v under τ .
Moreover, we define I i τ = I v 1 τ I v m τ and S i τ = V \ I i τ . Note that the set of vertices scrutable to a player i need not include the set V i of vertices where i is active.
Given a topological sorting τ and a vertex v, by Y, we will generally adhere to the following notational conventions. The parents of v we generally denote by Y, the set of vertices inscrutable to v by X, and the set of all vertices that are not parents of v but still scrutable to v by Z. The following example illustrates these concepts.
Example 6.
Consider the game depicted in Figure 10. Then, we have the following sets of scrutable and inscrutable vertices.
S v 1 = { v 1 , v 2 , v 3 , v 4 } I v 1 = Ø S 1 = { v 2 , v 4 } I 1 = { v 1 , v 3 } S v 2 = { v 1 , v 2 , v 3 , v 4 } I v 2 = Ø S 2 = { v 1 , v 2 , v 3 , v 4 } I 2 = Ø S v 3 = { v 1 , v 2 , v 3 } I v 3 = { v 4 } S 3 = { v 1 , v 2 , v 3 } I 3 = { v 4 } S v 4 = { v 2 , v 4 } I v 4 = { v 1 , v 3 }
For the topological sorting τ = ( v 1 , v 2 , v 3 , v 4 ) , we get
S v 1 τ = { v 1 , v 2 , v 3 , v 4 } I v 1 τ = Ø S 1 τ = { v 2 , v 4 } I 1 τ = { v 1 , v 3 } S v 2 τ = { v 1 , v 2 , v 3 , v 4 } I v 2 τ = Ø S 2 τ = { v 1 , v 2 , v 3 , v 4 } I 2 τ = Ø S v 3 τ = { v 1 , v 2 , v 3 , v 4 } I v 3 τ = Ø S 3 τ = { v 1 , v 2 , v 3 , v 4 } I 3 τ = Ø S v 4 τ = { v 2 , v 4 } I v 4 τ = { v 1 , v 3 }
By contrast, under topological sorting τ = ( v 1 , v 2 , v 4 , v 3 ) , we obtain
S v 1 τ = { v 1 , v 2 , v 3 , v 4 } I v 1 τ = Ø S 1 τ = { v 2 , v 3 , v 4 } I 1 τ = { v 1 } S v 2 τ = { v 1 , v 2 , v 3 , v 4 } I v 2 τ = Ø S 2 τ = { v 1 , v 2 , v 3 , v 4 } I 2 τ = Ø S v 3 τ = { v 1 , v 2 , v 3 } I v 3 τ = { v 4 } S 3 τ = { v 1 , v 2 , v 3 } I 3 τ = { v 4 } S v 4 τ = { v 2 , v 3 , v 4 } I v 4 τ = { v 1 }
For another example, consider Figure 11 along with topological order τ = ( v 1 , v 2 , v 3 , v 4 ) . We then obtain for the vertices the following sets of scrutable and inscrutable vertices for vertices as well as for players:
S v 1 τ = { v 1 , v 2 , v 3 , v 4 } I v 1 τ = Ø S 1 τ = { v 1 , v 2 , v 3 , v 4 } I 1 τ = Ø S v 2 τ = { v 2 , v 3 , v 4 } I v 2 τ = { v 1 } S 2 τ = { v 2 , v 4 } I 2 τ = { v 1 , v 3 } S v 3 τ = { v 1 , v 2 , v 3 , v 4 } I v 3 τ = Ø S v 4 τ = { v 1 , v 2 , v 4 } I v 4 τ = { v 3 }
Figure 11. Another example of a partial order game.
Observe that w being reachable from v implies τ w τ v , in other words, S v S v τ for every topological order τ . Accordingly, plain scrutability is stronger than scrutability under a topological sorting. The following lemma lays down the precise formal interrelationship between scrutability and scrutability under a topological order.
Lemma 6.
Let i be a player of a partial order game and v a vertex controlled by i. Then, for all vertices w,
(i) 
w is scrutable to v if and only if w is scrutable to v under all topological orders τ,
(ii) 
w is inscrutable to v if and only if w is inscrutable to v under some topological order τ,
(iii) 
w is scrutable to i if and only if w is scrutable to i under all topological orders τ,
(iv) 
w is inscrutable to i if and only if w is inscrutable to i under some topological order τ.
Proof. 
We prove part (i) first. Then, parts (ii) through (iv) follow by straightforward set-theoretic reasoning.
(i)
For the “only if”-direction, first assume that w is scrutable to v and consider an arbitrary topological sorting τ with topological indices τ i and τ j for v and w, respectively. Then, either w E v or w is reachable from v, that is, v E * w . In the former case, we have immediately that w is scrutable to v under τ . In the latter case, observe that then τ i < τ j by definition of a topological sorting. It then also follows that w is scrutable to v under τ .
For the “if”-direction, assume for contraposition that w is not scrutable to v. Then, neither w E v nor w being reachable from v hold. In case v is reachable from w, let τ be any topological order with τ i and τ j the topological indices for v and w. Then, τ j < τ i , and it follows that w is not scrutable under τ . In case v is not reachable from w, let ( V , E ) be the DAG underlying the partial order game. Then, consider the edge set E = E { ( w , v ) } . The graph ( V , E ) must be acyclic because if ( V , E ) contained a directed cycle, there would be a simple path from v to w not involving the edge ( w , v ) . Therefore, w would also be reachable from v in ( V , E ) , a contradiction. Therefore, ( V , E ) is acyclic and has a topological sorting τ . Observe that τ is also a topological order for ( V , E ) . Let τ i and τ j be the topologic al indices for v and w, respectively. As w E v , we have τ j < τ i and again we may conclude that w is not scrutable to v under τ .
(ii)
This part follows immediately from part (i), by observing the following equivalences:
w is inscrutable to v if and only if w is not scrutable to v if and only if w is not scrutable to v under some topological order τ if and only if w is inscrutable to v under some topological order τ
(ii)
Let v 1 , , v m be the vertices assigned to player i, and let τ 1 , , τ k be the topological sortings for the game. From (ii) we obtain that I v j = I v j τ 1 I v j τ k for every 1 j m .
(iv)
Therefore,
I i = 1 j m I v j = 1 j m I v j τ 1 I v j τ k = 1 k I v 1 τ I v m τ = 1 k I i τ ,
and it follows that w is inscrutable to i if and only if w is inscrutable to i under some topological sorting. Part (iii) then is an immediate consequence of de Morgan’s law:
S i = V \ I i = V \ ( I i τ 1 I i τ k ) = V \ I i τ 1 V \ I i τ k = S v 1 τ 1 S i τ k = 1 k S i τ .
Therefore, w is scrutable to i if and only if i is scrutable to i under all topological sortings.
This concludes the proof. □

6.2.2. Games of Adequate Information

In this section, we define a class of partial order games, which we refer to as games of adequate information. We prove that games of adequate information are fit for backward induction, and thus, on account of Theorem 7, allow for Nash equilibria. For this, we require that each player at each vertex assigned to them is in a position to determine their optimal choices if they are given only the choices at their parents and those at the nodes with a greater topological index.
Formally, we introduce the concept of a utility function being of adequate information and a variant of the notion parametrised by a topological order τ . These concepts only differ in the vertices that are considered to be scrutable and inscrutable. Let i be a player with utility function u i : A v 1 × × A v | V | R . Without loss of generality, we assume that V i = { v 1 , , v m } . Let S be the set of vertices scrutable to i (under τ ) and I the set of vertices inscrutable to i (under τ ). Then, player i’s utility function u i : A V R is said to be of adequate information (under τ ) or adequately informed (under τ ), if there is a function u i 0 : A S R such that for all profiles a S , a S A S and b I A I
u i ( a S , b I ) u i ( a S , b I ) if and only if u i 0 ( a S ) u i 0 ( a S ) .
If this is the case, we say that player i’s utility function u i : A R (ordinally) reduces to  u i 0 : A S i R .
A game of adequate information under τ is, then, a partial order game in which each player’s utility function is of adequate information under τ . We also define a concept of adequate information that is irrespective of the topological sorting τ in the straightforward manner. Thus, a game of adequate information simpliciter is a partial order game in which each player’s utility function is of adequate information. Observe that total-order games constitute a subclass of adequate-information games, since such games do not allow for inscrutable vertices. To illustrate, we have the following example:
Example 7(Game of adequate information).
Consider the game depicted in Figure 10 again. Under topological sorting τ = ( v 1 , v 2 , v 3 , v 4 ) , player 1 has scrutables S 1 τ = { v 2 , v 4 } and inscrutables I 1 τ = { v 1 , v 3 } . For player 1, observe that for all a A v 2 × A v 4 and all b , b A v 1 × A v 3 we have that u 1 ( a , b ) = u 1 ( a , b ) . For instance, u 1 ( c g , a e ) = u 1 ( c g , a f ) = 3 and u 1 ( d g , a f ) = u 1 ( d g , b f ) = 0 . Accordingly, define u 1 0 : A v 2 × A v 4 R such that
u 1 0 ( c g ) = 3 u 1 0 ( c h ) = 1 u 1 0 ( d g ) = 0 u 1 0 ( d h ) = 2 ,
and find that u 1 ordinally reduced to u 1 0 and, hence, that player 1 is adequately informed in this game. For players 2 and 3, we saw that S 2 τ = S 3 τ = { v 1 , v 2 , v 3 , v 4 } and I 2 τ = I 3 τ = Ø . As some reflection reveals, u 2 and u 3 thus vacuously reduce to themselves. We therefore immediately see that their utility functions are of adequate information under τ as well.
Finally, consider once more the game depicted in Figure 11 along with the natural topological order τ = ( v 1 , v 2 , v 3 , v 4 ) . Under τ, this game is of adequate information. As I 1 τ = Ø , this is easy to see for player 1. For player 2 we have S 2 τ = { v 2 , v 4 } and I 2 τ = { v 1 , v 3 } . Now, we can define u 2 0 such that
u 2 0 ( c g ) = 1 u 2 0 ( c h ) = 4 u 2 0 ( d g ) = 8 u 2 0 ( d h ) = 2
It then only requires an easy check to see that u 2 ordinally reduces to u 2 0 . For instance, u 2 0 ( c g ) < u 2 0 ( d g ) and also
u 2 ( c g , a e ) = 2 < 9 = u 2 ( d g , a e ) u 2 ( c g , a f ) = 3 1 2 < 7 = u 2 ( d g , a f ) u 2 ( c g , b e ) = 2 < 7 = u 2 ( d g , b e ) u 2 ( c g , b f ) =   2   <   4 = u 2 ( d g , b f )
By contrast, under topological order τ = ( v 2 , v 1 , v 4 , v 3 ) , player 1’s utility function is not of adequate information. To see this, first observe that S 1 τ = { v 1 , v 3 , v 4 } and I 1 τ = { v 2 } . Now, u 1 ( a e g , c ) = 2 < 9 = u 1 ( a e h , c ) , whereas u 1 ( a e g , d ) = 5 > 3 = u 1 ( a e h , d ) . Had there been a function u 1 0 : A S 1 τ R such that u 1 ordinally reduces to u 1 0 , then both u 1 0 ( a e g ) < u 1 0 ( a e g ) and u 1 0 ( a e g ) > u 1 0 ( a e g ) , a contradiction.
The following result establishes that games of adequate information (under a topological sorting τ ) are fit for backwards induction (under τ ).
Theorem 8.
Let τ = ( v τ 1 , , v τ | V | ) be a topological sorting. Then,
(i) 
games of adequate information under τ are guaranteed to have a Nash equilibrium, which can be found using our backward induction procedure under τ, and
(ii) 
games of adequate information simpliciter are guaranteed to have a Nash equilibrium, which can be found using our backward induction procedure underanytopological sorting.
Proof. 
Let τ = ( v τ 1 , , v τ | V | ) be an arbitrary topological sorting. For part (i), due to Theorem 7, it suffices to prove that games of adequate information under τ are fit for backward induction under τ . To this end, consider an arbitrary game of adequate information under τ and consider an arbitrary vertex v. Let i be the player assigned to v. Furthermore, let I i τ = { w 1 , , w k } denote the set of vertices that are inscrutable to i under τ and S i τ = { w k + 1 , , w | V | } denote the set of vertices that are scrutable to i under τ . We let Y denote the set of parents of v, and Z the set of vertices with a strictly greater topological index than v. Then, set X = V \ ( Y Z { v } ) and let X = { x 1 , , x | X | ] } . Observe that X is the set of vertices that are inscrutable to v under τ , and thus all vertices in X are inscrutable to i under τ , that is, X I i τ .
By assumption, player i’s utility function u i : A R is adequately informed under τ . In particular, we may assume that u i ordinally reduces to a function u i 0 : A S i τ R .
Now, consider an arbitrary action profile a = ( a 1 , , a | V | ) . Having assumed A to be finite, there must be an action a v * in A v with
a v * arg max a v A v u i ( a X , a Y , a v , a Z ) .
Let a * = ( a X , a Y , a v * , a Z ) .
Consider an arbitrary a v in A v , and let a = ( a X , a Y , a v , a Z ) . Then, u i ( a * ) u i ( a ) . Having assumed that the game is of adequate information we may assume that there is a function u i 0 : A S i τ R such that
u i 0 ( a S i τ * ) u i 0 ( a S i τ ) .
Consider an arbitrary profile b in A . Then, u i ( b I i τ , a S i τ * ) u i ( b I i τ , a S i τ ) . Now, either v is scrutable to i or v is inscrutable to i. If the latter, I i τ = X { v } . Then, u i ( b X , a Y , a v * , a Z ) = u i ( b X , a Y , a v , a Z ) . With a v having been chosen arbitrarily, a v * arg max a A v u i ( b X , a Y , a v , a Z ) . If the former, I i τ = X , and we find that u i ( b X , a Y , a v * , a Z ) u i ( b X , a Y , a v , a Z ) . Again, we may conclude that a v * arg max a A v u i ( b X , a Y , a v , a Z ) . In either case, and with b having been chosen arbitrarily, a v * b X A X arg max a v A v u i ( b X , a Y , a v , a Z ) , which proves the case.
Part (ii) follows by an argument that runs along analogous lines as that for part (i), reasoning about the vertices that are inscrutable to i simpliciter rather than under a specific topological sorting τ . In this context, observe that, by virtue of Lemma 6, the set I i τ of vertices inscrutable to i under a topological sorting τ constitutes invariably a subset of the set I i of vertices inscrutable to i simpliciter. □

7. Computing Backwards Induction Solutions for Partial Order Games

In this section, we explore a number of natural computational problems surrounding our backwards induction procedure for partial order games and games of adequate information. We first conduct an asymptotic analysis of our backwards induction procedure. We find that the procedure runs in exponential time in the general case, but that a considerable speed up is possible if the in-degree of the vertices or the number of their parents is bounded appropriately. Subsequently, we investigate the computational complexity of deciding whether a partial order game is fit for backwards induction, that is if the precondition for our procedure to produce a Nash equilibrium holds. In a similar vein, we also address the closely related question of whether a given partial order game is a game of adequate information. We find that both these problems are coNP -complete. Finally, we come to consider the problems of deciding whether a given strategy profile is a backwards induction solution and whether a given action profile is sustained by a backwards induction solution. We show that the latter problem is NEXPTIME -complete for the general case, but that it is PSPACE -complete for total-order games.

7.1. Asymptotic Complexity of Backwards Induction

Due to the compactness of the partial order game representation, our backwards induction procedure runs in time exponential in the number | V | of vertices. Let v 1 , , v | V | be ordered according to the topological sorting τ under which backwards induction is conducted. Throughout this section, as we did before, for each vertex v, by Y we denote the parents of v, by Z the vertices with a greater topological index than v, and let X = V \ ( Y Z { v } ) . Then, for each vertex v j , where 1 j | V | , with parents Y and τ -inscrutable vertices X we have to find, an action a v j * in A v j such that, for i the active player at v j ,
a * b X A X arg max a v j A v j u i ( b X , a Y , a v j , f Z * ) ,
and abort if no such action exists. This requires for each profile a Y in A Y the computation of | A v j | · u X | A u | utility values u i ( a X , a Y , a v j , a Z * ) . Letting A ¯ = max v V A v , this makes for a total number of utility computations of
j = 1 | V | k = 1 j | A k | j = 1 | V | k = 1 j | A ¯ | = j = 1 | V | | A ¯ | j = j = 0 | V | | A ¯ | j 1 = | A ¯ | | V | + 1 1 | A ¯ | 1 1 = O ( | A ¯ | | V | ) .
Assuming that the players’ utilities are represented by a Boolean circuit, each of these can be achieved in time polynomial in | A | . Following the definition, we can therefore compute a backwards induction solution in O ( | A | | V | ) time.
It is not hard to see that, if we restrict ourselves to partial order games of adequate information in which all vertices have at most log 2 ( | A | ) parents, a backwards induction profile can be computed in time pseudo-polynomial in | A | . If, in addition to their having at most log 2 ( | A | ) parents, every vertex v of a game of adequate information has at most two actions, that is, | A v | 2 , then a backwards induction profile can be computed in time polynomial in | A | . Summarising, we have the following proposition.
Proposition 5.
Let Y v denote the set of parents of a vertex v. For a partial order game of adequate information, a backwards induction profile can then be computed in time:
(i) 
O ( | A | | V | ) , in the general case,
(ii) 
O ( | A | log 2 ( | A | ) ) , if | Y v | log 2 ( | A | ) for all vertices v,
(iii) 
O ( | A | ) , if | Y v | log 2 ( | A | ) and | A v | 2 for all vertices v.

7.2. Computational Complexity of Backwards Induction

In this section, we investigate the natural computation problems of deciding whether a backwards induction solution exists in a given partial order game for a given topological order. Furthermore, we analyse how hard it is to decide whether a given strategy profile incorporates a backwards induction solution. Finally, recalling that action profiles and strategy profiles are essentially different objects in the setting of partial order games, we explore how computational complex it is to decide whether a given action profile is sustained by a backwards induction solution, that is, whether, for a given action profile a , there is a backwards induction solution f such that a = a f . Throughout this section, we assume that strategies and utilities are concisely represented as Boolean circuits as outlined in Section 3.1.
First, we consider the following computational problem concerning the conditions under which a partial order game is fit for backwards induction. Formally:
fit for backwards induction
Given: Partial order game G and a topological sorting τ of its vertices
Problem: Is G fit for backwards induction under  τ ?
For total-order games, fit for backwards induction is vacuous, as they are all fit for backwards induction. For general partial order games, however, we have the following.
Theorem 9.
fit for backwards inductionis coNP -complete.
Proof. 
To see that fit for backwards induction is in coNP , let τ be a topological sorting of the game’s vertices. A certificate for a counterexample is then given by
(i)
a vertex v, with A v = { a 1 , , a k } , and X, Y, and Z the set of inscrutable vertices with a lower topological index, the set of v’s parents, and the vertices with a higher topological index than v, respectively;
(ii)
a profile a Y in A Y ;
(iii)
a profile a Z in A Z ;
(iv)
a tuple ( a 1 , , a k ) of k actions a j in A k ; and
(v)
a tuple ( b X 1 , , b X k ) of k profiles b X j in A X .
It is easy to verify that such a certificate is still of polynomial size. Observe that, if u i ( b X j , a Y , a j , a Z ) > u i ( b X j , a Y , a j , a Z ) holds for some 1 j k , we may conclude that
a j b X A X arg max a v A v u i ( b X , a Y , a v , a Z ) .
Therefore, such a certificate is a counterexample to the game being fit for backwards induction under τ if and only if u i ( b X j , a Y , a j , a Z ) > u i ( b X j , a Y , a j , a Z ) for every 1 j k . Observe that determining the sets X, Y, and Z can be achieved in polynomial time. Having assumed, moreover, that the utility function u i is part of the input and represented by a Boolean circuits and evaluation problem for Boolean circuits is solvable in polynomial time, we find that this is a property that can be checked in polynomial time.
For coNP -hardness, we reduce the complement of satisfiability. Let φ be a propositional formula in the propositional variables p 1 , , p m . Let q and r be two additional, fresh, propositional variables. We construct a partial order Boolean game with three players, player 1, player 2, and player 3 on a DAG ( V , E ) with V = { v q , v r , v p 1 , , v p m } and E such that ( v , u ) E whenever v = v q and u = v p j for some 1 j m , or v = v p j for some 1 j m and u = v r . Furthermore, see Figure 12 for an illustration of the construction of such a DAG in the case φ is defined over five propositional variables. Let v q be assigned to player 1, the vertices v p 1 , , v p m to player 2, and v r to player 3. Furthermore, assume that γ 1 = γ 2 = and γ 3 = φ ( q r ) . Consider an arbitrary topological sorting τ of the vertices.
Figure 12. Construction of the DAG for the coNP-hardness proof for fit for backwards induction in the case of the formula φ being defined over five variables p1 through p5.
First, assume that φ is not satisfiable. Then, all players are indifferent between all outcomes, and it can easily be seen that the game is fit for backwards induction under τ .
Finally, assume that assignment α : { p 1 , , p m } { 0 , 1 } satisfies φ . Observe that v q is inscrutable to v r and has a lower topological rank under τ . Moreover, v p 1 , , v p m are the parents of v r . Now, let a v p 1 , , a v p m be such that a p j = p j if α ( p j ) = 1 , and a p j = p ¯ j if α ( p j ) = 0 . Furthermore, let b v q = q and b v q = q ¯ , and a v r = r and a v r = r ¯ . Then,
u 3 ( q , a v p 1 , , a v p m , r ) > u 3 ( q , a v p 1 , , a v p m , r ¯ )
but
u 3 ( q ¯ , a v p 1 , , a v p m , r ) < u 3 ( q ¯ , a v p 1 , , a v p m , r ¯ ) .
Accordingly, we may conclude that the game is not fit for backwards induction. □
The closely related decision problem of whether a given partial order game is of adequate information, can be formulated as follows.
adequate information
Given: Partial order game G and topological sorting  τ
Problem: Is G a game of adequate information under  τ ?
Again, note that adequate information is vacuous for total-order games. For general partial order games, we find that the problem of adequate information is not easier than fit for backwards induction. The similarity of the two problems—and so also the closeness of the concepts of fitness of backwards induction and adequate information—is revealed by the closeness of their respective proofs of coNP -completeness.
Theorem 10.
adequate informationis coNP -complete.
Proof. 
For membership in coNP , let a certificate for a counterexample consist of a player i, a pair of profiles a S and a S in A S , and a pair of profiles b I and b I in A I , where S is the set of vertices that are scrutable to i under τ and I the set of vertices that are inscrutable to i under τ . Observe that such a certificate is of polynomial size and that the sets S and I can be computed in polynomial time. Then, such a certificate ( a S , a S , b I , b I ) defines a counterexample against the game being of adequate information whenever it is not the case that u i ( a S , b I ) u i ( a S , b I ) if and only if u i ( a S , b I ) u i ( a S , b I ) . Having assumed that the utility function u i is part of the input and represented by a Boolean circuit and evaluation problem for Boolean circuits is solvable in polynomial time, we find that this is a property that can be checked in polynomial time (see Lemma 2).
The proof of coNP -hardness is by a reduction from the complement of satisfiability, and runs along analogous lines as the proof of coNP -hardness of fit for backwards induction. Thus, given a propositional formula φ in the variables p 1 , , p k , construct the partial order Boolean game as in the proof of Theorem 9.
Then, if φ is not satisfiable, we find that for all three players, each action profile yields a utility of 1. It thus immediately follows that the game is of adequate information, as we can set, for instance, u i 0 ( a S ) = 1 for all players i with set S of scrutable vertices under τ .
For the opposite direction, assume that assignment α : { p 1 , , p m } { 0 , 1 } witnesses the satisfiability of φ . Observe that v q is inscrutable to player 3 under τ , whereas v q , v p 1 , , v p m are scrutable to player 3. Now let a v p 1 , , a v p m be such that a p j = p j if α ( p j ) = 1 , and a p j = p ¯ j if α ( p j ) = 0 . Then,
u 3 ( q , a v p 1 , , a v p m , r ) > u 3 ( q , a v p 1 , , a v p m , r ¯ )
but
u 3 ( q ¯ , a v p 1 , , a v p m , r ) < u 3 ( q ¯ , a v p 1 , , a v p m , r ¯ ) .
Assuming that the game be of adequate information, there then should be a utility function u 3 0 : A S R such that from the former it would follow that u 3 0 ( a v p 1 , , a v p m , r ) > u 3 0 ( a v p 1 , , a v p m , r ¯ ) , whereas from the latter u 3 0 ( a v p 1 , , a v p m , r ) < u 3 0 ( a v p 1 , , a v p m , r ¯ ) , a contradiction. Accordingly, we may conclude that the game is not of adequate information. □
Using Theorems 7 and 8, we know that games that are fit for backwards induction, like games of adequate information, have Nash equilibria. Thus, Theorems 9 and 10 could be seen to prove coNP -completeness of weak versions of the non-emptiness problem for backwards induction. This still leaves the question of whether a given strategy profile is a backwards induction solution and of whether a given action profile is sustained by a backwards induction solution, which can be seen as variants of the non-emptiness problem for backwards induction. The first decision problem can be formulated formally as follows.
is backward induction
Given: Partial order game G that is fit for backwards induction under topological sorting  τ and strategy profile f = ( f v 1 , , f v | V | )
Problem: Is  f a backwards induction solution of G under  τ ?
We find that is backward induction is coNP -complete. For the membership part, recall that Lemma 5 showed that our backwards induction procedure has the one-stage-deviation property. Let τ be a topological sorted and assume ( v 1 , , v | V | ) be sorted according to τ . The contrapositive of the one-stage-deviation property says that, if a profile f is not a backwards induction solution, there is one player, who, after some τ -history, wants to unilaterally deviate and play another action than the one prescribed by f , but otherwise has no incentive to change their strategy. This prepares the ground for the following result.
Theorem 11.
is backward inductionis coNP -complete. The problem remains coNP -hard for total-order games.
Proof. 
For membership in coNP , one can design a non-deterministic algorithm along the following lines. Assume v v 1 , , v v | V | is ordered according to τ . Guess a vertex v k assigned to player i along with action profiles a = ( a X , a Y , a v k , a Z ) and a = ( a X , a Y , a v k , a Z ) (thus, a X = a X and a Y = a Y ). Let Y denote the parents of v k , Z the vertices with a greater topological index than v k , and X = V \ ( Y Z { v k } ) . Then, check whether
(i)
the actions a v k , a v k + 1 , a v | V | are in accordance with f = ( f v 1 , , f v | V | ) , that is, for every k m | V | , we have a v m = f v m ( a Y m ) , where Y m is the set of parents of v m ;
(ii)
the actions a v k + 1 , , a v | V | are in accordance with f = ( f v 1 , , f v | V | ) , that is, for every k < m | V | , we have a v m = f v m ( a Y m ) , where Y m is the set of parents of v m ; and
(iii)
u i ( a ) > u i ( a ) .
In virtue of Lemmas 1 and 2, each of these checks can be performed in polynomial time in the size of the input. Moreover, if all of these checks are positive, we have found a counterexample against f being a backwards induction solution.
For coNP -hardness, we reduce the complement of satisfiability. Given an Boolean formula φ on propositional variables p 1 , , p k , we construct a partial order Boolean game G φ with one player i with goal formula γ i = φ p 0 , where p 0 is a “fresh” variable distinct from any p 1 , , p k . Let the vertices v p 0 , , v p k be ordered accordingly, that is, p E p m if and only if < m . Thus, G φ is a totally-ordered Boolean game, and, consequently, also fit for backwards induction. Now, consider the strategy profile f 0 = ( f v p 0 0 , , f v p k 0 ) where each f v p j 0 is represented by the choice equation p j : , that is, f 0 unconditionally sets every propositional variable to false. Note that f 0 yields player i a utility of 0, that is u i ( f 0 ) = 0 .
Now, if φ is unsatisfiable, so is i’s goal γ i , and all strategy profiles yield i the same payoff. Therefore, f 0 is vacuously a backwards induction solution. If, on the other hand, φ is satisfiable, say in virtue of assignment α , let f α = ( f v p 0 , , v p k ) be the unconditional strategy profile represented by the choice equations p 0 = and, for each 1 m k , p m : , if α ( p m ) = 1 , and p m : , if α ( p m ) = 0 . It can then easily be verified that player i has u i ( f α ) = 1 . Accordingly, player i has an incentive to deviate from f 0 to f α , indicating that f 0 is not a Nash equilibrium. By Theorem 7, it then follows that f 0 is not a backwards induction solution either. □
Finally, we come to consider the natural counter-part of the is nash actions problem for the backwards induction procedure, which can formally be stated as follows.
is backward induction actions
Given: Partial order game G, topological sorting  τ , and action profile a = ( a v 1 , , a v | V | )
Problem: Is  a sustained by a backwards induction solution under  τ in G?
We find that backwards induction actions is NEXPTIME -complete for general partial order games. The hardness part of this result follows again from a reduction from dependency quantifier boolean formula game (dqbfg).
Theorem 12.
For partial order games,backward induction actionsis NEXPTIME -complete.
Proof. 
A NEXPTIME algorithm to decide backward induction actions can be designed along the following lines. Given a partial order game G, a topological-ordering τ = ( v 1 , , v | V | ) of its vertices, and an action profile a = ( a v 1 , , a v | V | ) , first, guess a strategy profile f = ( f v 1 , , f v | V | ) . Given that strategies f v are given by a Boolean circuit C f v , this can be achieved in time not more than exponential in | A | , the size of the set of actions. Second, check whether f = ( f v 1 , , f v | V | ) is a backwards induction solution. The inductive definition of a backwards induction solution suggests a procedure for how this can be achieved in exponential time. Finally, check whether action profile a f as induced by strategy profile f coincides with the action profile a as it is given in the input. This can be achieved in time polynomial in the size of the Boolean circuit representing f . Altogether, the algorithm runs in non-deterministic exponential time.
For hardness, we reduce from dependency quantifier boolean formula game (dqfg) using a similar construction of a Boolean partial order game as in the proof of Theorem 4. Given an instance φ , X 1 , X 2 , Y 1 , Y 2 of dqfg with X 1 = { x 1 1 , , x | X 1 | 1 } , X 2 = { x 1 2 , , x | X 2 | 2 } , Y 1 = { y 1 1 , , y | Y 1 | 1 } , and Y 2 = { y 1 2 , , y | Y 2 | 2 } . Let x 0 be a ‘fresh’ variable x 0 not in X 1 X 2 Y 1 Y 2 . We define a Boolean partial order game on dependency graph ( V , E ) such that V = { x 0 } X 1 X 2 Y 1 Y 2 and where E is defined as follows:
(i)
x 0 E z    for all z X 1 X 2 Y 1 Y 2 ,
(ii)
x k i E x m j  for all x k i X i and x m j X j with i = j and k < m , or i j ,
(iii)
x 1 E x 2 , for all x 1 X 1 and x 2 X 2 ,
(iv)
x 1 E y 1 , for all x 1 X 1 and y 1 Y 2 ,
(v)
x 2 E y 2 , for all x 2 X 2 and y 2 Y 2 .
Observe that under this definition on X 1 X 2 , the relation E is complete and transitive. We extend E to a topological sorting τ such that, for instance,
τ = ( x 0 , x 1 1 , , x | X 1 | 1 , x 1 2 , , x | X 2 | 2 , y 1 1 , , y | Y 1 | 1 , y 1 2 , , y | Y 2 | 2 ) .
The players of the Boolean game are the same as in the instance of dqbfg, where B controls { x 0 } X 1 X 2 , W 1 the variables in Y 1 , and W 2 those in Y 2 . Their goals are given by
γ B = x 0 φ γ W 1 = γ W 2 = .
Observe that all vertices are scrutable to player B under τ and that the white players have the same utility for every outcome. It therefore follows that the game constructed is of adequate information and hence fit for backwards induction.
We are now in a position to show that the profile
a ¯ = ( x ¯ 0 , x ¯ 1 1 , x ¯ | X 1 | 1 , x ¯ 1 2 , x ¯ | X 2 | 2 , y ¯ 1 2 , y ¯ | Y 1 | 1 , y ¯ 1 2 , y ¯ | Y 2 | 2 ) ,
which sets all variables, including x 0 , to false, is sustained by a Nash equilibrium in G if and only if φ , X 1 , X 2 , Y 1 , Y 2 is a postive instance of dqbfg.
First, assume φ , X 1 , X 2 , Y 1 , Y 2 is a postive instance of dqbfg. Then, the white team has a joint winning strategy in the original dqbfg-game given by Boolean functions β 1 and β 2 on the variables X 1 and X 2 , respectively. Next, define strategies f W 1 * and f W 2 * for W 1 and W 2 in the Boolean partial order game that are given by the following choice equations for i = 1 , 2 and j = 1 , , | Y i | ,
y j i : ( x 0 φ β i ( x 1 i , , x | X i | i ) ) ( x ¯ 0 )
As the dependency graph ( V , E ) respects the information dependencies of the dqbfg-instance, observe that f W 1 and f W 2 together embody a winning strategy to render φ false if x 0 is played. Let B’s unconditional strategy f B * be defined by the choice equations of the form x j i : , setting x j i to false for all for i = 1 , 2 and j = 1 , , | X i | . Observe that for the strategy profile f * = ( f B * , f W 1 * , f W 2 * ) , we have that a f * = a ¯ , as desired. As W 1 and W 2 will not have their goal achieved under any action profile, their strategies trivially conform with the backwards induction procedure. Observe furthermore that B does not get their goal satisfied no matter which strategy plays against f W 1 * and f W 2 * . If B sets x 0 to false, their goal is not satisfied in a self-defeating fashion. By setting x 0 to true, B will induce W 1 and W 2 to play a winning strategy rendering ¬ φ false and therewith γ B as well. This in particular holds for any backwards induction strategies B has at their disposal, and it follows that f * is a backwards induction solution sustaining a ¯ .
For the opposite direction, assume that φ , X 1 , X 2 , Y 1 , Y 2 is a negative instance of dqbfg. Analogously to the proof of Theorem 4, it can then be shown that the constructed partial order Boolean game does not have a Nash equilibrium sustaining a ¯ . In virtue of the contrapositive of Theorem 7, there is no backwards induction solution sustaining a ¯ , giving us the result. □
The computational complexity of backward induction actions reduces considerably when we restrict attention to total-order games. Still, the problem is PSPACE -complete for this class of games. To prove this result, we first introduce some auxiliary notation and terminology.
Recall that if the dependency graph ( V , E ) of a total-order game is such that v 0 E E v | V | , then τ = ( v 0 , , v | V | ) is the only topological sorting of the vertices. Moreover, the parents of each vertex v k are all vertices with a lower topological index. The set { v 1 , , v k 1 } of parents of vertex v k we will denote by Y k , on the understanding that Y k = Ø , if k = 0 . Similarly, we will denote by Z k = { v k + 1 , , v | V | } the set of vertices with a higher topological index than v k , here on the understanding that Z k = Ø , if k = | V | .
For a = ( a v 1 , , a v | V | ) an action profile and h = ( a v 1 , , a v k ) a τ -history of a , we now introduce the auxiliary concept of a being sustained by a backwards induction solution at history h of a. This enables us to reason recursively about action profiles being sustained by backwards induction solutions without having to make explicit reference to specific underlying strategy profiles. Formally, for action profile a =   ( a v 1 , , a v | V | ) a n d τ-history h =   ( a v 1 , , a v k ) of a we say that a is sustained (by backwards induction) at h, if either h = a, or h =   ( a v 1 , , a v k ) for some 0 < k < |V| and the following two conditions hold:
(a.i)
action profile a is sustained by backwards induction at history ( a v 1 , , a v k , a v k + 1 )
(a.ii)
for every b v k + 1 A v k + 1 , there is an action profile a = ( a v 1 , , a v k , b v k + 1 , b v k + 2 , b v | V | ) that is sustained by backwards induction at history ( a v 1 , , a v k , b v k + 1 ) and which is such that u i ( a ) u i ( a ) , where i is the player active at vk+1.
We now have the following lemma.
Lemma 7.
For total-order games, an action profile a = ( a v 1 , , a v | V | ) is sustained by a backwards induction solution if and only if a is sustained by backwards induction at all histories of a . Moreover, a is sustained by a backwards induction solution only if a is sustained by backwards induction at the empty history ( ) . .
Proof. 
As we are dealing with total-order games, first observe that, for each vertex v, the set X of vertices that are neither a parent of v nor have a greater topological index than v is empty. Thus, Condition (1) in the definition of a backwards induction solution f * for partial order games reduces to
f v k + 1 * ( a v 1 , , a v k ) arg max b v k + 1 A v k + 1 u i ( a v 1 , , a v k , b v k + 1 , f v k + 2 , , f v | V | ) ,
for all 0 k < | V | , and where i is the player active at v k + 1 .
As an auxiliary concept, we define a strategy profile f = ( f v 1 , , f v | V | )  to sustain action profile a = ( a v 1 , , a v | V | ) (by backwards induction) at history h = ( a v 1 , , a v k ) , if either a = h , or both
(f.i)
f sustains a at ( a v 1 , , a v k , a v k + 1 ) ,
(f.ii)
f v k + 1 ( a v 1 , , a v k ) = a v k + 1 , and
(f.iii)
for all b v k + 1 A v k + 1 with b v k + 1 a v k + 1 we have that f sustains action profile ( a v 1 , , a v k , b v k + 1 , f v k + 2 , , f v | V | ) at history ( a v 1 , , a v k , b v k + 1 ) and,
u i ( a v 1 , , a v k , a v k + 1 , a v k + 2 , a v | V | ) u i ( a v 1 , , a v k , b v k + 1 , f v k + 1 , , f v | V | ) ,
where i is the player active at v k + 1 .
The we split the proof up in three parts, where we prove, respectively, the following statements (I), (II), and (III), below. In conclusion then put them together again, to prove the lemma.
(I)
for all 0 k | V | and all action profiles a = ( a v 1 , , a v | V | ) , a is sustained at ( a v 1 , , a v k ) if and only if some strategy profile f = ( f v 1 , , f v | V | ) sustains a at ( a v 1 , , a v k ) .
(II)
strategy profile f = ( f v 1 , , f v | V | ) sustains action profile a = ( a v 1 , , a v | V | ) at history ( ) if and only if a f = a and for every 0 k | V | and all action profiles x = ( x v 1 , , x v k , f v k + 1 , , f v | V | ) , strategy profile f sustains x at ( x v 1 , , x v k ) .
(III)
for every 0 k | V | and every action profile x = ( x v 1 , , x v k , f v k + 1 , , f v | V | ) , strategy profile f sustains x at history h = ( x v 1 , , x v k ) if and only if strategy profile f = ( f v 1 , , f v | V | ) is a backwards induction solution.
Part (I) is by induction on | V | k . For the basis, where k = | V | , consider an arbitrary action profile a = ( a v 1 , , a v | V | ) and arbitrary history h = ( a v 1 , , a v k ) . In this case, we find that h = a and immediately that a is sustained at h . Moereover, every strategy profile sustains a at h in this case.
For the induction step, consider an arbitrary 1 k < | V | , and equally arbitrary action profile a = ( a v 1 , , a v | V | ) and history h = ( a v 1 , , a v k ) .
First, assume that a is sustained at ( a v 1 , , v k ) , and let i be the player active at v k + 1 . Then, a is also sustained at ( a v 1 , , v k + 1 ) . By the induction hypothesis, there is a strategy profile f that sustains a at ( a v 1 , , v k + 1 ) . Moreover, as k < | V | , for each b v k + 1 A v k with b v k + 1 a v k + 1 , there is an action profile a = ( a v 1 , , a v k 1 , b v k + 1 , b v k + 2 , b v | V | ) that is sustained at history ( a v 1 , , a v k 1 , b v k , b v k + 1 ) such that u i ( a ) u i ( a ) . Let A v k + 1 = { b v k + 1 1 , , b v k + 1 | A v k + 1 | } . By the induction hypothesis, for each 1 j | A v k + 1 | , there is a strategy profile g j = ( g v 1 j , , g v | V | j ) sustaining action profile ( a v 1 , , a v k , b v k + 1 j , g v k + 2 j , g v | V | j ) at history ( a v 1 , , a v k , b v k + 1 j ) , and which is such that
u i ( a ) u i ( a v 1 , , a v k , b v k + 1 j , g v k + 1 j , , g v | V | j ) .
Now, define the strategy profile f * = ( f v 1 * , , f v | V | * ) such that for 1 k | V | and every history x = ( x v 1 , , x m ) , where 0 m | V | ,
f v m * ( x ) = f v m j ( x ) if x v k + 1 = a v k + 1 j and m > k , g v m j ( x ) if x v k + 1 = b v k + 1 j and m > k , a v m otherwise .
It is now easy to check that, by construction, strategy profile f * sustains a at history ( a v 1 , , a v k ) , as desired. In particular, observe that f v k + 1 * ( a v 1 , , a v k ) = a v k + 1 .
For the opposite direction, assume that strategy profile f = ( f v 1 , , f v | V | ) sustains action profile a = ( a v 1 , , a v | V | ) at history ( a v 1 , , a v k ) , where k < | V | . Then, by definition, strategy profile f = ( f v 1 , , f v | V | ) also sustains action profile a = ( a v 1 , , a v | V | ) at history ( a v 1 , , a v k = 1 ) . By the induction hypothesis, we then immediately obtain that a is sustained at history ( a v 1 , , a v k + 1 ) . Now, consider an arbitrary b v k + 1 A v k + 1 with a v k + 1 b v k + 1 , along with history ( a v 1 , , a v k , b v k + 1 ) . Then, f sustains action profile ( a v 1 , , a v k , b v k + 1 , f v k + 2 , , f v | V | ) at history ( a v 1 , , a v k , b v k + 1 ) . We have, moreover, that u i ( a ) u i ( ( a v 1 , , a v k , b v k + 1 , f v k + 2 , , f v | V | ) ) . By the induction hypothesis, we find that action profile ( a v 1 , , a v k , b v k + 1 , f v k + 2 , , f v | V | ) is sustained at history ( a v 1 , , a v k , b v k + 1 ) , which proves the case.
For part (II), first assume that a f = a and that for every 0 k | V | , strategy profile f sustains action profile ( x v 1 , , x v k , f v k + 1 , , f v | V | ) at history ( x v 1 , , x v k ) . The latter then holds in particular for k = 0 , that is, for history ( ) . Accordingly, f sustains action profile ( f v 1 , , f v | V | ) = a f at history ( ) . Our initial assumption that a f then gives the result for this direction.
For the opposite direction, assume that strategy profile f = ( f v 1 , , f v | V | ) sustains action profile a = ( a v 1 , , a v | V | ) at history ( ) . First, we find that by repeatedly applying condition (f.ii) gives us a f = a . Now, consider an arbitrary 0 k | V | and action profile x = ( x v 1 , , x v k , f v k + 1 , , f v | V | ) . We prove by induction on k that f sustains ( x v 1 , , x v k , f v k + 1 , , f v | V | ) at history h = ( x v 1 , , x v k ) .
First assume that k = 0 , that is, h = ( ) . In that case, x = ( f v 1 , , f v | V | ) = a f . We have already seen that a f = a , and, thus, the base case immediately follows from the assumption that f = ( f v 1 , , f v | V | ) sustains action profile a = ( a v 1 , , a v | V | ) at history ( ) .
For the induction step, let k > 0 . In case k = | V | we have h = a and we are done immediately. Otherwise, by the induction hypothesis, we may assume that f sustains action profile ( x v 1 , , x v k 1 , f v k , , f v | V | ) at history ( x v 1 , , x v k 1 ) . If f v k ( x v 1 , , x v k 1 ) = x k , then it follows from condition (f.i) that f sustains ( x v 1 , , x v k , f v k + 1 , , f v | V | ) at ( x v 1 , , x v k ) . If f v k ( x v 1 , , x v k 1 ) x k , then still f v k ( x v 1 , , x v k 1 ) A v k . Now, it follows from condition (f.iii) that f sustains ( x v 1 , , x v k , f v k + 1 , , f v | V | ) at ( x v 1 , , x v k ) .
For part (III), first assume, for every 0 k | V | and action profile x = ( x v 1 , , x v k , f v k + 1 , , f v | V | ) , that strategy profile f = ( f v 1 , , f v | V | ) sustains x at history h = ( x v 1 , , x v k ) . Now, consider an arbitrary history ( z 1 , , z k ) for some 1 k < | V | . Then, by assumption, f sustains action profile ( z v 1 , , z v k , f v k + 1 , , f v | V | ) at history ( z v 1 , , z v k ) . As k < | V | , we find that ( z 1 , , z k ) ( z 1 , , z k , f v k + 1 , , f v | V | ) . Therefore, by condition (f.iii), we then find that
f v k + 1 ( z v 1 , , z v k ) arg max b v k + 1 A v k + 1 u i ( z v 1 , , z v k , b v k + 1 , f v k + 2 , , f v | V | ) ,
where i is the player active at v k + 1 . It thus follows that f is a backward induction solution.
For the opposite direction, assume that f = ( f v 1 , , f v | V | ) is a backwards induction solution. Now, consider an arbitrary 0 k | V | . We prove by induction on | V | k , that, for every action profile x = ( x v 1 , , x v k , f v k + 1 , , f v | V | ) , strategy profile f sustains x at history h = ( x v 1 , , x v k ) .
For the basis assume let k = | V | . Consider an arbitrary action profile x = ( x v 1 , , x v k ) along with history h = ( x v 1 , , x v k ) . Obviously, h = x , and, thus, immediately that f sustains x at history h.
For the induction step, let k < | V | . Again, consider an arbitrary action profile h = ( x v 1 , , x v k , f v k + 1 , , f v | V | ) along with history h = ( x v 1 x , x v k ) . By the induction hypothesis, we find, for all action profiles y = ( y v 1 , , y v k + 1 , f v k + 1 , , f v | V | ) , that f sustains y at history ( y v 1 , , y v k + 1 ) . This holds in particular for histories ( x v 1 , , x v k , b v k + 1 ) where b v k + 1 A v k + 1 . For b v k + 1 = f v k + 1 ( x v 1 , , f v k ) , we obtain condition (f.i). Condition (f.ii) is trivially satisfied. Finally, having assumed that f is a backwards induction solution, we find that,
u i ( x v 1 , , x v k , f v k + 1 , f v k + 2 , , f v | V | ) u i ( x v 1 , , x v k , b v k + 1 , f v k + 2 , , f v | V | ) ,
for all b v k + 1 V v k + 1 with b v k + 1 f v k + 1 ( x v 1 , , f v k ) and i the player active at v k + 1 . We may therefore conclude that f sustains x = ( x v 1 , , x v k , f v k + 1 , , f v | V | ) at ( x v 1 , , x v k ) , proving part (III).
With (I) through (III) in place, we conclude the proof as follows. Consider an arbitrary action profile a = ( a v 1 , , a v | V | ) . First, assume, for all 0 k | V | , that a is sustained by backwards induction at ( a v 1 , , a v k ) . Then by (I), for all 0 k | V | , there is some strategy profile f = ( f v 1 , , f v | V | ) that sustains a by backwards induction at ( a v 1 , , a v k ) . For the particular case wherein k = 0 , we find that this strategy profile f that sustains a by backwards induction at ( ) . By repeated application of (f.ii), moreover, it can easily be established that a f = a . Part (II) then yields that for every 0 k | V | and all action profiles x = ( x v 1 , , x v k , f v k + 1 , , f v | V | ) , strategy profile f sustains a at history ( x v 1 , , x v k ) . Finally, by virtue of (III), we may conclude that f is a backwards induction solution that sustains a .
For the opposite direction, assume that f is a backwards induction solution sustaining a . Therefore, a f = a , and, accordingly, ( a v 1 , , a v k , f v k + 1 , , f v | V | ) = ( a v 1 , , a v | V | ) for all 0 k | V | . Part (III) yields furthermore that for every 0 k | V | , strategy profile f sustains action profile a = ( a v 1 , , a v k , f v k + 1 , , f v | V | ) at history ( a v 1 , , a v k ) . It now follows, for every 0 k | V | , that f sustains a at history ( a v 1 , , a v k ) . With part (I), we may now conclude that, for all 0 k | V | , action profile a is sustained by backwards induction at ( a v 1 , , a v k ) , as desired.
Finally, observe that the second part of the lemma also follows from the above argument. □
The intuition behind Lemma is that, for a strategy profile f sustaining an action profile a = ( a v 1 , , a v | V | ) to be a backward induction solution, it does not suffice just to induce a , in the sense that a f = a . It should, in addition, incorporate a threat strategy at every vertex v k that succeeds in deterring the player active at v k to play any action other than a v k , that is, the action prescribed by a . The lemma, moreover, shows that a strategy profile that achieve this can be constructed from the strategy profiles that achieve this at each history.
We introduce the following auxiliary decision problem, solving which is an important subroutine in our proof of PSPACE -membership of backwards induction actions for total-order games.
backward induction guarantee
Given:Total-order game G, action profiles  a = ( a v 1 , , a v | V | ) and history  h = ( a v 1 , , a v k ) , player i, and  r R .
Problem:Is  a sustained by backwards induction at  h such that u i ( a ) r
We find that backward induction guarantee is in PSPACE for total-order games.
Lemma 8.
For total-order games,backward induction guaranteeis in PSPACE , that is, the problem can be solved in space polynomial in the size of the game.
Proof. 
Let a total-order game be given together with an action profile a = ( a v 1 , , a v | V | ) . We prove by induction on | V | k , that for all histories h = ( a v 1 , , a v k ) of a , all players i, and all values r R the problem backward induction guarantee can be solved in polynomial space.
For the basis, assume k = | V | . Then, h = a , and consider an arbitrary player i and arbitrary value r R . In this case, it suffices to compute u i ( a ) and check whether u i ( a ) r . By Lemma 2, this achieved in polynomial time, and hence in polynomial space as well.
For the induction step, let k < | V | . Consider an arbitrary player j and an arbitrary value r R . Let, furthermore, i be the player active at a v k . Now, we first check whether a is sustained by backward induction at history h v k + 1 = ( a v 1 , , a v k + 1 ) . By virtue of the induction hypothesis, this can be achieved in polynomial space by computing backward induction guarantee for a , history h v k + 1 , player i, and value r = u i ( a ) . If this check is negative, we can conclude immediately and output “no”.
Otherwise, we deploy two binary counters with, respectively, log 2 | A ( x ) | and log 2 | H | = O ( | V | log 2 | A | ) digits to run through all actions b v k + 1 in A v k and all action profiles
( a v 1 , , a v k , b v k + 1 , b k + 1 , b v | V | ) ,
respectively. We then subsequently check for each b v k + 1 in A v k + 1 whether some action profile b = ( a v 1 , , a v k , b v k + 1 , b k + 1 , b v | V | ) is sustained by backwards induction at ( a v 1 , , a v k , b v k + 1 ) with u i ( b ) u i ( a ) . Then, a is sustained by backwards induction a history h = ( a v 1 , , a v k + 1 ) if and only if for each b v k + 1 in A v k + 1 at least one of these checks is positive. Moreover, as some reflection reveals, all of these checks are independent and, by the induction hypothesis, we may assume that all of them can be achieved in polynomial space, we may conclude that the subsequent execution of these checks can be performed in polynomial space as well, giving us the result. □
With Lemma 8 in place, we are now in a position to now demonstrate the last main result of this section.
Theorem 13.
For total-order games,backwards induction actionsis PSPACE -complete.
Proof. 
For membership in PSPACE , let a total order game with v 1 E E v | V | and an action profile a = ( a v 1 , a v | V | ) be given. Observe that the topological order τ = ( v 1 , , v | V | ) is fixed by E. By virtue of Lemma, it suffices to check whether a is sustained by backwards induction at the empty history ( ) for value k = u i ( a ) , where i is the player active at v 1 . Lemma 8 ensures that this can be achieved in space polynomial in the input.
For PSPACE -hardness, we reduce from qbf, which is known to be PSPACE -hard. Let Φ = Q 1 x 1 , , Q | X | x | X | φ ( x 1 , , x k ) be a fully quantified propositional formula over X = { x 1 , , x | X | } , where each Q i is either ∃ or ∀. We let X and X denote the sets of universally quantified variables and existentially quantified variables in Φ , respectively. Furthermore, for each variable x i with 1 i | X | , we let B i denote the set of Boolean functions on the variables x 1 , , x i 1 . With each profile β = ( β 1 , , β | X | ) in B 1 × × B | X | , we associate an a valuation α β : X { 1 , 0 } such that
α β ( x 1 ) = β 1 α β ( x i + 1 ) = β i + 1 ( α β ( x 1 ) , , α β ( x i 1 ) ) .
We construct a total-order Boolean game G Φ with two players: player 1 and player 2. The propositional variables our game is based on are given by x 1 , , x | X | together with an additional ‘fresh’ variable x 0 . The players’ preferences are captured by
γ 1 = x 0 ¬ φ γ 2 = x 0 φ .
We assign control over variable x i to player 1, if Q i = , and to player 2, if Q i = . Moreover, the auxiliary variable x 0 is assigned to the control of player 1. The dependency graph ( { x 0 , , x k } , E ) is defined such that x i E x j if and only if i < j . The game G Φ is a total-order game and, thus, fit for backwards induction. Backwards induction solutions are therefore bound to exist. We now demonstrate that Φ = Q 1 x 1 , , Q | X | x | X | φ evaluates to true if and only if the action profile x ¯ = ( x ¯ 0 , x ¯ 1 , , x ¯ | X | ) is sustained by a backwards induction solution in G Φ .
First assume for the “only if”-direction, that Φ holds. Then, there is a profile of Boolean functions β i for the variables in X , such that for all profiles Boolean functions β j for the variables x j X , such that the assignment α β 1 , , β | V | : X { 0 , 1 } satisfies φ . On this basis we define a strategy profile f * = ( f 1 * , f 2 * ) , where f 1 * and f 2 * are given by the following choice equations for each x i X { x 0 } (controlled by player 1) and each x j X (controlled by player 2):
x i : x j : x 0 φ β j ( x 1 , , x j 1 ) .
Here, φ β j denotes the Boolean formula representing Boolean function β j . Note, furthermore, that a f * = x ¯ .
First observe that for all action profiles a = ( a x 0 , , a x | X | ) with a x 0 = x ¯ 0 , the valuation α a will satisfy neither γ 1 nor γ 2 . Some reflection, thus reveals that f * sustains x ¯ at all histories h = ( a x 0 , , a x k ) with a v x 0 = x ¯ 0 , particular at history a v x 0 . Now observe that f 2 incorporates a winning strategy for player 2 in the “subgame” that results if player 1 sets x 0 to true. Therefore, φ will be satisfied by the valuations induced by the Nash equilibria in which x 0 is set to true—if any such Nash equilibria exist. As, by Theorem 7, every backwards induction solution is a Nash equilibrium, it follows that every action profile a = ( a x 0 , , a x | X | ) with a x 0 = x 0 that is sustained by backwards induction will render φ true, and, hence, player 1’s goal γ 1 false. Now let f = ( f 1 , f 2 ) be a strategy profile that sustains some action profile a = ( a x 0 , , a x | X | ) with a x 0 = x 0 by backwards induction at history h = x 0 . Observe that we may assume the existence of such a strategy profile. At this juncture, consider the strategy profile f * * = ( f x 0 * * , , f x | X | * * ) defined such that for all histories h = ( a x 0 , , a x k 1 ) with 0 k < | X | and i { 1 , 2 } ,
f i * * ( h ) = f 1 * ( h ) if a x 0 = x ¯ 0 , f 2 ( h ) otherwise .
Using the auxiliary formal apparatus introduced in the proof of Lemma 7, we find that f * * sustains action profile x ¯ at x ¯ 0 and action profile ( x 0 , f x 1 * * , , f | V | * * ) at x 0 . Moreover, u 1 ( x ¯ ) u 1 ( x 0 , f x 1 * * , , f x | X | * * ) . As player 1 controls x 0 , we may therefore conclude that f * * sustains x ¯ at ( ) . By Lemma, we may now conclude that x ¯ is sustained by a backwards induction solution, as desired.
For the “if”-direction, assume that Φ does not hold. In this case, it can be seen that player 2 does not have a winning strategy in the “subgame” that results if x 0 is set to true. Now, consider an arbitrary strategy profile f ¯ = ( f ¯ 1 , f ¯ 2 ) that sustains x ¯ . Then, player 1’s goal is not satisfied if f ¯ is played. Moreover, as player 2 does not have a winning strategy in the “subgame” after x 0 is set to true, player 1 has some strategy f i * with which they set x 0 to true and such that ( f 1 * , f ¯ 2 ) guarantees γ 1 to be satisfied. Accordingly, player 1 will want to deviate to f 1 * and strategy profile f ¯ is not a Nash equilibrium. By Theorem 7, strategy profile f ¯ is not a backwards induction solution either. Having chosen f ¯ arbitrarily, we get the result. □
With this result in place, we can now present the proof of Theorem 6, which stated PSPACE -completeness of is nash actions for total-order games. We show that, given an action profile  a = ( a v 1 , , a v | V | ) , a given total-order game G can be transformed into a total-order game  G a in which  a is sustained by a backwards induction solution if and only if  a is sustained by a Nash equilibrium in the original game G. We then leverage Theorem 13 to prove PSPACE -membership of is nash actions.
Formally, let G be a partial order game defined on dependency graph  ( V , E ) , and let  a = ( a v 1 , , a v | V | ) be an action profile. Then, we define G a as exactly the same game as G, be it with the following binary utility functions u i a : A { 0 , 1 } . For a given action profile  b = ( b v 1 , , b v | V | ) , we say that player j is the first to deviate from  a (to  b ) if, for some 1 k | V | , player j is active at  v k , b v m = a v m for all m < k , and b v k a v k . For each player i, we set u i a ( a ) = 1 and, for every action profile  b a where player j is the first player deviate from  a to  b , we define
u i a ( b ) = 0 if i = j and u j ( b ) u j ( a ) , 2 if i = j and u j ( b ) > u j ( a ) , 2 if i j and u j ( b ) u j ( a ) , 0 if i j and u j ( b ) > u j ( a ) .
We now have the following two lemmas, the first of which shows that the game  G a preserves the Nash equilibria of G.
Lemma 9.
Let a = ( a v 1 , , a v | V | ) be an action profile of a total-order game G. Then, a is sustained by a Nash equilibrium in G if and only if a is sustained by a Nash equilibrium in G a .
Proof. 
First assume that f = ( f v 1 , , f v | V | ) be a Nash equilibrium sustaining a in G. Therefore, a f = a . For contradiction, also assume that f does not do so in G a . Then, there is some player i and some strategy g i , such that u i a ( a ) < u i a ( a ( f i , g i ) ) . Obviously, player i is the first to deviate from a to a ( f i , g i ) . Thus, u i a ( a ) = 1 and u i a ( a ( f i , g i ) ) = 2 , and, by definition of u a , we find that u i ( a f ) < u i ( a ( f i , g i ) ) . It therefore follows that f is not a Nash equilibrium of G, a contradiction.
Now assume that f is a Nash equilibrium sustaining a in G a . Therefore, a f = a . It suffices to show that f is also a Nash equilibrium in G. To this end, consider an arbitrary player i and strategy g i for i. If a ( f i , g i ) = a f , we are done immediately. Otherwise, i is the first player to deviate from a f to a ( f i , g i ) . Because f is a Nash equilibrium in G a , we find that u i a ( a f ) u i a ( a ( f i , g i ) ) . As a f = a , we have u i a ( a f ) = 1 . Observe that it now follows that u i a ( a ( f i , g i ) ) = 0 . Accordingly, u i ( a ( f i , g i ) ) u i ( a ) , and thus u i ( a ( f i , g i ) ) u i ( a f ) . We may conclude that f is a Nash equilibrium in G, as desired. □
For total-order game  G a , we can furthermore show that if there is a Nash equilibrium that sustains action profile  a , then there is also a backwards induction that does the same. The converse of this statement is immediate, as every backwards induction solution is also a Nash equilibrium (Theorem 7).
Lemma 10.
Let a be an action profile of a total-order game G. Then, a is sustained by a Nash equilibrium in G a if and only if a is sustained by a backwards induction solution in G a .
Proof. 
By virtue of Theorem 7, the “if”-direction of the lemma is immediate. For the opposite direction, assume that a is sustained by a Nash equilibrium f * . For contradiction, also assume that a is not sustained by any backwards induction solution. As we are dealing with total-form games, we may assume the existence of backwards induction solutions. These we may also assume to be Nash equilibria in virtue of Theorem 7. Now, let g = ( g v 1 , , g v | V | ) be any backwards induction solution and let a g = b = ( b v 1 , , b v | V | ) . Furthermore, let 1 k | V | be the greatest index such that g v k ( a v 1 , , a v k 1 ) f v k ( a v 1 , , a v k 1 ) and assume i to be the player active at v k . We may also assume that f is the backwards induction solution for which there are the fewest histories h = ( x v 1 , , x v ) such that some player j i is active at v and f v ( g v ( h ) . Furthermore, observe that, by choice of k, we have that a = ( a v 1 , , a v k , g v k + 1 , , g v | V | ) . Having assumed that g is a backwards induction solution, we find moreover that
1 = u i ( a ) = u i ( a f ) < u i ( a g ) = u i ( b ) = 2 .
Now, observe that a g a ( f i , g i ) ; otherwise, f would not be a Nash equilibrium. Accordingly, there must be some player j i and some vertex v k with k < k | V | and at which player j is active such that
f v k + 1 ( a v 1 , , a v k 1 , b v k , , b k 1 ) g v k + 1 ( a v 1 , , a v k 1 , b v k , , b k 1 ) .
Having assumed that g is a backwards induction solution, we find that
u j ( a v 1 , , a v k 1 , b v k , , b v k 1 , a v k , g v k + 1 , , g v | V | ) u j ( a v 1 , , a v k 1 , b v k , , b v k 1 , b v k , g v k + 1 , , g v | V | ) = u j ( b ) .
where a v k = f v k ( a v 1 , , a v k 1 , b v k , , b v k 1 ) . Because a v k b v k and u i ( b ) = 2 , we have u j ( b ) = 0 . It now follows that u j ( a v 1 , , a v k 1 , b v k , , b v k 1 , a v k , g v k + 1 , , g v | V | ) = 0 as well. Accordingly,
u j ( a v 1 , , a v k 1 , b v k , , b v k 1 , a v k , g v k + 1 , , g v | V | ) u j ( a v 1 , , a v k 1 , b v k , , b v k 1 , b v k , g v k + 1 , , g v | V | ) .
At this juncture, consider the strategy profile g = ( g v 1 , , g v | V | ) defined such that for all vertices v and all histories h = ( x v 1 , , x v 1 ) ,
g v ( h ) = a v k if = k and h = ( a v 1 , , a v k 1 , b v k , , b v k 1 ) , g v ( h ) otherwise .
The key observation to make at this point is that g is a backwards induction solution just as well as g is. As a v k = f v k ( a v 1 , , a v k 1 , b v k , , b v k 1 ) , however, we find that for g the number of histories h = ( x v 1 , , x v ) such that there some player j i that is active at v and f v ( h ) g v ( h ) is one fewer than the same number of such histories for g . This contradicts our minimality assumption regarding the latter. This concludes the proof. □
We are now finally in a position to prove Theorem 6, and establish that is nash actions is PSPACE -complete for total-order games.
Proof of Theorem 6.
For PSPACE -membership, first observe that, given total-order game G and action profile a = ( a v 1 , , a v | V | ) , we can construct G a in polynomial time. To see this, note for each player i, we have to transform the circuit C u i computing i’s utility function u i in G to a circuit C u i a . This can be achieved by adding a polynomial number of gates to each C u i . We then check whether a is sustained by a backwards induction solution in G a . Theorem 13 guarantees that this can be achieved in polynomial space. Lemma 9 then yields the result.
To prove PSPACE -hardness, we adapt the proof of Theorem 13 and reduce from q b f . Given an instance Φ = Q 1 x 1 , , Q | X | x | X | φ ( x 1 , , x k ) of qbf, we construct the game G Φ as in Theorem 13. We demonstrate that Φ = Q 1 x 1 , , Q | X | x | X | φ evaluates to true if and only if the action profile x ¯ = ( x ¯ 0 , x ¯ 1 , , x ¯ | X | ) is sustained by a Nash equilibrium.
First assume that Φ holds. Then, there is a profile of Boolean functions β i for the variables in X , such that for all profiles of Boolean functions β j for the variables x j X , such that the assignment α β 1 , , β | V | : X { 0 , 1 } satisfies φ . As in the proof of Theorem 13, we consider profile f * = ( f 1 * , f 2 * ) , where f 1 * and f 2 * are given by the following choice equations for each x i X { x 0 } (controlled by player 1) and each x j X (controlled by player 2):
x i : x j : x 0 φ β j ( x 1 , , x j 1 ) .
Note that a f * = x ¯ , and so it suffices to show that f * is a Nash equilibrium. Recall that f 2 incorporates a winning strategy for player 2 in the “subgame” that results if player 1 sets x 0 to true. It follows that for each of player 1’s strategies g 1 , the assignment α a ( g 1 , f 2 * ) associated with action profile a ( g 1 , f 2 * ) either satisfies φ or does not satisfy x 0 . In either case, player 1’s goal is not satisfied, and hence does not want to deviate from f * . Player 2 does not want to deviate either, as, with player 1 playing f 1 * , variable x 0 will be set to false and player 2’s goal γ 2 will not be satisfied no matter which strategy player 2 chooses. We may conclude that f * is a Nash equilibrium, as desired.
Finally, assume that Φ does not hold. In that case, player 2 does not have a winning strategy in the “subgame” that results if x 0 is set to true. Now consider an arbitrary strategy profile f ¯ = ( f ¯ 1 , f ¯ 2 ) that sustains x ¯ , and observe that f ¯ does not lead to player 1’s goal being satisfied. As player 2 does not have a winning strategy when x 0 is set to true, there must be a strategy f 1 * for player 1 such that ( f 1 * , f ¯ 2 ) leads to player 1’s goal γ 1 becoming true. Accordingly, player 1 would like to deviate to f 1 * , and f ¯ is not a Nash equilibrium. We may therefore conclude that x ¯ is not sustained by a Nash equilibrium, as desired. □

8. Conclusions

Game theorists have developed a huge range of game models since the field was initiated nearly a century ago. Of these, the most important and prominent are normal form games and extensive form games. These models differ in the assumptions they make about the information available to players and their strategies. In this article, we have introduced a game model in which informational dependencies between decision nodes in a game are captured in what we call a dependence relation. This dependence relation explicitly states what information is available when making a choice at a decision node. Although our partial order games are strictly no more expressive than extensive form games of imperfect information, they can be exponentially more compact, and we believe they are much more transparent with respect to the information flow in a game.
The compact representation of non-cooperative settings as partial order games naturally evokes various questions concerning the complexity of computing solution concepts in this model. We have focussed on Nash equilibrium and backwards induction. We explored six decision problems surrounding these two solution concepts—non-emptiness, is nash, is nash actions, fit for backwards induction, is backwards induction, and is backwards induction actions—for partial-games as well as for the two extremal subclasses of empty-order games and total-order games. Our results are summarised in Table 1 and Table 2. For each of these problems, we find that computational costs are highest for general partial order games, rather than for the two extremal classes. This suggests that the structure of the dependence relation is a major source of complexity.
Table 1. Overview of complexity results surrounding Nash equilibrium in partial order games.
Table 2. Overview of complexity results surrounding backwards induction in partial order games.
Furthermore, note the contrast in complexity between is nash and is backwards induction, on the one hand, and is nash actions and is backwards induction actions on the other. Other than in normal-form games, and arguably similar to extensive-form games, partial order games evince a clear distinction between strategy profiles and action profiles, with the former being much (exponentially) ‘larger’ objects than the latter, and of which there can also be exponentially more. Thus, is nash and is nash actions are similar problems, which basically only differ in that the former has a strategy profile as part of its input and the latter an action profile. We propose that the difference in size and number between strategy profiles and action profiles accounts for much of the contrast in computational cost between is nash and is nash actions. A similar remark applies to fit for backwards induction and is backwards induction actions.
For future work, it would be useful to develop software tools to support reasoning in partial order games, and from a theoretical perspective, it would also be interesting to consider restrictions on dependence graph structures that might lead to more tractable decision problems. Finally, of course, it would be valuable to look in more detail at applications of our model.

Author Contributions

Conceptualization, J.G., P.H., M.W., and V.Z.; methodology, J.G., P.H., M.W., and V.Z.; formal analysis, P.H., M.W., and V.Z.; investigation, P.H. and V.Z.; writing—original draft preparation, P.H. and V.Z.; writing—review and editing, all authors; supervision, M.W.; project administration, M.W.; funding acquisition, M.W. All authors have read and agreed to the published version of the manuscript.

Funding

Wooldridge and Harrenstein were supported by the UKRI under a Turing AI World Leading Researcher Fellowship (EP/W002949/1) awarded to Wooldridge.

Institutional Review Board Statement

Not applicable.

Data Availability Statement

Not applicable.

Acknowledgments

The authors would like to thank James Fox, Lewis Hammond, and Thomas Steeples for discussions on this work, in particular on the relationship between partial order games and MAIDs.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Bradfield, J.; Gutierrez, J.; Wooldridge, M. Partial-order Boolean games: Informational independence in a logic-based model of strategic interaction. Synthese 2016, 193, 781–811. [Google Scholar] [CrossRef]
  2. Koller, D.; Milch, B. Multi-agent influence diagrams for representing and solving games. In Proceedings of the 17th International Joint Conference on Artificial Intelligence (IJCAI), Seattle, WA, USA, 4–10 August 2001; pp. 1027–1034. [Google Scholar]
  3. Koller, D.; Milch, B. Multi-agent influence diagrams for representing and solving games. Games Econ. Behav. 2003, 45, 181–221. [Google Scholar] [CrossRef]
  4. Maschler, M.; Solan, E.; Zamir, S. Game Theory; Cambridge University Press: Cambridge, UK, 2013. [Google Scholar]
  5. Osborne, M.J.; Rubinstein, A. A Course in Game Theory; MIT Press: Cambridge, MA, USA, 1994. [Google Scholar]
  6. Vollmer, H. Introduction to Circuit Complexity: A Uniform Approach; Texts in Theoretical Computer Science; Springer: Berlin/Heidelberg, Germany, 1999. [Google Scholar]
  7. Jukna, S. Boolean Function Complexity: Advances and Frontiers; Algorithms and Combinatorics; Springer: Berlin/Heidelberg, Germany, 2012; Volume 27. [Google Scholar]
  8. Greenlaw, R.; Hoover, H.J.; Ruzzo, W.L. Limits to Parallel Computation: P-Completeness Theory; Oxford University Press: Oxford, UK, 1995. [Google Scholar]
  9. Muller, D.E. Complexity in electronic switching circuits. IRE Trans. Electron. Comput. 1956, 1, 15–19. [Google Scholar] [CrossRef]
  10. Lupanov, O.B. A method of circuit synthesis. Izvesitya VUZ Radiofiz 1958, 1, 120–140. (In Russian) [Google Scholar]
  11. Harrenstein, P.; van der Hoek, W.; Meyer, J.J.; Witteveen, C. Boolean Games. In Proceeding of the Eighth Conference on Theoretical Aspects of Rationality and Knowledge (TARK VIII); Certosa Di Pontignano, Italy, 8–10 July 2001, van Benthem, J., Ed.; Morgan Kaufmann: Burlington, MA, USA, 2001; pp. 287–298. [Google Scholar]
  12. Harrenstein, P. Logic in Conflict. Ph.D. Thesis, Utrecht University, Utrecht, The Netherlands, 2004. [Google Scholar]
  13. Bonzon, E.; Lagasquie, M.; Lang, J.; Zanuttini, B. Boolean games revisited. In Proceedings of the ECAI 2006, Riva del Garda, Italy, 28 August–1 September 2006; Volume 141. Available online: https://www.researchgate.net/publication/48445144_Boolean_Games_Revisited (accessed on 17 December 2021).
  14. Bonzon, E.; Lagasquie-Schiex, M.C.; Lang, J.; Zanuttini, B. Compact preference representation and Boolean games. Auton. Agents Multi-Agent Syst. 2009, 18, 1. [Google Scholar] [CrossRef][Green Version]
  15. Wooldridge, M.; Endriss, U.; Kraus, S.; Lang, J. Incentive engineering for Boolean games. Artif. Intell. 2013, 195, 418–439. [Google Scholar] [CrossRef]
  16. Hammond, L.; Fox, J.; Everitt, T.; Abate, A.; Wooldridge, M. Equilibrium Refinements for Multi-Agent Influence Diagrams: Theory and Practice. In Proceedings of the 20th International Conference on Autonomous Agents and Multi-Agent Systems (AAMAS’21), Online, 3–7 May 2021; Endriss, U., Nowé, A., Dignum, F., Lomuscio, A., Eds.; Available online: https://arxiv.org/pdf/2102.05008.pdf (accessed on 17 December 2021).
  17. Pearl, J. Probabilistic Reasoning in Intelligent Systems; Morgan Kaufmann: Burlington, MA, USA, 1988. [Google Scholar]
  18. Howard, R.A.; Matheson, J.E. Influence Diagrams. In Readings on the Principles and Applications of Decision Analysis; Strategic Decisions Group: Menlo Park, CA, USA, 1994; Volume 2, pp. 721–762. [Google Scholar]
  19. Kjaerulff, U.B.; Madsen, A.L. Bayesian Networks and Influence Diagrams; Springer Science+ Business Media 200: Berlin/Heidelberg, Germany, 2008. [Google Scholar]
  20. Nielsen, M.; Plotkin, G.D.; Winskel, G. Petri Nets, Event Structures and Domains, Part I. Theor. Comput. Sci. 1981, 13, 85–108. [Google Scholar] [CrossRef]
  21. Sassone, V.; Nielsen, M.; Winskel, G. A Classification of Models for Concurrency. In Proceedings of the CONCUR ’93, 4th International Conference on Concurrency Theory, Hildesheim, Germany, 23–26 August 1993; Best, E., Ed.; Lecture Notes in Computer Science; Springer: Berlin/Heidelberg, Germany, 1993; Volume 715, pp. 82–96. [Google Scholar]
  22. Clairambault, P.; Gutierrez, J.; Winskel, G. The Winning Ways of Concurrent Games. In Proceedings of the IEEE Symposium on Logic in Computer Science, Dubrovnik, Croatia, 25–28 June 2012; pp. 235–244. [Google Scholar] [CrossRef][Green Version]
  23. Gutierrez, J.; Wooldridge, M. Equilibria of concurrent games on event structures. In Proceedings of the CSL-LICS, Vienna, Austria, 14–18 July; ACM: New York, NY, USA, 2014; pp. 46:1–46:10. [Google Scholar]
  24. Alcolei, A.; Clairambault, P.; Hyland, M.; Winskel, G. The True Concurrency of Herbrand’s Theorem. In Proceedings of the CSL, Birmingham, UK, 4–8 September 2018; Ghica, D.R., Jung, A., Eds.; Schloss Dagstuhl: Wadern, Germany, 2018; Volume 119, pp. 5:1–5:22. [Google Scholar]
  25. Castellan, S.; Clairambault, P.; Winskel, G. Thin Games with Symmetry and Concurrent Hyland-Ong Games. Log. Methods Comput. Sci. 2019, 15, 18:1–18:85. [Google Scholar] [CrossRef]
  26. Thompson, F.B. Equivalence of Games in Extensive Form; Research Memorandum RM-759; Rand Corporation: Santa Monica, CA, USA, 1952. [Google Scholar]
  27. Hearn, R.A.; Demaine, E.D. Games, Puzzles and Computation; A.K. Peters: Natick, MA, USA, 2009. [Google Scholar]
  28. Fudenberg, D.; Tirole, J. Game Theory; The MIT Press: Cambridge, MA, USA, 1991. [Google Scholar]
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Article Metrics

Citations

Article Access Statistics

Multiple requests from the same IP address are counted as one view.