Open Access
This article is

- freely available
- re-usable

*Games*
**2015**,
*6*(4),
637-676;
https://doi.org/10.3390/g6040637

Article

Evidential Equilibria: Heuristics and Biases in Static Games of Complete Information

Department of Economics, University Road, University of Leicester, Leicester LE1 7RH, UK

^{*}

Author to whom correspondence should be addressed.

Academic Editors:
Andrew M. Colman
and
Briony D. Pulford

Received: 6 August 2015 / Accepted: 5 November 2015 / Published: 16 November 2015

## Abstract

**:**

Standard equilibrium concepts in game theory find it difficult to explain the empirical evidence from a large number of static games, including the prisoners’ dilemma game, the hawk-dove game, voting games, public goods games and oligopoly games. Under uncertainty about what others will do in one-shot games, evidence suggests that people often use evidential reasoning (ER), i.e., they assign diagnostic significance to their own actions in forming beliefs about the actions of other like-minded players. This is best viewed as a heuristic or bias relative to the standard approach. We provide a formal theoretical framework that incorporates ER into static games by proposing evidential games and the relevant solution concept: evidential equilibrium (EE). We derive the relation between a Nash equilibrium and an EE. We illustrate these concepts in the context of the prisoners’ dilemma game.

Keywords:

evidential reasoning; game theory; cognitive bias; prisoners’ dilemma game; oligopoly games; conservative heuristics; radical heuristics; decision makingJEL classifications:

D03 (behavioural microeconomics: underlying principles); C7 (game theory and bargaining theory)## 1. Introduction

A considerable body of evidence shows that the predictions of the standard equilibrium concepts in game theory are not borne out by a significant fraction of experimental subjects. See Camerer [1] for a book-length treatment. For prisoners’ dilemma games, see Lewis [2], Howard [3], Rapoport [4], Shafir and Tversky [5], Cooper et al. [6], Croson [7], Li and Taplin [8], Acevedo and Krueger [9], Busemeyer [10], Zhong et al. [11], Histrova and Grinberg [12] and Khadjavi and Lange [13]. For oligopoly games, see Fouraker and Siegel [14], Huck et al. [15], Bosch-Domènech and Vriend [16] and Duersch et al. [17]. For public goods games, see Dawes and Thaler [18], Fehr and Gächter [19] and Gächter and Thöni [20]. For voting games, see Quattrone and Tversky [21], Grafstein [22], Forsythe et al. [23], Rassenti et al. [24], Krueger and Acevedo [25], Koudenburg [26], Requate and Waichman [27] and Delavande and Manski [28]. For p-beauty contests, see Moulin [29], Nagel [30] and de Sousa et al. [31]. For auctions, see Ivanov [32]. For the hawk-dove game, see Rubinstein and Salant [33]. For the give-some game, see Krueger [34]. For further results on these, and other games, that present a challenge for both mainstream and behavioural game theory, see Lucas et al. [35].

In this paper, we are interested in static games of complete information. Examples include the prisoners’ dilemma, the voting game, the public goods game, the hawk-dove game and oligopoly games.

Let us briefly note the nature of the violations of the standard equilibrium concepts in some static games of interest. A more detailed treatment is given in the main body of the paper. More than half the subjects in the prisoners’ dilemma game play the dominated action “cooperate”. Voters vote in elections when it is clear to them that they will not be pivotal. Under traditional preferences, if there is a cost to voting, the act of voting is dominated by not-voting. The dominant action in public goods games is to free-ride. Yet, we can elicit near first-best levels of contributions with like-minded players.1

The evidence from the publications cited above suggests the following stylized facts that any reasonable theory of static games may aspire to explain. We consider the empirical evidence behind these claims in more detail in the main body of the paper.

- S1.
- A significant fraction of players behave in a manner that is consistent with the predictions of classical game theory. For instance, many players defect in a prisoners’ dilemma game; many people abstain from voting; and many people do not contribute at all in public goods games.
- S2.
- An even larger fraction of players violates the predictions of classical game theory, and they often seem to behave in an apparently non-strategic manner. For instance, the action “cooperate” in a prisoner’s dilemma game is a dominated action, but these actions by players, jointly, lead to higher payoffs. These findings are fairly robust even in the other static games that we do not consider in this paper.
- S3.
- In environments where players think that they are playing with other like-minded players, evidence shows that they impute diagnostic significance to their own actions when forming beliefs about the actions of others. For instance, those who cooperate (respectively defect) in prisoners’ dilemma games think that the vast majority of other players will also cooperate (respectively defect). Similarly, despite publicly-available information on election polls, those who vote Democrat (respectively Republican) in the U.S. Presidential elections believe that a significant majority of other voters will also vote Democrat (respectively Republican).

Each of the static games that we consider has an exceedingly simple structure. Hence, we believe that the anomalies relative to the predictions of classical game theory are less likely to arise from mistakenly playing the incorrect action. For this reason, our focus is not on behavioural alternatives, such as quantal response equilibrium (QRE) in which players play a noisy best response, but otherwise have consistent beliefs.

The main focus of our paper is on S3, which also allows us to shed light on S1 and S2. There is always considerable uncertainty about what others will do in one-shot games. Think of being in an experiment where you are playing a prisoner’s dilemma game, or a voting game, or a public goods game. How do you infer what the other players are likely to play? Considerable evidence, which we shall review later, suggests that people often use evidential reasoning (ER)2, i.e., they assign diagnostic significance to their own actions in forming beliefs about the actions of other like-minded players. We stress that a player using ER does not believe that his or her actions influence the actions of other players3. ER merely influences a player’s own belief about which unobserved action other players are likely to take.

ER is best viewed as a heuristic or bias. In the early 1970s, Kahneman and Tversky proposed the heuristics and biases approach. A substantial literature developed subsequently that identified a rich range of heuristics and generated evidence that they are used by human subjects. See, for instance, Kahneman et al. [36] and Kahneman [37]. ER, like these heuristics, is fast and frugal in the use/processing of information and in cognitive requirements. As with all heuristics, players who use ER may find ex post that their initial beliefs were incorrect. ER can lead to the violation of some of the principles of classical decision theory, for example Savage’s sure thing principle (Savage [38]). However, these violations have been well documented. For example, the sure thing principle is violated in the Ellsberg paradox (Ellsberg [39]; which, however, is a non-game theoretic situation).

Evidence supports the interpretation of evidential reasoning as a heuristic. People who use evidential reasoning are not aware of using it despite their behaviour being obviously consistent with evidential reasoning. Evidential reasoning appears to arise as an automatic, rather than a deliberate effort or intention, e.g., it does not require awareness. Evidence supporting this view comes from experiments that show that evidential reasoning was not hampered by cognitive load or time required to complete an action; see Krueger [40]. Furthermore, other evidence, also reported in Krueger [40], suggests that considerable cognitive effort is required to suspend evidential reasoning. The evidence from Acevedo and Krueger [9] indicates that evidential reasoning applies to human-human interaction, but not to human-non-human interaction. Players using evidential reasoning do not believe that their actions cause the action of others; it merely informs their belief about the actions taken by others. Another feature of evidential reasoning is that individuals continue to behave in a self-interested manner.

It might be useful to distinguish between to types of heuristics. The first involves no violation of the standard assumptions of game theory. They propose extra conditions that are consistent with the standard assumptions of game theory, but whose aim is to reduce the multiplicity of Nash equilibria. Examples include payoff dominance and risk dominance (Harsanyi and Selten [41]). Note that in some cases, these heuristics are in conflict (in some games, payoff-dominance could select one equilibrium, but risk-dominance could select another). In fact, the whole program of refinements of Nash (van Damme [42]) may be viewed in this light. We may call these conservative heuristics. The second type, which we may call radical heuristics, involve relaxation of some of the standard assumptions of game theory. Examples include Stackelberg reasoning (Colman and Bacharach [43]; Colman et al. [44]). Evidential reasoning is firmly in the group of radical heuristics.

Section 2 gives a formal treatment of evidential reasoning and proposes several concepts that we will find useful in the rest of the paper. An evidential game is simply a game where players use evidential reasoning. An evidential equilibrium is one where each player chooses to optimize given his beliefs about the behaviour of the other players (inferred from his own behaviour in accordance with evidential reasoning). A consistent evidential equilibrium is an evidential equilibrium where beliefs turn out to be correct. Our formulation of evidential reasoning yields causal reasoning, the mode of reasoning assumed in the traditional framework in economics (and, indeed, generally) as a special case. If players use causal reasoning, then a consistent evidential equilibrium corresponds to a Nash equilibrium in the ordinary sense. We introduce the concept of a social projection function (SPF) and give formal definitions of like-mindedness, ingroup and outgroup.

Section 3 argues that evidential reasoning is a useful heuristic rather than a valid method of inference. Section 4, Section 5 and Section 6 show that evidential reasoning can answer the following questions: Why do people voluntarily contribute to public goods? Why do people vote? Why is there so much cooperation in the prisoners’ dilemma game? Section 7 examines the uniqueness of outcomes under evidential reasoning in the context of the Nash demand game. Oligopoly games are considered in Section 8. In Section 9, we argue that causal reasoning cannot adequately explain cooperation in the prisoners’ dilemma game. Section 10 concludes.

## 2. Evidential Equilibrium in Static Games of Complete Information

#### 2.1. Elements of Standard Game Theory4

Consider the following standard description of a static game of complete information, $\left\{N,\mathbf{A},\mathit{\pi}\right\}$. $N=\left\{1,2,...,n\right\}$ is the set of players. ${A}_{i}\subseteq \mathbb{R}$ is the set of actions open to player i. We denote a typical member of ${A}_{i}$ by ${a}_{i}$. $\mathbf{A}={\times}_{i=1}^{n}{A}_{i}$ gives all possible action profiles of the players. ${\mathbf{A}}_{-i}\subseteq {\mathbb{R}}^{n-1}$ is the set of vectors of actions open to the other players. Denote by ${S}_{i}$ the set of probability distributions over the set of actions ${A}_{i}$. We denote a typical element of ${S}_{i}$ by ${s}_{i}$ and call it a strategy. ${s}_{i}\left({a}_{i}\right)$ is the probability with which player i plays ${a}_{i}\in {A}_{i}$, so ${s}_{i}\left({a}_{i}\right)\ge 0$ and ${\sum}_{{a}_{i}\in {A}_{i}}{s}_{i}\left({a}_{i}\right)=1$. In particular, if ${s}_{i}\left({a}_{i}\right)=1$ (hence, ${s}_{i}\left({a}_{i}^{\prime}\right)=0$ for ${a}_{i}^{\prime}\ne {a}_{i}$), then we call s a pure strategy, and we identify it with the action ${a}_{i}$.

A profile of strategies of all players is denoted by

**s**$=\left({s}_{1},{s}_{2},\dots {s}_{n}\right)\in $**S**, where**S**$={\times}_{i=1}^{n}{S}_{i}$ is the set of all possible profiles of strategies. A particular profile of strategies of other players is denoted by**s**${}_{-i}=\left({s}_{1},\dots ,{s}_{i-1},{s}_{i+1},\dots ,{s}_{n}\right)\in $**S**${}_{-i}={\times}_{j\in N-\left\{i\right\}}{S}_{j}$. The payoff to player i is a mapping ${\pi}_{i}:$**S**$\to \mathbb{R}$. Let**π**be the vector of payoffs to all players. Given a strategy profile,**s**$=\left({s}_{i},{\mathbf{s}}_{-i}\right)\in $**S**, the payoff to player i is ${\pi}_{i}\left({s}_{i},{\mathbf{s}}_{-i}\right)\in \mathbb{R}$. The structure of the game, $\left\{N,\mathbf{A},\mathit{\pi}\right\}$, is common knowledge among the players. In an experimental setup, common knowledge can be achieved by a public announcement of $\left\{N,\mathbf{A},\mathit{\pi}\right\}$. This is the sense in which this is a game of complete information. However, when each player, i, chooses his strategy, ${s}_{i}$, he does not know the strategies,**s**${}_{-i}$, that have been, or will be, chosen by the other players. This is the sense in which this is a static game.**Definition 1.**: ${s}_{i}^{*}\in {S}_{i}$ is a dominant strategy for player $i\in N$ if ${\pi}_{i}\left({s}_{i}^{*},{\mathbf{s}}_{-i}\right)\ge {\pi}_{i}\left({s}_{i},{\mathbf{s}}_{-i}\right)$ for each ${s}_{i}\in {S}_{i}$ and each

**s**${}_{-i}\in $

**S**${}_{-i}$. If ${\pi}_{i}\left({s}_{i}^{*},{\mathbf{s}}_{-i}\right)>{\pi}_{i}\left({s}_{i},{\mathbf{s}}_{-i}\right)$ for each ${s}_{i}\in {S}_{i}-\left\{{s}_{i}^{*}\right\}$ and each

**s**${}_{-i}\in $

**S**${}_{-i}$, then ${s}_{i}^{*}$ is a strictly dominant strategy for player i.

**Definition 2.**(Nash [46,47]): A strategy profile

**s**${}^{*}=\left({s}_{1}^{*},{s}_{2}^{*},\dots ,{s}_{n}^{*}\right)\in $

**S**is a Nash equilibrium in the game $\Gamma =\left\{N,\mathbf{A},\mathit{\pi}\right\}$ if ${s}_{i}^{*}$ maximizes ${\pi}_{i}\left({s}_{i},{\mathbf{s}}_{-i}^{*}\right)$ with respect to ${s}_{i}$, given

**s**${}_{-i}^{*}$, for each $i\in N$, i.e.,

$${\pi}_{i}\left({s}_{i}^{*},{\mathbf{s}}_{-i}^{*}\right)\ge {\pi}_{i}\left({s}_{i},{\mathbf{s}}_{-i}^{*}\right)\phantom{\rule{4.pt}{0ex}}\text{for}\phantom{\rule{4.pt}{0ex}}\text{all}\phantom{\rule{4.pt}{0ex}}{s}_{i}\in {S}_{i}$$

Note that there is no role for beliefs about the strategies of others in the game $\left\{N,\mathbf{A},\mathit{\pi}\right\}$, nor in the definition of a Nash equilibrium (Definition 2). Hence, we augment the game $\left\{N,\mathbf{A},\mathit{\pi}\right\}$ with a profile of “social projection functions”, $\mathbf{P}$, that specify the beliefs of players; this is undertaken in Section 2.2, below.

#### 2.2. Social Projection Functions

We would like to define a function that captures the beliefs that a player has about the strategies of the other players, conditional on his own strategy. We will call such a function a social projection function.5

**Definition 3.**(Social projection functions (SPF)): A social projection function for player i is a mapping ${\mathbf{P}}_{i}$: ${S}_{i}\to $

**S**${}_{-i}$ that assigns to each strategy, ${s}_{i}\in {S}_{i}$, for player i, an $n-1$ vector of strategies for the other players. We write ${P}_{ij}\left({a}_{j}|{s}_{i}\right)$ for the subjective belief of player i that player j plays ${a}_{j}\in {A}_{j}$, conditional on player i playing ${s}_{i}\in {S}_{i}$. Hence, ${P}_{ij}\left({a}_{j}|{s}_{i}\right)\ge 0$ and ${\sum}_{{a}_{j}\in {A}_{j}}{P}_{ij}\left({a}_{j}|{s}_{i}\right)=1$. We may write ${\mathbf{P}}_{i}\left({s}_{i}\right)=$

**s**${}_{-i}^{e}\left({s}_{i}\right)$ to indicate that ${\mathbf{P}}_{i}\left({s}_{i}\right)$ is the $\left(n-1\right)$ vector of strategies that player i anticipates that the other players will follow if player i adopts the strategy ${s}_{i}$.

We now define causal reasoning, the mode of reasoning assumed in classical game theory. Then, we define evidential reasoning.

**Definition 4.**(Causal reasoning): We say that player i uses causal reasoning if ${P}_{ij}\left({a}_{j}|{s}_{i}\right)$ is independent of ${s}_{i}$ for each ${a}_{j}\in {A}_{j}$ and each $j\ne i$, i.e., if ${\mathbf{P}}_{i}\left({s}_{i}\right)={\mathbf{P}}_{i}\left({s}_{i}^{\prime}\right)$ for all ${s}_{i},{s}_{i}^{\prime}\in {S}_{i}$.

**Definition 5.**(Evidential reasoning) We say that player i uses evidential reasoning if it is not necessarily the case that ${\mathbf{P}}_{i}\left({s}_{i}\right)={\mathbf{P}}_{i}\left({s}_{i}^{\prime}\right)$ for all ${s}_{i},{s}_{i}^{\prime}\in {S}_{i}$.

Remark 1: (a) In a static game of complete information, players are uncertain of the actions taken by others. Under evidential reasoning, player i resolves this uncertainty by assigning diagnostic significance to his or her own choice of strategy, ${s}_{i}$, in inferring the strategies of the other players,

**s**${}_{-i}$, using his or her social projection function, ${\mathbf{P}}_{i}$. For this reason, Definition 5 allows for ${\mathbf{P}}_{i}\left(.|{s}_{i}\right)$ to change as ${s}_{i}$ changes. However, it is crucially important to realize that there is no causal connection between ${s}_{i}$ and**s**${}_{-i}$. The choice of ${s}_{i}$ by player i merely influences that player’s belief about the strategies,**s**${}_{-i}$, of the other players. In particular, players who use evidential reasoning know that their own actions have no causal effects in altering the actions of others when they change their own actions.(b) An SPF (Definition 3) specifies the beliefs of a player for all possible actions of others, including out-of-equilibrium actions. The beliefs of a player need not turn out to be fulfilled in equilibrium. In this respect, ER is similar to other disequilibrium-in-belief models, such as the level-k model (Example 3, below).

(c) If player i uses causal reasoning as in classical game theory (see Definition 4), then he or she assigns no diagnostic significance to his or her own strategy, ${s}_{i}$, in inferring the strategies,

**s**${}_{-i}$, followed by the other players. Thus, under causal reasoning, ${\mathbf{P}}_{i}\left({s}_{i}\right)$ remains fixed as ${s}_{i}$ changes. From Definitions 4 and 5, causal reasoning is a special case of evidential reasoning.In a dynamic game (under causal reasoning), if (say) Player 1 moves first, choosing the strategy ${s}_{1}$, followed by Player 2, who chooses strategy ${s}_{2}$, having observed a realization of ${s}_{1}$, then ${s}_{2}$ may very well depend on ${s}_{1}$. When choosing ${s}_{1}$, Player 1 will take into account the influence of his choice on the future behaviour of Player 2. This should not be confused with evidential reasoning.

Many different types of social projection functions (SPFs) are possible. There are possibly various degrees of like-mindedness. However, a particularly salient SPF is one where a player believes that a like-minded player will play a strategy that is identical to his own. We call this the identity social projection function. This SPF seems important when choices are low dimensional and players play symmetric games. Examples include cooperate or defect in a prisoners’ dilemma game, vote Democrats or Republicans in U.S. Presidential elections, coordinate or fail to coordinate in coordination games or play hawk or dove in the hawk-dove game.

**Definition 6.**(Identity social projection function): Let $M\subseteq N$ be a subset of players. Suppose that all players in M have the same action set, i.e., ${A}_{i}={A}_{j}=A$ for all $i,j\in M$. Let ${\mathbf{P}}_{i}$ be the social projection function for player $i\in M$. Recall that ${P}_{ij}\left(a|{s}_{i}\right)$ is the probability that player i assigns to player j playing action a when the strategy of player i is given by ${s}_{i}$. If ${P}_{ij}\left(a|{s}_{i}\right)={s}_{i}\left(a\right)$ for all $a\in A$ and all $j\in M-\left\{i\right\}$, then we say that ${\mathbf{P}}_{i}$ is an identity social projection function on M. If $M=N$, then we say that ${\mathbf{P}}_{i}$ is an identity social projection function.

Example 1 (self-similarity in the hawk-dove game): Consider the following game between two players. If both choose hawk (H), then each gets zero. If both choose dove (D), then each gets two. If one chooses H (the hawk) and the other chooses D (the dove), then the hawk gets three and the dove gets one. These payoffs are summarized by Table 1, where the row player plays D with probability $p\in \left[0,1\right]$, while the column player plays D with probability $q\in \left[0,1\right]$.

$D\left(q\right)$ | $H\left(1-q\right)$ | |
---|---|---|

$D\left(p\right)$ | $2,2$ | $1,3$ |

$H\left(1-p\right)$ | $3,1$ | $0,0$ |

Under causal reasoning (Definition 4), the row player should use a constant social projection function. However, the experimental results reported by Rubinstein and Salant [33] show that the higher the probability, p, with which the row player plays D, the higher the probability, q, she or he thinks the column player will play D. This can be formalized by the row player adopting the identity social projection function (Definition 6):

$${P}_{12}\left(D|p\right)=p$$

#### 2.3. Ingroups, Outgroups and Evidential Reasoning

Players need not impute diagnostic significance to their actions when others are perceived not to be like-minded; the next definition formalizes this idea.

**Definition 7.**(Ingroups and outgroups): Suppose that players use evidential reasoning.

(a) Player i regards player j ($j\ne i$) as an outgroup member if ${P}_{ij}\left({a}_{j}|{s}_{i}\right)$ is independent of ${s}_{i}$, i.e., if ${P}_{ij}\left({a}_{j}|{s}_{i}\right)={P}_{ij}\left({a}_{j}|{s}_{i}^{\prime}\right)$ for all ${s}_{i},{s}_{i}^{\prime}\in {S}_{i}$ and all ${a}_{j}\in {A}_{j}$. Otherwise, player i regards player j ($j\ne i$) as an ingroup member.

(b) Let $M\subset N$ be a non-empty set of players. If every player in M regards every other player in M as an ingroup member, then M is an ingroup.

(c) Let $L\subset N$ and $M\subset N$ be disjoint non-empty sets of players. Suppose every player in L regards every player in M as an outgroup member. Then, we say that M is an outgroup relative to L.

Remark 2: Player i plays action ${a}_{i}\in {A}_{i}$ with probability ${s}_{i}\left({a}_{i}\right)$ and believes that player j will play action ${a}_{j}\in {A}_{j}$ with probability ${P}_{ij}\left({a}_{j}|{s}_{i}\right)$ (the latter is conditional on ${s}_{i}$). Hence, player i believes that the joint probability of ${a}_{i}$ and ${a}_{j}$ being played is ${P}_{ij}\left({a}_{i},{a}_{j}|{s}_{i}\right)={s}_{i}\left({a}_{i}\right){P}_{ij}\left({a}_{j}|{s}_{i}\right)$. Suppose that player i regards player j as an outgroup member. Then (and only then), ${P}_{ij}\left({a}_{j}|{s}_{i}\right)$ is independent of ${s}_{i}\in {S}_{i}$. Hence, in this case, ${P}_{ij}\left({a}_{i},{a}_{j}|{s}_{i}\right)={s}_{i}\left({a}_{i}\right){P}_{ij}\left({a}_{j}|{s}_{i}\right)$. Thus, if player i regards player j as an outgroup member, then player i believes that the probability with which he or she (player i) plays ${a}_{i}\in {A}_{i}$ is independent of the probability that he or she believes j will play ${a}_{j}\in {A}_{j}$. In particular, if player i uses causal reasoning, then he or she regards all others as outgroup members, and hence, he or she believes that his or her actions are independent of the actions of all other players.

**Definition 8.**(Perfect ingroups): Let $M\subseteq N$ be a subset of players. Suppose that all players in M have the same action set, i.e., ${A}_{i}={A}_{j}=A$ for all $i,j\in M$. Let ${\mathbf{P}}_{i}$ be the social projection function for player $i\in M$. If ${\mathbf{P}}_{i}$ is an identity social projection function on M, for each player $i\in M$, then M is a perfect ingroup.

**Definition 9.**(Evidential game): Consider the static game of complete information, $\left\{N,\mathbf{A},\mathit{\pi}\right\}$. Let $\mathbf{P}=\left({\mathbf{P}}_{1},{\mathbf{P}}_{2},\dots ,{\mathbf{P}}_{n}\right)$ be a profile of social projection functions, where ${\mathbf{P}}_{i}$ is the social projection function of player $i\in N$ (Definition 3). Then, we denote the game augmented with the vector of social projection functions, $\mathbf{P}$, by $\Gamma =\left\{N,\mathbf{A},\mathit{\pi},\mathbf{P}\right\}$, and we call it an evidential game. We say that players in such a game use evidential reasoning.

**Definition 10.**If each ${\mathbf{P}}_{i}\left({s}_{i}\right)$ is independent of ${s}_{i}$, then we say that Γ is a causal game.

Remark 3: (a)From Definition 9, a causal game is a special case of an evidential game.

(b) Suppose ${\mathbf{P}}_{i}\left({s}_{i}\right)$ is independent of ${s}_{i}$, for each player, i, so that $\Gamma =\left\{N,\mathbf{A},\mathit{\pi},\mathbf{P}\right\}$ is a causal game. Γ is still richer than the static game of complete information, $\left\{N,\mathbf{A},\mathit{\pi}\right\}$, because Γ incorporates players’ beliefs about other players’ actions, as given by $\mathbf{P}$.

Example 2 (Matching pennies): Consider the matching pennies game.

H | T | |

H | −1, 1 | 1, −1 |

T | 1, −1 | −1, 1 |

The set of players is $N=\left\{1,2\right\}$. The action sets are ${A}_{1}={A}_{2}=\left\{H,T\right\}$. Player 1, the row player, plays H and T with respective probabilities p, $1-p$. Player 2, the column player, plays H and T with respective probabilities q, $1-q$. The sets of possible strategies are ${S}_{1}=\left\{p:0\le p\le 1\right\}$ for Player 1 and ${S}_{2}=\left\{q:0\le q\le 1\right\}$ for Player 2. If Player 1 plays H (say) and Player 2 plays T, then the payoff is one to Player 1 and $-1$ to Player 2. For any profile of strategies $\left(p,q\right)$, $p,q\in \left[0,1\right]$, the payoff functions of the players are ${\pi}_{1}\left(p,q\right)=-\left(1-2p\right)\left(1-2q\right)$ and ${\pi}_{2}\left(p,q\right)=\left(1-2p\right)\left(1-2q\right)$. The following are examples of social projection functions:

$$\left\{\begin{array}{c}{P}_{12}\left(H\mid p\right)=p\text{,}\phantom{\rule{4.pt}{0ex}}\text{for}\phantom{\rule{4.pt}{0ex}}\text{all}\phantom{\rule{4.pt}{0ex}}p\in \left[0,1\right]\phantom{\rule{4.pt}{0ex}}\\ {P}_{21}\left(H\mid q\right)=0.5\text{,}\phantom{\rule{4.pt}{0ex}}\text{for}\phantom{\rule{4.pt}{0ex}}\text{all}\phantom{\rule{4.pt}{0ex}}q\in \left[0,1\right]\end{array}\right.$$

According to Equation (2), Player 1, who uses evidential reasoning, believes that if he or she (Player 1) plays H with probability p, then so will Player 2 for any $p\in \left[0,1\right]$. Hence, Player 1 has an identity social projection function. It is critical to note that these are the “beliefs” of Player 1. There is no presumption that these beliefs will turn out to be justified ex post. Player 2, who uses causal reasoning, believes that Player 1 will play H with probability $0.5$, whatever strategy, q, Player 2 chooses. Hence, Player 1 regards Player 2 as an ingroup member, but Player 2 regards Player 1 as an outgroup member, so $N=\left\{1,2\right\}$ fails to be an ingroup. On the other hand, if both players had identity social projection functions, then $N=\left\{1,2\right\}$ would be an ingroup (in fact, a perfect ingroup). By contrast, if in Equation (2) we had, say, ${P}_{12}\left(H\mid p\right)=0.3$ for all $p\in \left[0,1\right]$, then both players would exhibit causal reasoning; and this example would become a causal game.

Example 3 (An application of the level-k model to the p-beauty contest6): A large number of contestants are asked to choose an integer between zero and 100, inclusive. It is announced that the winner is the one who gives an answer closest to $\frac{2}{3}$ the average of the choices of all of the other players. A Level 0 player simply makes a random uniform choice. Thus, if player j is of Level 0, then the action of player j is ${a}_{j}=50$. A Level 1 player assumes all other players are level-0 and, therefore, assumes all other players will choose 50. Thus, if player i is Level 1, his social projection function will be ${P}_{ij}\left({a}_{j}|{s}_{i}\right)=1$ for ${a}_{j}=50$ or ${P}_{ij}\left({a}_{j}|{s}_{i}\right)=0$ for ${a}_{j}\ne 50$. Thus, the optimal choice for player i, if i is Level 1, is ${a}_{i}=\frac{100}{3}\simeq 33.\phantom{\rule{0.166667em}{0ex}}333$. A Level 2 player assumes all other players are Level 1 and, therefore, assumes all other players will choose $\frac{100}{3}$. Thus, if player i is Level 2, his or her social projection function will be ${P}_{ij}\left({a}_{j}|{s}_{i}\right)=1$ for ${a}_{j}=\frac{100}{3}$ or ${P}_{ij}\left({a}_{j}|{s}_{i}\right)=0$ for ${a}_{j}\ne \frac{100}{3}$. Thus, the optimal choice for player i, if i is Level 2, is ${a}_{i}=\frac{200}{9}\simeq 22.\phantom{\rule{0.166667em}{0ex}}222$, and so on. Note that all players use causal reasoning, since, for each player, ${P}_{ij}\left({a}_{j}|{s}_{i}\right)$ is independent of ${s}_{i}$.

Intuitively, an “ingroup” is a group of players each of whom believes that the others are like-minded and, hence, would behave in a similar, but not necessarily identical, manner. The literature has typically assumed that players do not use their own actions as diagnostic of the actions of “outgroup” players; see, for instance, Robbins and Krueger [48] and Krueger [40]; and our definitions reflect this (see, in particular, Definition 7).

However, recent evidence suggests a more nuanced view that is also consistent with our definitions. Koudenburg et al. [26] show that voters project their own preference for a political party to non-voters even when they are informed about the poll results for non-voters. Thus, voters may regard non-voters as ingroup members, though in a strict sense, only the set of voters may be thought to form an ingroup (Definition 7).

Riketta and Sacramento [49] cite several references to show that members of an ingroup assign beliefs about other members even when they could have no possible information about those members. They find that an ingroup member may have a harmonious (or cooperative) relation with other members. On the other hand, they also find that an ingroup member may be in competition (or conflict) with other members. In the latter case, an ingroup member may believe that the actions of others are in contrast to his own actions (the contrast effect). The following is an illustration.

Example 4 (the contrast effect): Consider the matching pennies game of Example 2. Instead of Equation (2), consider the following social projection functions:

$$\left\{\begin{array}{c}{P}_{12}\left(H\mid p\right)=1-p\text{,}\phantom{\rule{4.pt}{0ex}}\text{for}\phantom{\rule{4.pt}{0ex}}\text{all}\phantom{\rule{4.pt}{0ex}}p\in \left[0,1\right]\phantom{\rule{4.pt}{0ex}}\\ {P}_{21}\left(H\mid q\right)=1-q\text{,}\phantom{\rule{4.pt}{0ex}}\text{for}\phantom{\rule{4.pt}{0ex}}\text{all}\phantom{\rule{4.pt}{0ex}}q\in \left[0,1\right]\end{array}\right.$$

According to Equation (3), Player 1 believes that if he or she (Player 1) plays H with (say) probability one, then Player 2 will play H with probability zero; and similarly for Player 2. Both players use evidential reasoning, but exhibit the contrast effect. Each player regards the other as an ingroup member. Hence, $N=\left\{1,2\right\}$ is an ingroup. However, N fails to be a perfect ingroup because players are not using their identity social projection functions.

#### 2.4. Equilibria

In this section, we propose an appropriate equilibrium concept for static evidential games of complete information. We call this an evidential equilibrium.

**Definition 11.**(Optimal strategies): An optimal strategy for player i, ${s}_{i}^{*}\in {S}_{i}$, in the evidential game $\Gamma =\left\{N,\mathbf{A},\mathit{\pi},\mathbf{P}\right\}$ (Definition 9), is one that maximizes the payoff function, ${\pi}_{i}\left({s}_{i},{\mathbf{P}}_{i}\left({s}_{i}\right)\right)$ of player i.

**Definition 12.**(Evidential equilibria): The strategy profile

**s**${}^{*}=\left({s}_{1}^{*},{s}_{2}^{*},\dots ,{s}_{n}^{*}\right)\in $

**S**is an evidential equilibrium of the evidential game $\Gamma =\left\{N,\mathbf{A},\mathit{\pi},\mathbf{P}\right\}$ if ${s}_{i}^{*}$ is an optimal strategy for each $i\in N$ (Definition 11).

Definitions 11 and 12 identify an important feature of an evidential equilibrium. In static games of complete information, under uncertainty about what others will do, evidential reasoning converts an essentially strategic situation into a non-strategic problem. This is made possible by treating the social projection function as an essential part of the game. This appears to be consistent with the evidence (see Section 6.3 and Section 9.5). There are no higher order beliefs. Players do not think about strategically exploiting the SPF of the other player. Indeed, the game $\mathbf{\Gamma}=\left\{N,\mathbf{A},\mathit{\pi},\mathbf{P}\right\}$ does not involve any assumptions about the mutual or common knowledge of $\mathbf{P}$. Requiring ${\pi}_{i}$ and ${\mathbf{P}}_{i}$ to be continuous functions on compact mixed strategy spaces guarantees that an equilibrium exists in the non-strategic problem.

Note that Definition 12 only requires that a strategy for a player be optimal given his beliefs. However, of course, beliefs may turn out to be wrong, ex post. Ultimately, the choice among models in all science is guided by the evidence. The evidence reviewed above (and below) shows that in static games, beliefs about others often turn out to be incorrect. Even in experiments, where successive rounds of play lead to an improvement in the accuracy of beliefs, one may be interested in explaining the behaviour in early rounds of play where beliefs do not turn out to be correct. Often such behaviour mimics real-life situations in which decision makers do not get repeated or frequent opportunities to make their decisions. Nevertheless, it is of interest to consider the special case where beliefs turn out to be correct, at least in equilibrium. This is the subject of the next two definitions.

**Definition 13.**(Mutually consistent strategies): A strategy profile

**s**${}^{*}=\left({s}_{1}^{*},{s}_{2}^{*},\dots ,{s}_{n}^{*}\right)\in $

**S**of the evidential game $\Gamma =\left\{N,\mathbf{A},\mathit{\pi},\mathbf{P}\right\}$ (Definition 9) is a mutually consistent vector of strategies if ${\mathbf{P}}_{i}\left({s}_{i}^{*}\right)=$

**s**${}_{-i}^{*}$ for all $i\in N$, i.e., if ${P}_{ij}\left({a}_{j}|{s}_{i}^{*}\right)={s}_{j}^{*}\left({a}_{j}\right)$, for all $i,j\in N$, $i\ne j$, and all ${a}_{j}\in {A}_{j}$.

In other words, a strategy profile

**s**${}^{*}=\left({s}_{1}^{*},{s}_{2}^{*},\dots ,{s}_{n}^{*}\right)$ is a mutually consistent vector of strategies if, for all players $i,j\in N$, $i\ne j$, and all actions, ${a}_{j}$, open to player j, the probability ${P}_{ij}\left({a}_{j}|{s}_{i}^{*}\right)$ that player i believes that player j will play action ${a}_{j}$ (given ${s}_{i}^{*}$) is equal to the actual probability ${s}_{j}^{*}\left({a}_{j}\right)$ with which player j plays ${a}_{j}$.**Definition 14.**(Consistent evidential equilibria): A consistent evidential equilibrium of the evidential game $\Gamma =\left\{N,\mathbf{A},\pi ,\mathbf{P}\right\}$ is an evidential equilibrium,

**s**${}^{*}\in $

**S**, which is also a mutually consistent vector of strategies (Definitions 12 and 13).

#### 2.5. Nash Equilibria and Consistent Evidential Equilibria

As one might expect, there is a natural correspondence between Nash equilibria and consistent evidential equilibria. This is formally stated and established by the following proposition.

**Proposition 1.**: (a) Let

**s**${}^{*}\in $

**S**be a Nash equilibrium in the static game of complete information, $\left\{N,\mathbf{A},\mathit{\pi}\right\}$. Consider the (constant) social projection functions: ${\mathbf{P}}_{i}\left({s}_{i}\right)=$

**s**${}_{-i}^{*}$, $i\in N$. Then,

**s**${}^{*}$ is a consistent evidential equilibrium in the evidential game $\Gamma =\left\{N,\mathbf{A},\mathit{\pi},\mathbf{P}\right\}$. Furthermore, Γ is a causal game.

(b) Let

**s**${}^{*}\in $**S**be an evidential equilibrium in the evidential game $\Gamma =\left\{N,\mathbf{A},\mathit{\pi},\mathbf{P}\right\}$, where $\mathbf{P}$ is the profile of constant social projection functions ${\mathbf{P}}_{i}\left({s}_{i}\right)=$**s**${}_{-i}^{*}$, $i\in N$ (hence,**s**${}^{*}$ is a consistent evidential equilibrium and Γ is a causal game). Then,**s**${}^{*}$ is a Nash equilibrium in the static game of complete information $\left\{N,\mathbf{A},\mathit{\pi}\right\}$.Proof of Proposition 1: (a) Let

**s**${}^{*}\in $**S**be a Nash equilibrium in the static game of complete information, $\left\{N,\mathbf{A},\mathit{\pi}\right\}$. Consider the social projection functions: ${\mathbf{P}}_{i}\left({s}_{i}\right)=$**s**${}_{-i}^{*}$, $i\in N$. Since**s**${}^{*}$ is a Nash equilibrium (Definition 2), it follows that ${s}_{i}^{*}$ maximizes ${\pi}_{i}\left({s}_{i},{\mathbf{s}}_{-i}^{*}\right)$ with respect to ${s}_{i}$, given**s**${}_{-i}^{*}$, for each $i\in N$. Since, by construction, ${\mathbf{P}}_{i}\left(.|{s}_{i}\right)=$**s**${}_{-i}^{*}$, $i\in N$, it follows that ${s}_{i}^{*}$ maximizes ${\pi}_{i}\left({s}_{i},{\mathbf{P}}_{i}\left({s}_{i}\right)\right)$ with respect to ${s}_{i}$, for each $i\in N$. Hence,**s**${}^{*}$ is an evidential equilibrium (Definitions 11 and 12) in the evidential game $\Gamma =\left\{N,\mathbf{A},\pi ,\mathbf{P}\right\}$. Furthermore, since, by construction, ${\mathbf{P}}_{i}\left({s}_{i}\right)=$**s**${}_{-i}^{*}$, $i\in N$, it follows that**s**${}^{*}$ is a consistent evidential equilibrium (Definitions 13 and 14). Since $\mathbf{P}$ is a profile of constant social projection functions, it follows that $\Gamma =\left\{N,\mathbf{A},\mathit{\pi},\mathbf{P}\right\}$ is a causal game (Definition 9).(b) Let

**s**${}^{*}\in $**S**be an evidential equilibrium in the evidential game $\Gamma =\left\{N,\mathbf{A},\mathit{\pi},\mathbf{P}\right\}$, where $\mathbf{P}$ is the profile of constant social projection functions ${\mathbf{P}}_{i}\left({s}_{i}\right)=$**s**${}_{-i}^{*}$, $i\in N$. Then, ${s}_{i}^{*}$ maximizes ${\pi}_{i}\left({s}_{i},{\mathbf{P}}_{i}\left({s}_{i}\right)\right)$ with respect to ${s}_{i}$, for each $i\in N$ (Definitions 11 and 12). However, ${\mathbf{P}}_{i}\left({s}_{i}\right)=$**s**${}_{-i}^{*}$, $i\in N$; hence, ${s}_{i}^{*}$ maximizes ${\pi}_{i}\left({s}_{i},{\mathbf{s}}_{-i}^{*}\right)$ with respect to ${s}_{i}$, for each $i\in N$. Hence,**s**${}^{*}$ is a Nash equilibrium in the static game of complete information $\left\{N,\mathbf{A},\mathit{\pi}\right\}$ (Definition 2). ■.## 3. Calvinism and the Development of Capitalism

Consider the following example from Quattrone and Tversky [21].7 According to the Calvinist doctrine of predestination, those who are to be saved have been chosen by God at the beginning of time, and nothing that one can do will lead to salvation unless one has been chosen. Although one cannot increase the chance of salvation by good works, one can produce diagnostic evidence of having been chosen by engaging in acts of piety, devotion to duty, hard work and self denial. According to Max Weber, this is exactly how millions of people responded to the Calvinist doctrine and why capitalism developed more quickly in Protestant rather than Catholic countries; see Weber [50], McClelland [51] and Nozick [52].

#### 3.1. A Normal Form

Consider the following simultaneous move game between two players: God and man. God decides whether to save man ($Yes$) or not to save man ($No$). Man decides whether to do good works ($GW$) or not ($NGW$). When making his choice, man does not know whether he has been saved or not. Being saved is better than not being saved. On the other hand, doing good works is onerous and/or costly. Therefore, if man knew he has been saved, then he would prefer not to do good works. Likewise, if man knew he had not been saved, then he would again prefer not to do good works. Thus, by Savage’s sure thing principle (Savage, [38]), man should prefer not to do good works even if he did not know whether he has been saved or not. Table 2 is a possible payoff matrix for this game, where God is the row player and man is the column player. The numbers in the cells are the payoffs to man.

$\mathit{GW}$ | $\mathit{NGW}$ | |

$Yes$ | 2 | 3 |

$No$ | 0 | 1 |

Clearly, not doing good works ($NGW$) is dominant for man over doing good works ($GW$), whether man knows he has been saved ($Yes$) or not ($No$). More explicitly, let $p=$ the probability that God has saved man $\left(Yes\right)$, and let $q=$ the probability that man does good works $\left(GW\right)$. Then, the expected payoff to man is:
which is, clearly, maximized when $q=0$, i.e., when man does not do good works; whatever probability, p, man assigns to having been saved by God.

$$\pi \left(q,p\right)=p\left(2q+3\left(1-q\right)\right)+\left(1-p\right)\left(1-q\right)=1+2p-q$$

#### 3.2. A Social Projection Function

Consider, for example, the social projection function:

$${P}_{21}\left(Yes|q\right)=q$$

Recall that a strategy for man is fully specified by the probability, $q\in \left[0,1\right]$, with which he does good works ($GW$). Likewise, a strategy for God is fully specified by the probability, $p\in \left[0,1\right]$, with which God saves man ($Yes$). Thus, Equation (5) says that man believes that if he does good works with probability q, then that indicates to him or her that he or she has been saved by God with probability $p=q$. Substituting $p=q$ in Equation (4), we get:
which is, clearly, maximized when $q=1$, i.e., when man does do good works.

$$\pi \left(q,q\right)=1+q$$

#### 3.3. Is Evidential Reasoning Rational or Is It a Useful Heuristic?

If man knew that he had been saved by God, then his optimal action would be not to do good works, because the latter is onerous and nothing he can do can influence the decision that God has already made. Likewise, if man knew that he had not been saved by God, then, again, his optimal action would be not to do good works. Here, evidential reasoning has no role to play.

However, the situation is quite different if man did not know whether he had been saved by God or not. Here, evidential reasoning plays a role by assuring man that if he does good works, then this indicates to him that he has been saved. Note that evidential reasoning is not a valid mode of inference. For if it is best not to do good words, if you are saved and also if you are not saved, then, according to classical logic, it is best not to do good works. This can be formally proven using the tautology $\left(P\Rightarrow Q\right)\Rightarrow \left[\left(not\phantom{\rule{4.pt}{0ex}}P\Rightarrow Q\right)\Rightarrow Q\right]$. Therefore, it is best to view evidential reasoning as a heuristic or bias. Our reading of the evidence suggests that evidential reasoning can explain people’s irrational, but systematic behaviour in static games. For a further discussion, see Colman and Pulford [53] and Colman et al. [44].

## 4. Why Do People Voluntarily Contribute to a Public Good?

The results from public goods games are now well established; see, for instance, Dawes and Thaler [18] and Fehr and Gächter [19]. What accounts for the high rates of cooperation in the initial round of a public goods game? Under evidential reasoning, many players may take their own desire to contribute in the first round as diagnostic of the probability with which other players will contribute; hence, they contribute. For the dynamic version of the public goods game, we conjecture that evidential reasoning in conjunction with negative reciprocity gives a better description of the evidence from public goods games.

Gächter and Thöni [20] investigate whether cooperation in public goods games with voluntary contribution is higher among like-minded people. In order to separate the subjects into like-minded people, they initially run a single-round of the public goods experiment. The subjects are then grouped by the amount of contributions they made in this round. For instance, the top 3 contributors are grouped into a separate group (the TOP group) as having the greatest inclination to contribute. The public good game is then played separately in each group. Over the next 10 rounds, contributions are much higher and free riding much lower in the TOP group, which achieves nearly the first best level in several rounds. Even the endgame effect, i.e., the sharp drop in contributions in the last round, is most pronounced among the bottom group. Evidential reasoning would predict that when high contributors have greater confidence that they are grouped with like-minded people, they contribute more; hence, the pattern of these results is consistent with evidential reasoning.

## 5. Why Do People Vote and How Do They Form Beliefs?

Under causal reasoning, any one voter is most unlikely to be pivotal, so nobody should vote. However, then why do so many people vote? This is the voting paradox.

The explanation of voting when a voter uses evidential reasoning is as follows. “If I do not vote for my preferred party, then probably like-minded people will not vote, and my preferred party will lose to the other party. On the other hand, if I decide to vote then, probably, other like-minded people will also make a similar decision and my party has a better chance of winning. So I vote if I wish my party to win, otherwise I do not.” In each case, the binary voting decision (vote or not vote) has diagnostic significance in forming beliefs about whether other like-minded people will vote, although it is critical to note that one’s action to vote does not cause others to vote. Other possible explanations for voting, for instance, that people vote out of a sense of civic duty, cannot explain several kinds of strategic voting and the variation in voter turnout when an election is believed to be close; see Krueger and Acevedo [25].

As Krueger and Acevedo [25], p. 468, put it: “Compared with a Republican who abstains, for example, a Republican who votes can be more confident that other Republicans vote in large numbers”. Quattrone and Tversky [21], Grafstein [22] and Koudenburg et al. [26] show that experimental evidence is strongly supportive of this view.

Delavande and Manski [28] argue that state and national poll information in the U.S. is readily available public knowledge. On the other hand, private knowledge in elections is likely to be very limited. Hence, all individuals should form similar estimates of the winning probabilities of the political parties. In contrast to these expectations, they find strong support for evidential reasoning. Voters who vote assign too high a probability to their preferred party of winning the elections. Their findings are invariant with respect to males/females, whites/non-whites, educated/non-educated, etc. Hence, and consistent with earlier work, there is a strong possibility that evidential reasoning is hard-wired in humans.

Consider Table 3, which reports survey evidence from successive U.S. Presidential elections that is supportive of the evidential reasoning explanation. Voters who intend to vote Democrats typically assign high probabilities to the Democrat candidate winning. By contrast, voters who intend to vote Republican assign high probabilities to a Republican win. Thus, voters seem to take their own actions as diagnostic of what other like-minded people will do. These findings show that, in some circumstances at least, people behave as if their actions were causal, even when they are merely diagnostic or evidential.

Year | Presidential Candidates | % of Democrat Voters Expecting Democrat Win | % of Republican Voters Expecting Republican Win |
---|---|---|---|

1988 | Dukakis vs. Bush | 51.7 | 94.2 |

1984 | Mondale vs. Reagan | 28.8 | 99.9 |

1980 | Carter vs. Reagan | 87.0 | 80.4 |

1976 | Carter vs. Ford | 84.2 | 80.4 |

1972 | McGovern vs. Nixon | 24.7 | 99.6 |

1968 | Humphrey vs. Nixon | 62.5 | 95.4 |

1964 | Johnson vs. Goldwater | 98.6 | 30.5 |

1960 | Kennedy vs. Nixon | 78.4 | 84.2 |

1956 | Stevenson vs. Eisenhower | 54.6 | 97.6 |

1952 | Stevenson vs. Eisenhower | 81.4 | 85.9 |

Of course, not everyone votes in elections. A possible explanation is that not all voters subscribe to evidential reasoning. The evidence appears to be consistent with a mix of voters; some follow evidential reasoning, while others follow the classical mode of reasoning, i.e., causal reasoning. Voters who do not vote appear to follow causal reasoning, while those who do vote appear to follow evidential reasoning. Estimating the respective fractions of voters who follow evidential and causal reasoning is an interesting and open empirical question, but one that lies outside the scope of our paper. Similar comments also apply to the other experimental games that we consider in which some people cooperate while others do not.

## 6. Explaining the Prisoners’ Dilemma under Evidential Reasoning

We have two players; hence, $N=\left\{1,2\right\}$. Each player has two actions: cooperate (C) or defect (D). Hence, ${A}_{1}={A}_{2}=\left\{C,D\right\}$, $\mathbf{A}=\left\{C,D\right\}\times \left\{C,D\right\}=\left\{\left(C,C\right),\left(C,D\right),\left(D,C\right),\left(D,D\right)\right\}$.

Therefore, if Player 1 chooses, say, C and Player 2 chooses D, then Player 1 gets zero and Player 2 gets 10.

#### 6.1. The Prisoners’ Dilemma under Causal Reasoning

Each player has a strictly dominant action, D (Definition 1); thus, the unique Nash equilibrium of this game is $\left(D,D\right)$ (Definition 2). By contrast, the empirical evidence, reviewed in Section 6.3 below, shows that 50% or more of the outcomes involve the play $\left(C,C\right)$.

#### 6.2. The Prisoners’ Dilemma under Evidential Reasoning

A strategy for player i is entirely determined by the probability ${p}_{i}$ with which he or she plays C. His or her expected payoff, ${\pi}_{i}\left({p}_{i},{p}_{j}\right)$, can be found from Table 4:

$${\pi}_{i}\left({p}_{i},{p}_{j}\right)=4+6{p}_{j}-4{p}_{i}+2{p}_{i}{p}_{j},i\ne j\phantom{\rule{-12.0pt}{0ex}}$$

C | D | |

C | $8,8$ | $0,10$ |

D | $10,0$ | $4,4$ |

We consider the following four cases.

Case 1: Both players use evidential reasoning (Definition 5).

Consider the social projection function (Definition 3) for player i:

$${P}_{ij}\left(C|{p}_{i}\right)={p}_{i},i\ne j$$

Remark 4: The social projection Function (8) should not be interpreted as saying that by playing C with probability ${p}_{i}$, player i can induce player $j\ne i$ to play C with probability ${p}_{i}$; indeed, there is no such causal link. Player i does not know what action the other player will take or has taken. Rather, Equation (8) is a heuristic device. Player i may reason as follows “I would like to cooperate with probability ${p}_{i}$. Since player j is like-minded, I believe he or she will also choose cooperate with probability ${p}_{i}$, just like me”. None of the players attempts to strategically exploit the SPF of the other players. Indeed, there is no requirement in an evidential game that there even be mutual knowledge of the social projection functions.

From Equation (8), we see that both players use evidential reasoning (Definition 5), so $\Gamma =\left\{N,\mathbf{A},\mathit{\pi},\mathbf{P}\right\}$ is an evidential game (Definition 9). In particular, each player uses his identity social projection function (Definition 6). Together, both players form an ingroup (Definition 7). In fact, N forms a perfect ingroup (Definition 8).

**Proposition 2.**: For the prisoners’ dilemma game, Table 4, $\left(C,C\right)$ is a consistent evidential equilibrium under the identity social projection Function Equation (8).

Proof of Proposition 2: Substituting from Equation (8) into Equation (7), we get:

$${\pi}_{i}\left({p}_{i},{P}_{ij}\left(C|{p}_{i}\right)\right)=4+2{p}_{i}+2{p}_{i}^{2},{p}_{i}\in \left[0,1\right],i\ne j$$

From Equation (9), we see that ${\pi}_{i}\left({p}_{i},{P}_{ij}\left(C|{p}_{i}\right)\right)$ is maximized when ${p}_{i}=1$. It follows that C is the unique optimal choice for player $i\in N$ (Definitions 11). Hence, $\left(C,C\right)$ is the unique evidential equilibrium of this game (Definition 12). Each player expects the other to play C, which turns out to be correct, ex post. Therefore, $\left(C,C\right)$ is a mutually consistent vector of strategies (Definition 13). Hence, $\left(C,C\right)$ is a consistent evidential equilibrium (Definition 14). ■.

In contrast, $\left(C,C\right)$ is not the Nash equilibrium of the game (Definition 2). Indeed, $\left(C,C\right)$ requires each player to play a strictly dominated strategies. However, $\left(C,C\right)$ is Pareto optimal. Note that under evidential reasoning, one does not need repeated game arguments to justify cooperation in the static prisoners’ dilemma game. Moreover, this is consistent with the play of the cooperative strategy by a majority of the players (see Section 6.3 below). This suggests that a majority of the players may be using evidential reasoning.

Case 2: Player 1 uses evidential reasoning (Definition 5), but Player 2 uses causal reasoning (Definition 4).

In this case, the SPF for each player is given by:

$$\begin{array}{ccc}\hfill {P}_{12}\left(C|{p}_{1}\right)& =& {p}_{1}\text{,}\phantom{\rule{4.pt}{0ex}}\text{for}\phantom{\rule{4.pt}{0ex}}\text{all}\phantom{\rule{4.pt}{0ex}}{p}_{1}\in \left[0,1\right]\hfill \end{array}$$

$$\begin{array}{ccc}\hfill {P}_{21}\left(C|{p}_{2}\right)& =& 1\text{,}\phantom{\rule{4.pt}{0ex}}\text{for}\phantom{\rule{4.pt}{0ex}}\text{all}\phantom{\rule{4.pt}{0ex}}{p}_{2}\in \left[0,1\right]\hfill \end{array}$$

From Equation (10), we see that Player 1 uses evidential reasoning (Definition 5) and, in particular, his or her identity social projection function (Definition 6), as in Case 1 above. On the other hand, from Equation (11), we see that Player 2 uses causal reasoning (Definition 4) and, in particular, mistakenly assumes that Player 1 will always cooperate. This is an evidential game. The unique evidential equilibrium (Definition 12) is $\left(C,D\right)$. It is an evidential equilibrium because each player’s chosen action is optimal, given his beliefs, which are captured by his social projection function. It is not a consistent evidential equilibrium (Definition 14) because the belief of Player 1 turns out to be mistaken in equilibrium (${P}_{12}\left(C|C\right)=1$, but Player 2 plays D instead). By contrast, the belief of Player 2 that Player 1 plays C turns out to be correct in equilibrium.

Case 3: Both players use causal reasoning (Definition 4), but beliefs turn out to be wrong ex post.

In this case, the SPF for each player is given by:
$$\begin{array}{ccc}\hfill {P}_{21}\left(C|{p}_{2}\right)& =& 1\text{,}\phantom{\rule{4.pt}{0ex}}\text{for}\phantom{\rule{4.pt}{0ex}}\text{all}\phantom{\rule{4.pt}{0ex}}{p}_{2}\in \left[0,1\right]\hfill \end{array}$$

$$\begin{array}{ccc}\hfill {P}_{12}\left(C|{p}_{1}\right)& =& 1\text{,}\phantom{\rule{4.pt}{0ex}}\text{for}\phantom{\rule{4.pt}{0ex}}\text{all}\phantom{\rule{4.pt}{0ex}}{p}_{1}\in \left[0,1\right]\hfill \end{array}$$

Both players use causal reasoning (Definition 4), so this is a causal game (Definition 10). Given these social projection functions, the unique payoff maximizing strategy for each player is to play D (Definition 11). Hence, $\left(D,D\right)$ is the unique evidential equilibrium (Definition 12). It is also, of course, the unique Nash equilibrium of this game. However, $\left(D,D\right)$ is not a mutually consistent vector of strategies (Definition 13) because each player expects his opponent to play C in response to D, but the opponent’s response is D. Hence, $\left(D,D\right)$ is not a consistent evidential equilibrium (Definition 14).

Case 4: Both players use causal reasoning (Definition 4), and beliefs turn out to be correct ex post.

In this case, the SPF for each player is given by:

$$\begin{array}{ccc}\hfill {P}_{12}\left(C|{p}_{1}\right)& =& 0\text{,}\phantom{\rule{4.pt}{0ex}}\text{for}\phantom{\rule{4.pt}{0ex}}\text{all}\phantom{\rule{4.pt}{0ex}}{p}_{1}\in \left[0,1\right]\hfill \end{array}$$

$$\begin{array}{ccc}\hfill {P}_{21}\left(C|{p}_{2}\right)& =& 0\text{,}\phantom{\rule{4.pt}{0ex}}\text{for}\phantom{\rule{4.pt}{0ex}}\text{all}\phantom{\rule{4.pt}{0ex}}{p}_{2}\in \left[0,1\right]\hfill \end{array}$$

Both players use causal reasoning (Definition 4), so this is a causal game (Definition 10). Given his social projection function, playing D is the unique optimal strategy for Player 1 (Definition 11) and similarly for Player 2. Hence, $\left(D,D\right)$ is the unique evidential equilibrium (Definition 12). Furthermore, $\left(D,D\right)$ is a mutually consistent vector of strategies (Definition 13) because each player expects his or her rival to play D, and in fact, his or her rival does play D. Hence, $\left(D,D\right)$ is a consistent evidential equilibrium (Definition 14). The unique Nash equilibrium of this game is, of course, $\left(D,D\right)$. Hence, this case illustrates Proposition 1a, namely a Nash equilibrium of the game $\left\{N,\mathbf{A},\mathit{\pi}\right\}$ is also a consistent evidential equilibrium (Definition 14) of the game $\left\{N,\mathbf{A},\mathit{\pi},\mathbf{P}\right\}$ with a suitable choice of social projection functions, $\mathbf{P}$.

Remark 5: When players are randomly matched to play the one-shot prisoners’ dilemma game, the weight of the evidence, reviewed in the next subsection, indicates a cooperation rate of at least 50%. To our minds, the only satisfactory explanation is provided by evidential reasoning with players using their identity social projection function (Definition 6). However, substantial numbers also defect, and this can be explained (as usual) by causal reasoning. Thus, the evidence can best be explained by a mixture of players, some of whom use causal reasoning, and the others use evidential reasoning.

#### 6.3. Evidence of Cooperation in the Prisoners’ Dilemma Game

In the static prisoners’ dilemma game, defection (D) is a strictly dominant strategy; recall Table 4. Hence, a player using causal reasoning should defect. However, experimental evidence indicates high cooperation rates. Rapoport [4] finds cooperation rates of 50% in the prisoners dilemma game. Zhong et al. [11] show that the cooperation rates in prisoners’ dilemma studies go up to 60% when positive labels are used (such as a “cooperative game”, rather than a “prisoners’ dilemma game”). When purely generic labels are used (such as C and D), then the cooperation rates are about 50%. Khadjavi and Lange [13] find that, while the cooperation rates among students playing the static prisoners’ dilemma game is 37%, the cooperation rate among prison inmates is 56%.

Lewis [2] used evidential reasoning to explain the unexpected levels of cooperation in the one-shot prisoners’ dilemma game. Mutual cooperation is better than mutual defection. If players use evidential reasoning, they may take their own preference for mutual cooperation as diagnostic evidence that their rival also has a preference for mutual cooperation, in which case both players are more likely to cooperate. These views are borne out by the evidence. Cooperators believe that the probability of other players cooperating is between $0.6$ and $0.7$. Similarly, players who defect believe that other players will defect with probabilities between $0.6$ to $0.7$; see Krueger [40].

Like-mindedness is compatible with both outcomes, C and D, and we do observe both outcomes in the PD game. Those who play C (respectively D) also believe that a disproportionately large share of the other players will play C (respectively D). However, why then do we observe so much cooperation in the static prisoners’ dilemma game? According to Gintis [54], page 145, humans have evolved the desire to cooperate with other humans. However, this cannot be the only cause, for it cannot explain why the rate of defection increases in the PD game when players know that their rivals have cooperated (see Section 9.5, below).

Rapoport [55], pp. 139–141, argued that each player takes his own belief that rational players deserve the cooperative outcome as evidence that similarly rational players will also cooperate. This is similar to evidential reasoning. Howard [3] tests the assertion by Rapoport [55], pp. 139–141, by running a contest between two computer programs. One computer program is designed to play the dominant strategy, defect. Another computer program, called the MIRROR program, is able to recognize if it is playing another MIRROR program, in which case it also cooperates; otherwise, it plays defect. There are five copies each of the programs that plays a tournament, and not surprisingly, the MIRROR program achieves higher payoffs. In effect, what the MIRROR program is doing is replicating the notion that people would cooperate with other like-minded people. In the conclusion, Howard [3], p. 212, gives an argument that is identical in spirit to the evidential reasoning argument: “If all players use the self-recognition program listed in the Appendix, and play cooperatively only if they recognize their opponents as their twins, then every game will be played cooperatively.”

In contrast to the standard explanations (Section 9, below), the explanation of cooperation based on evidential reasoning appears to be quite plausible. The fact that a sizeable fraction of the experimental subjects also defect suggests that the results are best accounted for by a mixture in the population of people who use evidential reasoning and causal reasoning.

## 7. The Nash Demand Game

#### 7.1. Non-Uniqueness of Outcomes in the Nash Demand Game under Causal Reasoning and under Evidential Reasoning

Consider the Nash demand game (Nash, [56,57]). Two players share a cake of size one. Player 1 demands $x\in \left[0,1\right]$ and Player 2 simultaneously demands $y\in \left[0,1\right]$. If the demands are feasible, i.e., if $x+y\le 1$, then each player receives what she or he demanded. However, if the demands are not feasible, i.e., if $x+y>1$, then each player gets zero. Any pair, $\left(x,y\right)$, such that $x+y=1$ is a Nash non-cooperative equilibrium. Thus, the Nash non-cooperative equilibrium concept does not pin down a unique solution in the Nash demand game.

The following theorem shows that a similar problem occurs with evidential reasoning.8

**Proposition 3.**: If we allow arbitrary social projection functions, then any outcome, $\left(a,b\right)$, such that $a+b=1$, $a>0$ and $b>0$, is an outcome of a consistent evidential equilibrium for the Nash demand game for suitably-chosen social projection functions.

Proof of Proposition 3: Let $a+b=1$, $a>0$, $b>0$. We shall construct social projection functions under which $\left(a,b\right)$ is a consistent evidential equilibrium. Consider the following social projection functions. Let ${\lambda}_{12}>0$ and ${\lambda}_{21}>0$, where ${\lambda}_{12}{\lambda}_{21}=1$. The restriction ${\lambda}_{12}{\lambda}_{21}=1$ ensures that the shares demanded by both players sum up to one; see below. For Player 1, set ${P}_{12}\left(y|x\right)=1\iff y={\lambda}_{12}x$, i.e., if Player 1 makes the demand x, then she expects Player 2 to make the demand $y={\lambda}_{12}x$. For Player 2, set ${P}_{21}\left(x|y\right)=1\iff x={\lambda}_{21}y$, i.e., if Player 2 makes the demand y, then she expects Player 1 to make the demand $x={\lambda}_{21}y$. Thus, Player 1 maximizes x subject to $x+{\lambda}_{12}x\le 1$. The unique solution to this maximization problem is $x=\frac{1}{1+{\lambda}_{12}}$. Similarly, the unique solution to Player 2’s maximization problem, maximize y subject to ${\lambda}_{21}y+y\le 1$, is $y=\frac{1}{1+{\lambda}_{21}}$. It is straightforward to check that $\left(\frac{1}{1+{\lambda}_{12}},\frac{1}{1+{\lambda}_{21}}\right)$ is a consistent evidential equilibrium for the chosen social projection functions. Finally, choosing ${\lambda}_{12}=\frac{1-a}{a}$ and ${\lambda}_{21}=\frac{1-b}{b}$ gives the outcome $\left(a,b\right)$. ■.

**Corollary 4.**: $\left(\frac{1}{2},\frac{1}{2}\right)$ is the unique consistent evidential equilibrium for the Nash demand game under the identity social projection functions: ${P}_{12}\left(y|x\right)=1\iff y=x$ and ${P}_{21}\left(x|y\right)=1\iff x=y$.

Proof of Corollary 4: Take ${\lambda}_{12}={\lambda}_{21}=1$ in the proof of Proposition 3. ■.

Corollary 4 shows that a particularly salient social projection function, the identity function, leads to an equal division of the pie. This might be of empirical interest, particularly when studying norms of equal division.

To overcome the non-uniqueness problem highlighted by Proposition 3, we need criteria to select social projection functions. One such criterion is to select the identity social projection function for symmetric evidential games. The identity social projection function appears plausible, maybe even compelling, for symmetric evidential games. However, what further criteria would help? One possibility is to appeal to the Nash bargaining axioms (Nash [56,57]; Osborne and Rubinstein [58]).9 These axioms are introduced in Section 7.2, below, followed by application to the Nash demand game under evidential reasoning, in the subsequent Section 7.3.

#### 7.2. Nash’s Axioms and Nash’s Theorem

First, some definitions.

**Definition 15.**: A bargaining problem is a pair $\u2329\mathbf{B},\mathbf{d}\u232a$, where $\mathbf{B}\subset {\mathbb{R}}^{2}$ is compact and convex and $\mathbf{d}\in \mathbf{B}$. We require that, for some $\mathbf{b}\in \mathbf{B}$, ${d}_{1}<{b}_{1}$ and ${d}_{2}<{b}_{2}$.

The bargaining problem $\u2329\mathbf{B},\mathbf{d}\u232a$ may be given the following interpretation. $\mathbf{B}$ is the set of possible payoffs resulting from agreement. $\mathbf{d}$ is the pair of payoffs to the players if they fail to agree.

**Definition 16.**: Let

**ß**be the set of all bargaining problems. A bargaining solution is a mapping, $\mathbf{F}:$

**ß**$\to {\mathbb{R}}^{2}$, where $\mathbf{F}\u2329\mathbf{B},\mathbf{d}\u232a\in \mathbf{B}$.

**Definition 17.**: ${\mathbf{b}}^{\prime}\in \mathbf{B}$ is Pareto optimal if, for all $\mathbf{b}\in \mathbf{B}$, ${b}_{1}\le {b}_{1}^{\prime}$ and ${b}_{2}\le {b}_{2}^{\prime}$.

**Definition 18.**: A bargaining problem, $\u2329\mathbf{B},\mathbf{d}\u232a$, is symmetric if ${d}_{1}={d}_{2}$ and if $\left({b}_{1},{b}_{2}\right)\in \mathbf{B}\iff \left({b}_{2},{b}_{1}\right)\in \mathbf{B}$.

**Definition 19.**: We say that the bargaining solution, $\mathbf{F}$, satisfies the independence of irrelevant alternatives if for all bargaining problems, $\u2329\mathbf{B},\mathbf{d}\u232a$ and $\u2329{\mathbf{B}}^{\prime},\mathbf{d}\u232a$, such that $\mathbf{B}\subset {\mathbf{B}}^{\prime}$ and $\mathbf{F}\u2329{\mathbf{B}}^{\prime},\mathbf{d}\u232a\in \mathbf{B}$, we have $\mathbf{F}\u2329\mathbf{B},\mathbf{d}\u232a=\mathbf{F}\u2329{\mathbf{B}}^{\prime},\mathbf{d}\u232a$.

**Definition 20.**: Let $\u2329\mathbf{B},\mathbf{d}\u232a,\u2329{\mathbf{B}}^{\prime},{\mathbf{d}}^{\prime}\u232a\in $

**ß**, ${\alpha}_{1},{\alpha}_{2},{\beta}_{1},{\beta}_{2}\in \mathbb{R},{\beta}_{1}>0,{\beta}_{2}>0$. Let $\mathbf{f}:\mathbf{B}\stackrel{onto}{\to}{\mathbf{B}}^{\prime},{f}_{1}\left({b}_{1}\right)={\alpha}_{1}+{\beta}_{1}{b}_{1},{f}_{2}\left({b}_{2}\right)={\alpha}_{2}+{\beta}_{2}{b}_{2}$. Then, we say that $\mathbf{f}$ is a positive affine transformation of $\u2329\mathbf{B},\mathbf{d}\u232a$ onto $\u2329{\mathbf{B}}^{\prime},{\mathbf{d}}^{\prime}\u232a$.

#### 7.2.1. The Nash Axioms

Let

**ß**be the set of all bargaining problems, $\mathbf{F}$ a bargaining solution and $\u2329\mathbf{B},\mathbf{d}\u232a$ a bargaining problem. We introduce the following axioms.**Pareto:**- $\mathbf{F}\u2329\mathbf{B},\mathbf{d}\u232a$ is Pareto optimal.
**Symmetry:**- If $\u2329\mathbf{B},\mathbf{d}\u232a$ is symmetric, then ${F}_{1}\u2329\mathbf{B},\mathbf{d}\u232a={F}_{2}\u2329\mathbf{B},\mathbf{d}\u232a$.
**Independence:**- $\mathbf{F}$ satisfies the independence of irrelevant alternatives.
**Invariance:**- If $\mathbf{f}\left(\mathbf{b}\right)=\left({\alpha}_{1}+{\beta}_{1}{b}_{1},{\alpha}_{2}+{\beta}_{2}{b}_{2}\right)$ is a positive affine transformation of $\u2329\mathbf{B},\mathbf{d}\u232a$ onto $\u2329{\mathbf{B}}^{\prime},{\mathbf{d}}^{\prime}\u232a$, then ${F}_{1}\u2329{\mathbf{B}}^{\prime},{\mathbf{d}}^{\prime}\u232a={\alpha}_{1}+{\beta}_{1}{F}_{1}\u2329\mathbf{B},\mathbf{d}\u232a,{F}_{2}\u2329{\mathbf{B}}^{\prime},{\mathbf{d}}^{\prime}\u232a={\alpha}_{2}+{\beta}_{2}{F}_{2}\u2329\mathbf{B},\mathbf{d}\u232a$.

#### 7.2.2. The Nash Theorem

**Proposition 5.**: There is a unique bargaining solution, ${\mathbf{F}}^{*}:$

**ß**$\to {\mathbb{R}}^{2}$, satisfying the axioms: Pareto, symmetry, independence and invariance. It is given by:

$${\mathbf{F}}^{*}\u2329\mathbf{B},\mathbf{d}\u232a=\underset{\left({d}_{1},{d}_{2}\right)\le \left({b}_{1},{b}_{2}\right)\in \mathbf{B}}{argmax}\left({b}_{1}-{d}_{1}\right)\left({b}_{2}-{d}_{2}\right)$$

#### 7.3. Application to the Nash Demand Game under Evidential Reasoning

We now illustrate, by an example, how the Nash axioms, Pareto, symmetry, independence and invariance, can reduce the choice among social projection functions sufficiently so as to generate a unique outcome for the Nash demand game.

Example 5 (The Nash demand game under evidential reasoning) Assume that the utility of Player 1 is ${\pi}_{1}\left(x\right)={x}^{\alpha}$ and that of Player 2 is ${\pi}_{2}\left(y\right)={y}^{\beta}$, where $\alpha >0,\beta >0$. Consider the following social projection functions. Let ${\lambda}_{12}>0$ and ${\lambda}_{21}>0$, where ${\lambda}_{12}{\lambda}_{21}=1$. For Player 1, set ${P}_{12}\left(y|x\right)=1\iff y={\lambda}_{12}x$, i.e., if Player 1 makes the demand x, then he or she expects Player 2 to make the demand $y={\lambda}_{12}x$. For Player 2, set ${P}_{21}\left(x|y\right)=1\iff x={\lambda}_{21}y$, i.e., if Player 2 makes the demand y, then he or she expects Player 1 to make the demand $x={\lambda}_{21}y$. Thus, Player 1 maximizes ${x}^{\alpha}$ subject to $x+{\lambda}_{12}x\le 1$. The unique solution to this maximization problem is $x=\frac{1}{1+{\lambda}_{12}}$. Similarly, the unique solution to Player 2’s maximization problem, maximize ${y}^{\beta}$ subject to ${\lambda}_{21}y+y\le 1$, is $y=\frac{1}{1+{\lambda}_{21}}$. It is straightforward to check that $\left(\frac{1}{1+{\lambda}_{12}},\frac{1}{1+{\lambda}_{21}}\right)$ is a consistent evidential equilibrium for the chosen social projection functions.

To make progress, we need to select ${\lambda}_{12},{\lambda}_{21}$. If $\alpha =\beta $, then the game is symmetric, and ${\lambda}_{12}={\lambda}_{21}$ (and hence, ${\lambda}_{12}={\lambda}_{21}=1$) appear compelling. This give the plausible outcome $x=y=\frac{1}{2}$. However, if $\alpha \ne \beta $,then the game is not symmetric, and it is not clear, a priori, how to choose ${\lambda}_{12},{\lambda}_{21}$. If we invoke, in addition to symmetry, the other Nash axioms, Pareto, independence and invariance, then, by Proposition 5, we must get the unique outcome that is determined by maximizing the Nash product ${\pi}_{1}\left(x\right){\pi}_{2}\left(y\right)={x}^{\alpha}{y}^{\beta}$ (here, ${d}_{1}={\pi}_{1}\left(0\right)=0$ and ${d}_{2}={\pi}_{2}\left(0\right)=0$). We then show that this unique outcome can be supported by the appropriate choice of ${\lambda}_{12},{\lambda}_{21}$.

It is straightforward to show that the problem: choose x and y, so as to maximize ${x}^{\alpha}{y}^{\beta}$ subject to $x+y\le 1$, having the unique solution $x=\frac{\alpha}{\alpha +\beta},y=\frac{\beta}{\alpha +\beta}$. This, in turn, determines the unique values ${\lambda}_{12}=\frac{\beta}{\alpha},{\lambda}_{21}=\frac{\alpha}{\beta}$. Thus, social projection functions compatible with the Nash axioms are ${P}_{12}\left(y|x\right)=1\iff y=\frac{\beta}{\alpha}x$, and ${P}_{21}\left(x|y\right)=1\iff x=\frac{\alpha}{\beta}y$. Note that these social projection functions are not unique (because they were chosen to be linear). However, the outcome $x=\frac{\alpha}{\alpha +\beta},y=\frac{\beta}{\alpha +\beta}$ is unique, whatever social projection functions are chosen, provided they satisfy the Nash axioms.

## 8. Oligopoly Games

Consider a market for a single homogeneous good. The total industrial output, Q, is produced by a fixed number of firms, n. Let ${q}_{i}$ be the output of firm i, then $Q={\sum}_{i=1}^{n}{q}_{i}$. All consumers are price takers. The unit price, $P\left(Q\right)$, is given by:

$$P\left(Q\right)=A-aQ\text{,}\phantom{\rule{4.pt}{0ex}}a>0\text{,}\phantom{\rule{4.pt}{0ex}}A>0$$

There are zero fixed costs, and the marginal cost of firm i is a constant, ${c}_{i}$, $i=1,2,...,n$, where:

$$0\le {c}_{1}\le {c}_{2}\le ...\le {c}_{n}<A$$

Hence, the profit of firm i is ${\pi}_{i}=\left(P-{c}_{i}\right){q}_{i}$, $i=1,2,...,n$, which, using Equation (16), can be written as:
where ${\mathbf{q}}_{-i}$ is the vector of outputs of firms other than firm i. Maximizing ${\pi}_{i}\left({q}_{i},{\mathbf{q}}_{-i}\right)$ with respect to ${q}_{i}$, given ${\mathbf{q}}_{-i}$, leads to firm i’s reaction function:

$${\pi}_{i}\left({q}_{i},{\mathbf{q}}_{-i}\right)=\left(A-{c}_{i}-a{\sum}_{j\ne i}{q}_{j}\right){q}_{i}-a{q}_{i}^{2}\text{,}\phantom{\rule{4.pt}{0ex}}i=1,2,...,n$$

$${q}_{i}^{*}\left({\mathbf{q}}_{-i}\right)=\frac{A-{c}_{i}}{2a}-\frac{1}{2}{\sum}_{j\ne i}{q}_{j}\text{,}\phantom{\rule{4.pt}{0ex}}i=1,2,...,n$$

#### 8.1. Causal Reasoning

The next proposition summarizes the results under causal reasoning on the part of all firms.

**Proposition 6.**: (a) Under perfect competition, $P={c}_{1}$, $Q=\frac{A-{c}_{1}}{a}$.

(b) The monopoly outcome is given by $P=\frac{A+{c}_{1}}{2}$, $Q=\frac{A-{c}_{1}}{2a}$.

(c) The Cournot output level of any firm is:

$${q}_{i}^{C}=\frac{A+{\sum}_{j\ne i}{c}_{j}-n{c}_{i}}{\left(n+1\right)a},i=1,2,...,n$$

(d) In a Stackelberg leader-follower model where Firm 1 is the leader while Firm 2 is the follower, the equilibrium output levels of the leader and the follower, respectively, are:

$$\begin{array}{ccc}\hfill {q}_{1}^{L}& =& \frac{A+{c}_{2}-2{c}_{1}}{2a}\phantom{\rule{4.pt}{0ex}}\mathit{(leader)}\hfill \end{array}$$

$$\begin{array}{ccc}\hfill {q}_{2}^{F}& =& \frac{A+2{c}_{1}-3{c}_{2}}{4a}\phantom{\rule{4.pt}{0ex}}\mathit{(follower)}\hfill \end{array}$$

Proof of Proposition 6:

(a) Under perfect competition, each firm produces at the minimum cost, ${c}_{1}$, and the price is set equal to ${c}_{1}$; Equation (16) then gives $Q=\frac{A-{c}_{1}}{a}$.

(b) Suppose we have a single firm (the monopolist) that produces at minimum cost, ${c}_{1}$. Setting ${\sum}_{j\ne i}{q}_{j}=0$ and ${q}_{i}^{*}\left({\mathbf{q}}_{-i}\right)=Q$ (there are no other firms) in Equation (19) gives $Q=\frac{A-{c}_{1}}{2a}$; Equation (16) then gives $P=\frac{A+{c}_{1}}{2}$.

(c) In a Cournot equilibrium, each firm chooses its out, ${q}_{i}^{C}$, so as to maximize its profit Equation (18), given the outputs, ${q}_{j}^{C}$, of the other firms. Set ${q}_{i}^{*}\left({\mathbf{q}}_{-i}\right)={q}_{i}^{C}$ and ${q}_{j}={q}_{j}^{C}$ in Equation (19) and solve the resulting system of simultaneous linear equations to get Equation (20).

(d) Consider a duopoly with Firm 1 acting as leader and Firm 2 acting as follower. The follower chooses its output, ${q}_{2}^{F}$, to maximize its profit given the output level, ${q}_{1}^{L}$, of the leader; Equation (19) then gives ${q}_{2}^{F}\left({q}_{1}^{L}\right)=\frac{A-{c}_{2}}{2a}-\frac{1}{2}{q}_{1}^{L}$. Substitute this into the profit function of the leader (from (Equation (18)) to get ${\pi}_{1}^{L}\left({q}_{1}^{L},{q}_{2}^{F}\left({q}_{1}^{L}\right)\right)=\left[A-{c}_{1}-a\left(\frac{A-{c}_{2}}{2a}-\frac{1}{2}{q}_{1}^{L}\right)\right]{q}_{1}^{L}-a{\left({q}_{1}^{L}\right)}^{2}$. Maximize this with respect to ${q}_{1}^{L}$ to get ${q}_{1}^{L}=\frac{A+{c}_{2}-2{c}_{1}}{2a}$, and hence, ${q}_{2}^{F}=\frac{A+2{c}_{1}-3{c}_{2}}{4a}$. ■.

Remark 6: Note that the perfectly competitive, monopoly and Cournot games are all single-stage games, i.e., one-shot games. The firms choose their actions simultaneously (Parts a, b and c of Proposition 6). However, the Stackelberg leader-follower model is a two-stage game: The leader moves first choosing its output level correctly anticipating the reaction of the follower. The follower then moves having observed the output level of the leader.

#### 8.2. Evidential Reasoning

Consider the consequences of evidential reasoning for the producers. All consumers are causal reasoners, i.e., each consumer regards every other consumer and every firm as an outgroup member (recall Definition 7). We also assume that each firm regards each consumer as an outgroup member. Thus, if Cis the set of consumers and F is the set of firms, then each is an outgroup relative to the other (Definition 7c). This also allows us to continue to assume that the market demand curve is given by Equation (16). If we allowed consumers to use evidential reasoning, then a single consumer could reason as follows “If I cut my demand, then probably each like-minded consumer would also cut his or her demand. The aggregate result would be a reduction in price for all of us”. Consumers would then be able to collude. The consequence would be that we would no longer have an oligopoly model (as classically defined), but a bargaining model. While this is very interesting, it lies beyond the scope of this paper and, in fact, deserves a paper on its own.

We now describe an evidential equilibrium, ${\mathbf{q}}^{*}$, with the following properties. Suppose that firm i is considering a deviation, ${q}_{i}$, from ${q}_{i}^{*}$. Firm i reasons as follows. “If I am tempted to deviate by an amount ${q}_{i}-{q}_{i}^{*}$ and if I believe that my rival, firm j, $j\ne i$, is like-minded, then the rival is probably also tempted to deviate by an amount ${q}_{j}-{q}_{j}^{*}={\lambda}_{ij}\left({q}_{i}-{q}_{i}^{*}\right)$”; the interpretation of ${\lambda}_{ij}$ is given below in more detail. We formalize such reasoning by the following social projection function (Definition 3):

$${P}_{ij}\left({q}_{j}|{q}_{i}\right)=1\iff {q}_{j}-{q}_{j}^{*}={\lambda}_{ij}\left({q}_{i}-{q}_{i}^{*}\right)\text{,}\phantom{\rule{4.pt}{0ex}}j\ne i$$

The social projection specified in Equation (23) is quite general. It nests several subcases, as we show below. The generality of Equation (23) should not be taken to mean that the predictive content of the evidential reasoning model of oligopoly is empty. Rather, as in prisoners’ dilemma games, individuals display a wide variation in choices when they are asked to play the oligopoly game (see Section 8.4 below). Variations in the parameter ${\lambda}_{ij}$ in Equation (23) offer a parsimonious way of capturing this heterogeneity by varying the degree of like-mindedness.

In particular, perfect like-mindedness, ${\lambda}_{ij}=1$, gives rise to the identity social projection function (Definition 6); each firm believes that the other will deviate from ${q}^{*}$ by an identical distance. The other extreme arises when no like-mindedness is perceived by firms, as in models of causal reasoning. This corresponds to ${\lambda}_{ij}=0$. Intermediate cases of like-mindedness correspond to values $0<{\lambda}_{ij}<1$ and to ${\lambda}_{ij}<0$. The distribution of values of ${\lambda}_{ij}$ in any population is ultimately an empirical question that cannot be answered in a theoretical model.

The next proposition gives the solution under evidential reasoning.

**Proposition 7.**: (a) Given the social projection Functions (23), the unique evidential equilibrium (Definition 12), ${\mathbf{q}}^{*}$, is characterized by the following set of simultaneous linear algebraic equations:

$$\left[\begin{array}{cccc}2+\sum _{j\ne 1}{\lambda}_{1j}& 1& ...& 1\\ 1& 2+\sum _{j\ne 2}{\lambda}_{2j}& ...& 1\\ ...& ...& ...& ...\\ 1& 1& ...& 2+\sum _{j\ne n}{\lambda}_{nj}\end{array}\right]\left[\begin{array}{c}{q}_{1}^{*}\\ {q}_{2}^{*}\\ ...\\ {q}_{n}^{*}\end{array}\right]=\left[\begin{array}{c}\frac{A-{c}_{1}}{a}\\ \frac{A-{c}_{2}}{a}\\ ...\\ \frac{A-{c}_{n}}{a}\end{array}\right]$$

(b) Furthermore, ${\mathbf{q}}^{*}$ is a mutually consistent vector of strategies (Definition 13) and, hence, a consistent evidential equilibrium.

(c) Conversely, given any vector of outputs, ${\mathbf{q}}^{*}$, satisfying ${q}_{i}^{*}>0$ and ${\sum}_{i=1}^{n}{q}_{i}^{*}\le \frac{A-{c}_{1}}{a}$, there exits a profile of social projection of the form Equation (23), such that ${\mathbf{q}}^{*}$ is a consistent evidential equilibrium. In particular,

$$\begin{array}{ccc}\hfill {\lambda}_{ij}& =& {\lambda}_{i}\text{,}\phantom{\rule{4.pt}{0ex}}i,j=1,2,...,n\text{,}\phantom{\rule{4.pt}{0ex}}j\ne i\text{,}\phantom{\rule{4.pt}{0ex}}\mathit{where}\hfill \end{array}$$

$$\begin{array}{ccc}\hfill {\lambda}_{i}& =& \frac{A-{c}_{i}}{\left(n-1\right)a{q}_{i}^{*}}-\frac{2}{n-1}-\frac{1}{\left(n-1\right){q}_{i}^{*}}\sum _{j\ne i}{q}_{j}^{*}\text{,}\phantom{\rule{4.pt}{0ex}}i=1,2,...,n\hfill \end{array}$$

Proof of Proposition 7: (a) Substituting ${q}_{j}$ from Equation (23) into Equation (18) gives:
which, after simplification, gives:

$${\pi}_{i}\left({q}_{i},{\mathbf{P}}_{i}\left(.|{q}_{i}\right)\right)=\left\{A-{c}_{i}-a\sum _{j\ne i}\left[{q}_{j}^{*}+{\lambda}_{ij}\left({q}_{i}-{q}_{i}^{*}\right)\right]\right\}{q}_{i}-a{q}_{i}^{2}\text{,}\phantom{\rule{4.pt}{0ex}}i=1,2,...,n$$

$${\pi}_{i}\left({q}_{i},{\mathbf{P}}_{i}\left(.|{q}_{i}\right)\right)=\left(A-{c}_{i}+a{q}_{i}^{*}\sum _{j\ne i}{\lambda}_{ij}-a\sum _{j\ne i}{q}_{j}^{*}\right){q}_{i}-a\left(1+\sum _{j\ne i}{\lambda}_{ij}\right){q}_{i}^{2}\text{,}\phantom{\rule{4.pt}{0ex}}i=1,2,...,n$$

Equation (27) shows how a player who uses the heuristic of evidential reasoning translates an essentially strategic problem into a decision theoretic problem. Maximizing Equation (27) with respect to ${q}_{i}$ gives the optimal (pure) strategy for firm i (Definition 11), given his social projection Function (23):

$${q}_{i}=\frac{A-{c}_{i}+a{q}_{i}^{*}{\sum}_{j\ne i}{\lambda}_{ij}-a{\sum}_{j\ne i}{q}_{j}^{*}}{2a\left(1+{\sum}_{j\ne i}{\lambda}_{ij}\right)}\text{,}\phantom{\rule{4.pt}{0ex}}i=1,2,...,n$$

Setting ${q}_{i}={q}_{i}^{*}$, $i=1,2,...,n$ and simplifying gives the following set of simultaneous linear algebraic equations,
which can be written in the matrix form Equation (24).

$$\left(2+\sum _{j\ne i}{\lambda}_{ij}\right){q}_{i}^{*}+\sum _{j\ne i}{q}_{j}^{*}=\frac{A-{c}_{i}}{a}\text{,}\phantom{\rule{4.pt}{0ex}}i=1,2,...,n$$

(b) From Equation (23), we see that ${P}_{ij}\left({q}_{j}|{q}_{i}^{*}\right)=1\iff {q}_{j}={q}_{j}^{*}$. In effect, when firm i produces the output ${q}_{i}^{*}$, it believes that firm j will produce ${q}_{j}^{*}$. Ex post, firm i finds that firm j indeed did produce an output level ${q}_{j}^{*}$, thus vindicating its ex ante belief. Hence, ${\mathbf{q}}^{*}$ is a mutually consistent vector of strategies and, hence, a consistent evidential equilibrium.

(c) Rewrite Equation (29) in the form:

$$\sum _{j\ne i}{\lambda}_{ij}=\frac{A-{c}_{i}}{a{q}_{i}^{*}}-2-\frac{1}{{q}_{i}^{*}}\sum _{j\ne i}{q}_{j}^{*}\text{,}\phantom{\rule{4.pt}{0ex}}i=1,2,...,n$$

Equation (30) has many solutions, for example Equation (25). ■

We now show how one may obtain the market outcomes under causal reasoning (Proposition 6) also under evidential reasoning by choosing suitable values for ${\lambda}_{ij}$, $j\ne i$, in Equation (25) of Proposition 7.

**Corollary 8.**: (a) Setting ${c}_{1}={c}_{2}=...={c}_{n}$ and ${\lambda}_{ij}=-\frac{1}{n-1}$, for all $i,j$ and $i\ne j$, gives the perfectly competitive output levels, ${\sum}_{i=1}^{n}{q}_{i}^{*}={Q}^{*}=\frac{A-{c}_{1}}{a}$ (Proposition 6a). Here, each firm regards every other firm as an ingroup member and the set of firms forms an ingroup (Definition 7). We may call this a competitive ingroup and the resulting social projection functions competitive social projection functions. This is in line with the ideas considered in Section 2.3 and, in particular, is an illustration of the contrast effect.

(b) Setting ${c}_{1}={c}_{2}=...={c}_{n}$ and ${\lambda}_{ij}=1$, for all $i,j$ and $i\ne j$, gives ${q}_{i}^{*}=\frac{A-c}{2na}$, $i=1,2,...,n$. Hence, the total output equals the monopoly level ${\sum}_{i=1}^{n}{q}_{i}^{*}={Q}^{*}=\frac{A-{c}_{1}}{2a}$ (Proposition 6b). In this case, the social projection functions for the producers are identity social projection functions on the set of all producers (Definition 6). Thus, each firm believes that the other firms are like-minded.

(c) Setting ${\lambda}_{ij}=0$, for all $i,j$ and $i\ne j$, gives the Cournot output levels ${q}_{i}^{*}=\frac{A+{\sum}_{j\ne i}{c}_{j}-n{c}_{i}}{\left(n+1\right)a}$, $i=1,2,...,n$ (Proposition 6c). Here, each firm regards the other as an outgroup member (Definition 7); hence, every firm uses causal reasoning (Definition 4).

(d) Setting ${\lambda}_{12}=-\frac{1}{2}$ and ${\lambda}_{21}=0$ gives, respectively, the leader output and the follower output in the leader-follower Stackelberg duopoly (Proposition 6d). The social projection functions for the follower (Firm 2) is constant; hence, the follower uses causal reasoning (Definition 4), ${\lambda}_{21}=0$. The follower regards the leader as an outgroup member. The leader regards the follower as an ingroup member (Definition 7), ${\lambda}_{12}=-\frac{1}{2}$. We may say that the leader behaves competitively towards the follower (the contrast effect, recall Section 2.3).

Remark 7: Recall that the Stackelberg game in Proposition 6d under causal reasoning is a two-stage game, where the leader moves first, correctly anticipating the reaction of the follower. The follower then chooses its output having observed the output of the leader. By contrast, the version under evidential reasoning (Corollary 8d) is a single-stage game. The empirical evidence that we present below does show that in single-stage duopoly games, players often choose outputs similar to the Stackelberg output levels. This appears to be confirmation of evidential reasoning in oligopoly games.

#### 8.3. The Nash Bargaining Solution for Oligopoly Games

Here, we investigate the consequences of treating the oligopoly problem as a Nash bargaining problem (see Section 7, above, for the two-player case and Osborne and Rubinstein [58], p.23, for the n-player case). We show that the oligopoly Models (16)–(18) have a unique Nash bargaining solution. We show that this solution specifies equal output levels for the firms. However, due to different costs, the firms’ profits are different. We give social projection functions that implement this Nash bargaining solution.

**Proposition 9.**: For the oligopoly Models (16)–(18), the unique Nash bargaining solution is given by:

$${q}_{i}^{*}=\frac{1}{{\sum}_{k=1}^{n}\frac{a}{A-{c}_{k}-a{Q}^{*}}},i=1,2,...,n$$

$$\frac{n}{{Q}^{*}}=\sum _{k=1}^{n}\frac{1}{\frac{A-{c}_{k}}{a}-{Q}^{*}}$$

Social projection functions that implement this solution are:
where:

$${P}_{ij}\left({q}_{j}|{q}_{i}\right)=1\iff {q}_{j}-{q}_{j}^{*}={\lambda}_{i}\left({q}_{i}-{q}_{i}^{*}\right)\text{,}\phantom{\rule{4.pt}{0ex}}j\ne i$$

$${\lambda}_{i}=\frac{A-{c}_{i}}{\left(n-1\right)a{q}_{i}^{*}}-\frac{2}{n-1}-\frac{1}{\left(n-1\right){q}_{i}^{*}}\sum _{j\ne i}{q}_{j}^{*}\text{,}\phantom{\rule{4.pt}{0ex}}i=1,2,...,n$$

Proof of Proposition 9: A firm can guarantee itself a zero profit by exiting the market. Therefore, we take the disagreement point to be $\mathbf{0}$. Maximizing the Nash product:
gives the first order conditions:
which can be re-written as:
where:

$$\Pi ={\times}_{i=1}^{n}{\pi}_{i}\left({q}_{i},{\mathbf{q}}_{-i}\right)={\times}_{i=1}^{n}\left[\left(A-{c}_{i}-a{\sum}_{j\ne i}{q}_{j}\right){q}_{i}-a{q}_{i}^{2}\right]$$

$$\frac{A-{c}_{i}-a{\sum}_{j\ne i}{q}_{j}-2a{q}_{i}}{\left(A-{c}_{i}-a{\sum}_{j\ne i}{q}_{j}\right){q}_{i}-a{q}_{i}^{2}}=a\sum _{k\ne i}^{n}\frac{{q}_{k}}{\left(A-{c}_{k}-a{\sum}_{j\ne k}{q}_{j}\right){q}_{k}-a{q}_{k}^{2}}\text{,}\phantom{\rule{4.pt}{0ex}}i=1,2,...,n$$

$$1={q}_{i}\sum _{k=1}^{n}\frac{a}{A-{c}_{k}-aQ}\text{,}\phantom{\rule{4.pt}{0ex}}i=1,2,...,n$$

$$Q=\sum _{j=1}^{n}{q}_{j}$$

Summing Equation (31) from one to n, using Equation (32), then rearranging, we get:

$$\frac{n}{Q}=\sum _{k=1}^{n}\frac{1}{\frac{A-{c}_{k}}{a}-Q}$$

We shall argue that Equation (33) has a unique solution,

$${Q}^{*}\in \left(0,\frac{A-{c}_{n}}{a}\right)$$

Consider the functions $f\left(Q\right)=\frac{n}{Q}$, $g\left(Q\right)={\sum}_{k=1}^{n}\frac{1}{\frac{A-{c}_{k}}{a}-Q}$, $0<Q<\frac{A-{c}_{n}}{a}$. Note that f and g are continuous with $\underset{Q\to 0}{lim}f\left(Q\right)=\infty >{\sum}_{k=1}^{n}\frac{a}{A-{c}_{k}}=\underset{Q\to 0}{lim}g\left(Q\right)$ and $\underset{Q\to \frac{A-{c}_{n}}{a}}{lim}f\left(Q\right)=\frac{na}{A-{c}_{n}}<\infty =\underset{Q\to \frac{A-{c}_{n}}{a}}{lim}g\left(Q\right)$. Hence, $f\left({Q}^{*}\right)=g\left({Q}^{*}\right)$, for some ${Q}^{*}\in \left(0,\frac{A-{c}_{n}}{a}\right)$. Since f is strictly decreasing and g is strictly increasing, it follows that ${Q}^{*}$ is unique. From Equation (31), it follows that the unique Nash bargaining solution for the oligopoly Models (16)–(18) is given by:

$${q}_{i}^{*}=\frac{1}{{\sum}_{k=1}^{n}\frac{a}{A-{c}_{k}-a{Q}^{*}}},i=1,2,...,n$$

From Equation (23) and Proposition 7c, it follows that social projection functions that implement Equation (35) are:
$${P}_{ij}\left({q}_{j}|{q}_{i}\right)=1\iff {q}_{j}-{q}_{j}^{*}={\lambda}_{i}\left({q}_{i}-{q}_{i}^{*}\right)\text{,}\phantom{\rule{4.pt}{0ex}}j\ne i$$
where:

$${\lambda}_{i}=\frac{A-{c}_{i}}{\left(n-1\right)a{q}_{i}^{*}}-\frac{2}{n-1}-\frac{1}{\left(n-1\right){q}_{i}^{*}}\sum _{j\ne i}{q}_{j}^{*}\text{,}\phantom{\rule{4.pt}{0ex}}i=1,2,...,n\text{.}\phantom{\rule{4.pt}{0ex}}\phantom{\rule{4.pt}{0ex}}\u25a0$$

#### 8.4. Empirical Evidence from Oligopoly Games

In our theoretical model, we showed how alternative values of λ in Equation (23) were able to produce, in a static game, the output levels under alternative market forms, such as perfect competition, monopoly, Cournot and the Stackelberg leader-follower game. We now argue that these results are consistent with the empirical findings.

The early experiments on Cournot markets were conducted by Fouraker and Siegel [14]. They, and many later experiments, gave a profit table (PT) to the subjects. The PT listed the outputs of each firm on the two margins while individual cells of the table contained the corresponding profits of both firms. The PT was often supplemented by a profit calculator (PC), which allowed each experimental subject in their role as a firm to calculate the profit for a given pair of quantities chosen by both firms. In recent years, several experiments also gave subjects a best response option (BRO), which tells them their profit maximizing quantity for any quantity chosen by the other player.

The extra information provided (PT, PC, BRO) arguably primes subjects to follow an optimization-based solution. Requate and Waichman [27] find that there is substantially more collusion (corresponding to $\lambda =1$ in Equation (23)) in PT and PC treatments as compared to BRO. They find, in a static duopoly experiment, that the collusive outcome is reached at least once in the 20 rounds, in 62%, 78% and 29% of the markets, respectively, in the PT, PC and BRO treatments. The theoretical outcome of the Cournot–Nash equilibrium is, therefore, not confirmed in many cases.

Several papers report the Cournot–Nash equilibrium under random matching of opponents while finding that there is greater collusion under fixed matching of players.10 For instance, Huck et al. [15] consider symmetric firms and linear demand curves. Output choices vary between 3 and 15, and the Cournot–Nash outcome is for each firm to produce an output of eight. Both firms are known to the experimental subjects to be symmetric, so they must know that the solution lies on the diagonal of a relatively small matrix. Profits of each firm in the PT drop off sharply for output levels equal to or higher than 10 or less than an output of three. This leaves only seven levels of output to choose from: 4, 5, 6, 7, 8, 9. The authors report data for Round 9 of play under random matching, as the most supportive of their hypothesis (see Table 5 in their paper); these results are given in Table 5.

Output Level | 6 | 7 | 8 | 9 | Greater Than 10 |
---|---|---|---|---|---|

% of subjects choosing | 12 | 21:5 | 35:5 | 14:5 | 14 |

The mean quantity is close to the Cournot–Nash output level of eight. However, there is substantial variation in the output levels, and about 65% of individuals do not choose the Cournot output level.

Rassenti et al. [24] use an asymmetric Cournot game in which firms have different marginal costs that are private information. Firms do not even know the underlying probability distribution of types. Hence, there is true uncertainty, an area where evidential reasoning would seem to have the most bite. The game is played over 75 rounds to allow for substantial learning possibilities. The main finding is that while total output is above, but close to, the Cournot–Nash solution, the individual levels of output chosen by the firms are quite different from the Cournot–Nash solution. The results, in this sense, are similar to those in Huck et al. [15]; however, the authors take this as a refutation rather than a confirmation of the Cournot–Nash equilibrium.

Bosch-Domènech and Vriend [16] report results from the last two rounds of a 22-round duopoly experiment in which, in each round, two firms simultaneously choose outputs. They find that the output levels are widely distributed over a range that includes the monopoly output level and the perfectly competitive level. Table 6 summarizes information that is extracted from their paper.

Monopoly | Cournot | Walrasian | Others | |
---|---|---|---|---|

Easy | 38.89 | 33.33 | 0 | 27.78 |

Hard | 11.11 | 16.67 | 11.11 | 61.11 |

Hardest | 8.33 | 14.28 | 16.67 | 60.72 |

The three treatments, easy, hard and hardest, differ in terms of the time within which firms had to choose their outputs and the level of information provided. For instance, in the easy treatment, a PT is provided, but not in the other treatments. In the hardest treatment, firms are not even told of the exact functional form of the linear demand curve (only that it is downward sloping), whereas in the easy treatment, firms know the exact demand curve. The cells in Table 6 report the approximate percentage of the standard market outcomes in each treatment. The Cournot output level is not particularly salient relative to the others. Further, the wide distribution of output levels, even in Rounds 21 and 22 of the experiment, suggest that a flexible social projection function, as in Equation (23), is consistent with the evidence.

Waichman et al. [59] find that pre-play communication increases the degree of collusion in the Cournot game. Between 91% and 100% of the markets achieve collusion in at least one round of the experiment when pre-play communication is allowed. This also seems consistent with evidential reasoning. Pre-play communication may increase the players’ beliefs that they are dealing with like-minded players, hence facilitating the use of evidential reasoning.

Duersch et al. [17] document systematic departures from a Cournot–Nash equilibrium. They consider a linear-demand, linear-cost Cournot game with the Cournot–Nash quantity, ${q}_{i}^{*}=36$. Computers play one of several well-known strategies, including best response against human subjects who are not aware of the computers’ strategy over 40 rounds. Again, by creating uncertainty about what others will do, this situation is quite relevant to the domain of evidential reasoning.

Mean quantities chosen by computers (34.39) are always lower than mean quantities chosen by humans (47.95). Human subjects choose quantities that are much greater than the Cournot–Nash levels and, in some cases, approach the Stackelberg leader output of 54. In particular, when computers are programmed to play a best response with some small noise, in three different treatments, subjects choose the output levels 51.99, 48.67 and 49.18, while computers choose 32.05, 35.02 and 31.67. Thus, human subjects show systematic (upward) departures from the Cournot–Nash level, even approaching the Stackelberg levels.

Thus, one observes a wide and rich range of behaviours that are often collusive and range all the way up to the choice of quantities in the Stackelberg case. The results are consistent with a range of social projection functions used (corresponding to alternative values of λ in Equation (23)) by experimental subjects who use evidential reasoning. The collusive outcome is played by a significant percentage of experimental subjects, which is consistent with the identity social projection function.

## 9. Can Causal Reasoning Explain Cooperation in the Prisoners’ Dilemma Game?

Here, we examine several theories to see if they can explain cooperation in the one-shot prisoners’ dilemma game under causal reasoning.11

First, in Section 9.1, we show that cooperation in the prisoners’ dilemma game cannot be explained by reputation, correlated equilibria, level-kmodels, Stackelberg reasoning or evolutionary stable equilibria.

Although the prisoners’ dilemma game is presented to experimental subjects as a non-cooperative game played once against an anonymous opponent, subjects may in fact interpret it as some form of cooperative game. We outline three such theories in Section 9.2, Section 9.3 and Section 9.4. In Section 9.5, we argue that the empirical evidence rejects these theories.

Before starting, however, the reader might find the following example useful.

Example 6 (High-low game)12: Consider the following game between two players. If both choose high (H), then each gets two. If both choose low (L), then each gets one. If one chooses H (high) and the other chooses L (low), then each gets zero. These payoffs are summarized by Table 7.

H | L | |

H | $2,2$ | $0,0$ |

L | $0,0$ | $1,1$ |

This game has two pure-strategy Nash equilibria: $\left(H,H\right)$ and $\left(L,L\right)$. The problem is: which Nash equilibrium will be played? We use this example to illustrate several heuristics that solve this problem Schelling [60] argued that, because of its high payoff, $\left(H,H\right)$ is salient. Therefore, it becomes a focal point and is chosen by both players. Harsanyi and Selten [41] argued that $\left(H,H\right)$ payoff dominates $\left(L,L\right)$ and, hence, will be selected by both players. These are examples of what we called conservative heuristics in the Introduction. They involve no violation or relaxation of any of the standard assumptions of game theory, but they do add extra conditions that cannot be derived from standard game theory, but are consistent with them.

As an example of what we called a radical heuristic, consider team reasoning (Sugden, [61]; Bacharach, [62,63]). Here, the players replace individual rationality (each seeking to maximise his or her own payoff, as in standard game theory) with collective rationality (each aims to maximise the payoff to the team). Since $\left(H,H\right)$ gives the highest payoff to the team, it chosen.

Another example of a radical heuristic is Stackelberg reasoning (Colman and Bacharach [43]; Colman et al. [44]). Note that high-low is a static game of complete, but imperfect information. In particular, when a player makes his or her move, he or she does not know the move that his or her rival has taken or will take. Both players are aware of this. A player solves this problem by carrying out the following thought experiment. She or he says to herself or himself “suppose this were a game of perfect information where I move first and my rival moves second, after having observed my move. What move would she or he take? From the payoff matrix, it is obvious to me that her or his best reply to me choosing H is for her or him to also choose H; and the best reply for her or him to me choosing L is for her or him to also choose L. Since $\left(H,H\right)$ is better for me than $\left(L,L\right)$, I will choose H”. The other player also reasons in the same way. The outcome is $\left(H,H\right)$. It is important to note that there is no breakdown of causality, nor do the players believe so. In particular, Player 1 does not believe that by choosing H (or L) she or he causes Player 2 to choose H (or L).

We now illustrate evidential reasoning. Player 1 (say) reasons as follows. “$\left(H,H\right)$ is clearly better for both of us. If only I could meet with Player 2, I would put to her or him the proposal that we both play H. I am sure she or he would agree. However, player 2 is probably thinking the same. However, actually, we do not have to meet, because we both know that if we were to meet, then we would agree on $\left(H,H\right)$”. Let $p,q$ be the probabilities with which Players 1 and 2 play H, respectively. The players’ reasoning can be formalized using the identity social projection functions:

$${P}_{12}\left(H|p\right)=p,{P}_{21}\left(H|q\right)=q$$

The resulting expected payoff to Player 1 is

$${u}_{1}\left(p\right)={\pi}_{1}\left(p,{P}_{12}\left(H|p\right)\right)=2{p}^{2}+{\left(1-p\right)}^{2}=3{\left(p-\frac{1}{3}\right)}^{2}+\frac{2}{3}$$

Since ${\left(p-\frac{1}{3}\right)}^{2}$ attains a unique global maximum on $\left[0,1\right]$ at $p=1$, it follows that the same holds for ${u}_{1}\left(p\right)$. Hence, Player 1 chooses H with probability one, and similarly, Player 2 chooses H with probability one. $\left(H,H\right)$ is a consistent evidential equilibrium (Definition 14). We stress, again, that there is no breakdown of causality, nor do the players believe so. In particular, Player 1 does not believe that by choosing H, she or he causes Player 2 to choose H. She or he merely takes her or his preference for H as evidence that Player 2 will also choose H.

#### 9.1. Reputation, Correlated Equilibria, Level-K Models, Stackelberg Reasoning and Evolutionary Stable Equilibria

In the finitely-repeated prisoners’ dilemma game under incomplete information, a player may believe, possibly mistakenly, that his or her rival will respond to cooperation with cooperation and defection with defection (Kreps et al. [64]). Under these conditions, a player may wish to play cooperate until late in the game, so as to establish a reputation for cooperation. However, because defect is a dominant strategy, such a player will defect in the final period. This contradicts the observation of significant amounts of cooperation in the final period (Cooper et al. [6]). In particular, reputation (on its own) is unable to explain cooperation in the static (one-shot) prisoners’ dilemma game.

Since “cooperate” is a dominated strategy, it is not rationalizable and cannot be supported in a correlated equilibrium either. In Level-k models, any player with level k, $k\ge 1$, will never play a dominated strategy. Cooperation in a prisoners’ dilemma game cannot be explained in evolutionary games either. The reason is that the set of evolutionary stable equilibria is a subset of the set of Nash equilibria of the game.

Stackelberg reasoning cannot explain cooperation in the prisoners’ dilemma either. To see this, consider the payoff matrix in Table 4. According to Stackelberg reasoning, if Player 1, say, chooses C, then D is the best response for Player 2, giving Player 1 the payoff zero; and if Player 1 chooses D, then, again, D is the best response for Player 2, giving Player 1 the payoff $4>0$. Hence, according to Stackelberg reasoning, Player 1 will always choose D; and similarly for Player 2. Therefore, cooperation will never be the outcome of the prisoners’ dilemma game under Stackelberg reasoning.

#### 9.2. Team Reasoning

Several elements are key to team reasoning (Bacharach, [63]):

- The team agrees on a common objective.
- The task for each member of the team is agreed upon by all members.
- The way the surplus is divided among the members of the team is agreed upon by all members.

The team is consolidated if all members carry out their tasks. However, serious failure of a significant number of members is liable to cause the team to breakup.

As an illustration, consider the two-player game whose payoff matrix is given by Table 8, below. This, clearly, has the structure of a prisoners’ dilemma game.

C | D | |

C | $5,5$ | $2,6$ |

D | $6,2$ | $4,4$ |

The interpretation of the payoffs in Table 8 is as follows. If a player works on her or his own, she or he gets the payoff of four. If both work as a team, then the joint payoff is 10, which is shared equally to give each a payoff of five. However, if one player defects (shirks), she or he gets six. This is because she or he gets four for herself or himself from working on her or his own. The other player (who has not defected) generates a payoff of four for the team by working on her or his own. By the sharing rule, this payoff is shared equally, giving the defector a total payoff of six and the non-defector a payoff of two. D is the strictly dominant strategy for each player. Hence, this game has the unique Nash equilibrium, $\left(D,D\right)$, which gives each player the payoff four. However, if both players internalize the objective of the team, to maximise joint payoff, then each gets five.

#### 9.3. Other-Regarding Preferences

Consider, for example, other-regarding preferences in the model of Fehr and Schmidt [65,66]. Suppose that we have n players with incomes: ${y}_{1}\le {y}_{2}\le ...\le {y}_{n}$. We concentrate on the linear version, which has had considerable empirical support. The Fehr–Schmidt utility function of an individual with income ${y}_{j}\in \mathbf{Y}$ is given by:

$$U\left({y}_{j}\right)={y}_{j}-\frac{\beta}{n-1}\sum _{i=1}^{j-1}\left({y}_{j}-{y}_{i}\right)-\frac{\alpha}{n-1}\sum _{k=j+1}^{n}\left({y}_{k}-{y}_{j}\right),\alpha \ge 0,0\le \beta <1$$

Individual j cares for his own payoff, ${y}_{j}$, as under selfish preferences. However, he or she also suffers disutility from being ahead of others (altruism) and from being behind others (envy). $\beta \ge 0$ and $\alpha \ge 0$ are sometimes known as the parameters of, respectively, advantageous and disadvantageous inequity. When $\alpha =\beta =0$, we have purely selfish preferences. Evidence indicates that disadvantageous inequity is more important than advantageous inequity ($\alpha >\beta $), and one never benefits by throwing away one’s own income ($\beta <1$). Let us apply FSpreferences, with $\beta =0.3$, to the prisoners’ dilemma game $\left(n=2\right)$, whose payoff matrix is given by Table 4, above. It can then be easily seen that C is a strict best reply to C, and D is a strict best reply to D. Hence, other-regarding preferences can, potentially, explain cooperation in the prisoners’ dilemma game. We return to this in Section 9.5, below.

#### 9.4. Altruism

Cooper et al. [6] give an explanation of cooperation in the prisoners’ dilemma game based on altruism. They consider three types of players. Egoists who always defect; dominant strategy altruists who always cooperate; and best response altruists who respond to cooperate with cooperate and to defect with defect.

Because there are three types of players and because when a player takes his or her action, he or she does not know the type of his or her opponent, this is a (static) game of incomplete information. The relevant solution concept (under causal reasoning) is the Bayesian Nash equilibrium. Each type of each player chooses his or her strategy so as to maximize his or her expected payoff, given the strategies of all of the types of all of the other players. The joint probability of all types is common knowledge. Each type of each player uses this, and his or her knowledge of his or her own type, to update his or her belief about the types of his or her rivals using Bayes’ Law. See, for example, Fudenberg and Tirole [45], Part III.

To rationalize the behaviour of these types, Cooper et al. [6] assume that player i enjoys an extra amount of utility (warm glow), ${\delta}_{i}\ge 0$, from playing cooperate (irrespective of the action played by his or her rival). If ${\delta}_{i}$ is sufficiently small, then player i behaves as an egoist, i.e., defect is a dominant action for him or her. If ${\delta}_{i}$ is sufficiently large, then player i behaves as a dominant strategy altruist, i.e., cooperate is a dominant strategy for him or her. Finally, for ${\delta}_{i}$ in the intermediate range, player i behaves as a best response altruist, i.e., for him or her, cooperate is the best response to cooperate and defect is the best response to defect. As an illustration, assume that, in addition to the payoffs in Table 4, player i enjoys warm glow, ${\delta}_{i}$, from cooperation. The modified payoff matrix is now given by Table 9:

C | D | |

C | $8+{\delta}_{1},8+{\delta}_{2}$ | ${\delta}_{1},10$ |

D | $10,{\delta}_{2}$ | $4,4$ |

The following is easy to check.

- If ${\delta}_{i}<2$, then defect is a strictly dominant action for player i. In this case, player i behaves as an egoist.
- If ${\delta}_{i}>4$, then cooperate is a strictly dominant action for player i. In this case, player i behaves as a dominant strategy altruist.
- If $2<{\delta}_{i}<4$ (e.g., ${\delta}_{i}=3$), then cooperate is a strict best response for player i to player j, $j\ne i$, playing cooperate. Defect is a strict best response for player i to player j, $j\ne i$, playing defect. Player i behaves as a best response altruist.

Reviewing the evidence, including the evidence from their own experiments, Cooper et al. [6] conclude that behaviour in the prisoners’ dilemma game can best be explained if players are either egoists (always defect) or best response altruists (respond to cooperate with cooperate and to defect with defect). To illustrate this, take ${\delta}_{i}=0$ if player i is an egoist and ${\delta}_{i}=3$ if player i is a best response altruist. Consider the game between two best response altruists. Putting ${\delta}_{1}={\delta}_{2}=3$, the payoff Table 9 becomes:

$C\left(q\right)$ | $D\left(1-q\right)$ | |
---|---|---|

$C\left(p\right)$ | $11,11$ | $3,10$ |

$D\left(1-p\right)$ | $10,3$ | $4,4$ |

Let $a\in \left(0,1\right)$ be the probability of a player being the best response altruist. We have four cases to consider:

1. Suppose Player 1 (a best response altruist) plays C.

1.1 Player 1 meets an egoist with probability $1-a$. An egoist always defects, giving Player 1 the payoff three.

1.2 Player 1 meets a best response altruist with probability a. If Player 2 (a best response altruist) plays C (probability q), then the payoff to Player 1 is 11. If Player 2 plays D (probability $1-q$), then the payoff to Player 1 is 3.

2. Suppose Player 1 plays D.

2.1 If Player 1 meets an egoist (probability $1-a$), who always defects, then the payoff to player 1 will be four.

2.2 Suppose Player 1 meets a best response altruist (probability a). If Player 2 (a best response altruist) plays C (probability q), then the payoff to Player 1 is 10. If Player 2 plays D (probability $1-q$), then the payoff of Player 1 is four.

Hence, the expected utility of Player 1 is,
which simplifies to:
from which we get:
and hence:

$${U}_{1}\left(p,q\right)=p\left\{3\left(1-a\right)+a\left[11q+3\left(1-q\right)\right]\right\}+\left(1-p\right)\left\{4\left(1-a\right)+a\left[10q+4\left(1-q\right)\right]\right\}$$

$${U}_{1}\left(p,q\right)=4+6aq-p+2apq$$

$$\frac{\partial {U}_{1}\left(p,q\right)}{\partial p}=2aq-1$$

$$\begin{array}{ccc}\hfill q& >& \frac{1}{2a}\Rightarrow \frac{\partial {U}_{1}\left(p,q\right)}{\partial p}>0\hfill \\ \hfill q& <& \frac{1}{2a}\Rightarrow \frac{\partial {U}_{1}\left(p,q\right)}{\partial p}<0\hfill \end{array}$$

To simplify the discussion, let us concentrate on the pure strategy Bayesian Nash equilibria in a population with a fraction, $a\in \left(0,1\right)$, of best response altruists and a fraction, $1-a$, of egoists.

If $a<\frac{1}{2}$, then we have one pure strategy Bayesian Nash equilibrium, where all players defect.

If $a\ge \frac{1}{2}$, then we have two pure strategy Bayesian Nash equilibria. In one equilibrium (as before), all payers defect. In the second equilibrium, all best response altruists cooperate and all egoists defect.

Hence, for the particular payoffs in Table 10, cooperation can be sustained as a Bayesian Nash equilibrium in the one-shot prisoners’ dilemma game if the percentage of best response altruists is ≥50%.

See Cooper et al. [6] for a more extensive analysis, including mixed strategies and repeated prisoners’ dilemma games.

#### 9.5. Can Team Reasoning, Altruism or Other-Regarding Preferences Explain Cooperation in the Prisoners’ Dilemma Game?

Shafir and Tversky [5] presented experimental subjects with the usual one-shot prisoners’ dilemma game. They then considered the following two variants.

1. A player whose competitor had defected was informed of this and offered the chance to revise her or his decision.

2. A player whose competitor had cooperated was informed of this and offered the chance to revise her or his decision.

Shafir and Tversky [5] found that when a player did not know what the opponent had chosen, then the player defected in 63% of games. If the player was informed that the opponent had defected (Case 1, above) then in 97% of the games, the player defected. Thus, defection increased. on the other hand, if the player was informed that the opponent cooperated (Case 2, above), then in 84% of the games, the player defected. Thus, defection, again, increased. These results were replicated and extended by Croson [7], Li and Taplin [8], Busemeyer et al. [10] and Histrova and Grinberg [12].

We shall argue that these results are consistent with evidential reasoning, but not with team reasoning, altruism or other regarding preferences.

First, consider Case 1, above. Here, all four theories predict an increase in defection following a player being told that her or his rival has defected, in line with the evidence. However, the reasons are different. Once a player has been told the move of her or his rival uncertainty is resolved, she or he no longer needs recourse to evidential reasoning. Since she or he continues to play selfishly and since defect is a strictly dominant strategy, she or he defects. An observation of defection is liable to destroy team spirit (Section 9.2). Hence, team reasoning is consistent with the observation of an increase in defection. According to other-regarding preferences, as in Section 9.3 above, a player who originally chose D will continue to choose D after she or he has been told that her or his rival had defected. However, a player who had originally chosen C will change to D after being told that her or his rival had defected. Hence, the other-regarding preferences model will predict an increase in defection, in line with the evidence. Finally, consider altruism, as in Section 9.4 above. The egoists will not change their behaviour; they will continue to defect. Likewise, the dominant strategy altruists will not change their behaviour; they will continue to cooperate. The best response altruists who chose to defect will continue to choose defect after being informed that their rival has defected. However, each best response altruist who had chosen cooperation will now switch to defection. Hence, the amount of defection will increase in line with the evidence.

Second, consider Case 2 above. Here, evidential reasoning predicts an increase in defection, in line with the evidence. The reason is that uncertainty is resolved once a player has been told the move of her or his rival. She or he no longer needs recourse to evidential reasoning. If she or he had defected, she or he will choose defect again. If she or he had cooperated, she or he will now defect. However, the other three theories predict a decline in defection, contrary to the evidence. For team-reasoning, the team spirit is strengthened once a player is told that her or his rival has cooperated. Hence, a player who had cooperated continues to cooperate. A team member who had initially chosen to defect may now change to cooperate. According other-regarding preferences, as in Section 9.3, a player who originally chose C will continue to choose C after she or he has been told that her or his rival had cooperated. However, a player who had originally chosen D will change to C after being told that her or his rival had cooperated. Hence, the other-regarding preferences model will predict a reduction in defection, contrary to the evidence. Finally, consider altruism, as in Section 9.4. The egoists will not change their behaviour; they will continue to defect. Likewise, the dominant strategy altruists will not change their behaviour; they will continue to cooperate. The best response altruists who chose to cooperate will continue to cooperate. However, each best response altruist who had chosen defect will now find it in their best interest to cooperate. Hence, the extent of defection will decrease, contrary to the evidence.

## 10. Conclusions

A large number of experimental subjects do not play a Nash equilibrium in well-known games, such as prisoners’ dilemma, the hawk-dove game, voting games, public goods games and oligopoly games. It would seem that this constitutes strong grounds for game theory to be open to alternative equilibrium concepts in static games. Aumann and Brandenburger [67] gave epistemic conditions under which the play of a game would result in a Nash equilibrium. A violation of Nash equilibrium is also a violation of the epistemic conditions that imply it.

In static game players are uncertain about which actions the other players will take (or have taken). A great deal of evidence suggests that in resolving uncertainty about what other players will do (or have done), players assign diagnostic significance to their own actions. Such reasoning is described as evidential reasoning. Players using evidential reasoning do not believe that their actions cause the action of others; it merely informs their belief about the actions taken by others. However, players who use evidential reasoning can violate standard rationality assumptions, such as Savage’s sure thing principle. Thus, evidential reasoning is best viewed as a heuristic rather than a sound rational principle.

The aim of our paper is to explore the significance of evidential reasoning for the class of static games of complete information. We define evidential games in which some players use evidential reasoning. We also propose the relevant solution concepts for such games: evidential equilibrium and consistent evidential equilibrium.

The evidence shows that the cooperative outcome in the prisoners’ dilemma game occurs more than 50% of the time, despite cooperation being strictly dominated by defection. Other-regarding preferences and altruism under causal reasoning can both explain this. However, neither team-reasoning, other-regarding preferences nor altruism can explain why the amount of defection increases when players are told that their rivals have, in fact, cooperated. We find that the evidence can best be explained by a mixture of players, some who use evidential reasoning and others who use causal reasoning.

Our proposal of an identity social projection function appears adequate for symmetric games. However, our general notion of a social projection functions appears too general (Proposition 3). We illustrated how the Nash bargaining axioms restrict social projections functions sufficiently to get unique outcomes in the Nash demand game (Example 5) and oligopoly games (Proposition 9).

Our framework can naturally be extended to incomplete information games and dynamic games, but we lack a body of evidence that could underpin such extensions. Hence, we leave such developments for future research as more evidence accumulates.

## Acknowledgments

We are very grateful to Subir Bose, Andrew Colman, Vincent Crawford, Herbert Gintis, Briony Pulford, Ludovic Renou, Chris Wallace, two anonymous referees and the editor for their comments on the paper. We remain responsible for all errors and omissions.

## Author Contributions

Both authors contributed equally to this article.

## Conflicts of Interest

The authors declare no conflict of interest.

## References

- Camerer, C. Behavioral Game Theory: Experiments in Strategic Interaction; Princeton University Press: Princeton, NJ, USA, 2003. [Google Scholar]
- Lewis, D. Prisoners’ dilemma is a Newcomb problem. Philos. Public Aff.
**1979**, 8, 235–240. [Google Scholar] - Howard, J. Cooperation in the Prisoner’s Dilemma. Theory Decis.
**1988**, 24, 203–213. [Google Scholar] [CrossRef] - Rapoport, A. Experiments with N-Person Social Traps I: Prisoners’ Dilemma, Weak Prisoners’ Dilemma, Volunteers’ Dilemma, and Largest Number. J. Confl. Resolut.
**1988**, 32, 457–472. [Google Scholar] [CrossRef] - Shafir, E.; Tversky, A. Thinking through uncertainty: Nonconsequential reasoning and choice. Cogn. Psychol.
**1992**, 24, 449–474. [Google Scholar] [CrossRef] - Cooper, R.; DeJong, D.; Forsythe, R.; Ross, T. Cooperation without Reputation: Experimental Evidence from Prisoner’s Dilemma Games. Games Econ. Behav.
**1996**, 12, 187–218. [Google Scholar] [CrossRef] - Croson, R. The disjunction effect and reason-based choice in games. Organ. Behav. Hum. Decis. Process.
**1999**, 80, 118–133. [Google Scholar] [CrossRef] [PubMed] - Li, S.; Taplin, J. Examining whether there is a disjunction effect in prisoner’s dilemma games. Chin. J. Psychol.
**2002**, 44, 25–46. [Google Scholar] - Acevedo, M.; Krueger, J. Evidential reasoning in the prisoners’ dilemma game. Am. J. Psychol.
**2005**, 118, 431–457. [Google Scholar] [PubMed] - Busemeyer, J.R.; Matthews, M.; Wang, Z. A quantum information processing explanation of disjunction effects. In Proceedings of the 29th Annual Conference of the Cognitive Science Society and the 5th International Conference of CognitiveScience, Vancouver, BC, Canada, 19–26 July 2006; Sun, R., Myake, N., Eds.; Lawrence Erlbaum Associates, Inc.: Mahwah, NJ, USA, 2006; pp. 131–135. [Google Scholar]
- Zhong, C.; Loewenstein, J.; Murnighan, J. Speaking the Same Language: The Cooperative Effects of Labeling in the Prisoner’s Dilemma. J. Confl. Resolut.
**2007**, 51, 436–456. [Google Scholar] [CrossRef] - Histrova, E.; Grinberg, M. Disjunction effect in prisoner’s dilemma: Evidence from an eye-tracking study. In Proceedings of the 30th Annual Conference of the Cognitive Science Society, Washington, DC, USA, 23–26 July 2008; Love, B., McCrae, K., Sloutsky, V., Eds.; Cognitive Science Society: Austin, TX, USA, 2008. [Google Scholar]
- Khadjavi, M.; Lange, A. Prisoners and their Dilemma. J. Econ. Behav. Organ.
**2013**, 92, 163–175. [Google Scholar] [CrossRef] - Fouraker, L.; Siegel, S. Bargaining Behavior; McGraw-Hill: New York, NY, USA, 1963. [Google Scholar]
- Huck, S.; Normann, H.; Oechssler, J. Learning in Cournot oligopoly—An experiment. Econ. J.
**1999**, 109, 80–95. [Google Scholar] [CrossRef] - Bosch-Domènech, A.; Vriend, N. Imitation of successful behaviour in Cournot markets. Econ. J.
**2003**, 113, 495–524. [Google Scholar] [CrossRef] - Duersch, P.; Kolb, A.; Oechssler, J.; Schipper, B. Rage against the machines: How subjects play against learning algorithms. Econ. Theory
**2010**, 43, 407–430. [Google Scholar] [CrossRef] - Dawes, R.; Thaler, R. Anomalies: Cooperation. J. Econ. Perspect.
**1988**, 2, 187–197. [Google Scholar] [CrossRef] - Fehr, E.; Gächter, S. Cooperation and punishment in public goods experiments. Am. Econ. Rev.
**2000**, 90, 980–994. [Google Scholar] [CrossRef] - Gächter, S.; Thöni, C. Social Learning and Voluntary Cooperation Among Like-Minded People. J. Eur. Econ. Assoc.
**2005**, 3, 303–314. [Google Scholar] [CrossRef] - Quattrone, G.; Tversky, A. Causal versus diagnostic contingencies: On self deception and the voter’s illusion. J. Personal. Soc. Psychol.
**1984**, 46, 237–248. [Google Scholar] [CrossRef] - Grafstein, R. An evidential decision theory of voter turnout. Am. J. Political Sci.
**1991**, 35, 989–1010. [Google Scholar] [CrossRef] - Forsythe, R.; Nelson, G.; Neumann, R.; Wright, J. Anatomy of an Experimental Political Stock Market. Am. Econ. Rev.
**1992**, 82, 1142–1161. [Google Scholar] - Rassenti, S.; Reynolds, S.; Smith, V.; Szidarovszky, F. Adaptation and Convergence of Behavior in Repeated Experimental Cournot Games. J. Econ. Behav. Organ.
**2000**, 41, 117–146. [Google Scholar] [CrossRef] - Krueger, J.; Acevedo, M. A game-theoretic view of voting. J. Soc. Issues
**2008**, 64, 467–485. [Google Scholar] [CrossRef] - Koudenburg, N.; Postmes, T.; Gordijn, E. If they were to vote they would vote for us. Psychol. Sci.
**2011**, 22, 1506–1510. [Google Scholar] [CrossRef] [PubMed] - Requate, T.; Waichman, I. A profit table or a profit calculator? A note on the design of Cournot oligopoly experiments. Exp. Econ.
**2011**, 14, 36–46. [Google Scholar] [CrossRef] - Delavande, A.; Manski, C. Candidate preferences and expectations of election outcomes. Proc. Natl. Acad. Sci. USA
**2012**, 109, 3711–3715. [Google Scholar] [CrossRef] [PubMed] - Moulin, H. Game Theory for the Social Sciences, 2nd ed.; NYU Press: New York, NY, USA, 1986. [Google Scholar]
- Nagel, R. Unraveling in guessing games: An experimental study. Am. Econ. Rev.
**1995**, 85, 1313–1326. [Google Scholar] - De Sousa, J.; Hollard, G.; Terracol, A. Do non-strategic players really exist? In Evidence from Experimental Games Involving Step Reasoning; Mimeo, Paris University: Paris, France, 2012. [Google Scholar]
- Ivanov, A.; Levin, D.; Niederle, M. Can Relaxation of Beliefs Rationalize the Winner’s Curse? An Experimental Study. Econometrica
**2010**, 78, 1435–1452. [Google Scholar] [CrossRef] - Rubinstein, A.; Salant, Y. Isn’t Everyone Like Me? On the Presence of Self-Similarity in Strategic Interactions. Available online: http://arielrubinstein.tau.ac.il/papers/selfsimilarity.pdf (accessed on 21 July 2015).
- Krueger, J. Social Projection as a source of Cooperation. Curr. Dir. Psychol. Sci.
**2013**, 22, 289–294. [Google Scholar] [CrossRef] - Lucas, G.; McCubbins, M.; Turner, M. Can We Build Behavioral Game Theory? Available online: http://dx.doi.org/10.2139/ssrn.2278029 (accessed on 12 June 2013).
- Kahneman, D.; Slovic, P.; Tversky, A. (Eds.) Judgement Under Uncertainty: Heuristics and Biases; Cambridge University Press: Cambridge, UK, 1982.
- Kahneman, D. Maps of Bounded Rationality: Psychology for Behavioral Economics. Am. Econ. Rev.
**2003**, 93, 1449–1475. [Google Scholar] [CrossRef] - Savage, L. The Foundations of Statistics; John Wiley and Sons: Hoboken, NJ, USA, 1954. [Google Scholar]
- Ellsberg, D. Risk, ambiguity and the Savage axioms. Q. J. Econ.
**1961**, 75, 643–669. [Google Scholar] [CrossRef] - Krueger, J. From social projection to social behaviour. Eur. Rev. Soc. Psychol.
**2007**, 18, 1–35. [Google Scholar] [CrossRef] - Harsanyi, J.; Selten, R. A General Theory of Equilibrium Selection in Games; MIT Press: Cambridge, MA, USA, 1988. [Google Scholar]
- Van Damme, E. Stability and Perfection of Nash Equilibria; Springer-Verlag: Berlin, Heidelberg, Germany, 1991. [Google Scholar]
- Colman, A.; Bacharach, M. Payoff dominance and the Stackelberg Heuristic. Theory Decis.
**1997**, 43, 1–19. [Google Scholar] [CrossRef] - Colman, A.; Pulford, B.; Lawrence, C. Explaining strategic coordination: Cognitive hierarchy theory, strong Stackelberg reasoning, and team reasoning. Decision
**2014**, 1, 35–58. [Google Scholar] [CrossRef] - Fudenberg, D.; Tirole, J. Game Theory; The MIT Press: Cambridge, MA, USA; London, UK, 1991. [Google Scholar]
- Nash, J. Equilibrium points in n-person games. Proc. Natl. Acad. Sci.
**1950**, 36, 48–49. [Google Scholar] [CrossRef] [PubMed] - Nash, J. Non-cooperative games. Ann. Math.
**1951**, 54, 286–295. [Google Scholar] [CrossRef] - Robbins, J.; Krueger, J. Social projection to ingroups and outgroups: A review and meta-analysis. Personal. Soc. Psychol. Rev.
**2005**, 9, 32–47. [Google Scholar] [CrossRef] [PubMed] - Riketta, M.; Sacramento, C. They Cooperate With Us, So They Are Like Me: Perceived Intergroup Relationships Moderates Projection >From Self to Outgroups. Group Process. Intergroup Relat.
**2008**, 11, 115–131. [Google Scholar] [CrossRef] - Weber, M. The protestant ethic and the spirit of capitalism. Archiv für Sozialwissenschaft und Sozialpolitik
**1904**, 20–21, 1–110. [Google Scholar] - McClelland, D. The Achieving Society; Van Nostrand: New York, NY, USA, 1961. [Google Scholar]
- Nozick, R. The Nature of Rationality; Princeton University Press: Princeton, NJ, USA, 1993. [Google Scholar]
- Colman, A.; Pulford, B. Problems and pseudo-problems in understanding cooperation in social dilemmas. Psychol. Inq.
**2012**, 23, 39–47. [Google Scholar] [CrossRef] - Gintis, H. The Bounds of Reason: Game Theory and the Unification of the Behavioral Sciences; Princeton University Press: Princeton, NJ, USA, 2009. [Google Scholar]
- Rapoport, A. Two-Person Game Theory; Michigan University Press: Ann Arbor, MI, USA, 1966. [Google Scholar]
- Nash, J. The bargaining problem. Econometrica
**1950**, 18, 155–162. [Google Scholar] [CrossRef] - Nash, J. Two-Person Cooperative Games. Econometrica
**1953**, 21, 128–140. [Google Scholar] [CrossRef] - Osborne, M.; Rubinstein, A. Bargaining and Markets; Academic Press: Waltham, MA, USA, 1990. [Google Scholar]
- Waichman, I.; Requate, T.; Siang, C.K. A Cournot experiment with managers and students: Evidence from Germany and Malaysia. B.E. J. Econ. Anal. Policy
**2010**, 10, 1–17. [Google Scholar] [CrossRef] - Schelling, T. The Strategy of Conflict; Harvard University Press: Cambridge, MA, USA, 1960. [Google Scholar]
- Sugden, R. Thinking as a team: Towards an explanation of non-selfish behaviour. Soc. Philos. Policy
**1993**, 10, 69–89. [Google Scholar] [CrossRef] - Bacharach, M. Interactive team reasoning: A contribution to the theory of cooperation. Res. Econ.
**1999**, 53, 117–147. [Google Scholar] [CrossRef] - Bacharach, M. Beyond Individual Choice: Teams and Frames in Game Theory; Gold, N., Sugden, R., Eds.; Princeton University Press: Princeton, NJ, USA, 2006. [Google Scholar]
- Kreps, D.; Milgrom, P.; Roberts, J.; Wilson, R. Rational Cooperation in the Finitely Repeated Prisoner’s Dilemma. J. Econ. Theory
**1982**, 17, 245–252. [Google Scholar] [CrossRef] - Fehr, E.; Schmidt, K. A theory of fairness, competition and cooperation. Q. J. Econ.
**1999**, 114, 817–868. [Google Scholar] [CrossRef] - Fehr, E.; Schmidt, K. On inequity aversion: A reply to Binmore and Shaked. J. Econ. Behav. Organ.
**2010**, 73, 101–108. [Google Scholar] [CrossRef][Green Version] - Aumann, R.; Brandenburger, A. Epistemic Conditions for Nash Equilibrium. Econometrica
**1995**, 63, 1161–1180. [Google Scholar] [CrossRef]

^{1.}In Bayesian games, like-minded players are often used to denote the case where players share the same priors. However, in this paper we use like-mindedness in its more general psychological sense.^{2.}Also known as social projection (see Krueger [34]).^{3.}According to Krueger ([34], p. 291) “… people use their own choices to predict the choices of others and then select the strategy that is best for them”. Note well that Krueger uses the word predict, not cause.^{4.}See, for example, Fudenberg and Tirole [45], Part I.^{5.}We use the term social projection function because, on the one hand, it is obviously connected to social projection and evidential reasoning and, on the other hand, to distinguish it from the term projection function as commonly used in mathematics.^{7.}We are grateful to Andrew Colman for drawing our attention to this example.^{8.}We are grateful to Ludovic Renou for drawing our attention to this problem.^{9.}“One states as axioms several properties that it would seem natural for the solution to have and then one discovers that the axioms actually determine the solution uniquely.” Nash [57].^{10.}The interested reader can consult the bibliography in Requate and Waichman [27].^{11.}For a review, see Colman and Pulford [53].^{12.}We are grateful to Andrew Colman for drawing our attention to this example.

© 2015 by the authors; licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/4.0/).