1. Logical Axioms and Rules
In logic, one studies how conclusions follow from certain given premisses. We shall restrict ourselves here to propositional logic, i.e., we shall restrict ourselves to arguments in which all premisses as well as the conclusion consist of propositions, atomic or built up from simpler propositions by means of the connectives ∧ (and), ∨ (or), → (if …, then …), and ¬ (not). The atomic propositions are rendered by letters P, Q, and R, and by means of the connectives one may build more complex propositions, such as . The latter formula may stand for ‘if it is raining or snowing, then we do not go for a walk’, for instance.
Now suppose that
P and
are given premisses and that they are both true; then
must also be true. Why? Supposing
P is true, then
is also true, and because
is supposed to be true, it follows that
must also be true. Schematically,
Note that the interpretation of the symbols
P,
Q, and
R does not matter for the validity of the argument.
P may stand for ‘it is raining’, but it may also stand for ‘I like honey’, for instance. Additionally,
Q may stand for ‘I like sugar’ and
R for ‘I like salt’, for instance. If the premisses
P and
are both true, then the conclusion
must also be true, no matter what the precise interpretation (meaning) of
P,
Q, and
R is. The validity of the argument is due to the meaning of the propositional connectives ∧ (and), ∨ (or), → (if
…, then
…), and ¬ (not) and does not depend on the content of
P,
Q, and
R.
So, by definition, a formula B is a valid consequence of given premisses iff for every interpretation of the atomic formulas P, Q, R, … occurring in , and B, whenever the premisses are all true (under the given interpretation), then the conclusion B must also be true (under the given interpretation). Notation: . As we have seen: .
One should realize that the validity of an argument does not say anything about the truth of the premisses. It only says that if the premisses are true (given a particular interpretation), then the conclusion must also be true (given the interpretation in question). For instance, the proposition ‘it is raining’ (P) may be false, in which case the conclusion ‘we do not go for a walk’ () may be false too.
Now, it turns out that one may select a finite, small number of (logical) axioms, which characterize the meaning of the propositional connectives, and one rule, called Modus Ponens (MP), such that every valid consequence
B of given premisses
can be deduced from the given premisses by applications of Modus Ponens and the axioms. Here are the (logical) axioms for the connectives:
| Introduction Elimination |
∧ | |
| |
| |
∨ | |
| |
| |
→ | |
| |
¬ | |
Additionally, the rule Modus Ponens (MP) is the following one: .
For instance, the valid consequence of the premisses P and can be deduced from the given premisses by applications of the logical axioms and two applications of Modus Ponens:
The schema above is called a deduction of from the premisses P and . Additionally, is called deducible from P and , because there exists a deduction of from the premisses P and ; notation: . More generally, B is deducible from, notation , if one can make a deduction of B from , using only Modus Ponens (MP) and the logical axioms.
It is easy to see that our logical deduction system is sound, i.e., logical deduction cannot lead from true premisses to false conclusions. That is, any formula B deducible from premisses is a valid consequence of these premisses. In other words, if , then . This is the soundness theorem for (classical) propositional logic.
Conversely, the completeness theorem for (classical) propositional logic says that every valid consequence
B of given premisses
can be deduced from the premisses by using only the logical axioms for the connectives and Modus Ponens. In short: if
, then
. For a proof, see any logic textbook, for instance de Swart [
1]. This theorem says that any valid human mathematical argumentation can be simulated by using only the logical axioms and the rule Modus Ponens. In other words, the logical axioms for the connectives together with the rule Modus Ponens are a characterization of human mathematical reasoning. Because the rule Modus Ponens and these logical axioms can be captured in a computer program, in theory such a computer program can simulate human (mathematical) reasoning and hence is equipped with artificial intelligence.
It is important to notice that the right selection of the logical axioms and rules is a subtle one. For instance, although the formula of Peirce,
, only contains the logical connective →, it cannot be deduced from the two axioms for → alone, see Exercise 2.44 in de Swart [
1]. In addition, one needs the axioms for ¬. Additionally, in order to make a deduction of
(without using any premisses), one makes a deduction of
and next uses the axiom
and Modus Ponens, with
C replaced by
. If one replaces the axiom
, for instance, by its converse,
, then the completeness theorem does not hold anymore and
cannot be deduced anymore.
Interestingly, if one replaces the logical axiom
by
, leaving all other axioms unchanged, a completely different logic results, called intuitionistic logic; see, for instance Chapter 8 in de Swart [
1]. In intuitionistic logic, although
is deducible,
is no longer deducible and it should not be, because that would mean that for any proposition
A, one has a proof of
A or a proof of
, which evidently is not the case: there are many unsolved problems
A.
Additionally, it should be noted that mathematical (propositional) reasoning may be characterized by different sets of axioms. For instance, it turns out that the three axioms , , and , together with Modus Ponens, constitute a different characterization, taking into account that conjunction (∧) and disjunction (∨) can be defined in terms of implication (→) and negation (¬). However, be aware: replacing by destroys the completeness theorem!
Basic propositions such as ‘john is a Philosopher’ typically consist of a subject (john) and a predicate (is a Philosopher). In predicate logic, one renders the subject–predicate structure of this basic proposition by, for instance,
: the predicate ‘(being a) P(hilosopher)’ applies to j(ohn). In predicate logic, this subject–predicate structure of basic propositions is expressed by using predicate symbols and symbols to denote individuals. Consequently, one may deal with sentences such as ‘for all persons
x,
x is a Philosopher’, rendered by
, and with sentences such as ‘there is at least one person
x such that
x is a Philosopher’, rendered by
. The axioms for propositional logic now have to be supplemented by axioms for ∀ and ∃:
and
, where
c may indicate an individual such as j(ohn) or a variable such as
. Additionally, apart from the rule Modus Ponens, in predicate logic one also needs a rule for ∀ and a rule for ∃; in order to avoid some technical details we refer to, for instance, de Swart [
1], Chapter 4.
The soundness and completeness theorem also hold for predicate logic: if , then , and conversely, if , then . The first one says that our deduction system is sound, i.e., it does not lead from true premisses to false conclusions. In other words, if B is deducible from and are true given an interpretation, then also B is true (given the same interpretation). The latter theorem says that we have enough axioms and rules to deduce any valid consequence of given premisses from those premisses by using the logical axioms and rules.
2. Arithmetical Axioms
In elementary arithmetic, one studies the properties of addition, +, and multiplication, ·, of the natural numbers . From now on, we shall indicate the natural numbers by , where the symbol s stands for the successor function. We shall use letters as variables over natural numbers.
It became gradually clear that many arithmetical facts, such as
, can be logically deduced from a small number of more basic facts. In 1889, G. Peano proposed the following set of (arithmetical) axioms for addition and multiplication:
| |
| |
Induction axiom: let
express a property of a natural number
n.
For instance, to deduce
from the Peano axioms, one needs the induction axiom. Together with the axioms for the successor function and equality (=), these axioms constitute the so-called
Peano axioms for elementary arithmetic:
|
and |
|
(equality is reflexive)
|
(equality is symmetric) |
(equality is transitive)
|
Let us indicate the set of Peano axioms by
. Then, we know from the soundness and completeness theorem for predicate logic that
iff
. The latter means that any interpretation of
and · that makes the Peano axioms true, will also make B true. However, doing arithmetic, we are not interested in all possible interpretations that make the Peano axioms true; we are only interested in the standard interpretation in which the variables are interpreted as natural numbers (and not as persons, for instance) and in which
and · are interpreted as the successor function and the addition and multiplication of natural numbers, respectively (and not otherwise). Let us denote this standard interpretation by
. Then, evidently, if
, then
B is true under the standard interpretation
because the Peano axioms are true under
. Experience now suggests that any formula
B, such as
, which is true under the standard interpretation, may be deduced from the Peano axioms. That is, one might expect that if
B is true under the standard interpretation
, then
. In fact, Kleene [
2] shows that many such formulas
B actually can be deduced from the Peano axioms.
It was quite a surprise when K. Gödel, in 1931, proved that there are formulas
B that are true under the standard interpretation
but that cannot be deduced from the Peano axioms. This is called Gődel’s incompleteness theorem: the Peano axioms are incomplete, i.e., they cannot deduce all formulas
B that are true under the standard interpretation. In other words, it is impossible to fully characterize elementary number theory. Even more interestingly, the Peano axioms cannot even be extended in a consistent way in order to obtain the desired result. However, the formula
B constructed by Gődel is a very special one: it says of itself that it is not deducible from the Peano axioms. For more details see, for instance, Kleene [
2] or de Swart [
1].
3. Axioms in Social Choice Theory
In traditional social choice theory, one assumes that each voter i gives as input a preference ordering of the alternatives or candidates and a profile or vote configuration p is then a mapping that assigns to each voter i his preference ordering . Given such a profile p, a social ranking rule F assigns to p a social ranking of the candidates and the highest ranked candidate in is considered to be the winner.
:= and , i.e., given profile p, the social ranking rule F declares x and y to be indifferent. Additionally, := and not , i.e., given profile p, the social ranking rule F prefers x to y.
The plurality rule (PR), for instance, is defined by
iff the number of first votes for
x in
p is larger than or equal to the number of first votes for
y in
p. The majority rule (MR), on the other hand, is defined by
iff
x beats
y, i.e., there are at least as many voters who prefer
x to
y in
p than there are voters who prefer
y to
x in
p. For illustration, consider the following profile
p with five voters, numbered 1, 2, 3, 4 and 5, over four candidates,
, and
d.
|
|
|
|
|
Then
:
, i.e.,
, where ≈ denotes indifference, because
a is two times first in
p, while
b,
c and
d are only once first in
p, while
:
, i.e.,
because
c beats
a with 3 against 2,
a beats
b with 3 against 2, and
b beats
d with 4 against 1. So, given this profile, the plurality rule would make
a the winner, while the majority rule would make
c the winner.
Notice that if there are only two alternatives, the plurality rule and the majority rule coincide.
Already in 1952, K. May [
3] gave a characterization of the majority rule or the plurality rule in the case of only two alternatives.
Theorem 1. In the case of only two alternatives, the majority (or plurality) rule is the only social ranking rule F that satisfies the following six axioms:
1 Based on comparing: the input for the social ranking rule F consists of profiles or vote configurations, which represent the individual preference orderings of the voters over the candidates.
2 Unrestricted domain: the social ranking rule F is defined over the set ofallpossible profiles, i.e., the social ranking rule F assigns a social ranking to every (logically) possible vote configuration p.
3 Anonymity: the outcome of the social ranking rule F does not depend on the names of the voters.
4 Neutrality: the outcome of the social ranking rule F does not depend on the names of the candidates. More precisely, let q be the profile resulting from profile p by interchanging the names A and B and suppose , then .
5 Monotonicity: if and A gets more support by at least one voter resulting in profile q, then .
6 Completeness: for each profile p and for all candidates A and B, or or .
Proof. Here, we present the proof as given by Balinski and Laraki [
4]. Suppose the social ranking rule
F satisfies all of the six axioms above. By anonymity, it follows that only the numbers count. Let
be the number of voters in profile
p who prefer
A to
B,
the number of voters in
p who prefer
B to
A, and
the number of voters in
p who are indifferent between
A and
B. Completeness guarantees that there must be an outcome.
Suppose in profile p and . Neutrality implies that if q is the profile resulting from interchanging the names of A and B in p, . This is impossible because the new profile q is identical to the original profile p. Therefore, if in p, then . (*)
Suppose in p. Change the preferences of voters who prefer A to B in p into indifferences, resulting in a valid profile q in which the number of voters who prefer A to B equals the number of voters who prefer B to A; according to axiom 2, all profiles are allowed. For this new profile q, we have because of (*). Next, by going back from profile q to the original profile p, the monotonicity axiom guarantees that . □
Assuming any number of candidates, Balinski and Laraki add the following two axioms to the six axioms just mentioned above:
7 Transitivity: if and , then .
We shall show below that the majority rule (MR) is not transitive, while the plurality rule (PR) clearly is.
8 Independence of Irrelevant Alternatives (IIA): If and q is a profile resulting from profile p by adding or removing candidates, then still .
From its definition, it follows immediately that the majority rule is IIA, but we shall see below that the plurality rule is not.
In order to show that the majority rule (MR) is not transitive, one may consider the so-called Condorcet profile
q:
There is a majority, consisting of the first and the third group, which prefers A to B; therefore, A
B. There is another majority, consisting of the first and second group, which prefers B to C; therefore, B
C. But there is also a majority, consisting of the second and third group, which prefers C to A; therefore, C
A and not A
C, as transitivity would suggest. So, application of the majority rule to this Condorcet profile
q yields a cyclic outcome: A
B
C
A. There is no winner when the majority rule is applied to the Condorcet profile
q.
In the literature, this is known as
Condorcet’s paradox. One might guess that this problem hardly occurs in practice. However, Gehrlein [
5] proved that in the case of three candidates and a large number of voters, the probability of a cyclic outcome under majority rule is about 1/16, assuming the impartial anonymous culture (IAC).
Next, we provide a real-life example showing that the plurality rule (PR) is not IIA. Let q be the following profile, which actually appeared in the year 2000 at the presidential elections in Florida.
profile
qprofile q | 2.912.790 | 49.18 % : | Bush | | |
2.912.253 | 49.17 % : | Gore | | |
97.488 | 01.65 % : | Nader | Gore | Bush |
Let
p be the profile resulting from profile
q by leaving out candidate Nader. Then Gore
Bush, because Gore has more first votes in
p than Bush. However, profile
q results from profile
p by adding candidate Nader and applying plurality rule (PR) to profile
q, Bush wins from Gore: Bush
Gore, because the additon of candidate Nader lowers the number of voters for Gore. So, under the plurality rule in the competition between Bush and Gore, a third (irrelevant) candidate Nader plays a decisive role.
Theorem 2. (Balinski and Laraki’s version of Arrow’s impossibility theorem): for more than two candidates, there is no social ranking rule F satisfying all axioms 1–8.
Proof. (Balinski and Laraki [
4]) Suppose
F satisfies all eight axioms and
q is a profile. IIA says that in order to determine whether
, we can leave out all other candidates and consider only these two. From axioms 1 till 6, it follows by May’s theorem that the social ranking rule
F must be the majority rule (MR). However, the domain is unrestricted (axiom 2), so any Condorcet profile is allowed, and hence transitivity is violated. Therefore, there can be no social ranking rule that satisfies all axioms 1–8. □
In his original impossibility theorem, instead of the monotonicity axiom, Arrow [
6] considered
Pareto’s principle, which says that if all voters weakly prefer
A to
B (
for all voters
i), then also society must weakly prefer
A to
B (
). Arrow proved that any social ranking rule
F that satisfies axioms 1–4, 6–8, and Pareto’s principle is a dictatorship, i.e., there is a voter
d, called the dictator, such that the social outcome
is always the preference ordering
of
d. In other words, axioms 1–4, 6–8, Pareto’s principle, and non-dictatorship are logically inconsistent; there can be no social ranking rule
F that satisfies all these axioms.
Moreover, while the proof of Arrow’s impossibility theorem is relatively complicated, the proof of Balinski and Laraki’s version of Arrow’s theorem is relatively simple.
Van der Hout [
7] et. al. proved that the plurality ranking (PR) rule can be characterized by the following three axioms or properties for a social ranking rule
F:
Consistency: Let and be disjoint sets of voters, a profile for , a profile for , and the profile for that corresponds with on and with on . F is consistent iff for all alternatives , if and , then .
Faithfulness: If F is applied to the profile of alone, then the first element in the social preference ordering is the first element of i’s individual preference ordering .
FS-cancellation (FS stands for first score): given a profile
p such that the number of voters who prefer
x most in
p equals the number of voters who prefer
y most in
p, then
.
Theorem 3 ([
7]).
The plurality ranking (PR) rule is the only social ranking rule that satisfies consistency, faithfulness, and FS-cancellation. Proof. Let F be a social ranking rule satisfying the three axioms just mentioned. For p, a profile on the set V of voters, let be the number of voters i in V for whom x is the most preferref alternative.
FS-cancellation says if , then . So, in order to show that F is the plurality rule, we still have to show that if , then . So, suppose .
Case 1: if x is the most preferred alternative of all voters i, then—because F is faithful—x is the first element of for all voters i and hence, by repeated application of consistency, x is the first element of the social ranking and hence . (*)
Case 2: suppose on the other hand that there is some voter i in V for whom x is not the most preferred alternative in . Then, the set V of voters may be split into three disjoint subsets , , and such that (1) all voters in have x as the most preferred alternative, (2) the number of voters in with x as most preferred alternative equals the number of voters in with y as most preferred alternative, and (3) the voters in have neither x nor y as most preferred alternative. For instance, if profile p looks as follows: , then , , and . Let be the profiles on respectively , corresponding with p. We have already seen in (*) that . By FS-cancellation, . Additionally, because , again by FS cancellation, . Finally, since F is consistent, it follows that , i.e., . □
In [
7], several other characterizations of the plurality ranking rule are given. Additionally, in the literature characterizations are given of several other social ranking and choice rules; see, for instance, [
8,
9,
10].
4. Axioms in Game Theory
Let denote a set of players, and let v be a function that assigns a real number to each non-empty subset S or coalition of N, such that . Then, the pair is called a cooperative transferable utility (TU) game. When N is clear from the context, we simply speak of the game v. A (0-1)-game is a game in which or . It is a simple game if it is not identically 0 and satisfies the condition of monotonicity: whenever . In these games, a coalition S is called winning if and losing if .
Collective decision-making procedures can be modelled by a (0-1)-game whose winning coalitions are those that can make a decision without the vote of the remaining players.
A winning coalition is minimal if it does not contain any other winning coalition. A player i is said to be a swinger in a coalition S, if S is winning and is not. A player i is said to be a null player if he or she is not a swinger in any coalition, i.e., for all S. In a simple game v, , respectively, , denotes the set of all winning, respectively, all minimal winning, coalitions. Any of these sets, or , fully characterizes the game v.
A TU game is superadditive if whenever . In the context of simple games, the superadditivity property is equivalent to the following condition: for all .
Let denote the set of all simple superadditive n-person games.
A power index is a function
that assigns to each simple superadditive game
v a vector
whose
ith component is interpreted as a measure of the influence that player
i can exert on the outcome. The best known power indices are the Shapley–Shubik [
11] index and the Banzhaf [
12] index. Shapley and Shubik interpreted the power of a player as the chance to be critical for the passage of a decision, while Banzhaf interpreted the power of a player as the probability that his vote is decisive assuming that all vote configurations are equally likely.
Example 1. Suppose and a proposal is accepted iff at least three players/voters say ‘yes’; in other words, iff S contains at least three players.
In order to determine the power of say voter/player 1, Banzhaf considers all
coalitions
S of which player 1 is a member and counts all such coalitions in which player 1 is decisive, i.e., if 1 withdraws from the coalition, then it is no longer winning; in other words,
. Below in
Table 1, we provide an overview of all coalitions player 1 belongs to and in which of these coalitions player 1 is decisive, given the game
v:
So, in this example voter 1 is decisive in 3 out of 8 coalitions he belongs to. Therefore, Banzhaf defines the power of voter 1 to be .
More generally, given a simple superadditive
n-person game
, the
Banzhaf power index of player/voter
i is the probability that
i is decisive, i.e., the number of winning coalitions
i belongs to and such that without
i the coalition becomes losing instead of winning, divided by the total number of coalitions
i belongs to. Rendered in a formula:
The
Shapley–Shubik index is based on a different idea. In the case of four voters there are 4! = 24 different ways in which a coalition may be formed. For instance, the order of joining the coalition may be 2 3 1 4 or 3 2 1 4, or 2 4 1 3 or 4 2 1 3, or 3 4 1 2 or 4 3 1 2. The total number of different orders of joining a coalition is 4! = 24 in case of 4 players. In order to determine the power of player 1, Shapley and Shubik look at the number of times that player 1 is critical for the passage of the decision. In our example, player 1 is critical for the passage of the decision if the coalition is formed in one of the six orders just mentioned. Therefore, the Shaply–Shubik power index of player 1, given the game
with
and
iff
S contains at least three players, will be
.
More generally, for a game
the Shapley–Shubik power index of player
i is given by
where
s denotes the size of
S, i.e.,
. In our example, the summation is over the sets
or
,
or
and
or
(
,
). Notice that when
, in other words, when player
i is critical for the pasage of the decision, then
is the number of different orders of the players entering the coalition before
i and
is the number of different orders of the players entering the coalition after
i.
In [
13] Annick Laruelle and Federico Valenciano provide a good characterization of both the Banzhaf and the Shapley–Shubik index in terms of four axioms for each of them, where the first three axioms are the same for both, and the difference is in the fourth axiom:
Anonymity (An): the power of each player in a simple superadditive game does not depend on how the players are labelled. In other words, for all , any permutation of N and any player , .
Null Player* (NP*): being a null player is the worst role any player can expect to play, the role that yields a minimal measure of power. In other words, for all and all , i is a null player in v iff for all , .
Symmetric Gain–Loss (SymGL): the effect of eliminating a minimal winning coalition is the same for any two players belonging to it and for any two players outside it. In other words, for all , for all (), and for all , respectively, , = . Here, denotes the set of minimal winning coalitions and the modified game results from v by deleting just one single coalition S from the set of winning ones.
Let us illustrate this axiom for the Banzhaf and the Shapley–Shubik indices of the game
v from our example above. Consider the minimal winning coalition
, and let
be the game resulting from
v by deleting
from the set of winning ones. In
Table 2 below, we give an overview of all coalitions player 1 belongs to and in which of these coalitions player 1 is decisive, but now given the game
:
So, and . More generally, one easily checks that for all , . However, for player we have .
For the Shapley–Shubik index, notice that given game , player 1 is not critical anymore for the passage of the decision in the sequences 2 3 1 4 and 3 2 1 4 because is no longer winning anymore. At the same time, player 4 is now critical for the passage of the decision in the sequences 1 2 3 4, 1 3 2 4, 2 1 3 4, 2 3 1 4, 3 1 2 4, and 3 2 1 4, as well as for the passage of the decision in the sequences 1 2 4 3, 2 1 4 3, 1 3 4 2, 3 1 4 2, 2 3 4 1, and 3 2 4 1. So, . More generally, for all , . However, for player , .
Both the Shapley–Shubik and the Banzhaf index satisfy the three axioms just mentioned. What distinguishes them is constant total gain–loss balance (CTGLB) and constant average gain–loss balance (CAGLB). Both axioms concern the effect of dropping a minimal winning coalition from the set of winning ones. The constant total, respectively, average gain–loss balance, postulates that the total, respectively, average loss of the players in the deleted coalition equals the total, respectively, average gain of the players outside it.
CTGLB: For all
and all
(
),
For instance, given the games v and above with , the total loss of the players in S equals , while the total gain of the players outside S, i.e, , also equals .
CAGLB: For all
and all
(
),
For instance, given the games
v and
above with
, the average loss of the players in
S, i.e.,
, equals
, while the average gain of the players outside
S, i.e.,
, i.e.,
, also equals
.
In [
13], the authors prove that the Shapley–Shubik index is characterized by (An), (NP*), (SymGL), and (C
TGLB), while the Banzhaf index is characterized by (An), (NP*), (SymGL), and (C
AGLB).
Theorem 4 ([
13]).
Let .(i) Φ satisfies (An), (NP*), (SymGL), and constanttotalgain–loss balance (CTGLB) iff , for some and , where 1:= . (ii) Φ satisfies (An), (NP*), (SymGL), and constantaveragegain–loss balance (CAGLB) iff , for some and .
For the proof, consisting of several pages, see [
13].