2.1. Naive Form
Numerous early variations of the TEP exist. One formulation is that of [
5], in which two men bet on the value of their respective neckties, with both neckties going to the man with the less valuable item. Another formulation, involving explicit distributions of monetary values printed on both sides of randomly drawn playing cards, is found in [
6] and attributed to the physicist Erwin Schrödinger. In the simplest form of the classical TEP (see [
1]), a risk-neutral decision maker is shown two identical envelopes, one of which contains a check for
X units of money, and the other a check for
units, where
X is an unknown positive real number. The two envelopes then are dropped into a box, from which the decision maker draws one at random. At this point, the decision maker is asked whether he or she would like to keep the chosen envelope, or trade for the other one. Seeing no benefit from trading (because of the obvious symmetry of the alternatives), the decision maker then opens the chosen envelope to find a check for
units. Asked again whether he or she would like to trade envelopes, the decision maker calculates the expected value of the amount in the other envelope,
Z, as
and discovers that it is better to trade, because the two possibilities,
i.e.,
and
, are presumably equally likely. Paradoxically, the decision maker’s behavior is the same for all values of
y—implying no benefit from opening the envelope—but learning the value of
y does in fact change the decision maker’s preference. In short, the decision maker’s failure to prefer
Z to
Y a priori is a violation of dominance reasoning.
This naive TEP is typically resolved by observing that and cannot be equally likely for all positive real numbers y, because that would require Z to possess a (nonexistent) uniform probability distribution over the positive real numbers.
2.2. Enhanced Form
Nalebuff offered a more sophisticated variation of the classical TEP in which the random gain
X is drawn from a distribution function (known to the decision maker) that always implies a positive expected benefit from trading once
has been observed [
1], so that
for all possible values of
y. (Actually, Nalebuff inadvertently claimed only that the expected benefit is nonnegative. His example employs the distribution function in Equation (2) of the present article, with
and
.) A necessary condition for these results is that
Such “paradoxical” distribution functions for
X can be either continuous or discrete, and [
2] offered one example of each:
in the continuous case, and
in the discrete case. In the present work, it will be useful to embed each of the above distributions into a natural, two-parameter generalization. For continuous
X, this is the Pareto II family:
for any
α in the interval
and positive
θ; whereas for discrete
X, it is the log negative binomial family:
for any positive integer
m and
p in the interval
. Note that although not immediately apparent, the distributions in Equations (1) and (2) are close analogs because the Pareto II distribution with
is obtained by exponentiating and shifting an exponential random variable, and the log negative binomial distribution with
is obtained by exponentiating a geometric random variable (
i.e., the discrete analog of the exponential random variable). The characterization of all possible “paradoxical” distributions is beyond the scope of the present work.
Our analysis will focus on the continuous case because it retains the infinitely divisible nature of X from the naive TEP. However, we also will briefly summarize the corresponding results for the discrete case.
2.3. Continuous Payoffs
Relaxing the assumption that the decision maker is risk neutral, let the decision maker’s VNM utility function be some continuous increasing function
defined for all real numbers greater than or equal to the decision maker’s initial wealth,
W. For
X with a Pareto II distribution, it is easy to see that the paradox persists for this generalization if and only if both of the following conditions are satisfied:
and
for all positive
y. In other words, the random variables
Y and
Z are equally desirable unconditionally (
i.e., prior to opening the decision maker’s envelope), but
Z is strictly preferable to
Y conditionally (
i.e., after opening the envelope to reveal
), thereby violating dominance reasoning.
Given that the random variables
Y and
Z are unconditionally independent and identically distributed, Equation (3) is always true. Therefore, to avoid the inconsistency of Equations (3) and (4) (and thus preserve dominance reasoning), we must require that
or equivalently,
for at least one continuous interval of
y. If Equation (5) does not hold for such an interval, then it constitutes an event “of probability zero” that has no practical impact on the decision maker’s behavior. Proposition 1 provides necessary and sufficient restrictions on the utility function to satisfy this inequality, and is proved in the
Appendix (as are all subsequent propositions).
Proposition 1: For any continuous increasing utility function and distribution parameters
α and
θ, dominance reasoning is preserved if and only if the recurrence inequality
holds for at least one interval of positive
y.
From the expression for
employed in the proof of Proposition 1, it is easy to show that this probability is increasing over
y for fixed
α and
θ, from a lower bound of
at
to an upper bound of
as
. Consequently, one can use the lower bound to derive the following sufficient (
i.e., stronger) condition for Equations (5) and (6):
and the upper bound to derive the following necessary (
i.e., weaker) condition:
These last two conditions are more readily interpreted than Equation (6) because their corresponding equalities describe concave-downward curves of the forms
and
respectively, where
and
b are real constants determined by any two points satisfying the indicated equation. Note that these results follow from straightforward applications of recurrence-relation techniques, and are easily confirmed by substituting the right hand sides of Equations (9) and (10) into the equalities associated with Equations (7) and (8), respectively. Thus, if the piecewise-linear functions passing through
,
and
are bounded below locally by the hyperbolas of Equation (9) for all
y in the indicated interval, then Equation (6) must hold. Here, by “bounded below locally,” we mean that if
a and
b are chosen such that
and
, then
. Furthermore, if Equation (6) holds, then the piecewise-linear functions passing through
,
and
must be bounded below locally (like stated above) by the translated power functions of Equation (10) for all
y in the indicated interval.
Although the function in Equation (9) is bounded above by the constant
b, it is important to note that Proposition 1 does not require that the decision maker’s utility function be bounded by a constant, or even that the expected utility be finite. For example, the unbounded piecewise utility function
generates, for
, infinite expected utility because it is linear for values greater than 3, but satisfies Equation (7) on the interval
because it possesses the form Equation (9) for values greater than
and less than or equal to 3. The various constants in the function are chosen to confer a high degree of smoothness (
i.e., continuous differentiability).