1. Introduction
Nozick’s [
1] libertarian theory of justice rests on two pillars:
justice in transfer argues that holdings acquired through voluntary exchange are just, assuming that the parties concerned held legitimate title to the exchanged holdings;
justice in acquisition, meanwhile, argues that the claim to ownership of a previously unowned resource by an individual is just, provided that it leave nobody else worse off. Both principles have been the subject of controversy (e.g., [
2]), but here I focus on justice in acquisition—in particular, its
relevance rather than its validity. This is particularly important for the application of libertarianism, given the practical impossibility of satisfying this principle. If, as argued recently by Piketty [
3], initial asset holdings have enduring distributive effects, then they are of critical importance in both the theory of justice and the practice of egalitarianism. If, by contrast, the effects of unjust acquisition vanish over time, it offers little cause for concern in implementing a libertarian theory of justice.
In this note, I provide conditions under which justice in acquisition is irrelevant in this way. In particular, I model the evolution of property rights as a stochastic process on the space of shares of society’s wealth. Such a process may or may not satisfy the property of ergodicity, under which every path of the process is representative of the whole; or, in other words, the initial conditions are irrelevant to “long-run” behaviour. By providing conditions under which the evolution of the societal division of wealth is ergodic, I show when unjust acquisition becomes irrelevant.
Of course, the significance of this enterprise is determined by the strength of my conditions. I make three main assumptions, each of which I argue to be “weak”. First, I assume that the stochastic process governing wealth shares is a Markov chain. This essentially involves assuming that the division of wealth in the next period is (probabilistically) dependent on the current division of wealth, but not on divisions in previous periods. For this to be appealing, we must simply take a sufficiently long period length; presumably a generation would be adequate, and we can henceforth think in these terms. Second, for any given current division of wealth, there is a positive probability that it change in any “direction” in the next period; i.e., there is some chance that any given individual will be slightly better (or worse) off in the next period. This chance could be very small, but it must be positive.
1 Third, small changes in the current division of wealth should not unduly affect its evolution in the next period.
Under these conditions, the long-run division of wealth is independent of its starting point, and unjust acquisition thus becomes irrelevant over time.
2 However, the question then arises of how much time is required to reach—or at least come close to—this “long-run” division. If, for example, we require more periods than there are atoms in the universe, then we may not think the result so interesting. Rates of convergence are, unfortunately, difficult to determine in general. I can establish that ergodic behaviour is approached at a geometric rate, but this rate could still be very slow; between this observation and the lengthened period required for the Markov assumption, unjust acquisition may remain relevant for a very long time. Nonetheless, geometric ergodicity establishes that the relevance of unjust acquisition diminishes period-by-period. Moreover, for any given approximating neighborhood of the ergodic distribution of wealth, there exists a finite length of time after which the process will always belong to that neighborhood.
But with finitely lived agents then, does the long-run nature of the analysis doom its own conclusions to irrelevance, particularly with a generation as the period length? If I take all of your money now, for instance, the opportunity many generations hence for your descendants to rob my descendants would be a weak argument for the justice of your current poverty. This is not, however, the domain of justice in acquisition, but rather of Nozick’s other libertarian pillar, justice in transfer. For my forced appropriation of your assets is a violation of voluntary exchange, rather than an unjust acquisition of a previously unowned resource. Put differently, the ergodic distribution to which society’s division of wealth tends may well be unjust, but this is a failure of the mechanisms of exchange that determine that distribution (i.e., injustice in transfer), rather than the initial societal division of wealth (i.e., injustice in acquisition). It is with conditions for the irrelevance of the latter that I am concerned here.
There is of course a developed literature endogenising the distribution of wealth in a variety of economic models, some of which produce ergodic distributions as their outcome [
4,
5,
6,
7,
8]. Where these models are rich in their analysis of the details of societal wealth evolution, my model is deliberately sparse, and seeks to provide a simple set of sufficient conditions for ergodicity that are satisfied by these papers. I discuss them, along with other phenomena that can and cannot fit within my framework, in the final section.
2. The Evolution of the Division of Wealth
Consider a population of
N individuals engaged in voluntary exchange through infinite discrete time
. Individual
i has a
wealth share in period
t of
, with
describing the
state of the process at time
t, belonging to the
state space of possible divisions of wealth.
3Assumption 1 (Markov).
The path of over time is governed by a time-homogeneous Markov chain , taking values in X, and constructed from a set of transition probabilities , where is the Borel σ-field on X, is a non-negative measurable function on X for each , and is a probability measure on .
As mentioned in the Introduction, this assumption can be made appealing by taking a sufficiently long period length. The next assumption, meanwhile, is the driving force of the analysis.
Assumption 2 (Local mobility).
For every state , there exists a neighbourhood such that for all .
Thus, the division of wealth may, at any point, move in any “direction”, i.e., there is some chance that any given individual will be slightly better (or worse) off in the next period.
A function
h from
X to
is called
lower semicontinuous if
If
is a lower semicontinuous function for any open set
, then
Φ is called a
(weak) Feller chain.
Assumption 3 (Downward smoothness).
Φ is a weak Feller chain.
Intuitively, the Feller property requires that, if we change the current division of wealth slightly, the chance of next period’s division of wealth shifting in a given way either increases or changes only slightly (in other words, this chance cannot jump dramatically downwards). This more technical assumption is harder to interpret intuitively, but it seems reasonable that small changes in the current division of wealth should not unduly affect its evolution in the next period.
3. Ergodicity
If
μ is a signed measure
4 on
, then the
total variation norm is
For the present purposes, the key limit of interest to us is of the form
where
π is an
invariant measure of the process, i.e., a
σ-finite measure on
with the property
If this sort of limit holds, then the long-run behavior of the process is described by the invariant measure
π, independent of the initial measure from which the process starts. In particular if, for any initial measure
λ,
then the process is said to be
ergodic.
To get to this point, I will require some additional apparatus.
5 Φ is called
ϕ-irreducible if there exists a measure
ϕ on
such that, for all
, whenever
, there exists some
, possibly depending on both
A and
x, such that
. It is called
ψ-irreducible if it is
ϕ-irreducible for some
ϕ and the measure
ψ is a “maximal irreducibility measure”, guaranteed to exist by Meyn and Tweedie’s [
9] Proposition 4.2.2. Letting
, if
Φ is
ψ-irreducible and every set in
is expected to be visited by
Φ infinitely often irrespective of the initial state, i.e.,
,
,
, then
Φ is called
recurrent. If it is
ψ-irreducible and the probability that every set in
is visited by
Φ infinitely often is 1 irrespective of the initial state, then
Φ is called
Harris recurrent. If it is
ψ-irreducible and admits an invariant measure
π, then
Φ is called a
positive chain. Finally, a set
is called
-
small if there exists an
and a non-trivial measure
on
such that, for all
and all
,
Proposition 4. Φ is ergodic.
Proof. Under Assumption 2, the process is
ϕ-irreducible for any
ϕ, and hence is trivially
ψ-irreducible for any
ψ with full support, i.e., any
ψ such that
,
. Hence,
, and the recurrence of
Φ follows trivially from
ψ-irreducibility. Since there are no
ψ-null, transient sets, it follows from Meyn and Tweedie’s [
9] Theorem 9.0.1 that
Φ is Harris recurrent. Moreover, it follows from their Theorem 10.4.4 that
Φ has a unique invariant measure
π, and is hence a positive chain.
Now, suppose that
is
-small for some
, with
, and let
By lower semicontinuity of , there exists such that for any , and hence C is also -small, with for some , , by Meyn and Tweedie’s Proposition 5.2.4(i). Thus, the greatest common divisor of the set is 1, i.e., the process is aperiodic. The result then follows by Meyn and Tweedie’s Theorem 13.3.3. ☐
Thus, the state of the process is independent of the initial distribution λ after a sufficiently long period of time has passed.
But how long is “a sufficiently long period of time”? This question is difficult to address without significantly stronger assumptions, but a little more can be said in general, once I have introduced some final apparatus. A set
is called
-petite if it satisfies the bound
for all
,
, where
is a non-trivial measure on
and
is a probability measure on
. Clearly every small set is petite. Lastly, if
Φ is positive Harris and there exists a constant
such that
then
Φ is called
geometrically ergodic.
Proposition 5. Φ is geometrically ergodic.
Proof. Since
Φ is
ψ-irreducible with the Feller property, and
ψ has full support on
X,
X is petite by Meyn and Tweedie’s [
9] Proposition 6.2.8. Condition (iii) of their Theorem 15.0.1 is then trivially satisfied for
and any
. This implies that there exist constants
,
such that for any
establishing the result. ☐
Thus converges to π at a geometric rate.