1. Introduction
There exists by now extensive literature in applied mathematics and probability theory dealing with
stochastic recurrence equations of the form
where
are pairs of continuous, real-valued, i.i.d. non-negative random variables. While early studies mostly focused on issues of existence and uniqueness of strictly stationary solutions
to (
1) and properties of the underlying limit support, particular emphasis has recently been placed on the characterization of the tail behavior of the marginal distributions of such solutions, a main result being that, under mild regularity conditions on the distribution of inputs
, the tails of the output
are asymptotic to a power law, see, e.g.,
Kesten (
1973),
Goldie (
1991) and
Buraczewski et al. (
2016) for details.
1In this paper, we derive novel order-preserving properties in the context of linear random recurrences, which enable us to obtain pairwise comparison theorems about well-known measures of
conditional size,
tail variability and
skewness across stationary solutions to (
1). Although a full grasp of our results requires introducing the reader to basic notions of stochastic ordering of probability distributions, we can summarize them in a non-technical fashion as follows: consider two alternative input pairs
and
for the linear model (
1), and let
and
be the corresponding stationary solutions to (
1), provided they exist, then:
- 1.
Size preservation—If and/or are smaller (in a well-defined stochastic sense) than A and/or B, respectively, then is itself smaller than and features lower conditional right-tail expectations than ’s.
- 2.
Variability preservation—If
is
less dispersed (in a well-defined stochastic sense) than
A (for the same
B) and such dispersion is preserved between
and
(
Di Pietro and Sorge 2018a), then
features lower
conditional right-tail variances than
’s.
- 3.
Variability and skewness comparison—If and/or are smaller than A and/or B, respectively, and and are both non-negative, then the distribution of cannot be more skewed to the right (in a well-defined stochastic sense) than that of whenever features larger (finite) unconditional variance than ’s.
How structural properties of a given model change as its constituent components vary is a central question in many areas of economics. By offering new insights on conditional first and higher moments of stationary distributions emerging from linear random recurrences (
1), our results are clearly valuable in theoretical explorations of wealth dynamics and within-group inequality, when the recurrence under scrutiny captures the evolution of wealth over time and/or across generations. In doing so, we complement already existing analytical results that solely focus on the determination of Pareto exponents via asymptotic equivalence analysis, e.g.,
Benhabib et al. (
2011). By the same token, our study can easily be exploited to conduct comparative statics exercises in dynamic models featuring Kesten-type reduced-form representations, e.g.,
Acemoglu and Robinson (
2015),
Dave and Sorge (
2020), and it can provide some discipline for simulation-based validation of theoretical mechanisms proposed in the literature to account for observed higher-order properties of macroeconomic and financial time series, e.g.,
Gabaix (
2009).
In order to illustrate the type of research questions that our study can address, we offer two simple economic applications: the analysis of the role of preference parameters (e.g., bequest intensity) in shaping the long-run distribution of wealth in a stylized dynamic economy with intergenerational wealth transfers and uninsurable risk; and the analysis of the impact of labor earnings taxation on asset accumulation and long-run properties of the wealth distribution in an infinite-horizon economy.
Upon surveying the strands of literature our paper relates to (
Section 2), we begin
Section 3 with an overview of the pairwise stochastic orderings between random variables (probability distributions), which paves the way to presenting our main results (
Section 4).
Section 5 lays down two analytically tractable models to frame our comparison theorems in terms of long-run distributional features of wealth and asset accumulation processes. In
Section 6, the findings are confronted with existing theoretical studies on the topic.
Section 7 offers concluding remarks.
2. Literature Review
There are at least two strands of literature our paper speaks to and to which it purports to contribute.
First, linear random recurrences of the form (
1) have been extensively used to model the evolution of wealth distributions in the presence of uninsured investment shocks (idiosyncratic returns on financial wealth, embodied in the multiplicative input
A) and non-diversifiable earnings risk (embodied in the additive input
B). A burgeoning number of structural dynamic models have been developed that provide micro-foundations for the intergenerational (and/or intertemporal) transmission of wealth, in order to shed light on the sources of wealth inequality across families, the underlying patterns of social mobility and the effects of corrective policies on top wealth concentration.
Benhabib et al. (
2011) analyzed the evolution of the wealth distribution in an overlapping generations (OLG) model with a joy-of-giving bequest motive, in which agents face random labor (additive) and capital (multiplicative) income shocks, which can persist over time as a function of a latent Markov state. A main finding of their analysis is that, in the presence of uninsurable risk, the upper tail of the stationary wealth distribution is asymptotic to a Pareto law.
Benhabib et al. (
2015) formally established an analogous
fat upper tail result for the endogenous distribution of wealth in Bewley economies in which ex-post heterogeneous agents solve an infinite horizon consumption-saving problem with borrowing limits and idiosyncratic shocks to the rates of return on wealth.
2 Interpreting (
1) as a wealth accumulation process in an incomplete market framework and using the so-called Lorenz dominance as an indicator of inequality,
Zhu (
2013) shows that a mean-preserving rise in the variability of
A (capital income risk) and/or
B (earnings risk) entails a less equal stationary distribution of the recurrence (
1), provided that it exists. Focusing on the same framework,
Peng (
2018) jointly studied inequality and mobility dynamics by means of the copula approach, which allows for specifying the joint distribution of percentile ranks of different cross-sectional distributions. In particular, advocating the so-called dispersive order to rank probability distributions in terms of intrinsic variability, it was therein established that inequality is affected by shifts in either type of risk, while mobility reacts only to changes in the capital income one.
Di Pietro and Sorge (
2018a) singled out a number of inaccuracies in
Peng (
2018)’s analytical proofs, showing that relatively stronger conditions are needed for the stationary solution of (
1) to inherit the stochastic features of the multiplicative input
A.
In a similar vein,
Di Pietro and Sorge (
2018b) studied the comparative statics of a well-defined class of wealth transition equations when allowing for stochastically ordered shifts in the multiplicative input. They identified an order-contingent monotone property according to which pure increases in risk foster top wealth concentration, whereas random shifts that involve the average unconditional return process rather lower inequality at the upper end of the stationary distribution. Based on a simple OLG model with intergenerational altruism,
Di Pietro and Sorge (
2018b) emphasized the potentially ambiguous effects on top wealth inequality of introducing or modifying capital income tax treatments in the presence of non-diversifiable investment risk.
Our paper is also closely related to the ample mathematical work on linear random recurrences, surveyed in, e.g.,
Buraczewski et al. (
2016), which is generally concerned with: (i) the characterization of existence of stationary solutions and the ensuing distributional properties (limit support, unconditional moments, tails); and (ii) the Markov chain properties of stationary solutions such as irreducibility, a-periodicity, mixing, and absolute continuity of the Markov kernel. A small number of papers, such as
Müller and Stoyan (
2002) and
Zhu (
2013), have built on this literature by exploring the consequences of distributional shifts in either the multiplicative input
A or the additive input
B (or both) on the characteristics of the ensuing stationary distribution for the state variable
X.
Our study complements the findings from both strands of literature by developing novel comparison theorems that rely on conditional measure of size and variability not explored thus far, theorems that can be applied broadly to any economic or financial model entailing Kesten-type reduced-form dynamics.
3. Preliminaries
Consider two continuous, real-valued random variables
Y and
with absolutely continuous cumulative distribution functions (CDFs)
and
, respectively. Let
and
,
, be the corresponding quantile functions, and
the supports.
3 Then
Definition 1. Y is said to be smaller than in the
- (a)
usual stochastic order – written – if and only if - (b)
dispersive order – written – if and only if - (c)
star order – written – if and only ifwhen and almost surely.
The
usual stochastic order is the most common criterion for comparing the location or the magnitude (size) of random variables. It simply states that
Y is less likely than
to take on large values, i.e., all values larger than
z for any arbitrarily chosen
z, see e.g.,
Shaked and Shanthikumar (
2007).
The
dispersive order, among others, can be meaningfully used to compare variability or spread between probability distributions, for it requires that the difference between any two quantiles of
Y be smaller than the corresponding quantiles of
, see e.g.,
Lewis and Thompson (
1981).
Lastly, the
star order is a weakening of the well-known
Van Zwet (
1964)’s convex order, and it has been introduced in the literature to compare the skewness of probability distributions, given its characterization in terms of increasing failure rate on average, see e.g.,
Oja (
1981).
4Consider again the linear recurrence (
1), with non-negative inputs
. Our main object of interest is the stationary solution to such an equation, defined as follows:
Definition 2. An ergodic, stationary and causal solution to the linear recurrence (1) is a process such that: (i) for all t, is a measurable function of ; (ii) is the unique random variable that satisfies , where denotes equality in distribution; and (iii) starting from any initial , converges in distribution to as . 4. Comparison Theorems
Using the stochastic orders defined above, we now present our formal results about pairwise comparisons of stationary solutions to linear random recurrences (
1), as enforced by the stochastic inputs
. In the following,
and
denote the unconditional expectation (mean) and unconditional variance of some real-valued random variable
Y, respectively; and
and
denote the conditional expectation and the conditional variance of
Y with respect to the
-algebra generated by a real-valued random variable
Z (both defined on the same probability space), provided they exist, which we assume throughout the paper. To facilitate reading, the proofs of the theorems are collected in
Appendix A.
Theorem 1 (Size preservation).
Consider two linear recurrenceswhere- i.
, , and are i.i.d. non-negative random variables;
- ii.
and are statistically independent, and are statistically independent;
- iii.
, , , ;
- iv.
and/or for all
Then, there exist stationary solutions under inputs and under inputs , with CDFs and respectively, such thatfor all . Theorem 1 stipulates that the stationary solution to (
1) inherits the size properties of the inputs
. As a main consequence, any conditional expectation in the right tail (i.e., the conditional mean with respect to the sub-
-algebra generated by any quantile of order
) of the stochastically smaller (in the usual order) stationary distribution never exceeds its counterpart enforced by the stochastically larger input
A.
Theorem 2 (Variability preservation).
Consider two linear recurrenceswhere- i.
, and are i.i.d. non-negative random variables;
- ii.
and are statistically independent, and are statistically independent;
- iii.
, , ;
- iv.
for all ;
- v.
has log-concave density.
Then, there exist stationary solutions under inputs and under inputs , with CDFs and , respectively, such that if , it holdsfor all . Unlike the usual stochastic order, the dispersive ordering is not invariant under monotone transformations, nor is it generically closed under a product of non-negative random variables. In particular, the dispersive order does not necessarily imply (or is implied by) the usual stochastic order. Theorem 2 asserts that, when conditions for preservation of the dispersive order are fulfilled, e.g.,
Di Pietro and Sorge (
2018a), then any conditional variance in the right tail (i.e., the conditional variance with respect to the sub-
-algebra generated by any quantile of order
) of the less dispersed stationary distribution is bounded from above by its counterpart enforced by the more dispersed input
A.
Theorem 3 (Skewnees and variability).
Consider two linear recurrenceswhere- i.
, , and are i.i.d. non-negative random variables;
- ii.
and are statistically independent, and are statistically independent;
- iii.
, , , ;
- iv.
and/or for all
Then, there exist stationary solutions under inputs and under inputs such that if and almost surely and their first moments are finite, it holds Theorem 3 concerns the relationship between the stochastic magnitude of the multiplicative input
A, on the one hand, and both the upper tail (conditional) variability and the skewness of the distributions of the stationary solution to (
1), on the other. It states that, in the presence of inputs
A (and/or
B) and
(and/or
) that are ranked by their size, and in the case in which the ensuing stationary solutions are non-negative, then the one entailing larger unconditional variance cannot exhibit stronger positive skew than its analogue’s.
6. Discussion
As stressed in our literature review, extant studies on the topic of stochastic comparisons of probability distributions have established a number of analytical results concerning either
Pareto exponents as an indicator for top wealth inequality, e.g.,
Benhabib et al. (
2011,
2015),
Di Pietro and Sorge (
2018b), or
unconditional measures of spread (dispersion), when framed in the context of dynamic wealth accumulation processes that exhibit ergodic behavior, e.g.,
Zhu (
2013),
Peng (
2018),
Di Pietro and Sorge (
2018a). In the present paper, we formalize stochastic comparisons in terms of
conditional first and second moments as well as positive skew of stationary distributions that are not limited to the asymptotic properties of the counter-cumulative distribution function, for they readily apply to
any given quantile that a researcher might be interested in, e.g., identifying the middle and upper class of a given socio-economic system, or even the whole distribution except for the lowest region of its support.
In terms of studies of the long-run effects of earnings taxation in incomplete market economies, a key result in
Benhabib et al. (
2011) implies that the Pareto exponent—which is inversely related to top wealth concentration—would be invariant with respect to the implementation of such a fiscal policy, under conditions that warrant existence of a power–law approximation of the right tail of the stationary distribution. Our analysis (Proposition 2) complements
Benhabib et al. (
2011)’s result in terms of right conditional expectations, even in cases where the stationary distribution exhibits no fat-tailed behavior, showing that uniform earnings taxation affects expected concentration of wealth in higher (even extreme) quantiles. A corollary of this result is that capital income and earnings taxes produce qualitatively similar effects in terms of conditional tail expectations: the higher the tax rate, the lower the conditional measure of size for the underlying stationary distribution, whatever the fiscal policy in place.
In the same vein, provided that the mild sufficient conditions for the preservation of the dispersive order developed in
Di Pietro and Sorge (
2018a) are met, Theorem 2 can be advocated to claim that the conditional right tail variability of wealth increases with the tax rate on labor earnings; this suggests that top wealth inequality, as measured by the dispersion of wealth holdings in extreme right tail quantiles,
can be affected by the earnings tax regime, while the tail index (if it exists) offers no insights on this (being solely driven by the properties of the multiplicative input
A), as shown in
Benhabib et al. (
2011).