A Statistical Characterization of Median-Based Inequality Measures

Beach, Charles M.; Davidson, Russell

doi:10.3390/econometrics13030031

Open AccessArticle

A Statistical Characterization of Median-Based Inequality Measures

by

Charles M. Beach

^1,† and

Russell Davidson

^2,3,*,†

¹

Department of Economics, Queen’s University, Kingston, ON K7L 3N6, Canada

²

Department of Economics and CIREQ, McGill University, Montréal, QC H3A 2T7, Canada

³

AMSE-GREQAM, 5 Boulevard Maurice Bourdet, CS 50498, 13205 Marseille cedex 01, France

^*

Author to whom correspondence should be addressed.

^†

These authors contributed equally to this work.

Econometrics 2025, 13(3), 31; https://doi.org/10.3390/econometrics13030031

Submission received: 28 May 2025 / Revised: 23 July 2025 / Accepted: 30 July 2025 / Published: 9 August 2025

Download Review Reports Versions Notes

Abstract

For income distributions divided into middle, lower, and higher regions based on scalar median cut-offs, this paper establishes the asymptotic distribution properties—including explicit empirically applicable variance formulas and hence standard errors—of sample estimates of the proportion of the population within the group, their share of total income, and the groups’ mean incomes. It then applies these results for relative mean income ratios, various polarization measures, and decile-mean income ratios. Since the derived formulas are not distribution-free, the study advises using a density estimation technique proposed by Comte and Genon-Catalot. A shrinking middle-income group with declining relative incomes and marked upper-tail polarization among men’s incomes are all found to be highly statistically significant.

Keywords:

median-based inequality; income shares; population shares; income polarization

JEL Classification:

C10; C42; D31

1. Introduction

Two major distributional changes have characterized many developed economies since around 1980: declining middle-class incomes and rising top incomes Hoffman et al. (2020); Blanchet et al. (2022); and Guvenen et al. (2022). For example, in the case of full-time full-year workers in Canada between 1970 and 2005, the proportion of workers who received middle-class earnings fell by 11.5 percentage points among men (from 74.3 to 62.8 percent) and by 13.4 percentage points among women (from 76.5 to 63.1 percent), while the proportion of higher earners rose by 3.4 percentage points for men and 4.9 percentage points for women, and the proportion of lower-earning workers went up by 5.1 and 5.7 points, respectively. Over the same period, the corresponding shares of total earnings received by middle-class earners fell by 16.9 points for men and 17.8 points for women, while the earnings shares of higher earners rose by over 13 percentage points for both men (18.5 to 32.0 percent) and women (11.4 to 25.0 percent) (Beach, 2016, Tables 1 and 6). It would clearly be useful to be able to capture both of these sets of changes efficiently in a simple empirical framework that allows for a conventional statistical inference methodology, so one can test for the statistical significance of such changes over time.

The distributional measures that are typically used to examine these patterns of distributional change are the income shares of middle- and upper-income groups, the relative sizes of these groups, and the relative incomes of these groups. In examining these changes, Beach (2016) demonstrated the usefulness of characterizing the income groups in terms of their relationship to the median income level. So, for example, the middle-income group (M) could be defined as including those with incomes between, say, fifty percent and two hundred percent of the median, the upper group (H) as those with incomes above twice the median, and the lower group (L) as those with incomes below half the median. This allows one to obtain separate estimates for group income shares (

I S_{i}

,

i = L, M, H

) and for the proportion of recipients within the group (or population share)

P S_{i}

, as well as for the group mean incomes (

μ_{i}

). This distributional framework allows a more insightful interpretation of distributional change, since one can then analyze both the size (

P S_{i}

) and the relative prosperity (

μ_{i} / μ

) of the income group separately. (Percentile- or quantile-based measures, by construction, assign the size of the income groups as a prespecified percentage such as the top decile or 10% of all income recipients.) Characterizing group size and prosperity allows one to capture the quantity dimension of a change in the group’s total income separately from the income per recipient. This in turn can be used to help identify the relative strength of demand-side or supply-side driving factors behind observed distributional change Katz and Murphy (1992). Such insights, though, have heretofore been based on the relative magnitude of these effects, not on their statistical significance. This framework also allows for a richer and more extensive set of measures of income polarization, in terms of both quantity and relative income dimensions at the tails of the distribution.

Any summary or scalar inequality index (such as the Gini coefficient) does not capture the complex mix of distributional changes that have been occurring and does not allow one to identify where the major changes are occurring (and hence possible appropriate policy concerns). A three-way (or more detailed) distributional characterization of these income inequality changes is required.

Davidson (2018) provided an empirical approach to calculate asymptotic variances and covariances for sample estimates of

I S_{M}

and

P S_{M}

for middle-group income recipients within the median-based empirical framework, thus enabling formal statistical inference on these measures. The present paper extends Davidson’s statistical analysis to apply to lower- and upper-income groups as well (all defined in terms of the median), so that one can examine a full set of population subsets covering an income distribution (i.e., for L, M, and H subsets) jointly. The analysis shows how this approach leads to explicit formulas for asymptotic variances and standard errors, which can be easily programmed, for

{\hat{I S}}_{i}

and

{\hat{P S}}_{i}

, for all of

i = L, M, H

income groups. The paper extends the set of distributional measures to a relative mean income statistic

{\hat{μ}}_{i} / \hat{μ}

, where

μ_{i}

is the mean of group i incomes, and

μ

is the overall population mean, and also to

{\hat{μ}}_{i}

itself, so that one can test for the statistical significance of growing income gaps between income groups.

The paper thus proposes a general framework for median-based income inequality analysis, based on asymptotic statistical inference. The derived formulas for variances and covariances of the various statistics are directly empirically applicable to available public microdata files such as those commonly used by research and public policy analysts. The present study serves as a complement to a separate piece by the authors (Beach & Davidson, 2025) that developed a comparable framework for inequality measures, based on quantile income shares as typically published by government statistical agencies. Together, the two papers provide the basis for a toolbox set of calculations that can be readily implemented to allow standard statistical inference for frequently used statistics of disaggregated income inequality change.

The paper first outlines the stochastic quantile function approach to statistical inference. It then extends Davidson’s (2018) middle-class group results for estimated income shares and population shares to corresponding lower- and upper-income groups as well and expresses the asymptotic variance results in terms of simple explicit formulas that can be estimated from available microdata. The extension of these results to group mean income measures is also presented. In Section 3, the results in Section 2 are used to obtain results for relative group mean incomes, measures of polarization, and mean–decile distribution functions. Section 4 provides an empirical application of the Section 2 theoretical results to Canadian Census earnings data. Section 5 summarizes the main results of the paper and notes some implications.

2. Basic Asymptotic Analysis

Let F be the population distribution of income recipients, and let Y denote a random variable of which the cumulative distribution function (CDF) is F. We make the following somewhat restrictive assumption:

Assumption 1.

The CDF F is differentiable and strictly increasing on its compact support.

The assumption is made for convenience and in order to simplify the asymptotic analysis. If it is not satisfied, various asymptotically negligible terms appear in the estimators of group population and income shares, which complicate the analysis.

2.1. Population Shares

Let m denote the median of the distribution F. Then, the population share of those recipients with income no greater than

b m

for some

b > 0

is

F (b m)

. If we have a random sample from the population of size N, we can estimate the distribution by the empirical distribution function (EDF)

\hat{F}

, defined as follows:

\hat{F} (y) = \frac{1}{N} \sum_{i = 1}^{N} I (y_{i} \leq y),

where the

y_{i}

,

i = 1, \dots, N

, are the observed incomes in the sample, and I is the indicator function, with value 1 if its argument is true and 0 if it is false. The sample median

\hat{m}

is defined as usual:

\hat{m} = \{\begin{matrix} y_{(n + 1)} & if N = 2 n + 1 (N odd) \\ (y_{(n)} + y_{(n + 1)}) / 2 & if N = 2 n (N even) . \end{matrix}

The natural estimate of the population share is

\hat{F} (b \hat{m})

. We have

\begin{matrix} \hat{F} (b \hat{m}) - F (b m) & = \int_{0}^{b \hat{m}} d \hat{F} (y) - \int_{0}^{b m} d F (y) \\ = \int_{0}^{b m} d (\hat{F} - F) (y) + \int_{b m}^{b \hat{m}} d F (y) + \int_{b m}^{b \hat{m}} d (\hat{F} - F) (y) . \end{matrix}

(1)

Under our Assumption 1 and also under less restrictive but still conventional regularity conditions, the first two terms above are of order

N^{- 1 / 2}

, while the last, being of order

N^{- 1}

, can be ignored for asymptotic analysis. Then, to leading order, we see that

N^{1 / 2} (\hat{F} (b \hat{m}) - F (b m)) = N^{- 1 / 2} \sum_{i = 1}^{N} (I (y_{i} \leq b m) - F (b m)) + b f (b m) (\hat{m} - m),

where

f (y) = F^{'} (y)

is the population density function. According to the Bahadur (1966) representation of quantiles,

\hat{m} - m = - \frac{1}{N f (m)} \sum_{i = 1}^{N} [I (y_{i} \leq m) - \frac{1}{2}] + O (N^{- 3 / 4} {(log N)}^{3 / 4}),

(2)

and so

\begin{matrix} N^{1 / 2} (\hat{F} (b \hat{m}) - F (b m)) = \\ N^{- 1 / 2} \sum_{i = 1}^{N} [I (y_{i} \leq b m) - F (b m) - \frac{b f (b m)}{f (m)} [I (y_{i} \leq m) - \frac{1}{2}]] + o_{p} (1) . \end{matrix}

(3)

Let B be equal to

b f (b m) / f (m)

and consider the random variable

U (b)

defined as follows:

U (b) \equiv I (Y \leq b m) - B I (Y \leq m),

(4)

where Y is a variable that has the distribution F. Then, clearly

E (U (b)) = F (b m) - B / 2 .

(5)

The terms in the sum in (3) can be seen to be IID realizations of the random variable

U (b) - E (U (b))

, and so it follows that

N^{1 / 2} (\hat{F} (b \hat{m}) - F (b m))

is asymptotically equal in distribution to

U (b) - E (U (b))

. Asymptotic normality follows from the central-limit theorem. The variance of the limiting distribution, which, following standard terminology, we refer to as the asymptotic variance of

\hat{F} (b \hat{m})

, is then just

Var (U (b))

. In order to estimate this variance, let

{\hat{u}}_{i} (b) = I (y_{i} \leq b \hat{m}) - \hat{B} I (y_{i} \leq \hat{m}), i = 1, \dots, N,

(6)

with

\hat{B} = b \hat{f} (b \hat{m}) / \hat{f} (\hat{m})

, using appropriate estimates

\hat{f}

of the density. Then, to leading order,

N^{1 / 2} (\hat{F} (b \hat{m}) - F (b m)) = N^{- 1 / 2} \sum_{i = 1}^{N} [{\hat{u}}_{i} (b) - \hat{E} (U (b))],

with, from (5),

\hat{E} (U (b)) = \hat{F} (b \hat{m}) - \hat{B} / 2 .

Var (U (b))

can be estimated by the sample variance of the

{\hat{u}}_{i} (b)

.

A possibly better approach is simply to compute

Var (U (b))

directly and then estimate the result. It is easy to see from (4) that

U^{2} (b) = I (Y \leq b m) + B^{2} I (Y \leq m) - 2 B I (Y \leq min (m, b m)),

whence

E (U^{2} (b)) = F (b m) + \frac{1}{2} B^{2} - 2 B min (F (b m), \frac{1}{2}) .

(7)

Next,

Var (U (b)) = E (U^{2} (b)) - {(E (U))}^{2}

, and so from (5), for

b < 1

, we have

Var (U (b)) = F (b m) (1 - F (b m)) + \frac{1}{4} B^{2} - B F (b m) .

(8)

We see that

Var (U (b))

can be estimated in a distribution-free manner by

\hat{Var} (U (b)) = \hat{F} (b \hat{m}) (1 - \hat{F} (b \hat{m})) + \frac{1}{4} {\hat{B}}^{2} - \hat{B} \hat{F} (b \hat{m}) .

Let

a > b

, and make the definitions

U (a) = I (Y \leq a m) - A I (Y \leq m); A = a f (a m) / f (m),

(9)

and, for

a > 1

Var (U (a)) = F (a m) (1 - F (a m)) + \frac{1}{4} A^{2} - A (1 - F (a m)) .

(10)

Then,

N^{1 / 2} (\hat{F} (a \hat{m}) - F (a m))

is asymptotically equal in distribution to

U (a) - E (U (a))

.

Some comments are in order concerning the “appropriate” estimates

\hat{f} (b \hat{m})

amd

\hat{f} (\hat{m})

. In Appendix B, we sketch an alternative to conventional kernel density estimation that works much better with distributions that have support only on the positive real line or a subset of it. Here, we follow the work of Comte and Genon-Catalot (2012).

The analysis so far developed is sufficient for estimating and providing standard errors for the population share with income less than

b m

or greater than

b m

. However, in order to estimate the population share of recipients with income in some interval

b m < y \leq a m

,

a, b > 0

,

b < a

, as in Davidson (2018), one needs not only the variances of

\hat{F} (a \hat{m})

and

\hat{F} (b \hat{m})

but also their covariance. The asymptotic covariance of

N^{1 / 2} (\hat{F} (a \hat{m}) - F (a m))

and

N^{1 / 2} (\hat{F} (b \hat{m}) - F (b m))

is the covariance of

U (a)

and

U (b)

.

Make the definitions

m_{a} = min (a m, m)

and

m_{b} = min (b m, m)

. Then,

\begin{matrix} U (a) U (b) & = (I (Y \leq a m) - A I (Y \leq m)) (I (Y \leq b m) - B I (Y \leq m)) \\ = I (Y \leq b m) - A I (Y \leq m_{b}) - B I (Y \leq m_{a}) + A B (Y \leq m), \end{matrix}

whence

E (U (a) U (b)) = F (b m) - A F (m_{b}) - B F (m_{a}) + \frac{1}{2} A B,

(11)

whereas

\begin{matrix} E (U (a)) E (U (b)) & = (F (a m) - \frac{1}{2} A) (F (b m) - \frac{1}{2} B) \\ = F (b m) F (a m) - \frac{1}{2} (B F (a m) + A F (b m)) + \frac{1}{4} A B . \end{matrix}

From this, we see immediately that

\begin{matrix} cov (U (a), U (b)) = E (U (a) U (b)) - E (U (a)) E (U (b)) \\ = F (b m) (1 - F (a m)) - A (F (m_{b}) - \frac{1}{2} F (b m)) - B (F (m_{a}) - \frac{1}{2} F (a m)) + \frac{1}{4} A B, \end{matrix}

(12)

and this can be estimated in a distribution-free manner.

Although the results of this section so far are quite general, for most of the rest of the paper, interest will be focused on the case with

b < 1 < a

. The share of the population with income not exceeding

b m

, that is,

F (b m)

, will be denoted by

P S_{L}

, where ‘L’ stands for the group of lower-income recipients. The population share of the middle-income group is

F (a m) - F (b m)

; it is denoted by

P S_{M}

. The share of the higher-income group,

1 - F (a m)

, is denoted by

P S_{H}

.

It is clear from (8) that

Asy var ({\hat{P S}}_{L}) = Var (U (b)) = P S_{L} (1 - P S_{L}) + B^{2} / 4 - B P S_{L},

(13)

and from (10) that

Asy var ({\hat{P S}}_{H}) = Var (U (a)) = P S_{H} (1 - P S_{H}) + A^{2} / 4 - A P S_{H} .

(14)

Note that the terms on the right-hand sides of these equations have simple intuitive interpretations. The first (product) term corresponds to the variance of random recipients lying within the respective population share, the second (squares) term corresponds to the variance of the estimated median-based cut-off points, and the last term corresponds to the covariance or interaction between the first two components.

The population share of recipients of incomes between

b m

and

a m

is

P S_{M} = F (a m) - F (b m)

, and the limiting variance of

N^{1 / 2} (\hat{P} S_{M} - P S_{M})

is equal to

Var (U (a) - U (b)) = Var (U (a)) + Var (U (b)) - 2 cov (U (a), U (b))

. The covariance (12) can now be rewritten as

cov (U (a), U (b)) = P S_{L} P S_{H} - A P S_{L} / 2 - B P S_{H} / 2 + A B / 4,

(15)

and so the asymptotic variance of

{\hat{P S}}_{M}

, after a little algebra based on (13), (14), and (15), can be seen to be

P S_{M} (1 - P S_{M}) + \frac{1}{4} {(A - B)}^{2} - (A - B) (P S_{H} - P S_{L}) .

(16)

The same expression results from calculating

E ({(U (a) - U (b))}^{2}) - {(E (U (a) - U (b)))}^{2}

directly. Let

C = A - B

. Then, (16) can also be written as

Asy var ({\hat{P S}}_{M}) = P S_{M} (1 - P S_{M}) + C^{2} / 4 - C (P S_{H} - P S_{L}) .

(17)

2.2. Income Shares

We begin by considering the income share of recipients of incomes no greater than

b m

, with

b < 1

. The average income earned by these recipients is

n (b m)

, defined as follows:

n (b m) = \int_{0}^{b m} y d F (y), estimated by \hat{n} (b \hat{m}) = \int_{0}^{b \hat{m}} y d \hat{F} (y),

(18)

and the income share is

n (b m) / μ

, where

μ \equiv \int_{0}^{\infty} y d F (y)

is the mean income of the population, estimated by

\int_{0}^{\infty} y d \hat{F} (y)

. Note that

μ = n (\infty)

and

\hat{μ} = \hat{n} (\infty)

. With

b < 1

, we denote

n (b m)

and

\hat{n} (b \hat{m})

by

n_{L}

and

{\hat{n}}_{L}

respectively, and we denote the income share of the lower-income group by

I S_{L}

. Clearly

I S_{L} = n_{L} / μ

.

For incomes greater than

a m

, with

a > 1

, the average income is

μ - n_{H}

with

n_{H} = n (a m)

defined just as in (18), replacing b by a. The income share is

I S_{H} = (μ - n_{H}) / μ = 1 - n_{H} / μ

. For the middle-income group, the average income is

n_{H} - n_{L}

, and the income share is

I S_{M} = (n_{H} - n_{L}) / μ = 1 - I S_{H} - I S_{L}

.

By analogy with (1) for population shares, we have

\begin{matrix} {\hat{n}}_{L} - n_{L} & = \int_{0}^{b \hat{m}} y d \hat{F} (y) - \int_{0}^{b m} y d F (y) \\ = \int_{0}^{b m} y d (\hat{F} - F) (y) + \int_{b m}^{b \hat{m}} y d F (y) + \int_{b m}^{b \hat{m}} y d (\hat{F} - F) (y), \end{matrix}

where the third term can be ignored asymptotically. With a random sample of size N, as in the preceding subsection, the first term is exactly equal to

N^{- 1} \sum_{i = 1}^{N} [y_{i} I (y_{i} \leq b m) - n (b m)],

and the second term can be approximated to leading order by

b (\hat{m} - m) b m f (b m)

, and, by (2), that approximation is to leading order equal to

- N^{- 1} \sum_{i = 1}^{N} \frac{b^{2} m f (b m)}{f (m)} (I (y_{i} \leq m) - \frac{1}{2}) .

This leads to

N^{1 / 2} ({\hat{n}}_{L} - n_{L}) = N^{- 1 / 2} \sum_{i = 1}^{N} [y_{i} I (y_{i} \leq b m) - b m B I (y_{i} \leq m) - n (b m) + \frac{1}{2} b m B] + o_{p} (1) .

(19)

Next, we define the random variable

U_{1} (b)

as

U_{1} (b) = Y I (Y \leq b m) - b m B I (Y \leq m),

(20)

noting that

E (U_{1} (b)) = n_{L} - b m B / 2

. It follows now that

N^{1 / 2} ({\hat{n}}_{L} - n_{L})

is asymptotically equal in distribution to

U_{1} (b) - E (U_{1} (b))

.

Similarly, for

a > 1

, we can define

U_{1} (a) = Y I (Y \leq a m) - a m A I (Y \leq m),

where

E (U_{1} (a)) = n (a m) - a m A / 2 = n_{H} - a m A / 2

, and

N^{1 / 2} ({\hat{n}}_{H} - n_{H})

is asymptotically equal in distribution to

U_{1} (a) - E (U_{1} (a))

.

For the variance of

U_{1} (b)

, we compute as follows:

U_{1}^{2} (b) = Y^{2} I (Y \leq b m) + {(b m B)}^{2} I (Y \leq m) - 2 b m B Y I (Y \leq b m)),

(21)

so that

E (U_{1}^{2} (b)) = n_{2, L} + \frac{1}{2} {(b m B)}^{2} - 2 b m B n_{L},

(22)

where we define

n_{2, L} = \int_{0}^{b m} y^{2} d F (y)

. It follows that

Var (U_{1} (b)) = E (U_{1}^{2} (b)) - {(E (U_{1} (b)))}^{2} = n_{2, L} - n_{L}^{2} + \frac{1}{4} {(b m B)}^{2} - b m B n_{L} .

In the same way, we find that

Var (U_{1} (a)) = E (U_{1}^{2} (a)) - {(E (U_{1} (a)))}^{2} = n_{2, H} - n_{H}^{2} + \frac{1}{4} {(a m A)}^{2} + a m A (n_{H} - 2 n_{med}),

where

n_{2, H} = \int_{0}^{a m} y^{2} d F (y)

and

n_{med} = n (m) = \int_{0}^{m} y d F (y)

. Everything here can be straightforwardly estimated in a distribution-free manner.

Alternatively, by setting

{\hat{u}}_{1 i} (b) = y_{i} I (y_{i} \leq b \hat{m}) - b \hat{m} \hat{B} I (y_{i} \leq \hat{m}), and {\hat{u}}_{1 i} (a) = y_{i} I (y_{i} \leq a \hat{m}) - a \hat{m} \hat{A} I (y_{i} \leq \hat{m})

(23)

for

i = 1, \dots, N

, the variance of

U_{1} (b)

and that of

U_{1} (a)

can be estimated by the sample variances of the

{\hat{u}}_{1 i} (b)

and the

{\hat{u}}_{1 i} (a)

, respectively.

The income share of the low-income group is

I S_{L} = n_{L} / μ

, and this income share can be estimated by

{\hat{I S}}_{L} = {\hat{n}}_{L} / \hat{μ}

. We have

\begin{matrix} N^{1 / 2} ({\hat{I S}}_{L} & - I S_{L}) = N^{1 / 2} [\frac{{\hat{n}}_{L}}{\hat{μ}} - \frac{n_{L}}{μ}] = N^{1 / 2} \frac{(μ {\hat{n}}_{L} - \hat{μ} n_{L})}{μ \hat{μ}} \\ = \frac{1}{μ \hat{μ}} [μ N^{1 / 2} ({\hat{n}}_{L} - n_{L}) - n_{L} N^{1 / 2} (\hat{μ} - μ)] . \end{matrix}

(24)

Now since

\hat{μ} = μ + O_{p} (n^{- 1 / 2})

, for the purposes of our asymptotic analysis, we can replace the denominator

μ \hat{μ}

by

μ^{2}

. Given (19) and the definition (20) of the random variable

U_{1} (b)

, and the fact that

N^{1 / 2} (\hat{μ} - μ) = N^{- 1 / 2} \sum_{i = 1}^{N} (y_{i} - μ)

, we are led to define the random variable

W (b) = U_{1} (b) / μ - n_{L} Y / μ^{2}

and to conclude that (24) is asymptotically equal in distribution to

W (b) - E (W (b))

. First, note that

E (W (b)) = n_{L} / μ - \frac{1}{2} b m B / μ - n_{L} / μ = - \frac{1}{2} b m B / μ .

(25)

For the variance, we have

W^{2} (b) = U_{1}^{2} (b) / μ^{2} + n_{L}^{2} Y^{2} / μ^{4} - 2 n_{L} U_{1} (b) Y / μ^{3} .

Now,

\begin{matrix} U_{1} (b) Y & = Y^{2} I (Y \leq b m) - b m B Y I (Y \leq m) so that \\ E (U_{1} (b) Y) & = n_{2, L} - b m B n_{med} . \end{matrix}

(26)

Then, from (22) and (26), we see that the asymptotic variance of

{\hat{I S}}_{L}

is

\begin{matrix} Var (W (b)) & = E (W^{2} (b)) - {(E (W (b)))}^{2} = \frac{1}{μ^{2}} [n_{2, L} + \frac{1}{4} {(b m B)}^{2} - 2 b m B n_{L}] \\ + \frac{1}{μ^{4}} n_{L}^{2} μ_{2} - \frac{2 n_{L}}{μ^{3}} [n_{2, L} - b m B n_{med}], \end{matrix}

(27)

where

μ_{2} = \int_{0}^{\infty} y^{2} d F (y)

. Note that the term involving

B^{2}

corresponds to the variability of the estimated median-based cut-off point about its true population cut-off value. The terms without any B in them correspond to the variability of random recipients lying within the true median-based cut-off range. Terms involving only B then correspond to the covariance or interaction between the first two components.

Since the income share of the high-income group is

I S_{H} = 1 - n_{H} / μ

, similarly to (24), we see that

N^{1 / 2} ({\hat{I S}}_{H} - I S_{H}) = - \frac{1}{μ^{2}} [μ N^{1 / 2} ({\hat{n}}_{H} - n_{H}) - n_{H} N^{1 / 2} (\hat{μ} - μ)] + o_{p} (1) .

Make the definition

W (a) = U_{1} (a) / μ - n_{H} Y / μ^{2}

; the asymptotic variance of

{\hat{I S}}_{H}

is then

Var (W (a))

, and after some algebra, we see that this is

Var (W (a)) = \frac{1}{μ^{2}} [n_{2, H} + \frac{1}{4} {(a m A)}^{2} - 2 a m A n_{med}] + \frac{1}{μ^{4}} n_{H}^{2} μ_{2} - \frac{2 n_{H}}{μ^{3}} [n_{2, H} - a m A n_{med}] .

(28)

For the middle-income group, we have

I S_{M} = (n_{H} - n_{L}) / μ

, and so, again similarly to (24), we find that

N^{1 / 2} ({\hat{I S}}_{M} - I S_{M}) = \frac{1}{μ^{2}} [μ N^{1 / 2} ({\hat{n}}_{H} - {\hat{n}}_{L} - n_{H} + n_{L}) - (n_{H} - n_{L}) N^{1 / 2} (\hat{μ} - μ)] + o_{p} (1) .

Define the random variable

W (a, b) = U_{1} (a) - U_{1} (b) - I S_{M} Y .

It is easy to check that the asymptotic variance of

{\hat{I S}}_{M}

is

Var (W (a, b) / μ)

. First,

E (W (a, b)) = n_{H} - n_{L} - m (a A - b B) / 2 - I S_{M} μ = - m (a A - b B) / 2 .

Then,

W^{2} (a, b) = {(U_{1} (a) - U_{1} (b))}^{2} + I S_{M}^{2} Y^{2} - 2 I S_{M} Y (U_{1} (a) - U_{1} (b)) .

(29)

Since

U_{1} (a) - U_{1} (b) = Y I (b m < Y \leq a m) - m (a A - b B) I (Y \leq m),

it follows that

E (U_{1} (a) - U_{1} (b)) = n_{H} - n_{L} - m (a A - b B) / 2,

(30)

and since

\begin{matrix} {(U_{1} (a) - U_{1} (b))}^{2} = Y^{2} I (b m < Y \leq a m) + m^{2} {(a A - b B)}^{2} I (Y \leq m) \\ - 2 m (a A - b B) Y I (b m < y \leq m), \end{matrix}

it follows that

E [{(U_{1} (a) - U_{1} (b))}^{2}] = n_{2, H} - n_{2, L} + m^{2} {(a A - b B)}^{2} / 2 - 2 m (a A - b B) (n_{med} - n_{L}) .

(31)

From (29)–(31), we conclude after a bit of algebra that

\begin{matrix} Var (W (a, b) / μ)) = [(n_{2, H} - n_{2, L}) (1 - 2 I S_{M}) + m^{2} {(a A - b B)}^{2} / 4 + μ_{2} I S_{M}^{2} \\ + 2 m (a A - b B) ((I S_{M} - 1) n_{med} + n_{L})] / μ^{2} . \end{matrix}

(32)

Another way to estimate

Var (W (b))

and

Var (W (a))

is to define

\begin{matrix} w_{i} (b) & = {\hat{μ}}^{- 1} {\hat{u}}_{1 i} (b) - {\hat{μ}}^{- 2} y_{i} {\hat{n}}_{L}, and \\ w_{i} (a) & = {\hat{μ}}^{- 1} {\hat{u}}_{1 i} (a) - {\hat{μ}}^{- 2} y_{i} {\hat{n}}_{H} \end{matrix}

for

i = 1, \dots, N

and use the sample variances of the

w_{i} (b)

and

w_{i} (a)

as

\hat{Var} (W (b))

and

\hat{Var} (W (a))

, respectively; recall the definitions (23). Further, if we define

w_{i} (a, b) = {\hat{μ}}^{- 1} [{\hat{u}}_{1 i} (a) - {\hat{u}}_{1 i} (b) - {\hat{I S}}_{M} y_{i}],

the sample variance of the

{\hat{w}}_{i} (a, b)

estimates

Var (W (a, b) / μ)

.

2.3. Income Group Means

The mean income of recipients with income no greater than

b m

is denoted

μ_{L}

and is equal to

μ_{L} \equiv E (Y ∣ Y \leq b m) = \int_{0}^{b m} y d F (y) / \int_{0}^{b m} d F (y) = n_{L} / P S_{L},

estimated by

{\hat{μ}}_{L} \equiv {\hat{n}}_{L} / {\hat{P S}}_{L}

. From this, we have to leading order,

N^{1 / 2} ({\hat{μ}}_{L} - μ_{L})) = \frac{1}{P S_{L}} [N^{1 / 2} ({\hat{n}}_{L} - n_{L}) - μ_{L} N^{1 / 2} ({\hat{P S}}_{L} - P S_{L})] .

This suggests the definition of a new random variable

X (b)

, as follows:

X (b) = \frac{1}{P S_{L}} (U_{1} (b) - μ_{L} U (b));

(33)

recall the definitions (4) and (20). Then,

N^{1 / 2} ({\hat{μ}}_{L} - μ_{L})

is asymptotically equal in distribution to

X (b) - E (X (b))

. Details of the calculation of the variance of

X (b)

are in Appendix A(a), although it is also possible to make the definition for

i = 1, \dots, N

{\hat{x}}_{i} (b) = \frac{1}{{\hat{P S}}_{L}} ({\hat{u}}_{1 i} (b) - {\hat{μ}}_{L} {\hat{u}}_{i} (b)),

(34)

with

{\hat{u}}_{i} (b)

and

{\hat{u}}_{1 i} (b)

defined, respectively, by (6) and (23), and use the sample variance of the

{\hat{x}}_{i} (b)

as an estimate of

Var (X)

. The calculation in Appendix A(a) leads to a rather simple expression for

Var (X (b))

, as follows:

Var (X (b)) = \frac{1}{P S_{L}^{2}} [n_{2, L} - P S_{L} μ_{L}^{2} + \frac{1}{4} B^{2} {(b m - μ_{L})}^{2}] .

(35)

Note that

Var (Y ∣ Y \leq b m) = \frac{\int_{0}^{b m} y^{2} d F (y)}{\int_{0}^{b m} d F (y)} - {(\frac{\int_{0}^{b m} y d F (y)}{\int_{0}^{b m} d F (y)})}^{2} = n_{2, L} / P S_{L} - μ_{L}^{2},

and so, writing

σ_{L}^{2} = Var (Y ∣ Y \leq b m)

, we can reformulate (35) as

Asy var ({\hat{μ}}_{L}) = \frac{1}{P S_{L}^{2}} {(μ_{L} - b m)}^{2} B^{2} / 4 + \frac{1}{P S_{L}} σ_{L}^{2} .

(36)

Note once more that the second term in this expression corresponds to the variance of

{\hat{μ}}_{L}

based on the true

b m

cut-off value, while the first term corresponds to the variability associated with the randomness of the cut-off

b \hat{m}

about its population value

b m

.

The mean of incomes greater than

a m

is

μ_{H} \equiv E (Y ∣ Y > a m) = \int_{a m}^{\infty} y d F (y) / \int_{a m}^{\infty} d F (y) = \frac{μ - n_{H}}{P S_{H}},

(37)

estimated by

{\hat{μ}}_{H} = (\hat{μ} - {\hat{n}}_{H}) / {\hat{P S}}_{H}

. Then,

N^{1 / 2} ({\hat{μ}}_{H} - μ_{H})

is asymptotically equal in distribution to the random variable

X (a) = \frac{1}{P S_{H}} [Y - U_{1} (a) - μ_{H} (1 - U (a))]

(38)

minus its expectation. Note that

\begin{matrix} 1 - U (a) = I (Y > a m) + A I (Y \leq m), & E (1 - U (a)) = P S_{H} + A / 2, \\ Y - U_{1} (a) = Y I (Y > a m) + a m A I (Y \leq m), & E (Y - U_{1} (a)) = μ - n_{H} + a m A / 2, \end{matrix}

(39)

so that

E (X (a)) = - \frac{A}{2 P S_{H}} (μ_{H} - a m) .

(40)

The variance of

X (a)

, derived in detail in Appendix A(b), is

Var (X (a)) = \frac{1}{P S_{H}^{2}} [μ_{2} - n_{2, H} - P S_{H} μ_{H}^{2} + \frac{1}{4} A^{2} {(μ_{H} - a m)}^{2}] .

(41)

Now, if we define the conditional variance

Var (Y ∣ Y > a m) = n_{2, H} / P S_{H} - μ_{H}^{2} \equiv σ_{H}^{2},

then (41) can also be expressed as

Asy var ({\hat{μ}}_{H}) = \frac{1}{P S_{H}^{2}} {(μ_{H} - a m)}^{2} A^{2} / 4 + \frac{1}{P S_{H}} σ_{H}^{2} .

(42)

Alternatively, for

i = 1, \dots, N

, make the definition

{\hat{x}}_{i} (a) = \frac{(y_{i} - {\hat{u}}_{1 i} (a)) - {\hat{μ}}_{H} (1 - {\hat{u}}_{i} (a))}{{\hat{P S}}_{H}} .

The variance of the limiting distribution of

N^{1 / 2} ({\hat{μ}}_{H} - μ_{H}))

can then be estimated by the sample variance of the

{\hat{x}}_{i} (a)

. (Recall definitions (6) and (23) for

{\hat{u}}_{i} ()

and

{\hat{u}}_{1 i} ()

.)

The mean of the incomes between

b m

and

a m

is

μ_{M} \equiv E (Y ∣ b m < Y \leq a m) = \int_{b m}^{a m} y d F (y) / \int_{b m}^{a m} d F (y) = \frac{n_{H} - n_{L}}{P S_{M}},

(43)

estimated by

{\hat{μ}}_{M} = ({\hat{n}}_{H} - {\hat{n}}_{L}) / {\hat{P S}}_{M}

. Thus,

N^{1 / 2} ({\hat{μ}}_{M} - μ_{M})

is asymptotically equal in distribution to the random variable

\begin{matrix} X (b, a) & = \frac{P S_{M} (U_{1} (a) - U_{1} (b)) - (n_{H} - n_{L}) (U (a) - U (b))}{P S_{M}^{2}} \\ = \frac{1}{P S_{M}} [(U_{1} (a) - U_{1} (b)) - μ_{M} (U (a) - U (b))], \end{matrix}

(44)

minus its expectation.

Note that

μ_{M}

is not a function of

μ_{H}

and

μ_{L}

alone. Estimating it poses no problem, but a new calculation is needed to find an expression for its asymptotic variance. The variance of

X (b, a)

is derived in Appendix A(c). It is

\begin{matrix} Var (X (b, a)) = E [W^{2} (b, a)] - {[E (X (b, a))]}^{2} \\ = \frac{1}{P S_{M}^{2}} [n_{2, H} - n_{2, L} - μ_{M}^{2} P S_{M} + D^{2} / 4 + D [2 (n_{med} - n_{L}) + 2 μ_{M} P S_{L} - μ_{M}]], \end{matrix}

(45)

where we have made the definition

D = μ_{M} (A - B) - m (a A - b B) .

(46)

Another conditional variance:

Var (Y ∣ b m < Y \leq a m) = (n_{2, H} - n_{2, L}) / P S_{M} - μ_{H}^{2} \equiv σ_{M}^{2},

so that (45) reformulated becomes

Asy var ({\hat{μ}}_{M}) = \frac{1}{P S_{M}^{2}} [D^{2} / 4 + D (2 (n_{med} - n_{L}) + 2 μ_{M} P S_{L} - μ_{M})] + \frac{1}{P S_{M}} σ_{M}^{2} .

(47)

In order to estimate the variance of the limiting distribution of

N^{1 / 2} ({\hat{μ}}_{M} - μ_{M})

, another way to proceed is to make the definition, for

i = 1, \dots, N

,

{\hat{x}}_{i} (b, a) = \frac{{\hat{u}}_{1 i} (a) - {\hat{u}}_{1 i} (b) - {\hat{μ}}_{M} ({\hat{u}}_{i} (a) - {\hat{u}}_{i} (b))}{{\hat{P S}}_{M}}

and use the sample variance of the

{\hat{x}}_{i} (b, a)

to estimate the desired variance.

2.4. Summary of Main Results

2.4.1. Population Shares

From the results (8), (10), and (17), we obtain directly that

\begin{matrix} Asy var ({\hat{P S}}_{L}) & = P S_{L} (1 - P S_{L}) + B^{2} / 4 - P S_{L} B, \\ where B = b f (b m) / f (m); \\ Asy var ({\hat{P S}}_{H}) & = P S_{H} (1 - P S_{H}) + A^{2} / 4 - P S_{H} A, \\ where A = a f (a m) / f (m); \\ Asy var ({\hat{P S}}_{M}) & = P S_{M} (1 - P S_{M}) + C^{2} / 4 - (P S_{H} - P S_{L}) C, \\ where C = A - B . \end{matrix}

2.4.2. Income Shares

From the results (27), (28), and (32), we obtain

\begin{matrix} Asy var ({\hat{I S}}_{L}) = \frac{1}{μ^{2}} [n_{2, L} + \frac{1}{4} {(b m B)}^{2} - 2 b m B n_{L}] + \frac{1}{μ^{4}} n_{L}^{2} μ_{2} - \frac{2 n_{L}}{μ^{3}} [n_{2, L} - b m B n_{med}]; \\ Asy var ({\hat{I S}}_{H}) = \frac{1}{μ^{2}} [n_{2, H} + \frac{1}{4} {(a m A)}^{2} - 2 a m A n_{med}] + \frac{1}{μ^{4}} n_{H}^{2} μ_{2} - \frac{2 n_{H}}{μ^{3}} [n_{2, H} - a m A n_{med}]; \end{matrix}

\begin{matrix} Asy var ({\hat{I S}}_{M}) = \frac{1}{μ^{2}} [(n_{2, H} - n_{2, L}) (1 - 2 I S_{M}) + m^{2} {(a A - b B)}^{2} / 4 + μ_{2} I S_{M}^{2} \\ + 2 m (a A - b B) ((I S_{M} - 1) n_{med} + n_{L})] . \end{matrix}

2.4.3. Income Group Means

From the results (35), (41), and (45), we obtain

\begin{matrix} Asy var ({\hat{μ}}_{L}) & = \frac{1}{P S_{L}^{2}} [n_{2, L} - P S_{L} μ_{L}^{2} + \frac{1}{4} B^{2} {(b m - μ_{L})}^{2}]; \\ Asy var ({\hat{μ}}_{H}) & = \frac{1}{P S_{H}^{2}} [μ_{2} - n_{2, H} - P S_{H} μ_{H}^{2} + \frac{1}{4} A^{2} {(μ_{H} - a m)}^{2}]; \\ Asy var ({\hat{μ}}_{M}) & = \frac{1}{P S_{M}^{2}} [n_{2, H} - n_{2, L} - μ_{M}^{2} P S_{M} + D^{2} / 4 + D (2 (n_{med} - n_{L}) \\ + 2 μ_{M} P S_{L} - μ_{M})], where D = μ_{M} (A - B) - m (a A - b B) . \end{matrix}

Expressed somewhat differently, these results also follow from (36), (42), and (47):

\begin{matrix} Asy var ({\hat{μ}}_{L}) & = \frac{1}{P S_{L}^{2}} {(μ_{L} - b m)}^{2} B^{2} / 4 + \frac{1}{P S_{L}} σ_{L}^{2}; \\ Asy var ({\hat{μ}}_{H}) & = \frac{1}{P S_{H}^{2}} {(μ_{H} - a m)}^{2} A^{2} / 4 + \frac{1}{P S_{H}} σ_{H}^{2}; \\ Asy var ({\hat{μ}}_{M}) & = \frac{1}{P S_{M}^{2}} [D^{2} / 4 + D (2 (n_{med} - n_{L}) + 2 μ_{M} P S_{L} - μ_{M})] + \frac{1}{P S_{M}} σ_{M}^{2} . \end{matrix}

Note that the general framework of this paper allows for more and for more refined income groups than just the three employed here—so long as the cut-off points between income groups are expressed in terms of multiples of the median.

If, as for instance with stratified sampling, observations are not equally weighted, our analysis can still be applied if the number of actual observations N is replaced by the sum of the weights over the sample.

3. Inference on Related Distributional Statistics

This section considers three sets of distributional statistics that involve applications of the analytical results developed in the previous section. As there, we restrict attention to the case in which

b < 1 < a

, thus defining three income groups: the lower group L, for incomes less than or equal to

b m

; the middle group M, with incomes between

b m

and

a m

; and the higher group H, with incomes greater than

a m

.

3.1. Relative Mean Income Ratios

The relative mean income for each income group is the ratio of the group’s mean income to the overall mean income of the distribution:

{RMI}_{i} = μ_{i} / μ for i = L, M, H .

(48)

For example, in recent decades for many countries, the lower-income ratio

{\hat{μ}}_{L} / \hat{μ}

has not changed much, while the upper-income ratio

{\hat{μ}}_{H} / \hat{μ}

has risen substantially. It would be useful to know whether the changes in both ratios are statistically significant or only the latter.

The relative mean income ratio can be estimated directly as

{\hat{RMI}}_{i} = {\hat{μ}}_{i} / \hat{μ} .

However, from the definitions of

μ_{L}

,

μ_{H}

, and

μ_{M}

, we have

μ_{L} / μ = n_{L} / (μ P S_{L}) = I S_{L} / P S_{L}

,

μ_{H} / μ = (μ - n_{H}) / (μ P S_{H}) = I S_{H} / P S_{H}

, and

μ_{M} / μ = (n_{H} - n_{L}) / (μ P S_{M}) = I S_{M} / P S_{M}

, and so for

i = L, M, H

,

{RMI}_{i} = I S_{i} / P S_{i}

. Thus, to leading order

N^{1 / 2} ({\hat{RMI}}_{i} - {RMI}_{i}) = \frac{1}{P S_{i}} [N^{1 / 2} ({\hat{I S}}_{i} - I S_{i}) - {RMI}_{i} N^{1 / 2} ({\hat{P S}}_{i} - P S_{i})] .

(49)

In Appendix A(d), explicit expressions are derived for the asymptotic variances of

{\hat{RMI}}_{i}

,

i = L, M, H

. The results are as follows:

\begin{matrix} Asy var ({\hat{RMI}}_{L}) = \\ \frac{1}{P S_{L}^{2}} [(1 - 2 I S_{L}) (n_{2, L} / μ^{2} - I S_{L} {RMI}_{L}) + I S_{L}^{2} μ_{2} / μ^{2} + \frac{1}{4} B^{2} {({RMI}_{L} - b m / μ)}^{2} \\ - I S_{L}^{2} - I S_{L} B ({RMI}_{L} - b m / μ) (2 n_{med} / μ - 1)], \end{matrix}

(50)

\begin{matrix} Asy var ({\hat{RMI}}_{H}) = \\ \frac{1}{P S_{H}^{2}} [(2 I S_{H} - 1) n_{2, H} / μ^{2} + n_{H}^{2} μ_{2} / μ^{4} + \frac{1}{4} A^{2} {({RMI}_{H} - a m / μ)}^{2} \\ - I S_{H}^{2} + I S_{H}^{2} / P S_{H} - 2 I S_{H}^{2} / P S_{H}^{2} + I S_{H} A ({RMI}_{H} - a m / μ) (2 n_{med} / μ - 1)] . \end{matrix}

(51)

\begin{matrix} Asy var ({\hat{RMI}}_{M}) = \\ \frac{1}{P S_{M}^{2}} [Asy var ({\hat{I S}}_{M}) + {RMI}_{M}^{2} Asy var ({\hat{P S}}_{M}) - 2 {RMI}_{M} Asy cov ({\hat{I S}}_{M}, {\hat{P S}}_{M})] . \end{matrix}

(52)

The details of the calculation of the covariance needed in (52) are relegated to Appendix A(e). The result is

\begin{matrix} Asy cov ({\hat{P S}}_{M}, {\hat{I S}}_{M}) = I S_{M} (1 - I S_{M}) - \frac{1}{2 μ} m (a A - b B) (P S_{H} - P S_{L}) \\ + \frac{C}{μ} [I S_{M} n_{med} - n_{med} + n_{L} + \frac{1}{4} m (a A - b B)], \end{matrix}

(53)

with

C = A - B

.

3.2. Polarization Measures

The rise in upper incomes, resulting in a growing separation between high-income recipients and middle-class workers, has led to concern about the degree of polarization in income distributions. The concept of polarization can be viewed as having two quite distinct dimensions. One is the size dimension or relative mass at the two ends of the distribution (see for example Wolfson (1994)), which we label tail-frequency polarization and capture here as the proportion of recipients in the lower or higher income groups—what we are referring to here as

P S_{L}

and

P S_{H}

. Such measures then are

{\hat{P S}}_{L}

,

{\hat{P S}}_{H}

, and

{\hat{P S}}_{L} + {\hat{P S}}_{H}

. Asymptotic variances for the first two have already been obtained in Section 2.4 above. For

{\hat{P S}}_{L} + {\hat{P S}}_{H}

, note that the sum of the three population shares is one, and so the asymptotic variance of

{\hat{P S}}_{L} + {\hat{P S}}_{H}

is simply that of the middle group,

{\hat{P S}}_{M}

, which again we already have in (17).

The other aspect of polarization is the distance dimension or income-gap polarization, represented here by

{\hat{μ}}_{H} - {\hat{μ}}_{M}

,

{\hat{μ}}_{M} - {\hat{μ}}_{L}

, or

{\hat{μ}}_{H} - {\hat{μ}}_{L}

. Both sets of measures provide useful insights, and both can be implemented in our analytical framework. In the case of the income-gap polarization measures, again, the asymptotic variances of

{\hat{μ}}_{H}

,

{\hat{μ}}_{M}

, and

{\hat{μ}}_{L}

have been established in Section 2.4.2. For the differences in income group means, recall that

Asy var ({\hat{μ}}_{i} - {\hat{μ}}_{j}) = Asy var ({\hat{μ}}_{i}) + Asy var ({\hat{μ}}_{j}) - 2 Asy cov ({\hat{μ}}_{i}, {\hat{μ}}_{j})

for

i \neq j

. The three required covariances are provided in Appendix A(f). Thus, again, standard errors of the income-gap polarization measures can be computed in the usual fashion.

One could also posit a set of compound polarization measures, which capture both of these dimensions together:

C P_{H} \equiv P S_{H} (μ_{H} - μ_{M}))

,

C P_{L} \equiv P S_{L} (μ_{M} - μ_{L})

, and also

C P \equiv (P S_{H} + P S_{L}) (μ_{H} - μ_{L}) = (1 - P S_{M}) (μ_{H} - μ_{L})

.

Analogously, one could further identify a compound measure to capture the evident decline in the economic situation of the middle dlass in many countries over recent decades as

P S_{M} \cdot μ_{M}

. This would allow one, for example, to use logarithmic derivatives to estimate the relative importance of changes in the relative size of the middle class (

Δ P S_{M}

) versus changes in their average real incomes (

Δ μ_{M}

) in this decline.

One can use the results of Section 2 to work out the asymptotic variances of these various estimated compound measures; see Appendix A(g) for details.

3.3. Mean–Decile Functions

In an environment where higher incomes have been rising dramatically relative to the rest of the distribution, one measure of interest could be an indication of skewness of the distribution, as measured by the difference between the overall mean and median of the income distribution,

\hat{μ} - \hat{m}

or

\hat{m} / \hat{μ}

. However,

\hat{m}

is simply the fifth decile of the distribution. One could, more generally, define a mean–decile function.

Choose some proportions

p_{i}

,

i = 1, \dots, m

with

p_{i} < p_{j}

for

i < j

. For deciles, we would have

p_{i} = i / 10

,

i = 1, 2, \dots, 9

. Let

ξ_{i}

be the

p_{i}

-quantile of the distribution: the proportion of incomes less than

ξ_{i}

is

p_{i}

, and let

{\hat{ξ}}_{i}

be the corresponding sample quantile. Possible mean–decile functions could take on values

{\hat{ξ}}_{i} - \hat{μ}

, or alternatively

{\hat{ξ}}_{i} / \hat{μ}

, for the

i^{th}

decile of the distribution as a further way of capturing growing income differences over various ranges of the distribution.

Here, we can make use of the work of Lin et al. (1980). These authors show that, under general regularity conditions, the

{\hat{ξ}}_{i}

and

\hat{μ}

are asymptotically joint normally distributed. We denote the asymptotic variance–covariance matrix by

Σ

: it is an

(m + 1) \times (m + 1)

matrix, where the index

i = 0

refers, not to a quantile, but to

μ

. Then, for

0 < i \leq j \leq m

, the elements of

Σ

are

\begin{matrix} σ_{i j} & = p_{i} (1 - p_{j}) / [f (ξ_{i}) f (ξ_{j})], \\ σ_{00} & = σ^{2}, \\ σ_{0 i} & = p_{i} (μ - μ_{i}) / f (ξ_{i}), \end{matrix}

where

f (ξ_{i})

is the density at

ξ_{i}

,

μ_{i} = E (Y ∣ Y \leq ξ_{i}) = (1 / p_{i}) \int_{0}^{ξ_{i}} y d F (y)

, and

σ^{2} = Var (Y)

.

Thus, for the mean–decile distribution defined in levels as

{\hat{ξ}}_{i} - \hat{μ}

, we have

\begin{matrix} Asy var ({\hat{ξ}}_{i} - \hat{μ}) & = Asy var ({\hat{ξ}}_{i}) + Asy var (\hat{μ}) - 2 Asy cov ({\hat{ξ}}_{i}, \hat{μ}) \\ = \frac{p_{i} (1 - p_{i})}{f^{2} (ξ_{i})} + σ^{2} - \frac{2 p_{i} (μ - μ_{i})}{f (ξ_{i})} . \end{matrix}

In relative or proportional terms,

\begin{matrix} Asy var ({\hat{ξ}}_{i} / \hat{μ}) & = [1 / μ - ξ_{i} / μ^{2}] Σ_{0 i} [\begin{matrix} 1 / μ \\ - ξ_{i} / μ^{2} \end{matrix}] \\ = \frac{σ^{2}}{μ^{2}} + \frac{ξ_{i}^{2} p_{i} (1 - p_{i})}{μ^{4} f^{2} (ξ_{i})} - \frac{2 ξ_{i} p_{i} (μ - μ_{i})}{μ^{3} f (ξ_{i})} . \end{matrix}

Note that the density appears as such in the denominator of the above expressions rather than as a ratio

f (a m) / f (m)

or

f (b m) / f (m)

as elsewhere in this paper. However,

f (ξ_{i})

can be estimated in the same way as the other densities used; see Appendix B. Standard errors can be calculated accordingly.

3.4. Relation with the Bootstrap

Given the fact that the bootstrap has become an almost universal tool for reliable statistical inference, it is incumbent on us to outline how the material in this paper can be used in connection with bootstrap methods. It has been suggested that the asymptotic variances and standard errors provided here are unnecessary, as they can be obtained in a finite-sample context by use of the bootstrap. However, Horowitz (2001) points out that naive bootstrap standard errors are unlikely to be any better than asymptotic ones and may well be worse. What he and numerous other authors recommend is using an asymptotic standard error in order to construct an asymptotically pivotal quantity by studentizing, that is, dividing the quantity of interest, supposed to have expectation zero, by its standard error. The studentized quantity can then be bootstrapped in order to obtain a bootstrap P value for some null hypothesis, or to construct a bootstrap confidence interval for a parameter of interest.

Our results can be applied readily to such a bootstrap exercise. For instance, a test of a hypothesis that

P S_{M}

is equal to some given value M can be based on bootstrapping

({\hat{P S}}_{M} - M) / s e_{P S M}

, where

s e_{P S M}

is the square root of the asymptotic variance of

{\hat{P S}}_{M}

given by (17). Similarly a bootstrap confidence for

P S_{M}

can be constructed by conventional means.

Another reason to exercise care in applying the bootstrap to the data used in this paper is set out in Davidson (2018). The incomes given for individuals in the census data are often, indeed usually, rounded to multiples of USD 500 or USD 1000. This means that the empirical distribution of the sample of incomes is not smooth, and this is known to cause problems for a conventional resampling bootstrap. We verified that this is the case with our samples. Asymptotic variances as given by the formulas of this paper, and variances derived from a conventional resampling bootstrap, were compared in the context of a simulation experiment that used samples of 200,000 observations realized from a lognormal distribution. The results were comparable, as might be expected with such large samples. When the same exercise was repeated with the sample of men’s incomes in 2000, the bootstrap variances were very different from the asymptotic ones.

Another point of interest for practitioners is that all the asymptotic standard errors reported in Table 1 Fortunately, no renumbering is needed. were computed in a quarter of a second, whereas the corresponding bootstrap standard errors, with 999 bootstrap repetitions, took 80 s.

4. Empirical Study

In this section, we present results obtained using data from the Canadian Census Public Use Microdata Files (PUMF) for Individuals for 2000 and 2005, as recorded in the 2001 and 2006 censuses. We preferred these datasets to more up-to-date ones since the 2015–2020 census interval has results that are massively affected by the Canadian federal government’s response to the COVID-19 pandemic in the form of major temporary income support programs. In addition, the 2011 Census used a changed methodology (to save money) that made the income data for 2010 non-comparable to the other censuses.

We treat men and women separately, as their wages and labor-market participation rates were quite different. Accordingly, for each census year, two samples, one for each sex, are extracted from the census data files and are treated separately. In both cases, individuals younger than 15 years of age are dropped from the sample, as well as individuals who did not work in that year or for whom the information on weeks worked is missing. Earnings here refers to annual wage and salary income and net self-employment income. Statistics Canada typically rounds incomes to integer multiples of CAN 1000. Earnings are stated in thousands of 2005 (Canadian) dollars.

Given Assumption 1, it is important to see to what extent the rounding of incomes, which inevitably creates an empirical distribution more discontinuous than one generated by sampling from a genuinely differentiable distribution, has an effect on our asymptotic standard errors. We took a subsample of just 1000 observations from the dataset for men from the 2000 census, and smoothed the data by adding noise generated by the Epanechnikov kernel. To each income y, measured in dollars rather than thousands of dollars, the added noise is given by

2 \sqrt{5} min (h, y / 2) cos ((2 π - {cos}^{- 1} (1 - 2 p)) / 3),

where the bandwidth

h = 1500

. The asymptotic standard errors computed from the smoothed data differed by less than one percent from those computed from the census data.

Density estimates were given by the approach outlined in Appendix B. We experimented with different values of the parameter n using samples drawn from the lognormal distribution, for which the density is known analytically. It appeared that a larger value of n gave more accurate estimates, but that numerical overflow occurred in the computation of the gamma function for values of n greater than around 170. We found that setting

n = 100

gave satisfactory results, although other choices in the neighborhood of 100 gave results that were not markedly different.

In Table 1, results are shown for men in 2000. The entries for

\hat{ξ}

are the upper income cutoff for group L and the lower income cutoff for group H. For group M, the entry is the sample median. Asymptotic standard errors are in brackets.

Table 1. Men in 2000.

	$\hat{ξ}$	$\hat{PS}$	$\hat{IS}$	$\hat{μ}$	$\hat{RMI}$
L	17.7420	0.2702	0.0500	7.7588	0.1851
		(0.0007)	(0.0002)	(0.0271)	(0.0006)
M	35.4840	0.5811	0.5745	41.4371	0.9886
	(0.0770)	(0.0012)	(0.0019)	(0.0937)	(0.0018)
H	70.9681	0.1487	0.3755	105.8242	2.5248
		(0.0009)	(0.0019)	(0.3020)	(0.0045)

Sample size is 227,828, and the estimate

\hat{A}

= 0.8603, and

\hat{B}

= 0.4362.

Table 2 shows the corresponding results for women in 2000.

In Table 3 and Table 4, there are similar results for men and women respectively in 2005.

The sample sizes for these four tables of basic distributional results are quite large; so, it should perhaps not be surprising that the asymptotic standard errors are quite small, and all the reported statistics in these basic tables are highly statistically significant. They involve averages or proportions, which seem to be robustly estimated. The estimates of A and B are also all quite sensible in that they imply that the estimated density ratio

\hat{f} (b \hat{m}) / \hat{f} (\hat{m})

is considerably larger than

\hat{f} (a \hat{m}) / \hat{f} (\hat{m})

—which is what one would expect for a right-skewed distribution such as for an earnings distribution.

Table 5 and Table 6 show the differences in outcomes between men and women for the years 2000 and 2005, with asymptotic standard errors for these differences in parentheses. A positive difference means that the relevant outcome is greater for men than for women; a negative difference means the reverse. Again, all the differences are highly statistically significant. Two results are evident. In both years, men were relatively more concentrated in the middle-income group with women relatively more concentrated in the lower- and higher- income groups within each distribution. This is consistent with more part-time women workers as well as generally higher levels of education for women than for men in recent decades. Second, the earnings gap between men and women changed very little within the lower and middle income groups over 2000–2005. But in the higher income group, men’s earnings shot up quite dramatically compared to women’s over this period.

Table 7 and Table 8 present differences or changes over time in the distributional outcome measures between 2000 and 2005, separately for men and women. For outcomes that were greater in 2005 than in 2000, the differences are positive. Again, asymptotic standard errors are in parentheses, and again, all but one of the changes are highly statistically significant. Here, the changes are quite dramatic given that major distributional changes have typically been rather slow and gradual over time. For both men and women, the proportion of workers in the middle-income group fell substantially between 2000 and 2005, as did the relative-mean incomes of the middle group. On the other hand, mean earnings levels in the higher-income group went up dramatically. As a result, the earnings share of the middle group of so-called middle-class earners markedly declined and was made up by a corresponding dramatic rise in the earnings share of the higher-income group. This pattern occurred for both women and men in the Canadian labor market between 2000 and 2005, but the changes were two to three times stronger in the earnings distribution for men than for women.

Table 9 and Table 10 further pursue this significant pattern of change and show results for several measures of polarization within the earnings distributions (see Section 3.2 above). Table 9 focuses on population shares or the proportion of workers towards the two ends of the distributions, while Table 10 bases alternative polarization measures on mean earnings gaps over the ends of the distributions. Again, in both sets of polarization measures, one finds broadly similar patterns of change for both men and women (though with some differences). In the case of

P S

-based measures (Table 9), the general polarization of workers out of the middle-class region was driven by an increased proportion of workers in the H earnings group among men but by an increased proportion of workers in the L earnings group among women. In the case of the earnings-gap measures (Table 10), the greatly widening gaps in earnings between groups in the distributions is almost entirely driven by the widening gap between middle-class and higher earnings levels—for both men and women in the labor market. Again, the changes are about twice as strong among men than among women workers, and again, the results are highly statistically significant.

Finally, Table 11 and Table 12 display estimates of and changes in the compound polarization measures (in Section 3.2) that combine the population share and earnings gap dimensions. As can be seen, for both men and women, changes in the upper end of the earnings distributions over the 2000–2005 period were much greater than changes in the lower end of the distributions. For women, the changes were about twice as large, while for men it was about eight times. Clearly, the large changes have been occurring between the middle-class earnings group and the higher-earnings group. This recommends the use of separate polarization measures for the lower and upper ends of the distribution rather than one that blends or combines the two and thus potentially hides the basic structural changes that are going on over the different regions of the distribution and in the Canadian labor market. Note also that, for men, both components of

C P_{H}

contribute to the large increases in earnings polarization—both increases in

P S_{H}

, as well as the rising earnings gap (

μ_{H} - μ_{M}

)—while for women, the increase in

C P_{H}

is driven completely by rapidly rising upper earnings levels. Again, these polarization changes are all highly statistically significant. Because our sample sizes are large, our asymptotic results seem to be reliable, as illustrated by the simulation evidence presented in Appendix D.

As actual explanations for these major changes are fairly complex and overlapping (some examples: skill-biased automation, globalization and deindustrialization, sectoral and demographic shifts, increased industrial concentration, and weakened private-sector unionization rates); for more extensive discussion, we prefer to refer to (Beach, 2016, 2025), among others, where one can find more extensive discussion of the leading structural explanations of the observed distributional changes and possible policy implications of these changes.

One might want to follow up on the above results by investigating possible intra-group dynamics within any of the income groups.1 Since the choice of the a and b cut-off scalars is arbitrary, one could redo part or all of the above empirical analysis with different values of a and b, possibly highlighting specific narrower regions of the income distribution. Instead, the authors would recommend using a—possibly quite refined—quantile-based analysis as provided in Beach and Davidson (2025). The corresponding variance–covariance formulas in a quantile-based approach are simpler to use and are distribution-free, so that no density estimation steps need to be undertaken. Indeed, the authors view these two papers to be complementary, and between them they provide a quite extensive tool box of distributional statistics to look at possibly quite disaggregative patterns of distributional change.

5. Conclusions

This paper considers income distributions that are divided into lower, middle, and upper regions based on separating points that are scalar multiples of the median. For example, the lower region (L) could consist of recipients with incomes less than half the median, the middle group (M) includes those with incomes between 50 percent and 200 percent of the median, and those with incomes above twice the median lie in the higher income group (H). Such a characterization of an income distribution is very useful in evaluating changes over time in the economic experience of the middle-class income group and in the nature of polarization in the distribution. For each of these three income groups, separate estimates are obtained for their income shares (

I S_{i}

), group size or population shares (

P S_{i}

) and their mean income levels (

μ_{i}

). The paper derives explicit formulas for the asymptotic variances (and, hence, standard errors) of sample estimates of the groups’ population shares, income shares, and mean incomes. It is shown that these formulas are not distribution-free, but that a density-estimation technique of Comte and Genon-Catalot (2012) is well-suited to provide needed data-based density estimates in empirical income distribution analyses. The results are then applied to derive asymptotic variances for relative-mean income ratios, for each income group, for various polarization measures, and for decile–mean income ratios. This statistical framework is implemented with Canadian Census public-use microdata files in order to investigate some of the key features of changes in the Canadian earnings distribution.

It is found that population and income shares and income-group means can indeed be estimated with a high degree of reliability. Major patterns of distributional change that have been previously highlighted in the literature have indeed been found to be highly statistically significant. The distributional framework and statistical approach used in this paper thus allow one to move beyond descriptive analysis of distributional change to a formal framework of statistical inference and hypothesis testing.

Further, since

I S_{i} = P S_{i} \cdot {RMI}_{i}

, changes in group income shares have been found to arise from changes in both population shares and relative mean incomes. Estimating these two dimensions separately allows for (i) a rich economic interpretation and testing of the driving factors behind distributional change and (ii) an extensive characterization (and hence better understanding) of polarization as a key aspect of on-going distributional change.

The results of this paper suggest that official government statistical agencies—such as Statistics Canada and the U.S. Bureau of the Census—may wish to consider providing median-based estimates of population shares, income shares and income-group means to complement their regularly published series on decile income shares and decile means. They could also provide user information on the general reliability of these estimates. Since the deciles and decile means, which official agencies already provide, and the median-based statistics provided in this paper are usefully complementary, they together would offer a much better source of distributional information on which to base possible policy initiatives to improve policy design and targetting. For example, one might ask what the appropriate income range is for so-called middle-class income tax cuts, COVID-19-response temporary income support programs, or possibly for wage or employment adjustment programs in face of major tariff impact adjustments.

Author Contributions

Conceptualization, C.M.B. and R.D.; methodology, C.M.B. and R.D.; software, R.D.; formal analysis, R.D.; writing—original draft, C.M.B.; writing—review and editing, R.D.; All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

The data used to generate the results in this paper were all extracted from the Canadian Census Public Use Microdata Files (PUMF).

Acknowledgments

Davidson’s research was supported by a Distinguished James McGill Professorship at McGill University. We thank participants at the 2024 meeting of the Canadian Econometric Study Group, especially Pujee Tuvaandorj, for valuable comments on the paper. We are grateful to the late Aidan Worswick, research assistant to both authors, for providing us data in a manageable form.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:

CDF	cumulative distribution function
EDF	empirical distribution function
RMI	relative mean income

Appendix A. Detailed Calculations

(a): Variance of ${\hat{μ}}_{L}$

Recall from (33) that

X (b) = \frac{1}{P S_{L}} (U_{1} (b) - μ_{L} U (b)) .

Since

E (U (b)) = P S_{L} - B / 2

,

E (U_{1} (b)) = n_{L} - b m B / 2

, and

μ_{L} = n_{L} / P S_{L}

, it follows that

E (X (b)) = \frac{B}{2 P S_{L}} (μ_{L} - b m) .

(A1)

Next,

X^{2} (b) = \frac{1}{P S_{L}^{2}} (U_{1}^{2} (b) + μ_{L}^{2} U^{2} (b) - 2 μ_{L} U (b) U_{1} (b)) .

For the expectation of this, we use (7) for

E (U^{2} (b))

, (22) for

E (U_{1}^{2} (b))

, and (A8) for

E (U (b) U_{1} (b))

. Thus,

\begin{matrix} E (X^{2} (b)) = \frac{1}{P S_{L}^{2}} [n_{2, L} + \frac{1}{2} {(b m B)}^{2} - 2 b m B n_{L} \\ + μ_{L}^{2} (P S_{L} (1 - 2 B) + \frac{1}{2} B^{2}) - 2 μ_{L} n_{L} (1 - B) + 2 b m B n_{L} - μ_{L} b m B^{2}]] . \end{matrix}

By collecting coefficients of powers of B, we see that

E (X^{2} (b)) = \frac{1}{P S_{L}^{2}} [n_{2, L} - n_{L} μ_{L} + \frac{1}{2} B^{2} {(b m - μ_{L})}^{2}],

while from (A1), we have

{(E (X (b)))}^{2} = \frac{B^{2}}{4 P S_{L}^{2}} {(b m - μ_{L})}^{2},

and so

Var (X (b)) = \frac{1}{P S_{L}^{2}} [n_{2, L} - n_{L} μ_{L} + \frac{1}{4} B^{2} {(b m - μ_{L})}^{2}] .

(b): Variance of ${\hat{μ}}_{H}$

From (38), we have

X (a) = \frac{1}{P S_{H}} [Y - U_{1} (a) - μ_{H} (1 - U (a))],

and so

X^{2} (a) = \frac{1}{P S_{H}^{2}} [{(Y - U_{1} (a))}^{2} + μ_{H}^{2} {(1 - U (a))}^{2} - 2 μ_{H} (Y - U_{1} (a)) (1 - U (a))] .

From (39), it is easy to see that

\begin{matrix} E [{(Y - U_{1} (a))}^{2}] & = μ_{2} - n_{2, H} + {(a m A)}^{2} / 2; \\ E [{(1 - U (a))}^{2}] & = P S_{H} + A^{2} / 2; \\ E [(Y - U_{1} (a)) (1 - U (a))] & = μ - n_{H} + a m A^{2} / 2 . \end{matrix}

It then follows that

E (X^{2} (a))

is

\begin{matrix} \frac{1}{P S_{H}^{2}} [μ_{2} - n_{2, H} + {(a m A)}^{2} / 2 + μ_{H}^{2} (P S_{H} + A^{2} / 2) - 2 μ_{H} [μ - n_{H} + a m A^{2} / 2] \\ = & \frac{1}{P S_{H}^{2}} (μ_{2} - n_{2, H} - P S_{H} μ_{H}^{2} + A^{2} {(μ_{H} - a m)}^{2} / 2) . \end{matrix}

From (40), we have

{[E (X (a))]}^{2} = \frac{A^{2}}{4 P S_{H}^{2}} {(μ_{h} - a m)}^{2},

and so we conclude that the asymptotic variance of

{\hat{μ}}_{H}

is

Var (X (a)) = \frac{1}{P S_{H}^{2}} [μ_{2} - n_{2, H} - P S_{H} μ_{H}^{2} + \frac{1}{4} A^{2} {(μ_{H} - a m)}^{2}] .

(A2)

(c): Variance of ${\hat{μ}}_{M}$

Recall the definition (44):

X (b, a) = \frac{1}{P S_{M}} [(U_{1} (a) - U_{1} (b)) - μ_{M} (U (a) - U (b))],

whence

\begin{matrix} X^{2} (b, a) = \frac{1}{P S_{M}^{2}} [{(U_{1} (a) - U_{1} (b))}^{2} + μ_{M}^{2} {(U (a) - U (b))}^{2} \\ - 2 μ_{M} (U_{1} (a) - U_{1} (b)) (U (a) - U (b))] . \end{matrix}

(A3)

Note that

\begin{matrix} U (a) - U (b) & = I (b m < Y \leq a m) - (A - B) I (Y \leq m) and \\ U_{1} (a) - U_{1} (b) & = Y I (b m < Y \leq a m) - m (a A - b B) I (Y \leq m), \end{matrix}

so that

E (U (a - U (b))) = P S_{M} - (A - B) / 2

and

E (U_{1} (a) - U_{1} (b)) = n_{H} - n_{L} - m (a A - b B) / 2

. Further,

{(U (a) - U (b))}^{2} = I (b m < Y \leq a m) + {(A - B)}^{2} I (Y \leq m) - 2 (A - B) I (b m < Y \leq m),

so that

E [{(U (a) - U (b))}^{2}] = P S_{M} + {(A - B)}^{2} / 2 - (A - B) (1 - 2 P S_{L}) .

(A4)

Next,

\begin{matrix} {(U_{1} (a) - U_{1} (b))}^{2} & = Y^{2} I (b m < Y \leq a m) + m^{2} {(a A - b B)}^{2} I (Y \leq m) \\ - 2 m (a A - b B) Y I (b m < Y \leq m), \end{matrix}

so that

E [{(U_{1} (a) - U_{1} (b))}^{2}] = n_{2, H} - n_{2, L} + m^{2} {(a A - b B)}^{2} / 2 - 2 m (a A - b B) (n_{med} - n_{L}) .

(A5)

Then,

\begin{matrix} (U (a) - U (b)) (U_{1} (a) - U_{1} (b)) & = Y I (b m < Y \leq a m) + m (A - B) (a A - b B) I (Y \leq m) \\ - [(A - B) Y + m (a A - b B)] I (b m < Y \leq m), \end{matrix}

so that

\begin{matrix} E [(U (a) - U (b)) & (U_{1} (a) - U_{1} (b))] = n_{H} - n_{L} + m (A - B) (a A - b B) / 2 \\ - (A - B) (n_{med} - n_{L}) - m (a A - b B) (\frac{1}{2} - P S_{L}) . \end{matrix}

From all this, we see that

\begin{matrix} E [X^{2} (b, a)] = \frac{1}{P S_{M}^{2}} [n_{2, H} - n_{2, L} + \frac{1}{2} [m^{2} {(a A - b B)}^{2} + μ_{M}^{2} {(A - B)}^{2} - 2 μ_{M} (A - B) m (a A - b B)] \\ + 2 (μ_{M} (A - B) - m (a A - b B)) (n_{med} - n_{L}) - μ_{M} (μ_{M} (A - B) - m (a A - b B)) (1 - 2 P S_{L}) \\ + μ_{M}^{2} P S_{M} - 2 μ_{M} (n_{H} - n_{L})] . \end{matrix}

To ease notation in the above expression, write

D = μ_{M} (A - B) - m (a A - b B),

as in (46). We get

E [X^{2} (b, a)] = \frac{1}{P S_{M}^{2}} [n_{2, H} - n_{2, L} - μ_{M}^{2} P S_{M} + D^{2} / 2 + D (2 (n_{med} - n_{L}) + 2 μ_{M} P S_{L} - μ_{M})] .

Now,

E (X (b, a)) = \frac{1}{2 P S_{M}} (μ_{M} (A - B) - m (a A - b B)) = \frac{D}{2 P S_{M}},

(A6)

and so

\begin{matrix} Asy var ({\hat{μ}}_{M}) = Var (X (b, a)) = E [W^{2} (b, a)] - {[E (X (b, a))]}^{2} \\ = \frac{1}{P S_{M}^{2}} [n_{2, H} - n_{2, L} - μ_{M}^{2} P S_{M} + D^{2} / 4 + D (2 (n_{med} - n_{L}) + 2 μ_{M} P S_{L} - μ_{M})], \end{matrix}

as stated in (45).

(d): Variances of the ${\hat{RMI}}_{i}$ , $i = L, M, H$

Consider first

{\hat{RMI}}_{L}

. With

i = L

, (49) suggests the random variable

R (b) = W (b) - {RMI}_{L} U (b) = 1 / μ U_{1} (b) - n_{L} Y / μ^{2} - {RMI}_{L} U (b) .

The asymptotic variance of

{\hat{RMI}}_{L}

is then

Var (R (b) / P S_{L})

. An easy calculation shows that

E (R (b)) = \frac{1}{2} B ({RMI}_{L} - b m / μ) - I S_{L} .

(A7)

Then,

R^{2} (b) = W^{2} (b) + {RMI}_{L}^{2} U^{2} (b) - 2 {RMI}_{L} U (b) W (b) .

The expectation of

W^{2} (b)

follows from (27), and that of

U^{2} (b)

is given by (7). We have

U (b) W (b) = \frac{1}{μ} U (b) U_{1} (b) - \frac{n_{L}}{μ^{2}} U (b) Y .

It is easy to show that

\begin{matrix} U (b) U_{1} (b) & = Y I (Y \leq b m) (1 - B) - b m B I (Y \leq b m) + b m B^{2} I (Y \leq m) and \\ U (b) Y & = Y I (Y \leq b m) - B Y I (Y \leq m), \end{matrix}

so that

\begin{matrix} E (U (b) U_{1} (b)) & = n_{L} (1 - B) - b m B P S_{L} + b m B^{2} / 2 and \end{matrix}

(A8)

\begin{matrix} E (U (b) Y) & = n_{L} - B n_{med} . \end{matrix}

(A9)

Thus, we have

\begin{matrix} E (U (b) W (b)) & = \frac{1}{μ} [n_{L} (1 - B) - b m B P S_{L} + b m B^{2} / 2] - \frac{1}{μ^{2}} n_{L} (n_{L} - B n_{med}) \\ = P S_{L} (1 - P S_{L}) - B I S_{L} (1 - n_{med} / μ) - b m B P S_{L} + \frac{1}{2} b m B^{2} . \end{matrix}

(A10)

Some algebra lets us calculate

Var ({RMI}_{L})

from (27), (7), (A7), and (A10). The result is

\begin{matrix} Asy var ({\hat{RMI}}_{L}) = Var (R (b) / P S_{L}) = \\ \frac{1}{P S_{L}^{2}} [(1 - 2 I S_{L}) (n_{2, L} / μ^{2} - I S_{L} {RMI}_{L}) + I S_{L}^{2} μ_{2} / μ^{2} + \frac{1}{4} B^{2} {({RMI}_{L} - b m / μ)}^{2} \\ - I S_{L}^{2} - I S_{L} B ({RMI}_{L} - b m / μ) (2 n_{med} / μ - 1)], \end{matrix}

(A11)

Similarly, for

{\hat{RMI}}_{H}

, we consider the random variable

R (a) = W (a) - R M I_{H} U (a) = 1 / μ U_{1} (a) - n_{H} Y / μ^{2} - R M I_{H} U (a),

and

R^{2} (a) = W^{2} (a) + R M I_{H}^{2} U^{2} (a) - 2 R M I_{H} U (a) W (a) .

(A12)

First,

E (R (a)) = \frac{1}{2} A (R M I_{H} - a m / μ) + I S_{H} - R M I_{H} .

From (28), we can deduce that

\begin{matrix} E (W^{2} (a)) & = \frac{1}{μ^{2}} [n_{2, H} + \frac{1}{2} {(a m A)}^{2} - 2 a m A n_{med})] + \frac{n_{H}^{2} μ_{2}}{μ^{4}} - \frac{2 n_{H}}{μ^{3}} [n_{2, H} - a m A n_{med}] \\ = \frac{1}{μ^{2}} [n_{2, H} (2 I S_{H} - 1) + \frac{1}{2} {(a m A)}^{2} - 2 a m A I S_{H} n_{med}] . \end{matrix}

(A13)

It is immediate from (9) that

E (U^{2} (a)) = 1 - P S_{H} + A^{2} / 2 - A .

(A14)

Analogously to (A8) and (A9), we find that

\begin{matrix} E (U (a) U_{1} (a)) & = n_{H} - A n_{med} - a m A / 2 + a m A^{2} / 2 and \end{matrix}

(A15)

\begin{matrix} E (U (a) Y) & = n_{H} - A n_{med} . \end{matrix}

(A16)

Now,

U (a) W (a) = U (a) U_{1} (a) / μ - n_{H} U (a) Y / μ^{2}

, and so, from (A15) and (A16), we see that

\begin{matrix} E (U (a) W (a)) & = \frac{1}{μ} (n_{H} - A n_{med} - \frac{1}{2} a m A + \frac{1}{2} a m A^{2}) - \frac{n_{H}}{μ^{2}} (n_{H} - A n_{med}) \\ = I S_{H} (1 - I S_{H}) - A I S_{H} n_{med} / μ - a m A (1 - A) / (2 μ) . \end{matrix}

(A17)

So, from (A12)–(A14), and (A17), we obtain the result

\begin{matrix} Asy var ({\hat{RMI}}_{H}) = Var (R (a) / P S_{H}) = \\ \frac{1}{P S_{H}^{2}} [(2 I S_{H} - 1) n_{2, H} / μ^{2} + n_{H}^{2} μ_{2} / μ^{4} + \frac{1}{4} A^{2} {({RMI}_{H} - a m / μ)}^{2} \\ - I S_{H}^{2} + I S_{H}^{2} / P S_{H} - 2 I S_{H}^{2} / P S_{H}^{2} + I S_{H} A ({RMI}_{H} - a m / μ) (2 n_{med} / μ - 1)] . \end{matrix}

(A18)

Although we can derive the asymptotic variance of

{\hat{RMI}}_{M}

along similar lines as above for

{\hat{RMI}}_{L}

and

{\hat{RMI}}_{H}

, this leads to expressions that are neither simple nor intuitive. A simpler procedure is to note from (49) that the asymptotic variance of

{\hat{RMI}}_{M}

is equal to

\frac{1}{P S_{M}^{2}} [Asy var ({\hat{I S}}_{M}) + {RMI}_{M}^{2} Asy var ({\hat{P S}}_{M}) - 2 {RMI}_{M} Asy cov ({\hat{I S}}_{M}, {\hat{P S}}_{M})] .

The asymptotic variances of

{\hat{I S}}_{M}

and

{\hat{P S}}_{M}

are given by (32) and (17), respectively; see also the summary of results Section 2.4.

The asymptotic covariance of

{\hat{P S}}_{M}

and

{\hat{I S}}_{M}

is the covariance of

U (a) - U (b)

and

W (a) - W (b)

. The details of the calculation of the covariance are in the next subsection (e) of this Appendix A.

(e): Covariance of ${\hat{P S}}_{M}$ and ${\hat{I S}}_{M}$

Recall that what we need is the covariance of

U (a) - U (b)

and

W (a) - W (b)

. With

C = A - B

, we have

\begin{matrix} U (a) - U (b) & = I (b m < Y \leq a m) - C I (Y \leq m), and \\ W (a) - W (b) & = \frac{1}{μ^{2}} [Y μ I (b m < Y \leq a m) - Y (n_{H} - n_{L}) - μ m (a A - b B) I (Y \leq m)], \end{matrix}

from which, we see that

\begin{matrix} (U (a) - U (b)) (W (a) - W (b)) = \frac{1}{μ^{2}} [Y μ I (b m < Y \leq a m) - Y (n_{H} - n_{L}) I (b m < Y \leq a m) \\ - μ m (a A - b B) I (b m < Y \leq m) - μ C Y I (b m < Y \leq m) + C Y (n_{H} - n_{L}) I (Y \leq m) \\ + μ C m (a A - b B) I (Y \leq m)] . \end{matrix}

Thus,

\begin{matrix} E (U (a) - U (b)) & = P S_{M} - C / 2, and \\ E (W (a) - W (b)) & = - m (a A - b B) / (2 μ), \end{matrix}

and

\begin{matrix} E [(U (a) - U (b)) (W (a) - W (b))] = I S_{M} (1 - I S_{M}) - \frac{1}{2 μ} m (a A - b B) (1 - 2 P S_{L}) \\ + \frac{C}{μ} [I S_{M} n_{med} - n_{med} + n_{L} + \frac{1}{2} m (a A - b B)] . \end{matrix}

Therefore,

\begin{matrix} cov ({\hat{P S}}_{M}, {\hat{I S}}_{M}) = E [(U (a) - U (b)) (W (a) - W (b))] - E (U (a) - U (b)) E (W (a) - W (b)) \\ = I S_{M} (1 - I S_{M}) - \frac{1}{2 μ} m (a A - b B) (P S_{H} - P S_{L}) \\ + \frac{C}{μ} [I S_{M} n_{med} - n_{med} + n_{L} + \frac{1}{4} m (a A - b B)] . \end{matrix}

(f): Covariances of estimates of income group means

For the purposes of evaluating the reliability of income polarization estimates,

{\hat{μ}}_{H} - {\hat{μ}}_{L}

,

{\hat{μ}}_{H} - {\hat{μ}}_{M}

, and

{\hat{μ}}_{M} - {\hat{μ}}_{L}

, it is necessary to calculate the asymptotic covariances of the income group means. For the case of

{\hat{μ}}_{H} - {\hat{μ}}_{L}

, we use the result that

Asy cov ({\hat{μ}}_{H}, {\hat{μ}}_{L}) = E [X (b) X (a)] - E [X (b)] E [X (a)] .

By use of the same approach to the evaluation of asymptotic variances for income group means as set out in Section 2 one obtains

Asy cov ({\hat{μ}}_{H}, {\hat{μ}}_{L}) = \frac{1}{4 P S_{L} \cdot P S_{H}} (μ_{L} - b m) (a m - μ_{H}) A B .

(A19)

Since

μ_{L} < b m

and

μ_{H} > a m

, it follows that this is strictly positive.

For the case of

μ_{M} - μ_{L}

, we have

\begin{matrix} Asy cov ({\hat{μ}}_{M}, {\hat{μ}}_{L}) & = E [X (b) X (b, a)] - E [X (b)] E [X (b, a)] \\ = \frac{1}{4 P S_{L} \cdot P S_{M}} (μ_{L} - b m) B C \\ + \frac{1}{2 P S_{L} \cdot P S_{M}} (μ_{L} - b m) B (μ_{med} + P S_{L} (μ_{M} - μ_{L})) . \end{matrix}

(A20)

For

μ_{H} - μ_{M}

, we have

\begin{matrix} Asy cov ({\hat{μ}}_{H}, {\hat{μ}}_{M}) & = E [X (a) X (b, a)] - E [X (a)] E [X (b, a)] \\ = \frac{1}{4 P S_{H} \cdot P S_{M}} (a m - μ_{H}) A C \\ + \frac{1}{P S_{H} \cdot P S_{M}} (a m - μ_{H}) A (P S_{L} (μ_{M} - μ_{L}) - (μ_{M} - μ_{med}) / 2) . \end{matrix}

(A21)

(g): Compound measures

Throughout this section, the results collected in the Table A1 will be freely used in the calculations.

Each of the compound measures in Section 3.2 involves the product of two terms, for instance,

C P_{L} \equiv P S_{L} (μ_{M} - μ_{L}) .

We see that

\begin{matrix} Asy var ({\hat{C P}}_{L}) & = {(μ_{M} - μ_{L})}^{2} Asy var ({\hat{P S}}_{L}) \\ + P S_{L}^{2} (Asy var ({\hat{μ}}_{M}) + Asy var ({\hat{μ}}_{L}) - 2 Asy cov ({\hat{μ}}_{M}, {\hat{μ}}_{L})) \\ + 2 C P_{L} (Asy cov ({\hat{P S}}_{L}, {\hat{μ}}_{M}) - Asy cov ({\hat{P S}}_{L}, {\hat{μ}}_{L})) . \end{matrix}

All of the asymptotic variances above are given in Section 2.4 and the covariance of

{\hat{μ}}_{M}

and

{\hat{μ}}_{L}

in Equation (A20). What remains is to compute the two asymptotic covariances with

{\hat{P S}}_{L}

.

First we consider

Asy cov ({\hat{P S}}_{L}, {\hat{μ}}_{L})

. It is equal to the covariance of

U (b)

in (4) and

X (b)

in (33):

\begin{matrix} cov (U (b), & X (b)) = E (U (b) X (b)) - E (U (b)) E (X (b)) \\ = \frac{1}{P S_{L}} [E (U (b) U_{1} (b)) - μ_{L} E (U^{2} (b))] + \frac{B}{2 P S_{L}} (b m - μ_{L}) (P S_{L} - B / 2) \\ = \frac{1}{P S_{L}} [n_{L} (1 - B) - b m B P S_{L} + b m B^{2} / 2 - μ_{L} P S_{L} (1 - 2 B) - μ_{L} B^{2} / 2] \\ = \frac{1}{4} B (b m - μ_{L}) (B / P S_{L} - 2) . \end{matrix}

(A22)

In similar fashion, the asymptotic covariance of

{\hat{P S}}_{L}

and

{\hat{μ}}_{M}

is the covariance of

U (b)

and

X (b, a)

in (44):

cov (U (b), X (b, a)) = E (U (b) X (b, a)) - E (U (b)) E (X (b, a)) .

Here,

E (U (b)) E (X (b, a)) = \frac{D}{4 P S_{M}} (2 P S_{L} - B),

while

\begin{matrix} E (U (b) X (b, a)) & = \frac{1}{P S_{M}} E (U (b) U_{1} (a) - U (b) U_{1} (b) - μ_{M} U (b) U (a) + μ_{M} U^{2} (b)) \\ = \frac{1}{P S_{M}} [B (n_{L} - n_{med} + μ_{M} / 2) + \frac{1}{2} D (2 P S_{L} - B)]; \end{matrix}

recall the definition (46) of D. Thus,

cov (U (b), X (b, a)) = \frac{1}{P S_{M}} [B (n_{L} - n_{med} + μ_{M} / 2) + \frac{1}{4} D (2 P S_{L} - B)] .

(A23)

Consider next the case of

C P_{H} \equiv P S_{H} (μ_{H} - μ_{M})

, for which we need the asymptotic covariances with

{\hat{P S}}_{H}

of

{\hat{μ}}_{H}

and

{\hat{μ}}_{M}

. The first of these is

- Asy cov (1 - {\hat{P S}}_{H}, {\hat{μ}}_{H}) = - cov (U (a), X (a)),

which, after some algebra, becomes

\frac{1}{4} A (μ_{H} - a m) (2 - A / P S_{H}) .

(A24)

Similarly, the asymptotic covariance of

{\hat{P S}}_{H}

and

{\hat{μ}}_{M}

is

- cov (U (a), X (b, a))

, where

cov (U (a), X (b, a)) = \frac{1}{P S_{M}} [\frac{1}{4} D (2 P S_{H} - A) - A (n_{med} - n_{L}) - A P S_{L} μ_{M})] .

(A25)

The last compound polarization measure defined in Section 3.2 is

C P

, which was defined as

(1 - P S_{M}) (μ_{H} - μ_{L})

. For this, we need the covariances with

{\hat{P S}}_{M}

of

{\hat{μ}}_{H}

and

{\hat{μ}}_{L}

. First,

\begin{matrix} Asy cov ({\hat{P S}}_{M}, {\hat{μ}}_{L}) = cov (U (a) - U (b), X (b)) = \frac{1}{P S_{L}} cov [U (a) - U (b), U_{1} (b) - μ_{L} U (b)] \\ = \frac{B}{4 P S_{L}} (b m - μ_{L}) (C + 2 (P S_{L} - P S_{H})) . \end{matrix}

In addition,

\begin{matrix} Asy cov ({\hat{P S}}_{M}, {\hat{μ}}_{H}) & = cov (U (a) - U (b), X (a)) \\ = \frac{1}{P S_{H}} cov [U (a) - U (b), Y - U_{1} (a) - μ_{H} (1 - U (a))] \\ = \frac{A}{4 P S_{H}} (μ_{H} - a m) (C + 2 (P S_{L} - P S_{H})) . \end{matrix}

(A26)

Finally, consider the compound middle-class measure

C M = P S_{M} μ_{M}

. Then,

Asy var (\hat{C M}) = μ_{M}^{2} Asy var ({\hat{P S}}_{M}) + P S_{M}^{2} Asy var ({\hat{μ}}_{M}) + 2 C M Asy cov ({\hat{P S}}_{M}, {\hat{μ}}_{M}) .

As before,

Asy var ({\hat{P S}}_{M})

is given by (17), and

Asy var ({\hat{μ}}_{M})

is given by (45). For the covariance,

Asy cov ({\hat{P S}}_{M}, {\hat{μ}}_{M}) = cov (U (a) - U (b), X (b, a)) .

For this,

cov (U (a), X (b, a))

is given by (A25) and

cov (U (b), X (b, a))

by (A23).

Appendix B. Density Estimation on the Positive Real Line

In most applications, the support of the distribution F is a subset of the positive real line. However, it is known that, in this case, ordinary kernel density estimates are biased downwards. A possible way around this difficulty is to transform the data, by taking logarithms for instance, and getting kernel density estimates of the transformed data, which can then be multiplied by the Jacobian of the transformation to obtain estimates of the density of the positive data.

A better approach is suggested by Comte and Genon-Catalot (2012), where it is unnecessary to transform the data. Here is a brief description of their approach, roughly quoted from their paper. Instead of a Gaussian or Epanechnikov kernel defined for both positive and negative arguments, consider a density function

K (u)

defined on the positive real line, with expectation equal to 1. Let

U_{1}, \dots, U_{n}

be an IID set of random variables with distribution characterized by the density K. Then, the density of the mean

\bar{U} = (U_{1} + \dots + U_{n}) / n

is given by

K_{n} (u) = n K^{* n} (n u)

, where

K^{* n}

is the n-fold convolution of K with itself. As

n \to \infty

, the distribution with density

K_{n} (u)

converges to a point mass at 1. The proposal is to estimate the density

f (x)

for

x > 0

by

{\hat{f}}_{n} (x) = \frac{1}{N x} \sum_{i = 1}^{N} K_{n} (y_{i} / x),

(A27)

using the random sample

y_{i}

,

i = 1, \dots, N

. The motivation they give is as follows:

In usual kernel methods, the intuition is that the estimation at x counts the number of observations $X_{k}$ such that $X_{k} - x$ is close to 0. In our strategy, the intuition is that the estimator at x counts the number of observations $X_{k}$ such that $X_{k} / x$ is close to 1.

They also point out that

n^{- 1 / 2}

plays the same role here as does the bandwidth in conventional kernel methods.

The paper provides some examples of functions K for which the corresponding

K_{n}

can be computed analytically. The easiest of these has K equal to the density of the exponential distribution, which is also the gamma distribution with parameter unity:

K (u) = e^{- u}

, from which it can be shown that

K_{n} (u) = \frac{1}{Γ (n)} e^{- n u} n^{n} u^{n - 1} .

With this choice, (A27) becomes

{\hat{f}}_{n} (x) = \frac{n^{n}}{x N Γ (n)} \sum_{i = 1}^{N} exp (- n y_{i} / x) {(y_{i} / x)}^{n - 1} .

Asymptotic theory requires that

n \to \infty

as

N \to \infty

, but the guidelines as to how fast or how slowly in Comte and Genon-Catalot are very loose:

n = k^{2} : log (N) \leq k \leq N / log (N) .

In Section 4, we discuss how we chose n for the datasets considered in the empirical work.

We conducted a small simulation experiment to see to what extent the approach of Comte and Genon-Catalot described here is reliable and to compare its performance with that of conventional kernel density estimates. We used the Epanechnikov kernel with bandwidth proportional to

n^{- 1 / 5}

times the interquartile range of the sample of size n. For 10,000 replications with samples of size 10,001 (an odd number, so that the sample median is uniquely defined) drawn from the lognormal distribution, we computed realizations of

f (b m)

,

f (m)

, and B for

b = 0.3

, with the densities estimated as kernel density estimates and as described here, and compared them with the true values for the lognormal distribution. The results, shown below, leave no doubt as to the reliability of the method described here and to the unreliability of the kernel density estimates.

	$f (bm)$	$f (m)$	$B$
True value	0.644203	0.398942	0.484433
Kernel density estimate	0.533028	0.403896	0.396132
This method	0.640649	0.401252	0.482013

Appendix C. Algorithm

Here is a detailed algorithm for the computation of estimates of the numerous measures presented in this paper and of their standard errors.

Select the cut-off parameters a and b needed to define the three income groups. (We used $b = 0.5$ , $a = 2$ .)
Choose a base unit of account. (Here, it has been thousands of 2005 constant Canadian dollars.) Convert raw income measures in the sample to the chosen unit of account, and sort the converted data.
Compute the mean income $\hat{μ}$ , the mean squared income ${\hat{μ}}_{2}$ , and the variance ${\hat{σ}}^{2}$ of the sample.
Compute the sample median $\hat{m}$ and the two cut-off incomes, $b \hat{m}$ and $a \hat{m}$ .
By use of the approach described in Appendix B, or otherwise, obtain the estimates $\hat{f} (b \hat{m})$ and $\hat{f} (a \hat{m})$ of the density at the cut-off incomes, and the estimates $\hat{A}$ and $\hat{B}$ .
Count the number of data points with incomes in the three groups defined, respectively, by $[0, b \hat{m}]$ , $(b \hat{m}, a \hat{m}]$ , and $(a \hat{m}, \infty)$ , and divide these numbers by the sample size N in order to obtain ${\hat{P S}}_{L}$ , ${\hat{P S}}_{M}$ , and ${\hat{P S}}_{H}$ .
Compute asymptotic standard errors for the estimated population shares using the formulas in Section 2.4.
Obtain estimates ${\hat{n}}_{L}$ , ${\hat{n}}_{med}$ , and ${\hat{n}}_{H}$ of the quantities $n (b m)$ , $n (m)$ , and $n (a m)$ , respectively. This can be achieved by averaging the incomes in the low-income group, incomes less than the median, and those in the low- and middle-income groups combined, respectively. In addition, obtain estimates ${\hat{n}}_{2, L}$ and ${\hat{n}}_{2, H}$ by averaging squared incomes in the relevant groups.
Compute the estimated income shares: ${\hat{I S}}_{L} = {\hat{n}}_{L} / \hat{μ}$ ; ${\hat{I S}}_{M} = ({\hat{n}}_{H} - {\hat{n}}_{L}) / \hat{μ}$ ; ${\hat{I S}}_{H} = 1 - {\hat{n}}_{H} / \hat{μ}$ .
Compute the estimated income group means: ${\hat{μ}}_{L} = {\hat{n}}_{L} / {\hat{P S}}_{L}$ , ${\hat{μ}}_{M} = ({\hat{n}}_{H} - {\hat{n}}_{L}) / {\hat{P S}}_{M}$ , ${\hat{μ}}_{H} = (\hat{μ} - {\hat{n}}_{H}) / {\hat{P S}}_{H}$ , and ${\hat{μ}}_{med} = 2 {\hat{n}}_{med}$ . In addition, obtain ${\hat{μ}}_{2, L} = {\hat{n}}_{2, L} / {\hat{P S}}_{L}$ , ${\hat{μ}}_{2, M} = ({\hat{n}}_{2, H} - {\hat{n}}_{2, L}) / {\hat{P S}}_{M}$ , ${\hat{μ}}_{2, H} = ({\hat{μ}}_{2} - {\hat{n}}_{2, H}) / {\hat{P S}}_{H}$ .
Compute the estimated relative mean income ratios using (48).
Obtain the estimated asymptotic variances for population shares, income shares, and group mean incomes by use of the formulas in Section 2.4. For the relative mean income ratios, estimated asymptotic covariances are given by (A11), (A18), and (53).
Standard errors are found by dividing the asymptotic variances by the sample size N and taking square roots.
The above computations provide all information necessary for the polarization measures introduced in Section 3.

Table A1. Table of expectations.

Reference	Random Variable	Expectation
	Y	$μ$
	$Y^{2}$	$μ_{2}$
(5)	$U (b)$	$P S_{L} - B / 2$
(5)	$U (a)$	$1 - P S_{H} - A / 2$
	$U_{1} (b)$	$n_{L} - b m B / 2$
	$U_{1} (a)$	$n_{H} - a m A / 2$
(A9)	$U (b) Y$	$n_{L} - B n_{med}$
(A16)	$U (a) Y$	$n_{H} - A n_{med}$
(26)	$U_{1} (b) Y$	$n_{2, L} - b m B n_{med}$
(26)	$U_{1} (a) Y$	$n_{2, H} - a m A n_{med}$
(11)	$U (a) U (b)$	$P S_{L} (1 - A) - \frac{1}{2} B (1 - A)$
	$U_{1} (a) U (b)$	$P S_{L} μ_{L} + a m A B / 2 - a m A P S_{L} - B n_{med}$
	$U (a) U_{1} (b)$	$P S_{L} μ_{L} (1 - A) + b m A B / 2 - b m B / 2$
	$U_{1} (a) U_{1} (b)$	$P S_{L} μ_{2, L} + m^{2} a b A B / 2 - b m B μ_{med} / 2 - a m A P S_{L} μ_{L}$
(7)	$U^{2} (b)$	$P S_{L} (1 - 2 B) + B^{2} / 2$
(7)	$U^{2} (a)$	$(1 - P S_{H}) + A^{2} / 2 - A$
(22)	$U_{1}^{2} (b)$	$n_{2, L} + {(b m B)}^{2} / 2 - 2 b m B n_{L}$
	$U_{1}^{2} (a)$	$n_{2, H} + {(a m A)}^{2} / 2 - 2 a m A n_{med}$
(A8)	$U (b) U_{1} (b)$	$n_{L} (1 - B) - b m B P S_{L} + b m B^{2} / 2$
(A15)	$U (a) U_{1} (a)$	$n_{H} - A (a m + 2 n_{med}) / 2 + a m A^{2} / 2$
(25)	$W (b)$	$- b m B / (2 μ)$
	$W (a)$	$- a m A / (2 μ)$
	$W^{2} (b)$	$μ^{- 2} [n_{2, L} + {(b m B)}^{2} / 2 - 2 b m B n_{L}] + μ^{- 4} n_{L}^{2} μ_{2} - 2 μ^{- 3} n_{L} [n_{2, L} - b m B n_{med}]$
(A1)	$X (b)$	$B (μ_{L} - b m) / (2 P S_{L})$
(40)	$X (a)$	$- A (μ_{H} - a m) / (2 P S_{H})$
(A6)	$X (b, a)$	$D / (2 P S_{M})$

Appendix D. Simulation Evidence

Simulations were run in order to see to what extent the numerous estimates produced by the algorithm do indeed approximate finite-sample properties. The simulated data were generated using a lognormal distribution The simulated samples contained

n = 1001

IID drawings from this distribution. As in the empirical work Section 4, the parameters a and b are set to 2.0 and 0.5, respectively. The true values of all the estimated properties are readily computed for the lognormal distribution.

For each of 100,000 replications, realizations were obtained for

{\hat{P S}}_{i}

,

{\hat{I S}}_{i}

, and

{\hat{μ}}_{i}

, for

i = L, M, H

. The variances of these realizations were computed and then multiplied by the sample size n, since the theoretical work concerns asymptotic variances. The estimates of the theoretical asymptotic variances, as given in the summary of results, Section 2.4, were also computed for each replication and then averaged over all of them. In some cases, a second estimate of an asymptotic variance was obtained for each replication as the sample variance of quantities like the

{\hat{u}}_{i} (b)

defined in (6). These too are averaged over the replications. In Table A2 below, the averages of the point estimates are given and in Table A3 the averages of the variance estimates.

Table A2. Point estimates.

	$PS$	$IS$	$μ$
Value for low incomes	0.2441	0.0452	0.3054
Estimated value	0.2439	0.0453	0.3052
Value for middle incomes	0.5118	0.3343	1.0768
Estimated value	0.5122	0.3352	1.0777
Value for high incomes	0.2441	0.6205	4.1910
Estimated value	0.2439	0.6195	4.1926

Table A3. Estimates of asymptotic variances.

	$PS$	$IS$	$μ$
${Var}_{L} (α)$	0.1472	0.0104	0.1551
${Var}_{L} (β)$	0.1467	0.0103	0.1548
${Var}_{L} (γ)$	0.1480	0.0104	0.1546
${Var}_{L} (δ)$	0.1483	0.0105	0.1547
${Var}_{M} (α)$	0.2499	0.4282	2.4579
${Var}_{M} (β)$	0.2512	0.4317	2.4434
${Var}_{M} (γ)$	0.2519	0.4359	2.4882
${Var}_{M} (δ)$	0.2522	0.4327	2.4956
${Var}_{H} (α)$	0.1472	0.4938	52.6442
${Var}_{H} (β)$	0.1475	0.4934	52.6775
${Var}_{H} (γ)$	0.1504	0.4971	53.1611
${Var}_{H} (δ)$	0.1504	0.4976	53.2143

The asymptotic variances denoted

{Var}_{i} (α)

for

i = L, M, H

are the theoretical variances as described in the summary of results Section 2.4 with the true values computed for the lognormal distribution; those denoted

{Var}_{i} (β)

are the variances of the sets of point estimates from all the replications; those denoted

{Var}_{i} (γ)

are the estimates of the theoretical variances averaged over the replications; and those denoted

{Var}_{i} (δ)

are the sample variances of quantities like the

{\hat{u}}_{i} (b)

in (6), again averaged over the replications.

Note

1	The authors wish to thank an anonymous referee for raising this question.

References

Bahadur, R. R. (1966). A note on quantiles in large samples. Annals of Mathematical Statistics, 37, 577–580. [Google Scholar] [CrossRef]
Beach, C. M. (2016). Changing income inequality: A distributional paradigm for Canada. Canadian Journal of Economics, 49(4), 1229–1292. [Google Scholar] [CrossRef]
Beach, C. M. (2025). Testing for canadian distributional change: Declining middle class, rising top income shares and widening income gaps. Department of Economics (Working Paper No. 1531). Queen’s University. [Google Scholar]
Beach, C. M., & Davidson, R. (2025). Quantile means and quantile share standard errors and a toolbox of distributional statistics. Econometric Reviews, 44, 1166–1185. [Google Scholar] [CrossRef]
Blanchet, T., Saez, E., & Zucman, G. (2022). Real-time inequality. Working Paper 30229. NBER. [Google Scholar]
Comte, F., & Genon-Catalot, V. (2012). Density estimation for non negative random variables. Journal of Statistical Planning and Inference, 142, 1698–1715. [Google Scholar] [CrossRef]
Davidson, R. (2018). Statistical inference on the canadian middle class. Econometrics, 6(1), 14. [Google Scholar] [CrossRef]
Guvenen, F., Pistaferri, L., & Violante, G. L. (2022). Global trends in income inequality and income dynamics: New insights from GRID. Quantitative Economics, 13, 1321–1360. [Google Scholar] [CrossRef]
Hoffman, F., Lee, D. S., & Lemieux, T. (2020). Growing income inequality in the United States and other advanced economies. Journal of Economic Perspectives, 34, 52–78. [Google Scholar] [CrossRef]
Horowitz, J. L. (2001). The bootstrap. In J. L. Heckman, & E. Leamer (Eds.), Handbook of econometrics (Vol. 5, pp. 3159–3228). Elsevier Science, B.V. [Google Scholar]
Katz, L. F., & Murphy, K. M. (1992). Changes in relative wages, 1963–1987: Supply and demand factors. The Quarterly Journal of Economics, 107, 35–78. [Google Scholar] [CrossRef]
Lin, P.-E., Wu, K.-T., & Ahmad, I. A. (1980). Asymptotic joint distributions of sample quantiles and sample mean with applications. Communications in Statistics-Theory and Methods, 9(1), 51–60. [Google Scholar] [CrossRef]
Wolfson, M. C. (1994). When inequalities diverge. American Economic Review, 84, 353–358. [Google Scholar]

Table 2. Women in 2000.

	$\hat{ξ}$	$\hat{PS}$	$\hat{IS}$	$\hat{μ}$	$\hat{RMI}$
L	11.1937	0.2925	0.0564	5.2879	0.1929
		(0.0008)	(0.0002)	(0.0200)	(0.0006)
M	22.3874	0.5296	0.5205	26.9463	0.9829
	(0.0640)	(0.0013)	(0.0022)	(0.0828)	(0.0022)
H	44.7748	0.1779	0.4231	65.2021	2.3783
		(0.0011)	(0.0022)	(0.1770)	(0.0038)

Sample size is 202,491,

\hat{A}

= 1.1229,

\hat{B}

= 0.6276.

Table 3. Men in 2005.

	$\hat{ξ}$	$\hat{PS}$	$\hat{IS}$	$\hat{μ}$	$\hat{RMI}$
L	17.5000	0.2742	0.0466	8.0874	0.1701
		(0.0007)	(0.0002)	(0.0267)	(0.0006)
M	35.0000	0.5538	0.4762	40.8862	0.8598
	(0.0828)	(0.0012)	(0.0021)	(0.1031)	(0.0027)
H	70.0000	0.1719	0.4772	131.9640	2.7752
		(0.0009)	(0.0022)	(0.7438)	(0.0088)

Sample size is 238,356,

\hat{A}

= 0.9659,

\hat{B}

= 0.5133.

Table 4. Women in 2005.

	$\hat{ξ}$	$\hat{PS}$	$\hat{IS}$	$\hat{μ}$	$\hat{RMI}$
L	12.0000	0.3034	0.0605	5.9934	0.1993
		(0.0007)	(0.0002)	(0.0195)	(0.0006)
M	24.0000	0.5190	0.4867	28.2055	0.9378
	(0.0670)	(0.0012)	(0.0020)	(0.0822)	(0.0023)
H	48.0000	0.1775	0.4528	76.7076	2.5504
		(0.0010)	(0.0021)	(0.2834)	(0.0056)

Sample size is 218,253,

\hat{A}

= 1.0437,

\hat{B}

= 0.6432.

Table 5. Differences men–women in 2000.

	$Δ \hat{PS}$	$Δ \hat{IS}$	$Δ \hat{μ}$	$Δ \hat{RMI}$
L	−0.0224	−0.0064	2.4709	−0.0078
	(0.0011)	(0.0003)	(0.0337)	(0.0008)
M	0.0516	0.0540	14.4908	0.0057
	(0.0018)	(0.0029)	(0.1251)	(0.0029)
H	−0.0292	−0.0476	40.6221	0.1465
	(0.0014)	(0.0029)	(0.3500)	(0.0059)

Table 6. Differences men–women in 2005.

	$Δ \hat{PS}$	$Δ \hat{IS}$	$Δ \hat{μ}$	$Δ \hat{RMI}$
L	−0.0292	−0.0138	2.0940	−0.0292
	(0.0010)	(0.0003)	(0.0331)	(0.0009)
M	0.0348	−0.0106	12.6807	−0.0780
	(0.0017)	(0.0029)	(0.1319)	(0.0035)
H	−0.0056	0.0244	55.2564	0.2248
	(0.0014)	(0.0030)	(0.7949)	(0.0104)

Table 7. Differences 2000–2005 for men.

	$Δ \hat{PS}$	$Δ \hat{IS}$	$Δ \hat{μ}$	$Δ \hat{RMI}$
L	0.0041	−0.0034	0.3285	−0.0150
	(0.0010)	(0.0003)	(0.0380)	(0.0008)
M	−0.0273	−0.0983	−0.5509	−0.1288
	(0.0017)	(0.0028)	(0.1394)	(0.0032)
H	0.0232	0.1017	26.1398	0.2504
	(0.0013)	(0.0029)	(0.8027)	(0.0099)

Table 8. Differences 2000–2005 for women.

	$Δ \hat{PS}$	$Δ \hat{IS}$	$Δ \hat{μ}$	$Δ \hat{RMI}$
L	0.0109	0.0040	0.7054	0.0064
	(0.0011)	(0.0003)	(0.0279)	(0.0008)
M	−0.0105	−0.0338	1.2592	−0.0451
	(0.0018)	(0.0030)	(0.1167)	(0.0032)
H	−0.0004	0.0297	11.5055	0.1721
	(0.0015)	(0.0031)	(0.3316)	(0.0067)

Table 9. Measures of polarization I.

	${\hat{PS}}_{L}$	${\hat{PS}}_{H}$	${\hat{PS}}_{L} + {\hat{PS}}_{H}$
Men, 2000	0.2702	0.1487	0.4189
	(0.0007)	(0.0009)	(0.0012)
Women, 2000	0.2925	0.1779	0.4704
	(0.0008)	(0.0011)	(0.0013)
Men, 2005	0.2742	0.1719	0.4462
	(0.0007)	(0.0009)	(0.0012)
Women, 2005	0.3034	0.1775	0.4810
	(0.0007)	(0.0010)	(0.0012)

Table 10. Measures of polarization II.

	${\hat{μ}}_{H} - {\hat{μ}}_{M}$	${\hat{μ}}_{M} - {\hat{μ}}_{L}$	${\hat{μ}}_{H} - {\hat{μ}}_{L}$
Men, 2000	64.3871	33.6783	98.0654
	(0.2785)	(0.1080)	(0.2912)
Women, 2000	38.2558	21.6584	59.9142
	(0.1659)	(0.0930)	(0.1664)
Men, 2005	91.0779	32.7988	123.8767
	(0.7251)	(0.1169)	(0.7356)
Women, 2005	48.5021	22.2121	70.7142
	(0.2687)	(0.0925)	(0.2722)

Table 11. Compound polarization measures.

	${\hat{CP}}_{L}$	${\hat{CP}}_{H}$	$\hat{CP}$
Men, 2000	9.0985	9.5755	41.0773
	(0.0305)	(0.1294)	(0.0995)
Women, 2000	6.3360	6.8053	28.1855
	(0.0296)	(0.0784)	(0.0705)
Men, 2005	8.9947	15.6603	55.2714
	(0.0339)	(0.1704)	(0.1504)
Women, 2005	6.7398	8.6109	34.0111
	(0.0307)	(0.0961)	(0.0824)

Table 12. Changes in polarization measures 2000–2005.

	$Δ {\hat{CP}}_{L}$	$Δ {\hat{CP}}_{H}$	$Δ \hat{CP}$
Men	−0.1039	6.0848	14.1941
	(0.0456)	(0.2140)	(0.1804)
Women	0.4038	1.8056	5.8256
	(0.0426)	(0.1240)	(0.1084)

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Beach, C.M.; Davidson, R. A Statistical Characterization of Median-Based Inequality Measures. Econometrics 2025, 13, 31. https://doi.org/10.3390/econometrics13030031

AMA Style

Beach CM, Davidson R. A Statistical Characterization of Median-Based Inequality Measures. Econometrics. 2025; 13(3):31. https://doi.org/10.3390/econometrics13030031

Chicago/Turabian Style

Beach, Charles M., and Russell Davidson. 2025. "A Statistical Characterization of Median-Based Inequality Measures" Econometrics 13, no. 3: 31. https://doi.org/10.3390/econometrics13030031

APA Style

Beach, C. M., & Davidson, R. (2025). A Statistical Characterization of Median-Based Inequality Measures. Econometrics, 13(3), 31. https://doi.org/10.3390/econometrics13030031

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Statistical Characterization of Median-Based Inequality Measures

Abstract

1. Introduction

2. Basic Asymptotic Analysis

2.1. Population Shares

2.2. Income Shares

2.3. Income Group Means

2.4. Summary of Main Results

2.4.1. Population Shares

2.4.2. Income Shares

2.4.3. Income Group Means

3. Inference on Related Distributional Statistics

3.1. Relative Mean Income Ratios

3.2. Polarization Measures

3.3. Mean–Decile Functions

3.4. Relation with the Bootstrap

4. Empirical Study

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

Abbreviations

Appendix A. Detailed Calculations

Appendix B. Density Estimation on the Positive Real Line

Appendix C. Algorithm

Appendix D. Simulation Evidence

Note

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI