1. Introduction
Two major distributional changes have characterized many developed economies since around 1980: declining middle-class incomes and rising top incomes
Hoffman et al. (
2020);
Blanchet et al. (
2022); and
Guvenen et al. (
2022). For example, in the case of full-time full-year workers in Canada between 1970 and 2005, the proportion of workers who received middle-class earnings fell by 11.5 percentage points among men (from 74.3 to 62.8 percent) and by 13.4 percentage points among women (from 76.5 to 63.1 percent), while the proportion of higher earners rose by 3.4 percentage points for men and 4.9 percentage points for women, and the proportion of lower-earning workers went up by 5.1 and 5.7 points, respectively. Over the same period, the corresponding shares of total earnings received by middle-class earners fell by 16.9 points for men and 17.8 points for women, while the earnings shares of higher earners rose by over 13 percentage points for both men (18.5 to 32.0 percent) and women (11.4 to 25.0 percent) (
Beach, 2016, Tables 1 and 6). It would clearly be useful to be able to capture both of these sets of changes efficiently in a simple empirical framework that allows for a conventional statistical inference methodology, so one can test for the statistical significance of such changes over time.
The distributional measures that are typically used to examine these patterns of distributional change are the income shares of middle- and upper-income groups, the relative sizes of these groups, and the relative incomes of these groups. In examining these changes,
Beach (
2016) demonstrated the usefulness of characterizing the income groups in terms of their relationship to the median income level. So, for example, the middle-income group (
M) could be defined as including those with incomes between, say, fifty percent and two hundred percent of the median, the upper group (
H) as those with incomes above twice the median, and the lower group (
L) as those with incomes below half the median. This allows one to obtain separate estimates for group income shares (
,
) and for the proportion of recipients within the group (or population share)
, as well as for the group mean incomes (
). This distributional framework allows a more insightful interpretation of distributional change, since one can then analyze both the size (
) and the relative prosperity (
) of the income group separately. (Percentile- or quantile-based measures, by construction, assign the size of the income groups as a prespecified percentage such as the top decile or 10% of all income recipients.) Characterizing group size and prosperity allows one to capture the quantity dimension of a change in the group’s total income separately from the income per recipient. This in turn can be used to help identify the relative strength of demand-side or supply-side driving factors behind observed distributional change
Katz and Murphy (
1992). Such insights, though, have heretofore been based on the relative magnitude of these effects, not on their statistical significance. This framework also allows for a richer and more extensive set of measures of income polarization, in terms of both quantity and relative income dimensions at the tails of the distribution.
Any summary or scalar inequality index (such as the Gini coefficient) does not capture the complex mix of distributional changes that have been occurring and does not allow one to identify where the major changes are occurring (and hence possible appropriate policy concerns). A three-way (or more detailed) distributional characterization of these income inequality changes is required.
Davidson (
2018) provided an empirical approach to calculate asymptotic variances and covariances for sample estimates of
and
for middle-group income recipients within the median-based empirical framework, thus enabling formal statistical inference on these measures. The present paper extends Davidson’s statistical analysis to apply to lower- and upper-income groups as well (all defined in terms of the median), so that one can examine a full set of population subsets covering an income distribution (i.e., for
L,
M, and
H subsets) jointly. The analysis shows how this approach leads to explicit formulas for asymptotic variances and standard errors, which can be easily programmed, for
and
, for all of
income groups. The paper extends the set of distributional measures to a relative mean income statistic
, where
is the mean of group
i incomes, and
is the overall population mean, and also to
itself, so that one can test for the statistical significance of growing income gaps between income groups.
The paper thus proposes a general framework for median-based income inequality analysis, based on asymptotic statistical inference. The derived formulas for variances and covariances of the various statistics are directly empirically applicable to available public microdata files such as those commonly used by research and public policy analysts. The present study serves as a complement to a separate piece by the authors (
Beach & Davidson, 2025) that developed a comparable framework for inequality measures, based on quantile income shares as typically published by government statistical agencies. Together, the two papers provide the basis for a toolbox set of calculations that can be readily implemented to allow standard statistical inference for frequently used statistics of disaggregated income inequality change.
The paper first outlines the stochastic quantile function approach to statistical inference. It then extends
Davidson’s (
2018) middle-class group results for estimated income shares and population shares to corresponding lower- and upper-income groups as well and expresses the asymptotic variance results in terms of simple explicit formulas that can be estimated from available microdata. The extension of these results to group mean income measures is also presented. In
Section 3, the results in
Section 2 are used to obtain results for relative group mean incomes, measures of polarization, and mean–decile distribution functions.
Section 4 provides an empirical application of the
Section 2 theoretical results to Canadian Census earnings data.
Section 5 summarizes the main results of the paper and notes some implications.
3. Inference on Related Distributional Statistics
This section considers three sets of distributional statistics that involve applications of the analytical results developed in the previous section. As there, we restrict attention to the case in which , thus defining three income groups: the lower group L, for incomes less than or equal to ; the middle group M, with incomes between and ; and the higher group H, with incomes greater than .
3.1. Relative Mean Income Ratios
The relative mean income for each income group is the ratio of the group’s mean income to the overall mean income of the distribution:
For example, in recent decades for many countries, the lower-income ratio
has not changed much, while the upper-income ratio
has risen substantially. It would be useful to know whether the changes in both ratios are statistically significant or only the latter.
The relative mean income ratio can be estimated directly as
However, from the definitions of
,
, and
, we have
,
, and
, and so for
,
. Thus, to leading order
In
Appendix A(d), explicit expressions are derived for the asymptotic variances of
,
. The results are as follows:
The details of the calculation of the covariance needed in (
52) are relegated to
Appendix A(e). The result is
with
.
3.2. Polarization Measures
The rise in upper incomes, resulting in a growing separation between high-income recipients and middle-class workers, has led to concern about the degree of polarization in income distributions. The concept of polarization can be viewed as having two quite distinct dimensions. One is the size dimension or relative mass at the two ends of the distribution (see for example
Wolfson (
1994)), which we label tail-frequency polarization and capture here as the proportion of recipients in the lower or higher income groups—what we are referring to here as
and
. Such measures then are
,
, and
. Asymptotic variances for the first two have already been obtained in
Section 2.4 above. For
, note that the sum of the three population shares is one, and so the asymptotic variance of
is simply that of the middle group,
, which again we already have in (
17).
The other aspect of polarization is the distance dimension or income-gap polarization, represented here by
,
, or
. Both sets of measures provide useful insights, and both can be implemented in our analytical framework. In the case of the income-gap polarization measures, again, the asymptotic variances of
,
, and
have been established in
Section 2.4.2. For the differences in income group means, recall that
for
. The three required covariances are provided in
Appendix A(f). Thus, again, standard errors of the income-gap polarization measures can be computed in the usual fashion.
One could also posit a set of compound polarization measures, which capture both of these dimensions together: , , and also .
Analogously, one could further identify a compound measure to capture the evident decline in the economic situation of the middle dlass in many countries over recent decades as . This would allow one, for example, to use logarithmic derivatives to estimate the relative importance of changes in the relative size of the middle class () versus changes in their average real incomes () in this decline.
One can use the results of
Section 2 to work out the asymptotic variances of these various estimated compound measures; see
Appendix A(g) for details.
3.3. Mean–Decile Functions
In an environment where higher incomes have been rising dramatically relative to the rest of the distribution, one measure of interest could be an indication of skewness of the distribution, as measured by the difference between the overall mean and median of the income distribution, or . However, is simply the fifth decile of the distribution. One could, more generally, define a mean–decile function.
Choose some proportions , with for . For deciles, we would have , . Let be the -quantile of the distribution: the proportion of incomes less than is , and let be the corresponding sample quantile. Possible mean–decile functions could take on values , or alternatively , for the decile of the distribution as a further way of capturing growing income differences over various ranges of the distribution.
Here, we can make use of the work of
Lin et al. (
1980). These authors show that, under general regularity conditions, the
and
are asymptotically joint normally distributed. We denote the asymptotic variance–covariance matrix by
: it is an
matrix, where the index
refers, not to a quantile, but to
. Then, for
, the elements of
are
where
is the density at
,
, and
.
Thus, for the mean–decile distribution defined in levels as
, we have
In relative or proportional terms,
Note that the density appears as such in the denominator of the above expressions rather than as a ratio
or
as elsewhere in this paper. However,
can be estimated in the same way as the other densities used; see
Appendix B. Standard errors can be calculated accordingly.
3.4. Relation with the Bootstrap
Given the fact that the bootstrap has become an almost universal tool for reliable statistical inference, it is incumbent on us to outline how the material in this paper can be used in connection with bootstrap methods. It has been suggested that the asymptotic variances and standard errors provided here are unnecessary, as they can be obtained in a finite-sample context by use of the bootstrap. However,
Horowitz (
2001) points out that naive bootstrap standard errors are unlikely to be any better than asymptotic ones and may well be worse. What he and numerous other authors recommend is using an asymptotic standard error in order to construct an asymptotically pivotal quantity by studentizing, that is, dividing the quantity of interest, supposed to have expectation zero, by its standard error. The studentized quantity can then be bootstrapped in order to obtain a bootstrap
P value for some null hypothesis, or to construct a bootstrap confidence interval for a parameter of interest.
Our results can be applied readily to such a bootstrap exercise. For instance, a test of a hypothesis that
is equal to some given value
M can be based on bootstrapping
, where
is the square root of the asymptotic variance of
given by (
17). Similarly a bootstrap confidence for
can be constructed by conventional means.
Another reason to exercise care in applying the bootstrap to the data used in this paper is set out in
Davidson (
2018). The incomes given for individuals in the census data are often, indeed usually, rounded to multiples of USD 500 or USD 1000. This means that the empirical distribution of the sample of incomes is not smooth, and this is known to cause problems for a conventional resampling bootstrap. We verified that this is the case with our samples. Asymptotic variances as given by the formulas of this paper, and variances derived from a conventional resampling bootstrap, were compared in the context of a simulation experiment that used samples of 200,000 observations realized from a lognormal distribution. The results were comparable, as might be expected with such large samples. When the same exercise was repeated with the sample of men’s incomes in 2000, the bootstrap variances were very different from the asymptotic ones.
Another point of interest for practitioners is that all the asymptotic standard errors reported in
Table 1 Fortunately, no renumbering is needed. were computed in a quarter of a second, whereas the corresponding bootstrap standard errors, with 999 bootstrap repetitions, took 80 s.
4. Empirical Study
In this section, we present results obtained using data from the Canadian Census Public Use Microdata Files (PUMF) for Individuals for 2000 and 2005, as recorded in the 2001 and 2006 censuses. We preferred these datasets to more up-to-date ones since the 2015–2020 census interval has results that are massively affected by the Canadian federal government’s response to the COVID-19 pandemic in the form of major temporary income support programs. In addition, the 2011 Census used a changed methodology (to save money) that made the income data for 2010 non-comparable to the other censuses.
We treat men and women separately, as their wages and labor-market participation rates were quite different. Accordingly, for each census year, two samples, one for each sex, are extracted from the census data files and are treated separately. In both cases, individuals younger than 15 years of age are dropped from the sample, as well as individuals who did not work in that year or for whom the information on weeks worked is missing. Earnings here refers to annual wage and salary income and net self-employment income. Statistics Canada typically rounds incomes to integer multiples of CAN 1000. Earnings are stated in thousands of 2005 (Canadian) dollars.
Given Assumption 1, it is important to see to what extent the rounding of incomes, which inevitably creates an empirical distribution more discontinuous than one generated by sampling from a genuinely differentiable distribution, has an effect on our asymptotic standard errors. We took a subsample of just 1000 observations from the dataset for men from the 2000 census, and smoothed the data by adding noise generated by the Epanechnikov kernel. To each income
y, measured in dollars rather than thousands of dollars, the added noise is given by
where the bandwidth
. The asymptotic standard errors computed from the smoothed data differed by less than one percent from those computed from the census data.
Density estimates were given by the approach outlined in
Appendix B. We experimented with different values of the parameter
n using samples drawn from the lognormal distribution, for which the density is known analytically. It appeared that a larger value of
n gave more accurate estimates, but that numerical overflow occurred in the computation of the gamma function for values of
n greater than around 170. We found that setting
gave satisfactory results, although other choices in the neighborhood of 100 gave results that were not markedly different.
In
Table 1, results are shown for men in 2000. The entries for
are the upper income cutoff for group
L and the lower income cutoff for group
H. For group
M, the entry is the sample median. Asymptotic standard errors are in brackets.
Table 1.
Men in 2000.
| | | | | |
---|
L | 17.7420 | 0.2702 | 0.0500 | 7.7588 | 0.1851 |
| | (0.0007) | (0.0002) | (0.0271) | (0.0006) |
M | 35.4840 | 0.5811 | 0.5745 | 41.4371 | 0.9886 |
| (0.0770) | (0.0012) | (0.0019) | (0.0937) | (0.0018) |
H | 70.9681 | 0.1487 | 0.3755 | 105.8242 | 2.5248 |
| | (0.0009) | (0.0019) | (0.3020) | (0.0045) |
Table 2 shows the corresponding results for women in 2000.
In
Table 3 and
Table 4, there are similar results for men and women respectively in 2005.
The sample sizes for these four tables of basic distributional results are quite large; so, it should perhaps not be surprising that the asymptotic standard errors are quite small, and all the reported statistics in these basic tables are highly statistically significant. They involve averages or proportions, which seem to be robustly estimated. The estimates of A and B are also all quite sensible in that they imply that the estimated density ratio is considerably larger than —which is what one would expect for a right-skewed distribution such as for an earnings distribution.
Table 5 and
Table 6 show the differences in outcomes between men and women for the years 2000 and 2005, with asymptotic standard errors for these differences in parentheses. A positive difference means that the relevant outcome is greater for men than for women; a negative difference means the reverse. Again, all the differences are highly statistically significant. Two results are evident. In both years, men were relatively more concentrated in the middle-income group with women relatively more concentrated in the lower- and higher- income groups within each distribution. This is consistent with more part-time women workers as well as generally higher levels of education for women than for men in recent decades. Second, the earnings gap between men and women changed very little within the lower and middle income groups over 2000–2005. But in the higher income group, men’s earnings shot up quite dramatically compared to women’s over this period.
Table 7 and
Table 8 present differences or changes over time in the distributional outcome measures between 2000 and 2005, separately for men and women. For outcomes that were greater in 2005 than in 2000, the differences are positive. Again, asymptotic standard errors are in parentheses, and again, all but one of the changes are highly statistically significant. Here, the changes are quite dramatic given that major distributional changes have typically been rather slow and gradual over time. For both men and women, the proportion of workers in the middle-income group fell substantially between 2000 and 2005, as did the relative-mean incomes of the middle group. On the other hand, mean earnings levels in the higher-income group went up dramatically. As a result, the earnings share of the middle group of so-called middle-class earners markedly declined and was made up by a corresponding dramatic rise in the earnings share of the higher-income group. This pattern occurred for both women and men in the Canadian labor market between 2000 and 2005, but the changes were two to three times stronger in the earnings distribution for men than for women.
Table 9 and
Table 10 further pursue this significant pattern of change and show results for several measures of polarization within the earnings distributions (see
Section 3.2 above).
Table 9 focuses on population shares or the proportion of workers towards the two ends of the distributions, while
Table 10 bases alternative polarization measures on mean earnings gaps over the ends of the distributions. Again, in both sets of polarization measures, one finds broadly similar patterns of change for both men and women (though with some differences). In the case of
-based measures (
Table 9), the general polarization of workers out of the middle-class region was driven by an increased proportion of workers in the
H earnings group among men but by an increased proportion of workers in the
L earnings group among women. In the case of the earnings-gap measures (
Table 10), the greatly widening gaps in earnings between groups in the distributions is almost entirely driven by the widening gap between middle-class and higher earnings levels—for both men and women in the labor market. Again, the changes are about twice as strong among men than among women workers, and again, the results are highly statistically significant.
Finally,
Table 11 and
Table 12 display estimates of and changes in the compound polarization measures (in
Section 3.2) that combine the population share and earnings gap dimensions. As can be seen, for both men and women, changes in the upper end of the earnings distributions over the 2000–2005 period were much greater than changes in the lower end of the distributions. For women, the changes were about twice as large, while for men it was about eight times. Clearly, the large changes have been occurring between the middle-class earnings group and the higher-earnings group. This recommends the use of separate polarization measures for the lower and upper ends of the distribution rather than one that blends or combines the two and thus potentially hides the basic structural changes that are going on over the different regions of the distribution and in the Canadian labor market. Note also that, for men, both components of
contribute to the large increases in earnings polarization—both increases in
, as well as the rising earnings gap (
)—while for women, the increase in
is driven completely by rapidly rising upper earnings levels. Again, these polarization changes are all highly statistically significant. Because our sample sizes are large, our asymptotic results seem to be reliable, as illustrated by the simulation evidence presented in
Appendix D.
As actual explanations for these major changes are fairly complex and overlapping (some examples: skill-biased automation, globalization and deindustrialization, sectoral and demographic shifts, increased industrial concentration, and weakened private-sector unionization rates); for more extensive discussion, we prefer to refer to (
Beach, 2016,
2025), among others, where one can find more extensive discussion of the leading structural explanations of the observed distributional changes and possible policy implications of these changes.
One might want to follow up on the above results by investigating possible intra-group dynamics within any of the income groups.
1 Since the choice of the
a and
b cut-off scalars is arbitrary, one could redo part or all of the above empirical analysis with different values of
a and
b, possibly highlighting specific narrower regions of the income distribution. Instead, the authors would recommend using a—possibly quite refined—quantile-based analysis as provided in
Beach and Davidson (
2025). The corresponding variance–covariance formulas in a quantile-based approach are simpler to use and are distribution-free, so that no density estimation steps need to be undertaken. Indeed, the authors view these two papers to be complementary, and between them they provide a quite extensive tool box of distributional statistics to look at possibly quite disaggregative patterns of distributional change.
5. Conclusions
This paper considers income distributions that are divided into lower, middle, and upper regions based on separating points that are scalar multiples of the median. For example, the lower region (L) could consist of recipients with incomes less than half the median, the middle group (M) includes those with incomes between 50 percent and 200 percent of the median, and those with incomes above twice the median lie in the higher income group (H). Such a characterization of an income distribution is very useful in evaluating changes over time in the economic experience of the middle-class income group and in the nature of polarization in the distribution. For each of these three income groups, separate estimates are obtained for their income shares (
), group size or population shares (
) and their mean income levels (
). The paper derives explicit formulas for the asymptotic variances (and, hence, standard errors) of sample estimates of the groups’ population shares, income shares, and mean incomes. It is shown that these formulas are not distribution-free, but that a density-estimation technique of
Comte and Genon-Catalot (
2012) is well-suited to provide needed data-based density estimates in empirical income distribution analyses. The results are then applied to derive asymptotic variances for relative-mean income ratios, for each income group, for various polarization measures, and for decile–mean income ratios. This statistical framework is implemented with Canadian Census public-use microdata files in order to investigate some of the key features of changes in the Canadian earnings distribution.
It is found that population and income shares and income-group means can indeed be estimated with a high degree of reliability. Major patterns of distributional change that have been previously highlighted in the literature have indeed been found to be highly statistically significant. The distributional framework and statistical approach used in this paper thus allow one to move beyond descriptive analysis of distributional change to a formal framework of statistical inference and hypothesis testing.
Further, since , changes in group income shares have been found to arise from changes in both population shares and relative mean incomes. Estimating these two dimensions separately allows for (i) a rich economic interpretation and testing of the driving factors behind distributional change and (ii) an extensive characterization (and hence better understanding) of polarization as a key aspect of on-going distributional change.
The results of this paper suggest that official government statistical agencies—such as Statistics Canada and the U.S. Bureau of the Census—may wish to consider providing median-based estimates of population shares, income shares and income-group means to complement their regularly published series on decile income shares and decile means. They could also provide user information on the general reliability of these estimates. Since the deciles and decile means, which official agencies already provide, and the median-based statistics provided in this paper are usefully complementary, they together would offer a much better source of distributional information on which to base possible policy initiatives to improve policy design and targetting. For example, one might ask what the appropriate income range is for so-called middle-class income tax cuts, COVID-19-response temporary income support programs, or possibly for wage or employment adjustment programs in face of major tariff impact adjustments.