Next Article in Journal
Design of Trabecular Bone Mimicking Voronoi Lattice-Based Scaffolds and CFD Modelling of Non-Newtonian Power Law Blood Flow Behaviour
Previous Article in Journal
Dynamic Modeling of Bacterial Cellulose Production Using Combined Substrate- and Biomass-Dependent Kinetics
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Theil’s Index of Inequality: Computation of Value-Validity Correction

by
Tarald O. Kvålseth
1,2
1
Department of Mechanical Engineering, University of Minnesota, Minneapolis, MN 55455, USA
2
Department of Industrial & Systems Engineering, University of Minnesota, Minneapolis, MN 55455, USA
Computation 2024, 12(12), 240; https://doi.org/10.3390/computation12120240
Submission received: 28 October 2024 / Revised: 27 November 2024 / Accepted: 3 December 2024 / Published: 5 December 2024
(This article belongs to the Section Computational Social Science)

Abstract

:
The Theil index is one of the most popular indices of economic inequality, one reason for which is no doubt due to its convenient additive decomposition property. One of its weaknesses, however, is its lack of any intuitively meaningful interpretations. Another, and more serious, limitation of Theil’s index, as argued in this paper, is its lack of the value-validity property. That is, this index does not meet a particular condition based on metric distances between income-share distributions required in order for the range of potential index values to provide true, realistic, and valid representations of the economic inequality characteristic. After outlining the value-validity condition, this paper derives a simple transformation of Theil’s index that meets this condition to a high degree of approximation. Randomly generated income-share distributions are used to demonstrate and verify the validity of the corrected index. The new index formulation, which is simply a power function of Theil’s index, can then be used to make appropriate and reliable representations of absolute and relative difference comparisons of economic inequalities.

1. Introduction

Among the various measures of economic inequality that have been previously proposed, one of the most popular is the Theil index, named after Theil [1,2], and generally expressed as
( Y n ) = 1 n i = 1 n ( y i y ¯ ) log ( y i y ¯ )
where Y n = ( y 1 , , y n ) is the set of incomes of n individuals or income intervals and y ¯ = i = 1 n y i / n (e.g., [3,4,5,6,7]). In terms of proportions (income shares) p i = y i / n y ¯   ( i = 1 , , n ) , Theil’s index can be expressed as
T ( P n ) = log n H ( P n ) ,   H ( P n ) = i = 1 n p i log p i
where H ( P n ) is recognized as the entropy of Shannon [8] for the distribution P n = ( p 1 , , p n ) with p i 0 for i = 1 , , n and i = 1 n p i = 1 . The natural (base-e) logarithm is used in (1) and (2).
While this index can certainly be criticized for its lack of intuitive sense [9], its popularity is probably due to its useful decomposition property, as follows: T can be decomposed additively into the inequality “between” and “within” different subgroups (e.g., [1,2,10,11]). This decomposability property is useful in empirical studies and can be used by policymakers when trying to identify sources of economic inequality (e.g., [6,10,12,13,14]). The U.S. Census Bureau produces estimates of the Theil index.
A recognized practical disadvantage of Theil’s T is that its values are not always comparable across different units (such as countries) since, although its lower bound is 0, the upper bound log n of T is not fixed, but depends on n. Another limitation of T, that has not so far been reported or discussed, relates specifically to the values taken on by T. While various properties relevant to T that are basically mathematical have been widely discussed (symmetry, scale invariance, population replication, Pigou–Dalton transfer principle; e.g., [5]), any concern about T lacking the value-validity property has not so far been discussed, but will be in this paper. This property, first introduced by Kvålseth [15], ensures that an inequality index takes on values throughout its range that provide representations of the inequality characteristic that are true, realistic, and valid with respect to a generally acceptable criterion.
It becomes immediately evident that T does not meet the condition required by the value-validity property and can therefore lead to unreliable, inappropriate, and misleading results and conclusions. Consequently, the objective of this paper is to explore some alternative formulation as a correction of T that satisfies the value-validity condition, at least as a good approximation. The exploratory analysis is based on randomly generated income-share distributions P n = ( p 1 , , p n ) as well as the so-called lambda distribution [16].

2. Value-Validity

Since the value-validity property and its conditions have been discussed extensively by Kvålseth [14,16,17], only a brief outline will be provided here. Thus, consider a generic economic inequality measure EI whose value becomes E I ( P n ) for the income-share distribution P n = ( p 1 , , p n ) and with the extreme values E I ( P n 0 ) and E I ( P n 1 ) for the two distributions
P n 0 = ( 1 n , , 1 n ) ,   P n 1 = ( 1 ,   0 , ,   0 ) .
While the strictly correct notation would be for EI to denote a measure or function and E I ( P n ) to denote its value for some P n , EI may be used in this paper to denote both a measure (index) and its value to simplify the notation when there is no chance of ambiguity.
As a convenient starting point to introduce the value-validity concept, consider the following delta distribution introduced by Kvålseth [16]:
P n λ = ( λ + 1 λ n , 1 λ n , , 1 λ n ) ,   0 λ 1
where λ can be considered as an inequality parameter. The P n 0 and P n 1 in (3) are seen to be extreme members of (4). Thus, for any given n, λ = 0 in (4) represents the income-share distribution with perfect equality while λ = 1 corresponds to the distribution with maximum income-share inequality. When considering some condition for the value-validity of an economic inequality index EI, the special distribution in (4) can conveniently be used because of the following relationship:
E I ( P n ) = E I ( P n λ )   for   one   unique   λ
for any given income-share distribution P n = ( p 1 , , p n ) and single-valued EI with E I ( P n 0 ) E I ( P n ) E I ( P n 1 ) . Of course, there can be any number of different P n , for which (5) would hold for the same λ -value.
The distribution in (4) can be viewed as a so-called mixture distribution, since it is the following weighted mean of P n 0 and P n 1 in (3):
P n λ = ( 1 λ ) P n 0 + λ P n 1 ,   0 λ 1 .
As a basis for the value-validity condition for an economic inequality index EI, the following linearity (mean-value) requirement for (5) is proposed:
E I ( P n λ ) = ( 1 λ ) E I ( P n 0 ) + λ E I ( P n 1 )
for all n and 0 λ 1 . This linear relationship can equivalently be expressed in terms of the normalized form
E I * ( P n λ ) = E I ( P n λ ) E I ( P n 0 ) E I ( P n 1 ) E I ( P n 0 ) = λ .
For any given λ , E I ( P n λ ) in (7) becomes a linear function of the two variables E I ( P n 0 ) and E I ( P n 1 ) and, for any given (fixed) E I ( P n 0 ) and E I ( P n 1 ) , E I ( P n λ ) is a linear function of λ .
Besides the linearity proposition in (7), this relationship can also be justified or explained in terms of metric distances between income-share distributions. Thus, by considering the distributions P n λ , P n 0 ,   and   P n 1 as points (vectors) in n-dimensional Euclidean space, λ can be expressed in terms of Euclidean distances d as
d ( P n λ , P n 0 ) d ( P n 1 , P n 0 ) = d * ( P n λ ) = λ .
Then, from (8) and (9), the value-validity condition can be expressed in terms of the normalized index EI* and distance d* as
E I * ( P n λ ) = d * ( P n λ ) = λ .
While the condition in (10) is based on the specific distribution in (4), there are more general implications from the equality in (5). Consequently, it can be expected that the first equality in (10) becomes an approximate equality for any income-share distribution P n = ( p 1 , , p n ) , i.e.,
E I * ( P n ) d ( P n , P n 0 ) d ( P n 1 , P n 0 ) = d * ( P n ) = n i = 1 n p i 2 1 n 1 .

3. Critical Assessment of T

It is readily seen from its definition in (2) that the Theil index T does not meet the value-validity condition in (10). It is clear from numerical examples that T substantially understates the true extent of the economic inequality. For a simple example P 2 = ( 0.75 ,   0.25 ) , it follows from (2) that T ( P 2 ) = 0.13 so that since T ( P 2 0 ) = 0 and since T ( P 2 1 ) = log 2 , then the normalized index value becomes T * ( P 2 ) = 0.19 . This P 2 distribution is equivalent to P 2 0.5 in (4) so that, according to the requirement in (10), the normalized index value should be 0.50 rather than the T * -value of 0.19.
For an inequality index ranging in potential values between 0 and log n , as is the case for Theil’s index with T ( P n 0 ) = 0 and T ( P n 1 ) = log n , an equivalent index T V complying with the value-validity condition in (10) can be expressed as
T V ( P n λ ) = ( log n ) λ ,   T V ( P n ) = ( log n ) d * ( P n ) = ( log n ) n i = 1 n p i 2 1 n 1
for the P n λ in (4) and P n = ( p 1 , , p n ) . The extent to which T lacks the value-validity property can conveniently be analyzed by comparing T ( P n λ ) with the T V ( P n λ ) in (12).
Although the extent of the inequalities T ( P n λ ) < T V ( P n λ ) and T ( P n ) < T V ( P n ) become readily apparent from numerical data considered below, this can be performed analytically in terms of P n λ and by defining the value bias of T ( P n λ ) as
V B T ( P n λ ) = T ( P n λ ) T V ( P n λ ) = ( 1 λ ) log n H ( P n λ )
for the T V ( P n λ ) in (12) and with H ( P n λ ) being the entropy in (2) for the distribution P n λ in (4). In terms of partial derivatives,
V B T λ = ( 1 1 n ) log [ 1 + n λ ( 1 λ ) 1 ] log n
and 2 V B T / λ 2 0 . For any given n, it is found from (14) that VBT becomes maximum for λ values ranging from λ = 3 / 5 for n = 2 and λ = 1 / 2 for n . Furthermore, by treating n as a continuous variable for mathematical purposes, it is found that V B T ( P n λ ) / n 0 . This analysis shows that the value bias V B T ( P n ) from (13) tends to become increasingly negative with increasing n and as P n approaches the mean ( P n 0 + P n 1 ) / 2 of the two extreme distributions in (3).
While the sensitivity of T V ( P n λ ) in (12) to changes in the inequality or concentration parameter λ in (4) remains constant for any given n, that of T ( P n λ ) varies substantially with λ . Specifically, it is found that
T ( P n λ ) λ = ( 1 1 n ) log ( 1 + n λ 1 λ ) > 0 ,   2 T ( P n λ ) λ 2 = n 1 ( 1 λ ) ( n λ + 1 λ ) > 0
i.e., for any given n, the sensitivity of T ( P n λ ) to small changes in λ increases with λ and at an increasing rate. For any income-share distribution P n = ( p 1 , , p n ) and from (5), the implication is clear: the sensitivity of T ( P n ) to small changes in the inequality (concentration, unevenness) of the components of P n is not constant for any fixed n, but increases with increasing inequality.

4. Correction of T

4.1. Specific Objective

In order to determine if T can be corrected so as to comply with the value-validity condition in (8), an obvious approach would be to explore whether some systematic relationship exists between T and T V in (8). If the dimension n of the income-share distribution P n = ( p 1 , , p n ) or the number n of income earners is known, the results from Kvålseth [15] could be used to explore such a potential relationship. However, when various studies, organizations, or agencies provide reports on economic inequality, the values of indices such as Theil’s T are typically given without specifying values of n (e.g., [10,11,12,13,18,19]).
Therefore, for practical purposes, it would be most useful if a value-validity correction T C could be formulated at least approximately as a simple function of T, i.e.,
T V ( P n ) T C ( P n ) = f [ T ( P n ) ] .
Exploratory statistical analyses will be used to explore the function f in (15).

4.2. Data

To obtain the necessary data for analyzing the potential relationship in (15), two sources of data were used. First, randomly generated lambda distributions P n λ in (4) were obtained by generating n as a random integer between 2 and 100, inclusive, and λ as a random number (to 2 decimal places) such that 0 < λ < 1 .
Second, randomly generated distributions P n = ( p 1 , , p n ) were produced and based on the following computer algorithm. First, n was generated as a random integer between 2 and 100, inclusive. Then, for each such generated n, each p i was generated in descending order ( p 1 p 2 p n ) as random numbers within the following respective intervals:
1 n p i 1
1 j = 1 i 1 p j n ( i 1 ) p i min { p i 1 , 1 j = 1 i 1 p j }   for   i = 2 , , n 1
p n = 1 j = 1 n 1 p j .
Some distributions P n λ and P n were excluded when they produced near identical (repeat) results or when they resulted in values of T > 1, since such T-values would be unrealistic of real reported economic data. Thus, a total of 35 of each of the two types of distributions were used in the analysis.

4.3. Results

The results from using the randomly generated P n λ in (4) and P n = ( p 1 , , p n ) are summarized in Table 1 and Table 2, respectively. An immediate observation from these results is how far T deviates from the corresponding values of T V in (12). The values of T ( P n λ ) and T ( P n ) differ greatly from the respective values of T V ( P n λ ) in Table 1 and T V ( P n ) in Table 2. These results support the above analysis that T consistently and substantially understates the true inequality.
Perhaps the most interesting and promising result from Table 1 and Table 2 is the apparent indication that, although the values of T and T V can differ greatly, they appear to be systematically related. In fact, when the values T V ( P n λ ) versus T ( P n λ ) and T V ( P n ) versus T ( P n ) are represented by the scatter diagram in Figure 1, it becomes clear that a functional relationship, as in (15), could be formulated.
It is evident from this scatter diagram that a simple power function may indeed be an appropriate correction for T, i.e.,
T C ( P n ) = α [ T ( P n ) ] β
where α and β are the parameters. The adequacy of this formulation would depend on how closely the values of T C from (16) approximate those of T V from (12).
From regression analysis of T V on T, the following parameter estimates from (16) are obtained: α ^ = 1.57 and β ^ = 0.64 for the data in Table 1 and α ^ = 1.53 and β ^ = 0.66 for Table 2. When combining the data from both tables into 70 data points, α ^ = 1.55 and β ^ = 0.65 , which turns out to be the means of the other two sets of parameter estimates. Consequently, the following value-validity correction of Theil’s T is proposed:
T ^ V ( P n ) = T C ( P n ) = ( 1.55 ) [ T ( P n ) ] 0.65
which is the curve shown in Figure 1.
When comparing the values of T V and T C in Table 1 and Table 2 and from the scatter diagram in Figure 1, it becomes apparent that the T C in (17) has the value-validity property since the values of T V and T C are approximately equal, to a reasonable degree. Specifically, if T C is used to predict T V , it is found that the coefficient of determination R 2 , when properly computed [20], becomes R 2 = 1 ( T V T C ) 2 / ( T V T ¯ V ) 2 = 0.987 for the fitted model T ^ V = T C and the 70 data sets combined from Table 1 and Table 2. That is, 99% of the total variation of T V (about its mean) is explained (accounted for) by the model T ^ V = T C . Also, R S M E ( T V , T C ) = 1 70 ( T V T C ) 2 / 70 = 0.045 . It is also clear, based on the residuals from Figure 1, that the formulation in (17) is an appropriate one and that no alternative needs to be considered.
It is also of interest to note the close comparative results when based on the distribution P n λ in (4) versus the general distribution P n = ( p 1 , , p n ) . From the data in Table 1 and Table 2 and from the scatter diagram in Figure 1, it is evident that the results from the two different types of distribution are highly comparable. In fact, such correspondence is not surprising, in view of the relationship in (5) involving T and the equivalent one in terms of the corrected T C in (17).

4.4. Real Data Results

In addition to the results from randomly generated data, as discussed in Section 4.3, it may also be of interest to perform the same analysis using some real income data. Also, while the focus of this paper is on the important Theil index, the results from the real data will also be used to make a comparison with another index, the most popular Gini’s index [21], which does, in fact, have the value-validity property.
By definition, if the income shares are rank ordered such that p [ 1 ] p [ 2 ] , p [ n ] , Gini’s index G can be expressed as
G ( P n ) = n + 1 n 2 n i = 1 n i p [ i ]  
with tied (equal) p [ i ] ’s being placed in any order. For the lambda distribution in (4), it is determined from (18) that
G ( P n λ ) = ( 1 1 n ) λ ,   G * ( P n λ ) = λ
and hence G meets the value-validity condition in (10).
In order to compare values of the indices G, T, T V ,   and   T C in (18), (2), (12), and (17), respectively, for some real economic income data, U.S. Census Bureau data were used, as reported by Semega and Kollar [22] (Table A2), for total household income and all ethnic groups for various years. Nine income intervals were reported, ranging from “under USD 15,000” to “USD 200,000 and over”. The results are summarized in Table 3 (to 3 decimal places in order to discriminate between some of the small index values).
It is clear from the data in Table 3 that T C in (17) is closely related to both T V in (12) and G in (18). However, the values of T C and T V do not correspond as closely as they do for the random-based data in Table 1 and Table 2. Of course, the range of values of T C and T V is much greater in Table 1 and Table 2 than in Table 3. Also, the results in Table 3 are based on a fixed number of a few income categories, n = 9 , whereas those in Table 1 and Table 2 are based on n ranging from 2 to 100.
There is, however, close linear relationships between the indices based on the data in Table 3. In fact, the following fitted regression models are obtained from the data in Table 3:
T ^ C = 0.127 + 1.436 T V ,   R 2 = 0.991
T ^ C = 0.135 + 2.068 G ,   R 2 = 0.990
G = 0.005 + 0.692 T V ,   R 2 = 0.993
showing that the variation of one index (about its mean) is nearly perfectly explained (accounted for) by its linear relationship to another index. Consequently, when making difference (interval) comparisons, the indices T V ,   T C ,   and   G can generally be expected to provide similar results, since each complies with the value-validity condition in (10) and (11).

5. Concluding Comments

The single most significant result in this paper is the simple formulation in (17) that provides a correction of Theil’s economic inequality index T to incorporate the value-validity property as a good approximation. The corrected index T C is only a function of T and does not explicitly depend on the number of income units n. This is important when using T C to correct published data for T, since such data typically do not specify n. In fact, this was the motivation behind searching for a potential relationship, as in (17), rather than considering some T C as a function of both T and n.
While Theil’s T has a number of desirable properties, none of those relate specifically to the potential numerical values of T and whether those values can be justified as truly representing the economic inequality characteristic. This limitation of T is addressed by T C and its value-validity property: T C transforms understated T-values into realistic, reliable, and valid inequality representations.
Various economic inequality indices, such as Gini’s G in (18) and Theil’s T, are commonly used to make absolute and relative comparisons between individual values and differences (intervals). The T C , because of its additional value-validity property, has the advantage of providing more representative economic inequality comparisons.
An interesting inconsistency occurs between T C and T when making absolute and relative comparisons. That is, for any two values t 1 and t 2 > t 1 of T and the corresponding values t C 1 and t C 2 > > t C 1 from (17), a general difference between the two indices becomes:
t C 2 t C 1 > t 2 t 1 ,   but   ( t C 2 t C 1 ) / t C 1 < ( t 2 t 1 ) / t 1 .
This inconsistency can be verified from the form of (17).
What sets Theil’s T apart from other inequality indices is its desirable decomposition property. That is, T can be decomposed into within ( T W ) and between ( T B ) inequalities, such that T = T W + T B when, for example, considering global economic inequality versus the inequalities within and between countries or regions (see, e.g., [12,13,23]). While the additive decomposition does not hold for the correction in (17), i.e., T C T W C + T B C , ratio comparisons could still be corrected, such as T W C / T C = ( T W / T ) 0.65 , T B C / T C = ( T B / T ) 0.65 , or T B C / T W C = ( T B / T C ) 0.65 .
The T C does have a clear limitation, as does T. Neither index has any intuitively appealing or meaningful interpretation. Nevertheless, as a simple quantitative measure of economic inequality, T C has the important advantage over T of having the value-validity property. Consequently, when compared to T, the corrected form T C provides more realistic and true inequality representations and comparative results from real economic evaluations.

Funding

This research received no external funding.

Data Availability Statement

Data are contained within the article.

Conflicts of Interest

The author declares no conflicts of interest.

References

  1. Theil, H. Economics and Information Theory; American Elsevier Company: New York, NY, USA, 1967. [Google Scholar]
  2. Theil, H. Statistical Decomposition Analysis; North-Holland Publishing Company: Amsterdam, The Netherlands, 1972. [Google Scholar]
  3. Cowell, F.A. Measuring Inequality, 3rd ed.; Oxford University Press: Oxford, UK, 2011. [Google Scholar]
  4. Coulter, P.B. Measuring Inequality: A Methodological Handbook; Routledge: London, UK, 1989. [Google Scholar]
  5. McGregor, T.; Smith, B.; Wills, S. Measuring inequality. Oxf. Rev. Econ. Policy 2019, 35, 368–395. [Google Scholar] [CrossRef]
  6. Trapeznikova, I. Measuring inequality. IZA World Labor 2019, 462, 1–11. [Google Scholar] [CrossRef]
  7. Bellù, L.G.; Liberati, P. Policy Impacts on Inequality: Simple Inequality Measures; Food and Agriculture Organizations of the United Nations, FAO: Rome, Italy, 2006. [Google Scholar]
  8. Shannon, C.E. A mathematical theory of communication. Bell Syst. Tech. J. 1948, 27, 379–423, 623–656. [Google Scholar] [CrossRef]
  9. Sen, A.K.; Foster, J.E. On Economic Inequality, 2nd ed.; Clarendon Press: Oxford, UK, 1997. [Google Scholar]
  10. Muszyńska, J.; Oczki, J.; Wędrowska, E. Income inequality in Poland and the United Kingdom. Decomposition of the Theil index. Folia Oeconomica Stetin. 2018, 18, 108–122. [Google Scholar] [CrossRef]
  11. Liao, T.F. Evaluating distributional differences in income inequality. Socius 2016, 2, 1–14. [Google Scholar] [CrossRef]
  12. Carvajal, C.P.; Rodríguez, M.A.; Cuartas, B.M. Determinants of income inequality reduction in the Latin American countries. Cepal. Rev. 2018, 126, 80–98. [Google Scholar] [CrossRef]
  13. Morrisson, C.; Murtin, F. Average Income Inequality Between Countries (1700–2030); FERDI La Fondation pour les études et Recherches sur le Développment International: Paris, France, 2011; pp. 1–16. [Google Scholar]
  14. Kranzinger, S. The decomposition of income inequality in the EU-28. Empirica 2020, 47, 643–668. [Google Scholar] [CrossRef]
  15. Kvålseth, T.O. Entropy evaluation based on value validity. Entropy 2014, 16, 4855–4873. [Google Scholar] [CrossRef]
  16. Kvålseth, T.O. The lambda distribution and its application to categorical summary measures. Adv. Appl. Stat. 2011, 24, 83–106. [Google Scholar]
  17. Kvålseth, T.O. Measurement of market (industry) concentration based on value validity. PLoS ONE 2022, 17, e0264613. [Google Scholar] [CrossRef] [PubMed]
  18. Buettgens, M.; Blavin, F.; Pan, C. The Affordable Care Act reduced income inequality in the US. Health Aff. 2021, 40, 121–129. [Google Scholar] [CrossRef] [PubMed]
  19. Amarante, V.; Galván, M.; Mancero, X. Inequality in Latin America: A global measurement. Cepal Rev. 2018, 118, 25–44. [Google Scholar] [CrossRef]
  20. Kvålseth, T.O. Cautionary note about R2. Am. Stat. 1985, 39, 279–285. [Google Scholar]
  21. Gini, C. Variabilità e mutabilità. In Studi Economico-Giuridici. Pubblicati per cura della Facoltà di Giurisprudenza della R. Università di Cagliari; Tipogr. di P. Cuppini: Bologna, Italy, 1912; Volume 3, pp. 1–158. [Google Scholar]
  22. Semega, J.; Kollar, M. Income in the United States: 2021; Current Population Reports, P60-276; U.S. Census Bureau: Suitland-Silver Hill, MD, USA; U.S. Government Publishing Office: Washington, DC, USA, 2022. [Google Scholar]
  23. Ma, N.; Cheong, T.S.; Li, J. Evaluating global inequality using decomposition approach. Front. Psych. 2022, 12, 809670. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Scatter diagram of T V in (12) versus Theil’s T from the data in Table 1 (red dots) and Table 2 (blue dots). The curve represents the fitted model for the corrected index T C in (17).
Figure 1. Scatter diagram of T V in (12) versus Theil’s T from the data in Table 1 (red dots) and Table 2 (blue dots). The curve represents the fitted model for the corrected index T C in (17).
Computation 12 00240 g001
Table 1. Values of T in (2), TV in (12), and T c in (17) for the distribution P n λ in (4) with randomly generated λ ( 0 ,   1 ) and n [ 2 ,   100 ] .
Table 1. Values of T in (2), TV in (12), and T c in (17) for the distribution P n λ in (4) with randomly generated λ ( 0 ,   1 ) and n [ 2 ,   100 ] .
λ n T ( P n λ ) T V ( P n λ ) T c ( P n λ )
0.16740.360.690.80
0.10780.160.440.47
0.10470.120.390.39
0.08800.110.350.37
0.16270.170.530.49
0.25720.551.071.05
0.5260.470.930.95
0.38310.721.301.25
0.35140.420.920.88
0.29640.651.211.17
0.32920.851.451.39
0.41160.591.141.10
0.06600.050.250.22
0.35480.771.351.31
0.24850.571.071.08
0.19350.270.680.66
0.26270.370.860.81
0.44380.981.601.53
0.16600.260.660.65
0.26590.551.061.05
0.4180.380.850.83
0.51200.931.531.48
0.20720.400.860.85
0.26350.430.920.90
0.25660.531.051.03
0.27790.651.181.17
0.06110.020.140.12
0.2850.130.450.41
0.23120.190.570.53
0.30180.380.870.83
0.22470.380.850.83
0.34560.781.371.32
0.36100.360.830.80
0.07860.090.310.32
0.15550.220.600.58
Table 2. Values of T in (2), T V in (12), d *   in (11), and T c in (17) for randomly generated income-share distributions P n = ( p 1 , , p n ) with n [ 2 ,   100 ] .
Table 2. Values of T in (2), T V in (12), d *   in (11), and T c in (17) for randomly generated income-share distributions P n = ( p 1 , , p n ) with n [ 2 ,   100 ] .
n T ( P n ) T V ( P n ) d * ( P n ) T c ( P n )
360.030.140.040.16
130.160.490.190.47
190.250.650.220.63
470.921.420.371.47
920.470.900.200.95
700.040.170.040.19
670.080.250.060.30
140.020.130.050.12
280.200.530.160.54
250.681.260.391.21
40.230.550.400.60
550.600.960.241.11
240.290.670.210.69
760.260.610.140.65
30.180.420.380.51
720.591.030.241.10
510.941.530.391.49
870.180.450.100.51
340.761.340.381.30
150.380.870.320.83
630.130.370.090.41
770.240.610.140.61
250.100.390.120.35
740.470.820.190.95
140.150.480.180.45
690.871.440.341.42
890.380.810.180.83
410.541.080.291.04
860.851.340.301.39
360.090.320.090.32
90.330.730.330.75
460.511.030.271.00
570.340.730.180.77
80.140.480.230.43
980.771.330.291.31
Table 3. Value of G in (18), T in (2), T V in (12), and T C in (17) for total U.S. household income over n = 9 income categories for different years. Source: U.S. Census Bureau.
Table 3. Value of G in (18), T in (2), T V in (12), and T C in (17) for total U.S. household income over n = 9 income categories for different years. Source: U.S. Census Bureau.
YearGT T V T C
20210.1480.0350.2100.176
20200.1440.0330.2110.170
20190.1540.0390.2190.187
20180.1470.0340.2130.172
20170.1410.0330.1880.169
20160.1460.0350.2090.174
20150.1420.0320.2060.165
20140.1460.0340.2040.171
20130.1500.0340.1970.172
20120.1580.0390.2210.187
20110.1640.0410.2290.195
20100.1610.0400.2240.190
20050.1690.0440.2400.204
20000.1780.0520.2500.227
19950.2060.0690.2840.273
19900.2270.0850.3110.312
19850.2370.1020.3330.351
19800.2540.1240.3630.399
19750.2790.1550.3970.461
19700.3020.1710.4340.492
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Kvålseth, T.O. Theil’s Index of Inequality: Computation of Value-Validity Correction. Computation 2024, 12, 240. https://doi.org/10.3390/computation12120240

AMA Style

Kvålseth TO. Theil’s Index of Inequality: Computation of Value-Validity Correction. Computation. 2024; 12(12):240. https://doi.org/10.3390/computation12120240

Chicago/Turabian Style

Kvålseth, Tarald O. 2024. "Theil’s Index of Inequality: Computation of Value-Validity Correction" Computation 12, no. 12: 240. https://doi.org/10.3390/computation12120240

APA Style

Kvålseth, T. O. (2024). Theil’s Index of Inequality: Computation of Value-Validity Correction. Computation, 12(12), 240. https://doi.org/10.3390/computation12120240

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop