Trade-Off Between Entropy and Gini Index in Income Distribution

Koutsoyiannis, Demetris; Sargentis, G.-Fivos

doi:10.3390/e28010035

Open AccessArticle

Trade-Off Between Entropy and Gini Index in Income Distribution

by

Demetris Koutsoyiannis

^*

and

G.-Fivos Sargentis

Department of Water Resources and Environmental Engineering, School of Civil Engineering, National Technical University of Athens, Heroon Polytechneiou 5, 15772 Zographou, Greece

^*

Author to whom correspondence should be addressed.

Entropy 2026, 28(1), 35; https://doi.org/10.3390/e28010035

Submission received: 2 November 2025 / Revised: 22 December 2025 / Accepted: 24 December 2025 / Published: 26 December 2025

Download

Browse Figures

Review Reports Versions Notes

Abstract

We investigate the fundamental trade-off between entropy and the Gini index within income distributions, employing a stochastic framework to expose deficiencies in conventional inequality metrics. Anchored in the principle of maximum entropy (ME), we position entropy as a key marker of societal robustness, while the Gini index, identical to the (second-order) K-spread coefficient, captures spread but neglects dynamics in distribution tails. We recommend supplanting Lorenz profiles with simpler graphs such as the odds and probability density functions, and a core set of numerical indicators (K-spread

K_{2} / μ

, standardized entropy

Φ_{μ}

, and upper and lower tail indices,

ξ, ζ

) for deeper diagnostics. This approach fuses ME into disparity evaluation, highlighting a path to harmonize fairness with structural endurance. Drawing from percentile records in the World Income Inequality Database from 1947 to 2023, we fit flexible models (Pareto–Burr–Feller, Dagum) and extract K-moments and tail indices. The results unveil a concave frontier: moderate Gini reductions have little effect on entropy, but aggressive equalization incurs steep stability costs. Country-level analyses (Argentina, Brazil, South Africa, Bulgaria) link entropy declines to political ruptures, positioning low entropy as a precursor to instability. On the other hand, analyses based on the core set of indicators for present-day geopolitical powers show that they are positioned in a high stability area.

Keywords:

entropy; principle of maximum entropy; K-moments; stochastics; wealth; income profiles; Gini index; inequality; stability

κατὰ μὲν τὴν οὐσίαν καὶ τὸν λόγον τὸν τὸ τί ἦν εἶναι λέγοντα μεσότης ἐστὶν ἡ ἀρετή, κατὰ δὲ τὸ ἄριστον καὶ τὸ εὖ ἀκρότης. (In terms of its essence and the definition of its nature, virtue is the mean, but in terms of excellence and rightness, [virtue is] the extreme)
Aristotle [1]

1. Introduction

The modern world is characterized by high interdependent complexities upon which its social prosperity is founded [2]. In this context, events of varying scales can trigger cascading effects, leading to dynamics of collapse that impact the entire social structure [3,4]. One of the most critical and intricate parameters, which has been the subject of extensive theoretical and empirical analysis, is income distribution, a foundational factor for social stability [5,6,7].

Based on income distribution, we can derive useful indices describing in an intuitive manner important aspects of a country’s society such as (in)equality and (in)stability. Among these indices, of particular interest are entropy and the Gini coefficient. Our previous research (Koutsoyiannis and Sargentis [7]) delineated entropy’s role across physical and economic systems. Extending this, here we investigate the dynamics between entropy and the Gini coefficient across income structures.

The Gini index, here reformulated as the second-order K-moment standardized by the mean,

K_{2} / μ

(see Section 3.1.3), and referred to as the K-spread coefficient, leads inequality assessments. Still, it masks extremes, i.e., the tail behaviour of the distributions: profiles with matching Gini indices can show stark contrasts in income extremes. Entropy instead gauges overall variability and uncertainty and, through the principle of maximum entropy (ME), identifies the likeliest—and, hence, most resilient—structure under given constraints. The ME principle was originally formalized by Jaynes [8] as a method to infer the most probable distribution given constraints. In economic contexts, ME posits that income distributions tend toward states of maximum entropy under real-world constraints.

Under constraints of specified mean,

μ

, and K-spread (Gini),

K_{2} / μ

, we demonstrate here that entropy maximization results in a generalized half logistic (GHL) distribution, a limiting case of which is the exponential distribution. The latter materializes the peak entropy pole, as (

K_{2} / μ = 1 / 2, Φ_{μ} = 1)

. The distance from this pole is another indicator of the resilience or stability of an economy, with a small distance denoting small instability.

The key conjecture in this study is that income profiles would naturally tend toward peak entropy given real constraints. On the other hand, pushing the economic state to depart from the peak entropy and creating large and persistent deviations from it herald fragility. Hence, a scale-invariant entropy form, or standardized entropy,

Φ_{μ} \leq 1

[7], functions as a resilience gauge, complementing

K_{2} / μ

as a spread gauge. In simpler words, economic structures with low entropy will tend to higher entropy and hence are unstable, while those leading to maximum (standardized) entropy will be stable as the entropy cannot be increased further.

To ratify the key conjecture, we use data (percentile records) from the World Income Inequality Database (WIID), to which we fit flexible models (Pareto–Burr–Feller, Dagum) and extract K-moments and tail indices. We conduct country-level analyses (Argentina, Brazil, South Africa, Bulgaria) to investigate whether low entropy states evolve to higher entropy over the course of a country’s history, and whether this evolution can be linked to political ruptures. In addition we analyse the core set of indicators in countries with the largest populations (exceeding 50 million), including present-day geopolitical powers (China, India, USA, Russia, EU), to assess their stability based on the criteria developed.

The data sources are outlined in Section 2. Section 3 summarizes stochastic tools and links entropy and Gini. Section 4 details applications and stories. Section 5 weighs meanings and suggests paths ahead.

2. Data

The real-world applications in this study are based on the World Income Inequality Database [9,10], developed and maintained by the United Nations University World Institute for Development Economics Research. More specifically, the product used is the WIID Companion Country Dataset, which reports inequality data by country. This was selected because of its unparalleled comprehensiveness and global scope in providing income inequality statistics. The dataset encompasses 202 nations (including 4 historical entities), starting in 1947 (for USA; later for other countries) and extending to 2023, offering 2546 distinct country-year records. This broad coverage enables detailed analyses of inequality trends across diverse geopolitical contexts and historical eras, which is essential for our examination of the trade-off between entropy and the Gini index. WIID allows us to explore extreme Gini values and long-term patterns in countries like Argentina, Bulgaria, Brazil, and South Africa, as well as in major geopolitical powers such as the USA, China, EU, Russia, and India, ensuring a robust empirical foundation for applying theoretical stochastic tools to real-world income disparities.

A key reason for choosing WIID is its provision of percentile-based income data, which aligns perfectly with the methodological requirements of our research. The specific data used are those named “p1”–“p100”, which represent the income per capita (based on GDP) per percentile, standardized to have an average of 1. These facilitate the calculation of Lorenz curves, Gini indices, and entropy indicators directly. This granularity in percentiles supports our use of K-moments, the ME principle, and tail index estimations, allowing for precise comparisons between relevant probabilistic distributions. Unlike more aggregated datasets, WIID’s percentile-level detail enables us to handle grouped data effectively, reliably estimate empirical statistics, and visualize trade-offs in figures. We note though that for some countries percentile-level data are missing; these were excluded from our analyses.

WIID’s reliability and methodological rigour enable comparability across countries and over time, as well as the extraction of information for multi-country entities. Curated by inequality experts, the database incorporates adjustments for data quality and consistency, with transparent documentation in user guides, technical notes, and replication tools. This minimizes biases in cross-national comparisons, which is critical for our analysis of social stability and political histories in case studies. Widely recognized in academic research for monitoring global inequality trends, WIID outperforms alternative sources by offering both raw and processed data, ensuring that our findings on the limitations of the Gini index and the advantages of entropy-based approaches are grounded in high-quality, verifiable evidence.

3. Methods

3.1. Basic Stochastic Tools

3.1.1. Distribution Function and Relative Concepts; Expectation and Moments

Let

\underline{x}

be a stochastic (random) variable of continuous type (i.e., taking on values that are real numbers); notice that we underline stochastic variables to distinguish them from regular variables. We denote its distribution function (i.e., probability of non-exceedance) and its tail function (i.e., probability of exceedance), respectively, as:

F (x) : = P \{\underline{x} \leq x\}, \bar{F} (x) : = 1 - F (x) = P \{\underline{x} > x\}

(1)

where P denotes probability. A useful derived function is the so-called odds function:

O (x) : = \frac{F (x)}{\bar{F} (x)} = \frac{F (x)}{1 - F (x)}

(2)

Both

F (x)

and

O (x)

are nondecreasing functions, and since the variable

\underline{x}

is continuous, the inverse functions exist. The inverse of

F (x)

, denoted as

x (F)

is called the quantile function. The derivative of the distribution function:

f (x) : = \frac{d F (x)}{d x}

(3)

is the probability density function and obeys the obvious relationship:

\int_{- \infty}^{\infty} f (x) d x = 1

(4)

Any deterministic function of

\underline{x}

,

g (\underline{x})

, is a stochastic variable per se, because its argument is stochastic. The expectation of the stochastic variable

g (\underline{x})

is defined as:

E [g (\underline{x})] : = \int_{- \infty}^{\infty} g (x) f (x) d x

(5)

For

g (\underline{x}) = \underline{x}

and

g (\underline{x}) = {(\underline{x} - μ)}^{2}

we get, respectively, the mean,

μ

, and the variance,

γ

, of

\underline{x}

:

μ : = E [\underline{x}] = \int_{- \infty}^{\infty} x f (x) d x, γ : = E [{(\underline{x} - μ)}^{2}] = \int_{- \infty}^{\infty} {(x - μ)}^{2} f (x) d x

(6)

The variance is necessarily nonnegative and its square root,

σ : = \sqrt{γ}

, is the standard deviation. For nonnegative variables, the limit

- \infty

in the above integrals is replaced by 0, while the ratio

σ / μ

, termed the coefficient of variation, is a useful dimensionless index of the variability of a system.

3.1.2. Entropy and Standardized Entropy

It is possible to define a function

g ()

in terms of not the variable

\underline{x}

but the probability density per se, i.e.,

g (\underline{x}) = h (f (\underline{x}))

, where

h ()

is any specified function. Among the several choices of

h ()

, most useful is the logarithmic function, which results in the definition of entropy,

Φ [\underline{x}]

. The emergence of the logarithm in the definition of entropy follows some postulates set up by Shannon (1948, [11]) for stochastic variables of discrete type. Extension for a continuous stochastic variable

\underline{x}

was not contained in Shannon’s original work but was given later (see e.g., [12,13] p. 375) as:

Φ [\underline{x}] : = E [- \ln \frac{f (\underline{x})}{β (\underline{x})}] = - \int_{- \infty}^{\infty} \ln \frac{f (x)}{β (x)} f (x) d x

(7)

where

β (x)

is a background measure density that can be any probability density, proper (with integral equal to 1, as in Equation (4)) or improper (meaning that its integral diverges). Typically, it is an (improper) Lebesgue density, i.e., a constant. We note that most texts do not include the background measure density

β (\underline{x})

in the definition (or set

β (\underline{x}) \equiv 1

) but, in terms of physical consistency, this is an error, because, in order to take the logarithm of a quantity, this quantity must be dimensionless. The density function has units

[f (x)] = [x^{- 1}]

and therefore we need to divide it by a quantity with the same units before taking the logarithm. Even if we choose the Lebesgue measure as the background, with

β (x) = 1 / λ,

(constant), where

λ

is the unit used to measure

x

, still the entropy depends on the unit. It can easily be verified that, if we measure

x

with two different units

λ_{1}

and

λ_{2}

, the respective entropies

Φ_{1} [\underline{x}]

and

Φ_{2} [\underline{x}]

will differ by a constant:

Φ_{1} [\underline{x}] - Φ_{2} [\underline{x}] = \ln \frac{λ_{2}}{λ_{1}}

(8)

The entropy

Φ [\underline{x}]

per se is always dimensionless and, for continuous variables, it can be either positive or negative, depending on the assumed

β (x)

, ranging from

- \infty

to a maximum value, depending on the system and, in particular, on its constraints.

Entropy quantifies uncertainty and its importance lies in the principle of maximum entropy, formally introduced in 1957 by Jaynes [8]. This postulates that the entropy of a stochastic system should be at maximum, under some conditions, formulated as constraints, which incorporate the information that is given about this system. The meaning of the principle is that the maximum entropy state of a system is the most probable one that is allowed by its degrees of freedom and not disallowed by its constraints. Therefore, entropy is also an index of stability: a state that is far apart from the maximum entropy state is unstable as it will tend to change toward maximizing entropy. The principle can be used for logical inference as well as for modelling physical systems. In this respect, the tendency of entropy to become maximal (as in the second law of thermodynamics), which drives natural change, can result from this principle. On the other hand, the principle equips the entropy concept with a powerful tool for logical inference.

In application in economics [7,14], for a constant background density equal to the inverse of the monetary unit (i.e.,

1 / λ w i t h λ

equal, e.g., to USD 1), the entropy provides an indicator of society’s wealth (even if

x

expresses income). If we set the background measure density to the value

1 / μ, w h e r e μ

is the mean income, we get the standardized entropy, which from Equation (8) is obtained as:

Φ_{μ} [\underline{x}] = Φ [\underline{x}] - \ln \frac{μ}{λ}

(9)

This quantity, which cannot exceed a maximum value of 1 for nonnegative continuous variables (see Section 3.3), has been originally introduced [7,14] as an index of inequality. However, as we will see below, it can better be thought of as an indicator of stability, while the notion of K-moments (see next) can better characterize inequality.

3.1.3. K-Moments

While in classical statistics moments of orders higher than 2 are defined and used (by substituting

{(\underline{x} - μ)}^{p}, p > 2

, for

{(\underline{x} - μ)}^{2}

in Equation (6)), these cannot be reliably estimated from samples [14,15]. However, the concept of knowable moments or K-moments [15] can reliably provide estimates for high-order moments.

The K-moments are defined as follows. We consider a sample of a stochastic variable

\underline{x}

, i.e., a number

p

of independent copies of the stochastic variable

\underline{x}

, i.e.,

{\underline{x}}_{1}, {\underline{x}}_{2}, \dots, {\underline{x}}_{p}

. If we arrange the variables in ascending order, the ith smallest, denoted as

{\underline{x}}_{(i : p)}, i = 1, \dots, p

is termed the ith order statistic. The largest (pth) order statistic is:

{\underline{x}}_{(p)} : = {\underline{x}}_{(p : p)} = \max ({\underline{x}}_{1}, {\underline{x}}_{2}, \dots, {\underline{x}}_{p})

(10)

and the smallest (first) is:

{\underline{x}}_{(1 : p)} = \min ({\underline{x}}_{1}, {\underline{x}}_{2}, \dots, {\underline{x}}_{p})

(11)

We define the upper knowable moment (K-moment) of order p as the expectation of the largest of the p variables

{\underline{x}}_{(p)}

:

K_{p}^{'} : = E [{\underline{x}}_{(p)}] = E [\max ({\underline{x}}_{1}, {\underline{x}}_{2}, \dots, {\underline{x}}_{p})]

(12)

and the lower knowable moment (K-moment) of order p as the expectation of the smallest of the p variables

{\underline{x}}_{(1 : p)}

:

{\bar{K}}_{p}^{'} : = E [{\underline{x}}_{(1 : p)}] = E [\min ({\underline{x}}_{1}, {\underline{x}}_{2}, \dots, {\underline{x}}_{p})]

(13)

An important property, directly resulting from their definition, is that the K-moments are ordered as follows:

{\bar{K}}_{p}^{'} \leq \dots \leq {\bar{K}}_{2}^{'} \leq {\bar{K}}_{1}^{'} = K_{1}^{'} = μ \leq K_{2}^{'} \leq \dots \leq K_{p}^{'}

(14)

These moments are noncentral and we can also define central moments as:

K_{p} : = K_{p}^{'} - K_{1}^{'}, {\bar{K}}_{p} : = {\bar{K}}_{1}^{'} - {\bar{K}}_{p}^{'}, K_{p}, {\bar{K}}_{p} \geq 0

(15)

As shown in Chapter 6 in [15], for a stochastic variable

\underline{x}

of continuous type, the upper K-moment of order p of

\underline{x}

, is theoretically calculated as:

K_{p}^{'} = p E [{(F (\underline{x}))}^{p - 1} \underline{x}] = p \int_{- \infty}^{\infty} {(F (x))}^{p - 1} x f (x) d x = p \int_{0}^{1} x (F) F^{p - 1} d F

(16)

Likewise, the lower K-moment of order p is theoretically calculated as:

{\bar{K}}_{p}^{'} = p E [{(\bar{F} (\underline{x}))}^{p - 1} \underline{x}] = p \int_{- \infty}^{\infty} {(\bar{F} (x))}^{p - 1} x f (x) d x = p \int_{0}^{1} \bar{x} (\bar{F}) {\bar{F}}^{p - 1} d \bar{F}

(17)

The unbiased estimator of the upper K-moment

{\underline{K}}_{p}^{'}

from a sample of size

n

is:

{\underline{\hat{K}}}_{p}^{'} = \sum_{i = 1}^{n} b_{i n p} {\underline{x}}_{(i : n)}

(18)

and that of the lower K-moment is:

{\hat{\bar{\underline{K}}}}_{p}^{'} = \sum_{i = 1}^{n} b_{i n p} {\underline{x}}_{(n - i + 1 : n)} = \sum_{i = 1}^{n} b_{n - i + 1, n, p} {\underline{x}}_{(i : n)}

(19)

where

b_{i n p} = \{\begin{array}{l} 0, & i < p \\ p \frac{Γ (n - p + 1)}{Γ (n + 1)} \frac{Γ (i)}{Γ (i - p + 1)}, & i \geq p \geq 0 \end{array}

(20)

and

Γ ()

is the gamma function. For data that are grouped in classes, the resulting modified estimator is shown in Appendix A.1.

Based on the K-moments, we define the K-centre of order

p

,

C_{p}

, and the K-spread of order

p

,

D_{p}

, as:

C_{p} : = \frac{K_{p}^{'} + {\bar{K}}_{p}^{'}}{2}, D_{p} : = \frac{K_{p}^{'} - {\bar{K}}_{p}^{'}}{2}

(21)

where

D_{p} \geq 0

. The least-order meaningful values thereof are:

C_{1} = K_{1}^{'} = {\bar{K}}_{1}^{'} = μ, D_{2} = \frac{{K_{2}^{'} - \bar{K}}_{2}^{'}}{2} = K_{2} = {\bar{K}}_{2}

(22)

Since

{\bar{K}}_{2}^{'} = {2 K}_{1}^{'} - K_{2}^{'}

[15], we have

C_{2} = (1 / 2) (K_{2}^{'} + {\bar{K}}_{2}^{'}) = K_{1}^{'} = C_{1}

, i.e., the first and second order K-centre parameters are equal to each other and equal to the mean. The standardized parameter

\frac{D_{2}}{μ} = \frac{K_{2}}{μ} = \frac{{\bar{K}}_{2}}{μ}

(23)

is a characteristic spread index, similar to the coefficient of variation

σ / μ

used in classical statistics, and will be referred to as the K-spread coefficient. Furthermore, the standardized parameter

D_{p} / D_{2}

is also a spread index which will be referred to as the K-spread ratio of order

p

.

3.1.4. Specific Distribution Functions and Tail Indices

Here we use several distribution functions resulting from entropy maximization, which are summarized in Table 1, along with their characteristics. Among them, the three-parameter distributions, namely the Pareto–Burr–Feller (PBF) and the Dagum distributions are quite flexible and can describe most real-world systems. The Pareto, Weibull, and exponential distributions are special cases of the PBF distribution. The logistic and the generalized half logistic (GHL) constitute another form of distribution, resulting from entropy maximization with constrained mean and K-moment of order 2. The log-logistic distribution is a special case of both the PBF and Dagum distributions.

In all distributions listed in Table 1,

λ

is a scale parameter with dimensions identical to those of the variable

\underline{x}

, and

ξ

and

ζ

are dimensionless parameters, representing the upper and lower tail indices, respectively. For a variable

\underline{x}

with domain

(0, \infty)

, their definitions are based on the limiting relationships:

\lim_{x \to \infty} x^{1 / ξ} \bar{F} (x) = l_{U}, \lim_{x \to 0} x^{- ζ} F (x) = l_{L}

(24)

where

l_{U}

and

l_{L}

are nonzero and finite constants. Both can be also determined from the odds function by:

ξ = 1 / O^{#} (\infty), ζ = O^{#} (0)

(25)

where

O^{#} (x)

denotes the log-log derivative (LLD) of the odds function, defined as:

O^{#} (x) : = \frac{d (\ln O (x))}{d (\ln x)} = \frac{x O^{'} (x)}{O (x)}

(26)

The tail indices are important characteristics of a distribution. A distribution with upper tail index

ξ = 0

(e.g., the exponential) is light-tailed, while one with

0 < ξ < 1

is a heavy-tailed distribution. In a distribution with

ζ < 1

, the density

f (x)

is necessarily a decreasing function, at least close to the origin, with

\lim_{x \to 0} f (x) = \infty

. In contrast, when

ζ > 1

, the density

f (x)

is an increasing function close to the origin, with

f (0) = 0

, and is usually bell-shaped. The particular case

ζ = 1

is characteristic of the exponential and Pareto distributions, where

f (0)

is finite and the density

f (x)

is a decreasing function.

3.2. The Lorenz Curve and the Gini Index

The economics literature makes extensive use of the Lorenz curve and the Gini index. If

x (F)

is the quantile function of a probability distribution, then the Lorenz curve is simply its integral standardized by the mean, i.e.,

L (F) = \frac{1}{μ} \int_{0}^{F} x (u) d u \Leftrightarrow x (F) = μ L^{'} (F)

(27)

The Gini index is the ratio of the area between the equality line and the Lorenz curve to the area under the equality line (which is 1/2), i.e.,

G = \int_{0}^{1} (F - L (F)) d F / \frac{1}{2} = 1 - 2 \int_{0}^{1} L (F) d F

(28)

where, to obtain the rightmost result, we observe that

\int_{0}^{1} F d F = 1 / 2

. It is easily shown (see Appendix A.2) that the Gini index is simply the K-spread coefficient:

G = \frac{K_{2}}{μ}

(29)

Once we know the K-spread coefficient and the tail indices of a distribution, we can effectively approximate it by the following proposed relationship:

L (F) = A F (1 - (1 - \frac{1}{A}) F^{1 / ζ}) - \frac{A (1 - F) ({(1 - F)}^{- ξ} - 1)}{ξ}

(30)

where

A = \frac{(2 - ξ)}{2 ζ + ξ - 1} ((2 ζ + 1) \frac{K_{2}}{μ} - 1)

(31)

As shown in Appendix A.2, the approximation preserves the mean and

K_{2}

moment, and the two tail indices of the exact distribution. Figure 1 shows that the above approximation is almost perfect for the PBF distribution. Appendix A.2 shows cases where the approximation is exact. Figure 2 shows a similar behaviour of the approximation for the Dagum distribution.

Therefore, one can replace the Lorenz curve altogether with three parameters,

K_{2} / μ, ξ, ζ

. Actually, these three parameters provide much richer information than the Lorenz curve per se. This is illustrated in Figure 3, which compares two distributions with very different behaviour, a PBF and a Dagum, which have the same

K_{2} / μ

but different tail indices. It is seen that the Lorenz curves do not give any indication of the different behaviours of the two distributions.

For this reason, we contend that the Lorenz curve, despite its popularity, is not a useful tool to understand the income distribution. A better tool, visualizing the distribution behaviour, is the double logarithmic plot of the odds function, also seen in Figure 3, along with plots of the density function, in linear or logarithmic axes. All three additional plots do not hide the information of the differences, with the logarithmic plots also visualizing the tail indices.

Additional evidence that the Lorenz curve is not a truthful stochastic tool is provided by Figure 4, which is based on actual income distribution data for Bulgaria in 1971. (Nb., we investigate Bulgaria in more detail in Section 4.3.4). The Lorenz curve is smooth and provides no information on the peculiarity of the income distribution in this case. Specifically, the density function plot shows that there is a huge peak at an income slightly lower than the mean, suggesting that most of the population had income close to this value. The Lorenz curve totally hides this fact. Even the K-centre and K-spread plots (lower left panel of Figure 4), while clearly showing the huge departure from the ME exponential distribution, do not provide insight into the extent of the departure from the PBF distribution. On the contrary, the latter is visible in the probability density plot, as well as in the odds function plot. In the latter, it appears as a big plateau at an income slightly lower than the mean, and at odds values around 1.

For these reasons, while for completeness we occasionally show some Lorenz curves in our applications, we do not recommend their use and we strongly propose the odds function double logarithmic plot as a replacement.

For completeness we note that the fitting of the PBF and Dagum distributions, shown in both right-hand panels of Figure 4 and substantially departing from the empirical distribution, was carried out by a least squares method on the odds function. Specifically, the sum of squared differences between the logarithms of the empirical and theoretical odds functions was minimized. The empirical values are readily available given that the data are provided in percentiles of the distribution function. The theoretical values are calculated by the formulae given in Table 1 along with Equation (2). The minimization was performed by a standard nonlinear solver that determined the values of distributional parameters.

The theory of statistics provides several tools to assess the appropriateness of a fitted distribution (e.g., chi-squared test, Kolmogorov–Smirnov test, probability plot correlation coefficient test). Such tests provide quantified validation for the selection of a probabilistic model. However, the scope of our study is not to provide insights into model selection and validation (this topic is covered in the recent (2025) study by Koutsoyiannis [16]); rather, it is to propose and assess indicators of (in)equality and (in)stability based on data of income distribution. For the purposes of our study, graphical tools such as those shown in Figure 4 and subsequent illustrations are more fit to purpose. A big advantage of the plots in both right-hand panels of Figure 4 is that they not only show the inappropriateness of the three theoretical models depicted but they also provide insights on the reasons why the departures of theoretical models appear. In this case, one may diagnose that there was forced equality for the middle class, reflected as a peak in the density function and a plateau in the odds function.

3.3. Maximum Entropy Distributions

3.3.1. Unconstrained Bounded Variables

Using calculus of variations, we can determine which is the probability density

f (x)

that maximizes the entropy, defined in Equation (7), under given constraints. If there is no constraint about the system, apart from the range where the variable lies, specified by the inequality constraint:

0 \leq x \leq Ω

(32)

then, maximization of entropy results in uniformity, i.e.,

f (x) = 1 / Ω

, while the maximum entropy, the standardized maximum entropy, and the K-spread coefficient are:

Φ [\underline{x}] = \ln \frac{Ω}{λ}, Φ_{μ} [\underline{x}] = \ln 2, \frac{K_{2}}{μ} = \frac{1}{3}

(33)

3.3.2. Constrained Mean

However, a system becomes more interesting when, in addition to inequality constraints like (32), or even in their absence, there appear equality constraints, corresponding to the information that is known about a system represented by the variable

\underline{x}

. In studying the material wealth (or income) in a certain society, we assume two characteristic quantities: the mean μ, which is related to the total energy available to the society [7], and an upper limit of wealth (or income)

Ω

, which is mainly determined by the available technology (knowhow) and thus we call it the technological upper limit. One may assert that real income distributions are unbounded from above, but a finite upper limit

Ω

helps to better understand the framework and may also be useful to model historical situations of the past, in which the technology was elementary. Furthermore the upper limit can easily be removed (and will actually be removed in the next sections) by letting

Ω \to \infty

.

The constraints for entropy maximization are thus:

\int_{- \infty}^{\infty} x f (x) d x = μ, 0 \leq x \leq Ω

(34)

Assuming a Lebesgue background measure density with

β (x) = 1 / λ

, with λ being a monetary unit (e.g., λ = USD 1), the entropy maximizing probability density is [7]:

f (x) = \frac{1}{λ} \frac{e^{- x / λ}}{1 - e^{- Ω / λ}}

(35)

which is a (doubly) bounded exponential distribution. The particular characteristics of the distribution are given in Table 1. Illustrations of the density function for two values of the upper bound

Ω

are seen in Figure 5 (left). In addition, Figure 5 (right) shows the variation of the mean, K-spread coefficient, and standardized entropy, with the upper limit

Ω,

standardized by the scale parameter

λ

. All three quantities increase with the increase of

Ω . A s Ω / λ \to \infty

, the K-spread coefficient tends to the value

K_{2} / μ = 1 / 2

and the standardized entropy tends to

Φ_{μ} = 1

.

As the tendency of entropy is to grow, one may understand that human societies would push the technological limit to high values and this has actually happened historically [7]. In other words—and despite the bad name of entropy because of misunderstanding its meaning—the tendency of entropy to become maximal is the agent of change and technological progress. As seen in Figure 5 (right), once the technological limit became high enough, say

Ω / λ \approx 10

, it could be neglected as if

Ω / λ = \infty

. In this case we obtain the standard (unbounded) exponential distribution, also shown in Table 1. The characteristics of the latter distribution, namely K-spread coefficient

K_{2} / μ = 1 / 2

and standardized entropy

Φ_{μ} = 1

, define a characteristic point or a pole on a 2D plane (

K_{2} / μ, Φ_{μ}

), which (provided that our variable is nonnegative) cannot be surpassed in the sense that

Φ_{μ}

cannot take any value higher than 1, and the highest value of 1 can be achieved only if

K_{2} / μ = 1 / 2

, otherwise it would necessarily be smaller. Hence the distance of a specific country’s state from this pole, i.e.,

d_{p} : = \sqrt{{(\frac{K_{2}}{μ} - \frac{1}{2})}^{2} + {(Φ_{μ} - 1)}^{2}}

(36)

is a useful index for the characterization of an economy’s state, additional to those already discussed.

It can be shown (and confirmed in Figure 5, right) that, as

Ω / λ \to 0, t h e m e a n μ t e n d s t o Ω / 2

, the K-spread coefficient tends to the value

K_{2} / μ = 1 / 3

, and the standardized entropy tends to

Φ_{μ} = \ln 2

. This signifies a uniform distribution (see Equation (33)). Furthermore, even though not shown in Figure 5, Equation (35) allows for values

λ < 0

and in this case we have the (bounded) anti-exponential distribution. As

Ω / λ \to - \infty

, the mean

μ t e n d s t o Ω

, the K-spread coefficient tends to the value

K_{2} / μ = 0

, and the standardized entropy tends to

Φ_{μ} = - \infty

. This signifies a distribution with all probability mass concentrated at

x = Ω = μ = 0

(an impulse). The impulse represents certainty, with full equality of the population in economic terms. Stochastically, the reason for the emergence of these types of distribution, which must have been materialized in the far distant past at the cradle of human societies, is the very low technological limit, which did not allow any options for diversity in income. Interestingly, Marx and Engels [17] and their followers interpreted this situation of misery as representing an ancient classless society, and envisaged recreating it in the future.

3.3.3. Constrained Mean and K-Spread

For the next step, we pose an additional constraint, namely of a fixed K-spread coefficient, or equivalently a fixed

K_{2}

moment, also removing the upper limit. In this case, the determination of the resulting entropy maximizing distribution is cumbersome and is given in Appendix A.3. The result, if the domain of the variable is the entire line of reals, is the logistic distribution:

F (x) = 1 - \frac{1}{1 + e^{- ς + x / λ}}

(37)

where

ς

is a parameter. If

\underline{x} \geq 0

, as in the case of income, the resulting distribution is the generalized half-logistic (GHL), whose expression is:

F (x) = 1 - \frac{1}{1 - e^{- ς} + e^{- ς + x / λ}}

(38)

It can be easily verified that, if

ς = 0 (a n d s o e^{- ς} = 1)

, in the case of Equation (37), we get the standard logistic distribution, while, in that of the Equation (38), the distribution becomes identical to the exponential one. Thus the GHL distribution contains the exponential distribution as a special case.

The details of both distributions are contained in Table 1. In the case of GHL, the equation giving the K-spread coefficient, albeit simple, cannot be solved explicitly for the parameter

ς

; yet, if

K_{2} / μ

is known, it is easy to find

ς

numerically and then determine the standardized entropy from

ς

. A good analytical approximation of

ς

is given by:

ς \approx \{\begin{array}{l} \frac{0.975}{K_{2} / μ} - 5.85 K_{2} / μ + 1 & 0 < K_{2} / μ < 1 / 2 \\ \frac{0.975}{K_{2} / μ - 1} - 5.85 (K_{2} / μ - 1) - 1 & 1 / 2 < K_{2} / μ < 1 \end{array}

(39)

and of standardized entropy by:

Φ_{μ} \approx 1 - 6 {(\frac{K_{2}}{μ} - \frac{1}{2})}^{2} - \frac{1}{3} {(- \ln (1 - 4 {(\frac{K_{2}}{μ} - \frac{1}{2})}^{2}))}^{\frac{5}{3}}

(40)

The thus established relationship between

K_{2} / μ a n d Φ_{μ}

is illustrated in Figure 6 (curve named “exact ME GHL distribution”), which also shows the satisfactory behaviour of the approximation in Equation (40) (curve named “explicit ME GHL approximation”).

This relationship represents the trade-off between entropy and the K-spread coefficient (Gini index). As

K_{2} / μ

departs from the pole (point (1/2, 1)), the entropy decreases in a symmetrical manner around the line

K_{2} / μ = 1 / 2

. Points below the curve of Figure 6 are mathematically (and practically) feasible, while those above the curve are infeasible. Values

K_{2} / μ < 1 / 2

indicate low stratification of society in terms of income, and values

K_{2} / μ > 1 / 2

indicate high stratification. As per entropy, we have (arbitrarily) partitioned the area below the curve of Figure 6 into three parts, based on the distance from the pole

d_{p}

. Values of

d_{p}

between 2/3 and 1 indicate high stability of economy, those between 1/3 and 2/3 indicate low stability, and those below 1/3 indicate high instability. The latter area also includes negative entropy values, which in reality are feasible but not quite common.

It is useful to approximate the curve of Figure 6 with more general distributions, such as PBF and Dagum, as well as special cases thereof, such as Pareto and Weibull. As seen in Figure 6, which also compares the exact and approximate curves, and in Figure 7 (left), which shows the approximation errors, all four distributions provide good approximations. Figure 7 (right) shows that, in the approximating distributions, the tail indices are no longer

ξ = 0

,

ζ = 1

but vary depending on the K-variation coefficient.

Most promising are the approximations by the two-parameter distributions Weibull and Pareto. The Weibull distribution provides good approximation for the low stratification part of the curve (

0 < K_{2} / μ < 1 / 2

). The upper tail index is

ξ = 0

, while the required value of the lower tail index

ζ

and the achieved standardized entropy

Φ_{μ}

are:

ζ = - \frac{\ln 2}{\ln (1 - K_{2} / μ)}, Φ_{μ} = 1 + (1 + \frac{\ln (1 - K_{2} / μ)}{\ln 2}) γ - l n Γ (- \frac{\ln (1 - K_{2} / μ)}{\ln 2})

(41)

The Pareto distribution provides good approximation for the high stratification part of the curve (

1 / 2 < K_{2} / μ < 1

). The lower tail index is

ζ = 1

, while the required value of the upper tail index

ξ

and the achieved standardized entropy

Φ_{μ}

are:

ξ = 2 - \frac{1}{K_{2} / μ}, Φ_{μ} = 3 - \frac{1}{K_{2} / μ} + \ln (\frac{1}{K_{2} / μ} - 1)

(42)

3.3.4. Notes on the Tail Indices

As will be seen in the applications (Section 4), while the curve shown in Figure 6 effectively captures the feasible space of the covariation of entropy and K-spread, the underlying GHL distribution proves not appropriate for modelling the income distribution. The main obstacle is its tail indices, which are

ξ = 0

,

ζ = 1

, same as in the exponential distribution. In reality, however, any deviation from the exponential distribution is due to different tail indices rather than due to a different K-spread with the same tail indices.

In all applications, the lower tail index

ζ

is always >1 and the upper tail index

ξ

is always >0. Hence neither the exponential nor the GHL distribution can accurately model reality. Moreover, these two tail indices are too important to ignore in economic studies. In fact, there are good reasons that they differ from their ME values [7]. A

ζ > 1

reflects the role of a state in an organized society to redistribute income and wealth through their transferal from richer individuals to poorer by means of several mechanisms, such as taxation, public services, land reform, monetary policies, and others. On the other hand,

ξ > 0

reflects the politico-economic power of the richest, who pursue a greater share of the community’s wealth, thus tending to modify mostly the income distribution tail, converting it from exponential to power-law. At the same time, this advances both the technological limit and the average wealth—a positive side of the elites’ actions for the entire society.

One would think about imposing additional constraints that would modify the distribution from exponential to power law with

ξ > 0, ζ > 1

. This is not so easy, but adopting a non-Lebesgue background measure density

β (x)

makes it easier without imposing additional constraints [15]. With a proper background measure, the resulting ME distribution becomes PBF or Dagum. If we adhere to the Lebesgue measure, then neither of these two distributions is an entropy maximizing distribution, as it can be shown that their density functions are not even local (let alone global) maximizers of entropy under specified mean and K-spread coefficients.

Nevertheless, both these distributions can provide high standardized entropies, depending on the specific values of the indices

ξ, ζ

, and could approximate with high accuracy even the GHL distribution as discussed in Section 3.3.3. Yet, the total effect of specifying values of

ξ > 0 o r ζ > 1

or both is the shift of the entropy curve lower than that of GHL. Some examples of this effect are shown in Figure 8.

The PBF and Dagum distributions are both related and complementary to each other. Obviously, one of the two would provide a better fit to the empirical distribution than the other and, depending on the specific values of the indices

ξ, ζ

, will give higher entropy than the other. The areas of the combinations of parameter values that lead each one to yield higher entropy than the other are depicted in Figure 9, constructed after a systematic numerical investigation. Details on the information provided by the figure are shown in its caption.

In many applications, the PBF and Dagum distributions are thought to lead to overfitting, especially when dealing with limited observed samples, and the estimates of their tail indices are regarded as too uncertain. However, this is not the case with the data used here, as they are summarized data from huge datasets, provided in terms of percentiles. As already discussed, the tail indices are very important and cannot be neglected, while, additionally to those, the two distributions have only one scale parameter, which is again necessary to consider. Hence these distributions are not over-parametrized, but rather have a minimal parameter set. Moreover, the estimates of the tail indices can be carried out even without fitting a probabilistic model by means of the empirical odds function using Equation (25) and determining the log-log slopes with the first or last handful of data points. Actually, the values provided in the application below were estimated in this way and later confirmed by fitting the PBF and Dagum models.

4. Application

4.1. General Setting

Here we apply the framework theoretically developed in Section 3 to the income data of several countries and different periods. This application has two parts. In the first we examine the characteristics of the countries with large populations—a total of 17 countries with populations over 50 million and with data availability for the year 2022, which is the reference year. Our aim here was to use the most recent year, which in the database is 2023, but there were too many missing data and we preferred to use 2022. In addition, we compiled merged datasets for two multi-country entities, namely the European Union (27 countries) and the World (69 countries, for which data from 2022 are available, with a total population of 5.4 billion). The compilation process is described in Appendix B.1. The total of 19 entities examined include major geopolitical powers, such as the USA, China, India, the Russian Federation, and the European Union, belonging either to the G7 or the BRICS group. Other countries of these groups (e.g., France, Germany, the United Kingdom for G7) are also regarded by many as geopolitical powers, but the results of our analyses (and many more indications) will not confirm such a characterization. In any case, such characterization is subjective and time changing, and does not affect the objective results of our analyses.

In this second part, we examine the history of the evolution of two major economic indices, the K-spread (Gini index) and standardized entropy, as well as the distance from the pole of maximum entropy for four countries, in correlation with the political history of each of them. These countries were chosen for the peculiarities in their politico-economic evolution, provided that their data also exhibit sufficient historical depth. Specifically, Argentina (1953–2023) was selected as a case study of a country that experienced successive military coups and political turmoil during the period 1953–1980. Bulgaria was chosen as an example of a Soviet satellite state attempting to implement the communist system and an egalitarian income distribution. Brazil (also included in the first group) was included as a country that, despite its severe inequalities, has historically recorded Gini values near 1/2. Finally, South Africa (also included in the first group) was selected due to its pronounced economic disparities and its persistent and extremely rigid social stratification.

A brief political overview of each country under examination is provided, beginning with a brief history and the political landscape to the time series examined. This allows us to form a concise understanding of broader societal perceptions regarding social stratification in different regions. Even if the available data for our analysis refer mainly to the second half of the 20th century and the early years of the 21st century, with these tools, we are trying to evaluate the social dynamics and the historical evolution of each country.

In all cases, from the available data, all information referred to in Section 3 was extracted but presented only partly to avoid a very long text. The processing of the data is described in Appendix B.2 for the assignment of empirical values of the distribution function and density function for observed values of the income (given in percentiles), and in Appendix B.3 for the calculation of empirical values of entropy. The PBF and Dagum distributions were fitted in all cases, and the numerical values of indices were calculated both directly from the data (as already described in Section 3.3.4) and indirectly from the fitted distribution. The direct and indirect values were very similar and hence only the former are reported here. We note though that, in extreme cases of large deviations of data from the two distributions, like in Bulgaria, 1971, shown in Figure 4, the direct empirical values differ from the indirect ones.

4.2. The Status of the Major Countries in 2022

In 2022, countries with populations over 50 million, and in particular major geopolitical powers, exhibited distinct perceptions of inequality shaped by historical legacies, economic structures, and policy approaches, influencing their social and political landscapes [18,19]. The United States framed inequality as a consequence of market-driven innovation, with policymakers often downplaying wealth concentration among the top 1% while public discourse highlighted racial and economic divides, amplifying polarization [20,21]. On the other hand, China’s leadership viewed inequality as a manageable byproduct of rapid growth since the 1978 reforms, prioritizing urban development and poverty reduction while tolerating wealth concentration among elites, with the “common prosperity” initiative signalling a shift toward addressing urban–rural disparities through targeted redistribution [22,23].

India perceived inequality as a structural challenge rooted in colonial land systems and informal economies, with elites accepting stark wealth gaps as a trade-off for growth, though public frustration over stagnant wages and urban–rural divides fuelled demands for reform without cohesive policy action [24,25]. The European Union saw inequality as a threat to social cohesion, emphasizing robust welfare systems and progressive taxation to ensure equitable income distribution, though regional variations and crises like inflation sparked debates over deeper fiscal unity [26]. Russia perceived inequality as secondary to state stability, with economic measures boosting lower incomes but elite wealth concentration and regional disparities accepted as entrenched features of its resource-driven system [27].

Detailed graphs of the economic status (similar to those for Bulgaria in Figure 4) are given in Figure 10 for the USA, in Figure 11 for China, and in Figure 12 for the World. The graphical depictions for the USA and China show a close similarity between the two cases, with the most visible difference being the smaller upper tail index of China, visualized by the slope of the rightmost part of the odds function curve.

On the other hand, the graphs for the composite sample of the World show distinct differences from those of USA and China, with much greater inequalities, mostly reflected in the K-spread profile, which is higher than that of the exponential distribution. In all cases, though, the PBF and the Dagum models provide good fits to the empirical distributions.

The K-spread vs. standardized entropy curve for all 19 entities examined are shown in comparison to each other and to the GHL curve in the upper left panel of Figure 13, while in the upper right graph the distances from the pole of maximum entropy are compared. For completeness, the same graph in its lower panels provides information on the gross domestic product (GDP) per capita and gross domestic product based on purchasing power parity (GDP-PPP) per capita. These are important indices of prosperity, but they were not investigated in detail here, as the focus is on (in)equality and (in)stability of economy.

The composite case of the World appears to have higher entropy than individual countries, which is expected because it incorporates many very different economic models, leading to high composite uncertainty. Observing the position of the geopolitical players in 2022 from the perspective outlined in the methodology above, we see that the K-spread coefficient (Gini index) does not reflect China’s political intentions regarding common prosperity, since it appears rather high. In contrast, evaluation through entropy captures the social dynamics more accurately, as China appears to be the most stable of all individual entities (just below the World), exhibiting the smallest distance from the pole of maximum entropy. India, although positioned according to the Gini index as having the potential to achieve maximum entropy, does not succeed in remaining close to the pole. The United States is at a greater distance than China and India, yet still within a stable framework; the European Union and Russia are located close to the boundary of high stability, while the United Kingdom, France, and Germany lie at the area of low stability. Except for these three, all other countries and composite entities examined lie in the high stability area.

In addition, Table 2 summarizes the main numerical indices resulting from this analysis. The lowest or highest values of the indices that favour equality or stability are highlighted in bold and it can be seen that China and Russia are the most notable in good performance in this respect, with Italy and Germany (in terms of equality) following. At the other end, of not good performance are Germany (in terms of stability), India, Turkey, and South Africa.

4.3. A Brief Political History and the Evolution of Economic Indices in Specific Countries

4.3.1. Argentina

Argentina’s political history began with independence from Spain in 1816, leading to a period of civil wars between federalists and unitarians, eventually stabilizing under a federal constitution in 1853. The late 19th century saw economic prosperity driven by agricultural exports and European immigration but also rising social tensions and oligarchic rule, culminating in the 1916 introduction of universal male suffrage and the Radical Civic Union’s ascent. The 1930 military coup marked the start of instability, followed by the 1943 coup that propelled Perón to power in 1946, establishing Peronism as a transformative force blending populism and authoritarianism [30].

In the 1950s, Argentina’s political and social perceptions were profoundly shaped by Peronism, a dominant political culture, emphasizing social justice, economic independence, and political sovereignty as a “third way” between communism and capitalism. Argentines under Peronism sought a balanced system that incorporated elements of state intervention in the economy without full communist collectivization, fostering a populist identity that prioritized national sovereignty over ideological extremes [31,32].

From the 1950s to the 1980s, Argentina experienced recurrent military interventions, including the 1955 coup that ousted Perón, the 1962 and 1966 coups against civilian governments, and the 1976 coup that initiated the brutal Dirty War under a military junta [33]. This era saw alternating periods of restricted democracy and outright dictatorship, and social unrest (Figure 14). The frequent upheavals stemmed from deep-seated political polarization, economic volatility including hyperinflation and debt crises, and a tradition of military involvement in politics [34].

Since the return to democracy in 1983, Argentina’s political history has been characterized by efforts to consolidate democratic institutions amid economic challenges [35]. In the 1990s, neoliberal reforms were introduced, followed by the 2001–2002 economic collapse and a series of short-lived presidencies. The period 2003–2015 saw progressive policies and debt restructuring, followed by shifts toward market-oriented reforms [36] and libertarian austerity measures, reflecting ongoing struggles for stability in a polarized landscape [37].

Observing Argentina’s history from the perspective outlined in the methodology above, we can see that the Gini index does not provide us with information regarding the stability and condition of social inequalities. Before 1980, this index ranged between 0.32 and 0.41, with an average value of 0.35; during 1980–1999, it ranged between 0.38 and 0.48, with an average of 0.44; and in the period since 2000, it ranged between 0.38 and 0.50, with an average of 0.42. Thus, if we were to evaluate social status solely through this index, we would conclude that the period before 1980 was of greatest social harmony—something that clearly did not occur.

Figure 14. Characteristic violent events in Argentina in the 20th century: (left) civilian casualties after the air attack and massacre on Plaza de Mayo, June 1955 [38]; (right) Cordobazo general strike in protest against the political and economic decisions of the military dictatorship, Bulevar San Juan, Córdoba Capital, May 1969 [39].

On the contrary, if we examine entropic indices, we observe that, before 1980, the entropy ranged between −0.12 and 0.59, with an average of 0.25; during 1980–1999 between 0.69 and 0.78, with an average of 0.74; and since 2000 between 0.71 and 0.82, with an average of 0.77. Clearly, in the period before 1980, the annual entopic indices lie in the areas of low stability to high instability, thereby explaining the social instability, the unrest, and the coups of that period, since the social structure was fragile—something not reflected in the Gini index (Figure 15 left).

By combining the two indicators, using the distance from the pole of maximum entropy as the indicator of stability, we find that the distances of the distributions before 1980 range between 0.44 and 1.12, with an average of 0.76; during 1980–1999 between 0.23 and 0.32, with an average of 0.27; and since 2000 between 0.19 and 0.30, with an average of 0.24. Taking into account that a smaller distance from the pole of maximum entropy indicates greater stability in the distribution, the large distances once again of the period before 1980 explain the upheavals and coups (Figure 15 right).

4.3.2. Brazil

Brazil’s political history began with independence in 1822 as a monarchy transitioning to a republic in 1889 amid the abolition of slavery and coffee boom-driven growth [40]. The Old Republic (1889–1930) was dominated by oligarchs, followed by the authoritarian Estado Novo (1937–1945), which introduced labour rights but centralized power [41]. Post-1945 democracy was interrupted by the 1964 military coup, establishing a dictatorship until 1985 that accelerated industrialization but deepened inequalities through repression and economic policies favouring elites.

Brazil’s political perceptions have been shaped by a legacy of colonialism, slavery, and elite dominance, leading to extreme inequalities rooted in the latifundia system where large landowners controlled vast estates, exploiting labour without significant redistribution. Favelas (Figure 16) emerged in the late 19th century as informal settlements for freed slaves and rural migrants, exacerbated by urbanization without social reforms [42]. Such inequalities persisted without major revolutions due to a tradition of clientelism, military repression, and co-optation of dissent through gradual reforms, preventing widespread uprisings despite stark disparities [43].

From 1980 to the present, Brazil transitioned from military rule [45] to democracy, with the 1988 Constitution emphasizing social rights [46]. The 1990s focused on economic stabilization via the Real Plan, reducing inflation. The 21st century was marked by attempts for poverty reduction coexisting with corruption scandals, political polarization, and ongoing challenges like inequality and environmental issues [47].

Observing history after 1980 (when data are available), from the perspective described in the methodology above, we see that the Gini index ranges roughly between 0.48 and 0.58, with a mean of around 0.53, higher than in the other countries examined above yet not reflecting extreme inequalities. However, the upper tail index

ξ

(not shown in the graphs) has a very high value, more than 0.5 and reaching 0.6, and this better reflects the fact that the social stratification system is far from ideal, as also depicted in the favelas and elsewhere. It appears that the inequalities stem from the political–cultural legacy of Brazil’s earlier colonial era, and that modern policies have not been able to fully eradicate those practices. If we examine social stability from the viewpoint of entropy, we find that it ranges between about 0.77 and 0.87, except for year 2005, when it was 0.63 (Figure 17). Generally, this level does not indicate political instability, and indeed no major instability has been observed in the period under review. Notably, in the year 2005, a major political scandal (the Mensalão scandal) broke out, in which the ruling Workers’ Party was accused of monthly payments to deputies to vote as the government wished [48].

The scandal had institutional consequences: public pressure, resignations of senior government officials, and suspicions of broader corruption. Although Brazil’s economy at that time did not collapse, the entropy derived from the income distribution registered an impressive drop, signifying that a destabilization event did occur—one that was later rectified.

4.3.3. South Africa

South Africa’s political history involved Dutch settlement in 1652, British conquest in the early 1800s, and the 1910 Union formation excluding Black participation. The National Party’s 1948 victory institutionalized apartheid, enforcing racial separation and suppressing resistance through events like the 1960 Sharpeville Massacre and 1976 Soweto Uprising. International sanctions and internal protests in the 1980s eroded the regime, leading to 1990 reforms and initiating negotiations [49,50].

South Africa’s political perceptions were shaped by centuries of colonialism and racial domination, with extreme inequalities stemming from the exploitation of Black labour under Dutch and British rule, formalized in apartheid from 1948. This tradition of segregation, including land dispossession and pass laws, created vast disparities without immediate overthrow due to military suppression and divide-and-rule tactics [51]. The anti-apartheid struggle culminating in the 1990–1994 transition, with Nelson Mandela’s release and the 1994 democratic elections marking the end of white minority rule [52].

From 1990 to the present, South Africa transitioned to democracy with Mandela’s 1994 presidency, focusing on reconciliation via the Truth and Reconciliation Commission and affirmative action. The 21st century developments are not free of corruption scandals [53], while inequalities persist, fuelling protests [54,55].

Observing history from the perspective of our methodology, and in particular the evolution of the Gini index, we see that, although apartheid was overturned in the period under examination (after 1993), inequalities in South Africa continued to be extremely large—with the index fluctuating between 0.67 and 0.74 and an average of 0.70 (Figure 18). It appears that, in South Africa, the intense inequalities stem from the political–cultural legacy of earlier eras, while modern policies have not managed to smooth them out. If we examine social stability from the lens of entropy, we find that it ranged between 0.51 and 0.72, with a mean around 0.62, indicating the country is in a state of low stability, as also suggested by high corruption indices [56] and elevated crime rates [57,58,59].

4.3.4. Bulgaria

Bulgaria’s modern political history started with its independence in 1878 after centuries under Ottoman control, followed by a monarchy and participation in the Balkan Wars. Its alignment with the Axis in WWII led to Soviet occupation in 1944, installing a communist regime under Georgi Dimitrov by 1946, with purges and nationalization. Todor Zhivkov’s long rule from 1954 emphasized Soviet loyalty, culminating in 1989 protests and the regime’s fall amid perestroika [60,61].

Communism arose through Soviet imposition rather than a strong indigenous tradition of equality or social stratification redistribution, though some agrarian reforms built on pre-existing peasant movements. There was no deep-rooted culture of egalitarian distribution, as the system was enforced top-down amid purges and collectivization. Figure 19 shows the architectural expression of the communist era, when all houses were quite similar, in huge blocks.

From 1960 to 1989, Bulgaria pursued industrialization and cultural assimilation policies. The 1989 ouster led to multiparty elections in 1990, with socialist governments initially dominating the transition to market economy, followed by EU accession in 2007 [62].

Observing history from the perspective of our methodology, we see a very low Gini index during the period 1963–1989, ranging between 0.18 and 0.26, with an average of 0.23 (Figure 20). This means that a high level of equality was achieved, according to the goals of the communist regime. In the period 1992–2023, when Bulgaria entered the free market, inequalities amplified and the Gini index increased to 0.31–0.42, with an average of 0.36.

From the perspective of entropy, we observe that during 1963–1989 it ranged between −0.67 and 0.39, with an average of 0.12, whereas during 1992–2023 it ranged between 0.60 and 0.80, with an average of 0.72. It is noteworthy that negative entropy values appear in several years of the communist period, with the lowest value in 1971, a turbulent period for Bulgaria (Figure 20, left) [63]. Indicatively, on 16 May 1971, the referendum on the Zhivkov Constitution was held, with voter turnout at 99.7% and approval also at 99.7% [64]. These extraordinarily high rates suggest that political practices were democratic only in appearance and the system had limited real alternatives. This manifests as a strikingly low entropy of the social structure of that time.

Figure 19. Apartment blocks in Sveta Troitsa, Sofia, Bulgaria, at the northwest end of the district, next to a train station [65].

Figure 20. Characteristic graphs for the evolution of major economic indices in Bulgaria: (left) standardized entropy vs. K-spread coefficient (Gini index), plotted alongside the maximum entropy vs. K-spread curve; (right) distance from the pole of maximum entropy. The cyan rectangles represent the era of Soviet influence and the dark red diamonds the era of free market. Purple dashed lines show the boundaries between the partitioned areas.

It is interesting that almost the entire communist period falls within the area we have characterized as one of high instability, with notable episodes such as in 1971, already mentioned above, as well as the years just before the collapse of the Soviet Union (1987–1988), during which the system also exhibits negative entropy and large distance from the pole of maximum entropy (Figure 20, right)—something that explains the eventual overturning of this political system.

5. Discussion and Conclusions

Entropy carries a bad reputation in both scientific and public discourse [7], but this can be attributed to the fact that its meaning is greatly misunderstood because it is a stochastic concept, while the education system is based on the deterministic paradigm. Far from signifying decay, decadence, or disorder as usually thought, entropy is a formal quantification of uncertainty, the dominant feature in complex real-world systems. The tendency of entropy to increase and the related principle of maximum entropy formally describe the natural tendency of complex systems to move from less probable to more probable states. High entropy corresponds to a greater multiplicity of states, hence expanded freedom of choice, more opportunities, and structural resilience.

Being a non-conservation law, entropy maximization is also a driver of change. This is also the case in economics and we have shown that, starting from a bounded distribution that has low entropy, the inevitable tendency of entropy to grow would push the technological limits to high values—a pattern historically confirmed. Technological progress as well as growth of wealth are not merely compatible with entropy increase, they are its direct expression.

The typical tools used in economic analyses, namely the Lorenz curve and the Gini index, totally miss accounting for entropy. Here we showed that Lorenz profiles are a poor representation of the economic states and hence we recommend replacing them with simpler graphs such as the odds and probability density functions. The Gini index, which we showed is identical to the (second-order) K-spread coefficient,

K_{2} / μ

, is a good indicator of (in)equality but neglects dynamics in distribution tails. Therefore, we propose complementing it with upper and lower tail indices,

ξ, ζ

and also accompanying it with a standardized form of entropy,

Φ_{μ}

.

We also demonstrated here that, under constraints of specified mean,

μ

, and K-spread,

K_{2} / μ

, the maximum entropy distribution is the GHL distribution, a limiting case of which is the exponential distribution. The latter materializes the peak entropy pole, as (

K_{2} / μ = 1 / 2, Φ_{μ} = 1)

. The limiting curve of

Φ_{μ}

vs.

K_{2} / μ

, or else the maximum entropy vs. K-spread curve, turns out to be a parabola-like shape symmetrically arranged below this pole. The distance from this pole is another indicator of resilience or stability of an economy, with a small distance denoting small instability.

The real-world applications with data (percentile records) from the World Income Inequality Database, illustrated the theoretical framework and provided support to its hypotheses and results. The country-level analyses (Argentina, Brazil, South Africa, Bulgaria) showed that entropy declines can be linked to political ruptures. In addition, the analyses of the core set of indicators in present-day geopolitical powers (China, India, USA, Russia, the EU) affirm their stability based on the criteria developed, even though some EU countries lie in the low stability area. Interestingly, in all latter cases, the K-spread index is lower than 1/2, positioning these geopolitical powers to the low stratification area of the maximum entropy vs. K-spread graph.

High stratification is rarer, but it was affirmed in the case of South Africa, where in recent years a tendency to increased entropy is noted, albeit without one to decreased stratification. In contrast, very low stratification, quantified by the K-spread coefficient, was the case in former socialist countries, of which Bulgaria was studied in detail. Interestingly, even in this case, higher order spread parameters, such as

D_{10} / D_{2}

kept high values, despite the low

K_{2} / μ

. Naturally, the entropy in this period was too low, placing the country in high instability. This radically changed after the fall of the communist regime, with the entropy substantially increasing, thus leading to higher stability.

Apparently, entropy, K-spread, and the other indices studied do not provide a complete picture of prosperity. Absolute indicators such as the GDP per capita and the GDP-PPP per capita should also be considered, but they were not the focus of this study—even though we also provided these indicators for the entities examined. Indices of “real economy” (dealing with goods and services that satisfy human needs and desires, such as agriculture, manufacturing, construction, and services), as contrasted with the “financial economy” (dealing with financial assets like stocks and bonds), are also most important but outside the scope of this study. Societal aspects such as equal opportunities, freedom of choice, and creative expression, and ultimately a meritocratic structure that would not be influenced by hereditary or entrenched class constraints, are also important drivers of economy. Our data do not allow us to make this kind of approach, but it would be interesting to explore it in future research.

The findings of our study, and in particular the proposed indicators (K-spread, standardized entropy, tail indices) and the concave frontier between entropy and K-spread, may have practical implications. Yet we avoid discussing them or providing policy recommendations, actionable insights for policymakers, or usable information for policy decisions, such as tax reforms, social welfare programs, or regulatory interventions. We prefer to adhere to the scientific part of the subject matter, leaving the study of such implications to policy experts.

Although our analysis focused on the country and international level, the proposed methodology could be applied at smaller scales, such as in businesses, organizations, or local communities. In these contexts, calculating similar indicators of relevant variables, e.g., wage distribution within a company, could provide valuable insights into the quality, effectiveness, and stability of the policies implemented.

The country-level analyses revealed that, while the maximum entropy vs. K-spread curve is a tool of high explanatory potential, the underlying GHL distribution is hardly representative of the actual statistical behaviour. Its specified tail indices at

ξ = 0

,

ζ = 1

do not correspond to real situations where both tail indices turn out to be higher than the GHL values. Therefore, our framework included the flexible PBF and Dagum distributions, which usually had excellent performance in terms of fitting in real-world data. Yet, there is space for future research with constraints different from a specified K-spread coefficient, or with varying background measures, which would result in better agreement between theory and real-world data.

Hopefully, our framework transforms inequality analysis: entropy is not a penalty on growth but its engine. By embracing uncertainty as freedom, we reconcile equity with innovation—a synthesis that Aristotle intuited: virtue lies in the mean but excellence in the extreme.

Author Contributions

Conceptualization, D.K. and G.-F.S.; methodology, D.K.; software, D.K.; validation, D.K. and G.-F.S.; formal analysis, D.K. and G.-F.S.; investigation, D.K. and G.-F.S.; resources, D.K. and G.-F.S.; data curation, D.K. and G.-F.S.; writing—original draft preparation, D.K. and G.-F.S.; writing—review and editing, D.K. and G.-F.S.; visualization, D.K. and G.-F.S.; supervision, D.K.; project administration, D.K.; funding acquisition, Not applicable. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding but was conducted out of scientific curiosity.

Institutional Review Board Statement

Not applicable.

Data Availability Statement

No new data were created; the data used are described in Section 2 and are publicly available in the given link.

Acknowledgments

During the preparation of this manuscript, the authors chatted with Grok 4 (created by xAI) for the purposes of checking and summarizing texts, and helping in mathematical derivations. The authors have considered the chats’ output and take full responsibility for the content of this publication. Two reviewers provided constructive comments that helped substantially improve and expand this paper. Dedicated to the memory of Katerina Souliou-Patrikiou and Ioanna Koutsoyianni-Christofaki (daughter in law and sister of DK, respectively), who left this world while this research was conducted.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:

GDP	Gross domestic product per capita
GDP-PPP	Gross domestic product based on purchasing power parity
GHL	Generalized half exponential (distribution)
LLD	Log-log derivative
ME	Maximum entropy
PBF	Pareto–Burr–Feller (distribution)
WIID	World Income Inequality Database

Appendix A. Mathematical Derivations

Appendix A.1. Estimation of K-Moments for Data That Are Grouped in Percentiles

The general K-moment unbiased estimator is given by Equation (18), with the coefficients

b_{i n p}

given by Equation (20). If the data values are too many, they are typically summarized in a more manageable form. Usually, they are partitioned into classes, each of which contains tied data. In this case, the calculations can be simplified and accelerated by applying a single coefficient to each value appearing in the sample. Assuming that a certain value appears

l

times in the sample, namely from positions

j \geq p

to

j + l - 1

in the ordered sample, i.e.,

x_{(j : n)} = \dots = x_{(j + l - 1 : n)} = y

, the value

y

should be multiplied by a bulk coefficient equal to the sum

\sum_{i = j}^{j + l - 1} b_{i n p}

. This sum is easy to calculate analytically, resulting in a concise expression:

b_{l, j, n, p} : = \sum_{i = j}^{j + l - 1} b_{i, n, p} = \frac{j + l - 1}{p} b_{j + l, n, p} - (\frac{j}{p} - 1) b_{j, n, p}

(A1)

It is easy to verify that for

l = 1

, the result is

b_{1, j, n, p} = b_{j, n, p}

, as it should be. Moreover, if

l = 0

, which means that there is no appearance of a particular value, then the result is

b_{0, j, n, p} = 0

, as required.

Now we assume that the observed sample is composed of

m

classes, each of which contains an equal number

l

of tied values, so that

n = m l

. The kth class contains the items

x_{((k - 1) l + 1 : n)} = \dots = x_{(k l : n)} = : y_{(k : m)}

. Hence the bulk coefficient if this class will be:

c_{l, k, m, p} : = b_{l, (k - 1) l + 1, m l, p} = (\frac{k l + 1}{p} - 1) b_{k l + 1, m l, p} - (\frac{(k - 1) l + 1}{p} - 1) b_{(k - 1) l + 1, m l, p}

(A2)

Furthermore, we attempt approximate

c_{l, k, m, p}

with

b_{k, m, p}

by totally ignoring

l

. In this case

b_{k, m, p}

applies to the kth item of the ordered sample

y_{(1 : m)}, y_{(2 : m)}, \dots, y_{(m : m)}

of size m. The approximation will be good if the error

D_{l, k, m, p} : = c_{l, k, m, p} - b_{k, m, p}

(A3)

is small. An extended numerical investigation showed that this is indeed the case. An example is shown in Figure A1, where it is seen that the absolute value of the error (a) is small, not exceeding 0.004, (b) increases with moment order, p, and class number, k, and (c) is practically independent of the class size, l. These observations allow us to locate the most extreme error as the quantity

D_{l, m, m, p}

for a sufficiently high number of l. We further observe that, as

l \to \infty

, the limit of

D_{l, m, m, p}

has a simple expression:

\lim_{l \to \infty} D_{l, m, m, p} = 1 - \frac{p}{m} - {(1 - \frac{1}{m})}^{p}

(A4)

which for large

m

or small

p

tends to zero. For

m = 100

, as in the applications of this study whose data are given at percentiles, and for

p = 10

the error is −0.0044, i.e., negligible. However, for higher values of

p

it would become non-negligible. Therefore, it is not recommended to estimate K-moments of order higher than 10, if the data are summarized in classes.

The above analysis justifies a procedure in which the K-moments are estimated from the sample

y_{i}

, instead of

x_{i}

, as if each class contained just one value.

Figure A1. Density plots of the error

D_{l, k, m, p}

for a number of classes

m

= 100, varying moment order

p

, and class size

l

, and for the indicates class numbers

k

.

Figure A1. Density plots of the error

D_{l, k, m, p}

for a number of classes

m

= 100, varying moment order

p

, and class size

l

, and for the indicates class numbers

k

.

Appendix A.2. Derivations About the Lorenz Curve and the Gini Index

Given Equation (28), to find the Gini index, we integrate by parts and get:

\int_{0}^{1} L (F) d F = \int_{0}^{1} L (F) F^{'} d F = {[L (F) F]}_{0}^{1} - \int_{0}^{1} L^{'} (F) F d F = 1 - \frac{1}{μ} \int_{0}^{1} x (F) F d F

(A5)

and using Equation (6.22) in Koutsoyiannis [15] we finally get:

G = 1 - 2 (1 - \frac{1}{μ} \int_{0}^{1} x (F) F d F) = 2 \frac{1}{μ} \int_{0}^{1} x (F) F d F - 1 = \frac{K_{2}^{'}}{μ} - 1 = \frac{K_{2}}{μ}

(A6)

The above derivation is general, based on the definition of the Lorenz curve, as no specific expression or approximation of the Lorenz curve has been used.

Yet the approximation proposed in Section 3.2 is useful in several applications and can be exact in some cases, as specified in Table A1.

Table A1. Special cases of the ME GHL approximation.

Case	$L (F)$	$K_{2} / μ$	$Distribution for Which the Expression for L (F)$ Is Exact
General	$A F - (A - 1) F^{1 + 1 / ζ} - \frac{A (1 - F) ({(1 - F)}^{- ξ} - 1)}{ξ}$	$\frac{A}{2 - ξ} + \frac{1 - A}{1 + 2 ζ}$
$A = 1$	$F - \frac{(1 - F) ({(1 - F)}^{- ξ} - 1)}{ξ}$	$\frac{1}{2 - ξ}$	Pareto
$ζ = \infty$	$1 - A (1 - F) (1 + \frac{({(1 - F)}^{- ξ} - 1)}{ξ})$	$\frac{A}{2 - ξ}$	Pareto with origin shifted to $(1 - A) μ$
$ξ = 0$	$A F - (A - 1) F^{1 + 1 / ζ} + A (1 - F) \ln (1 - F)$	$\frac{A}{2} + \frac{1 - A}{1 + 2 ζ}$
$A = 1, ξ = 0$	$F + (1 - F) \ln (1 - F)$	$\frac{1}{2}$	Exponential
$ζ = \infty, ξ = 0$	$A F + A (1 - F) \ln (1 - F)$	$\frac{A}{2}$	Exponential with origin shifted to $(1 - A) μ$

To verify the goodness of the approximation, we first find the quantile function from Equation (27):

x (F) = μ (\frac{A (1 - ξ)}{ξ} ({(1 - F)}^{- ξ} - 1) + \frac{(A - 1) (1 + ζ)}{ζ} F^{1 / ζ})

(A7)

By integration we find:

\int_{0}^{1} x (F) d F = μ

(A8)

which shows that the approximate expression has the same mean as the exact distribution. Furthermore:

2 \int_{0}^{1} x (F) F d F = Κ_{2} + μ

(A9)

which means that the approximate expression has the same

K_{2}

moment as the exact distribution. In addition, by taking the LLD of

x

with respect to

F

, we find that:

\lim_{F \to 0} x^{#} (F) = \frac{1}{ζ}

(A10)

which means that the approximate expression has the same lower tail index as the exact distribution. Finally, by expressing the distribution quantile as a function of the tail function,

\bar{x} (\bar{F})

, taking the LLD with respect to

\bar{F}

, we find that:

\lim_{F \to 0} {\bar{x}}^{#} (\bar{F}) = ξ

(A11)

It is noted that Equation (A11) is valid for

ζ \geq 1

, while for

ζ < 1

we have

\lim_{F \to 0} x^{#} (F) = 1

and, hence, in this case the approximation is not perfect in terms of the lower tail index.

Appendix A.3. Maximum Entropy Distribution for Fixed Mean and Second K-Moment

The goal is to find the probability density function

f (x)

that maximizes the entropy

Φ [f] = - \int_{0}^{\infty} f (x) l n f (x) d x

(A12)

subject to the following constraints:

Total probability constraint:

$\int_{0}^{\infty} f (x) d x = 1$

(A13)
Mean constraint:

$\int_{0}^{\infty} x f (x) d x = μ$

(A14)
$K_{2}$ constraint:

$H [f] : = \int_{0}^{\infty} 2 x f (x) F (x) d x = K_{2} + μ$

(A15)

We use calculus of variations to solve the constrained optimization problem. Introducing Lagrange multipliers

ν_{1}, ν_{2}, ν_{3}

for the three constraints, we formulate the functional to extremize as:

L [f] = - \int_{0}^{\infty} f (x) \ln f (x) d x + ν_{1} (1 - \int_{0}^{\infty} f (x) d x) + ν_{2} (μ - \int_{0}^{\infty} x f (x) d x) + ν_{3} (K_{2} + μ - \int_{0}^{\infty} 2 x f (x) F (x) d x)

(A16)

The maximum occurs where the functional derivative vanishes, i.e., when

δ L / δ f (y) = 0

for all

y > 0

. Here, we consider a change

δ f (y)

localized at

y

(in the form of a Dirac-like bump) and calculate the changes in the different terms. The change in the entropy term is:

\frac{δ}{δ f (y)} (- \int_{0}^{\infty} f (x) \ln f (x) d x) = - \ln f (y) - 1

(A17)

where only the term at point

y

is considered from the integral, because

δ f (x) = 0

for any

x \neq y

. The normalization term contributes

- ν_{1}

and the mean term contributes

- ν_{2} y

.

For the

K

-constraint, according to the Leibniz product rule, the change

δ H

has two parts:

δ H : = \int_{0}^{\infty} 2 x δ f (x) F (x) d x + \int_{0}^{\infty} 2 x f (x) δ F (x) d x

(A18)

In the first term,

F (x)

is treated as fixed. Since

δ f (x) = 0

for any

x \neq y

, the integral is:

\int_{0}^{\infty} 2 x δ f (x) F (x) d x = 2 y δ f (y) F (y)

(A19)

For the second term, we observe that

F (x)

depends on

f

, and so

δ F (x)

is not zero. Rather:

δ F (x) = \{\begin{array}{l} 0, & x < y, \\ δ f (y), & x \geq y \end{array}

(A20)

Hence the integral in the second term is:

\int_{0}^{\infty} 2 x f (x) δ F (x) d x = \int_{y}^{\infty} 2 x f (x) δ f (y) d x = 2 δ f (y) \int_{y}^{\infty} x f (x) d x

(A21)

Consequently,

\frac{δ H}{δ f (y)} = 2 y F (y) + 2 \int_{y}^{\infty} x f (x) d x

(A22)

Setting

δ L / δ f (y) = 0

yields:

- l n f (y) - 1 - ν_{1} - ν_{2} y - 2 ν_{3} y F (y) - 2 ν_{3} \int_{y}^{\infty} x f (x) d x = 0

(A23)

or

\ln f (y) = - 1 - ν_{1} - ν_{2} y - 2 ν_{3} y F (y) - 2 ν_{3} \int_{y}^{\infty} x f (x) d x

(A24)

Differentiating both sides with respect to

y

, we find:

\frac{f^{'} (y)}{f (y)} = - ν_{2} - 2 ν_{3} \frac{d}{d y} (y F (y) + \int_{y}^{\infty} x f (x) d x)

(A25)

The derivative on the right-hand side is:

\frac{d}{d y} (y F (y) + \int_{y}^{\infty} x f (x) d x) = F (y) + y f (y) - y f (y) = F (y)

(A26)

and so, the differential equation becomes:

\frac{F^{″} (y)}{F^{'} (y)} = - ν_{2} - 2 ν_{3} F (y)

(A27)

which is a second-order ordinary differential equation. Since

{f (y) = F}^{'} (y) \geq 0

, we can take its logarithm and define

G (y) = \ln f (y) \Leftrightarrow f (y) = e^{G (y)}

(A28)

We observe that the derivative of

G (y)

equals the left-hand side of Equation (A27). We can then write it as:

G^{'} (y) = - ν_{2} - 2 ν_{3} \int e^{G (y)} d y

(A29)

Taking derivatives on both sides, we find the transformed differential equation:

G^{″} (y) = - 2 ν_{3} e^{G (y)}

(A30)

whose general solution is:

G (y) = \ln (c_{1}^{2} {sech (\frac{1}{2} c_{1} (c_{2} + y))}^{2} / 4 ν_{3})

(A31)

where

c_{1}

and

c_{2}

are integration constants. To verify this solution, we calculate the left- and right-hand side of Equation (A30), which are, respectively:

G^{″} (y) = - \frac{1}{2} c_{1}^{2} {sech (\frac{1}{2} c_{1} (c_{2} + y))}^{2}, - 2 ν_{3} e^{G (y)} = - 2 ν_{3} c_{1}^{2} {sech (\frac{1}{2} c_{1} (c_{2} + y))}^{2} / 4 ν_{3}

(A32)

and hence they are equal to each other. Consequently,

f (y) = c_{1}^{2} {sech (\frac{1}{2} c_{1} (c_{2} + y))}^{2} / 4 ν_{3}

(A33)

By integration, we find:

F (y) = c_{1} \tanh (\frac{1}{2} c_{1} (c_{2} + y)) / 2 ν_{3} + c_{3}

(A34)

where

c_{3}

is a third integration constant. For

c_{1} > 0

,

F (y)

takes on the following specific values, which are used as initial conditions to find the integration constants:

F (- \infty) = c_{3} - \frac{c_{1}}{2 ν_{3}}, F (\infty) = c_{3} + \frac{c_{1}}{2 ν_{3}}, F (0) = c_{3} + \frac{c_{1} \tanh (\frac{c_{1} c_{2}}{2})}{2 ν_{3}}

(A35)

If the domain of the variable

y

is the entire real line, then we use the first two initial conditions in Equation (A35):

F (- \infty) = c_{3} - \frac{c_{1}}{2 ν_{3}} = 0, F (\infty) = c_{3} + \frac{c_{1}}{2 ν_{3}} = 1

(A36)

and, after algebraic manipulations, find:

c_{1} = ν_{3}, c_{3} = \frac{1}{2}, F (y) = 1 - \frac{1}{1 + e^{ν_{3} (c_{2} + y)}}

(A37)

Reparametrizing by

λ : = 1 / ν_{3}, ς : = - ν_{3} c_{2}

we obtain Equation (37).

If the domain of the variable

y

is the set of positive reals, then we use the last two initial conditions in Equation (A35):

F (\infty) = c_{3} + \frac{c_{1}}{2 ν_{3}} = 1, F (0) = c_{3} + \frac{c_{1} \tanh (\frac{c_{1} c_{2}}{2})}{2 ν_{3}} = 0

(A38)

and, after algebraic manipulations, find:

c_{2} = \frac{2 \tanh^{- 1} (1 - 2 ν_{3} / c_{1})}{c_{1}}, c_{3} = 1 - \frac{c_{1}}{2 ν_{3}}, F (y) = 1 - \frac{1}{ν_{3} / c_{1} + (1 - ν_{3} / c_{1}) e^{c_{1} y}}

(A39)

Reparametrizing by

λ : = 1 / c_{1}, e^{- ς} = 1 - ν_{3} / c_{1}

, we obtain Equation (38).

As a final step, we should connect the parameters

λ a n d ς w i t h t h e g i v e n μ

and

K_{2}

. For the case

y \in R

, by integration we find the mean,

μ

, and the K-moment

K_{2}

as:

μ = λ ς, K_{2} + μ = λ (1 + ς)

(A40)

and hence, by solving the system of two equations, the final results are:

λ = K_{2} + 2 μ, ς = \frac{μ}{λ}, F (y) = 1 - \frac{1}{1 + e^{\frac{y - μ}{K_{2} + 2 μ}}}

(A41)

For the case

y \geq 0

, the respective equations do not have an analytical solution, and a numerical procedure is needed to find the parameters

λ

and

ς

from

μ

and

K_{2}

. All required equations, obtained through integration using the definitions of the different quantities, are gathered in Table 1. Alternatively, the approximate Equations (39) and (40) can be used for direct calculations of all required quantities.

Appendix B. Data Processing

Appendix B.1. Compilation of Datasets for Multi-Country Entities

Let

Π_{i}

be the population of country

i = 1, \dots, m,

and

s_{i j}, j = 1, \dots, 100

, the average income in the jth percentile (in monetary values, not standardized, available from the database). The entire population of the composite entity is

Π_{c} = \sum_{i = 1}^{m} Π_{i}

. We perform the following steps:

For each country we make an array of 100 pairs ( $π_{i}, s_{i j}$ ), where $π_{i} = Π_{i} / 100$ .
We merge all pairs ( $π_{i}, s_{i j}$ ) for all countries thus forming a table of $100 m$ pairs in total.
We sort the table in ascending order of $y$ thus making a table of $100 m$ pairs $(π_{k}, s_{k}), k = 1, \dots, 100 m$ .
We split the latter table into 100 classes (percentiles), so that each one has population $π_{c} = Π_{c} / 100$ .
For each class $j$ we calculate the average income as $y_{j} = \sum_{k i n c l a s s j} π_{k} s_{k} / π_{c}$ .
We standardize $s_{j}$ to a sum of 100 by $y_{j} = 100 s_{j} / \sum_{k = 1}^{100} s_{k}$ .

Appendix B.2. Assignment of Distribution Function Values for the Observed Sample Values

Assuming that the observed sample is given as a sequence of values

y_{i}

representative for consecutive percentiles

i

, it is reasonable to assume that the value

y_{1}

corresponds to a value of the distribution function

F_{1}

= 0.01/2 = 0.005, the value

y_{2}

corresponds to

F_{2}

= (0.01 + 0.02)/2 = 0.015, and so on, up to the value

y_{99}

that corresponds to

F_{99}

= (0.98 + 0.99)/2 = 0.985. However, this simple technique is not reliable for the last point

y_{100}

, while the estimation of

F_{100}

is crucial for determining the upper tail of the distribution. Here, we follow a more reliable procedure.

As a first step we estimate the upper tail index

ξ

as the regression slope of

\ln O_{i}

vs.

\ln y_{i}

(where

O_{i} = F_{i /} {\bar{F}}_{i}

is the value of the odds function) for the highest few (say, 5–6) points

(O_{i}, y_{i})

, excluding the point

(O_{100}, y_{100})

, as at this phase

O_{100}

is unknown. (Likewise, we estimate the lower tail index

ζ

, but by using the lowest few points

(y_{i}, O_{i}) .

) Then we assume that a Pareto tail applies beyond

c : = x_{99}

, i.e.,

\bar{F} (y) = \bar{F} (c) {(1 + ξ \frac{(y - c)}{l})}^{- 1 / ξ}

(A42)

The mean value beyond

c

is:

μ_{c} : = \int_{c}^{\infty} - {\bar{F}}^{'} (y) y d y / \int_{c}^{\infty} - {\bar{F}}^{'} (y) d y = c + \frac{l}{1 - ξ}

(A43)

and the required value of the distribution function at

μ_{c} = y_{100}

is:

F (μ_{c}) = 1 - \bar{F} (μ_{c}) = 1 - \bar{F} (c) {(1 - ξ)}^{1 / ξ}

(A44)

where in our case

\bar{F} (c) = \bar{F} (y_{99}) = 0.015

. For

ξ = 0

the limiting expression has the form

F (μ_{c}) = 1 - \bar{F} (μ_{c}) = 1 - \bar{F} (c) / e

(A45)

Appendix B.3. Calculation of Empirical Entropy

For each of the first 99 percentiles, we estimate the

f (y_{i}) \approx Δ F / (y_{i} - y_{i - 1})

, where in our case

Δ F

= 0.01. The contribution of each

f (y_{i})

to entropy for unit background measure density is then

- f (y_{i}) \ln f (y_{i})

. However, according to the Pareto tail assumed in Appendix B.2, the contribution of the last percentile is:

Φ_{c} : = \int_{c}^{\infty} (- \ln (- {\bar{F}}^{'} (y))) (- {\bar{F}}^{'} (y)) d y = \bar{F} (c) (1 + ξ - \ln (\frac{\bar{F} (c)}{l}))

(A46)

where, for the Pareto tail after algebraic manipulations, we have:

- {\bar{F}}^{'} (y) = \frac{\bar{F} (y)}{l + (x - c) ξ} \Rightarrow f (c) = \frac{\bar{F} (x)}{l}

(A47)

and hence:

Φ_{c} = \bar{F} (c) (1 + ξ - \ln (f (c)))

(A48)

where in our case

\bar{F} (c) = \bar{F} (y_{99}) = 0.015

and

f (c) \approx 0.01 / (y_{99} - y_{98})

.

References

Aριστοτέλους, Hθικά Νικομάχεια (Aristotle, Nicomachean Ethics, 1107a.1). Available online: https://www.mikrosapoplous.gr/aristotle/nicom2b.htm (accessed on 22 December 2025).
Sargentis, G.-F. Fragility in Human Progress. A Perspective on Governance, Technology and Societal Resilience Front. Complex Syst. 2025, 3, 1609467. [Google Scholar] [CrossRef]
Sargentis, G.-F.; Lagaros, N.D.; Cascella, G.L.; Koutsoyiannis, D. Threats in Water–Energy–Food–Land Nexus by the 2022 Military and Economic Conflict. Land 2022, 11, 1569. [Google Scholar] [CrossRef]
Sargentis, G.-F. Entropy and War, Toy Models. Recent Prog. Sci. Eng. 2025, 1, 7. [Google Scholar] [CrossRef]
Sargentis, G.-F.; Koutsoyiannis, D.; Angelakis, A.; Christy, J.; Tsonis, A.A. Environmental Determinism vs. Social Dynamics: Prehistorical and Historical Examples. World 2022, 3, 357–388. [Google Scholar] [CrossRef]
Sargentis, G.-F.; Iliopoulou, T.; Dimitriadis, P.; Mamassis, N.; Koutsoyiannis, D. Stratification: An Entropic View of Society’s Structure. World 2021, 2, 153–174. [Google Scholar] [CrossRef]
Koutsoyiannis, D.; Sargentis, G.-F. Entropy and wealth. Entropy 2021, 23, 1356. [Google Scholar] [CrossRef] [PubMed]
Jaynes, E.T. Information theory and statistical mechanics. Phys. Rev. 1957, 106, 620–630. [Google Scholar] [CrossRef]
UNU-WIDER. World Income Inequality Database (WIID) Companion Dataset (Wiidcountry); UNU-WIDER: Helsinki, Finland, 2025. [Google Scholar] [CrossRef]
World Income Inequality Database (WIID), WIID Companion User Guide. Available online: https://www.wider.unu.edu/sites/default/files/WIID/WIID-Companion-User-Guide-29April2025.pdf (accessed on 1 November 2025).
Shannon, C.E. The mathematical theory of communication. Bell Syst. Tech. J. 1948, 27, 379–423. [Google Scholar] [CrossRef]
Jaynes, E.T. Probability Theory: The Logic of Science; Cambridge Univ. Press: Cambridge, UK, 2003; p. 728. [Google Scholar]
Uffink, J. Can the maximum entropy principle be explained as a consistency requirement? Stud. Hist. Philos. Mod. Phys. 1995, 26, 223–261. [Google Scholar] [CrossRef]
Lombardo, F.; Volpi, E.; Koutsoyiannis, D.; Papalexiou, S.M. Just two moments! A cautionary note against use of high-order moments in multifractal models in hydrology. Hydrol. Earth Syst. Sci. 2014, 18, 243–255. [Google Scholar] [CrossRef]
Koutsoyiannis, D. Stochastics of Hydroclimatic Extremes–A Cool Look at Risk, 4th ed.; Kallipos: Athens, Greece, 2024; 400p, ISBN 978-618-85370-0-2. Available online: https://www.itia.ntua.gr/2000/ (accessed on 22 December 2025).
Koutsoyiannis, D. When are models useful? Revisiting the quantification of reality checks. Water 2025, 17, 264. [Google Scholar] [CrossRef]
Marx, K.; Engels, F. The German Ideology; International Publishers: New York, NY, USA, 1970; Volume 1, Available online: https://archive.org/details/germanideology00marx/ (accessed on 1 November 2025).
World Inequality Report 2022–Executive Summary. Available online: https://wir2022.wid.world/executive-summary/ (accessed on 1 November 2025).
Global Inequalities—IMF. Available online: https://www.imf.org/en/Publications/fandd/issues/2022/03/Global-inequalities-Stanley (accessed on 1 November 2025).
2022 Income Inequality Decreased for First Time Since 2007—Census.gov. Available online: https://www.census.gov/library/stories/2023/09/income-inequality.html (accessed on 1 November 2025).
Income and Wealth Inequality in America, 1949–2016—Federal Reserve Bank of Minneapolis. Available online: https://www.minneapolisfed.org/research/institute-working-papers/income-and-wealth-inequality-in-america-1949-2016 (accessed on 1 November 2025).
Inequality in China: The Basics. Available online: https://www.csis.org/analysis/how-inequality-undermining-chinas-prosperity#h2-inequality-in-china-the-basics (accessed on 1 November 2025).
Xi Jinping: We Must Adhere to the People-Centred Development Philosophy. Friends of Socialist China. Available online: https://socialistchina.org/2025/05/02/xi-jinping-we-must-adhere-to-the-people-centred-development-philosophy/ (accessed on 1 November 2025).
India Ranks 4th Globally in Income Equality, Shows World Bank Data. Available online: https://timesofindia.indiatimes.com/india/india-ranks-4th-globally-in-income-equality-shows-world-bank-data/articleshow/122272852.cms (accessed on 1 November 2025).
What Is the State of Inequality in India?—The Hindu. Available online: https://www.thehindu.com/business/Economy/what-is-the-state-of-inequality-in-india/article69805101.ece (accessed on 1 November 2025).
Living Conditions in Europe–Income Distribution and Income Inequality—Eurostat. Available online: https://ec.europa.eu/eurostat/statistics-explained/index.php/Living_conditions_in_Europe_-_income_distribution_and_income_inequality (accessed on 1 November 2025).
Russia Economic Report—World Bank. Available online: https://www.worldbank.org/en/country/russia/publication/rer (accessed on 1 November 2025).
GDP Per Capita (Current US$)—China, India, United States, Russian Federation, European Union. Available online: https://data.worldbank.org/indicator/NY.GDP.PCAP.CD?end=2024&locations=CN-IN-US-RU-EU&start=1960&view=chart (accessed on 1 November 2025).
GDP Per Capita, PPP (Current International $)—China, India, United States, Russian Federation, European Union. Available online: https://data.worldbank.org/indicator/NY.GDP.PCAP.PP.CD?end=2024&locations=CN-IN-US-RU-EU&start=1960&view=chart (accessed on 1 November 2025).
Lewis, D.K. The History of Argentina; Bloomsbury Publishing: New York, NY, USA, 2014. [Google Scholar]
Argentina After World War II: From Peronism to Dictatorship. Available online: https://oercommons.org/courseware/lesson/88088/student-old/?task=2 (accessed on 1 November 2025).
Little, W. Party and State in Peronist Argentina, 1945–1955. Hisp. Am. Hist. Rev. 1973, 53, 644–662. [Google Scholar] [CrossRef][Green Version]
Katie, A. Dirty War Argentina [1976–1983]. Britannica. Available online: https://www.britannica.com/event/Dirty-War-Argentina (accessed on 1 November 2025).
Pion-Berlin, D. Argentina: The Journey from Military Intervention to Subordination. Oxf. Res. Encycl. Politics 2020. [Google Scholar] [CrossRef]
Romero, L.A. A History of Argentina in the Twentieth Century, Updated and revised ed.; Penn State Press: University Park, PA, USA, 2013. [Google Scholar]
Spruk, R. The rise and fall of Argentina. Lat. Am. Econ. Rev. 2019, 28, 16. [Google Scholar] [CrossRef]
Ferre, J.C. The Rise of Javier Milei and the Emergence of Authoritarian Liberalism in Argentina. Lat. Am. Res. Rev. 2025, 60, 965–976. [Google Scholar] [CrossRef]
Civilian Casualties After the Air Attack and Massacre on Plaza de Mayo, June 1955. Available online: https://en.wikipedia.org/wiki/Revoluci%C3%B3n_Libertadora#/media/File:Plaza-Mayo-bombardeo-1955.JPG (accessed on 1 November 2025).
Cordobazo. General Strike in Protest Against the Political and Economic Decisions of the Military Dictatorship. Bulevar San Juan, Córdoba Capital. In That Place Was Murdered That Day, the SMATA Worker Máximo Mena. Available online: https://en.wikipedia.org/wiki/Cordobazo#/media/File:Cordobazo.jpg (accessed on 1 November 2025).
Fernando Luiz, L.; Koury, A.P. In the beginning, There Was land. Street Matters: A Critical History of Twentieth-Century Urban Policy in Brazil; University of Pittsburgh Press: Pittsburgh, PA, USA, 2022; pp. 19–36. [Google Scholar] [CrossRef]
Carter, M. Social Inequality, Agrarian Reform, and Democracy in Brazil. Available online: https://static1.squarespace.com/static/5bbd787251f4d47ff1881d9b/t/5cdc38b4a4222fbfda5e7e86/ (accessed on 1 November 2025).
Fischer, B. Favelas and Politics in Brazil, 1890–1960. In Oxford Research Encyclopedia of Latin American History; Oxford University Press: Oxford, UK, 2019. [Google Scholar] [CrossRef]
Brazil Profile—Timeline. BBC. 3 January 2019. Available online: https://www.bbc.com/news/world-latin-america-19359111 (accessed on 1 November 2025).
Panoramic View of Rio’s Rocinha Favela. Wikipedia. Available online: https://en.wikipedia.org/wiki/Favela#/media/File:1_rocinha_panorama_2014.jpg (accessed on 1 November 2025).
Library of Congress. Brazil-US Relations. Military Dictatorship (1964–1985). Available online: https://guides.loc.gov/brazil-us-relations/military-dictatorship (accessed on 1 November 2025).
Talarico, A. Deeply Divided Brazil. Available online: https://www.beyondintractability.org/casestudy/deeply-divided-brazil (accessed on 1 November 2025).
Klein, H.S.; Luna, F.V. Brazil: An Economic and Social History from Early Man to the 21st Century. Hisp. Am. Hist. Rev. 2024, 104, 692–694. [Google Scholar] [CrossRef]
Michener, G.; Pereira, C. A Great Leap Forward for Democracy and the Rule of Law? Brazil’s Mensalão Trial. J. Lat. Am. Stud. 2016, 48, 477–507. [Google Scholar] [CrossRef]
The Editors of Encyclopaedia Britannica. “Apartheid”. Encyclopedia Britannica. 17 September 2025. Available online: https://www.britannica.com/topic/apartheid (accessed on 25 October 2025).
South African History Online. A History of Apartheid in South Africa. South African History Online. 6 May 2016. Available online: https://sahistory.org.za/article/history-apartheid-south-africa (accessed on 1 November 2025).
Larson, Z. South Africa: Twenty-Five Years Since Apartheid. Origins. Current Events in Historical Perspective. Available online: https://origins.osu.edu/article/south-africa-mandela-apartheid-ramaphosa-zuma-corruption (accessed on 1 November 2025).
Mandela, N. Long Walk to Freedom: The Autobiography of Nelson Mandela; Hachette UK: London, UK, 2008; Available online: https://books.google.gr/books?hl=en&lr=&id=jc41AQAAQBAJ&oi=fnd&pg (accessed on 1 November 2025).
Netshitenzhe, J. Inequality matters: South African trends and interventions. New Agenda S. Afr. J. Soc. Econ. Policy 2014, 53, 8–13. Available online: https://www.ajol.info/index.php/na/article/view/111806/101572 (accessed on 1 November 2025).
Lawal, S. South Africa: 30 Years After Apartheid, What Has Changed? Al Jazeera 27 April 2024. Available online: https://www.aljazeera.com/news/2024/4/27/south-africa-30-years-after-apartheid-what-has-changed (accessed on 1 November 2025).
Makgetla, N. Inequality in South Africa: An Overview. September 2020. Available online: https://tips.org.za/images/TIPS_Working_Paper_Inequality_in_South_Africa_An_Overview_September_2020.pdf (accessed on 1 November 2025).
United Nations. Corruption & Economic Crime. Available online: https://dataunodc.un.org/dp-crime-corruption-offences (accessed on 1 November 2025).
World Bank Group. International Homicides (Per 100,000 People)—South Africa. Available online: https://data.worldbank.org/indicator/VC.IHR.PSRC.P5?locations=ZA (accessed on 1 November 2025).
United Nations. International Homicide. Available online: https://dataunodc.un.org/dp-intentional-homicide-victims (accessed on 1 November 2025).
Global Organized Crime Index. Available online: https://ocindex.net/country/south_africa (accessed on 1 November 2025).
Bulgaria Country Profile. BBC. Available online: https://www.bbc.com/news/world-europe-17202996 (accessed on 1 November 2025).
The History of Communism in Bulgaria. Available online: https://openendedsocialstudies.org/2018/01/11/the-history-of-communism-in-bulgaria/ (accessed on 1 November 2025).
Todorov, A. The State of the Right: Bulgaria. Fondation Pour L’innovation Politique (Fondapol). Available online: https://www.fondapol.org/en/study/the-state-of-the-right-bulgaria/ (accessed on 1 November 2025).
Zmigrodzki, M. Issues facing the transformation of the political system in Bulgaria. Glob. Econ. Rev. 1992, 21, 95–108. [Google Scholar] [CrossRef]
Hill, R.J.; White, S. Referendums in Russia, the Former Soviet Union and Eastern Europe. In Referendums Around the World; Qvortrup, M., Ed.; Palgrave Macmillan: London, UK, 2014. [Google Scholar] [CrossRef]
Apartment Block in District of Sveta Troitsa, Sofia, Bulgaria. Available online: https://commons.wikimedia.org/wiki/File:Apartment_block_in_district_of_Sveta_Troitsa,_Sofia,_Bulgaria.jpg (accessed on 1 November 2025).

Figure 1. Visual comparison of the exact Lorenz curves for the PBF distribution function with four different parameter sets (shown in each of the four panels) with the approximation of Equation (30). Except for the Pareto case, the

ξ

parameter was determined to maximize entropy for a chosen

ζ

parameter.

Figure 1. Visual comparison of the exact Lorenz curves for the PBF distribution function with four different parameter sets (shown in each of the four panels) with the approximation of Equation (30). Except for the Pareto case, the

ξ

parameter was determined to maximize entropy for a chosen

ζ

parameter.

Figure 2. Visual comparison of the exact Lorenz curves for the Dagum distribution function with four different parameter sets (shown in each of the four panels) with the approximation of Equation (30). Except for the upper-left case, the

ξ

parameter was determined to maximize entropy for a chosen

ζ

parameter.

Figure 2. Visual comparison of the exact Lorenz curves for the Dagum distribution function with four different parameter sets (shown in each of the four panels) with the approximation of Equation (30). Except for the upper-left case, the

ξ

parameter was determined to maximize entropy for a chosen

ζ

parameter.

Figure 3. Visual comparison of statistical characteristics of two distribution functions, PBF and Dagum, with parameters as shown in the legend, having the same K-spread coefficient (Gini index),

K_{2} / μ = 0.43

, and different standardized entropies

Φ_{μ} =

0.88 and 0.97, respectively. (Upper left) Lorenz curves, which are very similar for the two distributions; (upper right) distribution functions plotted in the form of the variable

x

vs. the odds function

F (x) / (1 - F (x))

, which shows the substantially different behaviour of the distributions, especially in the tails; (lower) probability density function on (left) linear and (right) logarithmic plots.

Figure 3. Visual comparison of statistical characteristics of two distribution functions, PBF and Dagum, with parameters as shown in the legend, having the same K-spread coefficient (Gini index),

K_{2} / μ = 0.43

, and different standardized entropies

Φ_{μ} =

0.88 and 0.97, respectively. (Upper left) Lorenz curves, which are very similar for the two distributions; (upper right) distribution functions plotted in the form of the variable

x

vs. the odds function

F (x) / (1 - F (x))

, which shows the substantially different behaviour of the distributions, especially in the tails; (lower) probability density function on (left) linear and (right) logarithmic plots.

Figure 4. Detailed graphs of economic indicators of Bulgaria in 1971 (clockwise from upper left): Lorenz curve; odds function; probability density function; and graph of K-centre and K-spread vs. K-moment order, where for reference the theoretical curves of the ME exponential distribution are also plotted in dotted lines.

Figure 5. Visualization of the behaviour of the bounded exponential distribution, resulting from maximization of entropy for fixed mean

μ

and upper bound

Ω

: (left) two instances of the probability density function for the indicated values of the upper bound and for scale parameter

λ = 1

; (right) variation of standardized mean, K-spread coefficient (Gini index) and standardized entropy, for varying upper bound

Ω / λ

.

Figure 5. Visualization of the behaviour of the bounded exponential distribution, resulting from maximization of entropy for fixed mean

μ

and upper bound

Ω

: (left) two instances of the probability density function for the indicated values of the upper bound and for scale parameter

λ = 1

; (right) variation of standardized mean, K-spread coefficient (Gini index) and standardized entropy, for varying upper bound

Ω / λ

.

Figure 6. Maximum entropy vs. K-spread curve: maximum standardized entropy

Φ_{μ}

that is feasible for a specified K-spread coefficient

K_{2} / μ

(Gini index). A particular state, defined as a point (

K_{2} / μ, Φ_{μ}

) is feasible only if it lies below this curve. The exact curve corresponds to the generalized half logistic (GHL) distribution, while different approximations (practically indistinguishable from the exact curve) are also plotted. Purple dashed lines show the boundaries between the partitioned areas.

Figure 6. Maximum entropy vs. K-spread curve: maximum standardized entropy

Φ_{μ}

that is feasible for a specified K-spread coefficient

K_{2} / μ

(Gini index). A particular state, defined as a point (

K_{2} / μ, Φ_{μ}

) is feasible only if it lies below this curve. The exact curve corresponds to the generalized half logistic (GHL) distribution, while different approximations (practically indistinguishable from the exact curve) are also plotted. Purple dashed lines show the boundaries between the partitioned areas.

Figure 7. Approximations of the maximum entropy vs. K-spread curve: (left) approximation errors, defined as differences of the standardized entropy derived from approximations minus the exact values of the ME GHL distribution; (right) tail indices of the approximating distributions.

Figure 8. Maximum standardized entropy, as a function of the K-spread coefficient, attained by the PBF and Dagum distributions when (left) the lower tail index

ζ

is specified to the values shown in the legend and (right) the upper tail index

ξ

is specified to the values shown in the legend. Purple dashed lines show the boundaries between the partitioned areas.

Figure 8. Maximum standardized entropy, as a function of the K-spread coefficient, attained by the PBF and Dagum distributions when (left) the lower tail index

ζ

is specified to the values shown in the legend and (right) the upper tail index

ξ

is specified to the values shown in the legend. Purple dashed lines show the boundaries between the partitioned areas.

Figure 9. Contour plot of the attained maximum standardized entropy

Φ_{μ}

by the PBF and the Dagum distributions as a function of the lower and upper tail index

ζ, ξ

, respectively. The two lines plotted in white separate the total area into four parts; in two, noted as “PBF”, the maximum is attained by the PBF distribution, while in the other two, noted as “Dagum”, the maximum is attained by the Dagum distribution. One of the two boundary lines depicts the log-logistic distribution, which is a special case of both the PBF and the Dagum distributions. The other boundary curve is derived after a systematic numerical investigation. The exponential, Pareto, and Weibull distributions, all of which are special cases of the PBF distribution, are also shown.

Figure 9. Contour plot of the attained maximum standardized entropy

Φ_{μ}

by the PBF and the Dagum distributions as a function of the lower and upper tail index

ζ, ξ

, respectively. The two lines plotted in white separate the total area into four parts; in two, noted as “PBF”, the maximum is attained by the PBF distribution, while in the other two, noted as “Dagum”, the maximum is attained by the Dagum distribution. One of the two boundary lines depicts the log-logistic distribution, which is a special case of both the PBF and the Dagum distributions. The other boundary curve is derived after a systematic numerical investigation. The exponential, Pareto, and Weibull distributions, all of which are special cases of the PBF distribution, are also shown.

Figure 10. Detailed graphs of economic indicators in the USA in 2022 (clockwise from upper left): Lorenz curve; odds function; probability density function; and graph of K-centre and K-spread vs. K-moment order, where, for reference, the theoretical curves of the ME exponential distribution are also plotted in dotted lines.

Figure 11. Detailed graphs of economic indicators in China in 2022 (clockwise from upper left): Lorenz curve; odds function; probability density function; and graph of K-centre and K-spread vs. K-moment order, where, for reference, the theoretical curves of the ME exponential distribution are also plotted in dotted lines.

Figure 12. Detailed graphs of economic indicators of the World (more specifically, 69 countries from which data for 2022 are available, with a total population of 5.4 billion), (clockwise from upper left): Lorenz curve; odds function; probability density function; and graph of K-centre and K-spread vs. K-moment order, where, for reference, the theoretical curves of the ME exponential distribution are also plotted in dotted lines.

Figure 13. Characteristic graphs for the examined large population countries with data availability in 2022 (clockwise from upper left): standardized entropy vs. K-spread coefficient (Gini index), plotted alongside the maximum entropy vs. K-spread curve (purple dashed lines show the boundaries between the partitioned areas); distance from the pole of maximum entropy; GDP per capita [28]; GDP-PPP per capita, [29]. The inset in the upper left graph contains the entire range as in Figure 6. Five major geopolitical players are flagged. South Africa is included as one of the case studies (see below) even though the latest available data are for 2017.

Figure 15. Characteristic graphs for the evolution of major economic indices in Argentina: (left): standardized entropy vs. K-spread coefficient (Gini index), plotted alongside the maximum entropy vs. K-spread curve; (right) distance from the pole of maximum entropy with the grey lines indicating the times of coups d’état. Purple dashed lines show the boundaries between the partitioned areas.

Figure 16. Panoramic view of Rio’s Rocinha favela, contrasted by high-rise buildings (condominiums) near the coast (of South Atlantic Ocean) in São Conrado [44].

Figure 17. Characteristic graphs for the evolution of major economic indices in Brazil: (left): standardized entropy vs. K-spread coefficient (Gini index), plotted alongside the maximum entropy vs. K-spread curve; (right) distance from the pole of maximum entropy. Purple dashed lines show the boundaries between the partitioned areas.

Figure 18. Characteristic graphs for the evolution of major economic indices in South Africa: (left): standardized entropy vs. K-spread coefficient (Gini index), plotted alongside the maximum entropy vs. K-spread curve; (right) distance from the pole of maximum entropy. Purple dashed lines show the boundaries between the partitioned areas.

Table 1. Distribution functions used in this study and their main characteristics *.

$Distribution, F (x)$	$Density, f (x)$	$K-Moments, K_{p}^{'}$ $and / or {\bar{K}}_{p}^{'}$	$Mean, μ$	$K-Spread Coefficient, K_{2} / μ$	$Standardized Entropy, Φ_{μ}$
Exponential, $1 - e^{- x / λ}$	$\frac{1}{λ} \bar{F} (x)$	$K_{p}^{'} = λ H_{p}, {\bar{K}}_{p}^{'} = \frac{λ}{p}$	$λ$	$\frac{1}{2}$	$1$
Bounded exponential, $\frac{1 - e^{- x / λ}}{1 - e^{- Ω / λ}}$	$\frac{1}{λ} \frac{1}{e^{x / λ} - 1} F (x)$	$K_{p}^{'} = Ω - λ {(1 - e^{- \frac{Ω}{λ}})}^{- p} B_{1 - e^{- Ω / λ}} (p + 1, 0)$	$λ + \frac{Ω}{1 - e^{Ω / λ}}$	$\frac{1}{2} (\frac{Ω / λ}{Ω / λ - e^{Ω / λ} + 1} + \coth (\frac{Ω}{2 λ}))$	$1 + \frac{Ω / λ}{1 - e^{Ω / λ}} + \ln (\frac{2 - 2 \cosh (Ω / λ)}{Ω / λ - e^{Ω / λ} + 1})$
Logistic, $1 - \frac{1}{1 + e^{- ς + x / λ}}$	$\frac{e^{- ς + x / λ}}{λ} \bar{F} {(x)}^{2}$	$K_{p}^{'} = λ (H_{p - 1} + ς)$	$λ ς$	$\frac{1}{ς}$	$2 - \ln (ς)$
GHL, $1 - \frac{1}{1 - e^{- ς} + e^{- ς + x / λ}}$	$\frac{ς e^{- ς + x / λ}}{λ} \bar{F} {(x)}^{2}$	$K_{p}^{'} = λ (H_{p} + ς + \frac{B_{1 - e^{ς}} (p + 1, 0)}{{(1 - e^{ς})}^{p}})$	$\frac{λ ς}{1 - e^{- ς}}$	$\frac{1}{ς} (1 - \frac{ς}{e^{ς} - 1})$	$2 - \frac{ς}{1 - e^{- ς}} - \ln (\frac{ς}{e^{ς} - 1})$
Pareto ( $ζ = 1$ ), ${1 - (1 + ξ \frac{x}{λ})}^{- \frac{1}{ξ}}$	$\frac{1}{λ + ξ x} \bar{F} (x)$	$K_{p}^{'} = \frac{λ}{ξ} (p B (p, 1 - ξ) - 1), {\bar{K}}_{p}^{'} = \frac{λ}{p - ξ}$	$\frac{λ}{1 - ξ}$	$\frac{1}{2 - ξ}$	$1 + ξ + \ln (1 - ξ)$
Weibull ( $ξ = 0$ ), $1 - \exp (- {(\frac{x}{λ})}^{ζ})$	$\frac{ζ}{λ} {(\frac{x}{λ})}^{ζ - 1} \bar{F} (x)$	${\bar{K}}_{p}^{'} = λ p^{- 1 / ζ} Γ (1 + \frac{1}{ζ})$	$\frac{λ}{ζ} Γ (\frac{1}{ζ})$	$1 - 2^{- 1 / ζ}$	$1 + (1 - \frac{1}{ζ}) γ - l n Γ (\frac{1}{ζ})$
PBF, $1 - {(1 + ζ ξ {(\frac{x}{λ})}^{ζ})}^{- \frac{1}{ζ ξ}}$	$\frac{ζ / x}{{(\frac{x}{λ})}^{- ζ} + ζ ξ} \bar{F} (x)$	${\bar{K}}_{p}^{'} = \frac{λ p}{{(ζ ξ)}^{1 / ζ}} B (1 + \frac{1}{ζ}, \frac{p}{ζ ξ} - \frac{1}{ζ})$	As ${\bar{K}}_{1}^{'}$	$1 - \frac{B (\frac{1}{ζ}, \frac{2 - ξ}{ζ ξ})}{B (\frac{1}{ζ}, \frac{1 - ξ}{ζ ξ})}$	$1 + ζ ξ + \ln (ζ ξ) + (1 - \frac{1}{ζ}) (ψ (\frac{1}{ζ ξ}) + γ) - l n B (\frac{1}{ζ}, \frac{1 - ξ}{ζ ξ})$
Dagum, ${(1 + \frac{1}{ζ ξ} {(\frac{x}{λ})}^{- \frac{1}{ξ}})}^{- ζ ξ}$	$\frac{ζ / x}{{(\frac{x}{λ})}^{1 / ξ} + ζ ξ} F (x)$	$K_{p}^{'} = λ p {(ξ ζ)}^{1 - ξ} B (1 - ξ, p ζ ξ + ξ)$	As $K_{1}^{'}$	$\frac{2 B (1 - ξ, 2 ζ ξ + ξ)}{B (1 - ξ, ζ ξ + ξ)} - 1$	$1 + \frac{1}{ζ ξ} - \ln (ζ^{2} ξ) + (1 + ξ) (ψ (ζ ξ) + γ) - l n B (1 - ξ, ζ ξ + ξ)$
Log-logistic, $1 / (1 + {(\frac{x}{λ})}^{- ζ})$	$\frac{ζ / x}{{(\frac{x}{λ})}^{ζ} + 1} F (x)$	$K_{p}^{'} = λ p B (1 - \frac{1}{ζ}, p + \frac{1}{ζ})$ ${\bar{K}}_{p}^{'} = λ p B (1 + \frac{1}{ζ}, p - \frac{1}{ζ})$	$\frac{π λ}{ζ \sin \frac{π}{ζ}}$	$\frac{1}{ζ}$	$\ln (\frac{1}{π} \sin \frac{π}{ζ})$

* Clarifications of symbols:

γ = 0.577216

is the Euler’s constant;

Γ (\cdot)

and

B (\cdot, \cdot)

are the gamma and beta functions;

ψ (\cdot)

is the digamma function;

H_{p}

is the pth harmonic number;

λ

is a scale parameter;

ξ

and

ζ

are the upper and lower tail indices, respectively;

ς

is a shape parameter. The support of the bounded exponential distribution is

(0, Ω)

. The support of the logistic distribution is

(- \infty, \infty)

and its upper and lower tail indices are

ξ = ξ^{'} = 0

. For all other distributions, the support is

(0, \infty)

and the upper and lower tail indices are

ξ a n d ζ

, respectively; when expressions do not include either of the two, their values are

ξ = 0

,

ζ = 1

. Note that when

ξ = 1 / ζ

the PBF and Dagum distributions yield the log-logistic.

Table 2. Characteristic indices for the economies of the major countries and composite entities in 2022.

Country, Year	$K_{2} / μ$	$Φ_{μ}$	$d_{p}$	$D_{10} / D_{2}$	$ξ$	$ζ$
World	0.57	0.91	0.11	3.11	0.41	1.26
China	0.44	0.89	0.12	2.97	0.34	1.98
Colombia	0.56	0.87	0.14	3.31	0.53	1.67
Brazil	0.48	0.86	0.14	3.22	0.49	1.86
India	0.49	0.85	0.15	3.13	0.50	1.02
Bangladesh	0.50	0.83	0.17	3.44	0.69	2.59
Mexico	0.44	0.80	0.21	3.23	0.53	2.45
USA	0.40	0.79	0.23	3.08	0.41	2.31
Turkey	0.46	0.75	0.25	3.39	0.72	2.29
Italy	0.36	0.75	0.29	2.94	0.34	2.25
European Union	0.34	0.74	0.30	3.00	0.35	1.70
Indonesia	0.38	0.71	0.31	3.04	0.33	2.11
Viet Nam	0.39	0.71	0.31	3.14	0.41	2.65
Iran	0.36	0.72	0.32	3.03	0.41	2.24
Russian Federation	0.35	0.72	0.32	2.97	0.31	3.48
South Africa (2017)	0.68	0.72	0.33	3.45	0.55	2.24
United Kingdom	0.33	0.69	0.35	2.98	0.33	3.01
France	0.32	0.66	0.38	3.05	0.41	2.98
Germany	0.31	0.65	0.40	2.98	0.32	3.51
Bulgaria (1971)	0.23	−0.67	1.69	3.34	0.20	5.84

Note: All data are for the year 2022 except South Africa (2017). Bulgaria (1971), also shown in Figure 4, is included as an extreme case for comparison. The lowest or highest values among countries of the indices that favour equality or stability are highlighted in bold and those disfavouring them in italics, namely equality is manifested by low

K_{2} / μ, D_{10} / D_{2}, ξ

and high

ζ

, while stability is reflected in high

Φ_{μ}

and low

d_{p}

.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Koutsoyiannis, D.; Sargentis, G.-F. Trade-Off Between Entropy and Gini Index in Income Distribution. Entropy 2026, 28, 35. https://doi.org/10.3390/e28010035

AMA Style

Koutsoyiannis D, Sargentis G-F. Trade-Off Between Entropy and Gini Index in Income Distribution. Entropy. 2026; 28(1):35. https://doi.org/10.3390/e28010035

Chicago/Turabian Style

Koutsoyiannis, Demetris, and G.-Fivos Sargentis. 2026. "Trade-Off Between Entropy and Gini Index in Income Distribution" Entropy 28, no. 1: 35. https://doi.org/10.3390/e28010035

APA Style

Koutsoyiannis, D., & Sargentis, G.-F. (2026). Trade-Off Between Entropy and Gini Index in Income Distribution. Entropy, 28(1), 35. https://doi.org/10.3390/e28010035

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Trade-Off Between Entropy and Gini Index in Income Distribution

Abstract

1. Introduction

2. Data

3. Methods

3.1. Basic Stochastic Tools

3.1.1. Distribution Function and Relative Concepts; Expectation and Moments

3.1.2. Entropy and Standardized Entropy

3.1.3. K-Moments

3.1.4. Specific Distribution Functions and Tail Indices

3.2. The Lorenz Curve and the Gini Index

3.3. Maximum Entropy Distributions

3.3.1. Unconstrained Bounded Variables

3.3.2. Constrained Mean

3.3.3. Constrained Mean and K-Spread

3.3.4. Notes on the Tail Indices

4. Application

4.1. General Setting

4.2. The Status of the Major Countries in 2022

4.3. A Brief Political History and the Evolution of Economic Indices in Specific Countries

4.3.1. Argentina

4.3.2. Brazil

4.3.3. South Africa

4.3.4. Bulgaria

5. Discussion and Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

Abbreviations

Appendix A. Mathematical Derivations

Appendix A.1. Estimation of K-Moments for Data That Are Grouped in Percentiles

Appendix A.2. Derivations About the Lorenz Curve and the Gini Index

Appendix A.3. Maximum Entropy Distribution for Fixed Mean and Second K-Moment

Appendix B. Data Processing

Appendix B.1. Compilation of Datasets for Multi-Country Entities

Appendix B.2. Assignment of Distribution Function Values for the Observed Sample Values

Appendix B.3. Calculation of Empirical Entropy

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI