2.1. The Metalog Distribution Method
Let x represent the observable data used to measure the individual’s position in the social scale, such as individual income. We define the cumulative probability function (CPF) as , where is the fraction of the population with values below x. Thus the fraction of the population between and (where ) is given by . The cumulative probability function satisfies and , where is the maximum observed value within the group.
The quantile probability function (QPF), denoted as , is defined as the inverse function of the cumulative probability function . It provides the value of the observable data corresponding to a given fraction (y) of the population, such that , .
For a tiny interval
corresponding to a tiny interval
, there exists a function
that maps one interval into the other:
. This function
is the probability density function (PDF) and represents the likelihood of finding individuals within the interval between
x and
:
Therefore, as indicated by Equation (
1), once the quantile probability function is derived from the survey data, its derivative yields the probability density function of the distribution.
A sigmoidal distribution with a cumulative probability function
is associated with bell-shaped probability density function (PDF), denoted as
. We refer to this PDF as
. The function
describes the distribution of the observable (variable
x) around
with a spread,
s. The probability
y of observing a value of
is
Similarly, the probability density function (PDF) for the distribution of variable
y is the derivative of the quantile probability function (QPF) given in Equation (
1), which is also known as the quantile density function (q-PDF).
In this study, we employed the sigmoidal metalogistic probability function (metalog distribution) [
26] due to its flexibility in shaping the distribution. Introduced in 2016, the metalog distribution has been widely applied across various fields. It extends the logistic distribution, offering enhanced adaptability for modeling diverse data patterns:
with a power series expansion for location (
) and scale parameters (
s):
The expansions in Equation (
4) use the property that, for a sigmoidal distribution with a cumulative probability function
, the function
y is approximately linear in
x within a small neighborhood around
. Alternatively, if we treat
and
s as functions of
y, they can be expressed as power series expansions around fixed values
and
. This approach serves as the foundation for defining the metalogistic distribution. Note that Equation (
4) do not represent a polynomial expansion of the cumulative probability function itself; rather, they describe expansions for the location and scale parameters of the quantile probability function. The fitting parameters
are utilized in the expansions for
and
s. We present these expansions in the following traditional sequence: the first parameter is for
, the second and third parameters are for
s, the fourth and fifth parameters are for
, and from the sixth parameter onward, they alternate between
and
s, continuing up to the
k-th parameter.
We hypothesize that class boundaries are likely indicated by changes in the trend of the probability density function (PDF), either increasing or decreasing. It is natural to associate these boundaries with inflection points in the PDF, where the second derivative of the PDF is zero.
Associating class boundaries with the zeros of the second derivative of the probability density function (PDF) evokes concepts from the Lee–Yang theory [
27,
28], which is used to describe phase transitions in the thermodynamic limit. In this context, the number of particles and the system volume approach infinity while the particle density remains finite. A phase transition occurs when there is a sudden change—whether continuous or discontinuous—in a system property. The Lee–Yang theory links phase transitions to the zeros of the partition function in the complex plane [
29,
30].
The PDF is relevant primarily for very dense systems or in the thermodynamic limit of finite systems. The zeros of the second derivative of the PDF indicate sudden changes—whether continuous or discontinuous—in the characteristics of classes, thereby establishing the boundaries of these classes on the social scale. The Lee–Yang theory has broad applications in various fields, including protein folding, complex networks, and percolation [
31,
32,
33,
34,
35,
36,
37,
38,
39].
2.2. Case Study: Analyzing Brazilian Income Distribution
As a case study, we examine the per capita income distribution in Brazil, a country with a long history of affirmative action policies. These policies include financial support for low-income families, preferential admission to public universities, access to public services, and eligibility for political party candidacies, among other areas. Brazil has had a minimum income policy for over three decades, with its values and designations varying according to the ruling party.
In Brazil, social classes are commonly categorized into three broad groups based on family income: upper class, middle class, and lower class. However, the economic classification system employed by the Secretariat of Strategic Affairs (SAE) and the Brazilian Association of Research Companies (ABEP) provides a more detailed breakdown. This system further divides these broad categories into more specific classifications, denoted by letters. These classifications are the following:
Class A: A1, A2.
Class B: B1, B2.
Class C: C1, C2.
Class D.
Class E.
Among these classifications, Class A1 represents the highest economic status, characterized by superior quality of life and greater purchasing power. In contrast, Class E signifies the lowest economic status, with lower purchasing power and reduced quality of life. This classification takes into account factors such as family income, assets, and education levels. However, our study focuses solely on income values, excluding other social parameters.
For consistency, we use the Brazilian national currency as the unit of per capita income throughout this work. As of 2024, the minimum wage in Brazil is BRL 1412 (approximately USD 300). However, given the significant presence of the informal labor market in Brazil, the minimum wage may not fully reflect the economic reality for many individuals.
The Brazilian Institute of Geography and Statistics (IBGE) categorizes social classes based on monthly family income into five main groups:
Class A: Above 20 minimum wages (≥BRL 28,240).
Class B: From 10 to 20 minimum wages (BRL 14,120–BRL 28,240).
Class C: From 4 to 10 minimum wages (BRL 5648–BRL 14,120).
Class D: From 2 to 4 minimum wages (BRL 2824–BRL 5648).
Class E: Up to 2 minimum wages (≤BRL 2824).
The classifications mentioned are purely economic and static, focusing on family incomes. It is known that the broad category of the lower class generally includes families with a larger number of members compared to those in the middle and upper classes. In contrast, our criterion focuses on individual income, allowing a dynamic analysis of class boundaries. This approach enables us to examine the movement of individuals between classes and provides a more nuanced understanding of social mobility.
Our case study utilizes data from the Brazilian Institute of Geography and Statistics (IBGE), the Brazilian agency responsible for collecting and analyzing data to inform governmental strategies [
40].
Table 1 presents the
per capita income distribution derived from eleven consecutive surveys conducted between 2012 and 2022. The data are categorized by IBGE into twelve percentiles, each representing 10% of the population, with the exception of the final two percentiles, which cover the 90–95% and 95–99% ranges. The values in the columns represent the highest income within each percentile slice.
The top 1% is not included in the table, as the income distribution in this segment spans a broad range of values, starting from those listed in the last row of
Table 1. This omission would complete the twelve slices of Brazil’s income distribution. It is important to note that the table shows the upper
per capita income for fixed population fractions. Hence, if we overlook the gradual increase in population over the eleven-year period, the number of individuals in each percentile slice remains constant over time.
Table 1 clearly illustrates a striking disparity in income distribution among the population. The extent of this income dispersion is so significant that we had to use a logarithmic scale to ensure a more accurate statistical analysis. Despite Brazil experiencing several decades without major natural or economic disasters, and remaining unaffected by war or sudden political upheavals, the income inequality has persisted. This inequality has endured even as Brazil maintained its position among the world’s top ten largest economies.
Low-income families in Brazil are estimated to number around 30 million out of a total population of approximately 220 million. These families receive support from various social programs provided by the Brazilian government, including the Bolsa Família program, which offers BRL 600 per family, subject to certain conditions. Additional benefits are provided for each child in the family.
Brazil, a Federative Republic, holds presidential elections every four years. The election prior to 2012 was held in 2010, covering the presidential term from 2011 to 2015. The data presented in
Table 1 span from 2012 to 2022, encompassing four presidential terms: 2011–2014 under Luiz Inácio Lula da Silva, 2015–2018 under Dilma Rousseff and Michel Temer, and 2019–2022 under Jair Messias Bolsonaro.