Next Article in Journal
A Theoretical–Methodological Model for Communities Engaged in Alternative Tourism: Integrating the 2030 Agenda and Sustainable Development Structures
Previous Article in Journal
Human Energy Management System (HEMS) for Workforce Sustainability in Industry 5.0
Previous Article in Special Issue
Fostering Sustainable Female Participation in STEM Through Ecological Systems Theory: A Comparative Study in Three African Countries
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Multidimensional Educational Inequality in Italy: A Stacking-Based Approach for Gender and Territorial Analysis

1
Department of Economics and Business, University of Genova, 16126 Genova, Italy
2
Faculty of Communications, IULM University, 20122 Milan, Italy
*
Author to whom correspondence should be addressed.
Sustainability 2025, 17(14), 6243; https://doi.org/10.3390/su17146243
Submission received: 31 May 2025 / Revised: 22 June 2025 / Accepted: 27 June 2025 / Published: 8 July 2025

Abstract

This study investigates regional and gender disparities in educational attainment across Italy in 2021, drawing on the Fair and Sustainable Well-being (BES) dataset from ISTAT. By applying cluster analysis and composite indicators—including the Mazziotta–Pareto Index (MPI), geometric and arithmetic means, min-max normalization, and principal component analysis (PCA)—we assess the robustness and consistency of educational performance across regions. A key methodological innovation is the use of the stacking method to ensure comparability between genders. Results show persistent North–South educational divides and a consistent female advantage across all indicators. The paper contributes to Sustainable Development Goals by providing empirical insights into SDG 4 (Quality Education) through measurement of educational inequality and access; SDG 5 (Gender Equality) by highlighting structural advantages of women in educational outcomes; and SDG 10 (Reduced Inequalities) through a territorial analysis of disparities and policy implications. The findings offer both a methodological contribution—by testing multiple aggregation techniques—and a practical tool for policy evaluation, emphasizing the importance of multidimensional and gender-sensitive approaches in achieving educational sustainability.

1. Introduction

The level of education is a crucial indicator of a country’s social and economic development, directly influencing the quality of life, employment opportunities, and the competitiveness of the labor market. An efficient and accessible education system not only promotes individual growth but also contributes to collective progress, reducing social and territorial inequalities [1]. In Italy, educational inequalities between different geographical areas and between genders are a structural issue, with significant impacts on social cohesion and economic growth [2]. Empirical evidence has long documented a significant gap between the Northern and Southern regions of the country: the former has higher education rates, while the latter records higher levels of school dropout and more limited access to tertiary education [3,4]. This imbalance is also reflected in the labor market, generating differentials in terms of employment and remuneration. The causes of these disparities are multiple and attributable to economic, infrastructural, social, and cultural factors, which affect the quality of the training offered and educational opportunities [5]. In this context, the scientific literature has significantly deepened the analysis of educational inequalities, highlighting the importance of adopting methodological tools capable of synthesizing complex phenomena in a reliable manner. The complex and multidimensional nature of socioeconomic phenomena requires the adoption of different measures to analyze and understand them. The measurement process in the social sciences is associated with the construction of systems of indicators, which makes it possible to measure phenomena that would not otherwise be measurable [6]. In this study, we use non-compensatory aggregate indices that allow different aspects of the phenomenon investigated to be included in a single variable. Composite indicators for the measurement of multidimensional phenomena have become very popular in various social, economic, and political fields [7].
In line with the United Nations 2030 Agenda [8], education represents a foundational pillar for sustainable development. This study contributes directly to SDG 4 (Quality Education) by assessing the distribution and quality of educational outcomes; to SDG 5 (Gender Equality) by systematically comparing male and female educational performance; and to SDG 10 (Reduced Inequalities) by exploring territorial disparities in access and outcomes. Understanding these multidimensional gaps is essential for designing inclusive, effective, and evidence-based education policies that leave no one behind. Education is not only a right but also a driver of equity, economic resilience, and civic participation [9]. Reducing educational inequalities strengthens long-term social sustainability by empowering individuals, enhancing human capital, and fostering social mobility [10]. In contexts characterized by persistent territorial imbalances, such as Italy, promoting educational justice is crucial to mitigate systemic vulnerabilities and ensure sustainable regional development.
Despite a growing body of literature on educational disparities, there remains a lack of studies that apply advanced aggregation methods—such as the stacking technique—to explore multidimensional educational inequality with gender disaggregation. Most existing works focus either on single indicators (e.g., dropout rates or tertiary access) or rely on traditional composite measures, without addressing the internal heterogeneity of gender-based educational performance. This study seeks to fill this gap by offering an integrated approach that combines classical aggregation methods with the stacking procedure, allowing for a more robust comparison across population groups.
Moreover, although regional disparities in Italian education have been widely documented, fewer studies have contextualized these patterns in relation to international comparisons. Educational inequalities between regions—such as those observed in Spain [11], Germany [12], or the United States [13]—underscore the global relevance of spatial segmentation in education policy. By situating the Italian case within this broader framework, this study contributes to the global discourse on regional equity and educational justice.
Ivaldi and Ciacci [2] analyze the validity of the Gender Equality Index (GEI), proposed by the European Institute for Gender Equality, to measure gender inequalities in the Italian educational context, highlighting regional disparities in the period 2006–2014. These inequalities can be traced back to multiple factors, including economic conditions, infrastructure, and spatial factors [3].
From a methodological perspective, Montorsi and Gigliarano [5] propose spatial composite indicators built through Bayesian latent factor models in order to assess the territorial well-being of Italian provinces, also integrating the spatial dimension in the measurement. In line with this approach, Mazziotta and Pareto [7] introduce the non-compensatory indicator (MPI) of the same name, frequently adopted to summarize complex socioeconomic dimensions. This index has been applied in various contexts, such as the measurement of urban deprivation [14] and depopulation in Italian municipalities.
Numerous studies highlight how the choice of the summary index can significantly influence the results. For this reason, some authors have compared MPI with alternative methods, including principal component analysis (PCA), which is useful for the reduction of dimensionality [10] and the identification of latent patterns in educational variables [3].
However, an appropriate measurement model must support a PCA-based composite index to function properly [7]. Similarly, cluster analysis represents a widely used technique to segment the territory into homogeneous groups, allowing for highlighting the dichotomy between the North and South and to identify areas with atypical educational performances [14,15].
Studies such as Nardone et al. [4] have highlighted how the polarization between Northern and Southern Italy is not uniform but is articulated in sub-dynamics that also involve cultural, economic, and infrastructural factors. A particularly relevant contribution is provided by Alaimo [9], who uses an adapted version of the MPI, called AMPI, to classify Italian regions according to different areas of well-being, including education, confirming the validity of these tools also for comparative analyses on a national scale.
Finally, the coherence between different aggregation methods can be assessed through Spearman correlation analysis, a tool that allows us to verify the solidity of the results and the non-dependence of the conclusions on the synthesis technique adopted [1]. This comparative approach guarantees greater robustness in the interpretation of the data and contributes to the validation of the methodological choices in the field of measuring educational well-being.
This study aims to analyze the education levels of the Italian regions in the year 2021, distinguishing between men and women, through the use of the Fair and Sustainable Well-being (BES) data published by ISTAT. The analysis involves a segmentation of the regions using cluster analysis techniques applied to a set of educational variables, followed by the calculation of the Mazziotta–Pareto Indicator (MPI) through the application of the stacking method in order to make the index values comparable between the two genders. Furthermore, to test the robustness and consistency of the results obtained, additional aggregation methods were applied: arithmetic mean, geometric mean, min-max normalization, and principal component analysis (PCA) [7,10]. All techniques were adapted to the comparison between genders through stacking to ensure the methodological homogeneity of the approach. The consistency between the results obtained with the different methods was verified through Spearman correlation analysis [1].
The originality of this study lies precisely in the integration between classic aggregative approaches and the stacking technique applied systematically, to allow an accurate comparison between the education levels of the two genders. This approach, in addition to offering an innovative methodological contribution, can support the formulation of educational policies aimed at reducing territorial and gender inequalities in the Italian education system.
By employing multiple aggregation techniques and validating results across methods, this study offers a replicable framework for monitoring educational disparities. This methodological robustness enhances the policy relevance of the findings, supporting data-informed decision-making and the territorial targeting of educational interventions aligned with sustainable development principles.

2. Materials and Methods

The proposed analysis is based on the elaboration of a multidimensional set of educational indicators extracted from the ISTAT database of Fair and Sustainable Well-being (BES) for the year 2021 [16]. The selected variables represent different aspects of the regional education level: (a) population with at least a high school diploma (25–64 years), (b) transition to university, (c) early school dropout, (d) young people studying or working, (e) participation in continuing education, (f) tertiary qualifications in the STEM field, (g) cultural participation outside the home, (h) reading books/newspapers, (i) basic digital skills, and (j) graduates in the 25–34 age group.
Each of these ten variables was selected based on both theoretical relevance and empirical availability. For instance, “people with at least a high school diploma (25–64 years)” indicates the basic human capital and the minimum threshold for inclusion in skilled labor; a study related to the Chinese context uses this variable to analyze the state of the workforce and evaluate whether it is in line with the country’s sustainable development goals [17].
“Non-early school dropout” and “transition to university” capture the accessibility and continuity of the educational path. Asif and Searcy believe that evaluating the performance of higher education institutions can better guide the direction of funding in education [18]. While “STEM qualifications” and “basic digital skills” reflect the alignment of education with labor market demands, which is based on digitalization and innovations [19]. “Cultural participation” and “reading habits” represent broader cognitive and social capital dimensions [20]. “Participation in continuing education” is aligned with SDG4.3, which is the goal of lifelong and sustainable learning.
Table 1 summarizes the variables, descriptions, units of measure, statistical distributions, and data sources for each indicator during the year 2021. This detailed description ensures transparency in the construction of the composite index and allows for critical evaluation of its components.
First, a cluster analysis was carried out, using the K-means algorithm with k = 3, applied to the entire national sample (not distinguished by gender). The choice of k = 3 was guided by both statistical and substantive considerations. We tested several values of k and found that the Calinski–Harabasz and Silhouette coefficients reached optimal values at k = 3, indicating a well-defined clustering structure. Moreover, this solution aligns with the well-established North–Center–South tripartition commonly used in Italian territorial policy analysis, enhancing both interpretability and policy relevance.
The aim was to obtain a segmentation of the Italian regions into homogeneous groups based on the selected educational variables. This procedure made it possible to reduce the information complexity and identify recurring territorial patterns. The analysis was implemented using STATA 18 SE software.
The k-means algorithm minimizes the intra-cluster variance and maximizes the inter-cluster variance, optimizing the following objective function:
J = i = 1 k x j C i x j μ i 2
where
-
k is the default number of clusters;
-
C i is the i-th cluster;
-
x j is the feature vector of region j;
-
μ i is the centroid of the cluster C i ;
-
x j μ i 2 is the Euclidean distance between a point and the cluster centroid.
This first investigation allows us to assign each region to the cluster with the closest centroid and obtain a regional breakdown based on the totality of the explanatory variables of the level of education.
Subsequently, for each cluster, the Mazziotta–Pareto Indicator (MPI) was calculated [21] in the decreasing version (MPI–) for the entire population and, separately, for each gender. In order to guarantee the comparability between male and female values, the stacking method was adopted [22,23,24] following the approach adapted by Bartimoro and Ivaldi [25,26], which allows comparative aggregation between distinct groups (males and females).
The formulation is as follows:
μ j = i = 1 n x i , j , F + i = 1 n x i , j , M     2 n
σ j = i = 1 n ( x i , j , F   μ j ) + i = 1 n ( x i , j , M   μ j ) 2 n
where i = 1, 2, …, 20 are the Italian regions, j = 1, 2, …, 10 are the explanatory variables of education, and F and M are the two genders analyzed.
The MPI is a non-compensating composite index used to aggregate multiple variables into a single synthetic value, maintaining the structure of the original distributions. This method is particularly useful when you want to avoid extreme values of a variable compensate for low values of another. To calculate it, it is necessary to normalize the variables so that they have the same unit of measurement and can be compared with each other, after having performed a correction of their direction that makes them all positively oriented. Among the available aggregation techniques, the Mazziotta–Pareto Index (MPI) offers a non-compensatory approach particularly suited for sustainability analysis, as it penalizes unbalanced performance across dimensions and prevents compensation between poor and strong values. This characteristic is especially relevant when measuring environmental sustainability, where trade-offs may mask systemic weaknesses [21,27].
The synthetic index, for unit i at time t, is then calculated by applying the following formula:
M P I i t + / = M z i t ± S z i t c v i t
where M z i t = j = 1 m z i j t m ; S z i t = j = 1 m ( z i j t M z i t ) 2 m ; c v i t = S z i t M z i t .
The aggregate MPI for each region was obtained by taking the arithmetic mean of the male and female values. The means of the aggregate values within each cluster were used to reorder the classes based on the overall level of education. The same ordering was repeated separately for the two genders, thus allowing direct comparison of the regional hierarchies within the same clustered structure.
A leave-one-out (jack-knife) sensitivity check was performed: the composite index was recalculated after sequentially removing each individual indicator to assess the influence of single variables on the overall result.
To ensure the robustness of the results, the procedure was replicated by adopting additional aggregation methods: arithmetic mean, geometric mean, min-max normalization, and PCA. In each case, the entire process was performed using the stacking methodology, thus ensuring analytical uniformity and comparability between genders. The simultaneous adoption of different techniques allows for the verification of the influence of the chosen methodology on the results obtained.
Finally, the Spearman correlation matrix was calculated between the indices obtained with the different aggregation methods in order to assess the degree of consistency between the measures and identify any discrepancies. A high degree of correlation would confirm the stability of the results and the validity of the conclusions, regardless of the aggregation approach used.
Similar studies investigate complex phenomena through a multidimensional approach. Hassan explores the multidimensional nature of DEI (Diversity, Equity, and Inclusion) with practical measurement tools for cultivating fair, respectful, and inclusive environments across workplaces, educational institutions, and communities [28]. Educational disparities have also been analyzed to improve student success and retention in educational context, considering different factors: clear student guidelines, integrating first-year transition coursework, intrusive academic advising to treat the nonacademic and personal factors, and traditional developmental education coursework and tutoring to address academic factors [29]. Also, in the Malaysian context, a comparison of gender enrolment across three dimensions has been studied to examine the gender disparity in higher education across different higher education institutions [30].

3. Results

This section presents the results obtained from the analysis of Italian regional educational data for the year 2021. The elaborations were carried out by distinguishing by gender and applying the stacking method to make the indicators comparable between the two components of the population.
First, Figure 1 and Figure 2 present the radar graphs for the ten educational variables and MPI in Italy by region, comparing them by gender. The pink lines indicate the values for females, while the blue lines indicate the values for males.
This visualization allows you to immediately grasp the territorial and gender differences in the variables.
The radar graphs also highlight the degree of homogeneity: more regular polygons indicate a more uniform distribution among the regions, while marked deformations suggest specific critical issues in the variable.
After that, the classification of the Italian regions into three homogeneous clusters is reported, obtained through the K-means algorithm and based on the set of educational variables considered. The ordering of the clusters was established on the basis of the average value of the Mazziotta–Pareto indicator (aggregate MPI), calculated as the arithmetic mean between the MPI of females and that of males (Table 2).
We decided to label clusters in increasing order as follows: “Low-performing”, “Intermediate-performing”, and “High-performing”, based on the aggregate MPI values, so in cluster “Low-performing”, the aggregate MPI values vary between a minimum of 86.07 (Sicilia) and a maximum of 97.60 (Molise). In cluster “Intermediate-performing”, the range is between 96.90 (Sardegna) and 101.03 (Trentino-Alto Adige). Finally, cluster “High-performing” records the highest values, with a range from 102.31 (Piemonte) to 107.00 (Friuli-Venezia Giulia), as summarized in Table 2.
For visual support, a map of Italy is presented that graphically highlights the distribution of the regions in the three identified clusters (Figure 3).
Subsequently, all the indicators calculated with the stacking method are reported in Table 3, separately for gender. In particular, the following are presented: the Mazziotta–Pareto indicator (MPI), the geometric mean, the arithmetic mean, the normalized value through min-max scaling, and the score of the first principal component obtained by PCA.
The leave-one-out test confirmed the robustness of the index: omitting any single indicator changed the composite scores by less than 2% and did not alter the regional ranking.
In female MPI values, the range extends from 86.72 (Friuli-Venezia Giulia) to 108.35 (Puglia); for men, the corresponding range goes from 85.43 (Friuli-Venezia Giulia) to 105.72 (Piemonte). The female geometric mean varies between 25.39 (Friuli-Venezia Giulia) and 38.15 (Puglia), while the male mean varies between 25.24 (Friuli-Venezia Giulia) and 36.20 (Puglia). The min-max scaling values range from 0.16 to 1.00 for women and from 0.00 to 0.80 for men.
The female arithmetic mean values range from 35.62 (Friuli-Venezia Giulia) to 48.59 (Piemonte), while the male values range between 34.07 (Friuli-Venezia Giulia) and 45.97 (Piemonte). Finally, the PCA scores range between –1.87 and +1.88 for women and between –2.80 and +0.69 for men.
Table 4 shows the averages of each indicator within the clusters, again broken down by gender. This allows us to observe the average behavior of the indicators within each regional group and verify any systematic patterns between the clusters.
In the Low-performing cluster, the female MPI average is 92.11 against 90.13 for males; in the Intermediate-performing cluster, the averages are 102.17 (females) and 96.87 (males); finally, in the High-performing cluster, the values are 105.02 (females) and 102.61 (males). For the geometric average, the values vary from 27.70 to 35.67 (females) and from 26.86 to 34.42 (males). In the min-max, women range from 0.34 to 0.85 and men from 0.24 to 0.63. The female arithmetic mean grows from 38.91 to 46.63, while the male arithmetic mean grows from 36.67 to 44.25. Finally, the average PCA values rise from –0.89 to +1.28 for women and from –1.85 to +0.18 for men.
Table 5 and Table 6 present the Spearman correlation matrix calculated between all the composite indicators considered. This matrix allows for evaluating the coherence and robustness of the measures adopted, providing an indication of the convergence between the different aggregation techniques applied.
For women, the highest correlations are between MPI and arithmetic mean (ρ = 0.9729) and between MPI and PCA (ρ = 0.9684). For men, the correlation between MPI and arithmetic mean is 0.9970, while that between MPI and PCA is 0.9910. These values indicate a high consistency between the different synthesis methods.

4. Discussion

The results of the analysis confirm the existence of significant territorial and gender disparities in education levels in Italy in 2021. The cluster analysis, conducted with the k-means algorithm, allowed us to group the Italian regions into three homogeneous clusters based on ten educational variables extracted from the BES database. The cluster with the highest scores of the aggregate Mazziotta–Pareto indicator (MPI)—above 102 points—mainly includes regions in the Centre-North, such as Emilia-Romagna (106.16), Friuli-Venezia Giulia (107.00), Lazio (106.42), and Marche (102.32). On the contrary, regions in the South, such as Campania (88.02), Calabria (89.42), and Sicilia (86.07), are placed in the cluster with the lowest values, highlighting a persistent North–South fracture in the distribution of educational opportunities. This result is consistent with the existing literature, including the studies of Nardone et al. [4], which emphasize that the educational gap between north and south is not only expressed in school dropout or access to tertiary education but also in the quality and inclusiveness of the educational offer. These disparities directly impact the acquisition of cognitive and digital skills, undermining equitable human capital development [31].
From the perspective of the United Nations 2030 Agenda [8], these findings raise critical concerns for achieving SDG 4 (Quality Education) and SDG 10 (Reduced Inequalities). Territorial inequalities in education hinder the creation of inclusive and sustainable societies, weakening social cohesion and economic resilience in less advantaged areas.
A second dimension of great importance is gender. The application of the stacking method made it possible to perform a direct and methodologically homogeneous comparison between men and women for each composite indicator. The results show that, across all methodologies used, women consistently score higher than men [32,33,34,35,36,37,38,39]. For example, in the High-performing cluster, the female MPI average reaches 105.02, compared to 102.61 for men. Even in clusters with lower overall performance—such as the Low-performing cluster—women maintain an advantage: 92.11 versus 90.13. Similar trends are observed in other indicators—in geometric mean, PCA scores, and min-max normalization. Even in disadvantaged regions such as Calabria, female scores outperform male ones. This trend supports the goals of SDG 5 (Gender Equality) by evidencing a structural and persistent female advantage in educational outcomes, even in socioeconomically fragile territories. These findings resonate with the conclusions of Ivaldi and Ciacci [2], who demonstrate a long-term improvement in female educational performance across Italian regions. The observed resilience of the female segment in education highlights a paradox when contrasted with the more precarious position that women continue to hold in the labor market, emphasizing the need for integrated, cross-sectoral policies [40,41,42,43,44,45,46,47].
Another relevant aspect concerns the consistency across the synthesis methods adopted. The Spearman correlation matrix, calculated separately by gender, reveals very high coefficients between indicators. For women, the correlation between MPI and arithmetic mean is 0.9729, and 0.9684 with PCA. Among men, the MPI correlates 0.9970 with the arithmetic mean and 0.9910 with PCA. These values confirm the robustness of the analytical framework. As Alaimo and Maggino [1] suggest, strong correlations between results obtained through different techniques enhance the empirical validity and policy relevance of composite indicators. The integration of PCA, a compensatory method, with non-compensatory indices such as MPI is particularly useful in validating latent structures, as also noted by Mazziotta and Pareto [7].
In the context of sustainability, these methodological choices offer a replicable and robust framework for monitoring progress toward inclusive and equitable education, aligned with SDG reporting and local policy needs. The hybridization of classical and innovative approaches—such as stacking—enables more sensitive detection of regional and gender-specific patterns, enriching the decision-making process. Overall, the integration between traditional aggregation approaches and newer methodologies such as stacking has proven particularly effective in producing a robust, comparable analysis of educational inequality in Italy. This work not only offers an accurate picture of existing disparities but also a valuable methodological contribution for future research and sustainable policy design. As highlighted by Montorsi and Gigliarano [5], conventional well-being models risk overlooking spatial dynamics and latent heterogeneity. The application of hybrid methods, such as stacked MPI and PCA, can address this limitation, enhancing both analytical sensitivity and territorial equity—core principles of sustainable development.
The policy implications of this study are closely aligned with the territorial clustering results. For low-performing Southern regions (Cluster Low-performing), policy makers should prioritize early intervention strategies, including investment in school infrastructure, expanded access to early childhood education, and targeted financial support for students at risk of dropout. For intermediate regions (Cluster Intermediate-performing), measures could focus on strengthening transitions from secondary to tertiary education and enhancing vocational pathways, especially in underdeveloped inland areas. In high-performing regions (Cluster High-performing), policy efforts may concentrate on consolidating existing strengths, such as promoting excellence programs, fostering lifelong learning, and addressing gender imbalances in STEM education.
Moreover, the consistent female advantage observed across all indicators calls for integrated gender-sensitive strategies [48]. These should not only recognize women’s educational achievements but also ensure that these are effectively translated into labor market opportunities. Gender mainstreaming in education policy, combined with labor-market reforms, is necessary to close the gap between educational performance and employment outcomes. While this study focuses on descriptive and comparative multidimensional analysis, future developments could incorporate inferential techniques to better understand the causal mechanisms behind regional and gender disparities in education. In particular, the use of regression models—both linear and multilevel—could allow researchers to test the impact of structural factors like household income [49,50,51,52], school infrastructure [30,53], and urban–rural location, as well as psychological or individual factors such as field of study (Wan, 2018), academic self-esteem, and intrinsic vs. extrinsic motivation [51,54,55], on composite educational outcomes. Additionally, structural equation modeling (SEM) could offer a more integrated framework to explore latent constructs and indirect effects among educational determinants. These analytical extensions, while beyond the scope of the present study, would further enrich the explanatory power of the multidimensional approach presented here.

5. Conclusions

This work has provided a detailed and methodologically robust analysis of educational inequalities in Italy, highlighting the disparities that persist between regions and between genders in the year 2021. The integration of cluster analysis and the Mazziotta–Pareto Indicator (MPI), enhanced by the systematic application of the stacking method, allowed for a solid and comparable classification of regional educational performance. The territorial segmentation of Italian regions into three clusters revealed a well-documented North–South divide, reinforcing the need for geographically differentiated policy interventions that can address structural deficiencies in the South and support excellence in the Center and North.
A particularly significant finding is the consistent educational advantage of women across all regions and all synthesis methods adopted. This result, often underemphasized in public discourse, emerges as a structural pattern and not as an isolated trend. It suggests that, at least from an educational perspective, female human capital in Italy is already stronger than male, raising important questions about the mismatch between educational attainment and labor market participation. Bridging this gap requires integrated and gender-sensitive policies, aimed not only at supporting education but also at ensuring the recognition and valorization of women’s skills in professional and societal contexts.
These conclusions speak directly to the objectives of SDG 4 (Quality Education), SDG 5 (Gender Equality), and SDG 10 (Reduced Inequalities). Ensuring equitable access to education, acknowledging the structural resilience of women in the educational sphere, and reducing territorial gaps are key priorities in the achievement of a sustainable, inclusive, and just society.
From a methodological standpoint, the study contributes to the growing literature on the use of composite indicators in educational research. By adopting and comparing multiple aggregation techniques—including arithmetic mean, geometric mean, min-max normalization, and principal component analysis—the analysis confirmed the internal consistency of the findings. The high Spearman correlations between indicators validated the reliability of the approach and its replicability across different territorial and temporal contexts. The application of the stacking method, originally employed in predictive modelling, proved effective in constructing composite indicators for non-overlapping population groups, reinforcing comparability and analytical clarity.
The contributions of Alaimo and Maggino [1] have been particularly valuable in reframing educational well-being as a multidimensional and interconnected system rather than as a sum of isolated indicators. The empirical validation of hybrid aggregative methods consolidates their role as effective tools for evaluating complex phenomena in line with sustainability assessment principles.
Despite its strengths, this study also presents some limitations. Composite indices, even when non-compensatory, may still mask internal imbalances between indicators. Although the use of Spearman correlations and multiple synthesis techniques improves robustness, a full sensitivity analysis of indicator weights could be implemented in future work. In addition, while stacking offers enhanced comparability, it does not address causality. The integration of regression models and SEM frameworks is recommended for future research to deepen the understanding of structural determinants and interactions within educational inequality. Finally, future developments could explore spatially aware composite indicators, such as those built on Bayesian latent factor models, to account for territorial contiguity and spillover effects.
In conclusion, the study not only offers a precise diagnosis of educational inequalities in Italy but also provides practical insights for developing fair and sustainable policy strategies. Reducing regional disparities requires targeted investment in infrastructure, accessibility, and quality of education in disadvantaged territories. Addressing the gender gap demands measures that ensure the full valorization of women’s educational assets, especially in the school-to-work transition.
Finally, the methodology proposed in this study represents a replicable and adaptable tool for monitoring educational equity over time and supporting evidence-based planning. Future research should further explore the integration of spatially sensitive composite indicators, such as those developed by Montorsi and Gigliarano [5], which leverage Bayesian latent factor models to capture territorial contiguity and diffusion dynamics. This line of inquiry holds promise for improving the predictive capacity and local policy relevance of educational sustainability metrics.

Author Contributions

Conceptualization, E.I. and M.D.A.; methodology, E.I.; formal analysis, M.D.A.; writing—original draft preparation, E.I. and M.D.A.; supervision, E.I. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data are available from the authors.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Alaimo, L.S.; Maggino, F. Sustainable development goals indicators at territorial level: Conceptual and methodological issues—The Italian perspective. Soc. Indic. Res. 2020, 147, 383–419. [Google Scholar] [CrossRef]
  2. Ivaldi, E.; Ciacci, A. Measuring Gender Gap in Education in Different Italian Regions under Sustainable Development Framework: Is GEI a Good Tool? In Social Sector Development and Governance; Routledge India: New Delhi, India, 2022; pp. 218–233. [Google Scholar]
  3. Bertolini, P.; Pagliacci, F. Quality of life and territorial imbalances. A focus on Italian inner and rural areas. Bio-Based Appl. Econ. 2017, 6, 183–208. [Google Scholar]
  4. Nardone, P.; Odoardi, I.; Liberatore, A.; D’Ingiullo, D. Where the north–south gap in human capital begins: An analysis of educational outcomes across the Italian regions. J. Urban Reg. Anal. 2023, 15, 191–219. [Google Scholar] [CrossRef]
  5. Montorsi, C.; Gigliarano, C. Spatial comprehensive well-being composite indicators based on Bayesian latent factor model: Evidence from Italian provinces. Soc. Indic. Res. 2024, 175, 347–383. [Google Scholar] [CrossRef]
  6. Alaimo, L.S. The complexity of social phenomena and the construction of indicators. In Measuring Gender Equality; Springer: Cham, Switzerland, 2023; p. 31. [Google Scholar]
  7. Mazziotta, M.; Pareto, A. Principal component analysis for constructing socio-economic composite indicators: Theoretical and empirical considerations. SN Soc. Sci. 2024, 4, 114. [Google Scholar] [CrossRef]
  8. Transforming Our World: The 2030 Agenda for Sustainable Development. Available online: https://sdgs.un.org/2030agenda (accessed on 30 May 2025).
  9. Alaimo, L.S. Complexity of Social Phenomena: Measurements, Analysis, Representations and Synthesis; Sapienza Università Editrice: Rome, Italy, 2022; Volume 127. [Google Scholar]
  10. Jolliffe, I.T.; Cadima, J. Principal component analysis: A review and recent developments. Philos. Trans. R. Soc. A Math. Phys. Eng. Sci. 2016, 374, 20150202. [Google Scholar] [CrossRef]
  11. Education and Culture. Available online: https://www.ine.es/dyngs/INEbase/en/categoria.htm?c=Estadistica_P&cid=1254735573113 (accessed on 20 June 2025).
  12. Bildung, Forschung und Kultur. Available online: https://www.destatis.de/DE/Themen/Gesellschaft-Umwelt/Bildung-Forschung-Kultur/Bildungsstand/_inhalt.html (accessed on 20 June 2025).
  13. Condition of Education 2022. Available online: https://nces.ed.gov/use-work/resource-library/report/compendium/condition-education-2022 (accessed on 20 June 2025).
  14. Ivaldi, E.; Parra Saiani, P.; Primosich, J.J.; Bruzzi, C. Health and deprivation: A new approach applied to 32 Argentinian urban areas. Soc. Indic. Res. 2020, 151, 155–179. [Google Scholar] [CrossRef]
  15. Pratesi, M.; Quattrociocchi, L.; Bertarelli, G.; Gemignani, A.; Giusti, C. Spatial distribution of multidimensional educational poverty in Italy using small area estimation. Soc. Indic. Res. 2021, 156, 563–586. [Google Scholar] [CrossRef]
  16. Gli Indicatori del BES. Available online: https://www.istat.it/statistiche-per-temi/focus/benessere-e-sostenibilita/la-misurazione-del-benessere-bes/gli-indicatori-del-bes/ (accessed on 30 May 2025).
  17. Khor, N.; Pang, L.; Liu, C.; Chang, F.; Mo, D.; Loyalka, P.; Rozelle, S. China’s looming human capital crisis: Upper secondary educational attainment rates and the middle-income trap. China Q. 2016, 228, 905–926. [Google Scholar] [CrossRef]
  18. Asif, M.; Searcy, C. A composite index for measuring performance in higher education institutions. Int. J. Qual. Reliab. Manag. 2014, 31, 983–1001. [Google Scholar] [CrossRef]
  19. Pagani, L.; Argentin, G.; Gui, M.; Stanca, L. The impact of digital skills on educational outcomes: Evidence from performance tests. Educ. Stud. 2016, 42, 137–162. [Google Scholar] [CrossRef]
  20. Rao, S.; Gupta, R.; Gupta, H.K.; Karia, D. Cultivating a Reading Culture: A Design Intervention to Inculcate Reading Habits. In Proceedings of the 2023 IEEE Global Humanitarian Technology Conference (GHTC), San Francisco, CA, USA, 12–15 October 2023; pp. 386–392. [Google Scholar]
  21. Mazziotta, M.; Pareto, A. On a generalized non-compensatory composite index for measuring socio-economic phenomena. Soc. Indic. Res. 2016, 127, 983–1003. [Google Scholar] [CrossRef]
  22. Mazziotta, M.; Pareto, A. A non-compensatory approach for the measurement of the quality of life. In Quality of Life in Italy: Researches and Reflections; Maggino, F., Nuvolati, P., Eds.; Social Indicators Research Series; Springer: Dordrecht, The Netherlands, 2012; pp. 27–40. [Google Scholar]
  23. Norman, P. Identifying change over time in small area socio-economic deprivation. Appl. Spat. Anal. Policy 2010, 3, 107–138. [Google Scholar] [CrossRef]
  24. Landi, S.; Ivaldi, E.; Testi, A. Measuring change over time in socio-economic deprivation and health in an urban context: The case study of Genoa. Soc. Indic. Res. 2017, 139, 745–785. [Google Scholar] [CrossRef]
  25. Bartiromo, M.; Ivaldi, E. The impact of COVID-19 on sustainable development in Europe: A temporal analysis. In COVID-19 Pandemic and Global Inequality: Reflections in Labour Market, Business and Social Sectors; Springer: Singapore, 2023; pp. 25–42. [Google Scholar]
  26. Bartiromo, M.; Ivaldi, E. A gender sustainable development index for Italian regions. In Gender Inequality and Its Implications on Education and Health: A Global Perspective; Emerald Publishing: Bingley, UK, 2023; pp. 247–258. [Google Scholar]
  27. The Climate Action Monitor 2023. Available online: https://www.oecd.org/en/publications/the-climate-action-monitor-2023_60e338a2-en.html (accessed on 20 June 2025).
  28. Hassan, S.T. Measuring Diversity, Equity, and Inclusivity in an Organisation: Scale Development for DEI Index. In Diversity, AI, and Sustainability for Financial Growth; IGI Global: Hershey, PA, USA, 2023. [Google Scholar]
  29. Fowler, P.R.; Boylan, H.R. Increasing student success and retention: A multidimensional approach. J. Dev. Educ. 2010, 34, 2. [Google Scholar]
  30. Wan, C.D. Student enrolment in Malaysian higher education: Is there gender disparity and what can we learn from the disparity? Comp. J. Comp. Int. Educ. 2018, 48, 244–261. [Google Scholar] [CrossRef]
  31. Cotter, D.A.; Hermsen, J.M.; Vanneman, R. Gender Inequality at Work; Russell Sage Foundation: New York, NY, USA, 2004; p. 110. [Google Scholar]
  32. Morris, L.V. Women in higher education: Access, success, and the future. Innov. High. Educ. 2011, 36, 145–147. [Google Scholar] [CrossRef]
  33. Parvazian, S.; Gill, J.; Chiera, B. Higher education, women, and sociocultural change: A closer look at the statistics. SAGE Open 2017, 7, 2158244017700230. [Google Scholar] [CrossRef]
  34. Van Bavel, J.; Schwartz, C.R.; Esteve, A. The reversal of the gender gap in education and its consequences for family life. Annu. Rev. Sociol. 2018, 44, 341–360. [Google Scholar] [CrossRef]
  35. DiPrete, T.A.; Buchmann, C. Gender disparities in educational attainment in the new century: Trends, causes and consequences. In US2010: America in the First Decade of the New Century; Russell Sage Foundation: New York, NY, USA, 2013. [Google Scholar]
  36. England, P.; Allison, P.; Li, S.; Mark, N.; Thompson, J.; Budig, M.J.; Sun, H. Why are some academic fields tipping toward female? The sex composition of US fields of doctoral degree receipt, 1971–2002. Sociol. Educ. 2007, 80, 23–42. [Google Scholar] [CrossRef]
  37. Buchmann, C.; DiPrete, T.A.; McDaniel, A. Gender inequalities in education. Annu. Rev. Sociol. 2008, 34, 319–337. [Google Scholar] [CrossRef]
  38. Vincent-Lancrin, S. The reversal of gender inequalities in higher education: An on-going trend. Higher Educ. Manag. Policy 2008, 20, 1–20. [Google Scholar]
  39. Global Education Monitoring Team. Gender Review: Meeting Our Commitments to Gender Equality in Education; UNESCO: Paris, France, 2018. [Google Scholar]
  40. Kricheli-Katz, T. Gender inequalities in markets. Annu. Rev. Law Soc. Sci. 2021, 17, 109–122. [Google Scholar] [CrossRef]
  41. England, P.; Budig, M.; Folbre, N. Wages of virtue: The relative pay of care work. Soc. Probl. 2002, 49, 455–473. [Google Scholar] [CrossRef]
  42. England, P.; Folbre, N. Gender and economic sociology. In The Handbook of Economic Sociology; Smelser, N.J., Swedberg, R., Eds.; Princeton University Press: Princeton, NJ, USA, 2010; p. 627. [Google Scholar]
  43. England, P. The gender revolution: Uneven and stalled. Gend. Soc. 2010, 24, 149–166. [Google Scholar] [CrossRef]
  44. England, P.; Levine, A.; Mishel, E. Progress toward gender equality in the United States has slowed or stalled. Proc. Natl. Acad. Sci. USA 2020, 117, 6990–6997. [Google Scholar] [CrossRef]
  45. Mandel, H. Up the down staircase: Women’s upward mobility and the wage penalty for occupational feminization, 1970–2007. Soc. Forces 2013, 91, 1183–1207. [Google Scholar] [CrossRef]
  46. Smith, R.A. Race, gender, and authority in the workplace: Theory and research. Annu. Rev. Sociol. 2002, 28, 509–542. [Google Scholar] [CrossRef]
  47. Charles, M.; Grusky, D.B. Occupational Ghettos: The Worldwide Segregation of Women and Men; Stanford University Press: Stanford, CA, USA, 2005; Volume 200. [Google Scholar]
  48. Saadat, Z.; Alam, S.; Rehman, M. Review of factors affecting gender disparity in higher education. Cogent Soc. Sci. 2022, 8, 2076794. [Google Scholar] [CrossRef]
  49. Terriquez, V.; Gurantz, O. Financial challenges in emerging adulthood and students’ decisions to stop out of college. Emerg. Adulthood 2015, 3, 204–214. [Google Scholar] [CrossRef]
  50. Britt, S.L.; Ammerman, D.A.; Barrett, S.F.; Jones, S. Student loans, financial stress, and college student retention. J. Stud. Financ. Aid 2017, 47, 3. [Google Scholar] [CrossRef]
  51. Naz, S.; Shah, S.A.; Qayum, A. Gender differences in motivation and academic achievement: A study of the university students of KP, Pakistan. Glob. Reg. Rev. 2020, V, 67–75. [Google Scholar] [CrossRef]
  52. Benner, A.D.; Boyle, A.E.; Sadler, S. Parental involvement and adolescents’ educational success: The roles of prior achievement and socioeconomic status. J. Youth Adolesc. 2016, 45, 1053–1064. [Google Scholar] [CrossRef]
  53. Alcaine, J.G. Factors Affecting Institutional Performance at High and Very High Research Universities: Policy Implications. Ph.D. Thesis, University of Southern California, Los Angeles, CA, USA, 2016. [Google Scholar]
  54. Ghazvini, S.D.; Khajehpour, M.; Cunningham, M. Gender differences in factors affecting academic performance of high school students. Procedia Soc. Behav. Sci. 2011, 15, 1040–1045. [Google Scholar] [CrossRef]
  55. Lim, H.; Kim, S.; Chung, K.-M.; Lee, K.; Kim, T.; Heo, J. Is college students’ trajectory associated with academic performance? Comput. Educ. 2022, 178, 104397. [Google Scholar] [CrossRef]
Figure 1. Education variables from variable 1 to variable 6 in Italian regions by gender (year 2021).
Figure 1. Education variables from variable 1 to variable 6 in Italian regions by gender (year 2021).
Sustainability 17 06243 g001
Figure 2. Education variables (from variable 7 to variable 10) and MPI in Italian regions by gender (year 2021).
Figure 2. Education variables (from variable 7 to variable 10) and MPI in Italian regions by gender (year 2021).
Sustainability 17 06243 g002
Figure 3. Clusters sorted on the average of the aggregate MPI.
Figure 3. Clusters sorted on the average of the aggregate MPI.
Sustainability 17 06243 g003
Table 1. Variables, descriptions, units of measure, statistical distributions, and sources (year 2021).
Table 1. Variables, descriptions, units of measure, statistical distributions, and sources (year 2021).
VariableDescriptionUnits of MeasureStatistical DistributionsSources
1People with at least a high school diploma (25–64 years)Percentage valuesPercentage of population aged 25–64ISTAT—Labour Force Survey
2Transition to universityCohort-specific rateRate of high school graduates entering universityMinistry of Education; Ministry of University and Research
3Not leaving education and training earlyPercentage valuesPercentage of 18- to 24-year-olds not in education/trainingRe-elaborations on ISTAT data—Labour Force Survey
4Young people in employment and education (NEET)Percentage values% of 15- to 29-year-olds not in employment, education, or trainingRe-elaborations on ISTAT data—Labour Force Survey
5Participation in continuing educationPercentage values% of adults aged 25–64 in formal/non-formal educationISTAT—Labour Force Survey
6People who obtain a tertiary STEM qualification in the yearPer 1000 residents aged 20–29Absolute rate per 1000 of the population 20–29ISTAT—Ministry of University and Research data
7Cultural participation outside the homePercentage values% of people participating in cultural activitiesISTAT—Survey Aspects of Daily Life
8Reading books and newspapersPercentage values% of people who read at least one book or newspaperISTAT—Survey Aspects of Daily Life
9At least basic digital skillsPercentage values% of population with basic ICT skillsISTAT—Survey Aspects of Daily Life
10Graduates and other tertiary qualifications (25–34 years)Percentage values% of population aged 25–34 with a degreeISTAT—Survey Aspects of Daily Life
Table 2. Total MPI and cluster.
Table 2. Total MPI and cluster.
RegionClusterAggregate MPI
AbruzzoHigh-performing100.79
BasilicataLow-performing95.92
CalabriaLow-performing89.42
CampaniaLow-performing88.02
Emilia-RomagnaHigh-performing106.16
Friuli-Venezia GiuliaHigh-performing107.00
LazioHigh-performing106.42
LiguriaHigh-performing103.24
LombardiaHigh-performing102.84
MarcheHigh-performing102.32
MoliseLow-performing97.60
PiemonteHigh-performing102.31
PugliaLow-performing89.69
SardegnaIntermediate-performing96.90
SiciliaLow-performing86.07
ToscanaHigh-performing102.90
Trentino-Alto AdigeIntermediate-performing101.03
UmbriaHigh-performing104.04
Valle d’AostaIntermediate-performing100.64
VenetoHigh-performing103.97
Table 3. Composite indicator values by gender: MPI, geometric and arithmetic means, min-max normalization, and PCA scores.
Table 3. Composite indicator values by gender: MPI, geometric and arithmetic means, min-max normalization, and PCA scores.
RegionMPI FemaleMPI MaleGeometric Mean FemaleGeometric Mean MaleMin-Max FemaleMin-Max Male Arithmetic Mean FemaleArithmetic Mean MalePCA FemalePCA Male
Abruzzo97.7694.0829.1728.460.630.4942.5238.650.35−1.10
Basilicata90.1188.7326.0825.820.270.2337.6935.91−1.16−2.08
Calabria88.9387.1126.8125.610.210.1036.2334.60−1.68−2.49
Campania98.4896.7131.4429.460.570.5343.2740.550.30−0.38
Emilia-Romagna90.6488.7527.3226.540.160.0238.1536.12−1.30−2.27
Friuli-Venezia Giulia86.7285.4325.3925.240.180.0535.6234.16−1.87−2.80
Lazio100.3693.4432.8829.130.380.0043.5038.840.31−1.42
Liguria102.2099.8532.8932.500.940.7245.7343.251.020.01
Lombardia103.9697.3335.4031.580.720.3046.3241.611.02−0.62
Marche102.9498.6434.0230.870.840.6845.6842.431.10−0.37
Molise107.19105.1237.1036.080.930.6247.6745.611.700.57
Piemonte108.28105.7237.8935.910.930.7848.5945.971.880.69
Puglia108.35104.4938.1536.200.970.8048.1344.891.720.53
Sardegna104.37102.1135.1634.360.880.7046.2944.101.080.12
Sicilia103.66102.0334.8134.590.780.4746.0944.071.010.01
Toscana104.42100.2334.4232.660.770.5746.1443.281.39−0.20
Trentino-Alto Adige102.54102.0734.7134.340.690.4945.0043.790.640.03
Umbria104.12101.6735.3433.950.810.4645.8543.601.040.04
Valle d’Aosta105.04103.0435.9634.321.000.7647.1244.191.390.30
Veneto104.34103.6034.8435.300.740.5546.3344.821.170.30
Table 4. Average values of composite indicators within each cluster, by gender.
Table 4. Average values of composite indicators within each cluster, by gender.
ClusterMPI FemaleMPI MaleGeometric Mean FemaleGeometric Mean MaleMin-Max FemaleMin-Max Male Arithmetic Mean FemaleArithmetic Mean MalePCA FemalePCA Male
Low-performing92.1190.1327.7026.860.340.2438.9136.67−0.89−1.85
Intermediate-performing102.1796.8733.7231.070.680.3445.1841.230.79−0.68
High-performing105.02102.6135.6734.420.850.6346.6344.251.280.18
Table 5. Spearman correlation matrix among composite indicators for the female population.
Table 5. Spearman correlation matrix among composite indicators for the female population.
Geometric Mean FemaleMin-Max FemaleMPI FemaleArithmetic Mean FemalePCA Female
Geometric mean female1.0000
Min-max female0.82061.0000
MPI female0.95040.84471.0000
Arithmetic mean female0.96090.83940.97291.0000
PCA female0.89770.87700.96840.94741.0000
Table 6. Spearman correlation matrix among composite indicators for the male population.
Table 6. Spearman correlation matrix among composite indicators for the male population.
Geometric Mean MaleMin-Max MaleMPI MaleArithmetic Mean MalePCA Male
Geometric mean male1.0000
Min-max male0.72481.0000
MPI male0.97740.79251.0000
Arithmetic mean male0.98200.77740.99701.0000
PCA male0.96240.80450.99100.98801.0000
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

De Anna, M.; Ivaldi, E. Multidimensional Educational Inequality in Italy: A Stacking-Based Approach for Gender and Territorial Analysis. Sustainability 2025, 17, 6243. https://doi.org/10.3390/su17146243

AMA Style

De Anna M, Ivaldi E. Multidimensional Educational Inequality in Italy: A Stacking-Based Approach for Gender and Territorial Analysis. Sustainability. 2025; 17(14):6243. https://doi.org/10.3390/su17146243

Chicago/Turabian Style

De Anna, Martina, and Enrico Ivaldi. 2025. "Multidimensional Educational Inequality in Italy: A Stacking-Based Approach for Gender and Territorial Analysis" Sustainability 17, no. 14: 6243. https://doi.org/10.3390/su17146243

APA Style

De Anna, M., & Ivaldi, E. (2025). Multidimensional Educational Inequality in Italy: A Stacking-Based Approach for Gender and Territorial Analysis. Sustainability, 17(14), 6243. https://doi.org/10.3390/su17146243

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop