1. Introduction
The Member States of the European Union have intensified their efforts to enhance economic performance by launching a series of reforms and policy initiatives aimed at improving public sector efficiency and fostering innovation [
1]. In the context of accelerated social and digital transformations, the European Union is under increasing pressure to adapt and respond to multiple challenges, including institutional inefficiency, disruptive digitalization, declining social well-being, and public health concerns [
2].
This paper highlights the importance of developing accurate performance measurement indicators and strong empirical evidence to refine policies and strategies at the level of European Union countries and regions. The complexity of these challenges, amplified by the global COVID-19 pandemic and economic uncertainties, requires advanced methodological approaches to capture the multidimensional nature of performance.
While various qualitative and bibliometric approaches have supported the strategic framing of ESG policies in the past, this paper takes a distinct empirical direction by relying primarily on structured regional-level panel data and applying econometric and clustering techniques to identify performance patterns.
This research highlights the potential of comprehensive assessments and the application of innovative econometric models, such as Partial Least Squares (PLS), to assess the efficiency and sustainability of regional development within the European Union. However, the systematic use of such methods at the regional level remains limited, and their capacity to inform policy interventions requires further development.
Therefore, the present study proposes a data-driven, quantitative analysis of ESG performance using a multidimensional methodological framework, focusing on observable indicators at the NUTS-2 regional level across the EU. This paper aims to bridge these gaps by using a holistic approach to assess economic performance and sustainability in European Union countries. Specifically, this includes a combination of PLS regression (to identify latent relationships among ESG indicators), LOESS-based GDP deflation (to normalize economic outputs over time), and hierarchical cluster analysis (to classify EU regions based on performance similarities). The data used in this research consists of over 10 years of structured panel datasets (2010–2023), covering 27 EU countries, with indicators selected from Eurostat, the World Bank, and the World Health Organization (WHO).
This approach allows us to systematically explore and identify performance trends, contributing to the literature with new methodological perspectives and empirical evidence on the performance of European Union countries and regions.
Further, this paper addresses a research gap in the literature by applying ESG analysis at the subnational (regional) level. To date, ESG analysis has been studied at the firm and national levels, but few studies have examined regional ESG performance among EU Member States [
3,
4,
5]. These studies highlight the significance of institutional quality, social capital, and spatial development as determinants of regional sustainability performance.
Additionally, our proposed approach builds on composite indices such as the OECD Regional Well-Being Index [
6], which, while widely used for descriptive purposes, also relies on additive aggregation and equal-weighting schemes. However, such methods have not sufficiently addressed issues of non-linearity and multicollinearity or considered temporal differences and trends. In contrast, LOESS deflation for GDP and PLS regressions used in this paper allow for the dynamic normalization of economic indicators and multidimensional sustainability outcomes marked with enhanced statistical rigor.
The general research question guiding this study is: “How can the performance of European Union countries on various dimensions, including governance, social and economic aspects, be accurately measured and analyzed using a new methodological framework?”. Using an innovative methodological approach encompassing PLS econometric modelling and cluster analysis, this study aims to develop a global understanding of performance dynamics in EU regions. This research explores (i) which ESG pillars contribute most to regional convergence or divergence across the EU, (ii) how ESG performance has evolved over time in response to external shocks such as the COVID-19 pandemic, and (iii) what structural characteristics define clusters of countries that follow similar ESG trajectories.
In line with these questions, we formulate and test the following hypotheses:
H1. Governance is the strongest driver of composite ESG performance in the EU Member States.
H2. Northern and Western European countries exhibit greater ESG convergence due to institutional robustness and policy alignment.
H3. The environmental pillar shows the greatest spatial divergence due to unequal enforcement capacity and ecological exposure.
These hypotheses are tested using a multidimensional methodological framework that integrates PLS regression, LOESS-based GDP-deflation, and hierarchical cluster analysis. This methodology allows for a robust investigation of the key drivers, trends, and disparities in regional ESG sustainability performance across the EU.
Over the past two decades, ESG considerations have become integral to assessing sustainability performance across both public and private sectors. Rather than offering a general definition, this study focuses on how ESG metrics can be operationalized to capture regional disparities, track long-term progress, and inform investment and policy decisions at the EU level [
7].
Despite the expansion of ESG frameworks, one persistent challenge is the inconsistency and divergence of ESG scoring systems provided by various rating agencies. Dumrose’s paper analyzes the European Union’s taxonomy in the context of ESG scores. Firm-level ESG scores tend to differ between ESG data providers, influencing investment decisions due to uncertainty about a firm’s sustainability performance. The European Union taxonomy can support the reduction of this divergence [
8].
The results highlight a positive and significant relationship between E ratings and firm-level taxonomy performance. Thus, investment and political decision-makers can be established, i.e., investors could anticipate the potential implications of the taxonomy on their investments, such as changes in the cost of capital or capital reallocations by other investors [
8]. Policymakers can build on our findings to develop effective regulations for sustainable finance and improve quality.
Several empirical studies have investigated the correlation between ESG scores and firm profitability. For example, ref. [
9] conducted a study on the European tourism sector and identified a statistically significant negative correlation between ESG scores and return on assets. This research specifically aimed to investigate the existence of a significant relationship between ESG scores and firms’ profitability, defined by return on assets, and whether there is a positive, negative, or neutral relationship between them.
Based on the regression model applied, the results highlighted a negative relationship between ESG scores and firm performance, as described by return on assets, which is statistically significant for a materiality threshold of 5%. They suggest that firms must implement effective and appropriate practices that take into account ESG score information. Moreover, companies should promote the long-term benefits they derive from applying these practices and comply with their standards.
In contrast, ref. [
10] applied machine learning models to assess ESG scores within EuroStoxx-600 companies, finding a positive relationship between ESG performance and EBIT, especially post-2018, following EU sustainability reforms. He investigated the relationship between ESG scores and earnings before interest and taxes (EBIT) and assessed the extent to which ESG scores significantly influence companies’ profitability using machine learning methods. After the European Commission implemented a sustainable growth program in 2018, ESG scores began to rise. Thus, it was of interest to determine what happened in the two years of analysis.
The machine learning algorithms utilized indicated a score greater than 75 for the data from 2020, which is indicative of an increase in EBIT value when ESG scores are increased. A similar behavior was observed within regions where ESG scores for the entire dataset were in the range of 50–75. Therefore, to have any increase in EBIT value, it suggests that the firm will need to have strong, sustainable investments. Indeed, ESG scores have been shown to impact the profitability of firms, as represented by companies’ EBIT. In contrast, for a low ESG score, we can assume that the firm will be less focused on the objective of sustainability and the sustained use of sustainability. It does not add an additional profit margin. These complex or contradictory findings speak to the contextuality of the correlation found in ESG financial performance, but the implications of sector and geographic conditions need to be differentiated and applied in ESG modeling. These studies increase our ability to understand the implications of ESG impact at the micro level but lack application to regional policy and development plans. The transition from a firm-level to a regional level will need to rely on stronger, more sophisticated frameworks to reflect multivariate, spatially variable behavior more clearly. Further, ESG conditions have also been shown to influence systemic financial stability. Research by [
11,
12] shows that institutions with strong ESG performance are less likely to experience financial distress, especially during constraints like those attached to the global financial crisis and the recent COVID-19 pandemic. Additionally, environmental performance appears to positively influence economic profitability and market performance; the governance pillar is observed to negatively affect financial and operational performance [
11]. In a complementary study, ref. [
12] tested five hypotheses connecting ESG quality to banking stability and found that social and fairness can help mitigate financial risk, especially during constraints like those during the COVID-19 pandemic. These studies demonstrate how ESG can be used as a predictive and diagnostic measure of institutional resilience. By extension, for example, at the regional level, ESG can act as a predictive early warning system for vulnerabilities in public administration, health services, and social infrastructure. Ref. [
12]’s conclusions can have clearer implications for EU cohesion policy and perhaps ease the measuring of regional convergence and resilience. Furthermore, by applying ESG elements to fiscal and social risk indicators and spatial risk models using appropriate econometric methods (many of which are well suited to ESG, as in the case of PLS regression, as proven by authors like [
13,
14], researchers can quantify these variables in a robust manner, thereby enhancing the evaluative capacity of regional planning tools and aligning social sustainability performance measurement with long-term EU objectives for development and resilience. While conventional econometric models often dominate this type of research, PLS and similar approaches have recently emerged as viable techniques to organize complex high-dimensional data, mitigated by low sample sizes and inherent collinearity, which is often present in ESG material and data. PLS, as undertaken by [
13,
14], uses latent variables to help researchers feasibly analyze and work with data according to the two stated recognizable characteristics of ESG data. The relationship between corporate strategy, ESG performance, and bankruptcy risk in U.S. firms was studied from 2016 to 2020 using Partial Least Squares Structural Equation Modeling (PLS-SEM) [
14]. The authors find that successful cost management strategies are positively associated with ESG scores and lower probabilities of financial distress. Furthermore, ESG and financial performance acted as mediating variables between corporate strategy and bankruptcy likelihood, reinforcing the importance of long-term corporate strategizing. The authors also provide evidence that financial performance plays an important role in mitigating the risk of insolvency. The authors argue that firms and new policies should prioritize the continuous evaluation of ESG and need supportive regulatory environments that support this approach [
14]. Relatedly, ref. [
13] captures how ESG factors affect firm-level risks in Europe. The authors distinguish between systematic, idiosyncratic, and total risks and use regression and causality analyses to account for the differential effects of each ESG pillar. The effects of ESG performance are statistically significant, reflecting that all three types of risks are lower as ESG scores are higher, with the social aspect having the greatest protective effect against risk. Idiosyncratic risk was reduced with better environmental performance; however, total and systematic risks were higher for green industries. Corporate governance was not directly related to firm value; however, a bidirectional relationship existed between corporate governance and systematic risk. All the findings support the idea that ESG practices could be better incorporated by businesses and would contribute to decreasing risky activity and increasing resilience, driven strongly by socially oriented performance. However, despite the advantages described, PLS can be underutilized in regional studies. Most existing applications are oriented toward corporate purposes and do not use the full potential of the method in the field of public policy evaluation or regional diagnostics. In this paper, themethodology is applied with the mindset of public policy evaluation since the European Union engages in evidence-based policymaking and systemic evaluation under frameworks such as the Green Deal or NextGenerationEU. At the sovereign level, refs. [
15,
16] identify a negative relationship between a country’s ESG performance and its sovereign bond spreads across all components, especially governance and social components. The evidence suggests a credibility premium for sustainability-oriented countries in the international capital market. Although national governments serve as the focus of the studies noted above, the findings provide evidence to support the economic rationale for taking the next step into the subnational level. Whereas [
17] found both ESG disclosure and actual ESG performance to have positive impacts on the borrowing costs of non-financial firms in the EU, the distinction between ESG disclosure and actual ESG performance remains an important one, particularly at the regional scale, where the quality and transparency of ESG indicators vary considerably. The dilemma inherent in ESG ratings as lax rating standards allow those subject to ESG ratings to manipulate ratings, leading to calls for a more systematic approach to theory paradigms, recognizing that they do not distinguish between more conservative or more progressive rating systems, nor necessarily the complexities of the sustainability concept [
18]. This problem has been addressed directly in this study by introducing rigor into the methodology and normalizing indicators. Among the earliest attempts to examine sustainability at the macroeconomic level using ESG metrics, ref. [
15] examined the influence of government ESG performance on sovereign bond markets. Using a panel-level regression model, this study tests the hypothesis that higher ESG ratings are related to lower borrowing costs. They found that there was a relationship, but the effect of ESG ratings was much weaker—about a third of the effect of traditional financial ratings from S&P. Thus, while investors are increasingly using ESG factors as a source of value relevant to their risk assessment, ESG factors are not a substitute for financial metrics when assessing sovereign debt. Extending this study in [
10], measured the differential impact of environmental, social, and governance components on sovereign bond spreads. A panel regression model was constructed, and it was found that a 10% increase in the composite ESG was, in turn, correlated with a 15% reduction in sovereign spreads. The governance and social dimensions of ESG have statistically significant impacts on spreads, without any conclusive impact of the environmental dimension. In this way, although the findings of their analysis are limited by measurement concerns and data availability, the synthesis of ESG performance, especially governance, is important for understanding these variables to strategically assess sovereign risk and allocate capital. While these studies provide valuable insights, very few have applied ESG modeling at the subnational level, especially within the European Union context. The biggest issue with existing research is that it tends to suffer from fragmentation in methodology and/or limited scope, focusing primarily on the non-linear and multidimensional aspects of regional sustainability. Based on the EU’s aggressive objectives and aspirations outlined in the European Green Deal and the 2021–2027 Cohesion policy, there is a genuine need to have a more nuanced, data-driven, and methodical footprint. This study aims to respond to that need by applying PLS regression and cluster analysis on over 10 years of regional panel data, so that we can hopefully offer a replicable model for capturing the variation of ESG performance differences across EU regions that could be supportive for policy evaluation that aligns to the EU’s long-term objective of achieving resilient, sustainable, and inclusive territorial development. The remainder of the paper proceeds as follows:
Section 2 presents the materials and methods, including the data sources, selected indicators, and econometric techniques utilized.
Section 3 presents the empirical results regarding the patterns of ESG performance and their determinants across EU countries.
Section 4 discusses the results with respect to the existing literature and their implications for policy. Finally,
Section 5 outlines the key conclusions and possible implications for future research and sustainable regional development strategies.
2. Materials and Methods
This section highlights the methodological steps taken to build and conduct ESG performance scores across European Union Member States from 2010 to 2023. The methodological approach uses a series of statistical methods to address data quality issues, standardize measures, and create usable cross-country comparisons. After a preparatory step in data organization, treatment of missing data points and outliers, and smoothing and normalizing to account for typical economic differences, composite scores for each ESG pillar are developed using PLS regression, which is appropriate for research based on many correlated indicators. The ESG performance scores are then analyzed to measure changes in performance over time via annual growth rates calculated using the Compound Annual Growth Rate (CAGR). Cluster analysis is then used to group countries with identifiable similarities in ESG performance.
The analysis is based on a dataset developed around the three pillars of ESG performance: Environmental, Social, and Governance. The measures were selected from indicators provided by Eurostat, the World Bank, and the WHO for the years 2010 to 2023. These sources were selected because they offer internationally comparable and methodologically consistent datasets that are widely used in academic and policy research [
19,
20,
21]. In total, 178 measures were selected-50 measures corresponding to environmental performance outcomes, 102 measures reflecting indicators of social performance, and 26 measures related to governance performance. This equates to 2492 observations of the member countries of the European Union. The variable names and sources are listed in
Table A1,
Table A2 and
Table A3.
The selection of indicators followed a screening process to ensure conceptual coherence. First, a broad review of all ESG-related indicators available through Eurostat, the World Bank and WHO was conducted. Second, the indicators were retained based on three main criteria: (1) relevance to ESG performance as defined by established frameworks like the EU Taxonomy Regulation [
22], Sustainable Finance Disclosure Regulation [
23], and the World Bank ESG Data Portal [
24]; (2) sufficient data availability across EU Member States over the period 2010–2023; and (3) cross-country comparability through methodological consistency. Preference was given to outcome-based indicators rather than inputs or intentions so that the realized ESG performance would be better reflected.
A major limitation is data availability, especially for the dimension of governance at the sub-national level. Institutional arrangements, structures, and governance processes are sometimes not regularly reported, limiting research quality and resulting in far fewer indicators for that dimension. Regional-level data are also sparse and, therefore, noncomparable because of inconsistent reporting by Member States. To manage this discontinuity in indicators at the regional level, national-level data analysis was first performed, subsequently subjecting the indicator to a regionalization procedure based on GDP per capita, which provided an approximation of subnational ESG performance.
The final estimations of the indicators excluded the United Kingdom. While the time period includes times during which the UK was a member of the EU, we have omitted it to achieve greater consistency when interpreting trends and avoid inferred distortions to comparative scores.
Due to the significant disparity in economic size across EU countries, all values for GDP per capita were log-transformed to improve the statistical properties of the data and allow comparability across countries and years.
Missing data were handled using a multi-step imputation process. The first step was to assess each time series for missing data across all 178 indicators and the 27 EU Member States. If a missing observation occurred at the beginning or the end of a time series, the gap was filled using the last observation carried forward (LOCF) or first observation carried backward (FOCB), respectively. This approach ensured temporal continuity when the missing data were boundary values. However, when the missing values fell between two valid observations, the series was completed using linear interpolation, estimating the annual growth rate based on the two closest valid points and inferring the intermediate values. For years after 2021, in the absence of new data, we assumed that the last known growth rate would hold and extrapolated forward. In cases where gaps could not be filled by interpolation or extrapolation, the missing values were median-imputed. This ensured the preservation of the distributional characteristics. These procedures were uniformly applied across all indicators to maintain cross-country comparability and temporal consistency.
While more advanced methods, such as multiple imputation or model-based imputation, were considered, they were not appropriate given the high dimensionality and heterogeneous missingness patterns across countries and indicators. However, we acknowledge that simpler imputation techniques may introduce bias in some cases.
After imputation, all indicators underwent a winsorization procedure to manage outliers and reduce the influence of the extreme values. Values below the 5th percentile were replaced with the 5th percentile value, and values above the 95th percentile were capped at the 95th percentile. Winsorization was applied only after imputation and before normalization so that imputed distributions remained intact and extreme outliers did not skew the rescaling.
Next, normalization was performed using min-max scaling, which transformed the composite ESG performance to a 0–100 range. This step was essential for facilitating comparisons across countries and years. Normalization was applied after the GDP-adjusted scores were computed and after the PLS regression weighting was completed so that the scaling reflected only the final composite output.
Each indicator was smoothed with respect to GDP per capita (
x-axis) using LOESS (Locally Estimated Scatterplot Smoothing), a non-parametric method particularly useful for modeling complex, nonlinear relationships and for being robust to outliers [
25,
26].
In the LOESS framework, GDP per capita was used as the independent variable (
x-axis), while the indicator values were treated as the dependent variable (
y-axis). For a given point
x, a local subset of data is weighted using a tricubic kernel:
where
denotes the normalized distance of point
i from the focal value
x. A local polynomial regression model is then fitted using weighted least squares to minimize the following objective function:
where
are the observed indicator values,
are the GDP per capita values,
are the LOESS weights, and
,
, …,
are the coefficients of the polynomial. This process was applied individually to each of the 37 ESG-related indicators. The result is a smoothed expected value function for each indicator, conditional on GDP per capita, which allows subsequent analysis to isolate sustainability performance from the confounding effects of the economic scale.
To ensure that ESG scores accurately reflect an underlying sustainability performance metric rather than an economic capacity metric, each component indicator was regressed on GDP per capita, with GDP as the independent variable. The adjustment was made to better isolate the sustainability signal and account for the structural disadvantages faced by lower-income countries [
27] because, without a correction for GDP, larger economies may be seen to perform better solely because they have larger resources, potentially masking true differences. By controlling for GDP, the analysis allows for a fairer comparison across countries.
This study adopted a regression framework known as PLS, originally described by [
28] but formalized in the soft modeling literature by [
29] to construct composite ESG performance scores.
The matrix X contains the adjusted indicator values across countries, and Y represents the target variable—overall ESG performance or pillar-specific performance.
PLS identifies latent components in X that maximize covariance with Y. The weight vectors w and c are computed to maximize the correlation between the projected spaces Xw and Yc. Component scores t = Xw are extracted iteratively, and the PLS model is estimated using these scores to predict Y. The resulting regression coefficients serve as the weights for each indicator, quantifying their relative contribution to the composite score.
The optimal number of latent components in each PLS model was selected using 10-fold cross-validation, minimizing the Root Mean Square Error of Prediction (RMSEP). This approach avoids overfitting by retaining only the components that contribute meaningfully to the model performance and controlling for noise. The final number of components was selected where the RMSEP first reached a local minimum or leveled off, in line with the elbow rule, which is commonly used in dimensionality reduction.
Given the model’s robustness in reducing dimensionality while preserving the variance, three PLS models were implemented. One for each ESG pillar uses indicators as predictors and log GDP per capita as a control. The final performance scores for each pillar are computed as weighted averages of the indicator performances:
These raw scores are rescaled to a standardized 0–100 range using min–max normalization:
This procedure enables coherent cross-country comparisons of ESG performance, over time.
In the final stage of the analysis, the temporal evolution of ESG performance is evaluated using the CAGR, which is a standard metric for measuring the average annual growth of an indicator over a fixed time horizon. The CAGR for country
a is computed as:
where
denotes the ESG score in the most recent year of data availability (
t = 2023), and
represents the baseline value in the reference year (
= 2010). This metric provides a smoothed estimate of the average annual rate of change, mitigating the influence of short-term fluctuations and enabling meaningful comparisons across countries and indicators.
The estimated growth rates are classified into four qualitative growth categories, which reflect both the degree and direction of change. A growth rate above 1% per year is considered reflective of significant movement toward progress, thus supporting long-term sustainability targets. Growth rates of 0.5–1% are fair in terms of progress but demonstrate that continued improvement is needed. Growth rates on the order of −1% to 0.5% essentially demonstrate no or limited changes. Finally, all values below −1% are indicative of declining ESG performance. This classification system provides a recognizable utility for potentially tracking progression and determining whether policies should be adjusted. This classification typically retains the integrity of the benchmark metric and provides quantitative benefits.
Cluster analysis was used to group countries based on their ESG performance, which allows the identification of natural groupings in the data. The best number of clusters is identified with the elbow method that plots the within-cluster sum of squares (WSS) versus a range of values of
k. The optimal number of clusters is when the rate of decrease in WSS decreases dramatically, as in [
30,
31]. Next, the K-means algorithm, introduced by [
32], is used to partition countries into distinct clusters by minimizing the within-cluster variance. This approach is consistent with the clustering practices outlined by [
33,
34]. Lastly, to examine the structure of the data further, hierarchical clustering is used, using Ward’s minimum variance method [
35] to minimize within-cluster variance at all steps of the agglomerative process. This produces a dendrogram that illustrates the nested cluster structure and improves the interpretability. This method is based on [
36,
37], who posit that it is reliable for social and economic research. These techniques provide tactical insights useful for benchmarking and policy development at the European level.
4. Discussion
This study provides a comprehensive macroeconomic analysis of sustainability outcomes across EU regions, supporting the contextualization of ESG indicators against long-run economic resilience and structural convergence of economies. Employing PLS regression, this study identifies significant latent factors in regional differences in ESG indicators while circumventing methodological problems that have plagued previous research, including multicollinearity and overlap between indicators related to environmental performance and low sample size per regional dimension.
From a macroeconomic perspective, one of the most notable findings is the central role played by governance. Compared to environmental and social performance, the analysis shows that governance contributes more significantly to the variation in composite ESG scores, especially through indicators such as “Voice and Accountability” and “Control of Corruption”.
This result likely stems from greater institutional heterogeneity across EU Member States, especially between newer and older members, compared to the more convergent nature of environmental and social policy frameworks that are more closely aligned with EU directives [
4,
77]. While environmental and social standards may benefit from harmonization through EU regulations, governance remains deeply embedded in national political-administrative traditions, resulting in greater variability and higher explanatory power in the PLS models.
These results reinforce earlier findings on the role of institutional factors in shaping sustainable development trajectories, as seen in [
78], who emphasized the relevance of governance actions and social norms in fostering sustainability-oriented intentions at the enterprise level through PLS-SEM modeling.
This finding contrasts with firm-level ESG studies, in which environmental factors often dominate due to market expectations, reporting pressure, and reputational risks [
79,
80]. In our regional-level model, governance emerges as a systemic enabler of sustainable development, influencing not only financial and institutional resilience but also the efficacy of environmental and social policy implementations.
These results are also consistent with the existing literature presented in the previous sections, including the works of [
15,
16], which indicate a relationship between institutional quality and improved sovereign risk ratings along with lower sovereign bond spreads. Additionally, as noted by [
17], governance actors were shown to be paramount determinants of operational and financial performance in the banking industry, and in that case, a form of governance was useful in shaping sustainable finance. However, in this case, governance has broader implications—it not only enhances financial stability but also multiplies the impact of environmental and social policies across member regimes.
From a macroeconomic perspective, environmental performance reveals stark regional disparities closely linked to the varying levels of institutional development and economic capacity among EU Member States. Countries in Northern and Western Europe, such as Sweden and the Netherlands, continue to achieve solid results due to past investments in renewable energy and waste systems which are common characteristics of the circular economy. Countries in the Southern and Eastern regions, such as Bulgaria, Romania, and Greece, experienced challenges related to diminished fiscal capacity, fixed assets nearing the end of life, and enforcement challenges.
This regional divergence is in line with the findings of [
79], who highlighted the complex interplay between labor market conditions, productivity, and institutional structures in shaping macroeconomic outcomes through panel and cluster analyses across EU countries.
In addition, [
80] provides supporting evidence through a DEA-based efficiency analysis of urban development in European and global cities, reinforcing the importance of spatial disparities and institutional gaps—elements that directly align with our findings on governance bottlenecks and ESG heterogeneity across NUTS2 regions.
Nevertheless, consistent with [
13] ’s findings, the shift to environmental improvement is contingent on investment and institutional quality. Furthermore, utilizing GDP normalization and then LOESS smoothing allowed a more reflective view of environmental improvements by decoupling it from the size of the economy, removing an ingrained systematic bias of comparative ESG scoring.
The social dimension has experienced an even more uneven progression, as COVID-19 disrupted human capital development, access to services and demographic stability across nearly all Member States. The PLS analysis highlighted the roles of adult literacy, urbanization, and population growth, suggesting that social sustainability can be both a function of structural investments, such as infrastructure, and demographic vitality.
Our findings echo those of [
81], who demonstrated that informality and undeclared work—often concentrated in less developed EU regions—interact with social cohesion and inclusive growth, highlighting the multidimensional nature of social performance in ESG assessment.
These findings align with [
12], who showed that social engagement and quality of governance can help mitigate the risk of compounding crises in the banking industry. With respect to higher-income countries, such as Germany and the Netherlands, environmental and social sustainability performance improved compared to previous years, reflecting relative resilience. However, both countries also experienced a slowdown in the rate of improvement in their social ESG scores. Evidence and experience from this cross-time interval raise urgent policy challenge questions about the capacity of social and labor market safety net regimes to cope with compounding crises year after year.
This study reinforces calls in the literature for reforming social policy to be adaptive, inclusive, and digitally supported, especially under conditions of demographic aging and labor market change [
15]. With respect to wider macroeconomic convergence, this study shows that the ESG trajectory across and within the EU is incomplete and uneven. The CAGR illustrates how several mid-performing countries made progress in the early 2010s, only to stagnate or regress in subsequent years—often due to crisis-related shocks or persistent policy inconsistencies. This path-dependent progression mirrors the growing concerns highlighted in the cohesion literature regarding structural divergences and limited absorption capacity in the newer EU Member States.
In line with [
82], who stressed the need for integrated frameworks to interpret sustainability transitions across complex systems, this study leverages multi-level and multidimensional modeling to provide a scalable, replicable ESG framework aligned with broader digital and economic transformation agendas.
Our hierarchical clustering reveals that an “Emerging South-East” group—including Romania, Bulgaria, and Croatia—has led to recent improvements in ESG performance but continues to face structural governance and institutional bottlenecks. These insights suggest that the EU’s cohesion policy could benefit from differentiated, place-based strategies that prioritize governance capacity building in converging regions. This could include tailored funding through instruments such as REACT-EU, enhanced support under the Technical Support Instrument (TSI), or closer alignment with European Semester country-specific recommendations [
83].
It is important to note that ESG trends are fluid, and their evolution depends heavily on both political developments and economic disruptions. This necessitates adaptive strategy calibration and continuous learning mechanisms in policymaking. We emphasize that the regional institutional context is crucial and must be considered when translating ESG diagnostics into policy.
By design, our study also provides an open-source model for replicating an ESG performance framework matched with EU macro-regional planning instruments. By controlling for GDP in our PLS models and incorporating LOESS smoothing, we can disregard economic size while maintaining sensitivity to growth momentum. While using national-level proxies does come with limitations when considering local-level ESG indicators, we are constrained by data variability at this point in time. The process of putting multidimensional indicators into composite scores to produce interpretable forms also enhances the use of ESG in practice for regional administrators and in financial planning.
From a macro-policy perspective, we verify that robust governance is the anchor of sustainability and resilience. More than just a technical route to development, the advancement of ESG is a matter of commitment, coherence, a long-term strategic perspective, and political economy. We advocate for investments that are integrated and treat environmental, social, and governance systems as interdependent systems rather than separate sectors. Alongside an intensifying focus related to the EU Green Deal or Recovery and Resilience Facility, if this research can provide the timeliest empirical evidence for meaningful, strategic resource allocation and/or regional resilience planning vis-à-vis ESG, this research should be useful.
5. Conclusions
This article presents a replicable framework for assessing ESG performance in EU regions that is particularly useful for policymaking, as it addresses both the multidimensionality and economic scale biases of sustainability measures. By combining GDP normalization, LOESS smoothing, and PLS regression, the model reveals variations in ESG results at the subnational scale.
The analysis determined that the quality of governance, especially the dimensions of Voice and Accountability and Control of Corruption, measured subnationally, is the most powerful determinant of composite ESG performance. This finding is particularly relevant in the EU institutional context, where the effectiveness of mechanisms such as the Stability and Growth Pact, European Semester, and RRF conditionalities depends on the available quality of governance. The absence of institutional capacity reduces both the likelihood of ESG convergence and compliance with the EU fiscal rules.
To advance this research and strengthen the framework’s applicability, several areas merit further exploration:
Testing the framework’s resilience under external shocks, such as energy security disruptions resulting from the Ukraine crisis or climate-induced disasters.
Incorporating digital sustainability measures (e.g., inclusivity and diversity of access to technology and AI adoption in public service delivery) to reflect the growing role of technological governance in sustainable development.
Expanding the analysis of PLS causality and thresholds through more sophisticated machine learning and spatial econometric approaches.
Exploring links between ESG performance and EU-level monitoring instruments, including the Macroeconomic Imbalance Procedure and the EU Social Scoreboard.
We encourage EU Member States to consider place-based approaches, which will allow regional funding and cohesion policy to align with ESG objectives. Delivering capacity building for governance would hopefully receive support from the TSI, and conditional funding would be based on progress towards transparency. It may be possible to provide accountability by using governance indicators as criteria for green bonds or ESG-linked debt. Including ESG in the European Semester may, in a strict sense, refer to the set of actions a government undertakes in relation to reforms, but it will shape those actions to support such reforms. In regions where governance is weak, investors must downscale their due diligence and determine the risk premium, acknowledging that they have a limited governance system.
Overall, this study contributes to advancing evidence-based policymaking in the EU by providing a methodological tool for monitoring convergence, informing investment allocation, and supporting the strategic alignment of sustainability objectives with institutional reform.
From a practical perspective, these findings can serve the interests of different stakeholders in many ways. Policymakers may use the regional ESG diagnostic of performance to inform investment priorities in institutional capacity and governance interventions in regions that lag behind, which can then enhance the absorption capacity of EU funds and improve resilience in regions to future crises. Investors, such as sovereign bondholders and green finance actors, can then consider governance-adjusted ESG scores in their risk assessment frameworks, based on the assumption that more reliable “good governance” is indicative of the effective implementation of sustainability policies. Businesses may compare their own ESG strategies to regional performance baselines, which are aligned with the interests and priorities of the place, in preparation for regulatory frameworks that will be applied to shape their operations. Civil society organizations and other local stakeholders may also consider using ESG information and data to advocate for specific interventions that address chronic weaknesses in governance.