1. Introduction
In the globalized world of today, logistics systems’ productivity and resilience are essential drivers of competitiveness at the national level as well as of economic development and sustainability. The empirical organization of supply chains, developments in technology and global trade intensification have brought the performance of logistics to the forefront of both economic policy and corporate decision-making. In parallel to these developments has been the rise of the Environmental, Social, and Governance (ESG) paradigm as the leading framework used to evaluate sustainable economic performance, transcending conventional financial measurements to consider broader societal and environmental consequences [
1,
2].
Justification for the study. Despite the increasing importance of both the logistics performance perspective and the environmental, social, and governance (ESG) framework in the design of sustainable economic systems, the intersection between the two areas remains largely [
3,
4]. Currently, existing literature focuses predominantly on logistics performance as economic infrastructure, with most economic studies being carried out on a firm level, whereas, conversely, most existing studies on ESG frameworks focus mostly on a firm level paradigm, thereby largely ignoring their systemic dynamics within the country’s economic infrastructure. Currently, this systemic divide also leaves a huge knowledge gap, particularly with increasing recognition being accorded to the roles of logistics systems, as either facilitators or barriers, within environmental sustainability, as well as social well-being, and within governance dynamics [
3]. Knowledge within this intersection is also highly required, particularly since logistics infrastructure designs significantly impact issues such as energy consumption, carbon emissions, resource use efficiency, working conditions, inclusive supply chains, and transparency in governance [
4]. In addition, most global commitments, such as the United Nations Sustainable Development Goals, significantly depend on the assurance of sustainable logistics systems, so empirical studies within this field, particularly within the intersection of systemic dynamics within logistics infrastructure design, within ESG frameworks within different countries, remain largely uncharted.
In the midst of these twin evolutions, a recurring and relatively unexamined question sits at its core:
In contrast to the expanding real-world applicability of both ESG and logistics globally, academic work connecting the two is relatively rare. Most research on the Logistics Performance Index (LPI) targets economic metrics like trade levels, industrial competitiveness, and infrastructure quality [
4], whereas ESG scholarship is typically centered around firm-level sustainability, ethical investment practices, and policy at a high level [
5]. Consequently, our knowledge base is missing a systematic exploration of how logistics capabilities impact environmental sustainability, social fairness, and governance quality at the country level. That is a stark deficiency, given how essential sustainable logistics has become to attainment of the United Nations Sustainable Development Goals (SDGs) [
1]. This study has as its objective bridging that gap through a data-driven examination of how disaggregated ESG indicator variables correlate with logistics performance. In contrast to research using composite ESG indices, however, the research takes a disaggregated framework and looks at how infrastructure and efficiency in operations independently impact environmental (E), social (S), and governance (G) dimensions [
2]. The research question is simple but fundamental:
This research advances the frontier of merging sustainable development and logistics research by offering practical lessons for both governments and MNCs. The key strength of this research is that it employs multiple methods, combining econometric analysis with ML. Endogeneity in this research has been controlled using instrumental-variable panel-data regression techniques, namely 2SLS and G2SLS, on a balanced panel of 163 countries from 2007 to 2023. This tackles endogeneity by improving the accuracy of results by mitigating the challenges posed by unobserved variables and reverse causality. To complement the econometric model, this research applies both supervised machine learning algorithms (Random Forests, k-Nearest Neighbors, Support Vector Machines) and unsupervised clustering algorithms (Density-Based, Fuzzy C-Means, Hierarchical, Model-Based, Neighborhood Clusters). This dual modeling approach not only provides robustness testing rigor but also identifies nonlinear behaviors and hidden patterns that are not accounted for or apparent in traditional statistical modeling. The ever-deepening integration of ML applications in the sustainability literature has led to greater predictive precision and the detection of patterns in high-dimensional data [
6,
7]. The dual-methodology approach thus strengthens internal validity and enhances generalizability, in line with prevailing research perspectives that intertwine ML and econometric analysis in the realm of ESG studies [
8]. An important aspect of the analysis involves using ESG factors decomposed across the environmental, social, and governance pillars, rather than considering overall ESG scores, to identify the individual relationships of each factor with the Logistics Performance Index (LPI). Environmental factors are measured based on pressures exerted by emissions, air pollution, and land use; social factors are measured based on factors such as education access, service delivery, income levels, and child labor; and governance factors are measured based on the rule of law, regulatory quality, and innovation. This type of analysis has not been carried out in the literature with the depth and rigor reported here [
6,
7]. The conclusive evidence indicates that the ESG and logistics nexus involves various aspects of LPI improvements, including significant positive outcomes and challenges in environmental sustainability, tangible social impacts, applicable risks, and significant improvements in governance. However, effective logistics may exacerbate environmental or social inequalities in the absence of strengthened regulatory protection. Descriptive observations from all the collective data confirm the imperative of congruent policies that align with the evolution of logistics and the universal principles of ESG goals.
Study purpose. This research aims to develop a paradigm that establishes empirical links between logistics performance metrics and the Environmental, Social, and Governance (ESG) aspects of Sustainable Development. Recent studies indicate that national logistics performance closely correlates with both social and environmental aspects [
3]. Although the importance of the world’s logistics capabilities as basic determinants of economic competitiveness, trade efficiency, and development is well-appreciated, their relationship with Sustainable Development remains uncharted. There still remains a chasm in knowledge regarding the independent and cumulative roles of different logistics capabilities, particularly with regard to their effects on ESG metrics, given recent debates stratified by their effects on Sustainable Development. Studies on the effects of logistics performance on sustainability suggest that the G20 nations’ sustainability levels remain largely driven by their logistics performance, underscoring the importance of joint policy formulation [
9]. This assumption’s importance lies in its effort to bridge this chasm by exploring determinants of ESG, along with the interaction between logistical efficiency and ESG dimensions, as defined in the existing literature. Empirical studies suggest that ESG practices in logistics, environmental compliance, social responsibility, and corporate governance exert independent as well as cumulative effects at the firm and macroeconomic levels [
4]. Similarly, improvements in ESG capabilities, particularly as applicable to Small, Medium, and Large Enterprises, remain identified as an effective approach for sustainable and human-oriented practices [
2]. However, the use of digital technologies, along with Industry 4.0 technologies, in logistics remains effective in integrating urban and corporate logistics strategies with ESG dimensions [
10]. Therefore, within this background, this proposal’s subsequent analysis shall conduct a rigorous, sequential, bi-variate analysis of the Logistics Performance Index’s variables, prepared through thorough verification against detailed ESG variables, within the realm of its governance units spanning 2007–2023, totaling 163 governance units. Specifically, the analysis will focus on: (1) the correspondence between improved logistics capabilities and environmental stress, as opposed to environmental efficiency (e.g., customs clearance and lead-time reliability); (2) the social determinants as antecedents of logistics capabilities, shedding light on education, working conditions, demographics, and accessibility issues (e.g., ease of arranging shipments and customer satisfaction); (3) the governance quality, with a focus on its constitutive aspects: enabling institutions, regulatory systems, scientific productivity, as well as enabling governance structures (e.g., tracking and tracing capabilities and supply chain transparency). Using instrumental variables panel regression analyses and sophisticated machine learning models, this analysis will examine the two-way relationships between logistics capabilities and sustainable development.
Study Purpose and Research Themes. This proposed research aims to develop a theoretical framework that explains the relationship between Logistics Capabilities Metrics and Environmental, Social, and Governance (ESG) factors within Sustainable Development. The emerging literature shows a strong correlation between national logistics capabilities metrics and social or environmental issues [
3]. Although the correlation between global logistics capabilities metrics and Sustainable Development remains uncharted, the general implication is that logistics capabilities significantly affect economic competitiveness, trade, and Sustainable Development. However, there seems to be limited insight into the independent or cumulative aspects of such logistics capabilities, particularly as a Sustainable Development factor. With regard to the literature established in the precedent of existing literature on logistics capabilities metrics/sustainability, more recent literature asserts that the Sustainability path within G20 nations is largely dependent on their capabilities, thus establishing that policy as collectively decisive [
9]. By implication, that this literature fills a much-needed aspect, within existing Sustainable literature, that might explore this correlation between determinants of ESG factors, through measures of Logistics Efficiency established within pertinent literature, as emergent empirical notions demonstrate that ESG Logistics, defined through necessary environmental, social, or governance protocols, remain collectively independent, with negative, positive, notions within the macroeconomic paradigm [
4]. Improving ESG Capabilities, particularly within the realm of small-to-large-scale businesses, demonstrates a push towards more Sustainable, People-centric paradigms [
2]. Simultaneously, technology such as Industry 4.0 remains effective within the strategic paradigm of transforming urban, corporate, or logistics infrastructure, as defined through the resultant ESG paradigm [
10]. With this background, the proposed analysis will conduct a meticulous, sequential, multivariate analysis of Logistics Performance Index variables, cross-checked against more refined ESG variables, focusing on a dataset comprising 163 governance units for the years 2007–2023. To increase the validity of the results, posterior predictive checks and robustness analyses will be used. This will allow for issues of model specification parsimony and sensitivity issues that might qualify the conclusions. More specifically, this analysis will focus on the interaction between improved logistics capabilities under environmental stress and environmental efficiency, as captured by variables such as customs simplicity, customs clearance, and lead time reliability. Social-influence variables, such as education, working conditions, demographics, accessibility, ease of arranging shipments, and customer satisfaction, will be considered antecedents of logistics capabilities. More specifically, the knowledge generated by this analysis of education and working conditions might serve as a basis for formulating personnel management policies through train-and-develop programs or by establishing benchmark standards for laboratory practices, thus further reinforcing the social component of ESG. Governance quality will be analyzed through its constituent parts, including governance structures, governance frameworks, scientific productivity, and facilitative governance frameworks, which comprise tracking and tracing capabilities and transparency. By combining instrumental-variable panel regression with more sophisticated machine-learning analytics, this analysis will explore the two-way interaction between logistics capabilities and sustainable development.
Study Hypotheses. With the aforementioned research questions as the background, this study formulates three inclusive hypotheses that provide direction for the analysis, thereby aligning the conceptual framework with the methodology. These hypotheses assume that the correlation between logistics performance metrics and sustainability outcomes is complex, interacting with environmental, social, and governance factors within the ESG framework [
11].
H1. Logistics performance shows a systematic relationship with mixed environmental effects, reflecting trade-offs between development and the environment. This hypothesis argues that improvements in logistics infrastructure can minimize resource use and some types of pollutants, but simultaneously increase other pollutants, such as GHG emissions. The existing literature suggests that ESG innovations focused on logistics and transportation can improve environmental efficiency while addressing new environmental pressures, such as increased energy use and GHG emissions [1,12]. Using disaggregated measures of environmental effects, such as air, GHG emissions, heat stress, and land use, this research will examine the impact of environmental stresses and efficiencies as forces behind changes in the Logistics Performance Index [11]. H2. Socio-economic variables significantly and diversely affect logistics performance. This hypothesis assesses the influence of education, basic service accessibility, demographics, working conditions, and income distribution on logistics performance. Evidence confirms that socio-economic variables, such as employee education, fair working conditions, and service accessibility, affect logistics efficiency [1]. Social determinants, such as education, access to basic services, demographics, working conditions, and income distribution, create inequality in human capital, working conditions, or both, affecting the efficiency of global logistics. H3. Improving governance quality promotes a positive outcome on logistics performance. This hypothesis assumes that a high-quality institution, with attributes of proper regulation, the rule of law, efficient administration, and scientific strength, fosters a supportive environment that facilitates the establishment of a sound, modern, and trustworthy logistics infrastructure. Empirical evidence shows that sound governance principles or regulations can enhance ESG practices and sustainable development across nations [12,13]. These hypotheses collectively form the focal point of this analysis, through which the rest of this report will explore the relationships that exist between logistical performance and the environmental, social, and governance aspects of sustainable development. The research is organized as follows.
Section 2 reviews the existing literature, identifying the main conceptual frameworks and empirical findings to date.
Section 3 presents the data sources, sample characteristics, and the econometric and machine learning methodologies employed.
Section 4,
Section 5 and
Section 6 are dedicated, respectively, to the analysis of the relationships between LPI and the Environmental, Social, and Governance components, detailing both the regression-based and clustering-based results.
Section 7 concludes with a discussion of policy implications, limitations, and directions for future research. Furthermore,
Appendix A presents the hyperparameter settings of the regression algorithms,
Appendix B presents the hyperparameter settings of the clustering algorithms,
Appendix C presents the summary statistics of the environmental (E) indicators,
Appendix D presents the summary statistics of the social (S) indicators, and
Appendix E presents the summary statistics of the governance (G) indicators.
2. Literature Review
The existing literature presents informed but incomplete insights into the interrelation between ESG outcomes and logistic performance tending to lack the level of systemic integration and granularity desired by this study. The research by [
4,
5] has as its main objective assessing the financial impact of adopting ESG in the case of logistic firms but does not reveal its investigation to wider systemic interactions unfolding from country-wide metrics such as the Logistics Performance Index (LPI). While suggesting that the impact of ESG schemes is mediated by logistic performance and economic results, ref. [
14] does fail to differentiate the ESG pillars and does not treat direct causality, a concern treated by this research. The issue of ESG challenges and opportunities in the post-COVID-19 context is broached by [
2,
15], albeit in a way failing to integrate results systematically to transportation efficiency metrics such as the LPI. In the same spirit, research by [
1,
16] analyzes ESG’s impact on competitiveness and on stock performance but falls short of considering logistic infrastructure as country-wide driver of sustainability. Refs. [
10,
17] deal with smart and digitalized logistic as ESG enablers and participate in thematic add-ons short of adopting serious quantitative research practices like in the research presented here. The effect on firm performance of green logistic action is demonstrated by [
18,
19,
20], the latter focused on the dimension of ESG transparency but both are subject to micro perspectives. The use of technology is analyzed by [
21,
22], and [
23] but short of structural embedding of country-wide logistic performance in ESG effect. The research by [
24] generalizes ESG discourse to maritime and seaport logistic industries but fails to systematically analyze environmental, social, and governance dimensions separately vis-à-vis the LPI as it does in this study.
Refs. [
25,
26] acknowledge transport and logistic firms to be influenced by ESG but reduce ESG to aggregate scores and fail to identify pillar-specific effects as identified here. Refs. [
27,
28] discuss communication and perception dimensions of ESG in the logistic sector but fail to attain econometric robustness. Refs. [
29,
30] discuss impact of ESG on supply chains but by a generalized application by qualitative methods and non-dynamic panel data methods or by using machine learning algorithms. Refs. [
31,
32] include governance variables like board diversity but fail to capture how the impact of logistic infrastructure performance on ESG is systematically captured. Refs. [
33,
34,
35], and associate ESG and operation efficiency and productivity in the supply chain but to firm-specific or industry-specific studies and to system levels in countries by using LPI. Ref. [
36] associate climate policy uncertainty and logistic stock returns and ESG scores but fail to include pillar disaggregation. Refs. [
37,
38] document sustainable optimization of the logistic industry but fail to document how optimization practices are associated with larger ESG systems in countries. Ref. [
39] calculate competitiveness on efficiency of the logistic sector but their work does not systematically rule out environmental and social spillovers identified here. Ref. [
40] discuss digitization and benefits to ESG and [
41] discuss sustainable infrastructure but both fail to utilize instrumental variable panel data methods or machine learning regressions.
Research by [
42,
43] focuses on sustainability and governance in logistics companies but lacks generalizability at a country level. Refs. [
44,
45] design ESG assessment models but work primarily at conceptual or firm levels and lack the cross-country and long-dimensioned data included in this study. Refs. [
46,
47] connect ESG to credit risk at the firm level but do not conceptualize the firm as a fundamental unit of analysis as they do so. Refs. [
48,
49] acknowledge the role supply chain digitalization plays in improving ESG but do not systematically tie it to LPI measurements. Refs. [
50,
51] emphasize the predictive ability of sustainability initiatives and ESG outcomes but fail to discuss drivers exclusive to the logistics sector at the country level. Refs. [
52,
53] equate ESG with efficiency at the terminals and ports and get close to LPI issues but keep to a sectorial scope. Refs. [
54,
55] discuss procurement benefits and circular economy models but fail to consider logistics performance as a systemic driver. Together, this study is the first to combine both econometric and machine learning approaches to reveal LPI to be a first-order determinant of ESG outcomes and not a secondary measure and to do so across countries, filling gaps in existing research.
3. Data and Methodology
One of the main methodological difficulties faced in the current research stems from the non-existence of a continuous historical time series of the Logistics Performance Index (LPI). The available LPI data intermittently between the period of 2007–2023 pose a number of missing values by country and year and thereby complicate the creation of a full and balanced panel dataset adequate to perform rigorous econometric and machine learning analysis. In a bid to overcome this problem and maintain the consistency and integrity of the data’s longitudinal form, a polynomial-regression-based interpolation scheme was utilized. Polynomial fitting was used to fill in missing values on a country-wise basis to rebuild realistic historical traces of the LPI values and avoid risks of injecting spurious biases using simpler linear interpolation methods. The methodology is informed by existing research suggesting the benefits of using imputation as well as advanced interpolation methods in LPI research ranging from genetic algorithm-based weights to imputation methods using regression [
56]. The second core analytic decision concerns ESG disaggregation. In contrast to keeping ESG as a combined or aggregate indicator, the research systematically breaks up the model into its three pillars—Environmental (E), Social (S), and Governance (G)—and studies the interrelation of LPI across each of these dimensions in turn. The pillar-wise design allows a finer and more detailed understanding of how the interactions between logistics performance and sustainability outcomes unfold than has been the case with prior research which tended to work with ESG as a uniform block. The research design is aligned with contemporary research underlining the different and diverging influence of a particular ESG dimension on firm and sector performance [
4,
57]. In keeping with the research question’s adverseness to simplicity, the analytic design follows both conventional econometric and sophisticated ML approaches. The econometric analysis was conducted by using Instrumental Variables (IV) panel regressions comprising both Two-Stage Least Squares (2SLS) and Generalized Two-Stage Least Squares (G2SLS) models to rigorously contend with endogeneity issues and ascertain causal interpretation of the estimated coefficients. Complementarily to the above, machine learning methodologies were implemented in both the regression and clustering tasks—utilizing Random Forest, k-Nearest Neighbors, Support Vector Machines, Decision Tree Regression, Boosting Regression, and Lasso in the case of the former and Density-Based Clustering, Fuzzy c-Means, Model-Based Clustering, Neighborhood Clustering, Random Forest Clustering, and Hierarchical Clustering in the case of the latter. The interplay between the econometric and machine learning models facilitates both the verification of outcomes by means of different methodological perspectives and the determination of nonlinear and latent patterns likely to pass under the radar of conventional regression analysis. These combined methodological options respond to the requirements of data constraints but also intensify the robustness, exhaustiveness, and novelty of the research’s empirical contribution to the extant literature on the topic of logistic performance and sustainable development (
Figure 1).
Study model. In this case, the proposed research will use a multi-method design to investigate the relationship between logistics performance and the different dimensions of Environmental, Social, and Governance (ESG). In this design, the Logistics Performance Index (LPI) will be the dependent variable, with the environmental, social, and governance dimensions as determinants. Such designs have been used previously in other studies that investigated the effects of different dimensions of ESG issues on the quality of institutions and the economic aspects of different countries [
58]. To specify the nature of this link, the analysis resorts to Instrumental Variable (IV) panel fixed-effect regression models, namely Two-Stage Least Squares (2SLS) and generalized (2SLS). These econometric models address endogeneity, missing variables, and reverse causality. Finally, the models assess the specific drivers of environmental, social, and governance variables on logistics performance across 163 countries spanning 2007–2023. Previous studies have shown that IV models, as well as panel regression models, may effectively examine the links between logistics performance, innovation, and environmental issues [
59,
60]. Apart from this established framework of causality, the study uses more complex machine learning models to examine nonlinear correlations, thereby improving predictability and the ability to identify hidden dynamics across nations. Regression models (Random Forest, Support Vector Machines, k-Nearest Neighbors, Decision Trees, Boosting, Lasso, or Elastic Net) will be used for the analysis of accuracy, while the application of clustering models (DBSCAN, Fuzzy C-Means, Hierarchical, Model-based, Neighborhood-based, or Random Forest) will identify structural variations, more specifically within models linked with different nations. Applying such models aligns with recent improvements in predictive analytics, where machine learning algorithms were rigorously tested for feature selection and model accuracy assessment in logistics models. By embracing this convergence, analyses that treat ESG variables as discrete will seek to identify their net impact on the logistics industry as a whole, delivering valuable insights into any correlations within the realm of sustainability studies. These analyses will also position the logistics industry as a key environmental agent, demonstrating the positive impact that increased adoption of best practices can have on minimizing resource use. Case analyses will also demonstrate the use of freight consolidation, routing, and other solutions that address industry improvement as a tool for environmental remediation, as defined by [
59] and subsequent studies such as [
60].
Study analysis. This empirical analysis combines econometric identification with predictions derived from machine learning models, focusing on the impact of environmental, social, and governance (ESG) factors on the logistics performance of 163 different nations from 2007 through 2023. Similar studies combining multiple models into a single methodology were recently used in tandem with other studies involving artificial intelligence models to achieve more realistic results through the intersection of economic models with AI models [
61]. Using instrumental variables (IV) panel regression analysis, the empirical results show a twofold, double-edged, but mostly negative implication of environmental variables for every Logistics Performance Index (LPI). More specifically, greenhouse gas (GHG) emissions, agricultural value added, air pollutants, and extensive agricultural use are positively or negatively associated with logistics efficiency. This illustrates that environmental and governance variables often have both positive and negative implications for economic performance, depending on the relevant environmental conditions and economic factors [
62]. Specifically, variables such as water accessibility, sanitation facilities, aging, education, and increasing elementary education enrollment rates reflect modest negative adjustments, whereas child labor reflects higher LPI levels. However, income inequality has strong negative effects on logistics activity, suggesting that stronger social development, with reduced income inequality, facilitates efficient value chain management. By contrast, predictions from machine learning models indicate that IV estimates of environmental stress, education, and demographics remain applicable, valid, and accurate. Integration between the two models results in more robust models with enhanced predictability, a methodological improvement supported by previous studies on machine learning-based predictions of logistics performance. Clustering analyses identify distinct elements within each country, defined by attributes such as air pollutants, extreme temperatures, and agricultural intensity, and determine distinct loci for each country. Thus, the empirical results for this topic show that multifaceted logistics sustainability prevails, implying that the well-balanced evolution of logistics must address, alongside environmental enhancement, improved social conditions and proper governance [
62].
Limitations. Across all analyses, the dataset includes 163 countries. Although such a large cross-country dataset helps derive corresponding cross-country correlations with relative ease, this inevitably affects the level of granularity available to inspect individual national settings. To address this tension, the necessity of accounting for national cross-country variability is carefully explained in this manuscript through complementary cross-country analysis strategies, including cluster and machine-learning algorithms.
5. Exploring the Interaction Between Social Factors and LPI in an ESG Context
This part examines the causality between the Logistics Performance Index (LPI) and the Social (S) pillar of the ESG framework in 163 nations from the period 2007 to 2023. Employing two-stage least squares (TSLS) and generalized two-stage least squares (G2SLS) techniques, the research looks at how important social variables like water and sanitation accessibility, education, population structure, income distribution and labor conditions influence the efficiency of logistics. Accounting for endogeneity by using a comprehensive set of instrumental variables, the outcomes show social development drivers to be important influencers of logistic performance and prove why socially inclusive approaches are required to boost supply chain systems everywhere.
5.1. Analyzing the S-Social Component’s Impact on Logistics Performance
This section explores the relationship between the Logistics Performance Index (LPI) and the Social (S) pillar of the ESG model. Using fixed-effects two-stage least squares (TSLS) and generalized two-stage least squares (G2SLS) methods, the study investigates how social factors—such as access to basic services, education, income distribution, labor market conditions, and demographic structures—impact logistics performance. The results reveal that improvements in social indicators can have both positive and negative effects on LPI, highlighting the intricate connections between human development, equity, and logistics efficiency within a sustainable growth framework.
We have estimated the following model:
(First Stage)
i = 163
t = [2007; 2023].
This research examines the determinants of the Logistics Performance Index (LPI) of 163 countries over a period of 17 years using fixed-effects two-stage least squares (TSLS) and generalized two-stage least squares (G2SLS) models with random effects. The framework includes a broad range of instruments capturing economic, demographic, governance, and environmental data. One key finding from the research has a direct bearing on the Social (S) component of the ESG framework. The endogenous variables—i.e., access to safely managed drinking water (PSMWS) and sanitation services (PSMS), elderly population percentage (PA65A), primary school enrollment (SEP), employment of children (CET), prevalence of overweight adults (POA), and income share held by the poorest 20% (ISL20)—all are dimensions of social development considered essential. The correlation unfolds as follows: More widespread provision of simple services like water and sanitation is somewhat counterintuitively negatively related to the variable of logistics performance. While statistically robust, however, the effect is small and implies high-performance social service provision may be related to more rigorous regulatory systems or greater operational costs marginally impacting the effectiveness of logistics. Demographic issues are also seen: a larger percentage of aging population and more enrollment in schools is negatively related to LPI. This may represent the effects of changing labor market fundamentals, whereby aging societies and higher education enrollment fewer youth in the workforce temporarily limit the labor available to the heavily labor-intensive industries like logistics. The opposite effect is identified in the case of the prevalence of child labor (CET), which has a strong positive effect on LPI—a worrying indicator. This indicates improving the performance of logistics in less developed economies may depend partly on exploitative employment arrangements. This has a fundamental social sustainability issue at its core: efficiency gains at the expense of youth welfare and human rights are unacceptable if it goes against the core tenet under the Social pillar of ESG. Equally, the positive effect of overweight prevalence (POA) on the variable of logistics performance is likely a reflection of deeper patterns of economic prosperity and consumerism requiring more sophisticated systems of logistics. This also has social concerns related to modern lifestyles and unjust food systems. The negative correlation of income inequality (ISL20) and the variable of logistics performance is a fundamental finding. In economies in which the bottom 20% of the population possess less income, logistics systems look less efficient. More economic inequality contributes to fragmented markets, stagnant mobility, and lower human capital, all of which contribute to less smooth logistics operations. From an ESG-Social stance, this result confirms that more inclusive economic development bolsters better-performing logistics and supply chain systems. The extensive range of tools utilized—and range of indicators including internet penetration and rule of law, female labor force participation and governance—also highlight social and institutional environments as the determinants of the performance of logistics. More robust social structures, improved legal protections and more inclusive labor markets are not social goods alone but also efficiency enablers of global supply chain operations. In general, this examination makes it evident that social development underpins the performance of logistics. Education, services provision, equality of condition, labor quality and provision of health services all play important parts. Logistics infrastructures policies to enhance them must be strongly integrated with social investment plans to guarantee progress in the area of logistic infrastructures does not happen at the expense of the development of humanity but hand in hand with it and in full coherence with ESG-S objectives.
Causality. The causal identification strategy employed—fixed-effects TSLS and G2SLS with a rich instrument set—permits a strong identification of the causal impact of social variables on the Logistics Performance Index (LPI). The coefficients imply the causal influence of variations in social development indicators on logistics performance and do not simply correlate with it. In particular, better access to safely managed water (PSMWS) and sanitation (PSMS), a larger elderly population percentage (PA65A), and increased school enrollment (SEP) are causally associated with a marginally declining LPI, possibly through augmented regulatory costs or labor force shortages. More troublingly, the causal positive effect of child labor (CET) on LPI illustrates how, in certain settings, improving the efficiency of logistics depends on unsustainable and ethically challenged forms of labor. The causal negative effect of income inequality (ISL20) on LPI also shows how more equal income distribution facilitates the efficiency of the logistics system. Significantly, the instrumental variables technique enhances the causal assertions by reducing endogeneity generated by reverse causality or missing variable bias. Nevertheless, low R2 values signify how social variables have statistically significant causal impacts but account for a minimal share of overall variance in the performance of the logistics system and argue in favor of combining social interventions with more general economic and infrastructural reforms.
Overall impact of the S-Social component within the ESG model. The evidence presents unequivocal empirical proof that the Social (S) pillar of the ESG framework has a causal and sizable yet multifaceted effect on the performance of logistics. Social improvements in indicators have a positive or negative impact on the Logistics Performance Index (LPI), highlighting the subtle tradeoff between operational efficiency and human development. The provision of fundamental services such as safely managed drinking water (PSMWS) and sanitation (PSMS), demographic transitions like population aging (PA65A), and increased enrollment in schools (SEP) are causally linked to declines of minor magnitude in the performance of logistics, probably indicative of increased regulatory costs or labor shortage. The worrying causal positive effect of child labor (CET) on LPI also indicates the persistence of socially unsustainable patterns supporting the efficiency of logistics in some economies. The positive causal effect of overweight prevalence (POA) on LPI also shows stronger consumer-led logistic requirements, while income inequality (ISL20) has a negative effect on logistic efficiency and highlights the importance of equalized growth. Although the causal evidence is statistically strong because a rich list of instrumental variables was used, the low values of R2 reveal a minimal share of variance explained by social variables. Summing up, the development of logistic performance has to be coordinated with socially sustainable development policies completely aligned with ESG-S principles.
5.2. Machine Learning Estimation of Socio-Economic Impacts on Logistics Performance
This section applies machine learning methods to estimate the relationship between socio-economic variables and the Logistics Performance Index (LPI). Several algorithms—including Boosting, Decision Trees, Random Forests, and Support Vector Machines—are evaluated based on normalized performance metrics. The K-Nearest Neighbors (KNN) algorithm emerges as the most accurate and robust model, achieving the lowest prediction errors and the highest explanatory power. Further analysis identifies key social predictors, such as school enrollment, overweight prevalence, and child labor incidence, highlighting the critical influence of human development factors on logistics performance. These results underline the complex interplay between social structures and logistic efficiency (
Table 10).
This cluster is seen to represent the overall or “baseline” population. Cluster 2, though having very small number of observations (8), also has a very high silhouette value of 0.791 as a testament to good clustering and separation between groups. The average values confirm positive NOE (+0.423), very low PM2.5 exposure levels (−2.623), very low agricultural land usage (−2.766), and high value added from agriculture, forestry, and fishing (+0.843). This proves that Cluster 2 consists of countries or regions with high productivity in terms of agriculture and good air quality despite relatively high nitrous oxide emissions [
87]. Cluster 3 with 238 has a high Heat Index 35 (+3.250), indicating extreme exposure to hot air and heat stress, with associated positive departures of NOE (+0.684) and PM2.5 exposure (+0.606). The silhouette value of 0.523 indicates good but imperfect separation of the groups. This group appears to represent countries or regions with both high exposure to heat and air pollution levels as per conclusions drawn in recent semi-supervised PM2.5 clustering and air pollution patterns by region by [
88,
89]. Within the quality of clustering, the silhouette values range from 0.382 to 0.791 across groups and are representative of an acceptable but imperfect data partitioning. The within-cluster sum of squares is very high on Cluster 1 (12,160.403), as a marker of data variability internally in the group and is very low on Cluster 2 (3.617), as an indication of the closeness of the small group. Generally, the model is capable of separating groups at the extremes of data distribution but a majority of the data fall into a very large heterogeneous core group [
87].
Using the K-Nearest Neighbors (KNN) algorithm to forecast the Logistic Performance Index (LPI) on the basis of socio-economic and demographic variables produces results both statistically robust and informative in terms of substance. Primary school enrollment (SEP) is the most significant predictor identified by feature importance assessment expressed as mean dropout loss (28.085), followed by adult overweight prevalence (POA, 26.403) and child labor (CET, 26.196). Other variables, such as access to safely managed sanitation services (PSMS), population percentage aged 65 and above (PA65A), income share of the lowest 20% (ISL20), and percentage of population with access to safely managed drinking water services (PSMWS), are also contributory but to a lesser magnitude. These results imply educational level, labor and public health indicators are fundamental determinants of logistic capacities at the national level (
Figure 6).
The additive feature attribute analysis of the test dataset better represents the effects of single predictors on the model’s predictions. In all scenarios, the base prediction, the model’s prediction when particularized feature effects are removed, is a fixed value of 10.241. Deviations from the baseline represent the subtle interactions among variables: School enrollment (SEP) has a consistent strong positive effect on LPI predictions everywhere, especially in cases 1 to 4. Contrariwise, access to drinking water services (PSMWS) consistently has a negative effect, especially in cases 2, 3, and 4, and represents a mediated association with logistic performance by other infrastructural or governance variables. The negative effects of overweight prevalence (POA) and child labor (CET) also demonstrate the adverse effect of labor market distortions and healthcare on logistic efficiency. These inferences are consistent with recent studies using SHAP (Shapley Add ExPlanations), which demonstrate the capacity of the technique to identify the marginal effect of predictors on models with a high degree of complexity [
90,
91]. Overall, the KNN model not only makes good LPI predictions but also allows better interpretation by quantifying the marginal effects of key socio-economic variables, in a manner analogous to the SHAP-based explanations used in the prediction of the attrition of employees and diagnostics in healthcare [
91,
92]. These inferences demonstrate interdependencies between logistic output and human development indicators and represent the significance of social policy considerations in logistic performance maximization plans (
Figure 7).
5.3. Clustering to Verify the Relationship Between LPI and the S-Social Component of the ESG Model
This study examines the predictive correlation between the Logistics Performance Index (LPI) and a range of socio-economic and demographic variables using machine learning regression methods. Comparing different algorithms using normalized performance measurements highlights K-Nearest Neighbors (KNN) as the optimal technique to capture the underlying variance in logistics performance. Not only does KNN perform better in terms of predictive precision, but it also provides innovative insights into relative importance values of important social variables like education, health, and labor conditions. The investigation underscores how socio-economic development indicators play a pivotal role in determining logistics outcomes, thus supporting socially inclusive logistics approaches in the ESG framework (
Table 11).
Based on normalized performance measurements, Neighborhood-Based Clustering is the most suitable out of the methods considered. This is evident in better performance on a set of core clustering validity measurements. Notably, it has the best R
2 value with a higher percentage variance explained compared to other methods. Moreover, it has a high Silhouette score, reflecting good internal cohesion and good separation between groups—properties of paramount importance to measuring the quality of a clustering structure [
93]. In addition to that, its strategically low maximum diameter and acceptable minimum separation values further attest to Neighborhood-Based Clustering to effectively minimize within-cluster dispersion and maintain different groups separated. Though it fails to achieve the best AIC and BIC values to evaluate model simplicity and goodness of fit, its performance remains competitive considering the merit of structural clarity and interpretableness to clustering analysis [
93]. Density-Based Clustering approaches, for example, despite having best scores on maximum diameter and Dunn index scores, register poor Silhouette values and weaker R
2 values and demonstrate weaker model robustness in the respective setting of this type of application [
94]. Likewise, while targeted metrics have good performance by Random Forest Clustering, it does not outperform consistently on all dimensions. Although it has good performance on certain dimensions of the clustering problem, its stability and interpretableness are unstable on different datasets [
95]. Neighborhood-Based Clustering therefore has the best trade-off among the considered methods between separation and compactness and model explanatory power and stability. Overall performance also means it is best suited to applications requiring consistent group distinction as well as internal consistency to exist and best used in the setting of the current investigation (
Table 12).
Applying Neighborhood-Based Clustering to the chosen socio-economic and demographic variables confirms a significant splitting of the dataset into ten groups with different profiles by logistic performance and corresponding indicators of human development. The silhouette values are mostly average but confirm acceptable cohesion among the groups, with cluster groups 8 and 10 sharing the highest internal consistency (0.450 and 0.430, respectively), suggesting consistency in relatively homogeneous patterns in the data [
96]. The explained percentage of heterogeneity among the groups further confirms adequacy in the model, as in Cluster 5, the low percentage of heterogeneity (0.041) and a high cluster center LPI value (3.309) pick out a distinctive group with high logistic performance. Clusters 5 and 10 are indeed the most differentiated structural groups and show much higher Logistic Performance Index values compared to other groups with central values around negative LPIs [
97]. A look at the cluster centers picks out significant socio-economic contrasts. The groups found to have a higher LPI values are predominantly marked by improved coverage in terms of sanitation (high scores on PSMS), relatively higher proportions of elderly population (PA65A), improved coverage of safely managed drinking water (PSMWS), and more balanced income distribution (ISL20). The groups found to have low LPI centers (now classified as groups 3 and 7) are marked by negative performance in all of the above dimensions combined with increased prevalence of child labor (CET) and decreased enrolment in schools (SEP), suggesting structural weaknesses [
96]. Surprisingly, Cluster 8 has a positive logistic profile even though it has low scores on water service indicators, implicating the hypothesis that education and income distribution may in this group make up deficits in infrastructure. These patterns amplify the importance of the inclusion of socio-economic dimensions in clustering methods in the case of logistics and infrastructure evaluation, as shown in previous examples of clustering in supply chain and logistic environments [
95]. Overall, the results demonstrate that logistic performance is closely intertwined with broader social determinants, including education access, labor market conditions, health outcomes, and basic service provision, confirming the multi-dimensional nature of logistics capacity within national and regional contexts [
97].
7. Policy Implications
The results of this research have vast implications for policymakers seeking to coordinate improvements in Logistics Performance with overall Environmental, Social, and Governance factors. The outcomes of this research reveal strong correlations between improvements in the Logistics Performance Index (LPI) and key factors in the Environmental, Social, and Governance domain, thereby validating current idioms regarding the essential role of logistics infrastructure in Sustainable Development Policies [
120]. From a governance perspective, the strong correlations between LPI and other factors such as Government Effectiveness (GEE) and Regression Estimate of Regulation Quality (RQE) highlight the need for more open and effective governance in logistics [
121]. From a social perspective, the study shows that effective logistics networks support overall achievement of social rights, as evidenced by Economic and Social Rights Performance Scores (ESRPS). This indicates that effective logistics networks contribute positively to the access of necessary products and services in society, underscoring the strategic need to align logistics network investment with clear social aims. An environmental perspective analysis indicates that effective improvements in the sector should be aligned with robust environmental regulatory frameworks and incentives that neutralize the negative environmental impacts of supply chains [
120]. The clustering analysis also illustrates the diversified patterns of country-level ESG and logistics achievements, thus emphasizing the need for differentiated approaches [
96]. Countries with weaker LPI and poor ESG outcomes should focus on institutional and infrastructural improvements, whereas more successful countries should focus on further improving environmental and social sustainability dimensions [
5]. The significance of scientific achievement (STJA) in improving logistics achievement also draws attention in this study, suggesting that innovation and research strategies should align with or follow overall logistics improvements [
4]. Finally, political stability and overall values (PSAOV) become essential factors in ensuring effective logistics systems, underscoring the need for intersectoral governance and policymaking.
Theoretical implications. Several theoretical implications of this analysis emerge as a consequence of its results. Among them, the most central implication lies within the assumption of ‘two-sided logistics’ that not only captures its enabling dimensions but is also sensitive to any negative externality that might emerge as a consequence of deregulation [
122]. From a theoretical perspective, this report clearly shows that, through the use of appropriate causal analysis, logistics, as well as sustainability, move beyond traditional linear or unidirectional notions. Rather, they assume multiple-layered, structurally defined trade-off forms that transform and evolve in specifically dynamic ways [
123]. The application of instrumental variables with machine learning tools in logistics and sustainability pushes beyond conventional descriptive or correlation-based literature, opening theoretically innovative dimensions in defining logistics and sustainability as theoretically robust. Logistics, on the one hand, can be regarded as enabling infrastructure for achieving logistics sustainability, whereas, on the other hand, negative externalities may emerge with unregulated logistics [
10]. Empirical confirmations of the Logistics Performance Index (LPI) as well as its improvements that contribute, along with their effects, towards achieving logistics sustainability exist. Improvements in LPI will enhance social sustainability, such as better education or the absence of child labor in some industries and countries. However, LPI, along with logistics, may cause negative social imbalances or inequalities that may call for structural and meaningful corrective measures [
43]. Moreover, environmental sustainability may emerge due to the promotion of clean technologies along with more effective resource management. However, negative environmental imbalances, such as pollution, may emerge from such LPI, along with their effects that may call for structural and meaningful remedies [
123]. Thus, theories recognize the application of “two-sided logistics sustainability” that relies heavily on, as well as being dependent on, some extremely specific assumptions: that logistics may enhance social sustainability in multiple aspects (improving educations or eliminating child labor in some industries) or may generate social inequities requiring attention, along with contributing towards environmental sustainability in some ways, however, with the possibility of environmental pollutions [
10,
122]. Further, this analysis refutes the existing theoretical consideration that takes a marginal stance concerning the role of environmental and demographic variables in the field of logistics. However, the causal dynamics of air pollutants, GHG, heat stress, and logistics efficiency identify the importance of theoretically formulating logistics systems as units that respond to environmental dynamics [
124]. This goes beyond the existing theoretical focus that considers infrastructure, cost, and trading volume. An extra theoretical implication relates to the formulation of ESG variables. By employing a disaggregated framework, there would be the possibility of proving that the interaction of the three variables of ESG with logistics system dynamics differs in ways that would not be recognized via the aggregate metric [
43,
124]. Finally, in light of the above, the interaction of machine learning with econometric insights within the methodology is a material theoretical component, clearly showing that large-scale sustainability dynamics are, in fact, nonlinear, such that country-level profiles must be factored in for their effective probing. In other words, this theoretical exercise again vindicates the need for hybrid studies in the theory of supply chain dynamics and sustainability [
122,
123]. Moving ahead, a practical policy implication would be that carbon pricing policies must be adopted with a focus on keeping corporations on their toes in terms of carbon emissions, along with providing incentives for more investments in green technology. At the same time, green freight routes may significantly enhance the efficiency of logistics along with its positive effects on the environment [
10].
Practical implications. A number of the results have implications that are important for policymakers, decision-making bodies in international forums, investment circles, and managers responsible for logistics. Recent studies suggest that improvements in logistics do not always yield positive outcomes for environmental, social, and governance (ESG) factors, requiring policy measures to reconcile efficiency with sustainability [
125]. However, it seems entirely valid that improvements in logistics will increasingly necessitate more focused strategies, as greater efficiency may increase costs in terms of sustainability. With regard to environmental aspects, stronger links in the logistics system exhibit a strong positive correlation with NOx emissions, whereas air pollutants, specifically particulate matter (PM2.5), negatively affect efficiency. These findings align with existing empirical insights that link logistics practices with material environmental spillovers, such as higher NOx levels, when effective sustainability-mitigating strategies are not considered [
19,
126]. This further explains that any project linked to logistics practices must effectively address comprehensive environmental protection, including decarbonization, environmentally friendly transportation, ecologically safe warehouses, and climate-proof infrastructure, as mentioned by [
19]. Socially, the topic highlights the importance of incorporating human development considerations into logistics planning. Specifically, the positive causal link between Child Labor (CET) and the Logistics Performance Index (LPI) across different societies highlights that some logistics improvements may have stemmed from problematic social behaviors. These results confirm that logistics improvement must operate within the parameters of responsibility, so that the development of the value chain occurs in a fair, humane manner [
126]. Also, the negative impacts of demographics, such as aging societies or increasing school enrollment, on logistics efficiency indicate that the adoption of effective active labor policies, as well as the promotion of automated technology, would be necessary as a buffer against changes in the unavailability of the workforce [
127]. Thus, social logistics and sustainable logistics practices must be seen as structural aspects that ensure the supply chain’s resilience, rather than mere afterthoughts. With regard to governance, nations with better quality governance, defined as better regulation, rule of law, accountability, as well as scientific capabilities, show better logistics capabilities [
125]. This explicitly indicates that betterment in the Logistics Performance Index requires improvements in governance quality [
19]. Machine learning analyses provide additional helpful tools for decision-makers. Prediction algorithms such as Random Forests or K-Nearest Neighbors (KNN) may help governments and companies predict logistics efficiency. In addition, cluster analysis helps identify different national profiles. This will allow global organizations and development finance institutions to develop strategies grounded in concrete environmental or structural issues [
126]. In conclusion, the implications of the mentioned dimensions are that Sustainable Logistics requires that the processes of infrastructure development, environmental issues, social aspects, and the quality of institutions must move forward together. Thus, logistics transformation must be incorporated into broader sustainable development paradigms so that efficiency for both the economy and the environment can be achieved [
125,
127].
8. Discussion
In this regard, the analysis illustrates the complex but intricate link between logistics performance and the trilogy of E(S)G issues with sufficient evidence that shows that improvements in the Logistics Performance Index (LPI) positively correlate with environmental, social, as well as governance issues, although with different degrees of intensity [
4,
124]. Regardless of the models, be it economics, AI, or cluster models, the fact remains that logistics efficiency is both driven as well as a driver of sustainability, with the LPI being the mediator that facilitates the link between environmental sustainability, social issues, and good governance [
43]. Environmentally, the findings indicate a strong trade-off among economic development, resource use, and environmental conditions. There is a positive correlation between higher LPI rankings and greater GHG emissions, such as nitrogen oxides, indicating that as economic development occurs, associated carbon-intensive practices often increase. This observation aligns with the traditional diseconomies observed in early-stage industrialization, where increased transport infrastructure, warehousing, and freight activity lead to higher associated GHG emissions [
15]. At the same time, environmental degradation, particularly air pollutants such as PM2.5, significantly costs the logistics system. Air pollutants contribute to poor logistics performance, leading to reduced productivity and logistical infrastructure disruptions [
43].
It is further found that there is a positive correlation between extreme heat exposure and LPI, which is explained by the efforts of hot-region countries that invested significantly in resilient logistics structures to overcome inefficiencies caused by weather conditions [
124]. Land use factors further interact with this aspect, such that a higher percentage of agricultural land is associated with poorly developed logistics infrastructure, but higher value addition in agriculture is associated with better logistics efficiency. This difference indicates that subsistence agriculture does not enhance logistics efficiency, but a more commercialized agricultural sector promotes investment in the Cold Chain, exports, and transport [
4]. References confirm the importance of environmental variables, pinpointing agricultural land, nitrous oxide, PM2.5 air quality, and heat stress as the top determinants of LPI [
43]. Taken together, this literature collectively suggests that, contrary to the topic’s periphery, environmental issues play a fundamental, defining role within the efficiency of logistics, finding that any improvement within this realm will necessarily require addressing issues of environmental degradation as well as those of climate change [
15]. Socially, the implication of this finding is that there is a complex interaction of positive and negative effects, pointing towards a convergence of logistics performance with human development. Higher LPI scores indicate improved education, as evident through higher education enrollment and fewer children in the workforce, as the workforce, processes, and technology required in logistics systems depend on human knowledge, competencies, and technology-enabled human resource capabilities, thus their development in societies with positive education dynamics [
4]. However, income inequality appears to be a robust negative factor for logistics performance, suggesting that societies with income inequality will suffer from inefficient human resource allocation, unconsolidated service value, and inadequate infrastructure accessibility. Simultaneously, other dimensions, such as an aging population and imbalanced access to basic services, appear as weak negative determinants of logistics efficiency. Such results indicate that, despite human development, logistics development may increase inequalities in societies, which may further strain them if proper social policies do not channel appropriate attention [
15]. The results confirm that logistics performance extends beyond techno-infrastructural aspects, encompassing social cohesion, human capital development, and equal opportunities for economic participation [
124]. In terms of governance, the results are more definitive, being strongly positive. There is a positive correlation between higher governance, as indicated by more effective governments, the rule of law, the quality of regulation, scientific productivity, and logistics performance. This occurs because the enabling effects of governance shape proper regulations, customs policies, the enforceability of contracts, and stable environments that attract investment in logistics infrastructure [
43]. Voice and accountability, combined with active scientific production, further enhance the enabling effects of innovation-modified logistics systems [
124]. Similarly, machine learning models identify governance variables as robust predictors of LPI, with cluster analyses affirming that nations with strong governance, as identified via systematic clustering, position closely with strong logistics practices [
4]. These indicate that governance as a consideration is essential, as it promotes efforts toward logistics modernization, allowing better alignment between the scale of logistics development and ESG principles [
15]. In any case, the conclusions of this research indicate that logistics performance is firmly positioned at the center of the dynamics of ESG. Environmental considerations call for proper management via greener innovation as well as more climate-resilient infrastructure, with the social factor pinpointing that logistics systems always remain strong within more equitable, more educated societies, contrary to effects that generate reduced efficiency within more inequitable, poorly developed societies, with governance proving as a cornerstone much more closely associated with improved logistics performance [
43,
124]. See
Table 21.