The Contribution of Sustainability and Governance Signals to Return on Equity Prediction: Evidence from Tree-Based Machine Learning, Bootstrapped Grouped CV and SHAP

Talaş, Hasan; Gök, Ela Naz; Akçakanat, Özen; Gültekin, Gürkan; Terzioğlu, Mustafa; Tutcu, Burçin; Ünal Uyar, Güler Ferhan

doi:10.3390/jrfm19020106

Open AccessArticle

The Contribution of Sustainability and Governance Signals to Return on Equity Prediction: Evidence from Tree-Based Machine Learning, Bootstrapped Grouped CV and SHAP

by

Hasan Talaş

¹

,

Ela Naz Gök

²

,

Özen Akçakanat

³,

Gürkan Gültekin

⁴

,

Mustafa Terzioğlu

¹

,

Burçin Tutcu

¹ and

Güler Ferhan Ünal Uyar

^5,*

¹

Accounting and Tax Department, Korkuteli Vocational School, Akdeniz University, Antalya 07058, Türkiye

²

Independent Researcher, Antalya 07058, Türkiye

³

Department of Banking and Finance, Faculty of Economics and Administrative Sciences, Süleyman Demirel University, Isparta 32260, Türkiye

⁴

Keçiborlu Vocational School, Applied Sciences University of Isparta, Isparta 32200, Türkiye

⁵

Faculty of Economics and Administrative Sciences, Akdeniz University, Antalya 07058, Türkiye

^*

Author to whom correspondence should be addressed.

J. Risk Financial Manag. 2026, 19(2), 106; https://doi.org/10.3390/jrfm19020106

Submission received: 8 January 2026 / Revised: 27 January 2026 / Accepted: 29 January 2026 / Published: 3 February 2026

(This article belongs to the Section Financial Technology and Innovation)

Download

Browse Figure

Versions Notes

Abstract

In the global economy, traditional accounting-based ratios alone are often insufficient to fully explain firm performance, increasing the importance of complementary information sources such as sustainability and governance disclosures. In this context, environmental, social, and governance (ESG) indicators, together with corporate governance signals, have increasingly been recognized as important drivers of firm performance. However, the literature does not provide a clear and generalizable view on the impact of ESG indicators on profitability. This study aims to examine whether sustainability and corporate governance signals provide additional information value beyond traditional financial ratios in predicting ROE. To this end, two models were compared using a sample of 428 non-financial publicly traded companies operating in Turkey. The firm-level dataset was constructed using financial statements and independent audit disclosures obtained from the Turkish Public Disclosure Platform (KAP). Tree-based machine learning models were employed to capture potential nonlinear relationships and complex interactions between financial and non-financial indicators. Model performance was evaluated within a Bootstrapped Grouped Cross-Validation framework that considered firm-level dependency; the statistical reliability of performance differences was tested using bootstrap-based confidence intervals and matched tests. Among the evaluated models, Random Forest achieved the strongest overall predictive performance. In conclusion, this study demonstrates that sustainability and corporate governance disclosures provide statistically significant additional information value to ROE prediction. Due to the use of multiple algorithms, it contributes to the literature in a generalizable manner.

Keywords:

corporate sustainability; corporate governance; machine learning; bootstrapped grouped CV; financial performance; ROE

1. Introduction

It is well known that, from a traditional business perspective, accounting-based ratios are the fundamental criteria for measuring financial performance. However, crises arising from different perspectives in the global world, such as climate risk, social inequalities, corporate scandals, and management failures, reveal that profitability cannot be explained solely by the income statement and balance sheet. To address this shortcoming, complementary sources of information have become necessary. Businesses have consequently become familiar with the ESG approach, which encompasses environmental, social, and governance dimensions. This approach provides businesses with a framework for assessing their risk profiles, strategic orientations, and performance sustainability.

In this context, a review of the literature reveals that there is no consensus among studies examining the impact of ESG indicators on financial performance. Particularly when examining performance metrics that directly concern investors, such as Return on Equity (ROE), the impact of ESG indicators cannot be clearly articulated. It is thought that the most significant reason for this is that the studies conducted are predominantly based on linear regression models and linear propositions.

The increasing use of machine learning methods in the literature contributes to the modelling of non-linear relationships. Nevertheless, significant methodological limitations are apparent. The majority of studies focus on a single algorithm, and performance differences between models cannot be statistically reported. As a result, it is not possible to clearly explain the impact of ESG indicators.

The primary motivation behind this study is to test, in a generalizable manner, whether ESG indicators can provide information beyond financial ratios for ROE prediction. To this end, two separate models are compared. The first model incorporates financial ratios, while the second model incorporates sustainability and governance signals. In this study, governance signals refer to firm-specific corporate governance disclosures (such as board structure and audit-related information), which are treated separately from aggregated ESG “G” components in order to capture more granular governance effects. The most important feature that distinguishes this study from others is the use of the Bootstrapped Grouped Cross-Validation approach. This method allows for the statistical reporting of model performance. In this respect, it sheds light on a gap in the literature. Furthermore, the combined use of different tree-based algorithms such as Random Forest, XGBoost, LightGBM, and CatBoost in the study enhances methodological robustness by testing the algorithm-sensitivity of the findings. Consequently, this study offers a unique contribution to the literature by testing the additional information value that sustainability and corporate governance indicators provide for ROE prediction within a multi-algorithmic, information leakage-resistant, and statistically reliable comparison framework. From a conceptual perspective, financial performance reflects not only firms’ operational efficiency but also their governance quality, risk management practices, and strategic sustainability orientation. Accordingly, indicators related to profitability, liquidity, leverage, and corporate governance are expected to capture complementary dimensions of firm performance, making them suitable predictors within a machine learning framework.

Based on the above discussion, this study tests the following hypotheses:

H1.

A model incorporating sustainability and corporate governance indicators provides higher out-of-sample predictive performance for ROE than a model based solely on financial ratios.

H2.

The improvement in out-of-sample predictive performance obtained by adding sustainability and governance indicators to traditional financial ratios is statistically significant but limited in magnitude.

H3.

The contribution of sustainability and governance indicators to ROE prediction is algorithm-sensitive and more pronounced in boosting-based tree models.

2. Conceptual Framework

2.1. Corporate Sustainability

Traditionally, businesses have focused on producing goods and services at profitable and competitive prices to meet customer needs. However, due to limited resources, ecosystem degradation and accelerating climate change, it is not possible for businesses to contribute to sustainable development through their traditional role (Danciu, 2013). Therefore, corporate sustainability is now considered not as an alternative to economic growth and profitability, but as a complementary element. In line with this trend, it is predicted that sustainability will be the real business success strategy in the business environment in the coming period. In this context, it is not sufficient for companies to focus solely on growth and profitability targets; it is also crucial that they fulfil their environmental responsibilities, support social justice and equality, and ensure economic development in line with sustainability principles (Wilson, 2003; Danciu, 2013).

The concept of sustainability aims to meet the needs and expectations of the present generation without compromising the ability of future generations to meet their own needs (Brundtland Report, 1987). The concept of sustainability has been shaped by the convergence of environmental, economic, and social dimensions to achieve this long-term balance (Cox & Cusick, 2006; Hahn & Kühnen, 2013). Each of these dimensions necessitates that business activities be evaluated holistically, not only in terms of financial indicators but also from environmental, social, and economic perspectives (Lu et al., 2016).

With the increasing importance of the concept of corporate sustainability today, businesses are adopting various sustainability reporting frameworks to identify appropriate methods for better explaining their activities in environmental, social and economic matters to target stakeholders (Goswami et al., 2023). The effectiveness of these frameworks is related to how the relevant reports are prepared and reported. Among these, the sustainability reporting frameworks and standards that are prominent and widely used today are the Global Reporting Initiative (GRI), the International Integrated Reporting Council (IIRC), the Sustainability Accounting Standards Board (SASB), the Climate Disclosure Project (CDP), and the Climate Disclosure Standards Board (CDSB) (Bose, 2020).

Each of the relevant reporting frameworks and standards has been developed to ensure that sustainability reports are shared transparently and accurately with stakeholders. However, preparing, publishing and keeping up to date with the latest developments in sustainability reporting is challenging and time-consuming. Despite all these difficulties, the guidelines published by the Global Reporting Initiative (GRI) among these standards are frequently used in sustainability reporting and stand out as one of the most widely used methods in the literature (Isaksson, 2019). The GRI Standards outline stakeholder engagement, sustainability context, materiality, and the requirements for the accuracy, comparability, and reporting process in creating a report. According to this standard, an organization completes the reporting process by determining its long-term environmental, social, and economic strategies, risks, and objectives (GRI 101, 2016).

In this context, sustainability reports provide additional information on the environmental, social and economic conditions of businesses, offering a much more comprehensive set of information on long-term performance, risk management and transparency. Thanks to these features, sustainability reports become a valuable source of information for investors in terms of assessing the long-term sustainability of companies (Dhaliwal et al., 2011). Therefore, sustainability reports, together with financial reports, offer higher information value for investors. In Turkey, financial reporting for publicly listed firms is mandatory under Capital Markets Board (CMB) regulations, while sustainability reporting remains largely voluntary. Companies disclose financial statements, independent audit reports, and selected corporate governance information through the Public Disclosure Platform (KAP). Governance-related disclosures typically include board structure, ownership concentration, audit committee characteristics, and compliance with corporate governance principles. Although sustainability reporting is encouraged, only a subset of firms provides standalone sustainability reports or detailed ESG-related disclosures. This institutional setting implies substantial heterogeneity in non-financial transparency, making governance signals particularly informative in explaining firm-level performance.

2.2. Corporate Governance and Sustainability—Governance Interaction

Corporate governance is the entire structure that forms the basis of the correct management approach that determines the path to be followed to achieve corporate goals, in addition to ensuring that businesses comply with certain rules (Naciti et al., 2022). Furthermore, corporate governance is defined as a set of rules that ensure investors who provide funds to the business receive a profitable return on their investment (Shleifer & Vishny, 1997). In this regard, businesses contribute positively to shareholder value in the long term by increasing the accountability of managers and the quality of decision-making through corporate governance mechanisms (Khan, 2011).

The importance of the concept of corporate governance has increased further with the growing distinction between ownership and control among shareholders and company managers today. In this context, the corporate governance structure shapes the company’s decision-making processes by defining the distribution of rights, responsibilities and authority among the board of directors, managers and other stakeholders. This creates a comprehensive system for setting company objectives, establishing the processes necessary to achieve these objectives, and monitoring business performance (OECD, 2004).

Ultimately, the fundamental purpose of successful corporate governance is to add value to the company and to ensure the participation of everyone who contributes directly or indirectly to the creation of this value. Therefore, proper governance activities aim to ensure that shareholders are rewarded for the capital they provide, that managers and employees receive compensation for their labor, that customers are provided with higher quality products and services, that suppliers are paid on time and regularly, and that lenders are guaranteed timely repayment of the financing provided (Castrillón, 2021).

Establishing a sound governance system in companies sends strong signals that the company is properly managed and monitored, and that stakeholder interests are prioritized. Consequently, this system ensures that all stakeholders have access to accurate information. This is because the transparent and explanatory implementation and monitoring of sustainability reports is achieved through the governance structure (Michelon & Parbonetti, 2012). A company’s board of directors plays a critical role in the adoption of corporate sustainability and corporate governance within the company, in ensuring stakeholder participation, and in measuring and explaining non-financial information. Furthermore, companies with a strong position in terms of corporate sustainability have been seen to achieve better results in the stock market and financial performance in the long term compared to their competitors (Eccles et al., 2014). In this context, the contributions of the corporate governance structure to the adoption of sustainability principles at the corporate level have gained a more holistic dimension with the integration of the increasingly important ESG-focused management approach into company strategies. The increasing importance of disclosures related to the environmental (ENV), social (SOC) and governance (GOV) dimensions of ESG indicates that businesses are moving towards sustainable and responsible management practices. Today, companies are making ESG factors an integral part of their corporate strategies in order to meet the expectations of investors, stakeholders and regulatory bodies. ESG performance and corporate governance practices support businesses’ resilience, risk management capacity, and long-term financial success by reducing environmental risks, strengthening social responsibility, and increasing the transparency of management processes. In academic literature, it is widely accepted that high ESG scores and effective corporate governance mechanisms have a positive impact on company performance by reducing information asymmetry, enhancing corporate reputation, and strengthening stakeholder confidence. The fact that investors prefer such companies because they associate them with a low risk profile and more stable returns further increases the strategic importance of ESG and corporate governance on financial performance. In this context, addressing ESG performance and corporate governance together provides a critical analytical approach to understanding companies’ sustainable competitive strength, risk mitigation capacity, and long-term value creation ability (Basali, 2025; Buchetti et al., 2025).

The implementation of corporate governance practices offers numerous significant benefits to businesses. These benefits enable companies to increase their operational efficiency and performance by using their resources more efficiently, attract new investors, increase company value by reducing capital costs, and achieve greater visibility and reputation in the public eye (International Finance Corporation, 2010). Furthermore, a robust governance structure provides additional information on internal risks, management quality, and decision-making processes not included in financial reports, enabling investors to make more accurate assessments of the company. Corporate governance indicators contribute to a more accurate, comprehensive and multidimensional analysis of company performance by providing additional explanatory power in the assessment of financial reports, thereby reducing uncertainty in investors’ decision-making processes (Gompers et al., 2003; Bushman et al., 2004).

3. Literature Review

Table 1 summarises selected empirical studies examining the relationship between ESG indicators, corporate governance, and firm performance.

Table 1 summarises the literature examining the relationship between ESG indicators, corporate governance, and financial performance in chronological order. When current studies are evaluated thematically, the literature is seen to be concentrated on four main axes.

The first strand of the literature consists of studies that directly test the relationship between ESG performance and firm profitability and financial outcomes. A significant portion of these studies suggests that ESG indicators may be positively related to accounting-based performance measures such as ROA, ROE, and EBIT (e.g., De Lucia et al., 2020; Herman et al., 2025; D’Amato et al., 2021). However, this literature generally examines the impact of ESG on financial performance through a single model or limited set of methods; while partially capturing non-linear structures, it does not assess the statistical reliability of performance differences between models.

The second group of literature focuses on modelling and explaining ESG indicators or ESG scores using machine learning and deep learning methods. Studies in this context show that ESG rating processes can be re-estimated (Del Vitto et al., 2023), ESG data can be analysed using NLP and deep learning techniques (Lee et al., 2022), and the determinants underlying ESG scores can be revealed using explainable ML tools (Kim & Lee, 2025; Pei, 2025). While these studies offer significant methodological contributions to the ESG measurement process, they do not directly test the marginal contribution of ESG to financial performance prediction and mostly focus on the internal structure of a single model.

The third strand of literature examines the impact of ESG and sustainability disclosures on forecast accuracy, information asymmetry, and prediction errors. Studies on analyst forecasts and financial prediction accuracy show that sustainability disclosures can improve the information environment and reduce forecast errors (Acheampong & Elshandidy, 2025). Conversely, some studies reveal that ESG’s contribution to financial forecast accuracy may be limited or model-sensitive (Dincă et al., 2025; Dossa et al., 2025). A key methodological issue in this literature is the failure to account for firm-level dependence in panel-like data structures and the insufficient consideration of the risk of information leakage that may arise in the training–testing split. Furthermore, supporting performance differences between models with confidence intervals and statistically testing them is largely neglected.

The final group of literature focuses on systematic reviews and bibliometric analyses of studies in the AI–ESG field (Davidescu et al., 2025; Ferrari et al., 2025; Mohsin & Nasim, 2025). These studies show that the ESG and machine learning literature has grown rapidly since 2015, with explainability, risk management, and ESG measurement standards coming to the fore. However, these reviews emphasise that the fundamental problem in the field is the lack of data standardisation and methodological consistency; they offer limited contributions to model comparisons at the empirical level and the statistical reliability of prediction performance.

When the current literature is evaluated overall, it is seen that the majority of studies have structural limitations, such as relying on a single model, not using group-based validation techniques that prevent information leakage, not statistically testing performance differences between models, and not testing the additional information value that ESG provides to financial performance from a generalisability perspective.

This study systematically tests the marginal information value that ESG provides to ROE prediction by comparing a basic model based solely on financial ratios with an enriched model that includes sustainability/governance indicators in addition to financial ratios on the same sample. Grouped Cross-Validation prevents information leakage arising from firm-level dependencies; the statistical reliability of model performance differences is assessed through bootstrap-based confidence intervals and matched tests. Furthermore, applying SHAP analysis to all models allows for examining whether variable contributions are consistent independently of the model, rather than being model-specific. In these respects, the study offers a unique and robust contribution to the ESG–financial performance literature, both methodologically and empirically. Unlike prior studies, this paper explicitly quantifies the marginal contribution of sustainability and governance indicators by formally testing performance differences between competing models under a grouped cross-validation design. By combining multi-algorithm benchmarking, leakage-aware validation, and bootstrap-based statistical inference, the study moves beyond descriptive model comparisons and provides reproducible evidence on whether ESG-related information adds economically and statistically meaningful predictive value. Beyond statistical significance, the observed improvements are also economically meaningful, as even modest gains in ROE prediction accuracy may translate into substantial differences in firm valuation, risk assessment, and investment decision-making. These performance differences are formally evaluated using matched statistical tests alongside bootstrap confidence intervals, allowing us to distinguish systematic information gains from random variation. Accordingly, the reported improvements reflect both statistical reliability and economically meaningful effect sizes, rather than model-specific artifacts.

4. Methodology and Approach

The aim of the study is to test whether sustainability/governance indicators provide additional information value in predicting companies’ financial performance (ROE) beyond financial ratios. To this end, two model families were compared on the same sample. These are (i) the basic model based solely on financial ratios (Model 1) and (ii) the enriched model that includes financial ratios and sustainability/governance indicators derived from companies’ annual reports (Model 2). Tree-based ensemble models were selected because of their ability to capture nonlinear relationships and higher-order interactions commonly observed in financial data. In particular, Random Forest benefits from variance reduction through bagging, while boosting-based methods (XGBoost and LightGBM) sequentially improve weak learners by focusing on residual errors, allowing governance and sustainability signals to interact with financial ratios in a more flexible manner. This design enables a direct assessment of whether ESG-related variables provide incremental predictive content beyond traditional financial fundamentals.

To establish the forecasting framework and reduce information leakage, the dependent variable ROE_(i,t) (ROE for 2024 in this study) has been defined. All independent variables are taken from period t − 1 (2023 in this study).

{R O E}_{i, t} = f (x_{i, t - 1})

Thus, the ROE (for year 2024) forecast is produced using only financial information disclosed to the public at the end of 2023 and governance/sustainability disclosures in the 2023 activity report.

Financial data and activity reports were obtained through the Public Disclosure Platform (KAP) operating in Turkey. KAP is an official public platform where publicly traded companies disclose their financial reports and news to the public in accordance with the legislation. The Model 1 dataset (financial ratios) and Model 2 dataset (financial ratios and governance/sustainability indicators) belong to a total of 428 companies. Banks and insurance companies were not included in the study due to differences in the presentation of financial statements. The sample structure across these different sectors allows us to test whether the findings are specific to a particular industry. Thus, the proposition that “governance/sustainability indicators add value to ROE forecasting” can be tested not only in a single sector but also in a broader corporate environment. In terms of analytical methods, the sample structure of companies within the same sector also produces natural variance compared to the sample structure across different sectors, particularly in financial ratios and reporting practices. This variance expands the information content that machine learning models can “learn.” Consequently, it may be misleading regarding whether the added governance/sustainability indicators are truly discriminative. Furthermore, the sample structure of companies in the same sector may sometimes be captured as an “easy signal” by machine learning methods when making predictions, due to sector-specific circumstances, and show high success. In a multi-sector data set, the model must learn generalizable patterns because it does not rely on the patterns of a single industry. This subjects model performance to a “more difficult but more reliable” test.

The model variables for Model 1 and Model 2 designed in the study are provided in Table 2 and Table 3.

The selection of the dependent variable (ROE) and independent variable sets (financial ratios and sustainability/governance indicators) in the study is based on the literature stating that financial performance can be explained through both accounting-based profitability drivers and corporate governance/transparency channels, in line with the “predicting profitability in period t using t − 1 information” approach.

The primary reason for selecting ROE (return on equity) as the dependent variable is that ROE is one of the most common accounting-based performance measures reflecting the company’s periodic value creation from the shareholders’ perspective. The continuity of profitability components over time and their predictive power for the future have also been systematically examined in the financial ratio analysis literature (Nissim & Penman, 2001). ROE is considered a fundamental “outcome variable,” particularly in valuation and profitability analyses (based on DuPont logic). Furthermore, ROE is a frequently used performance target in both corporate finance and financial reporting research because capital structure, operating success, and margin dynamics can be read simultaneously through ROE (Selling & Stickney, 1989).

The financial ratios used in Model 1 (INKM, IFMA, IROI, ICRT, ILEV) have economic content that is highlighted in the literature as key determinants of ROE, but do not completely overlap with each other. Net profit margin (INKM) and operating margin (IFMA) represent the “margin/operational efficiency” channel of profitability. The sensitivity of margins to both cross-sector structural differences and the strategic competitive environment has been demonstrated in classical ratio analysis studies (Selling & Stickney, 1989). The ratio analysis and valuation literature emphasizes that margin and operating performance carry critical information for understanding future levels of profitability (Nissim & Penman, 2001). The investment return (IROI) variable also aims to capture the “investment efficiency” channel of ROE by representing the firm’s investment/asset utilization efficiency and the capacity of investments to generate profitability. This variable also plays a central role in DuPont-based studies discussing the continuity and risk of profitability components (Li et al., 2014). The current ratio (ICRT), representing liquidity, and leverage (ILEV), representing leverage and capital structure, are expected to bring the financial policy dimension of ROE into the model. The relationship between liquidity and profitability is addressed in the literature within a “trade-off” framework. While high levels of liquidity provide a safety buffer, excessive liquidity can suppress profitability due to financing costs or idle resources. This relationship has been empirically tested, particularly through liquidity indicators such as the current ratio (Eljelly, 2004). Similarly, the link between working capital management and profitability points to short-term financing/operating mechanisms affecting profitability (Deloof, 2003). The choice of leverage (ILEV) relates the impact of capital structure on profitability and shareholder returns to both theoretical and empirical literature. As is well known, agency theory suggests that the use of external sources of funds may create a disciplinary effect on managers, but that these sources may also generate agency costs (Jensen & Meckling, 1976). Empirically, the relationship between capital structure components and ROE has been directly tested in different samples (Abor, 2005).

In Model 2, sustainability/governance indicators (IKYI, ISUS, IESG, IRSK, IMNG) were added as independent variables. They were used as independent variables in the study based on the assumption that, beyond financial ratios, they could provide additional information on the prediction of ROE regarding “corporate transparency, oversight, and risk management capacity.” This selection has two main theoretical underpinnings. The first is the agency-based approach (Jensen & Meckling, 1976), which argues that corporate governance and stakeholder oversight may be related to performance by disciplining firm behavior. The second pillar is the disclosure literature, which emphasizes that public disclosure and reporting can reduce information asymmetry, thereby generating economic outcomes through more efficient pricing in capital markets, lower uncertainty, and potentially lower capital costs (Healy & Palepu, 2001). Thus, the relationship between corporate governance quality and performance has also found empirical support in studies within the literature that analyze the link between shareholder rights and governance mechanisms and valuation and performance indicators (Gompers et al., 2003; Bhagat & Bolton, 2008).

The presence of the risk management section (IRSK) among these sustainability/governance indicators in Model 2 is consistent with the risk disclosure literature, which indicates that risk reporting is a measurable dimension of annual reports and may be related to corporate characteristics (Linsley & Shrives, 2006; Abraham & Cox, 2007). The sustainability committee (ISUS) was selected because it is consistent with studies discussing that sustainability-focused corporate governance mechanisms may be related to reporting and auditing practices (particularly through mechanisms such as environmental/corporate social responsibility (CSR)/sustainability committees at the board level) (Peters & Romi, 2015). The reporting of ESG indicators (IESG) and the corporate governance compliance report/information form (IKYI, IMNG) connect with the extensive literature showing that corporate social responsibility/ESG disclosures may have economic consequences. The relationship between CSR and ESG and financial performance is mostly reported as non-negative/positive in meta-analytic evidence (Friede et al., 2015; Orlitzky et al., 2003) and that CSR reporting has effects on the cost of capital and the information environment (Dhaliwal et al., 2011). Therefore, the indicators in Model 2 enable a literature-based test of the hypothesis that ROE can be better predicted by “financial and governance/sustainability disclosures” rather than “financial ratios alone”.

The analysis of the study is designed to test whether sustainability/governance indicators provide additional information value in ROE prediction beyond financial ratios. Therefore, two model families are constructed on the same sample and compared using the same validation framework. The analysis is conducted using a four-stage comparison logic: (i) the performance of Model 1 based on the algorithm, (ii) the performance of Model 2 based on the algorithm, (iii) a comparison of the models’ performance, and (iv) an interpretability analysis. The study’s findings are based on predictive power and additional information value rather than on a claim of causality.

In this study, machine learning algorithms were selected as tree-based methods to ensure that the predictive performance of Model 1 and Model 2 does not depend on the structural assumptions of a single algorithm and to test the algorithm-sensitivity of the findings. For both models, Random Forest, XGBoost, LightGBM, and CatBoost algorithms are used as tree-based methods. Tree-based algorithms rely on decision trees that produce a prediction for the target variable (ROE in this study) in each region by sequentially splitting the input (X) space. The fundamental statistical/algorithmic framework of decision trees has been systematised in the Classification and Regression Trees (CART) literature (Breiman et al., 1984). Although a single tree is interpretable, individual trees often exhibit a high tendency for overfitting. Consequently, success in modern applications is largely achieved through ensemble approaches (Hastie et al., 2009). Therefore, all four methods used in this study are approaches that combine decision trees using an “ensemble” logic.

Random Forest (RF) is an ensemble method that trains multiple decision trees on bootstrap samples and combines their results by averaging (Breiman, 2001). Its fundamental basis is to reduce high variance using the “bagging (bootstrap aggregating)” approach (Breiman, 1996). In RF, inter-tree correlation is also reduced by using a random feature subset in each split, thereby strengthening generalisation performance (Breiman, 2001). XGBoost is an optimised application of Friedman’s gradient boosting idea (Friedman, 2001) for high efficiency and scalability. XGBoost builds trees sequentially, with each new tree attempting to reduce the errors of the previous one (Chen & Guestrin, 2016). One of XGBoost’s distinguishing features is its regularised objective function, sparse data-sensitive splitting, and various sampling/learning rate mechanisms that attempt to control overfitting (Chen & Guestrin, 2016); LightGBM is also in the gradient-boosted trees class. It is designed for scalability/computational efficiency (Ke et al., 2017). One of its most notable differences from other tree-based methods is that it favours a leaf-wise (best-first) growth strategy over the level-wise (depth-based) approach used by most gradient-boosted tree algorithms. This strategy can enable faster convergence by growing the leaf that reduces the loss the most (Ke et al., 2017). CatBoost is also a gradient-boosted tree method. Its prominent difference in the literature is its ordered boosting approach, which aims to reduce the “prediction shift” type of bias seen in the boosting process, and its methods developed for categorical variables (Prokhorenkova et al., 2018). Although the sustainability/governance indicators in the Model 2 dataset of the study are binary (0/1), and thus do not directly experience the “categorical variable explosion” problem, CatBoost’s ordered boosting approach has been added to the analysis with the aim of reducing the risk of boosting-induced leakage/bias (Prokhorenkova et al., 2018) and providing methodological diversity. The model performance of the analysis will be evaluated for each algorithm using the R² determination coefficient and the RMSE and MAE error metrics.

The objective of this study is to compare the predictive performance of two model families established on the same sample (Model 1 based solely on financial ratios and Model 2 incorporating sustainability/governance indicators in addition to financial ratios) and to test whether the enriched variable set provides “additional information value” from a generalizability perspective. For this reason, the analysis design requires that performance measurement be conducted with a validation framework that is free from information leakage and reports uncertainty components, beyond algorithm selection. Accordingly, tree-based estimators such as Random Forest, XGBoost, LightGBM, and CatBoost were set up for both models. For all algorithms, hyperparameters were tuned using grid search within the training folds only, embedded in the grouped cross-validation procedure, in order to avoid any information leakage from the test sets. The same tuning protocol was applied consistently across all models to ensure a fair benchmarking framework. The tuned hyperparameters included tree depth, learning rate, number of estimators, and minimum node size (model-specific), with search ranges defined based on standard practice in the literature and preliminary experimentation. These algorithms were selected as state-of-the-art tree-based ensemble methods widely used in tabular financial data, allowing comparison across both bagging-based (Random Forest) and boosting-based learners (XGBoost, LightGBM, and CatBoost). These models were chosen because they represent complementary ensemble paradigms (bagging versus gradient boosting), have demonstrated strong performance in structured financial datasets, and differ in their handling of feature interactions, regularization, and categorical information, thereby providing a diverse yet comparable benchmarking set. However, the inter-model comparison was performed using the Bootstrapped Grouped Cross-Validation Model Comparison (BG-CVMC) framework rather than relying on a single random data split. This framework aims to (i) prevent information leakage through group-protected splitting in data structures containing firm-level dependencies, and (ii) quantify CV-induced randomness using bootstrap sampling to generate confidence intervals for both model performance and performance differences.

The BG-CVMC application consists of three components. Firstly, the data set has been grouped according to company identities, and observations belonging to the same company have been prevented from falling into both the training and test sets within the same fold. This step reduces the risk of “information leakage” that could arise from dependencies generated by the panel/hierarchical structure in the study findings. Thus, observations belonging to the same company are either entirely in the training set or entirely in the test set in any fold, ensuring that the training-test separation remains “clean” at the company level (Roberts et al., 2017). Second, RMSE, MAE, and R² metrics were calculated for each split through grouped CV splits. Since cross-validation outputs contain random components in a single run, reporting only average performance values may be insufficient for model evaluation. Therefore, in BG-CVMC, the error metrics calculated for each split/fold (e.g., RMSE, MAE, R²) are resampled using bootstrap to obtain point estimates and 95% confidence intervals for each metric. The bootstrap approach approximates the distribution of performance estimates through resampling and allows for the quantitative reporting of uncertainty (Efron & Tibshirani, 1994; Davison & Hinkley, 1997). In the third and final step, split-based performance values are resampled using bootstrap, and point estimates with 95% confidence intervals are reported for each model and each metric. To strengthen the statistical basis of the model comparison, within the scope of BG-CVMC, not only the average performances of the two models but also the bootstrap distribution of the model differences calculated over the same splits are obtained. The magnitude predicted by cross-validation estimation and how the quality of the prediction is affected (particularly due to the variance/dependence structure) have been discussed in detail in the literature (Bates et al., 2021). Furthermore, the theoretical foundations and limitations of approaches for producing confidence intervals for CV-based test errors have been developed, taking into account dependencies within CV (Bayle et al., 2020). BG-CVMC combines this theoretical background with group-based partitioning and multiple bootstrap repetitions to establish a more robust basis for comparison in data structures containing dependencies at the firm level.

In this study, to address the uncertainty of CV estimation in a more efficient and computable manner, a framework compatible with the recently proposed accelerated bootstrap-based variance estimation approach has been adopted (Cai et al., 2025). This enables the generation of confidence intervals not only for each model’s performance but also for the performance difference between Model 2 and Model 1. It is possible to assess whether the difference is statistically distinct from zero. The statistical significance of the differences between models was examined using both parametric and non-parametric tests, while maintaining the split-based matched structure. Company codes were defined as group labels, and the data was split according to these labels using a repeated K-fold Cross-Validation (CV) structure. In this context, the paired t-test (Student, 1908) and the Wilcoxon signed-rank test (Wilcoxon, 1945) were used to assess whether the observed performance differences stemmed from random fluctuations or systematic superiority. In the analysis, company codes were defined as group labels, and the data was divided according to these labels using a repeated-

K

-fold Cross-Validation (CV) structure.

The Bootstrapped Grouped Cross-Validation Model Comparison (BG-CVMC) applied in the previous stage serves to compare the generalizable prediction performance of Model 1 and Model 2 (and the tree-based predictors trained on them), taking into account firm-group dependency, and to report the performance difference along with its uncertainty. Accordingly, performance differences between Model 1 and Model 2 are formally evaluated using matched statistical tests alongside bootstrap-based confidence intervals, allowing systematic information gains to be distinguished from random variation. The findings obtained in this stage revealed which model/algorithm combination was more successful in terms of predictive power. Subsequently, in the final step of the study, an interpretability layer was implemented for the final model with high performance. In this context, the Shapley Additive Explanations (SHAP) analysis is not an alternative to BG-CVMC, which serves to “prove” prediction performance, but rather a complementary method that explains the model’s decision logic (Lundberg & Lee, 2017). SHAP is an additive explanation approach that decomposes a machine learning model’s prediction for a specific observation into the contribution (attribution) components of each explanatory variable. SHAP’s theoretical basis is rooted in the Shapley value concept from cooperative game theory. In the analysis, variables are considered as “players” and the model prediction as “payoff,” and the marginal contribution of each variable is systematically allocated (Shapley, 1953; Lundberg & Lee, 2017). In this study, variable contributions were reported using SHAP (TreeSHAP) because it provides a computable and consistent framework for tree-based methods, enabling the interpretation of global patterns by combining local explanations (Lundberg & Lee, 2017). In this study, Shapley-based attribution was interpreted as an intra-model contribution rather than a causal effect (Aas et al., 2021).

The analysis results obtained as a result of these procedures are reported in Section 5.

5. Findings

The study is based on the concept of forecasting ROE (t) for 2024 using 2023 (t − 1) data; thus, the risk of information leakage is minimized by using only publicly disclosed historical financial information and activity report disclosures in the forecast. The sample consists of a total of 428 companies; banks and insurance companies are excluded due to differences in the presentation of financial statements. The study compares two model families on the same sample. These are (i) the basic model based solely on financial ratios (Model 1) and (ii) the enriched model that includes financial ratios and sustainability/governance indicators derived from the companies’ activity reports (Model 2).

When comparing the error metrics for the four tree-based algorithms for both models (lower RMSE and MAE, higher R² indicates better performance), Table 4 shows that Random Forest achieves the best overall performance.

As shown in Table 4, Model 2 exhibits a tendency to increase R² across all algorithms and reduce error metrics in most cases.

The marginal contribution of Model 2 compared to Model 1 is shown in Table 5 on an algorithm-by-algorithm basis.

Upon examining the table, the error reductions in Model 2 are particularly noticeable in boosting-based models (XGBoost, LightGBM). This finding indicates that the governance/sustainability signal can be captured more effectively in interaction with financial ratios in some algorithms. However, rather than relying on a single split/single run for the final decision, an assessment should be made using the BG-CVMC presented below, taking into account the associated uncertainty.

In this study, the Bootstrapped Grouped Cross-Validation Model Comparison (BG-CVMC), which considers firm-level dependency, prevents the mixing of observations from the same firm in training and testing; furthermore, the randomness of performance estimation and CV uncertainty are quantified using bootstrap. This approach is consistent with cross-validation strategies proposed for grouped/dependent structures (e.g., Roberts et al., 2017) and with the literature focusing on bootstrap-based confidence interval production for CV uncertainty (e.g., Bayle et al., 2020; Cai et al., 2025; Efron & Tibshirani, 1994). The models’ BG-CVMC Performance Summary is shown in Table 6.

Table 6 shows the BG-CVMC performance summary (mean and 95% GA), indicating a small but consistent advantage for Model 2.

The BG-CVMC results of the models’ differences are presented in Table 7.

When examining Table 7 where model differences are reported, the positive differences in (M1 − M2) RMSE and MAE and the negative difference in R² indicate that Model 2 performs better in terms of error metrics and has higher explanatory power.

As can be seen in Table 7, the BG-CVMC analysis results indicate statistically significant differences between the performance of the two models. Model 2’s average performance is better than Model 1’s in terms of both RMSE and MAE as well as R² metrics, and the 95% confidence intervals for the model differences are outside zero. The statistical significance of these performance differences is further confirmed by paired t-tests and Wilcoxon signed-rank tests, as reported in Table 8. Split-based paired t-tests and Wilcoxon tests also support this finding; positive mean differences for RMSE and MAE confirm that Model 2 produces lower errors, while the negative difference for R² confirms that Model 2 has higher explanatory power. The p-values being below 0.05 in all tests demonstrate that the observed differences are not random and that Model 2 performs statistically significantly better than Model 1.

Overall, these results support H1, indicating that the model incorporating sustainability and corporate governance indicators consistently outperforms the baseline financial model in out-of-sample ROE prediction.

Consistent with H2, the performance improvements observed are statistically significant but limited in magnitude, as reflected by the small average differences and the associated confidence intervals.

Finally, the algorithm-specific results reported in Table 4 and Table 5 provide support for H3, showing that the incremental contribution of sustainability and governance indicators is more pronounced in boosting-based tree models.

Table 8 reports the results of the paired t-tests and Wilcoxon signed-rank tests, confirming that the performance differences between Model 1 and Model 2 are statistically significant across all evaluation metrics. Overall, the findings of the BG-CVMC analysis indicate that Model 2 provides statistically significant but limited “additional information value” compared to Model 1. This supports that the contribution of sustainability/governance indicators to ROE prediction is marginal; however, it differs from zero even when group dependency and CV uncertainty are taken into account (Bates et al., 2021; Bayle et al., 2020; Cai et al., 2025).

When BG-CVMC and key performance metrics (RMSE, MAE, R²) were evaluated together, the best generalization performance for both Model 1 and Model 2 was achieved using the RandomForest algorithm. Therefore, the final model selected for SHAP analysis, RandomForest, is reported in Table 9 and Figure 1.

The Random Forest SHAP results presented in Table 9 and Figure 1 indicate that the most dominant determinant of ROE prediction is Net Profit Margin (INKM) (Mean|SHAP| = 10.9589; positive effect ratio = 0.611). Return on Investment (ROI) ranks second, and the average contribution direction appears to be predominantly negative (Mean|SHAP = −0.4997; Mean|SHAP| = 2.3100; negative ratio = 0.709). The negative average contribution of ROI can be interpreted in the context of mean reversion and profitability normalization effects, particularly in emerging markets. Firms exhibiting unusually high ROI in period t − 1 may experience subsequent performance moderation due to competitive pressures, capacity constraints, or transitory gains, leading to a negative marginal association with next-period ROE. Similarly, the negative contribution of the Net Margin growth indicator (IMNG) may reflect adjustment dynamics whereby short-term margin expansions—often driven by temporary cost reductions, pricing anomalies, or one-off operational effects—do not persist into future profitability. In addition, rapid margin growth may coincide with increased risk-taking or earnings volatility, which can weaken the stability of subsequent ROE. These findings are consistent with prior evidence suggesting that extreme profitability signals tend to partially reverse over time, implying that exceptionally strong short-run margins or returns may not translate proportionally into sustainable future performance. Among the financial ratios, the Operating Margin (IFMA) (Mean|SHAP| = 0.8627) and Current Ratio (ICRT) (Mean|SHAP| = 0.6267) follow with positive contributions, while Leverage (ILEV) provides a more limited positive contribution (Mean|SHAP| = 0.4656). The mean absolute SHAP values of the sustainability/governance indicators specific to Model 2 (IKYI, ISUS, IESG, IRSK, IMNG) are lower than those of the financial ratios but are not entirely negligible (e.g., IMNG Mean|SHAP| = 0.2290; IESG Mean|SHAP| = 0.1355). This pattern, consistent with the “small but systematic performance improvement of Model 2” observed in the BG-CVMC, indicates that the governance/sustainability signal makes a marginal but measurable contribution to ROE prediction.

6. Discussion

This study examined whether sustainability and corporate governance indicators provide measurable additional information value in return on equity (ROE) prediction compared to models based solely on financial ratios, using advanced machine learning algorithms and validation techniques that prevent information leakage. The findings reveal that ESG and governance signals statistically significantly improve ROE prediction performance, albeit to a limited extent. From an economic perspective, the magnitude of the observed performance improvement is modest. This finding is consistent with the nature of ESG and governance information, which is not expected to replace core financial fundamentals but rather to complement them. Financial ratios remain the dominant drivers of profitability, while sustainability and governance disclosures function as secondary signals that refine predictive accuracy at the margin. In this sense, the results suggest that ESG-related information should be interpreted as an incremental layer of informational content rather than as a primary determinant of firm-level profitability.

When examining algorithm-based results, it is observed that the highest prediction success in both model families was achieved with the Random Forest algorithm. In Model 1, which is based solely on financial ratios, the R² value for Random Forest was calculated as 0.5497, RMSE as 11.50, and MAE as 7.54. In Model 2, which includes sustainability/governance indicators in addition to financial ratios, the R² value for the same algorithm increased to 0.5525, while the RMSE decreased to 11.47 and the MAE decreased to 7.52. These results indicate that, although the absolute magnitude of the performance improvement is limited, its direction is consistently in favour of Model 2.

A similar pattern is observed more distinctly in boosting-based algorithms. For example, for XGBoost, the R² value increased from 0.5142 in Model 1 to 0.5270 in Model 2; RMSE decreased from 11.95 to 11.78, and MAE decreased from 8.08 to 7.97. LightGBM results also support this trend; the R² value increased from 0.4972 to 0.5062, while a decrease was observed in the error metrics. These findings indicate that sustainability and governance indicators can be more effectively reflected in the model, particularly in boosting-based algorithms that are strong at capturing non-linear interactions.

However, beyond one-off algorithm comparisons, the fundamental contribution of this study is that these differences have been evaluated together with their uncertainties using the Bootstrapped Grouped Cross-Validation Model Comparison (BG-CVMC) framework. The BG-CVMC results show that Model 2 has an average RMSE value of 11.4134 (95% GA: [11.1111, 11.7219]), while Model 1 had a value of 11.4336 (95% GA: [11.1312, 11.7458]). Similarly, the average MAE was calculated as 7.6377 in Model 1 and 7.6208 in Model 2; the average R² increased from 0.5497 to 0.5515.

In the BG-CVMC difference analysis, where model differences are directly evaluated, the RMSE difference (Model 1 − Model 2) averages 0.0202, with a 95% confidence interval ranging from [0.0012, 0.0383]. The mean MAE difference was 0.0170 (95% CI: [0.0007, 0.0332]) and the R² difference was −0.0018 (95% CI: [−0.0032, −0.0004]). The fact that the confidence intervals for all three metrics exclude zero indicates that the contribution of sustainability/governance indicators to ROE prediction is statistically significant. The paired t-test and Wilcoxon tests also support this result; p-values below 0.05 for all metrics indicate that the observed differences are not random.

These quantitative findings are consistent with the “weak but positive” impact patterns frequently reported in ESG literature. Sustainability and governance signals are not dominant determinants replacing financial ratios; they function as complementary layers of information added to financial fundamentals in ROE estimation. This suggests that the role of ESG indicators on financial performance should be assessed without exaggeration, but also without being completely disregarded.

The SHAP analysis results also support this interpretation. For the Random Forest model, the highest mean absolute SHAP value belongs to the Net Profit Margin (INKM) variable (Mean|SHAP| = 10.96). This is followed by Return on Investment (ROI) (Mean|SHAP| = 2.31) and Operating Margin (IFMA) (Mean|SHAP| = 0.86). In contrast, the average absolute SHAP values for sustainability and governance indicators are lower (e.g., IMNG = 0.229, IESG = 0.136). However, the fact that these values are significantly different from zero indicates that these variables are not entirely ineffective within the model; on the contrary, they make marginal contributions consistent with the performance increase observed in BG-CVMC.

These findings are consistent with prior studies reporting a weak but statistically significant association between ESG dimensions and firm performance, suggesting that sustainability-related information primarily operates as a refinement mechanism rather than a substitute for financial fundamentals. From a theoretical perspective, governance and sustainability disclosures may reduce information asymmetry, signal managerial quality, and reflect long-term risk management practices, thereby supporting future profitability in an indirect manner. However, their relatively small effect sizes indicate that such signals are absorbed gradually by markets and materialize mainly through interaction with core financial drivers. Accordingly, the results support an incremental-information view of ESG, where governance and sustainability indicators enhance prediction accuracy at the margin while financial ratios remain the primary determinants of ROE.

7. Conclusions

This study examined whether sustainability and corporate governance indicators provide additional and statistically significant information value in ROE forecasting compared to traditional models based on financial ratios. The results obtained reveal that ESG and governance signals enhance prediction performance; however, this increase is limited in magnitude and exhibits a structure dominated by financial fundamentals.

The BG-CVMC results show that Model 2 reduces the RMSE and MAE by approximately 0.2–0.3 per cent compared to Model 1 and increases the R² by approximately 0.18 points. While this improvement does not represent a significant leap in economic terms, it is statistically reliable and exhibits a consistent pattern across different algorithms. Therefore, sustainability and governance disclosures produce small but systematic signals rather than “noise” in ROE estimation. The robustness of these results is supported by the use of grouped cross-validation to prevent information leakage, bootstrap-based confidence intervals, paired statistical tests, and consistent performance patterns observed across multiple machine learning algorithms. Together, these elements indicate that the reported improvements reflect systematic informational gains rather than random variation or model-specific artifacts.

The findings of the study present an important balance point regarding how ESG data should be positioned in financial performance analyses. ESG indicators are not independent performance determinants that replace financial ratios; however, when considered alongside financial fundamentals, they can statistically significantly improve prediction accuracy. This sheds light on the methodological sources of conflicting results in the academic literature and demonstrates that ESG reporting should be evaluated with realistic expectations from the perspective of both investors and regulatory bodies.

For future studies, examining sustainability and governance indicators not only in binary terms (present/absent) but also in terms of intensity, quality and continuity over time, as well as investigating dynamic effects in long-term panel data structures, could deepen the knowledge base in this field. This study contributes to the ESG–financial performance literature with measured, methodologically sound and numerically supported findings.

Overall, the findings of this study suggest that sustainability and corporate governance disclosures should be viewed neither as substitutes for financial fundamentals nor as negligible sources of information. Instead, they function as complementary signals that provide incremental informational content in profitability prediction. From this perspective, the contribution of ESG-related information is modest in magnitude but statistically reliable, highlighting its role as a refinement mechanism rather than a primary driver of firm-level financial performance.

Despite its contributions, this study is subject to certain limitations, which also point to avenues for future research. First, the sustainability and governance indicators employed in the analysis are based on binary disclosures, which capture the presence of reporting practices rather than their depth, quality, or intensity. Future studies could extend this framework by incorporating more granular ESG measures or textual-based indicators that reflect the qualitative aspects of sustainability reporting. Second, the analysis focuses on a single forecasting horizon using lagged information; exploring dynamic or multi-period prediction settings may provide further insights into the long-term informational role of sustainability and governance disclosures.

Author Contributions

Conceptualization, G.F.Ü.U. and Ö.A.; methodology, E.N.G. and M.T.; software, E.N.G.; validation, E.N.G., M.T. and G.G.; formal analysis, E.N.G. and M.T.; investigation, H.T. and B.T.; resources, H.T. and B.T.; data curation, H.T. and B.T.; writing—original draft preparation, G.F.Ü.U., Ö.A. and H.T.; writing—review and editing, G.G., M.T. and B.T.; visualization, E.N.G. and G.G.; supervision, G.F.Ü.U. and Ö.A.; project administration, G.F.Ü.U.; funding acquisition, not applicable. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The original data presented in this study are openly available from the Turkish Public Disclosure Platform (KAP) at https://www.kap.org.tr/tr/ (accessed on 15 December 2024). The authors manually collected firm-level financial information and independent audit reports by accessing individual company disclosures on the KAP website. No proprietary data were used.

Conflicts of Interest

The authors declare no conflict of interest.

References

Aas, K., Jullum, M., & Løland, A. (2021). Explaining individual predictions when features are dependent: More accurate approximations to Shapley values. Artificial Intelligence, 298, 103502. [Google Scholar] [CrossRef]
Abor, J. (2005). The effect of capital structure on profitability: An empirical analysis of listed firms in Ghana. Journal of Risk Finance, 6(5), 438–445. [Google Scholar] [CrossRef]
Abraham, S., & Cox, P. (2007). Analysing the determinants of narrative risk information in UK FTSE 100 annual reports. The British Accounting Review, 39(3), 227–248. [Google Scholar] [CrossRef]
Acheampong, A., & Elshandidy, T. (2025). Does sustainability disclosure improve analysts’ forecast accuracy? Evidence from European banks. Financial Innovation, 11, 25. [Google Scholar] [CrossRef]
Basali, M. (2025). Impact of financial performance and corporate governance on ESG disclosure: Evidence from Saudi Arabia. Sustainability, 17(18), 8473. [Google Scholar] [CrossRef]
Bates, S., Hastie, T., & Tibshirani, R. (2021). Cross-validation: What does it estimate and how well does it do it? arXiv, arXiv:2104.00673. [Google Scholar] [CrossRef]
Bayle, P., Bayle, A., Janson, L., & Mackey, L. W. (2020, December 6–12). Cross-validation confidence intervals for test error. 34th Conference on Neural Information Processing Systems (NeurIPS 2020), Vancouver, BC, Canada. Available online: https://lucasjanson.fas.harvard.edu/papers/Cross_Validation_Confidence_Intervals_For_Test_Error-Bayle_ea-2020.pdf (accessed on 20 December 2025).
Bhagat, S., & Bolton, B. (2008). Corporate governance and firm performance. Journal of Corporate Finance, 14(3), 257–273. [Google Scholar] [CrossRef]
Bose, S. (2020). Evolution of ESG reporting frameworks. In Values at work: Sustainable investing and ESG reporting (pp. 13–33). Springer International Publishing. [Google Scholar] [CrossRef]
Breiman, L. (1996). Bagging predictors. Machine Learning, 24(2), 123–140. Available online: http://link.springer.com/article/10.1007/BF00058655 (accessed on 20 December 2025). [CrossRef]
Breiman, L. (2001). Random forests. Machine Learning, 45(1), 5–32. [Google Scholar] [CrossRef]
Breiman, L., Friedman, J., Olshen, R. A., & Stone, C. J. (1984). Classification and regression trees (1st ed.). Chapman and Hall/CRC. [Google Scholar] [CrossRef]
Brundtland Report. (1987). Our common future: World commission on environment and development. Available online: https://cdn.iuc.edu.tr/FileHandler2.ashx?f=rapor_638035862645304306.pdf (accessed on 22 December 2025).
Buchetti, B., Arduino, F. R., & Perdichizzi, S. (2025). A literature review on corporate governance and ESG research: Emerging trends and future directions. International Review of Financial Analysis, 97, 103759. [Google Scholar] [CrossRef]
Bushman, R., Chen, Q., Engel, E., & Smith, A. (2004). Financial accounting information, organisational complexity and corporate governance systems. Journal of Accounting and Economics, 37(2), 167–201. [Google Scholar] [CrossRef]
Cai, B., Luo, Y., Guo, X., Pellegrini, F., Pang, M., de Moor, C., Shen, C., Charu, V., & Tian, L. (2025). Bootstrapping the cross-validation estimate. The Annals of Applied Statistics, 19(4), 2981–3002. [Google Scholar] [CrossRef]
Castrillón, M. A. G. (2021). The concept of corporate governance. Revista Científica “Visión de Futuro”, 25(2), 178–194. [Google Scholar]
Chen, T., & Guestrin, C. (2016, August 13–17). XGBoost: A scalable tree boosting system. 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (pp. 785–794), San Francisco, CA, USA. [Google Scholar] [CrossRef]
Cox, L. J., & Cusick, J. (2006). What is sustainable development? Available online: https://scholarspace.manoa.hawaii.edu/server/api/core/bitstreams/1bfa27e9-6919-4322-ba7b-eaa8a1417a89/content (accessed on 20 December 2025).
Dağıstanlı, H. A., Özen, F., & Saraçoğlu, İ. (2024). Forecasting sustainability reports with financial performance indicators using random forest for feature selection and gradient boosting for learning. Journal of Defence Sciences, 20(2), 279–302. [Google Scholar] [CrossRef]
D’Amato, V., D’Ecclesia, R., & Levantesi, S. (2021). Fundamental ratios as predictors of ESG scores: A machine learning approach. Decisions in Economics and Finance, 44, 1087–1110. [Google Scholar] [CrossRef]
Danciu, V. (2013). The sustainable company: New challenges and strategies for more sustainability. Theoretical and Applied Economics, 20(9), 7–26. Available online: https://ideas.repec.org/a/agr/journl/vxxy2013i9(586)p7-26.html (accessed on 22 December 2025).
Davidescu, A. A. M., Bîrlan, I., Manta, E. M., & Geambașu, C. M. (2025, March 20–22). Artificial intelligence in ESG and sustainable finance: A bibliometric analysis of research trends. 19th İnternational Conference on Business Excellence (pp. 1506–1517), Bucharest, Romania. Available online: https://sciendo.com/2/v2/download/article/10.2478/picbe-2025-0117.pdf (accessed on 22 December 2025).
Davison, A. C., & Hinkley, D. V. (1997). Bootstrap methods and their application. Cambridge University Press. [Google Scholar] [CrossRef]
Deloof, M. (2003). Does working capital management affect profitability of Belgian firms? Journal of Business Finance & Accounting, 30(3–4), 573–588. [Google Scholar] [CrossRef]
De Lucia, C., Pazienza, P., & Bartlett, M. (2020). Does good ESG lead to better financial performances by firms? Machine learning and logistic regression models of public enterprises in Europe. Sustainability, 12(13), 5317. [Google Scholar] [CrossRef]
Del Vitto, A., Marazzina, D., & Stocco, D. (2023). ESG ratings explainability through machine learning techniques. Annals of Operations Research. [Google Scholar] [CrossRef]
Dhaliwal, D. S., Li, O. Z., Tsang, A., & Yang, Y. G. (2011). Voluntary nonfinancial disclosure and the cost of equity capital: The initiation of corporate social responsibility reporting. The Accounting Review, 86(1), 59–100. [Google Scholar] [CrossRef]
Dincă, G., Ciotlăuși, G., & Akomeah, M. (2025). Estimating the ımpact of ESG on financial forecast predictability using machine learning models. International Journal of Financial Studies, 13(3), 166. [Google Scholar] [CrossRef]
Dossa, J. V., Ukwuoma, C. C., Thomas, D., Dossa, J. M., & Gopang, A. A. (2025). Prediction of nexus among ESG disclosure and firm performance: Applicability, explainability and implications. Innovation and Green Development, 4, 100261. Available online: https://ideas.repec.org/a/eee/ingrde/v4y2025i4s294975312500058x.html (accessed on 15 December 2025). [CrossRef]
Eccles, R. G., Ioannou, I., & Serafeim, G. (2014). The impact of corporate sustainability on organisational processes and performance. Management Science, 60(11), 2835–2857. [Google Scholar] [CrossRef]
Efron, B., & Tibshirani, R. J. (1994). An introduction to the bootstrap. Chapman & Hall/CRC. [Google Scholar]
Eljelly, A. M. A. (2004). Liquidity–profitability tradeoff: An empirical investigation in an emerging market. International Journal of Commerce and Management, 14(2), 48–61. [Google Scholar] [CrossRef]
Ferrari, A., Cini, F., & Castellano, R. (2025). Beyond traditional metrics: ESG and financial performance in innovation-driven sectors. Journal of Financial Management, Markets and Institutions, 13(1), 2550001. [Google Scholar] [CrossRef]
Friede, G., Busch, T., & Bassen, A. (2015). ESG and financial performance: Aggregated evidence from more than 2000 empirical studies. Journal of Sustainable Finance & Investment, 5(4), 210–233. [Google Scholar] [CrossRef]
Friedman, J. H. (2001). Greedy function approximation: A gradient boosting machine. The Annals of Statistics, 29(5), 1189–1232. Available online: https://www.jstor.org/stable/2699986?seq=1 (accessed on 20 December 2025). [CrossRef]
Giudici, P., & Wu, L. (2025). Sustainable artificial intelligence in finance: Impact of ESG factors. Frontiers in Artificial Intelligence, 8, 1566197. [Google Scholar] [CrossRef]
Gompers, P. A., Ishii, J. L., & Metrick, A. (2003). Corporate governance and equity prices. The Quarterly Journal of Economics, 118(1), 107–155. [Google Scholar] [CrossRef]
Goswami, K., Islam, M. K. S., & Evers, W. (2023). A case study on the blended reporting phenomenon: A comparative analysis of voluntary reporting frameworks and standards—GRI, IR, SASB, and CDP. The International Journal of Sustainability Policy and Practice, 19(2), 35. Available online: https://acsdri.com/wp-content/uploads/2023/09/2023-A-case-study-on-the-blended-reporting-phenomenon-GRI-IR-SASB-CDP-Goswami-Islam-Evers_compressed.pdf (accessed on 20 December 2025). [CrossRef]
GRI 101. (2016). Foundation 101—Reporting principles. Available online: www.globalreporting.org/standards/gri-standards-download-center/ (accessed on 20 December 2025).
Hahn, R., & Kühnen, M. (2013). Determinants of sustainability reporting: A review of results, trends, theory, and opportunities in an expanding field of research. Journal of Cleaner Production, 59, 5–21. [Google Scholar] [CrossRef]
Hastie, T., Tibshirani, R., & Friedman, J. (2009). The elements of statistical learning: Data mining, inference, and prediction (Latest edition). Springer. [Google Scholar]
Healy, P. M., & Palepu, K. G. (2001). Information asymmetry, corporate disclosure, and the capital markets: A review of the empirical disclosure literature. Journal of Accounting and Economics, 31(1–3), 405–440. [Google Scholar] [CrossRef]
Herman, A., Oplotnik, Ž. J., & Jagrič, T. (2025). The impact of ESG on business performance: An empirical analysis of NASDAQ–NYSE-listed companies. Sustainability, 17, 9683. [Google Scholar] [CrossRef]
International Finance Corporation. (2010). IFC annual report 2010. World Bank Group. [Google Scholar]
Isaksson, R. (2019). A proposed preliminary maturity grid for assessing sustainability reporting based on quality management principles. The TQM Journal, 31(3), 451–466. [Google Scholar] [CrossRef]
Jensen, M. C., & Meckling, W. H. (1976). Theory of the firm: Managerial behaviour, agency costs and ownership structure. Journal of Financial Economics, 3(4), 305–360. [Google Scholar] [CrossRef]
Ke, G., Meng, Q., Finley, T., Wang, T., Chen, W., Ma, W., Ye, Q., & Liu, T. Y. (2017). LightGBM: A highly efficient gradient boosting decision tree. Advances in Neural Information Processing Systems, 30, 1–9. Available online: https://proceedings.neurips.cc/paper_files/paper/2017/file/6449f44a102fde848669bdd9eb6b76fa-Paper.pdf (accessed on 20 December 2025).
Khan, H. (2011, December). A literature review of corporate governance. In International conference on E-business, management and economics (Vol. 25, No. 1, pp. 1–5). IACSIT Press. [Google Scholar]
Kim, H., & Lee, M. (2025). Unravelling the drivers of ESG performance in Chinese firms: An explainable machine-learning approach. Systems, 13, 578. [Google Scholar] [CrossRef]
Lanza, A. A. G., Bernardini, E., & Faiella, I. (2023). Machine learning, ESG indicators, and sustainable investment. In A. Scalia (Ed.), Financial risk management and climate change risk (pp. 223–248). Springer Nature Switzerland. [Google Scholar] [CrossRef]
Lee, O., Joo, H., Choi, H., & Cheon, M. (2022). Proposing an integrated approach to analysing ESG data via machine learning and deep learning algorithms. Sustainability, 14(14), 8745. [Google Scholar] [CrossRef]
Li, M., Nissim, D., & Penman, S. H. (2014). Profitability decomposition and operating risk. Review of Accounting Studies. [Google Scholar] [CrossRef]
Linsley, P. M., & Shrives, P. J. (2006). Risk reporting: A study of risk disclosures in the annual reports of UK companies. The British Accounting Review, 38(4), 387–404. Available online: https://ideas.repec.org/a/eee/bracre/v38y2006i4p387-404.html (accessed on 22 December 2025). [CrossRef]
Lu, C. S., Lai, P. L., & Chiang, Y. P. (2016). Container terminal employees’ perceptions of the effects of sustainable supply chain management on sustainability performance. Maritime Policy & Management, 43(5), 597–613. [Google Scholar] [CrossRef]
Lundberg, S. M., & Lee, S. I. (2017). A unified approach to interpreting model predictions. Advances in Neural İnformation Processing Systems, 30, 4765–4774. [Google Scholar]
Michelon, G., & Parbonetti, A. (2012). The effect of corporate governance on sustainability disclosure. Journal of Management & Governance, 16(3), 477–509. [Google Scholar] [CrossRef]
Mohsin, M. T., & Nasim, N. B. (2025). Explaining the unexplainable: A systematic review of explainable AI in finance. International Journal of Science and Research Archive, 16(3), 476–497. [Google Scholar] [CrossRef]
Naciti, V., Cesaroni, F., & Pulejo, L. (2022). Corporate governance and sustainability: A review of the existing literature. Journal of Management and Governance, 26(1), 55–74. [Google Scholar] [CrossRef]
Nissim, D., & Penman, S. H. (2001). Ratio analysis and equity valuation: From research to practice. Review of Accounting Studies, 6, 109–154. [Google Scholar] [CrossRef]
OECD. (2004). OECD principles of corporate governance. OECD Publishing. [Google Scholar]
Orlitzky, M., Schmidt, F. L., & Rynes, S. L. (2003). Corporate social and financial performance: A meta-analysis. Organisation Studies, 24(3), 403–441. [Google Scholar] [CrossRef]
Pei, P. (2025). Artificial intelligence in ESG investing: A scoring model for accuracy and accountability. SHS Web of Conferences, 225, 03017. [Google Scholar] [CrossRef]
Peters, G. F., & Romi, A. M. (2015). The association between sustainability governance characteristics and the assurance of corporate sustainability reports. Auditing: A Journal of Practice & Theory, 34(1), 163–198. [Google Scholar] [CrossRef]
Prokhorenkova, L., Gusev, G., Vorobev, A., Dorogush, A. V., & Gulin, A. (2018). CatBoost: Unbiased boosting with categorical features. Advances in Neural İnformation Processing Systems, 31, 5898–5908. [Google Scholar]
Roberts, D. R., Bahn, V., Ciuti, S., Boyce, M. S., Elith, J., Guillera-Arroita, G., Hauenstein, S., Lahoz-Monfort, J. J., Schröder, B., Thuiller, W., Warton, D. I., Wintle, B. A., Hartig, F., & Dormann, C. F. (2017). Cross-validation strategies for data with temporal, spatial, hierarchical, or phylogenetic structure. Ecography, 40(8), 913–929. [Google Scholar] [CrossRef]
Selling, T. I., & Stickney, C. P. (1989). The effects of business environment and strategy on a firm’s rate of return on assets. Financial Analysts Journal, 45(1), 43–52. [Google Scholar] [CrossRef]
Shapley, L. S. (1953). A value for n-person games. In H. W. Kuhn, & A. W. Tucker (Eds.), Contributions to the theory of games, Volume II (pp. 307–317). Princeton University Press. [Google Scholar] [CrossRef]
Shleifer, A., & Vishny, R. W. (1997). A survey of corporate governance. The Journal of Finance, 52(2), 737–783. [Google Scholar] [CrossRef]
Student. (1908). The probable error of a mean. Biometrika, 6(1), 1–25. [Google Scholar] [CrossRef]
Van der Heever, W., Satapathy, R., Park, J. M., & Cambria, E. (2024). Understanding public opinion towards ESG and green finance with the use of explainable artificial intelligence. Mathematics, 12, 3119. [Google Scholar] [CrossRef]
Wilcoxon, F. (1945). Individual comparisons by ranking methods. Biometrics Bulletin, 1(6), 80–83. [Google Scholar] [CrossRef]
Wilson, M. (2003). Corporate sustainability: What is it and where does it come from. Ivey Business Journal, 67(6), 1–5. [Google Scholar]

Figure 1. Random Forest Mean SHAP Values.

Table 1. Literature Study.

Author/Year	Objective	Method	Findings/Results
De Lucia et al. (2020)	To examine the impact of ESG practices on financial performance (ROA, ROE) in European public companies.	Machine learning models with ordered logistic regression; 1038 European public companies in the 2018–2019 period, Thomson Reuters ESG data.	ESG indicators can significantly predict ROA and ROE. Environmental innovation, employment efficiency, and diversity/equal opportunity policies are particularly positively correlated with financial performance.
D’Amato et al. (2021)	Analysing whether companies can explain their ESG scores through their financial statement ratios and examining the relationship between ESG ratings and key financial indicators.	Random Forest and GLM for comparison purposes; 109 companies listed on the STOXX Europe 600 index between 2014 and 2018; Bloomberg ESG scores and financial ratios.	Financial ratios can explain ESG scores in a meaningful and non-linear way. In particular, Net Income/Sales, Sales/Assets and debt ratios are the strongest predictors of ESG scores. Random Forest shows higher prediction success compared to classical regression.
Lee et al. (2022)	To enable the integrated analysis of ESG data using machine learning and deep-learning algorithms; to propose a practical approach for predicting companies’ ESG rankings.	A five-stage experimental design encompassing investment classification, adversarial attack analysis, financial return prediction, ESG news classification, and ESG score prediction.	Machine learning and deep learning models can analyse ESG data with high accuracy and predict financial performance with low error rates. NLP-based models classify ESG news with approximately 90% accuracy. Furthermore, ESG datasets are susceptible to noise and adversarial attacks.
Del Vitto et al. (2023)	To increase the transparency of ESG rating processes, reconstruct and explain how ESG scores are produced using machine learning.	White-box and black-box machine learning models using Refinitiv ESG data; model interpretability with SHAP.	ESG rating algorithms can be re-estimated with high accuracy. However, structural noise remains that cannot be completely eliminated. ESG scores rely heavily on a limited number of critical indicators.
Lanza et al. (2023)	Predicting performance using raw ESG indicators with ML.	252 companies; 220+ ESG indicators; DT-ML; FF & BIRR comparisons.	ML models create portfolios that generate positive alpha using raw ESG indicators. Environmental indicators provide the strongest contribution. ML offers higher explanatory power than traditional models.
Dağıstanlı et al. (2024)	Predicting SR publication using FP indicators; measuring classification success.	XUSRD 44 firms; Profit, ROA, ROE, ROS, Size, Leverage; GBC is the best model.	ML models predict SR issuance with high accuracy. GBC shows the best performance. The insignificance of the Profit variable indicates that SR behaviour is related to factors other than profit.
D’Amato et al. (2021)	Investigating the effect of ESG on EBIT using ML.	EuroStoxx600; GLM, DT, Bagging, RF; RF (R² = 88.39%).	The effect of ESG on EBIT exhibits a non-linear structure, with the positive effect strengthening at high ESG scores. The RF model performs best and creates significant value for ESG profitability.
Van der Heever et al. (2024)	Analysing public sentiment towards ESG/green finance using XAI.	20,961 tweets; ABSA, BERT; LIME, SHAP.	Social media sentiment towards ESG is generally positive; the environmental dimension is strong, while governance appears weak. BERT and ABSA provide successful classification. XAI methods increase model transparency.
Acheampong and Elshandidy (2025)	Examining the impact of SD on analyst prediction accuracy.	145 banks; 2005–2017; ML text mining; RMMA.	Sustainability disclosures reduce analyst forecast errors. The effect is strong for 0–2 years and increases after 2014/95/EU. There are effects that reduce information asymmetry.
Davidescu et al. (2025)	Analysing the AI–ESG literature (2004–2025).	898 WoS articles; Bibliometrix; co-citation, thematic mapping.	The AI–ESG literature has grown rapidly since 2015. ESG disclosures, risk management, and XAI themes are coming to the fore. The most fundamental problem is the lack of data and methodological standards.
Dincă et al. (2025)	To measure the contribution of ESG to the accuracy of financial forecasts.	2548 companies; ARIMA, RF, XGBoost; t-test.	ESG scores do not increase financial prediction accuracy, with the best performance achieved by the ARIMA model. Adding ESG to ML models does not improve accuracy.
Dossa et al. (2025)	Measuring the impact of ESG disclosures on company performance using ML; evaluating E–S–G components separately.	China A-Shares; 15 ML models; Extra Trees (R² ≈ 0.97); SHAP, PDP.	ESG disclosures have a weak but positive effect on company performance. SHAP analyses show that the environmental component is the strongest explanatory factor, while the S and G dimensions make limited contributions. The information value of ESG varies depending on company characteristics.
Ferrari et al. (2025)	Systematically review the ML–ESG literature.	2010–2023 SLR; RF, XGBoost, LSTM; SHAP/LIME.	ML models perform particularly well on the environmental dimension. The social and governance dimensions perform poorly due to data deficiencies, and the limitations of the literature are ESG data standardization.
Giudici and Wu (2025)	Investigate the impact of ESG on credit ratings using explainable ML.	Ensemble models, gradient boosting, SHAP.	ML models are better at capturing the relationship between ESG and credit ratings. SHAP analyses particularly highlight the importance of E and G indicators. ESG acts as a complementary signal in credit risk.
Herman et al. (2025)	Examining the impact of ESG on ROA–ROE; testing linear & non-linear relationships.	NASDAQ/NYSE; 6681 observations; stepwise regression; FFNN.	ESG scores have a positive effect on both ROA and ROE. A non-linear threshold effect is observed in ROA, while a linear relationship is seen in ROE. ESG affects profitability through different mechanisms.
Kim and Lee (2025)	Examining ESG determinants using explainable ML; testing sectoral differences.	1608 companies; 2009–2021; RF, OLS, SHAP, PFI.	The RF model is the best method for explaining ESG performance. Patents, emissions, and CSR training are the most critical determinants. E and G factors stand out in sectoral differences.
Mohsin and Nasim (2025)	Systematically review the financial XAI literature.	Scopus; 6086 → 323 → 30 articles; 7 categories.	XAI techniques are rapidly spreading in the financial sector; SHAP and attention are the most frequently used methods. However, the lack of standards and ethical issues persist.
Pei (2025)	Reducing ESG scoring inconsistency; developing an XAI-based ESG model.	FinBERT, RF, XGBoost, SHAP, LIME.	NLP–ML integration strengthens the ESG signal. SHAP/LIME increases explainability and reduces score inconsistency. The XAI-based approach appears more reliable than traditional scoring methods.

Table 2. Model 1 Baseline Financial Model (BFM).

Dependent Variable	Code
ROE (Return on Equity)	DROE
Independent Variables
Net Profit Margin	INKM
Operating Margin	IFMA
Return on Investment	IROI
Current Ratio	ICRT
Leverage Ratio	ILEV

Table 3. Model 2 Extended ESG-Governance Model (GEM).

Dependent Variable	Code
ROE (Return on Equity)	DROE
Independent Variables
Net Profit Margin	INKM
Operating Margin	IFMA
Return on Investment	IROI
Current Ratio	ICRT
Leverage Ratio	ILEV
Is there a Corporate Governance Compliance Report in the Activity Report? (Y/N)	IKYI
Is there a Sustainability Committee operating within the Board of Directors? (Y/N)	ISUS
Are ESG indicators provided in the Activity Report? (Y/N)	IESG
Is there a section titled ‘Risk’ or ‘Risk Management’ in the Activity Report? (Y/N)	IRSK
Is there a ‘Corporate Governance Information Form’ in the Activity Report? (Y/N)	IMNG

Table 4. Algorithm-Based Performance Analysis for Model 1 and Model 2.

Model 1 Baseline Financial Model (BFM)	RMSE	MAE	R²
RandomForest	11.50449	7.544685	0.549698
CatBoost	11.66561	7.710918	0.539142
XGBoost	11.94938	8.080732	0.514166
LightGBM	12.18481	8.434944	0.497158
Model 2 Extended ESG-Governance Model (GEM)	RMSE	MAE	R²
Random Forest	11.4742	7.516016	0.552463
CatBoost	11.62065	7.714267	0.541885
XGBoost	11.77804	7.965747	0.527005
LightGBM	12.07769	8.392546	0.506239

Table 5. Marginal Contribution of Independent Variables.

Algorithm	Metric	Model 1	Model 2	Change_(M2-M1)
Random Forest	RMSE	11.50449	11.4742	−0.030291575
	MAE	7.544685	7.516016	−0.028669014
	R2	0.549698	0.552463	0.00276513
CatBoost	RMSE	11.66561	11.62065	−0.044960242
	MAE	7.710918	7.714267	0.003348352
	R2	0.539142	0.541885	0.002743246
XGBoost	RMSE	11.94938	11.77804	−0.171336121
	MAE	8.080732	7.965747	−0.114985472
	R2	0.514166	0.527005	0.012839374
LightGBM	RMSE	12.18481	12.07769	−0.107122565
	MAE	8.434944	8.392546	−0.042398112
	R2	0.497158	0.506239	0.00908092

Table 6. BG-CVMC Performance Summary of Models.

Metric	Model 1 Average	Model 1 95% GA	Model 2 Average	Model 2 95% GA
RMSE	11.4336	[11.1312, 11.7458]	11.4134	[11.1111, 11.7219]
MAE	7.6377	[7.4621, 7.8080]	7.6208	[7.4460, 7.7902]
R²	0.5497	[0.5282, 0.5702]	0.5515	[0.5306, 0.5720]

Table 7. BG-CVMC Differences of Models BG-CVMC Results.

Metric	Average Difference (M1 − M2)	95% GA Lower Bound	95% GA Upper
RMSE	0.0202	0.0012	0.0383
MAE	0.0170	0.0007	0.0332
R²	−0.0018	−0.0032	−0.0004

Table 8. Paired Tests (t-test and Wilcoxon).

Metric	t-Statistic	p (t-Test)	Wilcoxon W	p (Wilcoxon)
RMSE	2.1155	0.0369	1654.0000	0.0027
MAE	2.0448	0.0435	1840.0000	0.0185
R²	−2.4622	0.0155	1635.0000	0.0022

Table 9. RandomForest SHAP Analysis.

Variable	Mean_SHAP	Mean_Abs_SHAP	Positive Rate	Negative Ratio	Effect Direction
INKM	0.302	10.9589	0.611	0.389	positive
IFMA	0.1007	0.8627	0.494	0.506	positive
IROI	−0.4997	2.3100	0.291	0.709	negative
ICRT	0.0760	0.6267	0.615	0.385	positive
ILEV	0.0447	0.4656	0.543	0.457	positive
IKYI	0.0039	0.0754	0.424	0.576	positive
ISUS	0.0011	0.0621	0.263	0.737	positive
IESG	0.0315	0.1355	0.524	0.476	positive
IRSK	0.0026	0.0515	0.643	0.357	positive
IMNG	−0.0164	0.2290	0.392	0.608	negative

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Talaş, H.; Gök, E.N.; Akçakanat, Ö.; Gültekin, G.; Terzioğlu, M.; Tutcu, B.; Ünal Uyar, G.F. The Contribution of Sustainability and Governance Signals to Return on Equity Prediction: Evidence from Tree-Based Machine Learning, Bootstrapped Grouped CV and SHAP. J. Risk Financial Manag. 2026, 19, 106. https://doi.org/10.3390/jrfm19020106

AMA Style

Talaş H, Gök EN, Akçakanat Ö, Gültekin G, Terzioğlu M, Tutcu B, Ünal Uyar GF. The Contribution of Sustainability and Governance Signals to Return on Equity Prediction: Evidence from Tree-Based Machine Learning, Bootstrapped Grouped CV and SHAP. Journal of Risk and Financial Management. 2026; 19(2):106. https://doi.org/10.3390/jrfm19020106

Chicago/Turabian Style

Talaş, Hasan, Ela Naz Gök, Özen Akçakanat, Gürkan Gültekin, Mustafa Terzioğlu, Burçin Tutcu, and Güler Ferhan Ünal Uyar. 2026. "The Contribution of Sustainability and Governance Signals to Return on Equity Prediction: Evidence from Tree-Based Machine Learning, Bootstrapped Grouped CV and SHAP" Journal of Risk and Financial Management 19, no. 2: 106. https://doi.org/10.3390/jrfm19020106

APA Style

Talaş, H., Gök, E. N., Akçakanat, Ö., Gültekin, G., Terzioğlu, M., Tutcu, B., & Ünal Uyar, G. F. (2026). The Contribution of Sustainability and Governance Signals to Return on Equity Prediction: Evidence from Tree-Based Machine Learning, Bootstrapped Grouped CV and SHAP. Journal of Risk and Financial Management, 19(2), 106. https://doi.org/10.3390/jrfm19020106

Article Menu

The Contribution of Sustainability and Governance Signals to Return on Equity Prediction: Evidence from Tree-Based Machine Learning, Bootstrapped Grouped CV and SHAP

Abstract

1. Introduction

2. Conceptual Framework

2.1. Corporate Sustainability

2.2. Corporate Governance and Sustainability—Governance Interaction

3. Literature Review

4. Methodology and Approach

5. Findings

6. Discussion

7. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI