Climate Risk, CEO Risk Preference, and Corporate Greenwashing in High-Emission Industry: A Debiased Machine Learning Approach

Ma, Shijie; Hou, Jingzhi; Niu, Haoran; Chen, Hsing Hung

doi:10.3390/su18105174

Open AccessArticle

Climate Risk, CEO Risk Preference, and Corporate Greenwashing in High-Emission Industry: A Debiased Machine Learning Approach

¹

School of Business, Macao University of Science and Technology, Macao 999078, China

²

The Institute for Sustainable Development, Macao University of Science and Technology, Macao 999078, China

^*

Author to whom correspondence should be addressed.

Sustainability 2026, 18(10), 5174; https://doi.org/10.3390/su18105174

Submission received: 27 April 2026 / Revised: 16 May 2026 / Accepted: 18 May 2026 / Published: 20 May 2026

Download

Browse Figures

Versions Notes

Abstract

The transition to a low-carbon economy is the cornerstone of global sustainability, requiring high-emission enterprises to shift from carbon-intensive production to genuine green innovation. However, this study uncovers a significant structural impediment to this transition: the “defensive greenwashing” response to climate stress. Focusing on listed companies in China’s high-emission industries (2009–2024), we employ a Debiased Machine Learning (DML) framework and Causal Forest analysis to capture the non-linear impacts of multi-dimensional climate risks. Our findings reveal a robust “threshold-trigger” mechanism: once climate pressures—whether physical shocks or policy-induced transition risks—exceed corporate endurance levels, firms aggressively pivot toward strategic “information arbitrage” rather than substantive decarbonization. We identify a profound “capability paradox” in sustainability governance, where firms with higher digital maturity and resource slack leverage their technical prowess to “calibrate” sophisticated narratives, thereby widening the monitoring gap and distorting green asset pricing. Furthermore, CEO risk preference acts as a psychological accelerator, amplifying strategic decoupling, particularly under transition-risk-induced uncertainty. By demonstrating how climate stress inadvertently incentivizes symbolic compliance over sustainable transformation, this research offers critical micro-level insights for policymakers. These findings are vital for refining sustainability oversight and ensuring that capital allocation fosters a resilient, equitable transition toward true ecological and economic decoupling.

Keywords:

climate risk; greenwashing; CEO risk preference; debiased machine learning; sustainable development; climate change

1. Introduction

With the intensification of global climate change, climate risk has evolved from a macro environmental issue to a core variable that cannot be ignored in the micro corporate finance field, posing a fundamental challenge to global sustainability. The existing literature generally summarizes such shocks as Physical Risk and Transition Risk [1]. Physical risks cause direct damage to corporate assets and supply chains through extreme weather, significantly increasing default probability [2,3], while transition risk arises from the tightening of low-carbon policies, leading to a significant increase in the financing constraints of enterprises with high emissions [4]. Under the background of China’s “dual carbon” target, listed companies in high-emission industries—the primary battlefield for environmental sustainability—are at the center of the dual pressure of institutional regulation and market expectation. However, our empirical evidence suggests that in the face of escalating climate pressure, enterprises do not always choose substantive green transformation. Instead, many pivot toward strategic measures to protect their valuation, making the relationship between multi-dimensional climate risk and corporate greenwashing a critical impediment to achieving long-term sustainable development.

The research in this paper is rooted in Legitimacy Theory and Upper Echelons Theory [5,6]. According to the legitimacy theory, when facing negative environmental shocks, enterprises tend to modify their image by increasing the complexity and frequency of environmental information disclosure to maintain their “social contract” [7,8]. This study extends this logic by identifying a “defensive greenwashing” mechanism: as climate risks exceed a firm’s endurance threshold, the marginal cost of symbolic signaling becomes substantially lower than the marginal cost of radical industrial innovation [9]. Furthermore, the Upper Echelons Theory suggests that such strategic responses are filtered through executive psychology [10]. We specifically examine how CEO risk preference acts as a catalyst in this process, potentially amplifying regulatory arbitrage behaviors in high-emission sectors where the stakes of a sustainability transition are highest [11].

Despite these theoretical foundations, several critical research gaps remain in the existing literature. First, while the impact of environmental regulation is documented, most studies assume a linear response, failing to account for the “threshold effects” where extreme climate shocks trigger aggressive defensive shifts in high-emission industries—a domain where traditional econometrics often struggle with high-dimensional confounding [12]. Second, there is a lack of evidence regarding the “Capability Paradox” in sustainability governance, specifically how firms with significant resource slack and digital maturity might use these advantages to widen the monitoring gap rather than closing it [13,14]. Finally, the micro-level interplay between executive psychology and climate-driven strategic decoupling remains under-explored, particularly regarding how policy-induced transition risks influence “information arbitrage” differently than physical disasters.

Methodologically, this paper addresses these gaps by employing a Debiased Machine Learning (DML) framework [15]. By utilizing DML and Causal Forest analysis [16], we are able to capture the “threshold-trigger” effect of climate shocks and provide a more robust estimation of heterogeneous treatment effects. This approach allows us to uncover the aforementioned paradox: firms with higher digital maturity and resource slack may instead leverage their technical prowess to “calibrate” environmental narratives more sophisticatedly, thereby obstructing true sustainability transition.

The contributions of this study are three-fold. First, we enrich the literature on the economic consequences of climate risk by providing empirical evidence of its non-linear impact on greenwashing within China’s high-emission industries, highlighting the systemic risks this poses to green asset pricing. Second, by integrating CEO risk preference into the DML framework, we offer a micro-behavioral explanation for the failure of climate policies in certain organizational contexts [17]. Finally, we contribute to the sustainability discourse by revealing how digitalization and resource slack can paradoxically facilitate strategic deception, providing critical insights for policymakers to transition from “disclosure-based” to “performance-linked” sustainability oversight [18].

The remainder of this paper is organized as follows: Section 2 reviews the relevant literature; Section 3 describes the data, variables, and the Debiased Machine Learning methodology; Section 4 presents the empirical results and mechanism analysis; Section 5 provides the discussion; and Section 6 concludes with policy implications and limitations. Figure 1 presents the empirical framework.

2. Literature Review

2.1. The Microeconomic Consequences of Climate Risks

Climate risks shift from macro climate issues to the core considerations in micro corporate finance. Existing literature generally classifies climate shocks as physical risks and transition risk [1]. In the physical dimension, extreme weather directly undermines a company’s cash flow by damaging fixed assets, interrupting supply chains, and increasing insurance costs [19], and leads to significant equity risk premiums [20] and an increase in default probability [2,3]. In the transition dimension, the tightening of low-carbon policies and the implementation of carbon tariffs trigger the stranded asset risk, significantly increasing the financing constraints of high-emission enterprises [4,21].

Regarding how enterprises respond to these risks, there are obvious logical differences in the academic community. The classic “Porter Hypothesis” holds that environmental regulations can produce an innovation compensation effect [22], driving enterprises to increase green R&D [23,24]. However, recent studies targeting high-emission industries point out that when enterprises face severe financial difficulties or technological path dependence, climate pressure generates a resource extrusion effect [25,26]. Under such pressure, enterprises tend to adopt strategic defense, manipulating environmental information disclosure to maintain valuation [17,27,28,29]. The latest evidence indicates that this defensive motivation directly induces fraudulent narratives in the MD&A text by management [30,31,32]. Existing research mostly explores the impact of climate risks on the objective performance of enterprises, but rarely deeply examines how enterprises, when the “substantial transformation” path is blocked, adopt symbolic disclosure strategies to hedge risks, especially lacking systematic analysis of how the dual risks of physical and transition risks synergistically induce defensive greenwashing behavior.

2.2. The Driving Mechanism of Enterprises’ Greenwashing Behavior

Greenwashing is defined as the systematic deviation between corporate environmental rhetoric (Symbolic) and substantive performance [14,33]. From the perspective of Legitimacy Theory, when facing negative environmental impacts, enterprises tend to modify their image by increasing the complexity and frequency of environmental rhetoric to avoid “organizational stigma” [5,8,34]. Based on Agency Theory, in order to pursue short-term professional reputation or ESG-linked compensation, managers use information asymmetry to implement strategic disclosure [35,36].

Under the framework of the signaling game, greenwashing is regarded as a low-cost signal transmission tool to capture the green premium in the capital market [37]. Although existing studies identify that external monitoring mechanisms, such as media attention [9], audit quality [38], analyst coverage [39], and institutional investors [40], inhibit greenwashing, the effectiveness of such monitoring faces significant challenges amidst the uncertainty of global climate governance [41]. In addition, financial pressure, government regulation, and industry competition structures are also proven to be the key micro-factors driving greenwashing [18,42,43]. The current empirical research on the driving factors of greenwashing is mostly based on traditional linear regression, which finds it difficult to describe the complex non-linear impact of high-dimensional and non-stationary external shocks such as climate risks on greenwashing behavior, and often lacks the robust causal identification offered by Debiased Machine Learning when dealing with high-dimensional nuisance variables.

2.3. Executive Traits and the Technological Paradox

Upper Echelons Theory posits that the strategic choices of enterprises are essentially the projections of executives’ cognitive traits and psychological tendencies [6,10]. In response to climate uncertainty, CEO risk preference plays a decisive moderating role [44]. Managers with risk-seeking characteristics tend to exhibit stronger overconfidence and myopia, regarding climate regulation as an opportunity for “risk arbitrage” rather than a catalyst for transformation [45,46].

At the same time, the role of digitalization in climate governance triggers a wide discussion on the “Digital Green Paradox” [47]. While the traditional view suggests that digital technology can improve the transparency of environmental monitoring and curb rent-seeking [48,49], recent research proposes the “enabling camouflage” hypothesis. Under this hypothesis, high-ability management may use artificial intelligence and big data tools to precisely optimize environmental narratives, thereby implementing more subtle strategic decoupling [50,51]. This coupling of technical tools and managers’ speculative psychology makes high-emission enterprises—especially those facing severe financing constraints and lacking technological accumulation—more likely to collapse in front of the climate defense and turn to higher-order greenwashing fraud [52,53,54,55]. However, there is little literature that combines managers’ psychological characteristics with organizational technical characteristics to explore their synergistic effect on climate risk response, and there is even less use of cutting-edge methods such as Debiased Machine Learning to capture the dynamic threshold effects of these micro-characteristics in greenwashing decisions.

3. Materials and Methods

A Debiased Machine Learning (or Double Machine Learning, DML) framework [15] is suggested in this section. The working flow of Debiased Machine Learning is depicted in Figure 2. Data collection, data preprocessing, learner selection, residual regression and model training, cross validation, significance test, and CAME estimations are the seven processes in the procedure.

3.1. Data Collecting

Our initial sample spans Chinese A-share listed enterprises operating in high-emission industries from 2009 to 2024. The target industries are designated based on the Key Energy-Consuming Industries Directory promulgated by the Ministry of Ecology and Environment of China, capturing eight carbon-intensive sectors: power, steel, building materials, nonferrous metals, chemicals, petrochemicals, papermaking, and aviation.

Firm-level financial indicators, internal governance structures, and executive socio-demographic profiles are obtained from the China Stock Market Accounting Research (CSMAR) database. To robustly capture the tension between firms’ ecological commitments and operational realities, we construct multi-dimensional metrics for environmental performance: symbolic corporate greenwashing (GW_score) and substantive eco-efficiency (Real_Eff). Following standard data-cleaning conventions, we exclude: (i) Special Treatment (ST and *ST) firms due to imminent financial distress (see Supplementary Information S5.3 for full sample results), and (ii) observations with missing structural information. All continuous variables are winsorized at the 1st and 99th percentiles to eliminate outlier distortions, yielding a final unbalanced panel of 4395 firm-year observations for the core DML analysis.

3.2. Climate Risk

3.2.1. Physical Risk

Physical risk measures the objective exposure level of an enterprise’s operational location to extreme weather conditions, such as extreme high temperatures, floods, droughts, and typhoons. This paper manually collects the latitude and longitude coordinates of the office addresses of A-share high-emission enterprises and performs spatial matching with a high-resolution historical database of extreme disasters from the China Meteorological Data Network.

Specifically, this study constructs a physical risk exposure index based on the attenuation of geographical distance. For the physical risk faced by enterprise i in year t, the calculation formula is as follows:

P h y s i c a l_R i s k_{i, t} = \sum_{k = 1}^{K} ω_{k} \cdot \frac{D_{k, t}}{1 + f (d_{i, k} / \bar{d})}

(1)

where

D_{k, t}

represents the sum of disaster k in year t (such as rainfall deviation, typhoon level, or the number of hot days) [56], the intensity has been standardized;

d_{i, k}

is the spherical distance between enterprise i and the center of disaster k;

\bar{d}

is the preset distance threshold and is set as 200 km, which is approximately the radius covering prefecture-level cities or neighboring cities within the same province. Within this distance, climate disasters (such as typhoons and extreme heavy rains) not only damage the factory assets of enterprises, but also significantly impact the local supply chain, logistics and power supply. This is the most significant radius where physical risks come into play; and

ω_{k}

is the weight coefficient for different types of disasters. This indicator achieves the transition from perceiving risks to objective exposure and more accurately captures exogenous shocks determined by geographical location [44,57]. The robustness checks by changing the threshold are reported in Supplementary Information S5.3.

3.2.2. Transition Risk

Transition risks reflect the pressure and legal constraints faced by enterprises regarding low-carbon policies. This paper employs text mining methods to extract keywords related to climate policies from the Management Discussion and Analysis (MD&A) sections of enterprise annual reports, thereby measuring the sensitivity of management to the external regulatory environment. The definition is as follows:

T r a n s i t i o n_R i s k_{i, t} = (\frac{\sum K e y w o r d_{t r a n s, i, t}}{S e n t e n c e_C o u n t_{i, t}}) - Median {(\frac{\sum K e y w o r d_{t r a n s, j, t}}{S e n t e n c e_C o u n t_{j, t}})}_{j \in Industry}

(2)

The keyword library for transition risk encompasses specific regulatory terms such as “carbon peak,” “emission quotas,” “green taxation,” and “energy structure adjustment.” Unlike the geographical nature of physical risks, transition risk focuses on the objective regulatory compulsion that constrains an enterprise’s strategic decisions, such as greenwashing or substantive transformation. To ensure the validity of this measure and mitigate confounding effects from textual verbosity or general industry trends, we employ a Relative Textual Density (RTD) approach. First, to avoid “false alarms,” we conduct manual and corpus-based cleaning to eliminate sentences that merely describe generic industry prospects or provide boilerplate compliance statements. Second, we define the raw risk intensity as the frequency of these refined keywords normalized by the total number of sentences in the MD&A section. This sentence-level density is more robust than word-count normalization, as it effectively isolates specific information from the “filler-word” strategies or redundant linguistic styles often used by management. Finally, we perform an Industry-Year Median Adjustment by subtracting the concurrent industry median from each firm-year observation. This procedure effectively filters out macro-level policy shocks—such as the 2020 “Dual Carbon” target announcement—that would otherwise cause a systemic rise in keyword frequency across all firms. The resulting

T r a n s i t i o n_R i s k_{i, t}

indicator accurately captures the incremental transformational pressure an enterprise faces relative to its peers. To verify this indicator’s effectiveness, Table 1 reports the correlation analysis between our measure and the intensity of actual macro-environmental regulations at the provincial level.

The results show that the enterprise transformation risk calculated in this study is significantly positively correlated with the actual number of low-carbon policies implemented by each province at the 1% level (with a correlation coefficient of 0.345). This strongly proves that the risk perception mentioned by the management in the MD&A is not ‘empty talk’, but truly reflects the external policy pressure and compliance costs faced by the enterprise. This consistency lays a reliable measurement foundation for the subsequent research of this paper on the impact of climate risks on greenwashing behavior.’

3.3. CEO Risk Preference

This study believes that executives’ attitude towards risk will inevitably be mapped into their behavioral decisions [58], and then affect the operating results of the company. Referring to the previous literature [59,60], this study selects the following six financial indicators from the five dimensions of asset structure, solvency, profit structure, profit distribution and cash flow, and uses principal component analysis (PCA) to extract the principal components reflecting the risk preference level of executives. In addition, a CEO behavior and personal characteristics is added to the analysis to further recognize the impact of behavior traits and CEO background [54,61,62,63,64]. To address the potential reverse causality issue, this paper uses a one-period lagged variable of CEO risk preference in the baseline regression:

C E O_r i s k_p r e f = \sum_{j = 1}^{m} W_{j} F_{j}, W_{j} = \frac{λ_{j}}{\sum_{k = 1}^{n} λ_{k}}

(3)

For detailed features, please refer to Supplementary Information S1. To eliminate the excessive dependence of the PCA comprehensive index on a single original variable, this paper conducted a sensitivity test using the Leave-one-variable-out method. Specifically, we successively removed one variable from

X_{1} - X_{7}

, and then re-performed principal component analysis using the remaining 6 variables. The results (Figure 3) show that the correlation coefficient between the new principal component scores obtained after removing any one variable and the benchmark scores is all higher than 0.90. This strongly proves that the CEO risk preference index constructed in this paper has extremely strong internal consistency. Its measurement results are not disturbed by abnormal fluctuations of specific single indicators (such as the debt ratio or the CEO’s background), thus ensuring the robustness of the empirical conclusion. To avoid mechanical multicollinearity, we exclude

X_{1} - X_{7}

from covariates when CEO risk preference is considered in the model.

3.4. Real Pollution Control Efficiency

This paper employs the Slack-Based Measure [65] that takes into account undesirable outputs to calculate the pollution control efficiency of enterprises in reducing pollution emissions while maintaining economic output. The efficiency is calculated as:

R e a l_E f f (ρ^{*}) = \min \frac{1 - \frac{1}{3} (\frac{s_{L a b o r}^{-}}{x_{L a b o r, 0}} + \frac{s_{C a p i t a l}^{-}}{x_{C a p i t a l, 0}} + \frac{s_{E n e r g y}^{-}}{x_{E n e r g y, 0}})}{1 + \frac{1}{1 + 2} (\frac{s_{V a l u e}^{g}}{y_{V a l u e, 0}^{g}} + \frac{s_{W a t e r}^{b}}{y_{W a t e r, 0}^{b}} + \frac{s_{A i r}^{b}}{y_{A i r, 0}^{b}})}

(4)

See Supplementary Information S2 for details.

3.5. Greenwash

According to previous definition [31], greenwashing refers to the act of enterprises making false or misleading statements about their environmental protection commitments. To accurately quantify this behavior, this paper constructs two-dimensional indicators: Symbolic disclosure level (

E R_d i s

) and the substantive performance level (

E R_p e r

) following relative studies [66,67]:

G W_s c o r e_{i, t} = \frac{E R_d i s_{i, t} - {\bar{E R_d i s}}_{t}}{σ_{d i s, t}} - \frac{E R_p e r_{i, t} - {\bar{E R_p e r}}_{t}}{σ_{p e r, t}}

(5)

where Symbolic disclosure level (

E R_d i s

) uses the Bloomberg environmental disclosure score. This score specifically assesses the detail of information disclosed by enterprises in their ESG reports, representing the “verbal commitment” of the enterprises. And the substantive performance level (

E R_p e r

) is measured by the Wind environmental score. This score places greater emphasis on the actual pollution discharge indicators, environmental violation records, and substantive emission reduction achievements of the enterprises, representing their “actual actions”.

In order to eliminate the influence of different rating systems on the dimension, this paper standardizes the above two scores at the “annual-industry” level.

\bar{E R}

and

σ

represent the average and standard deviation of the corresponding scores within the same year and the same industry, respectively.

Furthermore, in order to verify the robustness of the conclusion, this paper also conducted alternative tests in the following sections by using a binary greenwashing identification variable (DGW) constructed based on “annual report keyword frequency” and “administrative penalty records”. Firstly, we need to separately evaluate the oral “green propaganda” (Oral) and the actual environmental performance (Actual) of the enterprises [68,69]. We constructed a term set related to the degree of greenness or the environment, including terms such as “green”, “environmental protection”, “low carbon”, and “environment”. Then, for each enterprise’s annual observation data, we analyzed the frequency of these terms appearing in the management and disclosure sections of the annual reports. For Oral, when the frequency of occurrence in the observation data is higher than the median of the same industry in the same period, the value is 1; otherwise, it is 0. For the Actual value, when the enterprise’s annual observation data is subject to environmental administrative penalties, the value is 1; otherwise, it is 0. In summary, the calculation method of the proxy variable for “green whitewashing” (DGW) of the enterprise is as follows:

D G W_{i, t} = \{\begin{matrix} 1, & if O r a l_{i, t} = 1 and A c t u a l_{i, t} = 1 \\ 0, & otherwise \end{matrix}

(6)

The validation of greenwash index is reported in Table S11, Supplementary Information S8.

3.6. Mechanisms: CEO Myopia and Disclosure Opacity

To overcome the potential measurement bias in the text analysis method, this paper adopts the CEO compensation duration (Duration) as a proxy variable for short-sighted behavior [70]. This indicator measures the degree to which the interests of the management are tied to the long-term value of the company by calculating the weighted average of the expiration times of various components (base salary, bonus, restricted stocks, options) in the CEO’s compensation package:

D u r a t i o n_{i} = \frac{\sum_{t = 1}^{T} t \times P a y_{i t}}{\sum_{t = 1}^{T} P a y_{i t}}

(7)

where

P a y_{i t}

represents the expected salary cash flow for the CEO in the t-th year. T represents the unlocking or exercise period for each component of the compensation. Among them, the basic salary and the annual bonus’ duration is set to 0 (to be paid in the same year, representing extremely short-term incentives). For restricted stocks and stock options, the weighted time is calculated based on the specific annual vesting plans of each type.

This paper measures the opacity of enterprise information disclosure using Analyst Forecast Dispersion. a metric widely recognized in the previous literature as a reliable proxy for the informational environment quality [71,72]. The underlying economic intuition is that professional analysts act as sophisticated information intermediaries; when a firm’s disclosures are transparent and predictable, analysts are more likely to reach a consensus on its future performance. Conversely, as climate-related transition risks escalate, they introduce substantial uncertainty regarding asset valuation and regulatory compliance costs, which “shrouds” the firm’s true economic state. We define and calculate

O p a c i t y_{i, t}

as the standard deviation of earnings-per-share (EPS) forecasts for firm i in year t, scaled by the absolute value of the mean forecast to ensure cross-sectional comparability:

O p a c i t y_{i, t} = \frac{σ (F o r e c a s t_{i, t, j})}{|M e a n (F o r e c a s t_{i, t, j})|}

(8)

where j denotes the individual analysts providing forecasts for the target firm. A higher Opacity value indicates a greater degree of disagreement among market experts, reflecting a more “noisy” or “opaque” information environment. By adopting this market-based measure rather than a self-reported disclosure index, we capture the realized informational friction resulting from climate shocks.

3.7. Covariates

Following previous studies [25,32,47,73], this study adds 33 different firm features and covariates for DML. See Supplementary Information S3 for detailed variable definitions and summary statistics.

3.8. Residual Regression and DML Model Training

The fundamental principle of Debiased Machine Learning is “orthogonalized residual regression,” which reduces the impact of initial-stage estimation errors on causal inference after computing the conditional expectation functions [15]. For a detailed explanation of the specific learners employed in the DML framework, please refer to Supplementary Information S4. The first step in this estimation procedure is residualization:

{\tilde{Y}}_{i} = Y_{i} - \hat{g} (X_{i})

(9)

{\tilde{D}}_{i} = D_{i} - \hat{m} (X_{i})

(10)

The residuals

{\tilde{Y}}_{i}

and

{\tilde{D}}_{i}

represent the dependent variable and the treatment variable after removing the influence of the covariate

X_{i}

, capturing the parts of change in Y and D that cannot be explained by X. Then the model is implemented with OLS regression:

{\tilde{Y}}_{i} = θ {\tilde{D}}_{i} + μ_{i}

(11)

where ATE is denote by the coefficient

θ

. In addition, to avoid overfitting, K-Fold cross-validation (K = 5) is used for sample hierarchical training and prediction residual construction. First, Divide the population sample set D into K parts:

D = \cup_{K = 1}^{K} D_{K}, D_{k} \cap D_{k^{'}} = \emptyset for K = k^{'}

(12)

Next, the first k compromise uses all training machine learning algorithms to provide predictions based on

D_{k}

in addition to

D_{k}

:

\hat{g} (X_{i}) = {\hat{g}}^{(- k)} (X_{i}), \hat{m} (X_{i}) = {\hat{m}}^{(- k)} (X_{i}), \forall X_{i} \in D_{k}

(13)

That is, the predicted value of

X_{i}

for each sample is given by a model trained without its own subsamples, ensuring that the predictions are unbiased. Therefore, the average prediction error of the entire K-fold cross-validation can be written as:

Error = \frac{1}{n} \sum_{i = 1}^{n} L (Y_{i}, {\hat{g}}^{(- k (i))} (X_{i}))

(14)

where L is the loss function. In addition, this study employs the causal forest framework [16], which enables the estimation of Conditional Average Marginal Effects (CAME) for each individual firm. This approach allows for a granular exploration of how the treatment effect of climate risk on greenwashing behavior varies across diverse firm characteristics. By capturing such individual-level heterogeneity, the model identifies the specific conditions under which enterprises are more predisposed to strategic disclosure in response to environmental shocks.

The target of interest for each firm is:

τ (X_{i t}) = \partial E [Y_{i t} - D_{i t} | X_{i t}] / \partial D_{i t}

(15)

The implementation of the causal forest in this research follows classical framework [74] and begins with the honest splitting of samples, where the observations are divided into distinct training and estimation subsets to effectively mitigate the risk of overfitting. Within each generated causal tree, node splitting is performed by identifying the specific variables that best distinguish differences in the Conditional Maximum Effect. The model then achieves stable estimation through forest aggregation, which involves calculating the average treatment effect across a multitude of individual trees. The resulting output encompasses the Conditional Average Marginal Effect (CAME) at the individual level, alongside the mean, median, and overall distribution of the CAME. Furthermore, the model identifies significant variations in variable heterogeneity after performing a sample grouping based on the median values. The final CAME derived from the causal forest estimation is formulated as follows:

CAME = (1 / n) \cdot \sum_{i = 1}^{n} τ (X_{i t})

(16)

4. Results

4.1. DML Settings

4.1.1. Hyperparameter Settings

To avoid estimation bias arising from potential misspecification of the first-stage nuisance models, we adopt the Debiased Machine Learning (DML) framework [15]. We simultaneously employ two complementary learners—Random Forest (RF) and Lasso—to estimate the conditional expectation functions E[Y|X] and E[T|X]. RF captures potential interaction effects and non-linear relationships among high-dimensional covariates, while Lasso performs variable selection and sparse modeling via L1 regularization, making it well suited for screening high-dimensional control variables. The simultaneous use of RF and Lasso is strategic: RF is uniquely positioned to capture the non-linear and interactive influences of CEO risk preference on greenwashing behaviors, while Lasso provides interpretable variable selection for a wide range of provincial-level macro-controls.

For RF, we use GridSearchCV with 5-fold cross-validation to tune the hyperparameters. Following the recommendations related literature [75], we search: n_estimators in {300, 500, 1000} to ensure convergence; max_depth in {5, 7, 10} to prevent overfitting; and min_samples_leaf in {5, 10, 20} to enhance generalization. The scoring metric is cross-validated R². The optimal hyperparameters are max_depth = 7, min_samples_leaf = 10, and n_estimators = 500. For Lasso, we rely on the built-in 5-fold cross-validation of the LassoCV class to automatically select the regularization parameter alpha. This procedure minimizes the cross-validated prediction error and avoids subjective bias in manual alpha tuning.

Before DML estimation, we apply the double-selection method [76]. Specifically, we take the union of covariates retained by Lasso when regressing each outcome variable Y and each treatment variable T on X, and use this union as the final covariate set. This alleviates concerns about over-control and collider bias arising from high-dimensional controls.

For sample splitting, we adopt the standard K = 5-fold cross-fitting strategy. This choice balances the reduction of overfitting bias with computational efficiency. Standard cross-fitting mitigates the variability of the estimated causal parameter that may result from random sample splits, while avoiding the heavy computational burden of a larger number of folds. Because a single random split could still affect estimation efficiency, we repeat the cross-fitting procedure 10 times (n_rep = 10) and report the average estimate, thereby improving the stability of the estimator and its standard errors.

To control for unobserved heterogeneity across industries and over time, we further include industry and year fixed effects as additional binary covariates in the covariate set. These fixed effects are treated as ordinary control variables and are absorbed into the nuisance functions E[Y|X] and E[T|X] during the first-stage estimation. The orthogonalization step then partials out the influence of both the continuous controls and the fixed effects, ensuring that the estimated causal parameters are identified from within-industry and within-year variation after removing time-invariant industry characteristics and common time shocks. To account for potentially correlated shocks within firms over time, we employ cluster-robust standard errors at the firm level across all repetitions of the DML procedure.

Table 2 summarizes the configurations of the two learners and their hyperparameter settings. Hyperparameter tuning results based on different model settings and learners are reported in Supplementary Information S5.

4.1.2. Covariates Selection

To address the potential issues of over-control and collider bias arising from high-dimensional control variables, we adopt the Double Selection method [76] and conduct variable screening prior to DML estimation. Specifically, for each treatment variable (Physical_Risk, Transition_Risk) and each outcome variable (GW_score, Real_Eff), we employ Lasso regression with a regularization parameter alpha = 0.01 to select variables with non-zero coefficients from the 33 initial control variables. The choice of alpha = 0.01 reflects a conservative screening strategy, allowing Lasso to retain a broader set of potentially relevant variables and thereby avoiding the omission of important confounders.

Table 3 reports the Lasso selection results for each target variable, as well as the final union set. The results indicate that 19 variables are retained when the target is GW_score, 13 variables are retained for Real_Eff, and 22 variables are retained for each of the climate risk variables (Physical_Risk and Transition_Risk). The variation in selected variables across target variables suggests that different outcomes and treatments are driven by different sets of covariates. For example, Real_Eff retains only 13 variables, implying that its actual emission reduction efficiency is mainly explained by a few core financial and governance indicators (e.g., ROA, cash flow, quick ratio, growth rate, board size). In contrast, the climate risk variables retain a larger set of covariates (22 variables each), indicating that climate risk is influenced by a broader range of financial and governance characteristics.

Taking the union of the four Lasso selection results yields a final set of 29 variables—only 4 of the original 33 covariates are excluded by all models. This nearly complete union selection strategy leverages the advantage of Lasso in removing noise variables from individual regressions, while the union operation prevents the omission of potentially important covariates that may be relevant only for specific target variables. Another benefit of retaining almost all variables is that in the first stage of DML, the learners (Random Forest and Lasso) can still exploit the full set of original information, while the double-selection step mainly serves to exclude clearly irrelevant variables. As a result, the approach controls dimensionality while maintaining inclusiveness for potential confounders. All covariates are predetermined (measured before the outcome and treatment) to avoid post-treatment bias, and the double-selection union approach retains confounders conservatively.

Overall, our variable selection strategy satisfies the robustness requirements of high-dimensional control without excessively reducing the covariate set, thereby providing a sufficiently rich conditioning set for the subsequent DML estimation. The detailed variables selected is reported in Supplementary Information S6.

4.1.3. Orthogonality Diagnostics

Table 4 reports the orthogonal diagnostic results and cross-validation R² of the first-stage conditional expectation function under each outcome variable, treatment variable, and learner combination. The main findings are as follows:

First, the absolute values of the residual correlation coefficients ρ for all combinations are all less than 0.1, and the corresponding p-values are all greater than 0.10 (ranging from 0.16 to 0.91). The non-significant correlation between the residuals suggests that, after partialling out the influence of high-dimensional observed characteristics, no simple linear association remains between the residualized treatment and outcome variables. This alignment with the expectations of the DML first-stage ‘orthogonalization’ ensures that the identified causal parameter

θ

is immune to biases arising from omitted covariates or functional misspecification. Therefore, the setting of the conditional expectation function E[Y∣X] and E[T∣X] is reasonable, and the orthogonality condition is satisfied, which lays the foundation for subsequent unbiased causal inference.

Second, the cross-validation R² of the first-stage model for E[Y∣X] and E[T∣X] is all positive and of a certain magnitude. For the outcome variable GW_score (greenwashing index), the CV R² ranges from 0.154 (Lasso and Transition_Risk combination) to 0.298 (RF and Physical_Risk combination); for Real_Eff (real pollution control efficiency), the CV R² reaches a maximum of 0.705 (Lasso). These positive values indicate that the selected control variables and the learner can effectively explain the variation in the outcome variable and the treatment variable. Although DML does not require the first-stage model to have very high prediction accuracy, the moderate R² values further confirm the effectiveness of the anonymous model (nuisance models) and the rationality of the orthogonalization step. The variation in R² across different models reflects the heterogeneous predictive power of financial covariates on substantive efficiency versus strategic disclosure behaviors.

Thirdly, the orthogonal diagnostic results are highly consistent across the two learners (RF and Lasso) and the two climate risk indicators. The orthogonality condition is satisfied under all settings, and the predictive performance shows good stability. In summary, these diagnostic tests confirm the appropriateness of our DML setup and provide support for the causal estimation results reported in Table 4.

4.2. DML Main Results

Table 5 presents the estimated associations between climate risks and corporate greenwashing as well as real emission efficiency under the DML framework. While DML helps to control for high-dimensional confounding, the results should be interpreted with the usual caution regarding causal inference from observational data.

For the continuous greenwashing score (GW_score), both physical and transition risks exhibit positive and statistically significant coefficients across the two learners. Specifically, the coefficients for physical risk are 0.7115 (t = 2.906) under RF and 0.3884 (t = 8.455) under Lasso; those for transition risk are 0.8204 (t = 3.035) and 0.4037 (t = 6.611). The binary greenwashing indicator (DGW) yields consistently positive and significant results (all t-values > 5.0). These findings suggest that firms facing higher climate risks tend to engage in more symbolic environmental disclosure or “greenwashing”. The pattern is consistent with a defensive greenwashing motivation, whereby firms may resort to strategic disclosure in response to climate pressures rather than undertaking substantive green transformation. In particular, the differences between the RF and Lasso estimation results imply that the impact of climate risks on greenwashing may exhibit nonlinear characteristics: when the risk pressure is low, the strategic responses of enterprises are relatively weak; but once the risk exceeds a certain threshold, the behavior of greenwashing may rapidly intensify. The consistently higher coefficients observed under the RF learner compared to Lasso (e.g., 0.7115 vs. 0.3884 for Physical Risk) suggest that climate-induced greenwashing is not a simple linear process. Rather, it may involve threshold effects or complex interactions with CEO traits, where the motivation to greenwash escalates exponentially once risk pressures surpass critical corporate endurance levels.

The reported “Residual R²” values are small (ranging from 0.01 to 0.11), which is typical for DML regressions. Nevertheless, the consistently significant coefficients and the orthogonality diagnostics (see Table 4) support the reliability of the estimates.

Taken together, the DML analysis reveals a robust positive statistical association between climate risks and corporate greenwashing, while no such association is found for real emission efficiency. Crucially, across all DML specifications and alternative measures (SBM vs. CCR-DEA, see Supplementary Information S5.3), we find no statistical evidence that climate risks promote substantive green transformation (Real_Eff). This decoupling of symbolic disclosure from actual performance provides empirical support for the ‘Defensive Greenwashing’ hypothesis. While the lack of a significant impact on Real_Eff could also reflect operational rigidities or resource constraints in the short term, its coexistence with significantly increased symbolic disclosure suggests a tendency toward strategic signaling rather than fundamental reconfiguration. However, a persistent strategic behavior of greenwash is identified by System GMM (See Supplementary Information S9 for details).

4.3. Moderating Role of CEO Risk Preferences

To further explore the potential heterogeneity in how climate risks relate to corporate behavior, this paper employs a lagged CEO risk preference (at t − 1) as a moderating feature within the Causal Forest DML framework. By employing the lagged (t − 1) CEO risk preference as the moderating feature, we mitigate simultaneity bias and ensure that the captured heterogeneity reflects the modulation effect of pre-existing psychological traits on subsequent climate-related decision-making, rather than a synchronized reaction to the same economic shock. Figure 4 illustrates the estimated Conditional Average Treatment Effect (CATE), revealing non-linear patterns that suggest a complex interplay between managerial style and risk exposure. In Panel A (Physical Risk), the results indicate that the association between physical risk and greenwashing is relatively suppressed when the lagged CEO risk preference is low. Notably, a non-linear transition appears as the risk preference moves into the positive domain, where the estimated treatment effect exhibits a step-wise increase. This pattern suggests a “threshold-trigger” mechanism: managerial risk tolerance may act as a catalyst that amplifies the corporate tendency to use symbolic disclosure as a buffer against physical disasters, but this amplification only becomes prominent after a specific psychological threshold is crossed. The ATE of 2.072 (p < 0.01) provides empirical support for the statistical significance of this observed heterogeneity. In Panel B (Transition Risk), the CATE estimates display a distinct asymmetric structure with a more pronounced sensitivity compared to physical risk. While transition-related pressures show a limited relationship with greenwashing among risk-averse managers, the link intensifies markedly as the lagged risk preference turns positive. The CATE curve for transition risk exhibits an inverted U-shaped tendency at high levels of risk preference, suggesting a saturation point. This implies that while risk-seeking managers are more prone to strategic greenwashing, there may be a ‘self-constraint’ mechanism or a fear of heightened regulatory scrutiny (reputation cost) that prevents greenwashing from escalating indefinitely, even under extreme risk-taking orientations. The larger ATE for transition risk (3.186, Std.Err: 0.952) compared to physical risk (2.072, Std.Err: 0.301) suggests that the corporate strategic response to policy-induced shocks is more volatile and sensitive to managerial styles than the response to tangible physical disasters.

4.4. Mechanism Channels

After identifying the main effect and the moderating role of managerial style, this paper employs a moderated mediation framework to explore the potential channels through which climate risk is associated with corporate greenwashing behavior. Using segmented regression under the DML framework and causal forest-based estimation, this study identifies two plausible mechanisms that appear to be regulated by CEO risk preference: the induction of managerial myopia and the amplification of information opacity.

In this analysis, the focus shifts to transition risks. This choice is grounded in the observed divergence of strategic motivations under different climate pressures. While physical risks typically manifest as uncontrollable external forces causing tangible asset damage, transition risks—arising from policy shifts and technological paradigms—form a complex institutional pressure. Such pressure may enter the decision-making function more directly, suggesting a potential strategic trade-off between long-term sustainability and immediate valuation survival. To assess the internal validity of these channels, we map our assumptions using a Directed Acyclic Graph (DAG) (see Figure 5).

Regarding the Managerial Myopia Channel, Table 6 reports a significant positive association between transition risk and myopia

(β

= 0.3005, t = 3.538) and between myopia and greenwashing

(β

= 1.2463, t = 6.690). This is further elucidated in Panel A of Figure 6, where the Step 1 plot demonstrates a monotonic relationship. Notably, the Step 2 plot reveals that the marginal link between myopia and greenwashing is not uniform; instead, it exhibits a sharp upward “jump” once short-termism crosses a critical threshold. This suggests that the indirect association of 0.3745 reported in Table 6 may be predominantly driven by firms under acute psychological strain.

Simultaneously, for the Information Opacity Channel, Table 6 provides evidence that transition risk is associated with increased opacity

(β

= 0.2007, t = 8.037), which in turn facilitates intensified greenwashing (

β

= 1.3642, t = 2.625). As transparency is steadily eroded (Step 1), firms may face fewer informational constraints, making deceptive signaling a more accessible strategic outlet (Step 2). The indirect effect of 0.2738 characterizes this channel as a consistent “informational shroud” that may enable firms to capture green premiums while evading effective scrutiny.

Figure 6 employs non-parametric DML estimations to unveil these distinct trajectories. In Panel A, the widening confidence intervals at higher risk levels depict a heterogeneous decision-making landscape: faced with similar regulatory pressures, some management teams maintain resilience while others appear to succumb to existential anxiety. The “non-linear leap” in Step 2 reflects what could be interpreted as a state of “desperate gambling”: when psychological pressure reaches a breaking point, greenwashing may cease to be a subtle adjustment and instead become a “last-resort” manipulation.

In contrast, Panel B depicts a process of “incremental erosion.” As transition risk intensifies, analyst disagreement expands at a nearly constant rate, reflecting how climate policy complexity may act as a thickening “fog.” The robust, linear correlation in Step 2 suggests that every incremental loss in transparency is converted into expanded “strategic space” for greenwashing. While the myopia channel represents an “acute stress response,” the opacity channel functions more like “institutional arbitrage,” where firms may capitalize on a deteriorating informational environment.

To account for potential sequential ignorability violations, we incorporate E-value sensitivity analysis (Table 6). The calculated E-values (3.459 for myopia; 2.118 for opacity) indicate that any unobserved confounder would require a substantial risk ratio to invalidate these findings—a scenario that is less probable given our extensive control for 29 firm-specific covariates and the temporal precedence established in Supplementary Information S7.

4.5. Heterogeneity Analysis Based on Causal Forest

4.5.1. Features Contribution

To delineate the boundaries and conditional factors governing the “symbolic vs. substantial” response gap, this study employs the Causal Forest algorithm [16] to calculate the Heterogeneity Discovery Importance (MDI) of covariates (see Figure 7). To enhance the robustness of our findings and address potential high-dimensional confounding, we implement a Double Selection procedure. Among the 33 candidate covariates, 29 were retained based on their predictive power for both climate risk exposure and disclosure indicators.

Figure 7 presents the MDI scores, revealing which firm-level characteristics are most closely associated with the variations in the Conditional Average Marginal Effects (CAME). The results suggest that Digitalization ranks as the most significant factor in explaining heterogeneity. This finding is consistent with the information opacity mechanism: a high level of digitalization is associated with increased disclosure complexity, which may lead to diminished clarity in environmental reporting when firms face external financing incentives.

Following Digitalization, financial indicators such as ROA, Growth, and Tang_Ratio emerge as important moderators. The high ranking of financial stability metrics suggests that for firms with constrained liquidity, the costs of climate transition may be more significant, potentially correlating with a tendency to prioritize symbolic reporting over substantive changes. Furthermore, the role of Subsidy and RD_Invest suggests that resource constraints may influence how firms navigate transition pressures. In summary, the MDI ranking indicates that corporate responses to climate risk are multi-dimensional, reflecting an interplay between digital capabilities, financial conditions, and the firm’s oversight environment.

4.5.2. Heterogeneous Marginal Effect Trends

Figure 8 presents the heterogeneous marginal effects estimated via the generalized random forest algorithm. To enhance reliability, the estimation incorporates the Honest Inference framework alongside Out-of-Bag (OOB) predictions. This procedure ensures that the marginal effect for each observation is generated using a subsample not utilized during the training phase, addressing potential concerns regarding over-fitting. In this visualization, the red solid line represents the predicted conditional marginal effect of transition risk, the blue dashed line represents that of physical risk, and the vertical axis captures the estimated tendency toward symbolic disclosure.

The results for Digitalization reveal a distinct divergence in corporate responses. In the Digitalization subgraph, as digital maturity increases, the marginal effect of transition risks exhibits a non-linear upward shift, while the impact of physical risks demonstrates a downward trend. This divergence is consistent with the hypothesis that digital transformation is associated with a greater capacity for adjusting disclosure in response to policy-driven transition pressures. The heightened sensitivity of the transition risk curve may reflect the role of digital tools in managing complex environmental narratives under regulatory stress, rather than purely substantive improvements.

The trends for profitability (ROA) and growth potential (Growth) align with differing response patterns based on financial health. For transition risk, the ROA subgraph shows a clear negative slope: as profitability improves, the estimated tendency toward symbolic disclosure significantly weakens. This suggests that firms with lower financial performance show a higher correlation with such behaviors, potentially reflecting a low-cost response to climate-induced financial pressure. In contrast, the Growth subgraph exhibits an upward trajectory for transition risk, indicating that firms in rapid expansion phases—which may face higher pressure to maintain a “green” market image—are more likely to exhibit symbolic responses when sensing policy shifts.

The analysis of financial constraints and Subsidy further refines the micro-landscape. The downward slope in the Leverage (Lev) subgraph for physical risk suggests that highly leveraged firms may exhibit different response patterns, possibly due to heightened monitoring that limits disclosure flexibility. Additionally, the relative stability across Subsidy and Tang_Ratio suggests that these responses are more sensitive to internal financial and digital profiles than to external governmental support.

In summary, the heterogeneous responses captured in Figure 8 indicate that symbolic environmental disclosure is associated with a complex interplay of digital capabilities, financial health (ROA), and market expansion needs (Growth).

4.5.3. SHAP Analysis

After characterizing the global marginal trends, this study employs SHapley Additive exPlanations (SHAP) to decompose the predicted heterogeneity at the granular sample level. While the marginal trend plots (Figure 8) visualize the structural functional forms, the SHAP scatter plots (Figure 9) provide a granular validation by illustrating the contribution of each specific observation to the estimated effects.

To ensure the statistical reliability of these decompositions, we first address potential concerns regarding feature correlations. As illustrated in the Pearson correlation matrix (Figure 10), the pairwise correlations among the ten variables are generally low-to-moderate, with all coefficients remaining below 0.6. Furthermore, we employ a SHAP explainer that accounts for the conditional dependence of features, ensuring that each covariate’s contribution reflects its role within the observed joint distribution rather than unrealistic feature combinations.

Panel A of Figure 9 (Transition Risk) identifies Wage and Subsidy as prominent factors associated with response divergence, followed by ROA. Observations with lower wage levels and poorer profitability (low ROA) are predominantly concentrated in the positive SHAP region. This suggests that firms characterized by higher labor costs or financial constraints exhibit a higher estimated tendency toward symbolic disclosure when facing transition pressures. Notably, Digitalization exhibits a distinct pattern where high feature values concentrate in the positive SHAP interval. This is consistent with the hypothesis that digital tools, in certain contexts, may be associated with more complex disclosure adjustments rather than immediate substantive efficiency gains.

Panel B of Figure 9 (Physical Risk) reveals that Cashflow and Wage are closely linked to the heterogeneity, reflecting a mechanism potentially related to financial profiles. Observations with low cashflow are densely packed in the positive SHAP region, with an impact magnitude exceeding that of other governance variables. This indicates that liquidity stress is associated with a higher frequency of symbolic disclosures, possibly acting as a buffer against the perceived financial risks of physical shocks. The relative stability across dimensions like Tang_Ratio and Subsidy further suggests that the response to physical risk is more closely aligned with internal financial conditions than with institutional support or asset structure.

In conclusion, the alignment between the global patterns (Figure 8) and the local contributions (Figure 9) demonstrates the robustness of our findings. The heterogeneity in corporate climate responses reflects systematic patterns associated with digital capabilities, financial conditions, and resource constraints, rather than isolated outliers.

4.5.4. Firm Profiling: Who Are the Greenwashers

After completing the heterogeneity analysis, this study constructs representative profiles of enterprises exhibiting distinct estimated responses to climate risk. To enhance statistical reliability and mitigate potential over-fitting, we implement a split-sample validation protocol. The DML framework is trained on an 80% hold-out set, while the firm profiling and mean-difference tests are conducted exclusively on the remaining 20% independent testing set (N = 879). Observations in the testing set are categorized into a High Sensitivity Group and a Low Sensitivity Group based on their Conditional Average Marginal Effect (CAME) values.

As reported in Table 7, firms in the High Sensitivity Group exhibit characteristics consistent with high resource availability and digital maturity. In the dimensions of Digitalization and ROA, the high-sensitivity group shows significantly higher mean values (differences of 0.1275 and 0.4559, respectively; p < 0.01). This is consistent with the hypothesis that firms with higher information-processing capabilities and financial slack may have a greater capacity for complex environmental reporting. Furthermore, this group demonstrates robust financial flexibility, characterized by significantly higher quick ratios and cashflow (Diff = 0.2371, p < 0.01), coupled with lower leverage and wage burdens. This suggests that the tendency toward symbolic disclosure is more pronounced among financially healthy firms, which may utilize their resource positions to manage market perceptions under climate-related transition pressures.

To further validate that these profiling results reflect structural patterns rather than statistical noise, we conduct a placebo grouping test (Table 8). By randomly assigning sensitivity labels, we observe a systematic collapse of statistical significance across all ten dimensions. The T-values, which were significant in the primary classification, vanish in the placebo set, supporting the robustness of our sensitivity-based stratification.

In summary, this section identifies the typical profile of a firm with a higher estimated propensity for symbolic environmental responses: high digital maturity, strong liquidity, and lower labor-cost pressure. These findings suggest that under certain conditions, enterprises may leverage their technical and financial positions to prioritize signaling-based disclosure, reflecting a potential gap between reported commitments and substantive green competitiveness.

4.6. Endogeneity Problems and Robustness Checks

Based on the IV-DML results reported in Table 9, we construct two time-varying instrumental variables to address potential endogeneity. For transition risk, the instrument is the interaction between the one-year lagged Climate Policy Uncertainty Index (CPU_lg1) and a firm-specific Sensitivity Index. For physical risk, we employ the interaction between the one-year lagged extreme event frequency (EVO_lg1) and the same Sensitivity Index. This index, derived via PCA from 29 base-period characteristics, captures persistent heterogeneity in firms’ responsiveness while allowing for firm-year variation through interaction with aggregate shocks.

The first-stage results confirm strong relevance: the CPU_lg1×Sensitivity Index interaction is positively associated with transition risk (coef = 0.3103, t = 3.667), and the EVO_lg1×Sensitivity Index interaction is associated with physical risk (coef = 0.2653, t = 4.294). The Kleibergen-Paap Wald F-statistics (26.88 and 37.16) exceed the Stock-Yogo critical value of 10, rejecting weak instrument concerns. In the second-stage IV-DML estimation, the coefficients for physical risk (0.4038, t = 3.389) and transition risk (0.3006, t = 4.358) remain positive and significant at the 1% level. Notably, the control function residuals enter the second stage insignificantly, supporting the exclusion restriction.

Complementing the IV analysis, we performed a series of robustness checks detailed in Supplementary Information S5. The ablation study (Table S5) demonstrates that the full DML + RF framework is essential, as simpler OLS or non-cross-fitted models suffer from either linear bias or overfitting. Furthermore, results remain stable across alternative physical risk thresholds (50 km, 100 km), exponential decay functions, and when using environmental penalties as an alternative measure for transition risk (Table S7). These extensive tests alleviate concerns that the results are artifacts of arbitrary modeling choices or measurement bias.

5. Discussion

5.1. The “Threshold-Trigger” Mechanism: Why Climate Risks Drive Non-Linear Defensive Greenwashing

The DML results (Table 5) reveal a positive association between climate risks and symbolic disclosure while highlighting a notable methodological divergence: the coefficients estimated via Random Forest (RF) are consistently higher than those from Lasso (e.g., 0.7115 vs. 0.3884 for physical risk). In terms of economic significance, considering the standardized variables, a one-standard-deviation increase in physical risk is associated with a 0.7115 unit increase in the GW_score under the RF specification. Given that the baseline (mean) GW_score in our sample is approximately 0.148, this represents a substantial relative escalation in greenwashing intensity.

This discrepancy suggests that the relationship between climate shocks and corporate disclosure is characterized by non-linearities that linear models fail to capture. For high-emission enterprises, the response to climate pressures appears more pronounced as risk levels intensify. This “threshold effect” implies that policy interventions based on linear assumptions may significantly underestimate the surge in symbolic signaling when climate risks cross critical corporate endurance levels. The observed gap between reporting intensity and substantive efficiency (Real_Eff) reflects a decoupling where disclosure serves as a signaling buffer to maintain legitimacy amidst rising decarbonization costs.

From a modeling perspective, the gap in estimated effects between RF and Lasso underscores the importance of accounting for complexity in high-emission sectors. These results suggest that as climate shocks increase, firms may find it more feasible to adjust their narratives than to undergo immediate, high-cost technological overhauls. This decoupling implies a potential risk of capital misallocation, as linear models may underestimate the extent of symbolic signaling. Such findings suggest the need for more nuanced green asset pricing models to better account for the varying nature of corporate climate responses during the low-carbon transition.

5.2. Managerial Risk-Taking: The Psychological “Accelerator” of Regulatory Arbitrage

The economic responses to climate shocks are contingent upon executive characteristics. Our GRF analysis indicates that CEO risk preference functions as a significant moderator, with the effect for transition risk (3.186) being notably higher than that for physical risk (2.072). This implies that for every one-standard-deviation shift in a CEO’s risk-taking propensity, the firm’s sensitivity to transition-related regulatory shocks increases by more than three times the baseline response.

For firms led by high-risk-preference executives, stringent climate policies—such as the tightening of the Carbon ETS—are associated with a more pronounced tendency toward symbolic disclosure. This pattern suggests that such firms seek to manage the perceived costs of compliance through “regulatory arbitrage,” favoring narrative adjustments over the high capital expenditure (CapEx) required for substantive carbon reduction.

However, the observed diminishing marginal effects at extreme risk-preference levels may indicate a potential boundary condition. This could reflect the influence of external monitoring mechanisms—such as the oversight of ESG-conscious institutional investors—which may impose a latent constraint on the extent of symbolic reporting. For high-emission industries, these results imply that while executive risk profiles are associated with variations in climate disclosure, the capital market’s monitoring environment remains a critical factor in shaping corporate reporting behavior.

5.3. The Capability Paradox: Digitalization and the Efficiency of Deception

The SHAP decomposition (Figure 9) and firm profiling (Table 7) highlight a “Capability-Response Gap”. Firms with high digital maturity and financial liquidity exhibit a more pronounced tendency toward symbolic disclosure rather than substantive transition. Economically, this is driven by the fact that the “marginal cost of signaling” (e.g., refining ESG reports via digital tools) is substantially lower than the “marginal cost of radical innovation” (e.g., implementing CCS technology).

This incentive structure presents a major challenge for policymakers: First, information asymmetry: Digital tools, while intended for efficiency, may be leveraged to increase disclosure complexity, making it harder for regulators to verify substantive performance. Moreover, capital misallocation: If firms with high digital capabilities secure “green premiums” through symbolic disclosure, they may lower their financing costs despite a lack of real emission reductions. Lastly, policy intervention: To counter this, oversight must shift from “disclosure-based” to “performance-linked” mechanisms. If a 1-SD increase in risk leads to a huge increase in greenwashing among the most “capable” firms, the public cost of verifying transition claims rises exponentially, necessitating targeted audits for digitally mature leaders in high-polluting sectors.

In terms of economic significance, this pattern suggests a potential friction in the allocation of green capital within high-emission industry. The SHAP values (Figure 9) and MDI rankings (Figure 7) indicate that digitalization is a primary determinant of the variance in corporate responses. The resulting information environment implies that firms with high digital capabilities may be more effective at securing “green premiums,” potentially lowering their cost of financing despite their actual environmental footprint.

5.4. Boundary Conditions and Generalizability

While this study focuses on Chinese renewable energy and high-emission enterprises, the findings offer broader implications for the global low-carbon transition, albeit with specific boundary conditions: First, emerging market context. The “Capability Paradox”—where high digital maturity facilitates symbolic signaling—is particularly relevant for emerging markets. In these settings, the rapid adoption of digital tools often outpaces the development of independent environmental verification infrastructure, lowering the relative cost of strategic disclosure. Second, regulatory maturity. A key boundary condition is the stringency of the legal environment. In jurisdictions with robust anti-greenwashing litigation and mature carbon auditing (e.g., the EU), the “defensive greenwashing” observed here might be mitigated by higher legal and reputational costs. Last, future research. Due to data constraints, a direct cross-country comparison was not feasible in this study. Future research should leverage international datasets to examine how different corporate governance norms and climate policy stringency moderate the impact of physical and transition risks on the decoupling of disclosure from substantive performance.

6. Conclusions and Policy Recommendations

6.1. Conclusions

This study investigates the relationship between multi-dimensional climate risks and corporate environmental disclosure behavior within China’s high-emission industries from 2009 to 2024. By utilizing a Double Machine Learning (DML) framework and Generalized Random Forest analysis, we address high-dimensional confounding—a critical factor in heavy industries with complex production linkages—to provide a more robust estimation of these associations. The empirical findings suggest several key insights:

First, both physical and transition climate risks are positively associated with an increase in symbolic greenwashing behavior. In high-emission sectors, where structural decarbonization and energy system overhauls require massive, irreversible capital expenditure, heightened climate pressure is associated with a tendency toward defensive reporting adjustments. Faced with significant transition costs and stringent environmental mandates, these firms may utilize expanded disclosure as a strategic buffer to manage legitimacy pressures and mitigate the perceived impact of climate shocks on their market valuation. Notably, the lack of significant improvement in substantive emission-reduction efficiency suggests a potential decoupling between reporting intensity and actual environmental performance.

Second, the association between climate risk and symbolic disclosure exhibits substantial heterogeneity. Specifically, firms led by CEOs with a higher risk-taking propensity and those with greater “resource slack” (e.g., higher liquidity and digital maturity) exhibit a more pronounced tendency toward this response gap. In the context of heavy industry, this pattern is consistent with an economic trade-off: firms may prioritize managing the immediate information environment to navigate the high marginal costs associated with radical green innovation, such as CCUS technology or hydrogen smelting.

Notably, our analysis reveals a “digitalization-response gap” that is particularly acute in high-emission samples. While digital transformation is a potential catalyst for operational efficiency, it may also provide carbon-intensive firms with the technical capacity to manage environmental narratives with greater complexity. This increased disclosure sophistication may inadvertently widen the “monitoring gap” for regulators in sectors where actual emissions are costly to verify. These results remain robust across various alternative specifications, validation tests, and identification strategies.

Overall, the findings suggest that for high-emission industries, climate-related pressures may lead to a divergence between symbolic signaling and substantive action. This underscores the importance of developing more stringent, performance-linked oversight mechanisms to ensure that climate governance promotes genuine industrial decarbonization rather than strategic information adjustments.

6.2. Policy Recommendations

Based on the empirical evidence, this study proposes the following policy interventions to transition from “symbolic compliance” to “substantive transformation”:

First, implement “performance-linked” green financing for high emission industry. Transition from disclosure-based incentives to performance-linked financing. For high-emission firms, capital access and interest rates should be pegged to verified reductions in carbon intensity and energy-saving efficiency, rather than the presence of “green keywords” in annual reports.

Second, bridge the “digital monitoring gap”: Regulators should deploy AI-driven audit tools to detect “textual calibration” in digitally mature firms. Given that these firms possess the resources for sophisticated manipulation, oversight must move toward real-time, IoT-based monitoring of physical emission points to close the gap between digital narratives and industrial reality.

Third, mandate standardized transition path disclosures: To counter the “Capability Paradox,” policymakers should require high-emission firms to disclose specific, stage-based technical pathways (e.g., equipment upgrade schedules). This increases the “marginal cost of deception” by making vague, symbolic claims easier to debunk.

6.3. Limitations

Despite the methodological rigor provided by the DML framework, this study has the following limitations:

Endogeneity Challenges: Although instrumental variable (IV) strategies were employed, the reliance on observational data means that unobserved time-varying confounders cannot be entirely ruled out. Future research could utilize exogenous policy shocks or natural experiments to further strengthen the causal identification of “defensive greenwashing.”

Generalizability and Context: The findings are situated within the specific institutional and regulatory environment of China’s high-emission industries. The relationship between climate risk and greenwashing may vary across countries with different levels of ESG market maturity and regulatory stringency, warranting further cross-national comparative studies to verify the boundary conditions.

Supplementary Materials

The following supporting information can be downloaded at https://www.mdpi.com/article/10.3390/su18105174/s1, Table S1: Index for SBM; Table S2: Variable definition; Table S3: Summary Statistics; Table S4: Robustness check; Table S5: Ablation test; Table S6: Measurement of pollution control efficiency; Table S7: Table S8: Full sample results; Table S9: Double selection results; Table S10: Mediation robustness checks; Table S11: Validation of the greenwashing index; Table S12: System GMM results with references [15,54,58,59,60,61,62,63,64,65,77,78,79,80,81,82,83].

Author Contributions

Conceptualization, methodology, formal analysis, investigation, writing—original draft preparation, S.M.; writing—original draft preparation, J.H.; writing—original draft preparation, H.N. writing—review and editing, supervision, H.H.C. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The raw data supporting the conclusions of this article will be made available by the authors on request.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Engle, R.F.; Giglio, S.; Kelly, B.; Lee, H.; Stroebel, J. Hedging Climate Change News. Rev. Financ. Stud. 2020, 33, 1184–1216. [Google Scholar] [CrossRef]
Giglio, S.; Maggiori, M.; Rao, K.; Stroebel, J.; Weber, A. Climate Change and Long-Run Discount Rates: Evidence from Real Estate. Rev. Financ. Stud. 2021, 34, 3527–3571. [Google Scholar] [CrossRef]
Hong, H.; Li, F.W.; Xu, J. Climate Risks and Market Efficiency. J. Econom. 2019, 208, 265–281. [Google Scholar] [CrossRef]
Bolton, P.; Kacperczyk, M. Do Investors Care about Carbon Risk? J. Financ. Econ. 2021, 142, 517–549. [Google Scholar] [CrossRef]
Dowling, J.; Pfeffer, J. Organizational Legitimacy: Social Values and Organizational Behavior. Pac. Sociol. Rev. 1975, 18, 122–136. [Google Scholar] [CrossRef] [PubMed]
Hambrick, D.C. Upper Echelons Theory: An Update. Acad. Manag. Rev. 2007, 32, 334–343. [Google Scholar] [CrossRef]
Laufer, W.S. Social Accountability and Corporate Greenwashing. J. Bus. Ethics 2003, 43, 253–261. [Google Scholar] [CrossRef]
Cho, C.H.; Laine, M.; Roberts, R.W.; Rodrigue, M. Organized Hypocrisy, Organizational Façades, and Sustainability Reporting. Account. Organ. Soc. 2015, 40, 78–94. [Google Scholar] [CrossRef]
Lyon, T.P.; Montgomery, A.W. The Means and End of Greenwash. Organ. Environ. 2015, 28, 223–249. [Google Scholar] [CrossRef]
Hambrick, D.C.; Mason, P.A. Upper Echelons: The Organization as a Reflection of Its Top Managers. Acad. Manag. Rev. 1984, 9, 193. [Google Scholar] [CrossRef]
Wang, L.; Fu, L.; Yu, B. CEO Industry Expertise and Corporate ESG Performance. Int. Rev. Econ. Financ. 2026, 105, 104815. [Google Scholar] [CrossRef]
Varian, H.R. Big Data: New Tricks for Econometrics. J. Econ. Perspect. 2014, 28, 3–28. [Google Scholar] [CrossRef]
Fiss, P.C.; Zajac, E.J. The Symbolic Management of Strategic Change: Sensegiving Via Framing and Decoupling. Acad. Manag. J. 2006, 49, 1173–1193. [Google Scholar] [CrossRef]
Marquis, C.; Toffel, M.W.; Zhou, Y. Scrutiny, Norms, and Selective Disclosure: A Global Study of Greenwashing. Organ. Sci. 2016, 27, 483–504. [Google Scholar] [CrossRef]
Chernozhukov, V.; Chetverikov, D.; Demirer, M.; Duflo, E.; Hansen, C.; Newey, W.; Robins, J. Double/Debiased Machine Learning for Treatment and Structural Parameters. Econom. J. 2018, 21, C1–C68. [Google Scholar] [CrossRef]
Athey, S.; Tibshirani, J.; Wager, S. Generalized Random Forests. Ann. Stat. 2019, 47, 1148–1178. [Google Scholar] [CrossRef]
Bartram, S.M.; Hou, K.; Kim, S. Real Effects of Climate Policy: Financial Constraints and Spillovers. J. Financ. Econ. 2022, 143, 668–696. [Google Scholar] [CrossRef]
Walker, K.; Wan, F. The Harm of Symbolic Actions and Green-Washing: Corporate Actions and Communications on Environmental Performance and Their Financial Implications. J. Bus. Ethics 2012, 109, 227–242. [Google Scholar] [CrossRef]
Pankratz, N.; Bauer, R.; Derwall, J. Climate Change, Firm Performance, and Investor Surprises. Manag. Sci. 2023, 69, 7352–7398. [Google Scholar] [CrossRef]
Huynh, T.D.; Nguyen, T.H.; Truong, C. Climate Risk: The Price of Drought. J. Corp. Financ. 2020, 65, 101750. [Google Scholar] [CrossRef]
Ilhan, E.; Sautner, Z.; Vilkov, G. Carbon Tail Risk. Rev. Financ. Stud. 2021, 34, 1540–1571. [Google Scholar] [CrossRef]
Porter, M.E.; Linde, C.V.D. Toward a New Conception of the Environment-Competitiveness Relationship. J. Econ. Perspect. 1995, 9, 97–118. [Google Scholar] [CrossRef]
Ambec, S.; Cohen, M.A.; Elgie, S.; Lanoie, P. The Porter Hypothesis at 20: Can Environmental Regulation Enhance Innovation and Competitiveness? Rev. Environ. Econ. Policy 2013, 7, 2–22. [Google Scholar] [CrossRef]
Dechezleprêtre, A.; Sato, M. The Impacts of Environmental Regulations on Competitiveness. Rev. Environ. Econ. Policy 2017, 11, 183–206. [Google Scholar] [CrossRef]
Krueger, P.; Sautner, Z.; Starks, L.T. The Importance of Climate Risks for Institutional Investors. Rev. Financ. Stud. 2020, 33, 1067–1111. [Google Scholar] [CrossRef]
Stroebel, J.; Wurgler, J. What Do You Think about Climate Finance? J. Financ. Econ. 2021, 142, 487–498. [Google Scholar] [CrossRef]
Berg, F.; Kölbel, J.F.; Rigobon, R. Aggregate Confusion: The Divergence of ESG Ratings. Rev. Financ. 2022, 26, 1315–1344. [Google Scholar] [CrossRef]
Pástor, Ľ.; Stambaugh, R.F.; Taylor, L.A. Dissecting Green Returns. J. Financ. Econ. 2022, 146, 403–424. [Google Scholar] [CrossRef]
Sautner, Z.; Van Lent, L.; Vilkov, G.; Zhang, R. Firm-Level Climate Change Exposure. J. Financ. 2023, 78, 1449–1498. [Google Scholar] [CrossRef]
Azar, J.; Duro, M.; Kadach, I.; Ormazabal, G. The Big Three and Corporate Carbon Emissions around the World. J. Financ. Econ. 2021, 142, 674–696. [Google Scholar] [CrossRef]
Flammer, C. Corporate Green Bonds. J. Financ. Econ. 2021, 142, 499–516. [Google Scholar] [CrossRef]
Zhou, H.; Chen, J.; Ding, X.; Qin, Q.; Han, L. Future Climate Change Will Strengthen Cotton Production but Have Substantial Environmental Costs—A Focus on Xinjiang by APSIM Modelling. J. Clean. Prod. 2025, 491, 144803. [Google Scholar] [CrossRef]
Tashman, P.; Marano, V.; Kostova, T. Walking the Walk or Talking the Talk? Corporate Social Responsibility Decoupling in Emerging Market Multinationals. J. Int. Bus. Stud. 2019, 50, 153–171. [Google Scholar] [CrossRef]
Delmas, M.A.; Burbano, V.C. The Drivers of Greenwashing. Calif. Manag. Rev. 2011, 54, 64–87. [Google Scholar] [CrossRef]
Jensen, M.C.; Meckling, W.H. Theory of the Firm: Managerial Behavior, Agency Costs and Ownership Structure. J. Financ. Econ. 1976, 3, 305–360. [Google Scholar] [CrossRef]
Luo, T.; Gilmanova, L.; Kaskel, S. Advances of MOFs and COFs for Photocatalytic CO₂ Reduction, H₂ Evolution and Organic Redox Transformations. Coord. Chem. Rev. 2023, 490, 215210. [Google Scholar] [CrossRef]
Xu, G.; Lu, N.; Tong, Y. Greenwashing and Credit Spread: Evidence from the Chinese Green Bond Market. Financ. Res. Lett. 2022, 48, 102927. [Google Scholar] [CrossRef]
Becker, C.L.; Defond, M.L.; Jiambalvo, J.; Subramanyam, K.R. The Effect of Audit Quality on Earnings Management*. Contemp. Account. Res. 1998, 15, 1–24. [Google Scholar] [CrossRef]
Yu, F. Analyst Coverage and Earnings Management. J. Financ. Econ. 2008, 88, 245–271. [Google Scholar] [CrossRef]
Dyck, A.; Lins, K.V.; Roth, L.; Wagner, H.F. Do Institutional Investors Drive Corporate Social Responsibility? International Evidence. J. Financ. Econ. 2019, 131, 693–714. [Google Scholar] [CrossRef]
Mahran, K.; Elamer, A.A. Chief Executive Officer (CEO) and Corporate Environmental Sustainability: A Systematic Literature Review and Avenues for Future Research. Bus. Strategy Environ. 2024, 33, 1977–2003. [Google Scholar] [CrossRef]
Kim, E.-H.; Lyon, T.P. Greenwash vs. Brownwash: Exaggeration and Undue Modesty in Corporate Sustainability Disclosure. Organ. Sci. 2015, 26, 705–723. [Google Scholar] [CrossRef]
Teti, E.; Etro, L.L.; Pausini, L. Does Greenwashing Affect Company’s Stock Price? Evidence from Europe. Int. Rev. Financ. Anal. 2024, 93, 103195. [Google Scholar] [CrossRef]
Bernile, G.; Bhagwat, V.; Rau, P.R. What Doesn’t Kill You Will Only Make You More Risk-Loving: Early-Life Disasters and CEO Behavior. J. Financ. 2017, 72, 167–206. [Google Scholar] [CrossRef]
Malmendier, U.; Tate, G. CEO Overconfidence and Corporate Investment. J. Financ. 2005, 60, 2661–2700. [Google Scholar] [CrossRef]
Peters, F.S.; Wagner, A.F. The Executive Turnover Risk Premium. J. Financ. 2014, 69, 1529–1563. [Google Scholar] [CrossRef]
Chouksey, A.; Rajan, A.K.; Gurjar, V.; Tiwari, R.; Mishra, P.K. The Green Paradox: The Climate, Environmental, and Sustainability Implications of Artificial Intelligence. Glob. Environ. Change Adv. 2026, 6, 100029. [Google Scholar] [CrossRef]
Fich, E.M.; Harford, J.; Tran, A.L. Disloyal Managers and Shareholders’ Wealth. Rev. Financ. Stud. 2023, 36, 1837–1888. [Google Scholar] [CrossRef]
Xu, X. Generalist CEOs, Management Risk and Internal Control Weaknesses. J. Bus. Financ. Account. 2024, 51, 209–239. [Google Scholar] [CrossRef]
Tian, B.; Yu, J.; Tian, Z. The Impact of Market-Based Environmental Regulation on Corporate ESG Performance: A Quasi-Natural Experiment Based on China’s Carbon Emission Trading Scheme. Heliyon 2024, 10, e26687. [Google Scholar] [CrossRef]
Bai, H.; Yu, M.; Park, J.-M. Can Digital Transformation Cure “Greenwashing”? Financ. Res. Lett. 2025, 86, 108747. [Google Scholar] [CrossRef]
Hirshleifer, D.; Low, A.; Teoh, S.H. Are Overconfident CEOs Better Innovators? J. Financ. 2012, 67, 1457–1498. [Google Scholar] [CrossRef]
Custódio, C.; Ferreira, M.A.; Matos, P. Generalists versus Specialists: Lifetime Work Experience and Chief Executive Officer Pay. J. Financ. Econ. 2013, 108, 471–492. [Google Scholar] [CrossRef]
Custódio, C.; Ferreira, M.A.; Matos, P. Do General Managerial Skills Spur Innovation? Manag. Sci. 2019, 65, 459–476. [Google Scholar] [CrossRef]
Ferris, G.R.; Ellen, B.P.; McAllister, C.P.; Maher, L.P. Reorganizing Organizational Politics Research: A Review of the Literature and Identification of Future Research Directions. Annu. Rev. Organ. Psychol. Organ. Behav. 2019, 6, 299–323. [Google Scholar] [CrossRef]
Guo, K.; Ji, Q.; Zhang, D. A Dataset to Measure Global Climate Physical Risk. Data Brief 2024, 54, 110502. [Google Scholar] [CrossRef]
Xie, S.; Duan, J.; Yang, Y. Climate Change and Corporate Operating Risk: Evidence from China. Sustainability 2025, 17, 3683. [Google Scholar] [CrossRef]
Malmendier, U.; Tate, G. Who Makes Acquisitions? CEO Overconfidence and the Market’s Reaction. J. Financ. Econ. 2008, 89, 20–43. [Google Scholar] [CrossRef]
Goel, A.M.; Thakor, A.V. Overconfidence, CEO Selection, and Corporate Governance. J. Financ. 2008, 63, 2737–2784. [Google Scholar] [CrossRef]
Bernile, G.; Bhagwat, V.; Yonker, S. Board Diversity, Firm Risk, and Corporate Policies. J. Financ. Econ. 2018, 127, 588–612. [Google Scholar] [CrossRef]
Malmendier, U.; Tate, G.; Yan, J. Overconfidence and Early-Life Experiences: The Effect of Managerial Traits on Corporate Financial Policies. J. Financ. 2011, 66, 1687–1733. [Google Scholar] [CrossRef]
Yim, S. The Acquisitiveness of Youth: CEO Age and Acquisition Behavior. J. Financ. Econ. 2013, 108, 250–273. [Google Scholar] [CrossRef]
Huang, J.; Kisgen, D.J. Gender and Corporate Finance: Are Male Executives Overconfident Relative to Female Executives? J. Financ. Econ. 2013, 108, 822–839. [Google Scholar] [CrossRef]
Giannetti, M.; Zhao, M. Board Ancestral Diversity and Firm-Performance Volatility. J. Financ. Quant. Anal. 2019, 54, 1117–1155. [Google Scholar] [CrossRef]
Tone, K. A Slacks-Based Measure of Efficiency in Data Envelopment Analysis. Eur. J. Oper. Res. 2001, 130, 498–509. [Google Scholar] [CrossRef]
Zhang, D. Green Financial System Regulation Shock and Greenwashing Behaviors: Evidence from Chinese Firms. Energy Econ. 2022, 111, 106064. [Google Scholar] [CrossRef]
Hu, X.; Hua, R.; Liu, Q.; Wang, C. The Green Fog: Environmental Rating Disagreement and Corporate Greenwashing. Pac.-Basin Financ. J. 2023, 78, 101952. [Google Scholar] [CrossRef]
Loughran, T.; McDonald, B.; Yun, H. A Wolf in Sheep’s Clothing: The Use of Ethics-Related Terms in 10-K Reports. J. Bus. Ethics 2009, 89, 39–49. [Google Scholar] [CrossRef]
Hoberg, G.; Lewis, C. Do Fraudulent Firms Produce Abnormal Disclosure? J. Corp. Financ. 2017, 43, 58–85. [Google Scholar] [CrossRef]
Gopalan, R.; Milbourn, T.; Song, F.; Thakor, A.V. Duration of Executive Compensation. J. Financ. 2014, 69, 2777–2817. [Google Scholar] [CrossRef]
Lang, M.; Lundholm, R. Corporate Disclosure Policy and Analyst Behavior. Account. Rev. 1996, 71, 467–492. [Google Scholar] [CrossRef]
Liu, Y.; Han, J. Climate Risk Disclosure and Financial Analysts’ Forecasts: Evidence from China. Sustainability 2025, 17, 3178. [Google Scholar] [CrossRef]
Wahyuningrum, I.F.S.; Baroroh, N.; Yanto, H.; Hidayah, R.; Puspita, A.S.; Elviana, L.D. Corporate Governance: Driving Climate Change Disclosure and Advancing SDGs. J. Risk Financ. Manag. 2025, 18, 234. [Google Scholar] [CrossRef]
Kennedy, E.H. Towards Optimal Doubly Robust Estimation of Heterogeneous Causal Effects. Electron. J. Stat. 2023, 17, 3008–3049. [Google Scholar] [CrossRef]
Liaw, A.; Wiener, M. Classification and Regression by randomForest. R News 2002, 2, 18–22. [Google Scholar]
Belloni, A.; Chernozhukov, V.; Hansen, C. High-Dimensional Methods and Inference on Structural and Treatment Effects. J. Econ. Perspect. 2014, 28, 29–50. [Google Scholar] [CrossRef]
Arellano, M.; Bover, O. Another Look at the Instrumental Variable Estimation of Error-Components Models. J. Econom. 1995, 68, 29–51. [Google Scholar] [CrossRef]
Tibshirani, R. Regression Shrinkage and Selection Via the Lasso. J. R. Stat. Soc. Ser. B Stat. Methodol. 1996, 58, 267–288. [Google Scholar] [CrossRef]
Breiman, L. Random Forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef]
Liu, W.B.; Meng, W.; Li, X.X.; Zhang, D.Q. DEA Models with Undesirable Inputs and Outputs. Ann. Oper. Res. 2010, 173, 177–194. [Google Scholar] [CrossRef]
Ke, G.; Meng, Q.; Finley, T.; Wang, T.; Chen, W.; Ma, W.; Ye, Q.; Liu, T.-Y. LightGBM: A Highly Efficient Gradient Boosting Decision Tree. In Proceedings of the Advances in Neural Information Processing Systems; Guyon, I., Luxburg, U.V., Bengio, S., Wallach, H., Fergus, R., Vishwanathan, S., Garnett, R., Eds.; Curran Associates, Inc.: Red Hook, NY, USA, 2017; Volume 30. [Google Scholar]
Farbmacher, H.; Huber, M.; Lafférs, L.; Langen, H.; Spindler, M. Causal Mediation Analysis with Double Machine Learning. Econom. J. 2022, 25, 277–300. [Google Scholar] [CrossRef]
Chernozhukov, V.; Newey, W.K.; Singh, R. Automatic Debiased Machine Learning of Causal and Structural Effects. Econometrica 2022, 90, 967–1027. [Google Scholar] [CrossRef]

Figure 1. Empirical Framework.

Figure 2. DML Framework.

Figure 3. Robustness of PCA.

Figure 4. Moderating effects of CEO risk preferences.

Figure 5. Mechanism DAG.

Figure 6. Moderated mechanism channels.

Figure 7. Features contribution.

Figure 8. Multi-dimensional heterogeneous marginal effect trends.

Figure 9. SHAP analysis.

Figure 10. Correlation matrix of selected variables.

Table 1. Validity testing of enterprise transformation risk indicators.

Variable	Transition_Risk	No. of Provincial Low-Carbon Policies	Investment in Industrial Pollution Control
Transition_Risk	1.000
No. of provincial low-carbon policies	0.345 ***	1.000
Investment in industrial pollution control	0.288 ***	0.413	1.000

Note: *** denoting significance at the 1% level.

Table 2. Hyperparameter grid search configuration and optimal results.

Learner	Hyperparameter	Search Scope/Tuning Method	Selected Value	Select Criteria/Reasons
Random Forest	N_estimators	{300, 500,1000}	500	5-folds cross-validation R² maximization
	Max_depth	{5, 7, 10}	7	Control the depth of the tree to avoid overfitting, 5-folds cross-validation R² maximization
	Min_sample_leaf	{5, 10, 20}	10	5-folds cross-validation R² maximization
Lasso	alpha	LassoCV is equipped with an automatic search for cross-validation	Automatic
Overall DML Settings	N_folds	5 (Standard)	5	Standardized setting
	N_rep	10 (Standard)	10	Standardized setting

Table 3. Double Selection Results (Lasso with alpha = 0.01).

Target Variable	Threshold α	Original #(number) of X	Selected
GW_score	0.01	33	19
Real_Eff (Y)	0.01	33	13
Physical_Risk (T)	0.01	33	22
Transition_Risk (T)	0.01	33	22
Final Union	0.01	33	29

Table 4. Orthogonality diagnostics—residual correlation coefficients and significance tests.

Outcome (Y)	Treatment (T)	Model	CV R² (E[Y\|X])	CV R² (E[T\|X])	ρ (Residual Correlation)	p-Value
GW_score	Physical_Risk	RF	0.298	0.435	0.044	0.49
GW_score	Physical_Risk	Lasso	0.254	0.353	0.045	0.37
GW_score	Transition_Risk	RF	0.219	0.558	0.019	0.16
GW_score	Transition_Risk	Lasso	0.154	0.579	0.099	0.32
Real_Eff	Physical_Risk	RF	0.566	0.455	0.021	0.53
Real_Eff	Physical_Risk	Lasso	0.705	0.353	−0.011	0.40
Real_Eff	Transition_Risk	RF	0.566	0.574	0.033	0.91
Real_Eff	Transition_Risk	Lasso	0.705	0.579	−0.005	0.72

Note: The ρ and p values are used to test orthogonality; CV R² represents the cross-validation goodness-of-fit of the first-stage model for E[Y|X] and E[T|X] (5-fold), and a positive value indicates that the model has basic predictive capabilities.

Table 5. DML main results.

Variables	GW_score	GW_score	DGW	DGW	Real_Eff	Real_Eff	GW_score	GW_score	DGW	DGW	Real_Eff	Real_Eff
Physical Risk	0.7115 ***	0.3884 ***	0.1226 ***	0.1308 ***	0.7688	0.8204
	(2.906)	(8.455)	(5.256)	(6.393)	(1.295)	(1.296)
Transition Risk							0.8204 ***	0.4037 ***	0.1221 ***	0.1313 ***	1.1043	0.9119
							(3.035)	(6.611)	(5.185)	(6.449)	(1.579)	(1.289)
Controls	Yes	Yes	Yes	Yes	Yes	Yes	Yes	Yes	Yes	Yes	Yes	Yes
FEs	Yes	Yes	Yes	Yes	Yes	Yes	Yes	Yes	Yes	Yes	Yes	Yes
Model	RF	Lasso	RF	Lasso	RF	Lasso	RF	Lasso	RF	Lasso	RF	Lasso
Residual R²	0.03	0.01	0.01	0.01	0.04	0.04	0.04	0.01	0.01	0.02	0.11	0.02
N	4395	4395	4395	4395	4395	4395	4395	4395	4395	4395	4395	4395

Note: This table reports the baseline DML estimation results using Random Forest and Lasso learners across 29 high-dimensional covariates after double selection (standardized), t-statistics based on robust standard errors are in parentheses, with *** denoting significance at the 1% levels.

Table 6. DML Mechanisms.

	CEO Myopia Channel		Disclosure Opacity Channel
Variables	Myopia	GW_score	Opacity	GW_score
Transition_Risk	0.3005 ***		0.2007 ***
	(3.538)		(8.037)
Myopia		1.2463 ***
		(6.690)
Opacity				1.3642 ***
				(2.625)
Indirect effect	0.3745 *** (5.122)		0.2738 ** (2.459)
Outcome SD	0.5942		0.2255
E-Value for point estimate of indirect effect	3.459		2.118
E-value for lower 95% CI bound of indirect effect	2.461		1.352
Moderator	CEO_risk_pref	CEO_risk_pref	CEO_risk_pref	CEO_risk_pref
Controls	Yes	Yes	Yes	Yes
FEs	Yes	Yes	Yes	Yes
Model	RF	RF	RF	RF
Residual R-squared	0.03	0.01	0.04	0.04
Observations	4395	4395	4395	4395

Note: This table reports the DML mediation estimation results moderated by CEO risk preference using Random Forest learner across 29 high-dimensional covariates (standardized) after double selection, t-statistics based on robust standard errors are in parentheses, with *** and ** denoting significance at the 1%, 5% levels, respectively. The indirect effect (product of coefficients a·b) is computed via the product method, and its t-statistic as well as the corresponding p-value are obtained using the Delta method to approximate the standard error. The E-value quantifies the minimum strength of association that an unmeasured confounder would need to have with both the treatment (climate transition risk) and the outcome (greenwashing) to fully explain away the estimated indirect effect, with point estimates and lower bounds of the 95% confidence interval reported. R-squared values are predictive (based on the underlying random forest models) and are provided for descriptive purposes only; they do not represent causal goodness-of-fit. All models include firm and year fixed effects where applicable.

Table 7. Firm profiling.

Variable	Low Sens. (Mean)	High Sens. (Mean)	Diff	T-Value
Digitalization	0.0125	0.1300	0.1275 ***	3.968
ROA	−0.1509	0.3050	0.4559 ***	6.973
Growth	0.0102	0.1148	0.1046	0.759
Tang_Ratio	0.0131	−0.0323	−0.0454	−0.677
Quick	−0.3413	0.4414	0.7827 ***	13.493
Lev	0.6791	−0.9532	−1.6323 ***	−43.414
Wage	0.1427	−0.0594	−0.2020 **	−2.090
Asset_Growth	0.0869	−0.0418	−0.1287	−1.595
Cashflow	−0.0890	0.1482	0.2371 ***	3.491
Subsidy	0.0471	−0.0069	−0.0540	−0.794

Note: This profiling is conducted on an independent testing set (N = 879), representing 20% of the total observations to avoid over-fitting. *** and ** denoting significance at the 1% and 5% levels, respectively.

Table 8. Firm profiling placebo test.

Variable	Low Sens. (Mean)	High Sens. (Mean)	Diff	T-Value
Digitalization	0.0220	−0.1413	−0.1633	−1.109
ROA	0.0605	0.0518	−0.0088	−0.131
Growth	0.1210	−0.0179	−0.1389	−1.009
Tang_Ratio	0.0108	−0.0295	−0.0403	−0.601
Quick	−0.0236	0.0610	0.0847	1.330
Lev	−0.0430	−0.0885	−0.0454	−0.681
Wage	0.0664	0.0320	−0.0343	−0.354
Asset_Growth	0.0543	−0.0028	−0.0571	−0.707
Cashflow	0.0222	0.0150	−0.0072	−0.105
Subsidy	0.0507	−0.0112	−0.0620	−0.912

Note: Group labels are randomly assigned to test the null hypothesis. T-values and significance markers are expected to vanish.

Table 9. IV-DML.

Variables	Stage I: IV		Stage II: IV-DML
Variables	Transition_Risk	Physical_Risk	GW_score
CPU_lg1×Sensitivity Index	0.3103 *** (3.667)
EVO_lg1×Sensitivity Index		0.2653 *** (4.294)
Phy_risk_hat			0.4038 *** (3.389)
Tra_risk_hat			0.3006 *** (4.358)
Controls	Yes	Yes	Yes
FEs	Yes	Yes	Yes
AR_P	0.00	0.00
Wald_F-statistics	26.88	37.16
Model	RF	RF	RF
N	4395	4395	4395

Note: t-statistics in the parentheses, *** denoting significance at the 1% level. Control function residuals enter the second stage insignificantly (p = 0.342 for Trans, p = 0.287 for Phys), supporting the exclusion restriction.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Ma, S.; Hou, J.; Niu, H.; Chen, H.H. Climate Risk, CEO Risk Preference, and Corporate Greenwashing in High-Emission Industry: A Debiased Machine Learning Approach. Sustainability 2026, 18, 5174. https://doi.org/10.3390/su18105174

AMA Style

Ma S, Hou J, Niu H, Chen HH. Climate Risk, CEO Risk Preference, and Corporate Greenwashing in High-Emission Industry: A Debiased Machine Learning Approach. Sustainability. 2026; 18(10):5174. https://doi.org/10.3390/su18105174

Chicago/Turabian Style

Ma, Shijie, Jingzhi Hou, Haoran Niu, and Hsing Hung Chen. 2026. "Climate Risk, CEO Risk Preference, and Corporate Greenwashing in High-Emission Industry: A Debiased Machine Learning Approach" Sustainability 18, no. 10: 5174. https://doi.org/10.3390/su18105174

APA Style

Ma, S., Hou, J., Niu, H., & Chen, H. H. (2026). Climate Risk, CEO Risk Preference, and Corporate Greenwashing in High-Emission Industry: A Debiased Machine Learning Approach. Sustainability, 18(10), 5174. https://doi.org/10.3390/su18105174

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Climate Risk, CEO Risk Preference, and Corporate Greenwashing in High-Emission Industry: A Debiased Machine Learning Approach

Abstract

1. Introduction

2. Literature Review

2.1. The Microeconomic Consequences of Climate Risks

2.2. The Driving Mechanism of Enterprises’ Greenwashing Behavior

2.3. Executive Traits and the Technological Paradox

3. Materials and Methods

3.1. Data Collecting

3.2. Climate Risk

3.2.1. Physical Risk

3.2.2. Transition Risk

3.3. CEO Risk Preference

3.4. Real Pollution Control Efficiency

3.5. Greenwash

3.6. Mechanisms: CEO Myopia and Disclosure Opacity

3.7. Covariates

3.8. Residual Regression and DML Model Training

4. Results

4.1. DML Settings

4.1.1. Hyperparameter Settings

4.1.2. Covariates Selection

4.1.3. Orthogonality Diagnostics

4.2. DML Main Results

4.3. Moderating Role of CEO Risk Preferences

4.4. Mechanism Channels

4.5. Heterogeneity Analysis Based on Causal Forest

4.5.1. Features Contribution

4.5.2. Heterogeneous Marginal Effect Trends

4.5.3. SHAP Analysis

4.5.4. Firm Profiling: Who Are the Greenwashers

4.6. Endogeneity Problems and Robustness Checks

5. Discussion

5.1. The “Threshold-Trigger” Mechanism: Why Climate Risks Drive Non-Linear Defensive Greenwashing

5.2. Managerial Risk-Taking: The Psychological “Accelerator” of Regulatory Arbitrage

5.3. The Capability Paradox: Digitalization and the Efficiency of Deception

5.4. Boundary Conditions and Generalizability

6. Conclusions and Policy Recommendations

6.1. Conclusions

6.2. Policy Recommendations

6.3. Limitations

Supplementary Materials

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI