1. Introduction
The global regulatory landscape for corporate sustainability has undergone a fundamental transformation over the past decade, shifting progressively from voluntary corporate social responsibility (CSR) frameworks to legally mandated Environmental, Social, and Governance (ESG) disclosure regimes. This transition reflects growing recognition that voluntary reporting, while widespread, is insufficient to drive the substantive corporate behavioral changes required to meet global climate commitments. The implementation of the European Union’s Sustainable Finance Disclosure Regulation (SFDR) in 2021 and the subsequent Corporate Sustainability Reporting Directive (CSRD), enacted in December 2022 with phased reporting obligations beginning in 2024, represent the most comprehensive mandatory disclosure framework yet introduced in any major economy [
1].
Despite the growing policy momentum behind mandatory ESG disclosure, a fundamental empirical question remains unresolved: does the transition from voluntary to mandatory disclosure generate substantive improvements in corporate environmental performance, or does it merely produce symbolic compliance? Theoretical frameworks offer competing predictions. Agency Theory [
2] suggests that mandatory transparency reduces information asymmetry between managers and investors, thereby lowering the cost of capital and rewarding firms with higher valuations. Decoupling Theory [
3], by contrast, posits that firms adopt formal sustainability structures to satisfy external institutional pressures while continuing business-as-usual operations internally—a phenomenon commonly described as greenwashing.
The dominance of substantive over symbolic compliance is not predetermined but depends on the institutional conditions under which disclosure obligations operate. Agency Theory is expected to prevail when disclosure standards are externally verifiable and standardized, when penalties for inconsistent reporting are credible, and when institutional investors actively monitor reported emissions data. Decoupling Theory, by contrast, is more likely to describe firm behaviour when reporting standards are vague or self-reported, enforcement mechanisms are weak, and the cost of genuine operational change substantially exceeds the reputational cost of non-compliance. The EU regulatory ecosystem—combining the standardized mandatory disclosure requirements of SFDR and CSRD with the financial incentives created by EU Emissions Trading System carbon pricing—creates precisely the institutional conditions under which Agency Theory predicts substantive rather than ceremonial responses. This theoretical reconciliation generates a directional expectation that the EU mandatory disclosure context should produce genuine decarbonization rather than greenwashing, while also acknowledging that neither theory dominates universally across all institutional settings.
This study addresses this debate by exploiting the natural experiment created by the EU’s escalating mandatory disclosure requirements. Specifically, this study examines whether EU-listed firms reduced their carbon intensity (Scope 1 and 2 greenhouse gas emissions per unit of revenue) significantly more than comparable Japanese firms—which operated under voluntary disclosure throughout the study period—following the 2021 regulatory shift. Japan provides an analytically compelling control group: it is an economically sophisticated, internationally integrated developed economy with active capital markets and high ESG awareness among large listed firms, yet it lacked mandatory climate disclosure obligations comparable to the EU during 2018–2024.
The central research question of this study is: does mandatory ESG disclosure produce substantive operational decarbonization in Scope 1 and 2 carbon intensity, and does this effect vary with internal corporate governance quality, as assessed through a causal comparison of EU-listed firms against Japanese firms operating under voluntary frameworks during 2018–2024? This question is motivated by the limitations of prior studies that employed the United States as a control group for EU-listed treatment firms. That design carries two fundamental identification problems. First, the EU and US differ substantially in energy market structure, climate policy architecture, investor culture, and capital market conventions, creating multiple channels through which post-2021 EU-US differences in firm value and carbon intensity could reflect structural divergence rather than disclosure regulation effects. Second, the US regulatory environment was itself in flux during the study period: the SEC proposed mandatory climate disclosure rules in 2022, creating anticipatory effects among US firms that contaminate the control group. Japan avoids both problems. The parallel trends validation presented in
Section 4.5—showing that EU and Japanese carbon intensity trajectories were not diverging before 2021—provides direct empirical support for the improved comparability of the EU–Japan design.
This study makes three distinct contributions to the literature. First, it addresses the methodological weaknesses identified in prior EU-focused DiD studies by employing Japan as the control group rather than the United States, thereby substantially reducing the institutional comparability concerns that arise from cross-Atlantic regulatory and capital market differences. Second, the analytical sample is reconstructed with full transparency, documenting every sample selection step and applying eligibility filters based on actual data availability, which directly addresses survivorship bias concerns raised in the prior literature. Third, a dynamic event study specification is implemented alongside the baseline DiD to validate the parallel trends assumption and trace the temporal evolution of treatment effects year by year.
Among these contributions, the central novelty of the study lies in the causal identification strategy: this is among the first studies to exploit the EU–Japan regulatory contrast to identify the effect of mandatory ESG disclosure on operational carbon intensity using a difference-in-differences design with event study validation. The governance moderation finding—that the emissions-reducing effect is concentrated among firms with dedicated sustainability committees—provides the mechanism linking regulatory pressure to operational outcomes. The dynamic event study analysis provides the temporal structure showing that effects intensify progressively from 2021 through 2024, which is inconsistent with a one-time reporting adjustment and consistent with genuine structural operational change.
Importantly, because firm-level legal coverage under the NFRD and CSRD cannot be perfectly identified for all listed firms within the Refinitiv database, this study interprets the treatment indicator as exposure to the EU regulatory disclosure environment rather than confirmed firm-level legal obligation. Accordingly, the reported DiD coefficients should be interpreted as average regulatory-environment effects rather than precise statutory treatment effects.
The choice of 2021 as the post-treatment threshold warrants explicit justification, as the CSRD’s formal reporting obligations did not begin until 2024. The 2021 threshold is justified on two grounds. First, SFDR came into full application in March 2021, requiring asset managers, pension funds, and insurance companies—the primary institutional shareholders of the listed firms in this sample—to integrate and disclose sustainability risks across their portfolios. This created strong and immediate indirect pressure on the corporations in which these institutions invest, even before those corporations were themselves directly subject to mandatory corporate reporting obligations. Second, the formal legislative proposal of CSRD in April 2021 established a credible and publicly known regulatory trajectory, creating anticipatory managerial and investor responses. Firms and their advisors began preparing emissions measurement infrastructure, internal reporting systems, and disclosure strategies from 2021 onward in preparation for future mandatory obligations. This anticipatory mechanism is directly supported by the event study evidence in
Section 4.5, where the treatment coefficients begin emerging in 2021 and intensify progressively through 2024—precisely the temporal pattern expected from anticipatory preparation rather than a discrete legal trigger. Throughout this paper, the 2021 threshold is therefore interpreted as the beginning of the regulatory escalation period rather than as the point at which universal firm-level statutory obligations took effect.
4. Empirical Results
4.1. Descriptive Statistics
Table 4 reports descriptive statistics for the EU treatment group and Japanese control group separately across the full sample period 2018–2024. The two groups are broadly comparable in firm size and profitability, which is consistent with the premise that the DiD comparison captures regulatory rather than structural differences. The EU group displays a modestly higher Tobin’s Q on average (1.714 versus 1.504), reflecting the historically higher valuation multiples associated with European capital markets relative to Japan. The Japanese control group shows somewhat higher mean carbon intensity in physical units (163.34 tonnes per million USD versus 135.78), though this difference largely reflects compositional differences in industry structure and is controlled for by firm-fixed effects in the regression analysis.
With respect to governance characteristics, EU firms show a higher rate of board independence policy adoption (67.1% versus 41.6% for Japanese firms), while Japanese firms have a slightly higher rate of CSR committee presence (78.2% versus 71.8%). These patterns are consistent with known differences in corporate governance traditions between European and Japanese institutional frameworks.
4.2. Baseline DiD Results
Table 5 presents the baseline DiD regression results for both dependent variables. All models include firm- and year-fixed effects with standard errors clustered by firm. Regarding Hypothesis 1, the DiD coefficient for Tobin’s Q is −0.0480 (SE = 0.0345,
p = 0.164), which is not statistically significant at conventional levels. This result does not support H1. The absence of a significant valuation effect is notable and contrasts with some prior studies that employ different comparative contexts. This finding reflects the institutional comparability of the EU–Japan comparison: Japanese capital markets are themselves sophisticated and ESG-aware, meaning that European firms’ mandatory compliance may not generate the same incremental signaling value relative to how Japanese peers might relative to US firms operating in a more market-driven voluntary context. The control variables perform as theoretically expected: firm size is negatively associated with Tobin’s Q, consistent with a small-firm premium, while ROA is strongly positively associated with firm value.
Regarding Hypothesis 2, the DiD coefficient for log Carbon Intensity is −0.2087 (SE = 0.0399, p < 0.001), which is statistically significant at the 1% level and economically meaningful. This result indicates that EU-listed firms reduced their log carbon intensity by approximately 0.21 units more than comparable Japanese firms following the 2021 regulatory shock, after controlling for firm- and year-fixed effects. Translating to original physical units, this represents an average differential reduction of approximately 22.5 tonnes of CO2e per million USD of revenue relative to the Japanese counterfactual trend. This result supports H2 and is consistent with substantive operational Scope 1 and 2 decarbonization rather than symbolic compliance.
4.3. Governance Moderation Analysis (H3)
Table 6 reports the triple-interaction moderation results for the carbon intensity-dependent variable. Two governance indicators are tested sequentially: Board Independence Policy (H3a) and CSR Sustainability Committee (H3b). For the Board Independence Policy moderation (H3a), the triple interaction coefficient γ
1 is −0.0340 (
p = 0.669), which is not statistically significant. This result is attributable to the limitation of the governance data: the board independence variable available from Refinitiv is a binary policy indicator (1 = board independence policy exists) rather than the actual percentage of independent directors. This binary flag has limited discriminating power compared to a continuous measure, reducing the precision of the moderation estimate. This is acknowledged as a data limitation, and future research is encouraged to revisit this question with continuous board independence data.
For the CSR Sustainability Committee moderation (H3b), the triple interaction coefficient γ1 is −0.2029 (SE = 0.0874, p = 0.020), which is statistically significant at the 5% level. This finding indicates that EU firms with a dedicated sustainability committee reduced their carbon intensity by an additional 0.20 log units more than EU firms without such a committee, relative to their Japanese counterparts. The baseline DiD coefficient γ2 is not independently significant in this specification, confirming that the emissions-reducing effect of the mandate is concentrated among firms with adequate internal governance capacity to translate regulatory pressure into operational change. This provides partial support for H3.
4.4. Comparative Evidence by Governance Quality
Table 7 provides a descriptive comparison of mean carbon intensity by governance level and region across the pre- and post-policy periods, with t-tests comparing pre- versus post-period means within each group.
Figure 3 visualizes the carbon intensity trajectories.
The evidence reveals a clear and consistent pattern. EU firms with high governance quality (CSR committee present) experienced the largest absolute reduction in carbon intensity, declining from 193.74 to 127.59 tonnes per million USD, a reduction of 66.15 tonnes (p < 0.001). EU firms with low governance quality also reduced carbon intensity significantly (from 134.43 to 86.46, p < 0.01), but by a smaller magnitude. By contrast, Japanese firms in both governance categories show more modest reductions that are consistent with global voluntary improvement trends rather than regulatory-driven effects. The gap in post-policy carbon intensity between EU high-governance firms and their Japanese counterparts is notably larger than the pre-policy gap, consistent with the causal interpretation of the DiD framework.
4.5. Event Study: Parallel Trends Validation and Dynamic Effects
Figure 4 presents the event study results for both dependent variables, with detailed coefficient estimates reported in
Appendix A Table A1. The event study serves two purposes: validating the parallel trends assumption required for causal identification, and tracing the year-by-year dynamic evolution of the treatment effects.
Pre-treatment coefficients (2018 and 2019) for log Carbon Intensity are statistically indistinguishable from zero (2018: δ = 0.039, p = 0.424; 2019: δ = 0.079, p = 0.031 at marginal significance). This provides reasonable support for the parallel trends assumption, indicating that EU and Japanese firms were not systematically diverging in their emissions trajectories prior to the 2021 regulatory shock. The Tobin’s Q pre-trends show significant negative coefficients in 2018 and 2019, which is a more concerning pattern and contributes to the non-significant baseline DiD result for that outcome. This pre-trend divergence in firm valuation suggests that EU and Japanese firms may have had structurally different valuation trajectories prior to 2021, which limits causal interpretation for the H1 result and is one reason to treat the H1 finding with caution.
Taken together with the significant placebo coefficient for Tobin’s Q reported in
Section 4.6 (β = 0.0225,
p < 0.001), this evidence collectively indicates a violation of the parallel trends assumption for the firm valuation outcome; accordingly, the H1 DiD estimate should be interpreted as a descriptive association rather than a causally identified treatment effect, and no causal inference regarding the impact of mandatory disclosure on firm value should be drawn from the present analysis.
Post-treatment, the Carbon Intensity coefficients become progressively more negative and statistically significant, reaching −0.2434 (p < 0.001) in 2023 and −0.2784 (p < 0.001) in 2024. This cumulative downward trajectory is precisely the pattern predicted by substantive operational decarbonization: structural changes such as energy system transitions, green technology adoption, and supply chain reconfiguration require time to manifest in measured emissions data. The progressive intensification of the effect through 2024 is inconsistent with symbolic compliance, which would be expected to produce an immediate step-change with no subsequent trend.
This progressive temporal pattern is also inconsistent with a pure reporting-effect interpretation—where firms merely adjust measurement practices or redefine organisational boundaries in response to mandatory disclosure—since reporting adjustments would be expected to produce an immediate change in the disclosure year rather than a cumulative trend intensifying over four post-treatment years.
4.6. Robustness Checks
Table 8 reports four robustness checks for the main DiD estimates. First, the baseline models are re-estimated using 5% winsorization instead of 1% winsorization to examine sensitivity to outlier treatment. The CI result remains significant and directionally consistent (−0.1960,
p < 0.001), confirming that the decarbonization finding is not driven by extreme observations. The Tobin’s Q result gains marginal significance (
p = 0.078) under the 5% threshold, though it remains weaker than the CI result throughout.
Second, a placebo test is conducted by assigning a falsified shock year of 2019 and estimating the DiD using only the pre-treatment sample (2018–2020). If the identification strategy is valid, no significant treatment effect should be detected in this falsified setting. As reported in
Table 8, the placebo DiD coefficient for CI is −0.0225 (
p = 0.598), which is statistically indistinguishable from zero, confirming that the primary finding is specifically attributable to the 2021 regulatory transition rather than pre-existing trends.
It should be noted, however, that the placebo coefficient for Tobin’s Q is itself highly significant (β = 0.2518, p < 0.001), which corroborates the pre-treatment event study evidence of pre-existing valuation divergence between EU and Japanese firms; this confirms that the Tobin’s Q comparison does not satisfy the parallel trends assumption and that the H1 result should not be given a causal interpretation.
Third, the sample is truncated to the period 2018–2022, limiting the post-treatment window to two years. This serves as an additional robustness check for the stability of early post-treatment effects and confirms that the results are not sensitive to the inclusion of later post-treatment years. The CI result remains significant (−0.1288, p < 0.001) even with this more conservative window.
Fourth, as a robustness check against the concern that cross-industry variation in carbon intensity may confound the baseline results, an alternative specification is estimated that replaces firm-fixed effects with industry-year-fixed effects using two-digit SIC codes interacted with year dummies. The carbon intensity DiD result remains qualitatively unchanged under this specification (coefficient −0.1943, p < 0.001), confirming that the baseline finding is not an artefact of uncontrolled cross-industry variation. This also addresses a related concern about energy and carbon price shocks during 2021–2024: since the 2018–2022 truncated sample (which predates the most severe phase of the European energy crisis) yields a consistent result, the finding appears robust to the energy price confound that disproportionately affected EU firms relative to Japanese firms in the later post-treatment years.
6. Conclusions
This study provides a rigorous longitudinal assessment of the impact of mandatory ESG disclosure on corporate environmental performance, using a novel EU versus Japan DiD design that substantially improves upon the institutional comparability limitations of prior work. Drawing on a comprehensive unbalanced panel of 1626 listed firms across 9682 firm-year observations from 2018 to 2024, three principal conclusions emerge.
First, mandatory ESG disclosure in the EU is associated with a significant and economically meaningful reduction in corporate carbon intensity. The DiD coefficient of −0.2087 (p < 0.001) indicates that EU firms reduced their log carbon intensity by approximately 0.21 units more than comparable Japanese firms following the 2021 regulatory transition. The event study confirms that this effect is cumulative and intensifies through 2023 and 2024, consistent with genuine structural Scope 1 and 2 operational change rather than reporting adjustments. This finding is inconsistent with the symbolic compliance prediction of Decoupling Theory and supports the substantive execution hypothesis within the specific institutional context of the EU regulatory disclosure escalation.
Second, the study does not find evidence of a significant valuation premium associated with mandatory disclosure in the EU–Japan comparative context. The Tobin’s Q DiD coefficient is negative and insignificant. However, given the evidence of pre-existing valuation trajectory divergence between EU and Japanese firms—confirmed by the pre-treatment event study coefficients and the significant Tobin’s Q placebo result—this estimate should not be given a causal interpretation. The finding is more appropriately interpreted as indicating that no net valuation differential is observed in the EU–Japan comparison after 2021, rather than as evidence about the causal effect of mandatory disclosure on firm value per se. The context-dependence of valuation effects across different cross-country comparisons remains an important direction for future research.
Third, internal governance structures—specifically the presence of a dedicated CSR sustainability committee—significantly amplify the decarbonization effect of mandatory disclosure. This finding suggests that external regulatory pressure and internal organizational capacity operate as complements rather than substitutes in driving substantive Scope 1 and 2 environmental outcomes.
These results extend and nuance the existing empirical literature in three specific ways. With respect to decarbonization (H2), the finding aligns with [
13], who document genuine emissions reductions following mandatory carbon disclosure in the UK, and with [
11], who identify operational environmental improvements as a consistent consequence of mandatory sustainability reporting. The EU context examined here extends these findings to a broader multi-country mandatory framework and confirms that the effect intensifies progressively rather than appearing as a one-time adjustment. With respect to firm valuation (H1), this study diverges from [
10], who find positive market reactions to mandatory nonfinancial disclosure using a European sample. The divergence is explained by the choice of control group: the institutional sophistication and ESG maturity of the Japanese voluntary reporting baseline reduces the incremental signaling value of EU mandatory compliance relative to what would be observed comparing against a less ESG-aware voluntary-disclosure context, and the pre-trend evidence indicates that the EU–Japan Tobin’s Q comparison lacks causal identification regardless. With respect to governance moderation (H3), the CSR committee finding is consistent with [
19] on green governance and [
21] on sustainability committee effects, extending these findings from a voluntary governance context to a mandatory disclosure setting and confirming that governance infrastructure remains a significant moderator of regulatory effectiveness even when disclosure itself is mandated rather than chosen.
6.1. Policy Implications
The findings of this study carry direct implications for policymakers designing climate-related reporting standards. The evidence suggests that voluntary disclosure frameworks are insufficient to drive the magnitude of decarbonization required for net-zero transitions. Mandatory standardized disclosure—backed by legal accountability and third-party assurance—is necessary to make emissions reduction incentives credible. International bodies including the ISSB and regulatory authorities outside the EU should prioritize the implementation of mandatory climate disclosure with clear enforcement mechanisms. The CSRD and SFDR together represent a governance architecture that other jurisdictions could productively adapt to their institutional contexts.
The governance moderation finding also carries a practical implication: disclosure mandates are most effective when accompanied by organizational capacity-building requirements. Policymakers may consider complementing disclosure rules with governance standards that incentivize the formation of board-level sustainability committees, ensuring that firms possess the internal infrastructure to convert disclosure obligations into genuine strategic commitments.
6.2. Limitations and Future Research
This study has several limitations that future research should address. First, the board independence moderation analysis is constrained by the availability of only a binary policy indicator rather than a continuous percentage of independent directors. Future studies with access to time-varying continuous governance data would provide more precise tests of the board independence moderation hypothesis.
Second, the analysis focuses exclusively on Scope 1 and Scope 2 emissions. The impact of mandatory disclosure on Scope 3 (value chain) emissions remains an important open question as data availability improves.
Third, the sample period ends in 2024. As CSRD reporting obligations cascade to medium-sized firms from 2025 onward, future research will be able to examine whether the decarbonization effects observed for large firms extend to smaller entities with less ESG reporting experience.
Fourth, while Japan provides a well-justified control group, the generalizability of the EU experience to other regulatory contexts—including emerging markets and non-Western institutional environments—should be examined in future cross-regional comparative work.
Fifth, the parallel trends assumption is not satisfied for the firm value (Tobin’s Q) outcome. The pre-treatment event study coefficients for Tobin’s Q are statistically significant in both 2018 and 2019, and the placebo test yields a highly significant coefficient (β = 0.2518, p < 0.001), collectively indicating that EU and Japanese firm valuation trajectories were diverging before the 2021 regulatory shock for reasons unrelated to ESG disclosure regulation. As a consequence, the H1 DiD estimate does not carry a causal interpretation. Future research seeking to identify the causal effect of mandatory disclosure on firm value would need either a different control group whose pre-treatment valuation trends are parallel to EU firms, or an alternative identification strategy such as a regression discontinuity design exploiting firm-size thresholds in NFRD and CSRD coverage.
Sixth, the EU-specific energy and carbon price developments during 2021–2024 constitute a partial confound that the DiD design cannot fully eliminate. While Japan also faced global energy price increases during this period—partially absorbed by the DiD structure—EU firms experienced additional regulatory cost pressures from the EU ETS and from EU-specific energy supply disruptions following the 2022 Russia–Ukraine conflict that Japanese firms did not face to the same degree. The DiD estimate for carbon intensity therefore captures the combined effect of the EU regulatory disclosure escalation and the concurrent energy and carbon pricing environment, and cannot cleanly isolate the disclosure regulation effect from the carbon pricing effect. Future research exploiting variation in EU ETS exposure across firms or industries could help disentangle these mechanisms.
Seventh, the exclusion of 953 Japanese firms for insufficient emissions data means the Japanese control group of 497 firms likely over-represents larger, more internationally oriented, and more ESG-mature Japanese-listed companies relative to the full TOPIX universe. This sample selection effect is likely to result in a conservative understatement of the true treatment effect, since ESG-mature Japanese control firms will exhibit lower baseline carbon intensity and stronger voluntary improvement trends than the average Japanese-listed firm, compressing the measured EU–Japan differential.
Eighth, the treatment variable captures exposure to the EU regulatory disclosure environment rather than confirmed firm-level statutory obligation under the NFRD or CSRD. Some EU firms in the sample may have been directly legally covered by NFRD since 2017, while others may not yet face mandatory reporting obligations under CSRD during the study period. This heterogeneity in legal coverage means the DiD estimates represent the average effect of operating within the EU mandatory disclosure ecosystem rather than the effect of a precisely defined statutory obligation. Future research with firm-level regulatory coverage data—identifying which specific firms are subject to which specific instruments in which specific years—would enable more precise treatment assignment and a more credible causal estimate.