1. Introduction
Green innovation has evolved into an essential driving force for major economies globally to promote economic transformation and move towards sustainable development. Amid worsening global environmental pollution, accelerating the green transformation of firms is essential. (Bai et al., 2023; Jin et al., 2022) [
1,
2]. As a major engine of global economic growth, China is also facing severe challenges from increasing environmental pollution and excessive greenhouse gas emissions in the process of achieving rapid economic growth (Hu et al., 2023) [
3]. To address these challenges, China is actively promoting a green development strategy (Jiang and Raza, 2023; Zhang et al., 2024) [
4,
5]. As the core of the green development strategy, green innovation is being recognized as an essential option for addressing global environmental issues. (Han et al., 2024; Wang and Sheng, 2024) [
6,
7]. Green innovation can effectively reduce the environmental risks of traditional technologies, reduce pollution emissions at the source, and boost energy efficiency (Bai et al., 2023) [
1]. Nevertheless, the realization of green innovation encounters numerous challenges. Due to its high investment, high risk, and high uncertainty, enterprises often face cost pressure, technological integration difficulties, and limited financing channels when carrying out green innovation (Chen et al., 2022) [
8]. These factors are intertwined, resulting in enterprises lacking sufficient internal motivation to pursue green innovation (Xu et al., 2024) [
9]. Meanwhile, digital technology is developing rapidly, and digital transformation is steadily turning into a key driver of innovation and transformation among global enterprises (Tang et al., 2024) [
10]; the advent of digital transformation has created fresh opportunities for enterprises to advance green innovation (Cao, 2023) [
11]. With its high dependence and integration characteristics, digital transformation has shown unique advantages in transforming production processes and improving resource utilization efficiency (Liu et al., 2023) [
12]. In addition, digital technology can effectively integrate technology, data, and knowledge resources, which not only transforms the traditional innovation model of enterprises and the combination of innovation elements, but also plays a crucial part in alleviating the financial pressure of companies throughout the procedure of sustainable innovation, breaking the barriers of information asymmetry and helping firms to overcome technical difficulties, thereby facilitating conducive environments for firm green innovation (Luo et al., 2023) [
13].
At present, studies examining how digital transformation affects corporate green innovation can be extensively separated into three viewpoints. Firstly, digital transformation positively promotes corporate green innovation (Shen and Tan, 2022; Liu et al., 2023) [
14,
15]. Mubarak et al. (2021) [
16] discovered that the implementation of fourth industrial revolution technologies may boost the green innovation capacities of organizations. Song et al. (2022) [
17] demonstrated that digital transformation greatly enhances the green innovation of extremely polluting industries. Secondly, in the course of digital transformation, firms may reallocate internal resources, which increases cost pressures and, ultimately, reduces their willingness to engage in green innovation (Liu et al., 2024; Yang et al., 2024; Du and Cao, 2023) [
18,
19,
20]. Thirdly, other studies have suggested that they may follow a nonlinear inverted U-shaped pattern, with the relationship initially increasing, and then decreasing (Ning et al., 2024) [
21].
The researchers examined the mechanisms between them from two different perspectives: external and internal. At the external environment level, existing research shows that, when companies face stronger external regulatory pressure, greater government innovation subsidies, and higher media attention, the impact becomes increasingly significant. (Zhang et al., 2025; Guo et al., 2023) [
22,
23]. At the internal factor level, prior studies have primarily focused on how digital transformation boosts green innovation through diminishing communication obstacles, mitigating agency conflicts, and strengthening internal control (Zhang et al., 2021; Li et al., 2023; Duan et al., 2024) [
24,
25,
26].
Relative to extant studies, the incremental benefit provided by our study is evident in each of the following three dimensions: Firstly, in terms of research data, prior studies have predominantly measured corporate digital transformation by the ratio of hardware investments in digital technologies to total assets (Song et al., 2022) [
17]. However, this indicator neglects both the outputs and actual effectiveness of digital transformation, as well as its multidimensional nature. To address this limitation, this study employs Python (3.8.8)-based text mining to examine the yearly documents of companies that are listed on A-shares spanning from 2010 to 2023, establishing a metric for digital transformation by quantifying the frequency of terms associated with “digital transformation” in the reports. Secondly, in terms of research perspective, while earlier studies have primarily concentrated on the quantity of green innovation, they have overlooked its influence on the quality of green innovation (Li et al., 2024) [
27]. The quality of green innovation plays an essential role in facilitating a firm’s green transformation and ensuring long-term sustainable development. We perform a comprehensive analysis from both the “innovation quantity” and “innovation quality” dimensions, examining the effects of digital transformation on each. Thirdly, in terms of research content, this article enriches the influence mechanism between them. Whereas existing research has mostly centered on pathways such as environmental regulation, financial performance, and internal control, providing insufficient insights into the underlying mechanisms (Yang et al., 2025; Liu et al., 2023) [
28,
29], this study highlights two key channels: information interaction and resource integration.
3. Materials and Methods
This study initially collected data from Chinese A-share listed companies on the Shanghai and Shenzhen Stock Exchanges for the period 2010–2023. To ensure data quality and validity, this paper removed ST and ST* firms, firms in the financial and insurance sectors, as well as samples exhibiting significant missing signs or anomalous values in the control variables. Additionally, we also winsorized all continuous variables at the first and 99th percentiles to mitigate the impact of extreme values. The total dataset has 43,616 firm–year observations. Financial data for A-share listed companies was obtained from the China Stock Market & Accounting Research (CSMAR) database; meanwhile, information on corporate green invention patents and their citations was obtained from the Green Patent Research Database (GPRD). The corporate digital transformation was manually collected from the annual reports of the listed companies throughout the sample period.
The dependent variable is measured using two indicators: the quantity and the quality of corporate green innovation. First, for the quantity of corporate green innovation (lnGreenInnovation), since green patents represent the highest level of technological content in corporate green knowledge, following the existing literature (Messeni et al., 2011; Zhang et al., 2024) [
43,
44], we quantify green innovation intensity by the count of green invention applications for patents and utilize the natural logarithm of this value plus one to mitigate data skewness and accommodate instances with zero observations. Second, the quality of corporate green innovation (lnCite), as patent citation frequency is an important indicator of the technological impact of a patent, following previous studies (Xu et al., 2024) [
9], we utilize the citation count of green invention patents as an indicator of green innovation quality and apply the natural logarithm of this count plus one to normalize the data and mitigate the impact of outliers.
The core explanatory variable of this paper is digital transformation (lnDigital). Drawing on the studies by Tang et al. (2023), Wu et al. (2021), and Yuan et al. (2024) [
10,
45,
46], we assess the extent of the core explanatory variable by quantifying the total frequency of digital transformation-related terms in the yearly documents of publicly traded companies. Specifically, we first constructed a dictionary of enterprise digitalization terms. Combining relevant academic research and economic policy text documents, 173 digitalization-related terms were selected from the four areas of technology—big data, cloud computing, artificial intelligence, and blockchain—to form an enterprise digitalization terminology dictionary. Secondly, text mining was performed through Python software to systematically arrange the yearly statements of A-share publicly traded enterprises from 2010 to 2023. With the help of the PDFplumber library, all PDF files were batch converted into txt format, and all text content was extracted to provide basic data for subsequent feature word screening. Next, given the complexity of annual report content and the tendency for companies to present their business operations, development plans, and related information in the “Management Discussion and Analysis” (MD & A) section, we concentrated on analyzing the text in this section. After splitting the MD&A section, we divided the sentences into individual word tokens, and for each occurrence of the keyword “digital transformation,” anchored it and examined a fixed-length window of words to its left. If any negation terms (e.g., “not,” “no,” “none”), or words referring to entities other than the focal firm (e.g., the company’s shareholders, customers, suppliers, or executives), appeared within this window, we marked the keyword for deletion; otherwise, we retained it. In this way, expressions containing negation words preceding the keyword, as well as instances of “digital transformation” that did not refer to the focal firm itself, were removed. Next, the Jieba library was used to extract keywords from the specified text file and count the frequency of occurrence of these keywords. Finally, the number of keyword occurrences in the four dimensions of enterprise digital transformation was summarized to obtain the aggregate word frequency of enterprise digital transformation. To eliminate the influence of outliers, we logarithmized the total word frequency to obtain quantitative indicators that characterize enterprise digital transformation.
To control other influences and more precisely assess the effect of digital transformation on corporate green innovation, drawing on the research of Chen et al. (2022), Cao et al. (2022), and Wang et al.(2021) [
8,
11,
34], the following variables for control were chosen based on enterprise characteristics, financial condition, and corporate governance capacity: enterprise size (Size), company establishment year (FirmAge), asset–liability ratio (Lev), profit from assets (ROA), cash flow to asset ratio (Cash-flow), net profit growth rate (NetProfitGrowth), management shareholding ratio (Mshare), the share of independent directors (Indep), and Tobin’s Q ratio (TobinQ). The precise definitions of the primary variables are presented in
Table 1.
This study employs regression analysis using a multi-dimensional fixed-effects model. The first explained variable is given by the quantity of green innovations of enterprises (
lnGreenInnovation), and its model is shown in Formula (1), as follows:
The second explained variable is the enterprise green innovation quality (
lnCite), and its model is shown in Formula (2), as follows:
In the above two models, the subscripts i, j, s, t represent the company, industry, province, and time, respectively. is a constant term; is the principal explanatory variable of this paper, representing the extent of digital transformation of enterprises in the s province and j industry in t year; signifies a set of control variables; represents the firm fixed effect. To mitigate the influence of time-varying unobservable characteristics at both the industry and provincial levels, we also incorporated industry–year fixed effects and province–year fixed effects ; represents the random disturbance term, for which the clustered robust standard error at the industry–year level is used. All regression results were completed using STATA17 software.
We used a multivariate fixed-effects model for analysis. Compared to ordinary least squares (OLS) regression, the multivariate fixed-effects model can control firm-level heterogeneity while simultaneously eliminating the confounding of time-varying unobservable factors across industries and regions. Corporate innovation initiatives are influenced by numerous factors, including industry cycles and regional policy discrepancies. Traditional OLS regression alone would be susceptible to omitted variable bias. The multivariate fixed-effects model effectively controls these potential confounding factors and eliminates the influence of time-varying firm-level idiosyncrasies, such as historical R & D accumulation. Industry–year fixed effects control for common shocks faced by firms in different industries across different years (such as technology cycles and industry policy adjustments). Province–year fixed effects control for regional and year-to-year differences in the macroenvironment (such as local government green development policies and the intensity of regional environmental regulation). By employing these multidimensional controls, the model minimizes the confounding of unobservable variables on the estimation results.
This paper examines the validity of the regression coefficients. (When the coefficient results are significant, it demonstrates that digital transformation strongly influences corporate green innovation, thus verifying Hypothesis 1. Secondly, to evaluate the overall explanatory strength of the regression model, this study presents the adjusted R-squared value (Adjusted R2). Furthermore, in the robustness check section, this study further validates the multivariate fixed-effects results by substituting the dependent variables, replacing the key explanatory variables, modifying the model estimation method, and applying instrumental variable analysis to strengthen the reliability of the findings.
5. Conclusions
Drawing on the regression analysis, this study derives the findings regarding the mechanisms through which digital transformation affects corporate green innovation. (1) The findings indicate that digital transformation markedly improves both quantity and quality of corporate green innovations, with a stronger effect on the quantity. These results remain robust after conducting various robustness checks, including substituting the dependent variables, replacing the key explanatory variables, adjusting the estimation method, and employing the instrumental variable approach, indicating that Hypothesis 1, “Digital transformation can promote the incremental improvement of corporate green innovation,” is established. (2) Regarding Hypothesis 2, “Digital transformation promotes corporate green innovation by improving resource allocation efficiency,” the results of the mechanism analysis show that digital transformation increases the quantity of corporate green innovation by improving resource integration efficiency, but has no significant impact on the quality of green innovation. Hypothesis 2 holds true for the quantity of green innovation, but not for its quality. Meanwhile, regarding Hypothesis 3, “Digital transformation reduces the risk of information asymmetry in enterprises, improves the information interaction capabilities of enterprises, and motivates enterprises to carry out green innovation,” the results demonstrate that digital transformation enhances two categories of green innovation in enterprises via information interaction effects, and Hypothesis 3 is valid. (3) The heterogeneity analysis indicates that digital transformation influences both the quantity and quality of green innovation more profoundly in state-owned firms than in privately held ones, and in firms situated in the eastern region compared to those in the central and western regions. The influence of digital transformation is particularly significant in high-tech firms on both the quantity and quality of green innovation.
6. Practical Implications
This research reveals the micro-status of Chinese listed companies in promoting digitalization and innovation, offering an empirical foundation for developing policies that integrate industrial green upgrading with digital transformation at the macro level. Drawing from the study results, this paper outlines the following practical recommendations: (1) The government should encourage and support enterprises to apply technology in digital form, such as cloud computing, blockchain and big data production, management, and innovation. By formulating differentiated policies, providing special subsidies and regulatory guidance, we can promote the digitalization of enterprises to empower green innovation and promote sustainable economic development. Secondly, the government ought to augment investment in digital infrastructure, such as artificial intelligence, to narrow the regional “digital divide” and provide basic support for enterprises’ green innovations. (2) Enterprises may fully leverage digital technology to build internal and external information sharing and resource integration platforms, improve information transparency and efficiency of resource allocation, and enhance green innovation capabilities. Meanwhile, enterprises should formulate phased transformation plans based on their own characteristics, draw lessons from the practices of state-owned and high-tech enterprises, and give priority to promoting key business links of digital empowerment within the scope of their own conditions. Finally, enterprise managers can also promote the network collaboration of green innovation through digital resource integration and knowledge sharing to achieve low-carbon development of enterprises. In summary, this study offers practical guidance for governments, firms, and participants in the innovation ecosystem; highlights the main pathways and priority areas through which digital transformation fosters corporate green innovation; and provides an evidence-based foundation for promoting sustainable economic and social development.