1. Introduction
As the digital economy accelerates, digital transformation has become increasingly important for firms seeking innovation-driven and high-quality development (Jiang and Li, 2024 [
1]; Duan and Zhang, 2025 [
2]). Digital technologies such as artificial intelligence, big data analytics, cloud computing, and blockchain are no longer merely technical tools; rather, they are increasingly embedded in firms’ production systems, management processes, and business models (ElMassah and Mohieldin, 2020 [
3]; Su and Wu, 2024 [
4]). In China, digital transformation has also been promoted as a strategic priority through policies encouraging the deeper integration of digital technologies with the real economy (Liu et al., 2022 [
5]; Li et al., 2022 [
6]). However, although digital transformation has attracted growing scholarly attention, how it is associated with firm innovation, through which internal mechanisms this relationship operates, and under what conditions the relationship becomes stronger remain insufficiently clarified.
From a theoretical perspective, innovation is a core driver of firms’ sustained competitive advantage. Innovation output is often shaped by sustained R&D spending, effective resource allocation, and a sound governance environment (Fabrizio and Tsolmon, 2014 [
7]; Massey and Johnston-Miller, 2016 [
8]; Yin et al., 2023 [
9]). Digital transformation influences these critical factors through multiple channels. On one hand, it helps break down information barriers, reduce asymmetries and financing constraints, thereby enabling increased R&D investment and efficiency (Aben et al., 2021 [
10]; Antonio and Sorin, 2023 [
11]; Zhang et al., 2025 [
12]). On the other hand, it facilitates improvements in internal controls, enhances information transparency, and strengthens external supervision—thus fostering a more stable institutional environment conducive to innovation (Keller et al., 2021 [
13]; Cordery et al., 2023 [
14]). However, the literature remains divided regarding the precise pathways and boundary conditions through which digital transformation affects innovation. While some studies affirm its significant role in improving R&D productivity and overall innovative performance (Li et al., 2023 [
15]), others argue that such effects may vary depending on firm size, financing conditions, and regional development levels (Yang and Han, 2023 [
16]). This divergence calls for a more systematic empirical investigation grounded in a coherent theoretical framework and supported by large-scale micro-level data.
From a practical perspective, Chinese firms display considerable variation in their levels of digital transformation. On one hand, some large enterprises and firms located in eastern regions have leveraged superior resources and policy support to achieve relatively advanced levels of digitalization and intelligent operations, accompanied by strong R&D capabilities and innovation outputs. On the other hand, constraints in financing, human capital, and access to technology leave SMEs and firms in central and western regions at a clear disadvantage, leaving their digital transformation relatively behind. Such an imbalance can weaken firms’ innovation capability and, at the same time, impede the formation of a balanced and resilient cross-industry innovation ecosystem (Hu et al., 2023 [
17]; Li et al., 2023 [
18]; Wu et al., 2024 [
19]). Therefore, a systematic investigation of how digital transformation influences innovation output, including its underlying mechanisms, will not only contribute to a better understanding of firm-level innovation dynamics but also inform regional development and industrial upgrading policies.
Despite the growing literature on digital transformation and corporate innovation, several important questions remain insufficiently clarified. This study is therefore guided by three research questions. First, is corporate digital transformation positively associated with firm innovation output? Second, do R&D investment and the governance environment provide mechanism-consistent channels through which digital transformation relates to innovation? Third, under what firm-level and regional conditions is the digital transformation–innovation relationship more pronounced? By addressing these questions, this study aims to provide a more structured understanding of how digital transformation is linked to innovation output and why this relationship may vary across firms.
We study Chinese A-share non-financial listed firms from 2009 to 2023, focusing on how digital transformation relates to innovation output and on the mechanism pathways behind this relationship. Methodologically, we develop a firm-level index of digital transformation by applying text-mining methods to the MD&A narratives in annual reports. Firm and year effects are included in the baseline model; endogeneity is examined via instrumental variables; robustness is further assessed through multiple checks. Furthermore, we identify two parallel mechanisms: (1) the R&D investment mechanism, whereby digital transformation improves R&D intensity, capitalization, efficiency, and investment growth; and (2) the governance environment mechanism, through which digital transformation enhances internal control, improves information disclosure, and strengthens audit supervision. Heterogeneity is explored by stratifying firms by key attributes and external conditions and comparing the estimated innovation effects of digital transformation.
Although prior studies have provided important insights into the relationship between digital transformation and corporate innovation, the existing literature remains limited in three respects. First, many studies emphasize the resource-allocation or efficiency-enhancing role of digital transformation, but pay less attention to how digital transformation reshapes the governance and information environment in which innovation decisions are made. Second, studies that examine governance or information effects often treat them separately from firms’ R&D resource allocation, leaving the interaction between internal innovation-input mechanisms and external governance-support mechanisms insufficiently theorized. Third, existing research has not fully explained why the innovation implications of digital transformation differ across firms with different resource endowments, financing conditions, ownership structures, and regional environments. Therefore, this study seeks to provide an integrated dual-path framework that links digital transformation to firm innovation through both R&D resource allocation and governance environment improvement.
The principal contribution of this study lies in developing and empirically examining a dual-path framework that links digital transformation to firm innovation through R&D resource allocation and the governance/information environment. The text-based measurement, Chinese listed-firm panel evidence, and heterogeneity analysis are used to support and contextualize this theoretical framework rather than being treated as separate core contributions.
This study makes three main contributions. First, it develops a dual-path theoretical framework that explains how digital transformation is associated with firm innovation through both R&D resource allocation and the governance environment. This framework moves beyond a single resource-efficiency explanation and highlights the coexistence of internal innovation-input mechanisms and external governance-support mechanisms.
Second, this study deepens the theoretical understanding of digital transformation by treating it not merely as a process of technological adoption but as an organizational transformation process that reshapes knowledge processing, resource allocation, and information transparency. In doing so, the paper connects digital transformation more explicitly with the organizational foundations of innovation.
Third, this study clarifies the boundary conditions of the digital transformation-innovation relationship by examining how financing constraints, firm size, regional location, and ownership structure condition the strength of this association. This contributes to a more differentiated understanding of why the innovation implications of digital transformation vary across firms and institutional environments.
2. Theoretical Analysis and Research Hypothesis
2.1. Core Theoretical Foundation: Resource-Based View and Complementary Perspectives
This study adopts the Resource-Based View as its central theoretical foundation. From this perspective, digital transformation can be understood as a process through which firms convert data resources, digital technologies, organizational routines, and knowledge-processing capabilities into strategic resources that support innovation. This theoretical lens is appropriate because the innovation implications of digital transformation depend not only on the adoption of digital tools but also on whether firms can transform these tools into valuable, rare, and difficult-to-imitate organizational capabilities. Digital transformation may therefore enhance firms’ innovation capacity by improving the availability, recombination, and utilization of strategic resources.
Dynamic capability theory and information asymmetry theory are used as complementary perspectives. Dynamic capability theory helps explain how firms reconfigure digital resources and organizational processes in response to technological and market changes. Information asymmetry theory and agency theory help explain why digital transformation may improve the governance and information environment surrounding innovation activities. In this sense, the theoretical framework of this study is centered on the Resource-Based View, while the complementary theories are used to explain the two specific channels emphasized in this paper: R&D resource allocation and governance environment improvement.
In the era of the digital economy, corporate innovation models and sources of competitive advantage are profoundly shaped by digital transformation (DT). Unlike the traditional perception of digitalization as a mere adoption of information technology tools, DT refers to a systemic transformation that integrates frontier digital tools, such as AI, data analytics, cloud infrastructure, and blockchain solutions, into firms’ production, management, and decision-making. This transformation restructures resources, reengineers workflows, and optimizes governance systems, thereby exerting significant influence on innovation (Miklosik and Evans, 2020 [
20]; Massaro, 2023 [
21]; Ma and Chang, 2024 [
22]). Its impact can be interpreted from three interrelated perspectives.
First, the resource and capability perspective. RBV argues that an enduring competitive advantage arises when firms control resources that are scarce and not easily imitated or replaced. DT enables the datafication of resources and the sharing of information, which facilitates the rapid integration of knowledge, technologies, and market intelligence. This process reduces information processing and communication costs in R&D and enhances the efficiency of knowledge recombination and application (Li et al., 2021 [
23]). In terms of dynamic capability theory, DT enhances organizational capabilities in opportunity sensing, knowledge absorption, and agile responses to uncertainty. Consequently, firms can better balance exploratory and exploitative innovation, thereby maintaining innovation vitality in dynamic environments (Ceipek et al., 2021 [
24]; Kowalski et al., 2024 [
25]).
Second, an R&D-based view suggests that innovation outcomes hinge on how much firms invest in R&D, how those inputs are allocated, and how efficiently they are converted into outputs. R&D decisions are inherently linked to financing availability, cost efficiency, and strategic orientation—all of which are significantly influenced by digitalization. DT lowers transaction and organizational costs in R&D activities by enabling intelligent resource allocation and optimized information flows (Liu et al., 2023 [
26]; Cai et al., 2024 [
27]). It also reinforces data-driven strategic judgments, encouraging firms to engage in long-term, capitalized R&D projects that generate higher-value innovation outputs (Appio et al., 2021 [
28]). In this sense, DT promotes innovation indirectly through the “cost reduction—efficiency improvement–structural optimization” pathway, which simultaneously enhances the intensity and quality of R&D activities.
Third, from a governance and information-transparency angle, innovation inherently involves substantial risk and uncertainty, which can intensify agency conflicts and deter investors. From the lens of principal–agent theory and information asymmetry theory, DT can improve governance by enhancing internal control, information disclosure, and audit mechanisms (Manita et al., 2020 [
29]; Li et al., 2024 [
30]). On one hand, digital tools strengthen the authenticity and traceability of internal information, thereby mitigating managerial opportunism (Yang et al., 2021 [
31]). On the other hand, improved disclosure quality and more effective audit supervision enhance investor confidence in firms’ R&D strategies and innovation projects, increasing both capital availability and market recognition (Zhang et al., 2025 [
32]). Thus, improvements in governance and information transparency not only lower agency costs but also amplify the marginal effects of DT on innovation outcomes.
Taken together, DT contributes to corporate innovation through both direct and indirect channels. Directly, it enhances knowledge recombination and innovation efficiency. Indirectly, it increases R&D investment and strengthens governance mechanisms, thereby creating a more supportive institutional environment for innovation.
2.2. Mathematical Model Derivation
Before presenting the hypotheses, we provide a simplified analytical model to formalize the theoretical logic discussed above. The purpose of this model is not to establish an independent structural estimation framework or to provide causal identification. Rather, it serves as a supplementary theoretical device that clarifies why digital transformation may be linked to innovation through R&D investment and the governance environment. Therefore, the model should be read as an analytical extension of the theoretical discussion rather than as a substitute for the empirical research design.
Corporate innovation output reflects both knowledge accumulation and allocation efficiency in resource use. Digital transformation (DT) enhances innovative output by raising the efficiency of firms’ data processing and knowledge recombination. Let the innovation output of firm i at time t be denoted as Innov, which is a function of R&D investment (RD), governance quality (Gov), and the degree of digital transformation (DIGI). A simplified representation can be expressed as:
Here, reflects the digital-transformation–induced gain in the efficiency of knowledge transformation, and and captures the extent to which governance quality amplifies innovation output. This relationship indicates that digital transformation not only can directly increase the knowledge productivity of enterprises, but also operates through R&D activities and the governance environment to shape innovation indirectly.
From the perspective of R&D decision-making, firms weigh the marginal benefits against the marginal costs of R&D investment. While R&D costs typically increase with investment scale, DT reduces effective costs by mitigating financing constraints and minimizing information frictions. The R&D cost function can be expressed as:
Among them, reflects the effect of the alleviation of financing constraints brought about by digital transformation. The denominator is increasing in , which means that digitalization reduces the marginal cost of one additional unit of R&D.
The expected return from R&D depends on both the scale of investment and knowledge transformation efficiency. The return function can be defined as:
Among them,
represents the extent of digital transformation–driven improvements in R&D output efficiency. Optimal R&D occurs at the level where the marginal benefit of R&D just matches its marginal cost:
Substituting (2) and (3) yields:
Solving this condition indicates that:
This result implies that higher levels of DT reduce the effective marginal cost of R&D while simultaneously enhancing R&D output efficiency, thereby increasing optimal R&D investment and indirectly promoting innovation output. This provides theoretical support for the R&D mechanism proposed in this study.
Beyond the R&D channel, governance mechanisms also constitute a critical pathway through which DT influences innovation. Given the high risk and uncertainty of innovation, external supervision and governance quality become indispensable. Innovation output can thus be further expressed as:
Here, represents the degree of support provided by the external governance environment for innovation, and is greater than 0. This functional form shows that better governance strengthens the innovation gains associated with digital transformation at the margin.
Here, improvements in the governance environment strengthen DT’s innovation effect at the margin. Decomposing governance quality, we assume it is jointly determined by internal control (IC), disclosure quality (KV), and audit quality (AQ):
Substituting (8) into (7) gives:
Equation (9) illustrates the complementarity between governance and digital transformation: when internal controls are stronger, disclosures are more transparent, and audits more effective, the innovation-enhancing effect of DT is amplified.
2.3. Research Hypotheses
Building on the preceding analysis, we argue that digital transformation affects innovation output both directly and indirectly, with R&D investment and governance serving as the key transmission mechanisms. On this basis, we formulate three hypotheses as follows.
Digital transformation, as a systematic change, can directly enhance the innovation efficiency of enterprises by improving the rate of knowledge conversion, optimizing the methods of information processing and resource allocation. In Equation (1), the positive effect of implies that higher digitalization speeds up knowledge creation and application, thereby raising innovation output. Therefore, Hypothesis 1 is proposed:
H1. Digital transformation exerts a significantly positive impact on enterprises’ innovation output.
R&D activities are a central engine of innovation output, and the amount invested in R&D as well as the efficiency of that investment jointly determine the quantity and quality of innovation outcomes. Models (2)–(6) imply that digital transformation raises the optimal R&D investment level by easing financing constraints and lowering marginal costs, while also improving R&D output efficiency. Put differently, digital transformation improves innovation efficiency directly and also boosts innovation output through higher R&D intensity and a more optimized R&D structure. The enhancement should be reflected across multiple R&D metrics, including intensity, capitalization, growth, per capita input, and the expense ratio. Therefore, Hypothesis 2 is proposed:
H2. Digital transformation is positively associated with R&D investment and R&D efficiency, which provide a plausible mechanism-consistent channel linking digital transformation to firm innovation.
Innovative activities typically involve substantial risk and long horizons, which can intensify agency conflicts and undermine external investors’ trust. Models (7)–(9) suggest complementarity between digital transformation and governance quality, such that stronger governance amplifies the innovation-enhancing effect of digital transformation. Specifically, digital tools can strengthen external monitoring and resource support by enhancing the effectiveness of internal controls, increasing disclosure transparency, and reinforcing audit oversight. These changes mitigate information frictions and agency costs during innovation activities, thereby amplifying DT’s marginal innovation payoff. Thus, Hypothesis 3 is proposed:
H3. Digital transformation is positively associated with improvements in the governance and information environment, which provide a plausible mechanism-consistent channel linking digital transformation to firm innovation.
4. Empirical Results
4.1. Benchmark Regression
Table 3 reports the baseline regression estimates linking digital transformation to corporate innovation output. The specification in Column (1) excludes both control variables and fixed effects. Even without any controls or fixed effects (Column 1), the estimated DIGI coefficient remains positive and precisely estimated (0.019,
p < 0.01), indicating that digitally transformed firms file more patent applications in this simplest specification.
Moving from Column (2) to Column (5), the model is enriched by absorbing firm- and year-level effects and by adding a broader set of controls. Although the estimated coefficient on DIGI becomes smaller, the coefficient remains positive with strong statistical support (p < 0.01) and is approximately 0.003 in magnitude. After absorbing unobserved firm heterogeneity and year effects and adding comprehensive controls, the estimated DT effect on innovation output remains positive, pointing to robust results.
Taken together, the baseline estimates are consistent with H1, consistent with higher innovation output among firms with stronger digital transformation, with the effect estimated precisely. Similar estimates across alternative specifications strengthen confidence in the results. Specifically, digital transformation not only improves information processing and resource allocation efficiency, but also exerts a stable and generalizable impact on innovation activities, even in complex business and governance environments. These results lay a solid empirical foundation for the subsequent mechanism and heterogeneity analyses.
4.2. Endogeneity Test
Endogeneity is an important concern when examining the relationship between digital transformation and firm innovation. Firms with stronger innovation capabilities may be more willing and better able to implement digital transformation, leading to potential reverse causality. In addition, unobserved factors, such as managerial ability, organizational culture, industry-level technological opportunities, and firms’ long-term strategic orientation, may simultaneously affect both digital transformation and innovation output. To further alleviate these concerns, this study employs an instrumental-variable approach based on U.S. industry-level industrial robot penetration.
The instrumental variable is constructed from U.S. industry-level industrial robot data and matched to Chinese listed firms according to industry classification. The rationale for using this variable is that industrial robot penetration in the United States reflects frontier automation, intelligent manufacturing, and digital-technology diffusion at the global industry level. As the United States is one of the major economies leading global technological change, robot adoption in U.S. industries can capture external technological trends and automation pressure faced by comparable industries worldwide. When robot penetration increases in a corresponding U.S. industry, Chinese firms in similar industries are more likely to be exposed to international technology diffusion, supply-chain upgrading pressure, competitive imitation, and digital transformation incentives. Therefore, U.S. industry-level industrial robot penetration is expected to be positively associated with the digital transformation of Chinese firms, satisfying the relevance condition of the instrumental variable.
Regarding the exclusion restriction, the instrument is based on industry-level robot penetration in the United States rather than on the digital transformation behavior or innovation activities of individual Chinese firms. Therefore, it is unlikely to be directly determined by the innovation output of Chinese listed companies. Moreover, the instrument varies at the external industry level, which helps reduce concerns that it is driven by firm-specific unobserved characteristics, such as managerial ability, internal organizational culture, or persistent innovation orientation. Since the regressions include firm fixed effects and year fixed effects, time-invariant firm heterogeneity and common macroeconomic shocks are further controlled for. The inclusion of firm-level control variables also helps mitigate the influence of observable differences in firm size, profitability, leverage, growth, ownership structure, and governance characteristics.
In conceptual terms, the identifying logic is that U.S. industry-level industrial robot penetration affects the digital transformation of Chinese firms through external technology diffusion and industry-level digitalization pressure. However, it is less likely to directly affect the innovation output of a particular Chinese listed firm except through its influence on the firm’s digital transformation. This setting helps alleviate concerns about reverse causality and omitted firm-level determinants of innovation. Nevertheless, as industry-level global technology trends may also be related to broader innovation opportunities, the IV results should still be interpreted cautiously as evidence that helps reduce endogeneity concerns rather than as definitive causal proof.
Table 4 reports the instrumental-variable estimation results. Column (1) presents the first-stage regression results. The coefficient of the instrumental variable is 0.024 and statistically significant at the 5% level, indicating that U.S. industry-level industrial robot penetration is positively associated with firm-level digital transformation in China. This result supports the relevance condition of the instrument. Column (2) reports the second-stage regression results. The estimated coefficient of DIGI is 0.423 and statistically significant at the 5% level, suggesting that digital transformation remains positively associated with firm innovation output after using the external industry-level instrument.
4.3. Robustness Test
The robustness tests are designed to examine whether the baseline findings are sensitive to alternative specifications, variable measurements, sample restrictions, and external shocks. Each test addresses a specific empirical concern. Alternative model specifications are used to examine whether the results are driven by omitted regional or industry-level factors. Alternative dependent variables are used to assess whether the findings depend on the measurement of innovation output. Additional regional controls are included to account for macroeconomic and financial conditions at the city level. Excluding major event years helps reduce the influence of large external shocks. Excluding municipalities addresses the concern that firms located in administratively special and economically advanced cities may disproportionately influence the results. Finally, the balanced-panel test examines whether the findings are affected by sample entry and exit over time.
Because some robustness specifications include different combinations of fixed effects and controls, coefficient magnitudes are not expected to be directly identical across all columns. Therefore, the robustness analysis focuses primarily on whether the sign, statistical significance, and substantive interpretation of the DIGI coefficient remain stable across alternative specifications.
4.3.1. Alternative Model Specifications
This test addresses the concern that the baseline results may be driven by unobserved city-level or industry-level characteristics. By adding city and industry fixed effects, we examine whether the digital transformation–innovation association remains stable after accounting for regional and sectoral heterogeneity
To evaluate robustness, we vary the specification and re-estimate the baseline models accordingly. In particular, we sequentially absorb city and industry effects to capture unobserved differences across locations and sectors. By controlling for regional and industry influences, we mitigate concerns that the results merely reflect regional development differences or sectoral traits rather than digital transformation itself.
The estimates reported in
Table 5 show that the coefficient on DIGI remains positive with strong statistical support (
p < 0.01) under all specifications, ranging from 0.003 to 0.007. The estimates are broadly comparable to the benchmark regression, supporting the stability of the positive digital transformation effect on innovation output under different specifications.
4.3.2. Alternative Dependent Variables
This test addresses the concern that the findings depend on the use of invention patent applications as the innovation measure. We therefore use invention patent grants as an alternative dependent variable, which captures more realized and quality-screened innovation outcomes.
To test whether the results depend on how innovation output is measured, we adopt an alternative proxy. Beyond invention patent applications, we also use the log-transformed count of invention patent grants (Innov2) as the dependent variable. As opposed to patent applications, granted patents involve stricter examination and higher thresholds, thereby better reflecting the quality and practical implementation of innovation outcomes. If the results remain consistent under this alternative specification, concerns about inflated application counts or strategic patent filings are alleviated, thus reinforcing the credibility of our conclusions.
Regardless of the specification,
Table 6 yields a positive and precisely estimated DIGI coefficient (
p < 0.01). The coefficient remains of a similar size to the benchmark results, with estimates ranging from 0.005 to 0.009. Taken together, digital transformation boosts both “potential” innovation output (i.e., patent applications) and “realized” innovation outcomes (i.e., patent grants).
In other words, digital transformation not only stimulates firms to increase R&D investment and pursue more innovation attempts but also improves the quality and conversion efficiency of innovation activities, enabling a higher proportion of R&D projects to translate into granted patents. Innovation gains from digital transformation are two-dimensional, affecting not only quantity but also quality.
4.3.3. Adding Additional Control Variables
This test addresses the concern that regional macroeconomic and financial conditions may simultaneously affect digital transformation and innovation output. We therefore add city-level economic growth, industrial structure, per capita GDP, and financial loan variables.
To account for regional macro conditions, we enrich the baseline specification by including macroeconomic variables at the region level, which also serves as a robustness exercise. The added regional controls comprise city-level GDP growth (GDP), the GDP shares of the secondary (I2) and tertiary (I3) sectors, the log of per capita GDP (PerGDP), and the loan balance of financial institutions (Loan). The inclusion of these variables allows us to control for influences on innovation output arising from firm-level resources and governance, in addition to effects stemming from the broader economy and regional development.
Table 7 indicates that the DIGI coefficient is consistently positive and highly significant (
p < 0.01) across specifications, with estimates between 0.002 and 0.007. After controlling for regional economic conditions and industrial composition, the estimated link between digital transformation and innovation output remains positive, indicating that these factors do not drive the findings.
4.3.4. Excluding Major Events
This test addresses the concern that major external shocks may confound the estimated relationship between digital transformation and innovation. We therefore exclude years affected by representative macroeconomic or global disruptions.
To address the concern that external shocks may confound the relationship between digital transformation and innovation output, we re-estimate the baseline model after excluding observations from three representative years characterized by major external disruptions: 2015, when China’s stock market experienced severe turbulence that undermined firms’ financing environment and investor confidence; 2018, when the escalation of the U.S.–China trade conflict created uncertainty for export-oriented and high-tech enterprises; and 2020, when the COVID-19 outbreak interrupted global supply chains and precipitated a sharp economic downturn. These events could systematically affect both digital transformation strategies and innovation activities, thereby biasing the estimated relationship.
As shown in
Table 8, the DIGI coefficient remains positive and highly significant (
p < 0.01) in every specification, with estimates spanning 0.003 to 0.007. After excluding years affected by major shocks, the estimated digital transformation effect stays positive and broadly consistent with the benchmark regression.
4.3.5. Excluding Four Municipalities
To assess robustness, we re-estimate the models after excluding firms located in Beijing, Shanghai, Tianjin, and Chongqing, the four directly administered municipalities. These cities occupy a unique position in China’s economic landscape, characterized by advanced levels of digital economy development, strong innovation capacity, and pronounced agglomeration effects. In addition, they benefit disproportionately from policy support, capital availability, and market advantages compared to other regions. Including these municipalities without distinction may bias the estimates due to their extreme values and exceptional development trajectories.
Table 9 indicates that the DIGI coefficient is consistently positive and highly significant (
p < 0.01) across specifications, with estimates between 0.002 and 0.008. Both the coefficient signs and their statistical significance align with the baseline estimates, s which alleviates concerns that the estimated effect is concentrated in only a few highly developed municipalities. Rather, the effect is broadly applicable to firms across different regions.
4.3.6. Balanced Panel Estimation
As an additional robustness check, we re-run the regressions on a balanced panel that includes only firms observed in every year of the sample period. Compared with an unbalanced panel, the use of a balanced panel reduces biases arising from missing data and improves the comparability of results across firms and over time. However, this approach also entails a substantial reduction in sample size, which may lower statistical precision. If the results remain consistent under the balanced sample, the robustness of our conclusions will be further reinforced.
As reported in
Table 10, DIGI is consistently positive and statistically significant, and its estimated coefficient ranges from 0.003 to 0.007. Although the number of observations decreases markedly from over 20,000 firm-year samples to approximately 4600, the results are qualitatively unchanged compared with the baseline findings.
6. Discussion
6.1. Theoretical Implications
The findings of this study offer several theoretical implications for research on digital transformation and corporate innovation. First, this study extends the literature on the relationship between digital transformation and innovation by proposing a dual-path framework. Rather than treating digital transformation solely as a technological input or an efficiency-enhancing tool, this study conceptualizes it as a broader organizational transformation process that influences innovation through both R&D resource allocation and the governance environment. This perspective helps explain why digital transformation may promote innovation not only by increasing firms’ innovation inputs, but also by improving the institutional, informational, and managerial conditions under which innovation activities take place.
Second, this study contributes to the Resource-Based View by clarifying how digital resources can evolve into innovation-related strategic resources. Digital transformation does not automatically translate into innovation output. Its innovation value depends on whether firms can transform digital technologies, data resources, and organizational routines into capabilities that support knowledge recombination, R&D decision-making, and innovation governance. This interpretation also helps explain why the innovation effects of digital transformation may vary across firms with different resource endowments, ownership structures, financing conditions, and regional environments.
6.2. Interpretation of Mechanism-Consistent Evidence
The mechanism analysis should be interpreted as evidence consistent with the proposed channels rather than as definitive causal mediation evidence. The empirical results show that digital transformation is positively associated with several R&D-related and governance-related indicators. These findings are in line with the theoretical argument that digital transformation may promote corporate innovation by strengthening R&D investment and improving the governance environment. However, because this study is based on observational panel data, the results should not be interpreted as fully proving a causal mediation chain. Instead, they provide empirical support for the plausibility of the proposed dual-path framework.
This cautious interpretation is necessary because innovation is a cumulative, dynamic, and uncertain process. R&D investment, governance quality, and innovation output may influence one another over time. Therefore, the mechanism results are best understood as showing that the observed empirical patterns are consistent with the theoretical logic of this study. Future research may further examine the causal mediation mechanisms through more refined data, policy shocks, quasi-natural experiments, or other identification strategies.
6.3. Managerial and Policy Implications
The findings provide meaningful implications for both firms and policymakers. For firms, digital transformation should not be understood merely as the adoption of digital tools. Instead, firms need to embed digital technologies into R&D management, resource allocation, internal control, and information disclosure systems. Only when digital transformation is integrated with organizational capability building can it be more effectively converted into innovation output.
For policymakers, the heterogeneous results suggest that digital transformation policies should avoid a one-size-fits-all approach. Firms with high financing constraints, smaller scale, non-state ownership, or weaker regional digital infrastructure may face greater difficulty in transforming digital initiatives into innovation outcomes. Therefore, policy support should focus not only on encouraging the adoption of digital technologies, but also on improving financing access, digital infrastructure, governance standards, and innovation-supporting services for firms with relatively weak resource endowments. Such differentiated policy design can help narrow the innovation gap among firms and enhance the inclusive effects of digital transformation.
6.4. Limitations and Future Research
This study has several limitations. First, the text-based DIGI index constructed from annual reports may capture both substantive digital transformation and disclosure intensity. Although annual-report text provides useful information about firms’ digital orientation, it may also reflect managerial rhetoric or symbolic disclosure. Future research may combine textual measures with indicators such as digital investment, software assets, IT capital, digital patents, or other measures of actual digital capability.
Second, although this study includes firm and year fixed effects, instrumental-variable estimation, and several robustness checks, the observational panel-data design cannot fully eliminate endogeneity concerns. Therefore, the findings should be interpreted as robust associations rather than definitive causal estimates. Future studies may use quasi-natural experiments, policy shocks, or more rigorous causal identification strategies to further examine the relationship between digital transformation and innovation.
Third, this study focuses mainly on innovation output measured by invention patents and granted patents. Future research may further distinguish between different types of innovation, such as exploratory and exploitative innovation, radical and incremental innovation, or green and non-green innovation. Such extensions would provide a more nuanced understanding of how digital transformation shapes the direction, quality, and sustainability of corporate innovation.
7. Conclusions
This study investigates the innovation consequences of digital transformation among Chinese A-share listed firms from 2009 to 2023, with particular attention to the roles of R&D investment and governance improvement. Based on panel data analysis and a series of robustness checks, the study finds that digital transformation is positively associated with corporate innovation output. This conclusion remains consistent when innovation is measured by patent applications and patent grants, and it is further supported by alternative model specifications and instrumental-variable estimation.
The empirical findings lead to three main conclusions. First, firms with a higher degree of digital transformation tend to exhibit stronger innovation performance, suggesting that digitalization has become an important driver of corporate innovation. Second, the mechanism analysis shows that this relationship is consistent with two complementary channels: digital transformation is associated with stronger R&D inputs and a more supportive governance environment. Specifically, digital tools may improve resource allocation, enhance R&D efficiency, and support longer-term innovation investment, while also strengthening internal control, information disclosure, and external supervision. Third, the heterogeneity analysis indicates that the innovation effect of digital transformation is more pronounced among firms with lower financing constraints, larger enterprises, firms located in eastern regions, and state-owned enterprises. This suggests that firm characteristics and external conditions play an important role in shaping the innovation outcomes of digital transformation.
Overall, this study shows that digital transformation is not only a technological process, but also an organizational and institutional process that supports corporate innovation. The findings highlight the need for firms to integrate digital strategies with R&D management and governance improvement, while also suggesting that policymakers should create more inclusive conditions for digital innovation by improving infrastructure, financing access, and institutional support for firms with weaker resource endowments. Future research could extend this study by incorporating micro-level behavioral data, distinguishing between different types of innovation, or conducting cross-country comparative analysis. These extensions would deepen our understanding of how digital transformation interacts with corporate innovation and provide more precise insights for policy design and managerial practice.