1. Introduction
The United Nations Environment Programme (UNEP) reports that, despite recent climate pledges, global temperatures are still projected to rise by 2.3–2.5 °C by the end of the century, falling short of the Paris Agreement’s 1.5 °C target (UNEP, 2025 [
1]). Reinforcing this concern, discussions at the United Nations Climate Change Conference (COP30) emphasized the urgent need to strengthen the implementation and effectiveness of low-carbon policies, particularly through enhanced policy coordination, innovation incentives, and market-based instruments (UNFCCC, 2025 [
2]). Against this backdrop, this study analyzes low-carbon policy dynamics in a duopoly with differentiated products, green R&D, and knowledge spillovers under Cournot and Bertrand competition with both commitment and non-commitment regimes. Specifically, we (i) quantify the impacts of low-carbon policies on R&D, emissions, profits, and welfare across alternative competition modes, policy-timing regimes, and R&D organization structures; (ii) examine the roles of key policy parameters across all scenarios; and (iii) provide an integrated and intuitive interpretation of the underlying economic mechanisms. By integrating game theory and environmental policy, this paper sheds light on how market structure and firm strategy shape low-carbon policy effectiveness, thereby informing the design of more efficient and credible climate interventions.
Within a three-stage game-theoretic framework, we construct a duopoly model in which firms invest in green R&D to reduce emissions, with product differentiation and knowledge spillovers playing central roles. In the Cournot non-commitment scenario, firms first choose green R&D, the government then sets a welfare-maximizing carbon tax, and firms subsequently compete à la Cournot. The Bertrand non-commitment scenario follows the same timing, except that firms engage in price competition in the final stage. Under commitment, the government sets the carbon tax in the first stage, firms then choose green R&D, and the final stage involves either Cournot or Bertrand competition, depending on the competitive mode. We consider eight scenarios defined by the following: (i) the competition mode (Cournot or Bertrand), (ii) the regulator’s decision timing (commitment or non-commitment), and (iii) the R&D regime (R&D competition or R&D cooperation). The scenarios are summarized in
Section 4 (CCN, CCC, BCN, BCC, CNN, CNC, BNN, and BNC).
This study makes several contributions. To the best of our knowledge, no existing research examines the effects of low-carbon policies on economic, environmental, and social equilibria in a polluting duopoly with differentiated products, green R&D, and knowledge spillovers. Moreover, prior work does not provide a systematic comparison of equilibrium R&D, emissions, industry profits, and welfare across alternative R&D organizations and competition modes under regimes in which the government does or does not precommit to a carbon tax. Methodologically, this study integrates analytical derivations, graphical analysis, and numerical simulations to investigate the determinants and outcomes of R&D, emissions, industry profits, and welfare, as well as the broader implications of low-carbon policies. Model solutions are visualized using a comprehensive multidimensional graph, which simultaneously captures multiple variable dimensions, structural relationships, and scenario-specific contrasts within a unified framework. By adopting a unified perspective encompassing governments, firms, and consumers, this study fills important gaps in the literature and provides novel insights into strategic low-carbon policymaking aimed at achieving multiple objectives. Our analysis identifies the conditions under which superior outcomes in R&D, emissions, industry profits, and welfare can be attained, thereby extending the literature on time-consistent and time-inconsistent environmental policies with abatement R&D and emissions regulation (Ouchida and Goto, 2011, 2014, 2016a, 2016b; Petrakis and Poyago-Theotoky, 2002; Poyago-Theotoky, 2007, 2010; Poyago-Theotoky and Teerasuwannajak, 2002; Wang, 2020, 2021; Wang and Atallah, 2025a, 2025b [
3,
4,
5,
6,
7,
8,
9,
10,
11,
12,
13]).
This study yields six key insights. First, in the CCN scenario, lower product substitutability and weaker knowledge spillovers jointly increase equilibrium R&D. In other scenarios, lower product substitutability alone consistently raises equilibrium R&D. Cournot and Bertrand contours are nearly collinear in the CNN, BNN, CNC, and BNC scenarios. However, Cournot outcomes diverge significantly from Bertrand in the CCN, BCN, CCC, and BCC scenarios. As product substitutability decreases, R&D levels under CNC and BNC achieve the first-best outcomes. Second, in the CCN, BCN, CCC, and BCC scenarios, higher product substitutability and stronger knowledge spillovers systematically decrease equilibrium emissions. In the other scenarios, higher product substitutability alone reduces emissions. While Cournot and Bertrand contours are nearly collinear in the CNN and BNN scenarios, Cournot outcomes diverge notably from Bertrand when products are complementary or homogeneous, as observed in the CCN, BCN, CCC, and BCC scenarios. As product substitutability increases, emission levels under CCC and BCC reach the first-best outcomes.
Third, lower product substitutability and stronger knowledge spillovers consistently increase equilibrium industry profits. As product substitutability decreases, Cournot outcomes diverge substantially from Bertrand in the CNN, BNN, CNC, and BNC scenarios. In contrast, this divergence becomes more pronounced in the CCN, BCN, CCC, and BCC scenarios when lower substitutability is combined with stronger knowledge spillovers. As product substitutability decreases, industry profits under CNC and BNC reach the first-best outcomes. Fourth, lower product substitutability and stronger knowledge spillovers consistently increase equilibrium welfare. While Cournot and Bertrand contours are nearly collinear in the CNN, BNN, CNC, and BNC scenarios, they diverge in the CCN, BCN, CCC, and BCC scenarios when products become complementary or homogeneous. As product substitutability decreases, welfare under CNC and BNC reaches the first-best outcomes. Fifth, as R&D efficiency decreases, the contour orderings of R&D, emissions, industry profits, and welfare remain qualitatively similar. However, equilibrium R&D, industry profits, and welfare decline, while emissions increase. As environmental damage increases, the contour orderings remain unchanged, but R&D increases, while emissions, industry profits, and welfare decline. Sixth, in the Cournot model, compared to Bertrand, there is a special case where the carbon tax can be negative if the goods are highly complementary and knowledge spillovers are extremely high.
The paper is structured as follows.
Section 2 reviews the relevant literature,
Section 3 presents the model, and
Section 4 analyzes it under Cournot and Bertrand competition, with both non-commitment and commitment.
Section 5 evaluates the economic and environmental performance across scenarios, and
Section 6 concludes with the main findings and policy implications.
3. The Model
In the context of low-carbon policy, this study develops a duopoly model with differentiated products, green R&D, and knowledge spillovers (the model in this study builds on established frameworks of abatement R&D and emissions regulation (Ouchida and Goto, 2011, 2014, 2016a, 2016b; Petrakis and Poyago-Theotoky, 2002; Poyago-Theotoky, 2007, 2010; Poyago-Theotoky and Teerasuwannajak, 2002; Wang, 2020, 2021; Wang and Atallah, 2025a, 2025b) [
3,
4,
5,
6,
7,
8,
9,
10,
11,
12,
13]). Production generates pollution, firms invest in R&D to abate emissions, and R&D creates positive spillovers. We employ a three-stage game-theoretic framework under two competition modes (Cournot and Bertrand) and two policy-timing scenarios (commitment and non-commitment) and within each case consider R&D competition and R&D cooperation. In the Cournot non-commitment scenario, firms first choose R&D (cooperatively or non-cooperatively), the government then sets a welfare-maximizing carbon tax, and firms finally compete à la Cournot. The Bertrand non-commitment scenario follows the same timing, but firms engage in price competition in the last stage. Under commitment, the government first sets the carbon tax, firms subsequently choose R&D (cooperatively or non-cooperatively), and the final stage features Cournot or Bertrand competition, depending on the mode. (If the tax is determined through international climate negotiations with long planning horizons, regulators cannot credibly precommit to its level in subsequent negotiation rounds. Given that green R&D is also long-term, assuming no tax precommitment is therefore reasonable. Fukuda and Ouchida (2020) [
23] further identify two mechanisms underlying this lack of commitment in environmental regulation: first, firms unilaterally invest in green R&D before policy decisions to induce laxer future regulation; second, governments subsequently revise policies in response to firms’ strategic behavior.)
Two firms,
and
, engage in quantity competition in differentiated products, choosing outputs
and
. Each firm faces the following linear demand function:
where
is a market-size parameter. Product differentiation is captured by
. When
, the goods are complements; when
, they are independent; when
, they are imperfect substitutes; and as
, they approach homogeneity.
Let
denote consumption of a numeraire good, defined as follows:
where
denotes the consumer’s budget. (The model adopts a representative consumer rather than explicitly modeling heterogeneous individuals. Equation (3) specifies this consumer’s utility function, a standard device for aggregating preferences in macro- and industrial-organization models. The demand system in Equation (1) is derived from the representative consumer’s utility maximization under the aggregate budget constraint, so that market demand reflects the collective behavior of all consumers. As Caselli and Ventura (2000) [
41] emphasize, the representative-consumer framework does not preclude heterogeneity; it simply imposes enough structure on individual differences that the aggregate behaves as if generated by a single, fictitious consumer.)
The representative consumer’s utility function is given by
Consumer surplus is given by
The firm’s emissions function is specified as follows:
All firms adopt end-of-pipe technologies to reduce emissions, which abate pollution by capturing it at the end of the production process. The R&D spillover parameter is . Firm benefits from the rival’s R&D through a positive externality .
Profit is defined as follows:
where
denotes the carbon tax and
is the marginal cost parameter.
Industry profit is given by
Tax revenue is given by
Emissions generate environmental damage, which is measured as follows:
where
is the environmental damage parameter. (Ouchida and Goto (2016b) [
6] argue that the exogenous environmental damage parameter
should be grounded in findings from environmental epidemiology and public health and may be influenced by population growth, depopulation, and pollutant toxicity. Antelo and Loureiro (2009) [
42] further emphasize that
reflects the regulator’s overall valuation of environmental quality and its preferences regarding the distribution of environmental damage.)
Each firm’s emissions intensity is normalized to one unit per unit of output, and firm
’s R&D effort is denoted by
. An effort level
allows firm
to abate
units of emissions. Firm
benefits from both its own and its rival’s R&D efforts. R&D expenditures are assumed to be quadratic, implying decreasing returns to R&D (following Poyago-Theotoky (1999 [
43], 2007, 2010 [
8]) and Ouchida and Goto (2011, 2014, 2016a,b) [
3,
4,
5,
6], we also assume a quadratic R&D expenditure function). For abatement levels of
and
, R&D cost is given by
where
is the R&D efficiency parameter, with a lower
indicating higher R&D efficiency (the total cost function of each firm is additively separable into production costs and R&D expenditures (Wang, 2020, 2021; Wang and Atallah, 2025a, 2025b) [
10,
11,
12,
13]).
Pollution reduction is defined as follows:
Emissions are defined as follows:
Operational profit is defined as follows:
Joint profit is given by
Welfare is given by
Equations (1)–(15) are formulated under Cournot competition. Under Bertrand competition, Equation (1) is replaced by the following expression, and the model is correspondingly reformulated in terms of prices (see
Supplementary Materials):
The model variables are summarized in
Table 1.
5. Results
This study integrates analytical derivations, graphical analysis, and numerical simulations to examine low-carbon policy in a duopoly with differentiated products, green R&D, and knowledge spillovers, offering a systematic Cournot–Bertrand comparison. Model outcomes are visualized using a comprehensive multidimensional graph, which simultaneously represents multiple variable dimensions, structural relationships, and scenario-specific contrasts within a unified graphical framework. Unlike traditional univariate or two-dimensional plots, this approach employs contour maps, multi-panel layouts, or multidimensional-scaling projections to capture interactions among key parameters, decision variables, and institutional regimes. It reveals global patterns, scenario heterogeneity, cross-variable interaction effects, and policy-relevant insights inherent in high-dimensional economic models, thereby providing a more integrated and intuitive understanding of the underlying mechanisms. Propositions 1–6 compare economic and environmental performance across all scenarios and summarize the main findings.
Using the baseline parameters
,
,
, and
, and allowing
and
, the comprehensive multidimensional graphs (
Figure 1,
Figure 2,
Figure 3,
Figure 4 and
Figure 5) provide a horizontal comparison of R&D, emissions, industry profits, and welfare across the eight scenarios under commitment and non-commitment. The qualitative results are invariant to
and
. In these figures,
is shown on the horizontal axis and
on the vertical axis, while three solid and three dashed contour lines represent Cournot and Bertrand outcomes at low, medium, and high levels.
Table 3,
Table 4,
Table 5,
Table 6,
Table 7,
Table 8,
Table 9 and
Table 10 further report numerical simulations that compare scenario outcomes for selected parameter values.
Proposition 1. (i) In the CCN scenario, the contour ordering satisfies whenever
and
, indicating that lower product substitutability and weaker knowledge spillovers jointly increase equilibrium R&D. In the remaining scenarios, the ordering satisfies whenever
, implying that lower product substitutability alone systematically raises equilibrium R&D. (ii) Cournot and Bertrand contours are nearly collinear in the CNN and BNN scenarios, as well as in the CNC and BNC scenarios. However, Cournot outcomes diverge significantly from Bertrand in the CCN and BCN scenarios, as well as in the CCC and BCC scenarios. (iii) As product substitutability decreases, R&D levels follow the ordering and
.
Proof. The proposition is supported by the closed-form solutions derived in the
Supplementary Materials and is illustrated graphically in
Figure 1, where the contour ordering and cross-scenario rankings are visually evident across the admissible parameter domain. □
In the CCN scenario, lower product substitutability and weaker knowledge spillovers jointly increase equilibrium R&D. In all other scenarios, lower product substitutability alone systematically raises equilibrium R&D. Proposition 1(i) implies that, in the CCN scenario, the regulator precommits to the carbon tax and firms choose R&D non-cooperatively. As a result, the ratchet effect associated with non-commitment is absent (the ratchet effect under non-commitment stimulates R&D investment: when the government sets the carbon tax after firms’ R&D decisions, the tax increases the private returns to an additional unit of R&D, thereby incentivizing firms to invest more in the first stage (Hepburn, 2006; Puller, 2006; Brunner et al., 2012) [
44,
45,
46]), while R&D market failure arising from positive spillovers persists, and both factors jointly lead to lower R&D investment. Consequently, R&D incentives in CCN are strongly shaped by both product substitutability and knowledge spillovers. The intensity of product-market competition is governed by
: a lower
softens competition and increases the marginal return to R&D, while a lower
weakens the information-sharing improvement effect, thereby requiring greater R&D investment (the information-sharing improvement effect enhances R&D efficiency: higher spillovers mitigate information-sharing distortions and improve abatement efficiency, thereby reducing the amount of R&D required (Wang and Atallah, 2025b) [
13]). Equilibrium R&D therefore increases in both directions, i.e., as
and
decrease. As reported in
Table 2, CCN serves as the benchmark scenario. In the remaining scenarios, either policy is set under non-commitment or R&D is coordinated, capturing the ratchet effect of non-commitment or the correction of R&D market failure through cooperation, respectively. In both cases, the effective spillover externality is largely neutralized, rendering equilibrium R&D almost insensitive to
and leaving
as the primary determinant of R&D. This explains why higher
values are systematically located toward the southwest region of the
plane, whereas the other scenarios exhibit nearly vertical contours dominated by a monotonic effect of
alone.
Based on Proposition 1(ii), the near collinearity of the contours for and , as well as for and , reflects the fact that once policy is set under non-commitment—regardless of whether R&D is organized non-cooperatively or cooperatively—the market-competition asymmetry between quantity and price competition is largely neutralized by the ratchet effect associated with R&D-enhancing non-commitment. Consequently, the mapping from to equilibrium R&D becomes almost identical under Cournot and Bertrand competition. By contrast, the divergence between the contours of and , as well as between and , reflects the commitment structure, under which the tax cannot fully adjust to firms’ R&D incentives. This rigidity further suppresses R&D and amplifies the role of the competition mode. Cournot firms, facing weaker price incentives but stronger quantity-based strategic effects, adjust R&D more aggressively in response to changes in and , whereas Bertrand firms respond more weakly.
Proposition 1(iii) establishes the R&D rankings and illustrates how regulatory commitment and R&D organization jointly shape firms’ investment incentives. Two mechanisms enhance R&D under non-commitment with cooperative R&D (
,
): the ratchet effect associated with non-commitment and the joint profit-maximization effect arising from the internalization of free-riding through R&D coordination (firms have an additional incentive to invest in R&D due to the joint profit-maximization effect (Wang and Atallah, 2025b) [
13]). Together, these forces raise the marginal profitability of R&D and generate the strongest incentives, yielding the highest R&D levels. Eliminating cooperation while retaining non-commitment weakens these incentives, producing the intermediate outcomes (
,
). Under commitment, the tax is fixed ex ante and cannot respond to additional R&D. Although free-riding effects are still internalized through R&D coordination, the overall incentive effect is weaker, leading to lower R&D levels (
,
) (R&D cooperation solving the R&D market failure, reducing unnecessary duplication, and promoting market competition (Amir et al., 2008 [
22]; Cellini and Lambertini, 2009) [
47]). Finally, the combination of commitment and non-cooperative R&D (
,
) provides the least support for abatement investment. This logic applies symmetrically under Cournot and Bertrand competition, implying identical R&D rankings across the two competition modes (we adopt the baseline parameters
,
,
, and
, and set
, with
;
and
).
Following Petrakis and Poyago-Theotoky (2002) [
7], the rationale underlying the present framework lies in the presence of three distinct market failures: (i) R&D market failure arising from zero or positive spillovers, which generate strategic overinvestment and underinvestment incentives; (ii) information-sharing market failure, whereby firms are not adequately rewarded for sharing innovations, resulting in suboptimal information diffusion; and (iii) an overproduction market failure associated with pollution-generating activities. Accordingly, the regulator can deploy an optimal mix of low-carbon policy instruments to correct these distortions. A carbon tax combined with improvements in R&D efficiency addresses the environmental externality, while policies promoting R&D cooperation and research joint ventures mitigate R&D and information-sharing market failures.
Proposition 2. (i) In the CCN, BCN, CCC, and BCC scenarios, the contour ordering satisfies
whenever
and
, indicating that higher product substitutability and stronger knowledge spillovers systematically decrease equilibrium emissions. In the remaining scenarios, the ordering satisfies
whenever
, implying that higher product substitutability alone systematically reduces equilibrium emissions. (ii) Cournot and Bertrand contours are nearly colinear in the CNN and BNN scenarios, as well as in the CNC and BNC scenarios, whereas Cournot outcomes diverge markedly from Bertrand when products become either complementary or homogeneous in the CCN and BCN scenarios, as well as in the CCC and BCC scenarios. (iii) As product substitutability increases, emission levels follow the ordering
and
.
Proof. The proposition is supported by the closed-form solutions derived in the
Supplementary Materials and is illustrated graphically in
Figure 2, where the contour ordering and cross-scenario rankings are visually evident across the admissible parameter domain. □
In the CCN, BCN, CCC, and BCC scenarios, higher product substitutability and stronger knowledge spillovers jointly reduce equilibrium emissions. In the remaining scenarios, higher product substitutability alone systematically lowers equilibrium emissions. According to Proposition 2(i), in the CCN, BCN, CCC, and BCC scenarios, the carbon tax is predetermined and cannot adjust to firms’ R&D decisions; as a result, equilibrium emissions respond directly to technological and competitive forces. The market-competition effect implies that higher product substitutability (
) intensifies competition and compresses polluting output, while the information-sharing improvement effect indicates that stronger knowledge spillovers (
) enhance the effectiveness of R&D (the information-sharing improvement effect enhances the efficiency of R&D activities: high spillovers mitigate information-sharing distortions and increase abatement productivity, thereby reducing the amount of R&D required (Wang and Atallah, 2025b) [
13]). Both forces jointly reduce emissions, producing contour maps whose values uniformly decline from the southwest to the northeast. By contrast, in the CNN, BNN, CNC, and BNC scenarios, the carbon tax policy is non-commitment, so improvements in spillover effectiveness are largely offset by policy adjustments or by the internalization of spillover externalities. Consequently, emissions become almost insensitive to
and depend primarily on the market-competition effect captured by
, yielding nearly vertical contours and a monotonic decline in emissions with respect to
alone.
Proposition 2(ii) indicates that in the non-commitment scenarios of CNN, BNN, CNC, and BNC, the carbon tax adjusts endogenously to firms’ R&D decisions, rendering the effective strategic environment under Cournot and Bertrand almost identical. The endogenous tax absorbs most differences in strategic substitutability between price and quantity competition. Consequently, the marginal emissions response to () is driven mainly by the market-competition effect rather than by the mode of competition. This explains why the Cournot and Bertrand contours in these panels are nearly colinear. By contrast, in the commitment scenarios of CCN, BCN, CCC, and BCC, the tax is fixed ex ante and cannot offset strategic differences between Cournot and Bertrand. As products approach homogeneity or complementarity, the market-competition effect becomes dominant, and Cournot outcomes diverge markedly from Bertrand. Because the regulator no longer neutralizes these competitive effects, emissions become substantially more sensitive to whether firms compete in prices or quantities. This generates the pronounced Cournot–Bertrand deviations observed in the CCN/BCN and CCC/BCC panels. In sum, Cournot–Bertrand differences are negligible under non-commitment but become significant under commitment.
Proposition 2(iii) summarizes the rankings: under commitment with cooperative R&D of CCC and BCC, firms internalize free-riding effects while facing a fixed, pre-announced carbon tax. As product substitutability increases, the combined market-competition effect and R&D coordination effect compress the gap between total output and abatement, thereby reducing emissions. Consequently,
and
attain the lowest emission levels.
Table 3 confirms that as product substitutability increases, emissions follow the ordering
and
. If
rises to 0.8, the same ranking applies.
Proposition 3. (i) The contour ordering satisfies
whenever
and
, indicating that lower product substitutability and stronger knowledge spillovers systematically increase equilibrium industry profits. (ii) As product substitutability decreases, Cournot outcomes diverge substantially from those under Bertrand competition in the CNN and BNN scenarios, as well as in the CNC and BNC scenarios. By contrast, in the CCN and BCN scenarios and in the CCC and BCC scenarios, such divergence becomes pronounced only when lower product substitutability is accompanied by stronger knowledge spillovers. (iii) As product substitutability decreases, industry profits follow the ordering
and
.
Proof. The proposition is supported by the closed-form solutions derived in
Supplementary Materials and is illustrated graphically in
Figure 3, where the contour ordering and cross-scenario rankings are visually evident across the admissible parameter domain. □
Lower product substitutability and stronger knowledge spillovers systematically increase equilibrium industry profits. Proposition 3(i) demonstrates that, across all scenarios, the contour ordering indicates that lower product substitutability weakens competitive pressure, allowing firms to appropriate a larger share of surplus and thereby increasing operational profits. At the same time, stronger knowledge spillovers generate an information-sharing improvement effect that amplifies the effective return to R&D by enabling firms to benefit more from their rivals’ innovation efforts, enhancing abatement and joint profitability. When these two forces operate jointly, higher operational profits translate into increased equilibrium industry profits.
Table 4 and
Table 5 show that, as parameters shift from
and
to
and
, all scenarios experience higher operational and industry profits. For example,
increases from 3318.18 to 5609.92, while
rises from 5396.28 to 7559.82.
Proposition 3(ii) reveals that the divergence between Cournot and Bertrand outcomes as product substitutability decreases can be understood by decomposing industry profits into operational profits, abatement costs, and R&D expenditures. First, the market-competition effect implies that lower substitutability relaxes product-market competition and increases operational profits under both competition modes. In the CNN and BNN scenarios, as well as in the CNC and BNC scenarios, weaker competition raises equilibrium R&D (as established in Proposition 1(i)), which increases R&D expenditures. (
Table 4 and
Table 5 show that as the parameter shifts from
to
, all scenarios exhibit higher R&D expenditures. For example, in
Table 5,
increases from 2522.67 to 5423.03.) Under Cournot competition, the profit-enhancing effect of higher operational profits dominates the profit-reducing effects of higher abatement costs and R&D expenditures, whereas under Bertrand these effects are weaker; consequently, the profit gap widens as substitutability falls. (
Table 4 shows that as the parameter shifts from
to
,
and
exhibit larger increases in industry profits, operational profits, abatement costs, and R&D expenditures. The profit increase for
is 1208.31, and that for
is 1463.78, both exceeding the corresponding increases for
(653.32) and
(675.74).) Second, in these non-commitment scenarios, the information-sharing improvement effect of knowledge spillovers is largely offset by the ratchet effect associated with non-commitment, rendering spillovers relatively unimportant and making declining substitutability alone sufficient to generate Cournot–Bertrand divergence. Third, in the CCN and BCN scenarios and in the CCC and BCC scenarios, the absence of the ratchet effect implies that the market-competition effect and the information-sharing improvement effect jointly operate, so pronounced divergence arises only when lower product substitutability is combined with stronger knowledge spillovers (
Table 4 and
Table 5 show that as the parameters shift from
and
to
and
,
increases from 3318.18 to 5609.92,
rises from 3499.61 to 6874.36,
increases from 3296.72 to 6321.76, and
increases from 3458.01 to 7685.65).
Table 4 and
Table 5 further confirm Proposition 3(iii): as product substitutability decreases, industry profits follow the ordering
and
.
Proposition 4. (i) The contour ordering satisfies
whenever
and
, indicating that lower product substitutability and stronger knowledge spillovers systematically increase equilibrium welfare. (ii) Cournot and Bertrand contours are nearly colinear in the CNN and BNN scenarios, as well as in the CNC and BNC scenarios. By contrast, Cournot outcomes diverge markedly from Bertrand when products become either complementary or homogeneous, as observed in the CCN and BCN scenarios and in the CCC and BCC scenarios. (iii) As product substitutability decreases, welfare follows the ordering
and .
Proof. The proposition is supported by the closed-form solutions derived in the
Supplementary Materials and is illustrated graphically in
Figure 4, where the contour ordering and cross-scenario rankings are visually evident across the admissible parameter domain. □
Lower product substitutability and stronger knowledge spillovers systematically increase equilibrium welfare. The preceding discussion provides the foundation for interpreting Proposition 4. As indicated by the welfare function , welfare outcomes reflect the joint influence of the market-competition effect associated with lower product substitutability, the information-sharing improvement effect arising from stronger knowledge spillovers, the ratchet effect under non-commitment, and the R&D coordination effect stemming from the internalization of free-riding. These mechanisms jointly affect the four components of welfare and are reflected in the contour ordering.
Based on
Table 6 and
Table 7, several results emerge. (1) For a given level of spillovers (
or
), welfare increases in all scenarios as
decreases from −0.2 to −0.8. The welfare-enhancing effects of higher industry profits, stemming from greater market power, as established in Proposition 3, and higher tax revenues, driven by increased emissions, outweigh the welfare-reducing effects of lower consumer surplus due to firms’ market power and increased environmental damage associated with higher emissions, as established in Proposition 2. (2) Compared with the lower spillover case (
), a higher level of spillovers (
) amplifies the information-sharing improvement effect, generating larger welfare gains across all scenarios as
decreases from −0.2 to −0.8. For example, when
, the welfare gain for
is 1027.13, whereas it increases to 1605.30 when
. (3) For each scenario, when product substitutability is lower and knowledge spillovers are stronger, i.e., as parameters shift from
,
to
,
, welfare attains its largest increase. For example,
rises from 3780.53 to 6340.25, yielding a welfare gain of 2559.72. This reflects the joint welfare-enhancing effects of the market-competition effect associated with lower product substitutability and the information-sharing improvement effect associated with stronger knowledge spillovers. (4) When the environment features lower product substitutability and stronger knowledge spillovers, i.e., as parameters shift from
,
to
,
, all scenarios attain their largest welfare gains. Among them, non-commitment generates additional welfare improvements through the ratchet effect: for example, the welfare gain of
(4481.80) exceeds that of
(2559.72). (5) Building on the preceding effects and further incorporating the R&D coordination effect arising from the internalization of free-riding market failure, welfare gains are maximized in the
and
scenarios. Specifically,
achieves a welfare gain of 5750.86, while
attains a gain of 5671.75. (6) When all four effects are present, the Cournot outcome yields the largest welfare gain, namely
.
Table 6 and
Table 7 further confirm Proposition 4 (ii) and (iii). (1) Cournot and Bertrand welfare contours are nearly colinear in the CNN and BNN scenarios, as well as in the CNC and BNC scenarios. For example, when
and
, welfare levels are similar, with
and
. By contrast, Cournot outcomes diverge markedly from Bertrand when products become either complementary or homogeneous, as observed in the CCN and BCN scenarios and in the CCC and BCC scenarios. Under the same parameter configuration (
and
), welfare differs substantially, with
and
. These differences stem from variations in welfare components—consumer surplus, industry profits, tax revenues, and environmental damage—between CCN and BCN. Taken together, these patterns indicate that, relative to commitment, the ratchet effect under non-commitment shrinks Cournot–Bertrand welfare differences by reshaping firms’ strategic responses through endogenous policy adjustment. (2) As product substitutability decreases, welfare follows the ordering
and
.
Proposition 5. (i) As R&D efficiency decreases, the contour orderings of R&D, emissions, industry profits, and welfare remain qualitatively similar; however, equilibrium R&D, industry profits, and welfare decline, while emissions increase. (ii) As environmental damage increases, the contour orderings of R&D, emissions, industry profits, and welfare remain qualitatively similar; however, equilibrium R&D increases, while emissions, industry profits, and welfare decline.
Proof. The proposition is supported by the closed-form solutions derived in
Supplementary Materials and is illustrated graphically in
Figure 5, where the contour ordering and cross-scenario rankings are visually evident across the admissible parameter domain. Figures for the remaining variables are omitted for brevity. □
As stated in Proposition 5(i), a reduction in R&D efficiency proportionally lowers the return on R&D investment across all scenarios, prompting firms to optimally scale back R&D, which directly leads to higher equilibrium emissions. This decline in R&D reduces industry profits due to increased emission costs and carbon taxes, while also raising environmental damages, ultimately lowering overall welfare. Since R&D efficiency affects all policy regimes, competition modes, and cooperation structures symmetrically, it shifts all equilibria in the same direction without altering the underlying strategic trade-offs. As a result, the qualitative contour orderings of R&D, emissions, industry profits, and welfare remain unchanged, though their levels move monotonically.
Table 8 presents these results.
Proposition 5(ii) further shows that, as environmental damage intensifies, the strategic trade-offs between R&D investment, emissions, industry profits, and welfare remain consistent in terms of their qualitative relationships. Specifically, firms respond to increased environmental damage by adjusting their R&D investments to mitigate pollution. However, this heightened R&D effort incurs costs. Consequently, while R&D increases in response to greater environmental damage (as firms innovate to reduce emissions), overall emissions, industry profits, and welfare decline. The welfare reduction is driven by the negative effects of lower consumer surplus, industry profits, and tax revenue, which outweigh the welfare gains from reduced environmental damage.
Table 9 presents these results.
Proposition 6. In the Cournot model, compared to Bertrand, there exists a special case where the carbon tax can be negative if the goods are highly complementary and knowledge spillovers are extremely high.
Table 10 shows that when
and
, indicating highly complementary goods and extremely high knowledge spillovers, the CNC scenario generates a negative carbon tax (i.e., an emissions subsidy) compared to the BNC scenario.
The trends in the carbon tax are driven by two opposing effects. The first is a pollution-reducing effect, which leads to an increase in the carbon tax as the government raises it to control pollution, resulting in lower overall pollution. The second is a production-increasing effect, which reduces the carbon tax. Since a carbon tax raises a firm’s emission expenditures, firms reduce output to minimize these costs; however, the welfare equation indicates that the government desires higher output (
). Therefore, by lowering the carbon tax, the government can incentivize firms to increase production (Wang, 2020, 2021) [
10,
11].
As products become highly complementary in a high knowledge spillover environment, under Cournot’s CNC scenario, the production-increasing effect dominates the pollution-reducing effect. This outcome results from the combined influence of the market-competition effect due to lower product substitutability (), the information-sharing improvement effect from stronger knowledge spillovers (), the ratchet effect under non-commitment (CNC), and the R&D coordination effect from the internalization of free-riding (CNC). As a result, the carbon tax () becomes negative (i.e., an emissions subsidy), while , , , , and all increase.
Furthermore, this outcome supports Poyago-Theotoky (2010) [
8] by demonstrating that the carbon tax rate can be negative in equilibrium if the environmental damage is relatively insignificant and R&D efficiency is relatively high. In such cases, an emissions subsidy partially corrects the inefficiency caused by firms’ market power, which can be seen as an efficiency improvement effect associated with the subsidy.
In summary, the following conclusions are drawn. First, in the CCN scenario, lower product substitutability and weaker knowledge spillovers jointly increase equilibrium R&D. In other scenarios, lower product substitutability alone raises equilibrium R&D. Cournot and Bertrand contours are nearly collinear in the CNN and BNN scenarios, as well as in the CNC and BNC scenarios. However, Cournot outcomes diverge significantly from Bertrand in the CCN, BCN, CCC, and BCC scenarios. As product substitutability decreases, R&D levels under CNC and BNC reach the first-best outcomes. Second, in the CCN, BCN, CCC, and BCC scenarios, higher product substitutability and stronger knowledge spillovers systematically reduce equilibrium emissions. In other scenarios, higher product substitutability alone lowers emissions. Cournot and Bertrand contours are nearly collinear in the CNN and BNN scenarios, as well as in the CNC and BNC scenarios. However, they diverge when products become complementary or homogeneous in the CCN, BCN, CCC, and BCC scenarios. As product substitutability increases, emissions under CCC and BCC achieve the first-best outcomes.
Third, lower product substitutability and stronger knowledge spillovers systematically increase equilibrium industry profits. As product substitutability decreases, Cournot outcomes diverge more from Bertrand in the CNN, BNN, CNC, and BNC scenarios. This divergence becomes more pronounced in the CCN, BCN, CCC, and BCC scenarios when lower substitutability is paired with stronger knowledge spillovers. As product substitutability decreases, industry profits under CNC and BNC reach the first-best outcomes. Fourth, lower product substitutability and stronger knowledge spillovers increase equilibrium welfare. Cournot and Bertrand contours are nearly collinear in the CNN, BNN, CNC, and BNC scenarios but diverge when products become complementary or homogeneous in the CCN, BCN, CCC, and BCC scenarios. As product substitutability decreases, welfare under CNC and BNC reaches the first-best outcomes. Fifth, as R&D efficiency decreases, the contour orderings of R&D, emissions, industry profits, and welfare remain qualitatively similar. However, equilibrium R&D, industry profits, and welfare decline, while emissions increase. As environmental damage increases, the contour orderings remain similar, but R&D increases, while emissions, industry profits, and welfare decline. Sixth, in the Cournot model, compared to Bertrand, there is a special case where the carbon tax can be negative if goods are highly complementary and knowledge spillovers are extremely high.