Next Article in Journal
Mapping Urban Green and Its Ecosystem Services at Microscale—A Methodological Approach for Climate Adaptation and Biodiversity
Next Article in Special Issue
Towards Sustainable Development: A Study of Cross-Regional Collaborative Carbon Emission Reduction in China
Previous Article in Journal
Farmers’ Demand for Climate Information Services: A Systematic Review
Previous Article in Special Issue
Economic Growth, CO2 Emissions Quota and Optimal Allocation under Uncertainty
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Optimal Environmental Policy in a Dynamic Transboundary Pollution Game: Emission Standards, Taxes, and Permit Trading

1
School of Economics and Management, Southwest Petroleum University, Chengdu 610500, China
2
School of Economics and Management, Southwest Jiaotong University, Chengdu 610031, China
*
Author to whom correspondence should be addressed.
Sustainability 2022, 14(15), 9028; https://doi.org/10.3390/su14159028
Submission received: 28 June 2022 / Revised: 20 July 2022 / Accepted: 21 July 2022 / Published: 23 July 2022

Abstract

:
Global environmental problems such as transboundary pollution and global warming have been recognized as major issues around the world. In practice, governments of all countries are actively exploring various environmental policies to control pollution. The government needs to consider the impact of neighboring regions when formulating environmental policies, especially in the context of transboundary pollution. However, the above problems are less studied, to bridge this gap and aim at solving problems in existing practices, we consider a differential game model of transboundary pollution control to examine which policy is more effective in promoting environmental quality and social welfare in a dynamic and accumulative global pollution context. Three alternative policy instruments, namely emission standards, emission taxes, and emission permit trading, are considered and compared. The results show that the social welfare of each region is the lowest and the total pollution stock is the highest under the emission tax policy due to the “rent-shifting,” “policy-leakage,” and “free-riding” effects. Moreover, the realized level of the environmental policy in the Nash equilibrium of the policy game is distorted away from the socially optimal level. The emission standards policy is found to be better than the emission tax policy and characterized by initiating the rent-shifting effect without the policy-leakage effect. Moreover, the pollution stock of two regions is found to be the lowest and the social welfare is found to be the highest under the emission permit trading policy, which is not associated with any of the three effects. Finally, a numerical example is used to illustrate the results, and a sensitivity analysis is performed in the steady state.

1. Introduction

According to the report of the global Environmental Performance Index (EPI) released by Yale University and Columbia University in 2018 ([1]), the world is still far from achieving international environmental goals; the EPI score was reported to be only 46.16. This score reflects the current environmental condition and the implementation effects of environmental policies. The EPI reveals that the environmental quality of the world is improving as compared with the last few decades, but pollution remains severe, especially in developing countries and regions. Environmental pollution originates from the excessive discharge of pollutants by industrial firms. However, due to the pollutant attributes of externality and public goods, firms are reluctant to reduce their emissions or invest in emission abatement activities, resulting in market failure. Therefore, governments have adopted command-and-control or market-based environmental policies, such as emission standards, taxes, permit trading, etc., to govern the environment. As the most important means of pollution control, the above environmental policies have received extensive attention from academia and government [2,3,4]. Emission standards set quantitative limits on the permissible amount of specific air pollutants that may be released from specific sources over specific timeframes. As a command-and-control policy, if the pollutant emissions of enterprises exceed this standard, they will face serious environmental penalties [5]. Both emission taxes and emission permit trading are market-based environmental policies. The most significant difference between the two is the uncertainty of emission reduction and emission permit price. Emission tax is a price control policy, the tax rate is set by the policy makers, and the emission reduction is determined by the market. While the emission permit trading is a quantitative policy, the total emission is set by the policy maker, and the trading price is determined by the market mechanism. When the permit price or total emission is set at the place where the marginal abatement cost is equal to the marginal abatement income, the emission tax and emission permit trading can implement the same outcome under the completely competitive market conditions. Scholars have carried out long-term and systematic research on the comparison of the above policies. However, uncertainties in reality make all kinds of policies unable to achieve their theoretical optimal effects, and all kinds of policies need to be reasonably selected according to the specific environment [2,6].
What is more, a typical feature of pollution is that pollutants are released from one region and then migrate to another; this is called transboundary pollution, which often occurs in two or more neighboring regions. Many or all regions generate emissions, and many or all are also suffering from them. A celebrated example is the emission of greenhouse gases (GHG), sulfur dioxide (SO2), and ozone, which cause global warming and environmental degradation [7]. Transboundary pollution not only destroys the ecological environment, but also endangers human health. Outdoor air pollution caused by PM2.5, SO2 and ozone leads to premature death for more than 3 million people worldwide every year [8]. Therefore, in view of the impact of neighboring regions, when the government regulates the environmental policies of local enterprises, it should not only consider the impact of local environmental pollution, but also consider the impact of the behavior of the government and enterprises in adjacent areas, which increases the difficulty of decision-making and may cause new changes in the optimal choice of environmental policies [9]. Most related studies conducted previously were limited to the optimal environmental policy of either local or domestic governments, and some scholars compared the emission taxes, standards, and permit trading policies under the conditions of different market situations and implementation environments [10,11,12]. However, these studies did not compare the environmental and social welfare effects of the three environmental policies, particularly the conditions of the optimal policy choice for the transboundary pollution problem, which is the main objective of the present study. The consideration of transboundary pollution is unique; for two or more adjacent regions, each region suffers not only from the damage caused by the pollution stock generated by local firms, but also from the pollution stock generated by firms in the neighboring region [13,14]. Therefore, the government of each region will also be affected by the neighboring region when implementing environmental policies.
Two adjacent regions are considered in this study. Each region has a representative firm that competes with the representative firm of the other region and sells homogeneous goods, and the firm in each region will emit pollutants during the production process. Consumers in the two regions can also freely purchase the products produced by the firms in the two regions. The goal of each government is to maximize the local social welfare under the constraint of the total pollution stock of the two regions. Moreover, the two regions are considered to be suffering from the same environmental damage as a result of the total emissions. Thus, differential game models of the optimal choice problem are first formulated in the following three scenarios: emission standards, emission taxes, and emission permit trading. The feedback Nash equilibrium solutions of the total pollution stock and the social welfare of each region are then derived, and the three policies are compared. The results are illustrated with a numerical example.
The main contributions of this paper include the following: (1) Most of the existing studies are limited to the static situation and focus on the relationship between a single government and enterprises. This paper studies the optimal choice of government environmental policy under the dynamic change of transboundary pollution and pollution capacity; this is the main expansion of and improvement on the existing research; (2) emission standards, emission taxes and emission permit trading were analyzed in our paper, which not only systematically compares the differences in environmental quality of emission standards, emission taxes and emission permits trading environmental policies, but also compares the impact on social welfare in each region, and analyzes the reasons for the differences. It has important practical significance for the control of transboundary pollution.
The main structure of this paper is as follows: Section 1 of the paper introduces the research background and describes the purpose of the work and its significance. Section 2 summarizes the relevant literature and presents the contributions of scholars in this field and the research innovation of this paper. Section 3 provides the game formulation between the government and firms under the condition of transboundary pollution. Subsequently, Section 4 introduces the following scenarios: (i) the emission standards policy in Section 4.1; (ii) the emission tax policy in Section 4.2; (iii) the emission permit trading policy in Section 4.3. Section 4.4 compares two aspects of the three environmental policies, namely the total pollution stock and the social welfare benefits for the government in each region. In Section 5, the results are illustrated with a numerical example. Finally, Section 6 concludes this study with a brief summary and suggests future research directions.

2. Literature Review

The predominant strategy used to reduce pollution is environmental policy. Thus, topics related to environmental policy and the comparison of different policies have frequently been examined in the field of environmental economics and management [15,16,17,18,19]. These investigations have tried to establish which type of regulation is desirable in terms of social welfare or environmental quality. Requate [10] first compared the tax and permit policies for an asymmetric quantity duopoly setting. In general, the two policies have no significant difference and neither taxes nor permits result in the social optimum due to imperfect competition; however, for a considerable range of parameters, the permit policy is better than taxes in terms of social welfare. Masoudi and Zaccour [17] compared price-based (taxes) and quantity-based (quotas) environmental policies. One significant distinction of this study is that the authors introduced market uncertainty, and the regulator was considered to have Bayesian learning curve characteristics. The results showed that the emission levels under the tax policy and quota policy were the same, but from the perspective of social welfare, the tax policy was found to be better than the quota policy, which is due to the impact of uncertainty. Feenstra et al. [20] analyzed the effects of emission taxes and standards on environmental and abatement investment in the context of an international duopoly; they proved that among the feedback strategies, emission standards are better than emission taxes, as taxes induce the substitution between capital and pollution input. Hoel and Karp [21] compared the environmental effects of tax and quota policies. When the damages suffered by the government depend on the flow of pollution, the effects of the two policies depend on the discount rate and the environmental decay rate; a higher discount rate and environmental decay rate increase the likelihood of the government’s preference of taxes. Lee and Park [22] demonstrated that the differentiation of the product in a duopoly can play a significant role in the policy choice of the government; it was found that when the product difference between firms is large, the consumer surplus under the condition of the emission permit trading policy is lower than that under the condition of the command-and-control policy. Kato [11] considered the environmental policies of taxes and quotas set by the government in a mixed duopoly under both uniform and differentiated situations; it was found that social welfare is the greatest under differentiated emission quotas. Gacia et al. [23] compared the differences between the emission tax and emission permit trading policies under the credible commitment and non-commitment policies of the government. The authors established a Cournot model, and found that the effects of the two policies are the same under the government’s commitment to implement a certain environmental policy, but the tax policy causes less environmental damage under the non-commitment policy. Feichtinger et al. [24] studied the environmental externalities caused by enterprise production and emissions on the basis of oligopoly differential game, and supervised them simultaneously through an emission tax and emission cap policy. The results showed that the green research and development investment curve of enterprises showed an inverted U shape over time and was closely related to the government’s environmental policy. Moner-Colonques and Rubio [25] examined the effects that the strategic use of environmental innovation has on environmental policy (taxes and standard) and its welfare implications in a duopoly. The results showed that the strategic behavior of firms is welfare-improving and may induce more environmental innovation than under regulatory commitment only when a tax is used to control pollution and the convexity of investment costs is relatively more important than that of environmental damages.
However, these studies only considered domestic regions, and ignored the impact of pollution emissions from neighboring regions or countries. Therefore, some scholars have investigated the environmental policy issues among two or multiple regions. For example, Ulph [26] considered the choice of environmental policy instruments (taxes or standards) in the context of a model of strategic international trade between countries. The results revealed that if trade is modeled as a one-off Cournot equilibrium, then there is no difference between the two policies for all countries. However, if trade is modeled as a Stackelberg game, then both countries have higher producer surpluses under the emission standard policy. Lai and Hu [27] investigated the import tariff and tax policy instruments between two countries, each of which has one firm producing a differentiated good. The main results indicated that the two countries should subsidize the imported dirty goods in the presence of strong transboundary pollution. On the other hand, if the transboundary pollution is relatively weak, then the countries should set positive tariffs. Glachant et al. [28] proposes a partial equilibrium model with imperfect competition in both the North and South regional and international polluting product markets. They analyzed the impact on technology transfer when the North and South governments set emission quotas non-cooperatively or cooperatively.
A differential game is an effective tool with which to study environmental pollution control problems and analyze the interactions between the strategic behaviors of participants [29]. The dynamic game of transboundary pollution was considered by Dockner and Van Long [30], Jorgensen and Zaccour [31,32], Breton et al. [33], Li [34], Bertinelli et al. [35], and El Ouardighi et al. [36]. Considering the characteristics of transboundary pollution involving two neighboring countries, Dockner and Van Long [30] characterized cooperative and non-cooperative pollution control strategies of the governments of the two countries that maximize the discounted stream of net benefits of a representative consumer, he found that the Markov-perfect equilibrium can be achieved when governments use non-linear strategies. Li [34] established a differential game of the transboundary policy that allows emission permit trading to be carried out between asymmetric regions. The paper mainly found that the optimal cooperative emission rates are lower than the optimal non-cooperative emission rates and the optimal cooperative quantity of purchased/sold emission permits is lower than the optimal non-cooperative quantity of purchased/sold emission permits. Yeung and Petrosyan [37] first presented a cooperative stochastic differential game of transboundary industrial pollution with two features. One is that the pollution can be divided into short- and long-term impacts, another feature is that they explored a payment distribution mechanism. Xu and Tan [38] considered the abatement technology licensing between governments in two adjacent regions; in contrast, the current work considers the optimal environmental policy between the government and firms in two adjacent regions. Marsiglio and Masoudi [39] constructed a transboundary pollution control problem in a two-country differential game, they suggest that a universally homogeneous environmental tax may not be either desirable or optimal in spite of the non-cooperative and the cooperative solutions. Li and Guo [40] developed a dynamic decision model of transboundary basin pollution with emission permits trading and pollution abatement investment. De Frutos et al. [41] used a differential game model to compare the equilibrium trajectories of the stocks of pollution and cleaner technology as well the regions’ welfare. De Frutos and Martín-Herrán [42] analyzed a transboundary pollution differential game and introduced a spatial dimension to capture the geographical relationships among regions. Yanase [43] examined the effects of environmental policy on the total global environment as an international public good with a stock externality, and found that the emission tax game brings about larger strategic distortions than the emission permit game in the absence of global cooperation. Yanase [44] also considered the market structure of enterprises in two countries competing in a third country, and the author analyzed the impacts of emission tax and emission standard policies adopted by the governments of the two countries on the total pollution stock. The results revealed that the emission standards policy is better than the emission tax policy in terms of both the environment and social welfare. Yanase and Kamei [45] develop a two-country differential game model of transboundary pollution control with a continuum of polluting and oligopolistic industries. Governments choose the path of their emission permits and the price of the emission permits is determined by the market-clearing mechanism.
Although the academic community has carried out a number of studies on environmental policy and pollution control, there is still room for further expansion, as follows: (1) in the past, scholars have mainly studied the current situation and problems of environmental policy, but few scholars have studied the optimal environmental policy under transboundary pollution. This research expands and improves upon the existing research; (2) the existing studies did not explicitly consider inter-regional trade in goods or resources that cause pollution; and (3) discussions on the evaluation of alternative environmental policies, such as emission taxes, emission standards, or marketable emission permits, are absent from the literature. These characteristics reflect the main variations between the present study and previous research.

3. Parameter Description, Assumptions and Methods

3.1. Parameter Description and Assumptions

For convenience, the main parameters of this paper are described in Table 1.
A transboundary pollution model comprising two adjacent regions or countries, namely region i and region j , is considered. In each region, consumers purchase the same homogeneous product in the common market formed by the two regions. According to Glachant et al. [28], the demand function of consumers in each region is assumed to be χ m = α m p 2 , m = i ,   j , where χ m represents the quantity of goods purchased by consumers in region i or region j , α m is a positive constant parameter measuring the reservation price (alternatively, it is a measure of the market size), and p is the price of the product, which is the same in both regions. Accordingly, the consumer surplus in each region can be calculated as follows.
C S m = p 2 α m ( α m p 2 ) d p = ( 2 α m p ) 2 4
According to the consumer demand function, the inverse demand function of products in the common market can be obtained as follows.
p ( c i + c j ) = α i + α j χ i χ j
The product is produced by two firms, one in each region, which compete “à la Cournot.” In the usual manner, it is assumed that the market size in each region is the same, so α i = α j = α . Production generates pollution, and, for simplicity, each unit of output for the firm at time t is considered to create one unit of pollutant, q i ( t ) or q j ( t ) [6]. The firm can reduce emissions by end-of-pipe management, and each firm located in region i or region j is supposed to have a quadratic total abatement cost of γ m 2 r m 2 ( t ) , m = i ,   j , where r m ( t ) is the total pollution abatement level for each firm at time t , and γ m is the cost coefficient of emission reduction. The total cost of firm m is additively separable and given by c q m + γ m 2 r m 2 ( t ) , where c is the unit cost of production. The proximity of the two regions leads to the cross-flow of pollutants emitted by the firms, and the focus of this paper is the stock of pollutants, which evolves according to the following differential equation [46]:
x ˙ ( t ) = q i ( t ) + q j ( t ) r i ( t ) r j ( t ) δ x ( t ) ,   x ( 0 ) = x 0 ,   x ( t ) 0
where x ( t ) represents the pollution stock and δ > 0 is the natural decay rate of pollution. An advantage of the application of a differential game to problems of pollution is the opportunity to model damage caused by the stock of accumulated pollution [46,47]. According to Masoudi and Zaccour [17], Breton [33], and Menezes and Pereira [46], the damage caused by the stock of pollution for region m at time t can be measured by d m x ( t ) , where d m > 0 is the damage parameter. The damage d m x ( t ) means that each region suffers not only from the damage caused by the emissions of its own firm, but also from the damage caused by the emissions of the firm in the neighboring region. The objective of the firm in each region is to choose its output and reductions to maximize its profits. On the other hand, to reduce the environmental damage caused by the emissions of the firm, the government in each region chooses the optimal policy with the aim of maximizing social welfare, which is defined as the sum of the consumer surplus and producer surplus minus the environmental damages subject to differential Equation (3). Three alternative policy regimes are considered, namely emission standards, emission taxes, and emission permit trading. It is assumed that the government has more information about the total environmental quality and its impact on local welfare than do the firms. Specifically, the pollution stock and environmental damage of each region in Equation (3) can be observed by the governments, but it cannot be judged by the firms. Therefore, the firm cannot predict policy information, such as the emission standards, emission tax rates, or initial emission permits set by the government. These assumptions indicate that the local government can obtain information about the firm’s feedback strategy and the damage caused by the total pollution in the two regions to formulate the optimal temporal path under each environmental policy to maximize social welfare.

3.2. Methods

3.2.1. Differential Game

Differential games have distinct advantages in order to represent the interdependencies among time, strategic behavior and participants in mathematical models in the fields of environmental economics and optimal pollution control. Firstly, an advantage of differential games in applications to problems of pollution is the opportunity to model damage caused by the stock of accumulated pollution (Jørgensen et al., 2010). Therefore, one of the key assumptions of this paper is that emissions by either region contribute to the stock of pollution and the two regions face the same pollution stock. Secondly, the strategy structure of the differential game solution reflects the interaction of the behaviors between the participants. By establishing the Hamilton–Jacobi–Bellman equation (HJB), the Markov-perfect Nash equilibrium (MPNE) is obtained. Under this equilibrium, the participants not only consider the dynamic changes of state variables, but also adjust their own strategies based on the decision-making choices of other participants, and finally achieve time consistency and subgame perfection. However, for multiple state equations or non-linear situations, the differential game cannot achieve its analytical solution, meaning that numerical analysis can be used to verify the results, which is its limitation

3.2.2. Stackelberg Game

The Stackelberg leadership model is a strategic game in economics in which the leader firm moves first and then the follower firms move sequentially. In our paper, it is assumed that the government has more information about the total environmental quality and its impact on local welfare than do the firms. Specifically, the pollution stock and environmental damage of each region in Equation (3) can be observed by the governments (Leaders), but it cannot be judged by the firms (Followers). Therefore, the firm cannot predict policy information, such as the emission standards, emission tax rates, or initial emission permits set by the government. These assumptions indicate that the local government can obtain information about the firm’s feedback strategy and the damage caused by the total pollution in the two regions to formulate the optimal temporal path under each environmental policy to maximize social welfare. In particular, a three-stage Stackelberg game is considered in which: (1) the government in each region sets the standards/taxes/quotas to maximize social welfare, (2) firms determine the abatement level, and (3) market competition occurs (firms determine output). We can use backward induction to solve this problem and guarantee the equilibrium solution concept is in perfect subgame equilibrium [48,49].

4. Dynamic Games of Environmental Policies

4.1. Emission Standards (Scenario S)

Under the policy of emission standards, we consider the case in which the government in each region pre-commits to the level of environmental policy. In particular, a three-stage game is considered in which: (1) the government in each region sets the standards to maximize social welfare, (2) firms determine the abatement level, and (3) market competition occurs (firms determine output). The equilibrium solution concept is in perfect subgame equilibrium with backward induction. The output of the firm in region m is the function of the standard and abatement level: q m ( t ) = θ m ( t ) + r m ( t ) , where θ m ( t ) is the standard set by the local government in each region at time t . According to the inverse demand function and market clearing condition χ i + χ j = q i + q j , p = 2 a q i q j can be obtained. Then, the profit of each firm in region i and region j at time t can be expressed as follows.
  π i ( t ) q i , r i = [ α + α q i ( t ) q j ( t ) ] q i ( t ) c q i ( t ) γ i 2 r i 2 ( t )
  π j ( t ) q j , r j = [ α + α q i ( t ) q j ( t ) ] q j ( t ) c q j ( t ) γ j 2 r j 2 ( t )
In the last stage, namely the market competition stage, the firm in each region chooses its output to maximize Equations (4) and (5);   π m ( t ) is concave in q m ( t ) because 2 π m ( t ) / q m 2 ( t ) = 2 < 0 ,   m = i , j . The optimal outputs q i ( t ) and q j ( t ) can then be obtained as follows (the subscript S represents the emission standard situation, and the time t is omitted for simplicity):
q i S ( θ i , θ j ) = 2 ( a + γ i θ i ) + γ j ( 2 a + γ i θ i θ j ) 3 + 2 γ i + 2 γ j + γ i γ j ,    q j S ( θ i , θ j ) = 2 ( a + γ j θ j ) + γ i ( a + γ j θ j ) 3 + 2 γ i + 2 γ j + γ i γ j
where, for notational simplicity, a = α c is set, and a measures the market size. In the second stage, the firm in each region determines its abatement level to maximize its payoffs;   π m ( t ) is concave in q m ( t ) because 2 π m ( t ) / r m 2 ( t ) = γ i < 0 ,   m = i , j . Solving these problems yields the equilibrium abatement levels, as follows.
r i S ( θ i , θ j ) = 2 a 3 θ i + γ j ( 2 a 2 θ i θ j ) 3 + 2 γ i + 2 γ j + γ i γ j ,    r j S ( θ i , θ j ) = 2 a 3 θ j + γ i ( 2 a 2 θ j θ i ) 3 + 2 γ i + 2 γ j + γ i γ j
The condition to ensure that the results are non-negative is a > max { ( 2 θ i + θ j ) / 2 ,   ( 2 θ j + θ i ) / 2 } . The output and abatement level of each firm depend on the allowable emissions in two regions; q i S ( θ i ,   θ j ) / θ i > 0 , q j S ( θ i ,   θ j ) / θ i < 0 , which means that when the emission standard level in region i is more restrictive (i.e., a decrease in θ i ), the output of firm in region i will decrease correspondingly, while the output of the firm in region j will increase. This phenomenon is called the “rent-shifting” effect [41,50], i.e., local strict environmental policies will reduce the output of the local firm, but will simultaneously increase the output of the firm in the neighboring region. From the reaction function of the pollution abatement level of the firm in region i , r i S ( θ i , θ j ) / θ i < 0 , r j S ( θ i , θ j ) / θ i < 0 , and the net emissions E i S ( θ i , θ j ) / θ i > 0 and E j S ( θ i , θ j ) / θ i = 0 can be obtained. This means that a stricter emission standard in region i will increase the level of pollution abatement of the firm in region i , and will also increase the pollution abatement level of the firm in the neighboring region; however, a stricter emission standard in region i will have no effect on the net emissions in the neighboring region j .
In the first stage, the government in each region selects the standard that maximizes the discounted present value of social welfare. Thus, the objective function and the constraint condition under the emission standards policy can be expressed as follows:
max θ m = 0 e ρ t { CS m ( t ) + π m ( t ) d m x ( t ) } dt ,
s . t .    x ˙ ( t ) = q i ( t ) + q j ( t ) r i ( t ) r j ( t ) δ x ( t ) ,   x ( 0 ) = x 0 ,   x ( t ) 0 ,
where the parameter ρ ( 0 < ρ < 1 ) is the discount rate. The problem can be solved by using the Hamilton–Jacobi–Bellman (HJB) equation [29]. By substituting Equations (6) and (7) into the government’s objective function, and via symmetry ( d i = d j ,   γ i = γ j ), the HJB equations of region i and region j are obtained, respectively, as follows:
ρ V i S ( x ) = max θ i { ( 4 a + γ θ i + γ θ j ) 2 4 ( 3 + γ ) 2 ( 2 a ( 1 + γ ) + γ θ i ( 2 + γ ) γ θ j ) ( 2 a ( 1 + γ ) + γ θ i + γ θ j ) ( 3 + γ ) ( 3 + 4 γ + γ 2 )        γ ( 2 a ( 1 + γ ) + ( 3 + 2 γ ) θ i + γ θ j ) 2 2 ( 3 + 4 γ + γ 2 ) 2   d x ( t ) + V i S ( x ) ( θ i + θ j δ x ( t ) ) }
ρ V j S ( x ) = max θ j { ( 4 a + γ θ i + γ θ j ) 2 4 ( 3 + γ ) 2 ( 2 a ( 1 + γ ) + γ θ j ( 2 + γ ) γ θ i ) ( 2 a ( 1 + γ ) + γ θ i + γ θ j ) ( 3 + γ ) ( 3 + 4 γ + γ 2 )          γ ( 2 a ( 1 + γ ) + ( 3 + 2 γ ) θ j + γ θ i ) 2 2 ( 3 + 4 γ + γ 2 ) 2   d x ( t ) + V j S ( x ) ( θ i + θ j δ x ( t ) ) }
where V i S ( x ) and V j S ( x ) are the value functions of region i and region j , respectively. Following the solutions provided by Dockner et al. [29], the feedback Nash equilibrium solutions of the optimal standards θ i and θ j , the total pollution stock trajectory of both regions x S ( t ) , and the optimal social welfare of both regions V i S ( x ) and V j S ( x ) can be obtained, as given by Proposition 1.
Proposition 1.
The feedback Nash equilibrium solutions of the optimal standards, the total pollution stock trajectory of both regions, and the optimal social welfare of each region are given, respectively, by the following.
θ i = θ j = 2 a γ ( 5 + 5 γ + γ 2 ) ( ρ + δ ) d >( 1 + γ >) >( 3 + γ >) 2 >( 9 + 10 γ + 2 γ 2 >) >( ρ + δ >)
x S ( t ) = ( x 0 x S S ) e δ t + x S S
V i S ( x ) = V j S ( x ) = d 2 ( 3 + γ ) 2 ( 27 + 56 γ + 35 γ 2 + 6 γ 3 ) 2 γ ρ ( 9 + 10 γ + 2 γ 2 ) 2 ( ρ + δ ) 2 4 a d γ ( 81 + 175 γ + 131 γ 2 + 39 γ 3 + 4 γ 4 ) γ ρ ( 9 + 10 γ + 2 γ 2 ) 2 ( ρ + δ ) 2   + 4 a 2 γ ( 36 + 80 γ + 62 γ 2 + 19 γ 3 + 2 γ 4 ) γ ρ ( 9 + 10 γ + 2 γ 2 ) 2 ( ρ + δ ) 2 d ρ + δ x S
The proof for Proposition 1 is presented in the Appendix A.
In Equations (10)–(12), x S S = 2 ( 2 a γ ( 5 + 5 γ + γ 2 ) ( ρ + δ ) d ( 1 + γ ) ( 3 + γ ) 2 ) γ δ ( 9 + 10 γ + 2 γ 2 ) ( ρ + δ ) represents the steady state of the total pollution stock in the two regions, and x 0 is the initial condition of the pollution stock. When x 0 < x S S , x S / t > 0 , it means that if the initial pollution stock of the regions is less than that of the steady state, the evolution of the pollution stock is an accumulative process; when x 0 > x S S , x S / t < 0 , the evolution of the pollution stock is a dissipative process; when x 0 = x S S , x S / t = 0 , the pollution stock is constant.
Under the government’s optimal emission standards policy, q i S ,    q j S ,    r i S ,    r j S are, respectively, the equilibrium outputs and emission reductions of the firms in the two regions under the optimal emission standards, which are defined, respectively, as follows.
q i S = q j S = ( 3 + 4 γ + γ 2 ) ( 2 a ( ρ + δ ) d ) ( 9 + 10 γ + 2 γ 2 ) ( ρ + δ )
r i S = r j S = 3 d ( 3 + 4 γ + γ 2 ) 2 a γ ( 2 + γ ) ( ρ + δ ) γ ( 9 + 10 γ + 2 γ 2 ) ( ρ + δ )

4.2. Emission Taxes (Scenario T)

Similar to the description of Section 4.1, in the third stage of emission taxes, the firms in the two regions choose their outputs to maximize their respective payoffs. Given the emission tax rate τ m ( t ) , the profit function of the firm in each region is as follows.
max q m ,   r m   π m ( t ) = ( α + α q i ( t ) q j ( t ) ) q m ( t ) c q m ( t ) γ m 2 r m ( t ) 2 τ m ( t ) ( q m ( t ) r m ( t ) )
  π m ( t ) is concave in q m ( t ) because 2 π m ( t ) / q m 2 ( t ) = 2 < 0 ,   m = i , j . Given the emission tax rate, the first-order conditions give the following equilibrium output level of each firm and the total outputs:
q i T ( τ i ,   τ j ) = 2 a 2 τ i + τ j 3 ,    q j T ( τ i ,   τ j ) = 2 a 2 τ j + τ i 3
where a = α c . In the second stage, firms choose their abatement level to maximize their respective payoffs in Equation (15).   π m ( t ) is concave in r m ( t ) because 2 π m ( t ) / r m 2 ( t ) = γ m < 0 ,   m = i , j . The first-order conditions give the following equilibrium pollution abatement levels as a function of the tax.
r i T ( τ i ,   τ j ) = τ i γ i ,    r j T ( τ i ,   τ j ) = τ j γ j
The condition to ensure that the results are non-negative is a > max { ( 2 τ i τ j ) / 2 ,   ( 2 τ j τ i ) / 2 } . Based on Equation (15), q i T ( τ i ,   τ j ) / τ i < 0 , q j T ( τ i ,   τ j ) / τ i > 0 , q i T ( τ i ,   τ j ) / τ j > 0 , and q j T ( τ i ,   τ j ) / τ j < 0 . This means that the increase in the emission tax rate in the local region (i.e., region i ) reduces the output of the firm in the local region, whereas it increases the output of the firm in the neighboring region (i.e., region j ), i.e., the rent-shifting phenomenon. Regarding the abatement levels of the firms, r i ( τ i ,   τ j ) / τ i > 0 ,   r j ( τ i ,   τ j ) / τ j > 0 , the emission abatement levels of the firm in region i increase with the increase in τ i , but the firm in region j is not affected by the increase in τ i . Regarding the net emissions of each region, E i T ( τ i ,   τ j ) / τ i < 0 , E j T ( τ i ,   τ j ) / τ i > 0 , E i T ( τ i ,   τ j ) / τ j > 0 , and E j T ( τ i ,   τ j ) / τ j < 0 ; this indicates that with the increase in the emission tax rate imposed by the government of the local region (region i ), the net emissions of the local region (i.e., region i ) decrease, while those of the neighboring region (i.e., region j ) increase. This can be interpreted as the “policy leakage” of the environmental policy, which is different from that under the emission standards policy. Under the emission standards policy, the strict environmental policy in the local region (region i ) has no impact on the net emissions of the firm in the neighboring region. Based on the response function of each firm, the government maximizes social welfare subject to the dynamics of x . Analogously to the emission standards policy game, the value functions of region i and region j are denoted, respectively, as V i T ( x ) and V j T ( x ) . The HJB equations then become the following.
ρ V i T ( x ) = max τ i { ( 4 a + τ i + τ j ) 2 36 + 9 τ i 2 + 2 γ ( 2 τ i 2 τ i ( 2 a + τ j ) + ( 2 a + τ j ) 2 ) 18 γ i d x ( t )   + V i T ( x ) ( 1 3 ( 4 a τ i τ j ) τ i + τ j γ δ x ( t ) ) }
ρ V j T ( x ) = max τ j { ( 4 a + τ i + τ j ) 2 36 + 9 τ j 2 + 2 γ j ( 4 a 2 + τ i 2 + τ i ( 4 a τ j ) 2 a τ j 2 τ j 2 ) 18 γ j   d j x ( t ) + V j T ( x ) ( 1 3 ( 4 a τ i τ j ) τ i + τ j γ i δ x ( t ) ) }
The feedback Nash equilibrium solutions of the optimal emission taxes of each region τ i and τ j , the total pollution stock trajectory of both regions x T ( t ) , and the optimal social welfare of each region V i T ( x ) and V j T ( x ) can therefore be obtained, as given by Proposition 2.
Proposition 2.
The feedback Nash equilibrium solutions of the optimal emission taxes, the total pollution stock trajectory of both regions, and the optimal social welfare of each region are given, respectively, by
τ i = τ j = 3 d ( 3 + γ ) 4 a γ ( ρ + δ ) ( 9 + 4 γ ) ( ρ + δ )
x T ( t ) = ( x 0 x T S ) e δ t + x T S
V i T = V j T = d ρ + δ x T + d 2 ( 3 + γ ) 2 ( 27 + 14 γ ) 4 a d γ ( 81 + 73 γ + 16 γ 2 ) ( ρ + δ ) 2 γ ( 9 + 4 γ ) 2 ρ ( ρ + δ ) 2   + 16 a 2 γ ( 9 + 8 γ + 2 γ 2 ) ( ρ + δ ) 2 2 γ ρ ( 9 + 4 γ ) 2 ( ρ + δ ) 2
where x T S = 2 ( 2 a γ ( 5 + 2 γ ) ( ρ + δ ) d ( 3 + γ ) 2 ) γ δ ( 9 + 4 γ ) ( ρ + δ ) represents the steady state of the total pollution stock in the two regions.
The proof is presented in the Appendix B.
Moreover, q i T ,   q j T ,   r i T ,   r j T are the equilibrium outputs and emission reductions, respectively, of the firms in two regions under the optimal emission taxes, and are defined, respectively, as follows.
q i T = q j T = 2 a ( 3 + 2 γ ) ( ρ + δ ) d ( 3 + γ ) ( 9 + 4 γ ) ( ρ + δ )
r i T = r j T = 3 d ( 3 + γ ) 4 a γ ( ρ + δ ) ( 9 + 4 γ ) ( ρ + δ )

4.3. Tradable Emission Permits (Scenario P)

In this scenario, E ¯ i ( t ) and E ¯ j ( t ) are the emission quotas of the firms assigned by the government in each region at time t . Given the emission quotas, the governments also allow the firms to trade emission permits in the emission trading market. The emission trading market is assumed to be a centralized competitive market in which emission trading occurs via the market clearing price. Thus, if the net demand of the firm in region m is defined as D m = q m r m E ¯ m , the total net demand of emission permits is zero at the market equilibrium D i + D j = 0 . This also implies that both firms do not have market power in the emission market, and they therefore behave as price takers in the centralized competitive emission trading market in which firms trade permits at the realization of the market clearing price [19].
In the third stage, the firm in each region chooses its outputs to maximize the following function.
max q m ,   r m   π m ( t ) = ( α + α q i ( t ) q j ( t ) ) q m ( t ) c q m 1 2 γ m r m ( t ) 2 λ m ( q m ( t ) r m ( t ) E ¯ m ( t ) )
  π m ( t ) is concave in q m ( t ) because 2 π m ( t ) / q m 2 ( t ) = 2 < 0 ,   m = i , j . The equilibrium output as the function of the emission permit price λ m is obtained as follows:
q i P ( λ i , λ j ) = 1 3 ( 2 a 2 λ i + λ j ) ,   q j P ( λ i , λ j ) = 1 3 ( 2 a + λ i 2 λ j ) ,
where a = α c . Subscript T is employed to denote the equilibrium under the tradable permits policy. In the second stage, firms choose their abatement levels to maximize their payoffs.   π m ( t ) is concave in r m ( t ) because 2 π m ( t ) / r m 2 ( t ) = γ m < 0 . The optimal abatement levels can be determined as follows.
r i P ( λ i ) = λ i γ i ,   r j P ( λ j ) = λ j γ j
To ensure that the results are non-negative, a > max { ( 2 λ i λ j ) / 2 ,   ( 2 λ j λ i ) / 2 } . The emission trading market is assumed to be a centralized competitive market in which emission trading occurs via the market clearing price, and where E ¯ i + E ¯ j = 1 3 ( 2 a 2 λ i + λ j ) λ i γ i + 1 3 ( 2 a 2 λ j + λ i ) λ j γ j , λ i = λ j = λ . The following can then be obtained.
λ = γ i γ j ( 4 a 3 E ¯ i 3 E ¯ j ) 3 γ i + 3 γ j + 2 γ i γ j
By substituting Equation (28) into Equations (26) and (27), the outputs and emission abatement levels are obtained as functions of the emission quotas, as follows.
q i P ( E ¯ i , E ¯ j ) = 2 a γ j + γ i ( 2 a + E ¯ i γ j + E ¯ j γ j ) 3 γ i + 3 γ j + 2 γ i γ j ,   q j P ( E ¯ i , E ¯ j ) = 2 a γ j + γ i ( 2 a + E ¯ i γ j + E ¯ j γ j ) 3 γ i + 3 γ j + 2 γ i γ j
r i P ( E ¯ i , E ¯ j ) = ( 4 a 3 E ¯ i 3 E ¯ j ) γ j 3 γ i + 3 γ j + 2 γ i γ j ,   r j P ( E ¯ i , E ¯ j ) = ( 4 a 3 E ¯ i 3 E ¯ j ) γ i 3 γ i + 3 γ j + 2 γ i γ j
q i P ( E ¯ i , E ¯ j ) / E ¯ i > 0 , q j P ( E ¯ i , E ¯ j ) / E ¯ i > 0 , q i P ( E ¯ i , E ¯ j ) / E ¯ j   > 0 and q j P ( E ¯ i , E ¯ j ) / E ¯ j > 0 ; these relations state that when the local government reduces the initial emission quota assigned to the local firm (strict policy), the output of the firm will decrease, and the output of the firm in the neighboring region will also decrease accordingly. This means that there is no rent-shifting effect under the emission permit trading policy. Regarding the abatement levels of the firms, r i P ( E ¯ i ,   E ¯ j ) / E ¯ i < 0 , r j P ( E ¯ i ,   E ¯ j ) / E ¯ i < 0 and r i P ( E ¯ i ,   E ¯ j ) / E ¯ j < 0 , r j P ( E ¯ i ,   E ¯ j ) / E ¯ j < 0 . It can also be demonstrated that E i P ( E ¯ i ,   E ¯ j ) / E ¯ i > 0 , E i P ( E ¯ i ,   E ¯ j ) / E ¯ j > 0 , E j P ( E ¯ i ,   E ¯ j ) / E ¯ j > 0 , and E j P ( E ¯ i ,   E ¯ j ) / E ¯ i > 0 . These relations state that the reduction in the initial emission permit quota in the local region will reduce the net emissions of firms in both regions, and there is no policy leakage. Based on the response function of each firm, the government maximizes social welfare subject to the dynamics of x . The HJB equations of the government in each region can then be obtained as follows.
ρ V i P ( x ) = max e i { ( 4 a + γ E ¯ i + γ E ¯ j ) 2 4 ( 3 + γ ) 2 + 16 a 2 ( 2 + γ ) γ E ¯ i 2 ( 27 + 10 γ ) 8 a γ E ¯ j 8 ( 3 + γ ) 2 +   γ ( 9 + 2 γ ) E ¯ j 2 + 2 γ E ¯ i i ( 4 a ( 5 + 2 γ ) ( 9 + 4 γ ) E ¯ j ) 8 ( 3 + γ ) 2 d x ( t ) + V i P ( x ) ( E ¯ i + E ¯ j δ x ( t ) ) }
ρ V j P ( x ) = max e j { ( 4 a + γ E ¯ i + γ E ¯ j ) 2 4 ( 3 + γ ) 2 + 16 a 2 ( 2 + γ ) + γ E ¯ i 2 ( 9 + 2 γ ) + 8 a γ E ¯ j ( 5 + 2 γ ) 8 ( 3 + γ ) 2 γ ( 27 + 10 γ ) E ¯ j 2 2 γ E ¯ i ( 4 a E ¯ j ( 9 + 4 γ ) ) 8 ( 3 + γ ) 2 d x ( t ) + V j P ( x ) ( E ¯ i + E ¯ j δ x ( t ) ) }
Similar to the previous analysis, the feedback Nash equilibrium solutions of the optimal initial permit quotas E ¯ i and E ¯ j , the total pollution stock trajectory of both regions x P ( t ) , and the optimal social welfare of each region V i P ( x ) and V j P ( x ) can be obtained, as given by Proposition 3.
Proposition 3.
The feedback Nash equilibrium solutions of the optimal emission permit quotas, the total pollution stock trajectory of both regions, and the optimal social welfare of each region are given, respectively, by
E ¯ i = E ¯ j = 2 ( a γ ( 7 + 2 γ ) ( ρ + δ ) d ( 3 + γ ) 2 ) γ ( 18 + 5 γ ) ( ρ + δ )
x P ( t ) = ( x 0 x P S ) e ρ t + x P S
V i T = V j T = 2 d 2 ( 3 + γ ) 2 ( 27 + 8 γ ) 4 a d γ ( 135 + 77 γ + 11 γ 2 ) ( ρ + δ ) γ ρ ( 18 + 5 γ ) 2 ( ρ + δ ) 2 +   2 a 2 γ ( 144 + 83 γ + 12 γ 2 ) γ ρ ( 18 + 5 γ ) 2 d ρ + δ x P
where x P S = 4 ( a γ ( 7 + 2 γ ) ( ρ + δ ) d ( 3 + γ ) 2 ) γ δ ( 18 + 5 γ ) ( ρ + δ ) expresses the steady state of the total pollution stock in the two regions.
The proof is provided in Appendix C.
Moreover, q i P ,   q j P ,   r i P ,   r j P are the equilibrium outputs and abatement levels of the firms, respectively, in the two regions under the emission permit trading policy.
q i P = q j P = 2 ( 3 + γ ) ( 2 a ( ρ + δ ) d ) ( 18 + 5 γ ) ( ρ + δ )
r i P = r j P = 2 ( 3 d ( 3 + γ ) a γ ( ρ + δ ) ) γ ( 18 + 5 γ ) ( ρ + δ )

4.4. Comparing Emission Standards, Taxes, and Permit Trading

The feedback Nash equilibrium results of the total pollution stock and the social welfare of each region under the three environmental policies were obtained. In this section, the different effects of the three policies comprising pollution stock are compared in terms of the environmental quality and social welfare of each region.
(1)
Difference in the trajectory and steady state of the pollution stock
From Propositions 1–3, the different results of the optimal trajectories of the pollution stock under the three environmental policies can be derived as follows.
x T x S = 2 ( 1 e δ t ) ( 3 + γ ) ( ( 9 + 9 γ + 2 γ 2 ) + 2 a γ ( ρ + δ ) ) ( 9 + 4 γ ) ( 9 + 10 γ + 2 γ 2 ) ( ρ + δ ) > 0
x T x P = 2 ( 3 + γ ) ( 1 e δ t ) ( 3 d ( 3 + γ ) + 2 a ( 9 + 2 γ ) ( ρ + δ ) ) ( 9 + 4 γ ) ( 18 + 5 γ ) ( ρ + δ ) > 0
x S x P = 2 ( 3 + γ ) 3 ( 1 e δ t ) ( 2 a ( ρ + δ ) d ) ( 18 + 5 γ ) ( 9 + 10 γ + 2 γ 2 ) ( ρ + δ ) > 0
x T S x S S = 2 ( 3 + γ ) ( ( 9 + 9 γ + 2 γ 2 ) + 2 a γ ( ρ + δ ) ) δ ( 9 + 4 γ ) ( 9 + 10 γ + 2 γ 2 ) ( ρ + δ ) > 0
x T S x P S = 2 ( 3 + γ ) [ 3 d ( 3 + γ ) + 2 a ( 9 + 2 γ ) ( ρ + δ ) ] δ ( 9 + 4 γ ) ( 18 + 5 γ ) ( ρ + δ ) > 0
x S S x P S = 2 ( 3 + γ ) 3 [ 2 a ( ρ + δ ) d ] δ ( 18 + 5 γ ) ( 9 + 10 γ + 2 γ 2 ) ( ρ + δ ) > 0
From Equation (35), 2 a ( ρ + δ ) d > 0 was determined. Moreover, regarding the environmental decay rate, 0 < δ < 1 and 1 e δ t > 0 , based on which the following proposition is obtained.
Proposition 4.
The equilibrium trajectory and steady state of the pollution stock under the emission tax policy are larger than those under the emission standards and emission permit trading policies. Furthermore, the equilibrium trajectory and steady state of the pollution stock are the lowest under the emission permit trading policy, i.e., x T > x S > x P and x T S > x S S > x P S .
Proposition 4 demonstrates that when the government in each region implements environmental policy under the condition of transboundary pollution, the total pollution stock of the two regions under the emission tax policy is the highest from the perspective of the environmental effect. Furthermore, among the three policies, the total pollution stock under the emission permit trading policy is the lowest. Our conclusion is partially consistent with the views of Ulph [26] and Yanase [44]. For example, Ulph considered the choice of environmental policy instruments (taxes or standards) in the context of a model of strategic international trade between countries. The results showed that if trade is modeled as a Stackelberg game, then both countries have higher producer surpluses under the emission standard policy. Yanase [44] considered the market structure of enterprises in two countries competing in a third country, and he analyzed the impacts of tax and standard policies adopted by the governments of the two countries on the total pollution stock. The results revealed that the emission standards policy is better than the emission tax policy in environmental quality. However, these scholars did not participate in the analysis of emission permit trading policy, and did not compare the comprehensive impact on the environment and social welfare. In addition, they also did not analyze the deep-seated reasons for the above results. Our conclusion will fill these research gaps.
The reasons for this can be explained as follows. As presented in Section 4.1, Section 4.2 and Section 4.3, under the emission tax policy, when the government of the local region (i.e., region i ) increases the emission tax rate (the policy tends to be strict), the firm in the local region will decrease its outputs, whereas the firm in the neighboring region (i.e., region j ) will increase its outputs accordingly; this is the rent-shifting effect. Moreover, the increase in the emission tax rate by the local government will decrease the net emissions of the local region, but will lead to an increase in the net emissions in the neighboring region, resulting in policy leakage. This means that when the government attempts to improve the environmental quality by increasing the emission tax rate, “free-riding” behavior will occur in the neighboring region. For these reasons, when determining the emission tax rate, the government in each region will predict and consider the free-riding behavior of the other government. Therefore, the emission tax rates of both sides will be lower than the optimal level, which will lead to increases in the net emissions and pollution stock. Some of the literatures on environmental economics have revealed that local governments tend to lower the standards of environmental regulation to attract scarce working capital and enterprises to enter the local area, resulting in inferior competition between governments and intensifying environmental pollution [51,52].
Similarly, when the emission standard level in the local region (region i ) is more restrictive (i.e., a decrease in emission standards), the firm in the local region will decrease its outputs, but the firm in the neighboring region will increase its outputs. However, as compared with the emission tax policy, the rent-shifting effect is relatively weak. The reason for this is that the reduction in emission standards in the local region increases the emission abatement levels of not only the local firm, but also those of the firm in the neighboring region. Therefore, a portion of the increasing emissions in the neighboring region is offset, and no policy leakage will occur. Taking into account these reactions, the rent-shifting effect will be stronger under the emission tax policy by relaxing the environmental policy. In other words, compared with the emission standards policy, the environmental policy of the local government under the emission tax policy is more relaxed, thereby leading to a larger pollution stock.
Finally, in terms of the emission permit trading policy, the reduction in the initial emission quota of the firm in the local region (i.e., region i ) will reduce not only the output of the local firm, but also that of the firm in the neighboring region (i.e., region j ). Consequently, the rent-shifting effect does not exist. Furthermore, the reduction in the emission quota in the local region will lead to the increase in the emission abatement levels in both regions. Therefore, the reduction in the emission quota in the local region will increase the net emissions of the firms in both the local and neighboring regions, and no policy leakage problem will exist. There will also be no free-riding, which would cause the governments of the two regions to relax their environmental policies. According to the preceding analysis, among the three policies, the pollution stock under the emission permit trading policy will be the lowest, and the environmental performance will be the best.
(2)
Difference in social welfare under three policies
From Propositions 1–3, the following different results in the optimal welfare under the three policies ( V i S ( x ) = V j S ( x ) = V S ( x ) , V i T ( x ) = V j T ( x ) = V T ( x ) , and V i P ( x ) = V j P ( x ) = V P ( x ) ) can be obtained.
V S ( x ) V T ( x ) = ( 54 + 51 γ + 10 γ 2 ) [ d ( 9 + 9 γ + 2 γ 2 ) + 2 a γ ( ρ + δ ) ] 2 2 ρ ( 9 + 4 γ ) 2 ( 9 + 10 γ + 2 γ 2 ) 2 ( ρ + δ ) 2   d ρ + δ ( x S x T ) > 0
From Equation (37), x S x T < 0 can be determined; then, V S ( x ) V T ( x ) > 0 .
V P ( x ) V S ( x ) = ( 2 a ( ρ + δ ) d ) ( 3 + γ ) 3 2 ρ ( ρ + δ ) 2 ( 18 + 5 γ ) 2 ( 2 γ 2 + 10 γ + 9 ) 2 [ d ( 324 + 441 γ + 177 γ 2 + 22 γ 3 )   2 a γ ( 5 γ + 2 γ 2 ) ( ρ + δ ) + 18 a γ ( ρ + δ ) ] d ρ + δ ( x P x S )
From Equation (36), 3 d ( 3 + γ ) 4 a γ ( ρ + δ ) > 0 . For γ > 0 , via the multiplication of both sides of the inequality by 5 γ + 2 γ 2 2 , the following inequality can be obtained.
3 2 d ( 3 + γ ) ( 5 γ + 2 γ 2 ) > 2 a γ ( ρ + δ ) ( 5 γ + 2 γ 2 )
From Equation (45), the following results can be obtained.
d ( 324 + 441 γ + 177 γ 2 + 22 γ 3 ) 2 a γ ( 5 γ + 2 γ 2 ) ( ρ + δ ) > 0
Finally, from Equations (39) and (35), x P x S < 0 and 2 a ( ρ + δ ) d > 0 . Moreover, the result of Equation (44) is V P ( x ) V S ( x ) > 0 . Then, from V S ( x ) V T ( x ) > 0 and V P ( x ) V S ( x ) > 0 , Proposition 5 on the difference in social welfare can be obtained as follows.
Proposition 5.
When the government in each region implements an environmental policy under transboundary pollution, the social welfare of each region under the emission tax policy will be less than that under the emission standards policy, V S ( x ) V T ( x ) > 0 . Moreover, the social welfare of each region under the emission standards policy will be less than that under the emission permit trading policy, V P ( x ) V S ( x ) > 0 . In other words, the social welfare under the emission permit trading policy is the highest, that under the emission standards policy is the second highest, and that under the emission tax policy is the lowest.
Proposition 5 reveals that, from the perspective of the social welfare of each region, the emission tax policy is the worst, while the emission permit trading policy is the best. According to the previous hypothesis, the social welfare for region i or region j is the sum of the consumer surplus ( C S m ) and the profit of the firm in each region minus environmental damage. It is known that the consumer surplus is the same under the three environmental policies because the market price of commodities is the same. Therefore, the social welfare of each region primarily depends on the differences in the profits of the firm and the environmental damage suffered by each region. Due to the rent-shifting and policy-leakage effects, the environmental policy under the emission tax policy is looser than that under the emission standards policy; furthermore, due to the policy-leakage effect, the environmental policy under the emission standards policy is looser than that under the emission permit trading policy. This means that, compared with the other two policies, the firm under the emission tax policy has the highest output and the lowest emission abatement level (the abatement cost is the lowest). However, the response of the firm in each region to the policy of the neighboring government must be considered.
For the local region (region i ), when the government adopts the loose policy, the output q i T increases and the abatement level r i T decreases with the decrease in the emission tax rate τ i . Similarly, the output q i S or q i P increases and the abatement level r i S or r i P decreases with the increase in the emission standard θ i or emission quota E ¯ i . However, when considering the policy of the neighboring region, the output q i T decreases with the decrease in the emission tax rate τ j , but there is no impact on the abatement level r i T . Moreover, output q i S decreases and r i S decreases with the increase in the emission standard θ j of region j , and the output q i P increases and the abatement level r i P decreases with the increase in the initial quota E ¯ j of region j . This demonstrates that a decline in the emission tax rate in the neighboring region lowers the output of the firm in the local region, and for the emission standards policy, an increase in the emission standard level in the neighboring region lowers not only the output of the local firm, but also its abatement level, which reduces the abatement cost of the local firm. Finally, an increase in the emission quota in the neighboring region increases the output of the local firm and simultaneously lowers its abatement level. The preceding analysis demonstrates that the difference between the output and abatement level (abatement cost) under the emission permit trading policy is the greatest, that under the emission standards policy is the second greatest, and that under the emission tax policy is the least.
According to Proposition 4, the total pollution stock in the two regions is the highest under the emission tax policy, and is the lowest under the emission trading policy (the environmental damage suffered by each region is the lowest). Therefore, Proposition 5 can be interpreted from the preceding analysis.

5. Analysis of the Equilibrium Results

5.1. Equilibrium Trajectories

In this section, the equilibrium results of each region under the three policies and under different initial pollution stocks are analyzed. Reference is made to several previous studies [6,53], and representative parameter settings are considered as a basic example to illustrate the findings. The basic parameter settings are given as follows: the market size a = 60 , the abatement of the firm in each region γ i = γ j = 1 , the pollution damage suffered by each region d i = d j = 4 , the discount rate ρ = 0.05 , and the environmental decay rate δ = 0.1 .
Figure 1 presents the equilibrium trajectories of the pollution stock when the total initial pollution stock x 0 in the environment is 800 and 100, respectively.
Figure 1 presents the numerical results of the three scenarios over time. The total pollution stock of the two regions under the emission tax policy was found to be the highest among the three policies. Via the comparison of the emission standard and emission permit trading policies, it was found that the total pollution stock under the emission standards policy is higher than that under the emission permit trading policy. Moreover, when the initial pollution stock of the two regions is higher than the pollution stock under the steady state, the trajectories first descend and then converge to stationary levels over time. On the contrary, if the initial pollution stock of the two regions is less than the pollution stock under the steady state, the trajectories first ascend and then converge to stationary levels over time. Figure 2 exhibits the revenue trajectories of each region under the three policies when the initial pollution stock x 0 is 800 and 100. The emission permit trading policy was found to always be more beneficial than the emission standards policy, and both were found to be more beneficial than the emission tax policy. Thus, in terms of both social welfare and environmental quality, the emission permit trading policy is always better than the emission standard and emission tax policies.

5.2. Steady-State Equilibrium Results

Section 5.1 shows that the Markov-perfect Nash equilibrium of the regions converge to the stationary levels over time. In this section, we examine the impact of changes in environmental damage coefficients, market size and natural decay rate on total pollution stock, and social welfare in each region in a steady-state situation. Figure 3a shows that in the steady state, the total pollution stock of the two regions decreases as the environment damage parameter of region i increases. Figure 3b,c shows the impact of changes in the damage parameter of region i on the social welfare of each region. An increase in the damage parameter of region i will increase the damage by pollution stock, thereby reducing the output of the firm in the local region. For region j , however, the social welfare increases as the damage parameter of region i increases because the total pollution stock is reduced due to the reduction in the output of the firm in region i .
Figure 4 shows the impact of changes in market size a i on the total pollution stock and social welfare. It can be seen from Figure 4a that with the expansion of the market size of region i , the total pollution stock of two regions increases due to the increases in the output of the firm and the total pollution stock. Although the total pollution stock of the two regions has increased, the increase in producer surplus in region i causes social welfare to increase with the expansion of the market size a i . Compared with region i , due to the increase in pollution stock, region j suffers more damage without increasing producer surplus, meaning that its social welfare will be reduced.
Finally, the influence of the natural decay rate on the steady-state equilibrium results for the two regions is assessed. Figure 5a shows that the total pollution stock of the two regions decreases with the increase in δ , and as the natural decay rate increases, the difference between the three environmental policies becomes smaller and smaller. Figure 5b also shows that when the natural decay rate gradually increases, the social welfare of the two regions under the three environmental policies becomes higher and higher and the welfare difference becomes smaller and smaller.

6. Conclusions and Applications

6.1. Conclusions

The literature on environmental policy usually concentrates on a single region or country. However, a substantial amount of environmental pollution in the world is caused not only by firms in a domestic region, but also by firms in neighboring regions. Therefore, governments will also be affected by neighboring regions when creating environmental policies. In this paper, a transboundary pollution problem was investigated in which there is a Stackelberg game between firms and their local governments, and the government in each region chooses one of three different environmental policies, namely emission standards, emission taxes, and emission permit trading. The feedback Nash equilibrium solutions in the three scenarios were derived, and the three environmental policies were compared from the perspectives of environmental quality and social welfare. The results revealed that: (1) when the emission abatement cost coefficient of the firms in the two regions is the same, the two regions suffer the same environmental damage. Due to the rent-shifting, policy-leakage, and free-riding effects, the total pollution stock was found to be the highest and the social welfare of each region was found to be the lowest under the emission tax policy. The emission standards policy was found to be better than the emission tax policy due to the presence of the rent-shifting effect but the absence of the policy-leakage effect. Finally, the pollution stock of the two regions was found to be the lowest and the social welfare was found to be the highest under the emission permit trading policy, which does not have any of the three effects; (2) the dynamic trajectories of the equilibrium results and the sensitivity analysis of the parameters in the steady state are discussed numerically, and the basic results reveal that when the initial pollution stock of the total regions is higher (lower) than the pollution stock in the steady state, the trajectory first decreases (rises) and converges to the steady level with the passage of time. In addition, the trajectory of the social welfare curve is similar to that of the pollution stock. With the initial pollution stock higher or lower than that the steady state, the social welfare of each region will increase or decrease to the steady-state level accordingly. (3) We analyzed the impact of changes in parameters such as the environmental damage coefficient, market size, and natural decay rate on the total pollution stock and social welfare in each region under steady-state conditions.
In order to fully recognize the significance of the environmental policy under transboundary pollution and improve the total environmental quality, the government should mainly focus on the following four aspects: first, for some countries or regions with serious cross-border pollution, the implementation of regional emissions trading policies may be a better environmental governance model. For example, in order to alleviate greenhouse gas emissions, the EU implemented the European Union Emissions Trading System (EUETS), which has a significant inhibitory effect on the emissions of carbon dioxide and other air pollutants in the European Union [50]. Second, although the research conclusion of this paper states that the emission tax has the worst effect on environment and welfare, for some developing countries such as China or India, the level of marketization is relatively low, so it can still be combined with the emission tax policy and the emission standard policy to control the pollution.

6.2. Limitations and Prospects

The present work discussed the differential game problem of transboundary pollution. However, the difference in the market structure was not considered, which may also yield interesting and comparable results. For example, it would be interesting to know how the environmental policy would change if the market structure was different. In addition, the results are based on the traditional set-up, such as the costate equation for pollution stock and the pollutants damage. Particularly in recent years, a single source of emissions is typically comprised of multiple pollutants which also lead to regional and global negative externalities in reality. Finally, this paper assumes that the government utilizes a grandfathering approach for the initial allocation of costless permits to the firms. More and more governments, however, are now using auctions to deal with the initial allocation of emission rights [54,55]. These issues represent possible extensions of the present study.

Author Contributions

Conceptualization, H.X. and M.L.; methodology, H.X.; software, H.X.; validation, H.X. and M.L.; formal analysis, M.L.; resources, H.X.; writing—original draft preparation, H.X.; writing—review and editing, M.L.; visualization, H.X.; supervision, M.L.; project administration, H.X.; funding acquisition, H.X. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by [National Natural Science Foundation of China] (Grant No. 71571149).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Acknowledgments

The authors express their gratitude to the anonymous reviewers and editors for their helpful comments about this paper.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. For Proposition 1

The first-order condition of θ i , θ j from Equations (7) and (8) are as follows:
θ i = V i S ( 1 + γ ) ( 3 + γ ) 2 + 2 a γ ( 5 + 5 γ + γ 2 ) γ ( 9 + 10 γ + 2 γ 2 )
θ j = V j S ( 1 + γ ) ( 3 + γ ) 2 + 2 a γ ( 5 + 5 γ + γ 2 ) γ ( 9 + 10 γ + 2 γ 2 )
From symmetry we have V i S ( x ) = V j S ( x ) = V S ( x ) , substituting (A1) and (A2) into (7) and (8), we then attain the simplified HJB equation:
ρ V S ( x ) = ( 2 a + V S ( x ) ) 2 ( 1 + γ ) 2 ( 3 + γ ) 2 ( 9 + 10 γ + 2 γ 2 ) 2 ( V S ( x ) ) 2 ( 9 + 4 γ ) ( 3 + 4 γ + γ 2 ) 2 2 γ ( 9 + 10 γ + 2 γ 2 ) 2   + 4 a γ V S ( x ) ( 3 + γ ) 2 ( 3 + 5 γ + 2 γ 2 ) 4 a 2 γ ( 18 + 32 γ + 18 γ 2 + 3 γ 3 ) 2 γ ( 9 + 10 γ + 2 γ 2 ) 2 d x ( t )   + V S ( x ) ( V S ( x ) ( 1 + γ ) ( 3 + γ ) 2 + 2 a γ ( 5 + 5 γ + γ 2 ) γ ( 9 + 10 γ + 2 γ 2 ) δ x ( t ) )
where V S ( x ) = V i S ( x ) = V j S ( x ) , from Equation (3), we guess the form of the value function V S ( x ) to be linear in x , that is,
V i S ( x ) = V j S ( x ) = V S ( x ) = A S x + B S
where A S and B S are constant coefficients. Then, we obtain these constant coefficients to verify that our guess is correct, that means the exact expressions of the value functions are linear. Differentiating Equation (A4) with respect to x and substituting into (A3), we can derive A S and B S as follows:
A S = d ρ + δ
B S = d 2 ( 3 + γ ) 2 ( 27 + 56 γ + 35 γ 2 + 6 γ 3 ) 2 γ ( 9 + 10 γ + 2 γ 2 ) 2 ρ ( ρ + δ ) 2 4 a d γ ( 81 + 175 γ + 131 γ 2 + 39 γ 3 + 4 γ 4 ) 2 γ ( 9 + 10 γ + 2 γ 2 ) 2 ρ ( ρ + δ )   + 4 a 2 γ ( 36 + 80 γ + 62 γ 2 + 19 γ 3 + 2 γ 4 ) 2 γ ( 9 + 10 γ + 2 γ 2 ) 2 ρ
Substituting (A5) (A6) into V S ( x ) and from (A1) and (A2), we can determine the feedback Nash equilibrium standards θ i , θ j . By substituting θ i and θ j into Equations (5) and (6), we have the optimal output and emission reduction in the firm in each region as follows:
q i S = q j S = ( 3 + 4 γ + γ 2 ) ( 2 a ( ρ + δ ) d ) ( 9 + 10 γ + 2 γ 2 ) ( ρ + δ )
r i S = r j S = 3 d ( 3 + 4 γ + γ 2 ) 2 a γ ( 2 + γ ) ( ρ + δ ) γ ( 9 + 10 γ + 2 γ 2 ) ( ρ + δ )
Substituting Equations (A7) and (A8) into (3) and the steady state of the pollution stock faced by two regions, denoted by x S S , is determined by setting x ˙ S ( t ) = 0 :
x S S = 2 ( 2 a γ ( 5 + 5 γ + γ 2 ) ( ρ + δ ) d ( 1 + γ ) ( 3 + γ ) 2 ) γ δ ( 9 + 10 γ + 2 γ 2 ) ( ρ + δ )
Then, solving the ordinary differential equation, we calculate the following optimal trajectory of the pollution stock:
x S ( t ) = ( x 0 x S S ) e ρ t + x S S

Appendix B. For Proposition 2

Differentiating the right-hand sides of Equations (17) and (18) with respect to τ i , τ j , we can attain the equilibrium emission tax rate as follows:
τ i = ( 18 + 7 γ ) V i T ( x ) + γ ( 8 a + V j T ( x ) ) 18 + 8 γ
τ j = ( 18 + 7 γ ) V j T ( x ) + γ ( 8 a + V i T ( x ) ) 18 + 8 γ
By substituting (A9) and (A10) into (17) and (18), and by symmetry V i T ( x ) = V j T ( x ) = V T ( x ) , we have
ρ V T ( x ) = 12 a V T ( x ) ( 3 + γ ) + ( V T ( x ) ) 2 ( 3 + γ ) 2 ( 9 + 2 γ ) 16 a 2 λ ( 9 + 8 γ + 2 γ 2 ) 2 γ ( 9 + 4 γ ) 2   d x ( t ) + V T ( x ) ( 2 V ( x ) ( 3 + γ ) 2 + 4 a γ ( 5 + 2 γ ) γ ( 9 + 4 γ ) δ x ( t ) )
From (A11), we conjecture that the structure of (A11) can be regarded as linear value function:
V i T ( x ) = V j T ( x ) = V T ( x ) = A T x + B T
Substituting V T ( x ) and V T ( x ) into (A11), we can determine the coefficients A T and B T as follows:
A T = d ρ + δ
B T = d 2 ( 3 + γ ) 2 ( 27 + 14 γ ) 4 a d γ ( 81 + 73 γ + 16 γ 2 ) ( ρ + δ ) + 16 a 2 γ ( 9 + 8 γ + 2 γ 2 ) ( ρ + δ ) 2 2 γ ( 9 + 4 γ ) 2 ρ ( ρ + δ ) 2
Substituting (A13) and (A14) into V T ( x ) and referring to (A9) and (A10), we obtain the equilibrium emission taxes τ i , τ j . By substituting τ i and τ j into Equations (15) and (16), we attain the optimal output and emission abatement level of the firm in each region as follows:
q i T = q j T = 2 a ( 3 + 2 γ ) ( ρ + δ ) d ( 3 + γ ) ( 9 + 4 γ ) ( ρ + δ )
r i T = r j T = 3 d ( 3 + γ ) 4 a γ ( ρ + δ ) ( 9 + 4 γ ) ( ρ + δ )
We can attain the steady state of the pollution stock by substituting Equations (A15) and (A16) into (3), and by setting x ˙ T ( t ) = 0 :
x T S = 2 ( 2 a γ ( 5 + 2 γ ) ( ρ + δ ) d ( 3 + γ ) 2 ) γ δ ( 9 + 4 γ ) ( ρ + δ )
Then, solving the ordinary differential equation, we calculate the following optimal trajectory of the pollution stock:
x T ( t ) = ( x 0 x T S ) e ρ t + x T S

Appendix C. For Proposition 3

From the first-order condition of Equations (30) and (31) we obtain the optimal emission quota:
E ¯ i = V i P ( x ) + γ ( 4 a + γ E ¯ i + γ E ¯ j ) 2 ( 3 + γ ) 2 + 2 γ ( 27 + 10 γ ) E ¯ i + 2 γ ( 4 a ( 5 + 2 γ ) ( 9 + 4 γ ) E ¯ j ) 8 ( 3 + γ ) 2
E ¯ j = V j P ( x ) + γ ( 4 a + γ E ¯ i + γ E ¯ j ) 2 ( 3 + γ ) 2 + 2 γ ( 27 + 10 γ ) E ¯ j + 2 γ ( 4 a ( 5 + 2 γ ) ( 9 + 4 γ ) E ¯ i ) 8 ( 3 + γ ) 2
By symmetry V i P ( x ) = V j P ( x ) = V P ( x ) , we have
E ¯ i = E ¯ j = 2 ( V P ( x ) ( 3 + γ ) 2 + a γ ( 7 + 2 γ ) ) γ ( 18 + 5 γ )
Substituting (A19) into HJB equation and we obtain
ρ V P ( x ) = 4 ( 2 a + V P ( x ) ) 2 ( 3 + γ ) 2 ( 18 + 5 γ ) 2 2 [ 6 a γ V P ( x ) ( 3 + γ ) 2 + ( V P ( x ) ) 2 ( 3 + γ ) 2 ( 9 + 4 γ ) ] γ ( 18 + 5 γ ) 2 a 2 γ ( 72 + 35 γ + 4 γ 2 ) γ ( 18 + 5 γ ) 2 d x ( t ) + V P ( x ) [ 4 ( V P ( x ) ( 3 + γ ) 2 + a γ ( 7 + 2 γ ) ) γ ( 18 + 5 γ ) δ x ( t ) ]
We can also conjecture that the value function of region i or region j is linear in x , that is,
V i P ( x ) = V j P ( x ) = V P ( x ) = A P x + B P
Similar to the aforementioned analysis, we can determine A P and B P by substituting (A21) and V P ( x ) into (A20), as follows,
A P = d ρ + δ
B P = 2 d 2 ( 3 + γ ) 2 ( 27 + 8 γ ) 4 a d γ ( 135 + 77 γ + 11 γ 2 ) ( ρ + δ ) γ ( 18 + 5 γ ) 2 ρ ( ρ + δ ) 2 + 2 a 2 γ ( 144 + 83 γ + 12 γ 2 ) γ ( 18 + 5 γ ) 2 ρ
Then, substituting (A22) and (A23) into V P ( x ) , and referring to (A19), we can derive the equilibrium emission quotas, outputs and abatement levels:
q i P = q j P = 2 ( 3 + γ ) ( 2 a ( ρ + δ ) d ) ( 18 + 5 γ ) ( ρ + δ )
r i P = r j P = 2 ( 3 d ( 3 + γ ) a γ ( ρ + δ ) ) γ ( 18 + 5 γ ) ( ρ + δ )
Substituting Equations (A24) and (A25) into (3), and by setting x ˙ P ( t ) = 0 , the steady state of the pollution stock under permits trading policy can be obtained:
x P S = 4 ( a γ ( 7 + 2 γ ) ( ρ + δ ) d ( 3 + γ ) 2 ) γ δ ( 18 + 5 γ ) ( ρ + δ )
Then, solving the ordinary differential equation, we calculate the optimal trajectory of the pollution stock as in Proposition 3.

References

  1. Wendling, Z.A.; Emerson, J.W.; Esty, D.C.; Levy, M.A.; de Sherbinin, A. The 2018 Environmental Performance Index Report; Yale Center for Environmental Law & Policy: New Haven, CT, USA, 2018. [Google Scholar]
  2. Weitzman, M.L. Prices vs. quantities. Rev. Econ. Stud. 1974, 41, 477–491. [Google Scholar] [CrossRef]
  3. Newell, R.G.; Pizer, W.A. Regulating stock externalities under uncertainty. J. Environ. Econ. Manag. 2003, 45, 416–432. [Google Scholar] [CrossRef] [Green Version]
  4. Weber, T.A.; Neuhoff, K. Carbon markets and technological innovation. J. Environ. Econ. Manag. 2010, 60, 115–132. [Google Scholar] [CrossRef] [Green Version]
  5. Arguedas, C.; Cabo, F.; Martín-Herrán, G. Optimal pollution standards and non-compliance in a dynamic framework. Environ. Resour. Econ. 2017, 68, 537–567. [Google Scholar] [CrossRef] [Green Version]
  6. Martín-Herrán, G.; Rubio, S.J. Optimal environmental policy for a polluting monopoly with abatement costs: Taxes versus standards. Environ. Model. Assess. 2018, 23, 671–689. [Google Scholar] [CrossRef]
  7. Jørgensen, S.; Martín-Herrán, G.; Zaccour, G. Dynamic games in the economics and management of pollution. Environ. Model. Assess. 2010, 15, 433–467. [Google Scholar] [CrossRef]
  8. Zhang, Q.; Jiang, X.; Tong, D.; Davis, S.J.; Zhao, H.; Geng, G.; Feng, T.; Zheng, B.; Lu, Z.; Streets, D.G.; et al. Transboundary health impacts of transported global air pollution and international trade. Nature 2017, 543, 705–709. [Google Scholar] [CrossRef] [Green Version]
  9. Ambec, S.; Coria, J. Prices vs quantities with multiple pollutants. J. Environ. Econ. Manag. 2013, 66, 123–140. [Google Scholar] [CrossRef] [Green Version]
  10. Requate, T. Pollution control in a cournot duopoly via taxes or permits. J. Econ. 1993, 58, 255–291. [Google Scholar] [CrossRef]
  11. Kato, K. Emission quota versus emission tax in a mixed duopoly. Environ. Econ. Policy Stud. 2011, 13, 43–63. [Google Scholar] [CrossRef]
  12. Wirl, F. Taxes versus permits as incentive for the intertemporal supply of a clean technology by a monopoly. Resour. Energy Econ. 2014, 36, 248–269. [Google Scholar] [CrossRef]
  13. Huang, X.; He, P.; Zhang, W. A cooperative differential game of transboundary industrial pollution between two regions. J. Clean Prod. 2016, 120, 43–52. [Google Scholar] [CrossRef]
  14. Van Long, N. A Survey of Dynamic Games in Economics; World Scientific: Singapore, 2010; pp. 71–104. [Google Scholar]
  15. Benchekroun, H.; Van Long, N. Efficiency inducing taxation for polluting oligopolists. J. Public Econ. 1998, 70, 325–342. [Google Scholar] [CrossRef] [Green Version]
  16. Storrøsten, H.B. Prices versus quantities: Technology choice, uncertainty and welfare. Environ. Resour. Econ. 2014, 59, 275–293. [Google Scholar] [CrossRef]
  17. Masoudi, N.; Zaccour, G. Emissions control policies under uncertainty and rational learning in a linear-state dynamic model. Automatica 2014, 50, 719–726. [Google Scholar] [CrossRef]
  18. Masoudi, N. Environmental policies in the presence of more than one externality and of strategic firms. J. Environ. Plan. Manag. 2022, 65, 168–185. [Google Scholar] [CrossRef]
  19. Garcia, A.; Leal, M.; Lee, S.H. Optimal policy mix in an endogenous timing with a consumer-friendly public firm. Econ. Bull. 2018, 38, 1438–1445. [Google Scholar]
  20. Feenstra, T.; Kort, P.M.; de Zeeuw, A. Environmental policy instruments in an international duopoly with feedback in-vestment strategies. J. Econ. Dyn. Control. 2001, 25, 1665–1687. [Google Scholar] [CrossRef]
  21. Hoel, M.; Karp, L. Taxes versus quotas for a stock pollutant. Resour. Energy Econ. 2002, 24, 367–384. [Google Scholar] [CrossRef] [Green Version]
  22. Lee, S.H.; Park, S.H. Tradable emission permits regulations: The role of product differentiation. Int. J. Bus. Econ. 2005, 4, 249–261. [Google Scholar]
  23. Garcia, A.; Leal, M.; Lee, S.H. Time-inconsistent environmental policies with a consumer-friendly firm: Tradable permits versus emission tax. Int. Rev. Econ. Financ. 2018, 58, 523–537. [Google Scholar] [CrossRef] [Green Version]
  24. Feichtinger, G.; Lambertini, L.; Leitmann, G.; Wrzaczek, S. R&D for green technologies in a dynamic oligopoly: Schumpeter, arrow and inverted-U’s. Eur. J. Oper. Res. 2016, 249, 1131–1138. [Google Scholar]
  25. Moner-Colonques, R.; Rubio, S. The timing of environmental policy in a duopolistic market. Econ. Agrar. Recur. Nat. 2015, 15, 11–40. [Google Scholar] [CrossRef] [Green Version]
  26. Ulph, A. The choice of environmental policy instruments and strategic international trade. In Conflicts and Cooperation in Managing Environmental Resources; Springer: Berlin/Heidelberg, Germany, 1992; pp. 111–132. [Google Scholar]
  27. Lai, Y.; Hu, C. Trade agreements, domestic environmental regulation, and transboundary pollution. Resour. Energy Econ. 2008, 30, 209–228. [Google Scholar] [CrossRef]
  28. Glachant, M.; Ing, J.; Nicolai, J.P. The incentives for north-south transfer of climate-mitigation technologies with trade in polluting goods. Environ. Resour. Econ. 2017, 66, 435–456. [Google Scholar] [CrossRef] [Green Version]
  29. Dockner, E.J.; Jorgensen, S.; Van Long, N.; Sorger, G. Differential Games in Economics and Management Science; Cambridge University Press: Cambridge, UK, 2000; pp. 37–80. [Google Scholar]
  30. Dockner, E.J.; Van Long, N. International pollution control: Cooperative versus noncooperative strategies. J. Environ. Econ. Manag. 1993, 25, 13–29. [Google Scholar] [CrossRef] [Green Version]
  31. Jørgensen, S.; Zaccour, G. Time consistent side payments in a dynamic game of downstream pollution. J. Econ. Dyn. Control. 2001, 25, 1973–1987. [Google Scholar] [CrossRef]
  32. Jørgensen, S.; Martín-Herrán, G.; Zaccour, G. Agreeability and time consistency in linear-state differential games. J. Optim. Theory Appl. 2003, 119, 49–63. [Google Scholar] [CrossRef]
  33. Breton, M.; Sbragia, L.; Zaccour, G. A Dynamic model for international environmental agreements. Environ. Resour. Econ. 2010, 45, 25–48. [Google Scholar] [CrossRef]
  34. Li, S. A Differential game of transboundary industrial pollution with emission permits trading. J. Optim. Theory Appl. 2014, 163, 642–659. [Google Scholar] [CrossRef]
  35. Bertinelli, L.; Camacho, C.; Zou, B. Carbon capture and storage and transboundary pollution: A differential game approach. Eur. J. Oper. Res. 2014, 237, 721–728. [Google Scholar] [CrossRef]
  36. El Ouardighi, F.; Kogan, K.; Gnecco, G.; Sanguineti, M. Transboundary pollution control and environmental absorption efficiency management. Ann. Oper. Res. 2020, 287, 653–681. [Google Scholar] [CrossRef]
  37. Yeung, D.W.; Petrosyan, L.A. A Cooperative stochastic differential game of transboundary industrial pollution. Automatica 2008, 44, 1532–1544. [Google Scholar] [CrossRef]
  38. Xu, H.; Tan, D. Optimal abatement technology licensing in a dynamic transboundary pollution game: Fixed fee versus royalty. Comput. Econ. 2019; in press. [Google Scholar] [CrossRef]
  39. Marsiglio, S.; Masoudi, N. Transboundary pollution control and competitiveness concerns in a two-country differential game. Environ. Model. Assess. 2022, 27, 105–118. [Google Scholar] [CrossRef]
  40. Huiquan, L.; Genlong, G. Dynamic decision of transboundary basin pollution under emission permits and pollution abatement. Physica A 2019, 532, 121869. [Google Scholar]
  41. de Frutos, J.; Gatón, V.; López-Pérez, P.M.; Martín-Herrán, G. Investment in cleaner technologies in a transboundary pollution dynamic game: A numerical investigation. Dyn. Games Appl. 2022; in press. [Google Scholar] [CrossRef]
  42. de Frutos, J.; Martín-Herrán, G. Spatial vs. non-spatial transboundary pollution control in a class of cooperative and non-cooperative dynamic games. Eur. J. Oper. Res. 2019, 276, 379–394. [Google Scholar] [CrossRef] [Green Version]
  43. Yanase, A. Dynamic games of environmental policy in a global economy: Taxes versus quotas. Rev. Int. Econ. 2007, 15, 592–611. [Google Scholar] [CrossRef]
  44. Yanase, A. Global environment and dynamic games of environmental policy in an international duopoly. J. Econ. 2009, 97, 121–140. [Google Scholar] [CrossRef]
  45. Yanase, A.; Kamei, K. Dynamic game of international pollution control with general oligopolistic equilibrium: Neary meets dockner and long. Dyn. Games Appl. 2022; in press. [Google Scholar] [CrossRef]
  46. Menezes, F.M.; Pereira, J. Emissions abatement R&D: Dynamic competition in supply schedules. J. Public. Econ. Theory. 2017, 19, 841–859. [Google Scholar]
  47. Benchekroun, H.; Chaudhuri, A.R. Transboundary pollution and clean technologies. Resour. Energy Econ. 2014, 36, 601–619. [Google Scholar] [CrossRef] [Green Version]
  48. Martín-Herrán, G.; Rubio, S.J. On coincidence of feedback and global stackelberg equilibria in a class of differential games. Eur. J. Oper. Res. 2021, 293, 761–772. [Google Scholar] [CrossRef]
  49. Cheng, K.F.; Tsai, C.S.; Hsu, C.C.; Lin, S.-C.; Tsai, T.-C.; Lee, J.-Y. Emission tax and compensation subsidy with cross-industry pollution. Sustainability 2019, 11, 998. [Google Scholar] [CrossRef] [Green Version]
  50. Helm, D. The european framework for energy and climate policies. Energy Policy 2014, 64, 29–35. [Google Scholar] [CrossRef]
  51. Conrad, K. Taxes and subsidies for pollution-intensive industries as trade policy. J. Environ. Econ. Manag. 1993, 25, 121–135. [Google Scholar] [CrossRef]
  52. Konisky, D.M. Regulatory competition and environmental enforcement: Is there a race to the bottom? Am. J. Polit. Sci. 2007, 51, 853–872. [Google Scholar] [CrossRef]
  53. Lambertini, L.; Poyago-Theotoky, J.; Tampieri, A. Cournot competition and “green” innovation: An inverted-u relationship. Energy Econ. 2017, 68, 116–123. [Google Scholar] [CrossRef] [Green Version]
  54. Burtraw, D.; McCormack, K. Consignment auctions of free emissions allowances. Energy Policy 2017, 107, 337–344. [Google Scholar] [CrossRef]
  55. Khezr, P.; MacKenzie, I.A. Consignment auctions. J. Environ. Econ. Manag. 2018, 87, 42–51. [Google Scholar] [CrossRef] [Green Version]
Figure 1. The equilibrium trajectories of the pollution stock in the three scenarios under the initial. pollution stock x 0 = 800 (left), and x 0 = 100 (right).
Figure 1. The equilibrium trajectories of the pollution stock in the three scenarios under the initial. pollution stock x 0 = 800 (left), and x 0 = 100 (right).
Sustainability 14 09028 g001
Figure 2. The equilibrium trajectories of the revenues in the three scenarios under the initial pollution. stock x 0 = 800 (left), and x 0 = 100 (right).
Figure 2. The equilibrium trajectories of the revenues in the three scenarios under the initial pollution. stock x 0 = 800 (left), and x 0 = 100 (right).
Sustainability 14 09028 g002
Figure 3. Steady-state equilibrium and damage parameter d i ; (a) Steady-state equilibrium pollution stock; (b) Steady-state net revenue of region i ; (c) Steady-state net revenue of region j .
Figure 3. Steady-state equilibrium and damage parameter d i ; (a) Steady-state equilibrium pollution stock; (b) Steady-state net revenue of region i ; (c) Steady-state net revenue of region j .
Sustainability 14 09028 g003
Figure 4. Steady-state equilibrium and the market size a i ; (a) Steady-state equilibrium pollution stock; (b) Steady-state net revenue of region i ; (c) Steady-state net revenue of region j .
Figure 4. Steady-state equilibrium and the market size a i ; (a) Steady-state equilibrium pollution stock; (b) Steady-state net revenue of region i ; (c) Steady-state net revenue of region j .
Sustainability 14 09028 g004
Figure 5. Steady-state equilibrium and the natural decay rate δ ; (a) Steady-state equilibrium pollution stock; (b) Steady-state net revenue of each region.
Figure 5. Steady-state equilibrium and the natural decay rate δ ; (a) Steady-state equilibrium pollution stock; (b) Steady-state net revenue of each region.
Sustainability 14 09028 g005
Table 1. Notations and descriptions.
Table 1. Notations and descriptions.
NotationDescription
α m The positive constant parameter measuring the reservation price, m = i , j
a The market size of each region
χ m The quantity of the product purchased by consumers in each region, m = i , j
γ m The cost coefficient of emission reduction, m = i , j
δ The natural decay rate
ρ The discount rate
d m The damage parameter, m = i , j
x ( 0 ) The initial level of the pollution stock, x ( 0 ) = x 0 > 0
p ( t ) The product price at time t
q m ( t ) The output of the firm at time t ,   m = i , j
r m ( t ) The pollution abatement level at time t ,   m = i , j
x ( t ) The pollution stock in the two regions at time t
π m ( t ) The instantaneous profit of the firm at time t ,   m = i , j
θ m ( t ) The emission standard set by the government in each region at time t ,   m = i , j
τ m ( t ) The emission tax rate at time t ,   m = i , j
E ¯ m ( t ) The emission quota of the firm assigned by the government in each region at time t ,   m = i , j
V m The value function of each region, m = i , j
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Xu, H.; Luo, M. Optimal Environmental Policy in a Dynamic Transboundary Pollution Game: Emission Standards, Taxes, and Permit Trading. Sustainability 2022, 14, 9028. https://doi.org/10.3390/su14159028

AMA Style

Xu H, Luo M. Optimal Environmental Policy in a Dynamic Transboundary Pollution Game: Emission Standards, Taxes, and Permit Trading. Sustainability. 2022; 14(15):9028. https://doi.org/10.3390/su14159028

Chicago/Turabian Style

Xu, Hao, and Ming Luo. 2022. "Optimal Environmental Policy in a Dynamic Transboundary Pollution Game: Emission Standards, Taxes, and Permit Trading" Sustainability 14, no. 15: 9028. https://doi.org/10.3390/su14159028

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop