Abstract
We study the strategic behavior of firms competing in the exploitation of a common-access productive asset, in the presence of pollution externalities. We consider a differential game with two state variables (asset stock and pollution stock), and by using a piecewise-linear approximation of the nonlinear asset growth function, we provide a tractable characterization of the symmetric feedback–Nash equilibrium with asymptotically stable steady state(s). The results show that the firm’s strategy takes three forms depending on the pair of state variables and that different options for the model parameters lead to contrasting outcomes in both the short- and long-run equilibria.
1. Introduction
There is an extensive literature in economics, operations research, game theory, and dynamic optimization dealing with the exploitation of renewable resources (e.g., a fishery or a forest). One main question shared by all parties, namely firms, governments (regulators), and citizens, is how to exploit these resources in a sustainable way. In the vast dynamic games literature, where players (firms and regulators) interact strategically over time, the models have very often focused on the resource itself, without any other considerations. Typically, these models start with a dynamic system describing the evolution of the stock, and next, they characterize the equilibrium strategies under different assumptions related (i) to the information structure (e.g., open-loop, feedback, or closed-loop with or without memory) and (ii) to the players’ behavior (cooperative or noncooperative).
There is ample evidence showing that the evolution of renewable resources depends not only on natural variations and on human intervention (harvesting, deforestation, etc.) but also on the accumulated pollution. An illustrative example is the recent discovery—devastating for biomass—of over 5 trillion pieces of plastic, weighing a total of 250,000 tons, afloat on the Pacific Ocean. More than half of this island of plastic is made of fishing gear, i.e., is the result of resource exploitation [1]. In addition to the adverse effects of accumulated pollution on the resources, the literature documented various direct economic costs borne by firms operating in these industries. For instance, the fishing industry faces costs arising from the need to repair or replace gear that has been damaged or lost because of encounters with abandoned, lost, or otherwise discarded fishing gear (ALDFG) [2,3]. The impacts of ALDFG on fisheries also include hazards to navigation and safety at sea, which increase fuel costs and reduce fishing time, and loss of earnings due to reduced or contaminated catches including ghost fishing [4,5]. It is estimated that the cost of marine debris damage on the fishing industry in 2008 was US$364 million for the 21 Pacific Rim economies [6], and the derelict pot removal program between 2008–2014 in the largest estuary of the United States, the Chesapeake Bay, led to an additional harvest valued at US$21.3 million—a 27% increase above that which would have occurred without removals [7].
Scientists from various disciplines have proposed models to integrate the influence of other (state) variables—e.g., pollution stock, marine environmental quality, and habitat quality—in their analysis (e.g., [8]), omitting, however, the strategic interactions between the various parties involved. In this paper, we consider an oligopoly exploiting a renewable resource (a productive asset) and contribute to the literature by having a more realistic model where strategic behavior and pollution externalities are present. Indeed, strategic thinking has often been ignored in large-scale ecosystem models and in representative-agent frameworks and pollution has been disregarded in games of renewable-resource exploitation.
To represent the habitat’s limited carrying capacity, the rate of growth of the productive asset is typically modeled as a nonlinear, inverted U-shaped function of the asset stock (see, e.g., [9]). The author of [10,11] approximated this nonlinear rate of growth by an inverted-V function, which allowed for tractable characterization of the equilibrium strategies and payoffs. In this work, which is a revised version of [12], we adopt a differential game framework that extends the productive-asset model in [11] by introducing a second state variable, namely pollution stock, while retaining his approximation approach. The game is played by n identical firms competing à la Cournot over an infinite planning horizon. Each player aims to maximize their stream of discounted payoff, taking into account the market structure, the initial size of each stock, and their dynamics. For the present work, we abstract from the effects of pollution on the reproductive capacity of the asset and consider only the direct economic costs of pollution incurred by the firms as negative externality. The problem is technically an infinite horizon linear-quadratic n-player differential game with two state variables and one control variable for each player, which influences the two dynamic processes.
We characterize fully analytically the symmetric feedback–Nash equilibrium, where the firm’s strategy is a piecewise-linear function in the two state variables, for which the shape depends on the position of the game in the state space. Thus, the contribution of this work includes obtaining a closed-form solution in a model with two state variables and a piecewise-linear resource growth function. We show that the state space is divided into three regions, namely scarcity, abundance, and no exploitation. Three equilibrium cases are identified and are shown to depend on the relationship between the asset growth rate, the discount rate, and the pollution decay rate. This equilibrium is sustainable; that is, for any given pair of initial asset–pollution stocks, it converges to a stable steady state with positive exploitation and positive values of state variables. When there are at least two players, the equilibrium includes either one stable or two locally stable steady states; however, when the industry is monopolistic, the optimal solution includes only one stable steady state.
Our work belongs quite naturally to the literature on the exploitation of productive assets and the management of resources under pollution control. Early contributions in this area include [13,14], both of which explored the set of equilibria in a dynamic game framework. The characterization of the equilibrium in [10,11] showed that the firm’s strategy and the value function take piecewise forms, which depends on the asset stock level. There can be a unique or multiple steady states depending on the asset growth rate, and the decision of a single firm to unilaterally decrease its exploitation may result in a decrease in the asset stock. These contributions led to extensive research on various topics related to the strategic exploitation of common access resources that also took into account the nonlinearity of the growth rule. The literature has examined issues such as optimal taxation [15], losses from competition [16], the role of property rights, and convergence to the Cournot equilibrium [17,18]. More recently, [19] showed results on pre-emption, voracity, and exhaustion. The authors of [20] showed that nonlinear feedback strategies are unstable in a dynamic duopoly game with renewable resource exploitation. The effects of mergers were analyzed in [21], the impact of social status concern in these industries were studied in [22], and [23] investigated the incentives in a duopoly by considering a finite planning horizon.
In parallel to this dynamic-games literature on resources, a significant literature has dealt with the strategic behavior of agents under pollution externalities (see the surveys in [24,25]). The pioneering contributions are [26,27], where both noncooperative and cooperative solutions were characterized and contrasted. Many papers have ensued, focusing on issues such as taxation (e.g., [28,29]), sustainability and uncertainty (e.g., [30,31]), international environmental agreements (e.g., [32,33]), and technical change and R&D (e.g., [34]).
Some studies have integrated pollution accumulation in the exploitation of a renewable resource (see, e.g., [35,36,37]). However, these contributions have not accounted for strategic behavior or, when they have, they omitted the feature that the resource growth rate is not linear. In a closely related work to ours, [38] considered a similar model with two state variables, namely biomass stock and pollution stock. They studied a sustainable cooperative agreement in an open-access fishery and characterized the symmetric noncooperative and cooperative solutions. They focused only on an interior solution that includes one part of the solution (which we name the scarcity region). Moreover, they considered the damage of pollution on the resource dynamics together with the damage on welfare, and in the present paper, we study the effect of taking into account the direct economic damage on the firms’ profits. In addition, we characterize the equilibrium for the whole state space and study in detail the boundary cases and the values of parameters that allow for a feedback–Nash equilibrium.
In a nutshell, our paper attempts to integrate within the same framework the exploitation of a productive asset in the presence of pollution externalities and strategic behavior. We approximate the nonlinear growth function by a piecewise linear function, which allows us to have a tractable linear-quadratic dynamic game, for which the equilibria can be characterized analytically.
2. The Model
We consider an n-player infinite-horizon differential game, with the asset stock S and the pollution stock Z being the two state variables. The model extends the framework in [11] by introducing pollution externalities in the firms’ decision-making problem. At each date , n-firms exploit the common-access asset in quantities , and compete à la Cournot. We assume that the exploitation strategy is nonnegative and bounded above, that is, with the positive constant M sufficiently large (see [14,39] that considered similar constraints). We consider the transformation rate of the asset to the final product to be one-to-one and the unit cost of exploitation to be zero for simplicity. The price p is determined by the linear inverse-demand function given by , where is the total quantity of supply and where and .
The growth rate of the productive asset (e.g., a fishery, a forest, etc.) is assumed to be nonlinear in an inverted U–shape in the asset stock. Following the literature, we adopt the following piecewise-linear approximation:
where denotes the intrinsic growth rate of the asset and is the level of asset that leads to the so-called maximum sustainable yield () with being the carrying capacity of the habitat. Note that if , if , and if . Taking into account an asset growth function that includes pollution (i.e., ) would allow to capture the negative impacts of pollution on the asset stock. This is considered in [38], where they present the closed-form solutions for noncooperative and cooperative cases for a part of the interior solution. In this work, we refrain from adding pollution and use function to keep the presentation of results simpler.
Taking into account the firms’ exploitation, the change in the asset stock at date t is governed by the following differential equation:
The firms’ activities generate emissions as a by-product and add up to the pollution stock, which evolves over time as follows:
where denotes the amount of emissions resulting from exploiting a unit of asset, and is the pollution decay rate.
Denote by the (symmetric) damage cost of player i. We suppose that this environmental cost is convex increasing in the pollution stock Z and satisfies the property . For tractability, we adopt the quadratic functional form , where .
Assuming that each player maximizes their discounted stream of profit, the optimization problem of player i is then as follows:
where denotes the common discount rate.
3. The Equilibrium
We consider the equilibrium where firms use feedback information in their decision-making. This is a subgame-perfect equilibrium (also called a Markovian Perfect Nash Equilibrium (MPNE)), in which the firm’s strategy is state dependent and strongly time consistent [40,41]. Denote by the value function of firm i, which is the discounted sum of profits that the firm obtains in the game starting in state . Unless an ambiguity arises, we shall from now on omit the time argument. Introduce the Hamilton–Jacobi–Bellman (HJB) equation associated to firm i’s maximization problem, that is,
for , where the partial derivative represents the shadow price (or value) of the asset stock (also called scarcity rent) and denotes the shadow value of the pollution stock. Taking into account the nonnegativity restriction on in problem (4), maximizing the right-hand side of (5) yields the following condition:
with at least one of the inequalities being equality. The condition (6) must hold together with the terminal condition for every admissible trajectory. The left-hand side of the second inequality in (6) is the marginal revenue of firm i for given quantities of competitors. The right-hand side represents the marginal (opportunity) cost of a unit of exploitation, which comprises the shadow prices of the asset stock and pollution stock. Since we are considering costless exploitation, the opportunity cost consists only of the shadow prices of the state variables. We focus on a symmetric equilibrium in which all firms have the same value function, and they exploit the same quantities ( and for all ). Consequently, (6) becomes
for . Condition (7) results in two possibilities for the equilibrium strategy, that is, or . There may exist different cases where depending on the signs of the partial derivatives of the function . We consider the cases in which the asset has a scarcity rent () or not (), and we consider all possible cases for the effect of pollution on the value of the firm; thus, is free. By using (7), we write these cases in the following definition:
Definition 1.
The three regions are as follows:
- Scarcity region (): with :
- Abundance region (): with :
- No-exploitation region ()::
In region , the pair of state variables is such that it is profitable to exploit the asset, and the asset has a scarcity rent. The firms view the level of the asset stock as scarce, and they consider their impact on the asset stock in their strategy. In region , the asset stock is too high, so having an additional unit in the stock brings no value to firms. Players consider only the pollution externality as an intertemporal effect of exploitation, and the asset stock does not play a role in their decision. In region , the marginal revenue of an initial asset supply (given by the price ) is lower than its marginal cost, which depends on the shadow prices of the asset stock and pollution stock. Hence, exploitation is not dynamically profitable and the equilibrium strategy is to wait for the asset to replenish and for pollution to decline.
Since the function V is not known, it is not clear beforehand whether any of these three cases exist. To proceed with an analytically tractable characterization, we focus on the strategies that are linear functions of the state variables. Furthermore, we introduce the following assumptions:
Assumption 1.
where the terms and are given by
where we have for and .
Assumption 2.
where .
The assumptions above require the intrinsic growth rate of the asset () to be sufficiently high (which is the same condition required in [11]) but also bounded from above with a threshold that depends on the pollution decay rate (k) and the number of firms (n). Another restriction is imposed on the marginal damage parameter , which is required to be sufficiently low. For too high values of , the condition is revised to be more strict for the parameter . We will discuss the roles of these restrictions in more detail in Section 3.1, where we study the properties of the equilibrium. In the following theorem, we characterize the symmetric equilibrium:
Theorem 1.
Suppose that Assumptions 1 and 2 are satisfied.
- (a)
- Equilibrium strategies:The strategy profile for whereconstitutes a symmetric feedback–Nash equilibrium.The terms and , which depend on the exogenous model parameters, are written as follows:andwhere the term λ is given byThe regions are written as follows:
- (b)
- Value functions:The discounted sum of profits obtained by each firm is given by the following value function:for , which is continuously differentiable .The function is written as follows:whereThe function is written as follows:whereThe function is written as follows:withand the function is defined implicitly by the following system of equations:The partitions of region , denoted by and are written as follows:where the curve denotes the boundary between and given bywith the constants and given as follows:
Proof.
The long proof is built throughout the paper, and the details are provided in Appendix A, which has several subsections. The road map to complete the proof of the theorem is as follows:
- In Appendix A.1, we state some preliminaries and introduce the methodological approach.
- In Appendix A.2, we study the case . By guessing a piecewise-quadratic form for the value function and by applying the undetermined coefficient method, we obtain the functions and and the solutions associated with their coefficients. Then, we analyze the boundary cases and their positions in .
- Lemma A1 in Appendix A.2.1 shows that, under Assumptions 1 and 2, the function satisfies the HJB equation and that strategy profile satisfies (7) with and .
- Lemma A2 in Appendix A.2.2 shows that, under Assumptions 1 and 2, the function is continuously differentiable in S and Z and satisfies the HJB equation . The strategy profile satisfies (7) with and .
- Appendix A.3 looks at the case . Lemma A3 obtains the function and shows that it is continuously differentiable in S and , and on the boundary cases of .
□
Theorem 1 characterizes the symmetric equilibrium strategy of firms for any given pair . The strategy is a piecewise-linear function in S and Z with coefficients () and () that correspond to the coefficients of the marginal cost function given in (6) (see (A7) in Appendix A.2 in the Appendix A).
For further analysis, we define the boundary lines between the regions as follows:
Definition 2.
Let denote the boundary line in the plane between the regions and as function of S. We have
- (i)
- The boundary line between and :
- (ii)
- The boundary line between and :
- (iii)
- The boundary line between and :
We now proceed with studying the general properties of the equilibrium in the next subsection.
3.1. The Properties of the Equilibrium
We briefly explain the methods used for obtaining the equilibrium strategies and then investigate their properties. Using a linear-quadratic model with the piecewise-linear approximation of the asset growth function enables us to guess the form of the value function as a polynomial of degree 2 in S and Z within an interior solution. We obtain the six-dimensional equation system associated with the coefficients of and then reduce it into a system of two equations in () given in (A5) and (A6). This system yields four solutions: two include and the other two include . Among all solutions, only one pair makes it possible to characterize a feedback–Nash equilibrium with a stable steady state.
We use the solutions for the strategies and the value function to derive the analytical formulation of the case , which is given in (19). We obtain the linear functions associated with its boundary cases, where and , and then study their positions in by analyzing the signs of () given in Equations (12) and (13), which result in three cases that differ in the relationship among the dynamic model parameters, i.e., . In all cases, and , but the sign of differs, i.e., . Using these results, for , we write the following cases:
Since is the coefficient associated with the level of pollution in the equilibrium strategy in region , this difference leads to contrasting results in the equilibrium responses to pollution in , which will be discussed in detail below.
We analyze the properties of the equilibrium by using the diagram in Figure 1, which shows the shapes and positions of the regions given in Definition 1 in the plane.
Figure 1.
Illustration of the regions and their boundaries in .
- (i)
- There are four regions: the value function takes a different form in each region.
- (ii)
- in , and in .
- (iii)
- in , and in .
- (iv)
- The boundary case given in (37) denotes the threshold where and , beyond which the firms voluntarily cease exploitation and wait for the asset to replenish and pollution to decline. The sign of its slope depends on .
- (v)
- The boundary case given in (38) denotes the threshold where and . It decreases in S in all cases.
- (vi)
- The boundary given in (39), which is a positive constant, denotes the threshold level of pollution, where and . The firms refrain from any exploitation if the level of pollution is above this threshold.
- (vii)
- The boundary cases of intersect with the axis at points and , and their intersection point is denoted by , where the closed-form solutions of , , and are given in (A9), (A10), (35), and (36). When Assumptions 1 and 2 are satisfied, the ordering of these points is given as follows: and , .
The qualitative properties of equilibrium in region remain the same for all values of parameters that satisfy Assumptions 1 and 2. This is because the firm’s strategy and the value function in this region do not depend on the intrinsic growth rate . Within this region, the equilibrium behavior is that of the dynamic oligopoly with pollution externalities, where higher pollution induces the firms to exploit less of the asset (). Since the slope of the boundary in all cases, for a fixed point on this boundary, for lower levels of pollution and if pollution is high.
By contrast, the behavior in region differs depending on . In the following, we show the properties of the equilibrium strategies in each case:
- Case 1:
- , and (Figure 1 and Figure 2a): The equilibrium level of exploitation is faster if the asset stock is larger (), and it is slower if the level of pollution is higher (). Since , for a fixed pair of state variables on this threshold, firms will exploit the resource if pollution is sufficiently low but will not exploit if pollution is too high.
Figure 2. Two cases of region positioning under Assumptions 1 and 2. - Case 2:
- , and : The firm’s strategy includes only the asset stock and is independent of the level of pollution. The boundary case where and becomes a constant .
- Case 3:
- , and (Figure 2b): In this case, the slope of the threshold becomes negative. Firms exploit the asset faster if the level of pollution is higher (). Since , for a fixed a point on this threshold, firms exploit the asset if the level of pollution is high and do not exploit it if pollution is low. The opportunity cost of exploitation given in (6) decreases in pollution.
The results presented above highlight the differences in equilibrium behavior among the three cases. For a relatively slow growing asset, the response to pollution is to decrease exploitation; however, if the asset growth rate is sufficiently large, the equilibrium response to pollution is reversed, i.e., to exploit more under higher pollution.
Since we make an extension of [11], it will be useful to compare and contrast our results. Naturally, our solutions share many similarities with [11]. The three regions and the piecewise-linear strategies we obtain are also found in [11], where they depend only on the level of asset stock. This is indeed the consequence of considering a piecewise-linear growth function. Moreover, the solution we present is equivalent to the one given in [11] in the limit case, where the damage function parameter (see Remark A1 in Appendix A.2 in the Appendix A). Furthermore, as the damage cost parameter increases, the equilibrium behavior is more affected by the level of pollution. Our results show that, if the pollution level is too high, the firms do not exploit the asset for any given level of asset stock. Also, for a given set of other parameters, above a level of given in Assumption 1 ( or depending on , k, and r), we cannot characterize a feedback–Nash equilibrium for the whole state space. Last but not least, the different types of equilibrium responses in cases 1 to 3 appear due to the presence of pollution in the problem.
As mentioned above, when firms do not have information on the pollution externality or when they do not take it into account in their decision-making (), the equilibrium behavior is as given in [11]. One question is whether the firms benefit from taking into account the damage cost () in their profits. When there are direct economic costs due to pollution but the firm disregards it, then the discounted sum of profits the firm obtains will be lower. We elaborate on this by discussing the two following cases: one firm chooses another strategy while all other firms stick with the feedback–Nash strategy with d(Z) in their profit function, and all firms disregard the damage in their profit. For the first case, in which all other firms choose the feedback–Nash strategy ( given in Equation (11)), one firm deviating and choosing another strategy (denote by with ) would yield a lower value, which is immediate because the equilibrium we characterize is subgame-perfect (i.e., , where denotes all other firms’ strategy except firm i). For the second case, let denote the symmetric equilibrium strategy in the game without pollution, which is given in [11]. When all firms disregard the damage in their profit and all choose the strategy , one firm can deviate and make larger profits by taking into account since is not maximizer of the problem with , thus it would not be an equilibrium. For these reasons, it is beneficial for firms to take into account the costs arising from accumulated pollution in their decision-making process.
We can now discuss the roles that Assumptions 1 and 2 play on the characterization of the equilibrium. The first part of Assumption 1, which refers to the case , ensures that the asset quantity . In addition, having the slope of no-exploitation boundary increasing (see Figure 2a) leads to the result that the equilibrium behavior is not to exploit the asset for too small values of asset stock. In the second part of Assumption 1 (), ensuring is not sufficient to characterize a feedback–Nash equilibrium. The reason is that the slope of no-exploitation boundary decreases (see Figure 2b), and if then there may exist equilibrium trajectories leading to asset exhaustion. Hence, we modify the condition to ensure . Then, in both cases, Assumption 1 guarantees that the equilibrium behavior around the asset level is not to exploit the asset, which renders the asset growth to be positive () in the neighborhood of the axis for all levels of pollution, and thus, the asset exhaustion never occurs in the equilibrium. Furthermore, Assumption 2 is to ensure that the value function is continuously differentiable for all , particularly for , where the asset growth function in (2) is not continuously differentiable. Also note that, for a given parameter calibration that satisfies Assumption 1, the carrying capacity () can be set such that Assumption 2 is satisfied as well.
Lastly, we turn to the case where the equilibrium strategy is to not exploit the asset. To characterize the value function in region given in (21), we use the fact that, in this region, for all ; thus, the asset stock grows at rate and the pollution stock declines at rate k without an intervention. At a certain date , depending on the initial state , the pair reaches either one of the boundary cases where firms begin exploitation. The value of the game starting at a point in depends on this boundary point associated with itself, which can be computed. The following steps are taken in Appendix A.3:
- (i)
- We define an implicit function denoted by written in the system of equations in (31), which yields the point (and the date) at which the firms launch their exploitation for .
- (ii)
- Using this function, we obtain the curve denoted by given in (34) associated with the intersection point of the three regions where .
- (iii)
- This curve enables us to partition region into two parts, denoted by and given in (32) and (33), such that the boundary for launching exploitation is known for a given .
Having obtained the boundary point associated with all points in , in Lemma A3, we find that satisfies the HJB equation with and show its continuity. The function does not have an analytical solution for , except in special cases and (see Remark A2 in Appendix A.3 in the Appendix A); nevertheless the characterization of remains tractable. For any other parameter setting, this function has to be computed numerically in order to obtain the value of a point in . In the other partition, , the value function has an analytical form.
3.2. The Equilibrium Dynamics, Steady States, and Stability
In order to analyze the stable steady states of the equilibrium, we derive the set of points such that and , respectively. Then, we obtain the steady states and study their stability by using the methods provided in [42]. The results are shown in the following theorem:
Theorem 2.
Under Assumptions 1 and 2, the steady state(s) may be single or multiple depending on the parameters.
- (i)
- The steady state in region is denoted by and written as follows:The steady state always exists, and it is asymptotically stable for .
- (ii)
- The steady states in region are denoted by and and written as follows:
The existence of steady states and depends on the following condition:
Proof.
See Appendix B. □
Figure 3 illustrates these results for a case in which multiple steady states exist, which also shows the positions of the loci and .
Figure 3.
Sample of the case with multiple steady states.
The cases of unique and multiple steady states are illustrated in the diagrams in Figure 4, which also show the equilibrium trajectories in the plane. These trajectories are obtained by using the differential equation system resulting from replacing in (11) into (2) and (3). For a given initial state, the equilibrium strategy may shift from one to another as the values of the state variables cross the thresholds between the regions.
Figure 4.
Sample equilibria with unique and multiple steady states.
The continuity of on the boundary cases ensures a smooth transition between the regions with continuous ; consequently, the strategies converge to a steady state with for all initial states and constitute a symmetric feedback–Nash equilibrium.
We now investigate how the equilibrium responses to marginal increases in asset and pollution stocks vary with the changes in the parameter values. In the following proposition, we analyze these comparative statics for the number of players (n), the marginal damage coefficient (), and the intrinsic growth rate of the asset .
Proposition 1.
The partial derivatives of coefficients of the equilibrium strategy given in (11) with respect to the selected parameters have the following signs:
Proof.
See Appendix C. □
The results of Proposition 1 can be summarized as follows: the equilibrium response to an increase in asset stock is lower if the number of players is higher, and the response to an increase in pollution stock depends on . Recall that ; then, with a higher number of players, the equilibrium response to higher pollution is to decrease exploitation less for and increase exploitation less for . The results are reversed for the marginal damage parameter. When the growth rate of the asset is higher, the response to an increase in asset stock is higher and the response to an increase in pollution stock is also higher, which is in contrast with the effects of the other parameters.
Another example in which the qualitative properties differ is the case of a monopoly. The following proposition presents the result in this case:
Proposition 2.
Under Assumptions 1 and 2, for a monopoly (), the optimal solution includes only one stable steady state, which is in region .
Proof.
See Appendix D. □
Proposition 2 shows that, in the monopoly case, the steady state lies on the boundary and coincides with the unstable steady state in , i.e., . Hence, for a monopoly, this point is unstable but it is sustainable. Analysis regarding the stability of the other steady state in () remains valid, and thus, is the only steady state that is asymptotically stable. Therefore, for a monopoly, the optimal solution includes two steady states with one of them stable and the other one unstable but sustainable.
4. Concluding Remarks
We characterized the symmetric feedback–Nash equilibrium and showed its existence within a certain range of model parameters given in Assumptions 1 and 2. The equilibrium path always reaches a steady state that is sustainable. For a set of parameters outside this range, there may still exist local equilibria for some levels of asset–pollution stock pairs.
The framework we present includes various simplifications and abstractions, which made the characterization of the equilibrium more conveniently tractable. We introduced the pollution externalities in a simple way in order to guarantee that variations in the equilibrium results between the outcomes with and without pollution externalities could be studied through a single exogenous parameter.
The methodology used to characterize the equilibrium can be applied in problems involving similar features by considering different objective functions and state dynamics. Some examples are the issues relating to the open-access fisheries shared by multiple countries, analysis of cooperation and stability of coalitions, welfare analysis, spillover effects, and the interactions between other possible state variables and the asset stock.
Author Contributions
Conceptualization, N.B.V. and G.Z.; methodology, N.B.V. and G.Z.; formal analysis, N.B.V.; writing—original draft preparation, N.B.V.; writing—review and editing, N.B.V. and G.Z.; visualization, N.B.V.; supervision, G.Z. All authors have read and agreed to the published version of the manuscript.
Funding
This research was funded by Social Sciences and Humanities Research Council, Canada, grant 435-2013-0532.
Acknowledgments
We wish to thank three anonymous reviewers for their very helpful comments and suggestions which improved the paper.
Conflicts of Interest
The authors declare no conflict of interest.
Appendix A. Proof of Theorem 1
Appendix A.1. Some Preliminaries
A symmetric equilibrium exists if there exists a function that is continuous and continuously differentiable in S and Z, which satisfies the HJB equation in (5), the first-order condition in (7), and the terminal condition for every admissible trajectory (see [40,41]). Due to the nonnegativity restriction on in problem (4), we consider the two following cases:
Moreover, the constraint in (4) guarantees that the asset stock S cannot be negative.
In order to characterize an equilibrium strategy that is defined for the whole state space which converges to a stable steady state with , we look for the parameter constellations under which the following two conditions are satisfied:
- , which is to ensure that asset exhaustion never occurs and there exist steady state(s) with positive asset level.
- , which is to ensure the continuity of on where the asset growth function is not continuously differentiable.
More specifically,
- if is not satisfied, then and then there may exist an initial state such that and ; then, would not be a feedback–Nash equilibrium that includes stable steady state(s) with positive asset levels.
- if is not satisfied, then the discontinuity point given in (2) is included in the case and . For this reason, we cannot find a function that is continuously differentiable in S, and does not satisfy the HJB equation for ; thus, in that case, together with do not fulfill any currently known sufficient condition.
In the following subsections, we use the methods developed in the literature to study the cases given in Definition 1. We focus on linear strategies, obtain the function , and analyze a number of closed-form formulas to identify the restrictions on the model parameters under which the conditions discussed above are satisfied, which allows us to characterize the equilibrium.
Appendix A.2. Case with Positive Exploitation ()
Since the model is linear-quadratic, we make the informed guess that, within an interior solution (), the value function is a polynomial of degree 2 in S and Z. We consider the function given in Equation (23).
The maximized HJB equation is obtained by replacing in (7) into (5). Using results in an equation that is a polynomial of degree 2 in S and Z. We then apply the method of undetermined coefficients (see [41]) by identification, and after simplifications, the system of equations in is written as follows:
Introduce the following changes of variables:
Replacing (A4) into the equations for C and E, the term is written as a function of and , and is given in (14). Furthermore, using the equations for , and F in (A1)–(A3) with (A4) enables us to write the system of equations for and as follows:
Hence, we reduced the six-dimensional equation system in (A1)–(A3) into a system of two equations in and , which contain polynomials of degree 2. Equation (A5) has two solutions for , i.e., . Inserting these values into (A6) yields four solutions to the system in (A5) and (A6):
The solutions given above consist only of the exogenously given model parameters. Therefore, all solutions for can be obtained by replacing (A4) in (A1)–(A3) and by inserting the solutions given in (12)–(16).
In solutions A and , ; thus, they are candidates for the case and . In solutions B and , we have , and they are candidates for the case and . Therefore, we consider solutions A and for the function given in (23), and for solutions B and , where , we define the function denoted by , which is a polynomial of degree 2 in Z, written in (27). It can be verified that both functions and satisfy the HJB equation in (5) for all choices of solutions.
Remark A1.
In the limit case where , the problem reduces to a game with one state variable (S). Solution A reduces to , , and , which results in , and , , . Solution B vanishes with , which leads to the equilibrium outcome of the static Cournot oligopoly, i.e., and . These outcomes are identical to their corresponding terms in the solution provided in [11].
The equilibrium strategy is written by using (7) and (A4) with :
which is linear in S and Z. Note that, by using (A4), the RHS of (6) can be written as ; hence, these terms correspond to the coefficients of the marginal cost function.
In the following sections, we use the functions and to study the two cases where and .
Appendix A.2.1. Case with and
We first obtain the analytical formulation of . For , from (A7), we obtain , and for , we use (A1) to (A4). Since in all cases for , writing both inequalities in S enables us to obtain the following region and its boundary cases:
- (i)
- and for all where is given in (19); the boundary line associated with the first inequality, which is denoted by , is written in (37); and the sign of its slope is given by .
- if , then ;
- if , Equation (37) reduces to ; and
- if , then .
- (ii)
- The set of points such that is defined as follows:which is obtained by using .
- (iii)
- The set of points such that is given by the linear function in S written in Equation (38), and the sign of its slope is given by = . Since = and , we have in all cases where .
We now calculate the points at which the linear functions obtained for the boundary cases intersect with the axis and each other, i.e., , , and . These formulas are first written in terms of , and then, after inserting the solutions in (12)–(16), they are written in closed form, using in (18) to study their signs.
- (i)
- : and where
- (ii)
- : where
- (iii)
- : is written in Equations (35) and (36) and note that .
By using (A9) and (A10), the difference is given by
Thus, for , intersects with the axis at and . By using (A9), we obtain the conditions under which :
where is given in (8).
For , if (A12) and (A13) are satisfied, and for , if
where is given in (9), and note that this condition is more strict than the one in (8) (i.e., ).
Lemma A1.
Proof.
The sign analysis conducted in (A9) to (A15) shows that, if the conditions given in Assumptions 1 and 2 are satisfied, then we have and . We study the slopes of the two boundary lines of for each case. The slope of depends on , and the slope if ; then, the three cases are written as follows:
- Case 1:
- Case 2:
- Case 3:
- : In this case and , and hence, both boundary lines decrease in S. The difference in their slopes is given by . Since for we have and , the slopes compare as follows:thus, is steeper than , and , which can also be seen in (36). If ((A14) and (A15)) and are satisfied, then (see Figure 2b), and the function satisfies the HJB Equation (5) and condition (7) with .
□
Note that, when using (19), we eliminate the possibility of choosing solution A for (where ) and solution for (where and ). In these cases, the inequalities in (19) and (A8) change their directions. In step (A16), we eliminate the possibility of choosing solution for (where and ), as it leads to , which does not allow to characterize a feedback–Nash equilibrium.
Appendix A.2.2. Case with and
We now consider the function given in (27), and by using , which is given by with (19), (28), and (38), we have
- (i)
- and for all , where is given in (20).
- (ii)
Lemma A2.
Proof.
The functions and are polynomials of degree 2 and thus continuously differentiable in S and Z and , respectively. For continuity of on the boundary, as :
where (A17) is obtained by using (where ). By using (24) to (28), it can be verified that
Therefore, for , and thus, is continuous on .
Note that the four solutions to the system in (A5) and (A6) are paired such that (A19) is true for solutions (A and B) and ( and ) while (A19) is not true for ( and B) and (A and ); therefore, in order to have a function that is continuously differentiable on the boundary where , either one of the pair of solutions must be selected.
To constitute a Nash equilibrium, the strategies are required to be defined for the whole state space; hence, we study the case in the next subsection.
Appendix A.3. Case with No Exploitation ()
By combining the results in (39) and (A8), we obtain the set of points such that () given in (21). The value function for this region, denoted by , must satisfy the HJB equation in (5) with for , i.e.,
The above equation is a first-order linear partial differential equation (PDE), and must be continuously differentiable in and on its boundary cases. In order to obtain this function, we begin by deriving the boundary case associated to a given point in .
Under Assumptions 1 and 2, for , such that , where denotes either the boundary between and (i.e., ) or the boundary between and (i.e., ). This point is denoted by and found by solving the following system of equations:
where (A21) is found by using , which implies and . By using (21), there are three cases in which Equation (A22) holds true:
- (i)
- ; then, the point is given by the system of equations written in Equation (31).
- (ii)
- with ; then, the point is given by
- (iii)
- ; then, the point is given by
The system in (31) does not have an analytical solution, and the pair of equations are written as implicit functions denoted by and . For a fixed point on the boundary , , which will be used later. The functions and have to be computed numerically except some special cases given below:
Remark A2.
Since decreases (by (A21)), for where , we have such that . However, for , depending on the position of , may lie on or . In order to precisely determine the boundary associated to every point in , we first obtain the curve associated with the intersection of the boundary cases of (denoted by given in Equations (35) and (36) and recall that ). The set of points such that where is given by the curve denoted by that is written in Equation (34), which is found by solving (A21) with , and note that . By using (34), we define the partitions of region , denoted by and , given in Equations (32) and (33).
Consider a point such that , where and . It satisfies . Denote by such that . By using (A21), we obtain . Then:
- for where , we have with ; hence , , and . Then, , and the point is given by (31).
- for where , we have with ; hence the point is given by (A23).
To summarize, if , then , and if , then .
Since we obtained the boundary case associated with every point in , by applying the method of characteristics (see [43]), we obtain the solution to (A20) given in the following lemma:
Lemma A3.
The value function in is written in Equation (29). The function is continuously differentiable in S and Z and on the boundary cases of .
Proof.
We first consider Equation (A20) such that with a constant , where is an analytical function of S and Z, which is a Cauchy problem. Suppose that a function u, which is parameterized by , is a solution to Equation (A20) such that . Introduce the system of ordinary differential equations (ODEs), which are called the characteristic system for the PDE in (A20):
Solving the last two equations in (A27) with their initial conditions results in
By solving for and , we obtain and that satisfy (A29):
We insert given in (A29) into the first ODE in (A27) and obtain the following equation:
Solving (A31) with its initial condition in (A28) yields
Replacing (A30) into (A32) yields
Hence, we obtained the solution to (A20), which is verified for any analytical function . Further, in order to satisfy the boundary conditions in (A21) and (A22) , we replace and obtain (29). We then write (A20) by using (29), which simplifies to the following equation:
We showed that, for where , we have . For , satisfies the HJB Equation (5) with , i.e.,
and for where , we have . When , satisfies (5) with ,
For both cases, the third term in (A34) is zero and the equation holds true; hence, the function given in (29) satisfies the HJB equation in (5) with and .
For the partial derivatives of with respect to S and Z, by collecting the terms multiplied with and , using (A35) and (A36) allows us to simplify to the following:
The function is continuously differentiable in S and Z, , and , since on this curve, and .
- Continuity of on the boundary cases of :
There are two cases: and with .
- (a)
- On this boundary case, and we have ,Thus, is continuous on . For continuity of its partial derivatives, by using (A37) and (A38), we obtainHence, is continuously differentiable in S and Z on .
- (b)
- with :On this boundary case, we have with . For continuity, we haveThus, is continuous on . For continuity of its partial derivatives, by using (A38), we haveThus, is continuously differentiable on with .Therefore, the function is continuously differentiable in S and Z in and both its boundary cases.
□
Consequently, by using Lemmas A1–A3, under Assumptions 1 and 2, the function given in (22) is continuously differentiable in S and Z and satisfies the HJB equation in (5), and the strategy profile given in (11) satisfies condition (7) for all . The terminal condition is satisfied because and the set of all S is bounded, while the set of all can be divided into two parts: a bounded interval and a set in which every admissible trajectory Z decreases so every admissible trajectory is bounded. Thus, by continuity of , for any given initial condition , the limit of exists and equals 0. Therefore, constitutes a symmetric feedback–Nash equilibrium, which proves Theorem 1.
Appendix B. Proof of Theorem 2
In the following, we derive the set of points such that and , respectively, and then obtain the steady states and study their stability.
We begin with . The locus is given by . By solving for Z, we obtain the following linear function in S:
The locus is given by and defined by the following linear function in S:
The point at which the two loci intersect, i.e., such that and , is denoted by and written in Equations in (40) and (41).
We analyze the stability of the steady states by studying the signs of the determinants and traces of the Jacobian matrix associated with the differential equation system in the neighborhood of the critical points ([42]); see [44] for a similar analysis. The Jacobian matrix associated with the point is given by
Its determinant and trace are written as follows:
It can be verified that and if . This is a weaker condition than the first part of Assumption 1; thus, is a stable steady state if Assumption 1 is satisfied.
We now turn to . The locus is given by the following piecewise-linear function in S:
Since , we have and . Thus, for , the minimum level of Z such that occurs at is given by . The locus is given by , which results in the constant denoted by that is given in (44) (i.e., ). Then, in order to have and exist, must be satisfied, which is written in (45). When condition (45) is true, the intersection points denoted by and are given in Equations (42)–(44).
To check stability, we obtain the Jacobian matrices that are associated with these critical points, written as follows:
For the first matrix, we have
Since , is not a stable steady state. For the second one,
therefore, is a stable steady state.
We can conclude that, under Assumptions 1 and 2, stable steady states are either unique or multiple depending on condition (45). In both cases, converges to a steady state with , .
In the analysis of stability, in step (A46), we eliminate the possibility of using solution for , since replacing with in (A46) leads to , which means that the point is not stable. Thus, combining with the previous results in steps (19), (A16), and (A19), the strategy profile in (11) given by solutions A and B is the only solution that satisfies the sufficient conditions for symmetric feedback-Nash equilibrium.
Appendix C. Proof of Proposition 1
The partial derivatives of the coefficients of equilibrium strategies with respect to the parameters n, , and are written as follows:
Appendix D. Proof of Proposition 2
For , the steady state in , i.e., given in (40) and (41) simplifies to
The point lies on the boundary given in (38), i.e., . Furthermore, it coincides with (given in (44)) and (given in (42)); then, the point coincides with the unstable point in , i.e., for , , and thus, it is an unstable steady state; nevertheless, it is sustainable. The analysis on the stability of remains valid for ; therefore, given in (43) and (44) is the only stable steady state.
References
- Eriksen, M.; Lebreton, L.C.; Carson, H.S.; Thiel, M.; Moore, C.J.; Borerro, J.C.; Reisser, J. Plastic pollution in the world’s oceans: More than 5 trillion plastic pieces weighing over 250,000 tons afloat at sea. PLoS ONE 2014, 9, e111913. [Google Scholar] [CrossRef] [PubMed]
- Newman, S.; Watkins, E.; Farmer, A.; Ten Brink, P.; Schweitzer, J.P. The economics of marine litter. In Marine Anthropogenic Litter; Bergmann, M., Gutow, L., Klages, M., Eds.; Springer: Cham, Switzerland, 2015; pp. 367–394. [Google Scholar]
- Richardson, K.; Hardesty, B.D.; Wilcox, C. Estimates of fishing gear loss rates at a global scale: A literature review and meta-analysis. Fish Fish. 2019, 20, 1218–1231. [Google Scholar] [CrossRef]
- Gilman, E. Status of international monitoring and management of abandoned, lost and discarded fishing gear and ghost fishing. Mar. Policy 2015, 60, 225–239. [Google Scholar] [CrossRef]
- Macfadyen, G.; Huntington, T.; Cappell, R. Abandoned, Lost or Otherwise Discarded Fishing Gear; UNEP Regional Seas Reports and Studies No. 185; FAO Fisheries and Aquaculture Technical Paper No. 523; UNEP/FAO: Rome, Italy, 2009. [Google Scholar]
- McIlgorm, A.; Campbell, H.F.; Rule, M.J. The economic cost and control of marine debris damage in the Asia-Pacific region. Ocean Coast. Manag. 2011, 54, 643–651. [Google Scholar] [CrossRef]
- Scheld, A.M.; Bilkovic, D.M.; Havens, K.J. The dilemma of derelict gear. Sci. Rep. 2016, 6, 19671. [Google Scholar] [CrossRef] [PubMed]
- Ryan, R.W.; Holl, D.S.; Herrera, G.E. Ecosystem externalities in fisheries. Mar. Res. Econ. 2014, 29, 39–53. [Google Scholar] [CrossRef]
- Clark, C.W. Mathematical Bioeconomics: The Optimal Management of Renewable Resources; Wiley: New York, NY, USA, 1990. [Google Scholar]
- Benchekroun, H. Unilateral production restrictions in a dynamic duopoly. J. Econ. Theory 2003, 111, 214–239. [Google Scholar] [CrossRef]
- Benchekroun, H. Comparative dynamics in a productive asset oligopoly. J. Econ. Theory 2008, 138, 237–261. [Google Scholar] [CrossRef]
- Vardar, B.; Zaccour, G. Exploitation of a Productive Asset in the Presence of Strategic Behavior and Pollution Externalities; Les Cahiers du GERAD G-2018-43; HEC Montréal: Montréal, QC, Canada, 2018. [Google Scholar]
- Benhabib, J.; Radner, R. The joint exploitation of a productive asset: A game-theoretic approach. Econ. Theory 1992, 2, 155–190. [Google Scholar] [CrossRef]
- Dockner, E.J.; Sorger, G. Existence and properties of equilibria for a dynamic game on productive assets. J. Econ. Theory 1996, 71, 209–227. [Google Scholar] [CrossRef]
- Kossioris, G.; Plexousakis, M.; Xepapadeas, A.; de Zeeuw, A. On the optimal taxation of common-pool resources. J. Econ. Dyn. Control 2011, 35, 1868–1879. [Google Scholar] [CrossRef][Green Version]
- Fujiwara, K. Losses from competition in a dynamic game model of a renewable resource oligopoly. Res. Energy Econ. 2011, 33, 1–11. [Google Scholar] [CrossRef]
- Colombo, L.; Labrecciosa, P. On the convergence to the Cournot equilibrium in a productive asset oligopoly. J. Math. Econ. 2013, 49, 441–445. [Google Scholar] [CrossRef]
- Colombo, L.; Labrecciosa, P. Oligopoly exploitation of a private property productive asset. J. Econ. Dyn. Control 2013, 37, 838–853. [Google Scholar] [CrossRef]
- Lambertini, L.; Mantovani, A. Feedback equilibria in a dynamic renewable resource oligopoly: Pre-emption, voracity and exhaustion. J. Econ. Dyn. Control 2014, 47, 115–122. [Google Scholar] [CrossRef][Green Version]
- Lambertini, L.; Mantovani, A. On the (in) stability of nonlinear feedback solutions in a dynamic duopoly with renewable resource exploitation. Econ. Lett. 2016, 143, 9–12. [Google Scholar] [CrossRef][Green Version]
- Benchekroun, H.; Gaudet, G. On the effects of mergers on equilibrium outcomes in a common property renewable asset oligopoly. J. Econ. Dyn. Control 2015, 52, 209–223. [Google Scholar] [CrossRef]
- Benchekroun, H.; Long, V.N. Status concern and the exploitation of common pool renewable resources. Ecol. Econ. 2016, 125, 70–82. [Google Scholar] [CrossRef][Green Version]
- Grilli, L.; Bisceglia, M. A duopoly with common renewable resource and incentives. Intl. Game Theory Rev. 2017, 19, 1750018. [Google Scholar] [CrossRef]
- Jørgensen, S.; Martín-Herrán, G.; Zaccour, G. Dynamic games in the economics and management of pollution. Environ. Model. Assess. 2010, 15, 433–467. [Google Scholar] [CrossRef]
- Long, V.N. Dynamic games in the economics of natural resources: A survey. Dyn. Games Appl. 2011, 1, 115–148. [Google Scholar] [CrossRef]
- Van der Ploeg, F.; de Zeeuw, A.J. International aspects of pollution control. Environ. Res. Econ. 1992, 2, 117–139. [Google Scholar] [CrossRef]
- Dockner, E.J.; Long, V.N. International pollution control: Cooperative versus noncooperative strategies. J. Environ. Econ. Manag. 1993, 25, 13–29. [Google Scholar] [CrossRef]
- Benchekroun, H.; Long, V.N. Efficiency inducing taxation for polluting oligopolists. J. Public Econ. 1998, 70, 325–342. [Google Scholar] [CrossRef]
- Rubio, S.J.; Escriche, L. Strategic Pigouvian taxation, stock externalities and polluting non-renewable resources. J. Public Econ. 2001, 79, 297–313. [Google Scholar] [CrossRef]
- Wirl, F. Pigouvian taxation of energy for flow and stock externalities and strategic, noncompetitive energy pricing. J. Environ. Econ. Manag. 1994, 26, 1–18. [Google Scholar] [CrossRef]
- Wirl, F. Tragedy of the commons in a stochastic game of a stock externality. J. Public Econ. Theory 2008, 10, 99–124. [Google Scholar] [CrossRef]
- Germain, M.; Toint, P.; Tulkens, H.; de Zeeuw, A. Transfers to sustain dynamic core-theoretic cooperation in international stock pollutant control. J. Econ. Dyn. Control 2003, 28, 79–99. [Google Scholar] [CrossRef]
- Petrosjan, L.; Zaccour, G. Time-consistent Shapley value allocation of pollution cost reduction. J. Econ. Dyn. Ctrl. 2003, 27, 381–398. [Google Scholar] [CrossRef]
- Xepapadeas, A. Induced technical change and international agreements under greenhouse warming. Res. Energy Econ. 1995, 17, 1–23. [Google Scholar] [CrossRef]
- Tahvonen, O. On the dynamics of renewable resource harvesting and pollution control. Environ. Res. Econ. 1991, 1, 97–117. [Google Scholar]
- Xepapadeas, A. Managing the international commons: Resource use and pollution control. Environ. Res. Econ. 1995, 5, 375–391. [Google Scholar] [CrossRef]
- Wirl, F. Sustainable growth, renewable resources and pollution: Thresholds and cycles. J. Econ. Dyn. Control 2004, 28, 1149–1157. [Google Scholar] [CrossRef]
- Dahmouni, I.; Vardar, B.; Zaccour, G. A fair and time-consistent sharing of the joint exploitation payoff of a fishery. Nat. Res. Model. 2019, 32, e12216. [Google Scholar] [CrossRef]
- Singh, R.; Dwivedi, A.D.; Srivastava, G.; Wiszniewska-Matyszkiel, A.; Cheng, X. A game theoretic analysis of resource mining in blockchain. Clust. Comput. 2020, 23, 2035–2046. [Google Scholar] [CrossRef]
- Dockner, E.J.; Jørgensen, S.; Long, V.N.; Sorger, G. Differential Games in Economics and Management Science; Cambridge University Press: Cambridge, UK, 2000. [Google Scholar]
- Haurie, A.; Krawczyk, J.B.; Zaccour, G. Games and Dynamic Games; World Scientific Books: Singapore, 2012. [Google Scholar]
- Takayama, A. Analytical Methods in Economics; University of Michigan Press: Ann Arbor, MI, USA, 1993. [Google Scholar]
- Melikyan, A. Generalized Characteristics of First Order PDEs: Applications in Optimal Control and Differential Games; Springer Science and Business Media: New York, NY, USA, 2012. [Google Scholar]
- Jun, B.; Vives, X. Strategic incentives in dynamic duopoly. J. Econ. Theory 2004, 116, 249–281. [Google Scholar] [CrossRef]
© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).