Next Article in Journal
Optimal Control Applied to Vaccination and Testing Policies for COVID-19
Next Article in Special Issue
Estimation of Initial Stock in Pollution Control Problem
Previous Article in Journal
Mathematical Modeling of Lymph Node Drainage Function by Neural Network
Previous Article in Special Issue
The Cost of Work Discrimination: A Market Capture Differential Game Model
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

On the Estimation of the Initial Stock in the Problem of Resource Extraction

Faculty of Applied Mathematics and Control Processes, St Petersburg State University, University Emb. 7/9, 199034 St. Petersburg, Russia
*
Author to whom correspondence should be addressed.
Mathematics 2021, 9(23), 3099; https://doi.org/10.3390/math9233099
Submission received: 7 November 2021 / Revised: 28 November 2021 / Accepted: 29 November 2021 / Published: 1 December 2021

Abstract

:
We consider a differential game of non-renewable resource extraction, in which the players do not know the precise value of the resource stock and, thus, have to make an estimate. We define the value of information about the initial stock and give recommendations for the choice of the estimate depending on the parameters of the problem. Further, we consider the situation where the players only know the bounds for the stock of the resource and solve the problem of computing the optimal estimate, such that it minimizes the players’ losses in the worst-case scenario. The analysis allows us to give a simple rule for the choice of the optimal estimate of the resource stock.

1. Introduction

In the problems surrounding resource exploitation—accuracy of information about the initial stock of the resource, or its availability, plays a crucial role, as it influences the future profit of the company [1,2]. To characterize the profit that can be gained based on precise information, one may look at the notion of value of information (VoI). Based on this characteristic, the decision maker may make a decision on whether it is profitable to invest in acquiring more precise information or not. An overview of the recent results devoted to VoI can be found in [3,4], while the papers by [5,6] detail the use of value of information in resource extraction applications.
In this paper, we continue the line of research initiated in [7], and consider the problem of determining the value of information in a continuous time differential game of non-renewable resource extraction with n agents (players). Specifically, we compute the value of information on the initial stock of the resource. We first define the value of information for a cooperative scenario and later extend it to the case of the Nash equilibrium solution.
While the computation of the value of information about the initial stock of resource is of great importance, in most situations one does not know how much the initial estimate differs from the exact value. This implies that the exact value of information cannot be computed. In this case, both the initial stock estimate and the value of information are considered within the probabilistic framework, by assuming an a priori probability distribution for the different values of the resource stock. In contrast to that, we present an approach aimed at determining the optimal estimate from purely deterministic considerations.
The rest of the paper is structured as follows. In Section 2, we describe the problem statement, compute the cooperative solution for the considered differential game, and define the normalized value of information. In Section 3, it is shown that, under certain conditions, on the parameters of the system, the previously computed value of information equals the value of information computed for each individual player. Section 4 is devoted to the problem of computing the optimal estimate for the initial resource stock if the only available information is about the lower and upper bounds of the resource stock. Finally, Section 5 concludes the paper.

2. Problem Statement

Below we consider a classical game–the theoretic model of the non-renewable resource extraction. A similar problem was studied by Dockner et al. [8] [Sec. 12.1]. Our problem differs from the one presented in the cited reference in that we consider asymmetric players and finite time-horizon. Moreover, the results related to the computation of the value of information and the optimal estimation of the resource stock are completely new.
Let n players (firms or countries) exploit simultaneously a non-renewable natural resource. We will denote the set of all players by N, i.e., N = { 1 , , n } . Further, x ( t ) denotes the state corresponding to the resource stock at time t available to the extraction. The dynamics of the stock is given by the following differential equation with the initial condition x 0 > 0 :
x ˙ ( t ) = i N k i u i ( t ) , x ( t 0 ) = x 0 .
Here, u i ( t ) denotes the extraction effort of player i at time t, and the coefficient k i is used to convert the ith player’s effort into the extraction intensity.
In compliance with the physical nature of the problem, we require that u i ( t ) 0 and x ( t ) 0 for all t t 0 , and that, if x ( t ) = 0 , then the only feasible rate of extraction is u i ( t ) = 0 for all i = 1 , , n .
Further, it is assumed that the game is played over a finite time interval [ t 0 , T ] , the instantaneous payoff is discounted at a constant rate r > 0 , and the utility function has a constant elasticity of the marginal utility 0 < μ < 1 (see, e.g., Amigues and Moreaux [9]). The latter implies that the marginal utility not only decreases, but its relative change, with respect to the relative change of extraction u, is constant, and is equal to μ . The parameter μ is typically interpreted as the measure of risk-aversion, see, e.g., Arrow [10]. The values of μ within the range ( 0 , 1 ) correspond to the risk averse model of utility.
The objective function of the ith player is defined as
J i ( x 0 , u 1 , , u n ) = t 0 T e r t u i 1 μ ( t ) 1 μ d t .
We consider the situation when the game is played in a cooperative mode and the players cooperate in order to achieve the maximum total payoff:
i N J i ( x 0 , u 1 , , u n ) = i N t 0 T e r t u i 1 μ ( t ) 1 μ d t .
The optimal open-loop cooperative strategies of players u ¯ ( t ) = ( u ¯ 1 ( t ) , , u ¯ n ( t ) ) , t [ t 0 ; T ] are found from solving the optimization problem:
u ¯ ( t ) = arg max u 1 ( t ) , , u n ( t ) i N t 0 T e r t u i 1 μ ( t ) 1 μ d t .
Note that the problem (1), (4) is well defined for μ ( 0 , 1 ) .
The trajectory corresponding to the optimal cooperative strategies u ¯ ( t ) is called the optimal cooperative trajectory and is denoted by x ¯ ( t ) .
Following Pontryagin et al. [11], we define the Hamiltonian function as
H = i N e r t u i 1 μ ( t ) 1 μ ψ ( t ) i N k i u i .
Taking into account that x ( t ) 0 and u ( t ) 0 , the optimal controls u ¯ i are obtained from the first-order optimality conditions H u i = 0 as
u ¯ i ( t ) = e r t ψ ( t ) k i 1 μ .
Solving the respective canonical system with the transversality condition ψ ( T ) x ( T ) = 0 , and noting that the adjoint variable is constant, ψ ( t ) = ψ and, furthermore, has to satisfy ψ > 0 so that the expression for u ¯ i is well-defined, we obtain
ψ = μ K e r t 0 μ e r T μ r x 0 μ ,
where K = i N k i 1 1 μ . The optimal state trajectory is thus
x ¯ ( t ) = x 0 + x 0 e r t μ e r t 0 μ e r t 0 μ e r T μ .
The optimal value of the total payoff then equals
i N J i ( x 0 , u ¯ ) = x 0 1 μ 1 μ K μ r μ e r t 0 μ e r T μ μ .
Suppose that the information about the initial resource stock x 0 is not available to the players and they overestimate the initial value of the resource stock to be equal to x ^ 0 > x 0 . In this case, the controls chosen by the players would have the form
u ^ i ( t ) = r x ^ 0 e r t μ μ K k i 1 μ e r t 0 μ e r T μ ,
and the trajectory corresponding to these controls is
x ^ ( t ) = x 0 + x ^ 0 e r t μ e r t 0 μ e r t 0 μ e r T μ .
Should the players use the controls u ^ i ( t ) , the resource stock will be exhausted by the time t ^ < T , which satisfies
e r t 0 μ e r t ^ μ = x 0 x ^ 0 e r t 0 μ e r T μ .
Taking in account that
J i ( x 0 , u ^ ) = t 0 t ^ e r t u ^ i 1 μ ( t ) 1 μ d t = x ^ 0 1 μ 1 μ μ r μ x 0 e r t 0 μ e r T μ μ k i 1 1 μ x ^ 0 K 1 μ ,
we obtain the following relation:
J i ( x 0 , u ^ ) = x 0 x ^ 0 μ J i ( x 0 , u ¯ ) .
It is obvious that the total payoff obtained under the assumption of exact information will never be less than the payoff obtained for imprecise information. To characterize the impact of the accuracy of the available information on the amount of the obtained payoff, we introduce the notion of value of information (VoI).
Definition 1.
The (normalized) value of information in a cooperative game is defined as
V C = i N J i ( x 0 , u ¯ ) i N J i ( x 0 , u ^ ) i N J i ( x 0 , u ¯ ) ,
where i N J i ( x 0 , u ¯ ) is the total payoff obtained for the exact information about the values of parameters, while i N J i ( x 0 , u ^ ) is the total payoff obtained for an imprecise estimation of the values.
VoI characterizes the maximal amount, the decision maker may wish to pay to achieve the precise information about the value of the parameter of interest. To put it differently, the larger the value of V C is, the bigger the amount that the decision maker looses when using imprecise information. We have chosen to normalize V C , such that it belongs to the interval [ 0 , 1 ] .
Taking into account (6), the value of information about the initial resource stock is computed as
V C = x ^ 0 μ x 0 μ x ^ 0 μ = 1 x 0 x ^ 0 μ .
In the case when the players underestimate the initial stock, i.e., use the value x ^ 0 < x 0 as an initial condition, the ith player’s payoff is
J i ( x 0 , u ^ ) = x ^ 0 1 μ 1 μ K μ r μ e r t 0 μ e r T μ μ = x ^ 0 x 0 1 μ J i ( x 0 , u ¯ ) .
and the value of information is
V C = x 0 1 μ x ^ 0 1 μ x 0 1 μ = 1 x ^ 0 x 0 1 μ .
To illustrate the obtained relations, we write x ^ 0 in terms of x 0 as x ^ 0 = α x 0 . Here, α < 1 corresponds to an underestimation of the resource stock, while α > 1 corresponds to an overestimation thereof. The resulting expression for the value of information thus reads
V = 1 α 1 μ , α < 1 , 0 , α = 1 , 1 α μ , α > 1 .
Figure 1 plots the normalized values of information as a function of α for different values of μ .
One can see that if α = 1 , the decision maker possesses the exact information and is not willing to pay anything to improve this knowledge. However, as the information becomes more imperfect (i.e., α deviates from 1), the value of perfect information grows.
One can easily observe that when μ is small, it is cheaper to overestimate the resource stock, while for larger values of μ , one pays less when underestimating. Let, for instance, μ = 0.25 . Then an estimation of the available stock of resource to be the half of the actual value would cost approximately 2.5 times more than if one assumes that the resource stock is twice the actual value (for α = 0.5 , V C 0.4 , while for α = 2 , V C 0.16 ). If, in turn, the value of μ is close to 1 2 , both overestimation and underestimation result in approximately equivalent losses.

3. Nash Equilibrium

While the notion of VoI was previously introduced for a coalition, one can define value of information for an individual player. In this case, the definition of VoI would be based upon the use of the Nash equilibrium solution. We wish to study the relation between these two notions (cooperative and individual VoI) for the considered problem.
To approach this question, we need to check whether the cooperative solution in this game can be achieved as a Nash equilibrium. It was shown in [8] for an infinite horizon resource extraction game that the answer depends on the values of parameters. Here, we obtain a similar result for the game with finite duration. Similar to the cooperative case, we consider Nash equilibria within the class of open-loop solutions.
Assume that the player i deviates from the optimal cooperative strategy u ¯ i ( t ) . Such behavior can be beneficial only if the resource is exhausted by some time T i < T , which means that the deviating player chooses to extract beyond the allocated limit. To illustrate this, suppose that the game ends at the time T i > T . This implies that the ith player extracted less by the time T, which is clearly not optimal. The case T i = T cannot happen as well, as it would imply that the previously obtained cooperative solution can be improved.
Thus, the player i solves the following optimization problem:
max u i J i ( x 0 , u ¯ 1 , , u ¯ i 1 , u i , u ¯ i + 1 , , u ¯ n ) , x ˙ ( t ) = k i u i ( t ) j i k j u ¯ j ( t ) , x ( t 0 ) = x 0 , x ( T i ) = 0 .
Let, further, u ˜ i denote the solution to (8). To find u ˜ i ( t ) , consider the Hamiltonian function for the player i:
H i = e r t u i 1 μ ( t ) 1 μ ψ i ( t ) ( k i u i ( t ) j i k j u ¯ j ( t ) ) .
Solving the respective canonical system we obtain u ˜ i = u ¯ i d , where
d = K e r t 0 μ e r T μ j i k j μ 1 μ e r t 0 μ e r T i μ k i μ 1 μ e r t 0 μ e r T i μ .
Then the corresponding value of the i player’s payoff function is
J i ( x 0 , u ¯ 1 , , u ¯ i 1 , u ˜ i , u ¯ i + 1 , , u ¯ n ) = e r t 0 μ e r T i μ μ r μ x 0 d K e r t 0 μ e r T μ 1 μ .
The expression (9) attains its maximal value at T i = T if
μ 1 K j i k j μ 1 μ
This yields the following.
Lemma 1.
Let the parameters μ and k i satisfy (10) for each i N . Then the cooperative solution in this game is a Nash equilibrium.
This result implies that value of information computed on the basis of this particular Nash equilibrium solution will coincide with the value of information obtained for the cooperative case: V i N E = V C for all i N .

4. Optimal Estimate for the Initial Resource Stock

Now return to the cooperative game and suppose that the players do not have the information about the exact value of the initial resource stock x 0 , but they have an a priori estimate that x 0 [ x 1 , x 2 ] . Thus the players may decide to agree upon a guess x ^ 0 [ x 1 , x 2 ] that would minimize the possible loss in a worst-case scenario. To do so, one solves the following minimax problem:
min x ^ 0 [ x 1 , x 2 ] max x 0 [ x 1 , x 2 ] i N J i ( x 0 , u ¯ ) i N J i ( x 0 , u ^ ) ,
where x ^ 0 is the guess of the initial condition, and x 0 is the actual value thereof.
First consider the maximization problem. We rewrite it as
max x 0 [ x 1 , x 2 ] i N J i ( x 0 , u ¯ ) i N J i ( x 0 , u ^ ) = max max x 0 [ x 1 , x ^ 0 ] i N J i ( x 0 , u ¯ ) i N J i ( x 0 , u ^ ) , max x 0 [ x ^ 0 , x 2 ] i N J i ( x 0 , u ¯ ) i N J i ( x 0 , u ^ ) .
Taking into account (6) and (7), we obtain
max x 0 [ x 1 , x 2 ] i N J i ( x 0 , u ¯ ) i N J i ( x 0 , u ^ ) = C max ( x 1 1 μ x 1 x ^ 0 μ ) , ( x 2 1 μ x ^ 0 1 μ ) ,
where C = K μ r μ 1 1 μ e r t 0 μ e r T μ μ .
The above optimization problem can be interpreted as a game against nature. For each initial guess x ^ 0 that the players make, nature chooses the value of x 0 that minimizes the total payoff of the players. It turns out that when the admissible range of values for x 0 is [ x 1 , x 2 ] , the best value nature can take would be either x 1 or x 2 , depending on x ^ 0 .
Thus, the best in the sense of loss minimization estimate of the initial condition is given by
x 0 * = arg min x ^ 0 max { ( x 1 1 μ x 1 x ^ 0 μ ) , ( x 2 1 μ x ^ 0 1 μ ) } .
The following theorem provides a characterization of the sought-for solution.
Theorem 1.
The optimal estimate of the initial stock of resource x 0 * that solves (11) is given as the solution of the equation
x 1 1 μ x 1 ( x 0 * ) μ = x 2 1 μ ( x 0 * ) 1 μ .
Proof. 
Note that over the interval [ x 1 , x 2 ] , x 1 1 μ x 1 ( x ^ 0 ) μ is an increasing function of x ^ 0 that changes from 0 to x 1 1 μ x 1 ( x 2 ) μ > 0 , while x 2 1 μ ( x ^ 0 ) 1 μ is a decreasing function of x ^ 0 that changes from x 2 1 μ x 1 1 μ > 0 to 0.
Thus, we conclude that the graphs of these functions intersect on the interval [ x 1 , x 2 ] . Therefore, the solution of problem (11) corresponds to the minimum of the upper envelope of both graphs, which is the point of their intersection. □
When μ = 1 2 , x 0 * can be expressed explicitly as
x 0 * = x 2 x 1 + 5 x 1 + x 2 2 x 1 x 2 2 2 .
To illustrate the formulated theorem, we assume the following values of parameters: x 1 = 100 , x 2 = 200 . Figure 2 demonstrates the graphs of both x 1 1 μ x 1 ( x ^ 0 ) μ and x 2 1 μ ( x ^ 0 ) 1 μ as well as their upper envelope for μ = 1 2 .
Theorem 1 allows us to correct the intuitive rule: if the sought-for value is confined to the interval [ x 1 , x 2 ] , take the value at the center of the interval. In our case, the “center” of the interval is computed as a solution to a nonlinear equation and its location depends on the value of the elasticity of the marginal utility μ . To analyze this dependence, observe that (14) can be expressed through the fractions as follows:
x 2 x 1 1 μ 1 = x 0 * x 1 1 μ x 0 * x 1 μ .
This equation demonstrates that the exact values of x 1 and x 2 are not substantial, but rather their ratio x 2 / x 1 . The same is true for x 0 * : Equation (15) specifies it in relation to x 1 . Figure 3 shows the location of x 0 * relative to x 1 and x 2 for different values of μ and for different ratios x 2 / x 1 .
One can easily observe that the exact value of the ratio x 2 x 1 has no substantial influence on the choice of the estimate x 0 * . For small values of μ , the estimate of the initial stock should be chosen close to the upper bound x 2 , while for the values of μ sufficiently close to 1, one should choose the value of x ^ 0 that is close to x 1 . For μ = 1 / 2 , the estimate should be taken as the center of the interval, ( x 1 + x 2 ) / 2 .
It is instructive to see how the choice of the optimal initial guess affects the value of information. Let x 1 = 100 and x 2 = 200 and assume that the initial guess x ^ 0 = x 0 * is obtained as the solution to (14). In Figure 4, we show the value of information computed for different μ and for different actual values of the resource stock x 0 . The points where the value of information is equal to 0 correspond to the case x 0 = x 0 * , while the largest values of V C are attained at the endpoints of the interval.

5. Conclusions

The carried out analysis can be summarized as follows. If there is no information about the actual resource stock and μ is small, it is safe to overestimate, while for the values of μ closer to 1, one should rather underestimate the initial resource stock. If one has an estimation of the bounds of the resource stock, the optimal estimate can be chosen along a straight line that goes from x 2 at μ 0 to x 1 at μ 1 .
While the advantage of the obtained results consists in their deterministic nature, it would be interesting to establish a connection to the results obtained within a probabilistic framework. This will be the subject of our future research.

Author Contributions

Conceptualization, E.G.; methodology, A.T.; validation, A.T., E.G. and D.G.; investigation, A.T. and D.G.; writing—original draft preparation, A.T.; writing—review and editing, E.G. and D.G.; visualization, A.T. and D.G.; project administration, E.G.; funding acquisition, D.G. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by RFBR and DFG, project number 21-51-12007.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Sadorsky, P.A. Measuring resource scarcity in non-renewable resource with an application to oil and natural gas in Alberta. Appl. Econ. 1991, 23, 975–984. [Google Scholar] [CrossRef]
  2. Hart, R. Non-renewable resources in the long run. J. Econ. Dyn. Control 2016, 71, 1–20. [Google Scholar] [CrossRef]
  3. Yokota, F.; Thompson, K.M. Value of information analysis in environmental health risk management decisions: Past, present, and future. Risk Anal. Int. J. 2004, 24, 635–650. [Google Scholar] [CrossRef] [PubMed]
  4. Keisler, J.M.; Collier, Z.A.; Chu, E.; Sinatra, N.; Linkov, I. Value of information analysis: The state of application. Environ. Syst. Decis. 2014, 34, 3–23. [Google Scholar] [CrossRef]
  5. Bratvold, R.B.; Bickel, J.E.; Lohne, H.P. Value of Information in the Oil and Gas Industry: Past, Present, and Future. SPE Reserv. Eval. Eng. 2009, 12, 630–638. [Google Scholar] [CrossRef] [Green Version]
  6. Peskova, D.; Sizykh, A.; Rukavishnikov, V. Oil and gas exploration planning using VoI technique. In Proceedings of the IOP Conference Series: Earth and Environmental Science, Tomsk, Russia, 23–27 November 2015; IOP Publishing: Bristol, UK, 2016; Volume 33, p. 012032. [Google Scholar]
  7. Chebotareva, A.; Su, S.; Tretyakova, S.; Gromova, E. On the Value of the Preexisting Knowledge in an Optimal Control of Pollution Emissions. Contrib. Game Theory Manag. 2021, 14, 49–58. [Google Scholar] [CrossRef]
  8. Dockner, E.J.; Jorgensen, S.; Van Long, N.; Sorger, G. Differential Games in Economics and Management Science; Cambridge University Press: Cambridge, UK, 2000. [Google Scholar]
  9. Amigues, J.P.; Moreaux, M. Contrasting how to overcome the resource constraints in the non renewable and renewable economies. Does the shape of the utility function matter? Int. J. Environ. Dev. 2004, 1, 301–319. [Google Scholar]
  10. Arrow, K.J. The theory of risk aversion. In Essays in the Theory of Risk-Bearing; Markham Publishing Company: Chicago, IL, USA, 1971; pp. 90–119. [Google Scholar]
  11. Pontryagin, L.S.; Boltyanskii, V.G.; Gamkrelidze, R.V.; Mishchenko, E.F. Mathematical Theory of Optimal Processes; CRC Press: Boca Raton, FL, USA, 1987. [Google Scholar]
Figure 1. The normalized value of information for different values of μ and different initial guesses.
Figure 1. The normalized value of information for different values of μ and different initial guesses.
Mathematics 09 03099 g001
Figure 2. Illustration of Theorem 1 for x 1 = 100 , x 2 = 200 , and μ = 1 2 .
Figure 2. Illustration of Theorem 1 for x 1 = 100 , x 2 = 200 , and μ = 1 2 .
Mathematics 09 03099 g002
Figure 3. Location of the optimal in the sense of (11) estimate x 0 * as a function of μ . The position of x 0 * is shown relative to the boundary values x 1 and x 2 .
Figure 3. Location of the optimal in the sense of (11) estimate x 0 * as a function of μ . The position of x 0 * is shown relative to the boundary values x 1 and x 2 .
Mathematics 09 03099 g003
Figure 4. Value of information for different x 0 and different μ under the assumption that the initial guess x ^ 0 is chosen optimally in the sense of (11).
Figure 4. Value of information for different x 0 and different μ under the assumption that the initial guess x ^ 0 is chosen optimally in the sense of (11).
Mathematics 09 03099 g004
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Tur, A.; Gromova, E.; Gromov, D. On the Estimation of the Initial Stock in the Problem of Resource Extraction. Mathematics 2021, 9, 3099. https://doi.org/10.3390/math9233099

AMA Style

Tur A, Gromova E, Gromov D. On the Estimation of the Initial Stock in the Problem of Resource Extraction. Mathematics. 2021; 9(23):3099. https://doi.org/10.3390/math9233099

Chicago/Turabian Style

Tur, Anna, Ekaterina Gromova, and Dmitry Gromov. 2021. "On the Estimation of the Initial Stock in the Problem of Resource Extraction" Mathematics 9, no. 23: 3099. https://doi.org/10.3390/math9233099

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop