Non-Renewable Resource Extraction Model with Uncertainties

Ye, Peichen; Tur, Anna; Wu, Yilun

doi:10.3390/g16050052

Open AccessArticle

Non-Renewable Resource Extraction Model with Uncertainties

by

Peichen Ye

^*

,

Anna Tur

and

Yilun Wu

Faculty of Applied Mathematics and Control Processes, St. Petersburg State University, 199034 St. Petersburg, Russia

^*

Author to whom correspondence should be addressed.

Games 2025, 16(5), 52; https://doi.org/10.3390/g16050052

Submission received: 22 June 2025 / Revised: 5 September 2025 / Accepted: 22 September 2025 / Published: 9 October 2025

Download

Browse Figures

Versions Notes

Abstract

This paper delves into a multi-player non-renewable resource extraction differential game model, where the duration of the game is a random variable with a composite distribution function. We first explore the conditions under which the cooperative solution also constitutes a Nash equilibrium, thereby extending the theoretical framework from a fixed duration to the more complex and realistic setting of random duration. Assuming that players are unaware of the switching moment of the distribution function, we derive optimal estimates in both time-dependent and state-dependent cases. The findings contribute to a deeper understanding of strategic decision-making in resource extraction under uncertainty and have implications for various fields where random durations and cooperative strategies are relevant.

Keywords:

differential game; non-renewable resource extraction; random duration; uncertainty; minimax problem

1. Introduction

Non-renewable resource extraction inherently involves strategic conflicts among multiple stakeholders operating under profound uncertainties. As demonstrated in Epaulard (1998), uncertainties regarding resource stock and technological progress profoundly affect extraction paths and decision-makers’ intertemporal decisions, adding more complex dimensions to this field full of strategic conflicts and uncertainties. Differential games, as mathematical frameworks capturing strategic interactions over time, offer powerful tools to model these conflicts Isaacs (1999). Traditional formulations of such games usually assume a fixed game duration (finite time duration) or rely on an infinite time duration. However, real-world scenarios related to non-renewable resource extraction, including equipment failures, policy shifts and environmental disruptions, introduce duration uncertainty. In these cases, the duration of the game cannot be determined a priori and depends on a number of unknown factors. This type of uncertainty gives rise to the fascinating field of differential games with random duration. The random nature of the duration of the game affects the optimal strategies of the players. Understanding this process is critical to developing decision-making mechanisms under uncertainty.

This class of games was first introduced in Petrosjan and Mursov (1966), which studied differential zero-sum games with terminal payoff at a random time horizon. Subsequently, Boukas et al. (1990) conducted a general study on an optimal control problem with random duration. The study of cooperative and non-cooperative differential games with random duration was continued by Shevkoplyas and Petrosyan in Petrosjan and Shevkoplyas (2003) and Shevkoplyas (2014). The form of integral payoff in differential games with random duration was investigated in (E. Gromova & Tur, 2017; Shevkoplyas & Kostyunin, 2013). This class of games was further extended to the case where the distribution function of the random terminal time of the game has a composite form (Gromov & Gromova, 2014, 2017). Specifically, it was assumed that the probability density function of the terminal time may change depending on certain conditions, which can be expressed as a function of time and state. This modification of games can be particularly useful in environmental models due to potential environmental disasters and climate change, as well as in technical models accounting for equipment failures or different modes of technical equipment operation. The study of such models was continued in Zaremba et al. (2020) for discontinuous distributions and in Balas and Tur (2023) for the case of feedback strategies. In parallel, Wu et al. (2023) focuses on sustainable optimal control for a switched pollution-control problem with random duration.

As mentioned above, incidents such as equipment failures may occur during resource extraction, causing switches in the game’s dynamic system. This can lead to changes in aspects of the game, including its payoff structure, state equations, or termination conditions. Stuermer and Schwerhoff (2015) studied how the geological distribution of the non-renewable resource interacts with technological change. The modelling of emission-reduction technology adoption as an endogenous threshold-triggered switch is presented in Parilina et al. (2024). The establishment of political regime switches as drivers of extraction voracity in non-renewable resources is addressed in Van der Ploeg (2024). The synthesis of stochastic equipment failures and periodic purification switching, along with the proof that pollution states converge to unique hybrid limit cycles, is conducted in Wu et al. (2025b). Additionally, the study of an multi-player hybrid pollution-control problem that considers switching behavior and uncertain game duration is reported in Wu et al. (2025a). The empirical validation of phase-specific efficiency switching in R&D competition, as well as the confirmation of the regime-dependent nature of duration effects, is carried out in Huang (2024). While existing studies have advanced the understanding of uncertainty and switching in resource-related games, they often lack a targeted analysis of how such integration operates in multi-player non-renewable resource extraction scenarios, leaving room to explore unaddressed problems like unknown distribution switching moment estimation.

In this paper, we consider a model of non-renewable resource extraction by multiple participants with random duration. The peculiarity of the model under consideration is that the distribution of the random terminal time of the game is composite. The first problem we address is the need to verify the preservation of the property proved in Dockner (2000) for a new formulation of the problem. In Dockner (2000), conditions were obtained under which the cooperative solution is also a Nash equilibrium in a similar problem with fixed duration. Our goal is to obtain similar conditions for a problem with a random duration and composite distribution. Furthermore, under the assumption that the players do not know the moment of switching of the distribution function, we study the problem of obtaining an optimal estimate of this unknown moment, as in Ye et al. (2024). Notably, a distinct model featuring random initial times of player entry is presented in E. V. Gromova and López-Barrientos (2016). While their work focuses on HJB equations and imputation distribution procedures for cooperative solutions under uncertain start times, our model addresses fundamentally different challenges: random terminal times with composite distributions and unknown switching mechanisms. This distinction positions our work as advancing the theoretical frontier in duration uncertainty rather than entry uncertainty, with direct implications for sustainability planning under environmental disruptions. Additionally, we derive optimal estimates in state-dependent cases. By delving into these aspects, we strive not only to enhance the theoretical framework of differential games in non-renewable resource extraction but also to offer practical strategies that can assist industry stakeholders in making more informed decisions. A comparative summary of our work alongside key related works is provided in Table A1 of the Appendix A.

This paper makes the following pivotal contributions:

The construction of optimal cooperative and Nash equilibrium strategies of players in the differential non-renewable resource extraction game with a composite distribution function of the game’s random duration.
The derivation of sufficient conditions under which the cooperative solution constitutes a Nash equilibrium within this model.
The definition of the optimal estimation of unknown parameters in a differential game of non-renewable resource extraction.
The development of a method of constructing the optimal estimation of unknown parameters.
Optimal parameter estimates for both time-dependent and state-dependent cases.

This paper is organized as follows. In Section 2, we present the formulation of the problem. Section 3 proves that, in this model, the cooperative solution is a Nash equilibrium under certain conditions. Assuming that the players do not know the switching moment of the distribution function, we obtain the optimal estimate in the time-dependent case in Section 4 and Section 5. In Section 6 and Section 7, the optimal estimate in state-dependent case is obtained. In Section 8, we present a detailed example related to real-world oil extraction field development. Finally, in Section 9, we present our conclusion.

2. Problem Statement

We first summarize all key parameters of the model, along with their definitions, in Table 1 for reference.

Consider an n-player differential game

Γ (x_{0}, T)

of non-renewable resource extraction. The duration T of the game is a random variable following a ceratin distribution, whose cumulative distribution function is assumed to be an absolutely continuous nondecreasing function. Correspondingly, we adopt two distinct exponential cumulative distribution functions:

1 - e^{- λ_{1} t}

describes the termination probability before the switching moment

t_{1}

, and

1 - e^{- λ_{2} t}

describes that after

t_{1}

. This composite structure captures a system where hazard rates change from

λ_{1} > 0

to

λ_{2} > 0

.

At the switching moment

t_{1}

, the left limit must be equal to the function value:

\lim_{t \to t_{1}^{-}} F (t) = 1 - e^{- λ_{1} t_{1}} .

Assume an exponential structure when

t \geq t_{1}

:

F (t) = 1 - Q \cdot e^{- λ_{2} t} .

According to continuity, we have

\lim_{t \to t_{1}^{-}} F (t) = \lim_{t \to t_{1}^{+}} F (t),

i.e.,

1 - e^{- λ_{1} t_{1}} = 1 - Q \cdot e^{- λ_{2} t_{1}},

we can obtain

Q = e^{(λ_{2} - λ_{1}) t_{1}} .

Therefore, the duration T of the game has a composite cumulative distribution function:

F (t) = \{\begin{matrix} 0, & t \in (- \infty, 0), \\ 1 - e^{- λ_{1} t}, & t \in [0, t_{1}), \\ 1 - e^{- λ_{2} t} e^{t_{1} (λ_{2} - λ_{1})}, & t \in [t_{1}, \infty) . \end{matrix}

(1)

Let

x (t)

denote the state variable representing the resource stock available for extraction at time t. The dynamics of the stock are shown by the following differential equation with the initial condition

x_{0} > 0

:

\dot{x} (t) = - \sum_{i = 1}^{n} k_{i} u_{i} (t), x (0) = x_{0} .

(2)

Here,

u_{i} (t)

denotes the extraction effort of player i at time t, and the coefficient

k_{i} > 0

is used to convert the effort of the i-th player into the extraction intensity. In accordance with the physical nature of the problem, we impose the constraints that

u_{i} (t) \geq 0

and

x (t) \geq 0

for all

t \geq 0

. Moreover, if

x (t) = 0

, then the only feasible rate of extraction is

u_{i} (t) = 0

for all

i = 1, \dots, n

. To simplify the notation, we denote

u = (u_{1}, \dots, u_{n})

. We consider the problem within the framework of open-loop strategies.

The expected integral payoff of player i,

i = 1, \dots, n

is evaluated by the following formula:

J_{i} (x_{0}, u) = \int_{0}^{\infty} \int_{0}^{t} u_{i}^{μ} (s) d s d F (t),

(3)

where

μ \in (0, 1)

. According to Shevkoplyas and Kostyunin (2013), Equation (3) can be written as follows:

J_{i} (x_{0}, u) = \int_{0}^{\infty} (1 - F (t)) u_{i}^{μ} (t) d t = \int_{0}^{t_{1}} e^{- λ_{1} t} u_{i}^{μ} (t) d t + \int_{t_{1}}^{\infty} e^{- λ_{2} t} e^{t_{1} (λ_{2} - λ_{1})} u_{i}^{μ} (t) d t .

(4)

In this study, the model aims to represent a typical dynamic decision-making problem faced by a coalition extracting a non-renewable resource (e.g., petroleum, natural gas, or mineral resources). Player i’s extraction effort

u_{i} (t)

can be interpreted as its invested capital, equipment, or number of drilling rigs. The coefficient

k_{i}

characterizes player i’s technical efficiency in extraction; a higher

k_{i}

value implies a greater extraction intensity for the same level of effort. The parameter

μ \in (0, 1)

captures the diminishing marginal returns of capital investment, a common assumption in resource economics. The random duration T could represent the time of resource exhaustion or the random time at which extraction activities are forcibly terminated due to external uncertainties such as risks of accidents and technical failures, economic constraints, new environmental policies, or technological revolutions. Thus, each player’s objective is to maximize their expected total payoff under the dual constraints of dynamic resource depletion and future uncertainty.

We hypothesize that players cooperate so as to achieve the maximum total payoff:

\sum_{i = 1}^{n} J_{i} (x_{0}, u) = \sum_{i = 1}^{n} (\int_{0}^{t_{1}} e^{- λ_{1} t} u_{i}^{μ} (t) d t + \int_{t_{1}}^{\infty} e^{- λ_{2} t} e^{t_{1} (λ_{2} - λ_{1})} u_{i}^{μ} (t) d t) .

The optimal control problem can be divided into two sub-problems, corresponding to intervals

I_{1} = [0, t_{1})

and

I_{2} = [t_{1}, \infty)

. Over every interval, we employ the Pontryagin maximum principle Pontryagin (2018).

On the interval $I_{2} = [t_{1}, \infty)$
The Hamiltonian function is written as

$H_{2} (x, u, ψ, t) = e^{t_{1} (λ_{2} - λ_{1})} e^{- λ_{2} t} \sum_{i = 1}^{n} u_{i}^{μ} + ψ_{2} (- \sum_{i = 1}^{n} k_{i} u_{i}),$

(5)

where $ψ_{2} (t)$ is the adjoint variable.
The optimal controls ${\bar{u}}_{i} (t)$ are obtained from the first-order optimality conditions $\frac{\partial H_{2}}{\partial u_{i}} = 0$ :

${\bar{u}}_{i} (t) = {(\frac{ψ_{2} k_{i} e^{λ_{2} t} e^{- t_{1} (λ_{2} - λ_{1})}}{μ})}^{\frac{1}{μ - 1}} .$

The second derivative of $H_{2}$ ensures that the obtained optimal controls are maximum

$\frac{\partial^{2} H_{2} (x, u, ψ, t)}{\partial {u_{i}}^{2}} = e^{t_{1} (λ_{2} - λ_{1})} e^{- λ_{2} t} μ (μ - 1) u_{i}^{μ - 2} < 0 .$

The equation for the adjoint variable takes the following form

${\dot{ψ}}_{2} = - \frac{\partial H_{2} (x, u, ψ, t)}{\partial x} = 0,$

from which we obtain $ψ_{2} (t) = C = c o n s t$ . Using transversality condition $\lim_{t \to \infty} ψ_{2} (t) x (t) = 0$ , we have the following form for the optimal trajectory on the interval $I_{2}$ :

${\bar{x}}_{2} (t) = \frac{(1 - μ) K}{λ_{2}} {(\frac{C e^{λ_{2} t} e^{- t_{1} (λ_{2} - λ_{1})}}{μ})}^{\frac{1}{μ - 1}},$

where $K = \sum_{i = 1}^{n} k_{i}^{\frac{μ}{μ - 1}}$ .
On the interval $I_{1} = [0, t_{1})$
In the same way, we define the Hamiltonian function

$H_{1} (x, u, ψ, t) = e^{- λ_{1} t} \sum_{i = 1}^{n} u_{i}^{μ} + ψ_{1} (- \sum_{i = 1}^{n} k_{i} u_{i}) .$

(6)

The optimal controls ${\bar{u}}_{i} (t)$ are obtained from the first-order optimality conditions

${\bar{u}}_{i} (t) = {(\frac{ψ_{1} k_{i} e^{λ_{1} t}}{μ})}^{\frac{1}{μ - 1}} .$

The canonical system is

${\dot{ψ}}_{1} = - \frac{\partial H_{1} (x, u, ψ, t)}{\partial x} = 0,$

subject to the boundary condition $ψ_{1} (t_{1}) = ψ_{2} (t_{1}) = C$ . We can obtain that $ψ_{1} (t) = ψ_{2} (t) = C$ . By leveraging the initial condition $x (0) = x_{0}$ , we have the following form for the optimal trajectory on the interval $I_{1}$ :

${\bar{x}}_{1} (t) = x_{0} - \frac{(1 - μ) K}{λ_{1}} {(\frac{C}{μ})}^{\frac{1}{μ - 1}} + \frac{(1 - μ) K}{λ_{1}} {(\frac{C e^{λ_{1} t}}{μ})}^{\frac{1}{μ - 1}},$

where $C = μ {(\frac{x_{0} λ_{1} λ_{2}}{K (1 - μ) ((λ_{1} - λ_{2}) e^{\frac{λ_{1} t_{1}}{μ - 1}} + λ_{2})})}^{μ - 1}$ , which is obtained using the condition $x_{1} (t_{1}) = x_{2} (t_{1})$ .

The optimal cooperative strategies have the following form:

{\bar{u}}_{i} (t) = \{\begin{matrix} \frac{x_{0} λ_{1} λ_{2} k_{i}^{\frac{1}{μ - 1}} e^{\frac{λ_{1} (t - t_{1})}{μ - 1}}}{K (1 - μ) (λ_{1} + λ_{2} (e^{\frac{λ_{1} t_{1}}{1 - μ}} - 1))}, & t \in [0, t_{1}), \\ \frac{x_{0} λ_{1} λ_{2} k_{i}^{\frac{1}{μ - 1}} e^{\frac{λ_{2} (t - t_{1})}{μ - 1}}}{K (1 - μ) (λ_{1} + λ_{2} (e^{\frac{λ_{1} t_{1}}{1 - μ}} - 1))}, & t \in [t_{1}, \infty) . \end{matrix}

(7)

The cooperative trajectory, corresponding to (7), takes the following form:

\bar{x} (t) = \{\begin{matrix} \frac{x_{0} (λ_{1} + λ_{2} (e^{\frac{λ_{1} (t - t_{1})}{μ - 1}} - 1))}{λ_{1} + λ_{2} (e^{\frac{λ_{1} t_{1}}{1 - μ}} - 1)}, & t \in [0, t_{1}), \\ \frac{x_{0} λ_{1} e^{\frac{λ_{2} (t - t_{1})}{μ - 1}}}{λ_{1} + λ_{2} (e^{\frac{λ_{1} t_{1}}{1 - μ}} - 1)}, & t \in [t_{1}, \infty) . \end{matrix}

(8)

The total payoff is

J (x_{0}, \bar{u}) = \sum_{i = 1}^{n} J_{i} (x_{0}, \bar{u}) = x_{0}^{μ} {(\frac{K (1 - μ) (e^{\frac{λ_{1} t_{1}}{μ - 1}} (λ_{1} - λ_{2}) + λ_{2})}{λ_{1} λ_{2}})}^{1 - μ} .

3. Nash Equilibrium

In the work by Dockner (2000), an intriguing question regarding the non-renewable resource extraction game was considered. Specifically, it was investigated whether the cooperative solution in this game can be achieved as a Nash equilibrium of a non-cooperative game. It turns out that the answer depends on the parameter values of the models. We also study this question for a game with random duration. As is standard in optimal control theory with a random horizon, the expectation involved in the game’s objective function can be transformed into an equivalent deterministic problem. Although the model’s duration T is random, the verification of the Nash equilibrium, which specifically involves ensuring no player has an incentive to unilaterally deviate from their strategy, leads to a deterministic optimal control problem for any deviating player. Theorem 1 shows the results obtained.

Theorem 1.

If

λ_{1} > λ_{2}

, and for each player

i \in N

the inequality

1 - μ \geq \frac{1}{K} \sum_{j \neq i} k_{j}^{\frac{μ}{μ - 1}}

is satisfied, then the cooperative solution in this game is a Nash equilibrium.

Proof of Theorem 1.

Suppose that player i deviates from the optimal cooperative behaviour using strategy

{\tilde{u}}_{i} \neq {\bar{u}}_{i}

. It is worth noting that if, under the situation

({\bar{u}}_{1}, \dots, {\bar{u}}_{i - 1}, {\tilde{u}}_{i}, {\bar{u}}_{i + 1}, \dots, {\bar{u}}_{n})

, the resource is not exhausted by some finite point in time, then we have

J_{j} (x_{0}, {\bar{u}}_{1}, \dots, {\bar{u}}_{i - 1}, {\tilde{u}}_{i}, {\bar{u}}_{i + 1}, \dots, {\bar{u}}_{n}) = J_{j} (x_{0}, {\bar{u}}_{1}, \dots, {\bar{u}}_{i - 1}, {\bar{u}}_{i}, {\bar{u}}_{i + 1}, \dots, {\bar{u}}_{n})

for

j \in N ∖ i

. This indicates that player i cannot achieve a higher payoff in this situation compared to the situation

\bar{u}

, as such an outcome would contradict the fact that the sum of players’ payoffs is maximized in the situation

\bar{u}

. Accordingly, a deviation from

{\bar{u}}_{i}

can be beneficial for player i only if, in the situation

({\bar{u}}_{1}, \dots, {\bar{u}}_{i - 1}, {\tilde{u}}_{i}, {\bar{u}}_{i + 1}, \dots, {\bar{u}}_{n})

, the resource is exhausted by some time

T_{i} < \infty

.

First, consider the case where $T_{i} \leq t_{1}$ .
Player i solves the following optimization problem:

$\begin{matrix} \max_{u_{i}} E {J_{i} (x_{0}, {\bar{u}}_{1}, \dots, {\bar{u}}_{i - 1}, u_{i}, {\bar{u}}_{i + 1}, \dots, {\bar{u}}_{n}, T_{i})} = \max_{u_{i}} \int_{0}^{T_{i}} e^{- λ_{1} t} u_{i}^{μ} (t) d t, \\ \dot{x} (t) = - k_{i} u_{i} (t) - \sum_{j \neq i} k_{j} {\bar{u}}_{j} (t), x (0) = x_{0}, x (T_{i}) = 0 . \end{matrix}$

(9)

Let ${\tilde{u}}_{i}$ further denote the solution to (9). To determine ${\tilde{u}}_{i} (t)$ , consider the Hamiltonian function for player i:

$H_{i} = e^{- λ_{1} t} u_{i}^{μ} (t) + ψ_{i} (t) (- k_{i} u_{i} (t) - \sum_{j \neq i} k_{j} {\bar{u}}_{j} (t)) .$

By solving the respective canonical system, we obtain

${\tilde{u}}_{i} = \frac{(x_{0} + A (1 - e^{\frac{λ_{1} T_{i}}{μ - 1}}) e^{\frac{λ_{1} t_{1}}{1 - μ}} \frac{μ - 1}{λ_{1}}) λ_{1}}{(1 - μ) (1 - e^{\frac{λ_{1} T_{i}}{μ - 1}}) k_{i}} e^{\frac{λ_{1} t}{μ - 1}},$

where $A = \frac{x_{0} λ_{1} λ_{2} \sum_{j \neq i} k_{j}^{\frac{μ}{μ - 1}}}{K (1 - μ) (λ_{1} - λ_{2} + λ_{2} e^{\frac{λ_{1} t_{1}}{1 - μ}})}$ .
Thus, the corresponding value of the payoff function of player i is

$J_{i} (x_{0}, {\bar{u}}_{1}, \dots, {\bar{u}}_{i - 1}, {\tilde{u}}_{i}, {\bar{u}}_{i + 1}, \dots, {\bar{u}}_{n}, T_{i}) = \frac{λ_{1}^{μ - 1} {(x_{0} + A (1 - e^{\frac{λ_{1} T_{i}}{μ - 1}}) e^{\frac{λ_{1} t_{1}}{1 - μ}} \frac{μ - 1}{λ_{1}})}^{μ}}{k_{i}^{μ} {(1 - μ)}^{μ - 1} {(1 - e^{\frac{λ_{1} T_{i}}{μ - 1}})}^{μ - 1}} .$

(10)

Then, we solve the problem $\max_{T_{i} \in [0, t_{1}]} J_{i} (x_{0}, {\bar{u}}_{1}, \dots, {\bar{u}}_{i - 1}, {\tilde{u}}_{i}, {\bar{u}}_{i + 1}, \dots, {\bar{u}}_{n}, T_{i})$ .
Find the first derivative of $J_{i} (x_{0}, {\bar{u}}_{1}, \dots, {\bar{u}}_{i - 1}, {\tilde{u}}_{i}, {\bar{u}}_{i + 1}, \dots, {\bar{u}}_{n}, T_{i})$ with respect to the variable $T_{i}$ :

$\begin{matrix} \frac{\partial J_{i} (x_{0}, {\bar{u}}_{1}, \dots, {\bar{u}}_{i - 1}, {\tilde{u}}_{i}, {\bar{u}}_{i + 1}, \dots, {\bar{u}}_{n}, T_{i})}{\partial T_{i}} = (1 - \frac{\sum_{j \neq i} k_{j}^{\frac{μ}{μ - 1}}}{K (1 - μ)} \frac{λ_{2} e^{\frac{λ_{1} t_{1}}{1 - μ}}}{(λ_{1} - λ_{2} + λ_{2} e^{\frac{λ_{1} t_{1}}{1 - μ}})} (1 - e^{\frac{λ_{1} T_{i}}{μ - 1}})) \\ \frac{e^{\frac{λ_{1} T_{i}}{μ - 1}}}{{(1 - μ)}^{μ - 1}} {(\frac{λ_{1} x_{0}}{k_{i} (1 - e^{\frac{λ_{1} T_{i}}{μ - 1}})})}^{μ} {(1 - \frac{\sum_{j \neq i} k_{j}^{\frac{μ}{μ - 1}}}{K} \frac{λ_{2} e^{\frac{λ_{1} t_{1}}{1 - μ}}}{(λ_{1} - λ_{2} + λ_{2} e^{\frac{λ_{1} t_{1}}{1 - μ}})} (1 - e^{\frac{λ_{1} T_{i}}{μ - 1}}))}^{μ - 1} . \end{matrix}$

Note that $\frac{\sum_{j \neq i} k_{j}^{\frac{μ}{μ - 1}}}{K} < 1$ , $1 - e^{\frac{λ_{1} T_{i}}{μ - 1}} < 1$ . If $λ_{1} > λ_{2}$ , then $\frac{λ_{2} e^{\frac{λ_{1} t_{1}}{1 - μ}}}{λ_{1} - λ_{2} + λ_{2} e^{\frac{λ_{1} t_{1}}{1 - μ}}} < 1$ . So, $1 - \frac{\sum_{j \neq i} k_{j}^{\frac{μ}{μ - 1}}}{K} \frac{λ_{2} e^{\frac{λ_{1} t_{1}}{1 - μ}}}{(λ_{1} - λ_{2} + λ_{2} e^{\frac{λ_{1} t_{1}}{1 - μ}})} (1 - e^{\frac{λ_{1} T_{i}}{μ - 1}}) > 0$ . And if also $1 - μ \geq \frac{1}{K} \sum_{j \neq i} k_{j}^{\frac{μ}{μ - 1}}$ , then $1 - \frac{\sum_{j \neq i} k_{j}^{\frac{μ}{μ - 1}}}{K (1 - μ)} \frac{λ_{2} e^{\frac{λ_{1} t_{1}}{1 - μ}}}{(λ_{1} - λ_{2} + λ_{2} e^{\frac{λ_{1} t_{1}}{1 - μ}})} (1 - e^{\frac{λ_{1} T_{i}}{μ - 1}}) > 0$ .
It can be concluded that if $λ_{1} > λ_{2}$ and $1 - μ \geq \frac{1}{K} \sum_{j \neq i} k_{j}^{\frac{μ}{μ - 1}}$ , then

$\frac{\partial J_{i} (x_{0}, {\bar{u}}_{1}, \dots, {\bar{u}}_{i - 1}, {\tilde{u}}_{i}, {\bar{u}}_{i + 1}, \dots, {\bar{u}}_{n}, T_{i})}{\partial T_{i}} > 0$

and

$\arg \max_{T_{i} \in [0, t_{1}]} J_{i} (x_{0}, {\bar{u}}_{1}, \dots, {\bar{u}}_{i - 1}, {\tilde{u}}_{i}, {\bar{u}}_{i + 1}, \dots, {\bar{u}}_{n}, T_{i}) = t_{1} .$
Now consider the case where $T_{i} > t_{1}$ .
Player i solves the following optimization problem:

$\begin{matrix} \max_{u_{i}} E {J_{i} (x_{0}, {\bar{u}}_{1}, \dots, {\bar{u}}_{i - 1}, u_{i}, {\bar{u}}_{i + 1}, \dots, {\bar{u}}_{n}, T_{i})} \\ = \max_{u_{i}} \{\int_{0}^{t_{1}} e^{- λ_{1} t} u_{i}^{μ} (t) d t + \int_{t_{1}}^{T_{i}} e^{- λ_{2} t} e^{t_{1} (λ_{2} - λ_{1})} u_{i}^{μ} (t) d t\}, \\ \dot{x} (t) = - k_{i} u_{i} (t) - \sum_{j \neq i} k_{j} {\bar{u}}_{j} (t), x (0) = x_{0}, x (T_{i}) = 0 . \end{matrix}$

(11)

Let ${\overset{ˇ}{u}}_{i} (t)$ further denote the solution to (11). The corresponding trajectory is

$\overset{ˇ}{x} (t) = \{\begin{matrix} {\overset{ˇ}{x}}_{1} (t), & t \in [0, t_{1}], \\ {\overset{ˇ}{x}}_{2} (t), & t \in [t_{1}, T] . \end{matrix}$

We construct the Hamiltonians for the intervals $[0, t_{1}]$ and $[t_{1}, T_{i}]$ , respectively.

$\begin{matrix} H_{i 1} = e^{- λ_{1} t} u_{i}^{μ} (t) + ψ_{i 1} (t) (- k_{i} u_{i} (t) - \sum_{j \neq i} k_{j} {\bar{u}}_{j} (t)), \\ H_{i 2} = e^{t_{1} (λ_{2} - λ_{1})} e^{- λ_{2} t} u_{i}^{μ} (t) + ψ_{i 2} (t) (- k_{i} u_{i} (t) - \sum_{j \neq i} k_{j} {\bar{u}}_{j} (t)) . \end{matrix}$

Using the boundary conditions ${\overset{ˇ}{x}}_{1} (0) = x_{0}$ , ${\overset{ˇ}{x}}_{2} (T_{i}) = 0$ , ${\overset{ˇ}{x}}_{1} (t_{1}) = {\overset{ˇ}{x}}_{2} (t_{1})$ , $ψ_{i 1} (t_{1}) = ψ_{i 2} (t_{1})$ , the solution can be obtained

${\overset{ˇ}{u}}_{i} (t) = \{\begin{matrix} \frac{x_{0} - A (1 - μ) B e^{\frac{λ_{1} t_{1}}{1 - μ}}}{k_{i} B (1 - μ)} e^{\frac{λ_{1} t}{μ - 1}}, & t \in [0, t_{1}], \\ \frac{x_{0} - A (1 - μ) B e^{\frac{λ_{1} t_{1}}{1 - μ}}}{k_{i} B (1 - μ)} e^{\frac{λ_{2} t}{μ - 1}} e^{\frac{t_{1} (λ_{2} - λ_{1})}{1 - μ}}, & t \in [t_{1}, T], \end{matrix}$

(12)

where $B = \frac{λ_{2} (1 - e^{\frac{λ_{1} t_{1}}{μ - 1}}) + λ_{1} e^{\frac{λ_{1} t_{1}}{μ - 1}} (1 - e^{\frac{λ_{2} (T_{i} - t_{1})}{μ - 1}})}{λ_{1} λ_{2}}$ .
The corresponding value of the payoff function of player i is

$J_{i} (x_{0}, {\bar{u}}_{1}, \dots, {\bar{u}}_{i - 1}, {\overset{ˇ}{u}}_{i}, {\bar{u}}_{i + 1}, \dots, {\bar{u}}_{n}) = (1 - μ) {(\frac{x_{0} - (1 - μ) A B e^{\frac{λ_{1} t_{1}}{1 - μ}}}{k_{i} (1 - μ)})}^{μ} B^{1 - μ} .$

(13)

Its first derivative over $T_{i}$ is

$\begin{matrix} \frac{\partial J_{i} (x_{0}, {\bar{u}}_{1}, \dots, {\bar{u}}_{i - 1}, {\overset{ˇ}{u}}_{i}, {\bar{u}}_{i + 1}, \dots, {\bar{u}}_{n}, T_{i})}{\partial T_{i}} = {(1 - μ)}^{1 - μ} \frac{x_{0}^{μ}}{{(k_{i} B)}^{μ}} e^{\frac{λ_{1} t_{1}}{μ - 1}} e^{\frac{λ_{2} (T_{i} - t_{1})}{μ - 1}} \\ (1 - \frac{\sum_{j \neq i} k_{j}^{\frac{μ}{μ - 1}}}{K (1 - μ)} (1 - \frac{λ_{1} e^{\frac{λ_{2} (T_{i} - t_{1})}{μ - 1}}}{λ_{1} - λ_{2} + λ_{2} e^{\frac{λ_{1} t_{1}}{1 - μ}}})) {(1 - \frac{\sum_{j \neq i} k_{j}^{\frac{μ}{μ - 1}}}{K} (1 - \frac{λ_{1} e^{\frac{λ_{2} (T_{i} - t_{1})}{μ - 1}}}{λ_{1} - λ_{2} + λ_{2} e^{\frac{λ_{1} t_{1}}{1 - μ}}}))}^{μ - 1} . \end{matrix}$

Note that if $λ_{1} > λ_{2}$ , then $1 - \frac{\sum_{j \neq i} k_{j}^{\frac{μ}{μ - 1}}}{K} (1 - \frac{λ_{1} e^{\frac{λ_{2} (T_{i} - t_{1})}{μ - 1}}}{λ_{1} - λ_{2} + λ_{2} e^{\frac{λ_{1} t_{1}}{1 - μ}}}) > 0$ , since $\frac{\sum_{j \neq i} k_{j}^{\frac{μ}{μ - 1}}}{K} < 1$ and $\frac{λ_{1} e^{\frac{λ_{2} (T_{i} - t_{1})}{μ - 1}}}{λ_{1} - λ_{2} + λ_{2} e^{\frac{λ_{1} t_{1}}{1 - μ}}} < 1$ . Furthermore, if $1 - μ \geq \frac{1}{K} \sum_{j \neq i} k_{j}^{\frac{μ}{μ - 1}}$ , then $1 - \frac{\sum_{j \neq i} k_{j}^{\frac{μ}{μ - 1}}}{K (1 - μ)} (1 - \frac{λ_{1} e^{\frac{λ_{2} (T_{i} - t_{1})}{μ - 1}}}{λ_{1} - λ_{2} + λ_{2} e^{\frac{λ_{1} t_{1}}{1 - μ}}}) < 1$ .
It can be concluded that if $λ_{1} > λ_{2}$ and $1 - μ \geq \frac{1}{K} \sum_{j \neq i} k_{j}^{\frac{μ}{μ - 1}}$ , then

$\frac{\partial J_{i} (x_{0}, {\bar{u}}_{1}, \dots, {\bar{u}}_{i - 1}, {\overset{ˇ}{u}}_{i}, {\bar{u}}_{i + 1}, \dots, {\bar{u}}_{n}, T_{i})}{\partial T_{i}} > 0 .$

Therefore, $J_{i} (x_{0}, {\bar{u}}_{1}, \dots, {\bar{u}}_{i - 1}, {\overset{ˇ}{u}}_{i}, {\bar{u}}_{i + 1}, \dots, {\bar{u}}_{n}, T_{i})$ is an increasing function with respect to a variable $T_{i}$ .

Since

J_{i} (x_{0}, {\bar{u}}_{1}, \dots, {\bar{u}}_{i - 1}, {\tilde{u}}_{i}, {\bar{u}}_{i + 1}, \dots, {\bar{u}}_{n}, t_{1}) = J_{i} (x_{0}, {\bar{u}}_{1}, \dots, {\bar{u}}_{i - 1}, {\overset{ˇ}{u}}_{i}, {\bar{u}}_{i + 1}, \dots, {\bar{u}}_{n}, t_{1})

, we conclude that the payoff of player i in the case

T_{i} \in [0, t_{1}]

is no more than his payoff in the case

T_{i} > t_{1}

. Taking into account the increase in the function

J_{i} (x_{0}, {\bar{u}}_{1}, \dots, {\bar{u}}_{i - 1}, {\overset{ˇ}{u}}_{i}, {\bar{u}}_{i + 1}, \dots, {\bar{u}}_{n}, T_{i})

with respect to variable

T_{i}

, and the fact that

\lim_{T_{i} \to \infty} J_{i} (x_{0}, {\bar{u}}_{1}, \dots, {\bar{u}}_{i - 1}, {\overset{ˇ}{u}}_{i}, {\bar{u}}_{i + 1}, \dots, {\bar{u}}_{n}, T_{i}) = J_{i} (x_{0}, {\bar{u}}_{1}, \dots, {\bar{u}}_{i - 1}, {\bar{u}}_{i}, {\bar{u}}_{i + 1}, \dots, {\bar{u}}_{n}),

we conclude that that if

λ_{1} > λ_{2}

and

1 - μ \geq \frac{1}{K} \sum_{j \neq i} k_{j}^{\frac{μ}{μ - 1}}

, then for any finite value of

T_{i}

,

J_{i} (x_{0}, {\bar{u}}_{1}, \dots, {\bar{u}}_{i - 1}, {\overset{ˇ}{u}}_{i}, {\bar{u}}_{i + 1}, \dots, {\bar{u}}_{n}, T_{i}) \leq J_{i} (x_{0}, {\bar{u}}_{1}, \dots, {\bar{u}}_{i - 1}, {\bar{u}}_{i}, {\bar{u}}_{i + 1}, \dots, {\bar{u}}_{n}) .

This means that no player i benefits from deviating from the cooperative trajectory, i.e., the cooperative solution in this game is a Nash equilibrium. □

This Theorem has significant practical implications. Its conditions

λ_{1} > λ_{2}

and

1 - μ \geq \frac{1}{K} \sum_{j \neq i} k_{j}^{\frac{μ}{μ - 1}}

indicate that whether the cooperative solution can be a Nash equilibrium depends on three key factors: the hazard rates

λ_{1}

and

λ_{2}

, the marginal returns rate

μ

, and the technical efficiency

k_{j}

of other players within the coalition. In practical resource extraction scenarios, this implies that, for a coalition to maintain stability, two prerequisites must be met. On the one hand, the risk of unexpected project termination in the early stage (before

t_{1}

) must be significantly higher than that in the later stage (after

t_{1}

). On the other hand, the technical efficiencies among coalition members must not differ excessively. Alternatively, if there are technologically advanced players, the effective benefit space

1 - μ

must be large enough to suppress their incentive to unilaterally expand production and violate the cooperative agreement.

4. Time-Dependent Case

Let us now focus on the scenario where

k_{i} = 1

for all

i = 1, \dots, n

and

μ = \frac{1}{2}

. It is worth noting that for such parameter values, the condition stated in Theorem 1 holds only if

n = 2

. Nevertheless, for the sake of generality, we consider the problem in general for any number n. Although these parameter assignments are hypothetical, their values and relative relationships refer to stylized facts in resource economics to ensure the numerical results exhibit economic rationality.

Suppose that players do not have information about the exact value of the switching moment

t_{1}

. They use an estimated switching moment

{\hat{t}}_{1}

of the switching moment in the control (7) instead of the exact value

t_{1}

. This estimation arises due to the inherent uncertainty in the system and the lack of precise information. The exact value of

t_{1}

is not directly observable, as it is influenced by multiple factors, including the dynamic interactions among players, system parameters, and external disturbances. Players use

{\hat{t}}_{1}

based on available information, historical data, or heuristic predictions, which serves as a reasonable approximation under these uncertain conditions.

Then, their controls have the following form:

{\hat{u}}_{i} (t) = \{\begin{matrix} \frac{2 x_{0} λ_{1} λ_{2} e^{2 λ_{1} ({\hat{t}}_{1} - t)}}{n (λ_{1} - λ_{2} + λ_{2} e^{2 λ_{1} {\hat{t}}_{1}})}, & t \in [0, {\hat{t}}_{1}), \\ \frac{2 x_{0} λ_{1} λ_{2} e^{2 λ_{2} ({\hat{t}}_{1} - t)}}{n (λ_{1} - λ_{2} + λ_{2} e^{2 λ_{1} {\hat{t}}_{1}})}, & t \in [{\hat{t}}_{1}, \infty) . \end{matrix}

(14)

For convenience, we denote the control of player i on the interval

[0, {\hat{t}}_{1})

as

{\hat{u}}_{i 1} (t)

and the control on the interval

[{\hat{t}}_{1}, \infty)

as

{\hat{u}}_{i 2} (t)

.

The trajectory corresponding to these controls is

\hat{x} (t) = \{\begin{matrix} x_{0} \frac{λ_{1} + λ_{2} (e^{2 λ_{1} ({\hat{t}}_{1} - t)} - 1)}{λ_{1} + λ_{2} (e^{2 λ_{1} {\hat{t}}_{1}} - 1)}, & t \in [0, {\hat{t}}_{1}), \\ x_{0} \frac{λ_{1} e^{2 λ_{2} ({\hat{t}}_{1} - t)}}{λ_{1} + λ_{2} (e^{2 λ_{1} {\hat{t}}_{1}} - 1)}, & t \in [{\hat{t}}_{1}, \infty) . \end{matrix}

(15)

The form of players’ payoff in this scenario depends on the relationship between the values of

t_{1}

and

{\hat{t}}_{1}

, which is expressed as follows:

J (x_{0}, \hat{u}) = \{\begin{matrix} J_{I_{1}} (x_{0}, \hat{u}), & t_{1} \in [0, {\hat{t}}_{1}), \\ J_{I_{2}} (x_{0}, \hat{u}), & t_{1} \in [{\hat{t}}_{1}, \infty) . \end{matrix}

If $t_{1} \in [0, {\hat{t}}_{1})$ , the total payoff has the following form:

$\begin{matrix} J_{I_{1}} (x_{0}, \hat{u}) = \sum_{i = 1}^{n} J_{i} (x_{0}, \hat{u}) \\ = \sum_{i = 1}^{n} (\int_{0}^{t_{1}} e^{- λ_{1} t} \sqrt{{\hat{u}}_{i 1}} d t + \int_{t_{1}}^{{\hat{t}}_{1}} e^{- λ_{2} t} e^{t_{1} (λ_{2} - λ_{1})} \sqrt{{\hat{u}}_{i 1}} d t + \int_{{\hat{t}}_{1}}^{\infty} e^{- λ_{2} t} e^{t_{1} (λ_{2} - λ_{1})} \sqrt{{\hat{u}}_{i 2}} d t) \\ = \frac{1}{λ_{1} + λ_{2}} \sqrt{\frac{n x_{0}}{2 λ_{1} λ_{2} (λ_{1} + λ_{2} (e^{2 λ_{1} {\hat{t}}_{1}} - 1))}} (λ_{2} (λ_{1} + λ_{2}) e^{λ_{1} {\hat{t}}_{1}} + λ_{2} (λ_{1} - λ_{2}) e^{λ_{1} ({\hat{t}}_{1} - 2 t_{1})} \\ + λ_{1} (λ_{1} - λ_{2}) e^{t_{1} (λ_{2} - λ_{1}) - λ_{2} {\hat{t}}_{1}}) . \end{matrix}$
If $t_{1} \in [{\hat{t}}_{1}, \infty)$ , the total payoff has the following form:

$\begin{matrix} J_{I_{2}} (x_{0}, \hat{u}) = \sum_{i = 1}^{n} J_{i} (x_{0}, \hat{u}) \\ = \sum_{i = 1}^{n} (\int_{0}^{{\hat{t}}_{1}} e^{- λ_{1} t} \sqrt{{\hat{u}}_{i 1}} d t + \int_{{\hat{t}}_{1}}^{t_{1}} e^{- λ_{1} t} \sqrt{{\hat{u}}_{i 2}} d t + \int_{t_{1}}^{\infty} e^{- λ_{2} t} e^{t_{1} (λ_{2} - λ_{1})} \sqrt{{\hat{u}}_{i 2}} d t) \\ = \frac{1}{λ_{1} + λ_{2}} \sqrt{\frac{n x_{0}}{2 λ_{1} λ_{2} (λ_{1} + λ_{2} (e^{2 λ_{1} {\hat{t}}_{1}} - 1))}} (λ_{2} (λ_{1} + λ_{2}) (e^{λ_{1} {\hat{t}}_{1}} - e^{- λ_{1} {\hat{t}}_{1}}) \\ + 2 λ_{1} λ_{2} (e^{- λ_{1} {\hat{t}}_{1}} - e^{- λ_{1} t_{1} - λ_{2} (t_{1} - {\hat{t}}_{1})}) + λ_{1} (λ_{1} + λ_{2}) e^{- λ_{1} t_{1} - λ_{2} (t_{1} - {\hat{t}}_{1})}) . \end{matrix}$

The subsequent discussion focuses on the optimal determination of

{\hat{t}}_{1}

.

5. Optimal Estimate

To minimize potential risks, players may reach an agreement on a guess

{\hat{t}}_{1} \in [0, \infty)

, which minimizes the worst-case loss. Consequently, the following minimax problem needs to be solved:

\inf_{{\hat{t}}_{1} \in [0, \infty)} \sup_{t_{1} \in [0, \infty)} (J (x_{0}, \bar{u}) - J (x_{0}, \hat{u})),

(16)

where

{\hat{t}}_{1}

is the estimated value of the switching moment, and

t_{1}

is the actual value.

Denote

m = \frac{λ_{2}}{λ_{1}}

. The following theorem provides a solution.

Theorem 2.

If

λ_{1} > λ_{2}

, then the optimal estimate

{\hat{t}}_{1}^{*}

of the unknown switching moment

t_{1}

that solves (16) is

{\hat{t}}_{1}^{*} = - \frac{\ln p}{λ_{1}},

where p is the solution of the equation

(1 + m) \sqrt{p^{2} + m (1 - p^{2})} = (1 + \sqrt{m}) (m + p^{m + 1} - m p^{2}) .

Proof of Theorem 2.

Denote

\begin{matrix} D_{1} (t_{1}, {\hat{t}}_{1}) = J (x_{0}, \bar{u}) - J_{I_{1}} (x_{0}, \hat{u}), \\ D_{2} (t_{1}, {\hat{t}}_{1}) = J (x_{0}, \bar{u}) - J_{I_{2}} (x_{0}, \hat{u}) . \end{matrix}

By applying the calculation results from Section 2 and Section 4, we can obtain

\begin{matrix} D_{1} (t_{1}, {\hat{t}}_{1}) = & \sqrt{\frac{n x_{0} (e^{- 2 λ_{1} t_{1}} (λ_{1} - λ_{2}) + λ_{2})}{2 λ_{1} λ_{2}}} - \frac{1}{λ_{1} + λ_{2}} \sqrt{\frac{n x_{0}}{2 λ_{1} λ_{2} (λ_{1} + λ_{2} (e^{2 λ_{1} {\hat{t}}_{1}} - 1))}} \\ (λ_{2} (λ_{1} + λ_{2}) e^{λ_{1} {\hat{t}}_{1}} + λ_{2} (λ_{1} - λ_{2}) e^{λ_{1} ({\hat{t}}_{1} - 2 t_{1})} + λ_{1} (λ_{1} - λ_{2}) e^{t_{1} (λ_{2} - λ_{1}) - λ_{2} {\hat{t}}_{1}}), \\ D_{2} (t_{1}, {\hat{t}}_{1}) = & \sqrt{\frac{n x_{0} (e^{- 2 λ_{1} t_{1}} (λ_{1} - λ_{2}) + λ_{2})}{2 λ_{1} λ_{2}}} - \frac{1}{λ_{1} + λ_{2}} \sqrt{\frac{n x_{0}}{2 λ_{1} λ_{2} (λ_{1} + λ_{2} (e^{2 λ_{1} {\hat{t}}_{1}} - 1))}} \\ (λ_{2} (λ_{1} + λ_{2}) (e^{λ_{1} {\hat{t}}_{1}} - e^{- λ_{1} {\hat{t}}_{1}}) + 2 λ_{1} λ_{2} (e^{- λ_{1} {\hat{t}}_{1}} - e^{- λ_{1} t_{1} - λ_{2} (t_{1} - {\hat{t}}_{1})}) \\ + λ_{1} (λ_{1} + λ_{2}) e^{- λ_{1} t_{1} - λ_{2} (t_{1} - {\hat{t}}_{1})}) . \end{matrix}

First, we consider the maximization problem which can be rewritten as

$\sup_{t_{1} \in [0, \infty)} (J (x_{0}, \bar{u}) - J (x_{0}, \hat{u})) = \max \{\sup_{t_{1} \in [0, {\hat{t}}_{1})} D_{1} (t_{1}, {\hat{t}}_{1}), \sup_{t_{1} \in [{\hat{t}}_{1}, \infty)} D_{2} (t_{1}, {\hat{t}}_{1})\} .$

Consider the behaviour of functions $D_{1}$ and $D_{2}$ to solve the maximization problem.
- When $t_{1} \in [0, {\hat{t}}_{1})$ ,
  
  $\begin{matrix} \frac{\partial D_{1}}{\partial t_{1}} = e^{- 2 λ_{1} t_{1}} (λ_{2} - λ_{1}) \sqrt{\frac{n x_{0} λ_{1}}{2 λ_{2}}} \\ (\frac{1}{\sqrt{e^{- 2 λ_{1} t_{1}} (λ_{1} - λ_{2}) + λ_{2}}} - \frac{(λ_{1} - λ_{2}) e^{(λ_{1} + λ_{2}) (t_{1} - {\hat{t}}_{1})} + 2 λ_{2}}{(λ_{1} + λ_{2}) \sqrt{e^{- 2 λ_{1} {\hat{t}}_{1}} (λ_{1} - λ_{2}) + λ_{2}}}) . \end{matrix}$
  
  Note that if $λ_{1} > λ_{2}$ and $t_{1} < {\hat{t}}_{1}$ , then
  
  $\frac{1}{\sqrt{e^{- 2 λ_{1} t_{1}} (λ_{1} - λ_{2}) + λ_{2}}} > \frac{(λ_{1} - λ_{2}) e^{(λ_{1} + λ_{2}) (t_{1} - {\hat{t}}_{1})} + 2 λ_{2}}{(λ_{1} + λ_{2}) \sqrt{e^{- 2 λ_{1} {\hat{t}}_{1}} (λ_{1} - λ_{2}) + λ_{2}}} .$
  
  Refer to Appendix B for the proof of this fact.
  It can be concluded from this that $\frac{\partial D_{1}}{\partial t_{1}} < 0$ for $t_{1} < {\hat{t}}_{1}$ and $\frac{\partial D_{1}}{\partial t_{1}} |_{t_{1} = {\hat{t}}_{1}} = 0$ . This means that $D_{1} (t_{1}, {\hat{t}}_{1})$ is a decreasing function of $t_{1}$ when $t_{1} \in [0, {\hat{t}}_{1})$ ; then,
  
  $\begin{matrix} \sup_{t_{1} \in [0, {\hat{t}}_{1})} D_{1} (t_{1}, {\hat{t}}_{1}) = D_{1} (0, {\hat{t}}_{1}) = \\ \sqrt{\frac{n x_{0}}{2 λ_{2}}} (1 - \frac{1}{λ_{1} + λ_{2}} \sqrt{\frac{λ_{1}}{λ_{1} + λ_{2} (e^{2 λ_{1} {\hat{t}}_{1}} - 1)}} (2 λ_{2} e^{λ_{1} {\hat{t}}_{1}} + e^{- λ_{2} {\hat{t}}_{1}} (λ_{1} - λ_{2}))) . \end{matrix}$
- When $t_{1} \in [{\hat{t}}_{1}, \infty)$ ,
  
  $\begin{matrix} \frac{\partial D_{2}}{\partial t_{1}} = e^{- 2 λ_{1} t_{1}} (λ_{2} - λ_{1}) \sqrt{\frac{n x_{0} λ_{1}}{2 λ_{2}}} \\ (\frac{1}{\sqrt{e^{- 2 λ_{1} t_{1}} (λ_{1} - λ_{2}) + λ_{2}}} - \frac{e^{(λ_{2} - λ_{1}) ({\hat{t}}_{1} - t_{1})}}{\sqrt{e^{- 2 λ_{1} {\hat{t}}_{1}} (λ_{1} - λ_{2}) + λ_{2}}}) . \end{matrix}$
  
  Note that if $λ_{1} > λ_{2}$ and $t_{1} > {\hat{t}}_{1}$ , then (see Appendix C)
  
  $\frac{1}{\sqrt{e^{- 2 λ_{1} t_{1}} (λ_{1} - λ_{2}) + λ_{2}}} < \frac{e^{(λ_{2} - λ_{1}) ({\hat{t}}_{1} - t_{1})}}{\sqrt{e^{- 2 λ_{1} {\hat{t}}_{1}} (λ_{1} - λ_{2}) + λ_{2}}} .$
  
  Therefore, $\frac{\partial D_{2}}{\partial t_{1}} > 0$ for $t_{1} > {\hat{t}}_{1}$ and $\frac{\partial D_{2}}{\partial t_{1}} |_{t_{1} = {\hat{t}}_{1}} = 0$ . This means that $D_{2} (t_{1}, {\hat{t}}_{1})$ is an increasing function of $t_{1}$ when $t_{1} \in [{\hat{t}}_{1}, \infty)$ ; then,
  
  $\begin{matrix} \sup_{t_{1} \in [{\hat{t}}_{1}, \infty)} D_{2} (t_{1}, {\hat{t}}_{1}) = \lim_{t_{1} \to \infty} D_{2} (t_{1}, {\hat{t}}_{1}) = \\ \sqrt{\frac{n x_{0}}{2 λ_{1}}} (1 - \frac{1}{λ_{1} + λ_{2}} \sqrt{\frac{λ_{2}}{λ_{1} + λ_{2} (e^{2 λ_{1} {\hat{t}}_{1}} - 1)}} ((λ_{1} + λ_{2}) e^{λ_{1} {\hat{t}}_{1}} + (λ_{1} - λ_{2}) e^{- λ_{1} {\hat{t}}_{1}})) . \end{matrix}$
Let $\sup_{t_{1} \in [0, {\hat{t}}_{1})} D_{1} (t_{1}, {\hat{t}}_{1}) = L_{1} ({\hat{t}}_{1})$ , $\sup_{t_{1} \in [{\hat{t}}_{1}, \infty)} D_{2} (t_{1}, {\hat{t}}_{1}) = L_{2} ({\hat{t}}_{1})$ .
Then, the problem (16) is transformed into the following:

$\inf_{{\hat{t}}_{1} \in [0, \infty)} \max {L_{1} ({\hat{t}}_{1}), L_{2} ({\hat{t}}_{1})} .$

Note that $L_{1} ({\hat{t}}_{1})$ is an increasing function and $L_{2} ({\hat{t}}_{1})$ is a decreasing function (see the proof in Appendix D). Given that $L_{1} (0) < L_{2} (0)$ and $\lim_{{\hat{t}}_{1} \to \infty} L_{1} ({\hat{t}}_{1}) > \lim_{{\hat{t}}_{1} \to \infty} L_{2} ({\hat{t}}_{1})$ , it can be deduced that Equation $L_{1} ({\hat{t}}_{1}) = L_{2} ({\hat{t}}_{1})$ has only one root. Considering the behavior of these two functions, we can conclude that this root is the point of the minimum of the upper envelope of $L_{1} ({\hat{t}}_{1})$ and $L_{2} ({\hat{t}}_{1})$ graphs (the point of their intersection). This means that this root is the solution to the problem (16).
To find the root, we need to solve the following:

$\begin{matrix} \sqrt{\frac{1}{λ_{2}}} (1 - \frac{1}{λ_{1} + λ_{2}} \sqrt{\frac{λ_{1}}{λ_{1} + λ_{2} (e^{2 λ_{1} {\hat{t}}_{1}} - 1)}} (2 λ_{2} e^{λ_{1} {\hat{t}}_{1}} + e^{- λ_{2} {\hat{t}}_{1}} (λ_{1} - λ_{2}))) = \\ \sqrt{\frac{1}{λ_{1}}} (1 - \frac{1}{λ_{1} + λ_{2}} \sqrt{\frac{λ_{2}}{λ_{1} + λ_{2} (e^{2 λ_{1} {\hat{t}}_{1}} - 1)}} ((λ_{1} + λ_{2}) e^{λ_{1} {\hat{t}}_{1}} + (λ_{1} - λ_{2}) e^{- λ_{1} {\hat{t}}_{1}})) . \end{matrix}$

(17)

Let $e^{- λ_{1} {\hat{t}}_{1}} = p$ , $λ_{2} = m λ_{1} (0 < m < 1)$ , then (17) can be transformed into

$(1 + m) \sqrt{p^{2} + m (1 - p^{2})} = (1 + \sqrt{m}) (m + p^{m + 1} - m p^{2}) .$

This concludes the proof. □

To illustrate the result of Theorem 2, we present a numerical example with the following values of parameters:

x_{0} = 100

,

n = 3

,

λ_{1} = 0.5

,

λ_{2} = 0.25

. Figure 1 shows the graphs of

L_{1}

and

L_{2}

under these conditions. It can be observed that the intersection point of these graphs corresponds to the minimum of their upper envelope.

Finally, Table 2 shows the optimal estimates for different values of the parameter m. Figure 2 shows the maximum difference function

L_{1} ({\hat{t}}_{1})

and

L_{2} ({\hat{t}}_{1})

under these m values. The visualization results clearly reveal the mapping relationship between system parameters and functional behavior. Analysis demonstrates that m, as a key regulatory parameter, significantly influences the morphology of functions

L_{1}

and

L_{2}

and their intersection point: as m decreases, the intersection point of the two functions continuously moves upper-right, the optimal switching time

{\hat{t}}_{1}^{*}

increases substantially, and the corresponding function value also rises. This sensitivity analysis provides an intuitive basis for parameter optimization, indicating that system performance can be precisely modulated at different operating points by adjusting m.

In the context of resource extraction, the switching moment

t_{1}

can be interpreted as the anticipated time of a significant event, such as the enactment of new environmental regulations, the expected adoption time of a substitute technology, or a predicted market price inflection point. This time is unknown to the extractors due to incomplete information. The optimal estimate

{\hat{t}}_{1}^{*}

provided by Theorem 2 offers extractors a robust forecasting and decision-making tool. It also shows that

{\hat{t}}_{1}^{*}

is related to the parameter m, which is the hazard rate ratio between the time before and after the switch. Employing this estimate minimizes potential losses in the worst-case scenario, even if prediction errors exist, which is crucial for long-term investment and extraction planning under uncertainty.

6. State-Dependent Case

Following (Gromov & Gromova, 2014, 2017) and Balas and Tur (2023), we now assume that the stock level of the resource can influence the probability of a regime shift. Consequently, the switching does not occur at a fixed point in time but is triggered when a certain condition on the trajectory is satisfied. Within the framework of the model under consideration, such a condition could be the attainment of a predetermined level of resource stock.

Assume that

x_{1}

is fixed, with

x_{0} > x_{1} > 0

. The switching moment

t_{1}

for the composite distribution function (1) is determined by the condition

x (t_{1}) = x_{1}

.

Consider the Hamiltonian (6) in the interval

I_{1}

with boundary conditions

x (0) = x_{0}

,

x (t_{1}) = x_{1}

, and the Hamiltonian (5) in the interval

I_{2}

with boundary condition

x (t_{1}) = x_{1}

, along with the transversality condition

\lim_{t \to \infty} ψ_{2} (t) x (t) = 0

, players’ controls are obtained in the following form:

{\bar{\bar{u}}}_{i} (t) = \{\begin{matrix} \frac{2 λ_{1} (x_{0} - x_{1}) e^{- 2 λ_{1} t}}{n (1 - e^{- 2 λ_{1} t_{1}})}, & t \in [0, t_{1}), \\ \frac{2 λ_{2} x_{1} e^{2 λ_{2} (t_{1} - t)}}{n}, & t \in [t_{1}, \infty), \end{matrix}

(18)

To find the optimal solution, we also need to solve the following problem:

\max_{t_{1} > 0} \sum_{i = 1}^{n} J_{i} (x_{0}, \bar{\bar{u}}),

where

\sum_{i = 1}^{n} J_{i} (x_{0}, \bar{\bar{u}}) = \sqrt{\frac{n (x_{0} - x_{1}) (1 - e^{- 2 λ_{1} t_{1}})}{2 λ_{1}}} + \sqrt{\frac{n x_{1}}{2 λ_{2}}} e^{- λ_{1} t_{1}} .

The optimal switching moment

{\bar{t}}_{1}

is:

{\bar{t}}_{1} = \arg \max_{t_{1}} \sum_{i = 1}^{n} J_{i} (x_{0}, \bar{\bar{u}}) = \frac{1}{2 λ_{1}} \ln (1 + \frac{λ_{2} (x_{0} - x_{1})}{λ_{1} x_{1}}) .

In summary, the cooperative trajectory

\bar{x} (t)

and the optimal cooperative controls

{\bar{u}}_{i} (t)

at intervals

I_{1}

and

I_{2}

have the following form:

\bar{x} (t) = \{\begin{matrix} x_{0} - \frac{(x_{0} - x_{1}) (1 - e^{- 2 λ_{1} t})}{1 - e^{- 2 λ_{1} {\bar{t}}_{1}}} & t \in [0, {\bar{t}}_{1}), \\ x_{1} e^{2 λ_{2} ({\bar{t}}_{1} - t)}, & t \in [{\bar{t}}_{1}, \infty) . \end{matrix}

(19)

{\bar{u}}_{i} (t) = \{\begin{matrix} \frac{2 λ_{1} (x_{0} - x_{1}) e^{- 2 λ_{1} t}}{n (1 - e^{- 2 λ_{1} {\bar{t}}_{1}})}, & t \in [0, {\bar{t}}_{1}), \\ \frac{2 λ_{2} x_{1} e^{2 λ_{2} ({\bar{t}}_{1} - t)}}{n}, & t \in [{\bar{t}}_{1}, \infty) . \end{matrix}

(20)

And the total payoff

\sum_{i \in N} J_{i} (x_{0}, \bar{u}) = \sqrt{\frac{n (λ_{1} x_{1} - λ_{2} x_{1} + λ_{2} x_{0})}{2 λ_{1} λ_{2}}} .

7. Information Uncertainty

Suppose now that the value of

x_{1}

is unknown to the players. They use

{\hat{x}}_{1}

instead of

x_{1}

. Then, their strategies are

{\hat{u}}_{i} (t) = \{\begin{matrix} \frac{2 λ_{1} (x_{0} - {\hat{x}}_{1}) e^{- 2 λ_{1} t}}{n (1 - e^{- 2 λ_{1} {\hat{t}}_{1}})}, & t \in [0, {\hat{t}}_{1}), \\ \frac{2 λ_{2} {\hat{x}}_{1} e^{2 λ_{2} ({\hat{t}}_{1} - t)}}{n}, & t \in [{\hat{t}}_{1}, \infty), \end{matrix}

(21)

where

{\hat{t}}_{1} = \frac{1}{2 λ_{1}} \ln (1 + \frac{λ_{2} (x_{0} - {\hat{x}}_{1})}{λ_{1} {\hat{x}}_{1}}) .

The corresponding trajectory has the following form:

\hat{x} (t) = \{\begin{matrix} x_{0} - \frac{(x_{0} - {\hat{x}}_{1}) (1 - e^{- 2 λ_{1} t})}{1 - e^{- 2 λ_{1} {\hat{t}}_{1}}}, & t \in [0, {\hat{t}}_{1}), \\ {\hat{x}}_{1} e^{2 λ_{2} ({\hat{t}}_{1} - t)}, & t \in [{\hat{t}}_{1}, \infty) . \end{matrix}

(22)

To find the optimal estimate of

x_{1}

, which minimizes the worst case possible loss in accordance with (16), we consider the minimax problem:

\inf_{{\hat{x}}_{1} \in [0, x_{0}]} \sup_{x_{1} \in [0, x_{0}]} (\sum_{i \in N} J_{i} (x_{0}, \bar{u}) - \sum_{i \in N} J_{i} (x_{0}, \hat{u})) .

(23)

First, consider the maximization problem, which can be reformulated as

\begin{matrix} \sup_{x_{1}} (\sum_{i \in N} J_{i} (x_{0}, \bar{u}) - \sum_{i \in N} J_{i} (x_{0}, \hat{u})) = \\ \max \{\sup_{x_{1} < {\hat{x}}_{1}} (\sum_{i \in N} J_{i} (x_{0}, \bar{u}) - \sum_{i \in N} J_{i} (x_{0}, \hat{u})), \sup_{x_{1} > {\hat{x}}_{1}} (\sum_{i \in N} J_{i} (x_{0}, \bar{u}) - \sum_{i \in N} J_{i} (x_{0}, \hat{u}))\} . \end{matrix}

(24)

When ${\hat{x}}_{1} < x_{1}$ , we have $t_{1} < {\hat{t}}_{1}$ , since the resource diminishes over time, where $t_{1}$ is the switching time of the composite distribution function.
The value of $t_{1}$ could be obtained from Equation $\hat{x} (t_{1}) = x_{1}$ , i.e.,

$x_{0} - \frac{(x_{0} - {\hat{x}}_{1}) (1 - e^{- 2 λ_{1} t_{1}})}{1 - e^{- 2 λ_{1} {\hat{t}}_{1}}} = x_{1} .$

We can get

$t_{1} = \frac{1}{2 λ_{1}} \ln \frac{(λ_{1} - λ_{2}) {\hat{x}}_{1} + λ_{2} x_{0}}{(λ_{1} - λ_{2}) {\hat{x}}_{1} + λ_{2} x_{1}}$

and then the total payoff is

$\begin{matrix} J_{I_{1}} (x_{0}, \hat{u}) = \sum_{i \in N} J_{i} (x_{0}, \hat{u}) \\ = \sum_{i = 1}^{n} (\int_{0}^{t_{1}} e^{- λ_{1} t} \sqrt{{\hat{u}}_{i 1}} d t + \int_{t_{1}}^{{\hat{t}}_{1}} e^{- λ_{2} t} e^{t_{1} (λ_{2} - λ_{1})} \sqrt{{\hat{u}}_{i 1}} d t + \int_{{\hat{t}}_{1}}^{\infty} e^{- λ_{2} t} e^{t_{1} (λ_{2} - λ_{1})} \sqrt{{\hat{u}}_{i 2}} d t) \\ = \sqrt{\frac{n m}{2 λ_{1} ((1 - m) {\hat{x}}_{1} + m x_{0})}} (x_{0} + \frac{(1 - m) x_{1}}{m + 1} + \frac{2 (1 - m) {\hat{x}}_{1}}{m (m + 1)} \\ + \frac{(m - 1) {\hat{x}}_{1}^{\frac{m + 1}{2}}}{m (m + 1) {((1 - m) {\hat{x}}_{1} + m x_{1})}^{\frac{m - 1}{2}}}), \end{matrix}$

(25)

Here, $m = \frac{λ_{2}}{λ_{1}}$ . We limit our analysis to the case of $m < 1$ .
Denote $S_{1} (x_{1}, {\hat{x}}_{1}) = J (x_{0}, \bar{u}) - J_{I_{1}} (x_{0}, \hat{u})$ . Note that

$\max_{x_{1} : x_{1} > {\hat{x}}_{1}} S_{1} (x_{1}, {\hat{x}}_{1}) = S (x_{0}, {\hat{x}}_{1}),$

since function $S_{1} (x_{1}, {\hat{x}}_{1})$ is increasing over $x_{1}$ when $m < 1$ .
When ${\hat{x}}_{1} > x_{1}$ , we have $t_{1} > {\hat{t}}_{1}$ , where $t_{1}$ could be obtained from Equation $\hat{x} (t_{1}) = x_{1}$ , i.e., ${\hat{x}}_{1} e^{2 λ_{2} ({\hat{t}}_{1} - t_{1})} = x_{1}$ . We can obtain

$t_{1} = {\hat{t}}_{1} + \frac{1}{2 λ_{2}} l n \frac{{\hat{x}}_{1}}{x_{1}} .$

Then

$\begin{matrix} J_{I_{2}} (x_{0}, \hat{u}) = \sum_{i \in N} J_{i} (x_{0}, \hat{u}) \\ = \int_{0}^{{\hat{t}}_{1}} e^{- λ_{1} t} \sum_{i = 1}^{n} \sqrt{{\hat{u}}_{i 1}} d t + \int_{{\hat{t}}_{1}}^{t_{1}} e^{- λ_{1} t} \sum_{i = 1}^{n} \sqrt{{\hat{u}}_{i 2}} d t + \int_{t_{1}}^{\infty} e^{- λ_{2} t} e^{t_{1} (λ_{2} - λ_{1})} \sum_{i = 1}^{n} \sqrt{{\hat{u}}_{i 2}} d t \\ = \sqrt{\frac{n m}{2 λ_{1} ((1 - m) {\hat{x}}_{1} + m x_{0})}} (x_{0} + \frac{(1 - m) {\hat{x}}_{1}}{m + 1} + \frac{(1 - m) {\hat{x}}_{1}^{\frac{m - 1}{2 m}} x_{1}^{\frac{m + 1}{2 m}}}{m (m + 1)}) . \end{matrix}$

(26)

Denote $S_{2} (x_{1}, {\hat{x}}_{1}) = J (x_{0}, \bar{u}) - J_{I_{2}} (x_{0}, \hat{u})$ . In order to find $\max_{x_{1} : x_{1} < {\hat{x}}_{1}} S_{2} (x_{1}, {\hat{x}}_{1})$ , we need to solve $\frac{\partial S_{2} (x_{1}, {\hat{x}}_{1})}{\partial x_{1}} = 0$ , where $\frac{\partial S_{2} (x_{1}, {\hat{x}}_{1})}{\partial x_{1}}$ changes the sign from positive to negative at this point. Let ${\tilde{x}}_{1} = \arg \max_{x_{1} : x_{1} < {\hat{x}}_{1}} S_{2} (x_{1}, {\hat{x}}_{1})$ . Then, ${\tilde{x}}_{1}$ is the root of the following equation:

$(1 - m) x_{1}^{\frac{1}{m}} + m x_{0} x_{1}^{\frac{1}{m} - 1} - m^{2} {\hat{x}}_{1}^{\frac{1 - m}{m}} ((1 - m) {\hat{x}}_{1} + m x_{0}) = 0 .$

For $m = \frac{1}{2}$ this equation takes the following form:

$4 x_{1}^{2} + 4 x_{0} x_{1} - {\hat{x}}_{1} ({\hat{x}}_{1} + x_{0}) = 0,$

and

${\tilde{x}}_{1} = \frac{- x_{0} + \sqrt{x_{0}^{2} + {\hat{x}}_{1} ({\hat{x}}_{1} + x_{0})}}{2} .$

Now, problem (23) could be rewritten as follows:

\inf_{{\hat{x}}_{1} \in [0, x_{0}]} \sup_{x_{1} \in [0, x_{0}]} (\sum_{i \in N} J_{i} (x_{0}, \bar{u}) - \sum_{i \in N} J_{i} (x_{0}, \hat{u})) = \inf_{{\hat{x}}_{1} \in [0, x_{0}]} \max {S_{1} (x_{0}, {\hat{x}}_{1}), S_{2} ({\tilde{x}}_{1}, {\hat{x}}_{1})} .

Let us demonstrate the numerical solution of this problem with different values of m.

An illustration of the solution for

m = \frac{1}{2}

is provided in Figure 3. We assume

x_{0} = 10

,

λ_{1} = 0.2

,

n = 10

for the case shown on the left side of Figure 3. And,

x_{0} = 20

,

λ_{1} = 0.5

,

n = 5

for the right side of Figure 3. Interestingly, a rather general result is obtained that does not depend on the values of the parameters n and

λ_{1}

. In all cases, for

m = \frac{1}{2}

, the optimal estimate will be

{\hat{x}}_{1}^{*} \approx 0.528 x_{0}

.

Let’s continue with the parameters

x_{0} = 10

,

λ_{1} = 0.2

,

n = 10

. The left side of Figure 4 shows the solution for

m = \frac{1}{4}

. In this case,

{\hat{x}}_{1}^{*} \approx 0.434 x_{0}

. For

m = \frac{1}{5}

, we showed that

{\hat{x}}_{1}^{*} \approx 0.409 x_{0}

. This is shown in the right side of Figure 4.

Comparing the optimal estimation in the time-dependent and state-dependent cases, we can see that in the first case, the optimal estimation depends only on the values of parameters m and

λ_{1}

, whereas in the second case, it also depends on

x_{0}

.

8. Example: Oil Extraction Field with Equipment Modernization

Consider three companies (players) operating symmetrically in a shared oil field with an initial stock of 100 million tons (i.e.,

x_{0} = 10

). These companies are aware that new technologies are being developed which have the potential to improve extraction processes and reduce the risk of catastrophic accidents. However, the exact timing of the implementation of these innovations remains uncertain.

Before this modernization (

t < t_{1}

), the equipment is older and more prone to failures, leading to a higher risk of a catastrophic accident that would terminate extraction operations. After modernization (

t \geq t_{1}

), the new, more reliable equipment significantly reduces this risk. Consequently, the hazard rate

λ_{1}

before modernization is higher than the hazard rate afterwards

λ_{2}

.

The technical efficiency parameters are set to be equal for all players due to symmetry:

k_{1} = k_{2} = k_{3} = 1.0

. The output elasticity is set to

μ = 0.5

. The hazard rates are configured as

λ_{1} = 0.2

for the high-risk period before modernization and

λ_{2} = 0.05

for the lower-risk period after modernization.

The central question for the companies is to determine optimal extraction strategies under this operational risk uncertainty.

1. Time-dependent switching estimation

The optimal estimate for the switching moment (modernization time),

{\hat{t}}_{1}^{*}

, is derived from the model (substituting the parameter values

p = 0.347, λ_{1} = 0.2

)

{\hat{t}}_{1}^{*} = - \frac{\ln p}{λ_{1}} \approx 5.29 years .

2. State-dependent switching estimation

The optimal estimated switching moment occurs when the resource level reaches

{\hat{x}}_{1}^{*} \approx 0.434 x_{0} \approx 4.34 million tons .

Despite the planned modernization being 10 years away, the optimal robust strategy derived from our model suggests that the companies should behave as if the transition to the lower-risk environment will occur in approximately 5.29 years (time-dependent) or when the oil reserve depletes to about 43.4 million tons (state-dependent). This result indicates that optimal strategy requires adopting more conservative extraction measures in the near term to mitigate the higher operational risks associated with the older equipment, rather than waiting for the scheduled modernization.

This example demonstrates how our model provides practical insights for resource extraction industries facing operational risk uncertainties, enabling more informed decision-making for equipment upgrade scheduling and risk management strategies.

9. Conclusions

A model of multi-player non-renewable resource extraction with a random duration is considered. It is assumed that the distribution of the random duration of the game is composite. The conditions under which the cooperative solution is also a Nash equilibrium are obtained. For the case where players do not know the exact moment of switching of the distribution function, the optimal estimate of this unknown moment is obtained for both the time-dependent and state-dependent cases.

Furthermore, this research offers practical insights for managing non-renewable resources, suggesting that the stability of extraction coalitions depends on the balance of technical efficiencies among players and hazard rates, and provides a robust tool for estimating uncertain future policy or market shifts.

However, this study has limitations, as our model relies on specific assumptions regarding the functional form of the composite distribution and player preferences, which may not fully capture the complexity of real-world scenarios. While providing a foundational framework, these limitations also open several promising research avenues. Future work should extend beyond our assumptions by investigating more general distribution forms for the random duration and by incorporating asymmetric information and heterogeneous among players, particularly use a Bayesian Nash equilibrium approach. Other promising directions include endogenizing the switching mechanism itself and exploring more complex preference structures.

Author Contributions

Methodology, P.Y. and A.T.; investigation, Y.W.; writing—original draft preparation, P.Y.; writing—review and editing, A.T. All authors have read and agreed to the published version of the manuscript.

Funding

The work of the second author (Anna Tur) was supported by the Russian Science Foundation grant number 24-21-00302, https://rscf.ru/en/project/24-21-00302/ (accessed on 21 September 2025).

Data Availability Statement

No new data were created or analyzed in this study.

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A

Table A1 presents a comparative summary of our study alongside the main relevant literature.

Table A1. Comparison of this study with relevant literature.

	Duration	Switching	Uncertainty
Petrosjan and Shevkoplyas (2003), Shevkoplyas (2014), Shevkoplyas and Kostyunin (2013)	Random	×	terminal time
Gromov and Gromova (2014), Zaremba et al. (2020), Wu et al. (2023, 2025a)	Random	switching of distribution function	terminal time
E. Gromova and Tur (2017)	Random	switching of distribution function	terminal time and initial time
Balas and Tur (2023)	Random	switching of distribution function, different switching rules	terminal time
Wu et al. (2025b)	Random	switching of distribution function, regime shift	terminal time
Gromov and Gromova (2017)	Infinite	different switching rules	×
Stuermer and Schwerhoff (2015)	Infinite	technological change	×
Parilina et al. (2024)	Infinite	switching of variables, reputation, emission	×
Van der Ploeg (2024)	Infinite	regime switches	regime switch time
E. V. Gromova and López-Barrientos (2016)	Infinite	changes in the number of players and game model	initial time
Huang (2024)	Finite	switching of distribution function	time of the completion of the project
Ye et al. (2024)	Finite	utility function switching	switching moment
This study	Random	switching of distribution function, different switching rules	terminal time and switching moment

Appendix B

In this appendix we are going to prove that if

λ_{1} > λ_{2}

and

0 \leq t_{1} < {\hat{t}}_{1}

, then

\frac{\sqrt{e^{- 2 λ_{1} {\hat{t}}_{1}} (λ_{1} - λ_{2}) + λ_{2}}}{\sqrt{e^{- 2 λ_{1} t_{1}} (λ_{1} - λ_{2}) + λ_{2}}} > \frac{(λ_{1} - λ_{2}) e^{(λ_{1} + λ_{2}) (t_{1} - {\hat{t}}_{1})} + 2 λ_{2}}{(λ_{1} + λ_{2})} .

(A1)

Denote

g_{1} (t_{1}, {\hat{t}}_{1}) = \frac{\sqrt{e^{- 2 λ_{1} {\hat{t}}_{1}} (λ_{1} - λ_{2}) + λ_{2}}}{\sqrt{e^{- 2 λ_{1} t_{1}} (λ_{1} - λ_{2}) + λ_{2}}}

,

g_{2} (t_{1}, {\hat{t}}_{1}) = \frac{(λ_{1} - λ_{2}) e^{(λ_{1} + λ_{2}) (t_{1} - {\hat{t}}_{1})} + 2 λ_{2}}{(λ_{1} + λ_{2})}

.

Note that if

λ_{1} > λ_{2}

and

0 \leq t_{1} < {\hat{t}}_{1}

, then

g_{1} (t_{1}, {\hat{t}}_{1})

is a concave-down increasing function of

t_{1}

, since

\frac{\partial g_{1} (t_{1}, {\hat{t}}_{1})}{\partial t_{1}} > 0

and

\frac{\partial^{2} g_{1} (t_{1}, {\hat{t}}_{1})}{\partial t_{1}^{2}} < 0

. In contrast,

g_{2} (t_{1}, {\hat{t}}_{1})

is a concave-up increasing function of

t_{1}

, since

\frac{\partial g_{1} (t_{1}, {\hat{t}}_{1})}{\partial t_{1}} > 0

and

\frac{\partial^{2} g_{1} (t_{1}, {\hat{t}}_{1})}{\partial t_{1}^{2}} > 0

. It can also be observed that

g_{1} ({\hat{t}}_{1}, {\hat{t}}_{1}) = g_{2} ({\hat{t}}_{1}, {\hat{t}}_{1})

. Figure A1 is an illustration of the behavior of the functions

g_{1} (t_{1}, {\hat{t}}_{1})

and

g_{2} (t_{1}, {\hat{t}}_{1})

.

Figure A1. Illustration of the behavior of

g_{1} (t_{1}, {\hat{t}}_{1})

and

g_{2} (t_{1}, {\hat{t}}_{1})

.

Figure A1. Illustration of the behavior of

g_{1} (t_{1}, {\hat{t}}_{1})

and

g_{2} (t_{1}, {\hat{t}}_{1})

.

Thus, to prove the inequality (A1), it is sufficient to prove

g_{1} (0, {\hat{t}}_{1}) > g_{2} (0, {\hat{t}}_{1})

.

To achieve this, consider the following difference:

g_{1}^{2} (0, {\hat{t}}_{1}) - g_{2}^{2} (0, {\hat{t}}_{1}) = \frac{e^{- 2 λ_{1} {\hat{t}}_{1}} (λ_{1} - λ_{2}) + λ_{2}}{λ_{1}} - \frac{{((λ_{1} - λ_{2}) e^{- (λ_{1} + λ_{2}) {\hat{t}}_{1}} + 2 λ_{2})}^{2}}{{(λ_{1} + λ_{2})}^{2}},

and its first partial derivative

\frac{\partial (g_{1}^{2} (0, {\hat{t}}_{1}) - g_{2}^{2} (0, {\hat{t}}_{1}))}{\partial {\hat{t}}_{1}} = \frac{2 (λ_{1} - λ_{2}) e^{- 2 λ_{1} {\hat{t}}_{1}}}{λ_{1} + λ_{2}} ((λ_{1} - λ_{2}) e^{- 2 λ_{2} {\hat{t}}_{1}} + 2 λ_{2} e^{(λ_{1} - λ_{2}) {\hat{t}}_{1}} - λ_{1} - λ_{2}) .

Let us define

s ({\hat{t}}_{1}) = (λ_{1} - λ_{2}) e^{- 2 λ_{2} {\hat{t}}_{1}} + 2 λ_{2} e^{(λ_{1} - λ_{2}) {\hat{t}}_{1}} - λ_{1} - λ_{2}

. Note that

\frac{\partial s ({\hat{t}}_{1})}{\partial {\hat{t}}_{1}} = 2 λ_{2} (λ_{1} - λ_{2}) (e^{(λ_{1} - λ_{2}) {\hat{t}}_{1}} - e^{- 2 λ_{2} {\hat{t}}_{1}}) > 0

This means that

s ({\hat{t}}_{1})

is an increasing function of

{\hat{t}}_{1}

. Moreover, since

s (0) = 0

, it follows that

s ({\hat{t}}_{1}) > 0

for

{\hat{t}}_{1} > 0

.

Then

\frac{\partial (g_{1}^{2} (0, {\hat{t}}_{1}) - g_{2}^{2} (0, {\hat{t}}_{1}))}{\partial {\hat{t}}_{1}} > 0

if

{\hat{t}}_{1} > 0

, which means that

g_{1}^{2} (0, {\hat{t}}_{1}) - g_{2}^{2} (0, {\hat{t}}_{1})

is an increasing function of

{\hat{t}}_{1}

. Additionally, since

g_{1}^{2} (0, 0) - g_{2}^{2} (0, 0) = 0

, it follows that

g_{1}^{2} (0, {\hat{t}}_{1}) - g_{2}^{2} (0, {\hat{t}}_{1}) > 0

if

{\hat{t}}_{1} > 0

. From this we can directly conclude that

g_{1} (0, {\hat{t}}_{1}) > g_{2} (0, {\hat{t}}_{1})

. This concludes the proof of the inequality (A1).

Appendix C

In this appendix we are going to prove that if

λ_{1} > λ_{2}

and

t_{1} > {\hat{t}}_{1}

, then

\frac{\sqrt{e^{- 2 λ_{1} {\hat{t}}_{1}} (λ_{1} - λ_{2}) + λ_{2}}}{\sqrt{e^{- 2 λ_{1} t_{1}} (λ_{1} - λ_{2}) + λ_{2}}} < e^{(λ_{2} - λ_{1}) ({\hat{t}}_{1} - t_{1})} .

(A2)

For simplicity, rather than directly proving inequality (A2),we will instead demonstrate the equivalent inequality:

\frac{e^{- 2 λ_{1} {\hat{t}}_{1}} (λ_{1} - λ_{2}) + λ_{2}}{e^{- 2 λ_{1} t_{1}} (λ_{1} - λ_{2}) + λ_{2}} < e^{2 (λ_{2} - λ_{1}) ({\hat{t}}_{1} - t_{1})} .

Consider the difference:

\begin{matrix} \frac{e^{- 2 λ_{1} {\hat{t}}_{1}} (λ_{1} - λ_{2}) + λ_{2}}{e^{- 2 λ_{1} t_{1}} (λ_{1} - λ_{2}) + λ_{2}} - e^{2 (λ_{2} - λ_{1}) ({\hat{t}}_{1} - t_{1})} = \\ \frac{e^{- 2 λ_{1} {\hat{t}}_{1}} (λ_{1} - λ_{2}) + λ_{2} - λ_{2} e^{2 (λ_{2} - λ_{1}) ({\hat{t}}_{1} - t_{1})} - e^{- 2 λ_{1} {\hat{t}}_{1}} e^{2 λ_{2} ({\hat{t}}_{1} - t_{1})} (λ_{1} - λ_{2})}{e^{- 2 λ_{1} t_{1}} (λ_{1} - λ_{2}) + λ_{2}} . \end{matrix}

Let

y (t_{1}, {\hat{t}}_{1})

represent the numerator of this fraction, i.e.,

y (t_{1}, {\hat{t}}_{1}) = e^{- 2 λ_{1} {\hat{t}}_{1}} (λ_{1} - λ_{2}) + λ_{2} - λ_{2} e^{2 (λ_{2} - λ_{1}) ({\hat{t}}_{1} - t_{1})} - e^{- 2 λ_{1} {\hat{t}}_{1}} e^{2 λ_{2} ({\hat{t}}_{1} - t_{1})} (λ_{1} - λ_{2}) .

The derivative of

y (t_{1}, {\hat{t}}_{1})

with respect to

t_{1}

is given by

\frac{\partial y (t_{1}, {\hat{t}}_{1})}{\partial t_{1}} = 2 λ_{2} (λ_{1} - λ_{2}) e^{- 2 λ_{1} {\hat{t}}_{1}} e^{2 λ_{2} ({\hat{t}}_{1} - t_{1})} (1 - e^{2 λ_{1} t_{1}}) .

It is evident that

\frac{\partial y (t_{1}, {\hat{t}}_{1})}{\partial t_{1}} \leq 0

when

λ_{1} > λ_{2}

, with equality occurring only when

t_{1} = 0

. This implies that

y (t_{1}, {\hat{t}}_{1})

is a decreasing function of

t_{1}

. Since

y ({\hat{t}}_{1}, {\hat{t}}_{1}) = 0

, we can conclude that

y (t_{1}, {\hat{t}}_{1}) < 0

if

t_{1} > {\hat{t}}_{1}

. Consequently, we have:

\frac{e^{- 2 λ_{1} {\hat{t}}_{1}} (λ_{1} - λ_{2}) + λ_{2}}{e^{- 2 λ_{1} t_{1}} (λ_{1} - λ_{2}) + λ_{2}} - e^{2 (λ_{2} - λ_{1}) ({\hat{t}}_{1} - t_{1})} < 0,

which proves the inequality (A2).

Appendix D

In this appendix we are going to prove that if

λ_{1} > λ_{2}

, then

L_{1} ({\hat{t}}_{1})

is an increasing function, here

L_{1} ({\hat{t}}_{1}) = \sqrt{\frac{n x_{0}}{2 λ_{2}}} (1 - \frac{1}{λ_{1} + λ_{2}} \sqrt{\frac{λ_{1}}{λ_{1} + λ_{2} (e^{2 λ_{1} {\hat{t}}_{1}} - 1)}} (2 λ_{2} e^{λ_{1} {\hat{t}}_{1}} + e^{- λ_{2} {\hat{t}}_{1}} (λ_{1} - λ_{2}))) .

Its derivative with respect to

{\hat{t}}_{1}

is

\frac{\partial L_{1} ({\hat{t}}_{1})}{\partial {\hat{t}}_{1}} = - \frac{\sqrt{n x_{0} λ_{1} λ_{2}} (λ_{1} - λ_{2}) e^{- λ_{2} {\hat{t}}_{1}}}{\sqrt{2} (λ_{1} + λ_{2}) {(λ_{1} + λ_{2} (e^{2 λ_{1} {\hat{t}}_{1}} - 1))}^{\frac{3}{2}}} l_{1} ({\hat{t}}_{1}),

where

l_{1} ({\hat{t}}_{1}) = 2 λ_{1} e^{(λ_{1} + λ_{2}) {\hat{t}}_{1}} - (λ_{1} + λ_{2}) e^{2 λ_{1} {\hat{t}}_{1}} + λ_{2} - λ_{1}

.

Note that

l_{1} (0) = 0

and

\frac{\partial l_{1} ({\hat{t}}_{1})}{\partial {\hat{t}}_{1}} = 2 λ_{1} (λ_{1} + λ_{2}) (e^{(λ_{1} + λ_{2}) {\hat{t}}_{1}} - e^{2 λ_{1} {\hat{t}}_{1}}) < 0

when

{\hat{t}}_{1} > 0

, since

λ_{1} > λ_{2}

. It follows that

l_{1} ({\hat{t}}_{1}) < 0

and

\frac{\partial L_{1} ({\hat{t}}_{1})}{\partial {\hat{t}}_{1}} > 0

when

{\hat{t}}_{1} > 0

and

λ_{1} > λ_{2}

. Then we can conclude that

L_{1} ({\hat{t}}_{1})

is an increasing function of

{\hat{t}}_{1}

. In the same way, it can be proved that

L_{2} ({\hat{t}}_{1})

is a decreasing function of

{\hat{t}}_{1}

.

References

Balas, T., & Tur, A. (2023). The Hamilton–Jacobi–Bellman equation for differential games with composite distribution of random time horizon. Mathematics, 11(2), 462. [Google Scholar] [CrossRef]
Boukas, E. K., Haurie, A., & Michel, P. (1990). An optimal control problem with a random stopping time. Journal of Optimization Theory and Applications, 64(3), 471–480. [Google Scholar] [CrossRef]
Dockner, E. (2000). Differential games in economics and management science. Cambridge University Press. [Google Scholar]
Epaulard, A., & Pommeret, A. (1998). Does uncertainty lead to a more conservative use of a non renewable resource? A recursive utility approach. Journées de l’AFSE sur Économie de l’Environnement et des Ressources Naturelles, 11–12. Available online: https://www.researchgate.net/publication/228912969_Does_uncertainty_lead_to_a_more_conservative_use_of_a_non_renewable_resource_A_recursive_utility_approach (accessed on 21 September 2025).
Gromov, D., & Gromova, E. (2014). Differential games with random duration: A hybrid systems formulation. Contributions to Game Theory and Management, 7, 104–119. [Google Scholar]
Gromov, D., & Gromova, E. (2017). On a class of hybrid differential games. Dynamic Games and Applications, 7(2), 266–288. [Google Scholar] [CrossRef]
Gromova, E., & Tur, A. (2017, October 26–28). On the form of integral payoff in differential games with random duration. 2017 XXVI International Conference on Information, Communication and Automation Technologies (ICAT) (pp. 1–6), Sarajevo, Bosnia and Herzegovina. [Google Scholar]
Gromova, E. V., & López-Barrientos, J. D. (2016). A differential game model for the extraction of nonrenewable resources with random initial times—The cooperative and competitive cases. International Game Theory Review, 18(2), 1640004. [Google Scholar] [CrossRef]
Huang, X. (2024). Differential games of R&D competition with switching dynamics. Contributions to Game Theory and Management, 17, 38–50. [Google Scholar]
Isaacs, R. (1999). Differential games: A mathematical theory with applications to warfare and pursuit, control and optimization. Courier Corporation. [Google Scholar]
Parilina, E., Yao, F., & Zaccour, G. (2024). Pricing and investment in manufacturing and logistics when environmental reputation matters. Transportation Research Part E: Logistics and Transportation Review, 184, 103468. [Google Scholar] [CrossRef]
Petrosjan, L. A., & Mursov, N. V. (1966). Game theoretical problems in mechanics. Lithuanian Mathematical Journal, 6(3), 423–433. [Google Scholar] [CrossRef]
Petrosjan, L. A., & Shevkoplyas, E. V. (2003). Cooperative solution for games with random duration. Game Theory and Applications, 9, 125–139. [Google Scholar]
Pontryagin, L. S. (2018). Mathematical theory of optimal processes. Routledge. [Google Scholar]
Shevkoplyas, E. V. (2014). The Hamilton-Jacobi-Bellman equation for a class of differential games with random duration. Automation and Remote Control, 75, 959–970. [Google Scholar] [CrossRef]
Shevkoplyas, E. V., & Kostyunin, S. Y. (2013). A class of differential games with random terminal time. Game Theory and Applications, 16, 177–192. [Google Scholar]
Stuermer, M., & Schwerhoff, G. (2015). Non-renewable resources, extraction technology, and endogenous growth. FRB of Dallas Working Paper, No. 1506. Federal Reserve Bank of Dallas. [Google Scholar]
Van der Ploeg, F. (2024). Benefits of rent sharing in dynamic resource games. Dynamic Games and Applications, 14(1), 20–32. [Google Scholar] [CrossRef]
Wu, Y., Tur, A., & Wang, H. (2023). Sustainable optimal control for switched pollution-control problem with random duration. Entropy, 25(10), 1426. [Google Scholar] [CrossRef] [PubMed]
Wu, Y., Tur, A., & Ye, P. (2025a). Sustainable cooperation on the hybrid pollution-control game with heterogeneous players. arXiv, arXiv:2504.12059. [Google Scholar] [CrossRef]
Wu, Y., Tur, A., & Ye, P. (2025b). Sustainable solution for hybrid differential game with regime shifts and random duration. Nonlinear Analysis: Hybrid Systems, 55, 101553. [Google Scholar] [CrossRef]
Ye, P., Tur, A., & Wu, Y. (2024). On the estimation of the switching moment of utility functions in cooperative differential games. Kybernetes. [Google Scholar] [CrossRef]
Zaremba, A., Gromova, E., & Tur, A. (2020). A differential game with random time horizon and discontinuous distribution. Mathematics, 8(12), 2185. [Google Scholar] [CrossRef]

Figure 1. Illustration of Theorem 2.

Figure 2. Illustration of Table 2.

Figure 3. (a) Optimal solution for

m = \frac{1}{2}

,

x_{0} = 10

,

λ_{1} = 0.2

,

n = 10

; (b) Optimal solution for

m = \frac{1}{2}

,

x_{0} = 20

,

λ_{1} = 0.5

,

n = 5

.

Figure 3. (a) Optimal solution for

m = \frac{1}{2}

,

x_{0} = 10

,

λ_{1} = 0.2

,

n = 10

; (b) Optimal solution for

m = \frac{1}{2}

,

x_{0} = 20

,

λ_{1} = 0.5

,

n = 5

.

Figure 4. (a) Optimal solution for

m = \frac{1}{4}

; (b) Optimal solution for

m = \frac{1}{5}

.

Figure 4. (a) Optimal solution for

m = \frac{1}{4}

; (b) Optimal solution for

m = \frac{1}{5}

.

Table 1. Summary of model parameters.

Symbol	Description
n	Number of players
T	Random duration of the game
$F (t)$	Composite cumulative distribution function of T
$t_{1}$	Switching moment
$λ_{1}$	Hazard rate before time $t_{1}$
$λ_{2}$	Hazard rate after time $t_{1}$
$x (t)$	Resource stock at time t
$x_{0}$	Initial resource stock
$u_{i} (t)$	Extraction effort of player i at time t
$k_{i}$	Efficiency coefficient of player i
$μ$	Elasticity coefficient ( $0 < μ < 1$ )
${\hat{t}}_{1}$	Estimate of the switching moment $t_{1}$
${\hat{t}}_{1}^{*}$	Optimal estimate of the switching time $t_{1}$
m	Ratio of hazard rates ( $m = λ_{2} / λ_{1}$ )
$x_{1}$	Resource stock at time $t_{1}$ (critical stock level for switch)
${\hat{x}}_{1}$	Estimate of the resource stock $x_{1}$
${\hat{x}}_{1}^{*}$	Optimal estimate of the resource stock $x_{1}$

Table 2. Optimal estimates for different values of m.

m	$p^{*}$	${\hat{t}}_{1}^{*}$
0.5	0.388	$0.947 / λ_{1}$
1/3	0.364	$1.01 / λ_{1}$
0.25	0.347	$1.058 / λ_{1}$
0.1	0.296	$1.217 / λ_{1}$

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Ye, P.; Tur, A.; Wu, Y. Non-Renewable Resource Extraction Model with Uncertainties. Games 2025, 16, 52. https://doi.org/10.3390/g16050052

AMA Style

Ye P, Tur A, Wu Y. Non-Renewable Resource Extraction Model with Uncertainties. Games. 2025; 16(5):52. https://doi.org/10.3390/g16050052

Chicago/Turabian Style

Ye, Peichen, Anna Tur, and Yilun Wu. 2025. "Non-Renewable Resource Extraction Model with Uncertainties" Games 16, no. 5: 52. https://doi.org/10.3390/g16050052

APA Style

Ye, P., Tur, A., & Wu, Y. (2025). Non-Renewable Resource Extraction Model with Uncertainties. Games, 16(5), 52. https://doi.org/10.3390/g16050052

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Non-Renewable Resource Extraction Model with Uncertainties

Abstract

1. Introduction

2. Problem Statement

3. Nash Equilibrium

4. Time-Dependent Case

5. Optimal Estimate

6. State-Dependent Case

7. Information Uncertainty

8. Example: Oil Extraction Field with Equipment Modernization

9. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

Appendix A

Appendix B

Appendix C

Appendix D

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI