Interception Domain Approach to Orbital Multi-Player “Encirclement-Capture” Games: Theoretical Foundations and Solutions

Xingchen Li; Xiao Zhou; Xiaodong Yu; Guangyu Zhao; Yidan Liu

doi:10.3390/aerospace12100875

,

and

Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing 100094, China

^*

Author to whom correspondence should be addressed.

Aerospace2025, 12(10), 875;https://doi.org/10.3390/aerospace12100875

This article belongs to the Special Issue Dynamics and Control of Space On-Orbit Operations

Version Notes

Order Reprints

Abstract

In recent years, with the development of micro-satellite clusters and large-scale satellite constellations, the likelihood of multiple spacecraft engaging in orbital pursuit–evasion games has increased. This paper establishes a novel interception domain theory for planar orbital multi-player “encirclement-capture” differential games, and it proves the partitioned structure and classification properties of Nash equilibrium solutions. The main contributions of our study are the following: (1) Proposing the first rigorous definition of interception domains in orbital pursuit–evasion games, proving their convexity, developing computational methods for domain intersections, and establishing a complete classification of equilibrium for planar multi-pursuer interception games, which establishes a theoretical foundation for analyzing multi-spacecraft orbital pursuit–evasion games. (2) Analyzing Nash equilibrium properties for “encirclement-capture” differential games with two, three, or more pursuers, classifying degenerate/non-degenerate scenarios via spatial inclusion relationships. The equilibrium results indicate that as the number of pursuers increases, the game tends toward a degenerate scenario where the likelihood of redundant pursuers (whose actions do not affect the game outcome) rises.

Keywords:

orbital pursuit–evasion game; differential game; multi-player game; encirclement-capture

1. Introduction

Spacecraft orbital games describe strategic interactions between adversarial spacecraft groups governed by orbital dynamics and operational constraints. The theoretical foundation for the differential game approach was established in the 1970s when Anderson and Grazier [1] pioneered the application of Isaacs’ differential game framework [2] to low Earth orbit scenarios, formulating the first orbital pursuit–evasion differential game model for spacecraft employing continuous low-thrust propulsion systems. With the growing significance of the space domain, orbital differential pursuit–evasion game modeling has emerged as a critical research focus [3]. Current studies encompass scenarios where both parties optimize their control strategies based on objectives such as relative distance, fuel consumption, and game duration.

In recent years, the advancement of micro-satellite clusters and large-scale satellite constellations (e.g., EAGLE) has significantly increased the occurrence of multi-spacecraft orbital games [4]. In such settings, engagements with only one or two pursuers constitute special cases. This trend has drawn growing attention to orbital pursuit–evasion problems involving multiple participants. A particularly important variant is the multi-player “encirclement-capture” game, in which multiple cooperative pursuers strategically coordinate to capture a single evader.

Studying such multi-pursuer scenarios is essential. Cooperative pursuers can leverage tactical advantages—such as coordinated maneuvering and spatial occlusion—to achieve interception more quickly and reliably than in single-pursuer engagements. Moreover, as the number of pursuers increases, new strategic dilemmas and equilibrium types emerge—reflecting qualitative, not merely quantitative, changes in game outcomes. The strategies and outcomes vary considerably depending on the number and configuration of pursuers. Therefore, a systematic classification of game scenarios is crucial both for theoretical understanding and practical application. These capabilities play a critical role in applications such as constellation configuration design and space situational awareness.

Within that context, this paper seeks to address the following research questions:

What are the optimal pursuit strategies for the pursuer group and the optimal evasion strategies for the evader?
Can all pursuers either intercept the evader or meaningfully influence the game outcome?
How does the number of pursuers affect interception performance and game dynamics?

Existing research on orbital pursuit–evasion games has primarily focused on one-to-one scenarios. For instance, Pontani and Conway [5] investigated interception scenarios between two remote spacecraft employing constant-thrust propulsion. Building upon this work, Shen and Casalino [3] extended the framework by incorporating constraints such as minimum orbital altitude requirements and time-varying spacecraft mass dynamics. Stupik et al. [6] examined games involving spacecraft satisfying the Clohessy–Wiltshire (CW) conditions, which were further studied by Zhang et al. [7] using deep neural network methodologies. The pursuit–evasion games under imperfect information have been analyzed using various approaches, including compensation control strategies [8], different sensor schemes [9], a two-stage game model [10], mode-matched smooth variable structure filters [11], and a parameter-optimized control method based on the receding horizon framework [12]. Li et al. [13] further expanded the model to account for J2 gravitational perturbations. The pursuit–evasion games with fuel consumption as the optimization objective have been covered by the studies of discrete thrust conditions [14], optimization of total velocity increments [15], and reachable domains with fixed velocity increments [16]. The pursuit–evasion games with sun angle constraints have been covered by the studies of inspection games [17] and observation and counter-observation games [18]. Separately, Jagat and Sinclair conducted studies on orbital rendezvous scenarios, analyzing both linear [19] and nonlinear [20] dynamic systems, with optimization objectives centered on minimizing relative distance and fuel expenditure during maneuvering phases. Furthermore, research on multi-spacecraft orbital games has largely concentrated on pursuit–evasion–defense tripartite games, where an evader deploys a defender to intercept the pursuer. This problem has roots in studies on missile–target–interceptor tripartite games [21]. Within this context, Li et al. [22] analyzed the control strategies and winning conditions of the pursuer and defender by solving the Nash equilibrium under the assumption that the evader cannot maneuver. Li [23] compared scenarios where the evader and defender adopted cooperative versus non-cooperative strategies. They found that cooperative strategies significantly reduce the risk of the evader being captured.

To the best of our knowledge, there are three existing studies on orbital multi-player “encirclement-capture” games, which were conducted by Sun et al. [24], Jansson and Harris [25], and Li et al. [4].

Sun et al. [24] employed differential game theory to investigate the specific case involving exactly two pursuers. They considered non-degenerate scenarios where both pursuers simultaneously intercept the evader and degenerate scenarios where one pursuer cannot influence the game outcome, providing corresponding degeneracy determination algorithms. However, they did not provide a systematic theoretical proof for this classification—for example, why there cannot be a non-degenerate scenario where only one pursuer achieves interception. As demonstrated by the results in this paper, there are essential differences between two-pursuer and three-or-more-pursuer games, which indicates that their approach has limitations, whereas the interception-domain-based methodology proposed in this paper offers greater universality for this problem. The part of our research on two-versus-one games essentially provides a theoretical verification and supplementary proof for their study, while also offering a scenario discrimination algorithm with lower computational dimensionality.

Jansson and Harris [25] proposed a geometric algorithm based on reachable sets for a general number of pursuers. Their algorithm first computes the time-evolving reachable sets for each spacecraft and then iterates over time to identify the moment and location where the evader’s reachable set first becomes fully covered by the pursuers’ collective reachable sets. While their reachable set approaches may compute different degenerate or non-degenerate cases, they cannot answer which scenarios are possible and which are impossible—a key research question addressed in this paper. Furthermore, our method is distributed and hierarchical: it first efficiently identifies active pursuers via low-dimensional calculations before solving for trajectories, making it particularly suitable for rapid decision-making in large-scale encounters.

Li et al. [4] derived cooperative capture strategies for multiple spacecraft using discrete impulses as the maneuvering method, along with corresponding evasion strategies, via an effective deep reinforcement learning algorithm designed for multi-player cooperative games. Unlike their work, which relies on discrete impulses and learning algorithms, this paper presents theoretical Nash equilibrium solutions under the assumption of continuous low-thrust maneuvers. Thus, our results are more applicable to low-thrust propulsion systems such as electric propulsion satellites. In addition, compared to results derived from deep reinforcement learning, the differential game-based solutions obtained in this study possess theoretical optimality guarantees and offer higher interpretability.

Additionally, some studies have explored general multi-player “encirclement-capture” games without orbital dynamics constraints. For instance, Jin and Qu [26] investigated the barriers of the game under scenarios where players can control both the magnitude and direction of their velocity in real time, while Chen et al. [27] studied the barriers when players can control their velocity magnitude and angular velocity magnitude of direction in real time. In contrast, to align with practical spacecraft maneuvering, the scenario considered in this paper is more complex. First, in our model, spacecraft can only control their thrust direction, not their velocity directly. Second, all spacecraft in our model remain subject to gravitational fields. As will be shown later, these differences significantly increase the complexity of solving the proposed model.

To fill the theoretical gap in orbital “encirclement-capture” games, particularly concerning the classification of scenarios involving more than two pursuers, this paper establishes a differential game model for such games. We establish a novel interception domain theory to establish a complete classification of equilibrium for planar multi-pursuer interception games. The main contributions of our study are summarized as follows: (1) We propose and define the interception domain, which establishes a theoretical foundation for analyzing multi-spacecraft orbital pursuit–evasion games. (2) We model “encirclement-capture” differential games involving two, three, or more pursuers, determine their Nash equilibrium solutions, and prove the partitioned structure and classification properties of the solutions.

The remainder of this paper is organized as follows: Section 2 introduces the setup of the differential model for the “encirclement-capture” game and defines the problem to be solved. Section 3 defines and discusses the interception domain. Section 4 describes the derivation of the two-pursuer Nash equilibrium solution. Section 5 describes the derivation of the three-pursuer Nash equilibrium solution. Section 6 describes the derivation of the Nash equilibrium solution when there are more than three pursuers. Section 7 shows simulation examples. Section 8 concludes the paper.

2. Differential Game Model

Consider a game scenario involving multiple pursuers collaborating to intercept an evader. Upon initiation of the game, all players acquire real-time state information of other players, including positional coordinates, velocity vectors, and maneuvering acceleration magnitudes. Each player executes corresponding continuous control strategies optimized for their respective objectives. The game terminates when at least one pursuer achieves physical coincidence with the evader. During the game process, the pursuers aim to minimize mission completion time, while the evader seeks to maximize it.

The dynamics in this work are restricted to planar motion governed by CW dynamics, a simplification justified by the specific orbital environment under consideration—namely, geostationary orbit (GEO), which is the main scenario of orbital games, especially multi-spacecraft orbital games [28]. Satellites in GEO are required to perform strict north–south station-keeping maneuvers to maintain their orbital inclination within a very small range (typically within ±0.05° to ±0.1° [29]). This practice effectively suppresses out-of-plane motion, making the planar assumption a valid and widely adopted approximation for analyzing proximate orbital maneuvers in this regime [1,25]. This model allows us to derive fundamental insights into the multi-pursuer interception problem, which serves as a critical foundation for future extensions.

Denote

k

as the number of the pursuers. Let the positional and velocity vectors (as shown in Figure 1) be defined in a rotating reference frame centered at a circular orbital base.

x_{i} = [{r_{i x}, r_{i y}, v_{i x}, v_{i y}]}^{T}, i = \{E, P_{1}, P_{2}, \dots, P_{k}\} .

(1)

where the subscripts

P_{1}, P_{2}, \dots, P_{k}

indicate the variables for each

k

pursuer, respectively, and the subscript

E

indicate the variables for the evader. These vectors serve as the differential game’s state variables.

Figure 1. State variables of the game.

The maneuvering accelerations of players are denoted by thrust magnitude

T_{i}

with directions

{(c o s β_{i}, s i n β_{i})}^{T}

for

i = \{E, P_{1}, P_{2}, \dots, P_{k}\}

, where

β_{i}

is continuous over the interval

[0, 2 π)

and

T_{i}

is assumed to be constant for each player. This fixed-thrust-magnitude assumption is standard in time-optimal orbital pursuit–evasion games [3,5]. It is justified by the fact that for a linear system of differential equations, each player will always maintain their maximum thrust in the game equilibrium [2].

Building on prior research [8], let

w

denote the angular velocity of the relative circular orbit. The dynamic equations for each spacecraft are expressed as follows:

{\dot{r}}_{i x} = v_{i x} {\dot{r}}_{i y} = v_{i y} {\dot{v}}_{i x} = 2 w v_{i y} + 3 w^{2} r_{i x} + T_{i} c o s β_{i} {\dot{v}}_{i y} = - 2 w v_{i x} + T_{i} s i n β_{i} .

(2)

Since

i = \{E, P_{1}, P_{2}, \dots, P_{k}\}

, there are

4 (k + 1)

equations in (2). Denote the right side of them as

f_{j}, j = {1, 2, \dots, 4 (k + 1)}

. The decision variables for this game are the thrust acceleration direction angles of all players, which are assumed to be a function that varies over time

β_{i} (t)

. Under a fixed initial state, the game duration

∆ t

becomes a function depending solely on the decision variables. To meet the Nash equilibrium conditions of the game, these decision variables satisfy the following conditions:

(β_{P_{1}}^{*}, β_{P_{2}}^{*}, \dots, β_{P_{k}}^{*}) = {a r g m i n}_{β_{P_{1}}, β_{P_{2}}, \dots, β_{P_{k}}} [∆ t (β_{P_{1}}, β_{P_{2}}, \dots, β_{P_{k}}, β_{E}^{*})] β_{E}^{*} = {a r g m a x}_{β_{E}} [∆ t (β_{P_{1}}^{*}, β_{P_{2}}^{*}, \dots, β_{P_{k}}^{*}, β_{E})] .

(3)

Within the framework of optimal control theory, the resolution of constrained optimization problems involving state trajectories adhering to dynamic equations necessitates the introduction of dimensionally equivalent costate variables. For this system, these costate variables are formally defined as

λ_{i^{1}}, λ_{i^{2}}, λ_{i^{3}}, λ_{i^{4}}, i = {E, P_{1}, \dots, P_{k}}

. Then, according to Isaac’s differential game theory [2], the second form of the main equation is

\sum_{j = 1}^{4} (λ_{{P_{1}}^{j}} f_{j} + λ_{{P_{2}}^{j}} f_{j + 4} + \dots + λ_{{P_{k}}^{j}} f_{j + 4 (k - 1)} + λ_{E^{j}} f_{j + 4 k}) + G = 0,

(4)

where

G

is the time accumulation objective function of the game. In this paper,

G \equiv 1

since the objective function is the duration time. Guided by the Path Equation Theorem, after differentiating (4) with respect to each state variable, the dynamic equations of the costate variables under the Nash equilibrium can be obtained:

{\dot{λ}}_{i^{1}} = \sum_{j = 1}^{4} (λ_{{P_{1}}^{j}} \frac{\partial f_{j}}{\partial r_{i x}} + λ_{{P_{2}}^{j}} \frac{\partial f_{j + 4}}{\partial r_{i x}} + \dots + λ_{{P_{k}}^{j}} \frac{\partial f_{j + 4 (k - 1)}}{\partial r_{i x}} + λ_{E^{j}} \frac{\partial f_{j + 4 k}}{\partial r_{i x}}) = 3 w^{2} λ_{i^{3}}, {\dot{λ}}_{i^{2}} = \sum_{j = 1}^{4} (λ_{{P_{1}}^{j}} \frac{\partial f_{j}}{\partial r_{i y}} + λ_{{P_{2}}^{j}} \frac{\partial f_{j + 4}}{\partial r_{i y}} + \dots + λ_{{P_{k}}^{j}} \frac{\partial f_{j + 4 (k - 1)}}{\partial r_{i y}} + λ_{E^{j}} \frac{\partial f_{j + 4 k}}{\partial r_{i y}}) = 0, {\dot{λ}}_{i^{3}} = \sum_{j = 1}^{4} (λ_{{P_{1}}^{j}} \frac{\partial f_{j}}{\partial v_{i x}} + λ_{{P_{2}}^{j}} \frac{\partial f_{j + 4}}{\partial v_{i x}} + \dots + λ_{{P_{k}}^{j}} \frac{\partial f_{j + 4 (k - 1)}}{\partial v_{i x}} + λ_{E^{j}} \frac{\partial f_{j + 4 k}}{\partial v_{i x}}) = λ_{i^{1}} - 2 w λ_{i^{4}}, {\dot{λ}}_{i^{4}} = \sum_{j = 1}^{4} (λ_{{P_{1}}^{j}} \frac{\partial f_{j}}{\partial v_{i y}} + λ_{{P_{2}}^{j}} \frac{\partial f_{j + 4}}{\partial v_{i y}} + \dots + λ_{{P_{k}}^{j}} \frac{\partial f_{j + 4 (k - 1)}}{\partial v_{i y}} + λ_{E^{j}} \frac{\partial f_{j + 4 k}}{\partial v_{i y}}) = λ_{i^{2}} + 2 w λ_{i^{3}} .

(5)

Note that (5) is a linear system of differential equations, indicating that the time-varying functions of the costate variables can be directly determined by the boundary values at one end. Next, according to main equation theorem [2], the decision variables satisfy the following conditions:

(β_{P_{1}}^{*}, β_{P_{2}}^{*}, \dots, β_{P_{k}}^{*}) = {a r g m i n}_{β_{P_{1}}, β_{P_{2}}, \dots, β_{P_{k}}} [\sum_{j = 1}^{4} (λ_{{P_{1}}^{j}} f_{j} + λ_{{P_{2}}^{j}} f_{j + 4} + \dots + λ_{{P_{k}}^{j}} f_{j + 4 (k - 1)} + λ_{E^{j}} f_{j + 4 k}) + 1], β_{E}^{*} = {a r g m a x}_{β_{E}} [\sum_{j = 1}^{4} (λ_{{P_{1}}^{j}} f_{j} + λ_{{P_{2}}^{j}} f_{j + 4} + \dots + λ_{{P_{k}}^{j}} f_{j + 4 (k - 1)} + λ_{E^{j}} f_{j + 4 k}) + 1] .

(6)

The above formulas are equivalent to

(c o s β_{i}^{*}, s i n β_{i}^{*}) = - \frac{(λ_{i^{3}}, λ_{i^{4}},)}{\sqrt{λ_{i^{3}}^{2} + λ_{i^{4}}^{2}}}, f o r i = {P_{1}, P_{2}, \dots, P_{k}}, (c o s β_{E}^{*}, s i n β_{E}^{*}) = \frac{(λ_{E^{3}}, λ_{E^{4}},)}{\sqrt{λ_{E^{3}}^{2} + λ_{E^{4}}^{2}}} .

(7)

The optimal direction angle from (7) is computed in real time and provides an inertial pointing command. The spacecraft’s attitude control system must align the thruster with this direction, followed by firing at maximum thrust. This reflects the standard guidance–actuation separation in spacecraft implementation.

By substituting (7) into (2), we obtain a linear system of differential equations governing the state variables. The resulting equations for the pursuers are as follows:

{\dot{r}}_{i x} = v_{i x}, {\dot{r}}_{i y} = v_{i y}, {\dot{v}}_{i x} = 2 w v_{i y} + 3 w^{2} r_{i x} - T_{i} \frac{λ_{i^{3}}}{\sqrt{λ_{i^{3}}^{2} + λ_{i^{4}}^{2}}}, {\dot{v}}_{i y} = - 2 w v_{i x} - T_{i} \frac{λ_{i^{4}}}{\sqrt{λ_{i^{3}}^{2} + λ_{i^{4}}^{2}}},

(8)

where

i = {P_{1}, \dots, P_{k}}

. The resulting equation for the evader is as follows:

{\dot{r}}_{E x} = v_{E x}, {\dot{r}}_{E y} = v_{E y}, {\dot{v}}_{E x} = 2 w v_{E y} + 3 w^{2} r_{E x} + T_{E} \frac{λ_{E^{3}}}{\sqrt{λ_{E^{3}}^{2} + λ_{E^{4}}^{2}}}, {\dot{v}}_{E y} = - 2 w v_{E x} + T_{E} \frac{λ_{E^{4}}}{\sqrt{λ_{E^{3}}^{2} + λ_{E^{4}}^{2}}},

(9)

In general differential games, coupling (2) and (5) transforms the problem into a two-point boundary value problem (TPBVP). In this TPBVP, the initial states are known, and the terminal conditions are defined by the game’s termination region. However, in our problem, the termination region involves complex conditional criteria that prevent the direct application of the standard TPBVP approach. Due to this increased complexity, we instead conduct a geometric analysis of the equilibrium properties.

3. Interception Domain

Consider a scenario where a pursuer (

P

) attempts to intercept an evader (

E

) in minimum time, assuming

P

possesses superior maneuvering capability compared to

E

. In this case, if the evader aims to maximize the interception time, the equilibrium for this game has been analyzed in prior studies [3,5]. Due to the uniqueness of the Nash equilibrium in this differential game, the evader’s trajectory from the start of the game until interception is deterministic. If

E

deviates from this equilibrium strategy for any purpose, it will be intercepted by

P

in less time and via a different trajectory. Evidently, the region reachable by

E

before interception is finite. We define this region as the interception domain. The distinction between the concepts of the interception domain and the reachable set [25] is summarized in Table 1.

Table 1. Distinction between the concepts of the interception domain and the reachable set.

It is important to note that, distinct from the concept of the reachable set [25], the interception domain is defined by the pair of pursuer and evader, not by a single spacecraft.

We now present the equivalent mathematical definition of the interception domain. For any

θ ϵ [0, 2 π]

, consider the following differential pursuit–evasion game:

E

aims to maximize the dot product between the position vector

(r_{x}^{f}, r_{y}^{f})

and a given unit vector

(c o s θ, s i n θ)

before interception. Conversely,

P

aims to minimize this value. This game can also be interpreted as a reach–avoid game [30] where the target line is the orthogonal line to the vector

(c o s θ, s i n θ)

. The state and costate dynamics of this game correspond to the single-pursuer case given by (2) and (5) in this paper. The terminal region is the six-dimensional manifold where the position vectors of

P

and

E

coincide. Note that in this game, the objective function

H = c o s θ r_{e x}^{f} + s i n θ r_{e y}^{f}

is not cumulative over time. By combining the differentiation of

H

with respect to each element of the terminal region and the second form of the main equation, the terminal boundary conditions of TPBVP can be derived as follows:

\{\begin{matrix} (r_{p x}^{f}, r_{p y}^{f}) = (r_{e x}^{f}, r_{e y}^{f}) \\ λ_{P^{1}} + λ_{E^{1}} = \cos θ \\ λ_{P^{2}} + λ_{E^{2}} = \sin θ \\ λ_{P^{1}} v_{p x} + λ_{P^{2}} v_{p y} + λ_{E^{1}} v_{e x} + λ_{E^{2}} v_{e y} = 0 \\ λ_{P^{3}} = λ_{P^{4}} = λ_{E^{3}} = λ_{E^{4}} = 0 \end{matrix}

(10)

The solution of the above TPBVP can be assumed as

s (θ) = (∆ t (θ), r_{x}^{f} (θ), r_{y}^{f} (θ), v_{p x}^{f} (θ), v_{p y}^{f} (θ), v_{e x}^{f} (θ), v_{e y}^{f} (θ), λ_{P^{1}}^{f} (θ), λ_{P^{2}}^{f} (θ), λ_{E^{1}}^{f} (θ), λ_{E^{2}}^{f} (θ)),

(11)

where the superscript

f

means the corresponding state or costate value in termination.

Due to the smoothness of the solution, the point set

l = \{{(r}_{x}^{f} (θ), r_{y}^{f} (θ)) | θ ϵ [0, 2 π]\}

forms a closed, smooth curve in the plane. Consider the region

d

bounded by this curve. From the game formulation, it readily follows that at any point

(r_{x}^{f} (θ_{0}), r_{y}^{f} (θ_{0}))

on

l

, the tangent line to the curve is the straight line passing through that point and orthogonal to the vector

(c o s θ_{0}, s i n θ_{0})

. Furthermore, all points within the region

d

lie on the same side of this tangent line. This implies that the region

d

is convex.

Theorem 1.

The region

d

is the interception domain defined by

P

and

E

.

Proof.

Note that the curve

l

lies within the interception domain

d

.

We first prove that points outside

d

do not belong to the interception domain. Suppose there exists a point

(x_{0}, y_{0})

outside

d

that the evader

E

can reach under interception by

P

. By the convexity of

d

, there must exist a boundary point

(r_{x}^{f} (θ_{0}), r_{y}^{f} (θ_{0}))

such that

(x_{0}, y_{0})

lies on the opposite side of the tangent line to

d

at this point. According to the definition of

d

,

(r_{x}^{f} (θ_{0}), r_{y}^{f} (θ_{0}))

represents the maximum achievable dot product with the vector

(c o s θ_{0}, s i n θ_{0})

for

E

under interception. However, the dot product of

(x_{0}, y_{0})

with

(c o s θ_{0}, s i n θ_{0})

is strictly greater, leading to a contradiction.

Points inside

d

clearly belong to the interception domain, though unlike boundary points, trajectories ending at interior points are not unique. For example,

E

can wait for a period of time before maneuvering. □

We have now completed the definition of the interception domain and proven its convexity. As a concrete example, Figure 2 shows a numerical simulation result of an interception domain.

Figure 2. Numerical simulation result of an interception domain.

In addition, Appendix B explains that under the assumptions of negligible gravitational influence and zero initial velocity for both

P

and

E

, the interception domain will be an Apollonius circle containing

E

scaled according to the ratio of the players’ accelerations (as shown in Figure 3).

Figure 3. Interception domain as an Apollonius circle.

In practice, the study by Jansson and Harris [25] found that the relative initial velocity of satellites in geosynchronous orbits and the rotational angular velocity of the coordinate system have a negligible impact on the shape of the reachable set. The reachable set can be closely approximated as a circle centered at the spacecraft. Similarly, our calculations reveal that the interception domain of satellites in geosynchronous orbits is closely approximated by an Apollonius circle, as in Figure 3. Consequently, in subsequent analyses, we assume that the interception domains—or the interception domains relative to the reachable sets—intersect at no more than two points.

Afterwards, our focus will be on the boundary

l

of

d

, which has the following properties.

Proposition 1.

For any point

(r_{x}^{f} (θ_{0}), r_{y}^{f} (θ_{0}))

on the curve

l

, the trajectories (i.e., the functions of state variables) for

E

and

P

to simultaneously reach this point determined by (8) are the minimum-time trajectories for

E

and

P

to reach

(r_{x}^{f} (θ_{0}), r_{y}^{f} (θ_{0}))

individually.

Proof.

Consider the minimum-time trajectory problem for

E

to reach the point

(r_{x}^{f} (θ_{0}), r_{y}^{f} (θ_{0}))

. Following the same methodology as in Section 2, we can formulate the state and costate dynamics equations. It can be readily shown that the boundary conditions for its TPBVP in termination are as follows:

\{\begin{matrix} (r_{e x}^{f}, r_{e y}^{f}) = ({(r}_{x}^{f} (θ_{0}), r_{y}^{f} (θ_{0})) \\ λ_{E^{1}} v_{p x} + λ_{e 2} v_{p y} + 1 = 0 \\ λ_{E^{3}} = λ_{E^{4}} = 0 \end{matrix}

(12)

The unique solution to the above TPBVP can be expressed as

s^{E} = (Δ t^{E}, v_{E x}^{E}, v_{E y}^{E}, λ_{E^{1}}^{E}, λ_{E^{2}}^{E})

, representing the minimum time, the evader’s velocity components, and its first two costate components in termination, respectively. Now, consider

s_{θ_{0}}^{E} = (Δ t (θ_{0}), v_{E x}^{f} (θ_{0}), v_{E y}^{f} (θ_{0}), k λ_{E^{1}}^{f} (θ_{0}), k λ_{E^{2}}^{f} (θ_{0}))

, where

k = \frac{1}{λ_{P^{1}}^{f} (θ_{0}) v_{p x}^{f} (θ_{0}) + λ_{P^{2}}^{f} (θ_{0}) v_{p y}^{f} (θ_{0})}

. All elements in

s_{θ_{0}}^{e}

are derived from

s (θ_{0})

defined by (11).

We now prove that

s^{e} = s_{θ_{0}}^{e}

, which is equivalent to verifying that

s_{θ_{0}}^{e}

also satisfies (12). From the last two equality conditions in (10), it follows directly that the last two conditions of (12) are satisfied. The first condition of (10) implies that

(Δ t (θ_{0}), v_{E x}^{f} (θ_{0}), v_{E y}^{f} (θ_{0}), λ_{E^{1}}^{f} (θ_{0}), λ_{E^{2}}^{f} (θ_{0}))

satisfies the first condition in (12). Therefore, we need only prove that the proportional scaling of the first two costate components at termination does not affect the state dynamics governed by (2).

Consider the costate dynamics for

E

governed by (5). When the game duration is

∆ t (θ_{0})

and the terminal costate values are

(λ_{E^{1}}^{f} (θ_{0}), λ_{E^{2}}^{f} (θ_{0}), 0, 0)

, the time evolution of the costate vector is given by

(\begin{matrix} λ_{E^{1}} \\ λ_{E^{2}} \\ λ_{E^{3}} \\ λ_{E^{4}} \end{matrix}) = λ_{E^{1}}^{f} (θ_{0}) (\begin{matrix} 4 - 3 \cos w (∆ t (θ_{0}) - t) \\ 0 \\ \frac{1}{w} \sin w (∆ t (θ_{0}) - t) \\ \frac{2}{w} (1 - \cos w (∆ t (θ_{0}) - t)) \end{matrix}) + λ_{E^{2}}^{f} (θ_{0}) (\begin{matrix} 6 w (∆ t (θ_{0}) - t) - 6 \sin w (∆ t (θ_{0}) - t) \\ 1 \\ \frac{2}{w} (1 - \cos w (∆ t (θ_{0}) - t)) \\ 3 (∆ t (θ_{0}) - t) - \frac{4}{w} \sin (∆ t (θ_{0}) - t) \end{matrix}) .

(13)

(13) demonstrates that proportional scaling of the first two terminal costates results in proportional scaling of the costate vector at every time instant. By (7), such proportional scaling of the costates does not alter the state dynamics governed by (2). Therefore,

s^{e} = s_{θ_{0}}^{e}

.

The equivalence for the pursuer’s minimum-time trajectory to

{(r}_{x}^{f} (θ_{0}), r_{y}^{f} (θ_{0}))

follows analogously by considering the time-optimal control problem of

P

. □

Proposition 1 establishes an exact mathematical property of the interception domain. The uniqueness of the domain itself is inherent to its definition by the initial conditions. While numerical computation of the domain boundary (by solving the associated TPBVP) is subject to the usual sensitivities of boundary value problems, the theoretical scaling law holds exactly for the true domain. In our simulations, the multiple shooting method [22] as a robust numerical method was employed to ensure accurate solutions.

Based on this property, we derive the following corollary, which forms the foundation for applying the concept of the interception domain to the “encirclement-capture” game.

Corollary 1.

If the boundary

l_{1}

of the interception domain defined by

P_{1}

and

E

intersects the boundary

l_{2}

defined by

P_{2}

and

E

at point

(x_{0}, y_{0})

, then the minimum times for

P_{1}

,

P_{2}

, and

E

to reach

(x_{0}, y_{0})

are identical. The trajectory for

E

to reach

(x_{0}, y_{0})

is identical in both interception domains and constitutes its minimum-time trajectory (as shown in Figure 4).

Figure 4. Trajectory of

E

to the intersection point.

The intersection points of interception domain boundaries, as established in Corollary 1, play a foundational role in defining simultaneous interception events in multi-pursuer games. The property that all players share an identical minimum arrival time at such a point provides the crucial spatiotemporal agreement necessary to coordinate multiple pursuers against a single evader.

For any given pair of pursuers and an evader, the resulting pair of intersection points is unique under specific initial conditions, as it is geometrically determined by their uniquely defined interception domains.

Additionally, an alternative equivalent definition of the interception domain can be given based on Corollary 1: the interception domain is the set of points satisfying the notion that the minimum time for

E

to reach the point is less than or equal to that for

P

. On the boundary of the interception domain, the minimum times for

E

and

P

are exactly equal; in the interior of the interception domain,

E

can arrive before

P

.

4. Orbital Two-Versus-One Pursuit–Evasion Game

Consider a game with two pursuers (

P_{1}, P_{2}

) and one evader (

E

). Without loss of generality, we set

E

as the reference satellite in the relative coordinate system. Define

d_{1}, d_{2}

as the interception domains determined by

(P_{1}, E)

and

(P_{2}, E)

, with boundaries

l_{1}, l_{2}

, respectively. Under the Nash equilibrium for the one-versus-one pursuit–evasion game between

P_{1}

and

E

, both players reach point

N_{1} \in l_{1}

at time

t_{1}^{*}

. Similarly, for

P_{2}

and

E

, they reach point

N_{2} \in l_{2}

at time

t_{2}^{*}

under their Nash equilibrium. For the sake of conciseness, this paper does not address cases where conditional judgments yield equality, as such scenarios occur with Lebesgue measure zero in practice and can be arbitrarily approximated by strict inequality conditions. Thus, without loss of generality, assume

t_{1}^{*} < t_{2}^{*}

.

Consider the relationship between the two interception domains. By definition, both interception domains contain the origin since it is the initial position of

E

, implying a non-empty overlap region

d_{12}^{'} = d_{1} \cap d_{2}

. Consequently, the region reachable by

E

prior to interception is precisely

d_{12}^{'}

. Define

T_{i}^{*} (x)

as the minimum-time function for player

i \in {E, P_{1}, P_{2}}

to reach point

x

. Under game equilibrium, the position where

E

is intercepted (hereafter termed the interception point) must satisfy

x^{*} = {a r g}_{m a x} m i n T_{i}^{*} (x), f o r i = \{P_{1}, P_{2}\} .

(14)

In addition, by definition, the minimum time for

E

to reach any point within

d_{1}

is strictly less than

t_{1}^{*}

. Consequently,

N_{2}

must lie outside

d_{12}^{'}

. At this stage, the sole determinant of the game equilibrium is whether

N_{1}

lies within

d_{12}^{'}

(as illustrated in Figure 5,

d_{1}

and

d_{2}

are shown by blue and green regions respectively. Definitions of all points in sketches of the interception domain are summarized in Appendix B). Theorem 2 characterizes the Nash equilibrium solutions under these two distinct scenarios.

Figure 5. Sketch of interception domains in two-versus-one pursuit–evasion game.

Theorem 2.

The Nash equilibrium solutions varying based on whether

N_{1}

lies within

d_{12}^{'}

are as follows:

(1): If $N_{1}$ lies within $d_{12}^{'}$ , the game degenerates into a one-versus-one pursuit–evasion game between $P_{1}$ and $E$ . Under equilibrium, $P_{1}$ intercepts $E$ at point $N_{1}$ ;
(2): If $N_{1}$ lies outside $d_{12}^{'}$ , under equilibrium, $P_{1}$ and $P_{2}$ simultaneously intercept $E .$

Proof.

The proof of (1) is straightforward. Since

E

can reach

N_{1}

at time

t_{1}^{*}

before interception, and

t_{1}^{*}

is the maximum interception time for

E

within

d_{1}

, it is also the maximum interception time within

d_{12}^{'}

.

The proof of (2) proceeds in two steps. First, note that the two interception domains are not nested. Let their intersection points be

J_{1}

and

J_{2}

, and assume without loss of generality that

T_{E}^{*} (J_{1}) > T_{E}^{*} (J_{2})

. Thus,

T_{E}^{*} (J_{1}) = T_{p_{1}}^{*} (J_{1}) = T_{p_{2}}^{*} (J_{1}) > T_{E}^{*} (J_{2}) = T_{p_{1}}^{*} (J_{2}) = T_{p_{2}}^{*} (J_{2})

. We now analyze two types of points in

d_{12}^{'}

.

Boundary points of

d_{12}^{'}

: Suppose there exists a point

x_{1}

on the boundary segment

l_{1}

of

d_{1}

within

d_{12}^{'}

such that

T_{p_{1}}^{*} (x_{1}) > T_{p_{1}}^{*} (J_{1})

. Since

T_{p_{1}}^{*} (x_{1}) < T_{p_{1}}^{*} (N_{1})

, continuity implies the following: (a) On the segment

J_{1}

to

N_{1}

(excluding

J_{2}

), there exists

x_{2}

satisfying

T_{p_{1}}^{*} (x_{2}) = T_{p_{1}}^{*} (x_{1})

. (b) On the segment

J_{2}

to

N_{1}

(excluding

J_{1}

), there exists

x_{3}

satisfying

T_{p_{1}}^{*} (x_{3}) = T_{p_{1}}^{*} (x_{1})

. However,

P_{1}

’s reachable set at time

T_{p_{1}}^{*} (x_{1})

intersects

d_{1}

at most twice, leading to a contradiction. Thus, no point on

l_{1}

within

d_{12}^{'}

except

J_{1}

can be an equilibrium interception point.

Interior points of

d_{12}^{'}

: Suppose an interior point

x_{1}

satisfies

T_{p_{1}}^{*} (x_{1}) \neq T_{p_{2}}^{*} (x_{1})

, e.g.,

T_{p_{1}}^{*} (x_{1}) > T_{p_{2}}^{*} (x_{1})

. Since

T_{p_{2}}^{*} (x)

must have an ascending gradient at

x_{1}

, there exists a neighboring point

x_{2} \in d_{12}^{'}

such that

T_{p_{1}}^{*} (x_{2}) > T_{p_{2}}^{*} (x_{2}) > T_{p_{2}}^{*} (x_{1})

. Here,

E

would be intercepted later at

x_{2}

, contradicting optimality. Thus, interior capture points must satisfy

T_{p_{1}}^{*} (x) = T_{p_{2}}^{*} (x)

.

In summary,

E

will be intercepted simultaneously by

P_{1}

and

P_{2}

in (2). □

This paper hereafter refers to the two scenarios in Theorem 2 as the degenerate scenarios and non-degenerate scenarios, respectively. Theorem 2 not only explains the classification of scenarios in Sun et al. [24] from the perspective of interception domain theory but also demonstrates that distinguishing between these two scenarios is equivalent to determining whether the interception point with the shorter interception time (from the Nash equilibrium of a one-versus-one game) lies within the other pursuer’s interception domain. This, in turn, is equivalent to checking whether the minimum time for the other pursuer to reach that interception point exceeds the interception time of the one-versus-one equilibrium. Based on this, we design the following judgment algorithm:

Solve the Nash equilibrium for the one-versus-one pursuit–evasion game between $P_{1}$ and $E$ , and between $P_{2}$ and $E$ , respectively. Obtain the capture points $N_{1}$ , $N_{2}$ and interception times $t_{1}^{*}$ , $t_{2}^{*}$
Compare $t_{1}^{*}$ and $t_{2}^{*}$ . Without loss of generality, assume $t_{1}^{*} < t_{2}^{*}$
Compute the minimum time $T_{p_{2}}^{*} (N_{1})$ for $P_{2}$ to reach $N_{1}$ . Compare $T_{p_{2}}^{*} (N_{1})$ and $t_{1}^{*}$ . If $T_{p_{2}}^{*} (N_{1}) < t_{1}^{*}$ the game is non-degenerate. Otherwise, the game is degenerate.

The above algorithm for degeneracy determination in the two-versus-one pursuit–evasion game requires solving two 16-dimensional TPBVPs in the first step and one 8-dimensional TPBVP in the third step. The overall computational complexity is

O (m_{t} n)

, where

m_{t}

denotes the maximum number of iterations for solving each TPBVP and

n

represents the number of nodes used in the numerical differential equations. When

m_{t} = 300

and

n = 100

, on a system with 8 GB RAM and an i5-6500 3.2 GHz processor, the total computation time typically ranges between 0.05 and 0.1 s.

The initial states in Figure 5a,b are listed in Section 7.1 and Section 7.2. In Figure 5a,

t_{1}^{*} = 104.09 s

,

t_{2}^{*} = 109.55 s

, and

T_{p_{2}}^{*} (N_{1}) = 105.01 s

. As

t_{1}^{*} < t_{2}^{*}

and

T_{p_{2}}^{*} (N_{1}) > t_{1}^{*}

, the game is degenerate. In Figure 5b,

t_{1}^{*} = 114.86 s

,

t_{2}^{*} = 109.55 s

, and

T_{p_{1}}^{*} (N_{2}) = 76.59 s

. As

t_{1}^{*} > t_{2}^{*}

and

T_{p_{1}}^{*} (N_{2}) < t_{2}^{*}

, the game is non-degenerate.

T_{p_{2}}^{*} (N_{1})

is obtained by solving the TPBVP defined by (2), (5), and (12). Compared to Sun et al. [24], the primary distinction of this degeneracy determination lies in Step 3: It requires computing only the minimum time for a single spacecraft to reach a fixed point, rather than the interception time for a single spacecraft to capture another spacecraft following a known trajectory. Consequently, the TPBVP solved in this paper is lower-dimensional, offering significant advantages in computational efficiency.

Regarding the specific computational methods for both scenarios after degeneracy determination, Sun et al. [24] provide sufficiently detailed definitions of the corresponding TPBVPs. To avoid redundancy, we omit restating these formulations in this paper.

Denote the interception point under the equilibrium of the two-versus-one pursuit–evasion game defined by (

P_{1}, P_{2}, E

) as

x_{12}^{*}

. To generalize to broader scenarios, we further characterize

x_{12}^{*}

in the non-degenerate case. From the proof of Theorem 2, the minimum times for

P_{1}

and

P_{2}

to reach

x_{12}^{*}

are identical, implying that

x_{12}^{*}

lies on the arc segment between endpoints

J_{1}

and

J_{2}

within the interception domain

d_{12}

defined by (

P_{1}, P_{2}

) (illustrated in Figure 6). This arc segment denoted as

\hat{h_{12}}

is uniquely determined by the initial states of (

P_{1}, P_{2}, E

).

Figure 6. The interception point in the non-degenerate scenarios.

5. Orbital Three-Versus-One Pursuit–Evasion Game

Consider a game with three pursuers (

P_{1}, P_{2}, P_{3}

) and one evader (

E

). Following the same methodology, we set

E

as the reference satellite in the relative coordinate system. Define

d_{1}, d_{2}, d_{3}

as the interception domains determined by (

P_{1}, E

), (

P_{2}, E

), and (

P_{3}, E

), with boundaries

l_{1}, l_{2}, l_{3}

, respectively. Analogously, define the interception points in a one-versus-one game as

N_{1}, N_{2}, N_{3}

.

Since

d_{1}, d_{2}, d_{3}

all contain the origin, if any domain is nested within another, the game degenerates to one with fewer pursuers. Thus, we consider only the case where the three interception domains pairwise intersect (as shown in Figure 7,

d_{1}

,

d_{2}

and

d_{3}

are shown by blue, green and red regions respectively). Denote

d_{1} \cap d_{2}

at points

J_{1}, J_{2}

,

d_{1} \cap d_{3}

at points

K_{1}, K_{2}

, and

d_{2} \cap d_{3}

at points

L_{1}, L_{2}

. The arc segments

\hat{h_{12}}, \hat{h_{13}}, \hat{h_{23}}

(defined in the previous section) are indicated by red dashed lines. The intersection region of

d_{1}, d_{2}, d_{3}

is denoted as

d_{13}^{'}

. Similarly, the interception point in

d_{13}^{'}

must satisfy

x^{*} = {a r g}_{m a x} m i n T_{i}^{*} (x), f o r i = \{P_{1}, P_{2}, P_{3}\} .

(15)

Figure 7. Sketch of interception domains in three-versus-one pursuit–evasion game.

We first perform degeneracy determination based on initial states. The variety and complexity of cases here far exceed those in two-versus-one pursuit–evasion games. We initially divide them into degenerate and non-degenerate scenarios, then categorize the equilibrium interception points under both scenarios.

As shown in Table 2, degenerate scenarios can be classified into two types according to whether the game degenerates into a one-versus-one game or a two-versus-one game. If two equilibrium interception points from degenerate games coexist in

d_{13}^{'}

, the point with the longer interception time must lie within the interception domain of the other degenerate game. This contradicts the definition of the equilibrium with the shorter interception time. Therefore, these two types are mutually exclusive, and the corresponding degenerate game is unique.

Table 2. Two types of the degenerate scenarios in three-versus-one game.

Note that Figure 7a is a Type II degenerate scenario since

\hat{h_{12}} \subset d_{13}^{'}

. The initial states and game equilibrium in Figure 7b are presented in Section 7.3. In addition, if any interception domain contains another (e.g.,

d_{i} \supseteq d_{j}

), the pursuer corresponding to the containing domain does not affect the game outcome. Such scenarios are necessarily degenerate scenarios.

Based on the above analysis, we summarize the algorithm for determining degeneracy and solving degenerate scenarios in three-vs.-one games as follows:

Solve the Nash equilibrium for the one-vs.-one games between $(P_{1}, E)$ , $(P_{2}, E)$ , and $(P_{3}, E)$ , obtaining interception points $N_{1}, N_{2}, N_{3}$ and interception times $t_{1}^{*}, t_{2}^{*}, t_{3}^{*}$ . Compare $t_{1}^{*}, t_{2}^{*}, t_{3}^{*}$ . Assume $t_{1}^{*} < t_{2}^{*} < t_{3}^{*}$ without loss of generality.
Compute $T_{p_{2}}^{*} (N_{1})$ , $T_{p_{3}}^{*} (N_{1})$ , and $T_{p_{3}}^{*} (N_{2})$ . Check if both $T_{p_{2}}^{*} (N_{1}) > t_{1}^{*}$ and $T_{p_{3}}^{*} (N_{1}) > t_{1}^{*}$ hold. If true, the game is a Type I degenerate scenario: $P_{1}$ intercepts $E$ at $N_{1}$ . Else, proceed.
If $T_{p_{2}}^{*} (N_{1}) < t_{1}^{*}$ , compute the interception point $x_{12}^{*}$ and interception time $t_{12}^{*}$ for the non-degenerate two-vs.-one game defined by $(P_{1}, P_{2}, E)$ . Similarly, if $T_{p_{3}}^{*} (N_{1}) < t_{1}^{*}$ , compute $x_{13}^{*}$ and $t_{13}^{*}$ for $(P_{1}, P_{3}, E)$ . If $T_{p_{3}}^{*} (N_{2}) < t_{2}^{*}$ , compute $x_{23}^{*}$ and $t_{23}^{*}$ for $(P_{2}, P_{3}, E)$ .
For each $x_{i j}^{*}$ computed in Step 3, determine $T_{p_{k}}^{*} (x_{i j}^{*})$ (where $k$ is the third pursuer). If any $T_{p_{k}}^{*} (x_{i j}^{*}) > t_{i j}^{*}$ , the game is a Type II degenerate scenario: $P_{i}$ and $P_{j}$ intercept $E$ at $x_{i j}^{*}$ . Otherwise, the game is non-degenerate.

The above algorithm for degeneracy determination in the three-versus-one pursuit–evasion game requires solving three 16-dimensional TPBVPs in the first step, three 8-dimensional TPBVPs in the second step, and up to three additional 8-dimensional TPBVPs in the fourth step. The overall computational complexity is

O (m_{t} n)

. When

m_{t} = 300

and

n = 100

, the total computation time typically ranges between 0.1 and 0.15 s. The decision tree presented in Steps 1–4 is summarized in Figure 8.

Figure 8. Decision tree for determining degeneracy.

Due to the extreme complexity of classifying non-degenerate scenarios, we forgo rigorously characterizing equilibrium interception points. Instead, we identify candidate points for verification. Without loss of generality, assume the maneuver accelerations of the three pursuers satisfy

T_{P_{1}} > T_{P_{2}} > T_{P_{3}}

. Thus, we can define the interception domains

d_{12}

,

d_{13}

,

d_{23}

for pairs

(P_{1}, P_{2})

,

(P_{1}, P_{3})

, and

(P_{2}, P_{3})

. Note that intersection points of

d_{12}

and

d_{13}

satisfy

T_{p_{1}}^{*} (x) = T_{p_{2}}^{*} (x) = T_{p_{3}}^{*} (x)

and must lie on the boundary of

d_{23}

. Thus,

d_{12}

,

d_{13}

, and

d_{23}

intersect at two points

o_{1}

and

o_{2}

. By definition, if

o_{1}, o_{2} \in d_{13}^{'}

, they must be intersection points of the arcs

\hat{h_{12}}

,

\hat{h_{23}}

, and

\hat{h_{13}}

.

Theorem 3.

In the non-degenerate scenarios of a three-vs.-one game, only two types of equilibrium exist:

Type I: The interception point is located at pairwise intersection points of

d_{1}, d_{2}, d_{3}

:

J_{1}, J_{2}, K_{1}, K_{2}, L_{1}, L_{2}

.

Type II: The interception point is located at intersection points

o_{1}, o_{2}

of

d_{12}, d_{13}, d_{23}

.

Proof.

First consider the boundary of

d_{13}^{'}

. Similar to the proof of Theorem 2.(1), the interception point cannot lie strictly within any interior segment of

l_{i}

; thus, only pairwise intersections

J_{1}, J_{2}, K_{1}, K_{2}, L_{1}, L_{2}

are candidate points.

Next, consider the interior of

d_{13}^{'}

. Analogous to Theorem 2.(2), any candidate point

x

must satisfy

T_{p_{i}}^{*} (x) = T_{p_{j}}^{*} (x) \leq T_{p_{k}}^{*} (x)

. If the interception point

x

satisfied

T_{p_{i}}^{*} (x) = T_{p_{j}}^{*} (x) < T_{p_{k}}^{*} (x)

,

x

is the point with the longest interception time within its neighborhood in

\hat{h_{i j}}

. However, this point must be

x_{i j}^{*}

, leading to a contradiction. Therefore,

x

must satisfy

T_{p_{1}}^{*} (x) = T_{p_{2}}^{*} (x) = T_{p_{3}}^{*} (x)

.

By the equivalent definition of the interception domain, the set of points where

T_{p_{1}}^{*} (x) = T_{p_{3}}^{*} (x)

must lie on the boundary of the domain

d_{13}

, and similarly, points satisfying

T_{p_{2}}^{*} (x) = T_{p_{3}}^{*} (x)

must lie on the boundary of

d_{23}

. Therefore, any point

x

satisfying

T_{p_{1}}^{*} (x) = T_{p_{2}}^{*} (x) = T_{p_{3}}^{*} (x)

must lie simultaneously on the boundary of

d_{13}

and the boundary of

d_{23}

. By definition, the intersection of the two boundaries is exactly the set

{o_{1}, o_{2}}

. □

In the Type I non-degenerate scenario, although only two pursuers simultaneously intercept the evader, the presence of the third pursuer influences the evader’s strategy, resulting in an interception time shorter than in a game with only the other two pursuers. In the Type II non-degenerate scenario, all three pursuers intercept the evader simultaneously.

The key to computing the three-versus-one game equilibrium of the non-degenerate scenario lies in calculating intersections of pairwise interception domains. As discussed in Section 3, the interception domain can be approximated by an Apollonius circle scaled according to the ratio of the players’ accelerations. Therefore, the method for computing interception domain intersections involves first solving for intersections of these Apollonius circles, then using a homotopy method to gradually reintroduce the effects of relative velocity and gravitational field. The specific algorithm steps are as follows:

Calculate the equations of the two Apollonius circles based on the two sets of satellite initial positions and acceleration magnitudes, respectively. For initial positions $x_{1}, x_{2}$ and acceleration magnitudes $a_{1}, a_{2}$ (assume without loss of generality that $a_{1} > a_{2}$ ), the center of the corresponding Apollonius circle is $(a_{1}^{2} x_{2} - a_{2}^{2} x_{1}) / (a_{1}^{2} - a_{2}^{2})$ , and the radius is $‖ (a_{1} a_{2} (x_{2} - x_{1})) / (a_{1}^{2} - a_{2}^{2}) ‖$ . Calculate the intersection points of the two Apollonius circles, denoted as $x_{1}^{f}, x_{2}^{f}$ .
Define the evaluation function $h (n, ϕ, x) = |T_{p_{1}}^{*} (x) - T_{E}^{*} (x)| + |T_{p_{2}}^{*} (x) - T_{E}^{*} (x)|$ , which represents the sum of absolute value of the shortest time difference for two pairs of the pursuer and the evader to reach point $x$ when the relative coordinate line angular velocity is $n$ and the pursuer’s initial velocity direction is $v_{0} ϕ$ . By definition, $x_{1}^{f}$ and $x_{2}^{f}$ are the zeros of $h (0, 0, x)$ . Using $x_{1}^{f}$ and $x_{2}^{f}$ as starting points for the interior-point algorithm, obtain two zeros of $h (n, 0, x)$ , denoted as $x_{1}^{0}$ and $x_{2}^{0}$ .
Set the number of iteration steps $m$ , and sequentially calculate the two zeros of $h (n, 1 / m, x)$ , $h (n, 2 / m, x)$ , $\dots$ , $h (n, m / m, x)$ . The two zeros of $h (n, 1 / m, x)$ , denoted as $x_{1}^{(1 / m)}$ and $x_{2}^{(1 / m)}$ , are computed using $x_{1}^{0}$ and $x_{2}^{0}$ as starting points for the interior-point algorithm, respectively. Similarly, for $k = 2,3, \dots, m$ , the two zeros of $h (n, k / m, x)$ , denoted as $x_{1}^{(k / m)}$ and $x_{2}^{(k / m)}$ , are computed using $x_{1}^{((k - 1) / m)}$ and $x_{2}^{((k - 1) / m)}$ as starting points for the interior-point algorithm, respectively. The final two zeros of $h (n, 1, x)$ , denoted as $x_{1}^{1}$ and $x_{2}^{1}$ , are the intersection points of the two interception domains.

The above algorithm for computing interception domain intersections involves invoking the interior-point method

m

times. Let

m_{i}

denote the maximum number of iterations required by the interior-point solver. Each invocation requires solving up to

m_{i}

sets of three four-dimensional TPBVPs. Thus, the overall computational complexity is

O (m m_{i} m_{t} n)

. In this paper, each intermediate problem (for

ϕ = k / m, k = 0, \dots, m

) is solved using MATLAB 2022A’s fmincon routine with a step-size tolerance of

{1 \times 10}^{- 6}

and a constraint tolerance of

{1 \times 10}^{- 8}

. When

m_{t} = 300

,

n = 100

,

m_{i} = 3000,

and

m = 100

, the overall numerical error of the algorithm is in the order of

{1 \times 10}^{- 6}

. The total computation time ranges from 100 to 150 s.

Based on the aforementioned algorithm for solving intersections of pairwise interception domains, the algorithm for finding interception points in non-degenerate scenarios can be derived from Theorem 3. The specific steps are as follows:

Calculate the intersections of pairwise interception domains $J_{1}, J_{2}$ (for $d_{1}$ and $d_{2}$ ), $K_{1}, K_{2}$ (for $d_{1}$ and $d_{3}$ ), and $L_{1}, L_{2}$ (for $d_{2}$ and $d_{3}$ ), as well as the intersection points $o_{1}, o_{2}$ of $d_{12}$ , $d_{13}$ , and $d_{23}$ . Compute the shortest arrival times for $P_{1}, P_{2}, P_{3}$ , and $E$ to these points.
Determine whether these points lie within $d_{13}^{'}$ . This is equivalent to verifying whether they satisfy $T_{E}^{*} (x) \leq m i n (T_{p_{i}}^{*} (x))$ for $i = 1, 2, 3$
For the points identified in Step 2 that lie within $d_{13}^{'}$ , return the point that maximizes $m i n (T_{p_{i}}^{*} (x))$ for $i = 1, 2, 3$

The above algorithm for computing interception points in non-degenerate three-versus-one pursuit–evasion scenarios requires four invocations of the interception domain intersection algorithm. The overall computational complexity is

O (m m_{i} m_{t} n)

. When

m_{t} = 300

,

n = 100

,

m_{i} = 3000,

and

m = 100

, the total computation time ranges approximately from 300 to 500 s.

Finally, we consider the trajectory determination in non-degenerate scenarios. For the Type I scenario, the evader and the two simultaneously intercepting pursuers follow their respective minimum-time trajectories to the interception point. The remaining pursuer also follows its minimum-time path to the interception point (though it does not arrive by the game’s conclusion). For the Type II scenario, all three pursuers follow minimum-time trajectories to the interception point. Here, the evader arrives earlier than required, so its trajectory is non-unique. In subsequent simulations, for demonstration purposes, we assume the evader first maintains no maneuver before switching to its minimum-time path to reach the interception point.

6. Orbital Multi-Versus-One Pursuit–Evasion Game

This section discusses pursuit–evasion games with

k \geq 4

pursuers. Analogous to Section 5, define interception domains

d_{i}

for each pursuer

P_{i}

and the evader

E

, where

i \in {1, 2, \dots, k}

, with their intersection denoted as

d_{1 k}^{'} = ⋂_{i = 1}^{k} d_{i}

. Define pairwise interception domains

d_{i j}

for

i, j \in {1, 2, \dots, k}, i < j

, each determined by pursuers

P_{i}

and

P_{j}

. The interception point in

d_{1 k}^{'}

must satisfy

x^{*} = {a r g}_{m a x} m i n T_{i}^{*} (x), f o r i = \{P_{1}, P_{2}, \dots, P_{k}\} .

(16)

Since the probability measure of three circular-approximating convex domains with overlapping areas intersecting at a single point in the plane is zero, we disregard the possibility of all

d_{i j}

sharing a common intersection point for

k \geq 4

. Consequently, simultaneous interception by all pursuers becomes impossible. The game equilibrium exhibits fundamentally distinct characteristics beyond this demarcation threshold, justifying our focus on

k = 4

as the critical boundary. Here, the value of the threshold

k = 4

is a consequence of the planar problem geometry. An analysis of the same game in three dimensions would likely yield a higher threshold, as the increased spatial dimensionality allows for more complex domain intersections.

As the number of pursuers increases, an intuitive trend emerges: the likelihood of degenerate scenarios rises significantly. In high probability, not all pursuers become involved in the game. Conversely, solving non-degenerate scenarios becomes more tractable, with the primary computational bottleneck shifting to degeneracy determination.

Theorem 4.

In multi-vs.-one pursuit–evasion games with

k \geq 4

pursuers, the interception point in non-degenerate scenarios can only lie at pairwise intersection points of interception domains

d_{i}, i \in {1, 2, \dots, k}

.

Proof.

First, consider the boundary points of

d_{1 k}^{'}

. Without loss of generality, assume the interception point

x_{1}

is on the boundary segment

l_{1}

of

d_{1}

within

d_{1 k}^{'}

but not the intersection point of interception domains. Then,

T_{E}^{*} (x_{1}) = T_{p_{1}}^{*} (x_{1}) < T_{p_{i}}^{*} (x_{1}), for i = {2, \dots, k}

. There exists a neighborhood of

x_{1}

on

l_{1}

within

d_{1 k}^{'}

, which satisfies

T_{E}^{*} (x) = T_{p_{1}}^{*} (x) < T_{p_{i}}^{*} (x)

. Thus, according to (16),

x_{1}

must be the point with the largest

T_{p_{1}}^{*} (x)

in this neighborhood. However, this point can only be

N_{1}

.

N_{1} \in d_{1 k}^{'}

indicates that the game degenerates into a one-versus-one game defined by (

P_{1}, E

), which leads to the contrary.

Next, consider the interior of

d_{13}^{'}

. Without loss of generality, assume the interception point

x_{1}

satisfies

T_{p_{2}}^{*} (x_{1}) \leq T_{p_{1}}^{*} (x_{1}) \leq T_{p_{i}}^{*} (x_{1}), f o r i = {3, \dots, k}

.

Assume

T_{p_{1}}^{*} (x_{1}) > T_{p_{2}}^{*} (x_{1})

. Since

T_{p_{2}}^{*} (x)

must have an ascending gradient at

x_{1}

, there exists a neighboring point

x_{2} \in d_{1 k}^{'}

such that

(x_{2}) > T_{p_{2}}^{*} (x_{2}) > T_{p_{2}}^{*} (x_{1})

. Here,

E

will be intercepted later at

x_{2}

, contradicting optimality. Thus,

x_{1}

must satisfy

T_{p_{1}}^{*} (x_{1}) = T_{p_{2}}^{*} (x_{1})

.

Assume there exists

i \in \{3, \dots, k\}, w h i c h

satisfies

T_{p_{1}}^{*} (x_{1}) = T_{p_{2}}^{*} (x_{1}) < T_{p_{i}}^{*} (x_{1})

, where

x_{1}

is the point with the longest interception time within its neighborhood in

\hat{h_{12}}

. However, this point must be

x_{12}^{*}

, leading to a contradiction. Thus, an interior interception point would require

T_{p_{1}}^{*} (x) = T_{p_{2}}^{*} (x) = \dots = T_{p_{k}}^{*} (x)

. However, such a point cannot exist for

k \geq 4

due to its measure-zero probability. □

Based on Theorem 4, we derive the method for computing the interception point. Due to the extreme complexity of degeneracy determination, we provide an iterative decision framework instead of explicit computational rules. The algorithm proceeds as follows:

Solve $k$ reduced games by removing each pursuer $P_{i} (i = 1, 2, \dots, k)$ in turn, yielding games defined by $(P_{1}, \dots, \hat{P_{i}}, \dots, P_{k}, E)$ . Check if their interception points lie in $d_{1 k}^{'}$ (equivalent to verifying $T_{E}^{*} (x) \leq \underset{j = 1}{\overset{k}{m i n}} T_{p_{j}}^{*} (x)$ ). If any interception point satisfies this, terminate: the game degenerates to this $k - 1$ -pursuer scenario.
If Step 1 does not terminate, the game is non-degenerate. Compute all pairwise intersections of $d_{i} (i = 1, 2, \dots, k)$ . Calculate minimum times for $P_{1}, \dots, P_{k}$ and $E$ to reach these points.
For intersections from Step 2, verify membership in $d_{1 k}^{'}$ (equivalent to $T_{E}^{*} (x) \leq \underset{i = 1}{\overset{k}{m i n}} T_{p_{i}}^{*} (x)$ .
Among feasible intersections from Step 3, select the point maximizing $\underset{i = 1}{\overset{k}{m i n}} T_{p_{i}}^{*} (x)$ .

The computational complexity of the above algorithm for solving interception points in pursuit–evasion games with

k

pursuers is dominated by the

C_{k}^{2}

computations of interception domain intersections. Therefore, the overall computational complexity is

O (k^{2} m m_{i} m_{t} n)

.

Upon determining the interception point, the trajectories of all players under game equilibrium are determined as their respective minimum-time paths to this point.

7. Simulation Example

To validate the algorithm’s efficacy, we simulate two two-vs.-one, three three-vs.-one, and one four-vs. one pursuit–evasion examples. Among them, two two-vs.-one examples are corresponding to the non-degenerate and degenerate scenarios in Section 4, respectively; three three-vs.-one examples are corresponding to the two types of non-degenerate scenarios and degenerate scenarios in Section 5, respectively.

In all scenarios, the evader (reference satellite) is assumed to be in a geostationary orbit with angular velocity

ω = 7.27 \times 10^{- 5} rad / s

, orbital radius

r = 42,165.8 km

, and Earth-centered inertial coordinates

(29,815.7 km, 29,815.7 km)

. To ensure pursuers’ initial states align with its normality, they are initialized on circular orbits in the same orbital plane. Consequently, their positions and velocities in the Earth-centered inertial frame satisfy

(v_{x}, v_{y}) = \sqrt{\frac{μ}{‖(r_{x}, r_{y})‖}} (\cos (\arctan (\frac{r_{x}}{r_{y}}) + \frac{π}{2}), \sin (\arctan (\frac{r_{x}}{r_{y}}) + \frac{π}{2})) .

(17)

where

μ

is Earth’s gravitational parameter. In the following, we present the Nash equilibrium results of the proposed algorithm in this paper under the aforementioned assumptions across four mission scenarios.

To ensure the credibility of the simulation results, a validation strategy based on internal consistency is employed. The following checks are performed: (1) The simulations are indeed based on the CW dynamics stated in Equation (2). The trajectories of all agents are generated by integrating these equations using their optimal control laws derived from (4)–(7). (2) For non-degenerate cases, the simultaneous interception time of the pursuers must agree within a numerical tolerance (<0.1 s). (3) The value of the Hamiltonian is monitored within a numerical tolerance (<

10^{- 5}

) throughout the trajectory.

7.1. Example 1

The initial states (position and velocity in the relative coordinate system) and maneuvering accelerations for both pursuers and the evader are provided in Table 3.

Table 3. Initial states of the players in Example 1.

Figure 9 illustrates the interception domains for Example 1. The equilibrium trajectories of all players and the distances between each pursuer and the evader are shown in Figure 10 and Figure 11, respectively. The results confirm that Example 1 represents a non-degenerate two-vs.-one pursuit–evasion scenario, where both pursuers simultaneously intercept the evader.

Figure 9. Sketch of the interception domain in Example 1.

Figure 10. Trajectories of the players in Example 1.

Figure 11. Distance between the evader and the two pursuers in Example 1.

7.2. Example 2

The initial states and maneuvering accelerations for both pursuers and the evader are provided in Table 4.

Table 4. Initial states of the players in Example 2.

Figure 12 illustrates the interception domains for Example 2. The equilibrium trajectories of all players and the distances between each pursuer and the evader are shown in Figure 13 and Figure 14, respectively. The results indicate that Example 2 represents a degenerate scenario in the two-vs.-one pursuit–evasion game. Only one pursuer intercepts the evader, while the other exerts no influence on the game outcome. Therefore, we assume that the non-participating pursuer

P_{2}

executes no maneuver.

Figure 12. Sketch of the interception domain in Example 2.

Figure 13. Trajectories of the players in Example 2.

Figure 14. Distance between the evader and the two pursuers in Example 2.

7.3. Example 3

The initial states and maneuvering accelerations for the three pursuers and the evader are provided in Table 5.

Table 5. Initial states of the players in Example 3.

Figure 15 illustrates the interception domains for Example 3. The equilibrium trajectories of all players and the distances between each pursuer and the evader are shown in Figure 16 and Figure 17, respectively. The results show that Example 3 represents a Type I non-degenerate scenario in the three-vs.-one pursuit–evasion game, where the three pursuers simultaneously intercept the evader.

Figure 15. Sketch of the interception domain in Example 3.

Figure 16. Trajectories of the players in Example 3.

Figure 17. Distance between the evader and the three pursuers in Example 3.

7.4. Example 4

The initial states and maneuvering accelerations for the three pursuers and the evader are provided in Table 6.

Table 6. Initial states of the players in Example 4.

Figure 18 illustrates the interception domains for Example 4. The equilibrium trajectories of all players and the distances between each pursuer and the evader are shown in Figure 19 and Figure 20, respectively. The results show that Example 4 represents a Type II non-degenerate scenario in the three-vs.-one pursuit–evasion game, where under the strategic constraint imposed by the pursuer

P_{2}

limiting the evader’s maneuverability, the remaining two pursuers

P_{1}

,

P_{3}

achieve simultaneous interception of the evader.

Figure 18. Sketch of the interception domain in Example 4.

Figure 19. Trajectories of the players in Example 4.

Figure 20. Distance between the evader and the three pursuers in Example 4.

7.5. Example 5

The initial states and maneuvering accelerations for the three pursuers and the evader are provided in Table 7.

Table 7. Initial states of the players in Example 5.

Figure 21 illustrates the interception domains for Example 5. The equilibrium trajectories of all players and the distances between each pursuer and the evader are shown in Figure 22 and Figure 23, respectively. Note that the initial states of Example 5 are the same as that of Example 3. However, the difference in maneuvering accelerations results in a degenerate scenario:

P_{1}

exerts no influence on the game outcome, while

P_{2}

and

P_{3}

simultaneously intercept the evader.

Figure 21. Sketch of the interception domain in Example 5.

Figure 22. Trajectories of the players in Example 5.

Figure 23. Distance between the evader and the three pursuers in Example 5.

7.6. Example 6

The initial states and maneuvering accelerations for the three pursuers and the evader are provided in Table 8.

Table 8. Initial states of the players in Example 6.

Through numerical computation, we have determined that the three-versus-one pursuit–evasion game defined by

(P_{1}, P_{2}, P_{3}, E)

constitutes a non-degenerate scenario in which all three pursuers simultaneously intercept the evader. The interception point

x_{123}^{*}

satisfies

T_{P_{4}}^{*} (x_{123}^{*}) = 60.27 s > T_{P_{1}}^{*} (x_{123}^{*}) = 56.41 s

. This indicates that

P_{4}

does not participate in the game. Therefore, the overall game degenerates to a three-pursuer engagement. Specifically, it corresponds to the Type I non-degenerate scenario as described in Section 5. The equilibrium trajectories of all players and the distances between each pursuer and the evader are shown in Figure 24 and Figure 25, respectively.

Figure 24. Trajectories of the players in Example 6.

Figure 25. Distance between the evader and the three pursuers in Example 6.

On the other hand, as analyzed in Section 5 regarding the computational load of the algorithm, the primary computational cost of our method stems from the homotopy iteration over initial velocities. However, since the relative initial velocity in Example 6 is zero, this step was unnecessary, resulting in a significantly reduced computation time of approximately 12.6 s for this specific example—far less than that required by the reachable set algorithm of Jansson and Harris [25] for a similar scenario. Although the computational performance of their method under non-zero initial velocity remains unclear, this result demonstrates that our algorithm holds a computational efficiency advantage in scenarios with small relative initial velocities.

8. Conclusions

This paper establishes the interception domain theory and applies it to planar orbital multi-player “encirclement-capture” differential games. The main contributions are as follows:

We propose the first rigorous definition of interception domains for orbital pursuit–evasion games, prove their convexity, and develop computational methods for domain intersections. This theoretical framework enriches the methodological toolbox for orbital pursuit–evasion problems, providing a novel perspective for analyzing complex multi-spacecraft interactions.
Based on this theoretical foundation, we establish a complete classification of equilibrium outcomes for planar multi-pursuer interception games. This classification exhibits a hierarchical structure as the number of pursuers $k$ increases: For $k = 2$ , the solution degenerates to a single-pursuer interception, or both pursuers intercept simultaneously. For $k = 3$ , in addition to the aforementioned outcomes, new equilibria emerge where the evader is intercepted simultaneously by all three pursuers, or by a pair of pursuers under the constraining influence of the third. For $k \geq 4$ , our analysis proves that no more than three pursuers can simultaneously intercept the evader. The results reveal a phenomenon of diminishing returns beyond three pursuers, establishing that a three-pursuer configuration generally provides the most efficient resource utilization for planar orbital defense while maintaining strong interception efficiency.
We analyze the Nash equilibrium properties for these games and develop efficient solution algorithms, including novel low-complexity methods for degeneracy determination. Compared to existing methods, our approach is particularly suitable for scenarios requiring rapid situation assessment.

Regarding the differential game model of orbital multi-player “encirclement-capture” games, several research directions warrant further investigation: First, while this work focuses on planar scenarios, the interception domain framework can be naturally extended to three-dimensional orbital environments. In such settings, the interception domain becomes a volume, and the critical number of pursuers required for simultaneous interception is expected to increase—a promising direction for future research. Second, the degeneracy determination method for large-scale pursuer scenarios remains complex. Future work should establish concise criteria for evaluating individual pursuers’ strategic influence. Finally, leveraging “encirclement-capture” solutions could optimize the configuration design (quantity/spatial deployment) of pursuers.

Author Contributions

Conceptualization, X.L.; formal analysis, G.Z.; investigation, X.L.; methodology, X.L.; resources, Y.L.; software, X.Y.; supervision, X.Z.; validation, X.Z.; writing—original draft, X.L. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

Data is contained within the article.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:

TPBVP	Two-point boundary value problem
CW	Clohessy–Wiltshire
GEO	Geosynchronous Earth orbits

Appendix A

The Apollonius circle is defined as the set of points in a plane with a constant ratio of distances to two fixed points:

s = \{(x, y) | \frac{{(x - x_{a})}^{2} + {(y - y_{a})}^{2}}{{(x - x_{b})}^{2} + {(y - y_{b})}^{2}} = λ\} .

(A1)

When

λ > 0

and

λ \neq 1

, (A1) describes a circle.

In the orbital one-versus-one game governed by dynamics (2), if gravitational influence is neglected (

ω = 0

) and initial velocities are zero, the game reduces to a classical problem thoroughly analyzed in [2]. Under any boundary conditions specified in (7), the acceleration directions of

P

and

E

remain constant in the Nash equilibrium. Consequently, every point on the interception domain boundary satisfies

\frac{{(x - x_{P})}^{2} + {(y - y_{P})}^{2}}{{(x - x_{E})}^{2} + {(y - y_{E})}^{2}} = \frac{\frac{1}{2} T_{p} t^{2}}{\frac{1}{2} T_{E} t^{2}} = \frac{T_{p}}{T_{E}},

(A2)

where

t

is the time for both players to reach the point simultaneously. Given

T_{p} > T_{E} > 0

, the interception domain is an Apollonius circle containing

E

scaled according to the ratio of the players’ accelerations.

Appendix B

The definitions of all points in sketches of the interception domain are as follows.

Table A1. Definitions of all points in sketches of the interception domain.

Point	Definition
$P_{1}$	The initial position of $P_{1}$ in relative coordinate system.
$P_{2}$	The initial position of $P_{2}$ in relative coordinate system.
$P_{3}$	The initial position of $P_{3}$ in relative coordinate system.
$E$	The initial position of $E$ in relative coordinate system.
$N_{1}$	The interception point in one-versus-one pursuit–evasion game between $P_{1}$ and $E$ .
$N_{2}$	The interception point in one-versus-one pursuit–evasion game between $P_{2}$ and $E$ .
$J_{1}, J_{2}$	The intersection points of $d_{1}$ and $d_{2}$ .
$K_{1}, K_{2}$	The intersection points of $d_{1}$ and $d_{3}$ .
$L_{1}, L_{2}$	The intersection points of $d_{2}$ and $d_{3}$ .
$o$	The intersection points of $d_{12}$ , $d_{13},$ and $d_{23}$ .

References

References

Anderson, G.M.; Grazier, V.W. Barrier in Pursuit-Evasion Problems Between Two Low-Thrust Orbital Spacecraft. AIAA J. 1976, 14, 158–163. [Google Scholar]
Vajda, S.; Isaacs, R. Differential Games; Wiley: New York, NY, USA, 1965. [Google Scholar]
Shen, H.X.; Casalino, L. Revisit of the Three-Dimensional Orbital Pursuit-Evasion Game. J. Guid. Control Dyn. 2018, 41, 1823–1831. [Google Scholar]
Li, Z.Y.; Chen, S.; Zhou, C.; Sun, W. Orbital Multi-Player Pursuit-Evasion Game with Deep Reinforcement Learning. J. Astronaut. Sci. 2025, 72, 1. [Google Scholar]
Pontani, M.; Conway, B.A. Numerical Solution of the Three-Dimensional Orbital Pursuit-Evasion Game. J. Guid. Control Dyn. 2009, 32, 474–487. [Google Scholar]
Stupik, J.; Pontani, M.; Conway, B. Optimal Pursuit/Evasion Spacecraft Trajectories in the Hill Reference Frame. In Proceedings of the AIAA/AAS Astrodynamics Specialist Conference, Minneapolis, MN, USA, 13–16 August 2012; AIAA: Reston, VA, USA, 2012. [Google Scholar]
Zhang, C.G.; Zhu, Y.W.; Yang, L.P. An Optimal Guidance Method for Free-Time Orbital Pursuit-Evasion Game. J. Syst. Eng. Electron. 2022, 33, 1294–1308. [Google Scholar]
Zhou, J.F.; Zhao, L.; Li, H.; Cheng, J.H.; Wang, S. Compensation Control Strategy for Orbital Pursuit-Evasion Problem with Imperfect Information. Appl. Sci. 2020, 11, 1400. [Google Scholar] [CrossRef]
Wang, Z.; Gong, B.; Yuan, Y.; Ding, X. Incomplete Information Pursuit-Evasion Game Control for a Space Non-Cooperative Target. Aerospace 2021, 8, 211. [Google Scholar] [CrossRef]
Yang, B.; Liu, P.; Feng, J.; Li, S. Two-Stage Pursuit Strategy for Incomplete-Information Impulsive Space Pursuit-Evasion Mission Using Reinforcement Learning. Aerospace 2021, 8, 299. [Google Scholar] [CrossRef]
Tang, X.; Ye, D.; Luo, S.; Low, K.-S.; Sun, Z. A Hybrid Game Strategy for the Pursuit of Out-of-Control Spacecraft under Incomplete-Information. Aerospace 2022, 9, 455. [Google Scholar] [CrossRef]
Liu, P.; Yang, B.; Li, S.; Xin, M. Parameter-Optimized Pursuit Strategy for Orbital Games with Incomplete Information. J. Guid. Control Dyn. 2025, 48, 8. [Google Scholar]
Li, Z.Y.; Zhu, H.; Yang, Z. Saddle Point of Orbital Pursuit-Evasion Game Under J2-Perturbed Dynamics. J. Guid. Control Dyn. 2020, 43, 1733–1739. [Google Scholar] [CrossRef]
Wang, H.; Zhang, Y.; Liu, H.; Zhang, K. Impulsive thrust strategy for orbital pursuit-evasion games based on impulse-like constraint. Chin. J. Aeronaut. 2025, 38, 103180. [Google Scholar] [CrossRef]
Han, H.; Dang, Z. Optimal delta-v-based strategies in orbital pursuit-evasion games. Adv. Space Res. 2023, 72, 243–256. [Google Scholar] [CrossRef]
Ma, H.; Zhang, G. Delta-V analysis for impulsive orbital pursuit-evasion based on reachable domain coverage. Aerosp. Sci. Technol. 2024, 150, 109243. [Google Scholar] [CrossRef]
Li, Z.Y.; Zhu, H.; Luo, Y.Z. Orbital Inspection Game Formulation and Epsilon-Nash Equilibrium Solution. J. Spacecr. Rockets 2024, 61, 157–172. [Google Scholar] [CrossRef]
Wang, C.; Chen, D.; Liao, W. Research on Maneuver Strategy in Satellite Observation and Counter-Observation Game. Adv. Space Res. 2024, 74, 3170–3185. [Google Scholar] [CrossRef]
Jagat, A.; Sinclair, A.J. Optimization of Spacecraft Pursuit-Evasion Game Trajectories in the Euler-Hill Reference Frame. In Proceedings of the AIAA/AAS Astrodynamics Specialist Conference, San Diego, CA, USA, 4–7 August 2014; AIAA: Reston, VA, USA, 2014. [Google Scholar]
Jagat, A.; Sinclair, A.J. Nonlinear Control for Spacecraft Pursuit-Evasion Game Using the State-Dependent Riccati Equation Method. IEEE Trans. Aerosp. Electron. Syst. 2017, 53, 3032–3042. [Google Scholar]
Ratnoo, A.; Shima, T. Guidance laws against defended aerial targets. In Proceedings of the AIAA Guidance, Navigation, and Control Conference, Portland, OR, USA, 8–11 August 2011. [Google Scholar]
Li, Y.; Liang, X.; Dang, Z. Nash-equilibrium strategies of orbital Target-Attacker-Defender game with a non-maneuvering target. Chin. J. Aeronaut. 2024, 37, 365–379. [Google Scholar]
Li, Z.-Y. Orbital Pursuit–Evasion–Defense Linear-Quadratic Differential Game. Aerospace 2024, 11, 443. [Google Scholar]
Sun, S.; Zhu, H.; Wang, W. Orbital Three-Player Pursuit-Evasion Game. J. Astronaut. Sci. 2025, 72, 22. [Google Scholar] [CrossRef]
Jansson, O.; Harris, M.W. A Geometrical, Reachable Set Approach for Constrained Pursuit–Evasion Games with Multiple Pursuers and Evaders. Aerospace 2023, 10, 477. [Google Scholar] [CrossRef]
Jin, S.Y.; Qu, Z.H. Pursuit-Evasion Games with Multi-Pursuer vs. One Fast Evader. In Proceedings of the 2010 8th World Congress on Intelligent Control and Automation (WCICA), Jinan, China, 7–9 July 2010. [Google Scholar]
Chen, J.; Zha, W.; Peng, Z.; Gu, D. Multi-player pursuit-evasion games with one superior evader. Automatica 2016, 71, 24–32. [Google Scholar] [CrossRef]
Zhao, L.; Zhang, Y.; Dang, Z. PRD-MADDPG: An Efficient Learning-Based Algorithm for Orbital Pursuit-Evasion Game with Impulsive Maneuvers. Adv. Space Res. 2023, 72, 211–230. [Google Scholar] [CrossRef]
Shinar, J. Solution techniques for realistic pursuit-evasion games. Control Dyn. Syst. 1981, 17, 63–124. [Google Scholar]
Yan, R.; Shi, Z.; Zhong, Y. Reach-Avoid Games with Two Defenders and One Attacker: An Analytical Approach. IEEE Trans. Cybern. 2019, 49, 1035–1046. [Google Scholar] [CrossRef] [PubMed]

Figure 1. State variables of the game.

Figure 2. Numerical simulation result of an interception domain.

Figure 3. Interception domain as an Apollonius circle.

Figure 4. Trajectory of

E

to the intersection point.

Figure 5. Sketch of interception domains in two-versus-one pursuit–evasion game.

Figure 6. The interception point in the non-degenerate scenarios.

Figure 7. Sketch of interception domains in three-versus-one pursuit–evasion game.

Figure 8. Decision tree for determining degeneracy.

Figure 9. Sketch of the interception domain in Example 1.

Figure 10. Trajectories of the players in Example 1.

Figure 11. Distance between the evader and the two pursuers in Example 1.

Figure 12. Sketch of the interception domain in Example 2.

Figure 13. Trajectories of the players in Example 2.

Figure 14. Distance between the evader and the two pursuers in Example 2.

Figure 15. Sketch of the interception domain in Example 3.

Figure 16. Trajectories of the players in Example 3.

Figure 17. Distance between the evader and the three pursuers in Example 3.

Figure 18. Sketch of the interception domain in Example 4.

Figure 19. Trajectories of the players in Example 4.

Figure 20. Distance between the evader and the three pursuers in Example 4.

Figure 21. Sketch of the interception domain in Example 5.

Figure 22. Trajectories of the players in Example 5.

Figure 23. Distance between the evader and the three pursuers in Example 5.

Figure 24. Trajectories of the players in Example 6.

Figure 25. Distance between the evader and the three pursuers in Example 6.

Table 1. Distinction between the concepts of the interception domain and the reachable set.

	Interception Domain	Reachable Set
Definition	The region reachable by a spacecraft before interception by another spacecraft.	The region reachable by a spacecraft within a specific time.
Parameter	Initial states of the two spacecrafts.	Initial states of the single spacecraft and the time.

Table 2. Two types of the degenerate scenarios in three-versus-one game.

Type I	Type II
$There exists N_{i} \in d_{13}^{'}$	$There exists x_{i j}^{*} \in d_{13}^{'}$ on the arc segment $\hat{h_{i j}}$
Degenerate into a one-versus-one game defined by $(P_{i}, E$ )	Degenerate into a two-versus-one game defined by $(P_{i}, P_{j}, E$ )

Table 3. Initial states of the players in Example 1.

Player	$x [k m]$	$y [k m]$	$v_{x} [k m / s]$	$v_{y} [k m / s]$	$T [k m / s^{2}]$	$Orbit Radius [k m]$
$P_{1}$	−4.08	2.90	$1.05 \times 10^{- 4}$	$- 2.98 \times 10^{- 4}$	$1.75 \times 10^{- 3}$	42,164.9
$P_{2}$	0	−6	$- 2.20 \times 10^{- 4}$	0	$2 \times 10^{- 3}$	42,161.5
$E$	0	0	0	0	$1 \times 10^{- 3}$	42,165.8

Table 4. Initial states of the players in Example 2.

Player	$x [k m]$	$y [k m]$	$v_{x} [k m / s]$	$v_{y} [k m / s]$	$T [k m / s^{2}]$	$Orbit Radius [k m]$
$P_{1}$	−1.97	−3.37	$- 1.24 \times 10^{- 4}$	$- 1.44 \times 10^{- 4}$	$1.72 \times 10^{- 3}$	42,161.9
$P_{2}$	0	−6	$- 2.20 \times 10^{- 4}$	0	$2 \times 10^{- 3}$	42,161.5
$E$	0	0	0	0	$1 \times 10^{- 3}$	42,165.8

Table 5. Initial states of the players in Example 3.

Player	$x [k m]$	$y [k m]$	$v_{x} [k m / s]$	$v_{y} [k m / s]$	$T [k m / s^{2}]$	$Orbit Radius [k m]$
$P_{1}$	7.46	5.42	$1.97 \times 10^{- 4}$	$5.44 \times 10^{- 4}$	$2.13 \times 10^{- 3}$	42,173.5
$P_{2}$	0	−6	$- 2.20 \times 10^{- 4}$	0	$2 \times 10^{- 3}$	42,161.5
$P_{3}$	−4.08	2.90	$1.05 \times 10^{- 4}$	$- 2.98 \times 10^{- 4}$	$1.72 \times 10^{- 3}$	42,164.9
$E$	0	0	0	0	$1 \times 10^{- 3}$	42,165.8

Table 6. Initial states of the players in Example 4.

Player	$x [k m]$	$y [k m]$	$v_{x} [k m / s]$	$v_{y} [k m / s]$	$T [k m / s^{2}]$	$Orbit Radius [k m]$
$P_{1}$	4.81	5.47	$1.99 \times 10^{- 4}$	$3.51 \times 10^{- 4}$	$3.3 \times 10^{- 3}$	42,173.0
$P_{2}$	−4.26	−3.76	$- 1.38 \times 10^{- 4}$	$- 3.11 \times 10^{- 4}$	$2.5 \times 10^{- 3}$	42,160.1
$P_{3}$	0	−6	$- 2.20 \times 10^{- 4}$	0	$2 \times 10^{- 3}$	42,164.9
$E$	0	0	0	0	$1 \times 10^{- 3}$	42,165.8

Table 7. Initial states of the players in Example 5.

Player	$x [k m]$	$y [k m]$	$v_{x} [k m / s]$	$v_{y} [k m / s]$	$T [k m / s^{2}]$	$Orbit Radius [k m]$
$P_{1}$	7.46	5.42	$1.97 \times 10^{- 4}$	$5.44 \times 10^{- 4}$	$2.22 \times 10^{- 3}$	42,173.5
$P_{2}$	0	−6	$- 2.20 \times 10^{- 4}$	0	$2 \times 10^{- 3}$	42,161.5
$P_{3}$	−4.08	2.90	$1.05 \times 10^{- 4}$	$- 2.98 \times 10^{- 4}$	$2.7 \times 10^{- 3}$	42,164.9
$E$	0	0	0	0	$1 \times 10^{- 3}$	42,165.8

Table 8. Initial states of the players in Example 6.

Player	$x [k m]$	$y [k m]$	$v_{x} [k m / s]$	$v_{y} [k m / s]$	$T [k m / s^{2}]$	$Orbit Radius [k m]$
$P_{1}$	0	5	$0$	$0$	$3.2 \times 10^{- 3}$	42,170.8
$P_{2}$	0	−5	$0$	$0$	$3.1 \times 10^{- 3}$	42,160.8
$P_{3}$	−5	1	$0$	0	$3 \times 10^{- 3}$	42,162.9
$P_{4}$	2	−5	0	0	$3 \times 10^{- 3}$	42,163.6
$E$	0	0	0	0	$1 \times 10^{- 3}$	42,165.8

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Interception Domain Approach to Orbital Multi-Player “Encirclement-Capture” Games: Theoretical Foundations and Solutions

Abstract

1. Introduction

2. Differential Game Model

3. Interception Domain

4. Orbital Two-Versus-One Pursuit–Evasion Game

5. Orbital Three-Versus-One Pursuit–Evasion Game

6. Orbital Multi-Versus-One Pursuit–Evasion Game

7. Simulation Example

7.1. Example 1

7.2. Example 2

7.3. Example 3

7.4. Example 4

7.5. Example 5

7.6. Example 6

8. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

Abbreviations

Appendix A

Appendix B

References

Article Metrics

Citations

Article Access Statistics