Next Article in Journal
Asymptotically Optimal Status Update Compression in Multi-Source System: Age–Distortion Tradeoff
Previous Article in Journal
Electrochemical Modeling Applied to Intercalation Phenomena Using Lattice Kinetic Monte Carlo Simulations: Galvanostatic Simulations
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Simulation-Based Two-Stage Scheduling Optimization Method for Carrier-Based Aircraft Launch and Departure Operations

College of Mechanical and Electrical Engineering, Harbin Engineering University, No. 145 Nantong Street, Nangang District, Harbin 150001, China
*
Author to whom correspondence should be addressed.
Entropy 2025, 27(7), 662; https://doi.org/10.3390/e27070662
Submission received: 22 May 2025 / Revised: 18 June 2025 / Accepted: 19 June 2025 / Published: 20 June 2025
(This article belongs to the Section Multidisciplinary Applications)

Abstract

The scheduling of carrier-based aircraft departure operations is subject to stringent temporal, spatial, and resource constraints. Conventional approaches struggle to yield exact solutions or provide a comprehensive mathematical description of this complex, dynamic process. This study proposes a simulation-based optimization method, establishing a high-fidelity simulation model for aircraft departure scheduling. To address the coupled challenges of path planning under spatial constraints and station matching/sequencing under operational constraints, we developed (1) a deep reinforcement learning (DRL)-based path planning algorithm (AAE-SAC), and (2) an enhanced particle swarm optimization (PSO) algorithm (LTA-HPSO). This integrated two-stage framework, termed LTA-HPSO + AAE-SAC, facilitates efficient, collision-free departure scheduling optimization. Simulation experiments across varying sortie scales were conducted to validate the framework’s effectiveness and robustness. Notably, for a complex scenario involving 24 aircraft with diverse priorities and stringent spatial constraints, LTA-HPSO + AAE-SAC achieved an average solution time of 185.19 s, reducing scheduling time by 26.18% and 49.54% compared to benchmark algorithms (PSO + Heuristic and PSO + SAC, respectively). The proposed LTA-HPSO + AAE-SAC framework significantly enhances the quality and robustness of carrier-based aircraft departure scheduling.

1. Introduction

Aircraft carriers are critical combat assets, and the safe and efficient execution of aircraft launch and recovery operations directly determines sustained strike capability. Enhancing launch and recovery efficiency has, thus, remained a focal point in carrier-based aircraft support scheduling research. Carrier-based aircraft operations have a significantly different launch phase compared with land-based aircraft operations [1,2]. This can be attributed to (1) increased scheduling complexity due to confined deck space; (2) tighter departure sequencing due to limited takeoff runway length and quantity; and (3) takeoff challenges due to dynamic deck motion during maritime operations. Hence, the development of rational departure scheduling plans to minimize the total launch time, reduce resource occupation duration, and maintain safe, orderly, and efficient deck operations is paramount for optimizing the departure scheduling process of shipboard aircraft.
Existing research on carrier-based aircraft scheduling has predominantly addressed full-process launch and recovery operations. Early studies, such as Jewell et al. [3], analyzed the impact of support operations on operational capability. Subsequent initiatives aimed to reduce planning time through decision support systems [4,5,6]. A significant milestone was the development of MIT’s Deck Operations Course of Action Planner (DCAP) [7], integrating autonomous scheduling algorithms with simulation validation to enable human-in-the-loop decision-making. Recent research has increasingly leveraged intelligent algorithms and simulation technologies. Ghosh Dastidar and Frazzoli [8] optimized support operations using queuing networks and differential evolution, considering nominal failure rates. Michini and How [9] formulated a Markov decision process incorporating inverse reinforcement learning for optimal scheduling policies, albeit facing scalability challenges with larger fleets. Zhang et al. [6] developed a simulation-based deck operation planning system incorporating human–computer interaction. Yuan et al. [10] modeled deck operations as an integer programming problem with hybrid periodic/event-driven scheduling, solved via a dual-population genetic algorithm, and demonstrated its effectiveness in dynamic environments.
However, dedicated research specifically focusing on the departure sub-process remains relatively limited. Wan et al. [11] abstracted the departure scheduling problem as a hybrid flow-shop model with blocking constraints, solved using cross-entropy algorithms and heuristic rules. Liu et al. [12] approached the same problem as a flexible job-shop scheduling problem, employing mixed-integer programming and a hybrid genetic-simulated annealing algorithm with rescheduling strategies. While these studies advance launch capability optimization, key limitations persist: (1) insufficient granularity in modeling the departure sub-process; (2) inadequate incorporation of critical spatial constraints, such as path interference and wake turbulence; and (3) premature convergence and susceptibility to local optima in single-model, single-algorithm approaches.
To address these gaps, this research focuses on optimizing the scheduling problem for carrier-based aircraft departure operations, explicitly emphasizing the unique characteristics of the constrained deck operation space. We adopt a simulation-based optimization approach, establishing both a high-fidelity simulation model and an associated mathematical programming model for aircraft departure operations. Crucially, the simulation model captures, in real-time, critical factors like path conflicts, wake interference, and multi-aircraft coordination—factors challenging to observe in conventional numerical simulations yet critically impacting operational capacity. Specifically, we developed a two-stage optimization framework: Stage 1 employs a deep reinforcement learning (DRL)-based path planning algorithm (AAE-SAC) to solve the spatial path optimization problem under dynamic constraints; Stage 2 utilizes an enhanced particle swarm optimization (PSO) algorithm (LTA-HPSO) to solve the resource allocation (station selection) and sequencing (departure order) problem. This integrated two-stage framework is termed LTA-HPSO + AAE-SAC. Comprehensive simulation experiments and comparative performance analysis were conducted to validate the feasibility and superiority of the proposed LTA-HPSO + AAE-SAC framework for carrier-based aircraft departure scheduling. While two-stage optimization has demonstrated feasibility in other scheduling domains [13], our work represents its first application to the complex problem of coordinated multi-aircraft departure scheduling within the highly constrained deck environment of an aircraft carrier.

2. Problem Description and Modeling

2.1. Problem Description

In naval carrier operations, carrier-based aircrafts typically execute missions in continuous, high-intensity, or mixed-task sortie patterns. Prior to mission execution, aircraft undergo maintenance, logistical support, and armament loading at designated support positions. To prevent interference with subsequent operations, serviced aircraft are relocated to assigned parking positions. Subsequently, under deck officers’ commands, they taxi to designated launch positions to await takeoff authorization [14]. The departure workflow comprises five operational phases (Figure 1):

2.1.1. Aircraft–Position Matching

Upon mission initiation, a predefined number of aircraft must depart to reach designated airspace. The required number of aircraft is determined by operational demands. When the number of serviced aircraft (post maintenance and logistical support) meets mission requirements, they are allocated to preassigned parking positions to await departure orders. If deck resources are insufficient, additional aircraft are transported from hangars to assigned parking positions.

2.1.2. Aircraft Transfer and Taxiing

Post-service, aircrafts are towed by tractors to parking positions. During standby, engines are activated for warm-up. Upon receiving departure instructions, pilots taxi the aircraft to predetermined launch-ready positions.

2.1.3. Jet Blast Deflector (JBD) Cooling and Reset

The JBD redirects engine exhaust during takeoff to mitigate deck interference, enhancing operational efficiency. Post-launch, the JBD undergoes mandatory cooling (requiring several minutes) to dissipate thermal loads before resetting. Only after JBD reset can subsequent aircraft advance from launch-ready positions to the catapult-assisted runway.

2.1.4. Preflight Inspection

Prior to launch, deck crews verify aircraft status, JBD alignment, catapult integrity, runway conditions, and pilot readiness.

2.1.5. Takeoff and Departure

Post-verification, pilots and launch officers execute departure upon deck commander’s command. Aircraft are then guided by launch officers to initiate catapult-assisted takeoff, completing the departure sequence.

2.2. Notation

  • Indices
i , j : Number of aircraft in the carrier, i = 1 , , n , j = 1 , , n , i j ;
  • Sets
P _ C : Set of all coordinate points within the 2D deck plane, P _ C = L p M p R p ;
L p : Coordinate set of the deck port-side area, L p P _ C ;
M p : Coordinate set of the deck landing area, M p P _ C ;
R p : Coordinate set of the deck starboard area, R p P _ C ;
P _ C F : Set of points of solid obstacles in the 2D deck plane, P _ C F P _ C ;
P i , P j : Set of geometric center points of the outer contours of aircraft i and j ;
A n g l e _ P i , A n g l e _ P j : Angle sets of the encountered aircraft i and j in the deck coordinate system;
P F j : Coordinate of the geometric center point of the j -th obstacle;
  • Parameter
v i   , v j : Current traveling speeds of the encountering aircraft i and j ;
Δ v i   , Δ v j : Speed increment of aircraft i and j ;
Δ θ i   , Δ θ j : Increment of the turning angle of aircraft i and j ;
Δ θ i max ,   Δ θ j max : Maximum turning angles of aircraft i and j ;
l F : Geometric length of aircraft within the 2D plane of the deck;
l C : Geometric length of the tractor in the 2D plane of the deck;
l W : Maximum geometric length of the wake generated by the aircraft during taxiing in a 2D plane;
d S min : Minimum safe distance before aircraft collision;
d F min : Minimum safe distance between the outer contours of aircraft and obstacles;
d S L i : Distance between the aircraft i and the outer contour of the deck port-side area;
d F L i : Distance between the aircraft i and the obstacles in the port-side area;
d S R i : Distance between the aircraft i and the outer contour of the deck starboard area;
d F R i : Distance between the aircraft i and the obstacles in the starboard area;
d F M i : Distance between the aircraft i and the obstacles in the landing area;
Pr i O : Initial priority of aircraft i departure operations;
Pr C i : Priority value of the type of aircraft i ;
Pr M i : Priority value of the mission type of aircraft i ;
Pr S i : Priority value of the total travel distance of aircraft i ;
Pr W i : Waiting time priority value for the departure operation of aircraft i ;
Pr S R i : Priority value of the remaining distance ratio of the aircraft i ;
S i , S j : Total length of the travel paths of aircraft i and i ;
T W i , T W j : Already waiting time for the transfer process of aircraft i and i ;
S C i , S C j : Remaining distance of aircraft i and j at the current moment;
  • Decision variables
d S : Safe distances for aircraft collision detection;
Pr i : Dynamic priority of aircraft i at the current moment;
S R i ,   S R j : Ratio of the remaining distances of aircraft i and j at the current moment;
C i : Whether conflicts occur during the scheduling of aircraft i or not;
T max : The maximum scheduling time for aircraft depature operation;
x i , P p : Whether aircraft i occupy the parking position ( P p ) or not;

2.3. Simulation Model Development for Carrier-Based Aircraft Departure Operations

We developed a discrete-event simulation platform using C + + and O2DES framework [15] to model spatial deck constraints. The main components of the platform are as follows:

2.3.1. Environmental Resource Modeling

The environmental resource model (Figure 2) includes deck, hangar, shipboard aircraft, tractors, parking positions, support positions, take-off positions, deflector plates, and catapults. After integrating the current mainstream aircraft carrier deck shapes, carrier-based aircraft shapes, and support and take-off models, a 2D simulation model of the deck operations was established, including 18 one-stop support positions [16], laying a foundation for visualizing the entire scheduling process for sortie departure operations. More details can be found in Figure 2. The one-stop maintenance position enables carrier-based aircraft to complete all pre-takeoff maintenance and support procedures at one location, eliminating the need to transfer between multiple different types of maintenance positions as in the past. This significantly reduces the time required for conventional support operation and represents the current design trend for advanced aircraft carriers and the future direction for unmanned aircraft carriers.

2.3.2. Collision Detection System

The basic collision model of carrier-based aircraft is constructed using the convex hull of the aircraft’s outer contour [17] and ray scanning [18], which is used to perceive obstacles in the deck operation environment, as shown in Figure 3. Under the convex-hull-model conditions, only the Euclidean distance between the aircraft and the target point needs to be calculated to determine whether there is a collision, significantly reducing the problem of increased computational load due to differences in the shape boundaries of different carrier-based aircraft models. Equation (1) is the condition formula for triggering the above judgment. Figure 3a,b show the pentagonal outer-contour shell model of the carrier-based aircraft in the folded and extended wing states, respectively. Figure 3c shows the collision detection model of the carrier-based aircraft. During the deck transportation process, 12 rays are formed with the geometric center point of the 2D model of the carrier-based aircraft as the center and a radius of three times the fuselage length to surround the aircraft and perceive the surrounding obstacles.
d S = 2 l F + 1.5 l C , Stage   of   transfer 2 l F + l W , Stage   of   taxiing R d = ξ d S ,
where ξ denotes the deck-specific safety coefficient, ξ 1 , 2 .

2.4. Collision Avoidance Strategies for Multi-Aircraft Coordination

Given the constrained deck space, the following hierarchical strategies can address collision risks during concurrent aircraft movements:

2.4.1. Basal Path Selection Protocol

Leveraging our prior inverse reinforcement learning (IRL) framework [19], we utilize a validated path repository containing the following:
  • Geo-spatial way-points (x, y coordinates);
  • Orientation angles at critical maneuvering nodes;
  • Temporal sequencing between consecutive way-points;
  • Velocity modulation patterns across path segments;
  • Integrated metrics for path length and execution duration.
In this work, these baseline trajectories were extended and refined through advanced collision avoidance strategies and dynamic priority mechanisms, thereby enhancing the operational applicability of the repository in complex multi-aircraft scheduling scenarios. This evolutionary approach ensures the availability of both theoretically sound and practically executable path alternatives while maintaining continuity with our previously validated methodologies.

2.4.2. Collision Avoidance Strategies for Aircraft Encounters

During multi-aircraft departure operations, three encounter scenarios may occur between moving aircraft, as shown in Figure 4.
(a)
Following scenario (A → C/D in Figure 4)
Priority-based yielding is implemented in constrained deck environments where overtaking is prohibited. Its strategy can be described as in Equation (2):
A n g l e _ p i A n g l e _ p j 0 ° , 5 ° 355 ° , 360 ° , d s = < P i P j , v i = v j = P i P j , v i = v i , v j = v j Δ v j > P i P j , v i = v i , v j = 0 v i = 0 , v j = v j ,
Here, P i P j represents the straight-line distance between the aircraft and the nearest point on the geometric outer contour. When d s > P i P j , it is further determined which of the two aircraft in the encounter will adopt the waiting and evasive strategy through priority. That is, at this time, the aircraft with higher priority continues to move, while the other waits in place.
(b)
Head-on scenario (A → B in Figure 4)
Conflicting aircraft execute synchronized maneuvers; this strategy can be expressed as in Equation (3):
A n g l e _ p i A n g l e _ p j 175 ° , 180 ° 180 ° , 185 ° , d s = < P i P j , v i = v j , θ i = θ i + Δ θ i , θ j = θ j + Δ θ j = P i P j , v i = v i Δ v i , v j = v j Δ v j , θ i = θ i + Δ θ i , θ j = θ j + Δ θ j > P i P j , v i = v i Δ v i , v j = v j Δ v j , θ i = θ i + Δ θ i max , θ j = θ j + Δ θ j max ,
Here, Δ v i , Δ v j , Δ θ i , Δ θ j are the increments in the speed and steering angle of the carrier-based aircraft during movement. When the safety distance is gradually reduced, encountering carrier-based aircraft avoid collision by decelerating and increasing the steering angle on the same side.
(c)
Crossing scenario (A → E/F/G/H in Figure 4)
For intersecting trajectories with maximum collision risk:
  • Mandatory velocity reduction
  • Differential steering governed by Equations (4) and (5):
A n g l e _ P i A n g l e _ P j 5 ° , 175 ° , d s = < P i P j , v i = v i , v j = v j Δ v j , θ i = θ i , θ j = θ j Δ θ j = P i P j , v i = v i , v j = v j Δ v j , θ i = θ i Δ θ i , θ j = θ j Δ θ j > P i P j , v i = v i , v j = 0 , θ i = θ i Δ θ i max , θ j = θ j Δ θ j max ,
A n g l e _ P i A n g l e _ P j 185 ° , 355 ° , d s = < P i P j , v i = v i , v j = v j Δ v j , θ i = θ i , θ j = θ j + Δ θ j = P i P j , v i = v i , v j = v j Δ v j , θ i = θ i + Δ θ i , θ j = θ j + Δ θ j > P i P j , v i = v i , v j = 0 , θ i = θ i + Δ θ i max , θ j = θ j + Δ θ j max ,

2.4.3. Obstacle Avoidance Protocol

Static/dynamic obstacles (deck edges, islands, parked aircraft) trigger:
When the obstacle potentially colliding with the carrier-based aircraft is the outer outline of the deck itself, as shown in Figure 5, the following three cases can be considered:
(1).
When the aircraft is located within the port-side area of the deck ( L p ), with its coordinate being P i , its collision avoidance strategy is determined based on Equation (6).
P i L p , d S min < d S L i d S , v i = v i Δ v i , θ i = θ i + Δ θ i = d S L i , v i = 0 , Wait   for   rescheduling ,
(2).
When the aircraft is located within the starboard-side area of the deck ( R p ), with its coordinate being P i , its collision avoidance strategy is determined based on Equations (7).
P i R p , d S min < d S R i d S , v i = v i Δ v i , θ i = θ i Δ θ i = d S R i , v i = 0 , Wait   for   rescheduling ,
(3).
When the carrier-based aircraft is in the deck landing area ( M p ), it will not collide with any physical boundary on the deck. Therefore, its motion remains unchanged as expressed in Equations (8).
P i M p , v i = v i , θ i = θ i ,
In the case of obstacles that can potentially collide with other aircraft, tractors, or other shipboard equipment, the situation can be divided into the following:
(4).
When the obstacle ( P F j ) is not a deck entity boundary and is located in the port-side area of the deck, then it satisfies the following expression:
P F j P _ C F L p , d F min = d S min , d F L i = d S L i , it s   handled   according   to   Equation   ( 6 )
(5).
When the obstacle ( P F j ) is not a deck entity boundary and is located in the starboard area of the deck, then it satisfies the following expression:
P F j P _ C F R p , d F min = d S min , d F R i = d S R i , it s   handled   according   to   Equation ( 7 )
(6).
When the obstacle is not a deck entity boundary and is located within the landing area of the deck, the judgment should be made based on the relative distance between the current aircraft ( P i ) and the encountered aircraft ( P F j ), as expressed in Equation (9).
P F j P _ C F M p , d F M i = P i P F j > d S , v i = v i Δ v i , θ i = θ i ± Δ θ i P i P F j d S , v i = v i Δ v i , θ i = θ i ± Δ θ i max P i P F j = d S min , v i = 0 , Wait   for   rescheduling ,

2.4.4. Dynamic Priority Assignment

Carrier-based aircraft are initially assigned departure priorities based on mission requirements. However, sole reliance on these pre-assigned priorities [20] fails to effectively coordinate multi-aircraft operations in dynamic deck environments, compromising operational efficiency and introducing safety hazards. Although collision avoidance measures (e.g., path replanning, velocity modulation, and course correction) mitigate conflicts, residual scheduling collisions persist. Ad hoc rescheduling for conflict resolution substantially degrades operational throughput and predisposes subsequent optimization processes to local optima. To address these limitations, this study develops an adaptive and real-time priority allocation mechanism that dynamically resolves scheduling conflicts during coordinated multi-aircraft operations.
  • Definition of initial priority
Typically, carrier-based aircraft departure priorities are primarily determined by their type and mission profiles, as detailed in Table 1. Within the same aircraft category, ground attack missions receive higher priority than air-to-air combat or routine patrols due to their increased armament payloads and extended operational range requirements.
As specified in Table 1, the initial priority of a carrier-based aircraft scheduled for departure operations is quantified using Equation (10), where a higher summation value corresponds to elevated mission priority.
Pr i O = Pr C i + Pr M i ,
  • Definition of dynamic priority
The dynamic priority of carrier-based aircraft during departure operations is computed through three critical parameters:
Total travel path length:
S i > S j , Pr S i = 0 , Pr S j = 1 S i S j , Pr S i = 1 , Pr S j = 0 ,
Accumulated waiting duration:
T W i > T W j , Pr W i = 1 , Pr W j = 0 T W i T W j , Pr W i = 0 , Pr W j = 1 ,
Remaining path ratio:
Let Remaining   Path   Ratio S R i = Length   of   aircraft   remaining   distance Total   length   of   aircraft   path = S C i S i , then the relationship between the remaining distance ratio and the priority can be expressed as in Equation (13):
S R i > S R j , Pr S R i = 0 , Pr S R j = 1 S R i S R j , Pr S R i = 1 , Pr S R j = 0 ,
The composite dynamic priority, defined by Equation (14), increases with higher computed values. During conflict resolution, aircraft with elevated priority retain operational continuity, mandating collision avoidance actions from lower-priority counterparts.
Pr i = Pr i O + Pr S i + Pr W i + Pr S R i ,

2.4.5. Rescheduling Protocol

Should persistent collisions occur after implementing avoidance strategies, the following protocol is activated:
  • Priority-driven re-queuing: Aircrafts with lower priority are re-queued according to the established hierarchy, resetting their scheduling sequence to the terminal position.
  • Conflict flagging: Higher-priority aircrafts maintain operational continuity while receiving conflict markers (denoted by C i ) for trajectory re-calibration.

2.5. Two-Stage Scheduling Optimization Framework for Aircraft Departure Operations

Based on defined operational characteristics, the scheduling optimization model incorporates the following assumptions:
(1)
Non-interruptible process: Departure operations proceed without suspension.
(2)
Predetermined positional states: Deployment coordinates for aircraft and support/parking/launch positions are predefined.
(3)
Persistent operational readiness: All equipment maintain baseline functionality throughout operations.
(4)
Exclusive tractor assignment: Each tractor tows a single aircraft with independent workflows.
(5)
Aircraft-specific timing: Warm-up, wake separation, and pre-launch durations are type-dependent constants.
To reflect the operational integrity and deck space characteristics of the aircraft, a two-stage scheduling optimization model is proposed. First, the rationality of the station selection and departure order decision in departure operation scheduling should be ensured. Second, considering the dynamic changes in the deck space and the characteristics of multi-aircraft cooperative operation, the rationality of the carrier-based aircraft scheduling path in sortie and departure operations should be ensured. For details, see Equations (15) and (16)–(27) as the constraint conditions.
min G = min λ 1 T max + λ 2 C o n f l i c t + λ 3 V a r L + λ 4 C o n t r o l ,
S.→t.
min T max = min T _ t r a c t o r max + T _ s t a n d b y max + T _ t a x i i n g max + T _ p r e f l i g h t max + T _ c a t a p u l t max
λ 2 c o n f l i c t = λ 2 C a t a p l u t k = 1 M i = 1 K i 1 max 0 , Δ T C a t a p l u t t k , i + t k , i + 1 + λ 2 C o l l i s i o n i = 1 N C i
C i = 0 , no   collision   in   the   path   of   aircraft   i 1 , collision   occurred
V a r L = 1 N i = 1 N L i L a v e r a g e 2
L i = L i , In   the   coordinated   collision   avoidance   strategy ,   the   aircraft   i   does   not   need   to   wait L i + Δ l i , In   the   coordinated   collision   avoidance   strategy ,   the   aircraft   i   does   not   need   to   wait
L a v e r a g e = 1 N i = 1 N L i
Δ l i = T W i v i
C o n t r o l = 1 T i 0 T i λ s p e e d a s p e e d t + λ s t e e r ψ s t e e r t d t
Pr i > Pr j T _ o f f k , i < T _ o f f k , j
i = 1 N p x i , P p 1 , and   P p The   set   of   parking   position   of   aircraft , x i , P p 0 , 1
i j , When   aircraft   i   is   behind   aircraft   j ,   then t i t j Δ T w a k e
min T max T B a s e
Here, G represents the total dispatch completion time for the entire departure operation, λ 1 ~ λ 4 represent the weight coefficients of each sub-item in the target optimization function, which is used to reflect the importance of the optimization target; Equation (16) is the form of the optimization function for the scheduling duration of carrier-based aircraft departure operations, where T _ t r a c t o r max is the maximum towing time from the support position to the parking position of the aircraft; T _ s t a n d b y max is the maximum waiting time at the parking position; T _ t a x i i n g max is the maximum taxiing time from the parking position to the take-off waiting position; T _ p r e f l i g h t max is the maximum waiting time at the take-off waiting position for the aircraft to complete several pre-takeoff procedures such as waiting for the deflector to cool and reset, entering the take-off position, connecting the catapult device, and pre-flight inspection.
Since these procedures have strict precedence and subsequent relationships, and there are no other aircrafts interfering after entering the take-off waiting position, the completion time of these procedures is collectively represented; T _ c a t a p u l t max is the maximum take-off time of the aircraft on the catapult device, comprising five parts; Equation (17) is the joint penalty term in the optimization process of departure operation scheduling, that is, when the time ( t k , i + 1 ) interval between the arrival of the aircraft ( i + 1 ) at the take-off waiting position ( k ) and the catapult time ( t k , i ) of the preceding aircraft is too small, less than the standard interval time ( Δ T c a t a p u l t ), a penalty will be triggered; a penalty is also triggered if a collision occurs during its scheduling path. Equation (18) is the constraint condition of the collision penalty indicator function; Equation (19) is the sum of variances of the aircraft paths in the current scheduling plan, used to reflect the balance of the scheduling paths; Equations (20)–(22) are the constraint conditions for the sum of variances of the aircraft scheduling paths; Equation (23) is the aircraft path control smoothing function; Equation (24) is the consistency constraint for the aircraft departure operation scheduling, that is, if aircraft i is at the takeoff position k and its dynamic priority is higher than aircraft j , then its catapult departure time will be earlier; T _ o f f k , i < T _ o f f k , j , T _ o f f k represents the actual departure time of the aircraft on the catapult runway k ; Equation (25) is the uniqueness constraint for deck parking position allocation, that is, at most one aircraft can be parked at each parking position; Equation (26) is the wake interval constraint during the aircraft scheduling process; Equation (27) is the overall time constraint of the aircraft departure operation scheduling.
The T _ c a t a p u l t max term in Equation (16) can be disregarded in actual optimization as the catapult take-off process is typically completed within a few seconds and is related to its mechanical device, which is outside the scope of scheduling optimization. Therefore, Equation (16) can be updated to Equation (28):
min T max = min T _ t r a c t o r max + T _ s t a n d b y max + T _ t a x i i n g max + T _ p r e f l i g h t max

3. Design of a Two-Stage Scheduling Model Solving Algorithm for Carrier-Based Aircraft Departure Operations

Existing research on carrier-based operations frequently neglects critical deck spatial characteristics or isolates specific problems such as scheduling resource allocation [21] and path planning [22]. However, practical operational scenarios necessitate integrated scheduling–path coordination. Although heuristic-based optimization methods demonstrate efficacy for small-scale scheduling, their performance degrades significantly under randomly scaled scenarios with dynamic parameters, often yielding suboptimal efficiency and solution quality.
Recent advances in deep architectures have substantially enhanced deep reinforcement learning (DRL) performance across industrial domains [23,24,25,26,27]. While prior investigations on deck scheduling paths [19] acquired feasible trajectories for carrier-based aircraft operations, most paths originated from imitation learning in constrained scenarios—limited to single-aircraft scheduling, static obstacle avoidance, or isolated encounter resolution. Consequently, these paths fail to capture spatial dynamics and multi-aircraft coordination imperatives inherent to departure operations. Nevertheless, this study repurposes prior path data as an alternative library and initial training corpus for DRL. These trajectories populate an experience replay pool, enabling DRL-based refinement of multi-aircraft collaborative scheduling paths. Furthermore, we retain the particle swarm optimization (PSO) algorithm [28,29,30]—widely adopted for sequence decision-making—to resolve the following:
(1)
Aircraft parking position assignment;
(2)
Take-off station selection;
(3)
Take-off sequence determination;
Section 3.1 details DRL implementation with problem-specific adaptations, while Section 3.2 elaborates the enhanced PSO framework.

3.1. Design of AAE–SAC Algorithm

3.1.1. Design of State Space, Action Space, and Reward Function

The trajectory optimization problem in carrier-based aircraft departure operations can be considered a Markov decision process (MDP). In this study, since a 2D continuous visualization simulation was adopted to study scheduling optimization problems, the state transition of the MDP was implicitly realized through dynamic changes in the deck operation environment in the simulation system, rather than through a conventional explicit structure of the state transition matrix. Therefore, the core of this problem was to reasonably design and describe its state space (S), action space (A), and reward function (R).
Under the premise of carrier-based aircraft departure operations in coordination, the state space must consider the state of the aircraft itself, as well as the states of at least two aircrafts around it, and the environmental information of the deck space. Therefore, the state space can be expressed as in Equation (29):
S = State   of   itself 5 Dimension , State   of   other   aircraft 10 Dimension , Environmental   information 6 Dimension ,
Here, the self-state includes the current position coordinates x i , y i of aircraft, the speed v i , the heading angle θ i , and the dynamic priority Pr i ; the environmental information includes the coordinates x o b s , y o b s of the nearest obstacle to the aircraft, its target position coordinates x t arg e t , y t arg e t , and the direction-angle deviation Δ φ x , Δ φ y = sin θ i θ t arg e t , cos θ i θ t arg e t between the current position and the target position of the aircraft.
In the two-stage scheduling optimization problem of departure operations on an aircraft carrier, the first stage generates and optimizes the station position and take-off sequence problem in its scheduling scheme, while the path planning problem in the second stage runs through the entire departure operation from beginning to end. Therefore, the action at any moment in the MDP can be represented by the acceleration and angular acceleration of the carrier-based aircraft, as expressed in Equation (30):
A = a s p e e d ψ s t e e r , and   a s p e e d 0.5 , 0.5 m s 2 ψ s t e e r 30 , 30 ° s ,
Although preliminary work has obtained the basic path library and a clear collision avoidance strategy, achieving better and more realistic new paths in the simulation optimization process that are closer to the actual situation of departure operations is anticipated. Therefore, the design of the reward function should not only consider the completion time reward of the path but also consider the quality of the path, priority, and safety in collaborative operation, etc., which can be expressed as in Equation (31):
R = i = 1 N α r t i m e T i + β r p a t h i + δ Pr k r p r i o r i t y T i + ε r s a f e t y i ,
α + β + δ + ε = 1 , α , β , δ , ε 0 , 1 ,
r t i m e T i = e T t / T b a s e _ a l t 2 , T T b a s e _ a l t log T T b a s e _ a l t + 1 , T > T b a s e _ a l t ,
r p a t h i = γ s i m P i , P b a s e _ a l t + η k = 1 K I n e w _ p a t h ,
I n e w _ p a t h = 1 , When   the   k - th   sub - path   becomes   the   new   path 0 , O t h e r s ,
s i m P i , P b a s e _ a l t = 1 1 + min π i , j π τ i τ j b a s e _ a l t 2 ,
τ i = Δ x Δ T , Δ y Δ T , max a t , s t d θ t ,
r p r i o r i t y T i = 2 1 T i T b a s e _ a l t , T i < 0.9 T b a s e _ a l t 1 T i T b a s e _ a l t 2 , O t h e r s ,
Pr k = 1.5 , Pr i 6 1 , Pr i 4 0.5 , O t h e r s ,
r s a f e t y i = 100 i π C i + 20 i = 1 N I w a k e i + 5 d s a f e _ a l t d min i ,
I w a k e i = 1 , Δ T i , j Δ T w a k e 0 , O t h e r s ,
Equation (32) represents the weight factors for each sub-item reward. Equations (33)–(41) are the sub-item constraints of the joint reward function of Equation (31), where Equation (33) is the reward function for the completion time of the scheduling path, adopting a nonlinear formula to process it in stages, which is beneficial to improving the training efficiency and enhancing the marginal effect; Equation (34) is the reward function for exploring new paths, based on the existing paths; this paper introduces a dynamic time warping mechanism [31,32] to divide the current path into multiple sub-path segments and compare them with the basic path to improve the efficiency of exploring new paths; Equations (35)–(37) are the constraints of the reward function for exploring new paths; Equations (38) and (39) constitute the reward function for different task priorities, emphasizing the dynamic reward mechanism of prioritizing high-priority tasks while completing path planning; Equation (40) is the reward function for path safety, which includes penalties for unsafe behaviors, such as collision, wake turbulence, and safe driving distance; Equation (41) is the constraint condition of the wake turbulence penalty term.

3.1.2. Dynamic Target Entropy and Attention Mechanism

The soft actor–critic (SAC) is a reinforcement learning algorithm based on the actor and critic (A–C) framework, capable of efficiently handling optimization problems in continuous action spaces. Therefore, in this study, a deep reinforcement learning network based on the SAC algorithm was adopted to solve the multi-aircraft cooperative path planning problem in the optimization of carrier-based aircraft departure operations. However, unlike the commonly used fixed-entropy strategy, for aircraft operating in a known basic path and in a confined space, the issues of multi-aircraft cooperation and the utilization of the basic path still needs to be addressed. This makes it difficult for the fixed-entropy strategy to adapt to such a complex operational environment in terms of exploration ability and efficiency. Hence, an improvement is proposed by dynamically adjusting the target entropy [33], and its mathematical expression is expressed as in Equation (45). At the same time, an attention mechanism [34] is introduced into the algorithm, focusing on the nearest obstacle and its dynamic priority changes in the flight path of the carrier-based aircraft, thus achieving focused learning of high-dimensional key features in complex simulation environments. Compared with previous studies [35] that determined the observation focus of carrier-based aircraft departure and recovery capabilities through extensive calculations and the establishment of system evaluation indicators, the SAC algorithm improved with the attention and entropy adjustment strategy and has better adaptability and learning efficiency for the carrier-based aircraft scheduling optimization problem. This algorithm is named attentive adaptive entropy SAC (AAE–SAC) and is mathematically expressed as in Equations (42)–(44). Figure 6 shows the network structure diagram.
s S R 21 , w s = Softmax W 2 ReLU W 1 s + b 1 + b 2 ,
s a t t n = w v , θ , p , Δ x = w v v , w θ θ , w p p , w Δ Δ x ,
w s R 4 , w = w v , w θ , w p , w Δ   and   w i = 1 ,
The MDP state space of the carrier-based aircraft in the departure operation environment can be described as a 21-dimensional state space. When inputted to the SAC deep reinforcement learning network, the 21-dimensional state features can be weighted and focused on the nearest obstacle and its dynamic priority with respect to the current carrier-based aircraft through Equations (42) and (43) through the attention mechanism, thereby guiding the training direction of the actor policy network; W 1 , W 2 , b 1 , b 2 are network parameters.
H t arg e t k t = H h _ b a s e k + β C i k C max + α a t t n 1 w p ,
H h _ b a s e k = dim a 1 T h _ b e s t k T c u r r e n t k ,
π S g , α R   and   H π t = E s ~ p π log π a s ,
α t + 1 = α t exp η H π t H t arg e t k t ,
Equation (45) is the mathematical expression for the dynamic target entropy; α a t t n 1 w p represents the compensation term of the policy entropy when the priority weight obtained by the current attention mechanism is insufficient; Equation (46) is the expression form of the reference entropy, the value of which is related to the dimension of the action space and the time used for scheduling; Equation (47) is the expression form of the entropy under the current policy. Through Equations (45)–(47), the update rule of the entropy coefficient in Equation (48) can be obtained, thereby achieving feedback and optimization of the policy of the actor network.
Following the above discussion, the network structure of the AAE–SAC algorithm can be summarized as mainly comprising the actor network, the critic main network, the critic target network, and the experience replay pool containing known path samples. Among them, the critic main network adopts a dual Q-network design [36], and its corresponding target network is also a dual-target network. The dual-network structure can avoid the overestimation of the Q-values and poor stability caused by noise in a single network or abnormal sample data, thereby avoiding high-risk actions during carrier-based aircraft scheduling and improving operational safety. The pseudocode specification for the AAE-SAC algorithm is formally detailed in Algorithm 1.
Specifically, the actor policy network includes an input layer, which is used to receive the state space parameters of the carrier-based aircraft sortie and departure operation simulation, and two fully connected neural networks with 64 neurons in the middle layer. The structure of the critic dual network and target network is the same, both including an output layer and two fully connected neural networks with 128 neurons. The critic target network synchronizes the parameters of the critic main network through a soft update mechanism to improve training stability; the activation function of all the above networks is the ReLU function.
Algorithm 1. Attentive Adaptive Entropy SAC (AAE–SAC) Algorithm
Require: Number of aircraft: N , State   space :   S R 21 , Action   space :   A = a s p e e d ψ s t e e r , Base   paths :   P b a s e _ a l t , Maximum   episodes :   E max
Ensure:                      Optimized   policy   π , Learned Q-functions: Q ϕ 1 , Q ϕ 2
1: Initialize networks:
2:       Actor   π θ ,   Critics   Q ϕ 1 , Q ϕ 2
3:       Target   networks   Q ¯ ϕ 1 Q ϕ 1 , Q ¯ ϕ 2 Q ϕ 2
4:       Replay   buffer   D
5:       Temperature   α 1.0
6: for episode  e 1  to  E max  do
7:            Initialize   state   s 0
8:            for   step   t 1   to   T max  do
9:                    Compute attention weights:
                                   w s Softmax W 2 ReLU W 1 s + b 1 + b 2
10:                         Apply attention:
                                     s a t t n w v v , w θ θ , w p p , w Δ Δ x
11:                         Select action:
                                     a t π θ s a t t n + N 0 , σ
12:                           Execute   a t ,   observe   r t   and   s t + 1
13:                        Compute reward components:
                            r t i = 1 N α r t i m e T i + β r p a t h i + δ Pr k r p r i o r i t y T i + ε r s a f e t y i
14:                          Store   transition   s t , a t , r t , s t + 1 in D
15:                          if update time then
16:                                Sample batch from D
17:                                Compute target Q-values:
                                       Q ~ r + γ min j = 1 , 2   Q ¯ ϕ j s , π θ s α log π θ a s
18:                                   Update critics:
                                                    ϕ j ϕ j η Q ϕ j 1 B Q ϕ j s , a Q ~ 2
19:                                   Update policy:
                                    θ θ η π θ 1 B α log π θ a s min j = 1 , 2 Q ϕ j s , π θ s
20:                                   Update temperature:
                                                   α t + 1 α t exp η H π t H t arg e t k t
21:                                 end if
22:                     end for
23:       end for
24:  return π θ ,   Q ϕ 1 , Q ϕ 2

3.2. Design of LTA-HPSO Algorithm

3.2.1. Particle Swarm Coding and Decoding

As mentioned previously, this paper adopts the PSO algorithm to solve station selection and take-off sequencing problems in shipboard aircraft departure operations. As a swarm intelligence algorithm, PSO features simple implementation, rapid convergence, and excellent hybridization compatibility. Its explicit encoding scheme provides superior interpretability for scheduling optimization. To simultaneously capture critical information on parking positions, take-off positions, and take-off sequences, we design a hierarchical encoding method [37] as follows:
For station selection from the support position to the parking position, we suppose there are currently five parking positions available on the deck, and x i is a continuous-value variable reflecting the selection of the parking position by the carrier-based aircraft. If x i 0.6 , 0.8 0 , 1 , it indicates that aircraft i has chosen the fourth parking position among the five available parking positions.
For station selection from the parking position to the take-off position, we suppose there are currently three take-off positions available on the deck, and y i is a continuous-value variable reflecting the selection of the take-off position by the carrier-based aircraft. If y i 0.33 , 0.66 0 , 1 , it indicates that the aircraft i has chosen the second take-off position among the three available take-off positions.
The determination of the take-off sequence is not only based on the sequence of the operation times but also should fully consider the priority of the departure tasks. Therefore, the take-off sequence is related to the priority of the aircraft operation. Suppose z i is a variable reflecting the take-off sequence of the aircraft, and z i = 1 Pr i 0 , 1 , the lower the value, the earlier the take-off sequence.
For the departure operation of N carrier-based aircraft on the deck, the complete form of the particle swarm coding that determines the station selection and take-off sequence can be expressed as in Equation (49), with a total particle dimension of 3N.
X = x 1 , x 2 , , x N Code   of   parking   position , y 1 , y 2 , , y N Code   of   take - off   position , z 1 , z 2 , , z N Code   of   take - off   sequence ,

3.2.2. Population Division

If the flight deck is considered the search space and the carrier-based aircraft as particles, then for a deck space with M parking positions, L take-off positions, and N carrier-based aircraft, its search space can be expressed as in Equation (50). In medium- and large-scale continuous scheduling of carrier-based aircraft operations, the search space will be significantly large. To reduce the computational load caused by the large search space, based on the idea of dynamic cluster division [38], the carrier-based aircraft particles in the deck space are divided into multiple sub-populations according to the task coupling degree and sub-population size control, as expressed in Equations (51)–(54); subsequently, multiple sub-populations are optimized in parallel, and the optimal solutions of each sub-population are merged to form the optimal dispatching plan for the departure of this batch of carrier-based aircraft.
S e a r c h S p a c e = M N × L N × N ! ,
M C o u p l i n g i j = α Task   temporal   overlap   degree i , j + β The   degree   of   competition   for   take - off   positon i , j
Here, α , β are the weight coefficients in the emphasis direction of the task coupling relationship, α , β 0 , 1 ;
Task   temporal   overlap   degree i , j = max 0 , min T e i T e j max T s i T s j max T e i T e j min T s i T s j
The   degree   of   competition   for   take - off   position i , j = l = 1 L P o f f i l P o f f j l
The   number   of   aircraft   in   the   sub - population : 1 N _ S u b max = N

3.2.3. Fitness Function

During the scheduling process, an improved algorithm based on PSO is adopted to handle the station selection and take-off sequence decision-making problems in the operation, while the AAE–SAC algorithm solves the real-time control of the path. Therefore, it is inevitable that the scheduling plan generated by PSO cannot necessarily ensure that all carrier-based aircraft can complete the departure operation smoothly. That is, there may be conflicts when the time intervals between the arrival of certain carrier-based aircrafts at the take-off position are too small. There may also be situations where the operation is completed but does not necessarily conform to the priority order of the carrier-based aircraft. Therefore, in the design of the fitness function, it is not only necessary to consider whether the final completion time of the optimization plan is optimal but also whether the handling of high-priority tasks in the plan is reasonable and whether the take-off sequence meets the safety intervals of the take-off time. The specific form is expressed as in Equations (55)–(57).
f = max Pr i T i + λ P l a u n c h Time   conflict   penalty   term ,
λ t + 1 = λ t 1 + l = 1 L C o f f l ,
F = 1 f ,
Equation (55) is the expression for the fitness function variable f that reflects the relationship between priority and time conflict functions; Equation (56) is the adaptive adjustment function for the penalty time coefficient in Equation (55), that is, the greater the number of take-off interval conflicts in the current iteration process, the greater the penalty time coefficient in the next round of iteration; and Equation (57) is the fitness function expression.

3.2.4. Update Mechanism

In the proposed particle update mechanism, adaptive adjustments to the learning factors in the particle velocity update are introduced (Equations (58)–(61)), replacing conventional static methods. Laplace-distributed perturbation is applied to these learning factors c 1 , c 2 [39], where the long-tail characteristic facilitates exploration beyond current optimal regions, thereby enhancing the algorithm’s ability to escape local optima. Concurrently, a tabu list mechanism [40] is incorporated to systematically exclude non-improving or revisited solutions during iterations (Equations (62)–(63)), significantly boosting computational efficiency while ensuring solution quality.
v i t + 1 = ω v i t + c 1 L a p l a c e 0 , b Individual   perturbation p b e s t x i t + c 2 L a p l a c e 0 , b Social   perturbation g b e s t x i t
x i t + 1 = x i t + v i t + 1 ,
ω = ω max ω max ω min t t max ,
b = t t max ,
Here, L a p l a c e 0 , b is the Laplace distribution with location parameter 0 and scale parameter b ; t is the current iteration number, and t max is the total number of iterations.
f x n e w > μ f + α σ f ,
T n e w = T b a s e + β f x f b e s t f b e s t ,
Equation (62) represents the degree of deterioration of the solution generated in the current iteration. If the fitness value f x n e w of the current solution is greater than the sum of the average fitness value μ f and the weighted fitness standard deviation α σ f of the current population, where α 1 , 3 is the coefficient of solution deterioration significance, it indicates that the current solution has significantly deteriorated and should be placed in the tabu list. Equation (63) is the expression for the dynamic validity period of the tabu list, that is, the worse the current solution f x compared with the historical optimal solution f b e s t , the longer the validity period of its tabu list. T b a s e is the initial validity period of the tabu list, and β 1 , 3 is the sensitivity coefficient of the dynamic validity period.
In summary, a detailed discussion on the encoding and decoding, population division, fitness function, and particle update mechanism related to the PSO algorithm was made. Thus, the optimization algorithm LTA-HPSO was established for solving the problem of choosing the station position and determining the take-off sequence of the carrier-based aircraft. Figure 7 shows the algorithm flowchart. The pseudocode specification for the LTA-HPSO algorithm is formally detailed in Algorithm 2.
Algorithm 2. Laplacian–Tabu Augmented Hierarchical PSO (LTA-HPSO) Algorithm
Require: Number of aircraft: N
              Number of parking positions: M
               Number of departure positions: L
                Maximum   iterations :   t max
                Initial   population :   X = x 1 , x 2 , , x N , y 1 , y 2 , , y N , z 1 , z 2 , , z N
                Priority   of   each   aircraft :   Pr i   for   i = 1 , 2 , , N
Ensure: Optimal parking position selection and departure order: X b e s t
               Minimum fitness value: F b e s t
1: Initialize population X   and   velocities   v i for all particles
2: Initialize X b e s t X
3: Initialize   F b e s t
4: for t 1   to   t max  do
5:       for each particle i in the population do
6:               Update   velocity   v i using Laplacian perturbation:
  v i t + 1 ω v i t + c 1 L a p l a c e 0 , b p b e s t x i t + c 2 L a p l a c e 0 , b g b e s t x i t
7:               Update   position   x i :
                                                  x i t + 1 x i t + v i t + 1
8:                Evaluate   fitness   F x i t + 1 :
                                              F x i t + 1 1 max Pr i T i + λ P l a u n c h
9:                 if F x i t + 1 < F b e s t  then
10:                  X b e s t x i t + 1
11:                  F b e s t F x i t + 1
12:              end if
13:          end for
14:          Update penalty coefficient λ :
                                                    λ t + 1 λ t 1 + l = 1 L C o f f l
15:            Update inertia weight ω :
                                                   ω ω max ω max ω min t t max
16:            Update Laplacian scale parameter b :
                                                   b t t max
17:            Update tabu table:
18:            if f x n e w > μ f + α σ f  then
19:                      Add   x n e w to tabu table
20:           end if
21: end for
22: return X b e s t , F b e s t

3.3. Summary of Solution Methods

The previous two subsections expounded on the real-time planning of the scheduling path for departure operations on an aircraft carrier through deep reinforcement learning methods, establishing the AAE–SAC algorithm for solving the path planning problem. We also explained the optimization process of the decision-making schemes for the parking positions, take-off positions, and take-off sequence of the aircraft during the departure operation through the PSO method, establishing the LTA-HPSO algorithm for solving the station matching and sequence decision-making problem. Figure 8 shows a flowchart of the two-stage scheduling optimization algorithm for solving departure operations of the carrier-based aircraft. After the simulation program is started and initialization is completed, the user can choose to have the system generate combat tasks or manually define these tasks. The process of the LTA-HPSO sub-program is shown in Figure 7. The simulation system generates a scheduling plan based on the LTA-HPSO algorithm to match the corresponding station positions and then executes the simulation process in sequence. During this period, the AAE–SAC algorithm is used as a sub-program to perform real-time control of the scheduling path; its process is shown in Figure 6. If the final completion time of the real-time path in the simulation is less than or equal to 90% of the time taken by the original matching path (from [19]), a new path is considered to have been generated and is then saved. Otherwise, when the simulation reaches the termination condition, the Gantt chart of its current optimal operation process is outputted, and the simulation is terminated.

4. Simulation Experiment and Result Analysis

Taking a typical aircraft carrier deck as an example, this study performed simulation optimization experiments on carrier-based aircraft departure operations. The flight deck was assumed to have 18 one-stop support positions, 16 fixed parking positions (numbered A01–A16), 20 temporary parking positions (numbered B01–B20), three elevators, three catapult take-off positions, and runways. Table 2 and Table 3 present the algorithm parameters and operation-related parameters, respectively. The simulation environment was a carrier-based aircraft departure operation simulation platform developed using C + + in a Windows 10 operating system running on a PC with 8 GB of memory and a main frequency of 2.30 GHz.

4.1. Case 1

Case 1 involved 12 stealth fighter jets on the flight deck at support positions and fully supported. Some combat missions may require all existing carrier-based aircraft to be dispatched. Figure 9 shows the initial deck status and position information. Under the conditions of this case, the proposed algorithm was compared with two benchmark algorithms, namely, the conventional PSO algorithm and the deep reinforcement learning method. Each algorithm independently ran 10 simulations of the scheduling process. Table 4 presents the solution results obtained by the three algorithms for Case 1. The LTA-HPSO + AAE–SAC algorithm exhibited the best optimal scheduling time of 609 s. Table 5 presents the corresponding optimal scheduling plan.
The Gantt chart of the departure operation of 12 shipboard aircraft is shown in Figure 10. Below the chart is the description of the operation content represented by different color blocks. The departure operation color blocks at the end of each Gantt chart contain the take-off sequence of the carrier-based aircraft in decimal form. For example, the take-off sequence of aircraft No. 1 is 3.1, where the integer part 3 indicates that its take-off runway is located at the catapult No. C3, and the decimal part 1 indicates that it takes off in the first sequence on the C3 runway. The take-off sequence of other aircraft can be inferred in the same way. Figure 11 verifies the optimization effect of the algorithm proposed in this paper on the departure operation scheduling of carrier-based aircraft from the perspective of convergence.

4.2. Case 2

In Case 2, the sortie scale of the carrier-based aircraft was increased to 24 to explore the ability of the LTA-HPSO + AAE–SAC algorithm in solving large-scale, multi-constraint scheduling optimization problems. Accordingly, the test design involved 18 aircraft on deck in the one-stop support position and supported, including two early warning aircraft (located at maintenance positions P1 and P2, from left to right) and two heavy carrier-based aircraft (located at maintenance positions P10 and P11). There were sufficient spare aircraft in the hangar. Figure 12 shows the initial state of the deck and position information. Under the conditions of this case, each of the three algorithms independently ran 10 simulations of the scheduling process. Table 6 presents the solution results for Case 2. Figure 13 shows the iterative curves of the solutions obtained by the three algorithms for Case 2. The LTA-HPSO + AAE–SAC algorithm required 984 s for optimal scheduling; Table 7 presents the corresponding optimal scheduling plan. Figure 14 shows the Gantt chart of the departure operation of 24 aircraft.
Case 2, in addition to having a different scale of scheduling from Case 1, also includes different types of shipboard aircraft, which makes the scheduling optimization not only need to consider the change in quantity but also deal with multiple scheduling demands such as different priority aircraft and different operation tasks. Therefore, we use Figure 15 and Figure 16 to demonstrate the LTA-HPSO + AAE-SAC algorithm’s ability to collaboratively optimize multi-type aircraft and differentiated operations in large-scale scheduling. Figure 15 shows the instantaneous image at 195 s of the simulation experiment, from which it can be seen that the algorithm prioritizes the handling of early warning aircraft with higher priority, and its scheduling path is safe and reasonable. Figure 16 reflects the collaborative path planning process of aircraft taking off and leaving the field when the station positions are strongly constrained by deck space. At 504 s of the simulation experiment, the four aircrafts (numbered: 9, 12, 13, 14) located at parking positions A9-A12 have to detour their scheduling paths due to the occupation of their forward support stations (P3-P6), and there are multiple constraints such as operation space and wake interference. Under such circumstances, the scheduling path trajectory and the aircraft’s outer contour and wake envelope line planned by the algorithm from the side reflect the algorithm’s outstanding optimization ability for the departure scheduling problem under changes in priority and operation space, and the decision path is safe and reasonable.

4.3. Analysis of Algorithm Performance

4.3.1. Algorithm Comparison

In the case of the departure operation scheduling of 12 shipboard aircraft, we have sorted out the following information in Table 8 based on the solution results of three algorithms. Furthermore, we conducted pairwise t-tests between the LTA-HPSO + AAE-SAC algorithm and the PSO + heuristic rule algorithm, as well as the PSO + SAC algorithm. The statistical results show that in the one-sided t-test with the PSO + heuristic rule algorithm, the statistical result is t 9 = 3.515 > 1.833 ,   p = 0.007 < 0.05 , and the corresponding p-value is much smaller than the null hypothesis, indicating a significant difference in the optimization performance of the two algorithms. In the one-sided t-test with the PSO + SAC algorithm, the statistical result is t 18 = 5.73 > 1.734 ,   p = 0.00028 < 0.05 , and the corresponding p-value is much smaller than the null hypothesis, indicating a significant difference in the optimization performance of the two algorithms. Meanwhile, the average time taken by the LTA-HPSO + AAE-SAC algorithm to obtain the optimal scheduling is 652.6 s, with an average running time of approximately 125 s. Compared with the PSO + SAC algorithm under the same framework, the average scheduling time is shortened by 13.75%, and the operation time is reduced by 30.48%. Although compared with the traditional PSO + heuristic rule, in solving small-scale scheduling, the operation time cost does not show an advantage, the average scheduling time of the optimal solution of the LTA-HPSO + AAE-SAC algorithm is shortened by 31.27%.
In the large-scale scheduling case of 24 shipboard aircrafts taking off and leaving the field, the solution results of the three algorithms are summarized in Table 9. A t-test was conducted for pairwise comparisons between the LTA-HPSO + AAE-SAC algorithm and the PSO + heuristic rule algorithm, as well as the PSO + SAC algorithm. The statistical results show that in the one-sided t-test with the PSO + heuristic rule algorithm, t 9 = 8.445 > 1.833 ,   p = 0.00001 < 0.05 , which is significantly smaller than the null hypothesis, indicating a significant difference in the optimization performance of the two algorithms. In the one-sided t-test with the PSO + SAC algorithm, t 9 = 9.918 > 1.833 ,   p = 0.00000 < 0.05 , which is significantly smaller than the null hypothesis, indicating a significant difference in the optimization performance of the two algorithms. Moreover, all t-test results are far smaller than the null hypothesis, fully demonstrating that the algorithm proposed in this paper has a more prominent optimization capability in large-scale scheduling problems. Although the scheduling of 24 aircraft involves different priorities, hangar scheduling, and deck support, which increases the complexity of the takeoff and departure operations, the average scheduling time obtained by the LTA-HPSO + AAE-SAC algorithm is still 49.54% shorter than that of the PSO + heuristic rule algorithm and 26.81% shorter than that of the PSO + SAC algorithm. At the same time, the average running time of the algorithm has only increased by 48.04% compared to itself, and compared to the two benchmark algorithms, it shows improved superiority and good solution stability.

4.3.2. Sensitivity Analysis

In the preceding section, we demonstrated the superiority of the LTA-HPSO + AAE-SAC algorithm by comparing it with two other benchmark algorithms. In this section, we provide a detailed analysis to verify the rationality and feasibility of the improvements made to it. Specifically, to enhance the coordination capability and new-path exploration of the AAE-SAC algorithm for shipboard aircraft deck scheduling via the dynamic target entropy adjustment mechanism (DTEAM), we performed a sensitivity analysis on the parameter β in Equation (44). Using a scenario with 24 aircraft departures, we conducted 100 independent simulation experiments. The β value range was set to (0, 3]. We examined the effects of varying β values on aircraft departure operation time and the relationships between β and the occurrences of conflicts and path exploration during cooperative scheduling. Figure 17 presents a statistical chart illustrating the relationships among the dynamic target entropy parameter β , scheduling time, number of conflicts, and proportion of new paths. Based on the results shown in the figure, we summarize the following observations: as the β value increases, the scheduling time for shipboard aircraft departures (black line) increases, the number of conflicts, such as aircraft collisions, wake interference, and position occupation (red line), decreases, and the proportion of new paths (blue line) employed decreases. Consequently, we conclude that when the parameter β falls within the range [0.8, 1.5], the AAE-SAC algorithm achieves effective performance in collaborative path planning for large-scale aircraft departure operations and optimally balances conflict avoidance and path exploration during the scheduling process.
In the enhancement of the Particle Swarm Optimization (PSO) algorithm, a Laplace perturbation is incorporated into the original single-learning factor, enabling this parameter to vary dynamically throughout the iterative procedure. This modification enhances the algorithm’s optimization capability within the particle solution space. For the scale parameter b of the Laplace perturbation function, the linear form (Equation (61)) is adopted. To validate the design rationale, a comparative experiment was conducted involving 30 independent simulation trials, contrasting the linear form with an exponential form defined as b = 0.5 t / t max . Using a case study of 24-aircraft scheduling with 100 generations per trial, the statistical mean distribution is shown in Figure 18. The blue trajectory illustrates the impact of the exponential-form Laplace scale parameter on the algorithm’s fitness value, which fully converges by generation 50. In contrast, the red trajectory depicting the linear-form’s influence converges at generation 68 but achieves a significantly higher optimal fitness value than the exponential counterpart. These results demonstrate that the linear-form Laplace perturbation facilitates the LTA-HPSO algorithm in attaining superior minimum scheduling time solutions for aircraft departure scheduling optimization.

5. Conclusions

This paper investigates the scheduling optimization problem in deck departure operations of shipboard aircraft. We propose a novel two-stage optimization algorithm (LTA-HPSO + AAE-SAC) to address this problem. Building upon the particle swarm optimization (PSO) algorithm, we introduce a population segmentation strategy based on task sequence and take-off sequence. By incorporating Laplacian perturbation learning factors and a dynamic tabu table, we develop the enhanced LTA-HPSO algorithm, significantly improving optimization quality and efficiency. Concurrently, the AAE-SAC algorithm is proposed by dynamically adjusting policy entropy in the SAC algorithm and integrating an attention mechanism. This enhancement effectively mitigates the limitations in collaborative trajectory optimization for large-scale aircraft departure operations. The integration of PSO and deep reinforcement learning (DRL) within a two-stage framework represents a novel approach in carrier-based departure operations research, resolving the key limitation of existing studies which treated scheduling schemes and spatial paths in isolation.
Case simulation results demonstrate that the LTA-HPSO + AAE-SAC algorithm effectively tackles departure scheduling problem across different scales, outperforming two benchmark algorithms in both solution quality and stability. Although the traditional approach (PSO + heuristic rules) offers faster solution times for small-scale problems, LTA-HPSO + AAE-SAC incurs computational overhead due to sample training and real-time path control. However, this investment resolves the inherent imbalance in traditional two-stage scheduling, characterized by low coherence between scheduling schemes and their corresponding paths, as well as path discontinuity. The average scheduling time achieved by the optimal solution using LTA-HPSO + AAE-SAC is reduced by 31.27% compared to PSO + heuristic rules and by 13.75% compared to PSO + SAC. For a complex scenario involving 24 aircraft with varying priorities and heterogeneous tasks, the algorithm enables real-time collaborative path planning on the deck, with only a 48.04% increase in computational time. Notably, it enhances multi-aircraft operational safety while demonstrating superior robustness for complex scheduling scenarios.
Therefore, this study provides an effective solution for scheduling optimization within the trend of large-scale, high-dynamic, and highly collaborative aircraft departure operations, thereby enriching theoretical frameworks for deck operations and enhancing aircraft departure efficiency. Furthermore, the methodological framework holds applicability potential for scheduling problems requiring integrated path consideration, such as workshop scheduling.
The dynamic aspects considered in this study pertain primarily to deck space and priority changes. However, we did not account for uncertainties involved in hardware operations, such as aircraft warm-up time and deflector plate cooling time. Additionally, the impact of support personnel on operation times and the scheduling challenges in mixed manned/unmanned scenarios require further investigation. While the AAE-SAC algorithm utilizes an attention mechanism to reduce the dimensionality of deck state space, exploring its limitations under increased complexity involving more aircraft, personnel states, and surrounding factors will be a key focus of future research.

Author Contributions

Conceptualization, J.L.; Methodology, J.L. and N.W.; Software, J.L.; Validation, J.L.; Formal analysis, J.L.; Investigation, J.L.; Resources, J.L.; Data curation, J.L.; Writing – original draft, J.L.; Writing – review & editing, J.L.; Visualization, J.L.; Supervision, N.W.; Project administration, N.W. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding author.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Wei, M.; Yang, S.; Wu, W.; Sun, B. A multi-objective fuzzy optimization model for multi-type aircraft flight scheduling problem. Transport 2024, 39, 313–322. [Google Scholar] [CrossRef]
  2. Sun, B.; Xu, Z.; Wei, M.; Wang, X. A study on the strategic behavior of players participating in air-rail intermodal transportation based on evolutionary games. J. Air Transp. Manag. 2025, 126, 102793. [Google Scholar] [CrossRef]
  3. Jewell, A.; Wigge, M.A.; Gagnon, C.M.; Lynn, L.A.; Kirk, K.M.; Berg, K.K.; Roberts, T.A.; Hale, A.J.; Jones, W.L.; Matheny, A.M.; et al. USS Nimitz and Carrier Airwing Nine Surge Demonstration; Center for Naval Analyses: Alexandria, Egypt, 1998. [Google Scholar]
  4. Michels, A.S.; Thate, T. Requirements for Digitized Aircraft Spotting (Ouija) Board for Use on US Navy Aircraft Carriers; Naval Postgraduate School: Monterey, CA, USA, 2002. [Google Scholar]
  5. US Navy Air Systems Command. Navy Training System Plan for Aviation Data Management and Control System. US Navy. 2002. Available online: https://www.globalsecurity.org/military/library/policy/navy/ntsp/admacs-a_2002.pdf (accessed on 20 May 2025).
  6. Zhang, Z.; Lin, S.; Dong, R.; Zhu, Q. Designing a Human-Computer Cooperation Decision Planning System for Aircraft Carrier Deck Scheduling. In Proceedings of the AIAA Infotech@ Aerospace, Kissimmee, FL, USA, 5–9 January 2015; p. 1111. [Google Scholar]
  7. Ryan, J.; Cummings, M.; Roy, N.; Banerjee, A.; Schulte, A. Designing an Interactive Local and Global Decision Support System for Aircraft Carrier Deck Scheduling. In Proceedings of the Infotech@ Aerospace 2011, St. Louis, MO, USA, 29–31 March 2011; p. 1516. [Google Scholar]
  8. Ghosh Dastidar, R.; Frazzoli, E. A Queueing Network Based Approach to Distributed Aircraft Carrier Deck Scheduling. In Proceedings of the Infotech@ Aerospace 2011, St. Louis, MO, USA, 29–31 March 2011; p. 1514. [Google Scholar]
  9. Michini, B.; How, J. A Human-Interactive Course of Action Planner for Aircraft Carrier Deck Operations. In Proceedings of the Infotech@ Aerospace 2011, St. Louis, MO, USA, 29–31 March 2011; p. 1515. [Google Scholar]
  10. Yuan, P.; Han, W.; Su, X.; Liu, J.; Song, J. A Dynamic Scheduling Method for Carrier Aircraft Support Operation Under Uncertain Conditions Based on Rolling Horizon Strategy. Appl. Sci. 2018, 8, 1546. [Google Scholar] [CrossRef]
  11. Wan, B.; Han, W.; Su, X.C.; Liu, J. Carrier-Based Aircraft Departure Scheduling Optimization Based on CE-PF Algorithm. J. Beijing Univ. Aeronaut. Astronaut. 2022, 48, 771–785. (In Chinese) [Google Scholar]
  12. Liu, Z.X.; Wan, B.; Su, X.C.; Guo, F.; Liu, Y.J. Sortie Scheduling Method of Carrier Aircraft Based on HA Algorithm. Syst. Eng. Electron. 2024, 46, 1691–1702. (In Chinese) [Google Scholar]
  13. Wang, H.; Jiang, Z.; Wang, Y.; Zhang, H.; Wang, Y. A two-stage optimization method for energy-saving flexible job-shop scheduling based on energy dynamic characterization. J. Clean. Prod. 2018, 192, 534–546. [Google Scholar] [CrossRef]
  14. Si, W.; Sun, T.; Song, C.; Zhang, J. Design and Verification of a Transfer Path Optimization Method for an Aircraft on the Aircraft Carrier Flight Deck. Front. Inf. Technol. Electron. Eng. 2021, 22, 1221–1233. [Google Scholar] [CrossRef]
  15. Li, H.; Zhu, Y.; Chen, Y.; Pedrielli, G.; Pujowidianto, N.A. The Object-Oriented Discrete Event Simulation Modeling: A Case Study on Aircraft Spare Part Management. In Proceedings of the 2015 Winter Simulation Conference (WSC), Huntington Beach, CA, USA, 6–9 December 2015. [Google Scholar]
  16. McWhite, J.D. CVNX—Expanded Capability Baseline Aircraft Carrier Design Study. Nav. Eng. J. 2000, 112, 47–57. [Google Scholar] [CrossRef]
  17. Nemirko, A.P.; Dulá, J.H. Machine Learning Algorithm Based on Convex Hull Analysis. Procedia Comput. Sci. 2021, 186, 381–386. [Google Scholar] [CrossRef]
  18. Jiang, L.; Fang, D.J.; Zhou, H.W.; Huang, H.B. Improved Global Path Planning Algorithm Based on Ray Model. Acta Electron. Sin. 2022, 50, 548–556. (In Chinese) [Google Scholar]
  19. Liu, J. Research on Path Planning for Carrier-Based Aircraft Tractor Based on Inverse Reinforcement Learning. Master’s Thesis, Harbin Engineering University, Harbin, China, 2017. [Google Scholar]
  20. Meng, S.; Zhu, Q.; Xia, F. Improvement of the Dynamic Priority Scheduling Algorithm Based on a Heapsort. IEEE Access 2019, 7, 68503–68510. [Google Scholar] [CrossRef]
  21. Tian, B.; Xiao, G.; Shen, Y. A deep reinforcement learning approach for dynamic task scheduling of flight tests. J. Supercomput. 2024, 80, 18761–18796. [Google Scholar] [CrossRef]
  22. Wu, Y.; Hu, N.; Qu, X. A General Trajectory Optimization Method for Aircraft Taxiing on Flight Deck of Carrier. Proc. Inst. Mech. Eng. Pt. G. J. Aerosp. Eng. 2019, 233, 1340–1353. [Google Scholar] [CrossRef]
  23. Christiano, P.F.; Leike, J.; Brown, T.; Martic, M.; Legg, S.; Amodei, D. Deep reinforcement learning from human preferences. Adv. Neural Inf. Process. Syst. 2017, 30, 4302–4310. [Google Scholar]
  24. Li, F.; Lang, S.; Hong, B.; Reggelin, T. A two-stage RNN-based deep reinforcement learning approach for solving the parallel machine scheduling problem with due dates and family setups. J. Intell. Manuf. 2024, 35, 1107–1140. [Google Scholar] [CrossRef]
  25. Yan, Q.; Wang, H.; Wu, F. Digital Twin-Enabled Dynamic Scheduling with Preventive Maintenance Using a Double-Layer Q-Learning Algorithm. Comput. Oper. Res. 2022, 144, 105823. [Google Scholar] [CrossRef]
  26. Wang, Z.; Cai, B.; Li, J.; Yang, D.; Zhao, Y.; Xie, H. Solving Non-Permutation Flow-Shop Scheduling Problem via a Novel Deep Reinforcement Learning Approach. Comput. Oper. Res. 2023, 151, 106095. [Google Scholar] [CrossRef]
  27. Wong, M.F.; Tan, C.W. Aligning Crowd-Sourced Human Feedback for Reinforcement Learning on Code Generation by Large Language Models. IEEE Trans. Big Data 2024. [Google Scholar] [CrossRef]
  28. Zhang, Y.; Zhu, H.; Tang, D. An Improved Hybrid Particle Swarm Optimization for Multi-Objective Flexible Job-Shop Scheduling Problem. Kybernetes 2020, 49, 2873–2892. [Google Scholar] [CrossRef]
  29. Liu, L.L.; Hu, R.S.; Hu, X.P.; Zhao, G.P.; Wang, S. A Hybrid PSO-GA Algorithm for Job Shop Scheduling in Machine Tool Production. Int. J. Prod. Res. 2015, 53, 5755–5781. [Google Scholar] [CrossRef]
  30. Ding, H.; Gu, X. Improved Particle Swarm Optimization Algorithm Based Novel Encoding and Decoding Schemes for Flexible Job Shop Scheduling Problem. Comput. Oper. Res. 2020, 121, 104951. [Google Scholar] [CrossRef]
  31. Vakanski, A.; Mantegh, I.; Irish, A.; Janabi-Sharifi, F. Trajectory Learning for Robot Programming by Demonstration Using Hidden Markov Model and Dynamic Time Warping. IEEE Trans. Syst. Man Cybern. Part B (Cybern.) 2012, 42, 1039–1052. [Google Scholar] [CrossRef] [PubMed]
  32. Giorgino, T. Computing and Visualizing Dynamic Time Warping Alignments in R: The dtw Package. J. Stat. Softw. 2009, 31, 1–24. [Google Scholar] [CrossRef]
  33. Boularias, A.; Kober, J.; Peters, J. Relative Entropy Inverse Reinforcement Learning. Proc. Fourteenth Int. Conf. Artif. Intell. Stat. 2011, 15, 182–189. [Google Scholar]
  34. Nica, A.; Khetarpal, K.; Precup, D. The Paradox of Choice: Using Attention in Hierarchical Reinforcement Learning. arXiv 2022, arXiv:2201.09653. Available online: https://arxiv.org/abs/2201.09653 (accessed on 20 May 2025).
  35. Xia, G.; Luan, T.; Sun, M. Evaluation Analysis for Sortie Generation of Carrier Aircrafts Based on Nonlinear Fuzzy Matter-Element Method. J. Intell. Fuzzy Syst. 2016, 31, 3055–3066. [Google Scholar] [CrossRef]
  36. Fujimoto, S.; Hoof, H.; Meger, D. Addressing Function Approximation Error in Actor-Critic Methods. Int. Conf. Mach. Learn. 2018, 80, 1587–1596. [Google Scholar]
  37. Zhang, X.; Guo, P.; Zhang, H.; Yao, J. Hybrid Particle Swarm Optimization Algorithm for Process Planning. Mathematics 2020, 8, 1745. [Google Scholar] [CrossRef]
  38. Kuo, R.J.; Syu, Y.J.; Chen, Z.Y.; Tien, F.C. Integration of Particle Swarm Optimization and Genetic Algorithm for Dynamic Clustering. Inf. Sci. 2012, 195, 124–140. [Google Scholar] [CrossRef]
  39. Pant, M.; Thangaraj, R.; Abraham, A.; Grosan, C. Differential Evolution with Laplace Mutation Operator. 2009 IEEE Congr. Evol. Comput. 2009, 2841–2849. [Google Scholar]
  40. Peng, C.; Wu, G.; Liao, T.W.; Wang, H. Research on Multi-Agent Genetic Algorithm Based on Tabu Search for the Job Shop Scheduling Problem. PLoS ONE 2019, 14, e0223182. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Operational flowchart for carrier-based aircraft departure.
Figure 1. Operational flowchart for carrier-based aircraft departure.
Entropy 27 00662 g001
Figure 2. Resource model of carrier-based aircraft departure operations.
Figure 2. Resource model of carrier-based aircraft departure operations.
Entropy 27 00662 g002
Figure 3. Pentagonal outer-contour shell model of the aircraft in the (a) folded wing state, (b) extended wing state, and (c) collision detection model of the carrier-based aircraft.
Figure 3. Pentagonal outer-contour shell model of the aircraft in the (a) folded wing state, (b) extended wing state, and (c) collision detection model of the carrier-based aircraft.
Entropy 27 00662 g003
Figure 4. Encounter model of carrier-based aircraft deck operation.
Figure 4. Encounter model of carrier-based aircraft deck operation.
Entropy 27 00662 g004
Figure 5. Deck obstacle and work area division diagram for an aircraft carrier.
Figure 5. Deck obstacle and work area division diagram for an aircraft carrier.
Entropy 27 00662 g005
Figure 6. Schematic of the network structure of the AAE–SAC algorithm.
Figure 6. Schematic of the network structure of the AAE–SAC algorithm.
Entropy 27 00662 g006
Figure 7. Flowchart of the LTA-HPSO algorithm.
Figure 7. Flowchart of the LTA-HPSO algorithm.
Entropy 27 00662 g007
Figure 8. Simulation flowchart of a two-stage scheduling optimization algorithm for aircraft departure operations.
Figure 8. Simulation flowchart of a two-stage scheduling optimization algorithm for aircraft departure operations.
Entropy 27 00662 g008
Figure 9. Initial station map of Case 1.
Figure 9. Initial station map of Case 1.
Entropy 27 00662 g009
Figure 10. Gantt chart of 12 carrier-based aircraft departure operation.
Figure 10. Gantt chart of 12 carrier-based aircraft departure operation.
Entropy 27 00662 g010
Figure 11. Iterative curves of the solutions obtained by three algorithms in Case 1.
Figure 11. Iterative curves of the solutions obtained by three algorithms in Case 1.
Entropy 27 00662 g011
Figure 12. Initial station map of Case 2.
Figure 12. Initial station map of Case 2.
Entropy 27 00662 g012
Figure 13. Iterative curves of the solutions obtained by three algorithms in Case 2.
Figure 13. Iterative curves of the solutions obtained by three algorithms in Case 2.
Entropy 27 00662 g013
Figure 14. Gantt chart of 24 carrier-based aircraft departure operations.
Figure 14. Gantt chart of 24 carrier-based aircraft departure operations.
Entropy 27 00662 g014
Figure 15. Snapshot of the departure operation simulation test with early-warning aircraft.
Figure 15. Snapshot of the departure operation simulation test with early-warning aircraft.
Entropy 27 00662 g015
Figure 16. Snapshot of the simulation test for the departure operation of an aircraft with strong constraints on deck space for positions.
Figure 16. Snapshot of the simulation test for the departure operation of an aircraft with strong constraints on deck space for positions.
Entropy 27 00662 g016
Figure 17. The influence of the value of the dynamic target entropy parameter β in the AAE-SAC algorithm on the performance of the two-stage scheduling optimization algorithm.
Figure 17. The influence of the value of the dynamic target entropy parameter β in the AAE-SAC algorithm on the performance of the two-stage scheduling optimization algorithm.
Entropy 27 00662 g017
Figure 18. Performance analysis of the impact of scale parameter b in Laplace perturbation on the fitness of LTA-HPSO algorithm.
Figure 18. Performance analysis of the impact of scale parameter b in Laplace perturbation on the fitness of LTA-HPSO algorithm.
Entropy 27 00662 g018
Table 1. Priority relationships in carrier-based aircraft departure operation.
Table 1. Priority relationships in carrier-based aircraft departure operation.
Type of AircraftPriority Rank Value   of   Pr C i Type of MissionPriority Rank Value   of   Pr M i
Early-warning aircraftI3Ground strikeI3
Heavy aircraftII2Air-to-airII2
Stealth/electronic aircraftIII1CruiseIII1
Table 2. Parameters of the LTA-HPSO algorithm.
Table 2. Parameters of the LTA-HPSO algorithm.
Parameter NameValue
Population size100
Maximum number of iterations100
Base   individual   learning   factor   c 1 0.5
Base   social   learning   factor   c 2 0.5
Inertia   weight   ω max 1.2
Inertia   weight   ω min 0.15
Table 3. Aircraft departure operation-related parameters of the simulation system.
Table 3. Aircraft departure operation-related parameters of the simulation system.
Parameter NameValues and UnitsParameter NameValues and Units
Landing Gear Retention Time for Aircraft40 sPreparation Time for Bend Plate30 s
Release Time for Landing Gear of Aircraft30 sCooling and Reset Time for Bend Plate30 s
Maximum Traction Speed1.5 m/sPre-Takeoff Inspection Time30 s
Maximum Taxiing Speed3.0 m/sInterval Time of take-off30 s
Transport Time for Lifting Gear60 sEngine Warm-Up Time60 s
One-stop Support Time8 min
Table 4. Results of Case 1 where 10 simulations of aircraft departure operations are solved by three algorithms.
Table 4. Results of Case 1 where 10 simulations of aircraft departure operations are solved by three algorithms.
Number of Algorithm ExecutionsPSO + HeuristicPSO + SACLTA-HPSO + AAE–SAC
Optimal Solution Scheduling Time/sAlgorithm Running Time/sOptimal Solution Scheduling Time/sAlgorithm Running Time/sOptimal Solution Scheduling Time/sAlgorithm Running Time/s
Ex184138.64709157.64692136.56
Ex287079.87734150.87635115.85
Ex384868.65706119.84699134.05
Ex481515.36768200.63661128.72
Ex586464.62811271.58716143.21
Ex684434.35738162.23625108.46
Ex71663129.25835256.83653140.84
Ex886869.35792223.59622126.23
Ex9106696.62710122.68609102.63
Ex1081621.44763133.65614114.42
Table 5. Scheduling plan of Case 1.
Table 5. Scheduling plan of Case 1.
Aircraft NumberPriorityParking Position NumberTake-Off Position NumberTake-Off Sequence
13A2C31
23A1C34
33B8C26
43B7C15
53B13C12
63B14C37
73A9C23
83A10C18
93A11C29
103A12C310
113B15C111
123B16C212
Table 6. Results of Case 2 in which 10 simulations of aircraft departure operations are solved by three algorithms.
Table 6. Results of Case 2 in which 10 simulations of aircraft departure operations are solved by three algorithms.
Number of Algorithm ExecutionsPSO + HeuristicPSO + SACLTA-HPSO + AAE–SAC
Optimal Solution Scheduling Time/sAlgorithm Running Time/sOptimal Solution Scheduling Time/sAlgorithm Running Time/sOptimal Solution Scheduling Time/sAlgorithm Running Time/s
Ex12013635.1871354365.354991193.820
Ex22251678.5341314354.581985201.801
Ex32595751.6841473439.950984176.725
Ex41743594.9531311355.3151062214.395
Ex51976628.5121387412.790995191.963
Ex62639819.9261293349.2011010205.662
Ex71640583.5411554437.1651076186.946
Ex81797581.6641391381.3001093234.839
Ex91951679.2241606540.937988166.274
Ex101874597.8211332414.9561074189.460
Table 7. Scheduling plan for Case 2.
Table 7. Scheduling plan for Case 2.
Aircraft NumberPriorityParking Position NumberTake-Off Position NumberTake-Off Sequence
15A4C31
25A3C22
33A2C13
43A1C34
53A5C37
63A6C28
73A7C19
83A8C310
93A9C115
104B2C25
114B1C16
123A10C214
133A11C112
143A12C211
153A13C313
163A14C316
173A15C217
183A16C118
193B8 → P1 → B13C123
203B7 → P2 → B14C224
213B13 → P3 → A9C319
223B14 → P4 → A10C120
233B15 → P5 → A11C221
243B16 → P6 → A12C322
Table 8. Statistical analysis results of aircraft departure operations obtained by three algorithms in Case 1.
Table 8. Statistical analysis results of aircraft departure operations obtained by three algorithms in Case 1.
Algorithm NameOptimal Scheduling Time (s)Average Scheduling Time (±Standard Deviation)Average Algorithm Running Time (s)
PSO + heuristic815949.50 ± 260.7861.81 ± 35.39
PSO + SAC706756.60 ± 45.19179.95 ± 55.10
LTA-HPSO + AAE–SAC609652.60 ± 38.28125.09 ± 14.07
Table 9. Statistical analysis results of aircraft departure operation obtained by three algorithms in Case 2.
Table 9. Statistical analysis results of aircraft departure operation obtained by three algorithms in Case 2.
Algorithm NameOptimal Scheduling Time (s)Average Scheduling Time (±Standard Deviation)Average Algorithm Running Time (s)
PSO + heuristic16402032.90 ± 354.45655.10 ± 78.98
PSO + SAC12931401.50 ± 108.12405.15 ± 58.83
LTA-HPSO + AAE–SAC9841025.80 ± 44.62185.19 ± 17.18
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Liu, J.; Wang, N. Simulation-Based Two-Stage Scheduling Optimization Method for Carrier-Based Aircraft Launch and Departure Operations. Entropy 2025, 27, 662. https://doi.org/10.3390/e27070662

AMA Style

Liu J, Wang N. Simulation-Based Two-Stage Scheduling Optimization Method for Carrier-Based Aircraft Launch and Departure Operations. Entropy. 2025; 27(7):662. https://doi.org/10.3390/e27070662

Chicago/Turabian Style

Liu, Jue, and Nengjian Wang. 2025. "Simulation-Based Two-Stage Scheduling Optimization Method for Carrier-Based Aircraft Launch and Departure Operations" Entropy 27, no. 7: 662. https://doi.org/10.3390/e27070662

APA Style

Liu, J., & Wang, N. (2025). Simulation-Based Two-Stage Scheduling Optimization Method for Carrier-Based Aircraft Launch and Departure Operations. Entropy, 27(7), 662. https://doi.org/10.3390/e27070662

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop