Container Slot Allocation with Empty Container Repositioning: A Multi-Objective Optimization Approach

Huang, Lei; Sha, Mei; Guo, Wenwen; Gao, Yinping

doi:10.3390/jmse14050424

Open AccessArticle

Container Slot Allocation with Empty Container Repositioning: A Multi-Objective Optimization Approach

by

Lei Huang

,

Mei Sha

^*,

Wenwen Guo

and

Yinping Gao

College of Transport & Communications, Shanghai Maritime University, Shanghai 201306, China

^*

Author to whom correspondence should be addressed.

J. Mar. Sci. Eng. 2026, 14(5), 424; https://doi.org/10.3390/jmse14050424

Submission received: 28 January 2026 / Revised: 18 February 2026 / Accepted: 24 February 2026 / Published: 25 February 2026

(This article belongs to the Section Ocean Engineering)

Download

Browse Figures

Versions Notes

Abstract

Trade imbalances and equipment shortages are making it increasingly important to coordinate container slot allocation with empty container repositioning on liner services. This paper develops an integrated bi-objective mixed-integer model for voyage-level slot planning on a fixed cyclic route. The model jointly decides booking acceptance, inter-voyage shipment, and empty repositioning with port-level empty-inventory dynamics and leg-based vessel capacity constraints. We optimize two conflicting objectives: maximizing operational profit and minimizing empty container TEU-miles. To solve the model at practical scales, we propose a hybrid evolutionary framework, NSGA-II-RL, which uses a lightweight Q-learning controller to adapt operator and repair choices during NSGA-II evolution. Computational experiments on representative service route instances show that NSGA-II-RL produces diverse Pareto-efficient solutions and improves hypervolume relative to fixed-operator and random-control variants, revealing clear trade-offs between profitability and repositioning intensity.

Keywords:

container slot allocation; empty container repositioning; multi-objective optimization; NSGA-II; reinforcement learning

1. Introduction

Container shipping has established itself as the fundamental element of global trade, carrying more than 80% of world cargo by volume [1]. Despite recent raises in freight rates, liner shipping companies nevertheless face substantial operating cost constraints and uncertainties that limit their total profitability. First, it is expected that carrier operating expenses would increase in order to comply with new environmental laws, such as stricter fuel emission standards and carbon intensity requirements [2]. Second, operational volatility has lately increased due to geopolitical interruptions. In late 2023 and early 2024, container shipping rates sharply and unpredictably increased due to rerouting brought on by the Red Sea crisis and restrictions in the Suez and Panama Canals. The authors of Sun et al. [3] estimated that Red Sea rerouting alone raised overall route economic costs by at least 51% using a fleet size optimization model and an AIS-based bottom–up routing model. Third, enduring structural overcapacity continues to pose a significant long-term challenge. As noted by UNCTAD [1], between 2010 and 2023, global fleet capacity increased by 78.5%, whereas demand grew by merely 34%, with some of this discrepancy obscured by rises in distance-adjusted demand. These factors suggest that capacity efficiency remains critical for carriers aiming to remain competitive in increasingly current operating environments.

Against this background, the efficient allocation of limited vessel capacity has become a central issue for liner shipping companies. In the short term, shipping routes, service schedules, and vessel capacities are predominantly fixed. As a result, profitability largely relies on the allocation of available slot capacity among containers with heterogeneous demand characteristics and freight rates [4]. This decision problem, widely known as the container slot allocation problem (CSAP), lies at the core of liner shipping operations and has attracted sustained attention from both practitioners and researchers. Under the domain of revenue management, CSAP focuses on dynamically accepting or rejecting booking requests in order to maximize revenue under capacity constraints [5]. Nonetheless, the majority of CSAP models focus exclusively on capacity allocation for laden containers. This simplification overlooks the fact that empty containers must also be relocated utilizing the same limited vessel slots, thereby directly competing with revenue-generating cargo for capacity [6,7].

Empty container repositioning (ECR) constitutes one of the most enduring structural challenges within liner shipping and is broadly acknowledged as a core planning issue in container shipping [8,9]. Persistent trade imbalances between export-dominant and import-dominant regions create systematic mismatches between the places where empty containers become available and the places where export demand arises across ports. As a result, liner shipping companies have to reposition large volumes of empty containers from surplus ports to deficit ports at their own expense, making the economic and operational significance of ECR difficult to ignore [10,11]. Industry estimates suggest that empty container repositioning costs carriers between USD 15 and 20 billion annually, accounting for approximately 5% to 8% of total operating costs [12].

From a capacity occupation perspective, ECR also constitutes a considerable share of global container shipping activity. Drewry estimates that, historically, approximately 20% of all ocean container movements have involved the repositioning of empty containers since 1998 [13]. Furthermore, since ECR often involves long-haul movements along major trade lanes, it consumes a disproportionately large amount of vessel slot capacity. Recent industry analyses further indicate that, when measured in terms of empty container TEU-miles (empty TEU-miles), empty containers may account for up to 40% of global container shipping activity [14]. Consequently, ECR and CSAP are inherently coupled through shared vessel capacity. Slots assigned to empty containers reduce the space available for revenue-generating laden cargo in the short term, while insufficient repositioning may lead to empty container shortages at export-oriented ports and lost bookings in subsequent periods. This interdependence highlights the necessity of considering ECR jointly with CSAP, rather than treating ECR as a separate downstream operational task.

In practice, carriers care about both profitability and capacity efficiency when empty repositioning competes with laden shipments for limited vessel slots [15]. When empty repositioning is involved, this trade-off becomes more evident. Allocating more slots to empty containers can secure equipment availability for future demand, but it directly reduces the slots available for laden containers and may lower current-period laden slot share and overall profit [16,17].

Motivated by the coupling between CSAP and ECR, this study formulates an integrated multi-objective optimization model for voyage-level slot planning on a given liner shipping service. The model determines how limited vessel slot capacity is allocated to laden containers and empty containers across voyages. Two objectives are considered to reflect key managerial concerns. The first objective maximizes total profit calculated by total freight revenue minus operational cost. The second objective minimizes empty TEU-miles, which discourages excessive empty movements and thus reflects utilization-related efficiency.

Solving this model requires both effective exploration of trade-offs and robust feasibility handling. We therefore develop an enhanced NSGA-II solution approach with reinforcement-learning-guided operator control. A Q-learning agent observes the population state during the evolutionary process and adaptively selects genetic operators, while receiving rewards based on hypervolume improvement and feasibility-related signals. This design improves search efficiency and solution stability under tight capacity and inventory constraints, and it produces a diverse set of Pareto-efficient solutions that make the trade-off between profit and empty TEU-miles explicit. Numerical experiments based on representative liner service scenarios are conducted to evaluate solution quality and computational performance, and to derive managerial insights on when capacity should be reserved for empty container repositioning versus allocated to revenue-generating laden cargo.

The main contributions of this work are summarized as follows. First, we develop an integrated voyage-level formulation that jointly determines CSAP and ECR decisions under shared vessel capacity. Second, we formulate a profit–empty TEU-miles bi-objective optimization model to explicitly capture the resulting trade-off. Third, we design an efficient solution approach that can generate diverse Pareto-efficient solutions for realistic instance scales.

The remainder of this paper is organized as follows. Section 2 reviews related studies on CSAP, ECR and multi-objective optimization methods. Section 3 presents the integrated multi-objective formulation and key modeling assumptions. Section 4 describes the proposed NSGA-II based solution approach with reinforcement-learning-guided operator control. Section 5 reports experimental settings and computational results. Section 6 discusses managerial insights of the results and outlines directions for future work. Section 7 concludes the paper.

2. Literature Review

2.1. CSAP in Liner Shipping

In the short term, liner carriers typically operate pre-announced services with fixed shipping routes, service schedules, and vessel capacities. Under such a setting, operational revenue management largely depends on how vessel slots are allocated to heterogeneous bookings, which has a direct impact on carriers’ profitability and service performance [4,18]. Typical decisions of CSAP include accepting or rejecting booking requests and allocating slots across OD flows and customer groups, subject to operational constraints such as vessel capacity, reefer-plug limits, weight and slot-type restrictions, and the fixed schedules of liner services. Due to the large capacity scale of modern containerships and the leg-coupled capacity interactions induced by network operations, CSAP models often lead to large-scale mixed-integer programs that are computationally challenging [4].

A central modeling ingredient in CSAP is customer and product segmentation, because different demand segments contribute differently to volume, revenue, and service obligations. In particular, liner demand is often partitioned into contract and spot segments [4]. Contract customers typically commit volumes under longer-term agreements with relatively stable rates and implicit service expectations. This can stabilize demand for the carrier, but it also constrains operational flexibility when capacity becomes tight. Spot demand is more price-sensitive and uncertain, which often requires different acceptance and allocation decisions. This segmentation motivates deterministic slot allocation models that compute segment-specific allocations or acceptance thresholds as a planning backbone [5,16,19,20,21]. For instance, Wang et al. [20] studied seasonal-scale revenue management and formulated a deterministic optimization model that selects and routes multiple container types under volume and weight capacities while linking operating costs to operational decisions such as sailing speed and service deployment requirements. This study shows that even under deterministic demand, CSAP is tightly coupled with multiple capacity dimensions and cost drivers. In a later study, Wang et al. [21] investigated overbooking and delivery-delay-allowed strategies for slot allocation to reduce capacity waste and improve profitability when actual demand realization or shipment execution deviates from plan, thereby moving from pure static planning toward implementable operational control rules. More recently, CSAP research has increasingly incorporated stochastic demand and execution uncertainty, developing uncertainty-aware allocation models and control policies that hedge against demand volatility while maintaining feasibility under network capacity constraints [22,23,24,25,26,27,28].

In addition, CSAP decisions are rarely assessed solely by short-term profitability. Capacity efficiency is often treated as a key operational indicator in liner services, because insufficient utilization weakens scale economies and raises unit costs, yet maintaining some slack capacity may be necessary for operations, implying an inherent profit–utilization trade-off [15]. In this study, we characterize this trade-off using a profit–empty TEU-miles bi-objective setting, where empty TEU-miles measure the distance-weighted transport work associated with empty movements. This practical emphasis is also reflected in the slot allocation literature. For example, Lu et al. [16] formulate an integer-programming planning model that maximizes potential profits and report that it also provides a higher slot utilization rate. More explicitly, Wong et al. [17] develop a three-echelon slot allocation framework for yield and capacity utilization management, highlighting that utilization and revenue considerations are often treated jointly in operational planning.

2.2. Integrated CSAP and ECR Models

Beyond allocating scarce slots to laden container bookings, liner carriers must ensure that empty containers are available at the right ports and times to support export demand, which gives rise to the ECR problem [29]. Existing ECR studies can be broadly distinguished by the underlying decision network, including vessel-based repositioning on seaborne shipping networks, inland or inter-modal repositioning across depots and hinterland nodes, and ECR embedded in broader fleet or network planning problems [8,9,29,30]. Among these scopes, the seaborne liner shipping setting is particularly relevant to slot allocation, because empty containers are transported on the same maritime legs as laden cargo and therefore consume the same vessel slots. Consequently, ECR is inherently capacity-coupled with slot allocation in seaborne shipping networks, motivating integrated CSAP and ECR models that jointly determine laden allocations and empty flows under shared leg-capacity constraints.

A stream of studies couples empty and laden decisions within a single voyage or planning cycle, while treating empty containers as exogenous inputs to the allocation model. The authors of Ting and Tzeng [5] developed an integer-programming model that simultaneously allocates slots to laden and empty containers between port pairs, and enforces empty container repositioning through dedicated port-level replenishment requirements specified exogenous. Extending this idea from a single voyage to a seasonal planning horizon, Lu et al. [16] optimized slot allocation for a cyclic liner service and explicitly recognized that slot spaces must be assigned across heterogeneous container categories under practical vessel limits, while still relying on forecast- or policy-driven empty needs rather than endogenizing container circulation. Moving beyond allocation on a fixed service, Meng and Wang [31] integrated ECR into liner shipping service network design, showing how empty container flows can be embedded in broader network-structure decisions under maritime capacity constraints. More recently, Wong et al. [17] proposed a three-echelon slot allocation framework for yield and utilization management across planning levels, where empty container movements are part of the planning background but the emphasis is on coordinated slot allocation mechanisms. Overall, these studies demonstrate how empty-related considerations can be embedded into CSAP under shared capacity, yet most describe empty needs through exogenous targets or inputs rather than explicitly modeling inter-temporal container circulation and availability within the allocation model.

In contrast, another stream introduced port-side empty container inventories and balance over time, thereby linking container allocation decisions to the feasibility of empty container supply. The authors of Zurheide and Fischer [19] developed a network-based revenue-management formulation in which the service network is indexed with explicit time and ship-cycle relations and slot capacity is allocated to both laden and empty containers accordingly. More recently, Wang et al. [7] modeled port-side empty container state transitions and incorporated auxiliary expressions for remaining onboard containers across voyages. However, the correspondence between the voyage index and physical vessel cycles in a multi-vessel, multi-voyage service is left implicit, making the cross-voyage port-and-onboard container flows unclear. Moreover, to the best of our knowledge, existing integrated CSAP and ECR models are still predominantly formulated with a single economic objective, and explicit Pareto-based multi-objective formulations that jointly optimize profit and ECR effort remain scarce.

Table 1 summarizes representative studies across three closely related streams: CSAP, ECR, and integrated planning where CSAP and ECR are jointly considered. As the table shows, most CSAP studies are profit-driven and focus on booking acceptance and slot allocation decisions, whereas ECR is often handled separately through balancing, leasing, and inventory decisions, or is treated as an exogenous requirement. While integrated models exist, explicitly capturing the interaction between laden and empty decisions under shared vessel capacity remains limited in the literature. Moreover, only a limited number of studies go beyond purely profit-oriented objectives to explicitly quantify the operational burden of empty movements, and even fewer model and optimize the profit–empty TEU-miles trade-off in an integrated setting where laden and empty containers compete for the same vessel capacity. These observations motivate our profit–empty TEU-miles bi-objective formulation and the solution framework presented next.

2.3. Multi-Objective Optimization Methods in Maritime Logistics

In maritime logistics and liner operations, decision quality is usually assessed against multiple criteria rather than profit alone, such as capacity efficiency, equipment-balance stability, and operational robustness. These criteria can be intrinsically conflicting. For example, aggressively prioritizing short-term profit by allocating scarce slots to laden shipments may crowd out empty repositioning moves, which can exacerbate equipment imbalance and increase shortage-related operational costs later on.

Multi-objective optimization formalizes such trade-offs by seeking a set of Pareto-efficient solutions, where improving one objective necessarily worsens at least one other objective. Compared with scalarization approaches such as weighted-sum or

ϵ

-constraint methods, which can be sensitive to parameter choices and often require repeated solves to map a Pareto frontier, evolutionary multi-objective optimization offers a practical alternative for complex discrete planning models by generating a diverse approximation of the non-dominated set in a single run [32,33].

Among multi-objective evolutionary algorithms, the NSGA-II remains one of the most widely used baselines due to its fast non-dominated sorting, elitist selection, and crowding-distance mechanism for diversity preservation [34,35]. These features make NSGA-II particularly suitable for constrained, leg-coupled, integer decision problems in maritime logistics, where problem-specific encoding and feasibility-handling operators can be embedded to exploit structure and maintain solution feasibility during evolution.

Evidence from the maritime and port-operations literature further supports the suitability of NSGA-II for networked operational problems with competing objectives and complex feasibility structures. Representative port-side applications include multi-objective berth allocation and integrated berth–quay–crane planning at container terminals, where modified or enhanced NSGA-II variants embed constraint-handling and local search to cope with tightly coupled spatial–temporal constraints [36,37,38]. On the seaborne side, Song et al. [39] formulate a stochastic multi-objective tactical planning problem for liner services under uncertain port times and solve it using a simulation-based NSGA-II, while Ma et al. [40] apply NSGA-II to route and speed optimization under weather conditions and emission control area regulations to generate explicit cost–time trade-offs. These studies suggest that NSGA-II provides a natural foundation for our setting, where we aim to reveal the trade-off between profit and empty repositioning effort, quantified by empty TEU-miles, in an integrated CSAP and ECR model.

3. The Integrated Slot Allocation and Empty Repositioning Problem

3.1. Problem Setting

We study a short-term slot planning problem on a given liner service network. Route design, service schedules, and vessel deployment are taken as exogenous, and the focus is on voyage-level slot allocation for laden container shipments and empty container repositioning. Under this setting, the carrier uses limited vessel slot capacity both to transport laden containers requested by shippers and to reposition empty containers to mitigate imbalances across ports. Laden and empty containers therefore compete for the same slots, implying that allocation and repositioning decisions should be determined jointly rather than in isolation.

Laden container booking requests are heterogeneous. Following common practice, we distinguish contract shipments from spot-market shipments [28]. Contract shippers book capacity under long-term agreements and typically receive lower rates. In return, the carrier is expected to provide a guaranteed level of fulfillment over the planning horizon, which may be specified through minimum volume commitments or agreed reliability targets. Importantly, freight is generally charged on actually shipped volume, while failing to meet contractual commitments may lead to penalties or loss of future business. Spot shippers, in contrast, request capacity on a short-term basis with higher market-driven rates and without explicit fulfillment commitments. This heterogeneity implies that the carrier must balance contractual fulfillment and revenue opportunities, while reserving sufficient capacity for ECR to ensure equipment availability at export-oriented ports in subsequent voyages.

3.1.1. Service Network and Planning Horizon

We consider a liner company operating a cyclic service with a fixed port rotation. Let

L = {1, 2, \dots, L}

denote the ordered set of port-call indices along the route, where a physical port may appear multiple times and thus have multiple indices. Each consecutive pair of port calls defines a directed shipping leg. We denote the set of legs by

A

, indexed by a, where legs are in one-to-one correspondence with origin port-call indices. Specifically,

a (i) = (i, i + 1)

for

i \in {1, \dots, L - 1}

, and the wrap-around leg is

a (L) = (L, 1)

. The planning horizon contains a sequence of voyages

V = {1, 2, \dots, V}

, and voyage v is operated by a predetermined vessel with slot capacity

C a p^{v}

(in TEU). A container loaded at port-call index i and discharged at index j occupies slots on every leg along the forward path from i to j on the cyclic route. Once discharged, those slots become available again for downstream shipments, so OD flows compete for capacity on overlapping legs. To represent transportation requests on this cyclic service, we define the set of origin–destination pairs as

OD

, indexed by

(i, j)

, allowing both

i < j

and

j < i

to capture wrap-around demand. Laden booking requests are classified into a finite set of service classes

S

that reflect heterogeneous rates and service-level requirements. In particular, we distinguish two classes, namely contract shipments and spot-market shipments. Contract shipments are associated with explicit service-fulfillment requirements specified in long-term agreements, whereas spot-market shipments are accepted on a short-term basis without such commitments and typically yield higher market-driven rates.

3.1.2. Integrated Decisions and Container Sourcing

The integrated planning problem jointly determines (i) voyage-level acceptance and execution of laden container bookings and (ii) empty container repositioning over the planning horizon. For each voyage

v \in V

, OD pair

(i, j) \in OD

, and service class

s \in S

, the carrier decides the accepted bookings

z_{i j}^{s v}

and the corresponding executed shipments. We distinguish between shipments executed with carrier-owned equipment and those executed with externally leased equipment. Let

x_{i j}^{s v}

denote the shipped laden volume (in TEU) of class s on OD pair

(i, j)

in voyage v using owned containers. When local availability of owned empties is insufficient, the carrier may lease containers from external lessors. Let

r_{i j}^{s v}

denote the shipped laden volume executed using leased containers. The unit leasing cost

ρ_{i j}^{v}

is modeled as a daily rental rate multiplied by an expected on-hire duration, approximated by the OD transportation duration

t_{i j}

.

Due to spatial mismatches between where empty containers become available and where laden demand arises, the carrier may reposition empty containers between ports. Let

x_{i j}^{E v}

denote the repositioning flow (in TEU) of empty containers shipped from i to j in voyage v. At each port-call index, owned empty containers from three sources can be used to support outbound laden shipments or be sent as repositioned empties, including on-hand inventory, inbound repositioned empties, and empties generated by discharging owned laden containers. Accordingly, the remaining owned empty inventory after serving port-call index i in voyage v is denoted by

I_{i}^{v}

, which becomes the available stock for subsequent decisions. We assume leased containers are on-hired at the origin and off-hired upon arrival, so shipments executed with owned containers

x_{i j}^{s v}

participate in container circulation and affect the evolution of owned empty inventories, whereas leased shipments

r_{i j}^{s v}

do not.

Finally, accepted bookings may not be executed on the intended voyage due to slot or empty shortages. Such quantities are deferred to subsequent voyages. We use

h_{i j}^{s v}

to track stranded bookings, which incur delay-type costs when carried over to later voyages, while any remaining stranded quantity at the end of the planning horizon is penalized as unserved demand.

3.1.3. Performance Measures

The above decisions are evaluated from both financial and operational perspectives. On the one hand, the carrier aims to maximize profit by allocating scarce slots to revenue-generating laden container shipments while accounting for transportation, leasing, inventory holding, and delay or shortage penalties. On the other hand, ECR consumes vessel capacity without generating freight revenue and can be substantial on cyclic services [17]. We therefore measure the operational burden of repositioning using a distance-weighted metric in empty TEU-miles, which captures the transport work performed for empty containers over all legs in the planning horizon. Accordingly, we formulate a bi-objective optimization model that maximizes total profit while minimizing empty TEU-miles, making the trade-off between profitability and ECR intensity explicit.

3.2. Model Assumptions

To keep the formulation tractable while preserving the key operational features of the integrated CSAP and ECR problem, we make the following assumptions:

Fixed service network and sailing schedule. The shipping routes, service schedules, and vessel capacities over the planning horizon are given. The model optimizes how to use the announced capacity through laden acceptance, empty repositioning, and leasing decisions, rather than adjusting service frequency, speed, or port calls.
Contract fulfillment requirement over the planning horizon. Contract demand is subject to an explicit fulfillment requirement over the planning horizon. We enforce a reliability-based service target by requiring that the cumulative shipped volume for each contract OD pair covers the cumulative contract demand with at least a prescribed fulfillment rate.
Deferred fulfillment of unserved shipments with delay cost. When demand cannot be loaded on the intended sailing due to slot or empty shortages, it is stranded and may be served by subsequent sailings within the horizon. Each period of delay incurs a demurrage or holding penalty to reflect service deterioration.
External leasing as an empty container sourcing option. If internal repositioning and local stocks are insufficient, the carrier may lease empties at origin ports. Leasing is priced on a per-day basis, and the expected on-hire duration is OD-dependent, reflecting heterogeneous transit and turnaround cycles. Leased containers are treated as exogenous supply and are not assumed to permanently expand the carrier-owned container storage.

3.3. Mathematical Formulation

In this subsection, we formulate a bi-objective mixed-integer model for the integrated CSAP and ECR problem over multiple voyages. The formulation jointly determines booking acceptance, shipment execution, and empty procurement/repositioning decisions under vessel capacity and inventory feasibility requirements.

3.3.1. Notations

The notations used throughout the formulation are summarized in Table 2.

We introduced a set of auxiliary variables here, denoted as

y_{a}^{v}

, representing for the total onboard volume (in TEU) on shipping leg

a \in A

during voyage

v \in V

. On a cyclic route, an OD flow occupies slots on every leg along its transportation path. In our setting, laden shipments may be carried either by owned containers (

x_{j k}^{s v}

) or by leased containers (

r_{j k}^{s v}

). Although leased containers are off-hired upon arrival and thus do not contribute to the carrier’s owned empty-inventory evolution, they occupy vessel slots during transportation and therefore must be counted toward onboard slot occupancy. In addition, empty repositioning flows

x_{j k}^{E v}

also consume slots along the legs they traverse.

Voyages are indexed in chronological order by their departures from the origin port. We assume a regular cyclic deployment with M vessels on the service. Accordingly, a given vessel operates the voyage set

{v + k M ∣ k = 0, 1, 2, \dots}

within the planning horizon. This deployment implies that, for wrap-around OD pairs, the onboard volume observed on an early leg of voyage v may include containers that were loaded in the previous voyage operated by the same vessel, namely voyage

v - M

. To capture this effect, we use

o (a)

to denote the origin (tail) port-call index of leg a. For an OD pair

(j, k)

traversing leg a, if the loading index j has already been visited within the current voyage before reaching leg a (i.e.,

j \leq o (a)

), the onboard contribution comes from voyage v; otherwise (i.e.,

j > o (a)

), the onboard contribution comes from voyage

v - M

.

For the first M voyages, we assume no pre-existing onboard containers from prior operations at the beginning of the planning horizon. Hence,

y_{a}^{v} = \sum_{\begin{matrix} (j, k) \in OD : \\ j \leq o (a) \end{matrix}} δ_{j k}^{a} (\sum_{s \in S} (x_{j k}^{s v} + r_{j k}^{s v}) + x_{j k}^{E v}), \forall a \in A, \forall v \in {1, \dots, M} .

(1)

For subsequent voyages, the onboard volume on leg a in voyage v consists of two parts: (i) containers loaded in voyage v whose loading indices satisfy

j \leq o (a)

, and (ii) wrap-around containers loaded in voyage

v - M

whose loading indices satisfy

j > o (a)

:

\begin{matrix} y_{a}^{v} = \sum_{\begin{matrix} (j, k) \in OD : \\ j \leq o (a) \end{matrix}} δ_{j k}^{a} (\sum_{s \in S} (x_{j k}^{s v} + r_{j k}^{s v}) + x_{j k}^{E v}) + \sum_{\begin{matrix} (j, k) \in OD : \\ j > o (a) \end{matrix}} δ_{j k}^{a} (\sum_{s \in S} (x_{j k}^{s (v - M)} + r_{j k}^{s (v - M)}) + x_{j k}^{E (v - M)}), \\ \forall a \in A, \forall v \in {M + 1, \dots, V} . \end{matrix}

(2)

To explain the cross-voyage accounting in (1) and (2), Figure 1 illustrates three representative cases on a cyclic route and clarifies why splitting OD pairs by

j \leq o (a)

and

j > o (a)

is sufficient to determine whether the onboard contribution on leg a in voyage v comes from variables of voyage v or from the previous cycle

v - M

. Panels (a) and (c) correspond to part (i) in (2): leg a is traversed after the vessel visits the loading port j within voyage v, so the containers occupying leg a are those loaded in voyage v. This situation is captured by

j \leq o (a)

and is therefore accounted for by the first summation term. Only in the wrap-around case in panel (b) (

o (a) < k < j

) is leg a traversed before visiting j in voyage v; in this case, the containers occupying leg a in voyage v must have been loaded in voyage

v - M

by the same vessel. This situation is captured by

j > o (a)

and is therefore accounted for by the second summation term in (2). Accordingly, (1) and (2) provide a compact onboard-accounting equality under regular cyclic deployment, compared with the more general formulation in Wang et al. [7].

3.3.2. Objective Functions

The proposed problem is inherently multi-objective. For clarity and focus, we consider two objectives in this study. On the one hand, the carrier aims to maximize profit by allocating limited vessel slots to revenue-generating laden shipments, while accounting for transportation costs of both laden and empty moves, external leasing or procurement costs when owned empty containers are insufficient, inventory holding costs for residual empties at ports, and penalty costs associated with deferred and ultimately unshipped accepted bookings. On the other hand, since empty repositioning consumes slot capacity and transportation effort without directly generating freight revenue, the carrier also seeks to limit unnecessary empty movements. We therefore minimize the total empty TEU-miles over the planning horizon. These two objectives are generally conflicting, leading to a set of Pareto-optimal solutions that reveal the trade-off between profit and empty repositioning effort [32]. The two objective functions are defined as follows.

Objective 1: Profit maximization.

The profit objective equals the freight revenue generated by shipped laden containers minus the container transportation costs, empty container repositioning costs, external leasing costs, inventory holding costs, and penalty costs arising from postponed and ultimately unshipped accepted bookings:

\begin{matrix} max Z_{1} = & \sum_{v \in V} \sum_{(i, j) \in OD} \sum_{s \in S} f_{i j}^{s v} (x_{i j}^{s v} + r_{i j}^{s v}) - \sum_{v \in V} \sum_{(i, j) \in OD} \sum_{s \in S} c_{i j}^{L v} (x_{i j}^{s v} + r_{i j}^{s v}) - \sum_{v \in V} \sum_{(i, j) \in OD} c_{i j}^{E v} x_{i j}^{E v} \\ - \sum_{v \in V} \sum_{(i, j) \in OD} \sum_{s \in S} ρ_{i j}^{v} r_{i j}^{s v} - \sum_{v \in V} \sum_{i \in L} ζ_{i} I_{i}^{v} - \sum_{v \in V ∖ {V}} \sum_{(i, j) \in OD} \sum_{s \in S} κ_{i j}^{s v} h_{i j}^{s v} \\ - \sum_{(i, j) \in OD} \sum_{s \in S} π_{i j}^{s} h_{i j}^{s V} . \end{matrix}

(3)

For

v \in V ∖ {V}

, stranded bookings

h_{i j}^{s v}

are carried over to subsequent voyages and incur a postponement cost

κ_{i j}^{s v}

, capturing demurrage-type charges or service-recovery expenses due to delayed fulfillment. At the end of the planning horizon, any remaining stranded bookings

h_{i j}^{s V}

cannot be postponed further and therefore incur a terminal penalty

π_{i j}^{s}

, representing contractual compensation, loss of goodwill, or the cost of outsourcing transportation to alternative carriers.

Objective 2: Minimization of empty TEU-miles.

To limit unnecessary empty repositioning effort, we minimize the total transport work induced by repositioned empty containers over the planning horizon. Accordingly, the second objective is defined as

\begin{matrix} min Z_{2} = \sum_{v \in V} \sum_{a \in A} \sum_{(j, k) \in OD} δ_{j k}^{a} d_{a} x_{j k}^{E v} . \end{matrix}

(4)

3.3.3. Model Constraints

The above objectives are optimized subject to operational constraints on (i) booking acceptance and contract fulfillment, (ii) inter-voyage shipment execution and postponement handling, (iii) vessel-slot capacity on each shipping leg, and (iv) owned empty-inventory dynamics across voyages. Capacity is enforced on every leg along the cyclic route, reflecting that each OD flow occupies slots on all intermediate legs until discharge. External leasing provides an additional equipment-sourcing option when local owned empty availability is insufficient. The complete bi-objective model is given as follows.

\begin{matrix} max & Z_{1} \\ min & Z_{2} \end{matrix}

\begin{matrix} s . t . & z_{i j}^{s v} \leq q_{i j}^{s v} & \forall (i, j) \in OD, \forall s \in S, \forall v \in V . \end{matrix}

(5)

\begin{matrix} \sum_{v \in V} (x_{i j}^{s v} + r_{i j}^{s v}) \geq β_{i j} \sum_{v \in V} q_{i j}^{s v}, & \forall (i, j) \in OD, s = C 2 \end{matrix}

(6)

\begin{matrix} h_{i j}^{s 0} = 0, & \forall (i, j) \in OD, \forall s \in S \end{matrix}

(7)

(1) and (2),

\begin{matrix} h_{i j}^{s (v - 1)} + z_{i j}^{s v} = x_{i j}^{s v} + r_{i j}^{s v} + h_{i j}^{s v}, & \forall (i, j) \in OD, \forall s \in S, \forall v \in V \\ () - - (), \end{matrix}

(8)

\begin{matrix} y_{a}^{v} \leq C a p^{v}, & \forall a \in A, \forall v \in V \\ I_{i}^{v} = I_{i}^{v - 1} + \sum_{j : (j, i) \in OD} \sum_{s \in S} x_{j i}^{s v} + \sum_{j : (j, i) \in OD} x_{j i}^{E v}, \end{matrix}

(9)

\begin{matrix} - \sum_{k : (i, k) \in OD} \sum_{s \in S} x_{i k}^{s v} - \sum_{k : (i, k) \in OD} x_{i k}^{E v}, & \forall i \in L, \forall v \in V \end{matrix}

(10)

\begin{matrix} z_{i j}^{s v}, x_{i j}^{s v}, x_{i j}^{E v}, r_{i j}^{s v}, h_{i j}^{s v} \in Z, & \forall (i, j) \in OD, \forall s \in S, \forall v \in V \end{matrix}

(11)

\begin{matrix} I_{i}^{v} \geq 0, & \forall i \in L, \forall v \in V \end{matrix}

(12)

\begin{matrix} y_{a}^{v} \geq 0, & \forall a \in A, \forall v \in V \end{matrix}

(13)

Constraint (5) limits booking acceptance. For each voyage v, OD pair

(i, j)

, and service class s, the accepted bookings

z_{i j}^{s v}

cannot exceed the corresponding demand

q_{i j}^{s v}

.

Constraint (6) enforces the minimum contract-fulfillment requirement over the planning horizon. It requires that the cumulative executed volume of contract shipments on each OD pair, including both owned-container and leased-container executions, reaches at least a prescribed fraction

β_{i j}

of the planned contract demand.

Constraints (7) and (8) govern shipment execution and the carryover of unshipped bookings across voyages. We initialize stranded bookings by (7). In each voyage v, the total amount to be served consists of newly accepted bookings

z_{i j}^{s v}

and the previously stranded amount

h_{i j}^{s (v - 1)}

. Constraint (8) balances these quantities by splitting them into executed shipments using owned containers

x_{i j}^{s v}

, executed shipments using leased containers

r_{i j}^{s v}

, and the remaining stranded amount

h_{i j}^{s v}

carried forward. These stranded quantities are penalized in the profit objective through postponement costs for

v < V

and terminal penalties at

v = V

.

Constraints (1) and (2) together with (9) enforce vessel capacity on each shipping leg. The onboard volume

y_{a}^{v}

aggregates all laden and empty flows traversing leg a via the OD–leg incidence parameter

δ_{i j}^{a}

, including owned-container shipments

x_{i j}^{s v}

, leased-container shipments

r_{i j}^{s v}

, and repositioned empties

x_{i j}^{E v}

. Constraint (9) then limits

y_{a}^{v}

by the slot capacity

C a p^{v}

on every leg.

Constraint (10) describes the evolution of owned empty inventories across voyages. The state variable

I_{i}^{v}

is updated by adding owned empties generated by discharging owned-container shipments and inbound repositioned empties from the previous voyage, and subtracting owned empties consumed by loading owned-container shipments and by outbound repositioning departures in voyage v. Leased containers are assumed to be on-hired at the origin and off-hired at the destination. Therefore, the leased-shipment decision

r_{i j}^{s v}

occupies vessel slots during transportation but does not enter the owned empty-inventory balance.

Finally, constraints (11)–(13) specify variable domains.

4. Solution Algorithm

This section presents a hybrid solution framework that integrates a multi-objective evolutionary algorithm with reinforcement learning to solve the integrated slot allocation and empty container repositioning problem. The proposed approach is designed to handle the high-dimensional decision space, nonlinear interactions between laden and empty flows, and tight vessel slot-capacity constraints via repair and penalty mechanisms.

4.1. Overall Framework

We develop a hybrid learning-augmented heuristic that integrates NSGA-II with a lightweight Q-learning controller, named NSGA-II-RL, to solve the integrated slot allocation and empty container repositioning problem, balancing profit and repositioning effort. The method combines evolutionary search for Pareto approximation with online learning for adaptive operator control, and the overall procedure is summarized in Algorithm 1.

Algorithm 1: NSGA-II-RL

The problem involves a high-dimensional decision space and strong nonlinear coupling between laden container acceptance, empty container repositioning, and voyage-leg capacity usage. The resulting trade-offs between economic performance and repositioning effort are difficult to reflect with a pre-specified scalarization and can exhibit multiple locally efficient patterns. NSGA-II is therefore adopted as the main search engine because it can efficiently approximate a diverse set of non-dominated solutions in multi-objective settings while maintaining frontier diversity through elitist selection and crowding-based preservation.

Although NSGA-II provides a robust baseline, its performance is sensitive to operator configurations, especially under a discrete decode–repair–evaluate routine. Early generations often benefit from stronger exploration, whereas later generations require more conservative variation for refinement. In addition, feasibility is challenging due to capacity coupling across voyage legs. If repair is too aggressive, diversity may collapse; if the repair is too weak, infeasible offspring may dominate and slow down progress. We therefore introduce an RL controller to adapt mutation intensity and repair behavior online based on population-level feedback, rather than relying on fixed parameters.

The hybrid method can be viewed as a two-level procedure. At the population level, NSGA-II iteratively generates offspring and updates the population through selection and elitist replacement. At the individual level, every offspring must pass through a fixed processing routine before it can be compared and selected. This routine links the real-coded search space with discrete operational decisions and feasibility handling, and it forms the main interface where evolutionary operators and problem constraints interact.

Each candidate solution is represented as a real-coded chromosome. For evaluation, the chromosome is decoded into integer voyage-level decisions for laden acceptance and empty repositioning. The decoded solution may then be repaired to mitigate capacity overloads on sailing legs; remaining overloads are handled through a penalty term during evaluation. After repair, the phenotype is written back to the chromosome as a Lamarckian update, so that subsequent genetic operators act on an already feasible or near-feasible representation. Finally, the repaired solution is evaluated by sequential inventory propagation along the port-call sequence, where owned empty containers are allocated to repositioning first, and remaining inventory is used to serve laden shipments with leasing covering any unmet laden demand.

The RL controller operates at the generation level and only controls operator and repair settings, while the decode–repair–evaluate routine described above remains unchanged. At the beginning of each generation, the agent observes a compact state that summarizes search progress, population diversity, and feasibility status of the current population. It then selects an action that determines the operator configuration for the current generation, including mutation intensity and repair behavior used during offspring creation and feasibility handling. After NSGA-II completes offspring evaluation and elitist environmental selection, the agent receives a reward based on hypervolume improvement together with an additional penalty when the population feasibility rate falls below a target level, and updates its action value table for subsequent generations.

4.2. NSGA-II-Based Evolutionary Search

We adopt NSGA-II as the main evolutionary engine to approximate the trade-off frontier between the two objectives. Each solution is encoded as a real-coded chromosome that concatenates the voyage-level laden allocation variables and the empty repositioning variables. During evaluation, chromosomes are decoded into integer decisions by rounding, and the laden part is further capped by demand to avoid systematic overflow caused by rounding. This design allows standard genetic operators to work in a continuous search space, while the decoded phenotype remains consistent with discrete operational decisions. Let

n_{var}

denote the chromosome length. The procedure Evaluate

(\cdot)

in Algorithm 1 implements this fixed decode–repair–evaluate routine and returns objective values together with the capacity-violation metric.

As formulated in Section 3, let

Z_{1}

denote the total profit and

Z_{2}

denote the empty TEU-miles. Since NSGA-II is implemented in a minimization form, we use the following objective vector for evolutionary search:

\begin{matrix} f_{1} = - Z_{1} + λ_{cap} CapViol, \end{matrix}

(14)

\begin{matrix} f_{2} = Z_{2}, \end{matrix}

(15)

where

CapViol

is the aggregated vessel slot capacity violation over all sailing legs, and

λ_{cap}

is the penalty coefficient.

The initial population is generated by a mixed strategy that balances exploration and informed seeding. Most individuals are sampled uniformly within variable bounds. A subset is generated by a greedy heuristic biased toward moving empties to OD pairs associated with higher leasing costs. Another subset sets all empty-repositioning genes to zero to provide a profit-oriented baseline. The population is then shuffled to avoid positional bias before evolution starts.

At each generation, parent solutions are selected using binary tournament selection based on Pareto rank and crowding distance. Selected parents undergo simulated binary crossover with crossover probability

p_{cx}

and distribution index

η_{c}

, followed by polynomial mutation with mutation probability

p_{mut}

and distribution index

η_{m}

. When not specified, the default mutation probability is set to

p_{mut} = 1 / n_{var}

. All operators are applied in the real-coded chromosome space, and offspring genes are clipped to their bounds to preserve feasibility of the encoding. The procedure Variation

(\cdot)

consists of binary tournament selection, SBX crossover, and polynomial mutation under the parameter set

Θ_{g}

.

4.2.1. Offspring Evaluation and Feasibility Handling

Each offspring chromosome is processed by a fixed decode–repair–evaluate routine before it can participate in non-dominated sorting. First, genes are decoded and rounded to obtain integer voyage-level decisions, and the laden decisions are capped by the corresponding demands to eliminate rounding-induced overflow. The resulting decisions are then evaluated through sequential inventory propagation along the port-call sequence, where owned empty containers are allocated to repositioning first, and remaining inventory is used to serve laden shipments. Any unmet laden demand is covered by leasing, so infeasibility due to insufficient inventories is avoided by construction.

Vessel slot-capacity feasibility on each leg is handled by a dedicated repair operator that scales down flows on overloaded legs under different prioritization modes, including laden-first, empty-first, and balanced. The repair is applied before evaluation, possibly under a prescribed probability in the NSGA-II-RL variant. If overloads remain after repair, they are quantified by

CapViol

and penalized in

f_{1}

via the coefficient

λ_{cap}

. After repair, the repaired phenotype is written back into the chromosome as a Lamarckian update, so that subsequent crossover and mutation operate on an already feasible or near-feasible genotype.

4.2.2. Environmental Selection

NSGA-II adopts elitist environmental selection to form the next generation. Parent and offspring populations are merged and ranked by fast non-dominated sorting into Pareto fronts. Fronts are inserted into the next generation in ascending order of rank until the population limit is reached. If the last admissible front cannot be fully included, individuals in that front are selected in descending order of crowding distance to preserve diversity along the frontier. The algorithm outputs the first non-dominated front of the final population in Algorithm 1.

In the NSGA-II-RL variant, the evolutionary loop described above remains unchanged. The Q-learning controller only adjusts operator and repair settings across generations, while the decode–repair–evaluate routine stays the same. The state design, action-to-parameter mapping, and reward definition are detailed in the next subsection.

4.3. Reinforcement Learning Assisted Operator Control

NSGA-II provides a strong baseline for multi-objective search, yet its performance is sensitive to operator configurations at different stages of evolution. Fixed variation settings may lead to insufficient exploration in early generations or slow refinement in later generations, especially under the discrete decode–repair–evaluate routine. We therefore introduce a lightweight Q-learning controller to adapt mutation intensity and repair behavior online based on population-level feedback, while keeping the decoding, repair, and evaluation pipeline unchanged.

4.3.1. MDP Formulation and Controlled Parameters

The interaction between NSGA-II and the RL controller is formulated as a generation-level Markov decision process. At generation g, the environment is characterized by the current population

P_{g}

and summary indicators of its diversity and feasibility. The agent selects an action

a_{g}

, which is mapped to a generation-specific operator configuration

\begin{matrix} Θ_{g} = (p_{mut}^{(g)}, η_{c}^{(g)}, η_{m}^{(g)}, p_{rep}^{(g)}, χ_{rep}^{(g)}) . \end{matrix}

(16)

Here

p_{mut}^{(g)}

is the mutation probability;

η_{c}^{(g)}

and

η_{m}^{(g)}

are the distribution indices of SBX crossover and polynomial mutation;

p_{rep}^{(g)}

is the probability of applying the capacity repair; and

χ_{rep}^{(g)}

specifies the repair mode. The agent does not modify decision variables directly. Instead, it steers the search dynamics by adjusting variation and feasibility-handling behaviors across generations. The baseline parameter set

Θ^{0}

in Algorithm 1 provides default values for these controlled parameters when actions do not overwrite them.

4.3.2. State Representation

We use a compact discrete state to summarize search progress, population diversity, and feasibility. Progress is categorized into three stages based on the ratio

g / G

. Diversity is measured by the average finite crowding distance of the current population, and feasibility is measured by the fraction of solutions whose total constraint violation is below a tolerance. In this study, the violation metric mainly reflects voyage-leg capacity overloads, since inventory-related shortfalls are resolved by truncation of empty repositioning and leasing for laden demand during evaluation. All thresholds and tolerances used for state discretization are user-specified constants. The resulting state space contains

3 \times 2 \times 2 = 12

states and is indexed by

\begin{matrix} s_{g} = 4 p_{g} + 2 d_{g} + b_{g}, \end{matrix}

(17)

where

p_{g} \in {0, 1, 2}

is the progress stage,

d_{g} \in {0, 1}

indicates whether diversity exceeds a threshold, and

b_{g} \in {0, 1}

indicates whether the feasibility rate exceeds a target level.

4.3.3. Action Design and Parameter Mapping

The agent selects one of three actions representing exploration, balance, and exploitation. Each action corresponds to a predefined mapping from

a_{g}

to

Θ_{g}

. This mapping implements MapAction

(\cdot)

in Algorithm 1. The exploration action increases variation intensity and applies repair with moderate probability to avoid population collapse. The balanced action uses baseline variation settings and higher repair probability to stabilize feasibility. The exploitation action reduces variation intensity for refinement and applies repair more aggressively with the laden-first mode.

4.3.4. Reward and Q-Learning Update

After offspring evaluation and elitist environmental selection, the agent receives a scalar reward based on population-level improvement. This reward computation corresponds to Reward

(\cdot)

in Algorithm 1. We employ the hypervolume improvement of the first non-dominated front as the primary signal and add an extra penalty when the population feasibility rate falls below a target level. Hypervolume is computed using a fixed reference point and consistent normalization across generations. The Q-table is updated online using the standard Q-learning rule with an

ϵ

-greedy behavior policy, where

ϵ

is the exploration rate,

α

is the learning rate, and

γ

is the discount factor. The learned Q-table can be stored and reused to warm-start subsequent runs.

5. Numerical Experiments

5.1. Data Description

We use the standardized benchmark dataset LINERLIB proposed by Brouer et al. [41] for the Liner Shipping Network Design Problem. The dataset provides real-world instances with port sets, inter-port distances, fleet information, and an origin–destination demand matrix. Since our study focuses on the downstream operational stage given a fixed service network, we do not use the original design instances directly. Instead, we randomly select five services the published solution of Brouer et al. [41]. Each service is operated as a weekly string. Let

T_{r}

denote the round-trip cycle time of route r measured in weeks. The vessel capacity on each route is taken from the deployed vessel class in the corresponding solution and is reported in TEU. Table 3 summarizes the selected routes and their key attributes.

The original LINERLIB solution provides a representative weekly OD demand matrix and the inter-port distances. To construct a multi-week planning horizon, we generate weekly demand scenarios following Wang et al. [7]. Specifically, for each OD pair, the realized weekly demand is sampled around its nominal value with a coefficient of variation

CV = σ / μ = 0.1

. For each OD pair and each week, the realized demand is split into two parts, with

50 %

treated as contract demand and the remaining

50 %

treated as spot demand.

The revenue and cost parameters are calibrated based on Wang et al. [7] and adjusted to match the scale of the selected services. The resulting parameter values are summarized in Table 4. Note that we do not assume exogenous empty container demand. Empty repositioning is modeled as a derived operational decision driven by imbalanced laden flows and the availability of empty inventories at ports.

5.2. Experimental Setup and Evaluation Protocol

We evaluate the proposed NSGA-II-RL against two reference methods. The first is a baseline NSGA-II where the Q-learning controller is disabled and the operator configuration remains fixed throughout the run (Algorithm A1). The second is a random-control variant that samples an action uniformly at random at each generation and applies the corresponding operator and repair settings, while disabling Q-learning updates. All methods share the same solution encoding, decode–repair–evaluate routine (Evaluate), feasibility repair mechanism, objective evaluation procedure, population size N, and stopping criterion (generations G) to ensure a fair comparison.

Solution quality is measured by the hypervolume (HV) indicator computed from the non-dominated set returned by each run. We report the mean and standard deviation of the final-generation HV over independent runs, together with the best HV observed. To characterize feasibility and search stability, we track the feasibility rate, defined as the fraction of solutions whose total constraint violation is below a tolerance

ϵ_{cv}

, and report its trajectory across generations. Runtime is recorded under the same implementation environment for all methods. Hypervolume is computed on normalized objectives using fixed bounds

f^{min}

and

f^{max}

and a fixed reference point

r

. The same normalization bounds and reference point are used for all methods and all runs on the same instance. Detailed hyperparameter values, including normalization bounds and the reference point, are provided in Appendix B.

The Q-learning controller operates at the generation level. It observes a compact discrete state derived from progress stage, diversity level, and feasibility level. Progress is categorized into three stages based on the ratio

g / G

. Diversity is measured by the average crowding distance of the current population and compared with a threshold

τ_{C D}

. Feasibility is measured by the feasibility rate

ϕ^{g}

and compared with a target level

τ_{ϕ}

, where feasibility is judged using tolerance

ϵ_{cv}

on total constraint violation. The resulting state space contains

3 \times 2 \times 2 = 12

discrete states. The action set contains three actions corresponding to exploration, balance, and exploitation. Each action maps to a predefined operator and repair configuration Θ_g = Mapaction (a_g, Θ⁰) , which determines the settings used by Variation

(\cdot)

and Evaluate

(\cdot)

. The reward is based on hypervolume improvement with an additional penalty when the population feasibility rate falls below the target. The controller adopts an

ϵ

-greedy behavior policy and updates action values using learning rate

α

and discount factor

γ

. The Q-table is initialized as zeros and can be warm-started by loading a previously saved table.

5.3. Computational Results

We report results on the five routes presented in Table 3. For each route, we generate three instance sizes by scaling the planning horizon length as

V_{r, k} = k \cdot n_{r}

with

k \in {2, 5, 10}

, where

n_{r}

is the cycle length in weeks under weekly departures. For each route–size pair, each method is independently executed for 10 runs with different random seeds.

5.3.1. Overall Comparison on Solution Quality

Table 5 reports the overall performance across the 5 selected routes and three horizon scales. Several consistent patterns can be observed. All statistics are computed over 10 independent runs per route–size case.

First, NSGA-II-RL achieves the highest mean hypervolume in all route–size cases. Across the 15 route–size pairs, the average improvement in HV mean over the Baseline is

0.00125

. The gain increases with the planning-horizon scale, with average improvements of

0.00074

for the short cases,

0.00136

for the medium cases, and

0.00164

for the long cases. Although the absolute differences are numerically small, the improvement is consistent and becomes more pronounced when the horizon is sufficiently long, where the controller can accumulate feedback and stabilize its action selection.

Second, feasibility behavior clearly separates the three methods. Both NSGA-II-RL and the Baseline achieve a final feasibility rate of

1.0

on all cases, meaning every run terminates with fully feasible solutions under the adopted tolerance. In contrast, the random-control variant shows systematically lower final feasibility rates, with an average of about

0.953

across cases. This is consistent with the role of operator control in our framework, where mismatched operators can slow down feasibility recovery and leave part of the population infeasible at termination.

Finally, runtime is comparable across methods. Runtime increases with the number of voyages as expected, while the differences between NSGA-II-RL and the Baseline are small. In most cases, NSGA-II-RL incurs no noticeable computational overhead despite the additional Q-learning update.

5.3.2. Convergence Behavior and Stability

To reveal how the performance differences arise, we examine the generation-wise evolution of hypervolume (HV) and feasibility rate. Figure 2 provides a consolidated view across Routes 1–5 and three horizon scales

k \in {2, 5, 10}

, where rows correspond to routes and columns correspond to horizon scales. Each panel is a dual-axis plot, with HV on the left axis and feasibility rate on the right axis. Curves show the mean trajectory across runs and the shaded band indicates one standard deviation.

Overall, NSGA-II-RL attains higher HV than the two reference methods across the tested settings. The random-control variant exhibits lower HV and larger fluctuations, and its feasibility rate is less stable over generations, which is consistent with the final feasibility reported in Table 5.

5.3.3. Pareto Front Visualization and Trade-Off Interpretation

To visualize the trade-off between profit maximization and empty repositioning effort, we plot the Pareto front approximations across Routes 1–5 under three horizon scales

k \in {2, 5, 10}

. For each method, we merge the final non-dominated sets from 10 independent runs and extract the non-dominated subset of the union for visualization. This merged front reduces sensitivity to any single seed and better reflects the attainable trade-off set of each method.

Figure 3 shows that NSGA-II-RL yields a stronger trade-off set overall across routes and horizon scales. The RL-controlled front is shifted toward the upper-left region, indicating that it can achieve higher profit with the same empty TEU-miles, or equivalently reduce empty repositioning effort without sacrificing profit. The advantage is often most visible in the mid-range of the front where operational decisions face stronger tension between allocating slots to laden demand and repositioning empties. In contrast, the baseline and random-control variants exhibit very similar coverage in many panels, with only modest differences over certain ranges.

6. Discussion

6.1. Managerial Interpretation of the Bi-Objective Trade-Off

Our model highlights a fundamental profit–empty repositioning effort trade-off in integrated slot allocation and empty repositioning, where laden and empty containers compete for the same vessel capacity. In this study, we operationalize empty repositioning effort by empty TEU-miles, which reflects the transport work devoted to empty movements over the planning horizon.

Profit-oriented solutions allocate more capacity to revenue-generating laden flows and tend to reduce or postpone empty repositioning on some legs. Such solutions are attractive when freight rates are high or when capacity is tight, but they may exacerbate equipment imbalance at certain ports in later voyages and increase the risk of shortage-related costs or external sourcing needs, depending on the operational context. In contrast, solutions with lower empty TEU-miles emphasize a more conservative repositioning plan in terms of empty repositioning effort, which can be preferable when empty repositioning is expensive or operationally constrained.

From a planning perspective, the Pareto front provides a set of implementable policies rather than a single solution. Decision makers can select a point that matches current market conditions and operating priorities, and the framework also supports sensitivity analysis to examine how the profit–empty repositioning effort trade-off shifts under changes in demand mix, capacity tightness, and relevant cost parameters (e.g., shortage/holding costs or external sourcing penalties).

6.2. Horizon Length and Scalability

An important deployment question is how model size and runtime scale when the planning horizon is extended. For a fixed service network and fixed operational rules, the formulation scales linearly with the voyage count

| V |

in the sense that most decision variables and constraints are voyage-indexed. Adding one voyage introduces an additional block of voyage-indexed variables and constraints with the same structural pattern; therefore, increasing

| V |

by a factor k leads to a proportional increase in the number of voyage-indexed variables and constraints, i.e., the model size grows on the order of

O (| V |)

with respect to

| V |

.

This structural scaling is consistent with the computational results summarized in Table 5. When the horizon is extended so that the voyage count increases by a fixed multiplier, the observed runtime under the same solver configuration exhibits a broadly proportional increase, therefore the empirical pattern suggests that extending the horizon yields a predictable increase in computational effort. Practically, these observations provide guidance for selecting horizon length.

6.3. Limitations and Future Research

Several limitations define the current scope and motivate future extensions.

First, the current formulation focuses on a bi-objective trade-off between profit and empty TEU-miles, while service-related requirements are already reflected through the fulfillment-related constraints in our model, practical planning may still involve additional criteria that are not explicitly modeled as objectives, such as emissions and energy costs, schedule robustness to operational disruptions, and richer service-quality measures beyond the current fulfillment targets. The proposed framework is not restricted to two objectives and can be extended to multi-objective settings by incorporating additional performance metrics within the same Pareto-based solution paradigm. Future research can investigate tri-objective or many-objective variants and study how preference articulation and interactive decision support can be integrated into the planning process.

Second, our computational study relies on benchmark-derived services and scenario-based demand generation with calibrated parameters. Further validation using practical operational data would strengthen the empirical credibility of the findings. Future research can conduct practical case studies using operational records, such as booking data and equipment availability information, to evaluate how the recommended policies perform in real settings.

Third, the RL-based operator controller is implemented using a compact tabular Q-learning design for stability and interpretability. We do not provide a systematic comparison with alternative adaptive schemes, such as continuous-state parameter adaptation or deep-RL controllers. Benchmarking these alternatives in terms of solution quality, feasibility stability, and computational overhead, especially on larger instances, remains a meaningful direction for future work.

7. Conclusions

This paper studied an integrated short-term planning problem on a fixed cyclic liner service, where laden shipment execution and empty container repositioning compete for the same vessel slots across multiple voyages. We formulated a bi-objective mixed-integer model that maximizes profit while minimizing empty TEU-miles, and it explicitly captures key operational elements including stranded bookings, port-level empty-inventory evolution, and the use of externally leased containers as emergency supply that occupies slots but does not enter the owned-inventory balance. The resulting Pareto set reveals an operational trade-off between profitability and repositioning effort, providing decision support for carriers facing simultaneous revenue pressure and equipment imbalance.

To solve the bi-objective problem at practical scales, we developed an NSGA-II-RL approach that equips the NSGA-II framework with Q-learning based operator control to adapt variation and feasibility-handling behavior across generations. Computational experiments on five services derived from LINERLIB show that NSGA-II-RL consistently improves solution quality compared with both a fixed-operator baseline and a random-control variant. In particular, it achieves the best mean hypervolume across all tested route and horizon settings, and its advantage becomes more pronounced as the planning horizon length increases, suggesting that learning-based control is especially beneficial when richer feedback can be accumulated over a longer search process. Moreover, the feasibility trajectory stabilizes more clearly under NSGA-II-RL, indicating that the approach can navigate tight capacity and inventory constraints without sacrificing runtime. A discussion of limitations and potential extensions is provided in Section 6.3. Overall, the proposed model and solution framework offer a practical tool for jointly balancing profit and empty TEU-miles in integrated slot allocation and empty repositioning decisions.

Author Contributions

Conceptualization, L.H. and M.S.; Methodology, L.H. and M.S.; Software, L.H. and W.G.; Validation, W.G.; Formal analysis, L.H.; Investigation, W.G.; Resources, M.S.; Data curation, Y.G.; Writing—original draft preparation, L.H. and M.S.; Writing—review and editing, L.H., M.S., W.G. and Y.G.; Visualization, W.G. and Y.G.; Supervision, M.S.; Project administration, L.H.; Funding acquisition, L.H., M.S., W.G. and Y.G. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Natural Science Foundation of China (Grant No. 72401219), the Shanghai Sailing Program (Grant No. 23YF1416500), and the National Natural Science Foundation of China (Grant No. 72501171).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The datasets used and analyzed during the current study are available from the corresponding author upon reasonable request.

Acknowledgments

During the preparation of this manuscript, the authors used ChatGPT (GPT-5, OpenAI) to improve the clarity and fluency of the English language. The authors have carefully reviewed and edited all AI-assisted outputs and take full responsibility for the final content of this publication.

Conflicts of Interest

The authors declare that there are no conflicts of interest regarding the publication of this paper.

Appendix A. Baseline NSGA-II Algorithm Without Reinforcement Learning

For comparison, this appendix reports a baseline variant in which the reinforcement learning controller is disabled. The search is conducted by standard NSGA-II. Solution encoding, the decode–repair–evaluate routine, and objective evaluation are identical to those in the hybrid framework; only the generation-level parameter adaptation is removed.

Algorithm A1: Baseline NSGA-II

Appendix B. Algorithmic Hyperparameters and Evaluation Settings

This appendix reports the algorithmic hyperparameters and evaluation settings used in all experiments. Unless otherwise stated, the same parameter values are applied across all routes and instance sizes.

Hypervolume (HV) is computed on normalized objective vectors to ensure comparability across instances with different scales. Let

f (x) = (f_{1} (x), f_{2} (x))

denote the objective vector of solution

x

. Given fixed bounds

f^{min}

and

f^{max}

, we normalize each component as

{\tilde{f}}_{m} (x) = \frac{f_{m} (x) - f_{m}^{min}}{f_{m}^{max} - f_{m}^{min}}, m \in {1, 2} .

(A1)

HV is then computed in the normalized space with a common reference point

r

. The same

f^{min}

,

f^{max}

, and

r

are used for all methods and all runs on the same instance.

Table A1 summarizes the algorithmic hyperparameters and evaluation settings used throughout the experiments.

Table A1. Algorithmic hyperparameters and evaluation settings.

Item	Value
Population size N	50
Generations G	100
Crossover probability $p_{cx}$	$0.9$
Mutation probability $p_{mut}$	$1 / n_{var}$ if not specified
Baseline parameter set $Θ^{0}$	fixed operator and repair configuration in baseline NSGA-II
Penalty coefficient $λ_{cap}$	capacity-violation penalty in $f_{1}$
Feasibility tolerance $ϵ_{cv}$	$10^{- 6}$ on total violation
Target feasibility rate $τ_{ϕ}$	$0.9$
Crowding-distance threshold $τ_{C D}$	$0.5$
Normalization bounds $f^{min}, f^{max}$	$f^{min} = (- 5 \times 10^{10}, 0)$ , $f^{max} = (5 \times 10^{10}, 5 \times 10^{10})$
Reference point $r$ for hypervolume	$r = (1.1, 1.1)$ in normalized space
$ϵ$ -greedy parameter $ϵ$	$0.1$
Learning rate $α$	$0.1$
Discount factor $γ$	$0.9$
Number of independent runs (seeds)	10

References

UNCTAD. Review of Maritime Transport 2024. 2024. Available online: https://unctad.org/publication/review-maritime-transport-2024 (accessed on 17 December 2025).
International Maritime Organization (IMO). EEXI and CII—Ship Carbon Intensity and Rating System (FAQ). Available online: https://www.imo.org/en/mediacentre/hottopics/pages/eexi-cii-faq.aspx (accessed on 17 December 2025).
Sun, R.; Abouarghoub, W.; Demir, E.; Potter, A. Geopolitical Disruptions and Maritime Transitions: Environmental and Economic Costs of Rerouting. Transp. Res. Part A Policy Pract. 2026, 203, 104737. [Google Scholar] [CrossRef]
Meng, Q.; Zhao, H.; Wang, Y. Revenue Management for Container Liner Shipping Services: Critical Review and Future Research Directions. Transp. Res. Part E Logist. Transp. Rev. 2019, 128, 280–292. [Google Scholar] [CrossRef]
Ting, S.C.; Tzeng, G.H. An Optimal Containership Slot Allocation for Liner Shipping Revenue Management. Marit. Policy Manag. 2004, 31, 199–211. [Google Scholar] [CrossRef]
Song, D.P.; Dong, J.X. Cargo Routing and Empty Container Repositioning in Multiple Shipping Service Routes. Transp. Res. Part B Methodol. 2012, 46, 1556–1575. [Google Scholar] [CrossRef]
Wang, W.; Yang, Z.; Diao, C.; Jin, Z. Collaborative Optimization of Container Liner Slot Allocation and Empty Container Repositioning Based on Online Booking Platform. Appl. Sci. 2024, 14, 11092. [Google Scholar] [CrossRef]
Song, D.P.; Dong, J.X. Empty Container Repositioning. In Handbook of Ocean Container Transport Logistics; Lee, C.Y., Meng, Q., Eds.; Springer International Publishing: Berlin/Heidelberg, Germany, 2015; Volume 220, pp. 163–208. [Google Scholar] [CrossRef]
Abdelshafie, A.; Salah, M.; Kramberger, T.; Dragan, D. Repositioning and Optimal Re-Allocation of Empty Containers: A Review of Methods, Models, and Applications. Sustainability 2022, 14, 6655. [Google Scholar] [CrossRef]
Lee, C.Y.; Song, D.P. Ocean Container Transport in Global Supply Chains: Overview and Research Opportunities. Transp. Res. Part B Methodol. 2017, 95, 442–474. [Google Scholar] [CrossRef]
Gençer, H.; Demir, M.H. Optimization of Empty Container Repositioning in Liner Shipping. Bus. Manag. Horizons 2020, 8, 1. [Google Scholar] [CrossRef]
Sanders, U.; Kloppsteck, L.; Roeloffs, C.; Schlingmeier, J.; Riedl, J. Think Outside Your Boxes: Solving the Global Container-Repositioning Puzzle. Available online: https://www.bcg.com/publications/2015/transportation-travel-logistics-think-outside-your-boxes-solving-global-container-repositioning-puzzle (accessed on 17 December 2025).
Drewry Shipping Consultants Ltd. Annual Container Market Review & Forecast 2006/07. 2006. Available online: https://www.drewry.co.uk (accessed on 17 December 2025).
Sea-Intelligence. Approximately 41% of Container Transport Is Empty When Measured in TEU-Miles. Industry Analysis, Reported by Maritime Optima. 2025. Available online: https://maritimeoptima.com/maritime-news/41-percent-of-container-transport-is-running-empty (accessed on 22 December 2025).
Styhre, L. Strategies for Capacity Utilisation in Short Sea Shipping. Marit. Econ. Logist. 2009, 11, 418–437. [Google Scholar] [CrossRef]
Lu, H.A.; Chu, C.W.; Che, P.Y. Seasonal Slot Allocation Planning for a Container Liner Shipping Service. J. Mar. Sci. Technol. 2010, 18, 10. [Google Scholar] [CrossRef]
Wong, E.Y.C.; Ling, K.K.T.; Tai, A.H.; Lam, J.S.L.; Zhang, X. Three-Echelon Slot Allocation for Yield and Utilisation Management in Ship Liner Operations. Comput. Oper. Res. 2022, 148, 105983. [Google Scholar] [CrossRef]
Zurheide, S.; Fischer, K. Revenue Management Methods for the Liner Shipping Industry. Flex. Serv. Manuf. J. 2015, 27, 200–223. [Google Scholar] [CrossRef]
Zurheide, S.; Fischer, K. A Revenue Management Slot Allocation Model for Liner Shipping Networks. Marit. Econ. Logist. 2012, 14, 334–361. [Google Scholar] [CrossRef]
Wang, Y.; Meng, Q.; Du, Y. Liner Container Seasonal Shipping Revenue Management. Transp. Res. Part Methodol. 2015, 82, 141–161. [Google Scholar] [CrossRef]
Wang, T.; Xing, Z.; Hu, H.; Qu, X. Overbooking and Delivery-Delay-Allowed Strategies for Container Slot Allocation. Transp. Res. Part E Logist. Transp. Rev. 2019, 122, 433–447. [Google Scholar] [CrossRef]
Dong, J.X.; Lee, C.Y.; Song, D.P. Joint Service Capacity Planning and Dynamic Container Routing in Shipping Network with Uncertain Demands. Transp. Res. Part B Methodol. 2015, 78, 404–421. [Google Scholar] [CrossRef]
Fu, Y.; Song, L.; Lai, K.K.; Liang, L. Slot Allocation with Minimum Quantity Commitment in Container Liner Revenue Management: A Robust Optimization Approach. Int. J. Logist. Manag. 2016, 27, 650–667. [Google Scholar] [CrossRef]
Wang, X. Stochastic Resource Allocation for Containerized Cargo Transportation Networks When Capacities Are Uncertain. Transp. Res. Part E Logist. Transp. Rev. 2016, 93, 334–357. [Google Scholar] [CrossRef]
Wang, K.; Liu, W.; Wang, S.; Liu, Z. Optimal Reefer Slot Conversion for Container Freight Transportation. Marit. Policy Manag. 2017, 44, 727–743. [Google Scholar] [CrossRef]
Wang, T.; Meng, Q.; Wang, S.; Qu, X. A Two-Stage Stochastic Nonlinear Integer-Programming Model for Slot Allocation of a Liner Container Shipping Service. Transp. Res. Part B Methodol. 2021, 150, 143–160. [Google Scholar] [CrossRef]
Zhao, H.; Meng, Q.; Wang, Y. Robust Container Slot Allocation with Uncertain Demand for Liner Shipping Services. Flex. Serv. Manuf. J. 2022, 34, 551–579. [Google Scholar] [CrossRef]
Liang, J.; Li, L.; Zheng, J.; Tan, Z. Service-Oriented Container Slot Allocation Policy under Stochastic Demand. Transp. Res. Part B Methodol. 2023, 176, 102799. [Google Scholar] [CrossRef]
Song, D.P.; Carter, J. Empty Container Repositioning in Liner Shipping1. Marit. Policy Manag. 2009, 36, 291–307. [Google Scholar] [CrossRef]
Braekers, K.; Janssens, G.K.; Caris, A. Challenges in Managing Empty Container Movements at Multiple Planning Levels. Transp. Rev. 2011, 31, 681–708. [Google Scholar] [CrossRef]
Meng, Q.; Wang, S. Liner Shipping Service Network Design with Empty Container Repositioning. Transp. Res. Part E Logist. Transp. Rev. 2011, 47, 695–708. [Google Scholar] [CrossRef]
Ehrgott, M. Multicriteria Optimization; Springer: Berlin/Heidelberg, Germany, 2005. [Google Scholar]
Sharma, S.; Kumar, V. A Comprehensive Review on Multi-Objective Optimization Techniques: Past, Present and Future. Arch. Comput. Methods Eng. 2022, 29, 5605–5633. [Google Scholar] [CrossRef]
Deb, K.; Pratap, A.; Agarwal, S.; Meyarivan, T. A Fast and Elitist Multiobjective Genetic Algorithm: NSGA-II. IEEE Trans. Evol. Comput. 2002, 6, 182–197. [Google Scholar] [CrossRef]
Ma, H.; Zhang, Y.; Sun, S.; Liu, T.; Shan, Y. A Comprehensive Survey on NSGA-II for Multi-Objective Optimization and Applications. Artif. Intell. Rev. 2023, 56, 15217–15270. [Google Scholar] [CrossRef]
Ji, B.; Yuan, X.; Yuan, Y. Modified NSGA-II for Solving Continuous Berth Allocation Problem: Using Multiobjective Constraint-Handling Strategy. IEEE Trans. Cybern. 2017, 47, 2885–2895. [Google Scholar] [CrossRef]
Ji, B.; Huang, H.; Yu, S.S. An Enhanced NSGA-II for Solving Berth Allocation and Quay Crane Assignment Problem with Stochastic Arrival Times. IEEE Trans. Intell. Transp. Syst. 2023, 24, 459–473. [Google Scholar] [CrossRef]
Hu, Z.H. Multi-Objective Genetic Algorithm for Berth Allocation Problem Considering Daytime Preference. Comput. Ind. Eng. 2015, 89, 2–14. [Google Scholar] [CrossRef]
Song, D.P.; Li, D.; Drake, P. Multi-Objective Optimization for Planning Liner Shipping Service with Uncertain Port Times. Transp. Res. Part E Logist. Transp. Rev. 2015, 84, 1–22. [Google Scholar] [CrossRef]
Ma, W.; Lu, T.; Ma, D.; Wang, D.; Qu, F. Ship Route and Speed Multi-Objective Optimization Considering Weather Conditions and Emission Control Area Regulations. Marit. Policy Manag. 2021, 48, 1053–1068. [Google Scholar] [CrossRef]
Brouer, B.D.; Alvarez, J.F.; Plum, C.E.M.; Pisinger, D.; Sigurd, M.M. A Base Integer Programming Model and Benchmark Suite for Liner-Shipping Network Design. Transp. Sci. 2014, 48, 281–312. [Google Scholar] [CrossRef]

Figure 1. Illustration of onboard-container accounting on a cyclic route, where

o (a)

lies on the transportation path of OD pair

(j, k)

: (a)

j < o (a) < k

; (b)

o (a) < k < j

(wrap-around); (c)

k < j < o (a)

. (a,c) are accounted for by the first summation in (2) (voyage v variables,

j \leq o (a)

), while (b) is accounted for by the second summation (voyage

v - M

variables,

j > o (a)

).

Figure 1. Illustration of onboard-container accounting on a cyclic route, where

o (a)

lies on the transportation path of OD pair

(j, k)

: (a)

j < o (a) < k

; (b)

o (a) < k < j

(wrap-around); (c)

k < j < o (a)

. (a,c) are accounted for by the first summation in (2) (voyage v variables,

j \leq o (a)

), while (b) is accounted for by the second summation (voyage

v - M

variables,

j > o (a)

).

Figure 2. Convergence curves across Routes 1–5 and three horizon scales

k \in {2, 5, 10}

. Rows correspond to routes and columns correspond to horizon scales. Each panel is a dual-axis plot, with hypervolume (left axis) and feasibility rate (right axis). Curves show the mean across runs and shaded bands show one standard deviation.

Figure 2. Convergence curves across Routes 1–5 and three horizon scales

k \in {2, 5, 10}

. Rows correspond to routes and columns correspond to horizon scales. Each panel is a dual-axis plot, with hypervolume (left axis) and feasibility rate (right axis). Curves show the mean across runs and shaded bands show one standard deviation.

Figure 3. Merged Pareto front approximations (10 seeds) across Routes 1–5 under three horizon scales

k \in {2, 5, 10}

. The horizontal axis is empty TEU-miles (minimize) and the vertical axis is total profit (maximize).

Figure 3. Merged Pareto front approximations (10 seeds) across Routes 1–5 under three horizon scales

k \in {2, 5, 10}

. The horizontal axis is empty TEU-miles (minimize) and the vertical axis is total profit (maximize).

Table 1. Representative studies on slot allocation, empty container repositioning, and integrated planning.

Paper	CSAP Decisions	ECR Decisions	Obj.	Method
Ting 2004 [5]	Accept/reject bookings; allocate laden slots across OD pairs	Allocate empty slots/flows with exogenous replenishment requirements	Profit	Mathematical programming
Lu 2010 [16]	Seasonal slot allocation planning across OD flows	Allocate vessel slots to empty containers	Profit	Mathematical programming
Meng 2011 [31]	Service network design decisions	Reposition empty container flows on the designed network	Cost	Mixed-integer programming
Zurheide 2012 [19]	Network-based slot allocation on a time-expanded service network	Port–time empty balancing with repositioning, leasing, and storage decisions	Profit	Rolling re-optimization
Wang 2015 [20]	Assignment multi-type laden containers to container routes; fleet size and sailing speed	—	Profit	Mixed-integer nonlinear programming model; global optimization
Fu 2016 [23]	Slot allocation with minimum quantity commitment under uncertain demand	—	Robust profit/ commitment satisfaction	Robust optimization
Wang 2016 [24]	Stochastic resource allocation for containerized cargo networks with uncertain capacities	—	Expected profit/ risk-aware feasibility	Stochastic programming; sample average approximation
Wang 2017 [25]	Tactical slot-configuration and deployment control	—	Minimum delay of dry/reefer containers	Two-stage simulation-based optimization
Wang 2019 [21]	Slot allocation with overbooking and delivery-delay-allowed control	—	Profit	Lagrangian relaxation; surrogate subgradient method
Wang 2021 [26]	Two-stage stochastic slot allocation differentiating contract vs. spot segments	—	Expected profit	Two-stage stochastic programming; sample average approximation
Wong 2022 [17]	Three-echelon slot allocation planning with cargo shifting and slot exchange	Allocate vessel slots to empty containers	Yield and utilization	Branch-and-bound search; genetic algorithm; deep neural network
Zhao 2022 [27]	Planning-level slot allocation under uncertain demand	—	Robust profit	Distributionally robust optimization
Liang 2023 [28]	Service-oriented slot allocation policy with explicit fulfillment rate under stochastic demand	—	Profit; service level	Stochastic policy
Wang 2024 [7]	Collaborative slot allocation for laden containers	Empty repositioning decisions	Profit	Branch-and-cut algorithm

Table 2. List of notations.

Sets
$L$	A set of port-call indices along the cyclic liner route, $L = {1, 2, \dots, L}$ , indexed by i (and j and k when needed);
$A$	A set of directed shipping legs on the service network, indexed by a;
	On the cyclic route, legs are in one-to-one correspondence with origin port-call indices $i \in L$ , i.e., $a = a (i) = (i, i + 1)$ and the wrap-around leg is $(L, 1)$ ;
$V$	A set of voyages in the planning horizon, $V = {1, 2, \dots, V}$ , indexed by v;
$OD$	A set of origin–destination pairs on the liner service, indexed by $(i, j)$ ;
	OD pairs may satisfy either $i < j$ or $j < i$ to represent wrap-around demand on the cyclic route;
$S$	A set of laden container service classes, indexed by s. Specifically, we set $S = {C 1, C 2}$ , corresponding to contract and spot demand, respectively;
Parameters
$C a p^{v}$	Total slot capacity (in TEU) of the vessel operating voyage v;
$o (a)$	Origin port-call index of shipping leg $a \in A$ ; on the cyclic route, $o (a (i)) = i$ ;
$δ_{i j}^{a}$	OD–leg incidence: $δ_{i j}^{a} = 1$ if leg a lies on the path from i to j on the cyclic route, and 0 otherwise;
$t_{i j}$	Transportation duration (in days) for OD pair $(i, j) \in OD;$
$d_{a}$	Sailing distance (in miles) of shipping leg $a \in A$ ;
$d_{i j}$	Sailing distance (in miles) of OD pair $(i, j)$ , with $d_{i j} = \sum_{a \in A} δ_{i j}^{a} d_{a}$ ;
$q_{i j}^{s v}$	Demand (in TEU) of laden shipments for OD pair $(i, j) \in OD$ with service class $s \in S$ in voyage $v \in V$ ;
$β_{i j}$	Minimum service level (fill-rate) requirement for contract demand on OD pair $(i, j)$ over the planning horizon;
$f_{i j}^{s v}$	Freight rate (revenue) per TEU for OD pair $(i, j) \in OD$ with service class $s \in S$ in voyage $v \in V$ ;
$c_{i j}^{L v}$	Unit transportation cost per TEU for laden containers on OD pair $(i, j)$ in voyage v, assumed identical across demand classes $s \in S$ ;
$c_{i j}^{E v}$	Unit transportation cost per TEU for empty containers on OD pair $(i, j)$ in voyage v;
${\bar{ρ}}^{v}$	Daily rental rate per TEU-day for externally leasing empty containers in voyage v;
$ρ_{i j}^{v}$	Unit leasing cost per TEU for OD pair $(i, j)$ in voyage v, where $ρ_{i j}^{v} = {\bar{ρ}}^{v} t_{i j}$ ;
$ζ_{i}$	Unit inventory holding cost per TEU of empty containers remaining at port-call index i at the end of each voyage;
$κ_{i j}^{s v}$	Delay cost per TEU for accepted laden demand of class s on OD pair $(i, j)$ that is postponed from voyage v to subsequent voyages;
$π_{i j}^{s}$	Terminal penalty cost per TEU for accepted laden demand of class s on OD pair $(i, j)$ that remains unshipped by the end of the planning horizon;
$I_{i}^{0}$	Initial inventory (in TEU) of empty containers at port-call index i at the beginning of the planning horizon;
M	Number of vessels deployed on the liner service;
Variables
$z_{i j}^{s v}$	Accepted booking quantity (in TEU) for laden demand of class $s \in S$ on OD pair $(i, j)$ in voyage v;
$x_{i j}^{s v}$	Shipped quantity (in TEU) of laden containers of class $s \in S$ on OD pair $(i, j)$ in voyage v using carrier-owned containers;
$x_{i j}^{E v}$	Repositioned empty container flow (in TEU) shipped from i to j in voyage v;
$r_{i j}^{s v}$	Shipped laden quantity (in TEU) of class $s \in S$ on OD pair $(i, j)$ in voyage v executed using leased containers;
Auxiliary variables
$I_{i}^{v}$	Remaining empty container inventory (in TEU) at port-call index i immediately after departure in voyage v;
$y_{a}^{v}$	Total onboard volume (in TEU) on shipping leg $a \in A$ during voyage v;
$h_{i j}^{s v}$	Stranded laden bookings (in TEU) of class $s \in S$ on OD pair $(i, j)$ after voyage v, which are deferred to subsequent voyages and incur a demurrage-type postponement penalty;

Table 3. Selected routes and service attributes.

Route ID	Port Calls	Cycle Time (Weeks)	Vessels	Capacity (TEU)
1	Tanjung Pelepas–Port Klang–Port Said–Salalah–Tanjung Pelepas–Singapore–Hong Kong–Busan–Qingdao–Los Angeles–Shanghai–Kaohsiung	13	13	9600
2	Vancouver–Brisbane–Auckland–Brisbane–Port Klang–Tanjung Pelepas–Kaohsiung	8	8	9600
3	Algeciras–Tangier–Port Said–Jeddah–Gioia Tauro–Jeddah–Newark–Charleston–Felixstowe–Bremerhaven	10	10	9600
4	Busan–Vancouver–Los Angeles–Yokohama–Tanjung Pelepas–Colombo–Tanjung Pelepas–Hong Kong–Shenzhen	9	9	9600
5	Newark–Los Angeles–Balboa–Manzanillo–Santos–Apapa–Algeciras–Gioia Tauro–Algeciras–Tangier	14	14	9600

Table 4. Parameter settings used in numerical experiments.

Parameter	Value/Range
$f_{i j}^{s v} / d_{i j} (s = C 1)$	$0.70$
$f_{i j}^{s v} / d_{i j} (s = C 2)$	$0.50$
$β_{i j}$	$[0.70, 0.95]$
$c_{i j}^{L v} / f_{i j}^{s v}$	$[0.40, 0.50]$
$c_{i j}^{E v} / c_{i j}^{L v}$	$[0.40, 0.50]$
${\bar{ρ}}^{v}$	$[50, 60]$
$κ_{i j}^{s v} / f_{i j}^{s v}$	$[0.10, 0.15]$
$π_{i j}^{s} / f_{i j}^{s v}$	$[2.0, 5.0]$
$I_{i}^{0} (i = 1)$	$[5000, 6000]$
$I_{i}^{0} (i \neq 1)$	$[200, 500]$
$ζ_{i}$	$[50, 60]$

Table 5. Overall comparison on solution quality, runtime, and feasibility.

Route	Size	Voyages	Method	HV Mean	HV Std	Best HV	Runtime Mean(s)	Feasibility Final
1	S	26	NSGA-II-RL	0.8375	0.0021	0.8401	85.72	1
			Baseline	0.8363	0.0015	0.8389	86.88	1
			Random	0.8340	0.0015	0.8363	87.75	0.970
1	M	65	NSGA-II-RL	0.8285	0.0008	0.8298	181.10	1
			Baseline	0.8270	0.0009	0.8285	185.25	1
			Random	0.8248	0.0016	0.8279	183.26	0.9218
1	L	130	NSGA-II-RL	0.8236	0.0017	0.8262	331.62	1
			Baseline	0.8204	0.0007	0.8217	335.63	1
			Random	0.8188	0.0006	0.8195	333.37	0.944
2	S	16	NSGA-II-RL	0.8518	0.0012	0.8529	34.39	1
			Baseline	0.8502	0.0017	0.8525	35.58	1
			Random	0.8483	0.0016	0.8498	34.89	0.978
2	M	40	NSGA-II-RL	0.8431	0.0006	0.8445	67.82	1
			Baseline	0.8420	0.0010	0.8434	70.68	1
			Random	0.8400	0.0007	0.8413	69.95	0.9691
2	L	80	NSGA-II-RL	0.8374	0.0006	0.8386	140.49	1
			Baseline	0.8369	0.0006	0.8378	144.76	1
			Random	0.8357	0.0007	0.8369	143.19	0.966
3	S	20	NSGA-II-RL	0.8466	0.0010	0.8478	38.30	1
			Baseline	0.8460	0.0008	0.8474	37.99	1
			Random	0.8446	0.0007	0.8457	38.01	0.964
3	M	50	NSGA-II-RL	0.8391	0.0011	0.8414	95.49	1
			Baseline	0.8376	0.0006	0.8385	97.28	1
			Random	0.8352	0.0016	0.8375	97.11	0.938
3	L	100	NSGA-II-RL	0.8344	0.0011	0.8361	176.20	1
			Baseline	0.8322	0.0009	0.8337	177.33	1
			Random	0.8309	0.0008	0.8323	178.32	0.962
4	S	18	NSGA-II-RL	0.8423	0.0019	0.8453	51.34	1
			Baseline	0.8422	0.0023	0.8459	52.78	1
			Random	0.8391	0.0018	0.8421	54.09	0.968
4	M	45	NSGA-II-RL	0.8322	0.0012	0.8352	106.19	1
			Baseline	0.8300	0.0012	0.8318	108.66	1
			Random	0.8274	0.0017	0.8300	104.13	0.966
4	L	90	NSGA-II-RL	0.8248	0.0009	0.8266	212.48	1
			Baseline	0.8235	0.0007	0.8248	217.32	1
			Random	0.8218	0.0011	0.8229	217.01	0.948
5	S	28	NSGA-II-RL	0.8450	0.0005	0.8457	52.36	1
			Baseline	0.8448	0.0009	0.8465	53.18	1
			Random	0.8437	0.0008	0.8448	53.40	0.938
5	M	70	NSGA-II-RL	0.8341	0.0009	0.8355	113.52	1
			Baseline	0.8336	0.0010	0.8347	114.87	1
			Random	0.8316	0.0010	0.8335	114.77	0.922
5	L	140	NSGA-II-RL	0.8257	0.0007	0.8272	159.51	1
			Baseline	0.8247	0.0007	0.8260	156.20	1
			Random	0.8245	0.0012	0.8263	157.76	0.938

Notes: (1) S, M, and L denote

k \in {2, 5, 10}

, respectively. (2) Within each (Route, Size) block, bold numbers indicate the best-performing method for HV mean and best HV (higher is better) and for runtime mean (lower is better); bold “1” indicates full final feasibility.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Huang, L.; Sha, M.; Guo, W.; Gao, Y. Container Slot Allocation with Empty Container Repositioning: A Multi-Objective Optimization Approach. J. Mar. Sci. Eng. 2026, 14, 424. https://doi.org/10.3390/jmse14050424

AMA Style

Huang L, Sha M, Guo W, Gao Y. Container Slot Allocation with Empty Container Repositioning: A Multi-Objective Optimization Approach. Journal of Marine Science and Engineering. 2026; 14(5):424. https://doi.org/10.3390/jmse14050424

Chicago/Turabian Style

Huang, Lei, Mei Sha, Wenwen Guo, and Yinping Gao. 2026. "Container Slot Allocation with Empty Container Repositioning: A Multi-Objective Optimization Approach" Journal of Marine Science and Engineering 14, no. 5: 424. https://doi.org/10.3390/jmse14050424

APA Style

Huang, L., Sha, M., Guo, W., & Gao, Y. (2026). Container Slot Allocation with Empty Container Repositioning: A Multi-Objective Optimization Approach. Journal of Marine Science and Engineering, 14(5), 424. https://doi.org/10.3390/jmse14050424

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Container Slot Allocation with Empty Container Repositioning: A Multi-Objective Optimization Approach

Abstract

1. Introduction

2. Literature Review

2.1. CSAP in Liner Shipping

2.2. Integrated CSAP and ECR Models

2.3. Multi-Objective Optimization Methods in Maritime Logistics

3. The Integrated Slot Allocation and Empty Repositioning Problem

3.1. Problem Setting

3.1.1. Service Network and Planning Horizon

3.1.2. Integrated Decisions and Container Sourcing

3.1.3. Performance Measures

3.2. Model Assumptions

3.3. Mathematical Formulation

3.3.1. Notations

3.3.2. Objective Functions

3.3.3. Model Constraints

4. Solution Algorithm

4.1. Overall Framework

4.2. NSGA-II-Based Evolutionary Search

4.2.1. Offspring Evaluation and Feasibility Handling

4.2.2. Environmental Selection

4.3. Reinforcement Learning Assisted Operator Control

4.3.1. MDP Formulation and Controlled Parameters

4.3.2. State Representation

4.3.3. Action Design and Parameter Mapping

4.3.4. Reward and Q-Learning Update

5. Numerical Experiments

5.1. Data Description

5.2. Experimental Setup and Evaluation Protocol

5.3. Computational Results

5.3.1. Overall Comparison on Solution Quality

5.3.2. Convergence Behavior and Stability

5.3.3. Pareto Front Visualization and Trade-Off Interpretation

6. Discussion

6.1. Managerial Interpretation of the Bi-Objective Trade-Off

6.2. Horizon Length and Scalability

6.3. Limitations and Future Research

7. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

Appendix A. Baseline NSGA-II Algorithm Without Reinforcement Learning

Appendix B. Algorithmic Hyperparameters and Evaluation Settings

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI