Next Article in Journal
Mapping the Dynamics Behind Breakthrough Innovations in China’s Energy Sector: The Evolution of Research Foci and Collaborative Networks
Previous Article in Journal
Cinematic Narratives as Socio-Technical Systems: Emotion Mining and Script–Audience Emotional Fidelity
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Enhancing E-Commerce RMFS Order Fulfillment Through Pod Positioning with Jointly Optimized Task Allocation

School of Economics and Management, Yanshan University, Qinhuangdao 066004, China
*
Author to whom correspondence should be addressed.
Systems 2025, 13(11), 995; https://doi.org/10.3390/systems13110995
Submission received: 25 September 2025 / Revised: 30 October 2025 / Accepted: 4 November 2025 / Published: 6 November 2025
(This article belongs to the Section Artificial Intelligence and Digital Systems Engineering)

Abstract

Robotic mobile fulfillment systems have become an integral part of e-commerce warehouses. The pod repositioning problem, due to its interdependence with robot task allocation strategies, poses a significant challenge that constrains system performance. In this paper, we aim to jointly optimize the two interrelated problems of pod repositioning and task allocation. A multi-objective mixed-integer planning model is developed to minimize the maximum completion time of robots and the deviation between the pod position and the expected position. To tackle the challenges of decision coupling and a vast solution space, an adaptive genetic-neighborhood search algorithm guided by pod heat maps is designed. Additionally, to promptly correct expected layout deviations and avoid layout instability, a progressive storage mechanism is designed to update the expected layout. The numerical experiments show that compared to the staged optimization strategy, the joint optimization strategy proposed in this paper can reduce the maximum completion time by approximately 48%, and that the strategy reduces the maximum completion time by 9% to 16% compared to the nearest allocation strategy, which is commonly used and performs best in practice.

1. Introduction

The rapid development of e-commerce for the warehouse operation and management has brought a lot of challenges, customer demand for a wide range of stock-keeping unit (SKU) varieties, orders in small batches and other new features prompted the logistics industry to put forward new solutions. In the traditional “person-to-goods” order picking model, order pickers need to frequently walk through the aisles of the warehouse storage area to manually pick SKUs, and their walking time accounts for more than 60% of the total picking time, which results in high operating costs and low picking efficiency. In recent years, the robotic mobile fulfillment system (RMFS) as a new type of automated warehousing system in e-commerce warehouses has been widely put into use by some large-scale e-commerce companies such as Amazon, JD Logistics, and Alibaba, and extensively applied in medicine, clothing, and many other industries in logistics warehouses [1]. Its picking efficiency is 2~3 times that of the traditional “person-to-goods” mode, and has a high degree of flexibility and scalability. RMFS mainly consists of three parts: mobile robots, movable pods and workstations; the mobile robots can move the movable pods to be picked from the storage area to the designated workstations, and the pickers only need to pick the SKUs out of the workstations. The picker only needs to pick out the SKUs at the workstation to complete the order picking task. Compared with traditional manual picking, this “goods-to-person” mode significantly reduces the walking time consumed by order pickers when picking goods, reduces labor costs, and significantly improves the order picking efficiency of the warehousing system [2].
Pod repositioning serves as a core mechanism for achieving dynamic warehouse optimization in RMFS, and its execution is closely interrelated with robot task allocation. As shown in Figure 1, workstations (picking stations/replenishment stations) in RMFSs are typically fixed along one side of the warehouse. When pods are returned to the storage area, the system selects its new location from available positions (e.g., A–D in Figure 1) under the guidance of the expected layout, which integrates multiple operational factors. This process inherently involves a fundamental conflict between optimizing pod layout and controlling robot travel costs. Specifically, the selection of pod locations influences both the future warehouse layout structure and the execution efficiency of subsequent robot tasks. These two aspects form a structural interdependence and mutual constraint through handling activities. Pursuing an optimal layout requires increasing the frequency of pod adjustments, whereas reducing operating costs demands minimizing handling operations. This inherent competitive relationship makes it difficult to achieve overall system performance optimization through unilateral improvements, thereby constituting the core challenge in the joint optimization of pod repositioning and task allocation.
The expected layout in this article is based on pod picking frequency, pod replenishment frequency, and pod inter-association. Among these, pod picking frequency and pod inter-association have been proven to be important factors influencing warehouse layout. Additionally, high-frequency picking operations result in rapid depletion of SKUs on the pods, and when there are not enough SKUs on the pods to satisfy order picking needs, the pods need to be carried by robots to replenishment stations to replenish the out-of-stock SKUs [3]. During replenishment, pods are unable to perform picking tasks, which directly affects the efficiency of picking operations. Therefore, it is important to consider the replenishment frequency of pods. In addition, since the picking frequency of pods in the RMFS is significantly higher than the replenishment frequency, it is also important to balance the robot’s picking and replenishment tasks through reasonable task allocation planning.
This paper focuses on studying the pod repositioning strategy considering joint optimization with task allocation, and proposes a mixed-integer programming model to minimize the maximum completion time of robots and the distance between the pod position and the expected position. The maximum completion time of robots refers to the longest picking completion time required for robots to complete the set of orders at all workstations. In order to solve this complex model effectively, an improved genetic algorithm, i.e., adaptive genetic-variable neighborhood search algorithm (AGA-VNS), is developed, which provides a feasible solution to improve the efficiency of the RMFS. Meanwhile, in order to realize the flexible response of the pod layout to the order demand, a progressive storage mechanism is proposed based on the idea of “predictive guidance, progressive movement”, so that the warehouse layout is gradually close to the theoretically optimal expected layout during the pod repositioning process.
In this paper, we design instances of different scales to verify the feasibility of the joint optimization algorithm and progressive storage mechanism proposed in this paper. Since the RMFS is highly scalable and the number of robots, pods and workstations can be flexibly adjusted with the change in business volume, this paper analyzes the factors affecting the performance of the system through sensitivity experiments, which provides theoretical guidance and practical enlightenment for logistics centers to realize the rational scheduling of warehouse resources.
In summary, the contributions of this paper are as follows:
(1)
A joint optimization strategy based on the heat guidance of pod repositioning and task allocation is proposed, which expands the scope of the pod repositioning problem and has important theoretical significance;
(2)
A generative order-guided progressive storage mechanism is proposed to update the expected layout, which further improves the matching of the warehouse layout with the expected layout while reducing the maximum completion time of robots;
(3)
An adaptive genetic-variable neighborhood search algorithm is designed, in which an adaptive strategy is introduced into the genetic algorithm to accelerate the speed of population searching, and three target-oriented local search operators are designed for VNS perturbation, which can effectively improve the local development ability of the algorithm.
The rest of the paper is structured as follows: Section 2 outlines the relevant literature review. Section 3 details the background of the problem. Section 4 and Section 5 present the mathematical models and algorithms. Section 6 performs numerical experiments and sensitivity analysis. Section 7 summarizes and gives concluding remarks.

2. Literature Review

Optimization of the RMFS has been studied in detail in different fields, and this section provides a detailed description of the two main research directions in the RMFS, i.e., pod repositioning and robot task allocation, and analyzes the current research progress in joint optimization.

2.1. Pod Repositioning Problem

Pod repositioning strategies can greatly utilize the flexibility of RMFSs and are a key component that directly impacts the efficiency and responsiveness of warehouse operations. Lamballais et al. take into account the unique characteristics of RMFS warehouses, noting that robots can move pods between storage areas and workstations, and can continually reposition them during operations [4]. Therefore, the pod repositioning problem is a special kind of pod position allocation problem, which is still essentially a storage location allocation problem, and thus strategies for the storage location allocation problem, such as nearest storage [5], original storage, and turnover-based storage [6], can be naturally applied to solve the pod repositioning problem. Krenzler et al. solved binary, proximity, fixed-position, randomized-position, genetic, and Tetris algorithms under a deterministic model and found the Russian algorithm to be more efficient [7]. This shows that a reasonable pod location allocation strategy can effectively improve the efficiency of pod repositioning. Weidinger et al. further transformed the pod repositioning problem into a special interval scheduling problem and solved it using an adaptive programming approach [8]. Guan et al. constructed a pod similarity matrix and Laplacian matrix, partitioned the pods according to the historical pickup frequency, and proposed a pod repositioning strategy for the same-zone distribution strategy [9]. After that, some scholars began to study the problem of pod repositioning in a complex context. Ji et al. divided the pod repositioning problem into two types of problems, assigning to one pod at a time and assigning to multiple pods at a time, and established an optimization model considering the future handling distance of the pods [10]. Li et al. investigated task allocation, path planning, and blocking problem solving for high-density storage warehouses [11].
In the process of research, storage strategies based on pod turnover rate have gradually become a hot direction of research. Yuan et al. proposed a new storage strategy based on pod turnover rate, and found that having two or three types of storage zones has more benefits when partitioning pods with different turnover rates for storage by fluid modeling [12]. Li et al. proposed a new decentralized storage strategy based on turnover rate considering that turnover rate-based storage is prone to congestion, based on item association rule analysis and a clustering method [13]. Xu et al. took into account the continuous dynamic change relationship of the warehouse layout, designed the matching degree index by the ratio of pod turnover rate and the distance from the storage location to the picking station, and solved the dynamic pod repositioning strategy considering the matching degree of the pod and the storage location by using the single-moment exchange simulated annealing algorithm [14]. Ding studied the approach of dynamically adjusting pod positions based on SKU sales and correlations [15]. Tian et al. proposed a dynamic pod position allocation strategy that takes into account real-time changes in commodity demand and balanced tasks at picking stations, and designed a hybrid algorithm combining a greedy algorithm and improved adaptive genetic algorithm to solve the problem [16]. Some scholars have redefined the pod turnover rate and analyzed pod heat as a key indicator, and the pod heat is essentially a derivation of the pod turnover rate. Based on customer orders, Wang et al. established a model to generate pod picking heat and inter-pod correlation, and proposed a new idea of “customer orders → pod usage frequency → pod location optimization” to obtain a pod repositioning scheme considering pod heat and correlation degree [17]. Zhai designed a mathematical algorithm, MABBD, based on Benders decomposition by considering the hot-start strategy and the main variable fixing strategy [18].
Due to the simultaneous existence of picking and replenishment operations in RMFS warehouses, some scholars have considered the impact of replenishment on pod repositioning, and Jiang et al. proposed a novel synchronization mechanism for picking and replenishment that can balance replenishment work and picking efficiency [19]. Wang et al. considered both order splitting cost and replenishment cost minimization to develop a hybrid algorithm using alternating large-neighborhood and local searches to solve the pod repositioning problem [20]. Li et al. designed a weighted double-Q learning algorithm to form a dynamic optimization strategy for pod position allocation under picking and replenishment operations by integrating factors such as pod hotness, inter-pod correlation, and existing pod position, which can adapt to the fluctuation of order status [21]. Zhang et al. considered joint storage allocation and pod positioning in the replenishment stage with channel workload equalization, and designed a decentralized pod storage allocation strategy that is conducive to the realization of channel workload equalization [22].

2.2. Robot Task Allocation Problem

Robot task allocation is a very complex combinatorial optimization problem, and reasonable scheduling of multiple robots to meet the rapid transportation needs of task pods plays an important role in improving system efficiency and reducing costs. Merschformann et al., in their study of the impact of task allocation, pod selection, and pod location optimization problems on system operation metrics, noted that the task allocation strategy had the greatest impact on system throughput metrics [23].
In early research on robot task allocation, scholars worked to reveal the correlation laws between influencing factors and allocation strategies in RMFSs. Jin et al. simulated warehouse environments through computer modeling and used a variety of pod selection rules and task allocation rules, pointing out the link between robot task allocation and warehouse square footage [24]. Zou et al. considered the relationship between the picking speed of the picking table and the assignment of picking tasks, and pointed out that assigning pod tasks according to the picking speed is more efficient than randomly assigning tasks to pods [25]. Yuan et al. analyzed the two operation modes of synchronous and asynchronous picking with multiple picking stations, and with the goal of achieving the shortest possible time for the robot to complete all the tasks, the robot task scheduling scheme was obtained using co-evolutionary genetic algorithms for synchronous and asynchronous picking modes [26]. Roy et al. categorized the robots into shared and dedicated types, analyzed the application of the two types of robots in the picking and replenishment scenarios of RMFSs, and found that the shared robot is more effective than random picking of pods in terms of picking speed. The application was analyzed and it was found that the order picking time can be reduced to one-third of the original time with shared robots [27].
Some scholars have considered the dynamic nature of RMFS warehouses by taking into account the effects of task changes, path conflicts, and other situations on robot task allocation during robot movement. Yoshitake et al. established a real-time scheduling method that takes into account both pod and robot states, which allows for task allocation of robots while pods are in the process of picking [28]. Zhang et al. modeled the robot task allocation problem as a resource-constrained project scheduling problem with transfer time, and proposed a dedicated serial scheduling generation scheme and a genetic algorithm based on the building block crossover operator to solve the problem, which can realize the dynamic scheduling of robots [29]. Gharehgozli et al. studied the task allocation problem considering pod access constraints and pointed out that multi-position selection is the key to reducing traveling time [30]. Li et al. designed a two-stage algorithm based on pod priority to obtain a multi-robot scheduling scheme considering the path planning of a single robot and collision avoidance among multiple robots [31]. Yuan et al. designed a four-stage balanced heuristic auction algorithm to solve the multi-robot task allocation model considering the balance between task relevance and picking workstations with the objective of minimizing the overall sorting time [32]. Sun et al. proposed a dynamic scheduling method to dynamically solve robot task allocation through a forbidden search algorithm with the objective of minimizing the completion time [33]. Zhang et al. designed an improved adaptive large neighborhood search algorithm for solving the problem with the objective of minimizing the maximum completion time by taking into account the dynamic task time-consuming characteristics of the system and the capacity constraints of the picking station’s seeding wall [34]. Tian et al. proposed a two-layer planning model for robot task allocation considering load quantity equalization for system congestion, and utilized an improved adaptive genetic algorithm introducing a sigmoid function to solve the problem [35]. Zhao et al. established a mixed-integer linear programming model with the objective of minimizing the order picking time for the cooperative optimization problem of robot task allocation and charging scheduling, and designed a genetic algorithm embedded with reinforcement learning to solve it [36].
In order to explore the coupling relationship between different influencing factors in RMFSs, some scholars have built multi-objective optimization models. Wang et al. constructed a multi-objective optimization model with the objectives of minimizing the maximum completion time, workshop energy consumption and the number of robots used to solve the sustainable optimal scheduling problem of a flexible job shop [37]. Wang constructed a multi-objective optimization mathematical model with the matching relationship between robots and tasks as the decision variable, and effectively solved the multi-robot multi-task assignment problem using a dynamic parameter-adaptive adjustment strategy adjusted from both local and global perspectives [38]. Zhang et al. solved the problem using the NSGA-II algorithm that introduces variable neighborhood search with the objective of minimizing the maximum completion time, total energy consumption, and the delivery penalty value of the workpiece [39].

2.3. Joint Optimization Problem for Pod Repositioning and Robot Task Allocation

More and more studies focus on joint optimization problems in RMFSs, which help to improve the overall efficiency of warehousing. On pod position optimization as well as joint optimization for path planning problems, Cai et al. proposed a pod location allocation strategy based on pod turnover rate by setting robot operation rules using a reservation table mechanism [40]. Lai et al. used movable pods as storage units, established a two-layer optimization model with the objective of minimizing the sum of optimized picking paths and pod movement costs for the warehouse layout in the upper layer, as well as the total picking paths in the lower layer, and designed a two-layer meta-heuristic algorithm to solve the problem [41]. Shi et al. proposed a three-item hybrid heuristic algorithm based on product supply relationships for solving order allocation and pod location optimization problems [42]. Wang et al. jointly optimized pod repositioning and pod sorting and designed a logic-based Benders decomposition exact solution algorithm (LBBD) [43].
A few scholars have considered the interaction between task allocation and pod repositioning. Bao et al. proposed a strategy that considers both task allocation and pod repositioning, i.e., a strategy in which pods are returned to the location that minimizes the sum of the distances between the assigned location and the next task position [44]. Shi et al. followed up by studying a pod scheduling problem with multiple types of storage locations by solving the storage location allocation problem in the upper tier, and a two-layer optimization method in the lower layer that determines the optimal order of picking up pods based on the given storage locations and performs joint optimization of pod repositioning and robot task allocation [45]. Zhuang et al. considered that the decisions of “stocking pod” and “picking pod” scheduling jointly determine the traveling distance of the robots, and with the goal of reducing the total traveling distance of the robots, they made a joint decision on pod repositioning and robot task allocation, constructed a multi-robot “storage–retrieval pod” scheduling optimization model, and designed an adaptive mathematical planning heuristic algorithm to solve the problem [46]. Li et al. also considered the impact of robot task allocation, and with the objective of minimizing the cost of robots to complete the task, they constructed a mathematical model that takes into account the difference in the cost of heavy load and no load, and proposed a joint optimization strategy that takes into account pod repositioning and task allocation [47].
In summary, current research focuses on decision optimization in picking situations, and the efficiency and cost increase caused by replenishment situations have not been fully considered in the joint optimization of pod repositioning and robot task allocation. In this paper, we focus on the impact of picking and replenishment on the pod repositioning and task allocation strategies, and construct a multi-objective pod repositioning and robot task allocation joint optimization model, with the aim of proposing a pod repositioning and robot allocation scheme that takes into account the replenishment situation and maximizes the picking efficiency.

3. Problem Description

Due to increased customer demand and shorter delivery times in modern e-commerce, orders fluctuate dramatically [48]. The method of pre-planning warehouse layout based on historical orders in traditional warehousing systems lacks dynamic adaptability and, when used for a long time, can easily lead to mismatches between pod demand patterns and pod positions. At the same time, each link in the warehouse operation is complementary to each other: workstations, pods and robots operate together to determine the order picking efficiency. The positions of the pods determine the distance that the robot needs to move when handling the pods, and the order of tasks assigned to the robot when handling the pods also affects the storage location allocation strategy of the pods. Therefore, the positions of pods are updated in real time as order patterns change, requiring dynamic adjustments in the reasonable allocation of robot tasks.
In this study, we first use historical order data to continuously predict future order demand and then solve for the corresponding expected layout based on future orders. On this basis, we dynamically update the expected layout according to the progressive storage mechanism to achieve the theoretical optimum of the expected layout. Guided by this expected layout, we jointly optimize the robot task allocation and the pod repositioning. That is, for each batch of operational tasks, it is necessary to complete the robot and pod task matching at the same time, based on the expected position of the optimal available positions for the return pod assignment. Through this collaborative decision-making model, we can effectively reduce the layout deviation, increase the storage system resource utilization efficiency, and reduce the order response speed due to the double enhancement.
The research problem of this paper can be described as follows: simulate order picking based on the future order set obtained from the historical order set prediction, obtain the picking hotness, replenishment hotness and inter-pod correlation of pods, accordingly plan the layout of the warehouse, obtain the expected layout, and guide the joint optimization of pod repositioning and robot task allocation. With the incoming orders, the historical order set changes and the expected layout needs to be adjusted. If each pod handling process is considered as a task, then decisions need to be made about which robot in the set of robots R will handle the pod, and the available positions l on the return trip after the pod has been handled, i.e., task allocation of the robots and repositioning of the pod. The pod repositioning decision affects the total distance moved by the robots, and the distance traveled by the robots affects the efficiency of the system, so in this paper, we make a joint decision on the pod repositioning and task allocation.
Based on the observation of the actual operation process in the picking center of a large online retailer and the related literature, this paper makes the following assumptions for the joint optimization problem of pod repositioning and robot task allocation:
(1)
Using wave picking and first-come-first-served strategies, orders are picked over a period of time in batches, and the order of order picking is consistent with the order in which the orders are passed to the picking station.
(2)
The types and quantities of SKUs stored in the initial pods are known and are stored using a decentralized storage strategy.
(3)
Task processing at the picking and replenishment stations is completed immediately.
(4)
Robot traveling distance is calculated according to the Manhattan distance.
(5)
After a pod completes a task, the pod always returns to the storage area before the next task can be performed.
(6)
Future orders can be predicted.
(7)
Congestion, acceleration/deceleration, and charging of robots are not considered.

4. Mathematical Model

In this paper, a joint optimization model is proposed to solve the complex operations in RMFSs. The symbolic parameters are defined as shown in Table 1.
In order to obtain the maximum time required for the robot to complete the task, the overall process affecting the distance traveled by the robot is first divided into three parts based on the storage-and-retrieve pod scheduling decisions in the warehouse.
(1)
No-load distance D1: The distance the robot travels to the position of the task pod to carry the pod, and the distance the robot travels to the position of the next task pod after completing the storage of the previous task pod.
D 1 = r R i I m M n N d μ r m x m n r i
(2)
Carrying distance D2: The distance a robot carries a task pod from a position to a workstation that needs to perform a picking/replenishment task on the pod.
D 2 = r R i I m M n N d m π i x m n r i
(3)
Return distance D3: The distance the robot carries the pod from the task workstation to the available positions.
D 3 = r R i I m M n N d π i n x m n r i
From this, the distance required for the robot to complete the task can be obtained, which leads to the joint optimization model of this paper:
m i n Z 1 = m a x ( D 1 v 1 + D 2 + D 3 v 2 )
m i n Z 2 = r R i I m M n N s S d l i s i x m n r i
i I x m n r i = 1 r R , m M , n N
r R x m n r i = 1 i I , m M , n N
n N x m n r i = 1 r R , i I , m M
x m n r i { 0 , 1 } r R , i I , m M , n N
Objective function (4) minimizes the maximum completion time of robots. Objective function (5) denotes the minimization of the deviation between the task pod position and the expected position. Constraint (6) denotes that each task can only be performed by one robot; constraint (7) denotes that each robot can only carry one task pod at a time; constraint (8) denotes that each pod can only be stored in one position; and constraint (9) is the decision variable value constraint.

5. Algorithm Design

As mentioned earlier, this joint optimization problem is NP-hard, and the proposed joint optimization model for pod repositioning and task allocation can be solved using commercial solvers. However, the solvers have difficulty in efficiently handling large-scale problems in an acceptable time.

5.1. Introduction to the General Framework

In this paper, we propose a joint optimization strategy of pod repositioning and task allocation based on pod heat guidance, which takes the expected layout of the system as the guide, and makes a joint decision on the robot task sequence and the target position allocation scheme of the return pod. At the same time, a progressive storage mechanism is introduced so that the expected layout can be continuously adjusted with the order update, forming a progressive optimization system of “prediction–layout–execution”.
With the changes in customer order requirements, future orders continue to update, the expected layout continuously iterates, the pod position dynamically adjusts, and each pod handling operation promotes the overall layout of the warehouse to the expected optimal state of gradual convergence, thereby effectively improving the utilization of warehouse layout and operational efficiency. The overall framework of this study is shown in Figure 2.

5.2. Predictive Layout Generation

5.2.1. Predictive Order Generation Strategy

Before the picking process starts, orders in the future period are predicted based on historical order data. In this paper, we adopt an order generation strategy that combines the Apriori algorithm and Monte Carlo simulation, which guides the predictive order generation by mining the commodity-associated features in the historical orders, and has better adaptability and accuracy compared to the traditional methods. The specific implementation is divided into two core phases:
  • Merchandise association feature extraction based on the Apriori algorithm.
The Apriori algorithm is utilized to mine the association law between SKUs from historical order data to extract customers’ purchasing habits. Firstly, the frequency of each SKU appearing individually is counted to screen out high-frequency SKUs; then the combination relationship between SKUs is analyzed, and the strength of SKU association is quantified by support degree, as in Equation (10), to find the combination of SKUs that are often purchased at the same time. By setting a reasonable support threshold, the Apriori algorithm can automatically filter out the chance combinations and retain the truly meaningful association rules.
s u p p o r t X = O x O
In the above equation, O is the total number of orders, and O X is the number of orders containing item set X.
2.
Order generation based on Monte Carlo simulation.
Monte Carlo simulation utilizes the association features extracted by Apriori to generate predicted orders that match the real scenario through random sampling. In the generation process, the order law proposed in the literature is followed [49]: about 60% of the orders contain only a single item, and 40% contain 2–3 items; when generating multi-item orders, as shown in Equation (11), priority is given to selecting historical high-frequency combinations, and the rest of the scenarios are randomly paired according to the hotness of the SKUs. The number of SKUs purchased follows the Pareto distribution; 80% of the orders are only 1 item, and a small number of orders contain 2–5 items.
P ( X | S ) = β U ( F | S | ) + ( 1 β ) i X f i α f k α
In the above equation, β is the frequent itemset selection probability, F n is a collection of n-frequent term sets, U is a uniformly distributed sampling function, f i is the historical frequency of occurrence of commodity i , and α is the frequency weighting index.
This stratified sampling approach ensures a reasonable frequency of occurrence of high-frequency SKUs and popular combinations, while simulating uncertain behavior in real shopping through controlled randomness. The final generated order retains the historical core features while flexibly adapting to the changing needs of the business.

5.2.2. Priority-Based Pod Selection Method

To fulfill orders, pods are selected for picking operations and replenished when necessary. An efficient pod selection operation ensures that the picking station meets the items required for the order and minimizes the maximum completion time of robots. As shown in Figure 3, the pod selection process consists of two parts: the matching of orders and pods, and the picking and replenishment operations of pods. On the one hand, the matching of orders and pods as well as the replenishment plan of pods affects the heat and correlation of pods, which plays an important role in calculating the expected position of the current pod. On the other hand, when the quantity of a product on a pod reaches the replenishment threshold and needs to be replenished, the pod is not available for the next batch of task allocations, which reduces the options available in the pod selection group, and replenishment needs to be assigned to be carried by the robot to the replenishment station, which takes up robot resources and improves the robot’s operation cost.
The pod selection problem is modeled as follows:
min p P o O y o p + z o p
p P x o p i r o i o O , i I
x o p i q p i o O , i I
M y o p i I x o p i o O , p P
k K ω o k 1   o O
p P ln z o p + α ( q p i x o p i ) e i < 0   o O , i I , α ( 0 , 1 )
x o p i Z + , y o p { 0 , 1 }
In the above equation, objective function (12) represents the minimization of the number of pod movements, which contains both picking and replenishment movements. y o p is 1 if pod p has been used in the picking process that satisfies the set of orders O , and 0 otherwise, and z o p is 1 if it has been used in the replenishment process. Constraint (13) denotes that the customer demand of a picking order for a particular item must be satisfied, and the total quantity x o p i of pods p to provide picking for SKU i in the order set O cannot be smaller than the total quantity r o i of SKU i needed in the order set O . Constraint (14) is a SKU availability constraint that indicates that each SKU in the picking order can be picked from the position and x o p i cannot be greater than the total number q p i of stored SKUs i on pod p . Constraint (15) defines the relationship between the variables x and y. The number of SKUs offered for picking must be zero when the number of pod p moves is zero. Constraint (16) indicates that each order can be processed by at most one workstation, and the value of ω o k is 1 if the set of orders O is processed by picking station K , and 0 otherwise. Constraint (17) defines the relationship between the replenishment threshold and replenishment, when the pod provides the number of items x o p i for order picking, resulting in the quantity q p i of SKU i stored on pod p being less than the replenishment threshold e i , the pod needs to be replenished, and the number of replenishments should be increased. Constraint (18) defines the domains of the variables.
The resulting picking heat and replenishment heat of the pod are shown in Equation (19) and Equation (20), respectively:
m p = o O y o p p P
n p = o O z o p p P
And combined with the idea of pod prioritization proposed in the literature [50], the replenishment process is added to the picking operation, considering the change in the remaining quantity of SKUs in the case of replenishment, and designing the greedy picking algorithm based on pod prioritization for solving.

5.2.3. Bi-Level Position Optimization Algorithm

Pod location optimization is performed according to the pod heat and inter-pod correlation obtained from the pod selection algorithm. Since the positions of the picking and replenishment stations in the RMFS warehouse are fixed, the position of the pods determines the distance that the robot needs to move when handling the pods. By arranging the pods in suitable locations, the demand pattern of the pods is matched with the positions in the warehouse. As shown in Figure 4, the time and distance traveled by the robot can be reduced by placing the high heat pods closer to the workstations and the high picking pods closer to the picking tables in the expected layout.
The pod location optimization problem is modeled as follows:
min p P l L k K m p d l k x p l + p P l L s S n p d l s x p l + p P p P r p p d p p
l L x p l = 1 p P
p P x p l 1 l L
x p l 0 , 1
In the above equation, objective function (21) represents minimizing the total distance traveled by the robot. The objective function contains three parts. The first part is the distance of the robot carrying pods to the picking station; x p l indicates that the value is 1 if the pod p is stored on the storage position l and 0 otherwise, and the total distance of the pod p to the picking station in the whole picking process is obtained by the pod picking heat m p and the distance traveled by the robot carrying pods from the storage position l to the picking station k (or from the picking station k to the storage position l ). In the second part, similarly, the total distance of pod p to the replenishment station during the whole replenishment process is obtained by the pod replenishment heat n p and the distance traveled by the robot to carry the pod from the storage position l to the replenishment station s (or from the replenishment station s to the storage position l ). The third part is the no-load distance; d p p denotes the distance traveled when arriving from pod p to another pod p , and r p p denotes the correlation between the two pods. Constraint (22) indicates that a pod can only be placed in one position; constraint (23) indicates that at most one pod can be placed in one position; and constraint (24) is a variable-value constraint.

5.3. Adaptive Genetic-Variable Neighborhood Search Algorithm

Considering that the genetic algorithm has a strong global search capability but has the disadvantage of early convergence in the early stage and slow convergence in the late stage, this paper proposes an improved genetic algorithm, namely, adaptive genetic-variable neighborhood search algorithm (AGA-VNS). By introducing adaptive transform probability and local search process in GA, a hybrid optimization algorithm of global generalization and local refinement is developed. Firstly, the initial population is generated according to the greedy idea to improve the randomness and diversity of the first-generation population; then, a combination of elite retention and roulette is used to select individuals and construct crossover and mutation operators with adaptive transformation probabilities to obtain the new population; finally, finer local operations are carried out on the sub-generation mid-population from each individual to improve the overall performance of the algorithm. The improved genetic algorithm process has been demonstrated in Figure 1, and the main steps are as follows:

5.3.1. Chromosome Code

In order to jointly optimize the task allocation and pod repositioning operations, a chromosome is designed, as shown in Figure 5. The left half of the chromosome is the robot task allocation sequence, and each gene bit represents a robot number, the right half of the chromosome is the storage location allocation for pod repositioning, and each gene bit represents a position number; the number of gene bits is the number of tasks to be accomplished in the current batch, and the gene order is the current task order. This encoding method allows direct visualization of the direct matching of tasks to robots and pods to positions; for example, in Figure 4, the total number of tasks in the batch is 4, and task pod 1 is carried by robot 2 and reassigned to location 4.

5.3.2. Multi-Objective Processing

In practical pod repositioning problems, the direct trade-off between the two objective functions is challenging due to their varying importance. Furthermore, significant differences exist in both the units of measurement and magnitudes of the two objectives, making simple linear weighting methods inadequate for handling these multiple objectives.
A multi-objective model with the objectives of maximizing the completion time required for the robot to complete the task and minimizing the deviation of the pods from the expected positions is developed. In the actual pod repositioning problem, it is difficult to coordinate the two objective functions due to their different importance. Furthermore, significant differences exist in both the units of measurement and magnitudes of the two objectives, making simple linear weighting methods inadequate for handling these multiple objectives [51]. The specific steps of multi-objective processing in this paper are as follows.
Step 1: Convert the multi-objective model into two objective models and solve them. These two optimal objective values are defined as f 1 * and f 2 * .
Step 2: Calculate the new target values F 1 and F 2 by Equations (25) and (26). Obviously, this step not only eliminates the differences in the units of measure of the three targets, but also restricts them to the same order of magnitude.
F 1 = f 1 / f 1 *
F 2 = f 2 / f 2 *
Step 3: Propose the weights for each of the three objectives. The two weight parameters are defined as ω 1 and ω 2 and satisfy the following constraints:
ω 1 + ω 2 = 1
ω 1 , ω 2 0
Step 4: Calculate the integrated objective value. The new objective function of the multi-objective model is defined as Equation (29).
min F = ω 1 F 1 + ω 2 F 2

5.3.3. A Strategy for Pod Selection Based on Heat Grading

In order to obtain a high-quality solution quickly and optimize the optimization search process, this paper proposes a pod selection method under the consideration of heat grading. By setting the maximum offset of the pods, the high-heat-degree pods are prioritized to select the location closer to the expected location.

5.3.4. Adaptive Crossover and Variational Operators

The setting of the crossover and mutation probability values has a great influence on the optimization process of the solution; the crossover probability and mutation probability of traditional genetic algorithms are set to a fixed value according to the experience and subjective judgment; if they are set to a smaller value, it will make the individual evolution slower and increase the algorithm’s solution time, and if they are set to a larger value, it will be easy to destroy the excellent individuals in the late stage of the algorithm’s solution, so that the algorithm will fall into the locally optimal solution. In order to accelerate the speed of population optimization search, this paper refers to the crossover and mutation operator design ideas proposed in the literature [52]. The population is highly explored with a large crossover variance probability in the early stage to increase the diversity of solutions, and the crossover variance probability gradually becomes smaller in the late stage to avoid destroying the excellent individuals and to avoid premature convergence while maintaining the diversity of the population. The adaptive crossover and variance probability formulas are shown in Equations (30) and (31).
P c = P c _ max P c _ max P c _ min f f a v g f max f a v g , f f a v g P c _ max , f < f a v g
P m = P m _ max P m _ max P m _ min f f a v g f max f a v g , f f a v g P m _ max , f < f a v g
In the above equation, P c _ m a x , P m _ m a x are the maximum values of the crossover and variance probability intervals, respectively; P c _ m i n , P m _ m a x are the minimum values of the crossover and variational probability intervals, respectively; f is the individual fitness value; f is the fitness value of the optimal individual in the parent that performed the crossover; f m a x is the maximum of all individual fitness values; f a v g is the average fitness value for all individuals.
Considering that there is a one-to-one correspondence between pods and positions in the warehouse, if the operation is carried out according to the traditional crossover and mutation, it will destroy the original feasible solution and produce a large number of infeasible solutions, which will require a large amount of time to repair. In this regard, we design the crossover and mutation operators.
  • Crossover operator.
Task allocation part: Randomly select multiple crossover points, exchange the genes at the corresponding positions of the two parent individuals, and obtain two new offspring. Pod repositioning part: For pod number, since the encoding of the pod number array is unique, this paper adopts the Partial Matching Crossover (PMX) to ensure that each pod gene is unique among the offspring.
2.
Mutation operator.
A single point mutation at even randomized positions was used to randomly select two pod number exchange genes on the chromosome.

5.3.5. Introduction to the VNS Operator

In order to improve the local search capability of AGA, this paper designs three neighborhood search operators oriented to two different optimization objectives, namely the maximum robot completion time and the total layout deviation, so as to ensure that each objective can be quickly and accurately approached to the optimal solution in the iterative process.
  • Same-heat pod exchange operator.
Since the same-heat pod has similar access frequency and optimization needs, the exchange of positions will not destroy the overall heat distribution pattern; as shown in Figure 6, the letter indicates the pod number, the number indicates the solution, while colors denote changes during the update process. Assuming that pods A and C are the same-heat pods (marked by red dotted lines), the two corresponding robot allocation numbers and storage location allocation numbers can be exchanged. In addition, the exchange of high-heat pods can fine-tune the position to reduce the deviation, and the exchange of low-heat pods can increase the flexibility of the warehouse layout.
2.
Associative pod aggregation operator.
Aggregation of frequently co-visited pod positions that are continuously handled by the same robot. The proximity of associated pods (e.g., frequently present in the same order) can significantly reduce the cross-zone travel distance for future handling. Continuous execution by the same robot avoids empty trips due to task splitting. As shown in Figure 7, the letter indicates the pod number, the number indicates the solution, while colors denote changes during the update process. Assuming that B and D are associated pods and that pod D is hotter, pod D is used as the center to find the nearest available positions, e.g., C (marked by red dotted lines). The pre-reassigned positions of B and C are swapped, and pod B will be handled by Robot 2 that is handling pod D.
3.
Dynamic task sequence reversal operator.
The task order directly affects the robot path length, and the optimal handling sequence for the robot is found by dynamically adjusting the task order. The insertion operation can jump out of the local optimum and explore more compact path combinations. As shown in Figure 8, the letter indicates the pod number, the number indicates the solution, while colors denote changes during the update process. For a batch of task pods A–D, task points A and C are selected as the basis for reverse operations. By altering the sequence between these two task points (marked by red dotted lines), the optimal sequence for the robot to complete the task is determined.

5.4. Progressive Storage Mechanism

Since the expected layout plays a guiding role in the joint optimization strategy proposed in this paper, and the wrong expectation will make the warehouse layout gradually change in the direction of inefficiency, the expected layout needs to be continuously updated with the incoming orders to maintain the advancement. However, frequent updating of the expected layout will not only increase unnecessary pod handling operations, but also waste a lot of arithmetic power in the warehouse management system. Therefore, this paper designs a novel progressive storage mechanism. The mechanism contains two parts, the dynamic offset threshold and the expected layout update mechanism, as shown in Figure 2, with the dynamic offset threshold as the judgment criterion. When the overall pod offset of the standardized warehouse exceeds the dynamic offset threshold, it indicates that the current expected layout has deviated too much from the actual situation, and it needs to be updated by using the predictive layout update mechanism, or else it can continue to maintain the original expected layout.

5.4.1. Dynamic Offset Threshold

The deviation threshold is dynamically adjusted by an exponential decay function, as shown in Equation (32), where a higher threshold is set at the initial stage to accommodate fluctuations during the initialization phase of the system and then gradually lowered to the steady-state threshold by the decay coefficient with the increase in picking batches. This nonlinear decay characteristic accurately simulates the progressive optimization process of warehouse operation: at the initial stage, a large positional deviation is allowed to accumulate to reduce the cost of frequent adjustments, and at the later stage, the deviation growth is responded to by tightening the threshold value in time to ensure that the system always operates near the optimal state.
e ( t ) = θ i n i t ×   e k t + θ s t a b l e
In the above equation, t is the number of batches processed, θ i n i t is the initial threshold, k is the decay rate, and θ s t a b l e is the steady-state threshold. When the normalized deviation is greater than the dynamic offset threshold, the update of the expected layout is performed. The values of parameters θ i n i t , k and θ s t a b l e were determined using a systematic Taguchi method through Minitab 21 software. This optimization employed the signal-to-noise ratio (S/N) as the metric, aiming to ensure the robustness of the proposed mechanism across different operational scenarios.

5.4.2. Mechanisms for Updating the Expected Layout

Directly updating the expected layout is likely to cause turbulence in the warehouse layout, so the expected layout is updated by adopting a greedy idea of “Stronger animals prey on weaker ones”. Since the high-heat-level pods are handled frequently, they have a greater impact on the system performance. Therefore, the high-heat pods are defined as “strong” and the low-heat pods are defined as “weak”, and the high-heat pods have a higher priority in the selection of pod locations.
As shown in Figure 9, this diagram illustrates the adjustment process from original to new expected layout for 20 pods. The first row shows pod numbers, the second row displays original expected position numbers, and the third row indicates new expected position numbers. Under this mechanism, high-temperature pods (marked with red circles in the first row) will prioritize the new expected position, while the original expected positions for low-temperature pods will be retained. The final adjustment results are shown in the last row, clearly reflecting the mechanism.
For conflict resolution, a novel “Double-elimination” mechanism is designed based on the recursive idea, which utilizes three levels of states to mark the pods (0: not updated; 1: updated; 2: need to be optimized), so that when the low-heat pods fail in the conflict competition, they have one opportunity to choose to force the pods to be updated to the expected position; otherwise, they can only choose a closer position to their original expected position from among the remaining locations. Otherwise, it can only choose the position that is closer to its original expected position from the remaining storage positions.
As shown in Figure 10, assume that pod A and pod B have a position conflict, with both currently having the expected position P1. If pod A’s original expected position was P2, then after pod A loses the competition with pod B, its status is marked as “1,” and its expected position reverts from P1 back to its original expected position P2. If the conflict remains unresolved, and pod A subsequently loses a competition with another pod C, its status updates to “2”. At this point, the system abandons the expected position strategy and instead executes the nearest match strategy, forcibly assigning pod A to the nearest available position P3.

6. Numerical Study

6.1. Instance Generation and Parameter Setting

In this paper, we refer to the data of a large e-commerce warehouse used in the literature [49] for the simulation of order, product and pod information. According to the existing literature and survey data, the heavy-load speed of the robot is set to 0.8 m/s, and the no-load speed to 1.6 m/s. In order to verify the effectiveness of the joint optimization strategy, the progressive storage strategy, and the improved genetic algorithm proposed in this paper, this paper consists of designing three different scales of the instances (I, II, and III) in accordance with the scales of the warehouses in the literature [18], from small to large. The specific characteristics of the arithmetic examples are shown in Table 2. The algorithms are programmed using MATLAB R2022 a on a computer with the Windows 10 operating system, 8 GB of memory, and an Intel (R) Core (TM) i5- 8250 U CPU. The main frequency is 1.80 GHz.

6.2. Comparison of Experimental Results

6.2.1. Comparison of the Different Strategies

In order to validate the effectiveness of the joint optimization strategy of pod repositioning and robot task allocation based on pod heat dominance, this paper, based on the staged optimization method designed in the literature [46], similarly designs three staged optimization strategies for pod repositioning and robot task allocation: (1) nearby storage location assignment and nearby robot allocation (NN); (2) nearby storage location assignment and random robot allocation (NR); and (3) fixed storage location allocation and nearby robot allocation (FN). Under the progressive layout strategy, the proposed method in this paper is compared with the above three staged optimization strategies, and the results are shown in Table 3. The metrics that determine the performance of an RMFS are f1, which is the maximum completion time required for the robot to complete the task, and f2, which is the optimization of the overall warehouse layout; these are idealized metrics, so the warehouse layout can only be close to the expected layout due to the correlation relationship between the access pods. Therefore, in Table 3, only the optimized gap for f1 is shown.
Several findings can be drawn from Table 3.
(1)
The optimization objective f1 solved by the joint optimization strategy proposed in this paper outperforms the staged optimization strategy in all the instances, while the optimization objective f2 is mostly better than the other three strategies, thus verifying its effectiveness.
(2)
The joint optimization strategy also achieves considerable improvement in the optimization ratio, which covers the range of 8% to 48.65% in different cases, and the performance advantage is very significant. When the scale increases, the optimization ratio of the joint optimization strategy also shows a continuous and obvious growth trend; for example, in the case of NN, the average value of gap is also increased from 9.93% in instance I to 16.31% in instance III, which fully demonstrates that the joint optimization strategy has a scalable advantage in dealing with large-scale and high-complexity problems, and its performance advantage is even more prominent in the more challenging real-world scenarios.
(3)
Among the three phased scheduling optimization strategies, the nearest storage and random robot task assignment (NR) strategy achieves the worst results, and the average gap with the joint optimization method at the third arithmetic scale reaches 42.48%, which indicates that there are inherent limitations in random robot task assignment, which can produce strong negative effects, and that a reasonable robot assignment rule can help to improve the pod repositioning effectiveness and help companies improve their operational efficiency faster.

6.2.2. Comparison of the Two Mechanisms

In order to verify the effectiveness of the progressive storage mechanism proposed in this paper, the performance differences between the processes using the progressive storage mechanism and those not using the progressive mechanism in different instances are compared under the same joint optimization strategy, and the results are shown in Table 4.
From Table 4, the following can be seen:
(1)
The progressive storage mechanism is superior to the non-progressive mechanism in solving f1 in different instances, and most of the optimization objectives f2 are better than the non-progressive mechanism, which is enough to prove the effectiveness of the progressive mechanism.
(2)
The enhancement value of the optimization objective f1 of the progressive mechanism compared to the non-progressive mechanism becomes bigger with the scale increases, and there is more than 3% enhancement for the optimization objective f2 in instance II and instance III, indicating that the progressive mechanism has excellent scale adaptability and can meet the needs of real enterprise large-scale warehouse operation.

6.2.3. Algorithm Performance Comparison

In order to verify the performance of the AGA-VNS algorithm proposed in this paper, AGA and VNS are applied to solve the joint optimization strategy in this paper. The above three algorithms are run under three instances, and the results are shown in the table, still focusing on the maximum completion time f1. Table 5 shows the experimental results of the algorithm performance comparison.
As can be seen from Table 5,
(1)
AGA-VNS has obvious advantages in terms of solution quality compared with both AGA and VNS, and thus AGA-VNS is able to solve the joint optimization problems better.
(2)
The enhancement of AGA-VNS with respect to AGA and VNS also shows a strong scale dependence, and compared to AGA, as the scale of the instances increases, AGA-VNS can further reduce the maximum completion time by 6.33% (10.10–3.77%), indicating that AGA-VNS has outstanding advantages in dealing with large-scale problems, fully integrating the global search capability of AGA and the local development mechanism of VNS.
(3)
The enhancement magnitude of AGA-VNS compared to VNS is significantly higher than that of the enhancement to AGA, which suggests that AGA provides a key support for VNS, and by consistently providing high-quality and diversified initial solutions, it significantly improves the search starting point of VNS.

6.3. Sensitivity Analysis

From the Section 6.2, it can be seen that the index to judge the efficiency in the RMFS is mainly f1, and f2 focuses more on adjusting the pod position towards the expected warehouse layout, which is an idealized state. Therefore, in this paper, we mainly analyze the sensitivity of f1, and show the results of f1 in terms of the number of free positions and the number of robots.

6.3.1. Analysis of the Effect of the Number of Free Positions on the Maximum Completion Time

Companies usually design a certain number of free positions in RMFSs as a way to increase the flexibility of the warehouse. In order to investigate the effect of the number of free positions on the objective of the maximum completion time, this paper generates five sets of randomized instances with P = 100, O = 900, I = 450, R = 5, and the number of free positions in each set is increased from 2 to 20, resulting in eight different numbers of free positions. Using the method proposed in this paper to solve each of the instances, and taking the average of the 5 groups of instances in order to observe the change in the maximum completion time with the free position, the results are shown in Figure 11.
The following two conclusions can be obtained from Figure 11.
(1)
With the increase in the number of free positions, the maximum completion time shows a gradual decrease. This indicates that keeping a certain number of free positions in the RMFS warehouse can help improve the operational efficiency of the warehouse.
(2)
After the number of free positions increases to 15, the decrease in the maximum completion time tends to level off, i.e., there is an upper limit on the benefit brought by the increase in the number of free positions. When the number of free positions in the system reaches a certain value, even if the number of free positions is increased, the maximum completion time will not be reduced.

6.3.2. Analysis of the Effect of the Number of Robots on the Maximum Completion Time

Robots are one of the most expensive equipment resources in RMFSs. In order to analyze the effect of the number of robots on the maximum completion time, this paper randomly generates 5 sets of arithmetic instances with P = 100, O = 900, I = 450, and R = 5, and the number of robots in each set is increased from 2 to 30 to obtain 8 different numbers of robots. The 5 arithmetic instances are solved and averaged using the method proposed in this paper and the results are shown in Figure 12.
From Figure 12, it can be found that
(1)
With the increase in the number of robots, the maximum completion time will be reduced, indicating that the number of robots has a positive effect on the improvement of the operational efficiency of the warehouse.
(2)
In the early stage of the growth, the number of robots can significantly reduce the maximum completion time, and the rate of change reaches 55.44% in the process of growth from 2 to 5 robots, but with the increase in the robot number, the decreasing trend tends to slow down, indicating that there is an optimal number of robots under different scales of warehouses, and beyond this optimal boundary, although a large number of additional robots will reduce the time required, the reduction is very low and is not even enough to offset the cost of the robot’s input.
As shown in Figure 12, system efficiency peaks when the number of robots reaches five units. Beyond this threshold, efficiency actually declines. This finding suggests that managers should seek an optimal balance between cost and efficiency to avoid blind investment.

7. Conclusions

7.1. Summary of Research Findings

RMFSs are widely used in high-frequency scenarios, and the study of the pod repositioning problem considering joint optimization with task allocation is of great practical significance for improving the operational responsiveness of e-commerce warehouses and accelerating the fulfillment process of e-commerce orders. The key to solving this problem lies in simultaneously realizing the optimal matching between pods and positions, as well as robots and task points.
In order to further improve the operational efficiency of the robot and the scientific layout of the warehouse, this paper constructs a joint optimization model of pod repositioning and task allocation with the objectives of minimizing the robot task completion time and the deviation between the pod position and the expected position. Aiming to realize the proposed model, a joint optimization strategy of pod repositioning and task allocation based on pod heat dominance is designed, and an improved genetic algorithm is used to solve the pod and task allocation scheme. Based on this joint optimization strategy, an incremental storage mechanism guided by generative orders is designed to incrementally update the expected layout of the RMFS repository through the idea of “predictive guidance, incremental movement”. Through numerical experiments, the results of the study are as follows:
(1)
From the perspective of joint optimization, while considering the influence of the replenishment situation on the selection of pod position, the problems of pod repositioning and robot task allocation are combined into one optimization problem, which realizes the coordination between the two related problems and provides theoretical references for the solution of the pod repositioning problem.
(2)
A progressive storage mechanism is proposed and compared with the traditional non-progressive process. Through the analysis, it is concluded that the progressive storage strategy can guide the movement of pods to more promising locations and effectively improve the quality of the expected layout of the warehouse.
(3)
Combining AGA and VNS has excellent synergistic effects and can utilize the advantages of both in search capabilities while having excellent stability.

7.2. Limitations and Future Research

Although this study provides an effective solution for the joint optimization of pod repositioning and task allocation in RMFSs, certain limitations remain. First, the model assumes robots operate under ideal conditions, neglecting practical constraints that may arise in real-world environments, such as robot congestion, dynamic obstacle avoidance, battery charging, and workstation waiting times. Second, the study focuses solely on robot-side optimization without incorporating human factors related to pickers, which limits the model’s direct applicability in complex real-world warehouse environments.
Future research can explore the following directions: first, integrating congestion, obstacle avoidance mechanisms, and charging scheduling into the existing joint optimization framework to enhance system robustness and practicality; second, incorporating human factors engineering principles to establish a human–robot collaborative optimization model centered on “robot-picker” coordination efficiency, thereby comprehensively improving the overall system performance.

Author Contributions

Conceptualization, H.B. and G.Y.; methodology, H.B. and F.L.; analysis and simulation, G.Y. and Z.W.; writing—original draft preparation, G.Y.; writing—review and editing, H.B. and F.L. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Key R&D Program of China under Grant No. 2020YFB1712802, Research Program of Social Science of Qinhuangdao under Grant No. 2024LX026, and Science and Technology Research and Development Plan Project of Qinhuangdao under Grant No. 202401A143.

Data Availability Statement

The data presented in this study are available on request from the corresponding author due to data protection.

Acknowledgments

The authors are grateful to the editors and anonymous reviewers for their valuable comments and efforts, which have greatly enhanced this work. We also thank Zhiyuan Gao for his helpful discussions.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Sun, Y.; Zhao, N. Autonomous vehicle fleet interference-free scheduling approach in robotic mobile fulfillment system. Ind. Eng. J. 2020, 23, 49–58. [Google Scholar] [CrossRef]
  2. Lu, F.; Wang, Z.; Bi, H. A novel storage location allocation strategy for intelligent E-commerce warehouse with new products. Int. J. Syst. Sci. Oper. Logist. 2025, 12, 2549438. [Google Scholar] [CrossRef]
  3. Lu, J.; Qian, H.; Zhao, W. Storage Allocation of RMFS Order Picking System in Manufacturing Enterprises Based on Real-Time Replenishment. Comput. Integr. Manuf. Syst. 2024, 30, 2526–2539. [Google Scholar] [CrossRef]
  4. Lamballais, T.; Roy, D.; De Koster, M.B.M. Estimating Performance in a Robotic Mobile Fulfillment System. Eur. J. Oper. Res. 2017, 256, 976–990. [Google Scholar] [CrossRef]
  5. Brezovnik, S.; Gotlih, J.; Balič, J.; Gotlih, K.; Brezočnik, M. Optimization of an automated storage and retrieval systems by swarm intelligence. Procedia Eng. 2015, 100, 1309–1318. [Google Scholar] [CrossRef]
  6. Baniasadi, P.; Foumani, M.; Smith-Miles, K.; Ejov, V. A transformation technique for the clustered generalized traveling salesman problem with applications to logistics. Eur. J. Oper. Res. 2020, 285, 444–457. [Google Scholar] [CrossRef]
  7. Krenzler, R.; Xie, L.; Li, H. Deterministic Pod Repositioning Problem in Robotic Mobile Fulfillment Systems. arXiv 2018, arXiv:1810.05514. Available online: https://api.semanticscholar.org/CorpusID:52980579 (accessed on 17 July 2025).
  8. Weidinger, F.; Boysen, N.; Briskorn, D. Storage Assignment with Rack-Moving Mobile Robots in KIVA Warehouses. Transp. Sci. 2018, 52, 1479–1495. [Google Scholar] [CrossRef]
  9. Guan, M.; Li, Z. Genetic Algorithm for Scattered Storage Assignment in Kiva Mobile Fulfillment System. Am. J. Oper. Res. 2018, 8, 474–485. [Google Scholar] [CrossRef]
  10. Ji, T.; Zhang, K.; Dong, Y. Model-Based Optimization of Pod Point Matching Decision in Robotic Mobile Fulfillment System. In Proceedings of the IEEE 7th International Conference on Industrial Engineering and Applications (ICIEA), Bangkok, Thailand, 16–21 April 2020; pp. 216–223. [Google Scholar] [CrossRef]
  11. Li, X.; Yang, X.; Zhang, C.; Qi, M. A Simulation Study on the Robotic Mobile Fulfillment System in High-Density Storage Warehouses. Simul. Model. Pract. Theory 2021, 112, 102366. [Google Scholar] [CrossRef]
  12. Yuan, R.; Graves, S.C.; Cezik, T. Velocity-Based Storage Assignment in Semi-Automated Storage Systems. Prod. Oper. Manag. 2019, 28, 354–373. [Google Scholar] [CrossRef]
  13. Li, Z.; Fan, X.; Wu, L. Study on the Storage Allocation Problem under Cargo to Person Picking Mode. Oper. Res. Manag. Sci. 2020, 29, 1–11. Available online: http://www.jorms.net/CN/10.12005/orms.2020.0028 (accessed on 20 July 2025).
  14. Xu, X.; Ma, Z. Dynamic Location Reassignment of RMFS Order Picking System. Comput. Integr. Manuf. Syst. 2021, 27, 1146–1154. [Google Scholar] [CrossRef]
  15. Ding, T.; Zhang, Y.; Wang, Z.; Hu, X. Velocity-Based Rack Storage Location Assignment for the Unidirectional Robotic Mobile Fulfillment System. Transp. Res. Part E Logist. Transp. Rev. 2024, 186, 103533. [Google Scholar] [CrossRef]
  16. Tian, S.; Lin, S. Optimization of Dynamic Pod Location Assignment in RMFS Based on a Greedy-Genetic Hybrid Algorithm. J. Railw. Sci. Eng. 2025, 22, 2089–2099. [Google Scholar] [CrossRef]
  17. Wang, Z.; Lu, P.; Hu, X. Method of Locating Racks in Mobile-Rack Warehousing System. Chin. J. Manag. Sci. 2024, 32, 84–94. [Google Scholar] [CrossRef]
  18. Zhai, M.; Wang, Z. Optimizing Rack Locations in the Mobile-Rack Picking System: A Method of Integrating Rack Heat and Relevance. Mathematics 2024, 12, 413. [Google Scholar] [CrossRef]
  19. Jiang, M.; Leung, K.H.; Lyu, Z.; Huang, G.Q. Picking-Replenishment Synchronization for Robotic Forward-Reserve Warehouses. Transp. Res. Part E 2020, 144, 102138. [Google Scholar] [CrossRef]
  20. Wang, Z.; Xu, W.; Hu, X.; Wang, Y. Inventory Allocation to Robotic Mobile-Rack and Picker-to-Part Warehouses at Minimum Order-Splitting and Replenishment Costs. Ann. Oper. Res. 2022, 316, 467–491. [Google Scholar] [CrossRef]
  21. Li, T.; Zhang, R.; Sun, Y. Dynamic Optimization Strategy for Pod Return Location in Intelligent Warehouses. Front. Sci. Technol. Eng. Manag. 2024, 43, 41–48. Available online: https://link.cnki.net/urlid/34.1013.N.20240403.1621.008 (accessed on 20 July 2025).
  22. Zhang, J.; Yang, H.; Tian, L. Research on Storage Assignment Problem in Replenishment Phase of Robotic Mobile Fulfilment Systems Considering Picking Aisles’ Workload Balance. Chin. J. Manag. Sci. 2024, 1–12. [Google Scholar] [CrossRef]
  23. Merschformann, M.; Lamballais, T.; De Koster, M.B.M.; Suhl, L. Decision Rules for Robotic Mobile Fulfillment Systems. Oper. Res. Perspect. 2019, 6, 100128. [Google Scholar] [CrossRef]
  24. Jin, X.; Zhong, M.; Quan, X.; Li, S.; Zhang, H. Dynamic Scheduling of Mobile-Robotic Warehouse Logistics System. In Proceedings of the 2016 35th Chinese Control Conference (CCC), Chengdu, China, 27–29 July 2016; pp. 2860–2865. [Google Scholar] [CrossRef]
  25. Zou, B.; Gong, Y.; Xu, X.; Yuan, Z. Assignment Rules in Robotic Mobile Fulfilment Systems for Online Retailers. Int. J. Prod. Res. 2017, 55, 6175–6192. [Google Scholar] [CrossRef]
  26. Yuan, R.; Wang, H.; Sun, L.; Li, J. Research on the Task Scheduling of “Goods to Picker” Order Picking System Based on Logistics AGV. Oper. Res. Manag. Sci. 2018, 27, 133–138. Available online: http://www.jorms.net/CN/10.12005/orms.2018.0241 (accessed on 18 July 2025).
  27. Roy, D.; Nigam, S.; de Koster, R.; Adan, I.; Resing, J. Robot-Storage Zone Assignment Strategies in Mobile Fulfillment Systems. Transp. Res. Part E Logist. Transp. Rev. 2019, 122, 119–142. [Google Scholar] [CrossRef]
  28. Yoshitake, H.; Kamoshida, R.; Nagashima, Y. New Automated Guided Vehicle System Using Real-Time Holonic Scheduling for Warehouse Picking. IEEE Robot. Autom. Lett. 2019, 4, 1045–1052. [Google Scholar] [CrossRef]
  29. Zhang, J.; Yang, F.; Weng, X. A Building-Block-Based Genetic Algorithm for Solving the Robots Allocation Problem in a Robotic Mobile Fulfilment System. Math. Probl. Eng. 2019, 2019, 6153848. [Google Scholar] [CrossRef]
  30. Gharehgozli, A.; Zaerpour, N. Robot Scheduling for Pod Retrieval in a Robotic Mobile Fulfillment System. Transp. Res. Part E Logist. Transp. Rev. 2020, 142, 102087. [Google Scholar] [CrossRef]
  31. Li, K.; Liu, T.; He, B.; Xu, D. A Study on Routing and Scheduling of Automated Guided Vehicle in “Cargo-to-Picker” System. Chin. J. Manag. Sci. 2022, 30, 240–251. [Google Scholar] [CrossRef]
  32. Yuan, R.; Li, J.; Wang, X.; He, L. Multirobot Task Allocation in E-Commerce Robotic Mobile Fulfillment Systems. Math. Probl. Eng. 2021, 2021, 6308950. [Google Scholar] [CrossRef]
  33. Sun, Y.; Zhao, N. Dynamic Scheduling Approach to Robotic Mobile Fulfillment System. Comput. Integr. Manuf. Syst. 2022, 28, 2213–2228. [Google Scholar] [CrossRef]
  34. Zhang, J.; Hu, X.; Weng, X.; Ma, Y.; Yu, X. Task Allocation Optimization of Robotic Mobile Fulfillment System Considering Dynamic Task Cost and Seeding Wall Capacity. Oper. Res. Manag. Sci. 2023, 32, 61–67. Available online: http://www.jorms.net/CN/10.12005/orms.2023.0183 (accessed on 17 July 2025).
  35. Tian, S.; Shen, Y.; Ou, L.Y.; Fan, L. Optimization of AGV Task Assignment Considering Load Balancing in RMFS. Appl. Res. Comput. 2024, 41, 2366–2373. [Google Scholar] [CrossRef]
  36. Zhao, Y.; Huang, M.; Jiao, G.; Song, Y.; Wang, X. Collaborative Optimization of RMFS Task Allocation and Charging Scheduling Based on Flexible Charging Strategy. Ind. Eng. Manag. 2025, 30, 200–210. [Google Scholar] [CrossRef]
  37. Wang, Y.; Ma, R.; Tang, S. An IWOA for Multi-Objective Optimization Scheduling of AGV Flexible Job Shop. Mod. Manuf. Eng. 2024, 7, 17–25. [Google Scholar] [CrossRef]
  38. Wang, F.; Wang, L.; Gao, Y.; Chen, X.; Wang, B. Research on Multi-AGV and Multi-Task Allocation Based on Improved NSGA-II Algorithm. Mod. Electron. Tech. 2024, 47, 157–163. [Google Scholar] [CrossRef]
  39. Zhang, G.; Cai, Y.; Li, Z.; Guo, S.; Zhang, H. Research on Flexible Job Shop Scheduling Problem Considering Limited AGV Transportation Resources. China Mech. Eng. 2025, 36, 1811–1823. [Google Scholar] [CrossRef]
  40. Cai, J.; Li, X.; Liang, Y.; Ouyang, S. Collaborative Optimization of Storage Location Assignment and Path Planning in Robotic Mobile Fulfillment Systems. Sustainability 2021, 13, 5644. [Google Scholar] [CrossRef]
  41. Lai, P.; Zhang, D.; Zhao, S. Joint Optimization of Storage Location Selection and Picking Path with Movable Shelves. Ind. Eng. J. 2023, 26, 135–143. [Google Scholar] [CrossRef]
  42. Shi, X.; Zhou, X.; Fan, Y.; Zhang, L.; Deng, F. Integrating Order Assignment and Rack Assignment in Robotic Mobile Fulfillment System. In Proceedings of the 2022 IEEE 17th International Conference on Control & Automation (ICCA), Naples, Italy, 27–30 June 2022; pp. 611–617. [Google Scholar] [CrossRef]
  43. Wang, Z.; Zhang, R.; Lu, Y. Rack Location Optimization Method in Mobile-Rack Warehouse Picking Task Execution. Ind. Eng. Manag. 2024, 29, 217–226. [Google Scholar] [CrossRef]
  44. Bao, Y.; Jiao, G.; Huang, M. Cooperative Optimization of Pod Repositioning and AGV Task Allocation in Robotic Mobile Fulfillment Systems. In Proceedings of the 33rd Chinese Control and Decision Conference (CCDC), Kunming, China, 22–24 May 2021; pp. 2597–2601. [Google Scholar] [CrossRef]
  45. Shi, X.; Deng, F.; Lu, S.; Fan, Y.; Ma, L.; Chen, J. A Bi-Level Optimization Approach for Joint Rack Sequencing and Storage Assignment in Robotic Mobile Fulfillment Systems. Sci. China Inf. Sci. 2023, 66, 212202. [Google Scholar] [CrossRef]
  46. Zhuang, Y.; Sun, Y.; Zhu, T. Multi-Robot Scheduling Problem of Robotic Mobile Fulfillment Systems: An Adaptive Matheuristic. Syst. Eng.-Theory Pract. 2023, 43, 488–508. [Google Scholar] [CrossRef]
  47. Li, T.; Zhang, R.; Ding, P. Joint Optimization for Automated Guided Vehicle Task Assignment and Pod Repositioning in Robotic Mobile Fulfillment System. Sci. Technol. Eng. 2023, 23, 11271–11281. [Google Scholar] [CrossRef]
  48. Li, X.; Hua, G.; Huang, A.; Sheu, J.B.; Cheng, T.C.E.; Huang, F. Storage Assignment Policy with Awareness of Energy Consumption in the Kiva Mobile Fulfilment System. Transp. Res. Part E Logist. Transp. Rev. 2020, 144, 102158. [Google Scholar] [CrossRef]
  49. Zhuang, Y.; Zhou, Y.; Hassini, E.; Yuan, Y.; Hu, X. Improving Order Picking Efficiency Through Storage Assignment Optimization in Robotic Mobile Fulfillment Systems. Eur. J. Oper. Res. 2024, 316, 718–732. [Google Scholar] [CrossRef]
  50. Wang, Z.; Shan, Y.; Zhang, X. Rack Selection Method for Order Picking in Mobile-Rack Warehouses. Ind. Eng. Manag. 2022, 27, 15–23. [Google Scholar] [CrossRef]
  51. Bi, H.; Gu, Y.; Lu, F.; Mahreen, S. Site Selection of Electric Vehicle Charging Station Expansion Based on GIS-FAHP-MABAC. J. Clean. Prod. 2025, 507, 145557. [Google Scholar] [CrossRef]
  52. Fan, H.; Sun, X.; Zhang, Y.; Ren, X.; Tian, P. Vehicle Routing Problem with Mixed Time Windows Under Time-Dependent Network. Comput. Eng. Appl. 2022, 58, 292–302. [Google Scholar] [CrossRef]
Figure 1. Schematic diagram of pod repositioning in RMFS.
Figure 1. Schematic diagram of pod repositioning in RMFS.
Systems 13 00995 g001
Figure 2. The overall flow of the algorithm.
Figure 2. The overall flow of the algorithm.
Systems 13 00995 g002
Figure 3. Pod selection process.
Figure 3. Pod selection process.
Systems 13 00995 g003
Figure 4. Expected layout demonstration.
Figure 4. Expected layout demonstration.
Systems 13 00995 g004
Figure 5. Chromosome coding mode.
Figure 5. Chromosome coding mode.
Systems 13 00995 g005
Figure 6. Same-heat pod exchange operator.
Figure 6. Same-heat pod exchange operator.
Systems 13 00995 g006
Figure 7. Associative pod aggregation operator.
Figure 7. Associative pod aggregation operator.
Systems 13 00995 g007
Figure 8. Dynamic task sequence reversal operator.
Figure 8. Dynamic task sequence reversal operator.
Systems 13 00995 g008
Figure 9. “Stronger animals prey on weaker ones” mechanism.
Figure 9. “Stronger animals prey on weaker ones” mechanism.
Systems 13 00995 g009
Figure 10. “Double-elimination” mechanism.
Figure 10. “Double-elimination” mechanism.
Systems 13 00995 g010
Figure 11. Relationship between the number of free positions and the maximum completion time.
Figure 11. Relationship between the number of free positions and the maximum completion time.
Systems 13 00995 g011
Figure 12. Relationship between the number of robots and the maximum completion time.
Figure 12. Relationship between the number of robots and the maximum completion time.
Systems 13 00995 g012
Table 1. Notation declaration of mathematical formulation.
Table 1. Notation declaration of mathematical formulation.
ParameterDescription
L Set of positions
l Number of positions
S Set of expected positions
s Number of expected positions
M Set of current positions, M L
N Set of free positions, N L
P Set of pods
p Number of pods
I Set of task pods, I P
K Set of workstations, including picking stations and replenishment stations
R Set of robots
r Number of robots
μ r The position of robot r
v 1 The no-load speed of robot
v 2 The heavy-load speed of robot
d p p Distance traveled by the robot from position l , where pod p is located, to position l , where pod p is located, when the robot is no-loaded
d l k Distance traveled by a robot carrying pods from position l to workstation k (or from workstation k to position l )
π i Workstations that carry out picking/replenishment operations on task pods, π i K
x m n r i If the task pod i is carried by robot r from position m to complete the task and then stored in position n , the value is 1; otherwise, it is 0
Table 2. Parameters for instance generation.
Table 2. Parameters for instance generation.
InstanceNo.POIKSR
I188040112
210100502
316160802
420180902
II530280140223
6323201603
7404002003
8484802403
9605602803
10646003003
III11100900450325
1214414407205
1320018009005
14250240012008
15288270013508
16320300015008
173503300165010
184323900195010
Table 3. Comparison of different strategies in the instances.
Table 3. Comparison of different strategies in the instances.
No.Joint OptimizationNNNRFNGAP
f1f2f1f2f1f2f1f2NNNRFN
1696.8755763.759876.87511757.588.76%20.53%8.00%
2975.62581096.25141346.875221091.8751611.00%27.56%10.65%
31725.625371897.5542156.875491938.125539.06%19.99%10.96%
42152.5522415.63512798.75702458.1258210.89%23.09%12.43%
Average9.93%22.79%10.51%
52683.125993041.881253704.3751103094.37513511.79%27.57%13.29%
63205.6251103790.631754981.251873750.62517715.43%35.65%14.53%
74661.8751775313.752146604.3752295326.2524212.27%29.41%12.47%
86491.252637651.882939516.8753007526.87530215.17%31.79%13.76%
98113.754059703.1342112251.254469921.2545516.38%33.77%18.22%
108778.12544110503.843112184.3845810275.62547916.43%27.96%14.57%
Average14.58%31.02%14.47%
119362.571510633.884514636.2585010931.87587011.95%36.03%14.36%
1219599.38153223175.6153531521.88154323630154215.43%37.82%17.06%
1333405243938031.9256050943.75267038575.625273012.17%34.43%13.40%
1435432.5375043188.1389465063.75386344173.125378017.96%45.54%19.79%
1539550.63437948185.6444472523.75446949272.5434617.92%45.47%19.73%
1657484.38586669009.46070102561.9617669773.75592816.70%43.95%17.61%
1753645669467215.66887104468.1679568026.875695020.19%48.65%21.14%
1867336.25844282298.18767129371.9879282821.875885018.18%47.95%18.70%
Average16.31%42.48%17.72%
Table 4. Comparison of progressive and non-progressive storage mechanisms in instances.
Table 4. Comparison of progressive and non-progressive storage mechanisms in instances.
No.ProgressiveNon-ProgressiveGAP
f1f2f1f2f1f2
1696.8755715.62572.62%28.57%
2975.62581003.12582.74%0.00%
31725.625371776.875442.88%15.91%
42152.5412268.75525.12%26.83%
Average3.34%17.83%
52683.125992817.51044.77%4.81%
63205.6251103427.51206.47%8.33%
74661.8751774936.8751915.57%7.33%
86491.252636996.8752497.23%−5.62%
98113.753738977.54059.62%8.58%
108778.1254419523.754047.83%−9.16%
Average6.91%2.38%
119362.571510931.87582814.36%13.65%
1219599.375153223593.125152216.93%−0.66%
1333405243938571.875255013.40%4.35%
1435432.5375044162.5390219.77%3.90%
1539550.625437949246.25438819.69%0.21%
1657484.375586669761.25606917.60%3.34%
1753645669468036.25687821.15%2.68%
1867336.25844282813.75881118.69%4.19%
Average17.70%3.96%
Table 5. Comparison of algorithm experimental results.
Table 5. Comparison of algorithm experimental results.
No.AGAVNSAGA-VNSGAP
f1f2f1f2f1f2AGAVNS
1712.55721.256696.87552.19%3.38%
21003.75121067.59975.62582.80%8.61%
31814.375391834.375491725.625374.89%5.93%
42270522306.875692152.5525.18%6.69%
Average3.77%6.15%
52793.75933184.3751102683.125993.96%15.74%
63401.8751443650.6251363205.6251105.77%12.19%
74846.87519153051774661.8751774.92%12.12%
86731.252637439.3752456491.252634.57%12.74%
986213629854.3753568113.754055.88%17.66%
1090854799825.6254358778.1254413.38%10.66%
Average4.75%13.52%
119983.12581611887.57939362.57156.22%21.24%
1222011.15143325025148319599.38153210.96%21.68%
1335913.125243738422.524663340524396.98%13.06%
1439934.375375847560353435432.5375011.27%25.50%
1545298.75437952200411639550.63437912.69%24.23%
1662541.875603572536.875572957484.3758668.09%20.75%
1762106.25667774310676153645669413.62%27.81%
1875651.875862991301.875831067336.25844210.99%26.25%
Average10.10%22.57%
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Bi, H.; Yang, G.; Wang, Z.; Lu, F. Enhancing E-Commerce RMFS Order Fulfillment Through Pod Positioning with Jointly Optimized Task Allocation. Systems 2025, 13, 995. https://doi.org/10.3390/systems13110995

AMA Style

Bi H, Yang G, Wang Z, Lu F. Enhancing E-Commerce RMFS Order Fulfillment Through Pod Positioning with Jointly Optimized Task Allocation. Systems. 2025; 13(11):995. https://doi.org/10.3390/systems13110995

Chicago/Turabian Style

Bi, Hualing, Guangpu Yang, Zhe Wang, and Fuqiang Lu. 2025. "Enhancing E-Commerce RMFS Order Fulfillment Through Pod Positioning with Jointly Optimized Task Allocation" Systems 13, no. 11: 995. https://doi.org/10.3390/systems13110995

APA Style

Bi, H., Yang, G., Wang, Z., & Lu, F. (2025). Enhancing E-Commerce RMFS Order Fulfillment Through Pod Positioning with Jointly Optimized Task Allocation. Systems, 13(11), 995. https://doi.org/10.3390/systems13110995

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop