Optimizing Rack Locations in the Mobile-Rack Picking System: A Method of Integrating Rack Heat and Relevance

: The flexible movement of racks in the mobile-rack picking system (MRPS) significantly improves the picking efficiency of e-commerce orders with the characteristics of “ one order multi – items ” and creates a challenging problem of how to place racks in the warehouse. This is because the placement of each rack in the MRPS directly influences the distance that racks need to be moved during order picking, which in turn affects the order picking efficiency. To handle the rack location optimization problem (RLOP), this work introduces a novel idea and methodology, taking into account the heat degree and the relevance degree of racks, to enhance the efficiency of rack placements in the MRPS. Specifically, a two-stage solution strategy is implemented. In stage 1, an integer programming model (Model 1) is developed to determine the heat and relevance degree of racks, and it can be solved quickly by the Gurobi. Stage 2 entails developing a bi-objective integer programming model (Model 2) with the objective to minimize the travel distances of robots in both heavy load and no-load conditions, using the rack heat and relevance degree as inputs. In light of the challenge of decision coupling and the vast solution space in stage 2, we innovatively propose two lower bounds by slacking off the distance between storage locations. A matheuristic algorithm based on Benders decomposition (MABBD) is designed, which utilizes Benders-related rules to reconstruct Model 2, introduces an enhanced cut and an improved optimal cut with RLOP characteristics, and designs the warm start strategy and the master variable fixed strategy. Given the substantial size of real-life problems, the Memetic algorithm (MA) is specifically devised to address them. Instances of varying sizes are also employed to validate the science and efficacy of the model and


Introduction
The rapid development of electronic commerce in recent years has posed substantial implications and challenges for warehouse operation management.Given the demanding timeliness and large-scale, diverse, small-batch nature of customer orders, the urgent resolution of the problem of implementing precise and efficient order picking operations is a critical concern for every warehouse enterprise [1,2].The MRPS, operating as a semi-automated picking system, provides a fresh way to address this problem [3].This system deviates from the traditional fixed rack storage system as its racks are smaller in size and can be moved to any desired position within the warehouse.The picking operation is executed by the robot, which carries the rack to the picking station, thereby implementing the "parts to picker" mode.Instead of being stored in a fixed location like a traditional warehouse, each Stock Keeping Unit (SKU) can be scattered and put on numerous mobile racks, each rack holding varying amounts of different categories of SKUs [4,5].The MRPS significantly improves the flexibility of SKUs storage and efficiently addresses the difficulty of e-commerce order picking by allowing for the relocation of racks in the warehouse [6].As a result, it is highly desired by numerous ecommerce warehousing enterprises.Presently, e-commerce giants such as JD, Walmart, and Alibaba have embraced MRPS [7].
Nevertheless, this system also gives rise to the challenge of optimizing rack location.The positioning of each rack in the MRPS directly affects the distance traveled by the robots during order picking, thereby impacting the efficiency of the order picking.How can the arrangement of the rack in the warehouse be periodically adjusted, taking into account the regularity of customer orders and the categories and quantities of SKUs stored on the rack, with the objective of minimizing the travel distance of robots?This is a challenging problem that every MRPS decision-maker faces, especially when dealing with e-commerce order picking, in order to maximize efficiency and leverage the system's advantages.This problem is accompanied by the emergence of MRPS, which restricts the development of a new generation of order picking systems.It is different from the location optimization problem in the fixed rack storage system, its solution surpasses the applicability of existing theoretical methods, and its NP-Hard characteristics significantly amplify the problem's difficulty, especially when addressing large-size customer orders [8,9].
In light of this, this paper studies the RLOP in MRPS.A novel approach is proposed to optimize the location of racks by taking into account the categories and quantities of SKUs on each rack, as well as the ordering regularity observed in a large number of customer orders.This method employs the utilization of rack heat and relevance degree and can further enhance the efficacy and scientific rigor of MRPS rack location optimization.The contributions of this paper can be summarized as follows: (1) A novel idea of "customer orders → racks usage frequency → racks location optimization" is proposed to effectively model and solve the RLOP.(2) The MABBD algorithm is proposed to solve the integer programming model, and two lower-bound generation methods are designed based on the characteristics of RLOP, which enriches the theoretical research in the field of warehousing optimization.(3) A Memetic algorithm is specifically developed to address large-scale instances.This method is well-suited to the RLOP, as it employs crossover/mutation operators to rematch racks and locations throughout a broad range, resulting in the creation of a new population.The tabu search operators will explore individual locally by exchanging racks in the same type of locations or racks in the same heat.
The rest of this paper is organized as follows.Section 2 reviews the related work.Section 3 formulates the problem.Section 4 develops the solution algorithm.Section 5 discusses the results of computational experiments.Section 6 concludes the paper and provides future directions.

Literature Review
In terms of storage assignment, the traditional manual order picking system solely requires determining the storage position of the SKUs on the rack.This is because the racks have fixed storage locations, and pickers walk in the channels to retrieve SKUs from the racks.In other words, it is a problem of assigning storage locations to SKUs [10].Hausman et al. were the first to compare three storage assignment strategies: random assignment, full turnover-based assignment, and class-based turnover assignment [11].Subsequently, Kuo et al. assigned the storage locations with the objective of increasing warehouse utilization and customer satisfaction by reducing customer waiting time [12].Glock and Grosse studied the problem of storage assignment with the lowest warehouse picking cost [13], while Bortolini et al. optimized SKUs storage locations based on the best rack stability [14].The objective-setting methods of the above studies are relatively common.In addition, some scholars optimized SKUs storage locations by analyz-ing certain attributes of the products themselves: Caron et al. proposed a storage strategy based on the Cube-per-Order Index (COI) [15].Li and Nof proposed a storage optimization method to classify the SKUs based on their attributes [16].Yang and Nguyen developed a constrained clustering method integrated with principal component analysis to meet the need of clustering stored SKUs with the consideration of practical storage constraints [17].Since there is a certain correlation between different SKUs in practice and they are often purchased at the same time, other scholars take this correlation into consideration: Pan et al. used the frequency of simultaneous purchases of SKUs to quantify the correlation between products [18].Xiao and Zheng proposed to classify all SKUs based on correlation and then match them with classified storage locations [19].Jane and Laih proposed to classify SKUs according to association relationships, and similar SKUs are randomly placed in the same area [20].
The above research focuses on the problem of storage assignment under the traditional "picker-to-parts" model.Within the MRPS system, the robot transports the racks from the storage area to the picking station using the "parts-to-picker" picking mode.Meanwhile, the pickers stand still at the picking station, retrieving the SKUs from the racks to fulfill the orders' picking.After picking is completed, the robot needs to transport the racks from the picking station back to the empty location for storage [21].Therefore, the storage assignment problem of MRPS can be divided into SKUs storage racks optimization and rack location optimization.Some scholars have used a two-stage solution idea to study the storage assignment problem of MRPS.In the first stage, correlation analysis methods and clustering methods are used to perform correlation clustering on SKUs, so that SKUs that are often purchased at the same time can be stored on the same rack, and to realize that the SKUs stored on racks are maximized in overall relevance.In the second stage, Li et al. proposed a new scattering storage strategy based on rack turnover rate, aiming to decentralize the storage of racks with higher turnover rates to avoid congestion and accumulation of a large number of AGVs in high-frequency operating areas at the same time [22].Yuan et al. considered rack turnover rate, relevance between racks, and work balance in the lanes to construct an optimization model for minimizing the total travel distance of robots and designed a hybrid algorithm that combines a greedy algorithm and improved simulated annealing [23].The above research considered both the two subproblems of SKUs storage optimization and rack location optimization.However, in the real operation of the MRPS, the storage assignment of SKUs on the rack and the adjustment of the rack position are two types of work with different frequencies at different times, so they belong to two different levels of the problem.
Considering that the decision of rack storage location has an important impact on the moving distance of the rack and the workload of the robot, many scholars have conducted research on the RLOP of RMFS in recent years.Weidinger et al. formalized the rack location problem as a special interval scheduling problem by calculating the movement distances between racks and picking stations and introduced a mathematical approach based on adaptive large neighborhood search to solve the problem.It is shown that a simple rack location strategy like the shortest path storage can gain better results when the objective is to minimize the distance between the racks and the picking stations [8].Yuan et al. developed a fluid model to analyze the performance of velocitybased storage policies.They characterized the maximum possible improvement from applying a velocity-based storage policy in comparison to the random storage policy.The result showed that class-based storage with two or three classes can achieve most of the potential benefits and that these benefits increase with greater variation in the rack velocities [5].Merschformann et al. introduced several rack repositioning policies such as random policy, fixed policy, and nearest policy [24].Cai et al. integrated the rack location assignment problem with the path planning decisions to minimize the total travel time of robots [25].Zhuang et al. investigated the rack storage and robot assignment to racks problem during order processing.They formulated this problem with the objective of minimizing the makespan of this system and developed a matheuristic decomposition approach based on a rolling horizon framework and the simulated annealing method.In the above study, the storage location of the rack is constantly changing, and the rack may be assigned to a new storage location for each completed pick.Merschformann et al. demonstrated that employing variable storage locations is advantageous in numerous scenarios compared to a fixed strategy.This approach effectively minimizes the traveling time of robots, enabling them to promptly accomplish subsequent tasks.However, this strategy may result in no longer useful racks being stored in very prominent storage locations, which is likely to cause problems with congestion and further robot travel distances during order picking later.Moreover, assigning a new storage location after each rack use during the order picking process will greatly increase the computational burden of the intelligent operating system [26,27].
In actual situations, customers have increasingly higher requirements on the delivery time and cost of their orders.Owing to the impact of variables like sales, seasons, product life cycles, and turnover rates, rack locations in MRPS must be changed on a regular basis in order to accommodate these changes and satisfy customer demands.Given that the racks carried to the picking station by the robot always storage numerous SKUs, this paper establishes a correlation between the SKUs combination of customer orders and the SKUs combinations on the racks.By analyzing the SKUs ordering rules disclosed in numbers of customer orders, as well as the categories and quantities of SKUs present on each rack, it is possible to ascertain the using frequency for each rack.Subsequently, the rack locations can be rationally assigned to accommodate future order picking demands.This paper introduces a novel approach to optimize rack location by considering rack heat and relevance.In addition, we also propose a matheuristic algorithm based on Benders decomposition and a memetic algorithm to handle larger-sized real-world cases.This work aims to enhance the scientific rigor and efficacy of rack location optimization in MRPS.

Problem Description and Analysis
The picking station in the MRPS is usually fixed in place on one side of the warehouse, as depicted in Figure 1.The position of the rack within the warehouse influences the distance between the rack and the picking station, which in turn, affects the overall travel distance of the robot during picking.The rack's location is regularly updated (every few weeks or months) to accommodate future customer order picking demands, as the categories and quantities of SKUs in the order change.The RLOP in MRPS necessitates the precise arrangement of each rack in the warehouse to reduce the overall distance traveled by the robot when fulfilling customer orders in the future.This paper assumes that orders and their occurrence probabilities are known for the future period (the prediction procedure for order probabilities is not contemplated in this paper).Additionally, it assumes that the categories and quantities of SKUs on each rack are known.Then, the key to optimizing the rack location is determining how to calculate the expected number of times the rack is utilized based on the orders distribution regularity and provide the expected robot travel distance for a given rack location solution.
Upon analysis, it is obvious that the robot's movements can be divided into two modes: heavy load, when the robot carries the rack for travel, and no-load, when the robot does the movement itself.The two modes are performed sequentially: during the heavy load mode, the robot carries the rack from its storage location to the picking station, and subsequently returns it from the picking station once the picking process is finished.As soon as the robot completes handling the current rack, it will be no-load onto the next rack and resume the handling operation.The heavy distance traveled by the robot is directly proportional to the expected number of times each rack will be used, taking into account the probability distribution of the orders and the specified rack location arrangement.The no-load distance is determined by the number of times any two racks are carried in succession by the same robot.Considering that various scheduling approaches will result in distinct robot paths, this paper calculates the frequency at which two racks simultaneously serve the same order by utilizing the probability distribution of the orders, in cases when the actual scheduling method utilized by the warehouse is unknown.As the frequency increases, the likelihood of both racks being carried by the same robot consecutively also increases.Therefore, this paper employs such frequencies as a metric to determine the robot no-load travel distance.Define the expected number of movements of rack  (∈ ,  is the set of racks) as ℎ  , denoting the "heat" of the rack.The number of times that two racks  and  are selected to provide picking service for the same order is the " relevance degree" of the two racks, expressed by   .The higher the   , the greater the possibility that the two racks will be moved by the same robot in turn.The rack location optimization model can be built based on rack heat and relevance degree.Therefore, we first establish the rack heat and correlation generation model to obtain {ℎ  } ∈ and {  } ,∈ , and then use them as parameters to establish the rack location optimization model.

Rack Heat and Relevance Degree Generation Model (Model 1)
Model 1 aims to determine the appropriate rack to provide picking service for the orders, so as to improve picking efficiency as much as possible.For the same batch of customer orders, there will be a variety of rack selection solutions to meet the picking requirements, but different schemes have different rack movement times.The more times the robot shuttles between the picking station and the rack storage area, the higher the overall distance.Therefore, Model 1 aims to minimize the expected number of rack moves required to complete the picking task and obtain the rack heat and relevance degree attributes.Then, based on these two attributes, the rack location optimization model decides the specific location of each rack with the goal of minimizing the moving distance of the robot.
Let  represent the set of orders to be picked in the future period (several weeks or months), assuming the probability   (0 <   ≤ 1) of order (∈ ) is known (order distribution can be obtained through prediction, and the prediction method is not included in this paper.).All SKUs  required for order picking are stored on racks in set .The storage capacity of SKU (∈ ) on rack (∈ ) is   , and the total supply of items can satisfy the picking demand of orders, and there is no out-of-stock phenomenon.The parameters and decision variables used in Model 1 are defined in Table 1.

𝑅
The set of racks.

𝑂
The set of orders. The set of SKUs.

𝐾 𝑜
The set of the SKUs that order (∈ ) contains.

𝑅 𝑘
The set of racks where SKU (∈ ) is stored.

𝑑 𝑜𝑘
The number of the items of SKU (∈ ) that order (∈ ) requires.

𝑔 𝑖𝑗
The relevance degree between rack (∈ ) and rack (∈ ), that is, the number of times they provide picking services for the same order.ℎ  Rack heat, that is, the number of times rack (∈ ) needs to be moved to complete picking task.

Decision Variables 𝑥 𝑖𝑜
A binary variable that equals 1 if order (∈ ) is picked from rack (∈ ).
Non-negative integer variable indicating the number of the items of SKU (∈   ) picked from rack (∈ ) for order (∈ ).
Then, Model 1 can be formulated as follows: ∑   ∈ ≤   , ∀ ∈ ,  ∈  (4) Formula ( 1) is the objective function that minimizes the expected times of moving racks for picking orders.Constraint (2) formulates the relationship among  and .It ensures that rack  must not provide items for an order (  = 0) when items of an order are not picked from the rack (  = 0).Constraint (3) ensures that the items of an SKU required by an order must be satisfied by one or more racks.Constraint (4) guarantees that the number of an SKU items picked from a rack must not exceed the number of items stored on the rack.Constraints ( 5)-( 6) define the domains of the variables.

Rack Location Optimization Model (Model 2)
On the one hand, rack heat indicates the anticipated number of moves of each rack for future orders; therefore, based on the rack location scheme, we can estimate the travel distance of the robot's heavy load during order picking.The rack relevance degree, conversely, denotes the frequency with which two racks serve the same orders; the greater the frequency, the more likely it is that both racks are sequentially moved by the same robot.Since the number of times two racks are carried sequentially by the same robot depends on the robot scheduling method used in the warehouse, and in the case where the scheduling method is unknown, we cannot accurately know the robot's no-load traveling distance, so this paper calculates the robot's no-load traveling distance based on the rack relevance degree.
It is crucial to highlight that the picking efficiency is influenced to different degrees by the driving speeds of the robot with a heavy load compared to the robot moving with no load (the heavy load has a slower driving speed).Hence, this paper proposes a biobjective integer programming model to optimize the racking location by minimizing both the heavy load and no-load distances of the robots.The heavy load distance is given higher priority, while the no-load distance is given lower priority.The parameters and decision variables used in Model 2 are shown in Table 2.

𝑅
The set of racks.

𝐿
The set of rack locations.

𝑑 𝑢
The distance between the rack location (∈ ) and the picking area.

𝑔 𝑖𝑗
The relevance degree between rack (∈ ) and rack (∈ ), that is, the number of times they provide picking services for the same order.ℎ  Rack heat, that is, the number of times rack (∈ ) needs to be moved to complete picking task. 1 ,  2 Prioritization or weighting of optimization objectives.

Decision Variables 𝑚 𝑖𝑢
A binary variable that equals 1 if rack (∈ ) is stored in location (∈ ).  Non-negative variable indicates the distance between rack (∈ ) and rack (∈ ).
Then, Model 2 can be formulated as follows: ≥   • (  +   − 1), ∀,  ∈ ,  > ; ,  ∈ ,  ≠  (10) Formula ( 7) is the objective function that optimizes the weighted sum minimization of the robot's heavy travel distance and no-load travel distance.Constraint (8) ensures that each rack can only be placed in one location.Constraint (9) guarantees that each location can store a maximum of one rack.Constraint (10) formulates the relationship among  and .Constraints (11)-( 12) define the domains of the variables.

Solution Algorithm
Given that Model 1 can solve the instance with the size of || = 6000; || = 200 in a short time by Gurobi, for larger instances, a simple batch strategy can be adopted to quickly solve.Hence, Model 1 adeptly addresses the real problem, and the associated intelligent algorithm is not devised in our work.Since the RLOP is a typical unbalanced assignment issue, the complexity of Model 2 is NP-hard.This makes solving realistic large-size problems extremely challenging.When Model 2 encounters an instance with 40 racks and 52 locations, Gurobi achieves a Gap of 81.21% after one hour.Once the number of racks and locations rises to several hundred, solving the model becomes more difficult.Therefore, this section focuses on the optimization method of rack location.Initially, two high-quality lower bounds are derived based on the specific attributes of the RLOP.Subsequently, the matheuristic algorithm based on Benders decomposition (MABBD) is developed to enhance the efficiency of solving Model 2 by taking into account its unique properties.The Memetic algorithm (MA) is specifically developed to address complex problems on a large scale.

Lower Bound (LB)
In order to improve the performance of the algorithm in this paper, two lower bound acquisition methods are proposed in this section.
The objective function of Formula ( 7) is relaxed: First, the lower bound  1  of  1 is solved.Select || locations closest to the picking station and sort them in ascending order of distance, represented by the set  1 .The racks are sorted by their heat from high to end, represented by the set  1 ; Then, a lower bound of  1 is given by: Next, the lower bound  2  of  2 is solved, and two methods are designed.(1) Arrange   (,  ∈ ,  > ) in ascending order, represented by the set  1 .Similarly, arrange   (,  ∈ ,  > ) in ascending order by taking the first ||(|| − 1)/2 values in descending order, represented by the set  1 .Then, a lower bound of  2 is given by: (2) For each location  ∈ , select the nearest || − 1 locations   2 and calculate the sum of distances from location  ∈  to these locations   = ∑   ∈

2
. Arrange {  } ∈ in ascending order, the first || locations are represented by the set  2 , and their associated distance {  } ∈ 2 ,∈  2 in descending order is represented by the set  2 .Then, the other lower bound of  2 is given by: In summary, a lower bound for the original problem can be obtained as: Theorem 1.The total travel distance of the robots  is bounded from below by the optimal value of .

Matheuristic Algorithm Based on Benders Decomposition (MABBD)
Since Model 2 contains two types of decision variables, rack location assignment {  } ∈,∈ and the distance between racks {  } ∈,∈,> , Model 2 is reconstructed based on the relevant rules of Benders decomposition.First, the no-load travel distance of the robot in Model 2 is relaxed, and the master problem formed mainly solves the heavy load travel distance of the robot.When the relevant decision variable {  } ∈,∈ is gained, the subproblem solves the no-load travel distance of the robot; that is, it is a linear programming problem and can be solved quickly.In order to address the algorithm's drawback of sluggish convergence, the optimal cut is improved, and the enhanced cut is generated based on the attributes of Model 2. Subsequently, multiple heuristic algorithms are devised to generate compact initial cuts by utilizing warm start strategies that take into account the specific attributes of the RLOP.Ultimately, a solution pool is explicitly built to store approximate optimal solutions.Through the observation of value regularity in these approximate optimal solutions, we identify and choose locations and racks that are highly frequently matched and partially fix their decision variables.Simultaneously, variables that correspond to storage locations and racks that are utilized infrequently are eliminated in order to reduce the search range of the solution space.The algorithm flow is shown in the Figure 2.

Master Problem Model
is defined as the lower bound of the no-load travel distance of the robot in the objective function.The constraints related to the no-load travel of the robot are relaxed to obtain the master problem model as follows: Constraints (10), ( 11), ( 13) The objective function ( 16) minimizes the weighted sum of the robots' heavy-load distance and the lower bounds of the robots' no-load distance.Constraint (17) represents the lower bound of the no-load travel distance on the global search space.

Subproblem Model
When the master problem model is solved, the storage location of each rack {  } ∈,∈ can be determined, and then the subproblem becomes the weighted distance between the racks to minimize the no-load driving distance of the robot.The subproblem model is as follows: When a new upper bound of the original problem is found, cut ( 23) is generated and added to the master problem.This cut indicates that when the new optimal solution of RLOP is obtained, the no-load distance of robots is at least ( ̃), that is, in the subsequent iteration process, the cut will automatically exclude the current solution, and there is no need to solve it again.When a feasible solution to the original problem is found, cut ( 24) is generated and added to the master problem.This cut indicates that the current solution is not optimal and need not be considered again.The above valid cuts can convey effective information to the master problem, hence enhancing the solution of the master problem and continuously increasing the lower bound.The process is iterated until either the optimal solution of the RLOP is achieved or the computation time limit is reached.

Warm Start Strategy
According to the characteristics of the RLOP, a variety of heuristic algorithms are designed to tighten the feasible domain, which is relatively relaxed in the initial iteration, and to generate some tight initial cuts and upper bound (UB) of the RLOP.
(1) Bi-direction matching strategy.Firstly, the importance of each rack is calculated , where  1 ̅̅̅ is the average distance from the storage location to the picking station,  2 ̅̅̅ is the average distance between storage locations, and   ̅ is the average relevance degree of rack .Secondly, the importance of each storage location is calculated as   = 1  1  1 ℎ ̅ + 2  2  ̅ , where ℎ ̅ is the average heat, and ̅ is the average correlation degree.Finally, the racks and storage locations are matched one by one according to the descending sequence of rack importance and the descending sequence of location importance.
(2) Two-stage solution strategy.In stage 1, the storage locations are divided into ABC areas from near to far according to the distance from the picking station, and the proportion of the number of locations in each area is A = 20%, B = 30%, and C = 50%, respectively.The racks are divided into three areas according to heat.In stage 2, for each area, the rack with the highest heat is selected for priority storage, and then the associated racks of this rack are stored nearby according to the relevance degree.(3) Integrated matching strategy.The storage location is arranged in ascending sequence of distance from the picking station, and locations that have the same distance from the picking station are considered to be the same location.Racks are allocated for each kind of location in descending sequence of rack heat.When the same type of locations is matched to the same heat of racks, the priority is given to the rack that has a greater relevance degree with racks that have been assigned a location.(4) Generate feasible solution based on Model 2. The current upper and lower bounds are added to the original model as constraints to solve the linear relaxation solution of the model.The solution is feasible according to the rack importance and location importance of strategy (1).
Through the warm start strategy, four feasible solutions can be obtained, and the feasible solution with the smallest objective value F is taken as the UB of RLOP, and the optimal cut is generated using this feasible solution.Accordingly, the other three feasible solutions generate enhanced cuts.Theorem 2. The total travel distance of the robots  is bounded from above by the optimal value of .

Master Variables Fixation Strategy
Considering the warehouse layout, the racks with high heat should be placed as close to the picking station as possible, and the location far away from the picking station is often not the optimal choice, so the value of some decision variables is dominant.This paper introduces a solution pool to store 100 best approximate optimal solutions and record the value rule of each decision variable.When the algorithm is no longer updating the optimal solution in multiple searches, this paper extracts 5% of the decision variables with small changes in their values from the solution pool to fix them, and the model is feasible by checking and deleting the constraints and valid inequalities.Naturally, employing this strategy may discard some of the potential optimal solutions.To effectively prevent this scenario, the strategy permits a maximum usage of six times.In other words, the size of the decision variables we elect to fix must not exceed 30%.Appendix A provides a comprehensive explanation of the justification for the design, utilizing both theoretical analysis and experimentation.

Memetic Algorithm (MA)
The Memetic algorithm is specifically developed to address the RLOP for largescale instances.The Memetic algorithm is an optimization technique that combines population intelligence methods with local search methods [28].Classical intelligent algorithms often have the shortcomings of slow convergence and difficulty finding highprecision solutions when confronted with large-scale and intricate RLOP.Based on these algorithms, the Memetic algorithm introduces a local search strategy to further search the feasible solutions, which will effectively improve the solving efficiency and accuracy of the intelligent algorithm [29,30].
The analysis of RLOP revealed that the choice of rack position has an impact not only on the distance between the rack and the picking station but also on the distance between the racks themselves.Considering the attributes of the warehouse layout, there will be several storage locations equidistant from the picking station, along with several racks that have the same heat.However, the distances between these locations or the relevance degree between the racks may vary.Hence, the crossover/mutation operators of the MA will rematch racks and locations over a wide range to produce a new population.Conversely, the tabu search operators will explore individual locally by exchanging racks in the same type of locations or racks in the same heat.The algorithm flow is shown in Figure 3.

Initial Solution Generation
The warm start strategies mentioned in Section 4.2.4 were used to generate multiple initial solutions to form the initial population.

Search Operators
Parameter   is introduced to denote the stickiness of rack  with location  , , where the sticky degree   embodies the suitability of matching between a rack and a location.Obviously, the larger the   , the better the location assigned to rack .Let  ̅ = ∑   ∈ /|| for the average sticky degree, and racks with a sticky degree lower than  ̅ are filtered, denoted by the set   .
(1) Crossover operator Select ( < |  |) racks from   and use the bi-directional matching strategy in Section 4.2.4 to re-match their locations.
(2) Mutation operator ( < {|  |, || − ||}) racks are selected from   , and  idle locations are randomly selected, and the bi-directional matching strategy in Section 4.2.4 is used to rematch these locations and racks.
(3) Local search operators Theorem 3. In the local search process of the algorithm, the heavy loaded travel distance  1 of the robots is unchanged.
Operator 1: The racks are classified according to heat, and the racks with the same heat are classified as one class.Two racks of the same class are randomly selected to exchange their location.
Operator 2: The locations are classified according to the distance to the picking station, and the locations with the same distance are classified as one class.The two racks stored in the same location are randomly selected for location exchange.

Fitness Function
The fitness function represents the improved value of the optimization objective.Given the unique nature of the RLOP, adjusting the position of each rack only impacts its distance from the picking station and other locations.Therefore, there is no requirement to solve the entire problem again.It is sufficient to calculate the change in travel distance of the racks resulting from the change in location to determine the overall improvement in the problem.
Fitness function of the crossover/mutation operator: Fitness function of the local search operators:

Computational Results
To evaluate the effectiveness and performance of the proposed solution approaches, we conducted computational experiments on a computer equipped with an AMD Ryzen7 CPU, 16 GB RAM, and the Windows 10 (64-bit) operating system.All methods presented in this paper are implemented using C#.Gurobi is selected as the solver of mathematical models.

Instance Generation and Parameter Setting
The rack storage area of the MRPS is composed of multiple blocks; each storage block contains multiple adjacent locations, and the blocks are separated by horizontal and vertical lanes, and the robot can travel in each lane.This paper builds the warehouse layout diagram, as shown in Figure 4, according to the actual layout of the MRPS of a large e-commerce company.The distance of each grid is 1, with a total of 1500 locations.Part of the orders, racks, and SKUs are extracted from the actual business data to form 10 small-scale instances and 20 large-scale instances.The values of relevant parameters are shown in Table 3.   5 The quantity of each SKU in an order [1,5] The maximum number of un-updated iterations of the local optimal solution.{10,   } priority  1 = 0.7  2 = 0.3

Algorithm Performance Comparison
To evaluate the performance of the MABBD proposed in this paper, this section conducts a comprehensive comparison of the results from our MABBD, MA, Gurobi, and a meta-heuristic (the improved Partheno-Genetical algorithm, IPGA) that has shown good performance for the storage assignment problem.These comparisons are performed across instances of various sizes, with a maximum solution time of 3600 s.
Table 4 displays the results of Gurobi when applied to Model 2, as well as the outcomes of the MABBD, MA, and IPGA proposed in this paper for 10 small and mediumsized instances.In Table 4,   represents the disparity in results between an algorithm and Gurobi.If the solution time of Gurobi is less than 3600 s, it indicates that Gurobi has found the optimal solution; if the solution time is 3600, it signifies that Gurobi has returned a best-found solution.The solution and running time of other algorithms are the output results after the termination condition of the algorithm is satisfied.

Ta ble 4.
Comparison of the solution results on small and medium-sized instances.Table 4 shows that when the size of the instances is small, Gurobi can find the optimal solution, but as the size of the instances gradually increases, Gurobi is unable to obtain the optimal solution in 3600s.The performance of the MABBD is better than Gurobi with an average improvement in the objective value of 3.58%.The MA exhibits superior performance, as it not only enhances the objective value by 3.72% but also has a much shorter computation time in comparison to Gurobi.Due to its intelligent nature, the evaluation of the solution's quality cannot be made directly for MA algorithm.While the MABBD can give the upper and lower bound solution of the RLOP (see Appendix B), the average Gap value of the upper and lower bounds obtained by this algorithm is 7.37%, indicating that it has a good convergence impact.Meanwhile, comparing the MDBBD as a benchmark, the MA in this paper shows superior optimization searching ability.

No
For large-sized instances, due to Gurobi's inability to obtain feasible solutions in a limited period, this paper introduces an additional meta-heuristic algorithm for comparison: the improved Partheno-Genetical algorithm (IPGA).This algorithm is chosen based on its good performance for storage assignment in the recent literature [31][32][33][34].The algorithm details are as follows: drawing on the algorithmic framework of the above literatures, the population size is set to 150, the maximum number of iterations is 1000, and other parameters are set the same as in MA.Let the crossover/mutation operators of MA be the mutation operators of IPGA, and the Tabu search operators of MA be the crossover operators of IPGA.
The results of MA and IPGA on large-sized instances are shown in Table 5.In general, MA has greater advantages than IPGA.The MA results of 20 instances are better than IPGA, saving 60,232.3units of robot travel distance on average.In order to reflect the performance of MA more intuitively, the results of upper and lower bounds are taken as benchmarks to compare the solving effects of MA and IPGA (see Appendix C for the detailed calculation process and results).Upon analyzing Table 5, it is evident that MA outperforms IPGA in terms of enhancing the objective value, demonstrating an average optimization of 26.40%.Moreover, the optimal objective value found by our MA is closer to the lower bound, which is optimized by 32.54% compared with IPGA.In addition, the running time of the two algorithms is relatively close.Given that RLOP is a strategic decision with a medium to long-term impact, it is not necessary to place excessive emphasis on the algorithm's running time.Our primary focus is on discovering a more optimal resolution.In short, the strong solving ability of MA is mainly due to its dual optimization ability of both global search and local search.The crossover/mutation operator disturbs the matching results of racks and locations on a larger range, which is conducive to jumping out of the current neighborhood.The elitism retention strategy also preserves highquality solutions.The local tabu search strategy further searches for new neighborhood solutions, hence enhancing its ability to identify potential optimal solutions.This crossapplication of horizontal search and vertical search is in line with the characteristics of the RLOP, so the optimization ability of the algorithm is improved.

Conclusions
Research on the RLOP has significant practical importance in enhancing the efficiency of e-commerce order picking and accelerating the response speed of e-commerce logistics, given the rapid development of the MRPS in e-commerce logistics in recent years.Nevertheless, the key to solving this problem lies in extracting the ordering rules of SKUs in orders and effectively modelling and solving the problem through the process of "customer orders → rack use frequency → rack location optimization".This is particularly important due to the presence of multiple SKUs in e-commerce orders and the practical nature of splitting and storing per SKU on multiple racks in MRPS.
Focusing on the link of "customer orders → rack use frequency", this paper proposes a model that calculates the heat and relevance degree of racks based on customer orders.The objective of the model is to minimize the number of times the racks move.The model also determines the selected picking racks and generates two attribute values: per rack heat and relevance degree between racks.The link "rack frequency → rack position optimization" considers the heavy load and no-load travel distance of the robots as the determining variables for picking efficiency.A bi-objective integer programming model is established to optimize the rack position by minimizing these two distances.To address the challenge of decision coupling and the vast solution space in stage 2, we develop two lower bounds for the original problem by lowering the distance between storage locations.A matheuristic algorithm based on Benders decomposition (MABBD) is designed, which utilizes Benders-related rules to reconstruct Model 2, proposing an enhanced cut and an improved optimal cut, and designing the warm start strategy and the master variable fixed strategy.Given the substantial scale of real-life problems, the Memetic algorithm (MA) is specifically devised to address them.Instances of varying sizes are also employed to validate the science and efficacy of the model and algorithm.The experimental results demonstrate that the suggested algorithm achieves superior quality solutions.
This work introduces a novel idea and methodology for enhancing the efficiency of rack placements in the MRPS.Implementing this approach would significantly enhance the scientific and intelligent aspects of the decision-making process for optimizing rack locations in this system.It would also improve the current inefficient methods utilized for rack location optimization in e-commerce storage management.Our approach efficiently handles extensive customer orders in intricate warehouse operations and improves the applicability and efficiency of optimization theory in warehouse management.According to this study, a crucial and difficult aspect of the MRPS is determining the best location for racks to fulfil customer orders in real-time while also optimizing the robot's scheduling.This will be the focus of future research in this paper.

Figure 3 .
Figure 3.The framework of MA.
[5, 5000] Local search iterations     = 100 −   The categories of SKUs included in an order [1, 7] Tabu tenure 10 The maximum number of un-updated iterations of the global optimal solution.

Table 3 .
The values of relevant parameters.

Table 5 .
Comparison of the results of large-sized instances.

Table A3 .
Results of large-sized instances.