Next Article in Journal
The Future of Cotton in Brazil: Agroclimatic Suitability and Climate Change Impacts
Previous Article in Journal
Smart Edge Computing Framework for Real-Time Brinjal Harvest Decision Optimization
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Multi-Station Agricultural Machinery Scheduling Based on Spatiotemporal Clustering and Learnable Multi-Objective Evolutionary Algorithm

1
College of Information Science and Engineering, Xinjiang University, Urumqi 830017, China
2
Key Laboratory of Signal Detection and Processing in Xinjiang Uygur Autonomous Region, Urumqi 830017, China
3
Key Laboratory of Multilingual Information Technology in Xinjiang Uygur Autonomous Region, Urumqi 830017, China
*
Author to whom correspondence should be addressed.
AgriEngineering 2025, 7(6), 197; https://doi.org/10.3390/agriengineering7060197
Submission received: 6 May 2025 / Revised: 15 June 2025 / Accepted: 16 June 2025 / Published: 18 June 2025

Abstract

:
The multi-station agricultural machinery scheduling process mainly involves two key stages: order allocation and path planning. Order allocation methods based solely on spatial distance cannot ensure the continuity of agricultural operations. Multi-objective evolutionary algorithms are sensitive to the initial population quality and local search strategies for path planning, where unreasonable initial solutions or improper local search strategies can affect the diversity of solutions. Therefore, we propose a spatiotemporal allocation algorithm that constructs a spatiotemporal distance function to describe the feasibility of continuous operations and evaluates the spatiotemporal proximity of operation points and stations for clustering allocation. In terms of path planning, we design a learnable multi-objective evolutionary algorithm (LMOEA). First, a hybrid initialization strategy is used to enhance the initial population quality; second, a Q-learning-based local search method is constructed to adaptively adjust the search strategy to reduce ineffective iterations; finally, a dynamically adjusted crowding distance mechanism is introduced to improve the distribution of the solution set. Experimental results show that the spatiotemporal allocation algorithm improves the average cost and satisfaction by 4.09% and 3.28% compared to the spatial method. Compared with INSGA-II, HTSMOGA, and NNITSA algorithms, the LMOEA can obtain solutions of higher quality and greater diversity.

1. Introduction

During the agricultural machinery service process, improper resource allocation, information circulation issues, and insufficient emergency response capabilities during peak periods are common challenges [1]. Agricultural service organizations typically face a large number of dispersed fieldwork tasks, each with different times, locations, and requirements. These tasks have seasonal characteristics, especially during the peak agricultural production period, when the demand for agricultural machinery is high and the time windows are tight, leading to inflexible resource allocation among agricultural machinery stations, resulting in equipment idling or overuse. Additionally, the high dispersion of operation points increases the difficulty of scheduling. Without reasonable routing, unnecessary detours and excessive transfer distances can lead to higher operational costs. Furthermore, mismatches between arrival times and farmers’ preferred time windows may reduce service satisfaction, ultimately weakening the long-term cooperation between farmers and cooperatives. On the other hand, the transmission of work information usually relies on manual operations, and delayed updates lead to a mismatch between machinery supply and farmer demand, causing the service organization to be unable to provide targeted services based on farmer needs.
The task assignment work is generally solved using agricultural machinery scheduling methods. The agricultural machinery scheduling problem refers to the reasonable arrangement of tasks, sequence, and operation time of agricultural machinery under limited resources to ensure efficient and economical operations [2].
In recent years, significant progress has been made in agricultural machinery scheduling models, with research focus gradually shifting from single-machine operation optimization to multi-machine coordination and further evolving towards complex multi-region, multi-depot scheduling problems. Early studies mainly focused on the optimization of single or a small number of machines, laying the theoretical and framework foundation for agricultural machinery scheduling models. Guan et al. [3] proposed a framework to manage mechanical movement sequences and coordination deadlocks. This framework offers methodological support for complex scheduling problems. Edwards et al. [4] constructed a multi-machine coordination scheduling framework from the perspective of operational readiness, enhancing efficiency in multi-location operations through task sequence, condition constraints, and location state linkage. Alaiso et al. [5] proposed an integrated scheduling model for agricultural contracting work by systematically describing tasks, resource replenishment points, and machinery attributes, effectively solving agricultural contracting operation scheduling problems. These studies provided basic modeling ideas for agricultural machinery scheduling but still had limitations in dealing with large-scale, multi-entity coordination scheduling problems.
With the development of mechanization and intelligence, agricultural machinery scheduling problems are gradually moving towards multi-machine collaboration, task optimization, and dynamic adjustment. Mohamed et al. [6] started from the perspective of agricultural machinery selection and scheduling, establishing a scheduling plan that minimizes the total cost of machinery. To address the uncertainty in scheduling, Aguayo et al. [7] proposed a cyclic dynamic rescheduling strategy, adjusting field status and machinery location in real-time. Seyyedhasani et al. [8] solved the real-time scheduling problem from the perspective of multi-point path planning. In terms of multi-machine collaboration, Cao et al. [9] explored the task allocation mechanism for multiple agricultural machines; He et al. [10] proposed a coordinated operation plan for harvesters and transporters for fragmented fields; Han et al. [11] focused on the dynamic collaboration between harvesters and auxiliary transport vehicles, effectively reducing operation waiting time. Agricultural machinery service scheduling research mainly focuses on improving service efficiency and resource utilization. In terms of single-region service, He et al. [12] developed an integer programming model to optimize the scheduling of agricultural machinery on fragmented fields. The model allows multiple harvesters to harvest the same field, improving efficiency and shortening the harvest cycle. Si et al. [13] proposed a hierarchical game method to optimize the real-time scheduling of harvesters, enhancing the matching degree of farmer demand and machinery utilization. Hou et al. [14] constructed a service quality evaluation model, considering factors such as time urgency, working conditions, field size, and distance, and adjusted the weight of time urgency dynamically to optimize the agricultural machinery rental plan. However, the single-region service model is often limited by resource capacity and geographical range when facing large-scale operation demands. Ding et al. [15] addressed the underutilization of individual agricultural machinery resources and the lack of trust among all parties involved in scheduling, proposing a blockchain-based regional agricultural machinery resource scheduling system and agricultural machinery scheduling optimization model to reduce regional agricultural production costs and improve resource utilization. However, the single-region service model often faces limitations in terms of resource capacity and geographical range when facing large-scale operation demands.
To address these challenges, multi-regional cooperative scheduling has gradually become a research hotspot. Ma et al. [16] proposed a multi-region coordination architecture to improve the allocation of agricultural machinery resources between regions using an improved knapsack model and hybrid algorithms to solve the resource imbalance problem between regions. Huang et al. [17] proposed a multi-station, multi-machine collaborative real-time response dispatch system based on fuzzy membership degrees to optimize scheduling efficiency and improve farmer satisfaction. Cao et al. [18] established a bi-objective optimization model for wheat harvesting scenarios across regions, providing decision support for multi-regional interconnection scheduling.
In terms of algorithms, with the increase in resource constraints and objectives in agricultural production, single-objective optimization solutions can no longer meet practical requirements, which has driven the development of multi-objective optimization algorithms. Multi-objective algorithms can consider multiple conflicting optimization objectives simultaneously, providing a set of non-dominated solutions for decision-makers to balance. Lu et al. [19] used adaptive crossover strategies to enhance the convergence capability of NSGA-II. Guo et al. [20] proposed a multi-objective genetic algorithm based on time window priority rules and tabu search strategies, significantly improving the efficiency and cost-effectiveness of cross-regional agricultural machinery coordination operations through time window priority-based initial population generation, sequential crossover operations, etc. Sethanan et al. [21] designed a multi-objective particle swarm optimization algorithm that combines a three-layer social structure and seven movement strategies to balance local search and global exploration. Liu et al. [22] developed an improved multi-objective immune algorithm that integrates non-dominated neighbor selection and tabu search, enhancing the algorithm’s performance in static partition optimization and dynamic cross-region cooperation.
For the multi-region agricultural machinery scheduling problem, a two-stage optimization method decomposes the problem into two sub-problems: area division and path planning, effectively reducing the complexity. Chen [23] addressed the multi-agricultural machinery scheduling problem with fuzzy time windows by first using clustering to transform it into a single cooperative scheduling problem and then optimizing the scheduling using an improved genetic algorithm. Huang et al. [24] combine Voronoi diagrams with particle swarm optimization to optimize the scheduling plans of multiple regions. Sheng et al. [25] propose a two-stage method based on K-Means and multi-objective evolutionary algorithms for emergency agricultural machinery operations under natural disaster conditions, effectively enhancing the executability of the operation path. Yu et al. [26] propose a two-stage emergency allocation algorithm for agricultural machinery, combining field division with intra-field machinery allocation, and optimize the operation scheduling of agricultural machinery in each area using an improved genetic algorithm to reduce grain loss.
Current research indicates that decoupling order allocation and path optimization into two subproblems and solving them step-by-step is an effective method. However, in the actual agricultural operation process, due to the seasonal characteristics of agricultural production, there is often a sudden increase in the demand for agricultural machinery during specific periods. Additionally, the demand is geographically dispersed, which requires efficient collaboration between multiple machinery stations. Existing order allocation methods based on spatial distance often only consider spatial factors when allocating tasks, neglecting important factors such as timeliness. This leads to deficiencies in scenarios with dense task demands, especially in ensuring the rationality of service times. Moreover, most current scheduling algorithms focus on single-objective optimization or use weighted methods to transform multi-objective problems into single-objective problems, which, although simple, introduces subjectivity in weight setting and limits their practical application. Multi-objective optimization algorithms can handle multiple competing objectives simultaneously, providing a set of trade-off solutions for decision-makers, but there are still deficiencies in maintaining solution diversity and convergence, and research on multi-objective optimization algorithms is relatively scarce. Therefore, how to improve the rationality of task allocation based on timeliness and design multi-objective optimization algorithms that can obtain high-quality non-dominated solution sets is the key issue that this paper needs to address.
In summary, this paper designs a two-stage scheduling framework starting from order allocation and path planning. To address the limitations of existing order allocation methods that only consider spatial factors, a parallel assignment algorithm based on spatiotemporal characteristics is proposed. This algorithm constructs a temporal distance function to describe the feasibility of continuous operations and evaluates the spatiotemporal proximity of operation points and stations for clustering allocation. Additionally, for the path planning problem, a learnable multi-objective evolutionary algorithm (LMOEA) based on the NSGA-II framework is designed. The algorithm improves the distribution of initial solutions through a hybrid initialization strategy and enhances the diversity of the population with a dynamic crowding distance mechanism. In terms of local search strategy selection, existing methods often use random selection, ignoring the knowledge generated during the evolutionary process. Therefore, we introduce a Q-learning agent to enable the algorithm to learn from the evolutionary process and guide the selection of search strategies, thereby improving the search efficiency of the algorithm.
Our specific work schedule is as follows:
First, we constructed a mathematical model for agricultural machinery operation service scheduling in Section 2.1. In Section 2.1.1, by analyzing the characteristics and influencing factors of the scheduling problem in detail, we clarified the definitions of various parameters and variables in the model and constructed a service satisfaction function based on fuzzy time windows. In Section 2.1.2, we established a multi-objective optimization model aimed at minimizing operational costs and maximizing service satisfaction, and detailed various constraints. Based on this, we introduced a two-stage algorithm framework in Section 2.2 and provided detailed descriptions of each stage in Section 2.3 and Section 2.4. Section 2.3 introduced our intelligent allocation of operation demands through an improved parallel assignment algorithm in the first stage, which not only considered traditional spatial distance factors but also introduced a time distance function to evaluate the temporal compatibility between operation points. In Section 2.4, we designed LMOEA in the second stage. LMOEA is based on the NSGA-II algorithm, with improvements in three aspects: initialization (Section 2.4.1), crowding distance calculation (Section 2.4.2), and local search (Section 2.4.3).
In Section 3.1 and Section 3.2 of the experiments, we defined the instances (datasets) and multi-objective algorithm evaluation criteria, respectively. In Section 3.3, we determined the main hyperparameters of the algorithm through parameter sensitivity experiments of LMOEA. Section 3.4 was designed to verify that the job allocation method based on spatiotemporal clustering in the first stage of Section 2.3 is superior to the job allocation method based on spatial clustering. Section 3.5 and Section 3.6 were designed to verify that the multi-objective evaluation criteria of LMOEA in the second stage of Section 2.4 are superior to INSGA-II, HTSMOGA, and NNITSA. Section 3.7 is an ablation study corresponding to the improvements of LMOEA in Section 2.4.1, Section 2.4.2, Section 2.4.3.

2. Methods

2.1. Problem Analysis and Objective Functions

2.1.1. Problem Description

The multi-site agricultural machinery operation service scheduling problem involves multiple agricultural machinery service stations and dispersed operation plots. As shown in Figure 1, the scheduling center must consider both the distance between operation plots and service stations, and the availability of agricultural machinery at each station. It must allocate orders to appropriate service stations and develop detailed operation plans to ensure that agricultural machinery completes all tasks along the optimal routes while meeting farmers’ time requirements.
In the actual operation service process, farmers usually have some flexibility regarding operation times and can accept moderate adjustments before or after their desired time windows. Although this temporal flexibility provides more opportunities for optimizing the scheduling scheme, it also increases decision-making complexity.
In response to the aforementioned problem description, this paper makes the following assumptions regarding the model:
1. Special circumstances are not considered: during the scheduling process, we temporarily disregard unpredictable special situations such as extreme weather conditions and field road congestion, which pose additional challenges to the model’s universality and the algorithm’s robustness.
2. Uniform agricultural machinery models and power: it is assumed that all agricultural machinery involved in the scheduling has the same model and known power, meaning that the machinery can maintain a constant speed during transfer and operation, thereby simplifying the calculation of operational efficiency.
3. Ignoring differences in operator experience: we do not consider the potential impact of different operators’ experience on the efficiency of agricultural machinery operations, which helps us focus on the impact of agricultural machinery scheduling itself on operational efficiency.
4. Fixed locations and demands: it is assumed that the locations of all agricultural machinery stations and the demands of the operation points are known and unchanged during the scheduling process, which helps us concentrate on optimizing the scheduling strategy rather than demand forecasting.
5. Single task for agricultural machinery: in one scheduling plan, all agricultural machinery will depart from the station, complete all operational demands, and return to the original warehouse, which helps simplify the path planning problem.
6. Single agricultural machinery operation limit: it is stipulated that a single operation point can only be operated by one piece of agricultural machinery, avoiding the complexity that may arise from multiple machines operating on a single point simultaneously.
7. Uniform dispatching costs: it is assumed that the single dispatching cost of agricultural machinery (including labor costs, startup costs, and maintenance costs) is known, which helps simplify cost calculations and makes the model more manageable.
The efficiency and quality of agricultural machinery operation services are influenced by various factors. Since agricultural machinery scheduling is a variant of the vehicle routing problem (VRP) and falls within the category of NP-hard problems, it is impossible to consider all factors comprehensively and embody them in a mathematical model in the study. Therefore, based on the assumptions made earlier, the key influencing factors analyzed in the model include the following points:
(1) Spatial distribution characteristics. The dispersed distribution of operation plots leads to frequent transfers of agricultural machinery, directly affecting transfer costs and operational efficiency.
(2) Service station resource constraints. The geographical distribution and amount of equipment at agricultural machinery service stations determine the service response capability to surrounding areas.
(3) Time window requirements. The timeliness of crop harvesting requires operations to be completed within specific time windows, which directly affects service quality and farmer satisfaction.
(4) Equipment load limitations. There are upper limits to the continuous operation time and daily operation area of agricultural machinery, which need to be considered in scheduling to avoid excessive loads leading to equipment damage and a decrease in operational quality.
Definitions of relevant parameters based on the problem description are shown in Table 1.
Agricultural machinery operation services are typical examples of agricultural socialized services. One of the core indicators for evaluating their quality is farmer satisfaction. Since agricultural production has strict requirements for timing, missing the optimal farming time can directly affect crop yield and quality. Therefore, the timeliness of operations becomes a decisive factor affecting farmer satisfaction. To effectively utilize this relationship in the optimization model, it is necessary to transform the subjective evaluation of timeliness by farmers into a quantifiable mathematical expression. In the actual operation service process, although farmers have a preferred operation time window, they also have a certain tolerance for the early or delayed arrival of agricultural machinery. This temporal flexibility depends on the growth characteristics of the crops, weather conditions, and the farmers’ own production arrangements. Therefore, this paper designs a satisfaction function based on fuzzy time windows [27].
As depicted in Figure 2, the acceptable service time frame [ E i , L i ] for farmers is segmented into several intervals. The interval [ e i , l i ] represents the optimal operating window preferred by farmers, during which agricultural machinery arriving at the operation site achieves the highest satisfaction level of 1. The interval [ E i , e i ] indicates the time range within which operations can be advanced, with satisfaction changes described by an exponential function that reflects the accelerated deterioration of impacts due to early operations (such as crops not being in the optimal state or inadequate farmer preparation) as time increases. The interval [ l i , L i ] is the time range within which operations can be delayed, and considering the characteristics of crop quality decline and increased losses after the optimal harvest period, the satisfaction function decreases using a power function. Parameters k and K i can adjust the sensitivity of the function to accommodate the time-sensitive characteristics of different scheduling plans. This nonlinear change in satisfaction more closely aligns with real-world scenarios, as shown in Equation (1).
c s i A T i k = exp K E i e i , A T i k < E i exp K A T i k e i , E i < A T i k < e i 1 , e i < A T i k < l i 1 A T i k l i L i l i K i , l i < A T i k < L i 0 , A T i k > L i

2.1.2. Objective Functions and Constraints

(1) Objective Function
In agricultural machinery operation services, since the area of the fields and the operating power of the agricultural machinery remain constant during the scheduling process, and the unit operating efficiency of the same type of machinery is essentially the same, the operating cost and time for each field can be considered fixed values without accounting for losses. The core of scheduling optimization lies in solving the transfer planning problem of agricultural machinery between different fields, which involves two key cost factors: first, the transfer cost from one field to another, and second, the usage cost directly related to the number of agricultural machines scheduled. Each piece of machinery requires a dedicated operator. This labor cost is relatively fixed. Additionally, relevant studies have shown that the maintenance cost of a single piece of agricultural machinery during the scheduling process accounts for about 20% to 30% of the operating cost [28]. Therefore, reducing the number of agricultural machines used can not only lower equipment maintenance costs but also significantly save on fixed labor expenses. Based on this, this study aims to optimize the scheduling plan to achieve the goals of minimizing transfer distance and the number of agricultural machines used. The cost optimization objective is shown in Equation (2).
F 1 = C 1 k M Z w k + C 2 i N T j N T x i j k · d i j
This study utilizes the reciprocal of the sum of all farmers’ satisfaction levels as an additional optimization objective. By minimizing this value, the overall service satisfaction is maximized, as shown in Equation (3).
F 2 = 1 i N T k M c s i ( A T i k )
(2) Constraints
Equations (4) and (5) represent the visitation constraints for the operation points, ensuring that each field can be serviced by only one piece of agricultural machinery and that it is visited exactly once.
k M Y i k = 1 , i N T
j N X i j k 1 , i N T , k M
Equation (6) represents the flow conservation constraint, which ensures that the number of agricultural machines entering and leaving each field is consistent.
j N X i j k = j N X j i k , i N T , k M
Equations (7) and (8) are designed to ensure that all agricultural machinery departs from the depot at the start of the tasks and returns to the original depot after all tasks are completed.
j N X i j k = w N D Z w k , i N T , k M
w N D Z w k = 1 , k M
Equation (9) imposes a constraint that the number of agricultural machines used from each depot must not exceed the number of machines available at that depot.   
k M Z w k V w , w N D
Equations (10) and (11) impose constraints on the working hours and the amount of work each piece of agricultural machinery can handle within a single scheduling period.
i N S i k · X i j k + t i j k · X i j k T max , k M
i N T q i Y i k Q k
Equations (12) and (13) are constraints on the service start time of agricultural machinery. If the service starts earlier than the acceptable earliest service time, the machinery must wait and cannot exceed the latest acceptable service time set by the farmers.
E i S T i k < L i , i N T , k M
S T i k = max E i , A T i k , i N T , k M
Equations (14) and (15) are constraints related to the transfer time and service time of agricultural machinery.
S i k = q i p k , i N T , k M
t i j k = d i j v k , i , j N , k M
Equations (16)–(18) are constraints on the decision variables.
X i j k { 0 , 1 } , i , j N , k M
Y i k { 0 , 1 } , i N T , k M
Z w k { 0 , 1 } , w N D , k M

2.2. Design of a Two-Stage Solution Algorithm Framework

Agricultural machinery operation services are essentially a type of multi-depot agricultural machinery scheduling problem with fuzzy time windows. Compared to single-depot scheduling, this problem presents higher complexity and challenges. For such multi-depot scheduling problems, customer clustering methods have been proven to be significantly effective [29,30,31]. By clustering customer demands, the search space in the path planning phase can be effectively reduced, significantly lowering computational complexity and aiding in the regionalized scheduling and allocation of agricultural machinery resources.
Agricultural machinery operation service scheduling can be considered an extension of the VRP and is classified as an NP-hard problem. Due to the complexity of the problem, solving it with exact algorithms poses significant challenges. NSGA-II [32], as a multi-objective evolutionary algorithm, has demonstrated excellent performance in solving multi-objective vehicle path optimization problems. This algorithm ensures robust global search capabilities through a fast non-dominated sorting mechanism and effectively maintains diversity in the solution space using a crowding distance calculation method.
Based on the above analysis, this paper proposes a two-stage algorithm framework, as shown in Figure 3, combining customer clustering with a LMOEA. The first stage designs a parallel assignment algorithm based on spatiotemporal distance, intelligently assigning operation demands to various depots based on temporal and spatial characteristics. The second stage, building on the NSGA-II algorithm framework, integrates hybrid initialization methods and learnable local search strategies, proposing a LMOEA for optimizing the path planning after assignment.

2.3. Parallel Assignment Algorithm Based on Spatiotemporal Distance

In multi-depot agricultural machinery scheduling, each station often has a limited number of machines and varying service capabilities. Therefore, clustering and assigning tasks according to order requirements are essential to improve overall resource utilization efficiency. The order assignment strategy directly affects the optimization of subsequent path planning; a reasonable allocation plan can provide more feasible solutions for path planning, enhancing the overall service quality of the scheduling scheme.
Existing research primarily considers order assignment from a spatial distance perspective, often using distance-based clustering methods or nearest-depot assignment strategies. While these methods can assign operation points that are geographically close to the same service station, they often overlook the constraints of the operation time windows. As shown in Figure 4a, although operation points are assigned to the nearest agricultural machinery station during the assignment process, the service times of points D and F are highly mismatched with those of other operation points at depots O 1 and O 2 . This leads to prolonged idling of agricultural machinery, resulting in resource wastage. In contrast, Figure 4b shows a more rational work group formed after considering temporal factors, shortening the cycle time to complete all tasks in the route, which is particularly important during resource-scarce peak farming periods.
Giosa proposed a parallel assignment algorithm for the multi-depot vehicle scheduling problem [33]. The algorithm determines the assignment order by evaluating the urgency (priority) of each task point, prioritizing the allocation of high-urgency task points to depots with sufficient service capacity. The calculation of urgency is primarily based on the spatial distance from the task point to the depot.
The specific calculation of urgency is shown in Equation (19), where d ( c , d e p ) represents the distance between the depot and task point c, and d ( c , d e p ) represents the distance between task point c and the nearest depot.
U ( c ) = d e p N D d ( c , d e p ) d ( c , d e p )
Additionally, to reduce computational complexity, the parallel assignment algorithm employs a simplified calculation method, which only considers the nearest and second-nearest depots during the urgency calculation, as shown in Equation (20).
U ( c ) = d ( c , d e p ) d ( c , d e p )
The parallel assignment algorithm, in evaluating the urgency of completing task points, still considers only spatial distance factors in the allocation process. To address the shortcomings of previous methods, we improve the calculation of algorithm urgency U ( c ) by designing a spatiotemporal distance function. This function differs from traditional spatial distance metrics as it comprehensively considers driving time, waiting time, and delay time across multiple temporal dimensions, to assess the temporal feasibility of performing consecutive operations between two task points. This improvement allows the scheduling system to balance spatial and temporal constraints simultaneously during task allocation, thereby reducing the occurrence of agricultural machinery idling or order delays. Suppose the operation points i and j have time windows [ E i , L i ] and [ E j , L j ] , respectively. Then, the arrival time of agricultural machinery k from i to j is A T k j [ E i + S i k + t i j k , L i + S i k + t i j k ] , and is set as [ E T , L T ] . The [ E T , L T ] has several relationships with the time window of operation point j, as shown in Figure 5.
If L T < E j , it indicates that agricultural machinery k completes its operation at point i and arrives at operation point j after the earliest acceptable operation time for the farmer, requiring waiting before starting the operation. At this time, the temporal distance is represented as the sum of transfer time and waiting time, which can be expressed as  E j L T + t i j k .
If E T < e j L T , it indicates that agricultural machinery k may arrive at operation point j earlier than the earliest acceptable operation time for the farmer. At this time, the calculation is made as ω 1 ( E j L T ) + t i j k , where ω 1 is a waiting time bias coefficient, represented as E j E T L T E T . The larger ω 1 is, the higher the likelihood of agricultural machinery k waiting at point j, and the higher the penalty.
If E T e j < l j L T , it indicates that the possible arrival time range of agricultural machinery k at point j is completely within the time window of operation point j, and agricultural machinery can start the operation on time. At this time, the temporal distance is directly represented as the transfer time ( t i j k ) between i and j.
If E T l j < L T , it indicates that there is a possibility of delay for agricultural machinery k arriving at operation point j. The handling method is similar to the second situation, introducing a delay penalty coefficient ω 2 , and at this time the temporal distance is represented as ω 2 ( L T L j ) + t i j k , and ω 2 can be expressed as L T L j L T E T .
If E T > L j , then agricultural machinery k will definitely be delayed when arriving at point j, at which time the temporal distance is represented as the sum of delay time and transfer time, which can be expressed as L T L i + t i j k , the sum of delay time and transfer time.
Summing up the above, by considering the five scenarios, the temporal distance can be summarized as Equation (21).
D i j T = E j L T + t i j k , L T < E j ( E j E T ) 2 ( L T E T ) + t i j k , E T < E j L T t i j k , E T E j < L i L T ( L T L j ) 2 L T E T + t i j k , E T < L i < L T L T L i + t i j k , E T < L j
Based on the above temporal distance function, the closeness between operation points and various stations in terms of time can be calculated by Equation (22), which evaluates the time feasibility of operation points i and those already assigned to station w in the continuous operation process.
A F ( i , w ) = j N T ( w ) { w } e D i j T | N T | , i , j N T , w N D
The proximity of operation point i to depot w can be calculated by Equation (23), which comprehensively considers both temporal and spatial factors.
C L ( i , w ) = d ( i , w ) A F ( i , w ) , i N T , w N D
The urgency of operation point i after improvement can be represented by Equation (24), where w and w are the two depots with the smallest and second smallest proximity, respectively.
U ( i ) = C L ( i , w ) C L ( i , w ) , w , w N D
Based on the above improvements, this chapter quantifies the time compatibility of operation points and stations through index transformation and value normalization, achieving self-adaptation of temporal and spatial constraints, increasing the likelihood that operation points with higher temporal compatibility are assigned to their corresponding stations.

2.4. Learnable Multi-Objective Evolutionary Algorithm

To address the agricultural machinery path optimization problem under time window constraints, we propose a Learnable Multi-Objective Evolutionary Algorithm (LMOEA) built upon the NSGA-II framework. While preserving its core components—including fast non-dominated sorting, crowding distance calculation, and elite preservation—LMOEA introduces three key improvements. First, a hybrid initialization strategy is employed to enhance population quality. Second, a reinforcement learning-based local search module is integrated to adaptively guide search behavior and reduce ineffective iterations. Third, a dynamically adjusted crowding distance mechanism is designed to promote better distribution across the Pareto front. The overall procedure is illustrated in Algorithm 1.
Algorithm 1 Learnable Multi-objective Evolutionary Algorithm (LMOEA)
Input: 
Population size N, crossover rate P c , local search rate P l , maximum generation G max , discount factor γ , learning rate α
Output: 
Non-dominated solution set P S
  1:
Generate initial population P using a hybrid initialization strategy
  2:
Initialize Q-table, set exploration rate ε
  3:
Evaluate population: perform non-dominated sorting and crowding distance calculation on P
  4:
for  g = 1 to G max  do
  5:
    for  i = 1 to N / 2  do
  6:
        Select parents { p 1 , p 2 } via binary tournament selection on P
  7:
        Generate offspring: O O SREX ( { p 1 , p 2 } , P c )
  8:
    end for
  9:
    Combine population: R P O
10:
    for each individual x R  do
11:
      if  rand ( ) < P l  then
12:
           Use ε -greedy strategy to select and perform a neighborhood operator
13:
           Update Q-table
14:
      end if
15:
   end for
16:
    P popUpdate ( R , N ) Select next generation based on non-dominated sorting and dynamic crowding distance
17:
   Update ε : ε f ( ε , g )
18:
end for
19:
return  P S

2.4.1. Hybrid Initialization Method

This study employs a hybrid initialization method aimed at overcoming the limitations of a single initialization strategy by generating a diversified and high-quality initial solution to enhance the overall performance and stability of the algorithm. This method divides the initial population according to the ratio of 4:3:3, using random initialization, the plane-sweeping algorithm [34], and an improved k-nearest-neighbors algorithm (KNN) [35]. This proportional allocation ensures both diversity and quality of the initial solution.
The plane-sweeping algorithm originates from the line segment intersection problem in computational geometry, with the core idea of transforming a two-dimensional plane problem into a one-dimensional sweep problem. In the operation area division, this algorithm uses agricultural stations as vertices, dividing adjacent customer points into the same area through a clockwise rotation sweep, as shown in Figure 6. To obtain a high-quality delivery route initial solution, this paper modifies the traditional plane-sweeping algorithm, completing area division while using insertion heuristics to construct the delivery route. This simultaneous optimization strategy ensures both geographical continuity and solution feasibility. The specific process is as follows:
  • Step 1: Draw a circle centered at the agricultural station O to cover all operation points, convert each operation point to polar coordinates ( r , θ ) , and randomly select an operation point as the starting point.
  • Step 2: Sweep the area in polar order and add operation points to the region, calculate the optimal insertion position for each new point, and update the delivery route to ensure it meets the operation limit and time window constraints.
  • Step 3: When no new operation points can be inserted, complete the current area division and route construction, and start the construction process for the next area.
  • Step 4: Repeat steps 2–3 until all operation points are assigned and the route construction is complete.
To consider the operation time window factor in the initialization process, this paper combines the KNN algorithm to design an initialization algorithm. In the VRP, the basic principle of the KNN algorithm is to select the nearest k operation points based on distance and randomly choose one to add to the route. In this paper, k neighboring operation points are calculated based on Equation (25) for selection, where the spatial distance and temporal distance are normalized and weighted to ensure that the selection of operation points considers both spatial and temporal characteristics. In this paper, α is taken as 0.5.
D i j S T = α D i j S min D S max D S min D S + ( 1 α ) D i j T min D T max D T min D T

2.4.2. Non-Dominated Sorting and Dynamic Crowding Distance Calculation

The algorithm achieves the selection of elite individuals in each round through non-dominated sorting and crowding distance, where non-dominated sorting is used to evaluate the quality of individuals, and crowding distance is used to maintain population diversity. The specific implementation process is as follows:
  • Step 1: Merge the parent population P ( t ) and the offspring population Q ( t ) to form a population R ( t ) with a size of 2 N .
  • Step 2: Perform fast non-dominated sorting on population R ( t ) to obtain non-dominated layers L = { L 1 , L 2 , , L n } .
  • Step 3: In the order of non-dominated levels, start from L 1 and select individuals to construct a new population until the population size reaches N.
  • Step 4: For the non-dominated layers L i that cannot be fully added, select individuals based on crowding distance d j in descending order.
In the above algorithm, when a non-dominated layer cannot be completely added to the new population, it is necessary to select individuals from this layer based on crowding distance. However, traditional methods only complete the removal of multiple individuals through a single crowding distance sorting, and then remove the solution with the smallest crowding distance.
To overcome the shortcomings in the above individual removal process, this paper designs a simple but effective dynamic crowding distance sorting method: whenever the smallest individual x i in the current non-dominated layer is removed, before selecting the next individual to remove, update the crowding distance of its neighboring solutions x i 1 and x i + 1 as the new sorting results to ensure that the crowding distance ranking always reflects the current distribution of the solution. As shown in Figure 7a, before x i is removed, the crowding distance (Dist) between neighboring individuals x i 1 and x i + 1 is calculated as follows:
d k ( x i 1 ) = | f k ( x i ) f k ( x i 2 ) | max ( f k ) min ( f k ) D i s t ( x i 1 ) = k = 1 m d k ( x i 1 )
d k ( x i + 1 ) = | f k ( x i ) f k ( x i + 2 ) | max ( f k ) min ( f k ) D i s t ( x i + 1 ) = k = 1 m d k ( x i + 1 )
After removing individual x i , as shown in Figure 7b, the crowding distances of neighboring solutions x i 1 and x i + 1 change, and their crowding distances D i s t are calculated by Equations (28) and (29).
d k ( x i 1 ) = | f k ( x i + 1 ) f k ( x i 2 ) | max ( f k ) min ( f k ) D i s t ( x i + 1 ) = k = 1 m d k ( x i 1 )
d k ( x i + 1 ) = | f k ( x i 1 ) f k ( x i + 2 ) | max ( f k ) min ( f k ) D i s t ( x i + 1 ) = k = 1 m d k ( x i + 1 )
By using this method, after removing each solution, only the crowding distance of its neighboring solutions needs to be updated, without the need to recalculate the crowding distance of all solutions, thus reducing computational complexity. At the same time, it improves the problem of uneven distribution of solutions caused by inaccurate crowding distance sorting.

2.4.3. Local Search Strategy Based on Q-Learning

This paper uses the Selective Route Exchange Crossover (SREX) operator for the crossover operation of the algorithm. The selection of individuals is based on the binary tournament method. Local search is a key technology to improve the efficiency of algorithm exploration and promote population diversity, effectively preventing the algorithm from falling into local optima [36]. However, in traditional methods, executing multiple neighborhood operators one by one takes too long, and random selection of operators leads to low success rates. To address these issues, this paper designs an adaptive local search method based on Q-learning, as illustrated in Figure 8. By considering the current state and historical performance of individuals, appropriate neighborhood operators are dynamically selected at each search step.
Q-learning, as a classic reinforcement learning algorithm, has been widely applied in the field of scheduling. This algorithm typically consists of five elements: Agent, Environment, Action, State, and Reward. Its core mechanism is to maintain a Q-table to record the long-term cumulative reward for each state-action pair. In this paper, each individual in the population X = { x 1 , x 2 , , x n } of size N is considered an intelligence entity. The state reflects the current adaptability of the individual, and the action corresponds to the available neighborhood operators. At each search step, the intelligence entity senses the current state, selects the optimal neighborhood operator based on the Q-table, executes it, then obtains the corresponding reward, and transitions to a new state. By continuously updating the values in the Q-table, it can continuously optimize the selection of neighborhood operators for individuals under different environments. The update formula for the Q-table is:
Q ( s t , a t ) = ( 1 α ) · Q ( s t , a t ) + α · ( r t + γ · max Q ( s t + 1 , a ) )
State Design: Considering the complexity of solving the combined optimization problem, directly using the structure characteristics of the solution space as the state representation may lead to a dimensionality disaster. Therefore, this paper proposes a state representation method based on the target space, dividing the solution space into four quadrants to simplify the state representation. The specific division process is as follows:
  • Step 1: Sort all individuals in the population by f 1 value in ascending order and calculate the median f 1 M e d = f 1 ( x N / 2 ) .
  • Step 2: Use the same method to sort by f 2 value and calculate the median f 2 M e d .
  • Step 3: Use f 1 M e d and f 2 M e d as boundaries to divide the target space into four quadrants, each quadrant representing a state.
This method of dividing the solution space based on medians has the following advantages: First, it disperses the continuous target space into four states, significantly reducing the complexity of the state space; second, it divides the target space by the median of the population’s objective values, ensuring that each quadrant contains approximately equal numbers of individuals, maintaining the balance of the state division; finally, this dynamic partitioning method can adapt to the evolution process, better reflecting the distribution characteristics of the current population. Table 2 provides the specific conditions for the four states.
Reward Function: After each training session, each neighborhood selection action is rewarded based on the improvement effect of the individual. In this paper, the reward function is calculated based on the comprehensive improvement effect on multiple optimization objectives, and can be defined as:
r t = max k = 1 K f k min f k ( x t ) f k ( x t 1 ) f k ( x t ) f k max f k min , 0
where f k max and f k min represent the maximum and minimum values found for the k-th optimization objective, respectively. x t 1 denotes the initial solution, and x t is the new solution obtained by the neighborhood search. According to Equation (31), the larger the target function improvement, the the closer to the current optimal solution, the greater the reward. Additionally, negative feedback will reduce the impact of successful rewards, making the positive reward more prominent. When the comprehensive improvement of the new solution is negative, the reward is set to 0 without punishment.
Strategy: To balance exploration of new actions and exploitation of known optimal actions, this algorithm adopts the ε -greedy strategy [37]. Specifically, at each decision step, an action is selected at random with probability ε (exploration), and with probability 1 ε , the action with the highest Q-value under the current state is selected (exploitation). The value of ε gradually decreases with the generation index g, enabling the algorithm to smoothly transition from exploration to exploitation as learning progresses.
Actions: As shown in Figure 9, this paper uses six neighborhood search operators as the action set in Q-learning, and adopts the acceptance rule based on the Pareto optimality principle: when a new solution x dominates the current solution x, it is directly accepted; when x is non-dominated by x, it is accepted with a probability of 0.5. This strategy ensures that the quality of the solution will not decrease while providing sufficient exploration capability to escape from local optima. The neighborhood operators are designed as follows:
  • Swap: Exchange the positions of two nodes in the same or different paths.
  • 2-opt: Select two non-adjacent edges in the path, break them, and reconnect them in reverse order.
  • Relocate: Move a node from its initial position to a new position in the same or different path.
  • 2-Relocate: Move two consecutive nodes from their original positions to new positions in the same or different paths.
  • Arc-Swap: Exchange two consecutive node sequences in the same or different paths.
  • Path-Remove: Randomly remove a node sequence from a path and insert it into another path.

3. Experimental Verification and Analysis

3.1. Instance Description

Based on the characteristics of the two types of operation requirements, this study designs two categories of a total of 12 test instances to verify the effectiveness of the proposed algorithm. These instances are constructed based on representative real-world patterns, partially illustrated in Table 3, which displays information for the first ten operation points. Each instance is labeled using the format “feature type—number of operation points—number of service stations” and includes both C-type and D-type cases, where the number of operation points N { 45 , 70 , 80 , 100 , 120 , 160 } and the corresponding number of service stations M { 2 , 3 , 4 , 4 , 5 , 5 } . Among them, C-type instances correspond to large-scale farms, where farmland is spatially clustered and fields within the same cluster typically belong to the same farmer or cooperative. These instances feature relatively relaxed time windows (1–3 days) and similar operation time requirements within clusters. In contrast, D-type instances represent smallholder farms, which are uniformly scattered and subject to more varied and tighter time constraints. Their operation windows are limited to specific periods within a day (e.g., [480, 720] denotes 8:00–12:00).
The farmland areas of the two types of test instances are all between 0.5 and 2.5 hectares. Each service station is randomly assigned 5 to 10 seeding machines, which are uniformly configured with the same operational parameters. Additionally, the parameters of the agricultural machines during the experiment are set as shown in Table 4.

3.2. Evaluation Metrics

The hypervolume (HV) calculates the multidimensional space volume formed by the non-dominated solution set and reference points [38]. For the minimization problem, given a non-dominated solution set P and a reference point r, HV can be represented as:
H V ( P , r ) = Λ x P f 1 ( x ) , r 1 × × f m ( x ) , r m
where Λ represents the Lebesgue measure (in two dimensions it is the area), f ( x ) represents the function value of solution x on the i-th objective, r i represents the coordinate of the reference point on the i-th dimension, and m is the number of objectives. The larger the HV value, the better the algorithm’s performance, as it reflects both the convergence and diversity of the solution set. To eliminate the impact of different objective scales, the reference point is chosen as [ 1 , 1 ] after normalization before calculating HV.
The inverted generational distance (IGD) measures the proximity of the non-dominated solution set obtained by the algorithm to the true Pareto front [39]. Given a set P * uniformly distributed along the true Pareto front and the non-dominated solution set P obtained by the algorithm, IGD is defined as:
I G D ( P , P * ) = v P * d ( v , p ) | P * |
where d ( v , P ) represents the Euclidean distance from v to the nearest point in set P. The smaller the IGD value, the closer the solution set obtained by the algorithm is to the true Pareto front, indicating better performance.

3.3. Algorithm Parameter Settings

To determine the optimal parameter configuration for the LMOEA algorithm, this section uses the Taguchi method [40] on the four key parameters of the algorithm: crossover probability P c , local search probability P l , learning rate α , and discount factor γ . Each parameter is set at three levels: crossover probability P c is set to 0.3, 0.5, and 0.7; local search probability P l is set to 0.2, 0.3, and 0.4; learning rate α is set to 0.1, 0.2, and 0.3; discount factor γ is set to 0.7, 0.8, and 0.9. The experiment uses an L9 ( 3 4 ) orthogonal array (as shown in Table 5), and each parameter combination is run independently 20 times to ensure the statistical reliability of the experiment.
Figure 10 shows the main effects of each parameter on the average HV value. By calculating the average HV values of each parameter at different levels, it can be observed that the crossover probability P c and the local search probability P l have a greater impact on the algorithm’s performance, with the optimal values being 0.5 and 0.3, respectively. The learning rate α achieves the best results at 0.1, indicating that a smaller learning rate is beneficial for the stability of the algorithm; the discount factor γ achieves the best results at 0.9, indicating that the algorithm is more inclined to consider long-term benefits. Based on the main effect analysis, the parameter configuration of the algorithm is determined as: P c = 0.5 , P l = 0.3 , α = 0.1 , and γ = 0.9 . This parameter combination will be used for subsequent experiments.

3.4. Validation of Allocation Strategy Effectiveness

To verify the advantages of the improved parallel assignment algorithm proposed in this paper for the multi-depot scheduling process in the order aggregation process, two allocation strategies based on spatial distance and spatiotemporal distance are applied in each test case, respectively. The same LMOEA algorithm is used for subsequent path planning. To ensure the statistical reliability of the results, each test case is run independently 20 times, and the best solution is selected for comparative analysis. The objective function of each solution is normalized to F 1 and F 2 , and the Euclidean distance d = ( F 1 ) 2 + ( F 2 ) 2 is calculated between the solutions and the origin. The comparison results of the operation cost ( F 1 ) and the total farmer satisfaction ( 1 / F 2 ) of the two allocation methods are shown in Table 6.
From the results in Table 6, it is shown that for dataset C, due to the relatively concentrated distribution of farmland, the improvement rate of the strategy’s cost is between −0.39% and 1.30% (average 0.62%), and the average satisfaction is increased by 1.90%. This is mainly because the spatial distance of the concentrated farmland has better reflected its correlation, and the time window of the large-scale farmer is relatively loose, making the introduction of the time dimension relatively limited. In dataset D, the strategy shows significant advantages, with cost improvement rates between 5.96% and 9.67% and satisfaction improvement rates between 3.67% and 5.71%. This indicates that in the scattered farmland scenario, the spatiotemporal dimension allocation strategy better adapts to the time window constraints of small-scale farmers, reducing the ineffective movement between regions, thus achieving more significant improvements in cost and satisfaction.
Figure 11a and Figure 11b, respectively, show the path planning results of the LMOEA algorithm proposed in this paper based on two allocation strategies.

3.5. Performance Evaluation of Path Planning Algorithm

To evaluate the performance of the LMOEA algorithm in this paper’s agricultural machinery scheduling case, this paper selected instances similar to the scenario and compared them with multi-objective evolutionary algorithms based on Pareto optimization: HTSMOGA (2024) [20], NNITSA (2021) [22], and INSGA-II (2023) [19]. Among them, HTSMOGA and INSGA-II are improved algorithms based on the NSGA-II framework. HTSMOGA introduces time window priority rules and hybrid search strategies to address cross-region agricultural machinery scheduling issues, while INSGA-II combines adaptive crossover strategies to improve solution quality. NNITSA is based on NNIA, integrating non-dominated sorting and tabu search mechanisms. These algorithms all face similar agricultural machinery scheduling scenarios and use Pareto optimization for multi-objective optimization, thus have good comparability. To ensure fairness of the experiment, the population size of all algorithms is set to 50, and other parameters are set according to the recommended values in the original paper. The comparison results of the four algorithms in HV and IGD are shown in Table 7.
From the experimental data, it can be seen that the LMOEA algorithm shows significant advantages in terms of convergence and diversity of solutions. In terms of the HV metric, the LMOEA algorithm achieved higher values in different test instances, indicating that this algorithm can effectively maintain the diversity of the population. Especially in the D-type problems with dispersed locations and tight time windows, the advantages of this algorithm are more pronounced compared to the comparative algorithms. In terms of the IGD metric, the values of LMOE are generally lower than those of the comparative algorithms, proving that it can obtain solutions closer to the true Pareto front.

3.6. Statistical Significance and Stability Analysis

To further verify the stability and statistical significance of the algorithm’s performance, Figure 12 presents the box plots of the HV and IGD metrics across different test instances. Similar to the sensitivity experiments, each configuration was independently repeated 20 times to evaluate performance consistency across runs. It can be observed that the LMOEA algorithm not only achieves the best median values in all test instances but also exhibits consistently low variance, as indicated by the narrow interquartile ranges and short whiskers in the plots. These results demonstrate a high degree of robustness and reliability.
As the problem scale increases, the performance gap between LMOEA and the baseline algorithms becomes increasingly evident, demonstrating its superior scalability in large-scale scenarios. HTSMOGA ranks second, showing relatively stable results in medium-sized instances. In contrast, NNITSA and NSGA-II tend to perform less effectively in most cases, with their results exhibiting greater fluctuation as the problem size grows.
Figure 13 illustrates the Pareto front distributions obtained by various algorithms under different scale problems, where the horizontal axis F 1 represents the scheduling cost, and the vertical axis F 2 represents the reciprocal of service satisfaction. These two objectives are in conflict: improving satisfaction usually increases cost, while minimizing cost can reduce satisfaction. Since both objectives are to be minimized, solutions closer to the lower left corner are more optimal.
From Figure 13, it can be clearly seen that the LMOEA algorithm consistently achieves superior Pareto fronts across all instances, with solutions that are closer to the lower left corner and more evenly distributed, indicating its significant advantage in balancing the two conflicting objectives.Furthermore, as the problem scale increases—from 70 to 120 operation points—the relative advantage of LMOEA becomes more pronounced. In larger instances, the Pareto fronts generated by baseline algorithms become increasingly sparse or regress toward suboptimal regions, whereas the LMOEA fronts remain dense and well-spread along the lower-left boundary. This indicates that LMOEA not only maintains high solution quality under more complex conditions but also exhibits strong scalability and robustness when handling large-scale multi-objective scheduling problems.

3.7. Ablation Experiment

To verify the effectiveness of the key components in the LMOEA algorithm, this paper constructs three variants of LMOEA, as shown in Table 8. By gradually adding a hybrid initialization strategy, a dynamic crowding distance mechanism, and a local search strategy based on Q-learning, the optimization effect of each module is evaluated.
Table 9 shows the performance comparison of four algorithm variants on 12 test instances. The experimental results indicate that LMOEA-V2 significantly outperforms the base version LMOEA-V1 in all instances. The hybrid initialization strategy, by combining problem feature information and heuristic rules, can better guide the population to high-quality solution space evolution. The dynamic crowding distance mechanism further enhances the diversity and convergence of LMOEA-V3. After integrating Q-learning guided local search, LMOEA outperforms LMOEA-V3 in most instances.
We use the Friedman test to analyze the performance differences among various algorithmic variants. As shown in Table 10, the test results indicate that there are significant differences in the HV and IGD metrics among the four algorithms (p < 0.05). In terms of average ranking, the complete LMOEA achieves the best performance in both metrics (HV is 1.17, IGD is 1.08), and as key components are gradually added, the performance of each variant gradually improves, verifying the effectiveness of each component’s enhancement.

4. Conclusions

In real-world operations, agricultural machinery cooperatives face several challenges: (1) the demand for services is often highly concentrated in both time and space, requiring the completion of numerous scattered field tasks within narrow time windows; (2) agricultural machinery resources are unevenly distributed, making it necessary to coordinate equipment deployment across multiple service stations; and (3) farmers’ expectations for service quality are rising, requiring scheduling solutions that balance operational efficiency with service satisfaction.
To address these challenges, we first formulated a mathematical model for agricultural machinery operation scheduling. By thoroughly analyzing the problem characteristics and influencing factors, we defined key parameters and variables and introduced a service satisfaction function based on fuzzy time windows. A multi-objective optimization model was then established to minimize operational costs and maximize service satisfaction, subject to various practical constraints. To solve the proposed model, we designed a two-stage solution framework. In the first stage, an improved parallel assignment algorithm was developed to allocate field tasks to service stations by jointly considering spatial and temporal proximity. Comparative experiments with traditional methods demonstrated superior performance across multiple instances. In the second stage, a learnable multi-objective evolutionary algorithm (LMOEA) based on the NSGA-II framework was proposed, featuring enhancements in initialization, crowding distance computation, and local search. Experimental results confirmed that LMOEA outperforms baseline algorithms (INSGA-II, HTSMOGA, NNITSA) in terms of both solution quality and diversity for multi-depot, multi-objective scheduling problems.
Looking forward, the proposed scheduling approach has significant practical implications for agricultural cooperatives. Future research will aim to further extend the model in three directions: (1) integrating heterogeneous multi-task scheduling scenarios involving different types of agricultural machinery and task requirements; (2) incorporating uncertain conditions such as equipment failures or dynamic demand changes to improve robustness under real-world constraints; and (3) enhancing scalability to accommodate ultra-large-scale scheduling problems involving hundreds of operation points. These enhancements will contribute to improving the intelligence, flexibility, and reliability of agricultural machinery scheduling systems, supporting the modernization of agricultural service organizations.

Author Contributions

Conceptualization, L.J. and Q.Z.; methodology, L.J.; software, Q.Z.; formal analysis, L.J.; investigation, Q.Z.; resources, S.L.; data curation, B.K.; writing—original draft preparation, L.J.; writing—review and editing, S.L.; visualization, Y.L.; supervision, S.L.; project administration, S.L.; funding acquisition, S.L. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the Major Science and Technology Projects in Xinjiang Uygur Autonomous Region (No.2022A02012-1) and National Science and Technology Major Project (No.2022 ZD0115801).

Data Availability Statement

The data presented in this study are available from the corresponding author on reasonable request.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Hu, X.; Zhou, X. The Development Path of Socialized Agricultural Machinery Service for Small-scale Agriculture—A Comparative Study among China, Japan and South Korea. J. China Agric. Univ. (Soc. Sci.) 2025, 42, 207–218. [Google Scholar] [CrossRef]
  2. Huang, H.; Chen, Y. Research Progress of Agricultural Machinery Scheduling Technology Based on Time Window. J. Agric. Sci. Technol. 2022, 24, 93–106. [Google Scholar] [CrossRef]
  3. Guan, S.; Nakamura, M.; Shikanai, T.; Okazaki, T. Resource assignment and scheduling based on a two-phase metaheuristic for cropping system. Comput. Electron. Agric. 2009, 66, 181–190. [Google Scholar] [CrossRef]
  4. Edwards, G.; Bochtis, D.; Sørensen, C.G. Multi-machine coordination: Scheduling operations based on readiness criteria and using a modified tabu search algorithm. IFAC Proc. Vol. 2013, 46, 191–195. [Google Scholar] [CrossRef]
  5. Alaiso, S.; Backman, J.; Visala, A. Ant colony optimization for scheduling of agricultural contracting work. IFAC Proc. Vol. 2013, 46, 133–137. [Google Scholar] [CrossRef]
  6. Mohamed, M.A.; Kheiry, A.N.O.; Rahama, A.E.; Alameen, A.A. Optimization model for machinery selection of multi-crop farms in Elsuki agricultural scheme. Turk. J. Agric. Food Sci. Technol. 2017, 5, 739–744. [Google Scholar] [CrossRef]
  7. Aguayo, M.M.; Sarin, S.C.; Cundiff, J.S.; Comer, K.; Clark, T. A corn-stover harvest scheduling problem arising in cellulosic ethanol production. Biomass Bioenergy 2017, 107, 102–112. [Google Scholar] [CrossRef]
  8. Seyyedhasani, H.; Dvorak, J.S. Dynamic rerouting of a fleet of vehicles in agricultural operations through a Dynamic Multiple Depot Vehicle Routing Problem representation. Biosyst. Eng. 2018, 171, 63–77. [Google Scholar] [CrossRef]
  9. Cao, R.; Li, S.; Ji, Y.; Zhang, Z.; Xu, H.; Zhang, M.; Li, M.; Li, H. Task assignment of multiple agricultural machinery cooperation based on improved ant colony algorithm. Comput. Electron. Agric. 2021, 182, 105993. [Google Scholar] [CrossRef]
  10. He, P.; Li, J. A joint optimization framework for wheat harvesting and transportation considering fragmental farmlands. Inf. Process. Agric. 2021, 8, 1–14. [Google Scholar] [CrossRef]
  11. Han, X.; Wu, H.; Zhu, H.; Gu, J.; Guo, W.; Miao, Y. Scheduling of Collaborative Vegetable Harvesters and Harvest-Aid Vehicles on Farms. Agriculture 2024, 14, 1600. [Google Scholar] [CrossRef]
  12. He, P.; Li, J.; Wang, X. Wheat harvest schedule model for agricultural machinery cooperatives considering fragmental farmlands. Comput. Electron. Agric. 2018, 145, 226–234. [Google Scholar] [CrossRef]
  13. Si, H.; Li, Y.; Sun, C.; Qiao, H.; Hu, X. A hierarchical game approach on real-time navigation scheduling of agricultural harvesters. Comput. Electron. Agric. 2019, 162, 112–118. [Google Scholar] [CrossRef]
  14. Hou, J.; Zhang, J.; Wu, W.; Jin, T.; Zhou, K. Research on agricultural machinery rental optimization based on the dynamic artificial bee-ant colony algorithm. Algorithms 2022, 15, 88. [Google Scholar] [CrossRef]
  15. Ding, C.; Wang, L.; Chen, X.; Yang, H.; Huang, L.; Song, X. A blockchain-based wide-area agricultural machinery resource scheduling system. Appl. Eng. Agric. 2023, 39, 1–12. [Google Scholar] [CrossRef]
  16. Ma, J.; Yuan, Y.; Ren, N.; Guo, Y.; Liu, X. Optimal allocation of agricultural machinery service resources under multi-regional coordinated scheduling architecture. J. China Agric. Univ. 2020, 25, 113–122. [Google Scholar]
  17. Huang, H.; Chen, Y.; Zhu, M.; Liu, Y.; Zhang, Z. Multi-site and multi-machine cooperative instant response scheduling system based on fuzzy membership. Trans. Chin. Soc. Agric. Eng. 2021, 37, 71–79. [Google Scholar]
  18. Cao, G.; Ma, B.; Chen, C.; Ren, B.; Hu, C. Agricultural Machinery Cross-Region Scheduling Optimization Based on Genetic Algorithm Variable Neighborhood Search. Trans. Chin. Soc. Agric. Mach. 2023, 54, 114–123. [Google Scholar]
  19. Lu, Z.; Zhao, Z.; Long, L.; Ma, Y.; Lei, L.; Liu, Z.; Dai, F.; Zhang, Y.; Li, J. Multi-robot task allocation in agriculture scenarios based on the improved NSGA-II algorithm. In Proceedings of the 2023 IEEE 98th Vehicular Technology Conference (VTC2023-Fall), Hong Kong, China, 10–13 October 2023; pp. 1–6. [Google Scholar]
  20. Guo, Y.; Zhang, F.; Chang, S.; Li, Z.; Li, Z. Research on a multiobjective cooperative operation scheduling method for agricultural machinery across regions with time windows. Comput. Electron. Agric. 2024, 224, 109121. [Google Scholar] [CrossRef]
  21. Sethanan, K.; Neungmatcha, W. Multi-objective particle swarm optimization for mechanical harvester route planning of sugarcane field operations. Eur. J. Oper. Res. 2016, 252, 969–984. [Google Scholar] [CrossRef]
  22. Liu, X.; Zhu, X.; Hao, K. Dynamic immune cooperative scheduling of agricultural machineries. Complex Intell. Syst. 2021, 7, 2871–2884. [Google Scholar] [CrossRef]
  23. Chen, Y. Method Research and System Design of Agricultural Machinery Scheduling. Master’s Thesis, Huazhong Agricultural University, Wuhan, China, 2022. [Google Scholar]
  24. Huang, H.; Cuan, X.; Chen, Z.; Zhang, L.; Chen, H. A multiregional agricultural machinery scheduling method based on hybrid particle swarm optimization algorithm. Agriculture 2023, 13, 1042. [Google Scholar] [CrossRef]
  25. Sheng, H.; Liu, X.; Yang, H. Two-stage optimization method for emergency disaster relief operation path of agricultural machinery based on genetic algorithm and clustering algorithm. Acta Agric. Shanghai 2024, 40, 115–124. [Google Scholar] [CrossRef]
  26. Yu, C.; Chang, S.; Zhang, F.; Yao, J.; Meng, Y.; Guo, Y. Research on emergency allocation of agricultural machinery for cross-regions operation in large-scale farmland. J. Chin. Agric. Mech. 2024, 45, 196–203. [Google Scholar] [CrossRef]
  27. Yang, F.; Tao, F. A bi-objective optimization VRP model for cold chain logistics: Enhancing cost efficiency and customer satisfaction. IEEE Access 2023, 11, 127043–127056. [Google Scholar] [CrossRef]
  28. Wang, Y.J.; Huang, G.Q. A two-step framework for dispatching shared agricultural machinery with time windows. Comput. Electron. Agric. 2022, 192, 106607. [Google Scholar] [CrossRef]
  29. Xue, S. An adaptive ant colony algorithm for crowdsourcing multi-depot vehicle routing problem with time windows. Sustain. Oper. Comput. 2023, 4, 62–75. [Google Scholar] [CrossRef]
  30. Jiao, L.; Peng, Z.; Xi, L.; Guo, M.; Ding, S.; Wei, Y. A multi-stage heuristic algorithm based on task grouping for vehicle routing problem with energy constraint in disasters. Expert Syst. Appl. 2023, 212, 118740. [Google Scholar] [CrossRef]
  31. Wang, Y.; Luo, S.; Fan, J.; Xu, M.; Wang, H. Compensation and profit allocation for collaborative multicenter vehicle routing problems with time windows. Expert Syst. Appl. 2023, 233, 120988. [Google Scholar] [CrossRef]
  32. Deb, K.; Pratap, A.; Agarwal, S.; Meyarivan, T.A.M.T. A fast and elitist multiobjective genetic algorithm: NSGA-II. IEEE Trans. Evol. Comput. 2002, 6, 182–197. [Google Scholar] [CrossRef]
  33. Giosa, I.D.; Tansini, I.L.; Viera, I.O. New assignment algorithms for the multi-depot vehicle routing problem. J. Oper. Res. Soc. 2002, 53, 977–984. [Google Scholar] [CrossRef]
  34. Solomon, M.M. Algorithms for the vehicle routing and scheduling problems with time window constraints. Oper. Res. 1987, 35, 254–265. [Google Scholar] [CrossRef]
  35. Wang, L.; Lu, J. A memetic algorithm with competition for the capacitated green vehicle routing problem. IEEE/CAA J. Autom. Sin. 2019, 6, 516–526. [Google Scholar] [CrossRef]
  36. Chen, B.; Zeng, W.; Lin, Y.; Zhang, D. A new local search-based multiobjective optimization algorithm. IEEE Trans. Evol. Comput. 2014, 19, 50–73. [Google Scholar] [CrossRef]
  37. dos Santos Mignon, A.; da Rocha, R.L.d.A. An adaptive implementation of ε-greedy in reinforcement learning. Procedia Comput. Sci. 2017, 109, 1146–1151. [Google Scholar] [CrossRef]
  38. Guerreiro, A.P.; Fonseca, C.M.; Paquete, L. The hypervolume indicator: Computational problems and algorithms. ACM Comput. Surv. 2021, 54, 1–42. [Google Scholar] [CrossRef]
  39. Cai, X.; Xiao, Y.; Li, M.; Hu, H.; Ishibuchi, H.; Li, X. A grid-based inverted generational distance for multi/many-objective optimization. IEEE Trans. Evol. Comput. 2020, 25, 21–34. [Google Scholar] [CrossRef]
  40. Freddi, A.; Salmon, M. Introduction to the Taguchi method. In Design Principles and Methodologies: From Conceptualization to First Prototyping with Examples and Case Studies; Springer: Cham, Switzerland, 2019; pp. 159–180. [Google Scholar]
Figure 1. Schematic diagram of agricultural machinery operation service.
Figure 1. Schematic diagram of agricultural machinery operation service.
Agriengineering 07 00197 g001
Figure 2. Satisfaction function.
Figure 2. Satisfaction function.
Agriengineering 07 00197 g002
Figure 3. Two-stage solution algorithm.
Figure 3. Two-stage solution algorithm.
Agriengineering 07 00197 g003
Figure 4. Schematic Diagram of Spatiotemporal Path Planning for Two Allocation Methods. (Each path forms a closed-loop route starting and ending at the same depot, consistent with Capacitated Vehicle Routing Problem structure). (a) Clustering and operation routes considering geographical distance. (b) Clustering and operation routes considering spatiotemporal distance.
Figure 4. Schematic Diagram of Spatiotemporal Path Planning for Two Allocation Methods. (Each path forms a closed-loop route starting and ending at the same depot, consistent with Capacitated Vehicle Routing Problem structure). (a) Clustering and operation routes considering geographical distance. (b) Clustering and operation routes considering spatiotemporal distance.
Agriengineering 07 00197 g004
Figure 5. Schematic diagram of temporal distance.
Figure 5. Schematic diagram of temporal distance.
Agriengineering 07 00197 g005
Figure 6. Schematic diagram of plane-sweeping.
Figure 6. Schematic diagram of plane-sweeping.
Agriengineering 07 00197 g006
Figure 7. Schematic Diagram of Crowding Distance Update.
Figure 7. Schematic Diagram of Crowding Distance Update.
Agriengineering 07 00197 g007
Figure 8. Schematic diagram of the Q-learning-based local search process.
Figure 8. Schematic diagram of the Q-learning-based local search process.
Agriengineering 07 00197 g008
Figure 9. Neighborhood operators.
Figure 9. Neighborhood operators.
Agriengineering 07 00197 g009
Figure 10. Main effects of HV.
Figure 10. Main effects of HV.
Agriengineering 07 00197 g010
Figure 11. Planning Results of Two Allocation Methods. (a) Based on spatial distance allocation result. (b) Based on spatiotemporal distance allocation result.
Figure 11. Planning Results of Two Allocation Methods. (a) Based on spatial distance allocation result. (b) Based on spatiotemporal distance allocation result.
Agriengineering 07 00197 g011
Figure 12. Box plots of HV and IGD metrics for four algorithms.
Figure 12. Box plots of HV and IGD metrics for four algorithms.
Agriengineering 07 00197 g012
Figure 13. Pareto fronts for six instances.
Figure 13. Pareto fronts for six instances.
Agriengineering 07 00197 g013
Table 1. Definition of relevant variables and parameters.
Table 1. Definition of relevant variables and parameters.
TypeSymbolDescription
SetsNSet of all nodes, including depots and operation points, N = { 1 , 2 , , n }
N D Set of agricultural machinery stations, representing the depots where agricultural machinery is stationed, N D = { 1 , 2 , , m }
N T Set of fields, representing the operation points served, N T = { m + 1 , , n }
MSet of agricultural machinery, representing the available agricultural machinery, M = { 1 , 2 , , k }
Parameters C 1 Fixed usage cost per piece of agricultural machinery
C 2 Transfer cost per unit distance for each piece of agricultural machinery
V w Total amount of agricultural machinery owned by depot w
d i j Distance from node i to node j
v k Transfer speed of agricultural machinery k
q i Task demand of operation point i
P k Operating power of agricultural machinery k
[ e i , l i ] Expected completion time window for operation point i
[ E i , L i ] Acceptable time window for operation point i
T max Maximum working time of agricultural machinery within a scheduling period
Q k Maximum workload of agricultural machinery k within a scheduling period
Variables X i j k Binary variable, indicating whether agricultural machinery k transfers from node i to node j
Z w k Binary variable, indicating whether agricultural machinery k departs from depot w
Y i k Binary variable, indicating whether operation point i is assigned to agricultural machinery k
A T i k Time at which agricultural machinery k arrives at operation point i
L T i k Time at which agricultural machinery k leaves operation point i
S T i k Waiting time of agricultural machinery k at operation point i
S i k Operating time of agricultural machinery k at operation point i
t i j k Transfer time of agricultural machinery k from operation point i to operation point j
Table 2. State judgment condition.
Table 2. State judgment condition.
StateDescriptionStateDescription
1 f 1 ( x i ) f 1 M e d f 2 ( x i ) f 2 M e d 3 f 1 ( x i ) f 1 M e d f 2 ( x i ) f 2 M e d
2 f 1 ( x i ) f 1 M e d f 2 ( x i ) > f 2 M e d 4 f 1 ( x i ) f 1 M e d f 2 ( x i ) > f 2 M e d
Table 3. Operation point information.
Table 3. Operation point information.
Operation PointCoordinatesOperation Area (ha)
1(89.62337080817368,43.83519517958557)0.79
2(89.60990291336401,43.83225298520169)0.71
3(89.62373851155712,43.83415198062051)2.13
4(89.6042680311066,43.82290470780433)2.38
5(89.60491842829208,43.82276405036669)2.25
6(89.60771369608014,43.82180446148496)1.87
7(89.61970346331935,43.82041586887333)1.62
8(89.61400475918953,43.81625195121295)1.57
9(89.61947175127257,43.81600461431319)1.68
10(89.61992756569268,43.81601290031301)1.21
...
Table 4. Agricultural machinery parameter settings.
Table 4. Agricultural machinery parameter settings.
ParameterDescriptionDefault Value
P k Agricultural machinery operation efficiency (ha/h)0.45
V k Agricultural machinery transfer speed (km/h)40
T m a x Maximum working time of agricultural machinery (h/d)12
C 1 Single dispatch cost of agricultural machinery (Yuan)300
C 2 Unit distance transfer cost (Yuan/km)2
Table 5. Parameter settings and HV values.
Table 5. Parameter settings and HV values.
No. P c P l α γ HV
10.30.20.10.70.5371
20.30.30.20.80.4793
30.30.40.30.90.4996
40.50.20.20.90.5878
50.50.30.30.70.6193
60.50.40.10.80.6501
70.70.20.30.80.4015
80.70.30.10.90.6892
90.70.40.20.70.5292
Table 6. Comparison of results of two allocation strategies on different datasets.
Table 6. Comparison of results of two allocation strategies on different datasets.
InstanceCost (Yuan)Total Satisfaction
Spatial DistanceTemporal DistanceImprovement Rate (%)Spatial DistanceTemporal DistanceImprovement Rate (%)
C_45_23382.573395.64−0.3942.5343.181.53
C_70_34924.864860.821.3265.8466.951.69
C_80_45585.635605.28−0.3572.6773.581.25
C_100_47228.457156.730.9986.3488.152.09
C_120_58925.788828.461.09102.48104.862.32
C_160_511,856.4211,725.831.10135.67139.282.66
D_45_24682.534392.686.1936.8538.945.67
D_70_37052.866482.378.0957.8360.765.07
D_80_48172.487685.295.9664.5867.254.13
D_100_410,223.29324.588.7980.2784.164.85
D_120_511,985.6910,826.859.6795.3898.853.64
D_160_515,685.4714,642.836.65126.84132.464.43
Average8308.8287910.614.0980.6183.203.28
Table 7. Comparison of results of LMOEA with other algorithms.
Table 7. Comparison of results of LMOEA with other algorithms.
InstanceLMOEAINSGA-IIHTSMOGANNITSA
HVIGDHVIGDHV IGDHVIGD
C_45_20.76720.02210.65140.07710.69620.05450.70420.0472
C_70_30.72240.04150.59230.09410.66520.06520.60620.0541
C_80_40.74320.03420.55210.11060.70130.05870.64310.0425
C_100_40.67920.10310.47120.21810.59230.11730.54210.1315
C_120_50.75130.08090.46520.25770.63240.15260.58310.1891
C_160_50.68250.11560.52510.19840.61620.14450.57210.1656
D_45_20.82240.01250.69630.05420.73320.03310.66210.0352
D_70_30.70120.05990.58320.10770.64510.08470.53720.1361
D_80_40.69830.03170.55120.12830.63310.06540.60420.0932
D_100_40.80140.05370.58420.18540.67410.13610.56510.1671
D_120_50.71120.09120.39410.19740.58620.11570.54420.1409
D_160_50.65530.13470.45240.23580.54710.16840.49620.1828
Table 8. Composition of variants of the LMOEA algorithm in ablation experiments.
Table 8. Composition of variants of the LMOEA algorithm in ablation experiments.
Algorithm VariantHybrid InitializationDynamic Crowding DistanceQ-Learning Based Local Search
LMOEA-V1×××
LMOEA-V2××
LMOEA-V3×
LMOEA
Table 9. Performance comparison of different LMOEA variants on each test instance.
Table 9. Performance comparison of different LMOEA variants on each test instance.
InstanceLMOEA-V1LMOEA-V2LMOEA-V3LMOEA
HV IGD HV IGDHV IGDHVIGD
C_45_20.54870.29450.65920.11710.77240.04340.75240.0534
C_70_30.49260.34720.62230.12630.65240.15360.74260.0823
C_80_40.52340.29860.69470.14130.73580.11840.81370.0412
C_100_40.43870.39250.65260.12840.64260.14620.71520.0784
C_120_50.47230.36210.62340.21840.68850.17890.73270.0823
C_160_50.39560.37640.55870.24630.62340.18230.69810.1123
D_45_20.55230.28240.74580.08410.72580.09890.80540.0387
D_70_30.46370.35640.59340.18870.65310.13540.71850.0514
D_80_40.48340.30830.66270.12170.74260.08670.72180.0672
D_100_40.43230.38210.63540.19540.69120.15460.76830.0845
D_120_50.40340.41120.60780.23840.67850.19890.70270.1023
D_160_50.36870.43850.59340.25630.65360.20500.73780.1223
Table 10. Friedman test results for different LMOEA variants.
Table 10. Friedman test results for different LMOEA variants.
Algorithm VariantHVIGD
Average Rankingp-ValueAverage Rankingp-Value
LMOEA-V14.006.35 × 10−745.49 × 10−7
LMOEA-V22.832.75
LMOEA-V32.002.17
LMOEA1.171.08
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Jia, L.; Zhang, Q.; Liu, S.; Kong, B.; Liu, Y. Multi-Station Agricultural Machinery Scheduling Based on Spatiotemporal Clustering and Learnable Multi-Objective Evolutionary Algorithm. AgriEngineering 2025, 7, 197. https://doi.org/10.3390/agriengineering7060197

AMA Style

Jia L, Zhang Q, Liu S, Kong B, Liu Y. Multi-Station Agricultural Machinery Scheduling Based on Spatiotemporal Clustering and Learnable Multi-Objective Evolutionary Algorithm. AgriEngineering. 2025; 7(6):197. https://doi.org/10.3390/agriengineering7060197

Chicago/Turabian Style

Jia, Liruizhi, Qinshuo Zhang, Shengquan Liu, Bo Kong, and Yuan Liu. 2025. "Multi-Station Agricultural Machinery Scheduling Based on Spatiotemporal Clustering and Learnable Multi-Objective Evolutionary Algorithm" AgriEngineering 7, no. 6: 197. https://doi.org/10.3390/agriengineering7060197

APA Style

Jia, L., Zhang, Q., Liu, S., Kong, B., & Liu, Y. (2025). Multi-Station Agricultural Machinery Scheduling Based on Spatiotemporal Clustering and Learnable Multi-Objective Evolutionary Algorithm. AgriEngineering, 7(6), 197. https://doi.org/10.3390/agriengineering7060197

Article Metrics

Back to TopTop