1. Introduction
Distributed manufacturing has become increasingly common in discrete manufacturing systems under the continuing transition toward green and low-carbon production. Multiple geographically dispersed factories jointly undertake production tasks, which helps alleviate capacity limitations, order fluctuations, and uneven resource allocation in a single factory, and also creates additional opportunities for energy optimization in manufacturing processes. Naderi and Ruiz [
1] provided an early systematic study of distributed scheduling and laid the foundation for task assignment and sequencing optimization in multi-factory production environments. Compared with a single-factory shop, a distributed flexible job shop must simultaneously determine job-to-factory assignment, machine selection, and operation sequencing. These decisions may generate scattered or long idle intervals on machine timelines because of imbalanced factory loads, heterogeneous machine choices, and operation waiting. If all such idle intervals are treated as standby periods, unnecessary non-processing energy consumption will be incurred. Therefore, studying distributed flexible job shop scheduling with machine on–off decisions is important for reducing non-processing energy while maintaining production efficiency, thereby improving the green operation level of multi-factory manufacturing systems.
The flexible job shop scheduling problem (FJSP) allows each operation to be processed on one of several candidate machines; more broadly, job-shop related scheduling problems are known to be NP-hard [
2]. Dauzère-Pérès et al. [
3] reviewed FJSP studies from the perspectives of problem characteristics, modeling methods, and solution algorithms. On this basis, the distributed flexible job shop scheduling problem (DFJSP) further introduces job-to-factory assignment, extending scheduling from internal optimization within a single shop to collaborative optimization across multiple factories. For DFJSP, Wei et al. [
4] incorporated supply-demand matching into scheduling under shared manufacturing; Meng et al. [
5] developed mixed-integer linear programming (MILP) and constraint programming (CP) formulations; Luo et al. [
6] studied scheduling with inter-factory transfers; Zhang et al. [
7] considered crane transportation constraints; and Tang et al. [
8] further discussed integrated sequencing flexibility. These studies have extended DFJSP from basic factory assignment, machine selection, and operation sequencing to more complex scheduling scenarios involving transfers, transportation equipment, and sequencing flexibility.
Energy-efficient scheduling is not limited to adding energy consumption as an optimization objective; it also requires identifying the specific energy-saving mechanisms that can be influenced by scheduling decisions. Mouzon et al. [
9], Fang et al. [
10], Liu et al. [
11], and Dai et al. [
12] studied this issue from the perspectives of equipment operation control, power consumption and carbon-footprint reduction, total energy minimization in job shops, and energy-efficient scheduling in flexible production systems, respectively. The reviews by Gahm et al. [
13] and Fernandes et al. [
14] further indicate that scheduling-level energy-saving strategies can generally be grouped into three categories. The first uses time-of-use electricity prices or demand response to place flexible operations in low-price or low-load periods, thereby reducing energy cost [
15,
16,
17]. The second uses variable machine speed or processing-speed selection to trade off processing time and instantaneous power [
18,
19]. The third controls machine states during non-processing periods by selecting standby, off, or other low-power states during idle intervals [
9,
11,
18,
20,
21,
22].
Studies on machine on–off decisions have mainly focused on the coordination between equipment idle management and production scheduling. Mouzon et al. [
9] showed that non-bottleneck machines still consume considerable energy in standby mode and reduced equipment operating energy through scheduling rules and on–off policies. Shrouf et al. [
20] jointly determined job start times, idle states, and machine on–off times in a single-machine environment with time-of-use electricity prices. Zhang et al. [
21] introduced machine on–off control into FJSP and established an energy-efficient scheduling model with total energy consumption and makespan as objectives. Gu et al. [
22] further considered transportation and machine on–off constraints in energy-efficient DFJSP. Overall, machine on–off decisions have been shown to be effective for reducing non-processing energy. Although efficient scheduling can reduce idle time, high overall utilization does not mean that every machine is continuously occupied. In distributed flexible job shops, job-to-factory assignment, machine eligibility, technological precedence, and load imbalance may still create medium or long idle intervals on non-bottleneck machines. Therefore, machine on–off decisions are used as a complementary energy-saving mechanism for unavoidable non-processing intervals, rather than as a substitute for workload balancing or utilization improvement. However, existing studies are still concentrated more on single-machine or single-factory job shops, or treat machine on–off control as a specific extended constraint. In contrast, energy-oriented DFJSP studies mainly focus on energy-objective modeling, real-time scheduling, and metaheuristic or learning-based algorithms [
23,
24,
25,
26,
27]. The coordinated treatment of on–off decisions with job-to-factory assignment, machine selection, and operation sequencing remains insufficient. In particular, algorithm design still makes limited use of structural information such as critical factories, bottleneck machines, critical operations, and machine idle intervals. Therefore, high-performance algorithms driven by domain knowledge remain worth further investigation for this class of complex scheduling problems.
Table 1 compares representative studies related to DFJSP and energy-efficient scheduling. The comparison shows that existing studies have mainly focused on distributed scheduling decisions, energy-oriented objectives, or machine-state control separately, while the joint consideration of DFJSP, machine on–off decisions, and processing/standby/on–off energy accounting remains limited.
This paper, therefore, studies a bi-objective distributed flexible job shop scheduling problem with machine on–off decisions (DFJSP-OO). The model minimizes the makespan and total energy consumption, and the solution method is built around a domain-knowledge-driven memetic algorithm. The main contributions are as follows.
(1) A bi-objective mathematical model is developed for distributed flexible job shop scheduling with machine on–off decisions, in which processing energy, idle energy, and on–off energy are modeled simultaneously.
(2) A domain-knowledge-driven memetic algorithm is proposed. It combines three-layer encoding, hybrid initialization, structured neighborhood search, and energy-saving reconstruction to jointly optimize job-to-factory assignment, machine selection, operation sequencing, and on–off-eligible idle intervals.
(3) The effectiveness and stability of the proposed model and algorithm are evaluated through Taguchi parameter tuning, exact MILP validation on a small-scale instance, component ablation experiments, statistical tests, and representative case analysis.
The remainder of this paper is organized as follows.
Section 2 describes the problem and formulates the mathematical model.
Section 3 presents the DKMA.
Section 4 reports the numerical experiments and result analysis.
Section 5 concludes the paper and discusses future research directions.
2. Problem Description and Mathematical Modeling
2.1. Problem Description
In DFJSP-OO, jobs are processed in homogeneous distributed factories, and each factory is equipped with machines indexed by . Each job consists of several ordered operations, and different jobs may contain different numbers of operations. To avoid ambiguity, the operation set of the job is denoted by , and the complete operation set is denoted by . The set is the union of all job-specific operation sets, and each globally unique operation belongs to exactly one job indexed by .
Each operation can be processed on one machine selected from its eligible machine-index set . If operation is assigned to the machine in factory , its processing time is denoted by . The energy parameters of a machine in the processing, idle standby, and on–off states are denoted by the processing power , idle power , and unit on–off energy , respectively. Here, is defined as the energy consumed by one complete on–off cycle. Since identical energy parameters are used for all machines in this paper, the minimum idle threshold that permits an on–off cycle is uniformly denoted by The solution must simultaneously determine job-to-factory assignment, operation sequencing, and machine selection, and then determine machine on–off states according to internal idle lengths. Under technological and resource constraints, the makespan and total energy consumption are minimized simultaneously.
The following assumptions are adopted. (1) All factories and machines are available at the start of scheduling, and machines may remain off before their first operation starts. (2) Once an operation starts, it cannot be interrupted. (3) Each job can be assigned to only one factory during the whole processing process, and all of its operations must be completed in that factory. (4) Operations of the same job must be processed sequentially according to the predefined technological route. (5) At any time, each machine can process at most one operation, and each operation can be processed by at most one machine. (6) When the machine idle length is shorter than , the machine remains in standby mode; when the idle length reaches or exceeds , one complete on–off cycle is triggered. (7) The initial power-on and final power-off are not included in the optimization objective. (8) The energy associated with machine shutdown, startup, warm-up, and setup-related preparation is incorporated into the on–off energy parameter . The processing times used in this study are treated as effective processing times; when startup, setup, warm-up, or switching time is required, such time is assumed to be included in the given processing time.
The present model focuses on scheduling-level energy components, including processing energy, idle standby energy, and machine on–off energy. Load-dependent power variation, sequence-dependent energy costs, switching-induced wear, and uncertainty in energy parameters are not explicitly modeled in the current deterministic formulation.
2.2. Mathematical Formulation
The symbols used in this paper are defined in
Table 2.
The optimization objectives and constraints are given as follows.
Constraint (6) assigns each job to exactly one factory. Constraint (7) assigns each operation to exactly one processing machine selected from its eligible machine-index set. Constraint (8) uses to ensure that each operation is processed only in the factory assigned to its job. Constraint (9) defines the completion time of each operation. Constraint (10) enforces the technological precedence relationship for operation pairs in . Constraints (11)–(15) determine the processing order of operations assigned to the same machine and prevent processing overlap. Constraints (16)–(19) identify the first operation, last operation, and adjacent processing chain on each machine. Constraints (20)–(23) calculate internal idle intervals only between directly adjacent operations on the same machine. Constraints (24)–(26) determine whether an internal idle interval triggers an on–off cycle according to the threshold . Constraint (27) defines the makespan.
3. DKMA Design for Solving DFJSP-OO
The memetic algorithm (MA) is a metaheuristic optimization method that combines population-based evolutionary search with individual local improvement, and it has been widely used for complex combinatorial optimization problems. Compared with traditional evolutionary algorithms that mainly rely on crossover and mutation operators, a memetic algorithm introduces local search during population evolution, which enhances the exploitation of high-quality solutions and improves search efficiency in complex solution spaces.
DFJSP-OO must simultaneously determine job-to-factory assignment, operation sequencing, machine selection, and machine on–off states. The problem, therefore, contains multiple decision layers and a large solution space. Since the makespan is mainly affected by critical factories, bottleneck machines, and critical paths, whereas total energy consumption depends not only on processing arrangements but also on machine idle intervals and their on–off states, generic evolutionary operations alone cannot sufficiently exploit problem-structure information. To improve the use of critical scheduling structures and energy-saving opportunities, this paper proposes a domain-knowledge-driven memetic algorithm. The algorithm represents a schedule through three-layer encoding, improves the initial solution quality through hybrid initialization, and maintains the search range through population evolution. On this basis, knowledge-driven neighborhood search and energy-saving reconstruction strategies are designed for critical factories, bottleneck machines, critical paths, and on–off-eligible idle intervals. These strategies make targeted adjustments to the key components affecting makespan and energy consumption, thereby improving bi-objective optimization performance. The algorithmic procedure is shown in Algorithm 1. The evaluation budget is scaled according to the instance size, since larger instances require more evaluations to avoid insufficient evolution, whereas smaller instances can usually reach a stable search state with fewer evaluations. Therefore,
is set, where
is the number of operations in the instance and
is the number of factories. If the external nondominated archive is not updated for
= 10 consecutive generations, the newly generated solutions in this period are considered to provide limited additional nondominated information, and continuing the search may lead to low-return evaluations. Therefore, this condition is used to reduce redundant evaluations and computational cost.
| Algorithm 1 DKMA for solving DFJSP-OO |
Input: Instance ; number of factories ; population size ; crossover rate ; mutation rate ; local-search rate ; ; stagnation limit = 10. Output: External nondominated archive A. |
1 Initialize eval ← 0 and stall ← 0 2 Generate mixed initial population P by earliest-completion, energy-oriented and random initialization 3 Decode and evaluate P, apply energy-saving reconstruction, set eval ← eval + |P|, and let P ← EnvironmentalSelection(P, ) and A ← UpdateArchive({}, P) 4 while eval < and stall < do 5 Assign a nondominated rank and crowding distance to each individual in , and initialize the offspring population . 6 while |Q| < and eval < do 7 select parents xa and xb from P by tournament selection 8 if rand < then apply POX to OSL and uniform crossover to FAL and MSL end if 9 mutate FAL, OSL and MSL with probability 10 decode and evaluate offspring, apply energy-saving reconstruction, set eval ← eval + |offspring|, and add them to Q 11 end while 12 select the first ceil(×) elite individuals from A 13 for each elite individual x do 14 extract the critical factory, bottleneck machine, latest critical operation, critical jobs and on–off-eligible idle intervals from its schedule 15 generate knowledge-driven neighbors by N1-N8, decode and evaluate the accepted candidate, and set eval ← eval + 1 16 add the accepted candidate to Q 17 if eval ≥ then break end if 18 end for 19 set R ← deduplicate(P ∪ Q) and P ← EnvironmentalSelection(R, ) 20 let Aold be the signature of A, then update A ← UpdateArchive(A, P) 21 if Signature(A) = Aold then stall ← stall + 1 else stall ← 0 end if 22 end while 23 return A |
3.1. Encoding Strategy
A three-layer encoding scheme is used to represent a scheduling solution, including the factory assignment layer (FAL), operation sequence layer (OSL), and machine selection layer (MSL), as shown in
Figure 1. FAL represents the factory assigned to each job. Its length equals the number of jobs, and each gene corresponds to one job with a value equal to the factory index. OSL represents the operation scheduling sequence. Its length equals the total number of operations, and repeated job indices are used to represent operations; the t-th occurrence of a job index corresponds to the t-th operation of that job, and the sequence position determines the scheduling priority during decoding. MSL represents machine selection. It is encoded according to the internal operation order of jobs, and each gene corresponds to the selected machine for one operation from its eligible machine set.
3.2. Decoding Method
During decoding, the factory assignment layer first determines the factory to which each job belongs, thereby forming factory-level scheduling sets. The operation sequence layer is then scanned to read job indices, and the corresponding operation order is recovered according to the technological route. The machine selection layer subsequently assigns a specific processing machine to each operation. Processing times are determined according to the earliest feasible time rule; the start time of an operation is the larger of the completion time of its predecessor and the earliest available time of the selected machine. Once an operation is scheduled, the earliest available time of the corresponding machine is updated until all operation intervals are obtained. On this basis, idle intervals between adjacent tasks on the same machine are recorded, and on–off states are determined according to the threshold. Internal idle intervals that reach or exceed trigger on–off cycles and contribute to on–off energy, whereas idle intervals below the threshold remain in standby mode and contribute to idle energy. The total energy consumption is calculated from processing energy, internal idle standby energy, and on–off cycle energy, and it is optimized together with the makespan.
A small-scale case is used below to illustrate the decoding process, machine on–off judgment, and energy calculation. The case contains two homogeneous factories, denoted as
and
, and each factory is equipped with the same three machines, denoted as
,
, and
. There are six jobs,
–
, and each job contains at most six operations. The processing times of all operations are reported in
Table 3, where ‘-’ indicates that the corresponding job cannot be processed on that machine. The energy parameters used in this illustrative example are
= 10 kW,
= 1.2 kW, and
= 5 kWh; therefore,
= 5/1.2 = 4.17 h.
According to the encoding scheme and related constraints, an individual is randomly generated and decoded, producing the Gantt chart shown in
Figure 2. The two objective values are
= 25 h and
= 1054.2 kWh, where
= 1030 kWh,
= 19.2 kWh, and
= 5 kWh. Taking operation
-
as an example, this operation is the first operation of job 4. According to
Table 3, it can be processed on
and
, with processing times of 1 h and 5 h, respectively. In the current schedule, it is assigned to
, and therefore, its processing duration is 5 h. In factory
, machine
has an 8-h idle interval in [7, 15] hours. Since the on–off threshold is
= 5/1.2 = 4.17 h, this 8-h idle interval reaches and exceeds the threshold, and therefore one on–off cycle is triggered. On machine
, the intervals [7, 8] hours, [12, 14] hours, [15, 16] hours, and [22, 23] hours are all short idle intervals whose lengths are below 4.17 h; therefore, the machine remains in standby mode and generates idle energy.
3.3. Population Initialization
Because DFJSP-OO has a large solution space, complex constraints, and strong interactions between the makespan and energy objectives, a multi-strategy hybrid initialization mechanism is adopted to balance initial solution quality, population diversity, and interpretable scheduling structure. The initial population consists of three types of individuals: random feasible initialization accounts for 40%, and two heuristic initializations based on scheduling rules, each accounts for 30%.
3.3.1. Earliest-Completion Initialization Based on Round-Robin Factory Assignment
FAL initialization adopts a round-robin assignment rule, in which jobs are assigned to factories sequentially according to job indices to reduce the possibility of factory load imbalance in the initial solutions. OSL and MSL are generated jointly using the earliest completion time criterion. Specifically, in each scheduling round, the set of schedulable operations satisfying technological precedence constraints is first constructed. For each candidate operation in this set, all eligible machines are traversed, and the earliest start time and earliest completion time on each machine are calculated. The operation with the smallest earliest completion time is then selected as the current scheduled operation, and its job index is written into OSL. At the same time, the machine that produces the earliest completion time is recorded in the corresponding MSL gene. This process is repeated until all operations are sequenced and assigned to machines.
3.3.2. Minimum-Energy Initialization Based on Round-Robin Factory Assignment
FAL initialization also adopts the round-robin assignment rule to improve the initial distribution of factory loads. OSL and MSL are generated jointly according to the incremental energy criterion. Specifically, in each scheduling round, the set of schedulable operations satisfying technological precedence constraints is first determined. For each candidate operation, all eligible machines are traversed, and the feasible start time and completion time, and additional energy consumption are calculated according to the current partial schedule. The minimum additional energy of a candidate operation over different machines is then used as its evaluation value. The operation with the smallest evaluation value is written into OSL, and the machine that produces this minimum additional energy is recorded in the corresponding MSL gene. This process is repeated until all operations are sequenced and assigned to machines. Under feasible scheduling constraints, this strategy gives priority to operation-machine combinations with smaller energy increments, which helps improve the energy performance of the initial solutions.
3.3.3. Random Feasible Population Initialization
To improve the diversity of the initial population, a random feasible initialization strategy is introduced. FAL randomly determines the factory assigned to each job. OSL randomly generates an operation sequence under technological precedence constraints. MSL randomly selects a processing machine for each operation from its eligible machine set.
3.4. Evolutionary Operators
DFJSP-OO is characterized by multiple decision layers, strong interactions among variables, and a large solution space. Therefore, hierarchical crossover and mutation mechanisms are used in the design of evolutionary operators to effectively recombine information from different decision layers while maintaining solution feasibility and enhancing global exploration in the objective space.
The factory assignment layer (FAL) and machine selection layer (MSL) both use the uniform crossover (UX) operator. This operator generates a 0–1 mask vector with the same length as the chromosome and exchanges genes at the corresponding positions between two parent individuals, thereby realizing gene-wise recombination for factory assignment or machine selection. Taking MSL as an example, each gene corresponds to the machine selection of a specific operation, and the candidate machine is always restricted to the eligible machine set of that operation. Therefore, feasibility is preserved after crossover without additional repair, as shown in
Figure 3.
The operation sequence layer (OSL) uses the precedence operation crossover (POX) operator. This operator first randomly divides all jobs into two sets, then inherits the operation sequence structures of the corresponding job sets from the two parents, and finally fills the remaining operations according to their order of occurrence in the other parent. In this way, the scheduling sequence is reconstructed while encoding feasibility is maintained, as shown in
Figure 4.
The mutation stage adopts a random perturbation mechanism. In the factory assignment layer, the factory assignments of selected jobs are randomly adjusted. In the operation sequence layer, the operation order is perturbed through random swap or insertion operations. In the machine selection layer, selected operations are reassigned to processing machines from their eligible machine sets.
3.5. Knowledge-Driven Neighborhood Search Operators
The factory assignment, operation sequence, and machine selection decisions in DFJSP-OO are strongly coupled. A single neighborhood structure or purely random perturbation cannot fully exploit the structural information of a solution. Therefore, eight knowledge-driven neighborhood operators are designed around flexible machine selection and critical scheduling structures, as follows.
N1: The operation with the latest completion time in the current schedule is used as the starting point for critical-path backtracking. The current critical path is obtained by tracing technological arcs and machine arcs. The terminal critical operation on this path is then selected, and one machine different from its currently assigned machine is randomly selected from its eligible machine set for replacement.
N2: The factory whose completion time equals the current makespan is defined as the critical factory, and the job with the latest completion time in this factory is selected. A job from another factory is then randomly selected and exchanged with it. After the job exchange, processing machines are randomly reselected from the eligible machine sets for the operations of the migrated jobs.
N3: The machine with the largest current workload is identified, and the operations processed by this machine are extracted. One operation is randomly selected from them, and one machine different from its current machine is randomly selected from its eligible machine set for replacement. If no alternative machine is available for this operation, another eligible operation is selected.
N4: A job is randomly selected from the critical factory and unidirectionally migrated to another factory to reduce the load of the critical factory.
N5: Two operations in the critical factory are randomly selected and their processing order is exchanged, while machine selections remain unchanged.
N6: Two operations in the critical factory are randomly selected, and the latter operation is inserted before the former one, while machine selections remain unchanged.
N7: On the current critical path, several consecutive critical operations processed on the same machine form a critical block. For a head block, an intermediate operation is randomly selected and inserted after the tail operation. For a middle block, two intermediate operations are randomly selected and inserted before the head operation and after the tail operation, respectively. For a tail block, one operation is randomly selected and inserted before the head operation.
N8: For a critical operation, machine replacement is performed within its eligible machine set. Priority is given to a machine that reduces total energy consumption without worsening the makespan. If no such machine exists, a machine that further shortens the makespan with the smallest increase in energy consumption is selected.
Different neighborhood operators differ in the decision layer, perturbation intensity, and optimization focus. If they are selected in a fixed or uniformly random manner, the search direction may lack specificity, which reduces overall solution efficiency. Therefore, the neighborhood operator is selected according to the structural features of the current schedule. Based on their scope of influence on the scheduling structure, the eight operators are classified into three groups: factory-level reassignment operators (N2 and N4), machine-level adjustment operators (N1, N3, and N8), and operation sequence fine-tuning operators (N5, N6, and N7).
During the search process, the operator category is dynamically determined according to the structural features of the current schedule. When the completion time of the critical factory is higher than that of other factories, factory-level reassignment operators are preferentially invoked to alleviate factory assignment imbalance. When a high-load bottleneck machine appears, machine-level adjustment operators are preferentially invoked for targeted optimization of critical operations or heavily loaded equipment. When the overall system load is relatively balanced, but the makespan remains difficult to reduce, operation sequence fine-tuning operators are preferentially used to refine the critical-path and critical-block structures. After the category is determined, a specific operator within the category is randomly selected to retain search diversity and suppress premature convergence. All neighborhood operators follow a unified acceptance criterion: a new solution is accepted only when it is feasible and is not worse than the current solution in objective values or can contribute a new nondominated solution to the archive.
3.6. Energy-Saving Strategy
In the energy-saving reconstruction stage, once a feasible schedule has been decoded, machine selections and the corresponding processing times remain unchanged. Under this premise, processing energy is fixed, and the energy-saving potential mainly comes from restructuring internal idle periods. Two energy-saving strategies are therefore designed. First, right-shift-based idle reconstruction integrates non-fillable internal idle intervals to form continuous intervals eligible for on–off operation. Second, forward insertion compression further compresses internal idle intervals while maintaining technological feasibility.
3.6.1. Right-Shift-Based Idle Reconstruction Strategy
Without changing the technological precedence relationship within the same job, machine selection, or the makespan, this strategy reconstructs idle structures by controlled right shifts in operations within machine sequences, thereby reducing ineffective idle energy and creating on–off-eligible intervals. The pseudocode is given in Algorithm 2.
| Algorithm 2 Energy-saving reconstruction strategy |
Input: Initial feasible schedule ; on–off threshold . Output: Energy-saving schedule . |
1 Set , . 2 for each factory do 3 for each machine do 4 Construct , sorted by . 5 if , then continue. 6 Let be the last operation in . 7 for each operation in , except , do 8 Calculate the maximum feasible right shift under technological precedence, machine non-overlap, and . 9 Generate a temporary schedule by shifting rightward by . 10 if is feasible, , and , then set . 11 end for 12 Reconstruct according to the updated start times. 13 for each consecutive triple in do 14 Compute and . 15 if , then continue. 16 Calculate the maximum feasible right shift under technological precedence, machine non-overlap, and . 17 if and , then 18 Set . 19 Generate a temporary schedule by shifting rightward by . 20 if is feasible, , and , then set . 21 end if 22 end for 23 end for 24 end for 25 Set . 26 return . |
This strategy adjusts machine idle intervals without changing job-to-factory assignment, machine selection, or technological precedence. First, operations are right-shifted within their feasible time windows to reduce ineffective internal idle time. Second, adjacent short idle intervals are reorganized when possible so that an internal idle interval can reach the on–off threshold. If the reconstructed schedule does not increase the makespan and reduces total energy consumption, the adjustment is accepted.
Figure 5 shows the Gantt chart obtained after applying the right-shift-based idle reconstruction strategy to the schedule in
Figure 2. After the adjustment, the makespan remains unchanged at
= 25 h, while the total energy consumption decreases from
= 1054.2 kWh to
= 1041 kWh, a reduction of 13.2 kWh. Specifically,
= 1030 and
= 5 remain unchanged, whereas
decreases from 19.2 kWh to 6 kWh; the energy reduction therefore mainly comes from decreased standby time during machine idleness. In detail,
in
delays
from [0, 4] hours to [4, 8] hours,
in
delays
to [13, 15] hours, and
in
compresses or eliminates several scattered idle intervals by right-shifting part of the operations. For
in
, the on–off idle interval changes from [7, 15] hours to [8, 15] hours, but the number of on–off cycles does not increase, and the on–off energy remains unchanged. These results show that the right-shift-based idle reconstruction strategy adjusts the idle distribution on machine timelines without increasing the makespan and reduces the standby time counted in energy consumption.
3.6.2. Forward Insertion Compression Strategy Based on Machine Timelines
This strategy traverses each machine processing timeline and determines whether a subsequent operation can be moved forward into a feasible idle interval before it, while keeping the technological precedence relationship within the same job and other feasibility constraints unchanged. If the idle interval length is no shorter than the operation processing time and the shifted start time is not earlier than the technological ready time of the operation, forward insertion is performed to compress machine idle time and improve the schedule, as shown in
Figure 6. In
Figure 6, an idle interval [3, 5] hours exists before the operation
, and the processing duration of the operation
is 1 h. When feasibility conditions are satisfied,
is moved forward to [3, 4] hours for processing, which partially fills the idle interval, shortens the factory makespan to 6 h, and reduces idle energy by 1 h.
3.7. Alternative Approaches for Comparison
To provide a broader methodological comparison, three representative multi-objective optimization approaches are considered as alternative approaches: the inverse model and adaptive neighborhood search-based cooperative optimizer (IMANS) [
24], the nondominated sorting genetic algorithm II (NSGA-II) [
28], and the multi-objective evolutionary algorithm based on decomposition (MOEA/D) [
29]. IMANS is a problem-related optimizer developed for energy-efficient distributed flexible job shop scheduling. NSGA-II is a classical dominance-based multi-objective evolutionary algorithm that uses nondominated sorting and crowding distance to balance convergence and diversity. MOEA/D is a decomposition-based multi-objective evolutionary algorithm that transforms a multi-objective problem into a set of scalar subproblems and optimizes them cooperatively. These three methods are selected because they represent problem-specific search, dominance-based optimization, and decomposition-based optimization, respectively.
From the perspective of computational cost, the compared algorithms use the same encoding scheme, decoding rule, crossover-mutation framework, population size, evaluation budget, and number of independent runs. Therefore, solution representation, offspring generation, schedule decoding, objective evaluation, and stochastic-performance evaluation are controlled under the same experimental setting. The main cost differences come from their algorithm-specific search and update mechanisms. NSGA-II mainly relies on nondominated sorting and crowding-distance-based selection; MOEA/D relies on decomposition-based neighborhood update; IMANS includes inverse model and adaptive neighborhood operations; and DKMA introduces knowledge-driven local search and energy-saving reconstruction. The additional cost of DKMA mainly comes from identifying critical scheduling structures and adjusting machine idle intervals. These operations increase the cost of individual refinement, but they support targeted makespan improvement and non-processing energy reduction.
4. Numerical Experiments
The numerical experiments use two benchmark data sets commonly adopted in flexible job shop scheduling, namely Mk01-Mk10 from the Brandimarte benchmark set [
30], and 01a-18a from the Dauzère-Pérès benchmark set [
31], with 28 instances in total. Each instance is further extended to scenarios with two, three, and four factories, forming the test instance set used to evaluate the proposed algorithm. To assess algorithm performance from different perspectives, two widely used multi-objective optimization metrics are selected: hypervolume (
) [
32] and inverted generational distance (
) [
33].
measures the coverage of the obtained Pareto solution set in the objective space and reflects both convergence and diversity.
measures the average distance between the obtained solution set and the reference Pareto front and mainly reflects convergence accuracy. In this paper, both
and
are calculated using normalized objective values to eliminate the influence of different dimensions and numerical scales. Since the true Pareto front is difficult to obtain, all solutions obtained by all algorithms over multiple independent runs are pooled, duplicate solutions are removed, nondominated sorting is performed, and the resulting set is used as the reference set for
calculation.
All algorithms are implemented in MATLAB R2016b and run on a computer with an Intel Core i7-6700 CPU @ 3.40 GHz, 16 GB RAM, and Windows 11 operating system. To ensure fair comparison and statistical reliability, each algorithm is independently run 10 times on each test instance, and the mean values of the performance indicators are used to analyze convergence, distribution, and overall solution-set quality. To verify the correctness of the mathematical model and obtain exact solutions for small-scale instances, the MILP model is implemented in Python 3.12.12 and solved by the Gurobi solver.
4.1. Parameter Setting
Algorithm parameters have a considerable influence on solution performance. DKMA contains four key parameters: population size
, crossover probability
, mutation probability
, and local-search proportion
. To reduce the influence of empirical parameter settings on the experimental results, the Taguchi method is used to analyze these four parameters. Each parameter is assigned four levels:
,
,
, and
. The detailed combinations are shown in
Table 4.
The parameter experiments select three representative instances, Mk03, Mk06, and Mk10, and set 2, 3, and 4 factories for each instance, covering medium-scale, high-flexibility, and large-scale complex scenarios. Under the four-factor, four-level setting, an L16(4^4) orthogonal array is used to construct the experimental design. Each parameter combination is independently run 10 times on the nine cases. The termination rule is , and early termination is allowed when the external nondominated archive is not updated for 10 consecutive generations.
Using average
as the main response indicator, the main effects of different parameter levels are compared according to
Table 5 and
Figure 7. The recommended parameter combination is therefore determined as
= 100,
= 0.95,
= 0.05, and
= 0.10. Since the Taguchi response values were obtained from representative scenarios rather than a single instance, the selected parameter combination reflects an overall response under different problem characteristics. This setting was kept unchanged in all subsequent experiments, and the resulting HV and IGD values suggest that the selected parameter setting is applicable within the tested problem scales.
4.2. MILP Model Validation
To verify the correctness of the MILP model, the small-scale instance in
Table 3 is selected and solved exactly using the Gurobi solver. The computation terminates when the optimality condition is satisfied and the optimality gap is 0. The solution process adopts the
-constraint method and consists of two stages.
In the first stage, the model is solved with the makespan
as the objective. The result shows that the optimal makespan of this instance is
= 24 h, meaning that the shortest time required to complete all jobs under the current instance and constraints is 24 h. In the second stage, the model is solved again with total energy consumption
as the objective under the constraint
= 24 h, thereby obtaining the energy-optimal schedule. The optimal result is
= 1038.4 kWh, where
= 1030 kWh,
= 8.4 kWh, and
= 0 kWh. The corresponding Gantt chart is shown in
Figure 8.
The system’s energy consumption can be checked from the scheduling result. The total processing time of all operations is 103 h, corresponding to = 1030 kWh. Short idle intervals between adjacent processing tasks mainly appear at the following positions: machine in factory has two 2-h idle intervals in [7, 9] hours and [15, 17] hours; machine in factory has a 1-h idle interval in [15, 16] hours, and machine has a 2-h idle interval in [16, 18] hours. The total duration of these idle intervals is 7 h, corresponding to = 8.4 kWh. None of the idle intervals reaches the machine on–off threshold = 4.17; therefore, no machine on–off cycle occurs during scheduling, and the corresponding on–off energy is 0 kWh. The resulting total energy consumption is = 1038.4 kWh, which is consistent with the MILP solution. This verifies that the proposed MILP model can correctly describe the key scheduling decisions and accurately calculate system energy consumption.
4.3. Ablation Analysis of DKMA Components
The main components of DKMA affect different stages of the search process. The three-layer encoding determines the basic representation of job-to-factory assignment, operation sequencing, and machine selection, and provides the common solution structure for DKMA and its variants. Hybrid initialization constructs the initial population by combining diversity-oriented random solutions and heuristic solutions with better initial quality. Knowledge-driven local search acts during the iterative search stage and uses critical factories, bottleneck machines, critical paths, and idle intervals to refine candidate schedules. Energy-saving reconstruction acts after decoding and adjusts machine idle structures to reduce non-processing energy. Therefore, the ablation analysis focuses on two removable enhancement components, namely knowledge-driven local search and energy-saving reconstruction, which can be evaluated without changing the basic encoding and decoding framework.
4.3.1. Effectiveness Analysis of the Knowledge-Driven Local Search Strategy
To examine the actual contribution of the knowledge-driven local search strategy to DKMA performance, an ablation experiment is conducted. Keeping population initialization, crossover and mutation, decoding rules, and external archive update unchanged, the knowledge-driven local search strategy is removed from DKMA to obtain the comparison algorithm DKMA-NKLS.
Table 6 reports the
and
results of DKMA and DKMA-NKLS on Mk01-Mk10 under different factory configurations. All values are averages over 10 independent runs.
Table 6 shows that DKMA outperforms DKMA-NKLS in most test scenarios. Among the 30 test scenarios, DKMA obtains higher
values in 24 scenarios and lower
values in 29 scenarios. By factory configuration, the average
values of DKMA under 2, 3, and 4 factories are 0.555, 0.568, and 0.630, respectively, which are higher than the corresponding values of DKMA-NKLS, 0.477, 0.518, and 0.572. The average
values are 0.120, 0.119, and 0.121, which are lower than those of DKMA-NKLS, 0.157, 0.163, and 0.161. These results indicate that the knowledge-driven local search improves solution-set coverage and Pareto-front approximation accuracy in most scenarios. A two-sided Wilcoxon signed-rank test is performed on the 30 groups of mean values listed in
Table 6. The
-values for HV and IGD are 1.81 × 10
−4 and 1.92 × 10
−6, respectively, supporting a statistically significant difference after introducing knowledge-driven local search.
4.3.2. Effectiveness Analysis of the Energy-Saving Strategy
To evaluate the influence of the energy-saving strategy on algorithm performance, another ablation experiment is conducted. With all other mechanisms unchanged, the right-shift reconstruction and forward insertion compression operations are removed from DKMA to construct the comparison algorithm DKMA-RE. Since this section focuses on differences in total energy consumption between the two algorithms, the relative percentage index (
) is used for measurement. A smaller
indicates that the energy result obtained by an algorithm is closer to the best value under the corresponding combination. The
values in
Table 7 are reported as percentages and are defined as
where
denotes the algorithm,
denotes the
-th independent run of that algorithm for a given case,
denotes the total energy consumption obtained in that run, and
denotes the minimum total energy consumption among the 20 candidate results produced by 10 independent runs of DKMA and 10 independent runs of DKMA-RE.
Table 7 reports the average
comparison between DKMA and DKMA-RE on Mk01-Mk10. DKMA obtains lower
values in 29 of the 30 test scenarios. By factory configuration, the average
values of DKMA under 2, 3, and 4 factories are 0.313%, 0.264%, and 0.321%, respectively, all lower than the corresponding DKMA-RE values of 0.899%, 0.772%, and 0.677%. The corresponding reductions are 65.23%, 65.87%, and 52.62%.
Figure 9 shows the
boxplots of the two algorithms over all Mk instances. The two-sided Wilcoxon signed-rank test gives
, indicating that DKMA has a statistically significant advantage over DKMA-RE in terms of
.
Table 7 and
Figure 9 jointly show that the energy-saving strategy reduces the average
and improves the stability of energy optimization over multiple independent runs.
4.4. Comparative Analysis with Other Algorithms
To evaluate the performance of DKMA, IMANS, NSGA-II, and MOEA/D are used as comparison algorithms. To ensure a fair comparison, all algorithms use the same encoding scheme, decoding process, population size, crossover probability, mutation probability, and other basic parameter settings. The remaining parameters of IMANS follow the original literature. Each instance is independently run 10 times.
Table 8 shows that DKMA obtains the highest
value in 76 of the 84 test cases. From the average results under different factory scales, the average
values of DKMA in the 2-, 3-, and 4-factory scenarios are 0.503, 0.528, and 0.522, respectively. These values are higher than those of MOEA/D (0.272, 0.266, and 0.281), NSGA-II (0.274, 0.288, and 0.295), and IMANS (0.270, 0.288, and 0.306). These results indicate that DKMA maintains higher objective-space coverage and overall solution-set quality as the number of factories changes.
Table 9 shows that DKMA obtains the lowest
value in 81 of the 84 test cases. Its average
values under the 2-, 3-, and 4-factory scenarios are 0.103, 0.101, and 0.109, respectively, all lower than those of the three comparison algorithms. This indicates that the solution sets obtained by DKMA are generally closer to the reference Pareto front.
Figure 10 and
Figure 11 show the boxplots of
and
for the four algorithms under different factory scales, respectively. In each subplot, the samples of a single algorithm consist of the results from 10 independent runs on 28 instances under the same factory scale, giving 280 observations in total and reflecting the run-level distribution of algorithm performance. To further test the statistical significance of performance differences, two-sided Wilcoxon signed-rank tests are conducted between DKMA and MOEA/D, NSGA-II, and IMANS using the mean values of the two indicators for each case listed in
Table 8 and
Table 9. The test results show that, for
, the
-values of DKMA versus MOEA/D, NSGA-II, and IMANS are 4.19 × 10
−15, 7.38 × 10
−15, and 7.51 × 10
−15, respectively. For
, the corresponding
-values are 2.45 × 10
−15, 2.36 × 10
−15, and 4.01 × 10
−15.
These results show statistically significant differences between DKMA and the three comparison algorithms on the overall test set, indicating that DKMA provides more stable solution-set coverage and Pareto-front approximation on the tested instances.
To further reveal the structural characteristics of nondominated solutions, the results on instance 08a under the 3-factory configuration are selected for case analysis. The Pareto fronts obtained by the four algorithms are shown in
Figure 12. For the solution marked by a star in the DKMA solution set in
Figure 12, the objective values are
= 1468.0 h and
= 165,386.6 kWh. The corresponding Gantt chart is shown in
Figure 13.
Figure 13 shows that the completion times of the three factories are 1468 h, 1459 h, and 1454 h, respectively. The critical factory is factory 1, and the maximum completion-time difference among factories is 14 h, indicating a relatively balanced multi-factory task allocation. In terms of energy composition,
= 164,850 kWh,
= 21.6 kWh, and
= 515 kWh. The 515 kWh corresponds to 103 on–off cycles involving all 24 machines used across the three factories. By factory, factories 1, 2, and 3 generate 37, 33, and 33 on–off-eligible idle intervals, respectively. In contrast, only seven short idle intervals fail to reach the on–off threshold, with a total duration of 18 h; therefore,
= 1.2 × 18 = 21.6 kWh. Long on–off-eligible intervals appear, for example, in [275, 1100] hours on
, [591, 1137] hours on
, and [212, 696] hours on
, with lengths of 825 h, 546 h, and 484 h, respectively. This representative solution provides a structural explanation of the energy-saving effect: it maintains a low makespan while assigning medium and long idle intervals on multiple machines to the on–off state, thereby reducing standby energy during non-processing periods.
The performance of DKMA is attributable to its use of DFJSP-OO-specific structural information. The algorithm identifies critical positions affecting the makespan and total energy consumption through critical factories, bottleneck machines, critical paths, and on–off-eligible idle intervals, and then uses this information in neighborhood search and energy-saving reconstruction. This mechanism improves solution refinement efficiency and energy optimization capability. The experimental results show that DKMA balances multi-factory load allocation, makespan control, and non-processing energy reduction, supporting the applicability of the proposed method to distributed flexible job shop scheduling with machine on–off energy.
5. Conclusions and Future Work
This paper studies a bi-objective distributed flexible job shop scheduling problem considering machine on–off decisions. A mathematical model is established to jointly account for processing energy, internal idle standby energy, and on–off cycle energy, and a domain-knowledge-driven memetic algorithm is proposed. By considering the relationships among job-to-factory assignment, machine selection, and operation sequencing, the algorithm combines hybrid initialization, knowledge-driven neighborhood search, and energy-saving reconstruction for optimization. Parameter tuning, small-scale MILP validation, ablation experiments, and comparative experiments show that DKMA obtains higher HV and lower IGD values in the tested scenarios.
Future research may extend the present deterministic DFJSP-OO model by incorporating heterogeneous factories, machine breakdowns, dynamic job arrivals, processing-time uncertainty, and maintenance effects. More detailed machine state-transition and energy characteristics, such as setup times, startup delays, sequence-dependent transition constraints, load-dependent power, switching-induced wear, and uncertain energy parameters, can also be considered when production data are available. In addition, acceleration strategies such as parallel decoding and adaptive local-search triggering may improve the applicability of DKMA to larger-scale instances.