Next Article in Journal
A Domain-Independent Classification Model for Sentiment Analysis Using Neural Models
Previous Article in Journal
Comparison of Disturbance Compensators for a Discrete-Time System with Parameter Uncertainty
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Improved SP-MCTS-Based Scheduling for Multi-Constraint Hybrid Flow Shop

1
School of Mechanical Engineering, Northwestern Polytechnical University, Xi’an 710072, China
2
Xi’an Electronic Engineering Research Institute, Xi’an 710100, China
3
Robotics and Mechatronics Research Laboratory, Department of Mechanical and Aerospace Engineering, Monash University, Clayton, VIC 3800, Australia
*
Author to whom correspondence should be addressed.
Appl. Sci. 2020, 10(18), 6220; https://doi.org/10.3390/app10186220
Submission received: 1 July 2020 / Revised: 3 August 2020 / Accepted: 3 August 2020 / Published: 8 September 2020
(This article belongs to the Section Applied Industrial Technologies)

Abstract

:
As a typical non-deterministic polynomial (NP)-hard combinatorial optimization problem, the hybrid flow shop scheduling problem (HFSSP) is known to be a very common layout in real-life manufacturing scenarios. Even though many metaheuristic approaches have been presented for the HFSSP with makespan criterion, there are limitations of the metaheuristic method in accuracy, efficiency, and adaptability. To address this challenge, an improved SP-MCTS (single-player Monte-Carlo tree search)-based scheduling is proposed for the hybrid flow shop to minimize the makespan considering the multi-constraint. Meanwhile, the Markov decision process (MDP) is applied to transform the HFSSP into the problem of shortest time branch path. The improvement of the algorithm includes the selection policy blending standard deviation, the single-branch expansion strategy and the 4-Rule policy simulation. Based on this improved algorithm, it could accurately locate high-potential branches, economize the resource of the computer and quickly optimize the solution. Then, the parameter combination is introduced to trade off the selection and simulation with the intention of balancing the exploitation and exploration in the search process. Finally, through the analysis of the calculated results, the validity of improved SP-MCTS (ISP-MCTS) for solving the benchmarks is proven, and the ISP-MCTS performs better than the other algorithms in solving large-scale problems.

1. Introduction

The flow shop scheduling problem (FSSP) has been a very active research field, since it was first proposed by Johnson [1]. In the FSSP, a set of n jobs have to be processed on m single-machine stages, where each job follows the same route of stages. In production, the FSSP may result in overloading some stages and blocking some jobs [2]. When parallel machines are introduced in some of the stage, the standard flow shop layout becomes a hybrid flow shop (HFS) for these questioners in many real-life manufacturing scenarios. Given its practical interest, the hybrid flow shop scheduling problem (HFSSP) is widely used in today’s manufacturing and production systems, such as heavy industry, light industry and other industrial systems. Taking into account the computation complexity, more researchers have conducted in-depth research in this field, since HFSSP has been proved as a non-deterministic polynomial (NP)-hard problem [3]. Simultaneously, production scheduling has been proved to play an important role for improving productivity and responsiveness in the manufacturing system [4]. Therefore, developing more efficient algorithms for this problem is significant.
Ribas and Ruizad [5] discussed HFSSP in depth, and the methods for solving the HFSSP were divided into the exact algorithm, heuristic method and meta-heuristic algorithm. The exact algorithm used to be the most common solution. In 1970s, the branch and bound algorithm (B&B) was proposed by Arthanary and Ramaswamy [6] to solve the two-stage HFSSP. Portmann and Vignier [7] combined the B&B algorithm with the genetic algorithm to solve HFSSP, and the performance of the algorithm was significantly improved. Neron et al. [8] applied energy reasoning and global operations to improve the efficiency of branch definition. Since 1998, many heuristics have been applied to solve HFSSP problems. Gupta and Tunc [9] proposed a heuristic algorithm for solving the minimum completion time of the two-stage HFSSP problem. Kahraman et al. [10] introduced an efficient parallel greedy heuristic algorithm for solving multi-task HFSSP problems. Lin and Liao [11] took a two-step label manufacturing company as an example to minimize the maximum delay of total weight. Heydari and Fakhrzad [12] proposed a heuristic algorithm for minimizing the total time of advance processing and delay processing. In addition, Paternina-Arboleda [13] developed a heuristic method for identifying and continuously solving bottleneck processes. In summary, the heuristic algorithm can quickly construct the scheduling solution, but the quality of the solution is difficult to guarantee.
In recent years, various meta-heuristics have also been developed for solving the HFSSP. Researchers have further studied the application of meta-heuristic algorithms for HFSSP. The taboo algorithm proposed by Nowicki and Smutnick [14] defines the specific neighborhood through critical paths for search optimization. Alaykyran et al. [15] proposed a modified ant colony optimization algorithm to solve HFSSP. Furthermore, Kahraman et al. [16] developed a genetic algorithm which is better than the B&B method. Liao et al. [17] proposed a particle optimization algorithm, which makes the hybrid discrete particle swarm optimization algorithm and the bottleneck heuristic algorithm to fully explore the bottleneck stage, and combine the simulated annealing algorithm to avoid the local optimal solution. Then, the superiority of the algorithm is illustrated by a comparison of examples. Recently, Marichelvama et al. [18] optimized the HFSSP using the algorithm of Nawaz, Enscore and Ham (NEH) heuristic method, and subsequently applied the CS (cuckoo search) algorithm to quickly search and improve on the basis of the optimized solution. Pan et al. [19] proposed an effective artificial bee colony optimization algorithm, and solved the optimal scheduling solution by hybrid representation method. Li et al. [20] used the variable neighborhood algorithm (HVNS) combined with the chemical reaction optimization algorithm and the distribution estimation algorithm to solve the HFSSP. In the meantime, the algorithm was studied and adjusted to locate the potential region, and search for the optimal solution. Cui and Gu [21] applied the vector notation to model HFSSP, and further proposed an improved discrete artificial bee colony (IDABC) algorithm. Meanwhile, the validity of the IDABC algorithm was verified by using the benchmark problems. Komaki et al. [22] proposed heuristic algorithms and two metaheuristic techniques based on an artificial immune system for a two-stage assembly hybrid flow shop scheduling problem. Zhang and Chen [23] developed a discrete differential evolution (DDE) algorithm with a modified crossover operator to solve larger sized problems. Other algorithms, such as Quantum-inspired immune algorithm (QIA) [24], Artificial immune system (AIS) [25], have also been applied to solve HFSSP. In addition, there are a lot of non-classical methods for solving this kind of large and complex problems. In order to solve the flow shop problem of the minimized makespan, Ramanan et al. [26] proposed a feed forward back propagation neural network to solve the problem. Tkachenko and Izonin [27,28] used neural-like structures based on the geometric transformations model as a universal approximator to implement principles of training and self-training, and thus provided the repeatability of training results and large and small training samples as a satisfactory solution. Gupta et al. [29] investigated the use of these training algorithms as competitive neural network learning tools to minimize makespan in a flow shop. In the studies mentioned above, although the meta-heuristic algorithms were widely used to solve HFSSP, it still has some disadvantages when encountering large problems, such as complex algorithm frameworks, a lack of real-state mapping and incomplete neighborhood structures [30].
In order to solve these disadvantages, some scholars adopted machine learning to study HFSSP for achieving an efficient scheduling. Monte-Carlo tree search (MCTS) algorithms heuristically build an asymmetric partial search tree by applying machine learning, and then search through the potential branches to solve the optimal strategy by continuously traversing [31]. Therefore, Chaslot et al. [32] first introduced MCTS for solving production-related problems. A MCTS-based algorithm for the multi-objective flexible job shop scheduling problem was presented by Wu et al. [33], and it was improved by incorporating the Variable Neighborhood Descent Algorithm and other techniques, like rapid action value, which can estimate the heuristic and transposition table. Chou et al. [34] used the improved MCTS algorithm to solve the multi-objective flexible shop scheduling problem, and search the minimum completion time by the adaptive value game comparison. Furuoka and Matsumoto [35] used an MCTS-based algorithm to find good schedules for a re-entrant scheduling problem. Although good efforts have been made in the above studies, and MCTS is a series of two-person zero-sum game decision-making methods, there are still limitations in the performance and learning efficiency when solving shop floor scheduling problems.
For minimizing the makespan, the HFSSP scheduling approach using improved SP-MCTS (single-player Monte-Carlo tree search) is proposed in this paper. Schadd et al. [36] proposed a new machine learning algorithm, which is named the SP-MCTS algorithm. Schadd et al. [37] applied the algorithm to solve the single-machine puzzle game, and the application algorithm achieved good results in the standardized test. Shimpei et al. [38] verified the effectiveness of the new simulation policy through data experiments, calibrated the relationship between the search method and parameters, and demonstrated the application potential of the SP-MCTS algorithm in actual scheduling.
Through the above review, the improved SP-MCTS (ISP-MCTS) algorithm was successfully applied in the scheduling problems. Therefore, this paper applies ISP-MCTS to solve HFSSP, and the organization of this paper is as follows. In Section 2, the flow shop scheduling problem is introduced, and the mathematical model is established. Then, the improvement scheme of SP-MCTS is introduced in detail in Section 3. In Section 4, the calibration of the algorithm parameters is performed. In Section 5, the experimental results and the comparison are presented. Finally, the conclusions are discussed in Section 6.

2. Hybrid Flow Shop Scheduling Problem

The HFSSP is described as follows: a set of n jobs J = {1, 2,…, n} must be processed by the s-stage, i.e., Oj1→Oj2,…, Ojs, S = {1, 2,…, s}. Each stage k ∈S, has Mk ≥ 1 identical parallel machines, and each job j can be processed in any one of the Mk parallel machines. At the same time, the processing time of the job j in the stage k (Ojk) with deterministic processing time Pjk. Under known the parameters and assumptions, the scheduling policy is solved to minimize the maximum completion time for processing n jobs.

2.1. Assumption

The HFSSP problem satisfies the following assumptions:
  • At the beginning, each job release time is 0.
  • Job j can only be processed once in each stage.
  • Each process of job j can only be processed in one machine, and each machine can only process one job at a time.
  • There are infinite buffers in the adjacent two stages. Job setup time and the travel time between consecutive stages are included in the processing time Pjs or are ignored.

2.2. Symbol Definition

The notation of the HFSSP model is shown as follows:
  • Pjk is the processing time of job j at stage k.
  • Djk is the start processing time of job j at stage k.
  • BM denotes a very large number, the traditional “Big M”.
  • xjik if job j is assigned to machine i at stage k, then xjik = 1, otherwise xjik = 0.
  • yjj’k if job j is processed earlier than j’ at stage k, then yjj’k = 1, otherwise yjj’k = 0.

2.3. Mathematical Model

Using the above notation, the mathematical relationship of HFSSP is formulated as follows:
min ( C max ) = max j J ( D j s + p j s )
i = 1 m k x j i k = 1 , j J , k S
D 1 j 0 , j J
D k + 1 , j D k j p k j , j J , k , k + 1 S
y j j k + y j j k 1 , j , j J , k S
D k j ( D k j + p k j ) + B M ( 3 y j j k x k j i x k j i ) 0 , j , j J , k S , i { 1 , 2 , , M k }
x j i k { 0 , 1 } , j J , k S , i { 1 , , M k }
y j j k { 0 , 1 } , j , j J , k S
The objective function (1) is to minimize the maximum completion time. Constraints (2) ensures that each job completes all the processing stages and can only be assigned to one machine in each stage. Equation (3) ensures that the first stage start time of each job is greater than or equal to 0. Equation (4) indicates that when the job is processed in two consecutive stages, the next process can be started after the previous process is completed. Equations (5) and (6) describe the sequential constraints of different jobs in the same machine. When two jobs are processed, the previous job must be finished before the following job can be started. Equations (7) and (8) define the value ranges of decision variables.

3. ISP-MCTS Algorithm Design

The MCTS is based on the Monte Carlo simulation to build an asymmetric search tree, and subsequently find the “best” action in the current state [39]. The SP-MCTS algorithm derived from MCTS is proposed to solve HFSSP, and HFSSP is seen as a single-player optimization problem. In the solving process, the processing machines are regarded as starred positions (drop points), and the rules are formulated according to the constraint relationship in the mathematical model. With the goal of maximizing the score, the entire search process includes four phases of selection, expansion, simulation and backpropagation, as shown in Figure 1.

3.1. HFSSP Model

According to Markov decision processes (MDP), the model of HFSSP is established [40,41]: (1) s is a series of state matrix of the HFSSP, describing the relationship between the job state and stage. s1 and sgoal represent the initial state and completion state in the HFSSP. s1, s2,…, sgoal are used to describe the job production process through various stages. (2) a is the set of scheduling policies in each state s, the scheduling policy sets a1, a2,…, av are extracted under each state s1, s2,…, sgoal. A is the current scheduling policy selected from set a. Therefore, A1, A2,…, Av are the policies corresponding to each state of s1 to sgoal, selected from the policy sets of a1, a2,…, av. (3) T(s, A, s’) = 1, the scheduling policy A is selected, and the probability that the state is converted from s to s’ is 1.
According to the design methodology of the above MDP model, the relationship between the HFSSP field environment and the model state is established. The matrix model Sn×m satisfies the following conditions: (1) n represents the number of jobs, and m represents the number of stages. (2) The first column and the m th column of the matrix are on-line stage and offline stage respectively, the second to the m-1 th columns represent the processing stages. (3) The number of jobs being processed must be less than or equal to the number of parallel machines in the stage. (4) The job in the same row is processed according to the sequence of stages, so the job in the next column cannot be started until the job in the current column is completed. (5) For any job, there is Pi1 = 0, Pim = ∞. In Figure 1, S is a 4 × 4 state matrix, which is the problem of four jobs and four-stage HFSSP. In this matrix, stage 1 and stage 4 are the online stage and offline stage, respectively, and stage 2 and stage 3 have two parallel processing machines. When the state is searched to the k th state matrix, the element sk(i,j) = 0 means the job i is not in the machine of the stage j, and sk(i,j) = 1 represents that the job i is processing in the stage j. sk(i,j) = 2 represents that the job i has finished processing in the j th stage and waits for scheduling. akx represents the scheduling policy set (Ak1, Ak2, … Aku) in the sk state. If the element Ak1(i,j) = 1, it represents that the job i is dispatched from the machine in the stage j to the processing machine in the stage j + 1.
As shown in Figure 1, the HFSSP coding model was designed according to the above description. The specific model evolution process is described as follows:
Step 1:
Construct the n × m matrix of the initial state s1 and the completion state sgoal. In Figure 1, M is the number of parallel machines in each stage, M1 is the number of idle machines in each stage during the process, and the matrix evolves from s1 to sgoal.
Step 2:
According to the state of the job and the occupancy of the machine, the available scheduling policy set has ak = (A, A,…, A) during searching for the k-floor shop state node sk. Meanwhile, the search method is selected on the basis of the traversal times N(sk) of the state node sk. Details are shown as follows: ① when N(sk) ≤ P (P is the critical value of the simulation times), heuristic rules are applied to select the scheduling policy for searching to the sgoal state; ② as shown in Figure 1, when N(k) > P, the selection policy of the SP-MCTS algorithm is used to evaluate the policy set ak and select the scheduling policy Ak. After executing strategy Ak, it will search from state sk to sk+1.
Step 3:
If sk+1sgoal, step 2 is repeated for state evolution until the sgoal.

3.2. ISP-MCTS Algorithm Optimization Process

In this paper, the HFSSP scheduling process is constructed as a tree structure. For the SP-MCTS algorithm, there are disadvantages such as poor branch positioning accuracy and slow convergence speed. Therefore, the SP-MCTS algorithm is improved as follows:

3.2.1. Selection

The relationship between balanced depth exploration and breadth exploration is constantly explored in the existing state tree nodes, so that each exploration can reach a better solution as far as possible. The UCT (upper confidence bounds applied to trees) algorithm [33] is used to select the scheduling policy, and the improvements are as follows:
Q ( s , a ) = Q ( s , a ) + c 2 ln N ( s ) / N ( s , a ) + ( q ( s , a ) 2 N ( s , a ) × Q ( s , a ) 2 + D ) / N ( s , a )
π U C T ( s ) = arg max a Q ( s , a )
The first two terms of the UCT algorithm in Equation (9) are reserved. N(s) represents the total times of that the state s is accessed, and N(s, a) represents the times during which the policy a is selected in the state s. Q(s, a) represents the average score of the policy a that is executed multiple times in state s. Considering a third item here, indicating the deviation obtained after the policy a is executed, where ∑q(s, a)2 represents the sum of the squared results which achieves at the state s, constant D is used to ensure that policies with few choices are not underestimated [32]. The purpose of the deviation term without the particularity of the neighborhood is to fully consider the change in range of the score when the policy is selected, and select the policy by using Equation (10) in the end. As shown in Figure 1, taking node s1 as an example, when N(s1) > P, every policy in the set (A11, A12,…, A16) is evaluated according to Equation (9), and the policy is selected on the basis of Equation (10).

3.2.2. Expansion

When N(s) > P, the branch node is expanded. As shown in Figure 1, if s1 node N(s1) > P, the scheduling policy A12 is selected, and the node s1 is updated to s2. If s2 is not a node in the tree, s2 is added to the tree as a node.
Single-branch node continuous expansion: considering the existence of optional policy uniqueness and state self-updating in HFSSP operation, the single-branch node continuous expansion method is designed. After s2 is expanded as shown in Figure 1, the state needs to be self-updated while the job finishes processing, so the state evolves to s3. It can be seen that in s3, the scheduling policy only has A31. At this time, the node s3 is continuously expanded, and then the state evolves to the s4, and the number of policies (A41, A42) is more than 1. Therefore, node s4 is expanded into the tree, and heuristic rules are applied to search the scheduling policy from nodes s4 to sgoal. As shown in Figure 1, the states s2s4 are continuously expanded to avoid multiple meaningless explorations while effectively utilizing computing resources.

3.2.3. Simulation

Heuristic rules are used to simulate sgoal from the leaf node s. When the tree searches from the A12 branch to the s4 leaf node, as shown in Figure 1, s4 is used as the simulation initial state. Then, the heuristic rule is applied to search for sgoal. Therefore, the root node s1 is used as the starting leaf node of the program. Each simulation policy is obtained by different heuristic rule.
According to the literature [19] heuristic rules comparison results, four heuristic rules (4-Rule) are selected as the simulation policy. These four heuristic rules are: (1) NEHLPT(λ) forward heuristic rule for λ = n × 50%; (2) NEHSPTB(λ) forward heuristic rule for λ = n × 35%; (3) shortest processing time (SPT); and (4) longest processing time (LPT). Before the simulation policy is applied, with probability ε a random move is played, ε = 0.005.

3.2.4. Backpropagation

When the simulation is completed, the backpropagation is performed according to the simulation result. Starting from leaf node s4, as shown in Figure 1, each child node goes back to its parent node until root node s1 is reached. The information is updated as follows:
N ( s ) N ( s ) + 1
N ( s , a ) N ( s , a ) + 1
Q ( s , a ) Q ( s , a ) + ( z Q ( s , a ) ) / N ( s , a )
q ( s , a ) 2 q ( s , a ) 2 + z 2
Equation (11) represents the total number of times the node s was accessed. Equation (12) records the times that the policy a is executed under the node s. Equation (13) updates the average score of the execution policy a at node s. Equation (14) is used to calculate the sum of the squared score.

3.2.5. Evaluation and Policy Set

The application of ISP-MCT to solve the HFSSP is different from the game-type MCTS problem. In the ISP-MCTS scheduling search, the first iteration time T1 is used as the benchmark, and the quality of each subsequent iteration is Tn − T1 as the evaluation score. When Tn − T1 ≤ 0, the score is T1 − Tn+1, and if Tn − T1 > 0, the score is 0. Therefore, the optimization purpose is to select the highest scoring policy set.

3.3. The Complete Process of ISP-MCTS

Through the selection policy and the simulation policy, the root node searches from s1 to sgoal. The algorithm is based on the node average score design selection algorithm and takes the maximum score (Tmin) for the optimization purpose. Therefore, the pseudo code of the ISP-MCTS for solving HFSSP is described as Algorithm 1.
Algorithm 1 The main procedure of ISP-MCTS
1:Initialize: C, D, T1, s1, P, N = 0, sgoal, s = s1.
2:  While the halting criterion is not satisfied do
3:Step 1: Selection and expansion
4:if N(s) ≤ P then
5:   go to step 2
6:  else The improved UCT in Section 3.2.1 is applied to select the scheduling policy a
7:     Update state ss
8:    if the state s’ is a node in the tree then
9:      s = s1, return Step 1
10:    else s’ is expanded to a leaf node in the tree
11:    end if
12:    if The new expanded leaf node s has one optional policy or no one
13:      N(s) = P + 1, return Step 1
14:    else go to step 2
15:    end if
16:end if
17:Step 2: Simulation
18: The node s’ is taken as the simulation start state.
19: Selecting the simulation policy
20: Simulation to sgoal, the completion time Tn is obtained, go to Step 3
21:Step 3: Update simulation policy
22:if Tn < Tmin then
23:   The policy set (A, A,…, A) from s1sgoal is recorded, Tmin = Tn
24:   go to step 4
25:else go to step 4
26:end if
27:Step 4: Backpropagation
28:The information from the leaf node s to the root node s1 branch path node is updated according to Equation (11)→(14)
29:
30:if The halting criterion is not satisfied
31:   Output the optimal policy set, Tmin
32:else s = s1, return Step 1
33:end if
34:end while

4. Calibration of the Proposed Algorithms

4.1. Simulation Policy Verificationᾱ

In order to verify the simulation policy mentioned above, a set of instances are generated: n∈{20,40,60,80,100}, s∈{4,6,8}. Ten instances are generated for each combination of n and s, and a total of 150 instances are obtained. The processing time is generated from the uniform distribution [1], and the number of parallel machines Mk are generated from the uniform distribution [1,5]. The three simulation policies were constructed in comparison with 4-Rule in Section 3.2.3. The comparison policies are: 4-Rule without continuous expansion (N-Expan), NEH-LPT(λ) and random policy (Random). Simulation threshold: P = 4, taken the combination of expansion and exploration parameters as C = 0.5, D = 10,000.
These proposed algorithms were coded in MATLAB 2016a, and run on Intel Core i5-3470 3.2 GHz PC with 4 GB memory. The ISP-MCTS limited their run time to 30 n × s ms and ran two fixed replications. The minimum completion time Cmin is the optimal result of all simulation policies under the current instance. The deviation between the final solution of each simulation policy and Cmin is as shown in the following Equation (15):
RPI(Ci) = (Ci − Cmin)/Cmin × 100%,
In Equation (15), Ci is the solution value of the current simulation policy. The average relative percentage increase (ARPI) of the 10 sets of problems under each n and s combination is shown in Table 1. It can be seen that the average deviation of 4-Rule in the four-simulation policy is the smallest, with the average deviation of 0.07%. Meanwhile, the average deviation of the solution of the 4-Rule simulation policy is better than the simulation policy of the comparison.
Simulation policies were analyzed using a 40 × 6 case instance. Figure 2a shows the relationship between the number of iterations and the node depth. The number of callbacks and average depth of callbacks for each simulation policy are shown in Figure 2b. From Figure 2, NEH-LPT(λ) has the greater heuristic depth than the other three simulation policies throughout the iteration. In the iterative process, the number of NEH-LPT(λ) callbacks are the least, and the average depth of NEH-LPT(λ) callbacks are less than the two algorithms of 4-Rule and N-Expan. Therefore, it can be concluded that the NEH-LPT(λ) can easily fall into local minimum. The evaluation positioning is not accurate, and the N-Expan has a high callback frequency in the heuristic process. Therefore, the search depth of the N-Expan is less than 4-Rule. The Random has the least depth of search. From Figure 2b, since 4-Rule has the largest average callback depth, the 4-Rule can be used to accurately locate the branch area within a limited number of callbacks.
A single-factor analysis of variance (ANOVA) was carried out, where the type of simulation policy in Table 2 is considered to be a factor. The 4-Rule and the other three simulation policies are analyzed by multi-comparison method using the least significant difference (LSD) procedure. It indicates that the 4-Rule is significantly better than the other three simulation policies of N-Expan, NEH-LPT(λ) and Random. Therefore, the heuristic advantage of the 4-Rule is verified, and the 4-Rule is selected as the simulation policy.

4.2. Parameter Setting

The Benchmark problems [42] and Liao problems [17] were introduced to verify the performance of the ISP-MCT algorithm. The scale of the problems ranged from 10 jobs of 5 stages to 30 jobs of 10 stages. In this section, the parameter combination (C,D) is optimized in three segments according to the search space magnitude. The order of magnitude is 0–105, 105–106 and 106–5 × 106. The parameter combination (0.1, 32) focuses on the development of potential nodes during the search process. Parameter combination (1, 20,000) in the search process considers the exploration ability of unknown regions, and the combination of parameters (0.5, 10,000) in the search process takes into account the expansion and exploration capabilities. As shown in Table 3, j15c5d2 is selected for optimization analysis at each stage, and each parameter combination is operated independently for 20 times to obtain an average score. The depth is the distance between the deepest node and the root node, and the average depth is the average of 20 independent running depths.
As shown in Table 3, in the order of magnitude of 105–106, 106–5 × 106 within the specified time limit, the ISP-MCTS obtains the minimum average completion time and standard deviation, when the parameter combination is (0.1, 32). When applying the parameter combination (0.1, 32), the ISP-MCTS finds the minimum average completion time. The average depth of the search tree at the parameter combination (0.1, 32) is 57, then most of the nodes generated in the scheduling of the 15 jobs of 5 stages have been extended to the tree by the ISP-MCTS. Compared with the other two sets of parameters (0.5, 10,000) and (1, 20,000), the parameter combination (0.1, 32) expands the node deeper.
In the order of magnitude 105~106, 106~5 × 106 within the specified time limit, when the parameter combination is (0.5, 10,000), the ISP-MCTS finds that the average minimum completion time is optimal, and the algorithm stability is better than other combinations.
With the increasing magnitude and search time, the exploratory parameter combination advantage becomes more and more obvious. Therefore, at the order of magnitude of 106–5 × 106, the exploration parameter (1, 20,000) was significantly better than the parameter combination (0.1, 32).
It can be observed from the above that the order of magnitude is 0–105 within the specified time limit. When the parameter combination is (0.1, 32), the ISP-MCTS algorithm has the highest solution quality and the best stability. Therefore, when the order of magnitude is 105~106 or 106~5 × 106, the algorithm selects the parameter combination (0.5, 10,000).

5. Calculation Results and Analysis

According to the machine layout, the Benchmark problems is divided into two groups: (1) 47 a and b problems; and (2) 30 c and d problems. The ISP-MCTS algorithm was programmed in MATLAB 2016a and run on a central processing unit (CPU)-i5-3470(3.20GHZ) with 4.0 GB Main Memory. Six algorithms HVNS [20], IDABC [21], particle swarm optimization (PSO) [17], AIS [25], genetic algorithm (GA) [16] and B&B [8] are selected as comparison algorithms. The calculation results of the above comparison algorithm are obtained from the source literature. The HVNS algorithm sets the maximum running time to 100 s and takes the minimum completion time of 20 independent results under the same conditions. The IDABC algorithm also limits their runtime to 1600 s and runs 20 times independently. The maximum running time of the algorithms B&B, AIS, GA and PSO in the literature is limited to 1600 s. The minimum Cmax is obtained as the final solution by independently operating 20 times under the same conditions, and thus the CPU time corresponding to Cmax is recorded for each problem. The performance of each algorithm is compared by “%Deviation”, and the deviation between the lower bound LB and each algorithm Cmax is obtained by using Equation (16), as shown in Table 4 and Table 5. The statistical analysis results are shown in Table 6:
% deviation = ( C max L B L B ) × 100 %

5.1. Comparison of Carlier and Neron’s Benchmarks

From Table 4 and Table 5, only the B&B algorithm does not obtain the optimal solution when solving the problem of j15c10a5 in the 47 a and b problems. The PSO and B&B algorithms, respectively, solve the optimal solutions of 18 problems in 30 c and d problems, and the average deviation of the solutions is 3.95% and 9.32%, respectively. As shown in Table 6, the IDABC, HVNS and GA algorithms solve the optimal solutions for the 17 problems in the 30 c and d problems, and the average deviation of the solutions are 4.00%, 4.00% and 4.17%, respectively. The AIS algorithm solves the optimal solution of 16 problems in 30 c and d problems, and the average deviation of the solution is 4.26%. The ISP-MCTS algorithm proposed in this paper solves the optimal solution of 18 problems in 30 c and d problems, and the average deviation of the solution is 3.4%. Therefore, the solution quality of the ISP-MCTS algorithm for solving c and d problems is better than the other six algorithms. It can be seen from Table 4 and Table 5 that the average CPU time of the ISP-MCTS algorithm is much smaller than that of the the PSO and B&B algorithms. From the above comparison, the ISP-MCTS algorithm has a better solution and efficient optimization ability than the other six algorithms when solving Carlier and Neron problems.

5.2. Comparison of Liao’s Benchmarks

From the discussion in Section 5.1, the average completion time of the three algorithms of IDABC, HVNS and PSO for solving Carlier and Neron’s benchmarks is smaller than that of GA, AIS, and B&B algorithms. Therefore, the algorithms IDABC, HVNS and PSO are selected as the comparison algorithms to solve the Liao problems in this section. In the source literature, the IDABC algorithm was coded in Visual C++ and run on an Intel Pentium 3.06 GHz PC with 2 GB RAM under the Windows 7 operating system, and each problem was tested 20 times and the termination criterion for all the tests was the computation time 100 s. The running environment of the HVNS algorithm in the literature [20] was Intel Core i5 3.3 GHz PC with 4 GB of memory. The algorithm PSO was used to solve the Liao problems, and the algorithm limited its run time to 200 s and each problem runs 20 times independently. The ISP-MCTS algorithm is used to solve the Liao problems in the computer running environment proposed in this paper, and the execution time for each problem is limited to 200 s.
As shown in Table 7, the average completion time of the ISP-MCTS solution is smaller than that of the IDABC, HVNS and PSO, and the ISP-MCTS further updates the best solution of the three problems in Liao problems. The nonparametric test is proposed, and the null hypothesis H0 assumes that μISP-MCTS ≥ μIDABC, and the alternative hypothesis H1: μISP-MCTS < μIDABC with 0.05 significance level. Similarly, the hypotheses H0’ISP-MCTS ≥ μHVNS, H1’ISP-MCTS < μHVNS. H0’’ISP-MCTS ≥ μPSO, H1’’ISP-MCTS < μPSO. As shown in Table 8, the ISP-MCTS algorithm performs significantly better than the PSO and AIS algorithms, and has certain advantages in terms of solution quality and stability compared with the HVNS algorithm.
The six effective algorithms for solving HFSSP in recent years were selected for comparison with ISP-MCTS. The algorithms were: HVNS [20], IDABC [21], PSO [17], GAR [2], ISA [43] and artificial bee colony with permutation (ABCP) [21]. In the computer operating environment of this study, each algorithm solved 150 problems in Section 4.1. For the comparison algorithms, HVNS, DABC, and PSO solved the same problems as reported in the literature, whereas GAR, ISA, and ABCP were used to solve complex HFSSP, and the ability to solve these algorithms was verified. The above algorithms directly select the optimal parameter combination of the source literature. All the algorithm runtimes were limited to the maximum CPU (t = ρ mn s) time, and ρ is: 10, 20, 30. All the above algorithms are programmed in MATLAB 2016a, and each of the 150 examples is run independently for 10 times, and the average of 10 running results is taken as the solution of each example. When ρ = 30, the best solution Cmin found by any algorithm is used to calculate the relative percentage increase (RPI). The results are shown in Table 9, Table 10 and Table 11.
As shown in Table 9, Table 10 and Table 11, the ISP-MCTS solution quality is better than that of the other six algorithms. When ρ = 10, the average deviation of ISP-MCTS in the seven algorithms is the smallest, and the value is 0.115. The average deviation of the ISP-MCTS algorithm is less than IDABC (0.121), HVNS (0.263), PSO (0.495), GAR (0.743), ISA (0.509), and ABCP (0.591). Therefore, the solution quality of the ISP-MCTS algorithm is better than the other six algorithms. When ρ = 20 and ρ = 30, the average deviation of all algorithms is improved through comparing to ρ = 10. At the same time, the ISP-MCTS algorithm still has the smallest deviation among all algorithms, so the validity of the ISP-MCTS algorithm can be verified.
Here, a multi-factor analysis of variance was used to further compare the seven algorithms, and the algorithm type and ρ were taken as factors. From the mean graph and the two-factor Tukey HSD test (95% confidence interval) as shown in Figure 3, the ISP-MCTS algorithm is significantly better than the HVNS, PSO, GAR, ISA, and ABCP algorithms at different CPU times. According to the above average variance value, the superiority of the ISP-MCTS algorithm in solving the HFSSP problem was verified.

6. Conclusions

  • This paper analyzed the HFSSP operation mechanism, and proposed a tree search algorithm based on SP-MCTS to solve HFSSP. In order to solve the problem of HFSSP by SP-MCTS algorithm, the three steps of selection, expansion and simulation were improved respectively. The improvement of the algorithm included the selection blending standard deviation, the single-branch expansion strategy and the 4-Rule policy simulation. The results show that these improvements can quickly and accurately locate high-potential branches, and obtain the optimal solution.
  • The 4-Rule simulation policy was selected based on a comparative analysis of the problems, and the 4-Rule can give an accurate evaluation of the leaf nodes during the search process, thus improving the search efficiency of the algorithm. According to the magnitude of the search, the combination of the parameters C and D was determined to ensure the convergence speed of the algorithm.
  • The experimental results of the standard problems show that the ISP-MCTS algorithm is effective, and the optimal solutions of 47 class a and class b problems can be obtained at 100%. The ISP-MCTS algorithm solved the optimal solution of 18 problems in the c and d problems, and the average deviation was 3.4%, which was lower than the other five comparison algorithms. This proves that the ISP-MCTS algorithm has certain advantages in solving the HFSSP problems. According to the statistical analysis results of each algorithm to solve the Liao problems, the ISP-MCTS performance is obviously better than the HVNS and PSO algorithms, and it has a certain superiority in solution quality and stability compared with the IDABC algorithm.
  • Through the comparison of large-scale random problems, it shows that the ISP-MCTS algorithm is superior to the comparison algorithms in terms of CPU time, solution quality and stability.

Author Contributions

J.G. introduced the concepts of this paper and collected the corresponding references. P.Z. and J.G. conducted the experimental work, took part in the data acquisition. T.Y., Z.C. and Y.S. participated in the research design, data acquisition, data analysis and manuscript preparation. Finally, J.G. analyzed and interpreted the data, B.S. and P.Z. reviewed this paper and made constructive comments. All authors read and agreed to the published version of the manuscript.

Funding

This paper was supported by National Science and Technology Major Project of China (No.2018ZX04005001) and the Major Projects of Aero Engines and Gas Turbines (grant No. 2017-VII-0002-0095).

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Johnson, S.M. Optimal two- and three-stage production schedules with setup times included. Nav. Res. Logist. Q. 1954, 1, 61–68. [Google Scholar] [CrossRef]
  2. Ruiz, R.; Maroto, C. A genetic algorithm for hybrid flowshops with sequence dependent setup times and machine eligibility. Eur. J. Oper. Res. 2006, 169, 781–800. [Google Scholar] [CrossRef]
  3. Gupta, J.N.D. Two-Stage, Hybrid Flowshop Scheduling Problem. J. Oper. Res. Soc. 1988, 39, 359–364. [Google Scholar] [CrossRef]
  4. Wang, J.; Zhang, Y.; Liu, Y.; Wu, N. Multi-agent and Bargaining-game-based Real-time Scheduling for Internet of Things-enabled Flexible Job Shop. IEEE IoT J. 2018, 6, 1. [Google Scholar]
  5. Ribas, I.; Leisten, R.; Framinan, J.M. Review and classification of hybrid flow shop scheduling problems from a production system and a solutions procedure perspective. Comput. Oper. Res. 2010, 37, 1439–1454. [Google Scholar] [CrossRef]
  6. Arthanary, T.S.; Ramaswamy, K.G. An extension of two machine sequencing problems. Oper. Res. 1971, 8, 10–22. [Google Scholar]
  7. Portmann, M.-C.; Vignier, A.; Dardilhac, D.; Dezalay, D. Branch and bound crossed with GA to solve hybrid flowshops. Eur. J. Oper. Res. 1998, 107, 389–400. [Google Scholar] [CrossRef]
  8. Néron, E.; Baptiste, P.; Gupta, J.N. Solving hybrid flow shop problem using energetic reasoning and global operations. Omega 2001, 29, 501–511. [Google Scholar] [CrossRef]
  9. Gupta, J.N.D.; Tunc, E.A. Minimizing tardy jobs in a two-stage hybrid flow shop. Int. J. Prod. Res. 1998, 36, 2397–2417. [Google Scholar] [CrossRef]
  10. Kahraman, C.; Engin, O.; Kaya, I.; Öztürk, R.E. Multiprocessor task scheduling in multistage hybrid flow-shops: A parallel greedy algorithm approach. Appl. Soft Comput. 2010, 10, 1293–1300. [Google Scholar] [CrossRef]
  11. Lin, H.-T.; Liao, C.-J. A case study in a two-stage hybrid flow shop with setup time and dedicated machines. Int. J. Prod. Econ. 2003, 86, 133–143. [Google Scholar] [CrossRef]
  12. Fakhrzad, M.; Heydari, M. A Heuristic Algorithm for Hybrid Flow-Shop Production Scheduling to Minimize the Sum of the Earliness and Tardiness Costs. J. Chin. Inst. Ind. Eng. 2008, 25, 105–115. [Google Scholar] [CrossRef]
  13. Paternina-Arboleda, C.D.; Montoya-Torres, J.R.; Acero-Dominguez, M.J.; Herrera-Hernandez, M.C. Scheduling jobs on a k-stage flexible flow-shop. Ann. Oper. Res. 2007, 164, 29–40. [Google Scholar] [CrossRef]
  14. Nowicki, E.; Smutnicki, C. The flow shop with parallel machines: A tabu search approach. Eur. J. Oper. Res. 1998, 106, 226–253. [Google Scholar] [CrossRef]
  15. Alaykýran, K.; Engin, O.; Döyen, A. Using ant colony optimization to solve hybrid flow shop scheduling problems. Int. J. Adv. Manuf. Technol. 2007, 35, 541–550. [Google Scholar] [CrossRef]
  16. Kahraman, C.; Engin, O.; Kaya, I.; Yilmaz, M.K. An application of effective genetic algorithms for solving hybrid flow shop scheduling problems. Int. J. Intell. Syst. 2008, 1, 134–147. [Google Scholar] [CrossRef]
  17. Liao, C.J.; Tjandradjaj, E.; Chung, T.P. An approach using particle swarm optimization and bottleneck heuristic to solve flowshop scheduling problem. Appl. Soft Comput. 2012, 12, 1755–1764. [Google Scholar] [CrossRef]
  18. Marichelvam, M.K.; Prabaharan, T.; Yang, X.-S. Improved cuckoo search algorithm for hybrid flow shop scheduling problems to minimize makespan. Appl. Soft Comput. 2014, 19, 93–101. [Google Scholar] [CrossRef]
  19. Pan, Q.-K.; Wang, L.; Li, J.-Q.; Duan, J.-H. A novel discrete artificial bee colony algorithm for the hybrid flowshop scheduling problem with makespan minimisation. Omega 2014, 45, 42–56. [Google Scholar] [CrossRef]
  20. Li, J.-Q.; Pan, Q.-K.; Wang, F.-T. A hybrid variable neighborhood search for solving the hybrid flow shop scheduling problem. Appl. Soft Comput. 2014, 24, 63–77. [Google Scholar] [CrossRef]
  21. Cui, Z.; Gu, X. An improved discrete artificial bee colony algorithm to minimize the makespan on hybrid flow shop problems. Neurocomputing 2015, 148, 248–259. [Google Scholar] [CrossRef]
  22. Komaki, G.; Teymourian, E.; Kayvanfar, V. Minimising makespan in the two-stage assembly hybrid flow shop scheduling problem using artificial immune systems. Int. J. Prod. Res. 2015, 54, 1–21. [Google Scholar] [CrossRef]
  23. Zhang, X.Y.; Chen, L. A re-entrant hybrid flow shop scheduling problem with machine eligibility constraints. Int. J. Prod. Res. 2017, 56, 1–13. [Google Scholar] [CrossRef]
  24. Niu, Q.; Zhou, T.; Ma, S. A quantum-inspired immune algorithm for hybrid flowshop with makespan criterion. J. Univ. Comput. Sci. 2009, 15, 765–785. [Google Scholar]
  25. Engin, O.; Döyen, A. A new approach to solve hybrid flow shop scheduling problems by artificial immune system. Futur. Gener. Comput. Syst. 2004, 20, 1083–1095. [Google Scholar] [CrossRef]
  26. Ramanan, T.R.; Sridharan, R.; Shashikant, K.S.; Haq, A.N. An artificial neural network based heuristic for flow shop scheduling problems. J. Intell. Manuf. 2009, 22, 279–288. [Google Scholar] [CrossRef]
  27. Izonin, I.; Tkachenko, R.; Kryvinska, N.; Tkachenko, P.; Greguš ml., M. Multiple Linear Regression Based on Coefficients Identification Using Non-iterative SGTM Neural-like Structure. In Advances in Computational Intelligence; Springer: Cham, Switzerland, 2019. [Google Scholar]
  28. Tkachenko, R.; Izonin, I. Model and Principles for the Implementation of Neural-Like Structures Based on Geometric Data Transformations; Springer Science and Business Media LLC: Heidelberg, Germany, 2018; pp. 578–587. [Google Scholar]
  29. Gupta, J.N.D.; Majumder, A.; Laha, D. Flowshop scheduling with artificial neural networks. J. Oper. Res. Soc. 2019, 1–19. [Google Scholar] [CrossRef]
  30. Zhao, F.; Qin, S.; Zhang, Y.; Ma, W.; Zhang, C.; Song, H. A hybrid biogeography-based optimization with variable neighborhood search mechanism for no-wait flow shop scheduling problem. Expert Syst. Appl. 2019, 126, 321–339. [Google Scholar] [CrossRef]
  31. Powley, E.J.; Cowling, P.I.; Whitehouse, D. Information capture and reuse strategies in Monte Carlo Tree Search, with applications to games of hidden information. Artif. Intell. 2014, 217, 92–116. [Google Scholar] [CrossRef]
  32. Chaslot, G.; De, J.S.; Saito, J.T. Monte-Carlo tree search in production management problems. In Proceedings of the 18th Belgium-Netherlands Conference on Artificial Intelligence, Namur, Belgium, 5–6 October 2006; Volume 2, pp. 91–98. [Google Scholar]
  33. Wu, T.-Y.; Wu, I.-C.; Liang, C.-C. Multi-objective Flexible Job Shop Scheduling Problem Based on Monte-Carlo Tree Search. In Proceedings of the 2013 Conference on Technologies and Applications of Artificial Intelligence, Taipei, Taiwan, 6–8 December 2013; pp. 73–78. [Google Scholar] [CrossRef]
  34. Chou, J.-J.; Liang, C.-C.; Wu, H.-C.; Wu, I.-C.; Wu, T.-Y.; Jen-Jai, C.; Chao-Chin, L.; Hung-Chun, W.; I-Chen, W.; Tung-Ying, W. A new MCTS-based algorithm for multi-objective flexible job shop scheduling problem. In Proceedings of the 2015 Conference on Technologies and Applications of Artificial Intelligence (TAAI), Tainan, Taiwan, 20–22 November 2015; Institute of Electrical and Electronics Engineers (IEEE): Piscataway, NJ, USA, 2015; pp. 136–141. [Google Scholar]
  35. Furuoka, R.; Matsumoto, S. Worker’s knowledge evaluation with single-player Monte Carlo tree search for a practical reentrant scheduling problem. Artif. Life Robot. 2016, 22, 130–138. [Google Scholar] [CrossRef]
  36. Schadd, M.P.D.; Winands, M.H.M.; Herik, J.V.D.; Chaslot, G.; Uiterwijk, J.W.H.M. Single-Player Monte-Carlo Tree Search. Computers and Games; Springer: Berlin/Heidelberg, Germany, 2008. [Google Scholar]
  37. Schadd, M.P.; Winands, M.H.M.; Tak, M.J.; Uiterwijk, J.W.H.M. Single-player Monte-Carlo tree search for SameGame. Knowl. Based Syst. 2012, 34, 3–11. [Google Scholar] [CrossRef]
  38. Shimpei, M.; Noriaki, H.; Kyohei, I. Evaluation of Simulation Strategy on Single-Player Monte-Carlo Tree Search and its Discussion for a Practical Scheduling Problem. Lect. Notes Eng. Comput. Sci. 2010, 1, 2182. [Google Scholar]
  39. Sylvain, G.; David, S. Monte- Carlo tree search and rapid action value estimation in computer Go. Artific. Intell. 2011, 175, 1856–1875. [Google Scholar]
  40. Hazeghi, K.; Puterman, M.L. Markov Decision Processes: Discrete Stochastic Dynamic Programming. J. Am. Stat. Assoc. 1995, 90, 392. [Google Scholar] [CrossRef]
  41. Boutilier, C. Knowledge Representation for Stochastic Decision Processes: Artificial Intelligence Today; Springer Science and Business Media LLC: Heidelberg, Germany, 1999; Volume 1600, pp. 111–152. [Google Scholar]
  42. Carlier, J.; Néron, E. An Exact Method for Solving the Multi-Processor Flow-Shop. RAIRO Oper. Res. 2000, 34, 1–25. [Google Scholar] [CrossRef] [Green Version]
  43. Naderi, B.; Zandieh, M.; Balagh, A.K.G.; Roshanaei, V. An improved simulated annealing for hybrid flowshops with sequence-dependent setup and transportation times to minimize total completion time and total tardiness. Expert Syst. Appl. 2009, 36, 9625–9633. [Google Scholar] [CrossRef]
Figure 1. Schematic of the hybrid flow shop scheduling problem (HFSSP) state evolution.
Figure 1. Schematic of the hybrid flow shop scheduling problem (HFSSP) state evolution.
Applsci 10 06220 g001
Figure 2. Schematic of the HFSSP state evolution: (a) the relationship between the number of iterations and the node depth; (b) the number of callbacks and average depth of callbacks for each simulation policy.
Figure 2. Schematic of the HFSSP state evolution: (a) the relationship between the number of iterations and the node depth; (b) the number of callbacks and average depth of callbacks for each simulation policy.
Applsci 10 06220 g002
Figure 3. The means plot and 95% Tukey HSD confidence intervals for the interaction between the algorithms and the allowed CPU time.
Figure 3. The means plot and 95% Tukey HSD confidence intervals for the interaction between the algorithms and the allowed CPU time.
Applsci 10 06220 g003
Table 1. Comparison results for the RPI values of the four simulation policies.
Table 1. Comparison results for the RPI values of the four simulation policies.
Problem4-RuleN-ExpanNEH-LPT(λ)Random
20 × 40.010.024.9212.23
20 × 60.030.044.3713.11
20 × 80.150.295.7614.45
40 × 40.080.124.1110.17
40 × 60.140.374.7713.67
40 × 80.110.285.6715.42
60 × 40.040.164.5212.83
60 × 60.070.524.8714.17
60 × 80.130.315.1217.85
80 × 40.020.472.3111.77
80 × 60.030.177.5713.46
80 × 80.060.246.8615.52
100 × 40.030.233.1812.83
100 × 60.080.293.5717.51
100 × 80.070.616.8521.84
Ave.0.070.2754.96314.455
Notes: RPI: Relative percentage increase. NEH-LPT: the algorithm of Nawaz, Enscore and Ham - longest processing time.
Table 2. The hypothesis tests of 4-Rule with other heuristic schemes.
Table 2. The hypothesis tests of 4-Rule with other heuristic schemes.
N-Expan and 4-RuleNEH-LPT(λ) and 4-RuleRandom and 4-Rule
p-Value0.000.000.00
Significant? (p < 0.05)YesYesYes
Table 3. Results of improved single-player Monte-Carlo tree search (ISP-MCTS) for the different settings.
Table 3. Results of improved single-player Monte-Carlo tree search (ISP-MCTS) for the different settings.
(C,D)(0.1, 32)(0.5, 10,000)(1, 20,000)
0–105(j15c5d2) (~10 s)
Average completion time84.8385.1488.27
Standard deviation0.531.272.50
Average depth571911
105–106(j30c5e4) (~100 s)
Average completion time570.9567.3581.1
Standard deviation3.512.774.73
Average depth2077314
106–5 × 106(j30c5e1) (~200 s)
Average completion time482.7465.3480.5
Standard deviation3.562.723.21
Average depth2899827
Table 4. Comparison results for the type a and type b problems.
Table 4. Comparison results for the type a and type b problems.
ProblemLBISP-MCTSIDABCHVNSPSOGAAISB&B
CmaxCPUCmaxCPUCmaxCPUCmaxCPUCmaxCPUCmaxCPUCmaxCPU
j10c5a288880.003880.002880.004880.002880.0008818813
j10c5a31171170.0031170.0031170.0041170.0021170.00011711177
j10c5a41211210.0021210.0031210.0021210.0031210.01512111216
j10c5a51221220.0041220.0051220.0031220.0131220.000122112211
j10c5a61101100.0221100.0091100.0241100.1741100.01511041106
j10c5b11301300.0031300.0031300.0031300.0031300.000130113013
j10c5b21071070.0031070.0041070.0031070.0031070.00010711076
j10c5b31091090.0031090.0031090.0031090.0121090.00010911099
j10c5b41221220.0031220.0031220.0041220.0251220.00012221226
j10c5b51531530.0031530.0031530.0031530.0011530.00015311536
j10c5b61151150.0031150.0041150.0021150.0011150.000115111511
j10c10a11391390.0311390.0091390.2271390.0551390.015139113941
j10c10a21581580.3371580.0091580.3541580.871580.1251581815821
j10c10a31481480.0361480.0081480.2251480.0171480.047148114858
j10c10a41491490.0101490.0071490.0071490.0851490.141149214921
j10c10a51481480.0061480.0061480.0061480.1021480.000148114836
j10c10a61461460.2821460.0081460.2641460.2391460.156146414620
j10c10b11631630.0061630.0071630.0061630.0131630.000163116336
j10c10b21571570.1141570.0091570.1041570.2211570.131157115766
j10c10b31691690.0071690.0071690.0061690.0141690.000169116919
j10c10b41591590.0121590.0051590.0681590.0211590.015159115920
j10c10b51651650.0071650.0071650.0051650.0371650.016165116533
j10c10b61651650.0081650.0061650.0051650.0561650.016165116534
j15c5a11781780.0071780.0161780.0061780.0601780.031178117818
j15c5a21651650.0051650.0181650.0051650.0051650.015165116535
j15c5a31301300.0081300.0171300.0051300.0061300.015130113034
j15c5a41561560.0101560.0191560.0051560.0131560.015156215621
j15c5a51641640.0051640.0161640.0031640.0041640.046164116434
j15c5a61781780.0071780.0161780.0051780.0061780.032178117838
j15c5b11701700.0071700.0121700.0051700.0031700.015170117016
j15c5b21521520.0051520.0151520.0031520.0051520.015152115225
j15c5b31571570.0061570.0141570.0061570.0301570.015157115715
j15c5b41471470.0101470.0171470.0071470.0001470.015147114737
j15c5b51661660.1001660.0201660.0881660.0861660.016166216620
j15c5b61751750.0071750.0181750.0041750.0161750.015175117523
j15c10a12362360.0112360.0332360.0082360.0182360.015236123640
j15c10a22002000.1202000.0482000.3392000.2142000.01520030200154
j15c10a31981980.1301980.0321980.3131980.1711980.063198419845
j15c10a42252250.0702250.0342250.2122250.0722250.0312251222578
j15c10a51821820.0111820.0361820.0081820.5091820.0161822183c
j15c10a62002000.0522000.0332000.0242000.4682000.031200220044
j15c10b12222220.0262220.0482220.0092220.0172220.031222322270
j15c10b21871870.0091870.0441870.0081870.0121870.047187118780
j15c10b32222220.0092220.0392220.0082220.0072220.015222122280
j15c10b42212210.0092210.0372210.0082210.0072210.016221122184
j15c10b52002000.1092000.0482000.0452000.1352000.094200120084
j15c10b62192190.0102190.0372190.0092190.0062190.031219121967
AVE0.0350.0170.0520.0820.0282.57435.674
Notes: CPU: The CPU time of the algorithm in seconds. ISP-MCTS: Improved single-player Monte-Carlo tree search. IDABC: Improved discrete artificial bee colony algorithm. HVNS: Hybrid variable neighborhood search. PSO: Particle swarm optimization. GA: Genetic algorithm. AIS: Artificial immune system. B&B: Branch-and-bound algorithm. GAR: Genetic algorithm (Ruiz, R). ISA: Improved simulated annealing. ABCP: Artificial bee colony algorithm with permutation.
Table 5. Comparison results for type c and d problems.
Table 5. Comparison results for type c and d problems.
ProblemLBISP-MCTSIDABCHVNSPSOGAAISB&B
CmaxCPUCmaxCPUCmaxCPUCmaxCPUCmaxCPUCmaxCPUCmaxCPU
j10c5c168680.275680.017680.792680.332680.03168326828
j10c5c274740.633740.055740.752740.535740.0167447419
j10c5c371710.75272a720.141(a)7136.997710.01672a71240
j10c5c466660.161660.005660.631660.215660.031663661017
j10c5c578780.382780.004780.129780.122780.09478147842
j10c5c669690.527690.004690.226690.405690.0006912694865
j10c5d166660.268660.005660.148660.185660.046665666490
j10c5d273730.271730.069730.142731.158730.1107331732617
j10c5d364640.277640.007640.157640.098640.015641564481
j10c5d470700.895700.003700.675700.337700.00070570393
j10c5d566661.576660.009661.382660.515660.031661446661627
j10c5d662620.276620.009620.135620.383620.062628626861
j10c10c11131142.822(a)115a1150.571(a)115a115a115a127c
j10c10c21161170.414(a)119a1170.448(a)117a117a119a1161100
j10c10c3981082.706(a)116a1160.698(a)116a116a116a133c
j10c10c41031122.315(a)120a1200.461(a)120a120a120a135c
j10c10c51211252.831(a)125a1252.125(a)125a125a126a145c
j10c10c6971052.663(a)106a1060.343(a)106a106a106a112c
j15c5c185851.179850.127851.128854.205850.03185774852131
j15c5c290900.7859027900.49190119891a91a90184
j15c5c387870.511870.048870.502872.398870.109871687202
j15c5c489890.653890.038890.569892.208890.0008931790c
j15c5c573742.197(a)74a741.968(a)74a75a74a84c
j15c5c691910.370910.027910.180910.191910.04791199157
j15c5d11671670.0051670.0201670.00416701670.015167116724
j15c5d282841.133(a)84a840.915(a)84a84a84a85c
j15c5d377822.171(a)82a821.356(a)82a83a83a96c
j15c5d461840.672(a)84a841.325(a)84a84a84a101c
j15c5d567791.559(a)79a790.585(a)79a80a80a97c
j15c5d679811.683(a)81a811.123(a)81a82a82a87c
AVE0.5441.6150.47373.430.038168.88455.23
Table 6. Performance summary of the different algorithms.
Table 6. Performance summary of the different algorithms.
Algorithma and b Problemsc and d Problems
SolvedDeviationSolvedDeviation
ISP-MCTS470183.40%
IDABC470174.00%
HVNS470174.00%
PSO470183.95%
GA470174.17%
AIS470164.26%
B&B460.12%189.32%
Table 7. Comparison results for the ten harder problems.
Table 7. Comparison results for the ten harder problems.
ProblemISP-MCTSIDABCHVNSPSO
Ave.Min.Std.CPUAve.Min.Std.CPUAve.Min.Std.CPUAve.Min.Std.CPU
j30c5e1463.24620.8114.52465.24631.8057465.414641.3429.16474.704711.4296.16
j30c5e2616.0616010.32616.061602616.226160.3211.69616.256160.4455.28
j30c5e3594.75930.9720.37596.45931.7049598.515953.3227.33610.256024.7064.56
j30c5e4565.35631.2318.53566.25651.6039568.325661.4437.00577.105751.5286.98
j30c5e5600.96000.7717.23602.06001.6058603.756011.2621.32606.806051.1179.84
j30c5e6602.86001.4417.06603.16011.8055607.016033.3233.39612.506053.4967.99
j30c5e7626.46260.3118.23626.0626019627.346260.8327.37630.606290.7587.18
j30c5e8674.46740.6922.69674.76740.9055676.406742.2331.40684.206782.5097.67
j30c5e9643.16420.7621.37643.76421.0067645.286431.6633.81654.656511.8783.80
j30c5e10574.25731.0923.81576.35731.5076581.335773.9729.72599.755945.2877.46
Ave.596.1594.90.8318.41596.9595.31.1947.7598.96596.51.9728.22606.68602.62.3179.69
Table 8. Comparison results for the ten harder problems.
Table 8. Comparison results for the ten harder problems.
ISP-MCTS and IDABCISP-MCTS and HVNSISP-MCTS and PSO
p-Value0.120.0180
Significant? (p < 0.05)NOYesYes
Table 9. Computational results of the algorithms (ρ = 10).
Table 9. Computational results of the algorithms (ρ = 10).
ProblemISP-MCTSIDABCHVNSPSOGARISAABCp
20 × 40.030.030.310.711.010.680.38
20 × 60.120.110.240.771.270.780.69
20 × 80.330.340.611.031.211.180.82
40 × 40.030.030.220.310.520.340.49
40 × 60.040.030.150.220.710.290.77
40 × 80.170.180.660.981.391.121.13
60 × 40.110.130.110.330.510.350.48
60 × 60.160.160.230.410.610.470.55
60 × 80.210.250.420.671.100.711.04
80 × 40.060.060.110.200.350.210.33
80 × 60.200.210.240.460.650.520.60
80 × 80.120.120.330.550.610.320.52
100 × 40.000.010.030.050.180.050.11
100 × 60.050.050.100.230.410.270.38
100 × 80.090.110.180.510.620.350.57
Ave.0.1150.1210.2630.4950.7430.5090.591
Table 10. Computational results of the algorithms (ρ = 20).
Table 10. Computational results of the algorithms (ρ = 20).
ProblemISP-MCTSIDABCHVNSPSOGARISAABCp
20 × 40.020.010.270.620.880.570.29
20 × 60.040.060.160.681.030.660.57
20 × 80.210.230.470.890.961.010.44
40 × 40.020.030.150.240.410.290.36
40 × 60.010.020.110.150.560.170.51
40 × 80.100.130.530.731.151.010.99
60 × 40.050.070.090.210.470.310.42
60 × 60.060.100.170.330.480.360.51
60 × 80.120.130.310.420.920.620.93
80 × 40.020.020.080.110.290.180.31
80 × 60.090.130.170.230.650.460.48
80 × 80.050.090.250.330.470.280.42
100 × 40.000.000.020.030.130.030.09
100 × 60.020.020.070.110.280.160.31
100 × 80.060.060.210.390.460.310.48
Ave.0.0580.0730.2040.3650.6090.4280.474
Table 11. Computational results of the algorithms (ρ = 30).
Table 11. Computational results of the algorithms (ρ = 30).
ProblemISP-MCTSIDABCHVNSPSOGARISAABCp
20 × 40.000.000.150.430.760.490.22
20 × 60.030.030.120.510.970.580.52
20 × 80.170.210.290.770.820.910.41
40 × 40.000.010.110.120.330.210.27
40 × 60.010.010.090.110.410.140.42
40 × 80.060.090.350.620.950.750.86
60 × 40.020.020.060.170.360.260.36
60 × 60.040.050.130.310.410.270.44
60 × 80.050.060.180.370.750.450.74
80 × 40.000.000.070.090.240.110.26
80 × 60.030.050.140.180.550.320.37
80 × 80.050.050.160.270.430.190.34
100 × 40.000.000.020.030.090.010.08
100 × 60.010.010.060.090.260.110.28
100 × 80.040.040.190.320.350.180.41
Ave.0.0340.0420.1410.2930.5120.3320.399

Share and Cite

MDPI and ACS Style

Guo, J.; Shi, Y.; Chen, Z.; Yu, T.; Shirinzadeh, B.; Zhao, P. Improved SP-MCTS-Based Scheduling for Multi-Constraint Hybrid Flow Shop. Appl. Sci. 2020, 10, 6220. https://doi.org/10.3390/app10186220

AMA Style

Guo J, Shi Y, Chen Z, Yu T, Shirinzadeh B, Zhao P. Improved SP-MCTS-Based Scheduling for Multi-Constraint Hybrid Flow Shop. Applied Sciences. 2020; 10(18):6220. https://doi.org/10.3390/app10186220

Chicago/Turabian Style

Guo, Jian, Yaoyao Shi, Zhen Chen, Tao Yu, Bijan Shirinzadeh, and Pan Zhao. 2020. "Improved SP-MCTS-Based Scheduling for Multi-Constraint Hybrid Flow Shop" Applied Sciences 10, no. 18: 6220. https://doi.org/10.3390/app10186220

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop