Improvement of Scheduling Optimization of Cyber-Physical Systems Based on Petri Net and Intelligent Algorithm

Yang, Yuhai; Liu, Xiaodong; Lu, Wei

doi:10.3390/sym17040487

Open AccessArticle

Improvement of Scheduling Optimization of Cyber-Physical Systems Based on Petri Net and Intelligent Algorithm

by

Yuhai Yang

,

Xiaodong Liu

and

Wei Lu

^*

School of Control Science and Engineering, Dalian University of Technology, Dalian 116024, China

^*

Author to whom correspondence should be addressed.

Symmetry 2025, 17(4), 487; https://doi.org/10.3390/sym17040487

Submission received: 11 February 2025 / Revised: 19 March 2025 / Accepted: 22 March 2025 / Published: 24 March 2025

(This article belongs to the Section Engineering and Materials)

Download

Browse Figures

Versions Notes

Abstract

Cyber-physical systems need more intelligent decision-making methods. To address this issue with respect to incomplete process models and inefficient scheduling, we have previously proposed a new method called Petri-nets-adaptive ant colony optimization (PN-AACO). This method targets small-scale job shops with shared resource limits. These shops require symmetric job designs for resource sharing but have asymmetric job processing times. PN-AACO uses Petri net symmetry at edge nodes but faces a problem. Its marking–transition pheromone index mechanism causes state space explosion from Petri nets. This leads to a decrease in the computational speed of the algorithm in the face of an increase in scale or state, which results in a longer overall manufacturing process time that impacts productivity. Thus, we propose the improved PN-AACO (iPN-AACO). The improved method uses transition–transition pheromone recording to control pheromone amounts. It also adds pheromone-based initial selection and best-known-paths-based probability rules. Tests show this approach speeds up computations up to 92% in more-states models while keeping scheduling effective.

Keywords:

edge controller; cyber-physical systems; Petri nets; ant colony optimization; intelligent decision-making optimization

1. Introduction

1.1. Importance and Motivation

As edge control technology evolves, the implementation of intelligent manufacturing in smart factories becomes critically dependent on cyber-physical systems (CPSs) adoption [1,2,3]. Numerous cases underscore the pivotal role of optimized intelligent decision-making processes within CPSs [4,5]. This involves perceiving system states, forecasting future developments, setting goals, and planning actions [6]. Meanwhile, optimizing the processing of local data at the edge side significantly improves system performance, efficiency, and reliability. Job shop scheduling with shared resource constraints represents a key challenge in intelligent decision making. This problem has gained growing attention in industrial systems due to its significant impact on production scheduling efficiency [7,8].

In CPSs, the edge controller becomes an integral part. Adding algorithms to edge controllers gives them a certain level of intelligence, which can greatly improve the overall efficiency of the system from the device level. With the development of manufacturing automation, edge controllers with algorithms can replace human decision making to greatly reduce waiting time and improve productivity. However, different algorithms are applicable to different areas. If the algorithm execution time is too long, it will also reduce productivity. Therefore, reducing the execution time of the algorithm is also a necessary part of optimizing efficiency.

1.2. Literature Review

Prior to conducting the work presented in this study, the authors utilized Petri nets to model and analyze the processes of CPSs [2,9]. As a graphical–mathematical modeling tool, Petri nets uniquely integrate diagrammatic representation with algebraic formalism. This synthesis allows for precise characterization of the scheduling complexities, including but not limited to process discreteness, operational conflicts, asynchronous behaviors, concurrency patterns, and system deadlocks. [10,11]. Consequently, they have emerged as one of the mainstream technologies for CPS modeling and analysis [12,13]. This advantage has led to their widespread application in the industrial manufacturing sector [14,15,16]. Petri nets are gradually becoming a promising method for optimizing the job scheduling problem and have been thoroughly studied by a wide range of scholars. Based on timed Petri nets (TPNs), Cui [17] designed an algorithm combining genetic algorithms and particle swarm optimization to optimize scheduling problems. Similarly, Wu [18] devised an algorithm combining ant colony optimization (ACO) to optimize process control that was also based on TPNs. However, both approaches employ action fragmentation by dividing a single operation into two discrete phases: an initiation phase modeled through timed transition and a termination phase represented by immediate transition. This dual-representation paradigm inevitably leads to network structure inflation (increased node count) and consequently diminishes algorithmic efficiency. Although Petri nets are capable of modeling CPSs, they are predominantly used for system analysis [13,19]. Few cases directly guide industrial processes to improve efficiency, leading to a disconnect between the established models and practical problem solving.

With the advancement of computer performance, neural networks have begun to make significant strides across various fields, including reinforcement learning and deep learning. Compared to heuristic methods, neural networks rely less on domain knowledge and demonstrate strong capabilities in learning complex patterns and processing large datasets. They have also been proven successful in the field of task scheduling [20,21,22]. Neural networks can be combined with Petri nets to retain the inherent advantages of Petri nets in modeling discrete event dynamic systems. For example, Lassoued [23] developed the PetriRL framework by synergizing Petri nets with deep reinforcement learning. This integrated approach demonstrates strong generalization capabilities across varying instance scales. Kim [24] developed Look-Ahead Reinforcement Learning (LARL) to enhance Petri net model exploration. This method trains Q-networks through deep Q-learning on existing instances and then applies anticipatory search strategies for new scenarios. Experimental validation confirmed LARL’s effectiveness.

However, deep learning models are often viewed as “black boxes”, lacking interpretability. In CPSs, scheduling decisions require clear logic and traceability to meet safety and compliance requirements. Furthermore, deep learning typically necessitates a large amount of training data, which is often difficult to obtain in the field. Training on public datasets may not yield satisfactory generalization performance. In CPS environments, scheduling decisions frequently depend on real-time data. That exposes models to risks of data scarcity and rapid environmental dynamics.

Deep learning model training and inference require intensive computation on both developer workstations and CPS edge controllers. However, real-time processing needs and limited resources often restrict their effectiveness. These factors limit their widespread application in the industrial sector.

Heuristic algorithms are widely adopted in industrial settings due to their operational simplicity. A prime example is the Proportional–Integral–Derivative (PID) method. These algorithms require minimal computational resources compared to exact optimization approaches. Their design inherently balances system stability needs with hardware limitations. This combination makes them particularly suitable for process control applications. Conversely, the tradeoff for these simplification is suboptimal scheduling and significant reliance on domain knowledge, hindering their generalization. Some metaheuristic methods draw inspiration from nature, such as genetic algorithms and ACO. Although these methods are not problem-specific and can be applied to a wide range of problems, they are highly sensitive to initial conditions, necessitating extensive tuning of hyperparameters.

1.3. Contribution

The problem of algorithm efficiency has been widely explored by researchers. However, with increasing arithmetic power nowadays, the execution time problem of algorithms is easily ignored. In particular, algorithms running on edge controllers, which are small computing power devices, are constrained by arithmetic power. Thus, the algorithms need to be optimized to be more relevant. Therefore, this study aims to address the barriers between Petri nets and heuristic intelligence algorithms in scheduling optimization, with a view to achieving better results and improving algorithmic efficiency in a less arithmetic environment. Furthermore, whether heuristic algorithms can achieve further efficiency improvements based on Petri nets is also a noteworthy question.

Based on this, the authors conducted research on scheduling efficiency issues and devised a method named Petri-net-adaptive ant colony optimization (PN-AACO). The PN-AACO algorithm eliminates the information barrier between Petri net modeling and heuristic algorithms, thus enhancing the efficiency of heuristic algorithms in Petri net modeling. However, the algorithm’s pheromone index mechanism includes states, which imposes certain limitations on PN-AACO. When facing a small number of states, PN-AACO can quickly solve the optimization execution sequence. However, the actual process also exists in a complex Petri net process with a large number of states, and traversing the pheromone size will become slower and slower. That will lead to slower execution of the algorithm. The slow execution of the PN-AACO algorithm based on the number of Petri net states also slows down the operation efficiency of the whole system. To address the issue of slow execution speed of the PN-AACO algorithm when facing a large number of states, this study proposes a pheromone index mechanism based on transition–transition and fixes the count of pheromones to

{|T|}^{2} + |T_{o}|

(

|T|

is the count of transition set, and

|T_{o}|

is the count of the transition set that can fire in initial markings). The improved PN-AACO (iPN-AACO) algorithm is able to optimize the workflow by finding the current best known scheduling comparable to other heuristic algorithms in a smaller computing power environment. It also has an advantage over PN-AACOs when dealing with optimization problems for tasks with more states, wherein the more pheromones there are, the more obvious the advantage becomes.

1.4. Organization

The remaining sections of this paper are organized as follows: Due to the PN-AACO algorithm proposed in previous work not yet being published, a brief introduction of the PN-AACO algorithm is provided in Section 2. Section 3 presents the improvements made to the PN-AACO algorithm. The process of the iPN-AACO algorithm is described in Section 4. Section 5 conducts simulation verification on the iPN-AACO algorithm. Finally, conclusions are drawn in Section 6.

2. PN-AACO Algorithm

The PN-AACO algorithm mainly comprises Petri net modeling, the global timing mechanism of key time points, the pheromone index mechanism for marking–transition, and adaptive pheromone update mechanisms. This algorithm establishes a six-tuple TPN model for the task scheduling process of a CPS:

Σ = (P, T; F, W, M_{0}, τ)

(1)

where

P = {p_{1}, p_{2}, \dots, p_{|P|}}

is the place set (In this study,

| X |

represents the number of elements in X set);

T = {t_{1}, t_{2}, \dots, t_{|T|}}

is the transition set; F is the flow relationship, which is represented by an ordered pair such as

(p_{1}, t_{1})

that means from

p_{1}

to

t_{1}

; and W is the weight function, which represents the number of tokens required when the transition occurs. By default, it is 1 when not marked;

M_{0}

is the initial marking;

τ = {τ_{1}, τ_{2}, \dots, τ_{|T|}}

is the firing time set, and

τ_{i}

represents the firing time of

t_{i}

. Because the average firing rates of transition are defined as

λ_{i} = \frac{1}{τ_{i}}

, there is a set of the average firing rates of transitions

λ = {λ_{1}, λ_{2}, \dots, λ_{|T|}} = {\frac{1}{τ_{1}}, \frac{1}{τ_{2}}, \dots, \frac{1}{τ_{|T|}}}

. It defines the average number of firings per unit time when enabled. The unit is the number of times per unit time.

The model

Σ

allows for the determination of the input matrix

C^{-}

and output matrix

C^{+}

, as well as other variables required by the algorithm. Subsequently, the task objective is defined as finding a transition sequence

σ_{t} = t_{1} t_{2} \dots t_{n}

that minimizes the execution time

τ_{s u m} = \sum_{t_{x} \in σ_{t}}^{n} τ_{x}

of the sequence.

The traditional ACO algorithm is typically implemented for traveling salesman problem (TSP) scenarios with fixed city counts, where pheromone matrices maintain city-to-city indexing, and ants initiate exploration from randomly assigned urban nodes. While this architecture effectively addresses static TSP configurations, its reliance on predetermined topological constraints poses challenges in dynamic environments requiring real-time adaptability. Unlike the traveling salesman problem, the number of ordinal couples recorded in marking–transition is not a fixed value.

At the initial time, m ants are placed at the marking

M_{0}

. The initial pheromone levels on each path of the first travel are equal.

η_{i, j} (0) = η_{0}

is the initial pheromone level, where

i = 1, 2, 3, \dots

is the ordinal labeling of some marking M, and

j = 1, 2, \dots, | T |

is the ordinal labeling of the timed transitions t. The probability that an ant

k (k = 1, 2, \dots, m)

chooses the next transition among the enabled transitions according to a randomized proportionality rule among the feasible transitions is

p_{i, j} (t) = \{\begin{matrix} \frac{{[η_{i, j} (t)]}^{α} λ_{j}^{β}}{\sum_{s \in a_{k}} {[η_{i, s} (t)]}^{α} λ_{s}^{β}}, j \in a_{k} \\ 0, other \end{matrix}

(2)

where

η_{i, j}

is the pheromone level from

M_{i}

to

t_{j}

;

α

is the pheromone factor;

λ_{j} = 1 / τ_{j}

is the average firing rates of

t_{j}

; and

λ_{s}

is the same.

β

is the heuristic factor, and

a_{k}

is the transition set that the k-th ant will enable next.

Upon reaching the end marking

M_{n}

, ants cease their traversal, calculating the sequence execution time traveled by each ant. The shortest sequence execution time is then saved, and the pheromone levels on all paths are updated simultaneously.

Following this, a method for recording pheromones for key time points and marking–transition was designed (key time points and marking–transition were collectively called the KTMT method) to store and connect the ACO method. In the key time points method, the time before and after the transition is finely divided, and the entire system shares a single global timeline. Each key time point represents an instance in time and can be viewed as a structure. This structure includes the current time

T i m e

, the pretransition marking array

M_{s}

, the ending transition set

T_{e}

, the current marknig array M, the preceding transition set

T_{s}

, and the post-transition marking array

M_{e}

. The internal variables of key time points can be represented using

v a r^{k t (i n d e x)}

or

v a r^{K T (t_{x})}

based on the index sequences or time markers. Additionally, since a marking in the Petri net may be reached by different transitions, unlike two cities in analogy, the marking–transition is used as an atomic index to define the pheromones. Pheromones are updated offline, meaning that after all ants have completed their traversal, and a round of pheromone updates occurs.

Pheromone updates mainly consist of two components: pheromone evaporation and the deposition of pheromones by ants along the paths they traverse. The traditional method for updating pheromones is as follows:

η_{i, j}^{n_{c} + 1} = (1 - ρ) η_{i, j}^{n_{c}} + \sum_{k = 1}^{m} Δ η_{i, j}^{k}

(3)

where

n_{c} (n_{c} = 1, 2, \dots, N_{c})

denotes the current iteration count;

η_{i, j}^{n_{c} + 1}

represents the pheromone level for the next iteration;

ρ

stands for the pheromone evaporation coefficient, where

0 < ρ \leq 1

; and

1 - ρ

denotes the persistence coefficient of the pheromones. Additionally,

η_{i, j}^{n_{c}}

signifies the pheromone levels for the current iteration, and

Δ η_{i, j}^{k}

indicates the pheromone levels deposited by the k-th ant along its path, which are defined as

Δ η_{i, j}^{k} = \{\begin{matrix} Q / τ_{s u m}^{k}, M_{i} t_{j} \in σ^{k} \\ 0, M_{i} t_{j} \notin σ^{k} \end{matrix}

(4)

where Q is the pheromone increase factor;

τ_{s u m}^{k}

is the sequence execution time of the current iteration number k-th ant that has completed traveling; and

σ^{k}

is the occurrence sequence that the k-th ant has searched for. According to Equation (4), it is evident that traditional ACO algorithms exhibit significant variations in sequence execution times when applied to different tasks. This necessitates constant adjustments to the Q value to attain the suitable pheromone. Moreover, within the same task, the impact of the sequence execution time on the pheromone levels is relatively minor, potentially resulting in slower convergence rates. These limitations will be addressed in the subsequent parts by improving the pheromone updating mechanism.

Therefore, an adaptive ant colony algorithm (AACO) was designed to optimize the search efficiency of the traditional ant colony algorithm using the Petri net model. The method updates the increment of pheromones

Δ η_{i, j}^{k}

as follows:

Δ η_{i, j}^{k} = m a x ({η | η = Q \cdot \frac{n_{c}}{N_{c} / 2} \cdot γ^{\frac{τ_{s u m}^{k} - τ_{m i n}}{τ_{m a x} - τ_{m i n}}}})

(5)

where the subscripts i and j represent that the pheromone levels are from

M_{i}

to

t_{j}

;

γ (0 < γ < 1)

is the path influence factor;

τ_{m i n}

stands for the minimum global sequence execution time; and

τ_{m a x}

represents the maximum sequence execution time traveled by all ants in the current iteration. The proposed algorithm achieves good results in small, low-batch tasks.

To facilitate the description of the scheduling decision problem, let us define it within a Petri net framework as follows:

$J = {J_{1}, J_{2}, \dots, J_{|J|}}$ represents a set of $|J|$ jobs, such as those in a production line. Each job is further divided into different operations.
$R = {r_{1}, r_{2}, \dots, r_{|R|}}$ denotes a set of $|R|$ resources, such as operators, machines, or tools. These resources belong to the place set and are shared among various operations.

3. iPN-AACO Algorithm

The improved algorithm primarily improves the initial state selection method, recording method, and update method of the pheromones, as shown in Figure 1 when compared with the original algorithm.

As has been explained in Section 1.2, Petri nets have their advantages for describing discrete event dynamic systems such as task scheduling and are widely used; the key time point approach can describe the Petri operation process on a global timeline. Therefore, in Figure 1, the iPN-AACO algorithm follows the Petri net model of the original algorithm and the time description of the key time point. The iPN-AACO algorithm changes the original marking–transition (M-T) pheromone index mechanism to transition–transition (T-T) with the size of

{|T|}^{2}

and recollectively with the key time point method as KT³. The benefit of the T-T-based recording method is that it is not affected by the number of Petri net reachable markings, and it does not affect the size of the pheromone recording as the number of states increases. However, this also brings a disadvantage because the indexing of the pheromone markings has an uncertainty, i.e., the markers may be different between the same two transitions. To solve this problem, a probabilistic selection rule based on the best konwn paths was proposed. It makes the algorithm converge according to a heuristic rule. Since the T-T recording method cannot affect the initial selection probability as much as the initial state, a pheromone-based initial state selection method was also proposed. The details are developed and described in the following sections.

3.1. KT³ Method

The key time points serve as a bridge between the Petri net and the algorithm, enabling the optimization of scheduling through these points. The algorithm updates the pheromone levels in each iteration, and how to record pheromones is a crucial optimization issue. In this study, process transitions are analogized to cities in the traveling salesman problem, where each transition serves as a node to record pheromones, resulting in a fixed count of

{|T|}^{2}

pheromones. This pheromone recording table is referred to as the table Eta.

The final sequence formed by the Petri net is actually a finite occurrence sequence

σ = M_{0} t_{1} M_{1} t_{2} M_{2} \dots t_{n} M_{n}

. The objective is to find a transition sequence

σ_{t} = t_{1} t_{2} \dots t_{n}

. It is evident that

σ_{t}

is a simplification of

σ

, which is obtained by removing states from

σ

. If we directly seek

σ_{t}

, the marking between any two transitions ordered pair in

σ_{t}

in different initial markings is indeterminate. Therefore, using the transition–transition pheromone index mechanism does not uniquely describe the states in principle, but only reflects the “distance” between transitions locally. This leads to the ant’s selection based on the pheromones recorded by transition–transition being local. In Section 3.3, we will discuss incorporating the influence of the entire path into the ant’s probability selection.

3.2. Initial State Selection Based on Pheromone

Unlike the random selection of the initial route, the convergence ability of the algorithm will be reduced if the route is influenced by the best known route and the initial selection is still random. So, the first transition from each job

J_{i}

is extracted to form the initial variation set

T_{o}

. Simultaneously, maintain a table Zeta of size

|T_{o}|

to record the initial pheromone levels. At the beginning of each ant’s traversal, the k-th ant (where

k = 1, 2, \dots, m

) selects the initial city according to a random proportion rule. The probability of selecting a particular transition is designed as follows:

p_{i} (t) = \{\begin{matrix} \frac{{[ς_{i} (t)]}^{α} {λ_{i}}^{β}}{\sum_{s \in T_{o}} {[ς_{s} (t)]}^{α} {λ_{s}}^{β}}, i = 1, 2, \dots, |T_{o}| \\ 0, other \end{matrix}

(6)

where

ς_{i} (t)

represents the pheromone level on transition

t_{i}

selected in the initial marking

M_{0}

.

3.3. Probabilistic Selection Rules Based on Best Known Paths

In Equation (2), the factor

{[λ_{i, j} (t)]}^{β}

is only the time-consuming impact of local transition. Obviously, it is inappropriate to have only the local transition time affecting the selection probability in the whole path, and the path selection of ants should take into account the path impact of the shortest time-consuming path in each round. After each iteration

n_{c} (n_{c} = 1, 2, \dots, N_{c})

is completed, the shortest path

σ_{t B e s t}

with the least execution time is retained. When

n_{c} \geq 2

, the probability selection of each ant is influenced by the global best known path. The rule for designing probability selection is as follows:

p_{i, j} (t) = \{\begin{matrix} [1 + Λ] \cdot \frac{{[η_{i, j} (t)]}^{α} {λ_{j}}^{β}}{\sum_{s \in a_{k}} {[η_{i, s} (t)]}^{α} {λ_{s}}^{β}}, j \in a_{k} \\ 0, other \end{matrix}

(7)

where

Λ

is the global best known path influence component, which is designed to influence as follows:

Λ = s g n (n_{c} - 1) \cdot Q \cdot \frac{n_{c}^{3}}{N_{c} / 2} \cdot {(ε + ξ)}^{z}

(8)

where

N_{c}

is the maximum count of iterations;

ε

represents the overlap between the current path and the best known path; and Algorithm 1 shows how to compute

ε

.

Algorithm 1 Compute

ε

.

Require: transition sequence of current path

σ_{t}^{c u r}

, transition sequence set of global best known path

σ_{t}^{b e s t} = {σ_{t 1}^{b e s t}, σ_{t 2}^{b e s t}, \dots, σ_{t m a x}^{b e s t}}

.

Ensure:

ε

;

1: for

i = 1

;

i ⩽ |σ_{t}^{b e s t}|

;

i + +

do

2: if

σ_{t}^{c u r}

begin with

σ_{t i}^{b e s t}

then

3: Compute

ε = \frac{|σ_{t}^{c u r}|}{|σ_{t}^{b e s t}|}

;▹

|σ_{t}|

denotes the number of transitions in the sequence

4: break;

5: end if

6: end for

7: return

ε

;

ξ

is the overlap compensation coefficient, and z is the overlap influence coefficient. The factor

s g n (n_{c} - 1)

is to present the effect of the best known path at the first iteration, where only the first round of paths are traveled, and the path with the shortest time to execute the sequence occurs. The factor

n_{c}^{3} / (N_{c} / 2)

is applied to allow the ant colony to accelerate the rate of convergence as the count of iterations increases. The influence of the real best known path on the ant selection probability is the factor

{(ε + ξ)}^{z}

. After many tests, it was summarized that it would be more appropriate to take

ξ = 0.5

and

z = 1.5

in this study.

4. Improved Algorithmic Flow

The original PN-AACO algorithm process maintained two tables: the table KT and the pheromone table Eta. The iPN-AACO algorithm also includes an additional table: the initial pheromone table Zeta. The improved algorithm achieves optimization through an iteration cycle, ant cycle, and KT cycle, as illustrated in the Figure 2.

From Figure 2, it can be observed that the algorithm process mainly consists of three stages: model building, initialization, and cycle body.

Modeling of the Petri net for scheduling tasks in done in step 1. Find

M_{n}

using a method where each job is completed individually and sequentially to ensure that

M_{n}

is reachable. The scheduling process is described in the Petri net model to determine data such as P, T,

C^{+}

,

C^{-}

,

T_{τ}

,

T_{o}

,

M_{0}

, etc., to prepare the data input for the next ACO algorithm.

Step 2 is ACO algorithm initialization. In this step, some parameters are defined, including m,

λ

,

α

,

β

,

ρ

, Q, and

γ

. Then, create the table KT, table Eta, table Zeta, and tables for some intermediate parameters. Finally, set the iteration times to 1 and prepare the first iteration.

The three-layer cycle is described as follows: The three cycles are, in order from outside to inside, the iteration cycle, the ant cycle and the KT cycle. First, outside the cycle, initialize the iterations

n_{c}

. After entering the iteration cycle, i.e., step 3, first determine whether the current iteration count is less than or equal to the specified count of cycles; if not, enter step 22—output the result and end the cycle. If yes, enter step 4—initialize the ant number

k = 1

, and enter the ant cycle, i.e., step 5. In the ant cycle, the count of ants is m. If all ants are traveled, enter step 21—the pheromone level is updated according to Equations (3) and (5) and

n_{c}

increases itself by one to the next iteration cycle; if not traveled completely, enter step 6—initialize

T i m e^{k t (1)} = 0

,

M_{s}^{k t (1)} = M_{e}^{k t (0)} = M_{0}

,

T_{e}^{k t (1)} = Φ

, and prepare to enter the KT cycle, where

S t e p

is the index of KT; initialize index

S t e p = 1

.

Enter the KT cycle, i.e., step 7, to determine whether the current is the last key time point of the table KT. If yes, then enter step 20—jump out of the KT cycle to update the ant index number to travel the new ant; if not, then enter the KT cycle body, i.e., step 8. Because each ant does not necessarily travel through the same number of key time points, the ant cycle is in the outer layer of the KT cycle. The KT cycle needs to traverse all key time points on the global timeline, and the key time points are dynamically increasing. After entering the KT cycle, it first determines whether the end condition is satisfied in step 8, and the end condition can determine whether the end marking

M_{n}

is reached. Output the optimization result in step 22 if the end state is satisfied; if not, continue the KT cycle, i.e., step 9. When traveling to

k t (S t e p)

, the following is satisfied:

M_{s}^{k t (S t e p)} = M_{e}^{k t (S t e p - 1)}, S t e p = 1, 2, \dots, m a x

(9)

Then, enter step 10 according to the following:

M^{k t (S t e p)} = M_{s}^{k t (S t e p)} + C^{+} U_{T_{e}}

(10)

Compute the

M^{k t (S t e p)}

values, where

U_{T_{e}}

is a vector of which the ordinal labeling set is the transition set

T_{e}^{k t (S t e p)}

. The element of

U_{T_{e}}

is the fired count of the transition corresponding to the same ordinal labeling. For example, if the count of a transition set is

| P | = 3

, then

T_{e}^{k t (S t e p)} = {t_{1}, t_{1}, t_{2}, t_{2}, t_{2}, t_{3}}

. Then,

U_{T_{e}} = {[2, 3, 1]}^{T}

. After calculating

M^{k t (S t e p)}

, enter step 11: Determine what transition can fire in

M^{k t (S t e p)}

. The basis on which the transition

t_{j}

can fire is

⋂_{i = 1}^{| P |} M_{i}^{k t (S t e p)} \geq C_{i j}^{-}

(11)

where

M^{k t (S t e p)}

is a vector,

M_{i}^{k t (S t e p)}

denotes the i-th element of

M^{k t (S t e p)}

, and similarly,

C_{i j}^{-}

denotes the i-th row and j-th column of matrix

C^{-}

. If Equation (11) is satisfied, then the transition can fire. Temporary transitions that can occur are stored, and the process proceeds to step 12. If

S t e p = 1

, then proceed to step 13, where the probabilities are computed based on Equation (6), and an initial transition is randomly selected. Otherwise, proceed to step 14, where computations and selections are made according to Equation (7). Assuming that the chosen transition is

t_{j}

, then a new element

t_{j}

is added to

T_{s}^{k t (S t e p)}

in step 15, i.e.,

{T^{'}}_{s}^{k t (S t e p)} = T_{s}^{k t (S t e p)} \cup t_{j}

(12)

where “′” represents the new value. Once it is added, go to step 16. If

T i m e^{k t (S t e p)} + τ_{j}

does not exist in the table KT, add

T i m e^{k t (S t e p)} + τ_{j}

to the table KT and sort by

T i m e

. Also, update the member variable

T_{e}

of the table KT for the new moment:

{T^{'}}_{e}^{K T (T i m e^{k t (S t e p)} + τ_{j})} = T_{e}^{K T (T i m e^{k t (S t e p)} + τ_{j})} \cup t_{j}

(13)

Next, enter step 17: Calculate the new

M^{k t (S t e p)}

after the transition has fired. It is defined as follows:

{M^{'}}^{k t (S t e p)} = M^{k t (S t e p)} - C^{-} U_{t_{j}}

(14)

where

U_{t_{j}}

is a vector with the transition

t_{j}

as the ordinal labeling set. After calculating the new

M^{k t (S t e p)}

, we again determine whether

M^{k t (S t e p)}

can fire the transition or not. Until

M^{k t (S t e p)}

is unable to fire, enter step 18. Throughout the process from step 11 to step 17, the tokens always satisfy the selected transition first, and after the current transition has consumed the tokens, the remaining tokens are judged to determine whether a new transition can still fire. This process can effectively prevent the deadlock phenomenon caused by resource grabbing. Next is step 19:

M_{e}^{k t (S t e p)} = M^{k t (S t e p)}

is updated, while

S t e p

increments itself by one and goes to the next KT cycle.

5. Performance Proof

The experimental setup utilized an 3.0 GHz processor with 32 GB RAM. MATLAB r2022a was employed as the simulation software. To validate the effectiveness of the iPN-AACO algorithm, two simulations were conducted and compared with the original method (in references) and PN-AACO algorithm. The objective of the algorithms was to find a scheduling solution that minimizes the completion time

τ_{s u m}

of the system’s tasks, i.e., to find the shortest transition sequence

σ_{t}

that minimizes the

τ_{s u m}

.

5.1. Radar Receiver Test Task

This simulation used the Petri net in reference [17], which is a parallel test of five parameters of radar receivers A and B (corresponding to tasks

J_{1}

and

J_{2}

) with five types of instrumentation needed to complete the test. The S³PR model established in reference [17] has a high number of states due to different model types, but the essence is to split a transition into a combination of immediate and timed transitions. The network established in this study needs to combine the corresponding immediate and timed transitions, and the final TPN model is shown in Figure 3. In the model,

P = {p_{1}, p_{2}, \dots, p_{12}}

,

T = {t_{1}, t_{2}, \dots, t_{10}}

, and

R = {r_{1}, r_{2}, r_{3}, r_{4}, r_{5}} = {p_{13}, p_{14}, p_{15}, p_{16}, p_{17}} \in P

. The transition time

τ

is calculated as

τ = {7, 4, 8, 3, 5, 8, 3, 3, 7, 6}

(15)

Set the initial and end markings as

\begin{matrix} M_{0} = {1, 0, 0, 0, 0, 0, 1, 0, 0, 0, 0, 0, 1, 1, 2, 1, 1} \\ M_{n} = {0, 0, 0, 0, 0, 1, 0, 0, 0, 0, 0, 1, 1, 1, 2, 1, 1} \end{matrix}

(16)

The equivalent initial and end markings in reference [17] are, respectively,

\begin{matrix} M_{0}^{r e f} = {1, 0, 0, 0, 0, 0, 0, 0, 0, 0, 1, 0, 0, 0, 0, 0, 0, 0, 0, 0, 1, 1, 2, 1, 1} \\ M_{n}^{r e f} = {0, 0, 0, 0, 0, 0, 0, 0, 0, 1, 0, 0, 0, 0, 0, 0, 0, 0, 0, 1, 1, 1, 2, 1, 1} \end{matrix}

(17)

The algorithm parameters were set as follows: the maximum iteration count

N_{c} = 50

,

m = 30

,

α = 1.9

,

β = 0.4

,

ρ = 0.15

,

Q = 1000

, and

γ = 0.45

. In the iPN-AACO algorithm,

T_{o} = {t_{1}, t_{6}}

. It was run 20 times, and the results obtained are shown in Table 1.

The five methods of PSO, GA-ACO, GA-PSO, PN-AACO, and iPN-AACO to obtain the best known

σ_{t}

were practically the same; as an example of iPN-AACO, the resulting Gantt chart of the task scheduling sequence is shown in Figure 4.

The comparison shows that all five methods could find the optimal sequence with a

τ_{s u m}

of 37 s. However, the proposed two algorithms could find the best known path in the first iteration, which was 95% shorter than the 22 times in the GA-PSO algorithm proposed in reference [17]. Because the algorithm in this study uses the key time points method, many unreasonable paths are avoided at the beginning of using the ACO algorithm, making the final path only one.

To verify the advantage of the iPN-AACO algorithm in terms of computational speed in the face of an increase in the token number, the five initial markings

M_{0}

used for the experiment were the following:

\begin{matrix} M_{0} = & [1, 0, 0, 0, 0, 0, 1, 0, 0, 0, 0, 0, 1, 1, 2, 1, 1], \\ [2, 0, 0, 0, 0, 0, 1, 0, 0, 0, 0, 0, 1, 1, 2, 1, 1], \\ [2, 0, 0, 0, 0, 0, 2, 0, 0, 0, 0, 0, 1, 1, 2, 1, 1], \\ [3, 0, 0, 0, 0, 0, 3, 0, 0, 0, 0, 0, 1, 1, 2, 1, 1], \\ [5, 0, 0, 0, 0, 0, 5, 0, 0, 0, 0, 0, 1, 1, 2, 1, 1] . \end{matrix}

(18)

Each

M_{0}

was run 10 times, and the final results are shown in Table 2. Unless otherwise specified, all parameters refer to the average values obtained from 10 runs. The search hit rate indicates that the proportion of times of the minimum completion time was found in 10 runs. The best known and average completion time iteration counts refer to the average number of iterations for convergence.

According to Table 2, both algorithms converged to the same

τ_{b e s t}

, and both were able to find

τ_{b e s t}

completely. Due to the relatively simple system structure, both algorithms could find

τ_{b e s t}

in the first iteration and converge. The

τ_{b e s t}

and

τ_{a v e}

convergence rates were both 100%; however, with an increase in the initial token count, the iPN-AACO algorithm required more

τ_{a v e}

iterations to converge. Nevertheless,

τ_{a v e}

does not have practical significance. When it does not converge, it only indicates that the algorithm is still searching. With the initial token count of 2, the execution time of the iPN-AACO algorithm increased by 205% compared to PN-AACO. But when the initial token count was 10, the program execution time of iPN-AACO decreased by 79% compared to PN-AACO. This indicates that as the initial token count increases, the pheromone size in the PN-AACO algorithm increases significantly, which has a significant impact on the PN-AACO algorithm based on the count of states. However, the iPN-AACO algorithm, with a fixed count of pheromones, achieved higher efficiency in execution. A comparison of program execution times is shown in Figure 5.

According to Figure 5, the PN-AACO algorithm exhibited faster program execution times than the iPN-AACO algorithm when the initial token count was 2, 3, and 4. As the initial token count increased, the program execution time of the PN-AACO algorithm grew exponentially due to the growth in Petri net states. In contrast, the iPN-AACO algorithm maintained a relatively stable execution time because the pheromone size remained constant. This advantage positions the iPN-AACO algorithm favorably in scenarios involving large-scale systems with multiple states. Both algorithms generally operated around the boundary of the pheromone size given by

{|T|}^{2} + |T_{o}|

. Differences in performance may arise due to variations in coding methods such as concatenation functions and array manipulations.

5.2. Manufacturing System Processing Task

This simulation utilized the model depicted in Figure 6 of reference [25]. The model represents a manufacturing system with four machining tasks and is constrained by three shared resources. In this model, the sets are defined as follows:

P = {p_{1}, p_{2}, \dots, p_{20}}

,

T = {t_{1}, t_{2}, \dots, t_{18}}

, and

R = {r_{1}, r_{2}, r_{3}} = {p_{18}, p_{19}, p_{20}} \in P

. The transition time

\frac{1}{λ}

is represented by

\frac{1}{λ} = {69, 75, 85, 57, 51, 80, 75, 97, 85, 92, 98, 78, 75, 56, 70, 68, 99, 76, 93}

(19)

The initial and end markings in reference [25] are, respectively,

\begin{matrix} M_{0} = {1, 1, 1, 1, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 1, 1, 1} \\ M_{n} = {0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 1, 1, 1, 1, 1, 1, 1} \end{matrix}

(20)

The algorithm simulation parameters were the same as the radar receiver test task. In the iPN-AACO algorithm,

T_{o} = {t_{0}, t_{1}, t_{6}, t_{11}, t_{16}}

. The two algorithms, PN-AACO and iPN-AACO, were run 20 times each and compared with the data from the reference [25] species, and the results are shown in Table 3.

From Table 3, it can be seen that, compared to the improved ACO algorithm, the PN-AACO algorithm shortened the

τ_{a v e}

iterations and the

τ_{b e s t}

iterations by 73% and 67%, respectively, and the iPN-AACO algorithm shortened the

τ_{a v e}

iterations and the

τ_{b e s t}

iterations by 81% and 67%, respectively. The two algorithms proposed converged more quickly. What can be seen is that the PN-AACO algorithm was more volatile. The iPN-AACO performed slightly better in the optimization problem for this model. In the parameter of

τ_{a v e}

, convergence to a stable value is not of practical significance; only PN-AACO converged to a minimum value near 427 and performed a local search. All three algorithms were identical in the two metrics of

τ_{b e s t}

and the hit rate. Finally, the improved ACO algorithm found the best known transition sequence

t_{1} t_{16} t_{11} t_{17} t_{18} t_{2} t_{5} t_{6} t_{8} t_{12} t_{15} t_{9}

. The PN-AACO algorithm and the iPN-AACO algorithm found more than one best known transition sequence, and the Gantt chart of the best known scheduling is shown in Figure 7, taking one sequence of

t_{1} t_{16} t_{11} t_{17} t_{6} t_{18} t_{2} t_{8} t_{12} t_{5} t_{9} t_{15}

as an example.

In order to verify the advantage of the iPN-AACO algorithm in terms of computational speed in the face of increasing token number, the two methods of PN-AACO and the iPN-AACO were used in the initial markings

M_{0}

of

\begin{matrix} M_{0} = & [1, 1, 1, 1, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 1, 1, 1], \\ [2, 1, 1, 1, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 1, 1, 1], \\ [2, 2, 1, 1, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 1, 1, 1], \\ [2, 2, 2, 2, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 1, 1, 1], \\ [3, 3, 3, 3, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 1, 1, 1] . \end{matrix}

(21)

In this testing, the algorithm parameters were the same as in radar receiver test task. We executed 10 times for each

M_{0}

, and the final results are presented in Table 4.

According to Table 4, it is evident that compared to the iPN-AACO, the PN-AACO algorithm failed to find the best known completion time when the initial token count was 12, and the search hit rate decreased as the initial token count increased. Conversely, the iPN-AACO algorithm exhibited fluctuating search hit rates as the initial token count increased. Both algorithms showed decreasing convergence rates for

τ_{b e s t}

and

τ_{a v e}

as the initial token count increased, with the iPN-AACO algorithm requiring more iterations. The processing task, compared to test task, represents a more complex system, where increasing the initial token count results in more states. Consequently, both algorithms’ search comprehensiveness capacities were affected, leading to a decline in performance with increasing initial token count. However, the data indicate that the iPN-AACO algorithm performed slightly better. The increase in initial token count led to an increase in the pheromone size in PN-AACO, thereby increasing both algorithms’ execution times. With the initial token count of 4, the execution time of the iPN-AACO algorithm decreased by 4% compared to PN-AACO. And when the initial token count was 12, the program execution time of iPN-AACO decreased by 92% compared to PN-AACO. Similar to the test task, this suggests that the iPN-AACO’s efficiency increases with an increase in the initial token count.

6. Train Loading Application

To validate the feasibility of the iPN-AACO algorithm, this study conducted a simulation using the task of coiling carbon steel sheets in the 2# warehouse area of the Finished Product Service Branch of Gansu Jiuquan Iron and Steel Group Hongxing Iron and Steel Co., Ltd., Jiayuguan, China. The simulation aimed to obtain a scheduling solution for minimizing the completion time of work in this system. Specifically, the goal was to find the shortest transition sequence

σ_{t}

that minimized the completion time

τ_{s u m}

. The simulation employed both the PN-AACO algorithm and the iPN-AACO algorithm for this purpose.

There are two double-girder overhead traveling cranes and two forklifts for transporting steel coils in the cold rolled carbon steel sheet 2# storage area. And there is one railroad loading and unloading line with access to the warehouse area with six carriages, one carriage for two steel brackets and one steel bracket for two steel coils. The task was to optimize the deployment of cranes and forklifts in such a way that the shortest possible time was required for loading the six carriages of steel coils that were driven into the storage area. This study did not take into account the mutual interference between cranes, and it assumed that the storage capacity is much greater than the capacity of the six carriages. Additionally, it did not consider the time taken for the train to transport coils to the next empty train position. The loading problem was simplified to model the Petri net, as shown in Figure 8.

The description of each parameter in Figure 8 is shown in Table 5.

The algorithm parameters were the same as in test task. In the iPN-AACO algorithm,

T_{o} = {t_{1}, t_{2}, t_{5}, t_{6}, t_{9}, t_{10}, t_{13}, t_{14}, t_{17}, t_{18}, t_{21}, t_{22}}

. Our target was to find an

σ_{t}

that minimizes the

τ_{s u m}

when the train drives into the warehouse area. The PN-AACO and iPN-AACO algorithms were used, where the PN-AACO algorithm had a execution time of 36.1 s, and the iPN-AACO algorithm had a execution time of 5.1 s. Both algorithms searched for the best known path, and the

τ_{s u m}

of

σ_{t}

came out ot 1320 s. It can be seen from the algorithms that there is more than one best known path, and the scheduling Gantt chart is shown in Figure 9 for example

t_{22} t_{9} t_{21} t_{14} t_{9} t_{5} t_{14} t_{6} t_{17} t_{17} t_{12} t_{16} t_{15} t_{19} t_{8} t_{11} t_{7} t_{24} t_{23} t_{23} t_{24} t_{8} t_{19} t_{11} t_{16} t_{20} t_{15} t_{7} t_{12} t_{20} t_{1} t_{1} t_{3} t_{3} t_{4} t_{4}

.

7. Conclusions

This study introduces the iPN-AACO algorithm to address the performance issues encountered by the original PN-AACO algorithm when the model size or state increases. The iPN-AACO algorithm retains the Petri net modeling method and the time representation of key time points. On this basis, the pheromone index mechanism and the initial state selection method were improved so that the pheromone size was fixed to

{|T|}^{2} + |T_{o}|

. After that, a probabilistic selection method based on best known paths was designed to increase the influence of the best known paths and weaken the local influence on the results. Finally, through simulations, it has been demonstrated that the PN-AACO algorithm exhibited faster program execution speeds in scenarios with small-scale models or fewer states. In contrast, the iPN-AACO algorithm was faster when the model size or state increased, with up to a 92% reduction in runtime in the simulations. Each algorithm presents its own advantages when confronted with different application environments, necessitating the selection based on the specific task requirements. The next task will be to deploy the algorithms in the edge control nodes (self-developed PAG series controllers), and how to UI the modeling work is also a key point to consider.

Author Contributions

Conceptualization, Y.Y., X.L. and W.L.; methodology, Y.Y.; software, Y.Y.; validation, X.L. and W.L.; formal analysis, W.L.; investigation, Y.Y.; resources, W.L.; data curation, Y.Y.; writing—original draft preparation, Y.Y.; writing—review and editing, Y.Y., X.L. and W.L.; visualization, Y.Y.; supervision, X.L. and W.L.; project administration, X.L. and W.L.; funding acquisition, W.L. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Natural Science Foundation of China, grant numbers 62073056 and 61876029; the Applied Basic Research Project of Liaoning Province, grant number 2023JH2/101300207; the Dalian Key Field Innovation Team Project, grant number 2021RT14; and the Science and Technology Major Project of the Xinjiang Uygur Autonomous Region, grant number 2022A01001.

Data Availability Statement

Data are contained within the article.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Yang, Y.; Zhong, C.; Liu, X.; Lu, W. Modeling and Analysis of Cyber-physical Systems Based on Petri Net. Int. J. Control Autom. Syst. 2023, 21, 2980–2994. [Google Scholar] [CrossRef]
Wang, C.; Lv, Y.; Wang, Q.; Yang, D.; Zhou, G. Service-Oriented Real-Time Smart Job Shop Symmetric CPS Based on Edge Computing. Symmetry 2021, 13, 1839. [Google Scholar] [CrossRef]
Poltavtseva, M.; Shelupanov, A.; Bragin, D.; Zegzhda, D.; Alexandrova, E. Key Concepts of Systemological Approach to CPS Adaptive Information Security Monitoring. Symmetry 2021, 13, 2425. [Google Scholar] [CrossRef]
Sobb, T.; Turnbull, B.; Moustafa, N. A Holistic Review of Cyber-Physical-Social Systems: New Directions and Opportunities. Sensors 2023, 23, 7391. [Google Scholar] [CrossRef]
Nandhini, R.S.; Lakshmanan, R. A Review of the Integration of Cyber-Physical System and Internet of Things. Int. J. Adv. Comput. Sci. Appl. 2022, 13, 459–465. [Google Scholar] [CrossRef]
Yaghoubi, E.; Yaghoubi, E.; Yusupov, Z.; Maghami, M.R. A Real-Time and Online Dynamic Reconfiguration against Cyber-Attacks to Enhance Security and Cost-Efficiency in Smart Power Microgrids Using Deep Learning. Technologies 2024, 12, 197. [Google Scholar] [CrossRef]
Li, X.; Wang, K.; Yu, X.; Su, W. CPS-based Multiple Model Adaptive Control of GGBS Production Process. Acta Autom. Sin. 2019, 45, 1354–1365. [Google Scholar]
Ma, X.; Xu, X.; Jiang, G.; Qiao, Y.; Zhou, T. Hybrid adaptive particle swarm optimization algorithm for workflow scheduling. J. Comput. Appl. 2023, 43, 474–483. [Google Scholar]
Yang, Y.; Liu, X.; Lu, W. A Cyber-Physical Systems-Based Double-Layer Mapping Petri Net Model for Factory Process Flow Control. Appl. Sci. 2023, 13, 8975. [Google Scholar] [CrossRef]
Yuan, C. Principle and Application of Petri Net; Publishing House of Electronics Industry: Beijing, China, 2005. [Google Scholar]
Xie, N.; Li, A. Research on Modeling, Scheduling and Controller of Reconfigurable Manufacturing System Using Petri Nets; Tongji University Press: Shanghai, China, 2017. [Google Scholar]
Xu, G.; Chen, Y. Petri-Net-Based Scheduling of Flexible Manufacturing Systems Using an Estimate Function. Symmetry 2022, 14, 1052. [Google Scholar] [CrossRef]
Bouras, A.; Hamaci, S.; Ech-Chhibat, M.E.H.; Samri, H.; Absi, R. On the dynamic behavior of smart factories using petri nets in the context of industry 4.0. In Proceedings of the 2023 3rd International Conference on Innovative Research in Applied Science, Engineering and Technology (IRASET), Mohammedia, Morocco, 18–19 May 2023; IEEE: Piscataway, NJ, USA, 2023; pp. 1–6. [Google Scholar]
Pei, Y.; Yang, C.; Xu, J.; Wang, Y.; Dong, X. A hierarchical evaluation index system for FMS reliability considering coupling relations between system elements. Int. J. Adv. Manuf. Technol. 2021, 124, 3737–3747. [Google Scholar] [CrossRef]
Zhang, Y.; Wang, Y.; Wang, L.; Cai, G. An Extended Object-Oriented Petri Net Model for Vulnerability Evaluation of Communication-Based Train Control System. Symmetry 2020, 12, 1474. [Google Scholar] [CrossRef]
Shi, J.; Feng, T.; Zheng, L.; Wu, Y. Research on the Security of NC-Link Numerical Control Equipment Protocol Based on Colored Petri Net. Symmetry 2024, 16, 1612. [Google Scholar] [CrossRef]
Cui, Y.; Yue, X.; Zhou, K. Parallel test combining timed Petri net with GA-PSO algorithm. J. Comput. Appl. 2010, 30, 1902–1905. [Google Scholar] [CrossRef]
Wu, X.; Tian, S.; Zhang, L. The Internet of Things Enabled Shop Floor Scheduling and Process Control Method Based on Petri Nets. IEEE Access 2019, 7, 27432–27442. [Google Scholar] [CrossRef]
Mahato, D.P.; Singh, R.S. Load balanced scheduling and reliability modeling of grid transaction processing system using colored Petri nets. ISA Trans. 2019, 84, 225–236. [Google Scholar] [CrossRef]
Lei, K.; Guo, P.; Zhao, W.; Wang, Y.; Qian, L.; Meng, X.; Tang, L. A multi-action deep reinforcement learning framework for flexible Job-shop scheduling problem. Expert Syst. Appl. 2022, 205, 117796. [Google Scholar]
Liu, R.; Piplani, R.; Toro, C. Deep reinforcement learning for dynamic scheduling of a flexible job shop. Int. J. Prod. Res. 2022, 60, 4049–4069. [Google Scholar]
Yuan, E.; Cheng, S.; Wang, L.; Song, S.; Wu, F. Solving job shop scheduling problems via deep reinforcement learning. Appl. Soft Comput. 2023, 143, 110436. [Google Scholar]
Lassoued, S.; Schwung, A. Introducing PetriRL: An innovative framework for JSSP resolution integrating Petri nets and event-based reinforcement learning. J. Manuf. Syst. 2024, 74, 690–702. [Google Scholar] [CrossRef]
Kim, H.J.; Lee, J.H. Look-ahead based reinforcement learning for robotic flow shop scheduling. J. Manuf. Syst. 2023, 68, 160–175. [Google Scholar] [CrossRef]
Huang, H. Scheduling Flexible Manufacturing Systems Based on Petri Nets and an Ant Colony Algorithm. Master’s Thesis, Xidian University, Xian, China, 2023. [Google Scholar]

Figure 1. Comparison between the improved and original PN-AACO algorithms.

Figure 2. The algorithm flow of iPN-AACO.

Figure 3. Test task Petri net.

Figure 4. Best known scheduling Gantt chart in test task.

Figure 5. Comparison of program execution time.

Figure 6. Processing task Petri net.

Figure 7. Best known scheduling Gantt chart in processing task.

Figure 8. Train loading task Petri net.

Figure 9. Best known scheduling Gantt chart in loading task.

Table 1. Comparison results in test task.

Method	Best Known Sequence	Sequence Probability	Execution Time/s	Iterations
PSO				28
GA-ACO	$t_{10} t_{20} t_{15} t_{25} t_{11} t_{21} t_{16} t_{26} t_{12} t_{22} -$ $t_{17} t_{27} t_{18} t_{28} t_{13} t_{23} t_{19} t_{29} t_{14} t_{24}$	95%	37	25
GA-PSO				22
PN-AACO	$t_{1} t_{6} t_{2} t_{7} t_{3} t_{8} t_{9} t_{4} t_{10} t_{5}$	100%	37	1
iPN-AACO	$t_{1} t_{6} t_{2} t_{7} t_{3} t_{8} t_{9} t_{4} t_{10} t_{5}$	100%	37	1

Table 2. Comparison of test task.

Method	PN-ACOO					iPN-ACOO
Initial token count ¹	2	3	4	6	10	2	3	4	6	10
$τ_{b e s t}$ ²/s	37	47	59	87	43	37	47	59	87	43
Hit rate	100%	100%	100%	100%	100%	100%	100%	100%	100%	100%
$τ_{b e s t}$ convergence rate	100%	100%	100%	100%	100%	100%	100%	100%	100%	100%
$τ_{b e s t}$ iterations	1	1	1	1	1	1	1	1	1	1
$τ_{a v e}$ ³ convergence rate	100%	100%	100%	100%	100%	100%	100%	100%	100%	100%
$τ_{a v e}$ iterations	1	4.6	4.7	5.5	5.2	1	3.2	9.33	15.9	23.6
Pheromone size	14	72.3	130.2	514	2053.7	102	102	102	102	102
Execution time/s	0.38	1.26	2.21	6.71	34.15	1.16	1.72	2.39	3.67	7.12

¹ Non-resources. ² Best known completion time. ³ Average completion time.

Table 3. Results of the three algorithms.

Method	Improved ACO	PN-AACO	iPN-AACO
$τ_{a v e}$ iterations	5.2	1.4	1
$τ_{b e s t}$ iterations	3	1	1
$τ_{a v e}$ /s	427	427.3	427
$τ_{b e s t}$ /s	427	427	427
Hit rate	100%	100%	100%

Table 4. Comparison of processing task.

Method	PN-ACOO					Improved PN-ACOO
Initial token count	4	5	6	8	12	4	5	6	8	12
$τ_{b e s t}$ /s	427	512	581	762	1175	427	512	581	762	1097
Hit rate	100%	60%	60%	10%	0%	100%	60%	50%	70%	40%
$τ_{b e s t}$ convergence rate	100%	100%	80%	80%	N/A	100%	100%	90%	100%	10%
$τ_{b e s t}$ iterations	1.2	2.4	2.9	2.75	N/A	1.3	3.6	2.3	10.5	29
$τ_{a v e}$ convergence rate	60%	10%	10%	0%	N/A	50%	30%	10%	20%	0%
$τ_{a v e}$ iterations	12.3	8	10	Inf	N/A	10.8	6.3	14	29.5	Inf
Pheromone size	209.9	478.6	896.7	1606.4	3149.9	366	366	366	366	366
Execution time/s	1.45	3.28	6.23	14.00	36.90	1.39	1.63	1.76	2.45	3.10

Table 5. Definition of places and transitions.

Places or Transitions	Descriptions
$p_{1}, p_{4}, p_{7}, p_{10}, p_{13}, p_{16}$	Count of steel brackets
$p_{2}, p_{5}, p_{8}, p_{11}, p_{14}, p_{17}$	Count of steel coils
$p_{3}, p_{6}, p_{9}, p_{12}, p_{15}, p_{18}$	Completion status of coil loading
$p_{10} (r_{1}), p_{11} (r_{2})$	Crane resources, forklift resources, respectively
$t_{1}, t_{5}, t_{9}, t_{13}, t_{17}, t_{21}$	Average time for crane to install steel bracket: 90 s
$t_{2}, t_{6}, t_{10}, t_{14}, t_{18}, t_{22}$	Average time for forklift to install steel bracket: 120 s
$t_{3}, t_{7}, t_{11}, t_{15}, t_{19}, t_{23}$	Average time for crane loading steel coil: 150 s
$t_{4}, t_{8}, t_{12}, t_{16}, t_{20}, t_{24}$	Average time for forklift loading steel coil: 180 s

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Yang, Y.; Liu, X.; Lu, W. Improvement of Scheduling Optimization of Cyber-Physical Systems Based on Petri Net and Intelligent Algorithm. Symmetry 2025, 17, 487. https://doi.org/10.3390/sym17040487

AMA Style

Yang Y, Liu X, Lu W. Improvement of Scheduling Optimization of Cyber-Physical Systems Based on Petri Net and Intelligent Algorithm. Symmetry. 2025; 17(4):487. https://doi.org/10.3390/sym17040487

Chicago/Turabian Style

Yang, Yuhai, Xiaodong Liu, and Wei Lu. 2025. "Improvement of Scheduling Optimization of Cyber-Physical Systems Based on Petri Net and Intelligent Algorithm" Symmetry 17, no. 4: 487. https://doi.org/10.3390/sym17040487

APA Style

Yang, Y., Liu, X., & Lu, W. (2025). Improvement of Scheduling Optimization of Cyber-Physical Systems Based on Petri Net and Intelligent Algorithm. Symmetry, 17(4), 487. https://doi.org/10.3390/sym17040487

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Improvement of Scheduling Optimization of Cyber-Physical Systems Based on Petri Net and Intelligent Algorithm

Abstract

1. Introduction

1.1. Importance and Motivation

1.2. Literature Review

1.3. Contribution

1.4. Organization

2. PN-AACO Algorithm

3. iPN-AACO Algorithm

3.1. KT³ Method

3.2. Initial State Selection Based on Pheromone

3.3. Probabilistic Selection Rules Based on Best Known Paths

4. Improved Algorithmic Flow

5. Performance Proof

5.1. Radar Receiver Test Task

5.2. Manufacturing System Processing Task

6. Train Loading Application

7. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

Improvement of Scheduling Optimization of Cyber-Physical Systems Based on Petri Net and Intelligent Algorithm

Abstract

1. Introduction

1.1. Importance and Motivation

1.2. Literature Review

1.3. Contribution

1.4. Organization

2. PN-AACO Algorithm

3. iPN-AACO Algorithm

3.1. KT3 Method

3.2. Initial State Selection Based on Pheromone

3.3. Probabilistic Selection Rules Based on Best Known Paths

4. Improved Algorithmic Flow

5. Performance Proof

5.1. Radar Receiver Test Task

5.2. Manufacturing System Processing Task

6. Train Loading Application

7. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

3.1. KT³ Method