PINN-LSTM: A High-Precision Physics-Informed Neural Network for Solving Malware Propagation Dynamics in Wireless Sensor Networks

Zhang, Rui; Zhou, Kai; Shen, Shoufeng; Pang, Jiafu; Cao, Zhiyi

doi:10.3390/sym18050707

Open AccessArticle

PINN-LSTM: A High-Precision Physics-Informed Neural Network for Solving Malware Propagation Dynamics in Wireless Sensor Networks

by

Rui Zhang

¹,

Kai Zhou

²,

Shoufeng Shen

²

,

Jiafu Pang

^2,* and

Zhiyi Cao

²

¹

College of Information Science and Technology, Zhejiang Shuren University, Hangzhou 310015, China

²

School of Mathematical Sciences, Zhejiang University of Technology, Hangzhou 310023, China

^*

Author to whom correspondence should be addressed.

Symmetry 2026, 18(5), 707; https://doi.org/10.3390/sym18050707

Submission received: 30 March 2026 / Revised: 14 April 2026 / Accepted: 21 April 2026 / Published: 23 April 2026

(This article belongs to the Section Mathematics)

Download

Browse Figures

Versions Notes

Abstract

This paper proposes a hybrid PINN + LSTM framework for the high-precision solution of malware propagation dynamics in wireless sensor networks. A seven-compartment SVEHLQR model is developed to capture this complex transmission process. To overcome the limitations of standard physics-informed neural networks (PINNs) in long-term prediction, including gradient vanishing and error accumulation, we integrate LSTM’s temporal memory capability into the PINN architecture. Comprehensive comparisons are conducted among the proposed PINN + LSTM, standard PINN, and Fourier PINN, using the fourth-order Runge–Kutta method as the benchmark. Experimental results demonstrate that PINN + LSTM significantly outperforms both baseline methods, achieving an average relative error of

3.88 \times 10^{- 3}

compared to

7.20 \times 10^{- 2}

for PINN and

2.81 \times 10^{- 1}

for Fourier PINN, representing a 94.6% accuracy improvement over PINN. These results validate that incorporating LSTM’s recursive memory mechanism enables the accurate and efficient solution of complex time-dependent dynamical systems. Additionally, the model’s robustness is verified under 1%, 5%, and 10% Gaussian noise. PINN + LSTM maintains extremely low relative errors, not exceeding 0.0049, and outperforms PINN and Fourier PINN significantly, confirming its strong noise immunity and stable dynamics learning ability in realistic environments.

Keywords:

physics-informed neural networks (PINNs); long short-term memory (LSTM); wireless sensor networks (WSNs); malware propagation

1. Introduction

In recent years, physics-informed neural networks (PINNs) have emerged as a powerful paradigm for solving differential equations by embedding physical laws directly into the learning process [1,2,3]. The core idea of PINNs is to incorporate governing equations as soft constraints into the loss function via automatic differentiation, which computes derivatives of network outputs with respect to inputs [4,5]. This allows for the construction of residual terms equivalent to the original differential equations, enabling the network to learn system dynamics solely by satisfying physical laws, even in the absence of labeled data. Unlike traditional numerical methods such as finite difference or finite element methods, PINNs are mesh-free, thus avoiding the computational bottlenecks associated with grid generation and the curse of dimensionality. They naturally handle high-dimensional problems, complex boundary conditions, and sparse observational data while integrating data-driven insights with physical knowledge. This makes PINNs particularly advantageous in inverse problems, parameter identification, and uncertainty quantification [6,7]. PINNs have been successfully applied in various fields, including fluid mechanics, solid mechanics, heat transfer, and quantum mechanics.

As a key application area, PINNs have been employed to solve various systems of differential equations derived from epidemiological and propagation models [8,9,10]. Their ability to simultaneously fit observational data while respecting underlying physical laws makes them particularly attractive for modeling complex dynamical systems. Moreover, as these epidemiological and propagation models become increasingly high-dimensional, efficiently solving the resulting differential equation systems poses a critical challenge. Traditional numerical methods such as Runge–Kutta [11], while accurate, incur substantial computational costs and face difficulties in real-time scenarios. However, despite their success across multiple domains, conventional PINNs treat time as an independent spatial dimension and compute derivatives solely through automatic differentiation. This approach introduces two fundamental limitations when solving time-dependent dynamical systems: accumulated errors over long time horizons and the vanishing or exploding gradient problem in deep networks.

To overcome these limitations, this work integrates PINNs with long short-term memory networks. The memory cells and gating mechanisms of LSTM effectively capture long-term dependencies and time-delayed effects, which standard PINNs fail to exploit due to their independent treatment of time points [12,13]. Moreover, the gating structure alleviates gradient vanishing and explosion, stabilizing training and accelerating convergence, making the hybrid PINN + LSTM framework particularly suitable for long-range sequence prediction.

As a concrete application, we consider the problem of malware propagation in wireless sensor networks (WSNs), where accurate modeling and prediction are critical for network security. Malware poses severe threats to WSN stability, causing resource theft, data tampering, and service outages [14]. Given the analogy between malware spread and infectious disease transmission, compartmental models such as SIR and SEIR have been widely adopted to characterize propagation dynamics [15]. However, with increasingly refined node state divisions, the resulting differential equation models become high-dimensional, making efficient solution a critical challenge.

In this paper, we propose a novel PINN + LSTM hybrid model to characterize malware propagation in WSNs. Departing from traditional numerical methods, we employ physics-informed neural networks integrated with long short-term memory to solve the underlying differential equation systems. Using the fourth-order Runge–Kutta (RK4) method as the benchmark, we conduct comprehensive numerical comparisons between the proposed PINN + LSTM, standard PINN, and Fourier PINN. The rest of the paper is organized as follows. Section 2 constructs a malware propagation model based on SEIR epidemic theory. Section 3 introduces the PINN + LSTM framework. Section 4 carries out numerical simulations and a corresponding analysis. Finally, Section 5 concludes the paper and discusses future work.

2. Malware Propagation Model Formulation

In this section, we construct an extended SEIR-based epidemic model with seven state variables, namely Susceptible (S), Vaccinated (V), Exposed (E), High-Risk Infectious (H), Low-Risk Infectious (L), Quarantined (Q), and Recovered (R). Let

S (t)

,

V (t)

,

E (t)

,

H (t)

,

L (t)

,

Q (t)

, and

R (t)

denote the density of nodes in each compartment at time t, respectively. We assume that the total number of nodes in the WSNs remains constant over time. Therefore,

S (t) + V (t) + E (t) + H (t) + L (t) + Q (t) + R (t) = 1 .

(1)

Let

[t, t + Δ t]

be a time interval, where

Δ t \geq 0

is a sufficiently small time segment starting from time t. Based on epidemic theory, the state transition relationships of nodes among different compartments are illustrated in Figure 1.

The density variation in compartment S consists of three components. First, susceptible nodes are protected by the defense system and transition to the vaccinated compartment V at a rate

ϵ

. Second, susceptible nodes can be infected by high-risk and low-risk infectious nodes, and then move to the exposed compartment E at rates

α_{1}

and

α_{2}

, respectively. Third, some recovered nodes return to the susceptible state at rate

γ

. Accordingly, the density change in S over the time interval

[t, t + Δ t]

is formulated as follows:

S (t + Δ t) - S (t) = (- ϵ S (t) - S (t) (α_{1} H (t) + α_{2} L (t)) + γ R (t)) Δ t

(2)

The density variation in compartment V consists of two components. First, susceptible nodes are protected by the defense system and transition to the vaccinated compartment V at rate

ϵ

. Second, some vaccinated nodes may still be infected by high-risk and low-risk infectious nodes, and transfer to the exposed compartment E at rates

α_{3}

and

α_{4}

, respectively, where

α_{3} < α_{1}

and

α_{4} < α_{2}

. Accordingly, the density change in V over the time interval

[t, t + Δ t]

is formulated as follows:

V (t + Δ t) - V (t) = (ϵ S (t) - V (t) (α_{3} H (t) + α_{4} L (t))) Δ t

(3)

The density variation in compartment E consists of two components: susceptible and vaccinated nodes can be infected and transition to compartment E (consistent with the infection mechanisms of S and V), and the exposed nodes then progress to compartments H and L at rates

β_{1}

and

β_{2}

, respectively. Accordingly, the density change in E over the time interval

[t, t + Δ t]

is formulated as follows:

E (t + Δ t) - E (t) = (S (t) (α_{1} H (t) + α_{2} L (t)) + V (t) (α_{3} H (t) + α_{4} L (t)) - (β_{1} + β_{2}) E (t)) Δ t

(4)

The density variation in compartment H consists of two components: exposed nodes transfer to compartment H at rate

β_{1}

, and high-risk infectious nodes move to compartment Q at rate

δ

. Accordingly, the density change in H over the time interval

[t, t + Δ t]

is formulated as follows:

H (t + Δ t) - H (t) = (β_{1} E (t) - δ H (t)) Δ t

(5)

The density variation in compartment L consists of two components: exposed nodes transition to compartment L at rate

β_{2}

, and low-risk infectious nodes recover at rate

η

. Accordingly, the density change in L over the time interval

[t, t + Δ t]

is formulated as follows:

L (t + Δ t) - L (t) = (β_{2} E (t) - η L (t)) Δ t

(6)

The density variation in compartment Q is determined by two components: high-risk infectious nodes move to compartment Q at rate

δ

, and quarantined nodes recover at rate

η

. Accordingly, the density change in Q over the time interval

[t, t + Δ t]

is formulated as follows:

Q (t + Δ t) - Q (t) = (δ H (t) - η Q (t)) Δ t

(7)

Since the total number of nodes in the WSN is constant, the density variation in compartment R consists of three components: low-risk infectious nodes recover at rate

η

, quarantined nodes recover at rate

η

, and some recovered nodes return to the susceptible state at rate

γ

. Accordingly, the density change in R over the time interval

[t, t + Δ t]

is formulated as follows:

R (t + Δ t) - R (t) = (η (Q (t) + L (t)) - γ R (t)) Δ t

(8)

We assume that the number of nodes in each state varies continuously within

Δ t

. Equations (2)–(8) are in discrete-time difference form; by dividing by

Δ t

and letting

Δ t \to 0

, the continuous-time ordinary differential Equation (9) is directly obtained.

\{\begin{matrix} \frac{d S (t)}{d t} = - ϵ S (t) - S (t) (α_{1} H (t) + α_{2} L (t)) + γ R (t), \\ \frac{d V (t)}{d t} = ϵ S (t) - V (t) (α_{3} H (t) + α_{4} L (t)), \\ \frac{d E (t)}{d t} = (α_{1} S (t) + α_{3} V (t)) H (t) + (α_{2} S (t) + α_{4} V (t)) L (t) - (β_{1} + β_{2}) E (t), \\ \frac{d H (t)}{d t} = β_{1} E (t) - δ H (t), \\ \frac{d L (t)}{d t} = β_{2} E (t) - η L (t), \\ \frac{d Q (t)}{d t} = δ H (t) - η Q (t), \\ \frac{d R (t)}{d t} = η (Q (t) + L (t)) - γ R (t), \end{matrix}

(9)

where the initial condition is

{S (0) \geq 0, V (0) \geq 0, E (0) \geq 0, H (0) \geq 0, L (0) \geq 0, Q (0) \geq 0,

R (0) \geq 0}

.

The added complexity of the SVEHLQR model is justified by its ability to capture WSN-specific features. Distinguishing between high-risk infectious (H) and low-risk infectious (L) nodes reflects differences in node roles and transmission capabilities, which is critical for propagation accuracy [16], unlike the homogeneous infectious classes assumed in [17,18]. Furthermore, the introduction of vaccinated (V) and quarantined (Q) compartments incorporates proactive protection and isolation mechanisms, which are absent in basic models but essential for WSN security [19]. Compared to recent works such as [20,21], our model further subdivides infectious states to better align with the heterogeneous communication patterns observed in WSNs. In contrast to [22], which focuses on detection mechanisms, our work models propagation dynamics using a macroscopic compartmental approach suitable for large-scale WSNs.

For the model to be mathematically meaningful, we examine its steady states and derive a basic reproduction number. Let

X = {(S, V, E, H, L, Q, R)}^{⊤}

. System (9) can be written as

\frac{d X}{d t} = f (X),

(10)

with f Lipschitz continuous.

At a malware-free steady state, no exposed, infected or quarantined nodes exist. Setting

E = H = L = Q = 0

in (9) and solving yields

X_{0} = (0, 1, 0, 0, 0, 0, 0) .

(11)

This means the entire population is vaccinated. The derivation is straightforward: the first equation gives

- ϵ S + γ R = 0

; with

R = 0

we get

S = 0

. The second equation then holds automatically. Normalization of

S + V + E + H + L + Q + R = 1

forces

V = 1

.

Following the next-generation method, we isolate the infected compartments. Let

Y = {(E, H, L, Q)}^{⊤}

. System (9) becomes

\frac{𝜕 Y}{𝜕 t} = F - V,

(12)

where

F = (\begin{matrix} (α_{1} S + α_{3} V) H + (α_{2} S + α_{4} V) L \\ 0 \\ 0 \\ 0 \end{matrix}), V = (\begin{matrix} (β_{1} + β_{2}) E \\ - β_{1} E + δ H \\ - β_{2} E + η L \\ - δ H + η Q \end{matrix}) .

(13)

Let f and v be the Jacobians of

F

and

V

with respect to Y, evaluated at

X_{0}

:

f = {\frac{𝜕 F}{𝜕 Y}|}_{X_{0}} = (\begin{matrix} 0 & α_{1} S^{0} + α_{3} V^{0} & α_{2} S^{0} + α_{4} V^{0} & 0 \\ 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 \end{matrix}) = (\begin{matrix} 0 & α_{3} & α_{4} & 0 \\ 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 \end{matrix}),

(14)

v = {\frac{𝜕 V}{𝜕 Y}|}_{X_{0}} = (\begin{matrix} β_{1} + β_{2} & 0 & 0 & 0 \\ - β_{1} & δ & 0 & 0 \\ - β_{2} & 0 & η & 0 \\ 0 & - δ & 0 & η \end{matrix}) .

(15)

Computing

v^{- 1}

gives

v^{- 1} = (\begin{matrix} \frac{1}{β_{1} + β_{2}} & 0 & 0 & 0 \\ \frac{β_{1}}{δ (β_{1} + β_{2})} & \frac{1}{δ} & 0 & 0 \\ \frac{β_{2}}{η (β_{1} + β_{2})} & 0 & \frac{1}{η} & 0 \\ \frac{β_{1}}{η (β_{1} + β_{2})} & \frac{1}{η} & 0 & \frac{1}{η} \end{matrix}) .

(16)

The next-generation matrix is

f v^{- 1} = (\begin{matrix} \frac{α_{3} β_{1} η + α_{4} β_{2} δ}{η δ (β_{1} + β_{2})} & \frac{α_{3}}{δ} & \frac{α_{4}}{η} & 0 \\ 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 \end{matrix}) .

(17)

Hence

R_{0}

, the spectral radius of

f v^{- 1}

, equals

R_{0} = \frac{α_{3} β_{1} η + α_{4} β_{2} δ}{η δ (β_{1} + β_{2})} .

(18)

Standard results from epidemic theory tell us that the stability of

X_{0}

is governed by

R_{0}

. When

R_{0} < 1

, the malware-free equilibrium is locally asymptotically stable. In practical terms, a small outbreak will not spread and eventually disappears. When

R_{0} > 1

,

X_{0}

loses stability and the malware can invade the population. The case

R_{0} = 1

marks a transcritical bifurcation where the two equilibria exchange stability.

For

R_{0} > 1

, there exists a nontrivial endemic equilibrium

X^{*} = (S^{*}, V^{*}, E^{*}, H^{*}, L^{*}, Q^{*}, R^{*})

with

H^{*} > 0

. Solving the steady-state equations yields the following relations:

\begin{matrix} E^{*} & = \frac{δ H^{*}}{β_{1}}, \\ L^{*} & = \frac{β_{2} δ H^{*}}{β_{1} η}, \\ Q^{*} & = \frac{δ H^{*}}{η}, \\ R^{*} & = \frac{(β_{1} + β_{2}) δ H^{*}}{γ β_{1}}, \\ S^{*} & = \frac{(β_{1} + β_{2}) η δ H^{*}}{η β_{1} ϵ + η β_{1} α_{1} H^{*} + α_{2} β_{2} δ H^{*}}, \\ V^{*} & = \frac{ϵ S^{*}}{α_{3} H^{*} + α_{4} L^{*}} . \end{matrix}

(19)

Substituting these into

S^{*} + V^{*} + E^{*} + H^{*} + L^{*} + Q^{*} + R^{*} = 1

gives an equation for

H^{*}

. The explicit form is omitted as our focus is numerical rather than analytical.

3. Physics-Informed Neural Network Fused with LSTM Model

In this section, a hybrid physics-informed neural network integrated with long short-term memory (PINN + LSTM) is proposed to solve the numerical solution of the malware propagation system (9) accurately. The traditional fully connected PINN treats time as a static input feature and lacks the ability to capture long-term temporal dependencies in dynamic systems, which easily leads to accumulated errors and non-physical drift in long-time predictions. To address this limitation, the LSTM module is embedded into the PINN framework to enhance the model’s capability of memorizing historical dynamic information and modeling sequential evolution patterns, thereby improving the stability and accuracy of the numerical solution for the malware propagation system.

The PINN + LSTM model inherits the core advantage of PINNs by encoding physical laws into the training process via a physics-informed loss function. Meanwhile, it leverages the LSTM’s recursive memory mechanism to capture the temporal correlation of the dynamic system. The overall architecture of the model consists of four key components: an input layer, a time embedding layer, a stacked LSTM encoder, and a fully connected decoder. The detailed structure and mathematical formulation are presented as follows.

3.1. Network Architecture and Forward Propagation

The input of the PINN + LSTM model is the scalar time variable

t \in [t_{0}, t_{T}]

(where

t_{0} = 0

is the initial time and

t_{T}

is the terminal time), and the output is the predicted value of the 7-dimensional state vector

\hat{X} (t) = {[\hat{S} (t), \hat{V} (t), \hat{E} (t), \hat{H} (t), \hat{L} (t), \hat{Q} (t), \hat{R} (t)]}^{T}

, which corresponds exactly to the solution of the malware propagation system given in Equation (9).

The time input t is first mapped to a high-dimensional feature space through the embedding layer to enhance the expressive ability of temporal features. The mathematical formulation of the embedding layer is

z_{t} = tanh (W_{emb} \cdot t + b_{emb})

(20)

where

z_{t} \in R^{d_{emb}}

is the embedded time feature vector,

W_{emb}

and

b_{emb}

are trainable parameters, and

tanh (\cdot)

is the hyperbolic tangent activation function.

To capture the dynamic evolutionary characteristics of the malware propagation system, an LSTM encoder is introduced to memorize the historical states of

\hat{S}, \hat{V}, \hat{E}, \hat{H}, \hat{L}, \hat{Q}, \hat{R}

during temporal propagation. Based on the embedded feature

z_{t}

, the LSTM updates its internal states through four gate mechanisms, which are mathematically defined as follows:

Forget Gate ( $f_{t}$ ): Controls the retention of historical dynamic information related to the malware propagation states.

$f_{t} = σ (W_{f} \cdot [z_{t}, h_{t - 1}] + b_{f})$

(21)

where $σ (\cdot)$ is the sigmoid activation function that constrains gate values between 0 and 1, $W_{f}$ is the weight matrix of the LSTM gates, $h_{t - 1}$ is the hidden state at the previous time step, initialized as $h_{0} = 0$ , and $b_{f}$ is the bias vector of the LSTM gates.
Input Gate ( $i_{t}$ ): Regulates the update of new temporal information for the epidemic states.

$i_{t} = σ (W_{i} \cdot [z_{t}, h_{t - 1}] + b_{i})$

(22)

where $W_{i}$ is the weight matrix of the LSTM gates, and $b_{i}$ is the bias vector of the LSTM gates.
Cell State Update: Integrates historical memory and new features to maintain continuous evolution of the malware propagation trend.

${\tilde{c}}_{t} = tanh (W_{c} \cdot [z_{t}, h_{t - 1}] + b_{c})$

(23)

$c_{t} = f_{t} ⊙ c_{t - 1} + i_{t} ⊙ {\tilde{c}}_{t}$

(24)

where ⊙ is the element-wise (Hadamard) product, $c_{t} \in R^{d_{hid}}$ is the cell state that maintains long-term memory of state variables $\hat{S}, \hat{V}, \hat{E}, \hat{H}, \hat{L}, \hat{Q}, \hat{R}$ , $W_{c}$ is the weight matrix of the LSTM gates, $b_{c}$ is the bias vector of the LSTM gates, and $c_{t - 1}$ is the cell states at the previous time step, initialized as $c_{0} = 0$ .
Output Gate and Hidden State: Outputs temporal features that reflect the evolutionary trend of the malware propagation system.

$o_{t} = σ (W_{o} \cdot [z_{t}, h_{t - 1}] + b_{o})$

(25)

$h_{t} = o_{t} ⊙ tanh (c_{t})$

(26)

where $h_{t} \in R^{d_{hid}}$ is the hidden state that records the evolutionary features of the malware propagation system, $W_{o}$ is the weight matrix of the LSTM gates, and $b_{o}$ is the bias vector of the LSTM gates.

The decoder is a fully connected PINN-style network that maps the LSTM hidden state

h_{t}

to the physical state variables of the malware propagation system. It outputs the predicted solution

\hat{X} (t)

that satisfies the dynamic constraints of System (9):

\begin{matrix} h_{dec 1} & = tanh (W_{dec 1} \cdot h_{t} + b_{dec 1}), \\ h_{dec 2} & = tanh (W_{dec 2} \cdot h_{dec 1} + b_{dec 2}), \\ \hat{X} (t) & = W_{out} \cdot h_{dec 2} + b_{out} . \end{matrix}

(27)

where

h_{dec 1}, h_{dec 2} \in R^{d_{hid}}

are the hidden layer features of the decoder,

W_{dec 1}, W_{dec 2}, W_{out}

are the weight matrices, and

b_{dec 1}, b_{dec 2}, b_{out}

are the bias vectors.

By combining LSTM-based temporal memory and PINN-based physical constraints, the PINN + LSTM model accurately learns the dynamic evolution of

\hat{S}, \hat{V}, \hat{E}, \hat{H}, \hat{L}, \hat{Q}, \hat{R}

while strictly satisfying the malware propagation ODE system.

3.2. Training Objective and Loss Function

The training of the PINN + LSTM model is guided by a physics-informed loss function, which enforces the model output to satisfy both the initial conditions of the malware propagation system and the dynamic constraints described by the ODEs. The total loss function

L o s s

consists of two components: the initial condition loss

L o s s_{ic}

to ensure consistency with the initial state and the physics-informed residual loss

L o s s_{f}

to enforce compliance with the ODE system, which is formulated as

L o s s = L o s s_{ic} + λ \cdot L o s s_{f}

(28)

where

λ

is the weight coefficient of the residual loss, used to balance the importance of the initial condition and physical constraints.

The initial condition loss measures the squared error between the model’s predicted output at the initial time

t = 0

and the given initial state values of the malware propagation system. The mathematical formulation is

\begin{matrix} L o s s_{ic} = & {(\hat{S} (0) - S_{0})}^{2} + {(\hat{V} (0) - V_{0})}^{2} + {(\hat{E} (0) - E_{0})}^{2} + \\ {(\hat{H} (0) - H_{0})}^{2} + {(\hat{L} (0) - L_{0})}^{2} + {(\hat{Q} (0) - Q_{0})}^{2} + {(\hat{R} (0) - R_{0})}^{2} \end{matrix}

(29)

where

S_{0}, V_{0}, E_{0}, H_{0}, L_{0}, Q_{0}, R_{0}

are the given initial values of the seven state variables, which are consistent with the initial conditions of System (9), and

\hat{S} (0), \hat{V} (0), \dots, \hat{R} (0)

are the predicted values of the PINN + LSTM model at

t = 0

.

The residual loss is the core of the PINN framework, which forces the model output to satisfy the ODE constraints of the malware propagation system. For a set of collocation points

{t_{j}}_{j = 1}^{l}

(randomly sampled within the time interval

[0, t_{T}]

, l is the number of collocation points), the residual of each state variable is defined as the difference between the temporal derivative of the model’s predicted value and the corresponding right-hand side of the ODE in System (9).

To obtain the temporal derivatives

\frac{d \hat{S}}{d t}, \frac{d \hat{V}}{d t}, \dots, \frac{d \hat{R}}{d t}

needed for residual evaluation, we apply automatic differentiation. This approach enables accurate and efficient computation of output derivatives with respect to the input time t without manual symbolic derivation.

Although the LSTM layer updates its hidden state

h_{t}

recursively from the preceding state

h_{t - 1}

, the full network is regarded as a continuous mapping from the scalar input t to the estimated state vector

\hat{X} (t)

. The internal recurrence of the LSTM defines the structure of the approximator rather than a discrete dynamic update, so the network remains differentiable with respect to t.

The derivatives are computed through the chain rule in network backpropagation. The total derivative of the output state

\hat{X} (t)

with respect to time is expressed as

\frac{d \hat{X} (t)}{d t} = \frac{𝜕 \hat{X}}{𝜕 h_{dec 2}} \cdot \frac{𝜕 h_{dec 2}}{𝜕 h_{dec 1}} \cdot \frac{𝜕 h_{dec 1}}{𝜕 h_{t}} \cdot \frac{𝜕 h_{t}}{𝜕 z_{t}} \cdot \frac{𝜕 z_{t}}{𝜕 t},

(30)

where each term is evaluated analytically during backpropagation. The term

\frac{𝜕 h_{t}}{𝜕 z_{t}}

naturally captures the recursive update rule

h_{t} = LSTM (z_{t}, h_{t - 1})

within the LSTM cell. The resulting derivative represents the exact continuous-time gradient of the predicted trajectory, which is mathematically consistent with the derivative required by the ODE residual. This construction is supported by existing research on physics-informed recurrent networks, whose convergence can be guaranteed under mild regularity conditions on the dynamical system.

Based on the automatic differentiation results, the residual functions for each state variable are defined as follows:

\{\begin{matrix} \begin{matrix} f_{S} (t_{j}) & = \frac{d \hat{S}}{d t} |_{t_{j}} - [- ϵ \hat{S} (t_{j}) - \hat{S} (t_{j}) (α_{1} \hat{H} (t_{j}) + α_{2} \hat{L} (t_{j})) + γ \hat{R} (t_{j})], \\ f_{V} (t_{j}) & = \frac{d \hat{V}}{d t} |_{t_{j}} - [ϵ \hat{S} (t_{j}) - \hat{V} (t_{j}) (α_{3} \hat{H} (t_{j}) + α_{4} \hat{L} (t_{j}))], \\ f_{E} (t_{j}) & = \frac{d \hat{E}}{d t} |_{t_{j}} - [(α_{1} \hat{S} (t_{j}) + α_{3} \hat{V} (t_{j})) \hat{H} (t_{j}) + (α_{2} \hat{S} (t_{j}) + α_{4} \hat{V} (t_{j})) \hat{L} (t_{j}) - (β_{1} + β_{2}) \hat{E} (t_{j})], \\ f_{H} (t_{j}) & = \frac{d \hat{H}}{d t} |_{t_{j}} - [β_{1} \hat{E} (t_{j}) - δ \hat{H} (t_{j})], \\ f_{L} (t_{j}) & = \frac{d \hat{L}}{d t} |_{t_{j}} - [β_{2} \hat{E} (t_{j}) - η \hat{L} (t_{j})], \\ f_{Q} (t_{j}) & = \frac{d \hat{Q}}{d t} |_{t_{j}} - [δ \hat{H} (t_{j}) - η \hat{Q} (t_{j})], \\ f_{R} (t_{j}) & = \frac{d \hat{R}}{d t} |_{t_{j}} - [η (\hat{Q} (t_{j}) + \hat{L} (t_{j})) - γ \hat{R} (t_{j})] . \end{matrix} \end{matrix}

(31)

The residual loss is the average of the squared residuals over all collocation points, which quantifies the degree to which the model output violates the ODE constraints:

L o s s_{f} = \frac{1}{l} \sum_{j = 1}^{l} [f_{S}^{2} (t_{j}) + f_{V}^{2} (t_{j}) + f_{E}^{2} (t_{j}) + f_{H}^{2} (t_{j}) + f_{L}^{2} (t_{j}) + f_{Q}^{2} (t_{j}) + f_{R}^{2} (t_{j})]

(32)

3.3. Model Training Process

The trainable parameters of the PINN + LSTM model include all weight matrices and bias vectors of the embedding layer, LSTM encoder, and decoder:

Θ = {W_{emb}, b_{emb}, W_{f}, b_{f}, W_{i}, b_{i}, W_{c}, b_{c}, W_{o}, b_{o}, W_{dec 1}, b_{dec 1}, W_{dec 2}, b_{dec 2}, W_{out}, b_{out}} .

The training process aims to minimize the total loss function

L o s s

with respect to

Θ

, which is implemented using the Adam optimizer to adjust the parameters iteratively. The learning rate is dynamically adjusted using a learning rate scheduler, which reduces the learning rate by a factor of 0.5 when the loss does not decrease for 500 consecutive epochs, to improve convergence stability.

The training steps are summarized as follows:

Step 1: Initialize all trainable parameters $Θ$ using the Xavier normal initialization method to avoid vanishing/exploding gradients at the start of training;
Step 2: Randomly sample l collocation points ${t_{j}}_{j = 1}^{l}$ within the time interval $[0, t_{T}]$ ;
Step 3: For each epoch, compute the model output $\hat{X} (t)$ for the collocation points and initial time $t = 0$ via forward propagation;
Step 4: Compute the initial condition loss $L o s s_{ic}$ and the residual loss $L o s s_{f}$ (using automatic differentiation to compute temporal derivatives);
Step 5: Compute the total loss $L o s s$ and backpropagate the gradient to update the parameters $Θ$ using the Adam optimizer;
Step 6: Repeat steps 2–5 until the loss converges to a stable value or the maximum number of epochs (10,000 epochs in this study) is reached.

Compared with a traditional fully connected PINN, the PINN + LSTM model benefits from the LSTM module in two key aspects: it captures temporal dependencies and remembers historical dynamic information, while achieving more stable long-term prediction with less accumulated error and non-physical drift.

4. Numerical Simulations and Performance Analysis

In this section, a series of numerical simulations is carried out to verify the effectiveness of the proposed PINN + LSTM model for solving the malware propagation dynamic system in a wireless sensor network. All simulations are implemented in Python 3.7. The traditional fully connected PINN [23], the Fourier feature-based PINN, and the proposed PINN + LSTM are comprehensively compared, where the fourth-order Runge–Kutta (RK4) method is used as a high-precision reference solution. The comparison covers training convergence, state variable prediction, multi-dimensional error metrics, long-term stability, and error distribution. All experiments are implemented in PyTorch (1.13.1+cpu) with the same initial conditions, network parameters, and training settings for fairness.

The simulation interval is set to

t \in [0, 80]

, with 1500 uniform sampling points for testing and 500 collocation points for training. The parameters of the epidemic dynamics system are given as

\begin{matrix} ϵ = 0.01, α_{1} = 0.025, α_{2} = 0.03, α_{3} = 0.02, α_{4} = 0.015, \\ β_{1} = 0.3, β_{2} = 0.45, δ = 0.05, η = 0.08, γ = 0.04 . \end{matrix}

The initial states of the seven compartments are

[S_{0}, V_{0}, E_{0}, H_{0}, L_{0}, Q_{0}, R_{0}] = [0.5, 0.3, 0.1, 0.1, 0.0, 0.0, 0.0] .

Both networks are trained for 10,000 epochs using the Adam optimizer with an initial learning rate of

10^{- 3}

. The PINN uses four hidden layers with 128 neurons per layer. The PINN + LSTM uses an embedding layer, two LSTM layers with 128 hidden units, and a two-layer fully connected decoder. To ensure reproducibility, the penalty coefficient is set to

λ = 100

and the random seed is fixed at 42.

4.1. Training Dynamics and Convergence

Figure 2 shows the training loss curves of the traditional fully connected PINN, the Fourier feature-based PINN, and the proposed PINN + LSTM, including total loss, ODE residual loss, initial condition loss, and learning rate decay.

A comparative analysis of the training logs reveals distinct performance characteristics among the traditional PINN, the PINN + LSTM model, and the Fourier feature-based PINN for solving differential equations. In terms of convergence speed, the traditional PINN’s total loss fell from an initial 5.18 to a final value of

3.13 \times 10^{- 6}

, a reduction of approximately six orders of magnitude. The Fourier PINN’s total loss decreased from 3.94 to

1.39 \times 10^{- 5}

, achieving a five-order-of-magnitude reduction. By contrast, the PINN + LSTM model saw its total loss drop from 5.45 to

9.40 \times 10^{- 8}

, a reduction of over seven orders of magnitude. This suggests that integrating the LSTM structure notably improves both convergence rate and final approximation accuracy. The PINN + LSTM’s final ODE loss reached

4.34 \times 10^{- 9}

, nearly three orders of magnitude lower than that of the traditional PINN and the Fourier PINN.

The PINN + LSTM exhibited a notable loss rebound at epoch 3000, with total loss jumping from

2.49 \times 10^{- 7}

to

3.12 \times 10^{- 4}

, signifying temporary instability during training. This phenomenon is attributed to the structural oscillation during optimization, which is a normal behavior when the model balances learning long-term temporal dynamics and satisfying physical constraints, rather than a sign of training failure. Regarding the satisfaction of initial conditions, the traditional PINN achieved an extremely precise IC loss of

1.91 \times 10^{- 13}

, followed by the Fourier PINN with

9.20 \times 10^{- 11}

, while the PINN + LSTM recorded an IC loss of

8.97 \times 10^{- 10}

. In summary, the traditional PINN offers a highly stable training process with excellent adherence to initial conditions. The Fourier PINN delivers moderate performance in both convergence accuracy and stability. In contrast, the PINN + LSTM achieves superior solution accuracy and ODE residual minimization, albeit at the cost of minor training instability, making it better suited for complex dynamical problems where high precision is the top priority.

4.2. State Variable Prediction

Figure 3, Figure 4, Figure 5, Figure 6, Figure 7, Figure 8 and Figure 9 illustrate the predicted trajectories of all seven state variables. Both models can roughly capture the evolutionary trend. However, PINN + LSTM achieves significantly higher precision, especially for the key compartments E, H, and L, which dominate the epidemic transmission.

Examining each state variable individually, all three methods generally follow the overall evolutionary trend given by the RK4 benchmark. For the Vaccinated (V) compartment, PINN + LSTM achieves exceptional accuracy, while Fourier PINN performs noticeably worse than both PINN and PINN + LSTM. For Susceptible (S) individuals, PINN shows clear deviations in both the declining rate and long-term steady state, while PINN + LSTM maintains tight alignment with the reference solution. In the Exposed (E) compartment, PINN + LSTM closely reproduces the amplitude and timing of the epidemic curve. For High-Risk Infectious (H) and Low-Risk Infectious (L) compartments, PINN + LSTM achieves outstanding consistency with RK4, while both PINN and Fourier PINN display mild to moderate mismatches. In the Quarantined (Q) compartment, Fourier PINN performs significantly worse than PINN and PINN + LSTM, with clear deviations and unstable oscillations, while PINN drifts gradually and PINN + LSTM remains nearly identical to RK4. The Recovered (R) compartment further confirms the superior performance of PINN + LSTM. Overall, although Fourier PINN improves upon the vanilla PINN in some compartments, it performs notably worse in the Vaccinated (V) and Quarantined (Q) states, and PINN + LSTM remains the most accurate and stable method across all variables. This superior performance arises from LSTM’s inherent temporal memory and sequential gating mechanism, which effectively capture time-varying dynamics while complying with physical ODE constraints.

4.3. Error Analysis and Comparison

Figure 10 and Table 1 present the Relative Error (RE), Absolute Error (AE), and

L_{2}

error for each state variable.

Based on the quantitative results, the PINN + LSTM model significantly outperforms both the standard PINN and Fourier PINN across all error metrics. The average relative error is reduced from

7.20 \times 10^{- 2}

for PINN to

3.88 \times 10^{- 3}

for PINN + LSTM, corresponding to a remarkable improvement of 94.61%. Similarly, the average absolute error and average L2 error are reduced by approximately 97.19% compared with the traditional PINN. Meanwhile, Fourier PINN fails to provide competitive accuracy and performs noticeably worse, particularly in the Vaccinated (V) and Quarantined (Q) compartments.

Examining each state variable individually, PINN + LSTM achieves exceptional precision in slowly varying compartments such as S and V, with relative errors of only

3.19 \times 10^{- 4}

and

2.23 \times 10^{- 4}

, representing accuracy improvements exceeding 98% over PINN. For infection-dominated states, including H, L, Q, and R, the relative error reductions all exceed 93%, and the relative errors are uniformly controlled within the order of

10^{- 3}

. Even in the most dynamical Exposed (E) compartment, the relative error is reduced to

6.06 \times 10^{- 3}

.

In sharp contrast, the standard PINN shows relatively large deviations, with a relative error as high as

1.33 \times 10^{- 1}

for Low-Risk Infectious (L), and an absolute error for Susceptible (S) nearly 123 times larger than that of PINN + LSTM. Fourier PINN performs even worse, especially in V and Q, with relative errors reaching

1.91 \times 10^{- 1}

and

5.46 \times 10^{- 1}

, respectively. These results confirm the clear limitations of purely fully connected PINN and Fourier PINN in capturing sharp transitions and long-term evolutionary dynamics.

Overall, the superior performance of PINN + LSTM validates that introducing temporal memory via the LSTM structure effectively captures the time-dependent characteristics of epidemic transmission and peak dynamics, yielding overwhelming advantages in prediction accuracy, stability, and long-term simulation consistency.

To further validate the robustness of the proposed model in a realistic scenario, we conducted additional experiments by adding Gaussian noise to the observed training data points at three representative levels: 1%, 5%, and 10%. The average relative errors of the PINN + LSTM, standard PINN, and Fourier PINN under different noise intensities are compared, and the results are illustrated in the following figure (Figure 11).

The robustness of the proposed model is validated under three levels of Gaussian noise: 1%, 5%, and 10%. At 1% noise, PINN + LSTM achieves an average relative error of 0.0049, which is far lower than 0.0707 for PINN and 0.2782 for Fourier PINN. When the noise level increases to 5%, the errors of PINN and Fourier PINN rise sharply to 0.1225 and 0.4532, respectively, while PINN + LSTM maintains an extremely low error of 0.0011. Even under strong 10% noise, PINN + LSTM still exhibits superior performance with an error of 0.0017, whereas PINN and Fourier PINN yield much higher errors of 0.0384 and 0.4431. These results confirm that PINN + LSTM possesses stronger noise immunity and more stable dynamic learning ability than both PINN and Fourier PINN in realistic noisy environments.

5. Conclusions

In this paper, we have proposed a novel hybrid framework combining physics-informed neural networks (PINNs) with long short-term memory (LSTM) to model malware propagation dynamics in wireless sensor networks. A seven-compartment SVEHLQR epidemiological model was developed to capture the complex physical interactions among nodes, classifying them into Susceptible (S), Vaccinated (V), Exposed (E), High-Risk Infectious (H), Low-Risk Infectious (L), Quarantined (Q), and Recovered (R) categories. This study’s key contribution lies in integrating LSTM’s temporal memory capability into the PINN architecture, enabling the model to effectively learn time-dependent propagation characteristics.

Using the fourth-order Runge–Kutta (RK4) method as the benchmark, we conducted extensive numerical comparisons among the proposed PINN + LSTM, standard PINN, and Fourier PINN. Experimental results demonstrate that the PINN + LSTM model significantly outperforms both baseline methods across all seven compartments. The average relative error achieved by PINN + LSTM is

3.88 \times 10^{- 3}

, compared to

7.20 \times 10^{- 2}

for PINN and

2.81 \times 10^{- 1}

for Fourier PINN, representing a remarkable 94.6% improvement in accuracy over the conventional PINN and an even greater advantage over the Fourier-enhanced version. Particularly noteworthy is its performance on the Vaccinated (V) compartment, where PINN + LSTM attains near-perfect agreement with the RK4 benchmark, with a relative error as low as

2.23 \times 10^{- 4}

. Even on the most challenging compartment, Low-Risk Infectious (L), which exhibited a relative error of

1.33 \times 10^{- 1}

for standard PINN and far higher values for Fourier PINN, the proposed method reduces the error to

8.21 \times 10^{- 3}

, demonstrating its robustness in capturing complex dynamic behaviors.

It is necessary to clarify why a deep learning solver (PINN + LSTM) is adopted here, even though the 7-dimensional SVEHLQR ODE is low-dimensional, and grid-based methods like RK4 are theoretically feasible. We use RK4 not because it is inadequate for this specific low-dimensional scenario, but as a high-precision benchmark to validate the PINN + LSTM framework. While RK4 performs well in low-dimensional problems, it becomes computationally prohibitive in high-dimensional real-world WSN scenarios (e.g., more compartments, spatial–temporal propagation) due to the curse of dimensionality. Validating PINN + LSTM against RK4 in a low-dimensional setting ensures its reliability, laying the groundwork for its application in complex, high-dimensional scenarios where RK4 is no longer sufficient.

Despite its superior accuracy, the proposed PINN + LSTM framework has certain limitations. The training process is computationally more expensive than standard PINN and Fourier PINN due to the additional LSTM layers, which may hinder its deployment in highly resource-constrained scenarios. Moreover, while the model excels at learning from RK4-generated reference data, its performance on real-world noisy measurements still requires thorough validation. The current study focuses on a fixed parameter configuration, and its generalization ability across different network sizes, topologies, and malware types remains to be fully explored.

Future work will explore several promising directions. First, we plan to validate the model on real-world malware propagation datasets to evaluate its practical reliability. Second, we aim to integrate attention mechanisms or transformer-style structures to further strengthen its long-range temporal dependency modeling. Third, extending the framework to support adaptive and online learning would facilitate real-time threat monitoring and response. Fourth, while the proposed SVEHLQR model represents a meaningful advance for WSN malware modeling by explicitly distinguishing high-risk and low-risk infections, recent research trends from 2025 to 2026 highlight a growing emphasis on spatial–temporal dynamics, especially in the Internet of Underwater Things (IoUT), as well as topology-aware propagation models suitable for the Internet of Vehicles (IoV); thus, we will expand the model to capture these spatial–temporal and topology-aware dynamics to improve its relevance to emerging systems such as IoUT and IoV. Finally, combining the proposed data-driven physics-informed approach with optimal control strategies could lead to a unified framework for both accurate prediction and effective mitigation of malware spread in wireless sensor networks.

Author Contributions

Conceptualization, R.Z. and K.Z.; Methodology, K.Z.; Software, Z.C.; Validation, S.S. and J.P.; Formal analysis, S.S.; Investigation, R.Z.; Writing—original draft, R.Z.; Writing—review & editing, S.S. and J.P.; Visualization, Z.C.; Supervision, K.Z. and S.S. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

No external data were used in this study. All results were obtained through numerical simulations based on the model described in the manuscript.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Raissi, M.; Perdikaris, P.; Karniadakis, G.E. Physics-informed neural networks: A deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations. J. Comput. Phys. 2019, 378, 686–707. [Google Scholar] [CrossRef]
Lin, S.; Chen, Y. Gradient-enhanced physics-informed neural networks based on transfer learning for inverse problems of the variable coefficient differential equations. Phys. D Nonlinear Phenom. 2024, 459, 134023. [Google Scholar] [CrossRef]
Alla, A.; Bertaglia, G.; Calzola, E. A PINN Approach for the Online Identification and Control of Unknown PDEs. J. Optim. Theory Appl. 2025, 206, 8. [Google Scholar] [CrossRef]
Ma, P.L.; Pu, J.C.; Peng, W.Q. PINN for solving localized wave solutions and inverse problems involving the Gerdjikov-Ivanov equation. Nonlinear Dyn. 2025, 113, 26547–26559. [Google Scholar] [CrossRef]
Wu, Z.; Zhang, H.; Ye, H.; Zhang, H.; Zheng, Y.; Guo, X. PINN enhanced extended multiscale finite element method for fast mechanical analysis of heterogeneous materials. Acta Mech. 2024, 235, 4895–4913. [Google Scholar] [CrossRef]
Rajaperumal, T.A.; Chinnappan, C.C. Integrating data-driven and physics-based approaches for robust wind power prediction: A comprehensive ML-PINN-Simulink framework. Sci. Rep. 2025, 15, 29102. [Google Scholar] [CrossRef]
Zhang, S.; Zhang, C.; Han, X.; Wang, B. MRF-PINN: A multi-receptive-field convolutional physics-informed neural network for solving partial differential equations. Comput. Mech. 2025, 75, 1137–1163. [Google Scholar] [CrossRef]
Nguyen, T.; Nguyen, D.; Pham, K.; Tran, T. MP-PINN: A Multi-phase Physics-Informed Neural Network for Epidemic Forecasting. In Data Science and Machine Learning, AusDM 2024; Balasubramaniam, T., Liao, K., Vetrova, V., Benavides-Prado, D., Boo, Y.L., Zhao, Y., Eds.; Communications in Computer and Information Science; Springer: Singapore, 2026; Volume 2325. [Google Scholar] [CrossRef]
Cheng, H.; Mao, Y. Physics-informed epidemic prediction for irregularly sampled spatio-temporal sequence with missing values. Appl. Intell. 2025, 55, 967. [Google Scholar] [CrossRef]
Cheng, H.; Mao, Y.; Jia, X. A framework based on physics-informed graph neural ODE: For continuous spatial-temporal pandemic prediction. Appl. Intell. 2024, 54, 12661–12675. [Google Scholar] [CrossRef]
Tranquilli, P.; Ricketson, L.; Haut, T.; Hittinger, J. Stage-local partitioned two-step Runge-Kutta methods for large systems of ordinary differential equations. BIT Numer. Math. 2025, 65, 49. [Google Scholar] [CrossRef]
Xiong, Y.; Wang, H.; Liu, G.; Li, Y.; Jiang, T. Graph neural ordinary differential equations for epidemic forecasting. CCF Trans. Pervasive Comput. Interact. 2024, 6, 281–295. [Google Scholar] [CrossRef]
Hamad, A.H.; Hussein, N.K.; Abdulghani, A.M. A Deep Learning Paradigm for Intrusion Detection in Unmanned Aerial Vehicle Networks Using Extended LSTM. Int. J. Intell. Eng. Syst. 2025, 18, 507–523. [Google Scholar] [CrossRef]
Hoang, M.T. Mathematical analysis and numerical simulation of a generalized epidemiological model for malware propagation. Nonlinear Dyn. 2025, 114, 53. [Google Scholar] [CrossRef]
Quiroga-Sánchez, L.; Montoya, G.A.; Lozano-Garzon, C. The SEIRS-NIMFA epidemiological model for malware propagation analysis in IoT networks. Cybersecurity 2025, 8, 2. [Google Scholar] [CrossRef]
Moradi, L.; Farsimadan, E.; D’Angelo, G.; Carpentieri, B.; Palmieri, F. Epidemic Analysis of the Propagation of Multiple Malware Infectious in Wireless Sensor Networks. In Advanced Information Networking and Applications, AINA 2025; Barolli, L., Ed.; Lecture Notes on Data Engineering and Communications Technologies; Springer: Cham, Switzerland, 2025; Volume 252. [Google Scholar] [CrossRef]
Kang, H.; Sun, M.; Yu, Y.; Fu, X.; Bao, B. Spreading dynamics of an SEIR model with delay on scale-free networks. IEEE Trans. Netw. Sci. Eng. 2020, 7, 489–496. [Google Scholar] [CrossRef]
Liu, Q.; Li, H. Global dynamics analysis of an SEIR epidemic model with discrete delay on complex network. Phys. A Stat. Mech. Its Appl. 2019, 524, 289–296. [Google Scholar] [CrossRef]
Verma, C.; Gupta, C.P. Effect of vaccination on stability of wireless sensor network against malware attack: An epidemiological model. SN Comput. Sci. 2024, 5, 240. [Google Scholar] [CrossRef]
Dong, N.P.; Long, H.V.; Son, N.T.K. The dynamical behaviors of fractional-order SE1E2IQR epidemic model for malware propagation on Wireless Sensor Network. Commun. Nonlinear Sci. Numer. Simul. 2022, 111, 106428. [Google Scholar] [CrossRef]
Hosseini, S.; Azgomi, M.A. The dynamics of an SEIRS-QV malware propagation model in heterogeneous networks. Phys. A Stat. Mech. Its Appl. 2018, 512, 803–817. [Google Scholar] [CrossRef]
Song, Y.; Zhang, D.; Wang, J.; Wang, Y.; Wang, Y.; Ding, P. Application of deep learning in malware detection: A review. J. Big Data 2025, 12, 99. [Google Scholar] [CrossRef]
Rahman, M.A.; Zhang, T.; Lu, Y. PINN-CHK: Physics-informed neural network for high-fidelity prediction of early-age cement hydration kinetics. Neural Comput. Appl. 2024, 36, 13665–13687. [Google Scholar] [CrossRef]

Figure 1. State transition relationships of nodes in WSN.

Figure 2. Training dynamics comparison: total loss, ODE loss, initial condition loss, and learning rate schedule during training.

Figure 3. Comparison of predicted S.

Figure 4. Comparison of predicted E.

Figure 5. Comparison of predicted H.

Figure 6. Comparison of predicted L.

Figure 7. Comparison of predicted Q.

Figure 8. Comparison of predicted V.

Figure 9. Comparison of predicted R.

Figure 10. Error decomposition: relative error, absolute error,

L_{2}

error, and pointwise error over time.

Figure 10. Error decomposition: relative error, absolute error,

L_{2}

error, and pointwise error over time.

Figure 11. Average relative errors of PINN, PINN + LSTM, and Fourier PINN under 1%, 5%, and 10% Gaussian noise.

Table 1. Quantitative error comparison of PINN + LSTM, PINN, and Fourier PINN across all state variables.

Method	Metric	S	V	E	H	L	Q	R	Avg
PINN + LSTM	Rel	3.19 × 10⁻⁴	2.23 × 10⁻⁴	6.06 × 10⁻³	3.38 × 10⁻³	8.21 × 10⁻³	3.53 × 10⁻³	5.41 × 10⁻³	3.88 × 10⁻³
	Abs	9.75 × 10⁻⁵	8.88 × 10⁻⁵	5.31 × 10⁻⁵	1.36 × 10⁻⁴	1.48 × 10⁻⁴	8.32 × 10⁻⁵	2.96 × 10⁻⁴	1.29 × 10⁻⁴
	L2	1.28 × 10⁻⁴	1.01 × 10⁻⁴	5.93 × 10⁻⁵	1.84 × 10⁻⁴	1.81 × 10⁻⁴	9.88 × 10⁻⁵	3.97 × 10⁻⁴	1.64 × 10⁻⁴
PINN	Rel	3.54 × 10⁻²	2.08 × 10⁻²	4.63 × 10⁻²	7.04 × 10⁻²	1.33 × 10⁻¹	9.93 × 10⁻²	9.85 × 10⁻²	7.20 × 10⁻²
	Abs	1.20 × 10⁻²	5.96 × 10⁻³	3.71 × 10⁻⁴	2.94 × 10⁻³	2.37 × 10⁻³	2.34 × 10⁻³	6.19 × 10⁻³	4.59 × 10⁻³
	L2	1.42 × 10⁻²	9.39 × 10⁻³	4.54 × 10⁻⁴	3.83 × 10⁻³	2.93 × 10⁻³	2.78 × 10⁻³	7.23 × 10⁻³	5.83 × 10⁻³
Fourier PINN	Rel	5.59 × 10⁻²	1.91 × 10⁻¹	1.75 × 10⁻¹	2.89 × 10⁻¹	3.37 × 10⁻¹	5.46 × 10⁻¹	3.77 × 10⁻¹	2.81 × 10⁻¹
	Abs	1.84 × 10⁻²	7.00 × 10⁻²	1.40 × 10⁻³	1.43 × 10⁻²	6.76 × 10⁻³	1.38 × 10⁻²	2.31 × 10⁻²	2.11 × 10⁻²
	L2	2.24 × 10⁻²	8.61 × 10⁻²	1.71 × 10⁻³	1.57 × 10⁻²	7.43 × 10⁻³	1.53 × 10⁻²	2.77 × 10⁻²	2.52 × 10⁻²

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Zhang, R.; Zhou, K.; Shen, S.; Pang, J.; Cao, Z. PINN-LSTM: A High-Precision Physics-Informed Neural Network for Solving Malware Propagation Dynamics in Wireless Sensor Networks. Symmetry 2026, 18, 707. https://doi.org/10.3390/sym18050707

AMA Style

Zhang R, Zhou K, Shen S, Pang J, Cao Z. PINN-LSTM: A High-Precision Physics-Informed Neural Network for Solving Malware Propagation Dynamics in Wireless Sensor Networks. Symmetry. 2026; 18(5):707. https://doi.org/10.3390/sym18050707

Chicago/Turabian Style

Zhang, Rui, Kai Zhou, Shoufeng Shen, Jiafu Pang, and Zhiyi Cao. 2026. "PINN-LSTM: A High-Precision Physics-Informed Neural Network for Solving Malware Propagation Dynamics in Wireless Sensor Networks" Symmetry 18, no. 5: 707. https://doi.org/10.3390/sym18050707

APA Style

Zhang, R., Zhou, K., Shen, S., Pang, J., & Cao, Z. (2026). PINN-LSTM: A High-Precision Physics-Informed Neural Network for Solving Malware Propagation Dynamics in Wireless Sensor Networks. Symmetry, 18(5), 707. https://doi.org/10.3390/sym18050707

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

PINN-LSTM: A High-Precision Physics-Informed Neural Network for Solving Malware Propagation Dynamics in Wireless Sensor Networks

Abstract

1. Introduction

2. Malware Propagation Model Formulation

3. Physics-Informed Neural Network Fused with LSTM Model

3.1. Network Architecture and Forward Propagation

3.2. Training Objective and Loss Function

3.3. Model Training Process

4. Numerical Simulations and Performance Analysis

4.1. Training Dynamics and Convergence

4.2. State Variable Prediction

4.3. Error Analysis and Comparison

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI