5. Numerical Experiments
In this section, we present numerical experiments that demonstrate the effectiveness of the proposed multi-state decomposition framework for both finite-dimensional ODE systems and high-dimensional PDEs. By constructing observables that depend on multiple time steps, the method captures temporal correlations and multi-step interactions that conventional single-state approaches may fail to resolve. These examples illustrate how the generalized operator and the FIRE framework provide a systematic, flexible approach to modeling complex nonlinear dynamics.
Fair baseline comparison. To ensure a meaningful and unbiased evaluation of the FIRE framework, all baseline methods, including classical EDMD and Hankel-DMD, were tested using comparable feature classes and dictionary constructions, where applicable. Specifically, for each example, the same temporal delay embeddings, polynomial orders, and trigonometric components are provided to all methods.
Furthermore, when tuning hyperparameters such as embedding depth or polynomial order, all the methods are optimized equivalently using either cross-validation or systematic parameter sweeps, so that the baseline performance reflects their true potential under fair conditions.
Error Metrics. To evaluate the performance of the proposed framework, we employ two complementary metrics.
First, we define the absolute pointwise reconstruction error at time index
i as
where
and
denote the true and reconstructed system states, respectively. This metric captures the instantaneous deviation between the true and predicted trajectories and is particularly useful for identifying transient effects, peak mismatches, and localized reconstruction failures.
Second, we report a global relative reconstruction error over the full time horizon,
where
and
are the matrices of the true and reconstructed trajectories. This normalized measure provides a scale-invariant assessment of the overall reconstruction quality and enables fair comparison across different models, dictionaries, and experimental settings.
Together, these metrics offer both local (time-resolved) and global (aggregate) perspectives on reconstruction performance.
Example 1. Nonlinear Epidemiological Dynamics (SIR Model)
To assess the performance of the generalized transfer operator
on nonlinear dynamics arising in biological systems, we consider the classical SIR (Susceptible–Infectious–Recovered) model. This compartmental framework is governed by a bilinear infection term and thus represents a canonical benchmark for testing the ability of operator-theoretic methods to linearize and approximating intrinsically nonlinear flows. The evolution of the susceptible (
S), infectious (
I), and recovered (
R) populations is described by the following nonlinear system:
where
denotes the population size,
is the transmission rate, and
is the recovery rate.
The numerical experiment is conducted with population size
and initial conditions
,
, and
. The epidemiological parameters are set to
and
, yielding a basic reproduction number
, corresponding to a pronounced epidemic growth regime. The system is simulated over a time horizon of
days using high-resolution temporal sampling with step size
to generate snapshot data. The resulting compartmental dynamics are shown in
Figure 1.
Due to the bilinear coupling between the susceptible and infectious compartments, the SIR vector field is inherently nonlinear and non-normal. The generalized transfer operator is therefore tasked with approximating the induced flow between epidemiological compartments via a linear evolution in an appropriately lifted feature space. By constructing a dictionary of nonlinear observables, the proposed framework captures the dominant epidemic modes and provides a finite-dimensional linear representation of the underlying nonlinear dynamics.
To accurately capture the nonlinear interactions inherent in the SIR dynamics, we employ an augmented lifting dictionary that combines temporal delay embeddings with explicitly constructed nonlinear and differential features. This enriched representation enables the generalized transfer operator to approximate the nonlinear epidemic flow by a linear evolution in a suitably high-dimensional feature space, where the dominant compartmental interactions are made explicit.
Let the population state be denoted by , where , , and . The feature map is constructed by integrating three complementary sources of information:
Temporal delay embedding (Hankel structure). We employ a delay depth of , yielding a multi-step embedding that encodes the recent temporal history of the epidemic. This extended memory allows the operator to implicitly capture time-varying infection and recovery rates and enhances the identifiability of transient epidemic phases.
Nonlinear interaction observables. Motivated by the bilinear infection term
in the governing equations, we explicitly incorporate cross-product observables of the following form:
thereby embedding the dominant nonlinear coupling directly into the lifted space.
Differential information. To further align the lifted dynamics with the underlying continuous-time structure, we augment the dictionary with a numerical approximation of the recovered population derivative:
Since , this observable provides a direct linear proxy for the infectious compartment and substantially improves the spectral consistency of the learned operator. Accordingly, we include
Since
, this term provides a direct linear link to the infectious compartment, significantly improving the spectral alignment of the operator. We explicitly include observables,
as an additional feature.
Collectively, this construction yields an extended state space of dimension
. The resulting lifted feature vector is given by
so that, at each time step, the lifted state simultaneously encodes raw compartment values, nonlinear interaction terms, and differential information across all delay coordinates.
The data matrices are assembled via a standard time-shifting procedure,
resulting in matrices of dimension
, with
training snapshots. Applying Algorithm 1, we compute the finite-dimensional approximation
G of the generalized transfer operator and use it to reconstruct the system dynamics.
Figure 2 compares the reconstructed SIR trajectories obtained using the proposed FIRE framework with those produced by the Hankel-DMD method.
The FIRE-based reconstruction closely follows the true nonlinear dynamics of all compartments, accurately reproducing both the transient growth phase and the decay regime of the infectious population. In particular, the timing and magnitude of the infection peak are well captured, indicating that the learned transfer operator successfully encodes the dominant nonlinear interaction mechanisms of the epidemic process.
Regularization and conditioning. The lifted regression problem arising in the SIR example is inherently high-dimensional, with a feature space dimension of and only training snapshots. As a result, the estimation of the finite-dimensional transfer operator G from the relation is severely underdetermined and requires explicit regularization to ensure numerical stability and meaningful spectral properties.
In all numerical experiments, the operator G is computed using a truncated singular value decomposition (SVD) of the feature matrix . Only the dominant singular directions are retained, corresponding to singular values satisfying . This procedure yields a regularized low-rank approximation of the regression operator and effectively filters spurious directions associated with noise and redundant observables. The singular value spectrum of exhibits rapid decay, revealing a strong intrinsic low-rank structure induced by the combined effect of delay embeddings and nonlinear interaction features. Importantly, the dominant eigenvalues of the learned operator are observed to be robust with respect to the chosen truncation threshold.
Although Tikhonov (ridge) regularization, (
57) was not required for the results reported in this example, it remains a viable option for further stabilizing operator estimation in more severely ill-conditioned or noise-contaminated settings. In practice, SVD truncation alone proved sufficient to control conditioning and prevent overfitting.
To further assess numerical stability, we monitor the conditioning of the regression by examining the singular value decay of and the effective rank of the empirical Gram matrix . Despite the nominally large lifted dimension (), the data-driven operator effectively acts on a substantially lower-dimensional subspace induced by the correlated temporal structure of the delay coordinates and nonlinear observables. This implicit dimensionality reduction enables stable operator identification and mitigates overfitting, even in the underdetermined regime.
To provide a systematic comparison on the same simulated dataset, we evaluate several baseline methods, including Dynamic Mode Decomposition (DMD), Hankel DMD, Extended DMD (EDMD), and continuous-time EDMD (cEDMD). Among these approaches, Hankel DMD yields the best performance. While DMD, EDMD, and cEDMD fail to accurately reconstruct the nonlinear epidemic dynamics, Hankel DMD achieves a reconstruction error on the order of
when a delay dimension of
is employed. The corresponding Hankel-DMD reconstruction is shown in
Figure 2.
Reconstruction accuracy is quantified using the absolute pointwise error defined in (
58).
Figure 3 compares the temporal evolution of the reconstruction error for FIRE and Hankel DMD over the training interval. The FIRE framework consistently achieves a lower reconstruction error across all the time steps, with particularly pronounced improvements during the nonlinear growth and saturation phases of the epidemic.
For completeness,
Figure 4 reports the reconstructions obtained using DMD, EDMD, and continuous EDMD. These methods fail to capture the essential nonlinear features of the SIR dynamics.
To provide a rigorous baseline comparison, the EDMD and continuous EDMD schemes were equipped with a comprehensive polynomial library, as follows:
including all relevant cross-product terms up to the third order. For the continuous EDMD scheme, the evolution data matrix is constructed as
where the time derivatives are approximated using forward finite differences, as follows:
The reconstruction quality is evaluated using the relative reconstruction error defined in (
59).
Table 1 reports the errors for all methods considered.
Ablation Study. To isolate the contribution of each modeling component in the proposed FIRE framework, we perform an ablation study by systematically removing elements from the lifted representation. Specifically, we consider the following cases:
- 1.
Full dictionary: Delays + nonlinear cross-terms + derivative features.
- 2.
No delays: Remove temporal embeddings; retain nonlinear cross-terms and derivatives.
- 3.
No cross-terms: Remove nonlinear products ; retain delays and derivatives.
- 4.
No derivatives: Remove derivative features; retain delays and nonlinear cross-terms.
Table 2 reports the relative reconstruction error for different dictionary configurations.
The numerical results indicate that the explicit inclusion of the transmission nonlinearity () together with the recovered rate derivative yields the highest reconstruction fidelity. By embedding the functional form of the infection mechanism directly into the observable space, the generalized operator is able to accurately learn the transition from the initial exponential growth phase to the epidemic peak and the subsequent recovery regime. This hybrid strategy, which combines deep delay embeddings with physics-informed nonlinear observables, enables the proposed framework to significantly outperform standard DMD-based methods in both short- and long-term prediction accuracy.
From a theoretical perspective, this behavior is consistent with Takens’ embedding theorem, which guarantees that the topology of a finite-dimensional dynamical system can be reconstructed from a single observed time series given a sufficiently large delay window. The choice of a deep delay embedding with allows to capture the delayed feedback between susceptible–infectious interactions and the ensuing peak in the infectious population. In effect, the operator implicitly encodes higher-order temporal dependencies that are not explicitly present in the original state variables, providing a fading memory representation of the epidemic dynamics.
By projecting the nonlinear flow onto a 120-dimensional extended observable space, the learned operator becomes markedly less sensitive to local numerical fluctuations in the sampled S, I, and R trajectories. This implicit regularization, induced by the delay structure and correlated nonlinear features, contributes to the observed robustness and accuracy of the FIRE reconstruction in highly nonlinear and transient regimes.
Example 2. Forced Harmonic Oscillator with Transient Dynamics
A central challenge in data-driven system identification is the recovery of the underlying state space dynamics from limited, and often scalar, measurements. Classical operator-theoretic approaches typically rely on access to the full state or multiple observables and may fail to reconstruct the latent dynamical structure when only a single measurement is available. In contrast, the proposed generalized transfer operator enables state reconstruction through a lifting strategy that combines delay embeddings with derivative-based observables, thereby recovering the intrinsic manifold from univariate time series data.
To assess the performance of
in this setting, we consider a second-order, linear, non-homogeneous dynamical system: a forced harmonic oscillator exhibiting both transient and steady-state behavior. The governing equation is
where
denotes the displacement. The natural frequency of the unforced system is
rad/s, while the external forcing acts at frequency
rad/s. The coexistence of these frequencies induces a transient regime followed by a forced response, providing a nontrivial test case for operator-based identification from scalar data.
The trajectory is generated over the time interval s, starting from rest with initial conditions and . The system is integrated using a variable-step Runge–Kutta method (ode45) with tight error tolerances (, ). A fixed sampling interval of s yields 8001 discrete snapshots , which capture both the initial transient phase and the subsequent forced oscillatory dynamics. This dataset serves as the sole input for constructing the data-driven approximation of the generalized operator .
In this experiment, we assume that only the scalar displacement signal is available for measurement. The velocity and the external forcing are not directly observed. To compensate for this incomplete state information, we construct a compact yet expressive dictionary that embeds the scalar time series into a low-dimensional feature space enriched with derivative and forcing information. Our objective is to identify a continuous-time representation of the generalized transfer operator via its infinitesimal generator.
Unlike discrete-time formulations, which approximate a finite-step evolution map, the generator-based approach seeks a linear operator
G such that the lifted observables satisfy
where
denotes the feature map evaluated along the trajectory. This formulation directly approximates the action of the Lie generator associated with the underlying dynamics.
This example illustrates FIRE’s ability to operate on heterogeneous spaces, where the source space and the target space are not identical. Specifically, corresponds to a reconstructed feature space obtained from scalar measurements, while represents the infinitesimal evolution of these features under the system dynamics. The alignment between these spaces is achieved through a localized temporal window.
To this end, we introduce a symmetric delay structure with one step backward and one step forward, corresponding to memory depth
and anticipation depth
. The resulting extended state at time index
k is
This local temporal window encodes sufficient information to approximate first- and second-order derivatives while remaining agnostic to the true physical state
.
From the windowed measurements (
63), numerical derivatives are computed using centered finite differences:
These approximations provide local estimates of the velocity and acceleration without requiring explicit access to latent variables.
Based on this construction, we define a dictionary of observables
acting on
as
The inclusion of the trigonometric forcing terms explicitly encodes the non-autonomous input, while the derivative-based observable enables the reconstruction of the underlying second-order dynamics from the scalar data.
The use of the forward sample in the dictionary represents a localized anticipation depth during operator identification. Importantly, this non-causal information is required only in the offline training phase. Once the operator G is learned, the resulting spectral model is fully causal: the natural and forcing frequencies are embedded in the eigenvalues of G, enabling autonomous prediction without access to future measurements.
The resulting feature map is
with time derivative
where all derivatives are computed numerically from the sampled data.
Using the training dataset
with
samples, we compute the continuous-time FIRE approximation by solving a least squares problem for the generator matrix
in (
62).
This dictionary design effectively “unfolds” the scalar signal into a feature space in which the dynamics become linear and autonomous. By explicitly incorporating the driving terms and into the observable dictionary, the generalized operator is able to represent the non-autonomous forcing without introducing explicit time dependence in the operator coefficients. As a result, the learned operator acts on an augmented state in which the external forcing is internalized as part of the state evolution.
We use the first 5600 snapshots as training data. The resulting operator matrix G, obtained from the least squares approximation , provides a finite-dimensional numerical representation of the Lie generator associated with the lifted dynamics. This enables both high-fidelity state reconstruction and meaningful spectral analysis, demonstrating that the proposed framework remains robust even under the severe constraint of univariate measurements.
The dynamic reconstruction obtained using the FIRE algorithm is shown in
Figure 5. For comparison, the right panel shows the reconstruction obtained using EDMD with a standard polynomial–trigonometric dictionary.
Figure 6 reports the pointwise absolute reconstruction error, computed using (
58), for both the training interval and the out-of-sample prediction horizon. The observed error remains on the order of
, with slightly increased deviations during the prediction phase, reflecting the extrapolative nature of the task.
To provide a rigorous benchmark, we compare FIRE against several standard data-driven decomposition methods: Dynamic Mode Decomposition (DMD), Hankel DMD, Extended DMD (EDMD), and continuous EDMD. Among these approaches, EDMD yields the best performance, although its reconstruction accuracy remains significantly inferior to that of FIRE.
A representative EDMD dictionary augmented with explicit forcing terms is given by
Despite the inclusion of cubic nonlinearities and trigonometric forcing, EDMD fails to accurately reconstruct the system dynamics, as illustrated in
Figure 5.
For continuous EDMD, the same dictionary (
66) is employed, with the corresponding evolution data matrix
where time derivatives are approximated numerically.
Table 3 summarizes the relative reconstruction errors (
59) for all considered methods.
Unlike EDMD or Hankel-DMD, which rely on fixed temporal shifts within a single observable space, FIRE explicitly permits a general alignment map between heterogeneous spaces. In this example, the source space encodes local temporal context through windowed scalar measurements, while the target space corresponds to the instantaneous evolution of lifted observables.
Consequently, the learned operator G is not a time-shift operator but a data-driven approximation of the infinitesimal generator governing the induced feature space dynamics. This distinction enables FIRE to recover hidden state information and non-autonomous forcing effects without requiring backward-time data or explicit access to the full system state. As a result, the proposed framework remains applicable in settings where classical Koopman-based methods either fail or require substantial modification.
Example 3. Coupled Two-Mass Spring–Damper System
To demonstrate the effectiveness of the generalized transfer operator in higher-dimensional state spaces, we consider a coupled two-degree of freedom (2-DOF) coupled system. The model consists of two masses, and , coupled through a linear spring with stiffness k and a viscous damping element with coefficient c. This example illustrates the ability of the proposed framework to identify coupled dynamics and accurately capture unstable regimes using displacement-only measurements.
The governing equations of motion are given by the following coupled second-order ordinary differential equations:
where
and
denote the displacements of masses
and
, respectively.
The numerical experiment is conducted using the parameters , , , and . The system is simulated over the time interval with time step and initial condition , yielding 5001 discrete snapshots. The negative damping coefficient introduces an unstable regime in which energy is continuously injected into the system, causing exponential growth of oscillation amplitudes. This configuration provides a challenging test case for data-driven spectral identification methods.
The available measurements consist solely of the displacement vector
. To construct the transfer operator
, we lift these measurements into an augmented phase-space representation using forward finite differences. Let
denote the sampled displacement at time
. We define the extended state vectors as follows:
which implies that the source and target spaces coincide, i.e.,
.
Based on these windowed measurements, we define the lifted observables as follows:
resulting in the feature vector
which pairs displacement measurements with an approximation of the velocity. This lifting transforms the displacement-only observations into a phase space representation suitable for linear operator identification.
The first
of the snapshots are used as training data for Algorithm 1. The reconstruction of the training trajectories obtained using FIRE is shown in
Figure 7, while the out-of-sample predictions are presented in
Figure 8.
The data matrices
and
are constructed by time-shifting the lifted features,
where
denotes the number of training snapshots. This construction yields a finite-dimensional approximation of the discrete-time flow map in feature space.
The resulting operator G serves as a numerical approximation of the generalized transfer operator, mapping the current lifted state to its future counterpart. Despite the unstable nature of the underlying dynamics, the identified operator accurately captures both the inter-mass coupling and the dominant growth rates of the system modes, enabling reliable long-term prediction.
A comparative analysis was performed to benchmark the proposed FIRE scheme against several standard decomposition techniques, including DMD, Hankel DMD, Extended DMD (EDMD), and continuous EDMD. Among these methods, Hankel DMD provided the best performance, although still inferior to FIRE in terms of reconstruction fidelity.
For EDMD and continuous EDMD, we employ a nonlinear polynomial library:
The evolution data matrix for continuous EDMD is constructed as
where the time derivatives are approximated using forward finite differences:
Hankel DMD achieves the best reconstruction among the classical methods when using a delay coefficient
.
Figure 9 shows the training data reconstruction obtained by Hankel DMD.
Figure 10 depicts the pointwise absolute reconstruction error for both FIRE and Hankel DMD, computed according to Formula (
58) over the training snapshots
, with
.
These results demonstrate that FIRE successfully reconstructs the full four-dimensional state trajectory, even under challenging conditions including coupled, unstable dynamics and limited measurements. The framework effectively captures the inter-mass coupling and reproduces the growth rates of the dominant modes, outperforming classical DMD-based methods and validating its robustness for high-dimensional, multi-component dynamical systems.
Example 4. Spatio-Temporal Dynamics of a Nonlinear Soliton (PDE)
To demonstrate the applicability of the generalized transfer operator
in high-dimensional state spaces, we consider the evolution of a coherent structure governed by a partial differential equation. Specifically, we study the nonlinear Schrödinger (NLS) equation,
where
is a complex-valued function of space
and time
t. Following [
8], we perform a Fourier transform in space to obtain an evolution equation in the Fourier domain:
where ⊙ denotes element-wise multiplication,
k is the vector of wavenumbers, and
represents the Fast Fourier Transform (FFT).
We discretize the spatial domain using
points, resulting in a high-dimensional state space
. The initial condition is a high-amplitude soliton:
and the system is integrated in time with a fourth-order Runge–Kutta scheme over
, sampled at
time points with
. The resulting dataset consists of a snapshot matrix
representing the spatio-temporal evolution of the soliton.
To capture the full dynamics, we employ a derivative-augmented dictionary. Each snapshot is paired with its temporal derivative approximated by forward differences, effectively doubling the dimensionality and allowing the operator to approximate the second-order-in-time nature of the NLS equation. Specifically, we define the extended state vectors as follows:
so that the observation spaces satisfy
, where
. The lifted observables are chosen as
yielding the feature vector
. The forward difference derivative approximates the velocity field at each spatial point:
providing the operator with information about the temporal evolution of the system.
Finally, the lifted snapshot matrices are constructed as
with dimensions 1024 × 19. Using Algorithm 1, we compute the finite-dimensional approximation of the transfer operator
G, which simultaneously provides the reconstructed state vectors and the corresponding eigenfunctions and eigenvalues of the high-dimensional nonlinear system.
To evaluate the performance of the proposed generalized transfer operator
, we conducted a comparative study against standard decomposition techniques: Dynamic Mode Decomposition (DMD), Hankel DMD, Extended DMD (EDMD), and continuous EDMD (cEDMD).
Figure 11 depicts the full simulation data along with reconstructions obtained by the FIRE algorithm and Hankel DMD.
The accuracy of the reconstructions is quantified using the pointwise absolute error metric (
58). Continuous EDMD exhibited the poorest performance and is not visualized here. Among the remaining methods, FIRE and Hankel DMD consistently produce the lowest reconstruction errors, as shown in
Figure 12.
Hankel DMD achieves its best performance using a temporal stack of only two snapshots, highlighting the importance of encoding velocity information in the feature space. For EDMD, we employed a dictionary consistent with the cubic nonlinearity of the NLS equation:
Despite this, the FIRE algorithm provides superior reconstruction, demonstrating the advantage of a carefully designed, derivative-augmented observable set.
The eigenvalue spectra computed by the different methods further illustrate the efficacy of FIRE (
Figure 13). FIRE aligns the eigenvalues close to the imaginary axis, as expected for the Hamiltonian-like dynamics of the soliton, whereas deviations are observed for DMD and EDMD.
The inclusion of the derivative in the dictionary is essential for PDE systems. It allows the operator G to account for the “momentum” of the wave field, effectively capturing the spatio-temporal evolution even with a limited number of temporal snapshots (). This lifting enables the extraction of dominant Koopman modes that characterize the soliton’s stability and shape-preserving propagation, demonstrating the robustness of the proposed framework for distributed parameter systems with continuous-state fields.