1. Introduction
With the continuous advancement of the integration of source, grid, load, and storage in the power system [
1], the distribution station area, which is an important part of the modern power system, is accelerating its evolution into a smart station area. The smart station area deeply integrates advanced sensing, communication, computing, and control technologies with a variety of distributed energy and power electronic equipment, aiming to achieve more efficient energy management, optimized control, and intelligent operation. However, the high proportion of renewable energy penetration, complex power electronic control logic, and the time-varying uncertainty of operating conditions have led to significant nonlinearity, strong coupling, and time-varying dynamic behavior in the smart station area, presenting new challenges for precise and real-time system modeling and control [
2].
The existing research can be roughly divided into four technical paths. The first is the flexible interconnection and local control. The flexible interconnection system for low-voltage distribution transformer areas proposed in Reference [
3] has enhanced the reliability of power supply and the capacity to accept distributed power sources. Based on this, Reference [
4] constructed a planning model considering the total investment cost and comprehensive carrying capacity. Reference [
5] effectively alleviated the voltage over-limit and fluctuation problems caused by high-permeability distributed generation through a local interconnection and control strategy based on soft open points. These works focus on achieving flexible power regulation through power electronic devices, laying the foundation for the application of advanced control strategies. However, their underlying device-level dynamic models are often simplified and fail to fully reveal the nonlinear evolution process of the system. The second is big data and edge computing. Reference [
6] combined the autocorrelation feature detection and information fusion recognition to construct a risk assessment model for a smart station area. References [
7,
8] utilized edge computing technology to address the latency issue between intelligent terminal devices and cloud platforms, meeting the demands of business scenarios with high real-time requirements and enhancing the local perception and decision-making capabilities of the distribution area. Reference [
9] further proposed an intelligent transformer area design method based on cloud-edge collaboration, achieving real-time resource monitoring and power quality governance. These studies have addressed the platform issues for real-time data analysis and execution. However, their performance is highly dependent on the quality of the models themselves running on the edge side. A dynamic model that is both precise and computationally efficient is the key to realizing its potential. The third is the optimization and risk management methods. Reference [
10] constructed a risk-power interaction control model using mixed integer linear optimization, providing a tool for the quantitative management of risks in the distribution area. The fourth is the method based on operators or extended linearization. Reference [
11] introduced the Koopman method into the power flow calculation of the distribution system and constructed a high-precision linear mapping model. References [
12,
13] constructed a global linear dynamic model of wind turbine units by applying the Koopman operator theory and an extended dynamic mode decomposition algorithm.
The above-mentioned work has made significant progress in its respective dimensions. But there are still systematic deficiencies. Most precise mechanism models struggle to strike a balance between accuracy and computational complexity, thus failing to meet the demands of real-time control and online applications. The existing data-driven linear or local models are usually only effective under limited working conditions, and it is difficult to uniformly characterize the global nonlinear behavior of the system and cross-device interaction. In terms of constructing a universal observation/feature space to ensure the generalization ability of the model, there is still a lack of systematic adaptive methods, especially in the scenario of high-dimensional and multi-source perturbations in smart station areas.
To address the aforementioned deficiencies, the Koopman operator theory offers an effective approach. By mapping the original nonlinear dynamics to a high-dimensional observation function space, the system evolution is approximately described by linear operators in this space. Thus, not only can the linear system theory be utilized for analysis and control, but also the representational observation functions can be automatically learned through data-driven methods. Based on this idea, this paper proposes an adaptive high-dimensional dynamic modeling method based on the Koopman operator. The core of this method lies in achieving high-precision modeling of the global dynamic characteristics of smart station areas under high penetration of renewable energy and complex power electronics control by adaptively constructing and selecting a set of high-dimensional observation functions. The main contributions of this article are as follows:
A mathematical model of the smart distribution network stations, including wind, solar, hydrogen storage, and synchronous generators, was constructed. Through dynamic power allocation strategies and SOC constraint management, the energy interaction characteristics at multiple time scales were accurately depicted, providing a physical basis for subsequent stability analysis.
A construction method of hybrid nonlinear basis functions is proposed. Compared with the traditional single-type basis functions, this paper innovatively combines seven types of nonlinear observation functions, including cubic nonlinearity, square nonlinearity, exponential decay, trigonometric function oscillation, and logarithmic function. Through the function selection strategy based on the typical nonlinear characteristics of power electronic systems, the Koopman operator’s ability to capture the strong nonlinear dynamics of power electronic systems is significantly enhanced.
A two-stage state reconstruction framework based on least squares regression was designed. A post-regression mapping was introduced based on the standard Koopman method to achieve an accurate transformation from the high-dimensional observation space to the original state space, solving the problem of insufficient state reconstruction accuracy in traditional methods and providing a more reliable numerical tool for nonlinear system analysis.
A modal analysis method based on SVD truncation was adopted. The singular value decomposition was used to automatically screen the dominant modes. The eigenvalue analysis was combined to identify the dynamic patterns of the system, providing a data-driven analysis tool for the stability assessment of smart transformer areas.
3. Mathematical Model of the Smart Station Area
In the smart transformer area, to achieve the collaborative optimization and dynamic regulation of multiple types of energy and power electronic equipment, this paper constructs a multi-level and multi-module coupled mathematical model. This model is based on renewable energy sources such as solar photovoltaic and wind power generation, combines the regulation capabilities of BESS, HFC, and SG, and incorporates the standard adaptive control characteristics of power electronic equipment, achieving real-time feedback control of the system’s operating status. The model comprehensively considers the balance of energy supply and demand, the constraints of battery State Of Charge (SOC), the coupling of physical topological power flow, and cost-driven scheduling strategies, providing fundamental support for the safe and stable operation and efficient scheduling of smart transformer areas.
3.1. Solar Photovoltaic Power Model
The solar photovoltaic power generation system builds a model based on the meteorological characteristics of solar radiation and the actual installed capacity, breaking through the simplified assumption of traditional linear interpolation and introducing the intraday radiation pattern and the fluctuation characteristics within an hour, which is more in line with the actual output characteristics.
Assuming the installed capacity of photovoltaic power is
, the solar radiation within a day varies periodically (the effective radiation period is from 6:00 to 18:00), and the radiation intensity function is:
where
denotes continuous time.
Considering the random fluctuations of photovoltaic output within an hour (such as disturbances caused by cloud cover), the photovoltaic output at any time
is obtained through a three-layer calculation of “radiation intensity-installed capacity-fluctuation correction”, and at the same time, a 24 h modulus operation is utilized to achieve a periodic cycle:
where
represents the PV power output of the smart station area at time ;
represents the hourly fluctuation coefficient of photovoltaic power;
represents the random fluctuation simulation operator;
ensures linear interpolation between hourly data points;
represents a 24 h discrete moment;
represents the basic output of photovoltaic power at discrete moments within 24 h;
indicates the realization of a 24 h cycle loop.
3.2. Wind Turbine Model
The wind turbine model [
14] is established based on the physical characteristics of wind speed and power. It combines the entry, rated, and exit wind speed thresholds to reflect the correlation between wind power output and meteorological conditions, avoiding the defect of “no physical meaning” of traditional interpolation methods.
represents the wind speed at time ;
represents the average wind speed;
represents the fluctuation range of wind speed;
represents the time offset of the peak wind speed;
represents the period adjustment coefficient.
- 2.
Wind speed–power segmentation characteristics: Based on the physical principles of wind turbine units, the output varies in three segments with wind speed: cut-in, rated, and cut-out wind speed.
represents the base power of wind power generation in the smart substation at time ;
represents the installed capacity of the wind turbine;
represents the cut-in wind speed;
indicates the rated wind speed;
represents the cutting air velocity.
- 3.
Hourly fluctuation correction: Considering short-term disturbances such as gusts, the final wind power output is:
represents the output power of wind power generation in the smart substation at time ;
represents the hourly fluctuation coefficient of wind power;
represents the base output of wind power at discrete moments within 24 h.
3.3. Battery Energy Storage System (BESS) Model
The core function of BESS [
15] is to compensate for power gaps. Its scheduling logic combines time period characteristics with SOC safety constraints to ensure that the energy storage system takes into account both lifespan and regulation capacity during “peak shaving and valley filling”.
3.3.1. Objective Function
BESS selects charging and discharging strategies based on the load–renewable energy supply, and demand difference, and time period characteristics:
represents the system power imbalance at time , defined as the difference between load demand and total renewable energy output;
denotes the load demand power of the smart station area at time .
- 2.
The time-based charging and discharging strategy is:
During the period when photovoltaic power is sufficient and there is remaining electricity in the smart transformer area, BESS prioritizes charging operations, and its expression is:
where
represents the charging power of BESS at time ;
indicates the maximum allowable charging/discharging power limit.
During peak load periods, when a power gap occurs in the smart transformer area, BESS prioritizes discharging operations, and its expression is:
where
denotes the discharging power of BESS at time .
3.3.2. Operational Constraints
To ensure safe operation within specified boundaries, the charging/discharging power of BESS must satisfy the following constraints:
The State of Charge (SOC) reflects the current energy storage level of the battery, expressed as:
where
represents the state of charge at time , bounded within ;
indicates the charging/discharging power (negative for charging, positive for discharging);
is the discrete time step for calculating BESS state transitions;
denotes the rated energy capacity of BESS.
BESS maintains adequate regulation capability during prolonged operation while protecting the battery from extreme conditions to extend the battery lifespan and guarantee the operational safety of the smart station area. The SOC must comply with upper and lower limit constraints:
where
represents the minimum energy threshold to prevent the over-discharge;
indicates the maximum energy threshold to prevent overcharging.
3.4. Hydrogen Fuel Cell (HFC) Model
In the smart transformer area, HFC [
16] serves as a supplementary power supply, making up for the insufficient power of the system after BESS regulation. A cost-driven sharing strategy is adopted, taking into account both the economy and reliability of the system power supply.
The remaining power gap
after BESS regulation is:
where
represents the charge and discharge power of the BESS at time .
The output allocation of HFC and SG is based on the optimization of unit generation cost: if the unit cost of HFC is defined as
and that of SG as
. The cost-weighted sharing ratio is:
where
and represent the proportion coefficients of the remaining power gap borne by HFC and SG, respectively.
and represent the unit costs of HFC and SG, respectively.
The actual output of the HFC needs to meet the maximum capacity constraint:
where
represents the output power of the HFC in the intelligent transformer area’s adaptive dimensional escalation dynamic linear model at time ;
represents the short-term overload coefficient;
represents the upper limit of the maximum output of the HFC in the adaptive dimensional escalation dynamic linear model of the smart transformer area at time .
3.5. Synchronous Generator (SG) Model
As a traditional backup power supply, SG fills the gap when the output of HFC is still insufficient. It works in tandem with HFC to achieve the dual goals of “economy and reliability” while also playing a core role in maintaining the stability of the system frequency.
Based on the cost-weighted sharing ratio of Equation (13), the output of SG is:
where
represents the output power of SG at time ;
represents the maximum output limit of SG at time .
The final system power balance needs to meet:
3.6. Power Electronic Device Control Logic
3.6.1. Foundation Sag Control
In the smart transformer area system, power electronic devices are responsible for controlling and regulating the output of distributed energy and balancing power fluctuations. The standard adaptive Virtual Synchronous Generator (VSG) control is adopted to imitate the inertia and damping characteristics of traditional synchronous machines. The parameters are set in accordance with IEEE and GB/T standards to enhance the frequency, voltage stability, and dynamic adaptability of the system.
VSG adopts a droop control mechanism, converting the power deviation of the system into adjustments for frequency and voltage. The expression for frequency drop is:
where
represents the frequency adjustment amount caused by the active power deviation at time ;
represents the frequency drop coefficient;
represents the active power injected at time ;
represents the system reference power at time .
The expression of voltage sag is:
where
represents the voltage adjustment caused by the reactive power deviation at time ;
represents the voltage sag coefficient;
represents the reactive power injected at time .
3.6.2. Adaptive Inertia and Damping Control
To enhance transient stability, the adaptive parameter adjustment based on standard thresholds is introduced. The core logic is “increasing inertia in transient states and decreasing inertia in steady states”.
Frequency change rate calculation: By calculating the difference in frequency deviation between the current moment and the previous moment, combined with the time step, the frequency change rate is calculated to reflect the intensity of the system’s transient process:
represents the frequency deviation at the previous moment.
- 2.
Adaptive virtual inertia adjustment: This is adjusted according to the system operation status. The core logic is that “transient inertia increases and steady-state inertia decreases.”
and represent the inertia amplification coefficients, respectively;
represents the basic virtual inertia;
represents the frequency deviation threshold;
represents the threshold of the rate of frequency change.
- 3.
Adaptive damping coefficient adjustment: is adjusted according to the system operation conditions in different scenarios. The core logic is “transient strong damping and steady-state weak damping”.
and represent the damping amplification coefficients, respectively;
represents the basic damping coefficient.
3.6.3. Final VSG Control Equation
Adaptive parameters are integrated into the VSG frequency control while keeping the voltage control logic unchanged:
where
represents the actual frequency of VSG under control at time ;
represents the rated frequency of the system;
represents the voltage setting value of VSG under control at time ;
represents the rated voltage of the system.
Finally, under the VSG control strategy, the expression of the state equation for the dynamic change in node voltage
is:
where
and represent the real and imaginary parts of the voltage at time , respectively.
and represent the real and imaginary parts of the target voltage at time , respectively.
3.7. Power Flow Equations
The power flow calculation is based on the physical significance and numerical stability of the admittance matrix of the 33-node distribution network topology, which is in line with the actual grid structure of the transformer area. The complex admittance matrix
of a single line is:
where
represents the line resistance;
represents the line reactance.
The self-admittance of node
is the sum of the admittances of all the lines connected to this node:
If node
is directly connected to node
, the mutual admittance is the negative value of the line admittance:
The injection current of the
node is:
where
represents the complex voltage of node .
The injection complex power
of the
node is:
where
represents the complex voltage of node ;
represents the conjugate complex number of .
In summary, based on the core principles of nonlinear dynamic characteristic modeling, the mathematical model of the smart station area should possess the features illustrated in
Figure 2. These include multi-energy coordination, dynamic power flow regulation, controlled evolution of energy storage system states, adjustable and controllable power electronic interfaces, constraint-driven operation strategies, and data-driven dimensionality augmentation modeling capabilities. By integrating the output characteristics of various distributed energy resources, observation functions, and dynamic modes, the model establishes an efficient and reliable operational and analytical platform for the station area.
4. Adaptive Ascending Linear Dynamic Modeling Based on Koopman Theory
4.1. Fundamental Concepts of Koopman Theory
The Koopman operator is a theory that describes the evolution of Hamiltonian dynamical systems in terms of quantity measurement. Its function is to transform finite-dimensional nonlinear systems into infinite-dimensional linear systems. This theory has the ability to characterize the dynamic evolution process of nonlinear systems, and it can transform the originally finite-dimensional nonlinear system into an infinite-dimensional linear system for research. Therefore, modeling methods based on the Koopman operator have received widespread attention [
17,
18].
Consider a continuous nonlinear dynamical system evolving in the state space
. Its state-space representation can be expressed as:
where
denotes the dimensional state variable on a smooth manifold, encompassing quantities such as voltage magnitude and phase angle.
The mapping : describes the temporal evolution of the state variables in the dynamical system.
Define a complex-valued scalar function
:
on the state space as an observable function, representing the mapping from state variables to observables. The Koopman operator
is a linear operator acting on the observable function
such that:
where
denotes one or a set of observable functions, representing the result of a nonlinear transformation applied to .
is the Koopman operator, an infinite-dimensional linear operator acting on observable functions, capable of capturing global dynamic information in nonlinear systems.
To convert the nonlinear system into a linear representation, a nonlinear mapping—termed the ascending operator—is defined as:
where the new variable
approximately satisfies a linear dynamic:
Here, represents the finite-dimensional approximation of the Koopman operator within the selected observable subspace.
4.2. Construction of Koopman Dimensionality Ascension
The nonlinear mapping is achieved by constructing diverse observable functions, expressed as:
where
denotes the original state vector;
preserves the original state variables directly, retaining primary system information;
introduces third-order nonlinearity by multiplying the state variables by their squared norm;
captures quadratic nonlinear effects through component-wise squaring;
enhances local smoothness via exponential decay;
and employ trigonometric functions to characterize the system’s periodicity and oscillations.
Thus, the constructed high-dimensional observable vector is formulated as:
Here,
defines a nonlinear mapping:
where
represents the number of observable functions in ;
indicates that the new observable space has a significantly higher dimensionality than the original state space, effectively “linearizing” nonlinear relationships in this extended space.
To implement this, the original data matrix
is expanded into:
To avoid numerical overflow and information loss, the observation functions in Equation (26) are standardized:
Polynomial function: By using state variable normalization, let the normalized state vector , where and represent the minimum and maximum values of the original state vector, respectively, and compress the input range to .
Exponential function: Introduce the scaling factor (determined through cross-validation) to balance the attenuation rate and information retention.
Trigonometric functions: Add amplitude constraints to avoid high-frequency oscillations.
Based on the modal energy contribution screening function, the steps are as follows:
Calculate the Koopman mode energy corresponding to each observation function.
Sort functions in descending order by energy proportion, with a cumulative energy proportion reaching 99.5%, to form a sparse function set.
Define the sparsity index S as the filtered function/the original function. When S < 0.5, the energy threshold needs to be recalculated to avoid information loss due to excessive sparsity.
In the construction of the data matrix in Equation (29), a regularization term is introduced and corrected to:
where
represents the regularization parameter;
represents the identity matrix.
Noise robustness is achieved through data preprocessing, with a third-order moving average filter applied to the original state variable
x:
where
represents the filtered state variable at time ;
represents the original state vector at time ;
and represent the original state vectors of the previous moment and the previous two moments, respectively.
4.3. Linear Approximation Based on Dynamic Mode Decomposition (DMD)
There are many methods for finite-dimensional approximation of the Koopman operator. This paper chooses the method of Dynamic Mode Decomposition (DMD) [
19,
20].
Following the dimensionality ascension process, the new variable
satisfies an approximate linear dynamic model expressed as:
where
represents the ascended state at time step ;
denotes a linear transformation matrix characterizing the temporal evolution of .
To approximate the matrix A from data, the Dynamic Mode Decomposition (DMD) method is employed. The specific implementation steps are as follows:
To capture low-frequency dominant modes, a delay embedding matrix
is constructed using the Hankel matrix formulation. Assuming a delay step number of r, we have:
where
means taking the first columns of as the first delay layer;
represents the total number of time sampling points.
The reshaped delay embedding matrix
is then partitioned into
and
, corresponding to temporally adjacent delay embedding vectors:
where
represents, in the delay-embedded data matrix, from the first column to the second-last column;
represents from the second column to the last column.
- 2.
Singular Value Decomposition (SVD):
The matrix
undergoes SVD:
where
contains orthogonal basis vectors representing principal data directions ;
is a diagonal matrix of singular values indicating modal energy levels ;
contains the right singular vectors ;
denotes the truncation order.
- 3.
Low-dimensional Linear Operator Construction:
Using SVD results, the reduced-order linear operator is constructed as:
where
represents the approximate Koopman operator in the reduced-order space;
is the conjugate transpose of ;
denotes the inverse of the singular value matrix.
- 4.
Eigen decomposition and Mode Recovery:
Performing eigenvalue decomposition on
yields:
where
contains eigenvectors representing modal characteristics in the reduced space;
is a diagonal matrix whose elements are eigenvalues, reflecting each mode’s growth/decay and oscillatory properties.
The high-dimensional Koopman modes are recovered as:
where
is the high-dimensional modal matrix, with each column representing a dynamic mode in the ascended space.
- 5.
Continuous-time Frequency Calculation:
The continuous-time characteristic frequencies are derived from discrete eigenvalues
:
where
represents the characteristic frequency of the dynamic mode;
denotes the sampling interval.
- 6.
Initial Condition Expansion and Signal Reconstruction:
For the initial time
, the least squares projection on Koopman modes gives:
where
represents the modal expansion coefficients, indicating the contribution of each Koopman mode to the initial state;
denotes the Moore–Penrose pseudoinverse of matrix .
Finally, the reconstructed expression at time
is given by:
where
is a diagonal matrix with elements , describing the temporal evolution of each mode;
represents the reconstructed high-dimensional observation vector.
4.4. Online Adaptive Dimensionality Upgrade Model
A sliding window of length
(taking 12 time steps, corresponding to 1 h) is adopted, and the data matrix within the window is defined as:
where
represents the original state vector at the earliest moment.
When the new data
arrives, the update window is:
Based on the historical SVD result , the calculation steps for the incremental SVD of the new window data matrix are:
Calculate the projection residual of the new vector in the historical left singular space.
If , update the left singular matrix: .
Update the singular value matrix. The right singular matrix is reconstructed through QR decomposition.
Calculate the modal contribution of each observation function.
Calculate the Koopman operator to complete the model update.
In summary, this paper proposes an adaptive dimension-lifting linear dynamic model based on Koopman theory. By constructing diverse nonlinear observable functions (including higher-order polynomials, exponentials, and periodic functions) combined with time-delay embedding, we achieve high-dimensional linearization of the original nonlinear system states. Through truncated SVD and DMD algorithms, the model adaptively identifies dominant modes and computes the Koopman operator, thereby establishing a linear evolution mapping for the system in the lifted dimensional space. The workflow of this Koopman theory-based adaptive dimension-lifting linear dynamic model is illustrated in
Figure 3.
5. Case Studies
This paper takes the IEEE 33-node standard test system as the research object. This system is the benchmark network for power system research. The network topology and basic parameters are derived from the Matpower public database, covering five types of multi-functional equipment: solar photovoltaic, wind power generation, BESS, HFC, and SG. The simulation time domain selects 0:00 to 23:00 of the next day, with a total of 24 time intervals. The load data is based on the typical daily variation pattern of residential electricity consumption, and the profiles of photovoltaic and wind power generation refer to the standard typical daily data provided by NREL. The specific parameter settings are shown in
Table 1.
The system load balance is shown in
Figure 4. During the period from 0:00 to 04:00, the photovoltaic system has no output. At this stage, the load demand is mainly borne by the output of the wind turbine, the power supply from the fuel cell, and a small amount of generator power. The battery charging and discharging show slight fluctuations, and the battery state of charge remains in the range of 0.75 to 0.85, indicating that the battery has not undergone large-scale charging and discharging operations. During the period from 05:00 to 12:00, the photovoltaic output gradually increases to approximately 300 kW, and the wind turbine output remains at a relatively high level. The system’s power generation exceeds the load demand, and the battery begins to charge, with the SOC gradually rising from around 0.85. During the period from 12:00 to 15:00, the photovoltaic and wind turbines maintain a high-output state continuously, significantly exceeding the load demand, and the SOC rises to 0.95. During the period from 15:00 to 20:00, the output of photovoltaic power gradually decreases, and the output of wind turbines also drops. When the system’s power generation capacity is insufficient to fully cover the load demand, the battery begins to discharge, and the SOC drops accordingly. At the same time, the power of fuel cells and generators participates in load compensation. After 20:00, the photovoltaic output approaches zero, while the load demand remains at a certain level. The battery continues to discharge, and the SOC further drops. At this stage, the load demand is mainly met by the output of the wind turbine, the power supply of the fuel cell, and the power of the generator, ensuring the stable operation of the system.
Principal component analysis is shown in
Figure 5. Among them, the principal component 1 on the x-coordinate and the principal component 2 on the y-coordinate are dimensionless.
Figure 5 illustrates the primary trends and disturbance features of node voltages in the smart station area. The scatter plot shows that Principal Component 1 (PC1) corresponds to voltage gradient variations, with left-to-right increments reflecting spatial transitions from low- to high-voltage zones. PC2 captures non-uniformities caused by load fluctuations. The variance bar chart quantifies each component’s contribution: PC1 explains 70–90% of spatial variations, while PC2, though smaller, remains significant. This confirms that a two-dimensional reduction effectively preserves data features, reduces computational complexity, and suppresses random noise interference in DMD and Koopman analyses. Compared to the raw high-dimensional data, PCA achieves an optimal balance between information retention and computational efficiency, enhancing the stability and accuracy of the dynamic mode extraction.
Figure 6 shows the reconstruction results of the original data and Koopman. Among them, the position and amplitude of the sampling points on the coordinate axes are dimensionless.
As shown in
Figure 6a, the original spatiotemporal voltage response exhibits multimodal coupling characteristics. The high-frequency components decay slowly in the early stage, there is obvious oscillation in the middle stage, and only weak noise remains in the later stage.
Figure 6b shows that the nonlinear reconstruction results based on the high-dimensional Koopman operator significantly improve the fitting accuracy. It is more in line with the original data in terms of the capture process, pulse peak, and phase control, with a peak time error of less than 0.1 h. Moreover, the overall response is smoother, and the noise suppression is more effective, demonstrating a better modeling ability for the nonlinear dynamics of the system.
The absolute distribution of the residual is shown in
Figure 7. Among them, the probability density and cumulative distribution of the vertical coordinate are dimensionless. The expression of the absolute residual is:
where
represents the reconstructed value;
represents the original value.
The residual PDF is shown in
Figure 7a. The peak probability density of the proposed Koopman method is concentrated in the 0–0.2 range with a relatively small absolute residual, demonstrating an advantage in fitting samples with low residuals. The probability density distribution of other methods is relatively more inclined towards the high residual interval. The residual CDF is shown in
Figure 7b. The cumulative distribution curve of the Koopman method rises the earliest and fastest. Within the interval with a smaller absolute residual, its cumulative probability increases rapidly, indicating that the proportion of samples with a smaller residual is significantly higher under this method. However, the cumulative curves of other methods, such as DMD, rise more slowly, and the proportion of low-residual samples is lower. In conclusion, the Koopman model performs better in error control and fitting accuracy and can cover more low-residual samples.
The RMSE error of the model is shown in
Figure 8. The calculation formula for RMSE is:
where
represents the reconstructed value;
represents the original value.
The RMSE error of the model is shown in
Figure 8. The simulation mainly compared the performance of the Koopman model with other models under the RMSE index. Four typical moments of the day, namely 00:00, 08:00, 14:00, and 18:00, were selected. The selected time covers the low load period at night, the peak photovoltaic output period during the day, the afternoon period when photovoltaic output decreases and load is relatively high, and the evening peak electricity consumption period, which can represent different operating conditions. At the same time, these periods often correspond to voltage fluctuations or inflection points, which are conducive to revealing the evaluation characteristics of the model under high-fluctuation working conditions. The performance comparison of the Koopman model with other models is shown in
Table 2.
Based on the analysis of the RMSE error of the model, the performance differences of several methods were evaluated from the perspective of modal characteristics. By comparing the distribution of eigenvalues extracted from the dynamic data of the intelligent transformer region by Koopman and other methods, the overall dynamic trend reflected by the error curve was verified, highlighting Koopman’s unique capabilities in modal capture and system stability.
The eigenvalue distribution is shown in
Figure 9. The proposed Koopman method linearizes the nonlinear dynamics through an adaptive high-dimensional observation mapping, resulting in more concentrated eigenvalues with smaller modulus values, demonstrating better stability, reconstruction accuracy, and robustness. In contrast, although the SINDY method can obtain a concise nonlinear model through sparse recognition, its performance is highly dependent on the completeness of the pre-selected basis functions, and it has poor stability in high-dimensional or noisy scenarios. The linear approximation ability of DMD is limited. EDMD relies on the selection of artificial basis functions. Although SDMD is sparse, it is prone to unstable modes when there is noise or insufficient samples.