Next Article in Journal
A Performance Study of Deep Neural Network Representations of Interpretable ML on Edge Devices with AI Accelerators
Previous Article in Journal
Automated Remote Detection of Falls Using Direct Reconstruction of Optical Flow Principal Motion Parameters
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A GRU-KAN Surrogate Model with Genetic Algorithm Uniform Sampling for Active Magnetic Bearings–Rotor Critical Speed Prediction

1
School of Power and Energy, Northwestern Polytechnical University, Xi’an 710129, China
2
Science and Technology on Altitude Simulation Laboratory, Mianyang 62100, China
3
National Key Laboratory of Science and Technology on Advanced Light-Duty Gas-Turbine, Xi’an 710129, China
4
National Elite Institute of Engineering, Northwestern Polytechnical University, Xi’an 710129, China
*
Author to whom correspondence should be addressed.
Sensors 2025, 25(18), 5680; https://doi.org/10.3390/s25185680
Submission received: 10 July 2025 / Revised: 19 August 2025 / Accepted: 10 September 2025 / Published: 11 September 2025
(This article belongs to the Section Physical Sensors)

Abstract

With the development of active magnetic bearings (AMBs) toward higher speeds, understanding high-speed rotor dynamics has become a crucial focus in AMB research. Traditional finite element modeling (FEM) methods, however, are unable to rapidly and comprehensively uncover the complex interplay between controller parameters and dynamic behavior. To address this limitation, a surrogate modeling approach based on a hybrid gated recurrent unit–Kolmogorov–Arnold network (GRU-KAN) is introduced to mathematically capture the effects of coupled control gains on rotor dynamics. To enhance model generalization, a genetic algorithm-driven uniform design sampling strategy is also implemented. Comparative studies against support vector regression and Kriging surrogates indicate a higher coefficient of determination ( R 2 = 0.9887 ) and lower residuals for the proposed approach. Experimental validation across multiple controller parameter combinations shows that the resulting machine learning surrogate predicts the critical speed with a mean absolute error of only 38.51 rpm and a mean absolute percentage error of 1.56 × 10 1 % , while requiring merely 1.14 × 10 4 s per evaluation—compared to 201 s for traditional FEM. These findings demonstrate the surrogate’s efficiency, accuracy, and comprehensive predictive capabilities, offering an effective method for rapid critical speed estimation in AMB–rotor systems.

1. Introduction

1.1. Background

AMBs are widely employed in systems such as torque gyroscopes, flywheel energy storage devices, blowers, centrifugal compressors, and other electric engines [1]. Early AMB research focused on manufacturing, packaging, and topology design, yielding notable performance test results [2]. As an open-loop unstable nonlinear mechatronic system, the AMB–rotor assembly requires feedback control to ensure stable operation, while the influence of the rotor dynamics must not be neglected [3]. With recent advances in computational power, models have progressively evolved toward multidisciplinary and multiparameter coupling [4,5], furnishing powerful theoretical tools for the more accurate characterization of dynamic behavior.

1.2. Literature Review

1.2.1. Dynamic Modeling Methods for AMB–Rotors

Many researchers have investigated AMB–rotor dynamic modeling. In [6], FEM theory was used to derive MATLAB 2021a-computed and workbench-simulated dynamic models of both rigid and flexible AMB–rotors supported by AMBs, comparing modal frequencies and mode shapes under varying equivalent stiffness; the study showed that tuning AMBs’ stiffness shifts natural frequencies and thus mitigates vibration. In [7,8], an algorithm for the synchronous identification of rotor unbalance and residual AMB parameters was proposed. This algorithm was developed based on Timoshenko beam theory, the finite element method, and a misalignment approach. The efficacy of the proposed algorithm was demonstrated through experimental testing, which yielded enhanced diagnostic accuracy and robustness. In [9], the design, construction, and modeling of a flexible rotor AMB test rig were detailed: by integrating conventional rotor dynamics analysis, FEM, and experimental validation, a comprehensive system model was established, proving that precise modeling coupled with model-based controller design enables the effective control of flexible rotors. In [10], a nonlinear dynamic model coupling the AMB and backup bearing was developed, systematically incorporating contact stiffness, sliding/rolling friction, aerodynamic damping, eddy current losses, and rotor geometric irregularity. This provided a high-fidelity description of the transient dynamics during rotor delevitation. In [11], the dynamic stiffness of radial AMBs in a magnetic-suspension molecular pump was analyzed and measured via combined frequency- and time-domain experiments, yielding detailed insights into bearing dynamic characteristics. In [12], a disturbance compensation control strategy for rotational-speed standard devices using an AMB force compensation system was proposed: by analyzing rotor unbalance dynamics and linearizing and decoupling the AMB force model across radial degrees of freedom, a PI-based force feedback controller was designed and validated on a semi-physical simulation platform, demonstrating the effective suppression of static, step, and synchronous vibration disturbances. In [13], the state space-based dynamic modeling and μ -synthesis control of flexible rotors and AMBs were presented and experimentally validated. In [14], a dynamic model based on radial and tilting motions was developed; simulations indicated that increasing angular acceleration markedly reduced the vibration amplitude at the radial critical speed, enabling an AMB–motor to traverse this critical speed without additional damping. In [15], building on a rotor dynamic model with incorporated gyroscopic effects, a fractional-order PID controller was formulated and optimized via particle swarm optimization to tune both the fractional orders and gain coefficients concurrently, achieving pole placement that markedly extended the stability margins and enhanced the dynamic response under high-speed operation.
Despite advances in single-physics rotor modeling, comprehensive analysis techniques for multidisciplinary and multiparameter coupled models remain relatively underdeveloped, particularly with respect to the impact of control parameter coupling on dynamic performance. In addition, the significant computational cost of parameter coupling analysis has not been adequately addressed.

1.2.2. Surrogate Model Methods

Surrogate models offer an effective route to study how control systems influence dynamic performance by rapidly constructing high-fidelity mappings between input variables and output responses [16,17,18], which endows them with unique advantages in rotor dynamics parameter identification. In [19], a hybrid surrogate model combining Kriging and Polynomial Chaos Expansion (PCE) was developed to relate crack parameters to rotor system dynamic responses, enabling an analysis of how parameter variations shape the response surface. In [20], simulated annealing enhanced particle swarm-optimized support vector regression (SVR) surrogate models and improved the high-speed spindle thermal displacement prediction accuracy while reducing the computational complexity. In [21], a KAN was applied to system dynamics prediction; the results showed superior accuracy in forecasting the rotor angular frequency and identifying system uncertainty parameters with a smaller network scale than conventional machine learning methods, demonstrating the KAN’s potential for the efficient, precise modeling of complex rotor systems. In [22], a particle swarm optimization method based on a mutation surrogate model was proposed for high-dimensional complex problems, using an initial sample selection strategy to build globally representative surrogate models. In [23], Latin hypercube sampling (LHS) was optimized to enhance surrogate models’ exploration and exploitation performance, with a staged sample selection strategy that improved the global fidelity and accuracy under limited sample sizes.
Collectively, prior studies indicate that surrogate models have advanced mechatronic system modeling and prediction through global sample selection strategies and modern machine learning techniques, thereby supporting improved model fidelity. However, conventional surrogates remain susceptible to overfitting when the training set is sparse or unevenly distributed, and their robustness and accuracy are intrinsically constrained by the coverage of the sampled design space. In particular, when the available data are limited, Latin hypercube sampling may preserve marginal uniformity yet inadequately capture the response surface curvature and higher-order interactions [24], which can lead to reduced generalization.

1.3. Contributions

Therefore, this paper analyzes the influence of control parameters on dynamic models and proposes a GRU-KAN model based on a genetic algorithm uniform design (GAUD), comparing and analyzing it against SVR and Kriging models to investigate in depth the coupling between control parameters and the critical speed. We also introduce a novel method for the prediction of the critical speed of the AMB–rotor. The main contributions of this paper are as follows:
  • An evolutionary uniform design guided by a composite discrepancy metric is proposed. The scheme balances space filling with factor coupling and yields representative training sets suited to control–dynamics analysis.
  • A surrogate that combines gated recurrent units with a Kolmogorov–Arnold network is formulated to capture nonlinear interactions between controller parameters and rotor dynamics. In conjunction with the GAUD, this approach improves the data efficiency and predication capacity relative to conventional surrogates and clarifies controller–plant interactions.
  • The integrated GRU-KAN-UD workflow replaces repeated finite element simulations with rapid surrogate evaluation, enabling fast parametric exploration and early-stage design studies, thereby lowering the computational burden without compromising modeling fidelity.
The remainder of this manuscript is organized as follows. Section 2 formulates the AMB–rotor dynamic model. Section 3 details the GAUD design and GRU-KAN surrogate methodology. Section 4 presents the numerical and experimental results. Finally, Section 5 summarizes our findings.

2. AMB System Model

The AMB–rotor system is a prototypical, inherently unstable system. Therefore, unlike conventional rotor dynamics, the modeling and dynamic analysis of AMB–rotors primarily focus on control issues. The controlled plant includes not only the rotor structure but also displacement-sensing elements and power amplification units. To analyze and study the dynamics of the AMB–rotor, a dynamic model of the AMB system is first established. Figure 1 shows the five-degree-of-freedom configuration of the AMB–rotor. The structural parameters of the AMB and rotor are listed in Table 1 and Table 2.

2.1. Construction of the AMB–Rotor Dynamic Model

The dynamic equation of the AMB system can be written as
m x · · + c e x · + k e x = f
where m is the rotor mass, x the displacement, f the force acting on the rotor, c e the equivalent damping coefficient, and k e the equivalent stiffness coefficient.
Taking the Laplace transform of Equation (1) under zero initial conditions yields the frequency-domain transfer function (with s = j ω )
X ( j ω ) F ( j ω ) = 1 m ω 2 + k e + c e j ω
where s denotes the Laplace variable, j = 1 is the imaginary unit, and ω is the angular frequency.
In this paper, an eddy current sensor is selected as the displacement sensor due to its simple structure, compact size, and high sensitivity. Its transfer function is
G s ( s ) = A s
where A s denotes the sensor gain.
Within the frequency response range, the power amplifier can be modeled as a first-order inertial element:
G a ( s ) = A a T a s + 1
where A a is the amplifier gain and T a its time constant.
The support characteristics of the AMBs are principally determined by the control system. Thus, an analysis of stiffness and damping must be carried out under a given control strategy. Different controller parameters or algorithms yield distinct dynamic support properties. The closed-loop transfer function in the frequency domain is given in [25]:
X ( j ω ) F ( j ω ) = 1 m ω 2 + k x x + k i x G s ( j ω ) G a ( j ω ) G c ( j ω )
where k x x is the displacement–stiffness coefficient and k i x the current–stiffness coefficient, and G c ( j ω ) denotes the controller transfer function.
The electromagnetic stiffness and damping coefficients are defined by
k x x = μ 0 n 2 A I 0 2 2 x 0 3 k i x = μ 0 n 2 A I 0 2 x 0 2
with μ 0 being the permeability of free space, n the coil turns, A the pole area, I 0 the bias current, and x 0 the one-side air gap.
By combining Equations (2) and (5), substituting Equations (3) and (4), and applying the incomplete differential PID algorithm, the expression for the dynamic support characteristics is obtained:
k e = k x x + k i x Re G c ( s ) G s ( s ) G a ( s ) = k x x + k i x Re K p + K i s + K d s 1 + T d s A s A a 1 + T a s , c e = k i x ω Im G c ( s ) G s ( s ) G a ( s ) = k i x ω Im K p + K i s + K d s 1 + T d s A s A a 1 + T a s .
where K p , K i , K d , and T d are the proportional, integral, derivative, and incomplete differential gains of the PID controller; Re { · } and Im { · } denote the real and imaginary parts of a transfer function, respectively.
Based on these equivalent stiffness and damping definitions, FEM was performed in Ansys Workbench. A nominal mesh size of 5 mm was used, yielding 92,442 nodes and 26,114 elements after meshing. Boundary conditions reflect the five-DOF configuration: the AMB supports were modeled as spring connections located at the bearing stations. Using the Routh stability criterion, parameters were chosen as K p = 2000 , K i = 0.1 , K d = 20 , T d = 1 × 10 4 and I 0 = 1.4 A, resulting in equivalent stiffness of 6.23 × 10 6 N/m and damping of 4.33 × 10 4 N·s/m. Gyroscopic effects were enabled for rotational speed sweeps to generate the Campbell diagram in Figure 2. As the speed increases, gyroscopic effects split the rotor’s characteristic frequencies: the forward whirl rises, while the backward whirl falls, yielding a first critical speed of 24,870.54 rpm.

2.2. Effects of Individual Control Parameters on Critical Speed

Based on the FEM, the influence of each single control parameter (proportional gain K p , integral gain K i , derivative gain K d , incomplete differential gain T d , and bias current I 0 ) on the AMB–rotor’s critical speed was investigated. Each parameter was divided into ten equally spaced levels over its range (Table 3), while the other parameters were held at their mean values. FEM yielded the normalized first-order critical speed curves versus each parameter, as shown in Figure 3. The analysis indicates that increasing K p significantly lowers the first critical speed; similarly, increases in K d and I 0 also reduce it, albeit with a more gradual trend than for K p . By contrast, variations in K i and T d have negligible effects on the critical speed.

3. Surrogate Modeling of Control Parameters Versus Critical Speed

The direct sensitivity analysis of the control parameters is challenging due to the high dimensionality, strong parameter coupling, and substantial computation time. Surrogate models offer an efficient alternative, enabling the rapid evaluation of multivariable effects with significantly reduced complexity [26].
In this paper, we build a surrogate that maps the five control parameters ( K p , K i , K d , I 0 , T d ) to the first critical speed. All FEM and surrogate model training was performed on a workstation with an Intel i9-9900 CPU (INTEL CHINA Co. Ltd., Shanghai, China) and 64 GB RAM (Shenzhen Kingbank Technology Co. Ltd., Shenzhen, China).

3.1. Surrogate Model Framework

The surrogate model construction follows two primary steps: the selection of initial samples and model training [27].

3.1.1. Uniform Design Sampling

Traditional uniform design can produce anisotropic samples in high-dimensional spaces [28], while Latin hypercube sampling may miss the response surface curvature [24]. We introduce a genetic algorithm-enhanced uniform design (GAUD) to generate more representative samples [29]. The flowchart of GAUD can be seen in Figure 4.
For n experiments, compute the set of integers coprime to n,
H = { h 1 , h 2 , , h m } , m = ϕ ( n ) ,
where ϕ ( n ) is Euler’s totient function. Form an n × m candidate matrix U = [ u i j ] ,
u i j = i · h j mod n , i = 1 , , n , j = 1 , , m .
Map each entry to the midpoint of ( 0 , 1 ) ,
x i j = u i j + 0.5 n , x i j ( 0 , 1 ) ,
yielding m columns of factor levels. These samples initialize the genetic algorithm, guided by a composite bias measure that adapts through fitness-driven iterations to capture both sample diversity and parameter coupling.
The algorithm minimizes the mixed bias measure MD 2 ( x ) ,
min x X n × n f MD 2 ( x ) ,
where n f is the number of factors and X n × n f the sample table. The measure is defined as
M D 2 ( x ) = 19 12 n f 2 n i = 1 n j = 1 n f 5 3 1 4 x i j 1 2 1 4 x i j 1 2 2 + 1 n 2 i = 1 n k = 1 n j = 1 n f 15 8 1 4 | x i j 1 2 | 1 4 | x k j 1 2 | 3 4 | x i j x k j | + 1 2 | x i j x k j | 2

3.1.2. Support Vector Regression

SVR allows an error tolerance zone ε s , penalizing only deviations beyond this threshold [30]. The prediction function is
f ( x ) = v T ϕ ( x ) + b
where ϕ ( x ) is the nonlinear mapping function, v the weight vector, and b the bias term. SVR minimizes the regularized risk
min v , b , { ξ i , ξ ^ i } 1 2 v 2 + C i = 1 m ξ i + ξ ^ i
subject to
y i f ( x i ) ε s + ξ i , f ( x i ) y i ε s + ξ ^ i , ξ i , ξ ^ i 0 ,
where C > 0 is a regularization parameter balancing model complexity and tolerance, and ξ i , ξ ^ i are slack variables. By applying Lagrange duality, the solution takes the form
f ( x ) = i = 1 m α i α ^ i K ( x , x i ) + b
where α i , α ^ i are Lagrange multipliers and K ( · , · ) is the kernel function mapping inputs to a high-dimensional feature space.

3.1.3. Kriging

Kriging represents the response surface G ( Θ ) as the combination of a regression term and a Gaussian process [31]
G ( Θ ) = i = 1 p β i w i ( Θ ) + z ( Θ )
where w i ( Θ ) denotes the chosen basis functions, β i denotes the regression coefficients fitted to the data, and z ( Θ ) N ( 0 , σ 2 ) is a zero-mean Gaussian process capturing local deviations. The covariance between two sample points Θ ( i ) and Θ ( j ) is [32]
C o v [ z ( θ ( i ) ) , z ( θ ( j ) ) ] = σ 2 [ R ( θ ( i ) , θ ( j ) ) ]
where the matrix R encodes correlations critical to the prediction accuracy. The regression coefficients and process variance are estimated by
β ^ = W T R 1 W 1 W T R 1 m k σ ^ 2 = ( m k W β ^ ) T R 1 ( m k W β ^ ) / N T
with m k being the vector of the observed responses and W the N T × p regression matrix. The hyperparameters ε = [ ε 1 , , ε b ] T governing the covariance function are obtained by maximizing the log-likelihood:
m a x F ε = N T l n σ ^ 2 + l n R 2 ε 0

3.1.4. Gated Recurrent Unit–Kolmogorov–Arnold Network

The GRU-KAN architecture combines gated recurrent units with Kolmogorov–Arnold networks to leverage the GRU’s temporal modeling and the KAN’s universal approximation, improving both the accuracy and interpretability [33]. Its structure is shown in Figure 5.
The GRU core updates are
r t = σ W r x t + U r h t 1 + b r z t = σ W z x t + U z h t 1 + b z h ˜ t = tanh W h x t + U h ( r t h t 1 ) + b h h t = ( 1 z t ) h t 1 + z t h ˜ t
where z t and r t are the reset and update gates; h ˜ t is the candidate state; h t is the hidden state; σ ( · ) is the sigmoid function; tanh ( · ) is the hyperbolic tangent; and W * , U * , b * are the weight and bias parameters.
The KAN decomposes a multivariate function into univariate components
f x 1 , x 2 , , x n = q = 1 2 n + 1 ϕ q p = 1 n ψ p , q ( x p )
where both ϕ q ( · ) and ψ p , q ( · ) are one-dimensional continuous functions.
x ( l + 1 ) = Φ ( l ) x ( l ) ,
with
Φ ( l ) = ϕ 1 , 1 ( l ) ( · ) ϕ 1 , 2 ( l ) ( · ) ϕ 1 , n l ( l ) ( · ) ϕ 2 , 1 ( l ) ( · ) ϕ 2 , 2 ( l ) ( · ) ϕ 2 , n l ( l ) ( · ) ϕ n l + 1 , 1 ( l ) ( · ) ϕ n l + 1 , 2 ( l ) ( · ) ϕ n l + 1 , n l ( l ) ( · )
where n l is the number of units in layer l, x i ( l ) is the projection vector for unit i, and ϕ i , j ( l ) ( · ) is the univariate activation function applied to its input.

3.2. Model Result Analysis

At n = 100 and n f = 5 , the initial samples generated by LHS and GAUD are shown in Figure 6. For the GAUD sampling procedure, the population size is set to 30 and the number of iterations to 200. The convergence behavior under these settings is illustrated in Figure 7. Using the mixed bias measure from Equation (12), GAUD achieves MD 2 ( x ) = 0.0036 , significantly lower than LHS’s 0.012 , indicating a more uniform sample space.
To demonstrate the feasibility and advantages of the GRU-KAN model built on GAUD (GRU-KAN-UD), we compare five mappings from the control parameters to the critical speed: GRU-KAN-LHS, Kriging-UD, Kriging-LHS, SVR-UD, and SVR-LHS. All surrogate inputs are min–max normalized to [ 0 , 1 ] . The hyperparameters for GRU-KAN, SVR, and Kriging are tuned via Bayesian optimization with five-fold cross-validation. The search spaces are summarized in Table 4.
The residuals for the first critical speed are plotted in Figure 8. Compared with LHS sampling, Kriging-UD and GRU-KAN-UD maintain high confidence across both low and high critical speeds, confirming that GAUD yields a broader yet accurate sample space.
The error distributions are compared in Figure 9. All models exhibit approximately normal error distributions. GAUD sampling shows a smaller interquartile range than LHS sampling, indicating tighter error concentration and superior predictive performance; GRU-KAN-UD has the smallest range and interquartile spread.
For a quantitative evaluation, we compute the following metrics: mean absolute error ( E m ), mean absolute percentage error ( E m p ), mean square error ( E m s ), and root mean square error ( E r m s ).
E m = 1 n i = 1 n y i y ^ i ,
E m p = 100 % n i = 1 n y i y ^ i y i ,
E m s = 1 n i = 1 n y i y ^ i 2 ,
E r m s = 1 n i = 1 n y i y ^ i 2 .
The coefficient of determination is
R 2 = 1 i = 1 n ( y i y ^ i ) 2 i = 1 n ( y i y ¯ ) 2 ,
where n is the number of samples, y i the observed value, y ^ i the predicted value, and y ¯ the sample mean.
The standard deviation is
SD = 1 N i = 1 N y ^ i y ¯ 2 .
The evaluation metrics are summarized in Table 5. GRU-KAN-UD delivers the lowest errors and the highest R 2 = 0.9887 , indicating a highly reliable model (an R 2 above 0.9 denotes strong confidence). Furthermore, its SD most closely matches the actual SD ( 3.78 × 10 2 rpm).
To compare the relative performance across different metrics, we normalize each error measure E i , j (where j { m , m p , m s , r m s } indexes E m , E m p , E m s , E r m s and i , k indexes the model) as
E n o m , i , j = max k E k , j E i , j i = 1 n max k E k , j E i , j
and similarly normalize the coefficient of determination:
R n o m , i 2 = R i 2 i = 1 n R i 2
The radar chart in Figure 10 visualizes each model’s composite performance, with a larger enclosed area indicating better overall metrics. GRU-KAN-UD notably outperforms all others, while SVR-UD’s normalized E m and E m p closely match those of Kriging-LHS, further confirming the advantage of GAUD sampling.

3.3. Analysis of Control Parameter Coupling and Critical Speed

Using the GRU-KAN-UD surrogate, we examine the response surfaces of the first critical speed as a function of the bias current I 0 and each control parameter.
Figure 11 shows the surface for ( I 0 , K p ) . As demonstrated in Equations (6) and (7), the parameters k x x and k i x are proportional to I 0 2 and I 0 , respectively. The equivalent stiffness k e = k x x + k i x Re G c ( · ) G s ( · ) G a ( · ) ; thus, both K p (entering Re { G c } ) and I 0 strongly regulate k e and, in turn, the critical speed. Furthermore, across the full I 0 range, the critical speed decreases monotonically with K p , with the rate of decline inversely related to I 0 . For K p [ 2000 , 11,000 ] , the critical speed also decreases monotonically with I 0 ; for K p ( 11,000 , 20,000 ] , the response to I 0 is non-monotonic—initially decreasing and then rising. A comparison with Figure 3 reveals that the FEM trends hold only at a specific K p , demonstrating the surrogate’s value in capturing full coupling effects.
Figure 12 shows the surface for ( I 0 , K i ) . From Equation (7), K i contributes mainly at low frequencies to Im { G c } , so its effect on k e and c e is limited within the operational bandwidth. Consequently, the influence of K i on the critical speed is expected to be weak. Furthermore, when I 0 [ 0.8 , 1.3 ] , the critical speed decreases monotonically with K i . For I 0 ( 1.3 , 2 ] , the response is non-monotonic: it increases slightly before decreasing.
Figure 13 shows the surface for ( I 0 , K d ) . From Equations (6) and (7), K d amplifies Im { G c } in the relevant band, so K d directly increases c e , while I 0 scales with k i x and simultaneously affects c e . Moreover, because I 0 exerts an influence on both k e and c e , its net impact on the critical speed is stronger than that of K d . Accordingly, for I 0 [ 1.1 , 2 ] , the critical speed increases with K d . For I 0 [ 0.8 , 1.1 ) , the response is non-monotonic: it rises and then falls.
Figure 14 shows the surface for ( I 0 , T d ) . From Equation (7), T d exerts a significant influence on the derivative filter, while contributing marginally to k e and c e within the operational frequency range; hence, a limited effect on the critical speed is expected. Accordingly, when I 0 [ 0.8 , 1.1 ] , the critical speed exhibits a non-monotonic increase and then decrease with T d . For I 0 ( 1.1 , 2 ] , it decreases monotonically as T d increases.

4. Experimental Validation

An acceleration test was performed to validate the AMB–rotor surrogate model based on GRU-KAN-UD. A PID controller with gains K p = 2000 , K i = 0.1 , K d = 20 , T d = 1 × 10 4 and bias current I 0 = 1.4 A was implemented to maintain stable levitation. The test platform schematic is shown in Figure 15.
Starting from rest, the rotor was accelerated at 60 rpm/s up to 30,000 rpm. The first critical speed was determined to be 24,918.00 rpm from the vibration response during acceleration, as can be seen in Figure 16.
Table 6 compares the first critical speeds obtained from the surrogate model, the FEM, and the experimental measurement. The FEM’s prediction deviated from the experiment by 47.46 rpm ( 1.90 × 10 1 % ), confirming its accuracy and providing a solid foundation for surrogate construction. The surrogate model’s prediction differed from the experiment by only 51.92 rpm ( 2.08 × 10 1 % ), demonstrating close agreement among the model, simulation, and test.
In terms of computation time, the surrogate evaluation required only 1.14 × 10 4 s versus 201 s for FEM—an improvement of over four orders of magnitude. This indicates that, once trained, the surrogate could rapidly yield key dynamic parameters without further simulation or experiments.
To assess the robustness across control settings, six values of K p were tested with I 0 = 1 A, K i = 0.1 , K d = 14 , T d = 1 × 10 4 ; the results are shown in Figure 17. Similarly, six values of K d were tested with I 0 = 1 A, K i = 0.1 , K p = 10,000, T d = 1 × 10 4 ; the results appear in Figure 18. Furthermore, six values of I 0 were tested with K d = 16 A, K i = 0.1 , K p = 15,000, T d = 1 × 10 4 ; the results are shown in Figure 19.
Across 18 parameter combinations, the surrogate maintained high accuracy over the design space: relative to the experiment, E m = 38.51 rpm and E m p = 1.56 × 10 1 % ; relative to FEM, E m = 14.30 rpm and E m p = 5.80 × 10 2 % .

5. Conclusions

The key findings are as follows:
  • We introduce a surrogate-based method for the analysis of the coupling between the control parameters and the AMB–rotor’s first critical speed. The GRU-KAN-UD surrogate achieves a mean absolute error of 38.51 rpm and a mean absolute percentage error of 1.56 × 10 1 %. The experimental results confirm the surrogate’s feasibility and precision, providing a solid theoretical and practical foundation for AMB–rotor dynamic modeling.
  • Compared with FEM, the trained surrogate model reduces the computation time from 201 s to 1.14 × 10 4 s while maintaining high fidelity, demonstrating a significant speed advantage.
  • Unlike single-parameter studies, the surrogate approach captures the interaction effects among controller gains, offering a more comprehensive dynamic analysis.
  • We propose a genetic algorithm-enhanced uniform design for sample selection, improving the robustness of surrogate training. The resulting GRU-KAN-UD attains the highest goodness of fit among all candidates ( R 2 = 0.9887 ) and the lowest mean absolute error in cross-validation ( 22.06 rpm), with error reductions of over 60% relative to Kriging-LHS and over 70% relative to SVR-LHS.
The GRU-KAN surrogate is tailored to AMB–rotor dynamics and performs well on the studied configuration. However, its ability to generalize across other fields may be limited. Future work may explore ensemble learning and domain adaptation to enhance the robustness and out-of-domain generalization.
The present study focuses on predicting the critical speed; transient amplitude prediction under time-varying conditions remains to be explored. To broaden the applicability in magnetic bearing rotors, future extensions should incorporate external loads and system condition variations such as temperature rises.
The surrogate provides practical utility for controller pre-tuning and for design space exploration from a rotor dynamics perspective, enabling efficient early-stage assessment without repeated high-fidelity simulations.

Author Contributions

J.C.: writing—original draft, methodology, formal analysis, data curation, conceptualization; J.L.: supervision, conceptualization, project administration, funding acquisition; Z.Z.: methodology, conceptualization; Y.L.: writing—original draft, formal analysis; F.C.: supervision, methodology, conceptualization. All authors have read and agreed to the published version of the manuscript.

Funding

This work is supported by the Shaanxi Provincial Key R&D Program (No. 2023-ZDLGY-17 and No. S2025-YF-ZDXM-CYJQ-0186) and the Xi’an Major Science and Xi’an Science and Technology Program Project (No. 23ZCKCGZH0008).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data will be made available by the corresponding author on request.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Falkowski, K.; Kurnyta-Mazurek, P.; Szolc, T.; Henzel, M. Radial magnetic bearings for rotor–shaft support in electric jet engine. Energies 2022, 15, 3339. [Google Scholar] [CrossRef]
  2. Wu, T.; Zhang, W. Review on Key Development of Magnetic Bearings. Machines 2025, 13, 113. [Google Scholar] [CrossRef]
  3. Zhu, Z.; Qi, G.; Xu, J.; Li, X. Dynamic Modeling of the Support System for Vehicle-Mounted Spherical Magnetic Suspension Flywheel Battery. IEEE Trans. Energy Convers. 2023, 39, 322–333. [Google Scholar] [CrossRef]
  4. Uzhegov, N.; Smirnov, A.; Park, C.H.; Ahn, J.H.; Heikkinen, J.; Pyrhönen, J. Design aspects of high-speed electrical machines with active magnetic bearings for compressor applications. IEEE Trans. Ind. Electron. 2017, 64, 8427–8436. [Google Scholar] [CrossRef]
  5. Sun, J.; Zhou, H.; Ma, X.; Ju, Z. Study on PID tuning strategy based on dynamic stiffness for radial active magnetic bearing. ISA Trans. 2018, 80, 458–474. [Google Scholar] [CrossRef]
  6. Wei, Z.; Song, C.; Li, J.; Li, M. Research on vibration characteristics of magnetic suspension bearing-rotor system under variable supporting stiffness. Bearing 2022, 3, 15–22. [Google Scholar]
  7. Kumar, P.; Tiwari, R. Finite element modelling, analysis and identification using novel trial misalignment approach in an unbalanced and misaligned flexible rotor system levitated by active magnetic bearings. Mech. Syst. Signal Process. 2021, 152, 107454. [Google Scholar] [CrossRef]
  8. Kumar, P.; Tiwari, R. Dynamic analysis and identification of unbalance and misalignment in a rigid rotor with two offset discs levitated by active magnetic bearings: A novel trial misalignment approach. Propuls. Power Res. 2021, 10, 58–82. [Google Scholar] [CrossRef]
  9. Mushi, S.E.; Lin, Z.; Allaire, P.E. Design, construction, and modeling of a flexible rotor active magnetic bearing test rig. IEEE/ASME Trans. Mechatronics 2011, 17, 1170–1182. [Google Scholar] [CrossRef]
  10. van Rensburg, J.J.; Van Schoor, G.; Van Vuuren, P.; van Rensburg, A.J. Non-linear model of rotor delevitation in an active magnetic bearing system including parameter estimation. Measurement 2017, 109, 256–267. [Google Scholar] [CrossRef]
  11. Sun, J.; Zhou, H.; Ju, Z. Dynamic stiffness analysis and measurement of radial active magnetic bearing in magnetically suspended molecular pump. Sci. Rep. 2020, 10, 1401. [Google Scholar] [CrossRef] [PubMed]
  12. Chen, Y.; Du, L.; Sun, Q.; Bai, J. A Disturbance Compensation Control Strategy for Rotational Speed Standard Device Based on AMB System. Sensors 2024, 24, 3093. [Google Scholar] [CrossRef]
  13. Hutterer, M.; Schrödl, M. Modeling and mu-synthesis control of a flexible rotor stabilized by active magnetic bearings including current free control. J. Sound Vib. 2023, 546, 117439. [Google Scholar] [CrossRef]
  14. Sugimoto, H.; Chiba, A. Dynamic modeling and experimental validations of passing through critical speeds by high acceleration in one-axis actively positioned bearingless motors. IEEE Trans. Ind. Appl. 2021, 57, 6956–6964. [Google Scholar] [CrossRef]
  15. Shata, A.M.A.H.; Hamdy, R.A.; Abdelkhalik, A.S.; El-Arabawy, I. A fractional order PID control strategy in active magnetic bearing systems. Alex. Eng. J. 2018, 57, 3985–3993. [Google Scholar] [CrossRef]
  16. Song, X.; Lv, L.; Sun, W.; Zhang, J. A radial basis function-based multi-fidelity surrogate model: Exploring correlation between high-fidelity and low-fidelity models. Struct. Multidiscip. Optim. 2019, 60, 965–981. [Google Scholar] [CrossRef]
  17. Chelladurai, S.J.S.; Murugan, K.; Ray, A.P.; Upadhyaya, M.; Narasimharaj, V.; Gnanasekaran, S. Optimization of process parameters using response surface methodology: A review. Mater. Today Proc. 2021, 37, 1301–1304. [Google Scholar] [CrossRef]
  18. Cai, F.; Huang, X. Study on Trajectory Optimization of Hypersonic Vehicle Based on Neural Network. Front. Energy Res. 2022, 10, 884624. [Google Scholar] [CrossRef]
  19. Sinou, J.J.; Denimal, E. Reliable crack detection in a rotor system with uncertainties via advanced simulation models based on kriging and Polynomial Chaos Expansion. Eur. J. Mech.-A/Solids 2022, 92, 104451. [Google Scholar] [CrossRef]
  20. Zhou, Z.; Dai, Y.; Wang, G.; Li, S.; Pang, J.; Zhan, S. Thermal displacement prediction model of SVR high-speed motorized spindle based on SA-PSO optimization. Case Stud. Therm. Eng. 2022, 40, 102551. [Google Scholar] [CrossRef]
  21. Shuai, H.; Li, F. Physics-informed kolmogorov-arnold networks for power system dynamics. IEEE Open Access J. Power Energy 2025, 12, 46–58. [Google Scholar] [CrossRef]
  22. Tian, J.; Hou, M.; Bian, H.; Li, J. Variable surrogate model-based particle swarm optimization for high-dimensional expensive problems. Complex Intell. Syst. 2023, 9, 3887–3935. [Google Scholar] [CrossRef]
  23. Bogoclu, C.; Roos, D.; Nestorović, T. Local Latin hypercube refinement for multi-objective design uncertainty optimization. Appl. Soft Comput. 2021, 112, 107807. [Google Scholar] [CrossRef]
  24. Sheikholeslami, R.; Razavi, S. Progressive Latin Hypercube Sampling: An efficient approach for robust sampling-based analysis of environmental models. Environ. Model. Softw. 2017, 93, 109–126. [Google Scholar] [CrossRef]
  25. Xiang, B.; Shi, Z.; Wen, T.; Liu, H. Attitude stabilization control of three-axis inertial stabilized platform using magnetically suspension system. Measurement 2025, 253, 117858. [Google Scholar] [CrossRef]
  26. Kudela, J.; Matousek, R. Recent advances and applications of surrogate models for finite element method computations: A review. Soft Comput. 2022, 26, 13709–13733. [Google Scholar] [CrossRef]
  27. Han, Z.; Xu, C.; Qiao, J.; Liu, F.; Chi, J.; Meng, G.; Zhang, K.; Song, W. Recent progress of efficient global aerodynamic shape optimization using surrogate-based approach. Acta Aeronaut. Astronaut. Sin. 2020, 41, 30–70. [Google Scholar]
  28. Lü, Q.; Chan, C.L.; Low, B.K. Probabilistic evaluation of ground-support interaction for deep rock excavation using artificial neural network and uniform design. Tunn. Undergr. Space Technol. 2012, 32, 1–18. [Google Scholar] [CrossRef]
  29. Cui, J.; Li, J.; Zhao, W.; Zhai, X.; Liu, W.; Chen, C. Research on High Altitude Simulation Test of Aero-engine Based on Genetic Algorithm Optimization Uniform Design. J. Ordnance Equip. Eng. 2023, 44, 285–289. [Google Scholar]
  30. Gu, J.; Hua, W.; Yu, W.; Zhang, Z.; Zhang, H. Surrogate model-based multiobjective optimization of high-speed PM synchronous machine: Construction and comparison. IEEE Trans. Transp. Electrif. 2022, 9, 678–688. [Google Scholar] [CrossRef]
  31. Qian, J.; Yi, J.; Cheng, Y.; Liu, J.; Zhou, Q. A sequential constraints updating approach for Kriging surrogate model-assisted engineering optimization design problem. Eng. Comput. 2020, 36, 993–1009. [Google Scholar] [CrossRef]
  32. Chen, X.; Li, J.; Li, B.; Zou, X.; Cai, F. The multi-stage adding strategy on Kriging and applied to cavitation optimization of centrifugal impeller. J. Northwestern Polytech. Univ. 2024, 42, 467–476. [Google Scholar] [CrossRef]
  33. Gao, Y.; Hu, Z.; Chen, W.A.; Liu, M.; Ruan, Y. A revolutionary neural network architecture with interpretability and flexibility based on Kolmogorov—Arnold for solar radiation and temperature forecasting. Appl. Energy 2025, 378, 124844. [Google Scholar] [CrossRef]
Figure 1. Structure diagram of AMB system.
Figure 1. Structure diagram of AMB system.
Sensors 25 05680 g001
Figure 2. Campbell diagram of AMB–rotor.
Figure 2. Campbell diagram of AMB–rotor.
Sensors 25 05680 g002
Figure 3. The influence of the control parameters on the dynamic characteristics of the AMB–rotor.
Figure 3. The influence of the control parameters on the dynamic characteristics of the AMB–rotor.
Sensors 25 05680 g003
Figure 4. The flowchart of GAUD.
Figure 4. The flowchart of GAUD.
Sensors 25 05680 g004
Figure 5. GRU-KAN network structure.
Figure 5. GRU-KAN network structure.
Sensors 25 05680 g005
Figure 6. Sample scatter plots of (a) LHS and (b) GAUD.
Figure 6. Sample scatter plots of (a) LHS and (b) GAUD.
Sensors 25 05680 g006
Figure 7. Convergence curve of GAUD.
Figure 7. Convergence curve of GAUD.
Sensors 25 05680 g007
Figure 8. Comparison of residuals for first critical speed predictions: (a) SVR-LHS; (b) SVR-UD; (c) Kriging-LHS; (d) Kriging-UD; (e) GRU-KAN-LHS; (f) GRU-KAN-UD.
Figure 8. Comparison of residuals for first critical speed predictions: (a) SVR-LHS; (b) SVR-UD; (c) Kriging-LHS; (d) Kriging-UD; (e) GRU-KAN-LHS; (f) GRU-KAN-UD.
Sensors 25 05680 g008
Figure 9. Comparison of error distributions under different models.
Figure 9. Comparison of error distributions under different models.
Sensors 25 05680 g009
Figure 10. Radar chart of normalized metrics for different models.
Figure 10. Radar chart of normalized metrics for different models.
Sensors 25 05680 g010
Figure 11. Response surface of first critical speed versus I 0 and K p .
Figure 11. Response surface of first critical speed versus I 0 and K p .
Sensors 25 05680 g011
Figure 12. Response surface of first critical speed versus I 0 and K i .
Figure 12. Response surface of first critical speed versus I 0 and K i .
Sensors 25 05680 g012
Figure 13. Response surface of first critical speed versus I 0 and K d .
Figure 13. Response surface of first critical speed versus I 0 and K d .
Sensors 25 05680 g013
Figure 14. Response surface of first critical speed versus I 0 and T d .
Figure 14. Response surface of first critical speed versus I 0 and T d .
Sensors 25 05680 g014
Figure 15. Schematic diagram of the test platform.
Figure 15. Schematic diagram of the test platform.
Sensors 25 05680 g015
Figure 16. Experimental results of the AMB–rotor vibration responses.
Figure 16. Experimental results of the AMB–rotor vibration responses.
Sensors 25 05680 g016
Figure 17. Experimental validation under varying K p .
Figure 17. Experimental validation under varying K p .
Sensors 25 05680 g017
Figure 18. Experimental validation under varying K d .
Figure 18. Experimental validation under varying K d .
Sensors 25 05680 g018
Figure 19. Experimental validation under varying I 0 .
Figure 19. Experimental validation under varying I 0 .
Sensors 25 05680 g019
Table 1. Basic parameters of the radial AMB.
Table 1. Basic parameters of the radial AMB.
ParameterValue
Bias current (A)1.4
Number of coil turns171
Pole area (mm2)742
One side air gap (mm)0.8
Table 2. Basic parameters of the AMB–rotor.
Table 2. Basic parameters of the AMB–rotor.
ParameterVariableValue
Rotor mass (kg)m4.3
Total rotor length (mm)l502
Distance between radial AMB centers (mm) l 1 192
Distance between end sensor centers (mm) l s 296
Right AMB to thrust disk distance (mm) l a 287.5
Left AMB to thrust disk distance (mm) l b 95.5
Table 3. Ranges of control parameters.
Table 3. Ranges of control parameters.
Control ParameterRange
K p [2000, 20,000]
K i [0.01, 0.1]
K d [8, 20]
T d [ 5 × 10 5 , 1 × 10 4 ]
I 0 (A)[0.8, 2]
Table 4. Hyperparameter search spaces.
Table 4. Hyperparameter search spaces.
ModelHyperparameterRange/Options
GRU-KANBatch size{16, 32, 64, 128}
Learning rate [ 10 4 , 10 1 ]
Hidden dimensionintegers [ 16 , 64 ]
GRU layersintegers [ 1 , 3 ]
SVRKernel type{linear, poly, rbf, sigmoid}
Regularization strength [ 0.1 , 100 ]
Epsilon tube width [ 0.01 , 1.0 ]
KrigingKernel type{rbf, matern}
Length scale [ 10 3 , 10 3 ]
Constant value [ 10 3 , 10 3 ]
Noise level [ 10 10 , 10 3 ]
Table 5. First critical speed prediction error.
Table 5. First critical speed prediction error.
Method E m (rpm) E mp (%) E ms E rms (rpm) R 2 SD (rpm)
SVR-LHS83.03 3.40 × 10 1 2.26 × 10 4 150.310.8157 2.96 × 10 2
SVR-UD60.21 2.42 × 10 1 1.61 × 10 4 126.970.8877 3.14 × 10 2
Kriging-LHS57.60 2.36 × 10 1 1.12 × 10 4 105.770.9087 3.31 × 10 2
Kriging-UD32.47 1.31 × 10 1 2.46 × 10 3 49.580.9829 3.63 × 10 2
GRU-KAN-LHS43.06 1.78 × 10 1 1.08 × 10 4 103.970.9118 3.03 × 10 2
GRU-KAN-UD22.06 8.88 × 10 2 1.63 × 10 3 40.320.9887 3.63 × 10 2
Table 6. Comparison of the first critical speeds of various models.
Table 6. Comparison of the first critical speeds of various models.
Model/Test TypeCritical Speed (rpm) E m (rpm) E mp (%)
GRU-KAN-UD24,866.0851.92 2.08 × 10 1
FEA24,870.5447.46 1.90 × 10 1
Acceleration test24,918.00--
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Cui, J.; Li, J.; Cai, F.; Zhao, Z.; Liu, Y. A GRU-KAN Surrogate Model with Genetic Algorithm Uniform Sampling for Active Magnetic Bearings–Rotor Critical Speed Prediction. Sensors 2025, 25, 5680. https://doi.org/10.3390/s25185680

AMA Style

Cui J, Li J, Cai F, Zhao Z, Liu Y. A GRU-KAN Surrogate Model with Genetic Algorithm Uniform Sampling for Active Magnetic Bearings–Rotor Critical Speed Prediction. Sensors. 2025; 25(18):5680. https://doi.org/10.3390/s25185680

Chicago/Turabian Style

Cui, Jiahang, Jianghong Li, Feichao Cai, Zhenmin Zhao, and Yuxi Liu. 2025. "A GRU-KAN Surrogate Model with Genetic Algorithm Uniform Sampling for Active Magnetic Bearings–Rotor Critical Speed Prediction" Sensors 25, no. 18: 5680. https://doi.org/10.3390/s25185680

APA Style

Cui, J., Li, J., Cai, F., Zhao, Z., & Liu, Y. (2025). A GRU-KAN Surrogate Model with Genetic Algorithm Uniform Sampling for Active Magnetic Bearings–Rotor Critical Speed Prediction. Sensors, 25(18), 5680. https://doi.org/10.3390/s25185680

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop