1. Introduction
Blades are critical components in turbomachinery such as compressors and gas turbines. They operate under severe vibration, centrifugal, and thermal loads, making condition-based maintenance supported by reliable online health monitoring essential to extend their useful life and ensure safe operation [
1]. Vibration monitoring techniques can be divided into contact and non-contact categories. Traditional contact measurement involves complex preparation, low signal transmission reliability, and may alter the mechanical characteristics of the blade [
2]. Non-contact techniques, in contrast, offer ease of use, low installation cost, high sensitivity, and the ability to monitor the whole blade circle simultaneously [
3]. Recent reviews of the literature indicate that BTT research has evolved from basic non-contact vibration measurement toward a more integrated framework involving sensor design, probe-layout optimization, signal processing, uncertainty analysis, and structural health monitoring applications. In particular, Wang et al. provided a systematic review of BTT technology for aeroengines, covering the theoretical foundations of BTT, commonly used sensor types, optimization strategies for sensor placement and sensor number, major signal-processing and parameter-identification methods, as well as the main sources of measurement errors and uncertainties. These studies show that the practical performance of BTT depends not only on the identification algorithm itself, but also on the coupled effects of sensing configuration, undersampling characteristics, signal quality, and uncertainty management [
4,
5]. Meanwhile, the interpretation of blade vibration signatures also relies on an adequate understanding of blade dynamics and structural modeling. Recent surveys on aero-engine blade dynamics have systematically reviewed lumped-mass, finite-element, and semi-analytical models, together with their associated solution methods and vibration characteristics. In addition, review work on rotating cracked blades has summarized crack propagation mechanisms, open/breathing crack models, and the corresponding dynamic modeling approaches for fault-related vibration analysis. Representative nonlinear dynamic studies, such as the analysis of rotating shrouded blades with impacts, further demonstrate that centrifugal stiffening, spin softening, Coriolis effects, and contact/impact interactions can significantly affect blade vibration responses. These modeling studies provide the physical background for understanding why robust identification methods are required when BTT data are acquired under complex operating conditions [
5,
6,
7].
Blade Tip Timing (BTT) has emerged as an efficient non-contact method for blade vibration monitoring [
8]. It employs tip timing sensors to record precise blade passage times, from which instantaneous vibrational displacements are calculated. However, BTT signals are inherently undersampled (the sampling rate equals the rotational frequency) and are contaminated by various noise sources, leading to reduced accuracy compared to contact methods [
3]. Moreover, the absence of a high-precision once-per-revolution (OPR) probe further complicates the determination of theoretical arrival times [
9].
Against this background, considerable research has focused on two closely related issues in BTT-based blade vibration monitoring: accurate reconstruction of blade arrival-time/displacement information without a dedicated OPR probe, and reliable identification of asynchronous vibration parameters from severely undersampled signals. Russhard proposed a linear fitting approach to generate a fictitious OPR signal [
10], while Chen et al. improved the accuracy using a high-precision method (CR-BTT) [
11]. The fictitious OPR technique, however, suffers from large errors when rotational speed fluctuates significantly. Alternative methods exploit vibration differences among blades or the same blade at different times [
12,
13]. Guo et al. used the Levenberg–Marquardt algorithm to identify blade vibrations without an OPR probe [
13]. Wang et al. applied a Savitzky–Golay filter to improve BTT estimation under rapid speed fluctuation [
14]. Xu et al. developed a recursive BTT algorithm in the angular domain [
15]. Wang et al. also proposed a none-OPR BTT-based blade vibration parameter identification method [
16]. These approaches aim to accurately calculate the expected blade arrival times, which are essential for deriving vibration displacements.
Once vibration displacements are obtained, parameter identification for asynchronous vibrations is typically performed using spectral methods. The asynchronous vibration frequency
is related to the rotational frequency
by
, where
is an integer order and
is the difference frequency (aliased component). Because
often exceeds the Nyquist frequency (
), direct spectral analysis is impossible; instead, the low-frequency
is extracted from the undersampled data using the difference-frequency principle [
17]. Various techniques have been proposed to determine
and
, including autoregressive methods [
18], the “5 + 2” approach [
19], interpolation methods [
20], and least-squares fitting combined with Fourier transforms [
21]. Most of these methods rely on Fourier analysis and suffer from the inherent limitations of FFT, such as spectral leakage and the picket-fence effect, especially under non-stationary conditions. Furthermore, they often require a large number of sensors or impose strict constraints on sensor placement.
In recent years, deep learning has shown promise in feature learning and noise suppression [
22]. However, pure deep learning approaches may lack interpretability and require large training datasets. For BTT-based blade vibration monitoring, the feasibility of machine learning should be discussed separately for synchronous and asynchronous vibrations. Recent reviews of the literature have pointed out that these two vibration types differ in their under-sampling severity and signal characteristics, which means that different identification strategies are often required. For synchronous vibrations, machine learning can be feasibly used for resonance-region recognition, denoising, feature extraction, or assisting parameter identification when the response patterns are relatively structured. For asynchronous vibrations, however, the problem is generally more difficult because the signals are more severely under-sampled, more sensitive to aliasing and probe-layout effects, and often less supported by abundant labeled data. Therefore, in strongly under-sampled BTT scenarios, pure end-to-end machine learning may be less reliable than hybrid approaches that combine physical signal models with learning-based modules. From this viewpoint, machine learning is more suitable as an auxiliary tool for preprocessing, feature enhancement, or condition classification, while the final parameter identification still benefits from physically interpretable constraints.
Hybrid methods that combine signal processing with learning-based optimization offer a compelling alternative. The present work develops a multi-stage robust Bayesian high-resolution identification framework that systematically integrates several enhancements: (i) a Kalman-filter-based recursive algorithm for accurate speed estimation without an OPR probe; (ii) an attention-enhanced dynamic convolutional autoencoder (ADCAE) that generates adaptive window functions to suppress spectral leakage; (iii) sub-bin interpolation applied to all-phase FFT (APFFT) spectra to overcome the resolution limit; (iv) a Tukey biweight error function for robust aggregation of multi-channel phase information; (v) a Bayesian prior over the vibration order to guide the estimation; and (vi) a coarse-to-fine multi-stage search strategy to reduce computational cost. Experiments on a rotor-blade test bench demonstrate the effectiveness of each component and the overall method.
From this perspective, the contribution of this work is not the mere combination of several advanced techniques, but the construction of a minimally sufficient pipeline for difficult BTT conditions. The recursive digital algorithm improves arrival-time reconstruction when an OPR signal is unavailable; the adaptive window and APFFT/interpolation improve spectral concentration and frequency resolution; the robust cost function mitigates the influence of corrupted channels and channels with unequal data quality; the Bayesian prior constrains the solution to physically plausible orders; and the multi-stage search improves efficiency without changing the estimation target itself. This design principle helps avoid treating the method as a simple accumulation of independent modules.
Although the proposed method contains multiple components, it is not intended as an unnecessarily complicated architecture. Rather, it is a problem-driven framework in which each module is introduced to address a specific limitation of conventional asynchronous BTT identification. In practical OPR-free BTT measurements, several sources of error often occur simultaneously, including rotational-speed uncertainty, spectral leakage caused by undersampling and non-stationarity, sensitivity to abnormal sensor channels, and high computational cost in order search. A single conventional technique typically addresses only one of these issues. Therefore, the proposed framework combines recursive speed estimation, adaptive spectrum optimization, robust phase aggregation, Bayesian regularization, and coarse-to-fine search into a unified pipeline. For clean and stationary cases, simpler signal-processing pipelines may still be sufficient; the full proposed framework is mainly advantageous in noisy, undersampled, OPR-free, and outlier-prone scenarios. The present formulation is intended for constant-speed or quasi-stationary conditions within each analysis window.
5. Results and Discussion
5.1. Rotor-Blade Test Bench
5.1.1. Speed Estimation Accuracy
Table 1 compares the raw speed estimates (from individual sensors) with the Kalman-filtered speed for five sensors at the target speed of 1400 rpm (noise level σ = 0.05). The raw estimates exhibit fluctuations up to ±1.16 rpm, while the filtered estimates converge to values within 0.1 rpm of the true speed. The relative error
is below 0.1% for all sensors after filtering, demonstrating the effectiveness of the recursive digital algorithm.
Figure 6 shows the decay of the Kalman gain
with the number of iterations, indicating that the filter quickly reaches steady state. The convergence of the state covariance is illustrated in the inset.
5.1.2. Dynamic Window Enhancement
Figure 7 compares the shapes of the traditional Hanning window and a dynamic window generated by the ADCAE for a typical signal segment. The dynamic window adapts to the signal’s local structure, narrowing during transient impacts and widening during steady-state sections.
Figure 8 displays the APFFT spectra of blade S1 (sensor 0) without windowing and with the dynamic window. Quantitative metrics in
Table 2 show that the dynamic window reduces the noise floor from 25.36 dB to −34.57 dB and suppresses the maximum sidelobe from 50.35 dB to −8.49 dB, while preserving the peak frequency (11.028 Hz). This confirms the superior spectral clarity achieved by the adaptive window.
Both methods yield nearly identical peak frequencies (11.03 Hz), indicating that the dynamic window does not distort the fundamental frequency but rather refines the spectral characteristics. A significant reduction in peak amplitude is observed with the dynamic window, from 1180.93 to 1.16, demonstrating its effectiveness in suppressing noise and improving the clarity of the frequency peak. The noise floor is reduced significantly from 25.356 dB (No Window) to −34.568 dB (Dynamic Window), highlighting the dynamic window’s ability to reduce noise. The side lobe maximum is drastically reduced from 50.352 dB to −8.489 dB, indicating a significant suppression of undesired spectral components. The undersampling and the narrow bandwidth of the signal’s fundamental frequency are the primary factors that prevent the accurate detection of the main lobe width.
The dynamic window function significantly improves the spectral characteristics of the BTT signals. The reduction in peak amplitude and noise floor, alongside the suppression of side lobes, indicates that the dynamic window provides a clearer frequency spectrum. This is critical for the accurate identification of vibration frequencies in noisy, under-sampled BTT signals.
To further quantify the computational cost of the ADCAE, we measured its model size and inference efficiency on a CPU platform. The ADCAE used in this study contains 1,324,928 trainable parameters and requires an average inference time of 4.37 ms per 1024-point input segment under batch size 1 (standard deviation: 0.16 ms). For the five-sensor configuration used in this study, the corresponding total window-generation time is approximately 21.85 ms when channels are processed sequentially. These results indicate that the ADCAE introduces only a limited additional online cost.
5.2. Core Identification Algorithm Performance
A key question is whether the proposed multi-stage framework is genuinely necessary compared with conventional signal-processing methods. Our view is that the answer depends on the operating condition. Under clean, stationary, and well-instrumented conditions, conventional APFFT/FFT-based methods with fixed windowing and direct phase comparison may already provide acceptable identification accuracy. However, the practical BTT scenario considered in this study is more challenging: it is OPR-free, undersampled, contaminated by noise, and potentially affected by abnormal sensor channels. Under such conditions, the identification error is not dominated by a single factor, and the combined framework becomes beneficial because each module addresses a different error source. Therefore, the motivation of the proposed method is not to increase complexity for its own sake, but to improve robustness in conditions where conventional pipelines become less reliable.
We now evaluate each component of the proposed multi-stage robust Bayesian identification method using the stress dataset (with added noise and interference). Unless otherwise stated, Monte Carlo-based experiments were repeated 50 times with different noise realizations. For the heterogeneous channel-noise study in
Section 5.2.2, 200 trials were used to obtain more stable statistics.
5.2.1. Sub-Bin Interpolation Effect
Figure 9 illustrates the benefit of sub-bin interpolation.
Figure 8 shows a close-up of the APFFT spectrum around the true difference frequency
Hz. The integer bin corresponding to
yields
Hz, with an error of 0.0047 Hz. After parabolic interpolation, the estimated
Hz, reducing the error to 0.0033 Hz. To quantitatively evaluate the improvement, 50 independent trials were conducted, and the frequency estimation errors with and without parabolic interpolation are summarized in
Table 3. The statistics show that interpolation reduces the mean error from 0.005948 Hz to 0.004990 Hz, a relative improvement of 16.1%. The median error decreases by 15.8%, and the standard deviation is reduced by 12.5%, indicating not only better accuracy but also lower variability. The minimum and maximum errors are also reduced by 7.9% and 16.9%, respectively. These results demonstrate that sub-bin interpolation effectively overcomes the resolution limit of the FFT, providing more precise frequency estimates.
5.2.2. Robustness Under Heterogeneous Channel Noise
To further address the robustness issue raised by heterogeneous sensor quality, we extended the Tukey-weighting experiment to multi-sensor cases with unequal channel-noise levels. In the original formulation, the Tukey biweight was introduced to suppress the influence of abnormal phase deviations during multi-channel aggregation. However, in a practical BTT system, different probes may exhibit substantially different noise levels due to sensor installation, local interference, or channel quality variations. In such a case, a single global threshold applied to all residuals may not always be optimal. Therefore, in addition to ordinary least-squares aggregation and global-threshold Tukey weighting, we further evaluated a channel-adaptive Tukey formulation in which each channel residual is normalized by its own noise scale before applying the Tukey biweight. This extension is consistent with the robust multi-channel phase aggregation framework introduced in
Section 3.3.
Four noise configurations were considered: a homogeneous case, a mildly heterogeneous case, a one-bad-channel case, and a strongly heterogeneous case. For each configuration, 200 Monte Carlo trials were carried out. In addition, an extra 90° phase bias was imposed on one channel; for heterogeneous cases, this bias was applied to the noisiest channel. The identification accuracy of the correct candidate order was then compared among three aggregation strategies: least squares, global-threshold Tukey weighting, and channel-adaptive Tukey weighting.
The results are summarized in
Table 4 and
Figure 10. Under nearly homogeneous channel noise, the difference between the global-threshold and channel-adaptive formulations is limited. For example, in the homogeneous case without additional bias, the identification accuracies of least squares, global Tukey, and adaptive Tukey were 95.5%, 91.0%, and 91.0%, respectively. This indicates that when all channels have similar quality, a single global threshold remains adequate. By contrast, under heterogeneous channel noise, the adaptive formulation generally provides better robustness. In the mildly heterogeneous case with an additional 90° bias on the noisiest channel, the identification accuracy increased from 46.0% for least squares and 65.5% for global Tukey to 73.5% for adaptive Tukey. In the one-bad-channel case without additional bias, the accuracy further improved from 74.5% and 83.5% to 94.5%.
The difference becomes more evident under severe heterogeneity. In the strong-heterogeneity case with an additional 90° bias on the noisiest channel, the identification accuracy was 48.5% for least squares, 25.0% for global Tukey, and 61.5% for adaptive Tukey. This result shows that the global-threshold formulation may become unreliable when channel quality differs substantially and the worst channel is additionally corrupted, whereas the adaptive formulation remains more stable. Overall, these results indicate that a single global threshold is robust but not always optimal in multi-sensor BTT scenarios with unequal channel quality. A channel-adaptive threshold is particularly beneficial when one or more channels are significantly noisier than the others.
As a result, the role of Tukey weighting in the proposed framework can be interpreted more precisely. Its main benefit is not only to suppress isolated outliers, but also to improve robustness when multi-channel residuals are affected by heterogeneous noise. The added experiment therefore strengthens the practical relevance of the robust aggregation stage in
Section 3.3 and clarifies that the single-threshold formulation should be regarded as a robust baseline rather than a universally optimal choice.
5.2.3. Contribution of Bayesian Prior
We evaluated the impact of the prior distribution p(m) on identification accuracy. The prior was set as a Gaussian centered at
with
.
Figure 11 shows the prior shape and the percentage of times the correct order (
) was ranked first with and without the prior in 50 Monte Carlo trials. The near-identical cost values indicate that the baseline algorithm is already highly optimized under the tested conditions. The slight improvement in discriminability (≈1%) demonstrates the potential of the Bayesian prior to enhance robustness in more adverse scenarios.
This limited improvement indicates that the selected prior parameter mainly serves as a mild regularization term rather than a dominant source of performance gain.
5.2.4. Efficiency of Multi-Stage Search
The computational cost of the ADCAE module has been reported in
Section 5.1.2; here we focus on the efficiency gain brought by the multi-stage search strategy. We compared the computational time and accuracy of the proposed multi-stage search against exhaustive search (i.e., evaluating all integer bins k and all sub-bin offsets from −0.5 to 0.5 in steps of 0.1).
Figure 12 shows the average runtime over 50 trials. The multi-stage search took 1.046 s on average, while exhaustive search required 2.035 s—a speedup factor of about 1.95 (i.e., almost 50% reduction). More importantly, the frequency estimation errors were identical (0.01023 Hz) for both methods, confirming that the coarse-to-fine strategy does not compromise accuracy, as summarized in
Table 5.
5.2.5. Overall Identification Results
Applying the complete multi-stage robust Bayesian method to the experimental data (original, non-stressed) yielded the asynchronous vibration frequencies for blades S1–S4 at 1400 rpm, listed in
Table 6. The identified orders were all m = 5. The maximum error relative to the natural frequencies obtained from static tests is 7.84% (blade S4).
Table 7 shows the identification results at three different rotational speeds (1200, 1400, 1600 rpm) for blade S1. The identified frequencies increase with speed, following the expected trend of the first-order bending mode, and the combined error metric J remains stable, demonstrating robustness to speed variations.
The variation of the total error J with the candidate order m is shown in
Figure 13, where the minimum value corresponds to the identified order.
5.3. Discussion
The present results suggest that the different modules do not contribute equally, nor are they all equally necessary in every scenario. The recursive speed estimator is important when no reliable OPR reference is available. The adaptive window and sub-bin interpolation mainly improve spectral quality and frequency resolution under noisy or non-ideal sampling conditions. Tukey weighting becomes particularly important when the multi-channel phase residuals are affected by either abnormal channels or heterogeneous channel-noise levels. The Bayesian prior provides a relatively modest gain in the current experiments, but it improves physical plausibility and can help stabilize order selection when the cost landscape becomes ambiguous. The multi-stage search mainly contributes to computational efficiency rather than identification accuracy. Therefore, the framework should be interpreted as a modular solution for challenging operating conditions, not as a claim that all components are indispensable in every application.
The experimental results validate each component of the proposed method. The recursive digital algorithm provides accurate rotational speed estimates (error < 0.1%) without an OPR probe, enabling precise vibration displacement calculation. The dynamic window network significantly improves spectral quality by suppressing noise and sidelobes, which directly benefits the subsequent phase extraction. Sub-bin interpolation effectively overcomes the FFT resolution limit and reduces the frequency estimation error by about 16% on average in the reported stress-dataset trials. The robust Tukey-based aggregation improves resistance not only to isolated outlier sensors, but also to unequal channel quality across probes. The additional heterogeneous-noise study shows that a channel-adaptive threshold can provide higher identification accuracy than a single global threshold when one or more channels are substantially noisier than the others. The Bayesian prior increases the success rate of correct order identification under adverse conditions, leveraging prior knowledge about the blade count. The multi-stage search cuts computational time by nearly 50% while preserving accuracy, making the method suitable for real-time or near-real-time applications. The overall frequency identification error is within 7.84% for all blades, which is acceptable for engineering monitoring purposes. The method also performs consistently across different rotational speeds, indicating its potential for variable-speed operation.
At the same time, the results also indicate that the adaptive formulation is not uniformly superior in every case; under nearly homogeneous channel noise, the global-threshold and adaptive versions perform similarly. Therefore, the adaptive formulation should be interpreted as a more suitable choice for heterogeneous multi-sensor scenarios rather than as a universally necessary replacement.
Nevertheless, certain limitations remain. Under extreme negative-SNR conditions, the aliased vibration component may be deeply buried in noise, which would degrade both the difference-frequency estimation and the subsequent order identification. In such cases, the proposed method is still expected to be more stable than conventional fixed-window and direct phase-comparison approaches because the adaptive window, robust multi-channel aggregation, and Bayesian regularization provide complementary noise tolerance. However, once the useful vibration component remains consistently below the noise floor across most channels, correct identification can no longer be reliably guaranteed. In that regime, improved performance would require longer observation windows, more reliable sensor channels, stronger prior constraints, or additional denoising and order-tracking strategies.
The method assumes a constant speed over the analysis window; rapid speed transients may degrade performance. Future work will extend the framework to handle non-stationary speeds by incorporating order tracking and adaptive window segmentation. Additionally, the dynamic window network was trained on a specific dataset; its generalization to different blade geometries or sensor types requires further investigation. The influence of blade twisting under centrifugal load and non-zero tip clearance also merits study.