The proposed methodological framework consists of three main components: (i) data collection and preprocessing; (ii) mathematical modeling and health index (HI) development; and (iii) the design and training of the AI iterative prediction framework (XGBoost + NARX).
2.2. Mathematical Modeling of HI (Definitions)
2.2.1. Survival Analysis Definitions
In reliability theory and survival analysis, the failure time of a component is represented as a random variable T. Depending on its distribution, three key functions are defined to characterize system lifetime.
This shows the probability that a component will last beyond time t. A higher S(t) indicates a greater likelihood that it is still functioning.
Here, f(t) is the probability density function of T. The hazard function indicates the instantaneous failure risk at time t, given survival up to that point. A high h(t) suggests the component is very likely to fail soon after time t.
- 3.
Cumulative Hazard Function
This measures the overall risk of failure from time 0 to
t. Importantly, it directly connects to survival probability.
which shows that the survival probability decreases exponentially as the total hazard rises.
This study formulates the data handling of UAVs [
1] using functions that serve as the mathematical basis connecting the health index (HI) and Remaining Useful Life (RUL), thereby establishing a strong link between AI-generated features and reliability metrics.
2.2.2. Definition and Derivation of the Health Index (HI)
With the rise in UAVs (and robotics in general), there are new tools for HI solution [
1]. Data-driven methods with high fault recognition capabilities depend on two conditions: similar data sample distribution and complete data labeling [
2].
We define a single, interpretable, and bounded indicator HI, with HI in uppercase. The shaded grid t, belonging to [0,1], is used to represent the motor’s health state at time t, satisfying the following:
- 1.
Boundaries and Directionality: (close to 1 for a new system); monotonically decreasing with degradation; functional failure is declared when reaching a threshold .
- 2.
Linkage to Reliability: Establishes a monotonic relationship with survival/hazard functions, enabling Remaining Useful Life (RUL) and availability inference RUL prediction method [
1].
- 3.
Robustness to Noise: Estimates should stay consistent despite measurement variations and differences between devices.
Let
denote the feature vector calculated within the window
(e.g., efficiency
, vibration fundamental/harmonic energy
, current variance
, temperature rise
). The objective is to maximize the total sum of rewards obtained by all UAVs [
2].
To ensure consistency in direction (“larger value = more degraded”), each dimension is monotonically transformed via
:
where
, 8
j represent baseline mean/scale (or quantile-based). This guarantees that a larger
implies “worse condition.” This approach provides valuable insights into the current state [
3].
Robust statistics (median–MAD) or quantile normalization based on healthy samples can minimize the impact of outliers. Feature observation and direction alignment have emerged as promising solutions for enhancing efficiency and sustainability [
3].
The aligned vector is aggregated into a degradation score . Common mathematically sound approaches include
C1. Weighted Linear Combination (supervised/interpretable)
Weights
can be determined through (i) supervised optimization using RUL/failure labels (such as convex optimization and LASSO), (ii) unsupervised methods (PCA/PLS first component with non-negative loadings), or (iii) normalized SHAP/Permutation importance. Advancements in UAV technology have resulted in a diverse range of models [
3].
C2. Distance-based (robust to correlation)
where
is the covariance matrix of the healthy period (i.e., Mahalanobis distance). This naturally measures deviation from the “healthy manifold.”
C3. Reconstruction Error (data-driven score)
Use a manifold model
trained only on healthy data.
2.2.3. Notation and Robust Statistics
Covariance and Distance (for C2).
Let the feature vector
mean
and covariance
, with element
We use the Mahalanobis norm:
In C2 (distance-based), the degradation score is
where (
) are the healthy-period mean and covariance. To enhance robustness,
may be estimated via Huber-type M-estimation or shrinkage:
“Robust correlation” refers to Kendall’s (optionally bicor).
Define the healthy manifold
with projections: linear (PCA)
and nonlinear (autoencoder)
(x) ≈ g(f(x)).
The C3 score is
which is mapped to the health index by a monotone
(logistic/exponential):
and calibrate the mapping using anchor pairs
,
as specified in
Section 2.3, where
denotes projection/reconstruction, healthy data yields minor errors, while degradation increases reconstruction error.
Parameter Calibration: Healthy median maps to ; the engineering failure criterion maps to (e.g., ). This yields or .
First-Passage Time:
linking HI with survival time through a threshold crossing definition, enabling RUL/availability derivation. This technology transforms real-world physical objects [
3].
Discrete Hazard Deep Models: Incorporate the statistics of HI, represented as a shaded square u, with omega belonging to the range of tau at k minus 1.
If is a submartingale (), and is monotone decreasing, then (expected monotonicity).
If is L-Lipschitz and , then (measurement/estimation error is linearly bounded).
Isotonic regression or “non-increasing projection” can be applied to denoise , ensuring monotonicity. The method solves a challenging problem related to UAVs [4, 5].
The online implementation steps are shown in
Figure 1.
Input: Raw sensors (RPM, current/voltage, IMU, temperature); window = 0.25–0.5 s.
Procedure:
Filtering: Median → moving average; RPM smoothed with Kalman filter.
Feature Extraction: Compute to form .
Direction Alignment: .
Aggregation : Select among C1/C2/C3.
Mapping : Calibrate or via engineering/statistics.
Monotone Adjustment (optional): Isotonic/projection ensures a non-increasing trajectory.
Following UAV processing workflows described in [
6], our implementation extracts synchronized RPM, current/voltage, vibration, and temperature features, which are then integrated into the XGBoost–NARX pipeline.
Output: for Cox/AFT/discrete hazard models and XGBoost + NARX usage.
The overarching optimization objective of this problem is to define a bounded, monotone health index
derived from direction-aligned features and robust aggregation [
10]; the thresholded first-passage time links HI to survival, enabling principled integration with Cox/AFT and discrete hazard models [
11]. Equally crucial is quantifying uncertainty. While providing stable inputs for the iterative XGBoost + NARX pipeline, the experiments indicate that the proposed model can achieve accurate RUL estimation results [
12].
2.3. HI Calibration (Anchors and Mapping)
In the previous section, we defined the health index (HI) as a bounded indicator between 0 and 1 that decreases as the motor degrades, using inputs from the current UAV-based analysis [
13]. Safety relates to the physical integrity of the UAV hardware [
14]. While the definition is mathematically sound, to make it practically meaningful, we need one more step: calibration.
Calibration involves selecting a few anchor points that connect the abstract degradation score
z(
t) to real-world conditions. For example, we might set HI close to 1 when the motor is new and healthy, and set HI to 0.2 when it reaches the failure threshold. By aligning these anchors, UAVs usually need to maintain a minimum altitude, ensuring that the HI curve not only decreases steadily but also reflects intuitive and understandable stages of health. The benefits of this model include its higher accuracy and recall in detecting small HI changes. The outlined flight profile describes the different phases of a drone‘s operation. This calibrated HI then provides a solid foundation for downstream tasks such as Remaining Useful Life (RUL) prediction and availability analysis. HI offers a reasonable methodological basis for improving UAV operating standards. The calibration diagram is shown in
Figure 2.
Checks:
HI parameter table is shown in
Table 1.
In practice, for each set of parameter values [
15,
16,
17,
18,
19], we cannot directly determine how the health index (HI) should be mapped, so a calibration step is necessary. Think of it as converting the abstract degradation score
z(
t) back to an engineering scale that makes intuitive sense. Here are some practical tips:
Choose anchors first: Based on engineering rules or statistical criteria, specify the healthy state , the failure state , and the corresponding threshold τ.
Choose a mapping function: The exponential mapping is straightforward and physically intuitive, suitable for quick deployment, while the logistic mapping offers more flexibility in controlling the mid-range slope, which is useful across different motor types. The script saves the data frame containing the interpolated line into a new output shapefile [
20].
Verify with samples: After fitting parameters, test with three to five representative samples to ensure monotonic decrease in HI, and check that anchor errors stay within .
By performing the following, we ensure that the HI is not only mathematically consistent but also aligned with engineering intuition. The results obtained from UAV-based multispectral data are presented for the HI [
21].
2.4. Hybrid AI Framework (XGBoost + NARX)
In practical applications, a single model often cannot fully capture both the nonlinear relationships among features and the temporal dependencies of degradation dynamics. For example, XGBoost excels at extracting nonlinear degradation features from complex sensor data, but it is not sensitive to long-term temporal dependencies. Conversely, NARX is well-suited for modeling temporal dynamics but is limited in handling highly nonlinear feature interactions and real-time RUL predictions [
22].
To address these limitations, this study combines two approaches: XGBoost is used initially for feature extraction, followed by NARX to capture temporal dependencies. The numerous studies mentioned above show a consensus that AI/ML techniques can predict HI with much higher accuracy than older empirical methods [
23]. This hybrid approach merges the “breadth” of nonlinear feature learning with the “depth” of temporal sequence modeling, ensuring both predictive accuracy and mathematical interpretability. Next, we introduce the theoretical foundations of XGBoost and NARX separately, and then outline the proposed hybrid integration strategy. UAV HI prediction research is needed to handle rapid maneuvering and high-frequency sampling with relatively smaller datasets [
24]. This study examines the methods and challenges of mode fusion, emphasizing the importance of its practical application in AI research [
25].
Extreme Gradient Boosting (XGBoost) is a boosting algorithm that iteratively fits regression trees to residuals. Its prediction rule is as follows:
where
denotes the space of CART trees. The loss function is approximated by a second-order Taylor expansion:
where
are first- and second-order gradients, and Ω is a regularization term controlling complexity. This enables XGBoost to effectively extract nonlinear degradation features such as current fluctuations and vibration bands.
The nonlinear autoregressive model with exogenous inputs (NARXs) is a dynamic time-series model defined as the following:
where
is the target output (e.g., predicted RUL);
x(t) are exogenous inputs (motor features, environmental conditions);
F(⋅) is a nonlinear mapping, which is often realized by neural networks.
NARX is powerful in capturing both autoregressive dependencies and external influences.
To leverage both nonlinear feature extraction and dynamic modeling, we design an iterative hybrid framework:
Each feature or label set was split 80/20 to create the training and testing datasets [
30]. This ensures predictions are not only accurate but also theoretically consistent with survival analysis.
The RULEN-X (Remaining Useful Life Estimation with NARX–XGBoost) model flowchart is shown in
Figure 3. UAV technology creates a strong connection between the physical and digital worlds, enabling intelligent monitoring, RUL prediction, and decision-making. This makes RULEN-X a vital component in the development of next-generation smart infrastructure [
31].
This advanced RULEN-X technology can help assess the damage to UAV systems quickly after any catastrophic events.
2.5. Data Collection and Preprocessing
Before applying any AI or statistical models, the most important step is to make sure the dataset is clean, consistent, and representative. While some of the literature explores UAV materials in isolation [
32], we systematically collected high-frequency (20 Hz) sensor signals from five types of UAV motors, with each recorded for 10 min. This setup allows us to capture dynamic characteristics under various operating conditions and ensures enough representativeness for iterative model training [
33]. This includes any system that directly manages the life cycle of an instance of a target system during its development.
To justify the data acquisition settings, we note that the sampling frequency was set at 20 Hz (one record every 0.05 s). This rate is considered adequate for UAV BLDC motor prognostics for two main reasons:
According to the Nyquist–Shannon sampling theorem, the sampling frequency must be at least twice the highest frequency component of interest. In our case, the dominant vibration and electrical signatures of BLDC motors during typical UAV operation occur mainly in the range of 0–8 Hz for degradation-related trends (e.g., temperature drift, torque imbalance, vibration RMS). A 20 Hz sampling rate therefore exceeds the Nyquist requirement (>16 Hz), ensuring that no critical prognostic information is lost.
- 2.
Engineering Standards and Practical Constraints
Previous UAV health monitoring studies have used similar or even lower sampling rates (10–25 Hz) for tracking motor and battery degradation, demonstrating that these rates are sufficient for capturing slow-changing health indicators such as thermal rise, current fluctuations, and vibration envelopes. Although higher sampling rates (100–500 Hz) can capture detailed dynamics, they greatly increase storage and computational costs without significantly improving Remaining Useful Life (RUL) prediction accuracy.
- 3.
Balance Between Fidelity and Efficiency
The 20 Hz setting offers a practical balance: it captures key degradation patterns necessary for RUL estimation while keeping the dataset size manageable for iterative AI training and cross-validation. This balance ensures consistency across devices and supports real-time deployment in embedded UAV systems with limited onboard processing power.
Formal Description
The organization of this survey is illustrated [
34]; the data collection and preprocessing are designed as follows:
Motor Types: MTR101 = bench, MTR202 = env, MTR303 = flight, MTR404 = fault, MTR505 = longitudinal. There are differences between Pulse Width Modulation (PWM)-based inverter-fed induction motors and those with grid-fed supplies, concerning the challenges, complexity, and problems they face in their condition monitoring and fault diagnostics [
35].
Resolution and Duration: Sampling frequency is 20 Hz (one record every 0.05 s), with each motor recorded for 600 s (~12,000 records per motor).
Signal Channels: RPM, current, voltage, vibration (IMU), temperature, and for flight-type motors: GPS latitude, longitude, altitude, and payload.
Health and Lifetime Indicators: The health index (HI) and Remaining Useful Life (RUL) are generated based on temperature and vibration profiles, with stochastic perturbations introduced to simulate experimental uncertainty.
Status Labels: Samples are probabilistically labeled as fail if HI is critically low or under fault scenarios, while a subset of others is marked as censored to simulate right-censoring. This analysis helps us identify the most effective combination of these UAV types within our system [
36].
Splitting Strategy: Grouped K-fold cross-validation is used, where the grouping unit is motor ID, ensuring that no single motor’s data appears in both training and testing folds, thus preventing information leakage. A newly developed UAV classification system will be a key component that directly affects all stakeholders in the sector [
37].
Reports on per-motor statistics are shown in
Table 2.
Table 2 reports per-motor statistics, including average RPM, temperature, vibration (RMS), HI, and RUL. Fault-type motors (MTR404s) show significantly higher average temperature and vibration, along with lower HI and RUL.
Aggregates of averages and standard deviations by motor type are shown in
Table 3.
Table 3 contains the aggregates’ averages and standard deviations by motor type, showing that flight-type motors have the greatest temperature variability, while fault-type motors exhibit the highest vibration levels and the shortest lifespan. UAVs have more powerful onboard processors, greater storage capacity, and longer-range radios [
40].
Representative HI trajectories are shown in
Figure 4.
All motor types show a steady decline, with fault-type motors falling below HI = 0.3 after approximately 200 s, indicating more rapid degradation.
Figure 5 shows the decreasing RUL over time.
Figure 5 shows the decreasing RUL over time, aligned with HI trajectories; fault-type motors reach the failure threshold sooner.
The relationship between HI and RUL is shown in
Figure 6.
Figure 6 shows a strong positive correlation between HI and RUL (Pearson’s r > 0.85). Motor types are clearly separated, with fault-type motors clustered in the low-HI and low-RUL area, confirming HI as a reliable indicator for lifespan. The results show that the displacements extracted from the proposed method align well [
13].
The comparison of temperature distributions is shown in
Figure 7.
Figure 7 compares temperature distributions across different motor types, indicating that fault and environment types operate at notably higher average temperatures.
Figure 8 presents vibration distributions.
Figure 8 presents vibration distributions, with fault-type motors showing more outliers and greater variability, which is consistent with abnormal operating conditions [
41]. We use advanced hybrid optimization algorithms to optimize control parameters for UAV operations.