1. Introduction
The rapid growth of electric mobility has intensified the need for reliable battery-health monitoring and predictive-maintenance strategies.
Figure 1a illustrates the continuous increase in global electric vehicle adoption, where annual EV sales increased from nearly 14 million units in 2023 to approximately 17 million units in 2024. The figure also highlights the increasing contribution of EVs to total global vehicle sales, reflecting the accelerating transition toward electrified transportation systems. This sustained expansion directly increases the operational dependence on high-performance lithium-ion battery systems. Correspondingly,
Figure 1b presents the projected growth in global electric vehicle battery demand. The demand reached nearly 1 TWh in 2024 and is expected to surpass 3 TWh by 2030 under current policy and market development trends. The figure clearly demonstrates that battery usage is expanding at a much faster rate than conventional energy-storage growth, thereby increasing concerns related to battery degradation, safety, lifecycle reliability, and maintenance costs. Consequently, accurate State-of-Health (
SoH) estimation and degradation prediction have become critical requirements for ensuring safe, economical, and reliable operation of modern electric vehicle fleets. The trends illustrated in
Figure 1a,b further emphasize the necessity for advanced battery-health estimation and predictive diagnostic techniques. Among the different battery indicators, State of Health (
SoH) is regarded as one of the most informative metrics for assessing aging, as it reflects capacity reduction, increased resistance, and a gradual decline in performance relative to the original condition. Past assessments have indicated that estimating the State of Health and Remaining Useful Life encounters challenges stemming from nonlinear aging processes, variability among cells, reliance on operating temperature, impacts of current rates, and the differences between laboratory cycling and real-world electric vehicle duty cycles [
1,
2,
3]. Additional studies emphasized the importance of efficient battery-health management, integrating data-driven learning with robust temporal features, given that field data are often noisy, irregular, and significantly affected by user behavior and environmental stress [
4,
5,
6]. The findings presented here are highly pertinent to electric vehicle applications, as similar battery chemistries can experience notably different degradation processes influenced by diverse driving, charging, and thermal conditions.
Recent developments in machine learning have significantly enhanced the assessment of battery health and the prediction of their performance. Recent review articles emphasize a clear transition from conventional model-based estimation to hybrid and data-driven methodologies. This encompasses methods like ensemble learning, deep neural networks, transfer learning, and predictions that account for uncertainty [
5,
6,
7,
8,
9]. At the same time, considerable studies have shown that deep learning methods can evaluate battery health across various cell types without requiring extensive labeling of targets. Furthermore, information gathered from actual electric vehicle usage can improve health-estimation models, resulting in accuracy that is truly advantageous. The literature distinctly shows that accurate short-term tracking, degradation notifications, and long-term predictions pose different levels of difficulty. Short-term State-of-Health trajectories often show significant persistence, and recognizing degradation during stressful events requires models that can detect subtle but operationally critical changes in degradation dynamics.
While the estimation of battery health has improved considerably, notable practical gaps remain. A considerable portion of the literature addresses short-term State-of-Health regression, long-term lifetime forecasting, and anomaly detection as distinct tasks. Maintenance choices for electric vehicles rarely operate in isolation. Fleet operators and battery management systems require a comprehensive strategy for intelligence: accurate monitoring of current health, notifications for medium-term deterioration, and swift identification of accelerated aging patterns in stressful conditions [
10,
11]. The 7-day degradation warning is crucial for operations, as it offers a practical window for carrying out preventive maintenance, redistributing loads, adjusting thermal management, or altering charging strategies. Unlike long-term lifetime forecasting, which often involves unpredictability and the risk of extrapolation errors, a one-week warning period offers a practical balance between predictability and a timeframe that is beneficial for taking action. Furthermore, identifying the acceleration of degradation is essential for understanding nonlinear aging patterns that arise from elevated temperatures, higher current levels, or frequent deep cycling conditions commonly encountered in field operations, which are often not sufficiently represented in laboratory data. Recent studies often emphasize the value of point-wise estimation accuracy, while they frequently neglect the importance of early warning capabilities that prioritize recall. When considering safety and reliability, failing to recognize a rapid degradation event can lead to more severe repercussions than simply issuing a precautionary alert. Therefore, it is crucial to concentrate on modeling that takes acceleration into account and optimizing thresholds that are sensitive to recall for successful implementation. The primary objective of this research is to develop a unified predictive-maintenance framework capable of simultaneously addressing three complementary battery-health intelligence tasks: short-horizon
SoH forecasting, mid-term degradation warning, and accelerated aging risk detection. The specific novelty of this work is the development of a temporally consistent predictive-maintenance framework in which short-horizon
SoH forecasting, seven-day degradation warning, and acceleration-sensitive risk detection are jointly organized as connected maintenance-intelligence tasks rather than as independent prediction problems. In contrast to conventional ensemble-based or single-task battery-health studies, the proposed framework combines validation-weighted ensemble forecasting, training-only feature selection, rolling-origin validation, and recall-oriented warning calibration to support deployment-oriented battery monitoring. Thus, the contribution is centered on converting battery-health estimation into an integrated early warning and risk-screening workflow for electric vehicle battery maintenance.
2. Materials and Methods
The current research has been organized into four methodological streams: a review-based consolidation of State-of-Health/Remaining Useful Life estimation techniques, feature-engineered machine learning models, deep temporal architectures for partial-charge or partial-load data, and practical frameworks for electric vehicles to improve transferability beyond laboratory cycling. Since 2021, there has been an increasing emphasis on temporal convolution, recurrent learning, transfer learning, interpretable hybrid models, and estimation informed by domain knowledge. Furthermore, most of the research has mostly focused on point-wise
SoH estimates, whereas subjects like degradation alerts, acceleration-aware risk screening, and action-oriented maintenance lead times have garnered relatively little systematic examination. This discrepancy provided potential for frameworks that integrate short-horizon tracking, mid-horizon warning, and stress-sensitive risk identification as a unified predictive-maintenance issue rather than treating them as distinct estimating jobs. The comprehensive literature evaluation of the ongoing research initiatives is delineated in
Table 1 accordingly.
The comparative analysis presented in
Table 1 clearly demonstrates the progressive transition of battery-health research from conventional State-of-Health estimation toward more operationally adaptive prognostic frameworks. Earlier studies primarily concentrated on improving prediction accuracy through deep learning, temporal convolution, transfer learning, and hybrid machine learning architectures. However, most reported methodologies addressed forecasting, degradation analysis, and anomaly detection as isolated tasks. It was further observed that limited attention has been given to integrated maintenance-oriented frameworks capable of simultaneously supporting short-term health monitoring, actionable warning generation, and acceleration-sensitive degradation screening. The summarized comparison therefore establishes the necessity for a unified predictive-maintenance architecture that can transform battery-health estimation into practical maintenance decision support for electric vehicle applications. The literature has gradually transitioned from solely point-estimation frameworks to more transferable and operationally informed battery-health analytics. Initial consolidation studies primarily focused on outlining the methodological framework for State-of-Health (
SoH) and Remaining Useful Life (RUL) estimation. These studies emphasized machine learning, neural network-based battery management, and the increasing demand for health indicators pertinent to electric vehicles [
12,
13,
14]. Subsequently, greater focus was directed towards temporal deep learning models, such as temporal convolutional networks, graph-enhanced recurrent architectures, enhanced LSTM frameworks, and hybrid physics-assisted estimators [
15,
16,
17,
18,
19,
20,
21]. Recent contributions have shown a growing focus on transferability, interpretability, and the practical deployment of electric vehicles. This is evident in advancements such as transfer-stacking ensembles, State-of-Health estimation based on field data, studies on feature optimization, and interpretable hybrid models [
22,
23,
24,
25,
26]. Nonetheless, the scope of research examined has largely focused on the accuracy of State-of-Health (
SoH) estimation, while integrated approaches for short-term health monitoring, degradation alerts, and risk detection that accounts for acceleration have been explored only to a limited extent. Although machine learning-based predictive maintenance for EV batteries has received considerable attention, most existing studies remain concentrated on individual tasks such as
SoH estimation, RUL prediction, anomaly detection, or feature-based health modeling. These studies have improved prediction accuracy, but they do not fully address the operational requirement of converting battery-health data into maintenance decisions with defined warning horizons and acceleration-sensitive risk information. In practical EV battery management, maintenance decisions require three connected capabilities: reliable short-term health tracking, early identification of degradation within an actionable lead time, and rapid detection of accelerated aging under stress conditions. Treating these tasks independently may limit decision support because accurate
SoH prediction alone does not necessarily indicate when intervention is required or whether degradation is entering a risky accelerated phase. Therefore, the problem addressed in this study is the lack of an integrated, temporally consistent predictive-maintenance framework that jointly links short-horizon
SoH forecasting, 7-day degradation warning, and acceleration risk detection for EV battery systems. Accordingly, the contribution of this work is not the isolated use of machine learning for battery-health prediction but the integration of forecasting, warning, and acceleration-aware risk screening into a single time-ordered maintenance decision framework. Based on the identified research gap, this work proposes a unified predictive-maintenance framework designed to improve operational battery intelligence through integrated forecasting and warning mechanisms. The principal research contributions are outlined in the following section.
A cohesive machine learning framework has been created that concurrently executes short-term SoH forecasting, provides 7-day degradation alerts, and identifies risk with consideration for acceleration. This cohesive framework connects ongoing health monitoring with anticipatory maintenance guidance in electric vehicle battery systems.
A rolling-origin validation strategy has been adopted to maintain temporal causality and avoid data leakage, thereby ensuring realistic deployment. A blending mechanism that adapts has been integrated to improve generalization under different operational conditions.
A degradation warning and acceleration modeling scheme centered on recall has been developed to facilitate the early detection of unusual aging under thermal and load stress. This design enhances battery prognostics by shifting from mere estimation to actionable predictive maintenance that can be implemented in real operations.
This paper is organized in the following manner:
Section 2 offers an in-depth review of the literature, emphasizing the latest advancements in State-of-Health estimation, degradation modeling, and data-driven battery prognostics. In
Section 3, the problem is mathematically formulated, and the regression and classification goals that support short-horizon forecasting, degradation warning, and acceleration risk modeling are established.
Section 4 outlines the multi-layer machine learning framework that has been implemented. It covers aspects such as temporal feature engineering, ensemble regression, probabilistic classification, recall-oriented threshold optimization, and stress-conditioned validation.
Section 5 presents the simulated results along with a discussion based on various scenarios, while
Section 6 offers concluding remarks and outlines potential directions for future research.
4. Methodology: Implemented Machine Learning Framework
The research methodology was structured to ensure clarity, completeness, and reproducibility by explicitly organizing the predictive-maintenance framework into a coherent sequence of stages. Initially, multivariate battery data were chronologically arranged and preprocessed to remove inconsistencies and preserve temporal integrity. Subsequently, temporal feature engineering was performed using sliding-window techniques, through which statistically and physically meaningful descriptors such as rolling averages, gradients, and stress indicators were derived exclusively from historical observations to prevent information leakage. The problem was then formulated as a multi-objective learning task, where short-horizon State-of-Health forecasting, mid-term degradation warning, and acceleration-based risk detection were jointly addressed. A rolling-origin validation strategy was adopted to maintain causality and to simulate real-world deployment conditions. For forecasting, an ensemble regression model combining multiple tree-based learners was employed, with blending weights determined from validation performance to enhance generalization. Degradation warning and acceleration risk detection were modeled as probabilistic classification tasks, where thresholds were calibrated with emphasis on recall to ensure early identification of critical events. Furthermore, stress-conditioned evaluation was conducted to assess robustness under high-temperature and high-load scenarios. Through this systematically organized methodology, the framework was designed to transform battery-health data into actionable maintenance intelligence, thereby addressing the reviewer’s concern regarding methodological clarity and completeness.
4.1. Data Preprocessing and Temporal Feature Engineering
The initial step involves converting the raw multivariate time-series data into organized temporal formats to effectively capture short-term dynamics and patterns of operational stress. The time-indexed feature space is depicted in Equation (14), while the application of a sliding window of length
w to create temporal matrices is illustrated in Equation (15):
For each window, various derived descriptors are calculated, such as rolling mean and variance, first-order gradients, temperature–current interaction terms, load magnitude statistics, and local degradation slope indicators. These embedding transformations convert immediate measurements into contextually relevant inputs that can effectively capture nonlinear changes in health over time. All operations for feature extraction are conducted without referring to future observations, thus maintaining the integrity of the chronological order. To avoid any risk of information leakage, feature selection is performed solely within the training subset of each validation fold.
denotes the training partition of fold
. Tree-based impurity measures are utilized to calculate feature importance scores. The identical transformation is subsequently implemented on the related validation partition. This guarantees that the validation data stay statistically unobserved throughout the feature ranking process, as mathematically represented in Equation (16).
Battery-health data are inherently temporal; therefore, conventional random cross-validation is unsuitable. A rolling-origin scheme is implemented in Equation (17).
Every fold extends the training period into the future while assessing performance on later, unobserved data. This framework simulates actual deployment scenarios in which future data are not accessible during the training phase. Chronological ordering is rigorously maintained, preventing any look-ahead bias.
4.2. Ensemble Regression for Short-Horizon SoH Forecasting
Short-horizon
SoH forecasting is achieved using a convex ensemble of three nonlinear regressors: Histogram-Based Gradient Boosting Regressor (HGBR), Random Forest Regressor (RF), and Extremely Randomized Trees Regressor (ET) [
22,
26].
The final blended prediction is defined in Equation (19).
The blending weights
are optimized based on validation
R2 performance:
In Equation (18), the short-horizon State-of-Health forecast generated by the regression learner is defined as a nonlinear mapping from the temporally embedded feature matrix to the predicted future health value . Here, denotes the present time index, represents the prediction horizon, and encapsulates the internal parameters of the corresponding regression model, including tree structures, splitting rules, and boosting coefficients, depending on the learner type. This formulation establishes that multiple heterogeneous regressors independently approximate battery-health evolution using identical historical input representations while maintaining model diversity. Equation (19) then consolidates these independent predictions into a convex ensemble output, where the final estimate is obtained as a weighted linear combination of the individual forecasts. The blending weights are required to be non-negative and normalized, ensuring that their total sums to one. This condition of convexity guarantees numerical stability and stops any individual learner from excessively influencing the final prediction. Equation (20) provides additional details indicating that the blending weights are obtained from validation-based coefficients of determination . These coefficients measure the extent to which each learner accounts for the variance in the observed State of Health during out-of-sample validation. By standardizing these validation scores, a greater emphasis is consistently placed on models that exhibit enhanced generalization ability. This combination effectively lowers model variance, reduces the risk of overfitting, and improves generalization across various operating conditions.
4.3. Seven-Day Degradation Warning Classifier
The degradation-warning mechanism is subsequently formalized through Equation (6), where cumulative health loss over a predefined mid-horizon interval is computed as the difference between present and future State-of-Health values. The parameter corresponds to the seven-day interval expressed in discrete time steps. Equation (7) transforms this continuous degradation magnitude into a binary event indicator by comparing it against a predefined operational threshold . This threshold represents a clinically or operationally meaningful decline in capacity beyond which maintenance intervention is considered necessary. The probabilistic classifier defined in Equation (8) produces a degradation-event likelihood based on the same temporally embedded feature matrix. The learning objective associated with this classifier is expressed in Equation (9) through the cross-entropy loss function, which penalizes divergence between true binary labels and predicted probabilities. This probabilistic formulation enables flexible threshold adjustment and supports recall-sensitive decision strategies. Mid-horizon degradation is modeled as a probabilistic classification task.
4.4. Degradation Acceleration Risk Modeling
Equations (10) and (11) present the analysis of degradation based on acceleration. The first-order degradation rate is calculated by taking the difference between successive State-of-Health Values, which indicates the immediate rate of health decline. The second-order difference, referred to as acceleration, measures the variations in this degradation slope. Positive acceleration values suggest an increase in the severity of aging, frequently linked to stress-related factors like higher temperatures or increased current demands. Equation (12) transforms the calculated acceleration magnitude into a binary risk label by juxtaposing it with a specified acceleration threshold τ_risk. The classifier outlined in Equation (13) assesses the likelihood of rapid degradation onset, facilitating the early identification of nonlinear aging transitions that might not be apparent from absolute health values by themselves. Risk detection based on acceleration emphasizes the identification of nonlinear slope amplification.
4.5. Recall-Oriented Threshold Optimization
The classification outputs are determined by the decision rule outlined in Equation (21), which involves comparing predicted probabilities against an adjustable threshold parameter
. A binary event is designated when the anticipated probability surpasses this threshold. Equation (22) establishes the threshold-selection strategy by enhancing the F1-score while adhering to a minimum recall requirement. The F1-score serves as the harmonic means of precision and recall, effectively balancing detection accuracy with sensitivity, while the recall constraint guarantees that early degradation events are not missed. Given that early detection takes precedence over conservative precision, probability thresholds are fine-tuned to enhance the F1-score while adhering to recall limitations. The rule for decision making is outlined in Equation (21).
The threshold
is selected via validation search to satisfy with respect to Equation (22):
This guarantees a responsive approach to both degradation and acceleration occurrences. The predictive-maintenance framework is assessed under clearly defined stress conditions to guarantee reliability at the deployment level. The following equations broaden the modeling framework to include stress-conditioned validation. The operational state vector includes temperature and current magnitude, while stress regimes are established by comparing these variables against specific thresholds. Metrics for conditional evaluation are subsequently calculated independently for both stress and nominal subsets. A robust stability index is defined as the ratio of stress-regime performance to nominal-regime performance, effectively measuring the resilience of the predictive framework in challenging operating conditions.
4.5.1. Stress Regime Definition
The operational state at time
be defined as:
where
denotes battery temperature, and
denotes current magnitude. Two stress thresholds are defined
: a stress indicator variable is constructed as:
Thus, represents the stress regime, and represents the nominal regime.
4.5.2. Conditional Model Evaluation
Regression robustness is evaluated conditionally:
Similarly, classification metrics are computed:
4.5.3. Robustness Stability Index
To quantify performance degradation under stress, a stability index is defined:
where
represents any evaluation metric (
R2, AUC, F1). Interpretation:
→ stress resilience, while
→ performance degradation. The modeling sequence then transitions to the uncertainty-aware neural architecture. The multi-layer perceptron (MLP) mapping is defined through successive affine transformations and nonlinear activation functions applied to the multi-fidelity input vector. Each hidden layer is characterized by a weight matrix and bias vector, while nonlinear activation functions such as rectified linear units introduce representational flexibility. Unlike deterministic regression models, the heteroscedastic formulation simultaneously predicts the mean state of charge
and the input-dependent variance
, enabling distributional output rather than a single-point estimate. The negative log-likelihood (NLL) loss function is employed to jointly penalize inaccurate mean predictions and underestimated variance. This formulation allows the model to express confidence in its predictions, thereby supporting safety-critical decision making. The fast-charging optimization problem is subsequently formulated as a constrained control objective. The charging current
serves as the decision variable, and the cost function penalizes deviation between predicted state of charge and a desired reference trajectory while incorporating a regularization term to prevent excessively aggressive charging. Battery dynamics are represented through a first-order differential equation derived from Coulombic balance, where the rate of change of the state of charge depends on applied current, Coulombic efficiency, and nominal capacity. Operational constraints are imposed to ensure that the state of charge remains within physical bounds, current magnitude does not exceed hardware limits, and temperature remains below a maximum safety threshold. These constraints guarantee safe and feasible charging trajectories. The final group of equations defines evaluation metrics used for performance validation. The coefficient of determination
measures explained variance relative to total variance. Mean squared error (MSE) quantifies the average squared deviation between predicted and actual values. Prediction Interval Coverage Probability (PICP) represents the proportion of true values contained within predicted confidence intervals, while Mean Prediction Interval Width (MPIW) quantifies the average span of those intervals. Together, these metrics provide complementary insight into accuracy, reliability, and uncertainty calibration. Equations (21)–(28) transform the predictive-maintenance framework from a simple regression architecture into a comprehensive, uncertainty-aware, stress-validated, and control-integrated battery intelligence system. The overall framework of the proposed research problem is depicted in
Figure 2, respectively.
The suggested framework is following chronological preprocessing, short-term State-of-Health forecasting, a 7-day degradation alert, and acceleration risk identification, which are collectively developed utilizing crafted temporal features. A rolling-origin validation approach is utilized to maintain causality, and feature selection along with model calibration is conducted solely during training to prevent information leakage. Short-term forecasting is achieved using validation-weighted ensemble regression, while warning and risk decisions are based on recall-constrained probabilistic classifiers. The final performance is assessed in both nominal and stress-conditioned operating conditions. The corresponding implemented approach pseudocode is tabulated in Algorithm 1, respectively.
| Algorithm 1: Proposed Multi-Layer Predictive-Maintenance Framework for EV Batteries |
Input: Battery dataset = {x_t, SoH_t}, forecast horizon h, degradation horizon Δ, training window W, step size S, number of folds K, feature count p, thresholds τ_7d and τ_risk Output: Short-horizon SoH forecast, degradation warning, acceleration risk prediction, performance metrics 1: Preprocess and sort samples chronologically 2: Compute derived targets: D_t = SoH_t − SoH_{t + Δ} R_t = SoH_{t − 1} − SoH_t A_t = R_t − R_{t − 1} 3: Define labels: y_t^(7d) = 1 if D_t ≥ τ_7d, else 0 y_t^(risk) = 1 if A_t ≥ τ_risk, else 0 4: Construct temporal feature matrix using sliding-window statistics, lag terms, gradients, and stress indicators 5: for k = 1 to K do 6: Split data using rolling-origin validation 7: Fit scaler on training fold only 8: Select top-p features using training fold only 9: Train HGBR, RF, and Extra Trees regressors for SoH forecasting 10: Estimate validation R2 for each regressor 11: Compute normalized ensemble weights from validation R2 12: Generate final SoH forecast by weighted model blending 13: Train gradient-boosted classifier for 7-day degradation warning 14: Calibrate warning threshold on validation data under recall constraint 15: Predict degradation warning labels on test fold 16: Train gradient-boosted classifier for acceleration risk detection 17: Calibrate risk threshold on validation data under recall constraint 18: Predict acceleration risk labels on test fold 19: Evaluate regression and classification metrics 20: Compute stress-conditioned performance under high-temperature and high-load regimes 21: end for 22: Aggregate fold-wise results and report mean performance Return final forecasts, warning labels, risk labels, and robustness statistics |
5. Simulated Results and Discussion
The dataset employed in this study comprises multivariate time-series measurements collected from an electric vehicle battery system under realistic operating conditions. The data were originally recorded at 15 min intervals and, where required for computational efficiency, were downsampled to 30 min intervals without altering temporal continuity. The complete dataset contains approximately 175,000 observations prior to preprocessing and between 85,000 and 90,000 samples after downsampling and data cleaning procedures. Each record includes a timestamp and key battery state variables, namely State of Health (SoH), state of charge (SoC), battery temperature, terminal voltage, current magnitude, and cumulative charge cycle count. The SoH values span approximately 0.40 to 1.00, thereby representing the progressive aging trajectory of the battery from near beginning-of-life conditions to advanced degradation stages. The chronological structure of the dataset facilitates simultaneous modeling of short-term electrochemical dynamics and long-term degradation behavior. Operational variability reflected in the dataset includes fluctuations in load demand, repeated charging–discharging events, and temperature variations. These factors are known to influence capacity fade and aging acceleration. To support predictive-maintenance analysis, derived features are constructed exclusively from historical observations. These include lagged state variables, rolling statistical measures computed over hourly and daily windows, local degradation gradients, and stress indicators based on temperature and current thresholds. Such feature construction ensures that no future information is incorporated during model training. The modeling targets are formulated to address multiple maintenance objectives. These include short-horizon SoH prediction, mid- to long-horizon degradation estimation, and a binary acceleration-based risk indicator derived from temporal changes in degradation rate. The dataset therefore supports both regression-based health forecasting and classification-based early warning detection.
Scenario 1. Short-Horizon Health Tracking: The time-series comparison shown in Figure 3
demonstrates that the ensemble model successfully captures the fine-grained SoH dynamics over the final validation fold. The predicted trajectory closely overlaps the true SoH curve, including minor oscillations and transient variations. Although the persistence baseline also performs strongly due to high short-term autocorrelation in SoH, the ensemble maintains comparable precision while preserving structural consistency under temporal variation. The quantitative evaluation confirms this visual observation, with an
of 0.9254 and an RMSE of 0.0033. The very small absolute error indicates that short-term degradation behavior remains smooth and highly predictable. In practical deployment, this layer serves as a continuous health tracking mechanism, ensuring reliable short-horizon state awareness. As seen in
Figure 4 the data points are closely grouped along the ideal line, indicating a robust linear correlation and negligible systematic bias. The lack of curvature or dispersion widening indicates that prediction stability is preserved throughout the entire
SoH operating range.
The residual behavior illustrated in
Figure 5 demonstrates that prediction errors are centered around zero, exhibiting no noticeable drift. No signs of heteroscedasticity or temporal clustering are apparent. The limited residual variation indicates that the regression model is statistically stable and does not suffer from structural overfitting. This layer guarantees that the battery management system has precise near-term insights into health changes, even amidst varying load and temperature conditions. The comparatively stable short-horizon forecasting behavior may be attributed to the strong temporal persistence exhibited by battery degradation trajectories over limited operational intervals. Since short-term electrochemical aging evolves gradually under normal driving and charging behavior, recent historical observations retain considerable predictive relevance for near-future
SoH estimation. Furthermore, the ensemble learning configuration benefits from the complementary nonlinear learning characteristics of tree-based learners, allowing transient fluctuations caused by dynamic loading and thermal variation to be captured without excessive sensitivity to local noise. The rolling-origin validation strategy additionally contributes toward realistic temporal generalization by preventing information leakage between historical and future operating states.
Scenario 2. The second aim is to identify degradation at a mid-horizon level. This layer needs to detect important degradation trends before noticeable health loss occurs, unlike the short-term task. Figure 6 illustrates that the classifier exhibits robust recall performance throughout various operating thresholds, as shown by the precision–recall curve. The system demonstrates a recall rate surpassing 0.82 at the chosen threshold, effectively recognizing most impending degradation events. This behavior is advantageous in maintenance applications, where overlooked events hold greater significance than moderate false alarms. Figure 7 illustrates the ROC curve, which further validates the discrimination capability, showing an AUC of 0.7838. The curve consistently stays well above the diagonal reference, demonstrating a clear distinction between degrading and non-degrading states.
Figure 8 presents the confusion matrix, offering further operational insights. A total of 563 true-positive warnings were accurately recognized, whereas 151 degradation events were overlooked. While there are still false positives (1108), this compromise aligns with a strategy focused on recall. In predictive maintenance, this type of setup is suitable, as early inspections can help reduce the risk of further complications. This layer effectively converts raw
SoH trajectories into practical maintenance insights, enabling scheduling modifications and inspection planning days ahead of time. From an operational perspective, the comparatively high recall achieved by the degradation-warning module is particularly significant because missed degradation events may result in accelerated capacity fade, reduced driving reliability, or unexpected maintenance interruptions in electric vehicle applications. In predictive-maintenance environments, conservative early-warning behavior is often preferable to delayed detection because preventive intervention can be scheduled before severe degradation propagation occurs. The observed classification behavior also suggests that cumulative degradation patterns become more distinguishable when temporal feature embedding captures sustained thermal and loading trends rather than isolated instantaneous measurements alone.
Scenario 3. Degradation acceleration risk. The third objective focuses on the rapid acceleration of degradation, which poses a greater safety concern compared to a gradual decline. The precision–recall curve illustrated in Figure 9 demonstrates a more robust performance profile in comparison to the seven-day warning module. The PR-AUC of 0.7101 indicates a better balance between precision and recall in the presence of class imbalance. A high recall rate of 0.8845 guarantees that instances of accelerated degradation are seldom missed. Figure 9 illustrates the ROC curve, which shows an AUC of 0.8796, indicating a strong capacity for discrimination. Figure 9 illustrates the ROC curve, which shows an AUC of 0.8796, indicating a strong capacity for discrimination. The curve rises sharply near its origin, signifying high true-positive rates while maintaining relatively low false-positive rates. This functionality is crucial for oversight in situations where safety is paramount.
Figure 10 offers further insight into operational effectiveness by illustrating the confusion matrix. The low count of false negatives highlights the effectiveness of the acceleration-based detection method. This module functions as a proactive alert system, aimed at identifying atypical aging trends resulting from thermal stress, load fluctuations, or operational inconsistencies.
The strong acceleration risk detection behavior may be physically associated with the sensitivity of lithium-ion degradation mechanisms to elevated thermal exposure and aggressive load cycling. Under high-temperature and high-current operating conditions, electrochemical side reactions, internal resistance growth, and localized degradation processes may intensify nonlinearly, thereby increasing the degradation acceleration rate rather than only the absolute
SoH decline. The incorporation of temporal degradation gradients and stress-sensitive operational indicators therefore improves the capability of the proposed framework to recognize abrupt degradation transitions before severe long-term health deterioration becomes fully observable. The analysis of feature importance illustrated in
Figure 11 offers clarity regarding the predictive framework. The most influential features include lagged State-of-Health (
SoH) indicators, rolling voltage averages, and short-term temporal windows. The slopes of
SoH indicate that temporal memory and recent degradation gradients play a crucial role in short-horizon forecasting. The inclusion of temperature and current rolling statistics among the highest-ranked features strengthens their physical validity. The aging of batteries is recognized to be affected by thermal stress and the intensity of the load. The observed dominance of temporal degradation history features further indicates that battery aging behavior is strongly path dependent, meaning that recent operating history contributes substantially toward future degradation evolution. In addition, the importance of temperature-related variables supports established electrochemical understanding that elevated thermal exposure accelerates parasitic side reactions and capacity-loss mechanisms within lithium-ion battery systems. Similarly, current-related loading features likely capture stress-induced operational variability associated with repeated charging–discharging intensity and transient power demand behavior. Consequently, the hierarchy of learned features corresponds with the principles of electrochemical degradation instead of being based solely on statistical artefacts. Cyclic time encodings play a significant role, suggesting that operational periodicity affects short-term
SoH behavior. The results compiled in
Table 2 show that the proposed framework consistently performs well in both regression and classification layers while also preserving temporal integrity through rolling-origin validation. For the short-term forecasting task, an
R2 value of 0.9254 was obtained, along with an RMSE of 0.0033 and a MAPE of 0.32%. The data indicate that the one-hour
SoH dynamics were monitored with great accuracy. The similar performance of the persistence baseline shows that short-term
SoH evolution is still highly autocorrelated; nonetheless, the suggested ensemble maintained predictive stability while ensuring structural generalization.
The seven-day degradation warning task yielded an AUC of 0.7838, demonstrating a significant distinction between degradation and non-degradation states. A recall of 0.8205 indicates that most upcoming degradation events were effectively recognized. While the precision stayed at a moderate level, the configuration focused on recall guarantees that essential maintenance signals are not missed, which is beneficial in preventive maintenance environments. The highest performance was noted in the degradation acceleration risk module, achieving an AUC of 0.8796 and a PR-AUC of 0.7101. A recall of 0.8845 suggests that accelerated aging events were identified with considerable sensitivity. The minimal number of false negatives further validate the framework’s appropriateness for safety-critical monitoring applications. The findings suggest that although short-term State-of-Health forecasting provides precise health visibility, the classification layers add significant operational benefits by converting degradation patterns into practical early-warning and risk-detection signals. The combined multi-layer design thus improves both forecasting precision and maintenance insight within a cohesive machine learning system. The obtained forecasting and classification results were further analyzed in comparison with previously reported battery-health prognostic studies. Earlier investigations based on temporal convolutional networks, recurrent neural architectures, transfer learning models, and hybrid machine learning frameworks mainly concentrated on improving State-of-Health estimation accuracy under controlled cycling conditions [
15,
16,
17,
18,
19,
20,
21,
22,
23,
24,
25,
26]. Several studies reported strong regression performance for
SoH estimation; however, most methodologies were designed primarily for single-objective prediction tasks and did not simultaneously incorporate degradation-warning capability or acceleration-sensitive maintenance intelligence. In contrast, the proposed framework integrates short-horizon forecasting, seven-day degradation warning, and acceleration risk detection within a unified temporally consistent architecture. This integration improves operational interpretability by converting battery-health estimation into actionable maintenance-oriented decision support. The strong forecasting accuracy obtained in the present work is consistent with observations reported in deep temporal learning and ensemble learning studies, where recent degradation history and temporal memory were identified as dominant contributors to short-term State-of-Health prediction [
15,
18,
22]. The high predictive stability achieved by the proposed ensemble framework can be attributed to the complementary behavior of Histogram-Based Gradient Boosting, Random Forest, and Extremely Randomized Trees models, which collectively reduced variance while preserving nonlinear learning capability. Unlike several existing approaches that rely heavily on deep neural structures requiring extensive training complexity and large-scale parameter optimization, the adopted ensemble learning configuration maintained high regression accuracy while improving robustness and interpretability under rolling-origin validation. For degradation warning and acceleration risk detection, direct comparison with the existing literature remains limited because most prior studies primarily focused on point-wise State-of-Health estimation or Remaining Useful Life prediction without explicitly modeling operational warning horizons or acceleration-sensitive degradation transitions. The improved recall behavior achieved in the proposed framework indicates that the adopted recall-oriented threshold optimization successfully prioritized early identification of critical degradation events. The acceleration risk module further demonstrated strong discrimination capability under stress-conditioned scenarios, which may be attributed to the incorporation of temporal degradation gradients, temperature–current interaction indicators, and acceleration-aware feature representations. Therefore, the proposed framework extends beyond conventional battery-health estimation by integrating predictive forecasting with operational maintenance intelligence suitable for practical electric vehicle deployment environments. The performance results summarized in
Table 2 indicate that the proposed framework achieved consistently strong behavior across both regression and classification tasks. In the short-horizon forecasting layer, the obtained coefficient of determination and low RMSE confirm that the ensemble structure was capable of accurately tracking near-term State-of-Health variations with minimal prediction deviation. The very small MAPE further demonstrates that the forecasting error remained operationally negligible throughout the validation period. For the seven-day degradation warning task, the achieved recall value indicates that various upcoming degradation events were successfully identified before severe health deterioration occurred. Although moderate false-positive behavior was observed, the adopted recall-oriented configuration is considered appropriate for predictive-maintenance environments where early detection is prioritized over conservative alarm suppression. The acceleration risk detection layer demonstrated the highest classification robustness among all operational modules. The elevated AUC and PR-AUC values confirm strong discrimination capability under class-imbalanced conditions, while the high recall value indicates reliable identification of rapidly degrading operating states. Collectively, the results demonstrate that the proposed multi-layer framework not only improves forecasting precision but also enhances operational maintenance intelligence through early-warning and stress-sensitive risk detection capabilities.