1. Introduction
The growing electrification of transportation and renewable energy systems has intensified the demand for lithium-ion batteries (LIBs) with high reliability and safety [
1]. As essential energy storage devices, LIBs determine the performance and lifespan of electric vehicles (EVs) and energy storage systems (ESSs). However, their complex electrochemical mechanisms and multiple degradation pathways, including solid electrolyte interphase (SEI) growth, lithium plating, and loss of active material, make accurate health assessment highly challenging [
2]. SEI film growth is one of the most critical factors limiting long-term SOH estimation because it represents an irreversible degradation process that continuously consumes cyclable lithium and increases interfacial resistance. Notably, the SEI formation happens over the first couple of cycles and is stable after that [
3]. The initial SEI layer is rapidly formed during the first few charge–discharge cycles via electrolyte decomposition, and once a dense and passivating film is established, its growth rate significantly slows down and maintains relative stability under mild operating conditions. As the SEI layer thickens, charge-transfer kinetics slow down, leading to capacity loss and stronger polarization, which directly alters measurable health indicators such as voltage evolution and current fluctuation patterns. Moreover, SEI growth is highly sensitive to temperature and cycling conditions; periodic rupture–reformation cycles under high temperatures or high rates produce nonlinear and frequency-dependent signal variations that complicate data-driven prediction models [
4]. Because these SEI-induced dynamics manifest in specific time–frequency bands and evolve nonlinearly with aging, ignoring SEI effects often results in inaccurate or unstable long-term SOH predictions. Therefore, incorporating SEI-related degradation characteristics is essential for constructing physically consistent and generalizable SOH models [
5]. In addition to SEI growth, micro-cracking, and other electrochemical aging pathways, the degradation trajectory of lithium-ion batteries is also strongly dependent on the physicochemical characteristics of the electrode materials. For example, cathode nanocomposites utilizing nanoscale tin and carbon nanotubes have demonstrated distinct intercalation behavior and enhanced structural stability, suggesting that material design can play a crucial role in mitigating degradation [
6]. The State of Health (SOH), defined as the ratio of the current capacity to its nominal value, serves as a key indicator of degradation and plays a crucial role in ensuring safety, enabling predictive maintenance, and optimizing lifecycle management [
7,
8].
Traditional empirical and equivalent circuit models (ECMs) provide physically interpretable mechanisms but often exhibit limited accuracy under dynamic operating conditions due to parameter drift and simplified assumptions [
9,
10]. In contrast, data-driven approaches, particularly machine learning (ML) and deep learning (DL), have shown strong potential for learning nonlinear degradation behavior directly from data [
11]. Classical ML models such as support vector regression (SVR), random forest (RF), and Gaussian process regression (GPR) have achieved promising results but depend heavily on handcrafted features and specific datasets [
12,
13]. DL models, including convolutional neural networks (CNNs) and long short-term memory (LSTM) networks, have demonstrated superior capabilities in feature extraction and temporal dependency modeling [
14,
15]. For instance, Liu et al. developed a CNN–LSTM–Attention–FVIM model with meta-heuristic optimization, achieving an MAE below 1% [
16], while Zang et al. proposed a hybrid attention model that improved both accuracy and robustness, achieving an
exceeding 0.98 [
17]. Despite these advancements, purely data-driven models often lack physical interpretability and tend to overfit specific datasets, leading to poor generalization in real-world applications where operational conditions deviate from laboratory environments [
18]. Achieving accurate, interpretable, and generalizable SOH estimation therefore remains an open challenge.
To address this issue, researchers have introduced Physics-Informed Neural Networks (PINNs), which incorporate physical constraints into neural network loss functions [
19]. In the context of battery modeling, this approach ensures that predicted SOH trajectories remain consistent with underlying electrochemical principles. For example, Wang et al. embedded degradation stability equations within PINNs to enhance robustness under varying charge–discharge conditions [
20]. Ma et al. extended this framework with physics-informed macroscopic modeling, demonstrating stable convergence and improved accuracy [
21]. Li et al. further combined time- and frequency–domain features using a Parseval–BiLSTM architecture to achieve reliable predictions under incomplete cycles [
22]. Although effective, most existing PINN-based methods still face two limitations: First is insufficient feature diversity. Second is fixed weighting of physical constraints [
23]. Time-domain or handcrafted features may overlook microscopic degradation dynamics, while static constraint weighting can either over-restrict or under-regularize model learning. Thus, a framework capable of capturing multi-domain degradation patterns and dynamically balancing data–physics interactions is required.
Feature engineering plays a crucial role in robust SOH estimation. Time-domain indicators (e.g., mean voltage and charging time) describe macroscopic degradation behavior, whereas frequency– and wavelet–domain features capture microscopic responses associated with SEI growth and electrode microcracking [
24,
25]. Integrating these domains provides complementary insights and enhances model stability under noise and varying operational conditions. He et al. demonstrated that combining empirical mode decomposition (EMD) with multi-scale features improved early degradation detection [
26]. Wang et al. applied variational mode decomposition (VMD) with attention-based temporal convolutional networks (TCNs) to separate trend and transient signals, achieving higher accuracy [
27]. However, multi-domain fusion substantially increases feature dimensionality and redundancy. Feature selection methods—such as Pearson correlation, maximal information coefficient (MIC), and principal component analysis (PCA)—can effectively retain the most relevant variables [
28,
29].
Attention mechanisms enable neural networks to assign adaptive importance to input features, allowing models to emphasize degradation-sensitive information while suppressing noise [
30]. When combined with multi-domain features, attention modules enhance interpretability and reduce redundancy. In battery diagnostics, Liu et al. reported an attention-augmented SOH estimator demonstrated substantial gains over a GRU baseline: across multiple battery datasets, the MAE and RMSE decreased by 33.5% and 29.0%, respectively [
31]. Integrating attention mechanisms into physics-informed architectures further improves learning efficiency, as adaptive weighting encourages the model to prioritize features most relevant to specific degradation processes while maintaining physical consistency. This hybrid paradigm provides a promising pathway toward interpretable and generalizable SOH estimation frameworks.
Despite considerable progress, most existing models remain dataset-specific and exhibit limited generalization across chemistries and environments [
23]. Multi-domain fusion often treats temporal and spectral information independently, neglecting their intrinsic correlations [
25]. Additionally, fixed-weight physical constraints cannot adapt to evolving degradation stages, which compromises convergence stability [
21]. Few studies have explored the joint integration of attention-based feature weighting and dynamic physics constraints within a unified framework. Existing attention-driven models rarely ensure explicit physical consistency [
27], while conventional PINNs seldom employ attention for feature adaptivity [
20]. This separation limits both predictive accuracy and robustness. Hence, there is an urgent need for a unified approach that combines feature fusion, attention-driven adaptivity, and physics-informed constraints for precise SOH estimation.
In response to these challenges, this study proposes an Attention Mechanism–Multi-Feature Fusion Physics-Informed Neural Network (AM-MFF-PINN) for accurate and physically consistent SOH estimation of lithium-ion batteries. The motivation lies in jointly optimizing three critical aspects of SOH modeling: feature representation, adaptive weighting, and physical interpretability.
The main contributions of this study are summarized as follows:
A novel AM-MFF-PINN model is proposed for accurate and interpretable SOH estimation of lithium-ion batteries.
A multi-domain feature extraction and fusion scheme that captures comprehensive degradation information across time, frequency, and wavelet domains.
A dual-correlation strategy combining the Pearson correlation coefficient and the MIC effectively removes redundant features and enhances robustness against noise.
The model introduces an attention mechanism for adaptive feature weighting and a dynamic loss balancing scheme to maintain consistency between data fitting and physical laws.
Comprehensive cross-dataset validation, demonstrating superior accuracy, robustness, and generalization under diverse operating conditions.
The remainder of this paper is organized as follows.
Section 2 introduces the proposed AM-MFF-PINN framework, including data preprocessing, multi-domain feature extraction, feature selection, and model construction.
Section 3 presents the experimental setup, datasets, evaluation metrics, and comparative results.
Section 4 concludes the study and outlines future research directions.
2. Methodology
The accurate estimation of the SOH for lithium-ion batteries depends on the synergy between high-quality feature inputs and a robust physics-constrained model. The method proposed in this study focuses on three core steps: multi-domain feature extraction, attention mechanism, and enhanced PINN modeling. The approach captures degradation information through multi-dimensional features in the time, frequency, and wavelet packet domains, achieving high-precision SOH estimation via a neural network integrated with physical prior constraints. The following subsections elaborate on the technical details, parameter settings, and physical significance of each step.
2.1. Structural Overview
The proposed SOH estimation framework follows a four-stage process: data preprocessing, multi-domain feature extraction, attention mechanism, and improved PINN modeling. The specific steps are shown in
Figure 1. The core logic is as follows: first, multi-dimensional features covering macroscopic statistics, frequency dynamics, and time–frequency localization are extracted from charging data; redundant features are then removed through correlation analysis. The refined key features are input into a PINN model enhanced with an attention mechanism and residual structure. Finally, the SOH estimation is obtained by jointly optimizing both the data-fitting loss and physical constraint loss.
The generality of the framework lies in its independence from discharge strategies, as it focuses on the late charging stage, which remains consistent across battery types. Moreover, the incorporation of multi-domain features and physical constraints reduces dependence on specific datasets and enhances cross-scenario generalization.
2.2. Dataset Analysis
To encompass various charging–discharging strategies and states of charge, this study employs two publicly available datasets collected from different institutions: the Xi’an Jiaotong University (XJTU) dataset and the Tongji University (TJU) dataset [
20,
32]. Detailed information about these datasets is summarized in
Table 1, while
Figure 2 illustrates the capacity degradation trajectories for all batteries across four batches in the XJTU dataset and three batches in the TJU dataset.
The XJTU dataset consists of nickel–cobalt–manganese (NCM) batteries with a nominal capacity of 2000 mAh and a nominal voltage of 3.6 V. The charging and discharging cut-off voltages are 4.2 V and 2.5 V, respectively. All batteries were cycled to failure under room-temperature conditions. The test protocols include both fixed charging–discharging and random discharging modes at constant current for different cycles. The four battery batches are denoted as XJTU1, XJTU2, XJTU3, and XJTU4.
The TJU dataset includes three types of batteries: first, nickel–cobalt–aluminum (NCA) batteries with a nominal capacity of 3500 mAh and a cut-off voltage of 2.65–4.2 V; second, NCM batteries with a nominal capacity of 3500 mAh and a cut-off voltage of 2.5–4.2 V; third, hybrid NCM + NCA batteries with a nominal capacity of 2500 mAh and a cut-off voltage of 2.5–4.2 V. All batteries were cycled in a temperature-controlled chamber under different temperatures and charging current rates. The charging–discharging temperatures were set to 25 °C, 35 °C, and 45 °C, while the charging current rates ranged from 0.25 C to 1 C. The three corresponding batches are labeled as TJU1, TJU2, and TJU3.
2.3. Feature Engineering
Features serve as essential information carriers for data-driven SOH estimation. Traditional time-domain features mainly reflect the statistical characteristics of the measured signals but fail to capture subtle dynamic behaviors associated with internal battery degradation—such as high-frequency current fluctuations induced by SEI film growth and voltage transients caused by electrode microcracks. To address this limitation, a multi-dimensional feature set covering the time domain, Fast Fourier Transform (FFT) frequency domain, and Wavelet Packet Decomposition (WPD) domain is constructed in this study, followed by key feature selection through correlation analysis.
The sampling frequency and data integrity of the charging segments directly affect the consistency and reliability of feature extraction. Therefore, preprocessing is performed to unify the data format and isolate segments containing valid information. Since the sampling frequencies of the datasets differ, linear interpolation is employed to resample all voltage and current signals to 10 Hz, thereby preventing feature deviations caused by inconsistent sampling rates.
The choice of 10 Hz is justified by two main considerations:
It satisfies the Nyquist Sampling Theorem, as the key frequency components related to battery degradation are concentrated below 5 Hz (e.g., the characteristic frequency of SEI film growth is approximately 1–2.5 Hz), ensuring complete preservation of the relevant spectral content.
It achieves an optimal balance between accuracy and computational efficiency, since excessively high sampling rates would significantly increase data volume and computational cost without contributing to feature fidelity.
The discharging process of batteries varies widely depending on application scenarios—for example, variable-current discharging in electric vehicles versus step discharging in stationary energy storage systems—leading to substantial signal fluctuations. In contrast, the charging process typically follows a Constant Current–Constant Voltage (CC–CV) protocol, where the voltage and current curves at the late charging stage exhibit high regularity and are minimally affected by discharge strategies. Based on this observation, the late charging stage is selected as the target segment for feature extraction. During the constant-current stage, the voltage data in the range of 4.0–4.2 V are selected. During the constant-voltage stage, the current data in the range of 0.5–0.1 A are selected. Regardless of whether the battery was fully discharged beforehand, these voltage and current intervals consistently appear during the full-charge process, ensuring stable feature acquisition.
2.3.1. Fast Fourier Transform
The current signal during battery charging contains abundant degradation-related dynamic information. For instance, the growth of the SEI film induces periodic fluctuations in ion transport resistance, while microcracks in electrode materials trigger high-frequency variations in local current. Such degradation dynamics are difficult to capture using time-domain statistical features but can be effectively transformed into the frequency domain through the FFT, where they can be quantified using indicators such as frequency-band energy distribution and spectral shape [
33]. In this study, a standardized FFT-based feature extraction procedure is developed for current signals. It includes four main steps: data preprocessing, frequency–domain transformation, feature definition, and robustness processing, ensuring both the physical relevance and computational reliability of the extracted features.
Before performing frequency–domain analysis, the original current signal must be detrended to remove the DC component and long-term drift. A linear fitting method based on the least squares algorithm is used for trend removal:
The linear trend model is expressed as:
where
and
denote the slope and intercept of the fitted trend line, respectively.
The detrended signal is then obtained as:
where
represents the original time-series current signal, FFT is performed on the detrended signal
to convert the time-domain signal to the frequency domain:
The discrete Fourier transform of the detrended signal is given by:
where
is the total number of sampling points and
is the imaginary unit.
The corresponding frequency resolution is defined as:
where
is the sampling frequency. According to the Nyquist Sampling Theorem, the effective frequency range is
.
The amplitude spectrum is normalized as:
where
denotes the normalized spectral amplitude. The normalization factor
ensures amplitude consistency across different sampling lengths and allows for meaningful comparison between signals of varying durations.
Features are extracted from the FFT-processed data. A total of 14 FFT features are extracted for voltage and current, including the energy ratio in the 0–1 Hz band, energy ratio in the 5–10 Hz band, peak frequency in the 1–5 Hz band, peak count in the 1–5 Hz band, spectral entropy in the 1–5 Hz band, spectral centroid, and spectral bandwidth.
2.3.2. Wavelet Packet Decomposition
From the perspective of functional analysis, WPD projects a signal onto the space defined by wavelet packet basis functions. From the viewpoint of signal processing, it can be regarded as passing the signal through a series of filters with different center frequencies but equal bandwidths. WPD performs multiple iterations of wavelet transformation to analyze the detailed components of the input signal, decomposing it into a set of subband signals with different frequency ranges. This enables the method to capture the local characteristics of the signal more accurately. WPD is founded on a rigorous mathematical framework that ensures a fully deterministic and reproducible decomposition process. As long as the input signal, wavelet basis function, decomposition level, and other key parameters remain consistent, the resulting decomposition is mathematically guaranteed to be unique and repeatable [
34].
For a given time-series signa
(where
denotes the signal length), WPD is implemented using the following recursive relations:
where
represents the wavelet packet coefficient of the
n-th node at the
j-th decomposition level;
and
are the low-pass and high-pass filter coefficients, respectively; and
is the decomposition level, which is set to 3 in this study. The db4 wavelet basis function is selected, and symmetric boundary conditions are applied. During the decomposition process, the number of nodes at each level is
. For a three-level decomposition, a total of eight subband signals are obtained. For each subband signal generated by WPD, four features—namely energy, variance, peak value, and mean value—are extracted for both voltage and current signals, resulting in 64 features derived from the eight subbands in total.
2.3.3. Time-Domain Feature Extraction
Time-domain features serve as macroscopic indicators of battery degradation, primarily reflecting the statistical characteristics of voltage and current signals and directly correlating with the overall degradation mechanisms that determine the battery’s State of Health (SOH). For instance, capacity fading shortens the voltage plateau interval during charging, which is directly manifested as a drift in the mean voltage. The increase in internal resistance—including both ohmic and polarization components—causes instantaneous fluctuations in the current signal, thereby altering its standard deviation. Moreover, the loss of electrode active materials during aging reduces charge transfer efficiency, leading to a gradual increase in the deviation between accumulated charge and nominal capacity. Specifically, the time-domain features extracted from the stable charging segments in this study include eight key indicators: charging time, cumulative charge, curve entropy, kurtosis, mean value, skewness, curve slope, and standard deviation.
2.3.4. Feature Selection
A total of 94 multi-domain features are initially obtained, including 14 FFT features, 64 WPD features, and 16 time-domain features. However, many of these features are redundant. For example, energy features extracted from different subbands are often highly correlated, which increases the computational cost of the model and introduces unnecessary noise. To address this issue, a hybrid feature selection strategy combining the Pearson Correlation Coefficient (PCC), which measures linear correlations, and the Maximal Information Coefficient (MIC), which captures nonlinear dependencies, is employed to retain features that are most strongly correlated with the SOH.
The Pearson Correlation Coefficient quantifies the strength of the linear relationship between a given feature and the SOH, and is defined as follows:
where
is the number of samples;
denotes the feature value of the
i-th sample;
is the mean feature value;
represents the true SOH of the
i-th sample; and
is the average SOH. The value range of
is
, where a larger absolute value of
indicates a stronger linear correlation. The selection threshold is set to
, which effectively removes more than 80% of weakly correlated features while retaining over 95% of those strongly correlated with SOH.
However, the Pearson Correlation Coefficient captures only linear relationships and cannot account for nonlinear dependencies between features and SOH. To complement it, the MIC is employed for additional feature screening [
35]. Based on mutual information theory, MIC measures the strength of nonlinear associations between two variables, with a value range of
. A larger value indicates a stronger nonlinear correlation. The MIC is defined as:
where
represents the feature,
denotes the SOH, and
m and
k are the numbers of rows and columns in the grid partition, respectively.
(where
is the number of samples) defines the upper limit of grid complexity, and
represents the mutual information calculated after partitioning. The selection threshold is set to
, which corresponds to a nonlinear correlation of medium strength or higher and covers most nonlinear scenarios in SOH estimation. This threshold effectively eliminates redundant features, such as two highly correlated WPD subband energy features with an MIC of 0.98 (only one of which is retained), while avoiding the exclusion of key nonlinear features.
After the two-step selection process, which first considers linear correlation and then nonlinear correlation, a total of 48 features are retained. These include 12 FFT features, 30 WPD features and 16 time-domain features. The Pearson correlation analysis and MIC correlation analysis between the selected features and SOH are illustrated in
Figure 3 and
Figure 4, respectively.
During the feature selection process for Batch 2 of the TJU dataset, it was observed that the correlation between the voltage features extracted using the FFT and the battery’s SOH exhibited an obvious opposite trend with changes in charging–discharging temperature. As illustrated in
Figure 5, for NCM batteries, the VF01 feature shows a strong negative correlation with SOH at 45 °C, but a strong positive correlation at 25 °C. Similarly, features VF02–VF06 display strong positive correlations with SOH at 45 °C, which reverse to strong negative correlations at 25 °C. Although this opposite trend is also observed at 35 °C, the correlation strength between each feature and SOH at this temperature is not statistically significant. Moreover, this temperature-dependent opposite trend is observed only in NCM batteries; NCA batteries do not exhibit such behavior.
This phenomenon is caused by temperature-induced shifts in electrochemical impedance and high-frequency voltage dynamics, rather than by instability or unreliability of the FFT features themselves. Specifically, increasing the temperature reduces charge-transfer resistance and SEI film impedance, which amplifies high-frequency voltage components associated with ion-transport kinetics. At lower temperatures (25 °C), degradation leads to increased polarization, thereby reducing high-frequency spectral energy as SOH decreases—resulting in a negative correlation. In contrast, at elevated temperatures (45 °C), accelerated SEI rupture and electrolyte decomposition induce additional high-frequency fluctuations. These effects increase the spectral magnitude as degradation progresses, producing a positive correlation. Therefore, the sign reversal originates from physically distinct degradation pathways activated at different temperatures, rather than from noise or instability in the FFT feature extraction procedure. Importantly, although individual correlations differ in sign, the absolute correlation strength remains high, indicating that FFT voltage features consistently carry degradation information under all thermal conditions.
2.4. Model Construction
2.4.1. Attention Mechanism
The core idea of the attention mechanism is to enable the network to learn the relative importance of each feature and assign adaptive weights accordingly. By doing so, the model can focus more on features that are critical to the current task [
36]. In this study, a subnetwork is designed to generate attention weights. It consists of three fully connected layers and two batch normalization layers, which learn a weight vector of the same dimension as the original feature vector. The intermediate dimensionality reduction and subsequent dimensionality expansion introduce nonlinearity, reduce the number of parameters, and improve the model’s generalization capability. The final output is obtained through the element-wise product of the original feature vector and the corresponding attention weights. Each feature is thus scaled by a weight value between 0 and 1, where the weight reflects its relative importance. Because the attention weights are generated using the Softmax function and multiplied with the input features, gradients can be backpropagated to both the attention subnetwork and the original feature representations during training. This allows the attention mechanism to automatically adjust feature weights in response to task-specific objectives.
In the forward propagation process, let the input features matrix be
, where
is the batch size and
is the feature dimension. The first operation includes a fully connected layer, followed by GELU activation and batch normalization, formulated as:
where
are the learnable parameters of the first layer.
The output then passes through the second fully connected layer for dimensionality reduction, followed by GELU activation and batch normalization, expressed as:
where
are the learnable parameters of the second layer.
The output is then passed through the third fully connected layer for dimensionality expansion, which can be expressed as:
where
are the learnable parameters of the third layer.
The Softmax function is then applied to generate attention weights, as formulated below:
Here, Softmax is applied along the feature dimension (), so the weight vector of each sample satisfies .
The attention weights are then multiplied element-wise with the original input features to obtain the weighted feature representations, as expressed below:
where
denotes the weighted feature matrix,
represents the original feature matrix, and ⊙ indicates the element-wise multiplication operation.
2.4.2. Loss Function
The overall loss function of the proposed model consists of five components: data loss, PDE residual loss, monotonicity loss, decay loss, and boundary loss. Each component enforces a different constraint to improve model accuracy and ensure physical consistency.
The data loss quantifies the discrepancy between the predicted and true capacity values. A combination of Huber loss and coefficient of determination (
) loss is employed to balance robustness and fitting precision:
The Huber loss is defined as:
where
and
denote the predicted and true capacities of the
i-th sample, respectively,
N is the total number of samples, and
is set to 0.8 times the standard deviation of the target value, but not less than 0.05.
The
loss is given by:
Since a larger value represents better fitting performance, the model minimizes the negative part of during optimization.
- 2.
PDE Residual Loss
To ensure that the predicted degradation trajectory strictly adheres to the underlying physical laws of lithium-ion batteries, a physics-informed degradation equation is incorporated into the proposed PINN framework. The evolution of battery SOH can be described by the following partial differential equation:
where
denotes the battery SOH,
represents the multi-domain input features, and
is the normalized cycle index serving as the temporal variable. The term
is a learnable degradation dynamics function that maps the current battery state and feature space to the instantaneous degradation rate. It characterizes the physical relationship embedded in the degradation process.
Given this physical law, the discrepancy between the predicted temporal derivative of SOH and the estimated degradation dynamics constitutes the PDE residual:
The PDE residual loss requires
to be as close to 0 as possible and the corresponding loss term is:
where
is the number of samples. Minimizing
drives the network to produce SOH trajectories whose temporal evolution aligns with the inherent degradation physics.
- 3.
Monotonicity Loss
The monotonicity loss enforces the physically reasonable assumption that battery capacity decreases monotonically with cycling. For two adjacent samples
and
, if
, the model expects
. The corresponding loss is formulated as:
This constraint regulates the discrete SOH sequence, ensuring that the predicted capacity values follow a strictly non-increasing pattern from cycle to cycle. It effectively removes local upward fluctuations or rebound points and guarantees that each predicted SOH value does not exceed that of the previous cycle. While this loss prevents discrete violations of monotonicity, it does not explicitly control the continuous degradation rate; therefore, it operates in conjunction with the decay loss to achieve fully coherent physical behavior.
- 4.
Decay Loss
The decay loss constrains the derivative of capacity with respect to time to remain negative, ensuring a consistent degradation trend:
Unlike the monotonicity loss, which restricts point-to-point relationships between adjacent cycles, the decay loss acts on the continuous trend of the SOH trajectory. By penalizing any positive derivative, it ensures that the instantaneous degradation rate never becomes upward, thereby preventing the model from producing non-physical positive slopes even when discrete monotonicity is satisfied. This constraint encourages smoother, continuously decreasing degradation curves that reflect the irreversible nature of electrochemical aging.
- 5.
Boundary Loss
The boundary loss restricts the predicted capacity values to the physically meaningful range
:
- 6.
Total Loss
The total loss is the weighted sum of these losses:
where the coefficients
control the relative influence of each term. These weights are dynamically adjusted during training to maintain a balance between data fitting and physical consistency, which is discussed in detail in
Section 2.4.3.
2.4.3. Dynamic Loss Weight Adjustment
To prevent imbalance between data fitting and physical constraints that may arise from fixed weights, the loss coefficients and are adaptively adjusted according to the validation set performance.
PDE weight
(MAE-driven), A smaller MAE on the validation set indicates better data fitting. In this case,
is increased to reinforce physical constraints:
where
,
and
denote the initial, maximum, and minimum weights, respectively, and
is the MAE threshold.
Monotonicity weight
(
-driven), As the validation
increases, the monotonicity constraint is strengthened by raising
:
Decay rate weight
(
-driven), The decay term increases linearly with the improvement in
:
The boundary weight is fixed at the initial value .
4. Conclusions
This study presents a novel AM-MFF-PINN framework for accurate and physically consistent SOH estimation of lithium-ion batteries. The proposed framework integrates multi-domain feature extraction, dual-correlation feature selection, attention-based adaptive weighting, and physics-informed constraints into a unified model, aiming to address the limitations of traditional data-driven and PINN-based approaches.
The experimental results from the XJTU and TJU datasets demonstrate that the AM-MFF-PINN model achieves high estimation accuracy, with MAE values of 0.001–0.002 on the XJTU dataset and 0.003–0.007 on the TJU dataset, and values consistently above 0.96. These results verify that the model maintains stable performance across different battery chemistries, temperature conditions, and charging protocols. Compared with a conventional PINN and a CNN model, the proposed approach exhibits clear advantages in accuracy, error robustness, and generalization capability.
Overall, the scientific novelty of this work is threefold:
A unified SOH estimation framework that simultaneously incorporates multi-domain feature fusion, attention-driven adaptivity, and physics-informed degradation constraints.
A dual-correlation feature selection strategy that preserves both linear and nonlinear degradation relevance, ensuring compact and physically meaningful feature representations.
A dynamic loss-weighting mechanism that adaptively balances data fitting and physical law enforcement, thereby achieving stable convergence and high generalization across diverse operating conditions.
Future work will explore the integration of electrochemical impedance features, extension to real-time onboard BMS deployment, and adaptation to other types of energy storage systems. Overall, AM-MFF-PINN provides a promising and practically deployable solution for high-fidelity SOH monitoring, early fault warning, and life-cycle optimization in electric vehicles and large-scale energy storage applications.