1. Introduction
With the rapid development of the electric vehicle (EV) industry, lithium-ion (Li-ion) batteries have become the primary energy source for EVs due to their high energy density, long lifespan, and efficient charging/discharging performance [
1]. The technical advantages of Li-ion batteries have driven their increasing adoption in the EV market. However, as the usage and number of Li-ion batteries continue to rise, their performance inevitably degrades during prolonged cycling and storage, particularly under the complex and dynamic operational environments of EVs [
2]. This degradation, often manifesting as capacity fading, can sometimes lead to safety incidents. While lithium-ion (Li-ion) batteries have indeed played a transformative role in modern life, their aging and health management have also become critical areas of concern [
3]. To ensure the long-term, safe, and reliable operation of Li-ion batteries, battery management systems (BMSs) are implemented to monitor various battery states [
4]. Among these, the state of health (SOH) is a key factor in guaranteeing the safe operation of EVs. Accurately monitoring and predicting the SOH of batteries is, therefore, one of the core tasks of BMSs [
5].
In order to address the challenges mentioned above, various SOH estimation methods have been proposed by the academic community over the past decades. However, due to the complex electrochemical properties and highly nonlinear behavior of lithium-ion batteries, as well as the diverse and dynamic operating conditions they face, accurately estimating and predicting the SOH of lithium batteries remains a significant challenge [
6]. In general, current SOH estimation methods can be broadly categorized into model-based and data-driven approaches. Model-based methods develop high-fidelity models, either from mathematical or physical perspectives, to assess the battery performance and lifespan [
7]. These methods can be classified into electrochemical models and equivalent circuit models based on the modeling mechanism. Electrochemical modeling approaches aim to describe the thermodynamics, lithium-ion diffusion processes, SEI film thickness, and side reaction kinetics inside the battery, striving for the most accurate battery modeling and state estimation [
8].
For example, Gao et al. [
9] used the Padé approximation to simplify the rigorous P2D electrochemical model and assess battery aging based on three key aging factors, lithium-ion loss, the loss of active material, and increased resistance, ultimately achieving better accuracy. The method based on equivalent circuit modeling uses circuit elements (resistors, capacitors, and voltage sources) to explain various battery phenomena. For instance, Xu et al. [
10] employed an equivalent circuit model to represent the battery, applied the recursive least squares method for online parameter identification, and then used an unscented Kalman filter to estimate the battery state. The use of a filter-based approach is also a common step in equivalent-circuit-model-based methods. For example, Jiang et al. [
11] proposed an improved EKF design for co-estimating the state of charge (SOC) and state of health (SOH). The proposed method combines innovative strategies to overcome covariance traps and objectively compute the optimal covariance matrix configuration, thereby improving the accuracy of the traditional EKF. Although model-based approaches can characterize certain battery mechanisms with high accuracy, the models they rely on require extensive dedicated experiments for calibration, which is resource-intensive [
12]. Additionally, most physical models are subject to specific conditions, limiting their transferability and noise immunity. With the advancement in technology, data-driven methods have been widely applied in SOH estimation. These approaches treat batteries as a “black box” and utilize machine learning (ML) to extract features from extensive aging data for the health state evaluation. Since data-driven methods do not require complex battery modeling or an in-depth analysis of degradation mechanisms, they have become increasingly popular [
13]. Commonly used methods include shallow neural networks, deep learning (DL), support vector machines (SVMs), Gaussian process regression (GPR), etc. Ren et al. [
14] proposed a multi-feature extraction strategy combined with a particle swarm optimization–nonlinear autoregressive exogenous input neural network (PSO-NARXNN). The PSO algorithm was employed to optimize the hyperparameters of NARXNN, including input delay, feedback delay, and the number of hidden neurons, which enhanced the accuracy and robustness of the SOH estimation algorithm. He et al. [
15] introduced an SOH estimation method based on Gaussian process regression (GPR). This method extracted coefficient of variation features from partial charging voltage and current profiles as health factors and incorporated an improved particle swarm optimization algorithm. The improvement involved a mutation factor and adaptive weight adjustment based on population diversity. The proposed SOH estimation framework was validated using the NASA battery dataset.
Despite the progress achieved by the aforementioned methods, SOH estimation still faces significant challenges in generalizability. Model-based methods rely on high-precision battery models and require the extensive recalibration of numerous parameters when there are changes in battery materials or operating conditions [
16]. These requirements limit their generalization capabilities. On the other hand, most data-driven methods, while capable of automatically learning from data, often disregard the aging mechanisms of lithium-ion batteries [
17]. As a result, these methods may produce non-physical or impractical predictions when applied to unfamiliar datasets. Moreover, they require prolonged retraining for new operating conditions or datasets, and their accuracy is not guaranteed, further hindering their generalizability. To solve the above-mentioned problems, physics-informed neural networks (PINNs) integrate physical principles with data-driven approaches [
18]. By leveraging physical knowledge to guide the learning process, PINNs can reduce the retraining time and ensure accuracy when dealing with unfamiliar datasets, thus significantly enhancing model generalizability [
19]. For example, Wang et al. [
20] extracted health factors solely from charging curves and utilized PINNs to model battery degradation attributes from the perspectives of empirical degradation and state-space equations. This approach successfully captured battery degradation dynamics and demonstrated high accuracy across three battery datasets. Similarly, Ye et al. [
21] extracted features from the incremental capacity (IC) and differential temperature curves and incorporated the monotonic relationship between P-IC and SOH as physical constraints in the model’s learning process, thereby improving the interpretability of the neural network predictions.
From the above discussion, it is evident that, although some explorations and innovations in PINNs have addressed certain issues in traditional SOH estimation, several challenges remain unresolved:
In light of these challenges, this paper makes the following contributions:
Two battery datasets with different materials and discharge conditions are selected, and a more comprehensive HF extraction strategy is implemented.
To enhance generalizability, a conventional backpropagation neural network (BPNN) is augmented with a multilayer perceptron (MLP), which models battery degradation dynamics from the perspectives of empirical degradation and state-space equations. Monotonicity constraints are incorporated into the loss function, and both outputs collectively serve as inputs to the physical information component.
A fine-tuned transfer-learning strategy is employed to evaluate the PINN’s adaptability across the selected datasets, with the results compared against the BPNN and CNN for validation.
The remainder of the paper is organized as follows:
Section 2 introduces the experimental datasets and feature extraction strategy and explains the used algorithms.
Section 3 gives the results and discussions. Finally, the conclusions are summarized in
Section 4. Its flowchart is shown in
Figure 1. As shown in the figure, the experimental data obtained from the batteries are processed to extract and select health factors. These factors are first input into the BPNN to obtain the initial SOH. Subsequently, the initial SOH, health factors, and the partial derivatives of the initial SOH with respect to the health factors are fed into the MLP to model the battery degradation dynamics. The overall loss function of the network comprises three components: the BPNN loss, the MLP loss, and the monotonicity loss of the health factors.
2. Experimental Datasets
With reference to recent studies, this paper utilizes two open-source aging datasets provided by the University of Oxford [
24] and Xi’an Jiaotong University (XJTU) [
25], referred to as the Oxford dataset and the XJTU dataset, respectively. This study uses data from these datasets: batteries numbered 1 through 8 from the Oxford dataset, and batteries numbered 1 through 4 from batch 1 of the XJTU dataset. Details are provided below.
2.1. Oxford Aging Dataset
The Oxford aging dataset includes eight LiCoO
2 pouch batteries with a nominal capacity of 0.74 Ah. Aging experiments were conducted at 40 °C. The batteries were discharged using driving cycles derived from the Urban Artemis profile (average current = 1.36 A) until the terminal voltage reached 2.7 V. Subsequently, the batteries were subjected to constant-current (CC) charging at a 2C rate until the terminal voltage reached 4.2 V. This charge–discharge cycle was repeated 100 times, followed by characterization tests to measure battery capacity. Specifically, the charging protocol in the Oxford dataset employs constant-current charging (CC mode), while the discharging process is conducted under dynamic operating conditions based on the driving cycles derived from Artemis curves. The degradation trajectories of the eight batteries in this dataset are shown in
Figure 2.
2.2. XJTU Aging Dataset
The dataset includes 55 lithium-ion batteries subjected to six different charging and discharging strategies, with their operation-to-failure data recorded. These strategies include fixed charging and discharging, random discharging with a fixed current in two different cycles, random walking, and the charging and discharging strategy of a satellite in geosynchronous Earth orbit (GEO). The experimental batteries are divided into six distinct batches based on these different strategies. The selected batteries were manufactured by Lixin and have a chemical composition of LiNi
0.5Co
0.
2Mn
0.2O
2. They have a nominal capacity of 2000 mAh, a nominal voltage of 3.6 V, a charging cutoff voltage of 4.2 V, and a discharge cutoff voltage of 2.5 V. For this study, four cells from the first batch were selected, with the operating conditions involving constant-current charging and constant-current discharging. The degradation trajectories of these four cells are shown in
Figure 3.
2.3. Aging Analysis and Feature Extraction
The aging of Li-ion batteries is typically characterized by capacity fade, increased internal resistance, and changes in the charge/discharge curve shapes. These degradation trends visually reflect the aging state of the battery. By extracting health factors (Health Factors, HF), these trends can be quantitatively assessed, thereby further evaluating the aging status of the battery. The selection strategy for health factors plays a crucial role in the performance of the final model. Therefore, when choosing health factors, it is important to follow principles such as easy accessibility, suitability for real-world operating conditions, and a high correlation with aging, while also considering their adaptability and generalizability across different battery types.
In line with this approach, we aim to minimize the input dimensionality of the neural network while ensuring model accuracy and robustness. To this end, this section employs a generalized health factor extraction strategy based on the analysis of two open-source battery aging datasets. In this strategy, we extract health factors from the charge curves, discharge curves, and incremental capacity (dQ/dV) curves in each dataset. These factors are then quantitatively analyzed for their correlation with battery aging, and the factor most strongly correlated with aging is selected as the input for the aging estimation model. This generalized strategy ensures the accuracy and robustness of the model while maintaining its applicability and generalizability across different battery types.
2.3.1. Charging Curve
From the charging process, the constant-current charging process takes less and less time under the same charging rate because the capacity of Li-ion batteries decreases with the aging of Li-ion batteries. From the charging voltage curve of Oxford Cell1, there is an inflection point at 3.8 V, and the change of charging voltage curve is more obvious in the range of 3.8 V to 4.2 V, as shown in
Figure 4.
For the XJTU dataset, the charging process includes a constant-voltage phase. To ensure consistency with the Oxford dataset, we focus on the constant-current charging portion. As seen in the curve of Battery 1 (
Figure 5), there is a distinct inflection point around 3.8 V, after which the process transitions to constant-voltage charging at 4.2 V. Therefore, for both datasets, the Charged Time During Equal Voltage Increase (CTEVI) from 3.8 V to 4.1 V is selected as a health factor. Since the current magnitude remains constant during constant-current charging, once the time required for a given voltage increase is known, the corresponding charged capacity (Charged Capacity During Equal Voltage Increase, CCEVI) can be easily calculated as another health factor. It is important to note that, because the Oxford dataset does not include a constant-voltage phase, while the XJTU dataset does, health factors related to constant-voltage charging are not considered.
2.3.2. Discharge Curves
Although the batteries in the two datasets were tested under different operating conditions, their discharge curves exhibit certain similarities. Overall, as the batteries continue to age, the time required to discharge to the lower cutoff voltage becomes shorter, and the slope of the discharge voltage increases. For both batteries, there is little distinction in the discharge voltage curve between 4.2 V and 3.7 V. However, below 3.7 V, a noticeable difference appears, as shown in
Figure 6 and
Figure 7. Therefore, the time from 3.7 V to the end of discharge (Discharged Time During Equal Voltage Decrease, DTEVD) and the capacity discharged during this time (Discharged Capacity During Equal Voltage Decrease, DCEVD) are selected as health factors.
2.3.3. Incremental Capacity Curve
Incremental curve analysis (ICA) is a commonly used method to analyze the aging mechanism of LiB, which can transform the plateau curve of voltage into the informative IC curve; the wave peaks in the IC curves represent different phase transition processes during the charging and discharging phases of LIBs, and the information such as the height and position of the wave peaks can present certain information [
26,
27]. This information can effectively characterize the insertion and extraction process of lithium-ion active materials, thus revealing the phase transition properties within LIBs. Therefore, the IC curve analysis is considered a key technique to study the aging mechanism of LIBs.
The following differential equation can be used to calculate the IC curve from the CC charging process:
where
Q and
V denote the capacitance and voltage, respectively.
denotes the charging capacity at moment
t, and
denotes the terminal voltage at the moment
t, and
t denotes the sampling time.
The original IC curve processed in this way may have problems such as having sharp points or insignificant wave peaks, and Gaussian filtering is needed on top of this in order to obtain a smooth IC curve quickly. As shown in
Figure 8, Oxford’s IC curve has two obvious peaks after processing, and the shape of the curve changes significantly with the decrease in SOH; the most obvious is that the two peaks become one and the height of the second peak decreases at a low SOH.
As shown in
Figure 8 and
Figure 9, the IC curve of XJTU has only one obvious wave peak after processing, and the shape of the curve changes significantly as the cycle progresses, and the height of the wave peak decreases. To summarize, we can use the peak value of the cell peak (IC Peak, ICP) and the area of the voltage region where the peak is located (Area of IC Peak, ICPA), in which, for the Oxford dataset with two peaks, we choose the second peak with more obvious changes.
2.3.4. Feature Extraction
The above discussion has analyzed and extracted the health factors from the two datasets, and these screened health factors were subjected to Pearson’s correlation analysis, which is a valid method for assessing the degree of correlation between two continuous variables. The expression is given below:
where
is the Pearson correlation coefficient of
and
denote the mean values of the health factor and SOH, respectively, and the results for both datasets are shown in
Figure 10.
It can be seen that the average values of the ICPA and ICA correlation coefficients of XJTU and Oxford are above 0.95, which can illustrate the monotonous limiting relationship of IC curves to SOH; moreover, the correlation coefficients of CTEVI and CCEVI are above 0.9, indicating that the correlation coefficients selected before are all strongly correlated with SOH. Specifically, the correlation coefficients of ICPA in the selected samples range from a maximum value of 0.9993 to a minimum value of 0.9518, with an average value as 0.9867. The correlation coefficients of ICP in the selected samples range from a maximum value of 0.9993 to a minimum value of 0.9521, with an average value as 0.9872. The correlation coefficients of CTEVI in the selected samples range from a maximum value of 0.9995 to a minimum value of 0.8961, with an average value as 0.9681. The correlation coefficients of CCDEVI in the selected samples range from a maximum value of 0.9994 to a minimum value of 0.8951, with an average value as 0.9676. Therefore, the selected health factors are CTEVI, CCEVI, ICP, and ICPA.
2.4. Network Structure
Considering the complexity of the reaction mechanisms in lithium-ion power batteries, and their strong nonlinearity and time-varying nature, as well as the need to perform nonlinear mapping for a large volume of input–output data, the Backpropagation Neural Network (BPNN) is an effective method for SOH estimation [
28]. It continuously adjusts the network’s weights and thresholds through backpropagation to minimize the sum of squared errors, thereby simulating the human brain’s data processing capabilities [
29]. The BPNN topology consists of an input layer, hidden layers, and an output layer, with the gradient descent method used to find the optimal solution. In this study, the BPNN structure includes six input dimensions, one output dimension, three layers, fifty hidden dimensions per layer, and a dropout rate of 0.008. These configurations were selected based on preliminary experiments to balance the trade-off between model complexity and prediction accuracy. For an r-dimensional input layer vector
, s-dimensional hidden layer vector
, and t-dimensional output layer vector
, the main steps of BPNN are as follows:
Calculate the expected output, the health factor, and time as input information entered into the input layer through forward propagation; the layers of the network are calculated in a specified order, and the output layer receives the output results as shown in Equation (3) and (4):
where
is the weight between the i-th neuron in layer l and the j-th neuron in layer l + 1.
is the bias of the j-th neuron in layer l + 1.
is the activation function, and the sin function chosen in this paper.
- 2.
The error calculation, usually the loss function of traditional neural networks, is defined as the mean square error:
where N denotes the number of training samples
- 3.
Update the weights and biases:
where α is the learning rate, and L is the value of the loss function.
The error is backpropagated through the network, continuously updating the weights and thresholds to optimize the network. The training process iterates until the error requirement is met or the maximum number of iterations is reached. From this, it can be seen that the advantage of BPNN lies in its ability to perform the nonlinear mapping of inputs and outputs for large datasets, with a certain degree of self-learning and adaptive capability, making it widely applicable in various scenarios. However, it also becomes apparent that the adaptive ability of BPNN is based on adjusting the network weights during training, relying solely on the loss function for a single dataset () without considering other factors. This may result in performance degradation when encountering unfamiliar datasets and may not guarantee an optimal performance after retraining. Therefore, from this perspective, the generalization ability of the BPNN needs to be improved.
To address the issues mentioned above, this paper optimizes the BPNN by incorporating additional considerations into the loss function. As shown in
Figure 11, the health factors are first introduced into the BPNN. Subsequently, as described in [
15], battery degradation dynamics are integrated during the training process. Specifically, the battery degradation rate can be represented by a multivariate function, but its explicit form is difficult to determine. Therefore, a multilayer perceptron (MLP) is used to approximate this multivariate function, taking the health factors, predicted SOH, and the first derivative of the health factor with respect to SOH as inputs to the MLP. The actual time derivative of the SOH is used as benchmark data, thereby constructing a multivariate function to model the battery aging process. The loss function calculation for this process is shown in Equation (8):
In addition to this based on the previous analysis of the IC curve and the literature [
21], the monotonic relationship between the peak of the IC curve and the SOH can be incorporated into the neural network as physical information with the addition of a loss function:
where
represents the partial derivative of
with respect to
, and
represents the feature values extracted from the Incremental Capacity (IC) section.
Figure 11.
The layer configuration and learning mechanism of the PINN model.
Figure 11.
The layer configuration and learning mechanism of the PINN model.
The final loss function for the whole network is as follows:
where
is the parameter. This gives the whole model the ability to adjust the network weights from the perspectives of two battery degradation mechanisms: the battery degradation kinetics and IC curve analysis.
After constructing the PINN, it is initially validated on each of the two selected datasets. To assess whether the generalization capability of the PINN has been improved, transfer learning experiments are then conducted. The evaluation approach involves selecting one of the aging datasets as the source domain and using the other dataset as the target domain. The base model is first trained using a randomly selected battery from the source domain. Subsequently, a battery from the target domain is randomly chosen, and the base model is fine-tuned using its cycle data. Finally, the re-trained model is tested using the remaining batteries from the target domain. The specific results are presented in detail in the next chapter. Notably, the average RMSE between the PINN predictions and the actual values is 0.0053, indicating that the overall RMSE is small and does not exhibit systematic patterns of bias. This suggests that the model does not have significant systematic biases. Overall, the PINN model maintains relatively low RMSE and MAPE values under all conditions, indicating good accuracy and stability across different scenarios. The low RMSE reflects the model’s robustness against outliers, demonstrating its ability to handle noise or extreme data points while maintaining reliable performance. Meanwhile, the low MAPE signifies the model’s high precision in terms of the overall prediction error, enabling the accurate estimation of the battery state of health across various battery types, operating conditions, and data distributions.
3. Results and Discussion
This section provides a detailed account of the results obtained using the proposed generic health factor extraction strategy and PINN for SOH estimation on a single aging dataset, with a comparison to BPNN and CNN. Additionally, the effectiveness of the proposed PINN algorithm under the fine-tuning-based transfer-learning strategy is verified across different types of batteries, and the potential improvement relative to the BPNN is evaluated.
To assess the prediction accuracy, commonly used statistical metrics such as the Mean Absolute Percentage Error (MAPE) and Root Mean Square Error (RMSE) are employed for the quantitative evaluation. The MAPE measures the overall prediction error by calculating the percentage difference between each predicted value and the true value, then averaging it. In contrast, the RMSE is more sensitive to outliers and can be used to assess the robustness of the model.
where
denotes the estimated value of SOH,
denotes the true value of SOH, and N represents the number of samples.
3.1. Results on a Single Dataset
For each battery cycle data in the two datasets, XJTU and Oxford, the training and test sets were divided according to the ratio of 6:4, and both the BPNN and PINN were trained according to this division ratio, and the results are in
Table 1.
It can be observed that the average RMSE for the PINN on the XJTU dataset is 0.0024, with an average MAPE of 0.1961%, while the average RMSE for the BPNN on the XJTU dataset is 0.0188, with an average MAPE of 1.8589%. On the Oxford dataset, the average RMSE for the PINN is 0.0073, with an average MAPE of 0.6218%, while the average RMSE for the BPNN is 0.0153, with an average MAPE of 1.5750%. For the XJTU dataset, using the PINN instead of BPNN results in a reduction of approximately 87% in the RMSE and approximately 89% in the MAPE. On the Oxford dataset, the use of the PINN leads to a reduction of about 52% in the RMSE and about 60% in the MAPE compared to the BPNN. The specific prediction results can also be observed in the line charts. The line graph (
Figure 12 and
Figure 13) shows the data for Cell 4 from the Oxford battery dataset. From the graph, it is evident that the PINN (BPNN-MLP) significantly improves the prediction accuracy compared to the BPNN.
The line graph (
Figure 14 and
Figure 15) shows the data for Battery 4 from the XJTU dataset. From the graph, it is evident that the PINN (BPNN-MLP) significantly improves the prediction accuracy compared to the BPNN. In terms of the relative error, the PINN consistently maintains an error below 0.5%, whereas the BPNN quickly reaches 3% in the later stages of cycling. Similarly, in the Oxford dataset, using Battery 4 as an example, the line graph shows that the PINN (BPNN-MLP) outperforms the BPNN in prediction accuracy. The PINN’s relative error remains below 0.5%, while the BPNN’s error increases to 1.5% in the later stages of the cycle.
Based on these results, we can conclude that the PINN achieves a higher accuracy than the BPNN for the same dataset, and consistently produces better prediction results for each battery’s cycling data.
3.2. Transfer Learning Validation
To verify the generalization and interpretability of the proposed PINN algorithm, validation experiments were conducted using transfer learning. The source and target domains were first determined. The PINN model was trained using cycling data from randomly selected batteries in the source domain. Two strategies, namely, model parameter fine-tuning and domain adaptation, are commonly employed in recent works for SOH estimation [
30]. Among these, parameter fine-tuning is a more straightforward and efficient approach. Moreover, the primary differences between the two selected datasets lie in the experimental temperature and discharging conditions, which may affect the model’s ability to adapt to the target data, particularly when the temperature and discharging conditions significantly influence the battery performance. To mitigate these effects, we selected the fine-tuning strategy, adjusting certain model parameters to improve its performance on the target domain. A fine-tuning transfer-learning strategy was then applied, where a battery from the target domain was randomly selected, and its cycling data were used to fine-tune the base model. During fine-tuning, only the final connection layer was updated, while the other layers were frozen. Finally, the retrained model was evaluated using other batteries from the target domain.
When the XJTU dataset is selected as the source domain and the Oxford dataset as the target domain, the results are presented in
Table 2. In this case, Battery 4 from the Oxford dataset was used for fine-tuning. The results indicate that PINN transfer learning achieved an average RMSE of 0.01 and an average MAPE of 0.868%, while the BPNN yielded an average RMSE of 0.023 and an average MAPE of 2.229%. This represents a reduction in RMSE by 56.27% and MAPE by 61.07% for the PINN compared to the BPNN, demonstrating a significant improvement in prediction accuracy across all battery cycles.
Similarly, when Oxford is chosen as the source domain and XJTU as the target domain, the results are shown in
Table 3. In this case, Battery 4 from the XJTU dataset was used for fine-tuning. The results show that PINN transfer learning achieved an average RMSE of 0.011 and an average MAPE of 1.16%, whereas BPNN produced an average RMSE of 0.019 and an average MAPE of 1.94%. This indicates a reduction in RMSE by 40.5% and MAPE by 39.9% for the PINN compared to the BPNN.
These results demonstrate that, under the selected transfer-learning strategy, the PINN provides a substantial improvement in accuracy over the BPNN, addressing the BPNN’s generalization limitations and enhancing its performance.
3.3. Future Work
In this study, the effectiveness of the proposed PINN model for SOH estimation, using a generalized high-frequency feature extraction strategy and a fine-tuning-based transfer-learning strategy, was comprehensively validated using two open-source aging datasets with different material types and working conditions. However, several limitations remain. First, LiFePO4 (LFP) cells were not considered in this study. Second, the proposed PINN framework models battery degradation dynamics from the perspective of empirical degradation laws and state-space equations. With the availability of more data and internal variables, more sophisticated electrochemical models could be incorporated to capture battery aging dynamics. Finally, due to the small sample size of the Oxford dataset, the performance of the PINN on this dataset was lower compared to the XJTU dataset, and further exploration is needed in order to understand the underlying reasons for this performance difference across more datasets. In future research, we plan to explore a broader range of battery types and enhance the accuracy and generalization of the proposed method, tailoring it to the specific characteristics of different battery types. To address the aforementioned limitations, future work will focus on increasing the diversity of datasets and improving the model structure. By optimizing the model architecture, the proposed framework can better leverage diverse datasets, enabling it to perform more effectively on more datasets. Additionally, with the increased diversity of datasets, improvements in training methods will enhance the model’s ability to learn from multi-source heterogeneous data.