1. Introduction
Electric vehicles and portable electronics have rapidly developed a demand for efficient and reliable lithium-ion batteries (LIBs) in the market. Longevity, durability, and performance are greatly important for customer satisfaction in LIBs and technological development. Accurate estimation of a battery’s SOH is crucial for optimizing the battery usage and preventing sudden failures, which have great economic consequences and safety risks [
1,
2].
Model-based methods of SOH and RUL estimation solely depend on the physical and chemical nature of the battery for correct estimation. Most of these methods develop mathematical models for describing the activities of the battery in working and weighting conditions. For instance, Baghdadi et al. [
3] developed a semi-empirical model based on Dakin’s degradation approach to assessing lithium-ion battery cell life loss under various operating conditions. Among these factors, temperature, SOC, and current magnitude are the most influential degradation factors. The logarithms of capacity fade and increase in internal resistance, evolving linearly with time, and this forms a solid basis for predicting battery aging. Zhang et al. [
4] integrated a thermal dynamics model for RUL estimation, combining temperature and state of charge to predict battery aging more effectively. Xu et al. [
5] developed a stochastic deterioration rate model based on the Wiener process and the Arrhenius temperature model. This model improves the accuracy of the deterioration predictions by accounting for fluctuating temperature conditions at various times and introducing a stochastic variation of a battery’s aging processes.
Dong et al. [
6] discussed a physics-based aging model in which the solid electrolyte interphase (SEI) layer’s growth and creation is the main aging mechanism. This model predicts lithium-ion battery capacity fade by combining mechanical and chemical degradation factors. The impact of different operating circumstances on the formation of the SEI layer is simulated by the model. Further insights into health indicators and capacity thresholds for accurate RUL predictions were developed by Sun et al. [
7], who used capacitance and resistance as primary health indicators and provided a third-order polynomial function to represent the degradation process.
Lui et al. [
8] projected battery capacity deterioration by simulating the depletion of lithium inventory and active materials at both electrodes. The anticipated model incorporates the influences of temperature, state of charge, and current rates and offers a thorough understanding of battery aging.
Data-driven approaches, which use machine learning techniques that can handle big datasets and uncover hidden trends that are difficult for standard models to capture, forecast the battery SOH and RUL based on previous data and sophisticated algorithms.
In 2020, Gou et al. [
9] presented a hybrid ensemble data-driven technique. To improve the prediction accuracy of SOH and RUL, the researchers combined nonlinear autoregressive models with random learning methods. In test datasets, their methodology performed better.
A time domain method was presented by Harris et al. [
10] for monitoring SOH and RUL with the least amount of hardware. Their model uses a nonlinear least-squares method to estimate impedance characteristics by superimposing a short test signal on the battery load, whereas Reference [
11] employed a support vector regression (SVR) model with a particle filter to manage measurement noise, yielding robust RUL predictions.
There are still a number of obstacles to overcome before precise and trustworthy SOH and RUL calculations can be obtained, even with the advances in model-based and data-driven approaches.
The complexity of battery aging processes poses a challenge for model-based approaches in identifying time-varying parameters and real-time applicability, as demonstrated by [
12]. In their work, the authors [
12] suggest a progressive LSTM technique to solve some of these issues with data quality and model fidelity.
Beganovic et al. [
13] discussed the difficulties in developing accurate lifetime models that can adapt to varying operational conditions. They emphasize the need for models that can handle the stochastic nature of battery degradation. Additionally, Beganovic et al. [
13] indicates that current methods are lacking the robustness required for real-world applications.
He et al. [
14] pointed out the limitations of Gaussian process regression models in accurately predicting SOH and RUL without trend extraction. They proposed a multiscale approach to improve prediction accuracy but noted that the complexity of battery degradation mechanisms still poses significant challenges.
Tang et al. [
15] mentioned that data-driven methods can suffer from a lack of generalization and robustness due to the reliance on historical data. They propose an improved BiLSTM algorithm, yet acknowledge that the need for large, high-quality datasets remains a barrier. Gupta et al. [
16] examined several machine learning methods for estimating SOH and RUL, stressing the benefits and drawbacks of each strategy. They came to the conclusion that although data-driven approaches have potential, combining them with physical models could improve their precision and dependability.
To increase prediction accuracy, Zhange et al. [
17] suggested an ensemble approach that combines several data-driven models. They did point out that two important disadvantages are the requirement for a large amount of training data and the computational complexity.
There are distinct benefits to using both data-driven and model-based approaches for estimating SOH and RUL in battery systems. While data-driven methods use machine learning algorithms and historical data to make precise forecasts, model-based methods offer insights into the physical and chemical processes of battery degradation [
18,
19].
In this work, a novel framework is introduced, the GRU-battery performance predictor (GRU-BPP), to overcome the drawbacks of both model-based and data-driven techniques. Granger causality and K-means clustering are used to comprehend inter-group linkages inside battery systems, while exploratory data analysis (EDA) and principal component analysis (PCA) analyze intra-group characteristics. The robust model captures complicated physical relationships and improves predicted accuracy by using gated recurrent unit (GRU) networks.
The structure of this paper is as follows: First, we give a summary of the dataset along with the experimental setup that was utilized to acquire the data in
Section 2. We introduce our novel model GRU-battery performance predictor in
Section 3. We explain the process of exploratory data analysis (EDA) to find significant trends and anomalies in the data in in
Section 3.1. The use of Granger causality to comprehend the temporal correlations between the battery parameters is covered next in
Section 3.2. The principal component analysis (PCA) and clustering methods that are employed to classify the data and distinguish between different operational states are discussed in the next
Section 3.3. Last, we present the complete flow chart of the GRU-battery performance predictor model (GRU-BPP) in
Section 3.4 and assess its performance in comparison to benchmark techniques. We carry out a number of computational experiments in
Section 4, such as Granger causality, principal component analysis (PCA), K-means clustering, and sensitivity analysis of the GRU-BPP hyperparameters. These analyses are performed on both the aging and reference performance testing (RPT) datasets. A review of the results is discussed in
Section 6, and possible future research directions are included in the paper’s conclusion in
Section 7.
2. Dataset
This study utilizes a publicly accessible dataset compiled by [
20]. The dataset includes aging data from three prismatic lithium-ion (Li-ion) battery cells tested across multiple aging rounds under a realistic forklift operation profile. Aging data from three prismatic lithium-ion battery cells comprise the dataset used in this investigation. During a two-week testing period, these cells underwent a realistic forklift operations profile that was distilled from a four-month stint of field operation. In order to expedite the aging process, experiments were carried out at 45 °C, 40 °C, and 35 °C for cells 1, 2, and 3, respectively. The forklift field operation’s idling intervals were eliminated, and in order to comply with the battery test station’s requirements, the charging and discharging current during the aging process was capped at 50 A. With the exception of the eliminated idling times, the state of charge (SOC) profile of the batteries throughout testing was identical to that of the field-operated forklift batteries. Every aging round consisted of two stages: a reference performance test (RPT) phase and an aging phase employing a realistic forklift load profile. For Cells 1 and 2 and for Cell 3, these phases were repeated 58 and 53 times, respectively. Throughout the aging and testing processes, precise data on current, voltage, energy, and temperature were captured on a second-by-second basis.
3. GRU-Battery Performance Predictor (GRU-BPP)
This paper introduces the GRU-battery performance predictor (GRU-BPP) as a novel data-driven approach specifically developed for this study to improve predictive accuracy and reliability in analyzing battery degradation. The new GRU-battery performance predictor (GRU-BPP) model combines physical and causal links to make predictions more accurate and reliable. It is used to study how batteries work and how they break down. The unique model makes use of sophisticated algorithms to comprehend the heterogeneity in battery performance under various operating situations. To find underlying trends, distributions, and anomalies in the dataset, exploratory data analysis, or EDA, is first carried out. In order to choose the most pertinent traits for further analysis, this stage is essential. The Granger causality test is then used to ascertain the temporal correlations and predictive influences between the parameters. This test assesses the predictive power of previous values of one parameter over future values of another, hence shedding light on causal relationships within the data. The Granger causality test is based on a mathematical comparison of the variance of the residuals between the restricted model (which only includes lagged values of the predicted parameter) and the unconstrained model (which includes lagged values of both parameters). The F-statistic for Granger causality is calculated as
where the numbers
m and
n represent the number of lagged terms,
k the number of parameters in the unrestricted model, and
the residual sum of squares for the limited and unrestricted models, respectively. Understanding the directional influence between factors and confirming that these associations hold true across various data subsets depend heavily on this approach. Next, principal component analysis (PCA) [
21,
22] is used to extract the most important features from the dataset while reducing its dimensionality. The data matrix
X is broken down by the PCA into loadings and scores, resulting in
, where the loadings are
P and the scores are T. This method aids in pinpointing the main variables influencing battery degradation and performance. We may identify the variables that most significantly contribute to the observed variance by looking at the loadings of the principal components. This identifies important areas that require additional investigation. Our method is unusual in that it uses Granger causality and PCA together to validate internal links between the most crucial elements. By ensuring that the selected characteristics have both predicted causal linkages and a large contribution to the variance in the data, this dual synthesis improves the robustness of our model. By closely examining each feature’s significance, the prediction model’s accuracy and dependability are increased as it concentrates on the most important variables. The PCA-transformed characteristics are used to divide the data into discrete groups using K-means clustering. Minimizing the within-cluster sum of squares is the goal of K-means clustering:
where
is the set of points in cluster
k, and
is the centroid or the average position of points in cluster
k. This clustering addresses both intra-group and inter-group heterogeneity, allowing us to uncover patterns and variations within and between clusters. The clusters reveal different degradation behaviors and performance characteristics, providing a deeper understanding of battery health under varying conditions. Following the clustering, the output clusters are used as input features for the GRU model, enhancing its predictive capability by incorporating the discovered patterns and relationships. The GRU model, designed to capture temporal dependencies and complex relationships within the data, is particularly suited for time-series predictions. The GRU is mathematically defined as follows:
where
is the update gate, and
is the candidate activation. The update gate
and reset gate
are computed as
where
is the sigmoid function. To assess the model’s performance, 80% of the data are used for training, while the remaining 20% are used for testing. Mean squared error (MSE) was used as the loss function and the Adam optimizer with a learning rate of 0.001 was used to train the model. In order to avoid overfitting, training was carried out for 50 epochs with early termination. The model’s performance was measured using RMSE on a different test set that included 20% of the data. In order to comprehend intra- and inter-group differences and explain the variety in battery performance and degradation patterns, the relevance of the detected clusters was investigated. The present analysis offers a more profound understanding of the variables impacting battery health, guaranteeing that variation within and between groups is adequately addressed. The outcomes show how well the GRU-BPP model works at making accurate predictions, which improves knowledge and battery health management. The study’s integration of sophisticated analytical approaches results in the identification of critical parameters that impact battery performance, as well as their utilization to construct a resilient prediction model. This all-encompassing strategy guarantees precise and trustworthy forecasts, significantly advancing the fields of degradation analysis and battery health monitoring. The higher performance of our unique methodology is demonstrated by comparing the GRU-BPP model with conventional methods like LME (statsmodels version: 0.14.0) and XGBoost. (xgboost version: 1.7.3).
3.1. Exploratory Data Analysis (EDA)
In this section, we go into the exploratory data analysis (EDA) of the dataset [
20] to uncover initial patterns, trends, and insights. The following plots (
Figure 1 and
Figure 2) illustrate various aspects of the data collected during the reference performance testing (RPT) and aging processes.
Figure 1a shows the mean current for Cells 1, 2, and 3 over multiple rounds of RPT. All cells initially exhibit a high mean current which stabilizes over time. Notably, Cell 1 shows significant fluctuations, indicating possible anomalies.
Figure 1b presents the mean voltage trends, where all cells start with high initial voltages that decrease over time. Cell 1’s voltage exhibits pronounced fluctuations, suggesting irregularities in its aging process. In
Figure 1c, the mean energy data follow similar trends, with initial high values that decrease over time. Cell 1’s energy data also show significant fluctuations, reinforcing the possibility of anomalous events.
Figure 1d displays the mean temperature trends, which increase over time, particularly for Cell 1, indicating more significant degradation or potential testing issues.
During the aging process, as shown in
Figure 2a–d, the mean current and voltage exhibit stable trends with minor fluctuations. Cell 1 shows more variability compared to Cells 2 and 3, indicating potential differences in response to aging. The energy and temperature trends during aging also reveal a declining pattern, with Cell 1 showing more significant fluctuations and higher temperatures, reflecting higher internal resistance and accelerated degradation.
Figure 3a–d show the distributions of current, voltage, energy, and temperature during RPT. The current distribution reveals peaks around specific values, indicating consistent loading during testing. Voltage readings fall within a narrow range, suggesting stable voltage levels. Energy values display a broad distribution, reflecting varying operational modes. Temperature readings cluster around 25 °C to 30 °C, with Cell 1 operating at slightly higher temperatures.
Figure 4a–d display the distributions of the same parameters during the aging process. Current and voltage distributions are similar to the RPT phase, indicating consistent loading and voltage ranges. Energy values show broader distributions, reflecting varying energy profiles during aging. Temperature distributions highlight distinct peaks, with Cell 1 experiencing higher and more variable temperatures, indicating greater degradation stress.
3.2. Granger Causality Analysis
The Granger causality test was applied to identify significant causal relationships between parameters by assessing the predictive capability of one parameter over another, using lags from 1 to 5. The analysis revealed significant relationships within both the RPT and aging data samples. In the RPT data, strong causal relationships were observed between current and voltage in Cell 2 and Cell 3 and between temperature and energy in Cell 2 (
Table 1). For the aging data, notable relationships included those between temperature and energy in Cell 1 and between voltage and current in Cell 3 (
Table 2). These interdependencies are crucial for subsequent PCA and clustering analyses, enhancing the predictive capabilities and reliability of battery health monitoring systems.
3.3. PCA and Clustering Techniques
Principal component analysis (PCA) is employed here to capture the intra-group characteristics in order to examine complicated physical relationships.To further explore patterns within the dataset, clustering techniques were applied using K-means and Gaussian mixture models (GMM). The performance of both methods was evaluated using the Davies–Bouldin index, Calinski–Harabasz index, and silhouette score. Additionally, the plots of both methods are shown in
Figure 5 and
Figure 6.
K-means outperformed GMM, as indicated by lower Davies–Bouldin index and higher Calinski–Harabasz index and silhouette scores, suggesting more compact and well-separated clusters (
Table 3). Consequently, K-means was selected for further analysis.
3.4. Gated Recurrent Unit (GRU) Algorithm
A gated recurrent unit (GRU) is a type of recurrent neural network (RNN) designed to handle sequential data and capture temporal dependencies. GRUs address the vanishing gradient problem common in standard RNNs, with the reset gate and update gate determining the amount of past information retained and new information added. The GRU is defined mathematically as follows:
where
is the input at time step
t,
is the hidden state from the previous time step,
is the sigmoid activation function, and
,
, and
W are weight matrices. The GRU’s ability to capture long-term dependencies makes it effective for time-series prediction tasks, such as forecasting the state of health (SOH) of lithium-ion batteries. The clustering outputs serve as additional input features for the GRU model, leveraging the identified patterns to enhance predictive accuracy. A flow chart explaining all the previous steps connecting the different phases of the novel approach is illustrated in the detailed flow chart in
Figure 7 and
Figure 8.
4. Evaluation Results
In this section we evaluate our methods empirically as follows:
4.1. Principal Component Analysis
To further investigate the relationships and variance in the dataset, principal component analysis (PCA) was conducted on the standardized data. The PCA was applied separately to the RPT and aging datasets for each cell. The results are explained in
Figure 9 and
Figure 10, respectively.
The data were first standardized to ensure that each feature contributed equally to the PCA. The top two principal components (PCs) were extracted and analyzed for each cell, providing insights into the key features driving the variance in the dataset. The contribution of each feature—current, voltage, energy, and temperature—to the principal components was visualized using bar plots.
The PCA plots’ comparison analysis is displayed in
Figure 9. PC1 has determined that the features of voltage and energy are the most prominent in Cells 1, 2, and 3, while PC2 has demonstrated that temperature is the most important feature. Cell 1, which was tested at 45 °C, displays a distribution of data points along both principal components in the PCA scatter plot. This suggests that there is a significant amount of variability in the dataset and that higher temperatures may be causing more pronounced degradation effects, which in turn may lead to greater variability in the battery’s performance during the RPT process. In contrast to Cell 3, the PCA scatter plot for Cell 2, which is aged at a moderate temperature of 35 °C, shows a more scattered distribution of data points. This points at increased unpredictability as well as possible deterioration in the battery’s functionality under these particular circumstances. In its PCA scatter plot, Cell 3, which was aged at a lower temperature of 25 °C, displays a more compact distribution of data points, indicating steady and reliable performance. A strong response to the aging conditions and little variability are suggested by the clustering around the principal components. Lower temperatures seem to lessen the impacts of degradation, resulting in more dependable performance throughout the RPT procedure. Comparative examination of the PCA plots for Cells 1, 2, and 3 in
Figure 10 also showed different patterns. The PCA scatter plot for Cell 1 shows a distribution with compact variance along both major components, indicating a high degree of response variability from the battery. The bar plot shows that for PC2, temperature and energy are dominant features, while for PC1, voltage and current remain significant. Referencing
Table 2, the significant Granger causality relationships in Cell 1 between temperature and energy align with the PCA results, reinforcing the importance of these parameters.
For Cell 2 in
Figure 10b, the PCA scatter plot displays a very broad distribution, indicating substantial variability. This suggests higher variability and potential stress or degradation in the battery’s performance under its specific conditions. The bar plot shows that for PC1, current and voltage are dominant features, while for PC2, energy and temperature are significant. Referencing
Table 2, the significant Granger causality relationships in Cell 2 between current and temperature, energy and voltage, and temperature and voltage align with the PCA results, highlighting the critical role of these parameters.
For Cell 3 in
Figure 10c, the PCA scatter plot shows a compacted distribution compared to Cell 2, indicating relatively stable performance. The clustering around the principal components suggests minimal variability. The bar plot shows that for PC1, current and voltage are dominant features, while for PC2, temperature and energy are significant. Referencing
Table 2, the significant Granger causality relationships in Cell 3 between voltage and current align with the PCA results, confirming the importance of these parameters.
Additionally, the
Figure 9 and
Figure 10 visualize the interaction between the principal components (PC1, PC2) and the energy feature. By including energy on the z-axis, the 3D plots provide a clearer understanding of how energy levels distribute across different regions of the PCA space. For instance, in the 3D scatter plot of RPT data for all cells, the energy values show a distinct gradient, with higher energy clustering in the lower ranges of PC1 and PC2, as shown in
Figure 9, suggesting a concentration of higher energy data points under these component conditions. In contrast, Cells 1 and 2 in the
Figure 10 plots reveal more evenly distributed energy levels across the PCA components, further highlighting its stability and consistent performance.
4.2. Feature Distribution in K-Means Clustering
In this computational experiment, we discuss the features distribution in each cluster for the RPT and aging datasets in Cells 1, 2, and 3. The distinct clusters are visualized in the right plots (b, d, and f) of
Figure 5 and
Figure 6.
The clusters are well separated in the principal component space, indicating clear patterns in the battery’s operational data. Further analysis of the feature distributions within each cluster (
Figure 11) shows significant differences in current, voltage, energy, and temperature.
In Cell 1, the current distribution in
Figure 11 highlights that Cluster 0 (blue) has prominent current peaks, suggesting periods of inactivity or low current flow. In contrast, Clusters 1 (green) and 2 (purple) display more varied distributions, indicating different current levels during operation. The voltage distribution is tightly clustered around central values, with Cluster 1 (green) showing a slightly wider spread, suggesting more variability in voltage levels. The energy distribution reveals that Cluster 0 (blue) and Cluster 1 (green) have a more spread-out distribution, while Cluster 2 (purple) shows a distinct peak, representing energy storage. Temperature distributions indicate that Cluster 0 (purple) operates at higher temperature ranges compared to the other clusters. This could be due to different thermal conditions or varying states of health, impacting the battery’s performance.
For Cell 2, the K-means clustering is shown in
Figure 5d; the corresponding feature distributions in
Figure 12 again demonstrate distinct characteristics in current, voltage, energy, and temperature for each cluster. Clear prominent peaks are shown in Cluster 1 for current, voltage, energy, and temperature.
Similarly, for Cell 3,
Figure 5f shows the clusters in the PCA space, while
Figure 13 illustrates the feature distributions. The patterns in Cluster 0 observed here are similar to the ones in Cluster 1 in Cell 2.
The K-means clustering for the aging data of Cell 1, as shown in
Figure 6b, reveals three distinct clusters. Cluster 0 (blue), Cluster 1 (green), and Cluster 2 (purple).
The feature distributions in
Figure 14 further show these clusters. Cluster 0 predominantly exhibits higher values in voltage and energy in Cell 1, mostly aligning with its distribution in the PCA plot. In
Figure 15 Cluster 1 shows a broader distribution across voltage and energy in Cell 2. Cell 3 in
Figure 16 has similar distribution across voltage and energy in Cluster 1, highlighting its diverse characteristics.
5. Sensitivity Analysis for the RPT and Aging Dataset Using the GRU-BPP Archticture
5.1. Sensitivity Analysis for RPT Data
According to
Table 4, it is possible to have over-regularization and suboptimal generalization when the batch size or dropout rate is increased. For example, when the batch size is 64 and the dropout rate is 0.5, the RMSE value is 0.1835 with
equal to 0.9160. On the other hand, a lower batch size of 32 and a 0.3 dropout rate result in a higher
of 0.963 and an improved RMSE of 0.1561, indicating better performance.
To this end, 64 hidden units, a learning rate of 1, a batch size of 32, and a dropout rate of 0.3 are the ideal settings for the RPT dataset. This configuration results in the greatest value (0.963) and one of the lowest RMSE at (0.1561). In order to improve the accuracy and generalizability of the RPT predictions, a smaller batch size and a modest dropout rate are advised.
5.2. Sensitivity Analysis for Aging Data
The aging sensitivity analysis in
Table 5 shows that a smaller batch size combined with a dropout rate of 0.3 provides better performance, with lower RMSE and higher
values. The optimal configuration appears to be 64 hidden units, a learning rate of 1, a batch size of 32, and a dropout rate of 0.3, yielding an RMSE of 0.0321 and an
of 0.944.
In contrast, increasing the dropout rate to 0.5 or using larger batch sizes typically results in a slight reduction in performance. The configuration of 128 hidden units, a learning rate of 1, and a batch size of 64, for example, resulted in a higher RMSE of 0.0398 and a lower of 0.944, confirming that smaller batch sizes and moderate dropout rates are more effective for aging data.
In both the RPT and aging datasets, smaller batch sizes and moderate dropout rates consistently lead to better model performance. However, the aging dataset exhibits more stable and lower RMSE values overall compared to the RPT dataset, suggesting that it is less sensitive to variations in hyperparameters. The trends in how dropout rates and batch sizes affect accuracy remain similar in both datasets, though the aging data show less variability in performance.
For both the RPT and aging datasets, the sensitivity analysis indicates that a smaller learning rate (0.01) and a moderate dropout rate (0.3) are more effective in maintaining model accuracy. Additionally, simpler models with 64 hidden units tend to perform better, achieving lower RMSE and higher R2 values. Increasing the model complexity or regularization (i.e., higher dropout rates) generally leads to diminishing returns, as seen in the increased test losses and lower R2 values. Based on these results, a smaller, simpler GRU architecture with moderate regularization and a learning rate of 0.01 is recommended for both datasets.
5.3. Comparative Evaluation of Test Losses and Validation for RPT and Aging Datasets
The graphs in
Figure 17 show the test and validation losses for the RPT and aging datasets over 50 epochs. A steadily reduction over 20 epochs can be noticed in the validation loss of the RPT dataset. On the other side, the aging dataset has a significant reduction in the validation loss during the first 5 epochs; the curve stabilizes after that. Through the same comparison, we can see that similar behavior is shown for the testing loss in the aging and RPT datasets, the test losses for both datasets show similar patterns during validation.
5.4. Performance Comparison of Different Models
Table 6 shows the RMSE, MAE, and
values for the reference performance testing (RPT) and aging datasets. The hyperparameters selection for each model are shown in
Table 7.
The results of the GRU-BPP model outperforms all other models including Bi-LSTM and Attention-LSTM, achieving the lowest RMSE of 0.166694 for the RPT dataset and 0.032066 for the aging dataset. This captures the GRU-BPP’s superiority in modeling the temporal dependencies in battery degradation data, making it a highly effective tool for SOH prediction in lithium-ion batteries.
6. Discussion
In this paper, important insights into the performance and degradation patterns of lithium-ion battery cells were obtained using the principal component analysis (PCA), Granger causality, and K-means clustering techniques on the reference performance testing (RPT) and aging datasets.
The Granger causality study improved our understanding of the temporal dynamics of battery degradation by revealing important causal links between important parameters like current, voltage, temperature, and energy. Strong causal relationships between current and voltage were found for the RPT dataset across a variety of lags, especially in Cells 2 and 3. This result is consistent with the physical characteristics of lithium-ion batteries, whose internal resistance causes current fluctuations to frequently occur before voltage changes. Furthermore, during short-term testing, substantial Granger causality correlations between temperature and energy were discovered in Cell 2, emphasizing the influence of thermal circumstances on energy dissipation. On the other hand, the aging dataset showed stronger correlations between energy and temperature as well as between voltage and current, especially in Cells 1 and 3. These results highlight the long-term implications of temperature on battery health, especially when considering the deterioration of cells over time.
The main characteristics causing variance in the dataset were successfully determined using the PCA analysis. Voltage and energy were found to be the most important features for PC1 in the RPT dataset, but temperature was more noticeable for PC2. According to these results, temperature has a minor but significant impact on short-term battery performance, but voltage and energy are the main predicting components in the process. The aging dataset shows a stronger temperature contribution to variance, particularly in PC2, which is indicative of the growing effect of heat on battery deterioration over time. This is especially clear in Cell 1, where the PCA scatter plots indicate that increased operating temperatures are linked to greater battery performance variability. The usefulness of PCA in feature selection and dimensionality reduction is demonstrated by its capacity to disclose these unique patterns under a variety of operating situations.
These conclusions were further supported by K-means clustering, which divided the data into discrete groups according to the PCA-transformed features. Differential distributions of current, voltage, energy, and temperature were used to generate unique clusters in the RPT dataset. Cell 1’s Cluster 0 operated at higher temperatures, whereas Cells 2 and 3’s Cluster 1 demonstrated more consistent performance. The distinct degradation behaviors under varied test settings are reflected in the clustering patterns. Comparably, in the aging dataset, the clusters showed different operating modes, and the clusters were mostly defined by voltage and energy. The temperature distributions in Cell 1 support the notion that heat is a primary cause of long-term damage by indicating that thermal management becomes more crucial as the battery matures.
The state of health (SOH) of lithium-ion batteries may now be predicted with a comprehensive framework of three techniques—Granger causality, PCA, and clustering—incorporated into the GRU-BPP model. In comparison with other machine learning models (such as GRU, LME, LSTM, and XGBoost), the GRU-BPP model performed better and obtained the lowest RMSE values for the RPT and aging datasets. This is explained by the model’s suitability for time-series prediction tasks due to its capacity to capture both feature interactions and temporal dependencies. The model’s capacity to learn from the found patterns was improved by using the clustering results as input features, which produced predictions that were more accurate.
A lower batch size and a moderate dropout rate (0.3) were found to be ideal for both datasets by the sensitivity analysis. Specifically, it was discovered that the optimal hyperparameters for the GRU-BPP model were 64 hidden units, a learning rate of 0.01, and a batch size of 32. Using 128 hidden units or a greater learning rate, for example, increased the model’s complexity and caused overfitting, as shown by higher RMSE values. These results imply that for SOH prediction, a more straightforward model with moderate regularization works better.
To further illustrate the advantages and limitations of the GRU-BPP model compared to advanced deep learning techniques, we make the following key points as given in
Table 8:
The GRU-BPP model offers important advantages in predicting SOH batteries, and it is important to recognize its strengths and limitations. One of the main advantages of the GRU-BPP model is its interpretability, as it incorporates Granger causality between key parameters (e.g., temperature and voltage) to provide insight into the causal relationships and thus improve the understanding of battery degradation mechanisms. In addition, PCA and clustering techniques allow GRU-BPP to capture the variability intra- and inter-groups, providing a structured approach to feature selection and improving the robustness of the model. This structured analysis allows the model to more effectively identify critical operating conditions and underlying degradation behavior, resulting in higher accuracy compared to some complex deep learning models. However, due to its simple structure, the GRU-BPP model may not be flexible enough to capture highly nonlinear dependencies and complex temporal patterns as other advanced deep learning models; however, for our study, GRU-BPP was an excellent choice for battery SOH prediction with causal insights. The table below summarizes the comparison between GRU-BPP and alternative deep learning models.
7. Conclusions
In this paper, we developed a novel GRU-battery performance predictor (GRU-BPP) model to forecast lithium-ion batteries’ state of health (SOH). In order to pinpoint the critical elements affecting battery performance and deterioration, the model combines several intricate analytical techniques, including Granger causality, principal component analysis (PCA), and K-means clustering.
Significant causal correlations between current, voltage, temperature, and energy were found using the Granger causality analysis, which describes the temporal dynamics of battery degradation. Temperature was more important in the aging dataset, especially in the later phases of degradation, yet voltage and energy were found to be the primary drivers of variance in the dataset, as indicated by the PCA analysis.
Different degradation patterns were shown by the K-means clustering, with voltage and energy standing out as crucial characteristics in identifying the clusters. The GRU-BPP model was then able to better capture intra-group and inter-group patterns by using these clusters as input characteristics.
With the lowest RMSE values for the RPT and aging datasets, the GRU-BPP model outperforms conventional machine learning methods. The model was quite successful at predicting SOH because it could capture temporal dependencies and take advantage of clustered patterns. A more straightforward model with 64 hidden units, a learning rate of 0.01, and a dropout rate of 0.3 performed best, avoiding overfitting and retaining good accuracy, according to the sensitivity analysis.
Finally, the GRU-BPP model could provide a reliable and understandable way to forecast the health of lithium-ion batteries, by enhancing battery health monitoring, promoting energy efficiency, and sustainability, which has important ramifications for degradation analysis and battery health monitoring. This model is a useful resource for battery management systems and upcoming studies in this area since it not only increases prediction accuracy but also offers a deeper knowledge of the causes behind battery degradation. Further studies on additional datasets will be conducted to generalize the results on even wider temperature ranges, particularly lower operating temperatures.
8. Declaration of Generative AI and AI-Assisted Technologies
The authors acknowledge the use of Quillbot, an AI-assisted tool, during the preparation of this work, specifically to enhance the readability and language quality of the manuscript. The tool was employed to refine the clarity and coherence of the writing, ensuring that the manuscript effectively communicates the research findings. After utilizing this service, the authors thoroughly reviewed and edited the content to maintain the scientific integrity and accuracy of the work. They take full responsibility for the final content and conclusions presented in the article.