1. Introduction
Lithium-ion batteries have been widely used from portable electronics to battery-driven hybrid vehicles owing to their higher energy density, higher output voltage, lower self-discharge, longer lifetime, higher reliability and other advantages compared to other types of batteries available in the market [
1,
2,
3]. The lithium-ion battery failure consequence have different levels of severity: from performance degradation to catastrophic failure [
4]. The lithium-ion battery failure caused several Boeing 787 to be caught fire and caused all airliners to be grounded indefinitely in 2013 [
5]. Therefore, It is imperative and highly desired to detect the performance degradation and to predict the remaining useful life (RUL) of lithium-ion batteries. Prognostics and health management (PHM) are the technologies to evaluate the reliability of a system in its actual life cycle conditions to determine the RUL [
6,
7]. One of the major work of prognostics is to predict RUL [
1]. In this paper, we consider the battery will fail when the battery’s capacity drops to 70% of its initial capacity, because both the battery’s capacity and power will drop much faster after this point, and the battery is unreliable and should be replaced [
8,
9].
Existing methods for lithium-ion batteries’ RUL prediction roughly can be classified into two main categories: the physics-of-failure (PoF)-based method and the data-driven method [
1,
2]. The PoF-based prognostic methods depend on the battery physical characteristics, i.e., a battery’s material property, its failure mechanism and life cycle loading conditions, and tend to be computationally complex [
10]. Thus, the data-driven prognostic approach have widely been used because it does not require the extensive knowledge mentioned before. Among data-driven prognostic approaches, the particle filter (PF) and relevance vector machine (RVM) are two popular lithium-ion batteries’ RUL prognostic approach [
1]. PF is a sequential Monte Carlo method, which estimates the state probability distribution function (PDF) from a set of “particles” and their associated weights [
1]. Long [
2] proposed prognostic approach for lithium-ion batteries RUL based on the Verhulst model, particle swarm optimization (PSO) and particle filter. Miao [
11,
12] proposed several modified particle filter algorithms for lithium-ion battery RUL prediction. However, RUL prediction accuracy based on PF is limited by the particle degeneracy problem [
8,
9]. Compared to Support Vector Machine (SVM), RVM is constructed under Bayesian framework and thus has probabilistic outputs [
1,
13]. An intelligent prediction method for a battery health is proposed based on sample entropy feature, SVM and RVM [
14]. Their results showed that RVM is better than SVM in battery health prognostics. However, the training time of RVM is too long and RVM may not ultimately reflect the battery degradation trends [
1,
2].
In addition to the above methods, artificial neural networks (ANN) were also used in RUL prediction of lithium-ion batteries [
1,
8,
9,
15,
16,
17,
18]. ANN generally can be classified to two categories: the shallow ANN, such as the back propagation (BP) ANN and the nonlinear autoregressive (NAR) ANN, and the deep ANN, such as the long short-term memory (LSTM). Wu et al. [
15] proposed a RUL prediction method based on transfer components analysis and a feed forward neural network. Pang et al. [
16] proposed a lithium-ion battery RUL prediction method based on a nonlinear auto regressive neural network. Liu et al. [
17] proposed an adaptive recurrent neural network (ARNN) for system dynamic state prediction and validated that the ARNN showed better learning capability than classical training algorithms, including RVM and PF methods. Rezvani et al. [
18] used both recurrent neural network (RNN) and linear prediction error method to estimate the battery capacity and RUL. But the RNN gradients will vanish and will be unable to learn any longer if RNN allows the storage of information over some time [
8,
9]. The long short-term memory (LSTM) is a deep-learning neural network, which is explicitly designed to learn the long-term dependencies [
19]. LSTM has been successfully applied in machine translation [
20], travel time prediction [
21], and wind power prediction [
22]. Though the ANN has been applied in prognostics of lithium-ion batteries in some references [
8,
15,
16,
17,
18], no one has compared the prognostics of the lithium-ion batteries based on different ANN. Therefore, it is very interesting to compare the prognostics of the lithium-ion batteries based on different ANN, such as BP, NAR, and LSTM. The structure of the BP ANN is relatively simple. The NAR ANN retains the information of the previous data through feedback and delay. The LSTM model has a simple structure and uses it for RUL prediction of lithium-ion batteries. It not only gives the uncertainty of prediction results, but also has short training time and can achieve better prediction results in the early, middle and late stages. At the time of writing, this paper is the first to compare the prognostics of the lithium-ion batteries based on different ANN, i.e., BP, NAR, and LSTM. An improved LSTM NN is also proposed in order to achieve higher prediction accuracy and make the construction of the model simpler.
This paper is organized as follows:
Section 2 describes the battery data and evaluation index,
Section 3 describes the different ANN models of RUL prediction method based on BP, NAR, and LSTM,
Section 4 gives the experimental results and discussion of RUL prognostics, and conclusions is presented in
Section 5.
3. Different NN Models
3.1. The BP NN Model
The BP NN belongs to a kind of multilayer feedforward NN. It advances layer by layer according to the weight coefficient during training, and it adjusts the weight and bias of the network dynamically based on the back propagation algorithm. Structurally, it consists of an input layer, an output layer, and a hidden layer; in essence, the BP algorithm uses the squared error sum of the network as the objective function and uses the gradient descent method to calculate the minimum value of the objective function. Its basic structure is shown in the
Figure 4:
The neural model in
Figure 4 can be expressed mathematically as,
where the input values of a node,
, replicate the dendrites of the biological neuron and are multiplied by their respective connection weights,
, and then summed. The summation is applied to the function of node
, along with the bias
, giving as a result the activation value of the node
.
3.2. The NAR NN Model
The dynamic NN can add the data saved at the previous time to the calculation of the data at a later time through feedback and delay, so it is more suitable for the prediction of time series. Among them, the NAR NN is a kind of dynamic NN, which has been widely used in practical applications. Its basic structure is shown in
Figure 5.
As can be seen in equation (5), the output is defined by , which indicates that the current value of the system is determined by past values.
3.3. The LSTM NN Model
In general, the time series prediction modeling is a difficult problem, because unlike regression prediction modeling, the interdependence between time series makes the problem difficult. As a kind of deep learning network structure, LSTM has the advantage that the context of data is considered in the training process. Compared with the traditional data-driven prediction method, the LSTM network has stronger nonlinear approximation ability. The general structure of the LSTM network is shown graphically in
Figure 6.
There is a structure called “gate” in LSTM to remove or increase the ability of information to be passed to the memory unit. A gate is a structure that allows information to pass selectively. It consists of a sigmoid NN layer and a point multiplication operation. The sigmoid layer outputs a value between 0 and 1, describing how much of each part can pass. In a memory unit, there are three types of gates, namely a forgotten gate, an input gate, and an output gate, which are described in detail below.
Forget gate:
. takes inputs of
and
, and outputs a number between 0 and 1 for each state
.
A = 1stands for retaining the state value completely, whereas
A = 0 represents discarding the value completely. The forget gate is calculated as follows:
Input gate and input node: “input gate”,
, decides which values to update. “input node”,
, creates a new candidate state
. The equations to calculate the two outputs are as follows:
Combing (6) and (7) to update the previous internal state
into the current state
:
Output gate: finally, there is a sigmoid layer called the “output gate”,
, that determines what information to output. After putting the internal state
through a tanh layer This can be implemented as:
where
and
are the layer weights and biases, respectively.
3.4. The Improved LSTM Model
The LSTM model selects the historical capacity data of the lithium-ion battery as an input variable to predict the remaining capacity of the lithium-ion battery in the future. Taking the #B5 battery as an example, the length of the capacity data is 168, that is:
The original capacity data is then divided into training sets
and testing sets
based on the predicted starting point
T. The
will be normalized by the methods presented in the previous parts and
will be got, namely:
In the traditional LSTM prediction methods, and will be defined as the input and output to construct a LSTM model.
Since there are few data samples, a data construction method of the following form is proposed in order to achieve higher prediction accuracy and make the construction of the model simpler. Assuming that the width of the split window is
L and
m =
T − 1, the input and output of the LSTM model can be expressed as:
where
belongs to
;
represents the normalized data sequence value;
represents the input of the network;
represents the output label of the corresponding input.
According to our lots of simulation experiments and relative references, the width of the split window is set 12, i.e.,
l =
n = 12, thus m belongs to (5, 15) based on the data processing method in Equations (3) and (12). Under the same conditions, we have studied the variation of the
RMSE of the LSTM network with
m, as shown in the
Figure 7. As can be seen from the
Figure 7, when
m is equal to 13, the root mean square error of the network is the smallest, so
m = 13 in this paper.
4. Results and Discussion
The software package and the version used to implement all three NNs structures is MATLAB2018a. For the LSTM network model in this paper, according to our analysis and pre-experiments mentioned above,
l =
n = 12,
m = 13, α = 8, the learning rate is set 0.005. The predicted starting point is set
for #B5. The battery will be predicted 200 times at each prediction starting point and the point not reached failure threshold will be eliminated.
Figure 8,
Figure 9 and
Figure 10 show the prediction results at three different prediction starting points.
The prediction result of LSTM at T1 = 69 is shown in
Figure 8. As can be seen in the
Figure 8, the prediction result is 124 and the relative error is−4.
The prediction result of LSTM at T2 = 89 is shown in
Figure 9. As can be seen in
Figure 9, now the prediction result is 130 and the relative error is 2, which means that the prediction error is gradually decreasing.
The prediction result of LSTM at T3 = 109 is shown in
Figure 10. As can be seen in the
Figure 10, the prediction result is 131 and the relative error is 3.
The LSTM model can achieve a good prediction accuracy of the lithium-ion battery capacity, as shown in
Figure 8,
Figure 9 and
Figure 10. In order to observe the prediction effect of the LSTM model under different prediction starting points intuitively, the probability distribution histograms (PDH) of the #B5 battery prediction results at different prediction starting points is shown in
Figure 11,
Figure 12 and
Figure 13.
The probability distribution histograms and the probability distribution curve of the LSTM model after 200 tests is shown in
Figure 11. As can be seen in
Figure 8 and
Figure 11, due to the randomness of the network in initializing weights and bias, the prediction results are not same at each test, but the improved LSTM model proposed in this paper can approach the real life cut-off point of lithium-ion batteries with a high probability from the long-term prediction results, which verify the validity of the model.
Figure 12 is similar to
Figure 11, except that it is a statistical distribution of #B5 with the improved LSTM model after 200 tests at T2 = 89. As seen in
Figure 12, the prediction result is 130, which is very close to the real life cut-off point of the lithium-ion battery.
From the probability distribution histogram of the prediction results, when we use 69, 89, 109 as the prediction starting point of the network to predict the remaining service life of the lithium-ion battery, the life expectancy cutoff point of the battery B5 is 124, 130, 131, respectively. The corresponding probabilities are 29%, 35%, and 42.5%, respectively. In addition, we can also see that the normal distribution probability density curve fitted by the prediction results becomes thinner and thinner with the increase of the prediction starting point, which indicates that the accuracy of the prediction is also gradually increasing.
In order to compare the prognostics of the lithium-ion batteries based on different ANN, such as BP, NAR, and LSTM, the comparison results of the three algorithms are shown in
Figure 14 and
Table 1. The parameters for LSTM and NAR are:
n =
l = 12,
m = 13, α = 8,
d = 20.
As can be seen from
Table 1, among the three prediction algorithms, LSTM has the highest prediction accuracy. When 80, 90, and 100 are used as prediction starting points respectively, the corresponding absolute errors are 13, 8, and 2, and the relative errors are 10.16%, 6.25%, and 1.56%. From the absolute error, relative error and
RMSE corresponding to the prediction results, the LSTM model is indeed more accurate than the static BP network and the dynamic NAR network. By comparing these three different prediction algorithms, we can know that the LSTM network is better for the time series problem of lithium-ion battery life prediction, and the learning ability of the degradation process is stronger. In addition, the NAR ANN has the better prognostic accuracy compared to the BP ANN.
5. Conclusions
An improved LSTM prediction method for lithium-ion battery RUL estimation is proposed in this paper. And the prognostics comparison of lithium-ion battery based on the BP ANN, the NAR ANN and the LSTM ANN has been studied in detail. The experimental results show: (1) compared with the static NN prediction model and the dynamic NN prediction model, the LSTM model based on the deep learning theory can achieve better dependence, lower prediction error and higher prediction accuracy; (2) the NAR ANN has the better prognostic accuracy compared to the BP ANN.
For lithium-ion batteries, the error and accuracy of the prediction results obtained by applying the LSTM model are different at different prediction starting points. By analyzing the prediction results of the B5 using the LSTM model, the more the predicted starting point is, that is, the more samples are trained, the higher the accuracy of the obtained network model, but the longer the training model takes, a balance needs to be struck between prediction accuracy and training time.