To estimate the capacity of the battery accurately, machine learning can be used to always update the phase parameters of the battery. Here, the gradient vector is also used as an initial indication of the battery phase in each cycle. Then, we will try to see its characteristics from each cycle Finally, the results of machine learning will show what percentage of confidence the phase determination is. The charge–discharge cycle will continue like the existing data set, so the algorithm that is built is also made for this approach.
3.1. Discharge Profile
C-Rate represents both the internal and external battery regarding its ability to supply energy. For example, C20 can also be said to guarantee battery capacity in 20 h of discharge while taking into account the SoH of the battery. With reference to the C-Rate, other battery usage limits can be neglected. This is very valuable in operational conditions where it is only related to run times, operational voltage ranges, and SoH movements [
30].
Voltage, current and temperature during both charging and discharging situations can be seen in
Figure 4a. From
Figure 4b, it can be seen that there is a decrease in C-Rate with an increase in cycle. The reduction in C-Rate can be seen in the faster discharge time under the same conditions from a new cell to Cycle 85 and then to 169. The stress factor is usually taken into account when the battery value decreases, which also affects its temperature, SoC, C-Rate, and depth of discharge (DoD) [
31]. Temperature and SoC are the two most visible variables from the long test. This battery cycle shows the need for DoD calculations to energy throughput and C-Rate [
32].
DoD becomes the basic component of the calculation of the discharge cycle, which will form the battery model. To implement this methodology, machine learning will be used to represent the DoD for the remaining battery SoC with the release of the real data.
The battery capacity is shown in blue, which could be seen from the condition of the fresh cell (1st cycle) to the last cycle (169th cycle) in a decreasing trend from around 1.8 to 1.3 Ah. This trend is also followed by the red line, which is for discharge time, which moves from a little over 60 min to a bit under 50 min. However, there is a slight difference in the yellow line for charge voltage. It shows the voltage used when charging the battery. Basically, the voltage should be at the same number around 4.2 V. However, from the data, it is found that there is a difference at the 85th cycle, where the measured voltage reached around 8 V as shown in Figure. Overall, it could be seen that the weight and C-rate of the battery would diminish along with the trend of capacity and time in
Figure 5. For this reason, when using batteries, it is necessary to arrange them simultaneously, either in parallel or in series, so that the ongoing adequacy that may occur randomly from any battery would be covered by another battery [
30]. Meaning that we had calculated the possibility of any powerless batteries that might be present among the battery packs. For this reason, in this experiment we will also see the possible differences from other batteries, such as BAT0006, BAT0007, and BAT0018, even though the analysis method at the beginning will only use BAT0005 for the explanation.
3.2. The Gradient Vector
The amount of data from fresh cycle on BAT0005 is 197 with information in the form of voltage, current, and temperature from the battery in each time period. With these data we can calculate the magnitude of the gradient from each point. Then from the gradient values obtained easily we can see that there are several regions between the data patterns. To separate this data scientifically, we can use SVM Analysis. Among these regions, we can see that there are regions whose average value is smaller than the others. The data distribution that is carried out in this study is shown in
Figure 6.
The DoD can also be analyzed by the obtained gradient vector as in
Figure 6. By deriving the V(t) function, we can find how far DoD had entered in the battery. The representation of each vector follows Equation (
18).
From the magnitude of this gradient value, as shown in
Figure 6, and also between cycles, as in
Figure 4b, as a whole can be divided into four phases, which have their own characteristics. This should be referred to as the phase of the battery discharge process. Phase 1 is the part where the battery energy is first used. This can be referred to as the early discharge phase. The charge voltage around 4.2 V is unstable and looks like it has dropped drastically to a more stable value. The vector in this area looks quite steep down. The next second phase is the phase where the battery has reached a more stable voltage. In this phase, the battery is in normal usage phase. The voltage no longer drops drastically and is represented by a more sloping sideways vector. The third phase is the stage where the battery voltage drops drastically again, indicated by a downward steep vector. In this phase, there is also a point where it starts to be cut off following the previous battery operational parameters in
Table 1.
If connected to battery modeling at
Section 2.4, the first drop is caused by the Warburg impedance R
W. After that, there is an IR drop caused by the mass transfer resistance of the electrochemical battery. Furthermore, the electrolyte resistance, R
E, also increases due to polarization activation. Finally, the charge transfer resistance, R
CT, which is caused by concentration polarization. With the R
W and R
CT values together with the C
DL, the double layer capacitance can determine the time constant (
) of the circuit, as shown in Equation (
19). This is the time constant required for the circuit response to decay by a factor of 1/e or 36.8% of its initial value. A large
value will decay longer and a small
value will cause a faster decay. When there is no external source of excitation, the natural response of the circuit refers to the behavior (in terms of voltage and current) of the battery model circuit itself according to Equation (
20).
However, this equation only exists in the normal usage phase, where the battery charge is modeled by C
DL. However, there are several curves in the discharge profile graph as shown in the voltage discharge profile in
Figure 4b. By comparing each gradient value and its voltage range statistically,
Figure 7 showed the phase classification for each cycle of 1, 85, 169, respectively from right to left.
By mapping each phase of the NASA battery data set, we can also find out the statistical value based on the magnitude of the gradient vector. Each of the batteries, BAT0005, BAT0006, BAT0007 and BAT0018, will be broken down sequentially from their minimum, average, and maximum values, as shown in
Table 2. This may be necessary if we look at the possible categories intuitively to determine the limits of each phase. Namely, the L value, which is the boundary between the two successive phases, and the normal usage phase will be the main goals of the SoC measurement method with this method.
In general there are several characteristics of each phase. For example, only the early discharge and after cut-off phases have positive gradient values. This can also be seen in
Figure 6, with the vector pointing up. However, here we do not need to pay attention to the last phase, after cut-off phase. This is because the ultimate goal is to calculate the SoC and predict the end time of the battery on a single charge. In other words, use is only present in the initial three phases starting from early discharge phase to voltage drop phase, after the battery is disconnected we will no longer use it for various reasons, as discussed earlier in
Section 2.2. Even if possible, the battery has begun to be reduced in use or is disconnected when entering the voltage drop phase. This can be seen by looking at the characteristic curve of battery usage in
Figure 7, where the phase does not contribute a long time and the value will drop drastically, until finally it has to be forcibly disconnected because it has reached the voltage threshold of the battery.
When viewed in detail, there are differences between each battery. The value that should be the same for each battery is the maximum value at the early discharge phase before the battery discharging test is carried out is charging with a full charge voltage of 4.2 V. However, here, the voltage in the measurements of each battery obtained a tolerance value of around 0.02 V. The minimum value is BAT0006 with a value of 4.18 V and mostly 4.19 V on the BAT0005 and BAT0018 batteries. The same voltage value can only be obtained on BAT0007, which is 4.2 V. Then what is interesting is that there is a minimum value during normal usage phase that is slightly smaller or more positive than the maximum value of “Voltage Drop.” With a difference of about 0.11, 0.16, 0.12 and 0.05 mV/s for BAT0005, BAT0006, BAT0007, and BAT0018, respectively.
3.3. Time Estimation
Then, by following the predictions from the regression model, it is possible to obtain the beginning of the voltage drop phase. To find out all the dropping points from each cycle of each battery, the model is used together with the gradient value. The difference in dropping point values from different cycles from BAT0005 can be seen in
Figure 7. However, here we are trying to predict the time of the normal usage phase. For this reason, the comparison between this phase and the total phase up to the voltage drop phase is attempted to be visualized as shown in
Figure 8.
To understand the time estimation, a comparison between normal usage vs. total usage time before cut-off is used to check the difference ratio. From the existing data set,
Figure 7 is formed using data of 167 for each of BAT005, BAT006, and BAT0007. Meanwhile, BAT0018 only consists of 132 data. Plots of the data are shown in the bar chart at
Figure 8, where the average ratio between normal usage phase time and total usage time before cut-off is 85%. In other words, the ratio between the difference between the two to the total phase value of early discharge and voltage drop phases is only about 15% of the total. Whereas, the cut-off limit of the battery is about 2.2–2.7 V, as noted in
Table 1. However, as shown in
Figure 8, there is a time span difference between the four batteries. This difference can also be seen in the average length of time from the total time and also from the difference. The average total time is around 47 min with a slight difference in seconds for BAT005, BAT006 and BAT0018, respectively, and 50 min for BAT0007. Meanwhile, the average normal usage time is about the same for 40 battery sequences with a slight difference in seconds, and 43 min for BAT0007 only. Also, the difference between the two values is about 7 for each of the three batteries with a slight difference in seconds, and 8 min for the BAT0007. With the previous cut-off voltages of 2.7, 2.5, 2.2, and 2.5 V for batteries BAT005, BAT006, BAT0007, and BAT0018, respectively; with this method, the battery is in a critical state when it enters its voltage drop phase. Of course it varies more and can be in a voltage range greater than 3 V. The possible values are quite large compared to those in absence of this method. This voltage is represented by V2 in
Table 3.
From
Table 3, we could see that the threshold for each battery has a different range. V1 is the threshold between the early discharge phase and normal usage phase. From the table it can be seen that the upper limit is at the same number at 3.88 V. However, the lower and average limits of V1 still vary with the lowest value being at BAT0006. Meanwhile, V2 is the threshold between the normal usage phase and the voltage drop phase. There is an unequal threshold shift as previously discussed. There are similarities with the threshold on V1, and the lowest and highest values are on BAT0006. The data in the table still shows data from the overall cycle of each battery. Then, there is the time length that shows the variation of the length of the normal usage phase, which is started by V1 and ended by V2. According to the description in
Figure 8, even though the number of seconds varies quite a bit, between batteries BAT0005, BAT0006, and BAT0018 it is still around 4200 s or approximately 40 min with a slight difference in seconds. The difference is only in BAT0007 with a difference of about 3 min or 180 s. Basically, the time length distribution of each cycle of each battery is random. The distribution can be seen in
Figure 9.
Again at
Table 3, we could see also that the V1 and V2 thresholds follow the characteristics of each battery and its usage. Battery use up to a lower cut-off value is followed by an increase in usage time as seen in BAT0007 with a cut-off value of up to 2.2 V. However, the above concept is not always followed by a decrease in the existing value because of the comparison between BAT0005 with cut-off voltage of 2.7 V to BAT0006 and BAT0018 with cut-off voltage of 2.5 V does not show the corresponding data. It may be possible to compare the battery’s model from the characteristics of the battery. Meanwhile, there are still other possibilities that come from the division of phases that have been carried out. With this classification, basically we have also divided the existing graph into four different curve sections, as previously discussed in the voltage discharge profile between
Figure 4b and
Figure 7. Similarly, for the distribution of normal usage phase time, which has been described in
Table 3, there are also several possibilities, as described in Equation (
21) as the Total Probability.
With P
Total as the total probability of all possible data entering the normal usage phase. From Equation (
21), it can be seen that the total is not the same as the probability of that phase alone, which should represented by
. However, there is also the addition of the intersection with the previous phase, early discharge phase, which is represented by
. In addition, there is also the addition of a wedge with a phase after “Voltage Drop.” This is represented by
. Thus, the overall probability of normal usage data is formed from these three probability groups. In the real case of this intersection area, it is possible that a certain value comes from the two phases that overlap. Here, we try to minimize the value range because it can confuse the system in determining the initial limit of V1 and the final limit of the phase in V2. At least we know the point where we can classify it as the normal usage phase. Before carrying out cut-off point comparisons as before, it is necessary to get an idea from the existing data. In order to determine a particular intersection point, it is necessary to find a solution to the area of uncertainty. We make the resolution for the previous intersection of the normal phase and the intersecting phase as close to zero as possible. This can be simplified by assuming the occurrence of uncertainty when approaching the limit values in V1 and V2, as follows:
The probability of this slice can be zero, if we follow a normal distribution in which the maximum is usually at the midpoint and decreases as the standard deviation of the limiting stress approaches. The results of the time length distribution from the modeling process with
Table 3 can be observed in
Figure 9 even though the data distribution does not really follow a normal distribution. By looking at the limit values of each prediction curve, we can define as above, with a limit approaching the threshold in
Table 3; the phase slice is expected to be close to zero. The initial threshold voltage as point V1, can be found using the SVM approximation analysis, which is discussed previously in
Section 2.3. Next, of course, we return to the process of the best interpretation of the time length. The statistical data shown in
Table 3 cannot really represent the percentage of data density for each length of time, be it the existing minimum, maximum and average values. Of course there will be shifts and differences between the curves formed by each battery. For example, by the batteries BAT0005, BAT0006 and BAT0018, all three of which have an average of around 40 min or 2400 s. In other words, it takes an illustration that shows the overall data distribution between the minimum and maximum values. The distribution density for each battery is shown in
Figure 9.
Figure 9 shows that the length of time used for each battery varies from cycle to cycle, but there is a length of time that can be used as a guide as a characteristic. Of course, the duration of use has decreased according to
Figure 8 but a time limit, such as the 2000 s range, can be the range, as represented as a line in the figure. From the time length distribution, it can be seen that this figure represents almost the entire peak of all batteries.
In general, one might think that battery capacity is simply a matter of the charge number, which is stated as the nominal energy of the battery. Considering that all batteries, BAT0005, BAT0006, BAT0007 and BAT0018, come from the same battery type with a specification of nominal capacity as 2Ah, as shown in
Table 1. However, from the results above with CC mode, we conclude that the estimated time limit for battery use that can be guaranteed to the user is the minimum time limit of the various variations in the characteristics of each battery. The estimation results can also be calculated by taking into account the DoD level profile of each usage cycle and predicting the usage time with C Rate. Because the available data sets are limited, we need to fully exploit them by comparing the values of the available data sets with the polynomial regression models we obtain. Of the four data sets, we iterate in each cycle to calculate the difference according to Equations (
14)–(
16). Overall,
Table 4 shows the statistical data for each parameter of MAPE, MAE and RMSE.
Furthermore, in
Table 4, it can be seen how much error the method has used for each battery. For MAPE, which shows the difference from the average, overall it is in the range of 0.30 to 0.77. With the smallest difference in BAT0018 and the largest in BAT0006. The same happens for MAE, which is the absolute value, but with a different value, between 0.0106 and 0.0275. Finally, for the RMSE, which represents the process with the roots of the square, there is a slight difference with the lowest value 0.0136 and the largest 0.0276 all coming from BAT0006. BAT0006 itself is shown in
Figure 8d has a prediction portion of under 2000 s with the largest portion. Additionally,
Figure 9 has the largest distribution range and the gentlest peak value compared to other data set batteries. Meanwhile, BAT0018 is the data set that has the highest peak of the figure.