1. Introduction
Lithium-ion batteries (LIBs) have been extensively applied in various fields, highlighting their critical role in future energy sustainability [
1]. Among these applications, road transportation is where LIBs have the greatest impact. While LIBs play a critical role in electric vehicles, the aging of these batteries presents significant challenges for end-of-life management. Retired batteries that no longer meet the requirements must be urgently addressed to prevent safety issues [
2]. For the reliable use of LIBs in electric vehicles, developing an advanced battery management system (BMS) for accurately estimating battery states is essential [
3]. Research in this area has become increasingly prominent in recent years, with a significant rise in publications focusing on state-of-charge (SOC) estimation [
4,
5,
6,
7].
Although directly measuring SOC based on Coulomb counting is possible, achieving high accuracy requires dedicated current sensors, which can be as costly as USD 1000. Such sensors are unaffordable for commercial BMSs that are under significant cost pressure. Therefore, inexpensive but reliable estimation methods are required. Despite being estimates, the SOC must be as accurate as measurements obtained from current sensors enhanced with internal compensation. For use in electric vehicles, an SOC error within 5% over the full range is targeted.
Among estimation methods, Kalman filters (KFs) have proven particularly effective, with more than half of model-based SOC estimators relying on KFs and their variants [
4]. Notwithstanding the widespread use of Kalman filtering, estimating the SOC for aged batteries remains challenging, as it requires the concurrent estimation of SOC and capacity [
8,
9,
10]. Despite the strong correlation between SOC and capacity, existing methods typically estimate them separately, resulting in relatively limited comprehensive investigations into their simultaneous estimation [
11]. Even when simultaneous estimation is explored, new or minimally used batteries are typically applied [
12,
13,
14,
15,
16,
17,
18,
19,
20,
21,
22,
23,
24,
25]. This gap in research limits the development of BMSs for electric vehicles, where state estimation accuracy must be maintained throughout the battery’s lifespan. Besides KFs, alternative approaches such as using ultrasonic reflection waves for state-of-charge and temperature joint estimation have also been reported. In this method, a piezoelectric transducer attached to the battery surface transmits ultrasonic signals, allowing for the real-time monitoring of internal battery conditions in a non-destructive manner [
26].
Depending on how states and parameters in a battery model are estimated and updated, KFs are divided into two main groups. The first group uses a single filter to estimate states, with parameters updated separately through an optimization technique [
27,
28,
29,
30]. However, this method is burdened by higher computational demands due to the execution of two separate implementations. Determining the frequency of parameter updates can be another challenge in this method. The second group uses a dual filter for simultaneous state estimation and parameter updating [
31,
32,
33,
34,
35]. Robust Kalman filtering techniques, such as those presented in [
36], emphasize the strength of the dual extended Kalman filter (DEKF) in addressing model uncertainties, enhancing robustness and accuracy in state and parameter estimation. While this method allows for more comprehensive investigations of battery aging, it poses significant challenges due to the complexity of the variables involved. This complexity is further compounded by the limited number of direct measurements, namely, terminal voltage compared to the number of estimates. Consequently, the limited measurements can lead to less accurate estimations of battery states and parameters, affecting the reliability of the overall BMS and its decisions.
The co-estimation of SOC and capacity using a dual filter requires discussion in terms of various aspects: the types of battery models, the types of filters used to estimate states and parameters in the model, the specific settings of the filter, and the degree of battery degradation. Firstly, Thevenin models are most widely adopted, such as first-order (1RC) [
31], second-order (2RC) [
32,
33], and fractional second-order [
34,
35] models. While extended Kalman filters (EKFs) are commonly employed, variants such as unscented Kalman filters (UKFs) [
30] or adaptive extended Kalman filters (AEKFs) [
27] are sometimes used. In some cases [
31,
35], the filters operate at different time intervals to account for the distinct characteristics of state and parameter estimations. The filter must be initially set, but the values of error, process noise, and measurement noise covariances are not often specified [
33,
35]. Aged batteries are required for the filter to yield estimation results, and their degradation levels should preferably be wide and regular—for example, test data prepared at uniform intervals of 5% across the entire range of 30% in capacity fade—but unfortunately, they are not [
27,
28,
31,
33,
35]. In addition, despite using Kalman filtering, estimation results are often provided without confidence intervals. The summarized discussion from the aforementioned studies is presented in
Table 1. Tracking performance is typically evaluated by deliberately initializing the filter with incorrect state and parameter values, then measuring the convergence time and steady-state error. Each study claims superiority based solely on the steady-state error, generally measured by root mean square error (RMSE). However, in prior studies, filters have differed in structure, initialization, and covariance settings, and test data vary in battery degradation level and load profile. Such result-oriented information is insufficient to comprehensively assess the performance and validity of the previously reported estimators and to clearly understand them. Consequently, despite the abundance of prior studies, they are not readily applicable to practical BMS developments.
With these drawbacks in mind, our methodology for co-estimating the SOC and capacity is presented in this paper, with an emphasis on validating the developed estimators against abundant test data. Test data are crucial for identifying model parameters and validating state estimates. In this study, we extensively use as many as 30 datasets. These datasets are generated from three load profiles typically applied to modern plug-in hybrid electric vehicles (PHEVs). Nearly 30% capacity degradation is realized in the datasets, which is relatively evenly distributed across ten levels. An enhanced self-correcting (ESC) model, the augmented Thevenin model with a hysteresis element, is applied to a DEKF. This filter updates parameters in the model, ensuring that the model maintains its accuracy in estimating states regardless of battery aging.
2. Cell Testing
This section describes the test data generated to identify parameters in a battery model by using a fresh cell. The cell was then cycled to produce battery aging test data, which were used to trace changes in parameters and states with the developed cell model and its filter.
We generally followed the test conditions used in our prior study [
37]. We utilized a LIB cell (Samsung SDI, 18650-35E) with a nominal capacity of 3.5 Ah and a nominal voltage of 3.7 V. The load profiles were simulated by combining a DC electronic load (Kikusui, PLZ1004W) and a DC power supply (Kikusui, PWR800L). Seamless transitions between charging and discharging in the load profiles were achieved by integrating these devices with a charge–discharge system controller (Kikusui, PFX2512). Previously, we estimated current sensor bias to study its effect on SOC estimation, as the bias is a well-known source of process noise affecting the reliability of the SOC. Since the bias is not subject to change with battery aging, only a fresh cell was used. However, this study requires aged cells with reduced capacity to investigate the impact of capacity variations on SOC estimation. For this purpose, the cell was intentionally aged through continuous cycling, as depicted in
Figure 1b. Each cycle involved a constant current charge at 2 C-rate until the voltage reached 4.2 V, followed by a constant voltage charge at this level until the current dropped to 1 A. After resting for an hour, the cycle proceeded with a constant current discharge at 2 C-rate until the voltage fell to 2.5 V. This high current, applied over a wide SOC range, accelerated capacity loss. Before starting this cycle test, the cell was evaluated through an initial reference performance test (RPT), as shown in
Figure 1a. The test was repeated every 100 cycles up to cycle 300, and then every 300 cycles up to cycle 2100. The test was designed to mimic the real-world operations of PHEVs, as described below. First, the batteries in the vehicle are plugged in to charge. The electrical energy stored in the batteries is then used to propel the vehicle, gradually depleting SOC. This mode is thus referred to as the charge-depleting (CD) operation. When the available electrical energy in the batteries is almost depleted, the vehicle transitions to charge-sustaining (CS) mode. In this mode, the SOC is stabilized as the vehicle mainly utilizes the internal combustion engine for propulsion. The vehicle remains in this mode until the batteries are reconnected for recharging. In this study, three different but representative CD and CS operations were employed as load profiles: the City, Highway, and High-speed profiles represent stop-and-go city, highway, and aggressive highway driving, respectively. Unlike in the City and Highway profiles, CS is not included in the High-speed profile. In accordance with the prescribed operation of PHEVs, the test consisted of a constant power charge at 9 W until the SOC reached 90%, an hour rest, and a dynamic current discharge and charge until the SOC reached approximately 15%. At this level of SOC, transitioning from CD to CS occurred during both the City and Highway profiles.
A total of 30 datasets were finally obtained from the three load profiles with ten tests performed at cycles 0, 100, 200, 300, 600, 900, 1200, 1500, 1800, and 2100. As a result,
Figure 2 plots changes in the rated capacity, which decreases almost linearly per cycle, except at cycle 1500. At the final cycle, the capacity was measured to be 2.254 Ah. This final capacity corresponds to about 70% state of health (SOH), as it represents the present capacity relative to a nominal capacity of 3.23 Ah. This level of SOH falls below the end-of-life threshold for batteries in currently sold electric vehicles.
Figure 3 plots changes in the terminal voltage and SOC as capacity decreases. Due to space constraints, the measurement results from four specific cycles (0, 300, 1200, and 2100) are selectively presented. For each cycle, the three load profiles were applied to aging cells, and the initial SOC was set to 90%. As capacity decreased, cycles later than 300 were subject to abrupt termination due to rapidly reaching a cut-off voltage of 2.5 V.
3. Cell Modeling
This section formulates a cell model and identifies its parameters that can predict the terminal voltage despite battery aging.
We compared a typical Thevenin model to its variant, an enhanced self-correcting (ESC) model [
38], and selected the model with higher fidelity to the test data acquired from aged cells. The Thevenin model is one of the most widely utilized equivalent circuit models for battery state estimation, representing the time-dependent polarization voltages using one or more parallel resistor–capacitor (RC) elements. The ESC model extends the Thevenin model with an additional element that can describe the SOC-varying and instantaneous hysteresis voltages. These models were developed through an evolutionary process, progressively integrating states and their relevant parameters. Basically, the output voltage
was predicted based on the input current
flowing through the ohmic resistance
. The states for the SOC
and the current
flowing through the polarization resistance
paired with capacitance
were incorporated, building the Thevenin model. To complete the ESC model in
Figure 4, a state for the hysteresis voltage
was added, using the instantaneous hysteresis voltage
as an input.
As detailed in our previous study [
37], the model parameters were identified sequentially. Rather than identifying all parameters at once, the time constant
was directly solved using suboptimal linear optimization. Based on the initial guess of the decay rate of hysteresis
, the states were computed. Subsequently, the maximum hysteresis
, the maximum instantaneous hysteresis
,
, and
were iteratively solved by non-negative least-squares. Finally,
was updated iteratively. This sophisticated method, referred to as a sequential estimator in this study, allows for effectively estimating many parameters with only one measurement,
. In consequence, a total of 12 cases were created by applying the three load profiles to the four cell models. Both models utilize either one or two RC elements, referred to as Thevenin 1RC, Thevenin 2RC, ESC 1RC, and ESC 2RC. These models are named according to the load profiles applied during cell modeling. For instance, a Thevenin or ESC model developed using the High-speed profile is referred to as the High-speed model in this context. The load profiles used include City, Highway, and High-speed.
Figure 5 plots changes in the error between the measured and predicted terminal voltages, comparing modeling accuracy across the type of load profiles, the type of cell models, and the number of cycles. First, regarding the effect of load type, the error exhibits no appreciable difference among them. The High-speed models operating solely in CD mode produce RMSEs similar to those achieved by the City or Highway models, which operate in both CD and CS modes. Regarding the effect of cycle number, the error generally increases with decreasing capacity, irrespective of load and model types. This observation can be accounted for by limitations in representing the physicochemical changes in aged cells. Regarding the effect of model type, the ESC models evidently outperform the Thevenin models in predicting the terminal voltage. The ESC model maintains higher accuracy throughout all of the cycles due to the incorporation of the hysteresis voltage state and its associated parameter. Here, hysteresis refers to the phenomenon where SOC is prone to a lagging effect based on previous charging and discharging. The original intention of integrating the hysteresis element is to improve SOC accuracy by modeling this phenomenon; however, unintended advantages occurred. One of the main reasons for the increasing error is irreversible changes in the SOC–open circuit voltage (OCV) curve. The ESC model is more robust to this change because its hysteresis element allows for some adjustment of the curve over time. To support this claim, the difference in the error between the Thevenin and ESC models is not significantly large in the earlier cycles, but it gradually increases when approaching later cycles. This observation can be interpreted as the curve remaining intact during the early cycles, when battery aging is minimal. Another observation regarding the effect of model type is that the addition of RC elements significantly enhances the Thevenin model but does not improve the ESC model. This discrepancy arises because the hysteresis element reduces the relative influence of additional RC elements in the ESC model. Therefore, integrating the hysteresis element is more helpful in creating a cell model with greater fidelity, compared to simply increasing the number of RC elements. In consequence, we selected the ESC 1RC as the optimal model type. The state and output equations of the selected model are formulated as:
where
is the integer time index,
is the measurement time interval,
is the coulombic efficiency, and
is capacity. The three states
,
, and
are involved, along with the seven parameters
,
,
,
,
,
, and
, all of which are estimated using a dual filter.
The estimated values of and were compared against their true values obtained using the current sensors integrated in the test equipment. This dedicated sensor was designed for ultra-high precision measurement of currents, ensuring an accuracy of less than 0.01% across all measurement ranges. Although typical current sensors in commercial BMSs operate with single-digit accuracy around 3%, we aim to improve upon these less reliable measurements with estimates derived from the developed cell model and its filter.
4. Dual Filtering
This section formulates the DEKF and applies it to simultaneously estimate the states and parameters during battery aging. Among the parameters, the present capacity was solely estimated using a separate optimization technique due to its poor observability in the developed filter.
Fundamentally, the KF serves as the optimal state estimator, if the considered system is linear and the uncertainties are represented by Gaussian random variables. If the system is not linear, a linearization process is performed at every time step to approximate the nonlinear system as a linear time-varying system. The linearized system can then be utilized within the KF framework, generating the EKF that operates on the original nonlinear system. The original nonlinear system is in the form of a state-space model that describes the dynamics of states:
where
is the process noise with the covariance matrix
, and
is the measurement noise with the covariance matrix
. Here,
is a nonlinear state transition function, and
is a nonlinear measurement function.
The EKF uses this system model in a recursive process to update its estimate of the present state. The two steps, time update (prediction) and measurement update, are performed repeatedly, as follows. Note that
denotes a predicted value,
denotes an updated value,
denotes an estimated value, and
denotes the covariance of
. Initially, the present state is predicted given the past state estimate and the system input:
Next, the uncertainty of the state estimate is predicted:
From here, the measurement update is followed by the time update. The measurements of the system output are taken and compared to the predictions derived from the state estimate:
The state estimate is updated based on the prediction error and the Kalman gain. The Kalman gain is computed based on the sensitivity of the output to various states (represented by the Jacobian matrix in the EKF) and the uncertainty of the states (represented by the error covariance matrix):
Subsequently, the uncertainty of the state estimate is updated:
The original nonlinear system within the EKF refers to the cell model developed in
Section 3. The states in the model, SOC as an example, can change rapidly over time, spanning its entire range within several minutes. In contrast, the model parameters, such as capacity, change slowly. The SOC generally denotes the residual capacity with respect to the present capacity, leading to a strong correlation between them. Therefore, we devised the DEKF based on two separate but cooperating filters: the state filter and parameter filter [
39,
40]. While the state filter remains consistent with Equations (3) and (4), the parameter filter introduces a new state-space model that describes the dynamics of parameters:
where
is the process noise and
is the measurement noise. According to Equation (10), the parameters are essentially constant, but they may change slowly with respect to time and are modeled by a fictitious process noise. Each filter executes the aforementioned process to estimate either the states or the parameters, and they are linked by exchanging information during the time update step.
This interconnected process enables the more accurate tracking of changes in the states and parameters, with the exception of the present capacity. The weak observability of capacity has also been reported [
32,
41]. Here, the DEKFs have difficulty accurately estimating capacity. This observation is related to the facts that the terminal voltage is the sole measurement, and capacity is weakly linked to the terminal voltage, according to the derived system model. This inaccuracy in capacity estimation can also negatively impact SOC estimation, owing to its strong dependency on the present capacity.
Specifically for capacity estimation, the parameter filter of the developed DEKF was thus replaced with a separate optimization technique based on the weighted least-squares (WLS). As reported in series [
42,
43,
44], this method has been continuously improved, aiming at its BMS implementation. The basis of this method is to estimate the present capacity in the least-squares sense by utilizing changes in the shape of a charge curve as capacity decreases. The present capacity is estimated during battery charging, taking advantage of highly standardized and less dynamic charging profiles compared to arbitrary discharging profiles. In practice, however, the shape of the curve is influenced by operational factors before charging; for example, the duty cycle, rest time, SOC, and temperature. The WLS was thus selected to assign varying weights to each data point on the curve, since they differ in reliability for capacity estimation. In consequence, the WLS assigns greater weights to the later part of the curve, as these points are more dependable than those in the earlier part. Despite variations in such operational factors, a capacity error of less than 3% was achieved. However, this original scheme relies on prior information, specifically the past capacity estimate, which serves as the best guess for the true value of the present capacity. Consequently, if batteries or their BMSs are replaced, the stored value of capacity in the BMS no longer matches the actual value. The original scheme is enhanced to eliminate this issue, allowing it to operate independently of such replacements. Meanwhile, the original scheme relies on the WLS as its optimization technique, which necessitates Jacobian calculations and matrix inversion to solve the normal equation. As a result, during batch processing, state estimation requires substantially higher CPU usage and greater memory capacity than data measurement, creating challenges for BMS implementation. The original scheme is enhanced to mitigate these issues by replacing the batch estimator with a recursive estimator for solving the normal equation. This replacement helps distribute CPU usage more evenly and reduces the memory footprint since it eliminates the need for long, multi-vector formulations of the normal equation. By adopting this method, the present capacity was solely estimated during charging, while the states and the remaining parameters were simultaneously estimated by the DEKF during discharging.
5. Filter Validation
This section validates the developed filter using the test data collected during battery aging. As described in
Section 2, a total of 30 datasets were provided for this validation. These datasets encompassed the three load profiles applied to ten aged cells. The lowest capacity among these cells was 2.254 Ah, which is equivalent to less than 70% SOH. The validation result refers to an error between the estimated and measured values. As described in
Section 3, the ESC 1RC model created by averaging the parameters derived from the three load profiles was used for this validation. Compared to the Thevenin model, the ESC model provided greater fidelity to the battery aging test data, primarily due to the inclusion of a hysteresis element that captured changes in the SOC-OCV curve with battery aging. This result implies that the key aging-related parameters include
, which reflects an increase in internal resistance, and
and
, which represent alterations in the SOC-OCV relationship within the hysteresis element. In this study, a mean value (RMSE) was used to evaluate SOC estimates, whereas a final value was used for capacity estimates. Their measurements were commonly based on Coulomb counting with current sensors enhanced with internal compensation.
Based on the validation result, we developed three different schemes successively, as illustrated in
Figure 6. The first scheme relies solely on the simultaneous estimator to concurrently update the SOC and capacity, leading to inaccuracy in capacity estimation. To address this issue in the DEKF, the second scheme substitutes its parameter filter for capacity estimation with the WLS. Despite having this separate capacity estimator, maintaining accuracy during battery aging is still challenging. As an effective alternative, the third scheme repeats the sequential estimator to intermittently update all the model parameters. In the first and second schemes, this parameter identification for cell modeling is performed only once with a fresh cell, as described in
Section 3.
Figure 7 compares the sequential and simultaneous estimators in detail.
5.1. First Scheme
As a result of the first scheme,
Figure 8 presents the SOC error, comparing estimation accuracy across the type of load profiles and the number of cycles. Regarding the effect of cycle number, the SOC error generally increases as capacity decreases, except at cycle 1500. The test performed at this cycle is considered flawed because a similar anomaly is also observed in
Figure 2, which shows changes in the rated capacity. The cycles earlier than 900 exhibit an RMSE of less than 3%, regardless of load type. However, the error exceeds 3% and rises rapidly from cycle 900 onward, where the SOH reaches nearly 80%. Regarding the effect of load type, the SOC error remains similar across each type until cycle 1200. From this cycle, except for cycle 1500, the increasing error in the City model is less severe than in other models based on more aggressive load profiles. The SOC error in this model remains within 5% throughout all cycles. Capacity reduction necessarily accompanies an increase in internal resistance. Due to this increase, higher C-rates in the Highway and High-speed profiles cause greater voltage drops, particularly in more aged cells. As a result, the inability to accurately predict fluctuations in the terminal voltage leads to poor SOC estimation, which is more severe at low SOC levels towards the end of the load profile.
This inference can be confirmed in
Figure 9, which plots changes in the SOC error with its bounds against time, comparing estimation accuracy across the types of load profiles and the number of cycles. Due to space constraints, the estimation results from five specific cycles (0, 600, 1200, 1800, and 2100) are selectively displayed. The impact of an initial SOC on tracking performance is as significant as that of the cycle number. Here, some initial values are not visible because of the unified range of SOC error (−5~5%) across 12 cases. To address this lack of prior information,
Table 2 lists the initial SOC estimates for each load profile and cycle number. In this study, the initial SOC was determined using the SOC-OCV curve, assuming that the terminal voltage converges to the OCV after long enough relaxation periods. To mimic the actual operations of PHEVs, the test involves resting after charging to 90% SOC, as described in
Section 2. In the earlier cycles, the initial SOC is commonly estimated to be 88%, resulting in an SOC error of about 2%. Similar to the mean value (RMSE) of SOC in
Figure 8, the error in its initial value increases as capacity decreases, regardless of model type. This observation suggests that the curve established for a fresh cell loses effectiveness because of the physicochemical changes in aged cells. This inference was also made in
Section 3, and the current results provide even stronger evidence.
As another result of the first scheme,
Figure 10 plots changes in the measured and estimated capacities, comparing estimation accuracy across the type of load profiles and the number of cycles. The measurements are consistent with those in
Figure 2. The resulting estimates are consistent with our concerns about the weak observability of capacity in the DEKF, as discussed in
Section 4. This inaccuracy in capacity estimation can adversely affect SOC estimation, due to its direct relation to the present capacity. This claim is supported by the fact that, although there are slight differences depending on load type, the capacity estimation error generally starts to increase after 600 cycles, following the same trend as the SOC estimation results, as presented in
Figure 8. Therefore, from the next scheme, capacity is updated separately from the simultaneous estimator.
5.2. Second Scheme
As a consequence of the second scheme,
Figure 11 plots changes in the measured and estimated capacities, comparing estimation accuracy across the number of cycles. Unlike the first scheme, the present capacity is estimated during battery charging, corresponding to one single load type. This load profile is simply based on constant power charging at 9 W until the SOC reaches 90%, reflecting the real-world operations of PHEVs. By decoupling capacity estimation from the parameter filter of the DEKF, an estimation error within 3% can be realized, as witnessed in [
42].
Figure 12 presents the SOC error, comparing estimation accuracy across the type of load profiles and the number of cycles. The resulting estimates are comparable to those in
Figure 8, which is derived from the first scheme. Unfortunately, despite notable improvement in capacity accuracy, no corresponding enhancement is observed in the SOC. The SOC error is actually worse than in the first scheme, yielding RMSEs exceeding 5% after 600 cycles. Despite having capacities close to their true values, the decline is persistent. Compared to the cell model, its filter is much more responsible for this inaccuracy in SOC estimation. This is because, as demonstrated in
Figure 5, the ESC 1RC model can only retain accuracy within a certain range if its parameters are properly identified using the sequential estimator. Therefore, in the next scheme, all the model parameters except for capacity are updated intermittently with the sequential estimator.
5.3. Third Scheme
As a result of the third scheme,
Figure 13 presents the SOC error, comparing estimation accuracy across the type of load profiles and the number of cycles. The resulting estimates are comparable to those in
Figure 8 and
Figure 12, which are derived from the first and second schemes, respectively. As discussed in
Section 3, the sequential estimator is tailored to effectively extract multiple parameters in the ESC model using a single measurement. In place of the typical parameter filter in the simultaneous estimator, this method is applied prior to performing cycles 600, 1200, and 1800, where the SOC error starts to rise rapidly. As also shown in
Figure 14, an increase in SOC error is effectively suppressed, maintaining an accuracy within 5% throughout all the cycles.