1. Introduction
As a product of the transition from traditional fuel to electric vehicles (EVs), plug-in hybrid electric vehicles (PHEVs) are currently a hot topic in research on new energy vehicles. As one of the keys to energy-saving technologies for hybrid electric vehicles, energy management strategies (EMS) can be divided into three main approaches: ① Rule-based (RB) energy management strategies [
1,
2]; ② optimization-based control strategies; and ③ intelligent control-based algorithm strategies. RB strategies commonly used in the research field include the charge-depleting-charge sustaining (CD-CS) strategy and fuzzy logic (FL)-based strategy. A rule-based strategy is simple and can be easily implemented in real time using look-up tables, although the fuel economy is relatively poor. Optimization-based EMS can be divided into instantaneous and global optimization. A representative of an instantaneous optimization algorithm is the equivalent fuel consumption minimization strategy (ECMS) [
3,
4,
5], which achieves a good real-time performance, but cannot be operated stably under different working conditions. Global optimization algorithms mainly include dynamic programming (DP) [
6,
7], Pontryagin’s minimum principle (PMP) [
8,
9], etc. Although global optimization algorithms can guarantee the minimum fuel consumption, all driving cycle information is needed and the calculation cost is also large. Therefore, these algorithms cannot be used for online control. In contrast, based on machine learning (ML), artificial neural networks (ANNs), and other intelligent algorithms, the global optimization algorithm can be applied online by fitting the optimal control law obtained by the algorithm [
10].
At present, a large number of studies have realized the online application of a global optimization algorithm in a hybrid electric vehicle energy management strategy through a neural network. In [
11,
12], the authors used the battery power and engine speed as control variables, and applied dynamic programming to calculate the global optimal fuel consumption. To obtain the best battery power of the hybrid electric vehicles (HEV), as well as the best engine speed, the neural network is trained using the vehicle speed, required power, battery state of charge (SOC), and driving style of the offline calculation results. In addition, another neural network module based on 11 variables has been used to predict the operating mode. This method accurately fits the calculation results of dynamic programming; however, the calculation cost of dynamic programming is significant, the neural network module is too bloated, and the complexity affects the stability of online applications. In [
13], it is shown that online rules extracted from trained neural networks are difficult to adapt to different driving conditions. A large number of current studies have improved the generalization of energy management strategies by improving the neural network algorithm; however, they have rarely started from the improvement of optimization algorithms to improve the robustness of the neural network. To a certain extent, the complexity of the optimization algorithm determines the robustness of the online control strategy. This study therefore improves the accuracy of neural network recognition by improving the dynamic programming algorithm.
The calculation cost of dynamic programming mainly depends on the dimension of the state variables and control parameters, as well as the degree of dispersion in each dimension. For power-split hybrid vehicles or vehicles with gearboxes, the torque and speed are decoupled, and because they are all continuous variables, the transmission system has two degrees of freedom. To reduce the complexity of dynamic programming, in [
14], the best point of operation efficiency according to different engine powers of hybrid vehicles was found when applying a power-split configuration, and the speed corresponding to this point was used as the engine operating point under the current power. In addition, the control variable is simplified to the torque distribution of the two power sources. However, this method limits the adjustment range of the motor’s operating point and increases the overall energy consumption of the vehicle under certain harsh working conditions. Aimed at the P2 configuration hybrid vehicle with continuously variable transmission (CVT), in [
15], the authors determined the CVT speed ratio using instantaneous optimization during each calculation step of the dynamic programming. The control variable is simplified to the engine torque, and the problem of two degrees of freedom is thus converted into a one-degree problem. However, the weight coefficient during instantaneous optimization depends on engineering experience, which leads to a discrepancy between the instantaneous optimization solution and the global optimal solution.
Dynamic programming needs to predict all the information of driving conditions and does not have a real-time performance. However, the minimum strategy for the equivalent fuel consumption derived from PMP achieves a good real-time performance, and has therefore been widely used [
16,
17,
18,
19]. A single variable of the equivalent factor can be used to calculate the battery consumption or recovered energy equivalent to the fuel consumption, and the optimal solution of each instant can be solved by minimizing the equivalent fuel. Therefore, some researchers have combined DP and ECMS. For example, [
20] uses the DP algorithm to optimize the vehicle operating parameters (under the best fuel economy in the entire cycle), uses these parameters to solve the equivalent factor, and selects the best equivalent factor according to the different initial conditions of the car. However, the equivalent factor obtained by this method is fixed and cannot be adjusted in real time according to practical working conditions, so the robustness and fuel economy are poor. Moreover, [
21] uses DP to match the equivalent factors, construct the best equivalent factor map diagram under different SOC, and adjust the equivalent factor in real time, according to the current SOC of the vehicle. However, this method only considers the influence of SOC on the equivalent factor and does not consider the driving conditions, which is too limited.
In [
22,
23,
24,
25,
26], the authors proposed the approximate equivalent fuel consumption minimization strategy (A-ECMS) based on the approximate minimum principle (A-PMP). The motor output torque is constrained in a set of five optional values, which greatly simplifies the calculation cost, and its reliability has been proved in an actual vehicle experiment. Based on this, this study nests the A-ECMS into DP to form an improved dynamic programming algorithm with an equivalent factor as the single control variable. Firstly, the A-ECMS model is established, where the output torque of the motor is constrained to a limited number of optional values by numerically fitting the efficiency of the engine and the motor. The instantaneous energy distribution is obtained by A-ECMS of different equivalent factors at each instant, and the SOC variation under the energy distribution is calculated. Then, the DP global optimization model is established, in which the equivalent factor is the control variable, the battery SOC is the state variable, and the cumulative fuel consumption is the global optimization goal. Therefore, the DP-A-ECMS algorithm is formed. In order to verify the algorithm proposed in this paper, a PHEV computer simulation model was built to simulate under different driving conditions, and the optimal equivalent factor sequence was obtained. Finally, in order to realize the online application of DP-A-ECMS, a BP neural network was used to extract the nonlinear correlation between the optimal equivalent factor and real-time operating conditions, and the optimal equivalent factor sequence was transformed from a set of time-varying sequences to a set of state-varying sequences. By doing this, an online control strategy could be established.
3. Offline Optimization Algorithm
3.1. Approximately Equivalent Fuel Consumption Minimum Strategy
Originating from the PMP algorithm, the ECMS algorithm was proposed by Paganellig [
5] and applied to the energy management strategy of HEVs. The core idea is to select an appropriate charge and discharge equivalent factor to transform the electricity consumption into the fuel consumption, minimize the instantaneous equivalent fuel consumption, and obtain an approximate global optimal solution. The mathematical model can be formulated as
where
is the instantaneous equivalent fuel consumption rate;
is the battery capacity level;
and
are the engine torque and CVT speed ratio, respectively, and are system control variables;
,
, and
are obtained from the model of each component in the first section;
is the low heating value of gasoline;
is the equivalent factor; and
is the equivalent fuel consumption of the battery rate. The constraint conditions are as follows:
Ouyang and Qin proposed the A-ECMS. By numerically fitting the fuel consumption rate of the engine and battery, the search space of the optimal control variable is reduced to shorten the optimization time. In this paper, A-ECMS is analyzed according to the simulation model in Chapter 1.
Figure 2a shows the fitting curves of the instantaneous fuel consumption of the engine and
Figure 2b shows the equivalent fuel consumption of the battery (the battery SOC is 0.5 and the equivalent factor is 1). It can be seen that, at a certain speed, the relationship between the instantaneous fuel consumption and the output torque of the engine can be divided into three sections. For HEVs, the actual working requirement of the engine is to avoid working in the low-load area (the first section). Therefore, the linear and quadratic functions are used to fit the second and third sections of the instantaneous fuel consumption curve of the engine. The intersection of the two curves was recorded as
. The battery equivalent fuel consumption curve can be divided into two sections. Each curve is composed of two quadratic functions, and the point where the motor output torque is zero is the connection point of the two quadratic curves.
Equations (13) and (14) are fitting functions, where
,
,
, and
(
) are the fitting parameters. According to
Figure 2 and the definition of the convex function, it can be determined that, at a certain speed, the instantaneous fuel consumption of the engine is a convex function on
, and the instantaneous equivalent fuel consumption of the battery is a convex function on
.
The goal of ECMS is to minimize the equivalent fuel at each instant. Because the instantaneous fuel consumption of the engine and the instantaneous equivalent fuel consumption of the battery are convex functions, the nature of the convex function shows that the minimum value can only be obtained in
, and among them,
Therefore, the constraint in the ECMS mathematical model (Equation (12)) is transformed into Equation (16), and the A-ECMS is then formed.
At this point, in the A-ECMS calculation process, the optimal solution can only be determined by comparing the equivalent fuel function values of these five points. This reduces the search area of the optimal solution from the allowable reachable set of the entire control variable to five search points, significantly reducing the calculation time and calculation storage space. This process also leads to a discontinuous relationship between the optimal energy allocation and the equivalent factor.
Figure 3 shows the change in the optimal distribution of the power of the engine and motor corresponding to different equivalent factors when the vehicle speed is 60 km/h and the required torque is 30 Nm. It can be seen that the optimal power distribution curve is stepped; in other words, an interval of equivalent factors corresponds to an optimal power distribution. Therefore, in the A-ECMS solution process, no matter how the equivalent factor changes, the power of the engine and motor corresponding to the optimal equivalent fuel consumption is often concentrated at a few specific values.
3.2. Improved Dynamic Programming Algorithm
In a traditional dynamic programming algorithm used in the energy management strategy of a PHEV, the battery SOC and the transmission speed ratio are usually taken as state variables, and the motor torque is taken as a control variable. For CVT transmissions, owing to the continuously changing characteristics of the speed ratio, to maximize the transmission efficiency of the vehicle, the CVT speed ratio needs to be discretized into a series of values within its feasible range. In addition, for the motor torque, intensive dispersion is also required. Therefore, the direct application of a dynamic programming algorithm to global optimization of the EMS requires significant computational costs.
As mentioned above, the equivalent minimum fuel strategy can solve the instantaneous optimal solution through the equivalent factor, and the optimal solution can only be obtained from a limited number of specific values after a numerical fitting of the fuel consumption rates of the engine and motor. In view of this, in this study, the A-ECMS is combined with the DP algorithm to form an improved dynamic programming algorithm. In this method, the A-ECMS is added to each step of the DP process. Equivalent factors are taken as the control variables to calculate the instantaneous energy distribution of the A-ECMS, and the variation of SOC and the instantaneous fuel consumption of the engine under the energy distribution are then calculated. Taking the battery SOC as the state variable and the cumulative instantaneous fuel consumption as the optimization objective of DP, the optimal control law of the equivalent factor is obtained by offline global optimization. A flow chart of the algorithm is illustrated in
Figure 4.
The state transition equation and DP cost function are as follows:
Starting from the determined end state of the system, the optimal equivalent factor of each stage and each state is obtained in reverse order, according to Equations (17) and (18), respectively, and all optimal equivalent factors are stored to obtain the optimal control variable matrix . Then, starting from the initial state of the system, the optimal equivalent factor at this moment in the optimal control variable matrix is determined, and the engine fuel consumption under this equivalent factor is calculated using the ECMS model. The best equivalent factor is then recorded at each instant such that the best equivalent factor sequence and the best fuel consumption are obtained. Because of the discontinuous relationship between the optimal energy distribution and the equivalence factor, the optimal equivalent factor at each instant is within an interval composed of the upper and lower limits of the optimal equivalent factor, and the sequence of the optimal equivalent factor in the entire cycle is divided into the upper limit sequence of the effect factor and its lower limit sequence.
3.3. Offline Optimization Simulation Results
To cover the different driving conditions including highways, urban suburbs, and congested streets, this article uses six standard driving cycles to simulate the operating conditions of the vehicle: (1) Urban Dynamometer Driving Schedule (UDDS); (2) SC03; (3) Highway Fuel Economy Driving Schedule (HWFET); (4) US06_HWY; (5) Manhattan; and (6) New York City Cycle (NYCC). Each driving cycle goes through multiple successive iterations to ensure a sufficient driving distance and duration time. Based on the MATLAB/Simulink platform, a numerical model of the vehicle components and the DP-A-ECMS strategy model are established, and a backward simulation model of the vehicle dynamics is built.
Figure 5 shows the battery SOC change curve under different initial conditions for the 11-SC03 (a) and 9-HWFET (b) operating conditions. It can be seen that each curve always fluctuates slightly near the straight line formed by the initial SOC and the final SOC. Under this condition, the battery can be discharged smoothly throughout the cycle, and the vehicle can fully use battery energy to reduce the engine fuel consumption.
Figure 6 shows the optimal sequence of the upper and lower limits of the equivalent factor with initial SOCs of 0.9, 0.6, and 0.3 under the 11-SC03 and 9-HWFET operating conditions, in which meaningless points (such as speed and demand torque values of zero) and the point of pure electric driving are removed. In the SC03 driving cycle, the optimal sequence of the equivalent factor increases as the initial SOC decreases, and the effective data points increase. This is because, when the battery capacity is sufficient, the vehicle tends to drive in pure electric mode. In contrast, the time of the hybrid mode is increased when the SOC is low, and the equivalent factor is maintained at a higher level to ensure the SOC balance. Under the HWFET operating condition, owing to the higher average vehicle speed, the equivalent factor is always larger, and the engine thus needs to output a higher power.
Table 2 shows a comparison of the fuel economy and simulation calculation time for DP-A-ECMS and DP strategies under different initial SOCs, and the gap with DP-A-ECMS is calculated based on the DP. Because of the previous numerical fitting of the DP-A-ECMS algorithm, fuel consumption is increased by 0.18%–1.35% compared with DP. However, the control variable of DP-A-ECMS is limited to specific values instead of the entire feasible range, and its calculation cost is therefore reduced by more than 50%.
5. Conclusions
This paper has described the online energy management of PHEVs based on an offline optimization algorithm. The minimum strategy was analyzed, and the optimal strategy for the approximately equivalent fuel consumption was derived. Combined with the DP algorithm and taking the equivalent factor as the control variable, the A-ECMS was nested in the DP algorithm, and the DP-A-ECMS algorithm was formed to calculate the global optimal sequence of the equivalent factor through DP global optimization.
Taking a single-axis parallel CVT PHEV as the research object, a Simulink simulation model of the vehicle was established, and an off-line simulation calculation was carried out under the DP-A-ECMS strategy. The results show that, under different working conditions and different initial SOC conditions, the SOC curve under the DP-A-ECMS strategy fluctuates near the straight line formed by the initial SOC and the final SOC, and the battery can be discharged smoothly. Compared with the DP strategy, although the fuel consumption is slightly improved, the calculation cost is significantly reduced.
A BP neural network was used to extract the nonlinear relationship between the optimal equivalent factor and real-time operating conditions and vehicle states. The control strategy rules suitable for specific operating conditions were extracted, and an online control strategy based on a BP neural network was established. The simulation results show that the neural network control strategy based on optimal equivalent factor recognition has a good robustness, and the change in battery SOC can track the reference curve well. Compared with a direct method of identifying the motor torque and CVT speed ratio, the fuel economy is improved by 2.46%.