1. Introduction
Electric vehicles (EVs) have been accepted as a global consensus owing to the shortage of petroleum resources and environmental concerns [
1,
2]. Nonetheless, the cost and energy density of batteries have considerably limited the development of EVs. Therefor, plug-in hybrid electric vehicles (PHEVs) have become a promising solution due to their significantly lower capacity of battery requirement and no range anxiety compared to pure battery electric vehicles (BEVs) [
3]. To provide a preferable energy-saving potential, a scientific energy management strategy (EMS) is necessary to online optimize the distribution of the demanded power between the internal combustion engine (ICE) and electric motors (EMs) of the PHEVs [
4]. However, it is still a challenging issue owing to the high nonlinearity of the hybrid powertrain, and is attracting numerous investigations in the literature [
5].
Various methodologies have been provided to address the energy management problems, which can be categorized into rule-based strategies, optimization-based strategies, and learning-based strategies [
6]. The most representative rule-based EMSs can be deterministic strategy [
7], fuzzy logic strategy [
8], and feedback control strategy [
9], as well as charge-depleting and charge-sustaining (CD-CS) strategy, especially for PHEVs [
10]. All of them can be easily implemented in real time due to their lower computational load and higher reliability. Nevertheless, these methods require empirical or heuristic knowledge, leading to nonoptimal solutions. Therefore, optimization-based strategies, including offline optimization and online optimization, have been increasingly proposed and demonstrated to significantly promote energy-saving potential [
11]. The offline optimization EMSs are mainly developed by convex programming [
12], dynamic programming (DP) [
13], Pontryagin’s minimum principle (PMP) [
14,
15], or other optimization methods [
16], which can provide the global optimal or near-optimal solution. However, they are difficult to online implement due to their prior knowledge assumption and elevated computational effort, but they can be deployed as benchmarks or incorporated into other methods to enhance the online EMSs [
17,
18]. By contrast, online optimization approaches are more and more attractive owing to their abilities in real-time operation, for example, the equivalent consumption minimization strategy (ECMS) derived from PMP can convert the global optimization into an instantaneous or local optimization problem to realize the real-time application [
19,
20]. Furthermore, some alternative PMP-based EMSs, e.g., adaptive PMP (A-PMP) [
21], adaptive ECMS (A-ECMS) [
22], map-based ECMS [
23], and mode-switching ECMS [
24], are also employed for developing real-time EMSs. All of them can regularly provide acceptable performance, yet their drawback is the requirement of fruitful history operation information in order to obtain an adaptive co-state or equivalence factor.
Another kind of online optimization method is predictive-based EMSs, in which the power split is optimized based on the predicted information (e.g., power demand, velocity, etc.) [
25,
26]. Model-predictive control (MPC) is mainly deployed to realize predictive energy management, where the forecasting methods are utilized to predict future information [
27,
28], and the global optimization methods (e.g., DP or PMP) are integrated into EMSs to obtain the optimal solution in the predicted receding horizon [
29,
30]. Moreover, vehicle-to-vehicle (V2V) and vehicle-to-infrastructure (V2I) information can also be employed in MPC-based strategy to facilitate the forecasting of the vehicle velocity in some specific scenarios [
31]. MPC-based strategies can successfully overcome the impact of the uncertainty of driving cycles in real time, but the control performance (e.g., fuel economy) is excessive, dependence on the accuracy of the forecasting model. With the development of artificial intelligence, the learning-based method gradually plays a key role in the online energy management optimization of the PHEVs, which can be classified into artificial-neural-network-based (ANN-based) and reinforcement-learning-based (RL-based) [
32]. For the ANN-based method, ANN is commonly utilized to estimate the near-global optimal solutions or forecast driving cycles [
33,
34]. However, only if the characteristics of driving cycles are close to the training data, the results will be efficient. Thus, the training data need to be updated for ANN so as to achieve better performance for various driving cycles [
35]. More recently, RL-based methods have attracted more and more attention due to their model-free attribute and remarkable adaptability [
36,
37]. Q-learning (QL) is one of the most popular approaches; however, the training process is reasonably time-consuming, and the “curse of dimensionality” may be aroused [
38]. Hence, a deep Q-leaning (DQL)-based strategy is suggested to overcome the unstable learning characteristics of the tabular Q-learning method [
39]. Nevertheless, it commonly falls into an overestimation of action values. Consequently, double-DQL (DDQL) is recommended to address this problem and has exhibited excellent fuel economy improvement [
40]. Unfortunately, the complexity of the neural network may inevitably lead to a steep computation burden and overfitting problems, which is unsuitable for the real-time application of EMSs.
Generally speaking, neither optimization-based nor learning-based EMSs are tough to implement in real time. Considering the timeliness and optimality of the strategy for unknown driving cycles, simplicity and strong robustness are essential for real-time EMSs. The PMP-based method and the derived strategy (e.g., ECMS) are more suitable for real-time application, owing to their instantaneous optimality and simplified control variable [
41,
42]. The main challenge is determining how to guess the co-state of PMP or the equivalent factor of ECMS in real time to guarantee the optimality and robustness of EMS [
43]. In some cases, the co-state is assumed to be a constant value, if the influence of the battery state-of-charge (SOC) on battery voltage and resistance is neglected [
44]. The optimal co-state (or equivalent factor) can be successfully derived based on prior known driving data. However, it may not be applicable for an unknown driving cycle. Therefore, adaptive or predictive approaches are commonly employed for the real-time application of PMP (or ECMS) [
14,
21,
22,
45]. Yet, the adaptive or predictive controller is only based on the typical historical driving cycles without considering the randomness of the driving cycle and vehicle mass, thereby leading to an unreasonable solution for real-time execution.
Owing to the straightforward control logic, feedback control is considered to be one of the most effective methods for real-time control of the EMSs [
5,
7,
15,
19,
21,
35], in which battery SOC is usually deployed as the feedback parameter [
3,
18,
46]. Hence, it is crucial to provide a reasonable reference SOC trajectory close to the optimal battery SOC trajectory. Linear reference SOC planning is the simplest method to design a reference SOC trajectory. Once the whole trip distance and the battery SOC limitation (i.e., initial SOC and terminal SOC) are specified, the reference SOC trajectory can be easily attained [
18,
21,
23]. Nevertheless, it may be extremely different from the optimal SOC trajectory since the actual battery SOC trajectory is strongly nonlinear. Hence, numerous nonlinear reference SOC planning approaches are provided to forecast the reference SOC trajectory [
17,
24,
28,
29,
46]. However, a desirable estimation result is difficult to obtain unless the complexity of the method is increased, for which a higher computational burden may be aroused. This would be a significant drawback for real-time applications. On the other hand, trajectory tracking is another critical issue once the reference SOC trajectory is successfully designed. The PID-based (or other iterative-searching-based) controllers are constantly employed for tracking problems, whilst acceptable tracking performance can be provided [
16,
26,
34,
47]. Nonetheless, when the output parameter of the controller is designed to be the co-state of the PMP, unwanted fluctuations may be evoked, which can also lead to undesired increases in the energy consumption. Thus, a model-free-adaptive-control-based (MFAC-based) methodology was recommended to smooth the co-state in our previous research, in which the results demonstrated a meaningful improvement in the fuel economy of PHEVs [
48]. However, it still needs to be strengthened. First, the reference SOC in the previous MFAC-based EMS is designed to be a linear function, which may be distant from the optimal SOC trajectory, thus limiting the energy-saving potential. Second, the impact on the robustness of EMS due to stochastic driving cycles and vehicle mass is not yet considered. This may lead to unpredictable increases in energy consumption at different actual driving cycles with external disturbances in real time. Inspired by Refs. [
49,
50], this paper employs the Design for Six Sigma (DFSS) method to optimize the MFAC-based EMS in order to strengthen the robustness of the strategy for stochastic driving conditions in real-time application.
The main innovation and contribution are summarized as follows. (1) A deterministic optimization method based on multi-island genetic algorithm (MIGA) is proposed to optimize the MFAC-based EMS, and its robustness is evaluated by Monte Carlo simulation (MCS). (2) A robust design framework is proposed on the basis of the DFSS to enhance the adaptive capability of the MFAC-based EMS, in which a real-time SOC constraint derived from the linear reference SOC is incorporated into the PMP to further reinforce the energy-saving performance of the PHEV whilst operating in various driving conditions.
The remainder of the paper is organized as follows.
Section 2 presents the studied PHEV powertrain models and vehicle dynamic models. In
Section 3, MFAC-based EMS is described, and optimized by MIGA, followed by the MCS analysis of the deterministic optimization results. The robust design framework of the MFAC-based EMS based on DFSS is formulated in
Section 4, whilst the SOC constraint is also designed according to linear reference SOC.
Section 5 introduces the actual driving cycles together with DP and rule-based strategy to verify the robustness of the proposed strategy. Finally, conclusions are summarized in
Section 6.