1. Introduction
Air suspension can improve vehicle ride comfort and road friendliness, and its natural frequency is low and has variable stiffness characteristics [
1,
2,
3,
4,
5]. However, the general air suspension cannot adjust the suspension stiffness and damping according to the load change. The natural frequency and controllability of the electronically controlled air suspension (ECAS) are low, which can further improve the vehicle ride comfort and control stability [
6,
7,
8]. In recent years, research on ECAS has mainly focused on improving comfort and stability. In 2019, Rui modeled the ECAS system according to its nonlinear characteristics and designed an adaptive sliding mode control strategy. The method effectively improves the stability of the system by considering the parameter uncertainty [
9]. In 2021, Ma et al. designed an integrated control strategy to solve the problems of small stiffness adjustment range and poor roll stability of traditional ECAS systems. The handling stability and anti-roll performance of the vehicle are improved [
10]. In 2021, Hu et al. conducted research on the hybrid control of body height and attitude of the ECAS system. They built a vehicle model based on mixed logic dynamics and designed the switching strategy of the solenoid valve. The coordinated control between the ECAS system body height and attitude is well solved, and good vibration isolation performance and stability are achieved [
11].
However, the ECAS system is highly dependent on the reliability of each component. Sensors and actuators are very important components [
12,
13]. The structure of the ECAS system is shown in
Figure 1. The actuator is the air spring solenoid valve. If any of the four solenoid valves fail, the ride comfort and handling stability of the whole vehicle will be severely affected [
14]. Therefore, it is necessary to consider the reliability of the actuator of the electronically controlled air suspension system.
The fault diagnosis technology in the reliability research of the control system is widely used in the control system of various industries. The technology can effectively improve the reliability and maintainability of the system. The technology was first proposed in the 1990s by Frank, who divided the fault diagnosis methods into three categories based on an analytical model, expert knowledge, and signal processing [
15]. In 2009, Zhou further developed the method from qualitative and quantitative perspectives [
16]. In 2020, Feng proposed an SVM model and effectively distinguished different fault conditions of trains by using the support vector machine method [
17]. The fault diagnosis method based on qualitative analysis is mainly divided into the graph theory method, the expert system method, and the qualitative simulation method. Among them, the graph theory method includes the symbol-directed graph method [
18] and the fault tree method [
19]. The main principle is to judge the fault according to the logical causal relationship. This method can be understood easily and is widely adopted.
The fault location and type can be determined by fault diagnosis. In this way, fault-tolerant control (FTC) can be implemented. Alwi et al. classified fault-tolerant control methods in detail [
20]. Fault-tolerant control is usually divided into passive FTC (PFTC) and active FTC (AFTC). The common methods of passive fault-tolerant control include H methods based on ∞ control theory [
21] and sliding mode control theory [
22]. The characteristic of passive fault-tolerant control is that there is no need for fault diagnosis, and the controller parameters are not changed, so it is easy to implement, but the fault-tolerant control is limited. The active fault-tolerant control adjusts the controller parameters online or configures the controller structure units online based on the fault diagnosis information to realize the system stability, which is different from the passive fault-tolerant control. Active fault-tolerant control methods can be divided into two types: planning type and online adjustment type [
23]. In the planning of FTC, the controller is designed in advance for all possible faults of the system, and the corresponding controller is activated when the corresponding fault occurs. On the other hand, the online adjustment of the controller is mainly through adaptive control or control signal redistribution to achieve fault-tolerant control [
24,
25]. In 2020, Pang et al. designed a fault-tolerant controller based on the nonlinear suspension system, which combines the state feedback observer and the H-infinity observer. The purpose of fault-tolerant control is achieved by comparing the system state under the fault-free state to compensate [
26]. In 2021, Xue et al. considered the failure of a 1/4 active suspension actuator under the change of sprung mass. They used the finite-element neural network method to approximate the observer fault, and turned it into a linear matrix inequality problem. This method can obtain the results faster and more accurately, and can well adapt to the changing spring mass [
27]. The above methods are very effective, but the actuator fault diagnosis and fault-tolerant control for the ECAS system are rarely used. In order to fill the shortage in this field, this research will use the extended Kalman filter bank with an adaptive threshold as the observer, and use the online adjustment method to continuously improve the body height. Compared with the existing research, this method has stronger adaptability and accuracy.
At present, research on the fault diagnosis and fault-tolerant control of the ECAS system is scarce. In addition, research on ECAS vehicle height control rarely focuses on how to ensure effective control in the case of actuator failure. However, the ECAS system with actuator failure has many problems to be solved, such as risk analysis, fault diagnosis architecture, and fault-tolerant control strategy design.
The contribution of this paper is to use the extended Kalman filter bank based on an adaptive threshold to study the fault diagnosis and fault-tolerant control of the air spring solenoid valve. This method can change the threshold according to the system input, and can effectively reduce the probability of missed diagnosis and misdiagnosis. After fault diagnosis and isolation, corresponding active fault-tolerant methods are adopted for different types of faults. Among them, the method of online adjustment of controller parameters is used to improve the height and attitude control effect of the constant gain fault. In addition, a hardware-in-the-loop simulation platform is built to finally test the effectiveness and accuracy of the above methods. The HiL platform can realize the connection between the real controller and the simulation model of the controlled object, so as to form a complete loop to test the actual operation and feasibility of fault diagnosis and fault-tolerant control algorithm model in the real controller. This paper provides a research idea for the operation stability and fault-tolerant control method of electronically controlled air suspension.
The article is organized as follows. In the next section, the model of the height adjustment system of the electronically controlled air suspension is established. After analyzing the fault mode by fault tree method, the controlled object and actuator model are established. In
Section 3. The fault diagnosis mechanism based on the EKF group is designed and verified by simulation. In
Section 4, active fault-tolerant control is designed based on the previous section. In
Section 5, a hardware-in-the-loop simulation test bed is built to verify the fault diagnosis and active fault-tolerant control system. Finally, conclusions are drawn in
Section 6.
3. Fault Diagnosis of ECAS System Based on Adaptive Threshold
3.1. Fault Diagnosis System Architecture Based on KEFs
When the actuator of the ECAS system fails, the controller may adjust the charging and bleeding process of the air spring abnormally. Therefore, the actuator fault detection and isolation strategy shown in
Figure 4 is designed for the vehicle ECAS system.
The extended Kalman filter (EKF) algorithm is widely used for state estimation in nonlinear systems. The main idea is to use Taylor’s formula to transform the nonlinear model into a linear model and then perform Kalman filtering. Therefore, in view of the nonlinear characteristics of the ECAS system, the extended Kalman filter algorithm is used to design the fault diagnosis filter.
To design EKF1, selecting the state variable , defining measurement output , control input ; For EKF2, selecting state variable , defining measurement output , control input ; For EKF3, selecting state variable , defining measurement output , control input ; For EKF4, selecting state variable , defining measurement output , and control input .
According to Equation (5) and the selected state variables, measurement output, and control input, write the corresponding system and measurement equations, respectively. The general form of the system and measurement equation is as follows:
f(X), g(X), q(X) and h(X) derived from EKF1, EKF2, EKF3, and EKF4, respectively.
According to the Equation (11) design filter equation, the general form of the equation is:
where
is the estimate for state variables,
is the estimate for measurement output,
Y is the output for measurement,
U is the input matrix for control, and
Lk is the filter gain matrix. Process noise
ω and measurement noise
ν are mutually uncorrelated Gaussian white noise. The probability distribution characteristics are as follows:
where
Qk is the process noise covariance matrix, and
Rk is the measurement noise covariance matrix. The EKF algorithm is shown in
Figure 5, where
.
3.2. Calculation of Adaptive Threshold
Threshold selection is an important step to achieve fault detection and isolation. The adaptive threshold changes with system input, which can effectively reduce the probability of missed diagnosis and misdiagnosis. In this section, an adaptive threshold is designed according to the system model error and system input, so as to minimize the probability of missed diagnosis and misdiagnosis. The system model error includes linearization error and parameter uncertainty error. Firstly, the system equation of EKF1 is taken as an example to illustrate the analysis of the linearization error and parameter uncertainty error
The ECAS system equation has nonlinear characteristics. In the design process of the extended Kalman filter, the system equations need to be linearized, resulting in linearization errors. Equation (12) can be written in the form of
X. Thus, the state transition matrix
A and the control input coefficient matrix
B are obtained.
Thus, the linearization error matrix can be obtained as:
where
.
- 2.
Parameter uncertainty error
In the ECAS vehicle model, the uncertain parameters mainly include the sprung mass and the damping of the shock absorber. The sprung mass has the characteristics of uneven distribution and changes with vehicle masses. The damping value of the shock absorber is also different under different working temperatures. Therefore, the parameter uncertainty is introduced as follows:
The parameter value with real superscript represents the actual parameter value or the floating limit of the parameter. From this, the parameter uncertainty error matrix is derived as follows:
where
The model error matrix is introduced, and the system input is considered to determine the adaptive threshold. Then, the system can be expressed to:
where
.
Model error can be defined as
. Equations (13) and (23) are combined, so that the model error can be expressed as follow:
Integrating Equation (24), the error
ε can be deduced as:
The adaptive threshold for fault detection is given from the above equation as:
where
c calculates the acceptable deviation.
3.3. Fault Detection
The output residual is defined according to fault detection and isolation strategy as
. For the vehicle ECAS system, the residual characteristic description shown in
Table 2 can be obtained.
In
Table 2,
ri(j) is the output estimated residual,
i is the extended Kalman filter number (
i = 1 − 4), and
j is the measurement output number.
j = 1, 2, 3 represent the height change of the air spring, the vertical acceleration at the four corners of the body, and the internal air pressure of the air spring, respectively. Actuators 1–4 represent the front left, rear left, front right, and rear right air spring solenoid valves, respectively. Taking the residual
r as the fault detection indicator, there are three fault detection indicators, including the air spring height estimation residual
ri(1) and the vertical acceleration estimation residual error at the four corners of the car body
ri(2), and the air spring pressure estimation residual
ri(3). Each extended Kalman filter will produce the above three fault detection indicators (that is,
ri(1),
ri(2) and
ri(3)). Comparing the fault detection index with the adaptive threshold h, it can be detected whether the actuator has a fault.
According to Equation (27), when the fault detection index value is greater than or equal to the detection threshold, the actuator has failed. The corresponding detection index ri(j) = 1; when the detection index value is less than the detection threshold, the actuator has not failed. The corresponding detection index ri(j) is equal to 0.
As long as one of the three fault detection indicators exceeds the threshold, it is considered that a fault has occurred. The advantage of setting three fault detection indicators is to further reduce the missed diagnosis rate and increase the reliability and effectiveness of fault detection. By looking up
Table 2, you can not only know whether the actuator is malfunctioning, but also determine the location of the malfunctioning actuator; that is, fault detection and isolation are realized.
3.4. Simulation and Analysis
To verify the proposed fault diagnosis program, four fault behaviors shown in
Table 3 are selected. Fault 1 is a stuck fault, and Faults 2–4 are constant gain faults. In the fault behavior,
n represents the gain coefficient, and
δ represents the fault stuck value. The flow area of the valve port can be calculated from Equation (8). Therefore, the fault behavior represents the flow area of the valve port, corresponding to four fault behaviors. Actuators 1–4, respectively, represent the front left, rear left, front right, and rear right air spring solenoid valves. The fault diagnosis module was started at the same time as the vehicle height adjustment was started at 5 s, and the fault occurred at 8 s. The first 5 s is the process of the air spring model in AMEsim gradually returning to a steady state. The simulation results of Faults 1 and 2 detections are shown in
Figure 6 and
Figure 7.
As shown in
Figure 6, before the occurrence of Fault 1, the estimated residuals of the three fault detection indicators, namely displacement, acceleration, and air pressure output, are smaller than the adaptive threshold. The fault occurs at the 8th second, and the residual error
r2(1),
r2(2) and
r2(3) output by EKF2 all exceeds the adaptive threshold. The residual outputs by other filters still fluctuate around zero or are less than the adaptive threshold. According to
Figure 7 and the residual characteristics in
Table 2, the front left air spring solenoid valve is malfunctioning. The fault detection time is 11.1 s, 10.4 s, and 10.1 s, respectively. If one of
r2(1),
r2(2) and
r2(3) exceeds the threshold, it is considered that a fault has occurred. Therefore, the actuator failure was detected at 10.1 s.
As shown in
Figure 7, before Fault 2 occurs, the fault detection indicators are all less than the adaptive threshold. After the fault occurs, the output residuals of EKF1
r1(1),
r1(2),
r1(3) all exceed the adaptive threshold. The residual outputs by other filters are still smaller than the adaptive threshold, and the fault detection time is 11.4 s, 10.6 s, and 10.4 s, respectively. Therefore, the actuator failure is detected at 10.4 s. According to
Figure 7 and the residual characteristic
Table 2, it can be judged that the rear left air spring solenoid valve is malfunctioning.
Similarly, simulation verification was performed for Failures 3 and 4. According to the simulation results and the residual characteristics in
Table 2, it can be accurately known that the fault occurred in the front right and rear right air spring solenoid valves. In summary, the ECAS system fault diagnosis system based on the adaptive threshold is accurate and effective. It can correctly judge whether there is a fault and the location of the corresponding faulty solenoid valve.
6. Conclusions
In this paper, the fault diagnosis and active fault-tolerant control of the ECAS system under actuator fault are studied. Based on the fault model, an extended Kalman filter bank with an adaptive threshold is designed for fault diagnosis. In addition, online adjustment is adopted for fault-tolerant control.
Firstly, the ECAS vehicle model is simplified, and the mathematical model of vehicle height regulation is established as the basis of fault-tolerant control. The faults are classified into constant gain faults and stuck faults by the fault tree method. According to the fault type, an accurate fault mathematical model is established. Then, an adaptive threshold extended Kalman filter bank is designed as the observer. Each residual is compared with the adaptive threshold. Therefore, the fault location and type can be accurately located. This method improves the accuracy and speed of diagnosis. Then, based on the method of model analysis, the fault-tolerant control of the ECAS system under fault is successfully carried out by closing the air spring solenoid valve or adjusting it online.
The designed observer, controller, and vehicle model are run on the simulation platform. The validity of the above methods is verified by comparing the relationship between detection modes and adaptive thresholds under the four proposed fault behaviors. Finally, in order to verify the control effect on the actual vehicle, an HiL semi-physical test platform was built. Such a test platform can combine the actual controller with the simulation model. The test results prove that the fault diagnosis and fault-tolerant control methods proposed in this study can be used in actual controllers. At the same time, it can accurately diagnose the location and type of faults, so as to carry out effective active fault-tolerant control.