1. Introduction
Traditional demining relies on manual labor, which poses a risk of casualties among deminers and no longer meets the current requirements for demining operations. Robotic manipulators can replace humans in grasping and transporting unexploded ordnance for disposal. Large-scale EOD robotic manipulators, due to their unique operating conditions and substantial structural mass, often face issues of insufficient control precision and slow adjustment processes. Therefore, effectively optimizing the control strategies for demining robots has always been a key research direction in the scientific community.
Sliding mode control is an effective method to address the uncertainties and external disturbances of robotic manipulators. Shiyuan Jia et al. [
1] proposed a new continuous adaptive integral sliding mode controller, which is free from chatter and singularities, making it easy to implement continuous non-chattering control commands. Saim Ahmed introduced a model-independent adaptive fractional high-order terminal sliding mode control (AFO-HoTSMC) to achieve trajectory tracking of robotic manipulators under variable payloads with uncertainties and external disturbances, using fractional-order (FO) control and HoTSMC to achieve rapid finite-time convergence, non-chattering control input, better tracking performance, and robustness. Teng Long et al. [
2] proposed a combined control method, using an improved nominal model-based sliding mode controller (NMBSMC) as the main controller to output driving torque and an actor-critic-based reinforcement learning controller (ACBRLC) as the auxiliary controller to output small compensatory torque. The two different structures of sliding mode controllers fully utilize mathematical models and actual system measurement data to improve the tracking accuracy of the manipulator’s vibration balance position. Xi Ruidong [
3] used adaptive technology integrated with radial basis function neural networks to propose a robust adaptive sliding mode controller and introduced a new variable structure scheme to effectively suppress chattering phenomena, demonstrating excellent smooth tracking performance and stability during the motion of the manipulator. Wang Yang [
4], with the improved predefined-time control (PTC) algorithm and non-singular design method of terminal sliding mode, proposed a new non-singular terminal sliding mode control (NTSMC) scheme to ensure the prescribed time convergence of tracking errors, improving the precision of convergence time. Anh Tuan Vo [
5] designed a neural integral non-singular fast terminal synchronous sliding mode control (NINFTSSMC) method, which can compensate for the collective uncertainties of dynamics more quickly and accurately, completely addressing the problem of state variables approaching the sliding surface during the reaching phase, enabling the manipulator to achieve the specified performance, rapid error convergence, dynamic uncertainty robustness, minimal chattering, synchronization, and high precision.
PID control, due to its simple structure and strong adaptability, has become the mainstream control method for robotic manipulators. Despite the widespread application of PID control in industry and robotic technology, this method has some inherent limitations when dealing with complex, dynamic, and uncertain environments. Especially in scenarios requiring high precision and rapid response, traditional PID control strategies often fail to meet the demands. For instance, the performance of a PID controller largely depends on the adjustment of its parameters, which typically require professional knowledge and experience for their initial setup. This can lead to suboptimal control effects in variable operational environments. Moreover, traditional PID control methods are sensitive to system nonlinearity and external disturbances, potentially failing to achieve precise control of robotic manipulators, particularly when executing complex tasks or in dynamically changing environments. These limitations are particularly evident in the control of robotic manipulators. For example, EOD robotic manipulators need to ensure safety while achieving precise and agile operations during demining missions. However, traditional PID control methods may not effectively address the uncertainties and nonlinearity encountered during operations, leading to insufficient control accuracy and slow response times. Furthermore, these control methods have limitations in terms of parameter adjustment and adaptive capabilities, making it difficult to meet the performance requirements of manipulators across different tasks and environments. Therefore, developing new control algorithms or improving the existing PID algorithms [
6] has extremely important practical value and research prospects. To improve the control precision of robotic manipulators, domestic and international scholars have explored fuzzy PID control methods. Satyam Paul et al. [
7] designed an active vibration damping controller that combines PID with type-2 fuzzy logic (T2-F-PID), using Lyapunov analysis to establish the stability verification of the controller, and the vibration damping results were significantly improved compared to traditional PD/PID controllers and type-1 fuzzy PID controllers. Dongyang Shang et al. [
8] adjusted the parameters of the PI controller in real time through a fuzzy adaptive control strategy, which can effectively eliminate the influence of friction torque in flexible manipulators and reduce speed errors. Haibo Zhou et al. [
9] proposed an orthogonal fuzzy PID intelligent control method, using orthogonal test methods to adjust PID parameters for rapid determination of PID parameters, which has better accuracy and stability compared to fuzzy PID control methods. Fufeng Xue et al. [
10] proposed a motion control strategy based on a fuzzy PID controller, which controls the tracking error caused by the transmission mechanism of the deep-sea manipulator within 0.04 mm. John Kern et al. [
11] proposed an adaptive neural fuzzy inference system controller for a three-degree-of-freedom manipulator, learning the inverse dynamic model of the robot through a structured dataset, and the controller shows extraordinary precision and proficiency in tracking the reference trajectory of the manipulator, with minimal deviation, overshoot, or oscillation. Prasenjit Sarkhel et al. [
12] used a manually designed fuzzy logic-based method to tune PID gains and compared it with traditional tuning methods, where the rapidly exploring random tree (RRT) method performed best in terms of phase margin and gain margin. However, designing fuzzy PID control algorithms requires careful design of fuzzy rules and membership functions based on experience, which is time-consuming and challenging. Fuzzy PID control still has certain limitations in dealing with complex nonlinear and uncertain problems of multi-system coupling. Adaptive control has been introduced into control strategy research. Junyoung Lee et al. [
13] proposed an adaptive PID control and applied it to the control of a whole arm manipulator (WAM) robot holding a baseball bat, which has the same high stability and performance as the adaptive time-delay control (TDC), stronger robustness, and model-free characteristics. Ye Cao [
14] developed a robust adaptive fault-tolerant control scheme that does not require fault information and does not need an accurate mathematical model of the robot, achieving ideal tracking control. Akram Ghediri [
15] proposed an adaptive PID gain tuning design based on a deep deterministic policy gradient reinforcement learning agent for the PID computed torque control of robotic manipulators, considering the existence of unmodeled dynamics and external disturbances, and the results show that the method is robust to bounded internal and external disturbances and shows good trajectory tracking performance. Metaheuristic algorithms [
16,
17,
18,
19] can solve the optimization problem of PID controller parameters very well, providing the best results in a short time in various applications. Some common metaheuristic algorithms include genetic algorithm (GA), ant colony optimization (ACO), simulated annealing (SA), whale optimization algorithm (WOA), particle swarm optimization (PSO) [
20,
21,
22], etc. M. Karami [
23] proposed an optimal nonlinear PID control based on ACO to solve the performance degradation of micro-robot systems under disturbances or small friction forces. Alma Alanis [
24] proposed an adaptive neuron PD controller and a multi-layer neural PD controller for mobile manipulator position tracking, which showed faster learning speed and convergence time than backpropagation-based training and can dynamically adjust gains and apply to the KUKA Youbot mobile manipulator.
Large-scale EOD robotic manipulators, due to their substantial weight, have been driven by hydraulic systems to provide force and torque for the manipulator’s motion, resulting in significant inertia and hysteresis, as well as model uncertainty during the grasping and transportation operations. Therefore, these control methods may not be suitable for large-scale demining robotic arms.
In response to the multi-system coupling, model uncertainty, and issues of low control precision and slow response speed of large-scale EOD robotic arms, this paper considers modeling the mechanical system and hydraulic system of the demining manipulator separately in Adams and AMEsim. Through joint simulation, various motion parameters are obtained. To address the impact and vibration issues caused by sudden directional changes during the motion of the manipulator, B-spline curves are used for smooth trajectory processing, effectively reducing vibrations and improving control precision. Finally, a PSO algorithm-optimized BP neural network PID control method (PSO-BP+PID) is proposed. The PSO algorithm optimizes the connection weight matrix of the BP neural network, effectively solving the problem of local minimum values that may occur during the training process of the BP neural network. This method is compared with BP neural network PID control, genetic algorithm-based PID control, and WOA algorithm-optimized BP neural network PID control, showing good response speed and control precision, and stronger robustness.
3. Results
In the PSO algorithm, the inertia weight determines the flight path of particles in the solution space. Using a fixed inertia weight is simple and intuitive in implementation, but this approach may cause the algorithm to fall into local optima prematurely during the search process, thereby limiting its ability to explore global solutions. In contrast, a linearly decreasing inertia weight strategy can maintain strong global exploration in the early stages of optimization and gradually enhance the search for local optima as iterations proceed, thus achieving more comprehensive coverage of the solution space. Additionally, adaptive adjustment of the inertia weight, by dynamically adjusting its value during the algorithm’s operation, can further enhance the algorithm’s adaptability to complex search spaces, thereby enhancing its performance in a variety of optimization problems.
According to the algorithmic process, the BP neural network and particle swarm optimization algorithm parameters are initialized with the following settings: number of input layer nodes,
; number of hidden layer nodes,
; number of output layer nodes,
; learning rate,
; and inertia factor,
. Furthermore, based on Trelea’s [
27] derivation that the PSO algorithm performs best with the velocity inertia coefficient at this point, this paper adopts the following set of parameters: inertia weight
,
;
; initial population size,
; chaotic map parameter,
; maximum number of iterations,
; displacement simulation interval,
; dimension calculation formula,
; maximum velocity limit,
; initial velocity,
; and position limit,
,
.
Figure 12 illustrates the changes in the positions of all particles in the first and second dimensions with the number of iterations.
Figure 12a shows the change in the position of particles in the first dimension. There is a significant change in the position of particles in the first 7 iterations, indicating that the particles have conducted extensive exploration in the initial stage, attempting to cover multiple areas of the solution space. Between iterations 7 and 20, the position stabilizes at 2, indicating that the particle swarm has identified this area as the location of the optimal solution.
Figure 12b shows larger fluctuations in the first six iterations, with the search behavior of the particle swarm being more dispersed. As iterations proceed, the optimal solution in the second dimension is ultimately found.
Figure 13a shows the change in the optimal solution of the PSO algorithm optimized BP neural network with the number of iterations. The PSO algorithm optimized BP neural network shows a clear trend of convergence, with the optimal solution dropping from 0.95 to 0.25. The algorithm continuously approaches the optimal solution during the iteration process. In the later stages of iteration, that is, after 12 iterations, the optimal solution tends to stabilize, indicating that the algorithm has reached the global optimum. The fluctuation and downward trend of the optimal solution demonstrate that the PSO algorithm can effectively balance global and local searches, achieving optimization of the weight matrix of the BP neural network.
Figure 13b depicts the variation in the optimal fitness in PSO across generations. Initially, the optimal fitness of the population is set to zero. From the onset of the first generation, all particles participate in pairing with a fitness value of 435, resulting in the generation of offspring. As the fittest individuals are selected and the less fit are phased out, the optimal fitness progressively diminishes. By the third iteration, the optimal population fitness has achieved a value of 7, which remains unchanged through the twentieth iteration. Nevertheless, the algorithm retains an inherent stochastic nature, leading to variations in the optimal fitness trajectory of the population across different iterations.
The unit step function is set as the reference standard.
Figure 14 and
Figure 15 provide the numerical changes in K
p, K
i, and K
d in the PID control system optimized by the method proposed in this paper, the BP neural network, and the BP neural network optimized by the unimproved GA and WOA. To verify the performance of the improved algorithm,
Figure 16 compares the step response curves of the PID control system obtained by the method proposed in this paper and the other three methods.
From the simulation results in
Figure 14 and
Figure 15, it can be observed that the three control parameters of the PSO-BP neural network reach a stable state before 0.05 s. This is because, through training, the neural network can predict the system response for given PID parameters. Subsequently, the PSO algorithm is applied to the optimization process, where each particle represents a set of potential PID parameters. The PSO algorithm iteratively updates the particle positions, that is, the PID parameters, by simulating the behavior of swarm intelligence to find the parameter combination that maximizes the predictive performance of the BP neural network. The WOA algorithm-optimized BP neural network and the BP neural network PID controller are slightly slower. To fully observe the step response curves of the four methods, the time was set to 100 s.
Figure 15 shows that under the same step function input, the PSO-BP neural network control can reach a stable state at 14 s, with a fast response and almost no overshoot. It has the best overall performance among the four.
Figure 16 is the error curve of the control method proposed in this paper, PSO-BP+PID. The curve shows that at time 0 s, the error is at its maximum of 100%. As the control method enters the adjustment time, the error gradually decreases, reaching 0 at 14 s, and remains unchanged thereafter. This reflects the adjustment time and the fluctuation after adjustment of the PSO-BP+PID in response to a unit step function. The error curve and the response curve can be considered as an inverse process.
Table 2 provides a detailed comparison of the PID controller parameters and specific system performance indicators obtained by the four methods. By comparative analysis, it is concluded that the PSO-optimized BP neural network PID controller has a significantly better overshoot and adjustment time compared to the other three control methods. The overshoot is only 1.37%, almost one-tenth of the GA and BP neural network, with an adjustment time of 14 s and a peak time at 14 s, which is far superior to the other three methods. The second best is the WOA-optimized BP neural network PID control method, with an overshoot of 4.25%, which also has certain advantages, but the PSO-optimized BP neural network PID controller method has the best performance, the strongest anti-interference capability, and the optimal adjustment time, thereby verifying the effectiveness and practicality of this method for the control of the EOD mechanical arm system.
4. Conclusions
In response to the precise control and response speed requirements of large-scale EOD mechanical arms, based on the analysis of their mechanical structure characteristics, a driving model of the EOD mechanical arm is established and a PSO-BP+PID position control algorithm is proposed. The following conclusions are drawn.
By using the combined simulation of Adams and AMEsim, a mechatronic-hydraulic system model is constructed to fully simulate and analyze the EOD mechanical arm under dynamic conditions, the dynamic model outputs the velocity and angular velocity of the hydraulic rod’s center of mass, while the hydraulic system model provides the resultant force and displacement. This approach addresses the challenge that a single software cannot meet the simulation requirements for multiple system couplings, laying the groundwork for subsequent control strategies. The simulation results are accurate and consistent with the actual working conditions.
A particle swarm optimization algorithm is proposed to optimize the connection weight matrix of the BP neural network, overcoming the problem of local minimums that may occur in the training process of the BP neural network through the global search capability of the PSO algorithm. The parameter adjustment of the PSO algorithm is simple and efficient. By adjusting the inertia weight, individual learning factor, and social learning factor, it is possible to balance the algorithm’s exploration and exploitation capabilities. This ensures the global search ability of the algorithm while enhancing the precision of the optimization for PID parameters.
The particle swarm optimization algorithm is introduced to optimize the neural network PID controller. By adjusting the traditional PID control parameters Kp, Ki, and Kd and comparing it with the system step response curves of the other three algorithms, this method has high control precision, a strong anti-interference capability, and better robustness.