1. Introduction
In recent years, there has been a growing focus on the research and development of rehabilitation robots due to their potential in providing personalized and effective rehabilitation for individuals with mobility impairments resulting from neurological disorders such as stroke, spinal cord injury, or brain injury [
1,
2]. These robots offer a new approach to traditional rehabilitation methods by addressing their limitations. Rehabilitation robots can establish a direct connection between the user’s nerve signals and the robot’s actuators by integrating with the human nervous system [
3,
4,
5]. This integration allows for precise and natural movements, enhancing the effectiveness of rehabilitation exercises [
6]. They enable more intensive and repetitive training, crucial to the formation of new neural pathways in the brain. Rehabilitation robots also provide customized and targeted training, which is challenging to achieve with traditional methods, and offer objective measurements of patient progress, allowing clinicians to monitor advancements and tailor treatment regimens accordingly [
7]. Additionally, these robots create a safe and controlled environment for patients to exercise and recover, minimizing the risk of further injury. With ongoing advancements in this field, the future holds immense promise for the integration of rehabilitation robots with the human nervous system, ushering in a new era of personalized and efficient rehabilitation for individuals with mobility impairments [
8,
9].
The presence of unknown dynamics within rehabilitation robots can have a detrimental impact on the system’s performance and raise safety concerns for patients [
10]. Consequently, it is vital to accurately identify and estimate these disturbances to develop effective control strategies that can minimize their effects [
11]. Despite the inherent challenges, conventional techniques such as adaptive control strategies [
12,
13] and Kalman filters [
14,
15] have been extensively employed for estimating and predicting disturbances in rehabilitation robots. While these methods provide theoretical guarantees and facilitate the creation of adaptable control strategies that can accommodate the system’s dynamic behavior, it is worth emphasizing that the majority of existing algorithms rely on conventional approaches and lack the incorporation of state-of-the-art techniques. Thus, there remains ample opportunity for advancements in the field of dynamic identification within rehabilitation robotics.
In this regard, utilizing neural networks to estimate the dynamics of rehabilitation robots offers distinct advantages over traditional methods. Neural networks, given their universal approximation capabilities, can adeptly estimate complex dynamics without presuming the nature of unknown disturbances, a limitation often seen in conventional techniques. Furthermore, neural networks provide a broader, controller-independent approach, ensuring versatility. Their ability to model intricate nonlinear relationships might also yield more accurate representations of robotic dynamics, especially in contexts with nonlinear or complex events. The dynamic nature of human–robot interactions in rehabilitation settings necessitates an adaptable system, a requirement neural networks meet by adjusting to incoming data in real-time. However, the application of such networks demands extensive data and computational resources. Thus, while neural networks hold promise, their deployment mandates careful consideration.
One of the significant challenges in identifying non-linear real-world systems lies in the sampling process [
16,
17]. The accuracy of samples is not always guaranteed, and there are instances when the required information is not readily accessible. The process of acquiring precise samples from the unknown dynamics of rehabilitation systems presents a substantial challenge in their dynamic identification. Consequently, numerous studies within this field tend to address the unknown dynamics as disturbances in control applications, rather than undertaking their identification. Nevertheless, recent advancements have paved the way for accurate estimation of the unknown dynamics of these systems, allowing the acquired information, as well as insights from the physics of the problem, to be effectively utilized in subsequent tasks of rehabilitation robots. By integrating these advanced techniques into dynamic identification, along with leveraging the inherent knowledge of the system’s physical behavior, there exists a significant potential to enhance the control and overall performance of rehabilitation robots. Such an approach recognizes the critical role played by precise dynamic identification in ensuring the efficient operation of these robots.
One of the state-of-the-art techniques barely used for identification in rehabilitation robots is self-attention-based neural networks [
18]. This powerful technique captures relationships in sequential or spatial data, improving accuracy in recognizing body movements and coordinating rehabilitation activities. In recent years, there has been significant research and development in the field of self-attention-based neural networks, with promising results in various domains such as natural language processing [
19,
20], computer vision [
21,
22], and reinforcement learning [
23,
24]. However, their application in rehabilitation robots for identification purposes is still underexplored. One of the key advantages of self-attention-based neural networks is their ability to consider the relationships between different parts of the input sequence or spatial configuration. For instance, in the case of human motion analysis, the robot needs to understand how various body parts interact and move in coordination. By using self-attention, the neural network can effectively learn and capture these dependencies, enabling more accurate identification and tracking of body movements. Further research is needed to fully explore its potential in this field.
Advancements in rehabilitation robot research have the potential to significantly improve patients’ quality of life. In line with this objective, our study introduces a novel algorithm that utilizes a finite time sampler and neural network to uncover the unknown dynamics of rehabilitation robots. To the best of our knowledge, there is no prior research that applies finite time sampling to rehabilitation robots and then utilizes this information within a neural network to uncover the robot’s undisclosed dynamics. What distinguishes our proposed algorithm is its robustness, physics-informed nature, and integration of a distinctive self-attention layer within the neural network, ensuring its robustness and dependability. The employed self-attention mechanism allows the neural network to assign different weights to different elements of the input sequence or spatial layout, emphasizing the most relevant information for the identification task. This attention mechanism enables the network to focus on important features while suppressing irrelevant or noisy inputs, leading to improved performance and robustness. Additionally, the algorithm combines a finite time sampler with the self-attention recurrent neural network (RNN) to provide real-time samples of unknown dynamics.
The structure of the current study is as follows. In
Section 2, a description of the dynamical model used in a 2-DOF knee rehabilitation robot is provided.
Section 3 outlines the finite time sampling process and establishes its convergence. In
Section 4, the proposed algorithm is delineated and its advantages are discussed.
Section 5 presents the testing of the proposed algorithm in two experiments, highlighting its excellent performance. Finally, in
Section 6, the study concludes by presenting the findings and suggesting future research directions.
2. Dynamic of the System
The general mathematical formulation of a rehabilitation robotic system, comprising a collection of
interconnected elements, is presented with the following second-order ordinary differential equation (ODE) [
25]:
The dynamics of the robotic system are represented by the position (
), velocity (
), and acceleration (
) vectors, which exist in a multidimensional real number space (
). The input torque (
)) is also a part of this space, while the inertia matrix (
) is a symmetric and positive definite matrix with dimensions n × n. Moreover, (
) denotes the combined effects of Coriolis and centripetal forces, and
indicates the gravitational forces exerted on the system. Additionally,
is the Jacobian matrix and is assumed to be nonsingular. Additionally, the vector representing the constrained force exerted by constraints is denoted by
. It is important to mention that the matrix
meets the requirements of symmetry as stated in the study by Lee and Harris [
26]. Additionally, this dynamic equation of the systems can be reformulated using a linear parameterization as follows:
In this context, belongs to the p-dimensional real number space (). The is of dimensions () and is called the regression matrix. This matrix encompasses all unknown functions of the signals , and . It is assumed that the constrained force remains within a bounded range. From an engineering perspective, this assumption is considered reasonable, as the time-varying constrained force is expected to be limited.
In
Figure 1, we observe a knee rehabilitation robot with a 2-degree-of-freedom (DOF) configuration, specifically designed for vertical plane movements. The mechanical structure of the robot closely resembles that of a human leg, replicating its functionality. The knee joint, the focal point of the system, possesses two revolute joints. To actuate these joints, two motors are employed, connected via a transmission system. The motors are under the control of a computer, which receives input from sensors and electromyography signals. These sensors are responsible for measuring crucial parameters such as joint position, velocity, and acceleration, along with motor torque and current. Based on the acquired sensor data and electromyography signals the controller performs calculations and determines the necessary torque inputs required to achieve a desired trajectory. Subsequently, these calculated torque inputs are sent back to the motors, enabling precise control over the robot’s movements.
To obtain the equation of motion that corresponds to the parameters in Equation (1), we make use of the Euler–Lagrange equations. In this analysis, we designate
as the vector
, which represents the generalized coordinates. These coordinates are equivalent to the variables
and
in
Figure 1. The kinetic energy of the system can be mathematically formulated as follows:
In this context, the distance from joint
to the center of mass of link
is denoted by
, in which
can have values of 1 and 2.
and
denote the masses of links 1 and 2, respectively. Additionally,
represents the moment of inertia of link
around an axis perpendicular to the page and located at the center of mass of link
. The potential energy is mathematically expressed as:
By applying Lagrange’s equation, we derive the system’s dynamics as follows:
The Jacobian matrix is described as follows:
3. Finite Time Sampling
In this section, we present a finite time sampling technique for the rehabilitation robot and establish its convergence using the principles of Lyapunov stability theory. Consider the overall state space equation of a rehabilitation robot by introducing the notation
for
, we can express the dynamics of the robot as follows:
Taking into account Equation (1), one can derive the following expression:
To encompass both the uncertain terms of the system and the unknown dynamics within a single term, we introduce , which the proposed sampling mechanism should estimate in finite time.
Unlike most sampling methods, which do not guarantee the accuracy of their samples, here we prove the accuracy of our sampling algorithm by leveraging Lemmas 1 and 2, which are given by:
Lemma 1 ([
27])
. Suppose a continuously differentiable positive definite function that fulfills the following inequality:
Assuming that 0 , and the function demonstrates convergence to the equilibrium point within a finite time period.
Proof. Starting from the differential inequality (10), we have:
The derivative satisfies
, since we know both
and
terms are either negative or zero. Our objective is to establish finite time convergence of
To accomplish this, first, we separate the terms in Equation (11) and then integrate both sides over the interval [
]:
Since
, we know
is either decreasing or it is fixed; thus, we have
. Considering this and due to the fact that
is definitely positive, we have
We use this inequality, and this gives an upper bound on the integral as follows:
Integrating the left-hand side of Equation (13), we obtain:
Plugging the limits of the integral in Equation (14), we reach:
Inserting Equation (15) in Equation (13) results in:
Equation (16) gives a relation between time and the value of
Now, we want to calculate the convergence time (
), which means
= 0. We plug this value into Equation (16), and obtain:
Therefore,
and as a result,
We know therefore, and is positive and finite. As a result, we found a finite value for the upper bound of , which concludes the proof of Lemma 1. □
Lemma 2. When and 0 for , then the triangle inequality holds as follows: Now, we introduce variable
as follows:
where we define variable
in the following manner:
The variable
is mathematically defined based on the positive design parameters
. It is important to note that
is chosen to be greater than
to ensure the validity of the expression. Furthermore, considering odd positive integers
and
, where
, the estimation of the
is derived according to the following expression:
By considering Equations (7), (21) and (22), the following equation is derived:
By utilizing Equations (9), (23) and (24), we can deduce the following result:
Theorem 1. By employing the proposed sampling algorithm characterized by Equations (21)–(23) on the MIMO uncertain nonlinear system represented by Equation (9), the estimation error, denoted as , converges to zero within a finite duration.
Proof. We consider the following positive definite Lyapunov function candidate:
The expression for the time derivative of the function
is as follows:
The final line of Equation (27) is obtained by applying Lemma 3.
Lemma 3. Given the following function: By considering the given function, we can derive the following equation:
As we know
=
, as a results
Therefore, based on Lemmas 1–3, along with the satisfaction of the Lyapunov condition as expressed in Equation (30), we establish the finite time convergence of the sampling approximation to zero. □
4. Dynamic Approximation
Accurate estimation of uncertainties is crucial for robust and reliable control of robotic systems. In the past, conventional methods such as finite time disturbance observers have been commonly employed for uncertainty estimation in robotics. Conventional methods and finite time disturbance observers often rely on pre-defined mathematical models or heuristics for uncertainty estimation. These approaches may struggle to accurately capture nonlinear uncertainties that arise in complex robotic systems. Neural networks, on the other hand, excel at nonlinear uncertainty modeling by approximating complex functions. They have the capacity to capture intricate uncertainty patterns, enabling more accurate estimation in real-world robotic applications.
Noise is an inescapable component of real-world data, often adding complexity to various applications. In response, neural networks, especially RNNs, have become instrumental in distinguishing signal from noise due to their adeptness at processing sequential data patterns. Recently, advanced techniques have been developed to handle noise more effectively with neural networks [
28,
29]. For a comprehensive understanding and more detailed insights into these innovative methods, one can refer to [
30].
For rehabilitation robots, the presence of inputs and outputs is crucial for obtaining the system’s states when the states are designed to be observable. However, neural networks alone are limited to modeling the relationship between given inputs and outputs and cannot capture the unknown dynamic which is a part of the system. Moreover, their performance is optimal only under ideal conditions. This obstacle remains an open question in the field of robotics and identification. To overcome this challenge, we propose an RNN integrated with a finite time sampler, offering a novel approach to addressing these limitations.
4.1. The Proposed Algorithm
The structure of our proposed algorithm is illustrated in
Figure 2. As depicted in the figure, during the training process, electromyographic signals obtained from the nervous system are applied as inputs to the rehabilitation robot. These signals are time-dependent and serve to update the state of the rehabilitation robot. The states, along with their derivatives and time information, are then fed into both the RNN and the finite time sampler. The RNN’s loss function is defined based on the acquired samples and the estimated values generated by the RNN. Via backpropagation, the RNN is trained, and its weights and biases in all layers, including the self-attention layers, are updated. Once the training process is completed, the sampler is no longer required. Using the states of the systems and time information, the RNN is capable of estimating the unknown dynamics of the systems.
This algorithm incorporates physics principles with machine learning techniques to yield more precise system predictions. It can be summarized as follows:
An electromyographic signal is introduced into the system, and it acts as a torque according to Equation 1.
This electromyographic influence is then applied to the system dynamics, as visualized in
Figure 2.
The dynamic state of the system, along with the time, are used as inputs for the neural network. Simultaneously, this system state is also input to a finite time estimator.
The finite time estimator operates according to Equations 12, 13, and 14 to estimate an unknown dynamic factor, referred to as .
This estimated is then compared with the output of the neural network. The discrepancy between the estimated and the neural network’s output forms a loss function.
This loss function is instrumental in updating the weights and biases of the neural network during the training phase.
Once training concludes, the now refined neural network serves as a reliable estimator for the system.
Additionally, to train the neural network, we complete the following 5 steps:
For a given input vector, the network generates an output prediction utilizing its current parameter configuration (weights and biases). This prediction is the network’s approximation of the expected
- II.
Loss Quantification:
The computed output is contrasted against the estimated We compute the loss function by calculating the root-mean-square error (RMSE) between and its prediction.
- III.
Gradient Computation (Backpropagation):
Leveraging the chain rule of differentiation, gradients of the loss function with respect to each weight and bias in the neural network are calculated. These gradients denote the sensitivity of the loss to changes in each parameter.
- IV.
Update Weights and Biases:
Using the gradients computed in the backpropagation step, the weights and biases of the neural network are updated using an optimization algorithm.
- V.
Iterative Refinement:
The aforementioned steps are reiterated across multiple epochs, ensuring progressive optimization of the model’s parameters. In our neural network, the stop condition is indicated by a minimal fluctuation in the loss function. For our specific model, termination criteria are defined by the stability of the loss function: we halt training when the change in the normalized loss across five consecutive iterations is less than .
Our approach harnesses the power of self-attention algorithms, which offer distinct advantages in estimation tasks. Self-attention layers enable the model to simultaneously consider all input positions, capturing global dependencies efficiently. This capability proves especially beneficial in estimating quantities that rely on non-local relationships within the data. By attending to relevant information from the entire input, self-attention layers provide a comprehensive understanding that facilitates accurate estimations. These advantages can lead to a more accurate estimation of unknown dynamics in the knee rehabilitation robot. However, this approach also comes with limitations: it requires considerable memory, especially for long sequences; it lacks an inherent bias towards neighboring elements in a sequence, which may lead to missing out on localized patterns; the attention maps produced are not always easy to interpret; and the model may be distracted by noisy or irrelevant data, necessitating careful preprocessing. Balancing these benefits and limitations is crucial in developing an effective and efficient model. In what follows, we elaborate on the advantages of our proposed algorithm.
4.2. Accurate Sampling
In situations where explicit information or samples are unavailable, our proposed algorithm overcomes this limitation by incorporating a robust mechanism for obtaining accurate samples from unknown dynamics. This mechanism ensures the reliability and trustworthiness of the acquired samples for the neural network. By generating reliable samples for the RNN, our method demonstrates its practical applicability in real-world scenarios. This significant contribution directly addresses a crucial problem in the field, providing a solution that enables accurate sampling even in the absence of explicit information or samples.
4.3. Adaptability and Generalization
Conventional methods and finite time disturbance observers typically require the explicit specification of model parameters or tuning of observer gains. This limits their adaptability to varying operating conditions and environments. In contrast, neural networks can adaptively learn from data, making them more flexible and capable of generalizing to different robotic system configurations and dynamic environments. Via training on diverse datasets, our algorithm is capable to capture the underlying dynamics of uncertainties and adapt their estimation accordingly, resulting in improved performance across various scenarios.
4.4. Robustness to Complex Dynamics
Robotic systems often operate in complex and dynamic environments, where uncertainties can vary significantly. Conventional methods and finite time disturbance observers may struggle to handle the complexity of these dynamics and may exhibit limited robustness. Neural networks possess inherent robustness to complex dynamics due to their ability to learn from diverse and large-scale data. Hence, our algorithm can effectively model uncertainties arising from intricate and time-varying dynamics, providing more reliable and robust estimation capabilities in challenging operating conditions.
4.5. Integration of Sensor Data
Uncertainty estimation in robotics often requires the fusion of data from multiple sensors. For instance, here we use two electromyographic signals which are displayed in
Section 5. Conventional methods and finite time disturbance observers may face challenges in integrating heterogeneous sensor data, such as noisy or incomplete measurements. Neural networks excel in data fusion tasks by leveraging their ability to process and extract meaningful features from multiple sensor inputs. By incorporating sensor fusion capabilities, neural networks enhance uncertainty estimation by effectively utilizing diverse sensor information, leading to more accurate and reliable estimates compared to conventional methods.
4.6. Equipped with Attention Mechanism
The integration of attention mechanisms has showcased their considerable utility in the domain of time series classification. These mechanisms empower models to dynamically emphasize significant segments of sequences, effectively capturing essential temporal patterns and relationships. Within our tailored RNN architecture for time series classification, we have introduced a self-attention layer. This augmentation enhances the model’s capabilities by enabling it to selectively weigh the relevance of distinct time steps, a particularly advantageous feature for tasks where certain time steps carry more weight in determining class labels. In what follows, we delve into the formal mathematical framework of this self-attention layer within our RNN:
At each time step,
input vector
undergoes projection to derive query
, key
, and value
vectors:
in which
,
and
are weights for query, key, and value vectors, respectively. Then, we calculate dot products between the current time step’s query vector
, and all key vectors generate attention scores:
The computed attention scores undergo normalization using the softmax function to yield weights:
where
denotes the normalized attention score. The context vector
emerges as the weighted summation of value vectors as follows:
Subsequently, the context vector
merges with the RNN’s previous hidden state
to generate the new hidden state
for the current time step:
Via this incorporation of a self-attention layer, our RNN enhances its temporal dependency-capturing capabilities, ultimately refining classification accuracy by accommodating the varying relevance of different time steps.
In summary, when compared to conventional methods and disturbance observers, our method provides distinct advantages for uncertainty estimation in robotic systems due to its capacity for nonlinear uncertainty modeling, adaptability, and generalization. Moreover, the integration of self-attention layers in neural networks for estimation problems provides several benefits. From modeling complex relationships and handling variable importance to robustness in handling variable length and missing data, self-attention layers enhance the performance and robustness of estimation models.
5. Numerical Results
In this section, we assess the effectiveness of the proposed algorithm by conducting evaluations using two distinct experiments. The system parameters are given by , and . The design parameters of the sampler are as follows: , and . For the training process of the RNN, the Adam optimizer is utilized. The activation function employed is the hyperbolic tangent (tanh). The RNN consists of several layers, including a fully connected layer, two long short-term memory (LSTM) layers, and one self-attention layer. In the self-attention layer, the number of heads is set to 8, and the number of key channels is set to 64.
We have allocated 50 percent of the samples for training purposes and reserved the remaining 50 percent for testing to ensure a balanced distribution of data. The architecture of the neural network provided begins with a sequence input layer, which has been designed to receive sequences of data. In this case, the input sequences are vectors of length 5. This layer is followed by an LSTM layer with 128 hidden units. The LSTM layer is capable of learning patterns over time and is highly suited to time-series data. Subsequently, the data passes through a fully connected layer with 64 units, which is used to transform the dimensionality of the data. A tanh layer follows, where the hyperbolic tangent function is applied to the input, ensuring that the transformed data falls within the range between −1 and 1. After that, a self-attention layer is applied. Following this, another fully connected layer is employed, transforming the data into a 1-dimensional space. Here, the predictions are made based on the final hidden state. The MSE is then calculated as the average of the squared differences between the actual and predicted values. This computed MSE serves as a quantitative measure of the model’s performance, with lower values indicating a better fit. During training, the aim is to minimize this MSE value via repeated iterations of backpropagation, adjusting the model’s weights to improve prediction accuracy. For the training of this network, the Adam optimizer is employed, sequence padding to the left is incorporated, and data is shuffled before each epoch.
5.1. Experiment 1
In this specific instance, our goal is to evaluate the effectiveness of the suggested algorithm in accurately predicting the undisclosed behavior of a system. To clarify further, the undisclosed behavior can be described as follows:
where
denotes the force exerted by the user. The applied electromyographic signals to the system are displayed in
Figure 3. According to these signals, the system is stimulated, samples from unknown dynamics are automatically captured, and the RNN is trained. It is noteworthy that in real-world scenarios, the electromyographic signal is not merely added to torque. Instead, it has to be transformed or converted first. The true value, which is the desired torque to be applied to the system, is obtained after this transformation. This is a calibration process for each electromyographic signal, converting it into the appropriate torque. In the context of this study, the simplifying assumption is made that this transformation is linear. However, it is important to note that the method is not constrained by this assumption; the transformation could be non-linear or follow a different model.
The RMSE and loss function during the training process of RNN are illustrated in
Figure 4 and
Figure 5, respectively. The time history of unknown dynamic and predicted values in the testing dataset using the proposed algorithm is demonstrated in
Figure 6. As it is illustrated in this figure, the proposed finite time estimator successfully obtains precise samples, and the RNN effectively learns the intricate unknown dynamics. The performance of the RNN in predicting the test set is particularly noteworthy, showcasing remarkable results.
It is worth noting that taking into account the input torque range and the uncertainty stipulated in equation 27, our analyses reveal that the uncertainty bounds for and are calculated as 2.8 and 1.3 respectively. However, it is crucial to emphasize that this information has not been furnished to the sampler. Nonetheless, despite this absence, the sampler continues to perform adeptly within the current circumstances. This notable aspect of our approach’s exceptional performance can be attributed to the integration of self-attention layers in neural networks. In estimation tasks, it is evident that not all input features contribute uniformly to the target variable.
Some features may have a more significant impact, while others may be less relevant. Self-attention layers empower the model to assign varying attention weights to different input elements, effectively determining their importance in the estimation process. This adaptability enables the model to focus on the most influential features while attenuating the influence of less relevant ones, thereby improving overall estimation performance.
5.2. Experiment 2
Conventional approaches face challenges when it comes to identifying intricate dynamics encompassing complex time-varying functions. However, our innovative method possesses a resilient and intelligent framework that excels in handling such dynamics. As a result, it becomes feasible to make accurate predictions for unknown dynamic functions that remain elusive to traditional methods. To further highlight the efficacy of our proposed algorithm, we present an illustrative example featuring a sophisticated unknown dynamic function as follows:
The presence of the arctangent function () and the difference in the frequencies of the time-varying functions and introduces additional complexity to the signals of the system, making their estimation challenging. In this scenario, considering the input torque range and the uncertainty provided in Equation (28), our analyses indicate that the uncertainty bounds for and are 1.4 and 0.8 respectively.
Here, we utilize the electromyographic signals depicted in
Figure 3 to perform our analysis. In
Figure 7 and
Figure 8, we present the RMSE and loss function values observed during the training process. The identification results in the testing dataset are illustrated in
Figure 9, which clearly demonstrates that despite the complexity of the unknown dynamics, our proposed intelligent algorithm successfully detects and captures them. This remarkable achievement emphasizes the outstanding performance of our algorithm in dynamic identification.
5.3. Comparison with Other Techniques
Utilizing neural networks for estimating the unknown dynamics of rehabilitation robots presents substantial advantages over traditional identifiers. Firstly, due to their universal approximation capabilities, neural networks can effectively estimate complex and even discontinuous dynamics, requiring no assumptions about the nature of unknown disturbances. This is in stark contrast to conventional methods, which often make assumptions about the smoothness of these unknowns. Furthermore, unlike traditional estimators that are designed around specific controllers, thus limiting their applicability, neural networks offer a broader approach. They provide a general approximation that is not dependent on any particular controller, offering a significant degree of flexibility. Once a neural network is trained, it can adapt to work with any controller. In addition to this, neural networks have the unique ability to learn intricate, non-linear mappings from inputs to outputs, potentially modeling the robot’s dynamics more accurately, especially in scenarios involving non-linear or complex phenomena. Moreover, the unpredictable and dynamic nature of human–robot interactions, particularly in rehabilitation settings, necessitates a system that can adapt in real-time. Neural networks, thanks to their adaptability, can learn from incoming data and adjust their operations accordingly, unlike conventional methods that often rely on a static model. Therefore, the application of neural networks in the control of rehabilitation robots could potentially lead to improved accuracy, robustness, and overall better outcomes in complex, unpredictable environments.
Neural networks offer a compelling suite of advantages, but it is vital to recognize the inherent trade-offs. The process of training these networks necessitates considerable amounts of data and extensive computational resources. Further, their intricate nature often makes interpretation and validation more challenging than with conventional models. Therefore, while their potential benefits are substantial, the application of neural networks requires a thoughtful evaluation to confirm their appropriateness for a specific task, with an unwavering focus on safety and effectiveness. Interestingly, our proposed approach incorporates the use of a sampler, the accuracy of which carries theoretical guarantees. This aspect allows our method to enjoy benefits typically associated with conventional approaches, presenting a harmonious blend of traditional reliability and innovative potential.
In order to thoroughly assess the effectiveness of our developed method, we carried out a comparative analysis with the integral terminal sliding mode observer (ITSMC), as proposed in [
31]. In this comparative analysis, we introduce Gaussian noise with a mean of zero and a standard deviation of 0.01 into the system. We then evaluate the proficiency of our proposed technique in mitigating this interference. For situations with more pronounced noise, we suggest integrating a filter post-sampling, ensuring that the sampler’s output is filtered before being fed into the neural network (NN). Our findings are benchmarked against a conventional method. To guarantee an equitable comparison, we fine-tune the parameters of our sampler to correspond with those in the reference while still ensuring compatibility with our system’s requirements.
Figure 10 provides a side-by-side comparison of our results with those of the ITSMC observer. Upon closer inspection, as can be discerned from the zoomed-in plot, our method exhibits a more accurate alignment with the ground truth function. In contrast, the results of the ITSMC display considerable scattering. One of the distinct advantages of our method lies in the incorporation of the proposed estimator combined with the regression capabilities of neural networks. This synergy allows our method to deliver smoother and more reliable results. Such characteristics are not just statistically appealing but are of paramount importance for real-world applications, where consistency and reliability are often essential prerequisites.
To further evaluate the performance of our method, we have presented the data in
Table 1. This table delineates the numerical results of the average error for both our method and the ISMC. These results, presented in
Table 1, clearly underscore the effectiveness of our approach. Additionally, the proposed method provides smooth results.
For an unbiased comparison, it is noteworthy to mention the advantages of the ITSMC over our method. Specifically, the ISMC does not require any preprocessing or the learning intricacies associated with training a network. In contrast, our method necessitates a training process before it can be deployed in real-time applications. However, the proposed algorithm offers the possibility of end-to-end learning, where uncertainty estimation can be integrated seamlessly with other robotic control tasks. This integration is not readily achievable with conventional methods and disturbance observers, which typically require separate design and tuning. By jointly optimizing uncertainty estimation and control objectives, our technique enables a more holistic and integrated approach to uncertainty-aware robotic control. This end-to-end learning paradigm reduces the need for manual tuning and handcrafted designs, simplifying the development and deployment process while enhancing overall system performance, a capability lacking in conventional methods.
6. Conclusions
The current study introduced a new algorithm that combined a finite sampler and RNN to effectively identify and predict unfamiliar dynamics in lower-limb rehabilitation robots. To accomplish this, the study initially presented a dynamic model for a 2-DOF knee rehabilitation robot. Next, a nonlinear estimator was developed to capture samples from the dynamical system. By utilizing the output of the RNN and proposed sampler, the unknown dynamic of the rehabilitation robot is identified. The proposed RNN was enhanced with a self-attention mechanism, which played a vital role in devising effective strategies for practical applications, particularly when the input features held varying levels of importance. The proposed method’s effectiveness in identifying two distinct unknown dynamics was clearly demonstrated. Numerical analyses emphasized the algorithm’s proficiency in managing both dynamics adeptly, suggesting its broad applicability across diverse rehabilitation contexts. In addition, to more comprehensively assess the efficacy of our method, we presented numerical results juxtaposed with those from a state-of-the-art technique, ITSMC. Our findings highlighted that the proposed algorithm not only yielded smoother outcomes but also consistently achieved a reduced error rate in estimations. To continue advancing this field, we plan to evaluate and enhance the performance of the proposed method by incorporating fuzzy structures. This would allow for further improvement in the accuracy and efficiency of the algorithm, enabling the development of advanced control strategies for rehabilitation robots.