1. Introduction
Recently, the number of patients with motor function injuries has been increasing every year, posing significant challenges to their lives and their families [
1,
2]. Rehabilitation training is crucial for these patients to help them overcome these challenges as soon as possible. The accurate implementation of rehabilitation training has emerged as a focal research area in medical rehabilitation and training equipment domains. In the equipment domain and with advancements in science and technology, the precise acquisition of human joint positions has gradually become a new research hotspot. In particular, visual posture capture and inertial navigation system (INS)-based posture capture are the prevalent examples. For instance, ref. [
3] detailed the use of motion capture sensors for acquiring human motion data, which are subsequently processed in accordance with relevant data formats. Ref. [
4] reported a 3-D tracking of upper limb movement by using two inertial sensor systems. Additionally, the scheme for upper limb motion monitoring in neurorehabilitation utilizing low-cost inertial sensors such as those found in Sony Move, Nintendo Wii (Wii Remote with Wii MotionPlus), and smartphones has been developed [
5]. Ref. [
6] employed two wearable inertial sensors that are placed near the wrist and elbow joints to measure the human motion of the upper limbs. Research on video-recognition-based virtual reality for three-dimensional human motion pose capture, as discussed in [
7,
8], reported favorable results in accurately capturing and recognizing dual-category human motion gestures. Ref. [
9] presented a refined technique for reconstructing accurate motion from partially captured and noisy postures using Kinect, with experiments demonstrating significant accuracy of posture recognition under severe occlusion conditions. Ref. [
10] proposed a computer vision algorithm for automatic construction of a human body skeleton model, employing a method that segments the body into primary components by calculating the curvature of a B-spline parameterized human contour. This approach effectively addressed the complex issue of initialization in a vision-based markerless motion capture system for the human body. Investigations into wearable sensor methodologies for assessing lower limb motion are reported in [
11,
12], guiding a novel, self-contained, and universally adaptable system capable of consistent tracking of human lower limbs without substantial differences. Ref. [
13] reported an innovative wearable sensor system developed utilizing a commercial system-in-package with inertial and magnetic sensors. Further, Ref. [
14] reported a new algorithm for filtering foot motion and estimating foot kinematics during normal walking using inertial and magnetic sensors in relation to an earth-fixed reference frame. Lastly, Refs. [
15,
16] discussed a monitoring system based on inertial sensors for measuring and tracking upper limb movement in humans utilizing two wearable inertial sensors positioned close to the wrist and elbow joints.
Employing the Kalman filter (KF) effectively mitigates measurement drift, demonstrating commendable accuracy and reliability. A novel algorithm for motion tracking has been developed by fusing data from two wearable inertial sensors positioned near the wrist and elbow joints. Empirical findings showcased that the algorithm exhibited proficiency in achieving unwavering motion tracking of human arms over a 45 s duration being devoid of any perceptible measurement drifts [
17]. Despite the capabilities of the aforementioned measurement methods for human motion capture, they suffer from many limitations. The INS-based method is also prone to error accumulation, and visual solutions encounter recognition challenges in scenarios in which limbs intersect. Employed with the equipment, data fusion filters have shown potential in improving localization precision [
18]. A prominent example of such filters is the KF, which has been the subject of numerous fusion efforts [
19]. In [
20], a novel approach involving the utilization of a predictive quaternion KF is reported for continuous wireless tracking of lower limb posture of humans, effectively overcoming wireless communication outages. In addition, Ref. [
21] reported a robust KF by deriving robust estimators for Kalman filtering that incorporate constraints on state parameters by leveraging the principles of the generalized maximum likelihood Lagrangian condition. Simulation results and semiphysical trials revealed the efficacy of an adaptive KF in improving in the accuracy of state variable estimation. Ref. [
22] introduced a novel expectation–maximization (EM) algorithm with guaranteed convergence to derive the maximum likelihood estimator (MLE) solution. Furthermore, Ref. [
23] discussed the sigma-point update of a cubature KF of the Global Navigation Satellite System (GNSS)/INS integrated environment. Notably, the discussed KF-based methods require an accurate data fusion model and a comprehensive noise description, which is hard to achieve in practice [
24].
To surmount this obstacle, the implementation of a finite impulse response (FIR) filter is proposed. In [
25], an improved FIR filter was proposed for ultrawide-band (UWB) localization, integrating the FIR filter with a predictive model and extreme learning machine (ELM) to enhance the accuracy of UWB-based localization. Ref. [
26] introduced an improved iterative FIR state estimator. Although the FIR filter showed increased robustness, its localization accuracy may not surpass that of KF when the KF model is precise. The increasing prevalence of motor function injuries significantly impacts the lives of patients and their families. Thus, accurate implementation of rehabilitation training for patients has become increasingly central in research in this field. This study introduces an adaptive EM-based KF/FIR integrated filter for INS-based posture capture of human upper limbs. Initially, a data fusion model for the wrist and elbow positions is developed. The
Mahalanobis distance is then employed to assess the performance of the filter. In the integrated filter, when the performance of KF deteriorates, the EM-based KF is utilized to improve the noise estimation accuracy. Subsequently, the
Mahalanobis distance is used to evaluate the performance of the EM-based KF. Upon further decline in the performance of the EM-based KF, the FIR filter is employed to maintain the effectiveness of the data fusion filter. This research employs the proposed EM-based KF/FIR integrated filter for measuring the wrist and elbow positions. Empirical results demonstrate the effectiveness of the method in providing accurate position estimations of its capacity to overcome the challenge. This study contributes significantly in the following areas:
An INS-based motion model for human upper limbs is formulated, focusing on the wrist and elbow positions. The state vector comprises their position and velocity in the East–North–Up frame. Further, IMU-measured positions are employed as the input. The output of the two data fusion filters are used to determine the posture of human upper limbs.
A EM-based KF/FIR integrated filtering method is designed. It leverages the INS-based motion model of human upper limbs, using KF to estimate wrist and elbow positions from INS-based measurements. The Mahalanobis distance is used to evaluate the performance of the filter, employing the EM-based method and subsequently the FIR filter as the performance of KF deteriorates.
Experimental results affirm the superior performance of the proposed algorithms compared to traditional counterparts. A real-world test using two IMUs for INS-based wrist and elbow position measurements and Kinect 2.0 used to provide reference values demonstrate the effectiveness of the proposed EM-based KF/FIR integrated filter over traditional KF and FIR filters.
The remaining sections of this paper are organized as follows:
Section 2 delves into posture capture of human upper limbs based on INS.
Section 3 details the design of the EM-based KF/FIR filter used for capturing motion of human upper limbs.
Section 4 summarizes experimental tests, and conclusions are presented in
Section 5.
5. Conclusions
The increasing prevalence of motor function injuries presents substantial challenges for patients and their families. Consequently, the accurate execution of rehabilitation training has emerged as a critical research area. This study introduces an EM-based KF/FIR integrated filter for posture capture of human upper limbs, focusing on precise wrist and elbow position information. In this work, the wrist and elbow’s position have been considered. Thus, we employ their position and the velocity in East–North–Up frame as the state vector, and their positions measured by the IMUs are used as the measurements. The outputs from the two data fusion filters are then used to determine the posture of human upper limbs. In the proposed method, the filter performance is assessed using the Mahalanobis distance. When the performance of the KF is suboptimal, the EM-based KF is utilized to enhance performance. Subsequently, if the performance of the EM-based KF declines, the FIR filter is employed to increase localization accuracy. An EM-based KF/FIR integrated filter is used for the posture capture of human upper limbs. A real-world test was conducted to demonstrate the effectiveness of this approach. In the test, two IMUs provided INS-based wrist and elbow positions, while Kinect 2.0 was used to obtain reference values. The proposed EM-based KF/FIR integrated filter was compared with the traditional KF and FIR filter. The results indicated that the proposed EM-based KF/FIR integrated filter outperforms the conventional KF and FIR filter in localizing wrist and elbow positions.