1. Introduction
Accurate muscle force estimation enables many applications, including control of powered prostheses, medical rehabilitation, sports medicine, and human–machine interaction [
1,
2,
3,
4,
5], where electromyogram (EMG) signals have been extensively used. EMG signals can be acquired by electrodes located on the skin surface (non-invasive) or inserted into the muscle tissue (invasive) based on the EMG application. Surface electrodes are affixed to the skin and make contact through an electrolyte that can be either a gel or paste, or sweat in the case of dry electrodes. Surface electrodes are easy to use and provide information about muscle activity, with minimum discomfort to the participant. Invasive needle or wire electrodes are usually used to study the motor unit (MU) activities, since they provide localized information about neuromuscular activity [
6]. However, using multi-channel surface EMG electrodes along with signal decomposition algorithms, information at the MU level can be extracted [
7]. Therefore, because of the risk of injury, discomfort, and infection that invasive electrodes pose, non-invasive methods are preferable. In addition, for applications like EMG-based muscle force estimation, although invasive methods can be used, this is generally not acceptable in humans due to the mentioned reasons.
The surface EMG (referred to as EMG for simplicity) is often utilized to estimate the underlying neuromuscular activation that leads to force generation. Many studies have been carried out to estimate muscle force by estimating the relationship between EMG signals and output force [
8,
9,
10,
11,
12,
13,
14]. Hill’s muscle model [
15] is often used, where estimated muscle activation levels are used as inputs to the model, and the generated muscle force is calculated as the output [
16,
17]. Other methods including polynomial functions [
11,
12], linear regression [
18], neural networks [
13,
19,
20,
21], parallel cascade identification [
22], and fast orthogonal search (FOS) [
8,
9,
10,
16] have been used to identify the EMG–force relation without requiring any knowledge of muscle and joint dynamics.
Conventional EMG–force modeling techniques generally used EMG recorded from a single bipolar electrode attached to the belly of the muscle. Recently, researchers have investigated EMG-based force estimation using multiple bipolar electrode pairs [
9,
18,
22,
23] and high-density (HD) EMG recording [
10,
24,
25,
26] because of the enhanced information about neuromuscular activation as well as the improved muscle force estimation results [
10,
22,
24]. However, there are challenges associated with using a large number of electrodes. Processing the acquired HD-EMG requires more time and effort due to the high dimensionality of the data, which might contain redundant and correlated information. Additionally, the complexity of the systems, computational cost, and power consumption can make the HD and multi-channel recording applications limited. Studies have been done to manage dealing with recorded HD-EMG data by using fewer channels through ensemble learning techniques [
10] and by applying principal component analysis (PCA) [
24]. Channel selection approaches can also be implemented to deal with highly correlated information obtained using HD electrodes and to extract the informative subset of channels that contains discriminatory information to reduce the computational cost and time, and hence reduce power consumption, without significantly compromising the performance.
Use of HD-EMG recording, in which data are obtained from an electrode montage placed over the muscle belly, can improve force estimation accuracy because the muscle activation level is better characterized by the multiple channels of data [
10]. Johns et al. [
10] utilized HD-EMG signals with an ensemble learning technique coupled with the FOS algorithm for force estimation. Monopolar HD-EMG recordings were obtained from the biceps brachii (48 channels), triceps brachii (64 channels), and brachioradialis (eight channels) during a variable-force isometric contraction exercise. Ensemble learning was used to manage the high dimensionality and correlation in HD-EMG data and to reduce the effect of outliers on HD-EMG force estimation. The authors considered a number of different HD-EMG spatial configurations. The highest accuracy was obtained for configurations in which a higher number of channels was used, while the lowest accuracy was achieved for the single channel configuration [
10].
The effect of different bipolar electrode configurations and directions on EMG-based force estimation was investigated using HD-EMG recorded from the triceps brachii during isometric–isotonic extension of the elbow [
24]. Force estimation error was expressed as the root mean square difference (RMSD) between the normalized force and the normalized EMG. Different configurations were compared to find the best bipolar orientation. Additionally, PCA was used to manage the highly correlated information obtained from HD-EMG recording. PCA was applied to the EMG signals, then the first few modes were discarded, and the linear envelopes of the remaining modes were summed to estimate the neuromuscular activity. With respect to force estimation, the PCA-based approach showed a substantial improvement over other configurations/directions. Therefore, it was concluded that PCA can be considered as an effective tool for extracting relevant force-related information from a high-density EMG grid, thus improving the quality of EMG-based muscle force estimation [
24].
Channel selection and dimensionality reduction have been widely used in classification studies of bio-signals to provide faster processing and improved model performances [
27,
28,
29,
30,
31,
32,
33,
34]. Geng et al. [
27] proposed a novel channel selection approach called multi-class common spatial pattern (MCCSP) for channel selection in EMG pattern-recognition-based movement classification. The proposed algorithm involves calculating a matrix in which one motion class variance is maximized and the sum of the variances of all other classes is minimized. Spatial pattern matrices corresponding to all included classes were then obtained, and were utilized to select an optimal subset of channels from 56-channel HD-EMG recordings. Ong et al. [
29] used a PCA-based channel selection method for visually evoked potentials to classify alcoholics versus non-alcoholics. They selected 16 from an initial set of 60 channels based on their contribution to the total variance. Their results suggested that the classification performance experienced a negligible decrease when 16 channels were utilized instead of the initial 64 [
29]. PCA has also been used for feature selection and sufficient parallel channel selection from sensor arrays to reduce the dimensionality of the raw data to investigate the performance of a bionic olfactory model to classify two datasets—three classes of wine and five classes of green tea [
35]. Guler et al. [
36] recorded EMG signals from the biceps brachii and abductor digiti minimi muscles to diagnose and classify neuromuscular disorders. The extracted features from the EMG signals were the fast Fourier transform (FFT) coefficients. Before applying the feature set to the multilayer perceptron (MLP) and support vector machine (SVM) classifiers, PCA was used to reduce the dimensionality. Then, PCA coefficients were applied as inputs to classify the EMG data into three conditions: Normal, neuropathy, and myopathy, where the best classification accuracy of
was obtained by SVM [
36]. Shih et al. [
30] used a machine learning-based approach to reduce the average number of electroencephalogram (EEG) channels from 18 to 4.6 for seizure detection. A feature selection algorithm with backward elimination was used for channel selection, while the mean detection accuracy only experienced a slight decrease. Several studies have focused on channel selection in EMG applications [
31,
32,
33]. Martinez et al. [
31] recorded HD-EMG signals to estimate grasp force. They examined subsets of 4, 8, and 16 channels, in comparison to the full set of 168 channels, to determine the minimum amount of information needed for grasp force prediction. Three channel reduction methods were used: selecting channels in the center of the grid only (Fix-Ridge); selecting the Fix-Ridge channels with feature selection using elastic nets analysis (Fix-EN); and selecting channels using lasso with EN feature selection (LassoG-EN) in [
31]. The Fix-EN was selected as the best method, since there was no statistical difference between methods, and 16 channels proved to have the best trade-off between complexity and performance [
31]. Al-Ani et al. [
32] recorded multi-channel EEG (64 channels) and EMG (eight channels) data to classify up to four alertness states (engaged, calm, drowsy, and sleep) using EEG, and six different grip and finger movements using EMG. A dynamic channel selection method was proposed to select channels that are more relevant to the given classification task for both signal types. This method was compared with exhaustive search, and it was concluded that while for a small number of channels (e.g., eight channels), an exhaustive search is feasible and yields good results, the approach is not possible for a larger number of channels [
32]. A novel variable selection method based on Kullback–Leibler (KL) information to select channels to classify four hand motions using EMG has been explored [
33]. The measured signals were considered as probability variables, and their probability density functions were estimated using probabilistic neural network learning based on KL information. The results indicated that the average classification rate using the selected channels is almost the same as using all the channels [
33].
The goal of this work is to develop an efficient and effective technique for selecting a subset of available HD-EMG channels for force estimation while minimizing the effect on force estimation accuracy. We propose two different approaches: One is based on PCA in the frequency domain and the other is based on a novel algorithm that calculates and maximizes an index, which is called the power-correlation ratio (PCR). Then, the performance of these two proposed approaches is evaluated and compared to the results of channel selection using PCA in the time domain as a baseline. PCA in the time domain has been used by other studies for channel selection, dimensionality reduction, and feature selection [
14,
24,
29,
35,
37,
38,
39]. Preliminary versions of this work have been reported by Hajian et al. [
26,
40]. This paper extends and unifies those works by defining and assigning the PCR method capable of measuring a specific and informative index for channel selection. Moreover, a larger and extended dataset, which contains three different joint angles and two forearm postures, was recorded and analyzed. Subsets of one, two, and three channels were selected using the proposed techniques of PCR and PCA in the frequency domain. Additionally, further analysis was performed where the effects of using more than one principal component (PC) for channel selection by PCA have been investigated. Finally, the results of the proposed PCR method have been compared to PCA in both the time and frequency domains on the extended dataset.
3. Results and Discussion
The mean and standard errors (SE) of the %NMSE values for seven channels per muscle and subsets of one, two, and three channels per muscle are presented in
Table 1. The channel subsets were obtained using time-domain PCA (PCA
), frequency-domain PCA (PCA
), and PCR on the HD-EMG data; force estimation was done using FOS. As the number of selected channels increases, the %NMSE values decrease in most cases for different joint angles and forearm postures.
A two-way ANOVA was performed on the results of force estimation with the selected channel subsets and channel selection methods as factors to analyze the significance of changes in force estimation accuracy. ANOVA is a robust method that can be used to show significance even in data with a non-normal distribution [
45]. The results indicated that there were significant effects of the subset of selected channels as well as the channel selection methods on the average errors across subjects (
p-value =
). Then, one-way ANOVA was applied to investigate the effect of channel selection methods with respect to the entire set of channels for each subset of channels. The results indicated that there are significant differences between the channel selection methods for subsets of three and one selected channels per muscle (
p-value =
and
2.61
(
) and
p-value =
and
2.61
(
), respectively). However, no significant effect was found between PCA
, PCA
, and PCR when comparing force estimation results for the seven-channel HD-EMG versus the two-channel per muscle subset (
p-value =
). To explore where the significant differences are, pairwise comparisons were done using the t-test for the subset of three channels, which resulted in improvements in force estimation for PCA
and PCR methods. The t-test results are shown in
Table 2 for a subset of three channels. For this analysis, the Bonferroni correction method was applied at the
confidence level.
It is illustrated in
Table 2 that there are significant effects when PCA
and PCR were employed for three-channel per muscle subset selection. However, there is no significant effect when PCA
was used to select a three-channel per muscle subset versus the full set of HD signals.
We did not perform t-test analyses on single-channel subsets, since no improvement in force estimation was observed. When selecting one channel, the PCR method shows the closest performance to the full-set, followed by PCA and PCA. However, in single-channel selection, PCA and PCA showed very high standard errors, indicating low consistency in force estimation. Evidently, while selection of only one channel provides a dimensionality reduction of , it reduces our force estimation accuracy.
Although there were no significant differences between the different approaches, for a subset of two channels, PCA
still increased the average and standard deviation of the error in the estimated force, while PCA
and PCR reduced the error, achieving improvements. Moreover, the standard errors of the errors were considerably reduced using these two methods when two channels were selected. Finally, when a subset of three channels was selected, PCA
further deteriorated the performance of the force-estimation process, while PCA
and PCR, averaged across all conditions, enhanced the results by almost
, and the standard errors were reduced.
Figure 2 shows a detailed comparison of the %NMSE values obtained for the three-channel subsets selected for each subject in comparison with the all-channels case. The %NMSE values shown for each subject are averaged over the three force levels recorded. For almost all subjects, either the PCR or PCA
methods resulted in better performance than using all seven differential channels, and the average %NMSE values are lower than those for channel selection using PCA
.
These results indicate that while utilizing seven channels per muscle for force estimation yields relatively good results, there is redundant information among the channels, which is discarded through our channel selection techniques. On the other hand, it is evident that a single EMG channel does not contain enough information to allow for accurate force estimation.
Our proposed PCA technique (in both time and frequency domains) selects the channels with the largest contribution to the first PC. PCA in the time domain has been widely used as a common method to reduce the redundancy in data. However, we proposed the novel idea of using PCA in the frequency domain to select a subset of channels with the highest variance, where our results indicated that applying PCA in the frequency domain to select channels has the potential to improve the force estimation accuracy, while PCA in the time domain does not. To further explore the use of PCA for channel selection, we also examined voting for the selected channels based on the contributions of the PCs to the first two and first three PCs. The mean and SE values for each experimental condition are shown in
Table 3, where the results suggest that considering more PCs does not improve the force modeling performance. As more PCs are selected in the time and frequency domains, the error is increased for each experimental condition.
Figure 3 illustrates the average results across the different experimental conditions when the top one, two, and three PCs are considered.
These results were statistically investigated using single-factor ANOVA. The ANOVA results suggest that there are significant effects for using different numbers of PCs to select three channels for both PCA
(
p =
) and PCA
(
p =
). Next, a t-test was applied for pairwise comparison. The PCA methods using one and two PCs were not statistically different in either domain. When comparing the usage of one PC versus three PCs for channel selection,
p =
for PCA
and
p =
for PCA
;
p=
for PCA
and
p =
for PCA
when comparing the usage of two PCs versus three PCs. However, considering the Bonferroni correction, there was no significant effect for using more than one PC for channel selection on force estimation accuracy in either the time or frequency domain. Therefore, using the first PC is sufficient to select channels to improve the force estimation accuracy because the first PC represents the maximum variation in the data. In
Figure 4, the percentage of the total variance explained by each PC for the long head of the biceps brachii of one subject at
joint angle, neutral posture, is shown. This figure shows that the first PC has
of the total variance, while the other components represent
,
,
,
,
, and
, respectively. Therefore, there is no need to use more than one PC to select a subset of channels, since the first PC contains the maximum information.
3.1. Evaluation and Comparison
Staudenmann et al. [
24] used PCA on EMG signals, recorded using HD electrodes from the triceps brachii muscle during elbow extension, in order to improve EMG-based force estimation. They summed the PCs, and their evaluation criterion was the RMSD between the predicted and measured force values. They found that PCA reduced RMSD by approximately
for optimally aligned multiple bipolar electrodes. In our work, we did not use PCA to rebuild the EMG signal; instead, PCA was used to find the channels with the highest contribution (higher coefficient) in the PC space, indicating maximum variance of the signal. In addition, we achieved improvements of
and
NMSE compared to the seven-channel per muscle configuration using PCR and PCA
methods, respectively.
HD-EMG recordings were used for force estimation using an ensemble learning technique coupled with the FOS algorithm as an outlier detection method [
10]. Four HD spatial configurations were considered. The lowest error was obtained in a configuration where all the bipolar channels (44 channels) were used for force estimation (mean %NMSE
). In configurations where the number of channels was reduced (11 and seven channels), %NMSE
and %NMSE
were obtained, respectively. In this study, smaller subsets of channels were selected, but no attempt was made to optimize the chosen subset of channels. In comparison, through a similar reduction of approximately
in the number of channels, our method has been able to reduce the %NMSE through channel selection with PCR and PCA
. This indicates the importance of channel selection not only for reducing the dimensionality, but also to improve the force modeling performance.
In a relatively different application of channel selection compared to our study, Geng et al. [
27] proposed the MCCSP to select the optimal subset of channels from HD-EMG recordings for motion classification. The classification accuracy was defined as the percentage of the number of correct classifications over the total number of classifications. Eighteen monopolar EMG channels were selected among the total 56 as the sufficient number of channels for accurate motion classification, achieving an average classification accuracy of
in comparison to an accuracy of
for the entire set. When using a bipolar configuration, the average classification accuracy was
for 18 selected channels compared to
for the complete set.
Our channel selection methods provide a simple, practical, and effective way of dealing with the high dimensionality of EMG data in order to find a subset of channels that contributes the most to force estimation. PCA and our PCR channel selection methods improve the force estimation accuracy by and , respectively, while reducing the dimensionality by . Since we use a subject-dependent force estimation technique in FOS, our PCR and PCA techniques also select a unique set of channels for each subject. Our investigations showed that while either channel 3 or 4 was frequently among the selected channels (which matches the SENIAM recommended location), as expected, different subsets of channels were selected for force estimation for different subjects, as each subject has a unique physiology. To the best of our knowledge, our method outperforms available similar works in the literature that have tackled this problem.
3.2. Limitations
In regards to our proposed techniques, PCR and PCA, we are aware that the selected channels may not necessarily be the global optimum subset. In fact, a brute force search may result in a subset that is different from our proposed methods, further decreasing the subset size while increasing the accuracy of the estimated force. Nonetheless, a search approach is extremely time consuming, as all the possible subsets of channels need to be evaluated, which amounts to possible permutations where n is the number of channels (in our case, per muscle).
Another limitation of our method is that, similarly to many other studies in the field, the proposed channel selection technique is user-dependent, including the FOS algorithm used for force estimation in our study. Naturally, such algorithms depend on parameters learned from the user prior to deployment. This is also the case for our proposed channel selection method, as it requires the power spectral density and correlation between channels from the user, which, in fact, can be measured together with the calibration/learning phase of the FOS force estimation technique. In addition, the selected subset of channels may, in fact, change with the choice of force estimation technique. For example, we have yet to explore whether our method would find the same subset if an artificial neural network classifier was used instead of FOS. The same uncertainty applies to the pre-processing steps. However, the pre-processing methods used in this paper are fairly standard and common when utilizing EMG.
A general challenge with HD electrodes is that, compared to regular single-channel electrodes, HD electrodes have a higher chance of facing artifacts in the recordings given the relatively large area of the electrode pads, which can lift from the skin or may bend during muscle flexion. In such cases, re-recording of the data is often necessary in order to obtain usable recordings for EMG-based force estimation.
Finally, the third limitation of our work is the fact that we have not evaluated the approach on larger numbers of EMG channels or for other types of contractions. While we expect our approach to generalize fairly well to larger sets and different contractions (for example, two-dimensional HD-EMG, which has a greater number of channels, or isotonic/dynamic contractions), we have yet to study such scenarios.
4. Conclusions and Future Work
In this study, linear electrode arrays were used to record EMG signals during isometric contractions over three elbow flexor muscle locations while simultaneously recording the generated force. The recorded EMG data were first pre-processed, and the force induced at the wrist was estimated using FOS. Two techniques, one based on PCA in the frequency domain and a second, novel index called the power-correlation ratio (PCR), were used to select a subset of channels for force estimation. PCA and PCR improved the force estimation accuracy by and while reducing the dimensionality by .
Both proposed methods, PCA
and PCR, achieved lower errors for force modeling in comparison with the full set and the baseline, PCA
, when a subset of three channels per muscle was selected. Although the accuracy of the PCR is slightly better than that of PCA
for most of the subjects and experimental conditions, as shown in
Figure 2, they are not statistically different. However, the performance of PCR for the subset of one and two channels is better than that of PCA
, and is much closer to the full set. This indicates that the PCR is better approach than PCA
when dimensionality reduction is more important, since compromising the accuracy is lower with PCR than PCA
if fewer number of channels will be selected. In addition, in this paper, we showed that using the first PC is sufficient to select channels to improve the force estimation accuracy, though this might need more investigation if the application is changed. Therefore, more analysis is needed to implement PCA
compared to the PCR.
For future work, we will explore machine learning and deep learning techniques for channel selection by training supervised models. Additionally, we will investigate the estimation performance of the FOS model over time. Moreover, in addition to FOS, other force estimation methods such as machine learning approaches, for example, using neural networks, will be used, and our method will be evaluated. Furthermore, we will explore different types of muscle contractions, such as dynamic contraction. Finally, we believe that by incorporating information regarding the dynamics of joints, for example, through wearable inertial sensors, more accurate force estimation and channel selection may be achieved.