1. Introduction
Brain–computer interface (BCI) is a collaborative setup between the human brain and a computer device, which takes brain signals as input and tries to decode them into computer commands to direct external activities such as cursor control, wheelchair control, and silent-speech recognition [
1,
2]. The electroencephalogram (EEG) is widely used as a non-invasive way of measuring brain signals in BCI systems due to its high temporal resolutions. It captures cerebral activities through electrodes placed over the scalp. EEG is also used in diagnosing several neurological disorders such as Alzheimer’s and epileptic seizures [
3]. Generally, BCI records the brain activities, pre-processes them to extract features from the data and finally classifies them for identifying mental states [
4]. Therefore, analyzing the information contained in the EEG is of great importance. Although EEG is intended to record cerebral activities in the form of electrical pulses, it also captures electrical activities other than the cerebral activities, known as artifacts. EEG signals are widely contaminated with different types of artifact sources such as cardiogenic (ECG), ocular (EOG), and myogenic (EMG). These artifacts suppress the information contained in the EEG signal and lead to wrong interpretation of BCI systems and medical diagnosis. Hence, the elimination of artifacts from the EEG signal is of great importance for practical uses.
The elimination of EMG artifacts from an EEG is more challenging as compared to removing other types of artifacts from an EEG signal. Generally, the EMG artifacts occur in EEG signals due to various muscle movements near the head. Various muscle movements such as teeth clenching, swallowing, head movement, chewing, jaw movement, and tongue movement are also captured through electrodes and contaminate the EEG signals with higher amplitude, broad anatomical distribution, and wide frequency spectrum [
5]. Specifically, the EMG has a wide spectral band and broad spatial distribution [
6]. The power-frequency spectrum of EMG artifacts ranges from 2 Hz to 100 Hz [
6]. Their wide spectrum easily overlaps with the frequencies of interest of the EEG. Because of the influence of both the volume conduction and the broad distribution of muscles across the face, neck, and head, it can be observed anywhere on the scalp.
In contrast, the spatial distribution of ECG and EOG are comparatively localized. EMG artifacts occur from the movement of spatially distributed and functionally independent muscle groups with distinct topographic and spectral signatures [
7]. The spectral signatures may vary across different muscles and also with the intensity of muscle contraction. In addition, ECG and EOG artifacts can be removed with the help of a reference channel, while it is difficult to remove EMG artifacts by using reference channels. Due to the complex muscle distribution, the placement of reference electrode for EMG artifact removal is very difficult [
7] in a practical situation. Hence, it is challenging to remove EMG artifacts without using any reference channels.
Numerous techniques have been proposed in the literature to overcome these difficulties of EMG removal from EEG. In early research, approaches based on blind source separation (BSS) were explored for the automatic removal of EMG artifacts from multichannel EEG [
8]. The authors in [
9,
10,
11] showed that independent component analysis (ICA) resulted in a good performance for removing EMG artifacts from the multichannel EEG data. Janani et al. [
12] showed that canonical correlation analysis (CCA) outperformed the ICA to eliminate EMG from EEG successfully. Chen et al. [
13] proved that independent vector analysis (IVA) could successfully denoise the EEG signal from EMG. However, BSS-based denoising techniques assume that the number of concerned sources is equal to or less than the number of channels. In this case, BSS can also be implemented for single-channel EEG data [
14], where single-channel EEG data was at first decomposed with some decomposition techniques. Then artifactual components from the results of decomposition are used for BSS processing.
However, the recent trend in mobile healthcare applications has led to the reduction in the number of EEG channels for health monitoring, and in some cases only a single channel is used [
15]. In such applications, BSS may not perform well. To overcome this issue, several attempts based on the decomposition techniques have been proposed to remove EMG artifacts from a few channels and single-channel EEG data. Fitzgibbon et al. [
16] removed EMG artifacts from central channels through the surface Laplacian transform (SLT). They stated that brain activities captured through central channels are affected only by a group of nonadjacent muscles as there is no muscle under the center of the scalp [
16] and SLT performed well in removing muscle contamination of scalp signals. However, the neuronal activities from other channels are affected by both the adjacent and the distant group of muscles. Yong et al. [
17] removed EMG artifacts from both the non-central and distant channels through morphological component analysis (MCA) by assuming that the EEG recordings are a linear combination of neuronal activities, artifacts, and electronic noise. However, as the EEG and EMG have non-stationary morphology, the performance of the MCA is not always satisfactory with the chosen dictionaries in MCA. Further, the above-mentioned decomposition techniques are not fully data-driven, which make them unfit for automatic and online applications. Bhardwaj et al. [
18] proposed wavelet-transform-based EMG artifact removal. Multiscale principal component analysis (MSPCA) is another effective hybrid denoising technique for EMG artifacts correction, in which PCA is combined with multiscale wavelet transform [
19,
20,
21]. Wavelets separate stochastic and deterministic processes and approximately uncorrelated the autocorrelation between calculations, whereas PCA offers interactions between distinct variables. The wavelet transform is an ideal approach for EEG signal analysis due to its multi-resolution characteristics.
In recent years, several hybrid methods [
12,
22,
23,
24,
25,
26,
27,
28,
29,
30,
31,
32] have been proposed to overcome the difficulties of individual techniques for the removal of EMG artifacts from the EEG data, and their summary is shown in
Table 1. It is observed from the table that EMD-based methods have been widely used for EMG artifacts correction. However, these methods ignore significant cross-channel interdependence information and consider the dependent information between spatially adjacent channels for the source separation procedure [
33]. On the other hand, EMD-based techniques perform well on few channel EEG data compared to the single-channel EEG data, whereas wavelet-based techniques perform well on single-channel EEG data. The main idea of wavelet-based techniques is to threshold the wavelet coefficients to remove the artifacts from the signal. To threshold the wavelet coefficients, several thresholding techniques such as statistical threshold and universal threshold calculations are widely used. However, Phadikar et al. [
34] showed that these threshold functions may vary with the data, which makes the system unfit for automatic and online implementation. Recently, Bajaj et al. [
35] proposed a tuneable artifact removal technique based on wavelet packet decomposition (WPD). Their denoising method automatically removes EMG artifacts from the corrupted EEG signal. However, the challenge associated with wavelet analysis remains in the selection of proper wavelet function and decomposition level.
Zhang et al. [
36] proposed a convolutional neural network-based approach for the removal of muscle artifacts from the EEG signals. However, their approach is limited to only 2 s of EEG epoch. A better configuration of CNN is needed for longer duration of EEG. More recently, NLM filter-based methods were employed for the removal of EMG artifacts from the EEG data. Earlier it was designed for image denoising [
37]. Eltrass et al. [
38] proposed a hybrid method in which an NLM filter is combined with the multi-kernel normalized least mean square with a coherence-based sparsification (MKNLMS-CS) algorithm for the successful removal of the EMG artifacts from the EEG data.
While numerous methods have been studied for the elimination of EMG, developing robust single or hybrid algorithms that can operate automatically is still very challenging. In addition, most of the techniques are suitable for multi-channel EEG (performance degrades for single-channel EEG), hence, developing an automatic EMG artifact removal technique for single-channel EEG is still challenging. However, the combination of multiple cascading algorithms is essentially an unexplored area. The significance of cascading algorithms is to suppress the artifacts in a single stage, which suppresses artifact sources and achieves a higher degree of robustness. In this manuscript, a novel automatic cascaded system is proposed in which wavelet transform is combined with the NLM filter for the elimination of EMG artifacts from the EEG signals.
In the proposed work, an EMG-contaminated EEG signal is automatically identified and decomposed into wavelet coefficients through WPD. Wavelet transform is used to separate the EEG features into different scales so that significant features of the EEG signals are preserved and noises can be removed. The mother wavelet and with an appropriate decomposition level is selected through a proper procedure. It is to be noted that artifacts will be reflected in the wavelet coefficients. Hence, correcting the corrupted wavelet coefficients will result in denoising the EEG signal. In the proposed method, corrupted wavelet coefficients are corrected through an optimized NLM algorithm. Finally, all the corrected coefficients are used in the inverse operation to get back the original artifact-free EEG signal. In this approach, no threshold calculation method is employed; instead, wavelet coefficients are corrected through an NLM filter, which makes the system automatic and fit for implementation in online applications. In addition, a modified NLM algorithm is proposed, in which all the parameters of the algorithm are optimized in a proper way. The proposed approach preserves the information of interest in the EEG signal, which is reflected in higher average correlation coefficients (CC) and higher MI values. The main contributions of the proposed denoising algorithm are stated as:
- (1)
The proposed algorithm employs the wavelet transform for its good time-frequency localization and optimized NLM filter for better removal of wide frequency spectrum EMG artifacts from the corrupted EEG signals.
- (2)
The main challenge in wavelet-transform-based denoising is the selection of appropriate threshold values. Hence, instead of thresholding the wavelet coefficients, they are corrected through NLM estimation.
- (3)
The issue of optimum parameter selection for the NLM is properly addressed with the help of a meta-heuristic algorithm.
- (4)
Further, the proposed algorithm corrects only the EMG-corrupted portion of the EEG and keeps the clean portion untouched. Hence, the proposed system preserves the true information contained in the EEG signal.
The novelty of the proposed hybrid method in which SVM, WPD and NLM are combined is described as below:
SVM is one of the most efficient classifiers and hence was used in this work. Different artifacts have different characteristics, for example, eyeblink artifacts contaminate the EEG signals with 5 to 10 times’ larger amplitude than the normal EEG signals with very low frequency (typically 0.1 to 3 Hz), and EMG artifacts contaminate the EEG signals with higher amplitude and higher frequency components. Similarly, ECG artifacts contaminate the EEG signals with periodic discharges. Moreover, we do not know which EEG patterns are hidden in the artifact portions (there is no ground truth). Hence, before removal of artifacts, it is necessary to identify which artifacts contaminate the EEG signals, because different artifacts have different characteristics and affect the EEG signals distinctively. If the system does not have any prior knowledge about the type of artifacts, then the system will be capable of denoising the EEG signals but the true EEG pattern (or the original ground truth) will not be reconstructed properly. Therefore, first the EEG signals corrupted with EMG artifacts are identified with SVM as a classifier.
The frequency of muscle artifacts overlaps with the frequency of interest of EEG signals. Hence, removing those particular frequencies by making the relevant wavelet coefficients zero will result in a significant loss of information. Therefore, at first EEG signals are decomposed into wavelet coefficients using WPD to represent the different frequency bands of EEG signals with respect to time. Note that designing an efficient NLM-based filter for a very wide frequency band is very difficult. Thus, the corrupted EEG signal is decomposed into different frequency bands at different levels in WPD so that efficient NLM filters can be designed for decomposed signals at different levels for the effective removal (correction) of EMG artifacts.
The optimization of parameters, especially the bandwidth parameter, λ of NLM algorithm at different levels (set of λ), simultaneously is a challenging task, and meta-heuristic algorithms are the most efficient ones in solving this type of problem. Hence, one of the recently developed meta-heuristic algorithms, GWO, is used for the task.
The manuscript is organized as:
Section 1 introduces the background, state-of-arts, challenges, objectives, and notations and preliminaries used in this paper are described in
Table 2,
Section 2 depicts the tools and ideas employed in this methodology,
Section 3 states the proposed methodology,
Section 4 depicts the outcome of the research,
Section 5 discusses about the experiment, and
Section 6 completes the manuscript by drawing conclusions alongside the strategies for the future works.
5. Discussion
In this paper, a fully automatic denoising method is proposed for removing EMG artifacts from the EEG signals. At first, the artifact EEG signals are identified through SVM as a classifier. After that, the corrupted EEG signals are decomposed into wavelet coefficients through WPD. Wavelet transform is used to separate the EEG features into different scales so that significant features of the EEG signals are preserved and artifacts can be removed. Generally, in wavelet denoising techniques, wavelet coefficients are thresholded to remove the artifacts. However, selection of an appropriate threshold makes the system unfit for automatic operation. In the proposed method, the corrupted wavelet coefficients are corrected through a modified NLM algorithm instead of thresholding them. The NLM algorithm has various parameters and needed adjustment for different types of EEG data. The NLM algorithm has been modified in this paper by optimizing the bandwidth parameter through GWO. In addition, a set of λ is optimized at different scales of the wavelet-transformed EEG signal, which enhances the performance of the proposed method for denoising the EEG signals. Finally, all the corrected wavelet coefficients are used in inverse operation to get back the original clean EEG signal. The proposed method is fully automatic and does not need any intervention of the user.
The proposed method was first validated on simulated EEG signals (the ground truth is available) and then tested on recorded EEG data. It is observed from the experimental results that the proposed method is superior among all the denoising methods in terms of highest CC and SSIM. Hence, the proposed algorithm preserves more true structure of the clean EEG signal compared to other recently reported EEG denoising algorithms. For recorded EEG data, the original true clean EEG is unavailable (there is no ground truth); hence, MI is calculated to check how much information is mutual between corrupted EEG and denoised EEG. It is observed that the proposed approach achieved higher MI as compared to other recently developed techniques.
One of the main advantages of the proposed hybrid method is that it is insensitive to any threshold value and there is no need to tune the parameters of the NLM algorithm, which makes the system fully automatic and it can be implemented in online applications. WPD efficiently deals with the non-stationary characteristics of EEG signals. Although the proposed method is capable of correcting EMG artifacts from the multi-channel EEG signals, correcting one by one increases the computational time.