A Novel Instantaneous Phase Detection Approach and Its Application in SSVEP-Based Brain-Computer Interfaces

Huang, Xiangdong; Xu, Jingwen; Wang, Zheng

doi:10.3390/s18124334

Open AccessArticle

A Novel Instantaneous Phase Detection Approach and Its Application in SSVEP-Based Brain-Computer Interfaces

by

Xiangdong Huang

¹

,

Jingwen Xu

¹ and

Zheng Wang

^2,*

¹

School of Electrical and Information Engineering, Tianjin University, Tianjin 300072, China

²

College of Intelligence and Computing, Tianjin University, Tianjin 300072, China

^*

Author to whom correspondence should be addressed.

Sensors 2018, 18(12), 4334; https://doi.org/10.3390/s18124334

Submission received: 17 October 2018 / Revised: 29 November 2018 / Accepted: 3 December 2018 / Published: 7 December 2018

(This article belongs to the Special Issue Affective and Immersive Human Computer Interaction via Effective Sensor and Sensing (AI-HCIs))

Download

Browse Figures

Versions Notes

Abstract

:

This paper proposes a novel phase estimator based on fully-traversed Discrete Fourier Transform (DFT) which takes all possible truncated DFT spectra into account such that it possesses two merits of ‘direct phase extraction’ (namely accurate instantaneous phase information can be extracted without any correction) and suppressing spectral leakage. This paper also proves that the proposed phase estimator complies with the 2-parameter joint estimation model rather than the conventional 3-parameter joint model. Numerical results verify the above two merits and demonstrate that the proposed estimator can extract phase information from noisy multi-tone signals. Finally, real data analysis shows that fully-traversed DFT can achieve a better classification on the phase of steady-state visual evoked potential (SSVEP) brain-computer interface (BCI) than the conventional DFT estimator does. Besides, the proposed phase estimator imposes no restrictions on the relationship between the sampling rates and the stimulus frequencies, thus it is capable of wider applications in phase-coded SSVEP BCIs, when compared with the existing estimators.

Keywords:

fully-traversed DFT; phase estimator; direct phase extraction; spectral leakage; SSVEP

1. Introduction

Estimating the frequency, the phase and the amplitude of a signal is a standard classical problem of signal processing. In particular, phase estimation has been widely applied in synchronization in communication [1], power analysis [2], speech enhancement [3], GPS navigation [4] and much more. Until now, phase estimators based on direct Discrete Fourier Transform (DFT) are the mainstream [5].

In [5], Liguori pointed out that, the existing DFT-based phase estimators heavily rely on the result of frequency estimate. To illustrate this dependency, let us study the sampled version (the sampling rate is

F_{s}

) of a complex exponential signal

x (t) = a_{0}

exp

[j (2 π f_{0} t + θ_{0})]

as

x (n) = x (t) |_{t = n Δ t} = a_{0} e^{j (2 π f_{0} n Δ t + θ_{0})}, n = 0, \dots, N - 1,

(1)

where

a_{0}

,

f_{0}

,

θ_{0}

are amplitude, frequency and phase respectively and

Δ

t is the sampling interval 1/

F_{s}

. Accordingly, the frequency resolution of the N-point DFT

X (k)

is

Δ

f =

F_{s}

/N = 1/N

Δ

t. Assume the peak DFT bin is at k =

k_{0}

,

k_{0}

∈

Z^{+}

(

Z^{+}

refers to the set of positive integer numbers). Then, it can be deduced that the ideal phase value of

X (k_{0})

is

θ_{0} = arg (X (k_{0})) - π δ (N - 1) / N .

(2)

The variable

δ

in (2) is the fractional frequency offset

(- 0.5 \leq δ < 0.5)

with the value

δ = f_{0} / Δ f - k_{0}

, which is closely related to two sampling cases (coherent sampling or noncoherent sampling). For the coherent sampling case, i.e.,

f_{0} = k_{0} Δ f

,

δ = 0

and thus the sequence {

x (n), n = 0, 1, \dots

, N − 1} exactly covers

k_{0}

signal periods, it can be easily inferred from (2) that the phase

θ_{0}

can be estimated by directly taking the phase at the peak bin. Nevertheless, in case of noncoherent sampling, i.e.,

f_{0} = (k_{0} + δ) Δ f

,

δ \neq 0

and thus the sequence does not exactly contain integer times of signal periods, it can be inferred from (2) that the phase estimate

θ_{0}

is related to both the peak index

k_{0}

and the frequency offset

δ

.

Up to now, a lot of frequency estimators have been proposed to estimate the fractional offset

δ

. For example, Offelli proposed an energy-based approach [6] in which the frequency offset of some component can be derived from the energy summation of the windowed DFT spectral bins around the peak bin. Other approaches based on interpolated FFT were reported in [2,7,8,9], in which

δ

is calculated via interpolation between several successive high-amplitude DFT bins centered with peak bin. In recent years, a lot of high-accuracy interpolation-based estimators such as Provencher estimator [10], Jacobsen estimator [11], Candan estimator [12], and phase difference-based estimator [13] were proposed. However, in [5,14,15], Liguori emphasized that “the bias of frequency estimate will introduce the uncertainty propagation to the phase acquisition”. In other words, the error of frequency offset

δ

in (2) will inevitably give rise to the error of phase estimate

θ_{0}

accordingly.

Besides, spectral leakage is also a non-ignorable factor of degrading DFT-based phase estimators. In essence, for the non-coherent sampling case, performance degradation of phase estimation actually results from DFT’s inherent spectral leakage effect [16]. Therefore, to enhance the accuracy of phase estimation, some approach capable of suppressing spectral leakage is expected to be developed.

This paper proposes a novel approach of correcting DFT spectra which can alleviate phase estimate’s dependence on frequency estimate (or rather, the dependence on frequency offset

δ

). This improvement actually attributes to the property that the proposed fully-traversed DFT spectrum has a much slighter spectral leakage compared to original uncorrected DFT spectrum in non-coherent sampling case. To verify these advantages, we also apply this corrected DFT-based phase estimator into a phase-coded steady-state visual evoked potential-based brain-computer interface (SSVEP-based BCI).

Recently, for the purpose of increasing realizable targets, phase information is popularly integrated into frequency-coding in SSVEP-BCI, such as joint/mixed frequency and phase coding [17,18,19]. In [20], Lee introduced one scheme of phase coding in SSVEP-BCI system, in which 8 LEDs flickering at 31.25 Hz with the phase interval

45^{\circ}

were used to represent 8 cursor functions. However, the phase decoding in [20] was realized through detecting the maximum amplitude peak of the averaged SSVEP waveform, i.e., it was realized in time domain instead of frequency domain. In [21], Lee segmented one stimulus into reference epoch and phase-shift epoch, in which the phase differences between these two epochs were used to distinguish different targets. Shortly after that, in [22], Lee designed six SDFS (stepping delay flickering sequences) with stimulus frequency of 32 Hz, among which 6 different phase-tagged objects are represented by 6 distinct delay segments such that they can be detected by the normalized power of averaged responses. However, SSVEP-BCIs in [21,22] actually employ special stimulus sequences to facilitate phase decoding, which may be not suitable for the common situations of phase measurement. In [23], to increase the number of visual stimuli, Gao and Jia proposed a frequency and phase mixed coding scheme using multiple frequencies. Then, Gao proposed the phase constrained canonical correlation analysis (p-CCA) to better distinguish frequency-tagged stimuli from multichannel SSVEPs [24], in which the phase information was estimated from the FFT result of the SSVEP records over the occipital cortex. Following this, Gao et.al introduced phase coding in CCA to discriminate six

60^{\circ}

-interval targets [25]. However, the stimuli frequencies in [23,24,25] were actually deliberately chosen such that each stimuli frequency exactly equals the integer times of FFT resolution and thus the frequency offset is also exactly zero. Therefore, in SSVEP-BCIs, novel phase estimator capable of detecting the phases of stimuli with any frequency offset is expected to be developed.

This paper will demonstrate that the proposed phase corrected DFT-based estimator is in accordance with the above demand of SSVEP-BCIs. Experimental results show that, the proposed phase estimator is superior to the conventional estimator in SSVEP phase extraction.

The rest of this paper is organized as follows. Section 2 introduces the derivation of the proposed fully-traversed DFT. Section 3 elaborates the properties of the proposed fully-traversed DFT in the noiseless case. Section 4 gives accuracy analysis of this phase estimator in noisy case. Section 5 demonstrates the proposed DFT-based phase estimator’s superiority to conventional phase estimators in the phase-coded SSVEP-BCI with auto-calibration. Finally, we conclude with a summary of results in Section 6.

2. Derivation of the Fully-Traversed DFT Spectrum

2.1. Phase Property of DFT Spectrum

Consider a sampled signal {

x (0), x (1), \dots, x

(N −

1)

} (or denoted as the vector

x_{0}

). As is known, the normalized conventional DFT of

x (n)

is

X (k) = \frac{1}{N} \sum_{n = 0}^{N - 1} x (n) e^{- j \frac{2 π}{N} k n}, k = 0, 1, \dots, N - 1 .

(3)

Without loss of generality, assume that

Δ ω = 2 π / N

and

ω_{0} = β Δ ω = (k_{0} + δ) Δ ω

,

k_{0} \in Z^{+}

and

- 0.5 \leq δ < 0.5

. Substituting the complex exponential sequence

x (n) =

exp(j

(ω_{0} n + θ_{0}))

into (3) yields

\begin{matrix} X (k) & = \frac{1}{N} \sum_{n = 0}^{N - 1} e^{j (\frac{2 π}{N} β n + θ_{0})} e^{- j \frac{2 π}{N} k n} \\ = \frac{e^{j θ_{0}}}{N} \sum_{n = 0}^{N - 1} e^{j \frac{2 π}{N} (β - k) n} \\ = \frac{e^{j θ_{0}}}{N} \sum_{n = 0}^{N - 1} e^{j \frac{2 π}{N} (k_{0} - k + δ) n} . \end{matrix}

(4)

Using geometric series summation and Euler equation, one can further deduce (4) as

X (k) = \{\begin{matrix} e^{j [θ_{0} + \frac{N - 1}{N} (k_{0} + δ - k) π]} \cdot \frac{sin [(k_{0} + δ - k) π]}{N sin [(k_{0} + δ - k) π / N]}, δ \neq 0 \\ e^{j θ_{0}}, δ = 0 \end{matrix}

(5)

Substituting the peak index

k = k_{0}

into (4) and synthesize the two cases of

δ \neq 0

and

δ = 0

, we can derive a uniform expression of the phase value at peak DFT bin as

ϕ_{X} (k_{0}) = θ_{0} + (N - 1) / N \cdot δ π

(6)

Obviously, (6) is in accordance with (1).

From (6), it can be found that, only for the coherent sampling case (the frequency offset

δ = 0

), the observed phase

ϕ_{X} (k_{0})

at the peak DFT bin is an accurate estimate. In contrast to this, for the non-coherent sampling case (

δ \neq 0

), in order to estimate the phase,

δ

has to be estimated in advance. As a result, the phase estimator based on conventional DFT heavily depends on the frequency offset

δ

.

2.2. The Proposed Fully-Traversed DFT Spectrum

In this work, we aim to develop a phase estimator capable of removing the dependency on the frequency offset

δ

. To be specific, our goal is to construct a corrected DFT spectrum whose peak bin provides accurate phase information whether

δ = 0

or not.

To derive the proposed fully-traversed DFT, it is necessary to study the relationship between the ideal Fourier transform and the conventional DFT. As known to us, the ideal Fourier transform

X (j ω)

of an infinite-length sequence

{x (n)} = {\dots, x (-

N + 1

), \dots, x (0), \dots, x (

N − 1

), \dots}

is

X (j ω) = \sum_{n = - \infty}^{+ \infty} x (n) e^{- j n ω}

(7)

However,

X (j ω)

cannot be realized, since the calculation in (7) consumes innumerous samples and memory. Thus, in practical applications,

X (j ω)

is replaced by the conventional normalized DFT

X (k)

defined in (3). Furthermore, if we define a sequence

x_{0} (n)

, which is truncated from the infinite-length sequence {

x (n), - \infty \leq n \leq \infty

}, i.e.,

\begin{matrix} x_{0} (n) = \{\begin{matrix} x (n), n \in [0, N - 1] \\ 0, others \end{matrix}, \end{matrix}

(8)

then, apparently, combining (3) with (7), (8) yields a representation of

X (k)

as

\begin{matrix} X (k) & = \frac{1}{N} X_{0} (j ω) |_{ω = k 2 π / N} \\ = \frac{1}{N} \sum_{n = - \infty}^{+ \infty} x_{0} (n) e^{- j n k 2 π / N} \\ = \frac{1}{N} \sum_{n = 0}^{N - 1} x (n) e^{- j n k 2 π / N} \end{matrix}

(9)

Now, let us focus on the sample

x (0)

at which the ideal phase

θ_{0}

is located. It can be inferred from (9) that conventional DFT only considers one truncation case. However, there are N truncated sequences

x_{m}

(

m = 0, 1, \dots, N - 1

) containing

x (0)

listed as

\begin{matrix} \begin{matrix} x_{0} = {x (0), x (1), \dots, x (N - 2), x (N - 1)}, \\ x_{1} = {x (- 1), x (0), \dots, x (N - 3), x (N - 2)}, \\ ⋮ \\ x_{N - 1} = {x (- N + 1), x (- N + 2), \dots, x (- 1), x (0)}, \end{matrix} \end{matrix}

(10)

where the elements of sequence

x_{m} = {x_{m} (n), n = 0, \dots,

N − 1} are

\begin{matrix} x_{m} (n) = \{\begin{matrix} x (n - m), n \in [0, N - 1] \\ 0, others \end{matrix} \end{matrix}

(11)

Similar to the discrete spectrum of the truncated sequence

x_{0}

expressed in (8), a reasonable discrete spectrum for

x_{m} = {x (- m), x (

−m + 1

), \dots, x (

−

m

+ N − 1

)}

should be

\begin{matrix} X_{m} (k) & = \frac{1}{N} \sum_{n = - m}^{N - 1 - m} x (n) e^{- j n k 2 π / N}, \\ m, k = 0, 1, \dots, N - 1 . \end{matrix}

(12)

Furthermore, aiming to rewrite

X_{m} (k)

in the form of conventional DFT in which variable n ranges from 0 to N − 1, we should detach the series summation in (12) into two terms as

\begin{matrix} X_{m} (k) = \frac{1}{N} \sum_{n = - m}^{- 1} x (n) e^{- j n k 2 π / N} \\ + \frac{1}{N} \sum_{n = 0}^{N - m + 1} x (n) e^{- j n k 2 π / N}, \end{matrix}

(13)

The first summation in (13) can be denoted as

\begin{matrix} \frac{1}{N} \sum_{n = - m}^{- 1} x (n) e^{- j n k 2 π / N} \\ \underset{̲}{\underset{̲}{n^{'} = n + N}} \frac{1}{N} \sum_{n^{'} = N - m}^{N - 1} x (n^{'} - N) e^{- j (n^{'} - N) k 2 π / N} \\ = \frac{1}{N} \sum_{n = N - m}^{N - 1} x (n - N) e^{- j n k 2 π / N} . \end{matrix}

(14)

Hence, it is necessary to introduce N new truncated sequences

{\tilde{x}}_{m}

(

m = 0, 1, \dots, N - 1

) whose elements are

\begin{matrix} {\tilde{x}}_{m} (n) & = \{\begin{matrix} x (n), n = 0, \dots, N - m - 1 \\ x (n - N), n = N - m, \dots, N - 1 \end{matrix} . \end{matrix}

(15)

In terms of (15), N sequences

{\tilde{x}}_{0} \sim {\tilde{x}}_{N - 1}

are listed as

\begin{matrix} \begin{matrix} {\tilde{x}}_{0} = {x (0), x (1), \dots, x (N - 2), x (N - 1)}, \\ {\tilde{x}}_{1} = {x (0), x (1), \dots, x (N - 2), x (- 1)}, \\ ⋮ \\ {\tilde{x}}_{N - 1} = {x (0), x (- N + 1), \dots, x (- 2), x (- 1)} . \end{matrix} \end{matrix}

(16)

Then, combining (13) with (15), we have

\begin{matrix} X_{m} (k) = \frac{1}{N} \sum_{n = 0}^{N - 1} {\tilde{x}}_{m} (n) e^{- j n k 2 π / N} = D F T [{\tilde{x}}_{m} (n)], \\ m = 0, 1, \dots, N - 1 . \end{matrix}

(17)

Finally, averaging all the DFT results of

{\tilde{x}}_{0} \sim {\tilde{x}}_{N - 1}

yields the proposed corrected spectrum, i.e.,

\begin{matrix} Y (k) = \frac{1}{N} \sum_{m = 0}^{N - 1} X_{m} (k), k = 0, 1, \dots, N - 1 . \end{matrix}

(18)

From (8)–(18), it can be noticed that, the center sample

x (0)

fully traverses all the possible starting positions of N truncated sequences

x_{m}

(

m = 0, 1, \dots, N - 1

) in (10). Moreover, all the DFT results (17) of these traversed sub-sequences are fully averaged in (18) to yield the final spectrum

Y (k)

in (18). Hence, this novel spectral analysis is named as Fully-traversed DFT, whose dataflow is summarized as:

step 1.: Construct N N-length sub sequences $x_{0} \sim x_{N - 1}$ from the given $(2 N - 1)$ input samples $x (- N + 1), \dots, x (0)$ , $\dots, x (N - 1)$ , as (10) lists;
step 2.: For each index $m, m = 0, 1, \dots, N - 1$ , circularly left move m samples of $x_{m} = {x (- m), \dots,$ $x (0), \dots, x ($ − $m$ + $N$ − 1 $)}$ to generate N sequences ${\tilde{x}}_{m} = {x (0), x (1), \dots, x (N - m + 1), x (- m), \dots, x (- 1)}$ .
step 3.: Implement normalized DFT on each ${\tilde{x}}_{m}$ to obtain the discrete sub-spectrum $X_{m} (k), m = 0, \dots,$ N − 1;
step 4.: Average all N sub-spectra $X_{m} (k)$ to acquire the proposed phase corrected spectrum $Y (k)$ .

3. Property of Phase Estimation in the Noiseless Circumstance

3.1. Single-Tone Case

Consider a single-tone signal

x (n)

= exp(j

(ω_{0} n + θ_{0}))

,

ω_{0} = β Δ ω = (k_{0} + δ) 2 π / N

. Since

Y (k)

is obtained by averaging

X_{0} (k) \sim X_{N - 1} (k)

, substituting

x (n)

into

X_{m} (k)

in (12) and (18) yields

\begin{matrix} Y (k) & = \frac{1}{N} \sum_{m = 0}^{N - 1} [\frac{1}{N} \sum_{n = - m}^{- m + N - 1} e^{j (β 2 n π / N + θ_{0})} e^{- j n k 2 π / N}] \\ \underset{̲}{\underset{̲}{n^{'} = n + m}} \frac{e^{j θ_{0}}}{N^{2}} \sum_{m = 0}^{N - 1} [\sum_{n^{'} = 0}^{N - 1} e^{j (β - k) 2 (n^{'} - m) π / N}] \\ = \frac{e^{j θ_{0}}}{N^{2}} \sum_{m = 0}^{N - 1} e^{- j (β - k) 2 m π / N} [\sum_{n = 0}^{N - 1} e^{j (β - k) 2 n π / N}] \end{matrix}

(19)

Obviously, (19) can be rewritten as

\begin{matrix} Y (k) = e^{j θ_{0}} & (\frac{e^{j θ_{0}}}{N} \sum_{m = 0}^{N - 1} e^{- j \frac{2 π}{N} (k_{0} - k + δ) m}) \cdot \\ {(\frac{e^{j θ_{0}}}{N} \sum_{n = 0}^{N - 1} e^{- j \frac{2 π}{N} (k_{0} - k + δ) n})}^{*} \end{matrix}

(20)

where the superscript ‘∗’ represents complex conjugate operation. Then, combining (20) with (4) yields

\begin{matrix} Y (k) = e^{j θ_{0}} X (k) X^{*} (k) = e^{j θ_{0}} {|X (k)|}^{2}, \\ k = 0, 1, \dots, N - 1 . \end{matrix}

(21)

Taking the phase part of (21), we can obtain the phase spectrum

φ_{Y} (k)

as

\begin{matrix} φ_{Y} (k) = θ_{0}, k = 0, 1, \dots, N - 1 . \end{matrix}

(22)

Equation (22) shows that, for the signal

x (n)

= exp

(j (ω_{0} n + θ_{0}))

,

- N + 1 \leq n \leq N - 1

, the phase values at all N spectral bins (including the peak bin

k = k_{0}

) uniformly equal the ideal phase

θ_{0}

(also refers to the instantaneous phase of the center sample

x (0)

). In other words, the synthesized spectrum

Y (k)

directly provides the accurate phase estimate at any spectral bin, thus removing the conventional DFT-based phase estimator’s dependency on the frequency offset

δ

.

Furthermore, since

Y (k)

is obtained by linearly averaging the N sub DFT spectra as (18) shows, the proposed fully-traversed DFT spectrum is of linearity, also. Thus, for a single-tone signal with amplitude

a_{0} (a_{0} \neq 1)

, the following holds

\begin{matrix} Y (k) = a_{0} e^{j θ_{0}} \frac{{sin}^{2} [(k_{0} + δ - k) π]}{N^{2} {sin}^{2} [(k_{0} + δ - k) π / N]}, k = 0, 1, \dots, N - 1, \end{matrix}

(23)

among which the peak spectral bin is

\begin{matrix} Y (k_{0}) = a_{0} e^{j θ_{0}} \frac{{sin}^{2} (δ π)}{N^{2} {sin}^{2} (δ π / N)} . \end{matrix}

(24)

3.2. Multi-Tone Case

Consider a signal containing Q (

Q \geq 2

) components expressed as

x (n) = \sum_{q = 1}^{Q} a_{q} e^{j (ω_{q} n + θ_{q})} = \sum_{q = 1}^{Q} a_{q} e^{j [(k_{q} + δ_{q}) Δ ω n + θ_{q}]}, k_{q} \in Z^{+}, 0 < | δ_{q} | < 0.5 .

(25)

Since the conventional DFT is a linear transform, combining (5) and (25), its multi-tone spectrum

X (k)

is composed of Q single-tone spectra, i.e.,

\begin{matrix} \begin{matrix} X (k) & = \sum_{q = 1}^{Q} X_{q} (k) \\ = \sum_{q = 1}^{Q} a_{q} e^{j [θ_{q} + \frac{N - 1}{N} (k_{q} + δ_{q} - k)]} \frac{sin [(k_{q} + δ_{i} - k) π]}{N sin [(k_{q} + δ_{q} - k) π / N]}, k = 0, \dots, N - 1 \end{matrix} \end{matrix}

(26)

Similarly, since the proposed phase corrected DFT spectrum is also of linearity, combining (5) with (23), its multi-tone spectrum

Y (k)

equals the summation of Q single-tone spectra as

\begin{matrix} \begin{matrix} Y (k) = \sum_{q = 1}^{Q} Y_{q} (k) \\ = \sum_{q = 1}^{Q} a_{q} e^{j θ_{q}} \frac{{sin}^{2} [(k_{q} + δ_{q} - k) π]}{N^{2} {sin}^{2} [(k_{q} + δ_{q} - k) π / N]}, k = 0, \dots, N - 1 \end{matrix} \end{matrix}

(27)

Furthermore, to measure the phase of the i-th tone, the peak bin at

k = k_{i}

should be focused. Therefore, the conventional DFT spectrum

X (k_{i})

can be written as

\begin{matrix} X (k_{i}) & = X_{i} (k_{i}) + \sum_{q \neq i}^{} X_{q} (k_{i}) \\ = a_{i} e^{j (θ_{i} + \frac{N - 1}{N} δ_{i})} \frac{sin (δ_{i} π)}{N sin (δ_{i} π / N)} + \sum_{q \neq i}^{} a_{q} e^{j [θ_{q} + \frac{N - 1}{N} (k_{q} + δ_{q} - k_{i})]} \frac{sin [(k_{q} + δ_{q} - k_{i}) π]}{N sin [(k_{q} + δ_{q} - k_{i}) π / N]} . \end{matrix}

(28)

Similarly, the proposed fully-traversed DFT spectrum

Y (k_{i})

can be expressed as

\begin{matrix} Y (k_{i}) & = Y_{i} (k_{i}) + \sum_{q \neq i}^{} Y_{q} (k_{i}) \\ = a_{i} e^{j θ_{i}} \frac{{sin}^{2} (δ_{i} π)}{N^{2} {sin}^{2} (δ_{i} π / N)} + \sum_{q \neq i}^{} a_{q} e^{j θ_{q}} \frac{{sin}^{2} [(k_{q} + δ_{q} - k_{i}) π]}{N^{2} {sin}^{2} [(k_{q} + δ_{q} - k_{i}) π / N]} . \end{matrix}

(29)

In (29), the first term is the expected i-th single-tone spectrum which directly provides the phase estimate, and the second item represents the interference of other tones. Obviously, for either the conventional DFT spectrum or the fully-traversed DFT spectrum, the accuracy of the i-th tone’s phase estimation depends on the intensity of interference. Particularly, this accuracy depends on the relative magnitude ratio between

X_{q} (k_{i}), q \neq i

, and

X_{i} (k_{i})

. From (28) and (29), we have

\{\begin{matrix} \frac{|X_{q} (k_{i})|}{|X_{i} (k_{i})|} = \frac{a_{q}}{a_{i}} \cdot |\frac{sin ((k_{q} + δ_{q} - k_{i}) π) / sin ((k_{q} + δ_{q} - k_{i}) π / N)}{sin (δ_{i} π) / sin (δ_{i} π / N)}| \\ \frac{|Y_{q} (k_{i})|}{|Y_{i} (k_{i})|} = \frac{a_{q}}{a_{i}} \cdot \frac{{sin}^{2} ((k_{q} + δ_{q} - k_{i}) π) / {sin}^{2} ((k_{q} + δ_{q} - k_{i}) π / N)}{{sin}^{2} (δ_{i} π) / {sin}^{2} (δ_{i} π / N)} \end{matrix}

(30)

From (30), we have

\frac{|Y_{q} (k_{i})| / |Y_{i} (k_{i})|}{|X_{q} (k_{i})| / |X_{i} (k_{i})|} = |\frac{sin ((k_{q} + δ_{q} - k_{i}) π) / sin ((k_{q} + δ_{q} - k_{i}) π / N)}{sin (δ_{i} π) / sin (δ_{i} π / N)}|

(31)

Please note that

0 < | δ_{i} |, | δ_{q} | < 0.5

,

| k_{q} - k_{i} | \geq 1

and thus

| k_{q} + δ_{q} - k_{i} | > 0.5

. Since

sin (δ π) / sin (δ π / N)

is a monotonously descending even function, we have

|\frac{sin ((k_{q} + δ_{q} - k_{i}) π)}{sin ((k_{q} + δ_{q} - k_{i}) π / N)}| < |\frac{sin (δ_{i} π)}{sin (δ_{i} π / N)}|

(32)

Combining (31) with (32), we have

|\frac{Y_{q} (k_{i})}{Y_{i} (k_{i})}| < |\frac{X_{q} (k_{i})}{X_{i} (k_{i})}|, q \neq i .

(33)

Equation (33) shows that, for the fully-traversed DFT spectrum, the relative magnitude ratio between any other tone and the tone of interest is smaller than that of the conventional DFT. In other words, compared to the conventional DFT spectrum, the fully-traversed DFT magnitude spectrum does better in suppressing spectral leakage and interferences, thus yielding a higher accuracy of phase estimation.

3.3. Simplified Dataflow of Phase Estimation

From the aforementioned 4-step procedure listed in Section II, one can find that the fully-traversed DFT experiences N times of DFT operation. Therefore, to reduce computation complexity, this procedure needs to be simplified.

According to the linearity of DFT, the average of N sub-spectra is equivalent to the DFT result of the averaged data. Therefore, if we average the N sub-sequences

{\tilde{x}}_{0} \sim {\tilde{x}}_{N - 1}

in (15) and (16), then one new sequence {

y (n), n = 0, 1, \dots,

N − 1} can be constructed as

\begin{matrix} y (n) & = \frac{1}{N} \sum_{m = 0}^{N - 1} {\tilde{x}}_{m} (n) \\ = \frac{N - n}{N} x (n) + \frac{n}{N} x (n - N), n = 0, 1, \dots, N - 1 \end{matrix}

(34)

Accordingly, the DFT result Y(k) of y(n) is

\begin{matrix} Y (k) = \frac{1}{N} \sum_{n = 0}^{N - 1} [\frac{N - n}{N} x (n) + \frac{n}{N} x (n - N)] e^{- j \frac{2 π}{N} n k}, \\ k = 0, 1, \dots, N - 1 \end{matrix}

(35)

Clearly, Equation (35) only involves one time of DFT operation, thus greatly reducing computation complexity. Accordingly, its simplified dataflow is illustrated in Figure 1 (take

N = 4

as an example), from which a low-complexity procedure of multi-tone phase estimation can be summarized as follows.

Firstly, weight the input (

2 N - 1

)-length data sequence

[x (- N

+

1)

, \dots, x (0), \dots, x (N

− 1

)]

with one (

2 N

− 1)-length triangular window

[1 / N, \dots, (N

−

1) / N, 1, (N

−

1) / N, \dots, 1 / N]

;

Secondly, (N − 1) weighted data pairs (in each pair, the two data are spaced with N samples) are individually summed up to generate (N − 1) data

y (1), \dots, y (N

− 1), except the center sample

x (0)

due to

y (0) = x (0)

;

Thirdly, implement normalized DFT on the data sequence [

y (0), y (1), \dots, y (N

− 1)] to provide the final spectral result

Y

= [

Y (0), Y (1), \dots, Y (N

− 1)].

Lastly, collect all the peak indices

k_{1}, \dots, k_{Q}

of

Y (k)

. For each index

k_{q}

, directly taking the phase value of

Y (k_{q})

provides its phase estimates

{\hat{θ}}_{q}

.

4. Variance Analysis of Phase Estimation in Noisy Circumstances

4.1. CRLB for Conventional DFT Phase Estimator

Now we consider the phase estimation in noisy case, in which one random complex Gaussian process

η (n)

should be considered in (1), i.e.,

\begin{matrix} s (n) = x (n) + η (n) = a_{0} e^{j (ω_{0} n + θ_{0})} + η (n), \end{matrix}

(36)

where

n = - N + 1, \dots, N - 1

,

ω_{0} = β Δ ω = (k_{0} + δ) 2 π / N

, and

η (n)

is a complex Gaussian variable with mean zero and variance

σ^{2}

. As mentioned in (2), the conventional phase estimate is

\begin{matrix} {\hat{θ}}_{0} & = φ_{X} (k_{0}) - (1 - 1 / N) (β - k_{0}) π \\ = φ_{X} (k_{0}) - (1 - 1 / N) π δ . \end{matrix}

(37)

(37) indicates that, since the conventional DFT-based phase estimator relies on the frequency offset

δ

, the estimate error of frequency offset

δ

will propagate to the phase estimate. In fact, this dependency makes phase estimation obey a 3-parameter mathematical model parameterized with

α

= [

ω_{0}, θ_{0}, a_{0}]^{T}

. With regard to this model, previous studies [26,27] have derived a CRLB (Crammer-rao lower bound) for the variance of the phase estimate (consuming

2 N - 1

samples) as

C R L B_{3} (θ_{0}) = \frac{4 N - 3}{2 N (2 N - 1) ρ},

(38)

where

ρ = a_{0}^{2} / 2 σ^{2}

is the SNR (Signal to Noise Ratio). Constrained by CRLB

_{3}

(

θ_{0}

) in (38), the error variance of phase estimate obtained by any conventional DFT-based estimator cannot exceed this bound. This has been especially claimed for algorithms in [1,2,5,6,7,8,9,27,28,29].

4.2. CRLB for the Proposed Phase Estimator

Different from conventional 3-parameter model-based phase estimators, mathematical model for the proposed corrected DFT-based phase estimator can be simplified. The reasons are as three aspects.

Firstly, (29) implies that the proposed phase detector only requires roughly searching the peak spectral bin and then taking the phases directly, i.e., independent of frequency offset

δ

.

Secondly, as previously mentioned, the proposed fully-traversed DFT spectrum equals the average of N sub DFT spectra. This actually reflects the following mechanism: averaging N sub vectors plays the role of compensating the angles of N sub DFT spectra with each other, which leads the synthesized phase to automatically fall at the ideal phase value, whether the frequency offset

δ = 0

or not.

Lastly, as previously mentioned, the proposed phase estimator can directly determine the ideal phase

θ_{0}

, referring to the ‘instantaneous phase’ of the center sample among the

2 N - 1

input samples

x (- N + 1) \sim x (N - 1)

. Obviously, the position for the center sample is at

n = 0

, which can be easily determined in advance. Thus, if we rewrite

x (n) = a_{0} e^{j (ω_{0} n + θ_{0})}

as

a_{0} e^{j φ_{n}}

, then, for the position

n = 0

, the entire term

φ_{n}

equals the ideal phase

θ_{0}

, which also indicates that estimation of frequency

ω_{0}

can be omitted.

For the above 3 reasons, the error variance of fully-traversed DFT phase estimator obeys a 2-parameter mathematical model with

α

= [

θ_{0}

,

a_{0}

]

^{T}

, in which the estimate of frequency offset

δ

is bypassed. Now we deduce the CRLB of this 2-parameter model using the classical parameter estimation theory [27].

Since

η (n)

is a Gaussian noise, the joint probability density function (pdf) of the

(2 N - 1)

-length observation sequence

S

= [

s_{- N + 1}, \dots, s_{0}, \dots, s_{N - 1}

]

^{T}

conditioned on the unknown vector

α

= [

θ_{0}, a_{0}

]

^{T}

is given by [26]

\begin{matrix} f (S | α) = {(\frac{1}{2 π σ^{2}})}^{\frac{2 N - 1}{2}} exp (- \frac{1}{2 σ^{2}} \sum_{n = - N + 1}^{N - 1} {(s_{n} - x_{n})}^{2}) \end{matrix}

(39)

From (39), we can derive a 2 × 2 Fisher information matrix

J

whose entries are

\begin{matrix} J_{i j} & = - E \{\frac{\partial In f (S | α)}{\partial α_{i} \partial α_{j}}\} \\ = \frac{1}{σ^{2}} \sum_{n = - N + 1}^{N - 1} \frac{\partial x_{n}}{\partial α_{i}} \cdot \frac{\partial x_{n}}{\partial α_{j}}, i, j = 1, 2 . \end{matrix}

(40)

Since

x (n) = u (n) + j v (n) = a_{0}

cos

(ω_{0} n + θ_{0}) +

j

a_{0}

sin

(ω_{0} n + θ_{0})

, (40) can be further expressed as

\begin{matrix} J_{i j} = \frac{1}{σ^{2}} \sum_{n = - N + 1}^{N - 1} (\frac{\partial u_{n}}{\partial α_{i}} \cdot \frac{\partial u_{n}}{\partial α_{j}} + \frac{\partial v_{n}}{\partial α_{i}} \cdot \frac{\partial v_{n}}{\partial α_{j}}), i, j = 1, 2 \end{matrix}

(41)

with

\frac{\partial u_{n}}{\partial α_{1}} = \frac{\partial u_{n}}{\partial θ_{0}} = - a_{0} sin (ω_{0} n + θ_{0}),

(42)

\frac{\partial u_{n}}{\partial α_{2}} = \frac{\partial u_{n}}{\partial a_{0}} = cos (ω_{0} n + θ_{0}),

(43)

\frac{\partial v_{n}}{\partial α_{1}} = \frac{\partial v_{n}}{\partial θ_{0}} = a_{0} cos (ω_{0} n + θ_{0}),

(44)

\frac{\partial v_{n}}{\partial α_{2}} = \frac{\partial v_{n}}{\partial a_{0}} = sin (ω_{0} n + θ_{0}) .

(45)

Substituting (42)∼(45) into (41) yields

\begin{matrix} J_{11} & = \frac{{a_{0}}^{2}}{σ^{2}} \sum_{n = - N + 1}^{N - 1} ({sin}^{2} (ω_{0} n + θ_{0}) + {cos}^{2} (ω_{0} n + θ_{0})) \\ = \frac{{a_{0}}^{2}}{σ^{2}} (2 N - 1), \end{matrix}

(46)

J_{12} = J_{21} = 0,

(47)

\begin{matrix} J_{22} & = \frac{1}{σ^{2}} \sum_{n = - N + 1}^{N - 1} ({sin}^{2} (ω_{0} n + θ_{0}) + {cos}^{2} (ω_{0} n + θ_{0})) \\ = \frac{1}{σ^{2}} (2 N - 1) . \end{matrix}

(48)

Thus the 2 × 2 Fisher information matrix

J

takes the following diagonal form

\begin{matrix} J = \frac{1}{σ^{2}} [\begin{matrix} (2 N - 1) a_{0}^{2} & 0 \\ 0 & 2 N - 1 \end{matrix}], \end{matrix}

(49)

and has the inverse as

\begin{matrix} J^{- 1} = σ^{2} [\begin{matrix} \frac{1}{(2 N - 1) a_{0}^{2}} & 0 \\ 0 & \frac{1}{(2 N - 1) a_{0}^{2}} \end{matrix}] . \end{matrix}

(50)

Thus, the CRLB for the error variance of the proposed phase estimator is

C R L B_{2} (θ_{0}) = \frac{σ^{2}}{(2 N - 1) a_{0}^{2}} = \frac{1}{2 (2 N - 1) ρ}

(51)

Since the two CRLBs in (38) and (51) share a same sample length

(2 N - 1)

, their ratio is

\begin{matrix} \frac{C R L B_{2} (θ_{0})}{C R L B_{3} (θ_{0})} = \frac{N}{(4 N - 3)} \approx \frac{1}{4} . \end{matrix}

(52)

Hence, the CRLB for 2-parameter joint estimation is only 25% of that for 3-parameter joint estimation.

4.3. Numerical Results

To further verify the superiority of the proposed phase estimator, simulations performed under various noisy conditions and different spectral orders are presented. The phase error variance of the proposed estimator was also compared with that of conventional DFT-based estimator (we choose the ratio method based on interpolated DFT [2]). Assume

k_{0} = 3

,

N = 128

and

θ_{0} = 60^{\circ}

. Then, the specific signal based on (36) is

\begin{matrix} s (n) = a_{0} e^{j [(3 + δ) \times \frac{2 π}{N} n + \frac{π}{3}]} + η (n), \end{matrix}

(53)

where the frequency offset

δ

is specified with 3 values:

0.1, 0.2, 0.3

. Figure 2a–c gives the error variance curves versus SNRs for these 3 frequency offsets, respectively. For each SNR and

δ

case, 500 Monte Carlo trials were conducted.

As can be seen in each figure, the majority of phase corrected DFT’s error variance curve (marked in ‘∗’) lies below CRLB

_{3}

, proving that the proposed phase estimator is independent of the conventional 3-parameter joint estimation model. Furthermore, the proposed estimator’s error variance curve is bounded by CRLB

_{2}

curve, verifying the correctness of (51). Figure 2 also demonstrates that the error variances of the conventional interpolated DFT estimator are nearly one order of magnitude higher than that of the proposed estimator.

5. Applying Fully-Traversed DFT in Phase-Coded SSVEP-BCI

Recently, BCIs have become a very hot topic in neural engineering. A BCI detects an user’s ongoing brain activities and translates them into meaningful messages, which helps patients with severe motor disabilities to express their messages to external world [30]. In particular, BCI based steady-state visual evoked potential (SSVEP) has received much attention in bioengineering research due to its satisfactory performance [31]. To increase the number of recognizable targets, the phase-coded SSVEP-BCIs use phase information to encode subject’s visual intention. In this system, the phase-tagged visual stimuli are characterized with flashing at one frequency but different phases, resulting in that subjects’ SSVEPs also differ in phase features. Therefore, through extracting the phase information of SSVEP potentials, a computer is able to distinguish which flicker the subject desires to select.

5.1. Experiment Paradigm

In general, SSVEP is always elicited after some latency time L (or labeled as lag phase ‘

θ_{L}

’), which actually corresponds to a phase difference between flicker’s phase ‘

θ_{F}

’ and SSVEP’s phase ‘

θ_{S}

’. Hence, the relationship between the phase difference ‘

θ_{L}

’ and latency L is described by

θ_{S} = θ_{F} + θ_{L}

(54)

θ_{L} = - (L \times 360 \times f_{s} - q \times 360)

(55)

where

f_{s}

, q denote the stimulus frequency and the integer cycles, respectively. Under normal condition, L is stable in a short period of time but differs in inter-subject such that

θ_{L}

cannot be calculated in advance [21,23,24]. From (54), it can be found that the flicker’s phase is usually not equal to SSVEP’s phase (or

θ_{F} \neq θ_{S}

) due to

θ_{L}

, which means that we cannot directly use

θ_{s}

to identify which flicker the subject desires to select if

θ_{L}

is unknown. As a result, an additional measure of phase calibration is necessary for phase-coded SSVEP-BCI to calibrate this error

θ_{L}

in the detection algorithm.

Hence, we build up a BCI system which uses a half-field phase-tagged stimulus to evoke the SSVEP with two different frequency components

f_{1}

,

f_{2}

at the same time. This system does not adopt any phase calibration since it is able to identify the flicker by introducing the phase difference instead of the phase under the assumption

f_{1} \neq f_{2}

(this frequency distinction makes a subject more sensitive to flickering stimuli than to those with the same frequency), i.e.,

\begin{matrix} θ_{S} (f_{1}) - θ_{S} (f_{2}) = θ_{F} (f_{1}) - θ_{F} (f_{2}) + θ_{L} (f_{1}) - θ_{L} (f_{2}) \end{matrix}

(56)

If

f_{1} \approx f_{2}

, then

θ_{L} (f_{1}) \approx θ_{L} (f_{2})

. Therefore, the difference between

θ_{S} (f_{1})

and

θ_{S} (f_{2})

approximately equals the difference between

θ_{F} (f_{1})

and

θ_{F} (f_{2})

. Hence, the lag phase difference can be removed.

In our phase-coded SSVEP-BCI system, a visual stimulator (ViewSonic, 22 inch, 120 Hz refresh rate,

1680 \times 1050

screen resolution) presenting two phase-tagged flickers (with the size 12 cm × 8 cm each) was used to evoke subjects’ SSVEPs (Figure 3 and Table 1).

It should be noted that, in our SSVEP-BCI illustrated in Figure 3, the selected flickering frequencies

f_{1}

and

f_{2}

should be as close as possible (this helps to remove the possible jump change of multiple of

360^{\circ}

for the lag phase difference between

θ_{L} (f_{1})

and

θ_{L} (f_{2})

, which was solved in [17] by means of a exhaustive search procedure based on least-square fitting). Otherwise, their lag phase difference

(θ_{L} (f_{1}) - θ_{L} (f_{2}))

would not be removed. However, due to the fact that all the stimulus frequencies in our SSVEP-BCI are acquired by integer dividing a fixed LCD display refresh frequency

F_{r}

= 120 Hz (see [32]), we can only obtain a limited number of flikering frequencies (they are 120/7 Hz, 120/8 Hz, 120/9 Hz, 120/10 Hz, 120/11 Hz) falling at the visual sensitive region (10 Hz, 20 Hz). Therefore, among these candidate flickering frequencies, the frequency pair with the minimum interval is (

f_{1}, f_{2}

) = (120/11 Hz, 120/10 Hz) = (10.9 Hz, 12 Hz). Obviously, in this case, the lag phase difference between

θ_{L} (f_{1})

and

θ_{L} (f_{2})

will not be removed as

f_{1}

and

f_{2}

are not close enough. Hence, this phase difference should be taken into account in order to achieve an accurate result. In practice, it can be roughly estimated according to their empirical results of the apparent latency of SSVEPs [24]. In our experiment,

(θ_{L} (f_{1}) - θ_{L} (f_{2}))

cam be roughly estimated as

36^{\circ}

.

Different from the well-known CCA (Canonical Correlation Analysis) method, which also uses phase information to enhance the classification accuracy of SSVEPs, our proposed scheme consumes lower hardware cost. As [17] pointed out, the CCA method needs multiple stimulus frequencies (6 frequencies were adopted) to remove the possible jump change of multiple of

360^{\circ}

for the lag phase difference. In contrast, our proposed scheme only employs 2 frequencies, thereby lowering the hard cost.

Three subjects (S1∼S3, two males and one female) were seated on a comfortable chair before the visual stimulator in an illuminated room. The subjects’ EEG signals were recorded by a g.USBamp EEG amplifier from 13 electrodes (PO

_{3}

, PO

_{5}

, PO

_{7}

, POZ, PO

_{4}

, PO

_{6}

, PO

_{8}

, P1, PZ, P2, O1, Oz, and O

_{2}

). Specify the sampling rate

F_{s} = 600

samples/s. This experiment consisted of 5 runs containing 10 trials each. Each trial lasted for 8 s. Subjects were instructed to focus on one of flickers according to the following paradigm: From 0 to 2 s a cue appeared indicating which flashing flicker was required to focus on; From 2 s to 8 s the subjects gazed at the specified flicker; Then the next trial started. The order of gazed-flickers was ‘1212121212’ in each run. Thus this dataset had 25 trials for each flicker. The whole experiment lasted about 30 minutes. During this experiment, the subjects’ EEG signals were recorded for offline analysis later.

5.2. Procedure of SSVEP Phase Extraction

We collected a total amount of 50 trials for each subject. Basically, the procedure of SSVEP phase extraction would contain the following steps:

step 1.: Apply both conventional DFT and phase corrected DFT to extract the phase values $θ_{S}$ ( $f_{1}$ ) and $θ_{S} (f_{2})$ , respectively;
step 2.: Substitute $θ_{S} (f_{1}) - θ_{S} (f_{2})$ and the estimate $(θ_{L} (f_{1}) - θ_{L} (f_{2}))$ = $36^{\circ}$ into (56) to estimate the difference $Δ \hat{θ}$ = $θ_{F} (f_{1}) - θ_{F} (f_{2})$ ;
step 3.: Use the estimate $Δ \hat{θ}$ to identify the gazed-flicker. If $Δ \hat{θ}$ is close to 0 (or 180) deg, then the gazed-flicker is judged as flicker 1 (or 2).

The judgement involved in step 3 can actually be extended to distinguish C targets (

C \geq 2

) by finding the maximum among C decision variables

R_{k}

as

R_{k} = cos (Δ \hat{θ} - φ_{k}), k = 1, \dots, C

(57)

where

φ_{k}

refers to the ideal phase difference

(θ_{F} (f_{1}) - θ_{F} (f_{2}))

, i.e., the ideal clustering center of pattern recognition. For the above 2-category recognition problem, we have

φ_{1} = 0^{\circ}

,

φ_{2} = 180^{\circ}

. From (57), it can be found if the detected phase difference

Δ \hat{θ} \to φ_{k}

, then

R_{k} \to 1

. In other words, if

R_{1}

is close to 1, the gazed-flicker is judged as flicker 1 and vice versa. Furthermore, (57) also allows to assume more clustering centers. As will be elaborated, introducing more assumed clustering centers helps to evaluate the performance of phase estimator.

5.3. Result of Offline Analysis

The classification rates of this 2-class experiment are listed in Table 2.

It can be found that the proposed phase estimator can achieve a higher classification rate than conventional DFT-based estimator does, which is entirely over 10%.

Table 3 lists not only these two estimators’ detected phase values (

θ_{s}

(12),

θ_{s}

(10.9)) but also 4 columns of phase decision values

R_{k} (j), k = 1, 2, 3, 4

, calculated by (57). Here decision variable

R_{k} (j)

corresponds to the kth assumed clustering center (4 assumed phase clustering centers are

0^{\circ}, 180^{\circ}, 90^{\circ}, 270^{\circ}

, respectively) while the subject actually gazes at the flicker j. Therefore, for any flicker index

j (j = 1, 2)

,

R_{j} (j)

(marked with shadow in Table 3) is close to 1 and tends to be the maximum among

R_{k} (j), k = 1, 2, 3, 4

, if the accuracy of selected phase estimator is high enough. The data for all trials involved in Table 3 are recorded within 4 s.

In Table 3, generally speaking, the detected phases

θ_{S} (10.9)

by the proposed phase estimator are closer to the ideal phase ‘

0^{\circ}

’ than that of conventional DFT estimator. This result can be explained as follows: Since the sampling rate

F_{s} = 600

samples/s and the window length equals 4s, it follows that

N = 2400

samples are recorded and thus DFT frequency resolution

Δ f = F_{s} / N = 0.25

Hz. Hence, the stimulus frequency

f_{1} = 10.9

Hz can also be written as

f_{1} = 43.6 Δ f = (44 - 0.4) Δ f

, i.e., the frequency offset

δ

equals nonzero value

- 0.4

, indicating that severe spectral leakage causing large phase measurement error will arise in the DFT spectrum. In contrast to this, for our proposed phase estimator, due to the property of ‘direct phase extraction’, the phase measurement error caused by frequency offset is much smaller than that of the conventional DFT case, as Table 3 lists.

Different from the case of

θ_{S} (10.9)

, both estimators’ detected phases

θ_{S} (12)

are uniformly close to the ideal phase

0^{\circ}

(or

180^{\circ}

) (Even both mean and standard deviation are similar). This is because

f_{2} = 12

Hz

= 48 Δ f

, i.e., the frequency offset

δ = 0

, resulting in that spectral leakage will not appear in both conventional DFT spectra and corrected DFT spectra.

Hence, since SSVEP phase extraction in this experiment is based on the phase difference between

θ_{S} (10.9)

and

θ_{S} (12)

rather than either of them, the conventional DFT phase estimator is more likely to cause large errors than the proposed phase estimator. From Table 3, one can notice that, for the conventional DFT phase estimator, the phase differences between

θ_{S} (10.9)

and

θ_{S} (12)

are far away from 36 deg (see the results in 3-rd, 7-th and 11-th row, respectively). Accordingly, the expected maximum decision values

R_{1} (1)

or

R_{2} (2)

detected by conventional DFT estimator are entirely smaller than that detected by the proposed estimator, as Table 3 lists. This reflects the fact that the conventional DFT is not so good as the proposed fully-traversed DFT in extracting the phase information of SSVEP.

To further evaluate the performance of the fully-traversed DFT in this experiment, as previously mentioned, we assume that there were 4 flickers with stimulus phases

0^{\circ}, 180^{\circ}, 90^{\circ}, 270^{\circ}

on screen. Hence, the gazed-flicker was identified by finding the maximum among

R_{1}

,

R_{2}

,

R_{3}

,

R_{4}

. Moreover, the average classification rate for these 4 assumed flickers are also listed in Table 4. In this simulated 4-category experiment, compared to the 2-category case in Table 2, the classification rate of the fully-traversed DFT decreases but still over 80%. In contrast, the classification rate of conventional DFT is only about 37%, as Table 4 lists. The underlying reason lies in that conventional DFT cannot accurately extract the phase information at

10.9

Hz due to spectral leakage, while the proposed phase corrected DFT still works well for its property of ‘direct phase extraction’.

In summary, fully-traversed DFT is well suitable for the phase-coded SSVEP-BCI, especially in our proposed duel-frequency stimulus-based system which uses phase difference to recognize the gazed flick. Due to the fact that fully-traversed DFT does well in extracting the phase information at any frequency offset, the proposed phase estimator can achieve a higher classification rate. More importantly, as listed in Table 3, we find that the corresponding phase’s standard deviation of the proposed estimator is smaller than that of conventional DFT estimator, although the frequency resolution of fully-traversed DFT is less than DFT. The reason is actually mentioned in Section 2, i.e., the mechanism that fully-traversed DFT is the average of N DFT sub-spectra (see formula (18)) also helps to reduce the averaged noise’s power.

Based on our preliminary results, we believe that fully-traversed DFT can be applied in the online system. Although the simulated 4-category experiment results show that the corresponding average classification rate is only 80%, the classification rate is surely to be further enhanced once the estimate of the lag phase difference is more accurate. In this experiment, we only roughly estimate it as 36 deg. In fact, we can use another two very close frequencies (for example, we can replace LCD stimuli with LED stimuli) such that the lag phase difference will be close to zero and thus we do not need to estimate it. Moreover, it was also clearly found that the phase variance of fully-traversed DFT estimator does not get worse than conventional DFT under our experiment condition.

6. Conclusions

A novel phase estimator based on corrected-phase DFT was proposed in this paper. Due to considering all possible truncated sequences containing the center sample, spectral leakage in corrected-phase DFT is greatly reduced and thus the instantaneous phase information of the center sample can be directly extracted. In addition, we have also proved that the process of corrected-phase DFT is equivalent to a streamline dataflow, and thus the proposed phase estimator has a relatively low computational complexity. Furthermore, the phase error variance of the proposed phase estimator follows a 2-parameterized mathematical model and thus it has a higher accuracy than the conventional DFT-based phase estimators in noisy circumstances.

We also applied the proposed phase estimators to phase-coded SSVEP-BCIs. In particular, compared to the conventional DFT-based estimators, our offline experiment results demonstrate that the fully-traversed DFT does better in extracting the phase information of phase-coded SSVEP-BCI. Moreover, the proposed phase estimator imposes on restrictions on the relationship between the sampling rates and the stimulus frequencies (i.e., non-synchronous sampling is allowed), it is of wider applications in phase-coded SSVEP BCIs than the existing estimators. Our future work is to improve our system design and apply fully-traversed DFT in the practical SSVEP-BCI online system.

Author Contributions

Conceptualization and Methodology, X.H. and J.X.; Writing-Original Draft Preparation, Z.W.; Writing-Review and Editing, J.X.; Software and Validation, J.X. and X.H.; Supervision, Z.W.

Funding

This research was funded by National Natural Science Foundation of China, grant number 61671012.

Conflicts of Interest

The authors declare no conflict of interest.

References

Rice, F.; Rice, M.; Cowley, B. A new bound and algorithm for Star 16-QAM carrier phase estimation. IEEE Trans. Commun. 2003, 51, 161–165. [Google Scholar] [CrossRef]
Andria, G.; Savino, M.; Trotta, A. Windows and interpolation algorithms to improve electrical measurement accuracy. IEEE Trans. Instrum. Meas. 1989, 38, 856–863. [Google Scholar] [CrossRef]
Abe, T.; Honda, M. Sinusoidal model based on instantaneous frequency attractors. IEEE Trans. Audio Speech Lang. Process. 2006, 14, 1292–1300. [Google Scholar] [CrossRef]
Dach, R.; Schildknecht, T.; Springer, T.; Dudle, G.; Prost, L. Continuous time transfer using GPS carrier phase. IEEE Trans. Ultrason. Ferroelectr. Freq. Control 2002, 49, 1480–1490. [Google Scholar] [CrossRef] [PubMed]
Liguori, C.; Paolillo, A.; Pignotti, A. Estimation of signal parameters in frequency domain in presence of harmonic interference: A comparative analysis. IEEE Trans. Instrum. Meas. 2006, 55, 562–569. [Google Scholar] [CrossRef]
Offelli, C.; Petri, D. A frequency-domain procedure for accurate real–time signal parameter measurement. IEEE Trans. Instrum. Meas. 1990, 39, 363–368. [Google Scholar] [CrossRef]
Offelli, C.; Petri, D. Interpolation techniques for real-time multifrequency waveform analysis. In Proceedings of the Conference Record, 6th IEEE, IMTC-89 Instrumentation and Measurement Technology Conference, Washington, DC, USA, 25–27 April 1989; pp. 325–331. [Google Scholar] [CrossRef]
Schoukens, J.; Pintelon, R.; Van Hamme, H. The interpolated fast Fourier transform: A comparative study. IEEE Trans. Instrum. Meas. 1992, 41, 226–232. [Google Scholar] [CrossRef]
Agrez, D. Weighted multipoint interpolated DFT to improve amplitude estimation of multifrequency signal. IEEE Trans. Instrum. Meas. 2002, 51, 287–292. [Google Scholar] [CrossRef]
Provencher, S. Estimation of Complex Single-Tone Parameters in the DFT Domain. IEEE Trans. Signal Process. 2010, 58, 3879–3883. [Google Scholar] [CrossRef]
Jacobsen, E.; Kootsookos, P. Fast, Accurate Frequency Estimators [DSP Tips Tricks]. IEEE Signal Process. Mag. 2007, 24, 123–125. [Google Scholar] [CrossRef]
Candan, C. Analysis and Further Improvement of Fine Resolution Frequency Estimation Method From Three DFT Samples. IEEE Signal Process. Lett. 2013, 20, 913–916. [Google Scholar] [CrossRef]
Huang, X.; Xia, X.G. A Fine Resolution Frequency Estimator Based on Double Sub-segment Phase Difference. IEEE Signal Process. Lett. 2015, 22, 1055–1059. [Google Scholar] [CrossRef]
Betta, G.; Liguori, C.; Pietrosanto, A. Propagation of uncertainty in a discrete Fourier transform algorithm. Measurement 2000, 27, 231–239. [Google Scholar] [CrossRef]
Novotny, M.; Slepicka, D.; Sedlacek, M. Uncertainty Analysis of the RMS Value and Phase in the Frequency Domain by Noncoherent Sampling. IEEE Trans. Instrum. Meas. 2007, 56, 983–989. [Google Scholar] [CrossRef]
Agrez, D. Improving phase estimation with leakage minimization. IEEE Trans. Instrum. Meas. 2005, 54, 1347–1353. [Google Scholar] [CrossRef]
Ke, L.; Wang, Y.; Gao, X. Time-frequency joint coding method for boosting information transfer rate in an SSVEP based BCI system. In Proceedings of the International Conference of the IEEE Engineering in Medicine and Biology Society, Orlando, FL, USA, 16–20 August 2016; pp. 5873–5876. [Google Scholar]
Youssef, A.A.A.; Wittevrongel, B.; Van Hulle, M.M. Accurate Decoding of Short, Phase-Encoded SSVEPs. Sensors 2018, 18, 794. [Google Scholar] [CrossRef]
Zhao, X.; Zhao, D.; Wang, X.; Hou, X. A SSVEP Stimuli Encoding Method Using Trinary Frequency-Shift Keying Encoded SSVEP (TFSK-SSVEP). Front. Hum. Neurosci. 2017, 11, 278. [Google Scholar] [CrossRef] [PubMed]
Lee, P.L.; Sie, J.J.; Liu, Y.J.; Wu, C.H.; Lee, M.H.; Shu, C.H.; Li, P.H.; Sun, C.W.; Shyu, K.K. An SSVEP-Actuated Brain Computer Interface Using Phase-Tagged Flickering Sequences: A Cursor System. Ann. Biomed. Eng. 2010, 38, 2383–2397. [Google Scholar] [CrossRef]
Wu, H.Y.; Lee, P.L.; Chang, H.C.; Hsieh, J.C. Accounting for Phase Drifts in SSVEP-Based BCIs by Means of Biphasic Stimulation. IEEE Trans. Biomed. Eng. 2011, 58, 1394–1402. [Google Scholar] [CrossRef]
Chang, H.C.; Lee, P.L.; Lo, M.T.; Lee, I.H.; Yeh, T.K.; Chang, C.Y. Independence of Amplitude-Frequency and Phase Calibrations in an SSVEP-Based BCI Using Stepping Delay Flickering Sequences. IEEE Trans. Neural Syst. Rehabil. Eng. 2012, 20, 305–312. [Google Scholar] [CrossRef]
Jia, C.; Gao, X.; Hong, B.; Gao, S. Frequency and Phase Mixed Coding in SSVEP-Based Brain–Computer Interface. IEEE Trans. Biomed. Eng. 2011, 58, 200–206. [Google Scholar] [CrossRef] [PubMed]
Pan, J.; Gao, X.; Duan, F.; Yan, Z.; Gao, S. Enhancing the classification accuracy of steady-state visual evoked potential-based brain-computer interfaces using phase constrained canonical correlation analysis. J. Neural Eng. 2011, 8, 036027. [Google Scholar] [CrossRef] [PubMed]
Li, Y.; Bin, G.; Gao, X.; Hong, B.; Gao, S. Analysis of phase coding SSVEP based on canonical correlation analysis (CCA). In Proceedings of the 2011 5th International IEEE/EMBS Conference on Neural Engineering (NER), Cancun, Mexico, 27 April–1 May 2011; pp. 368–371. [Google Scholar]
Rife, D.; Boorstyn, R. Single tone parameter estimation from discrete-time observations. IEEE Trans. Inf. Theory 1974, 20, 591–598. [Google Scholar] [CrossRef]
Kay, M. Fundamentals of Statistical signal processing, Volume 2: Detection theory. In Blind Equalization and System Identification; Springer: London, UK, 1998; pp. 83–182. [Google Scholar]
Reisenfeld, S.; Aboutanios, E. A new algorithm for the estimation of the frequency of a complex exponential in additive Gaussian noise. IEEE Commun. Lett. 2003, 7, 549–551. [Google Scholar] [CrossRef]
Zhu, L.; Song, X.; Li, H.; Ding, H. High accuracy estimation of multi-frequency signal parameters by improved phase linear regression. Signal Process. 2007, 85, 1066–1077. [Google Scholar] [CrossRef]
Wolpaw, J.R.; Birbaumer, N.; McFarland, D.J. Brain computer interfaces for communication and control. Clin. Neurophysiol. 2002, 113, 767–791. [Google Scholar] [CrossRef]
Wang, Y.; Gao, X.; Hong, B.; Jia, C.; Gao, S. Brain-Computer Interfaces Based on Visual Evoked Potentials. IEEE Eng. Med. Biol. Mag. 2008, 27, 64–71. [Google Scholar] [CrossRef]
Wong, C.M.; Wang, B.; Wan, F.; Mak, P.U.; Mak, P.I.; Vai, M.I. An improved phase-tagged stimuli generation method in steady-state visual evoked potential based brain-computer interface. In Proceedings of the 2010 3rd International Conference on Biomedical Engineering and Informatics (BMEI), Yantai, China, 16–18 October 2010; Volume 2, pp. 745–749. [Google Scholar]

Figure 1. Simplified flow diagram of phase corrected DFT (N = 4).

Figure 2. Error variances of the phase estimator with (a)

δ = 0.1

, (b)

δ = 0.2

and (c)

δ = 0.3

.

Figure 2. Error variances of the phase estimator with (a)

δ = 0.1

, (b)

δ = 0.2

and (c)

δ = 0.3

.

Figure 3. Visual stimulator presenting two half-field phase-tagged stimuli. (Left field of Flicker 1: 10.9 Hz and 0 deg, Right field of Flicker 1: 12 Hz and 0 deg, Left field of Flicker 2: 10.9 Hz and 0 deg, Right field of Flicker 2: 12 Hz and 180 deg).

Table 1. The parameters of two phase-tagged half-field flickers.

	Left-Field		Right-Field
	Freq. (Hz)	Phase (deg)	Freq. (Hz)	Phase (deg)
Flicker 1	10.9	0	12	0
Flicker 2	10.9	0	12	180

Table 2. The classification rate of each subject under different window length.

Subject	Method	Electrode	Window Length (sec)				Average
Subject	Method	Electrode	3	4	5	6	Average
S1	Proposed	POZ	0.94	1.00	1.00	0.98	0.98
S1	DFT	PO7	0.92	0.88	0.78	0.78	0.84
S2	Proposed	O2	0.98	1.00	1.00	1.00	1.00
S2	DFT	P2	0.92	0.88	0.84	0.64	0.82
S3	Proposed	P1	0.80	0.96	0.92	0.94	0.91
S3	DFT	PZ	0.92	0.94	0.88	0.74	0.87
Average	Proposed	–	0.91	0.99	0.97	0.97	0.96
Average	DFT	–	0.92	0.90	0.83	0.73	0.84

Table 3. The phase and phase feature

R_{j}

of each subject (mean ± S.D.)

Table 3. The phase and phase feature

R_{j}

of each subject (mean ± S.D.)

Subject	Method	Flicker j	$θ_{S}$ (12) (deg)	$θ_{S}$ (10.9) (deg)	$R_{1}$ (j)	$R_{2}$ (j)	$R_{3}$ (j)	$R_{4}$ (j)
S1	Proposed	1	322.74 ± 29.3	6.84 ± 31.4	0.985 ± 0.03	0.133 ± 0.11	0.741 ± 0.1	0.651 ± 0.13
	Proposed	2	126.52 ± 23.1	2.25 ± 27.2	0.253 ± 0.16	0.954 ± 0.05	0.56 ± 0.18	0.789 ± 0.18
	DFT	1	329.22 ± 39.3	57.09 ± 38.0	0.857 ± 0.17	0.416 ± 0.26	0.896 ± 0.1	0.378 ± 0.22
	DFT	2	151.46 ± 31.9	49.97 ± 34.1	0.422 ± 0.22	0.865 ± 0.16	0.445 ± 0.23	0.848 ± 0.19
S2	Proposed	1	334.22 ± 36.1	23.10 ± 35.5	0.983 ± 0.03	0.132 ± 0.12	0.771 ± 0.09	0.62 ± 0.12
	Proposed	2	146.96 ± 31.0	17.47 ± 35.7	0.193 ± 0.13	0.972 ± 0.04	0.601 ± 0.16	0.774 ± 0.12
	DFT	1	332.91 ± 32.2	72.68 ± 37.7	0.833 ± 0.11	0.511 ± 0.19	0.95 ± 0.06	0.25 ± 0.18
	DFT	2	148.24 ± 29.4	50.71 ± 28.8	0.421 ± 0.19	0.881 ± 0.12	0.362 ± 0.16	0.916 ± 0.07
S3	Proposed	1	15.04 ± 42.4	34.66 ± 43.2	0.899 ± 0.11	0.347 ± 0.25	0.538 ± 0.31	0.75 ± 0.24
	Proposed	2	198.30 ± 26.0	24.81 ± 35.5	0.339 ± 0.21	0.913 ± 0.09	0.822 ± 0.19	0.471 ± 0.27
	DFT	1	9.71 ± 39.0	73.81 ± 36.8	0.899 ± 0.13	0.349 ± 0.24	0.794 ± 0.22	0.508 ± 0.27
	DFT	2	198.94 ± 24.5	81.04 ± 38.3	0.323 ± 0.26	0.892 ± 0.19	0.522 ± 0.22	0.814 ± 0.15
Average	Proposed	1	–	–	0.956 ± 0.06	0.204 ± 0.16	0.683 ± 0.17	0.674 ± 0.17
	Proposed	2	–	–	0.262 ± 0.17	0.946 ± 0.06	0.661 ± 0.18	0.678 ± 0.19
	DFT	1	–	–	0.863 ± 0.14	0.425 ± 0.23	0.88 ± 0.13	0.379 ± 0.22
	DFT	2	–	–	0.389 ± 0.22	0.879 ± 0.16	0.443 ± 0.2	0.859 ± 0.14

Table 4. The classification rate of each subject under different window length (Simulated 4-classes experiment).

Subject	Method	Electrode	Window Length (sec)				Average
Subject	Method	Electrode	3	4	5	6	Average
S1	Proposed	POZ	0.80	0.90	0.90	0.84	0.86
S1	DFT	PO7	0.67	0.43	0.18	0.18	0.37
S2	Proposed	O2	0.92	0.90	0.98	0.90	0.93
S2	DFT	P2	0.54	0.28	0.16	0.06	0.26
S3	Proposed	P1	0.54	0.58	0.68	0.74	0.64
S3	DFT	PZ	0.56	0.62	0.46	0.30	0.49
Average	Proposed	–	0.75	0.79	0.85	0.83	0.81
Average	DFT	–	0.59	0.44	0.27	0.18	0.37

© 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Huang, X.; Xu, J.; Wang, Z. A Novel Instantaneous Phase Detection Approach and Its Application in SSVEP-Based Brain-Computer Interfaces. Sensors 2018, 18, 4334. https://doi.org/10.3390/s18124334

AMA Style

Huang X, Xu J, Wang Z. A Novel Instantaneous Phase Detection Approach and Its Application in SSVEP-Based Brain-Computer Interfaces. Sensors. 2018; 18(12):4334. https://doi.org/10.3390/s18124334

Chicago/Turabian Style

Huang, Xiangdong, Jingwen Xu, and Zheng Wang. 2018. "A Novel Instantaneous Phase Detection Approach and Its Application in SSVEP-Based Brain-Computer Interfaces" Sensors 18, no. 12: 4334. https://doi.org/10.3390/s18124334

APA Style

Huang, X., Xu, J., & Wang, Z. (2018). A Novel Instantaneous Phase Detection Approach and Its Application in SSVEP-Based Brain-Computer Interfaces. Sensors, 18(12), 4334. https://doi.org/10.3390/s18124334

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Novel Instantaneous Phase Detection Approach and Its Application in SSVEP-Based Brain-Computer Interfaces

Abstract

1. Introduction

2. Derivation of the Fully-Traversed DFT Spectrum

2.1. Phase Property of DFT Spectrum

2.2. The Proposed Fully-Traversed DFT Spectrum

3. Property of Phase Estimation in the Noiseless Circumstance

3.1. Single-Tone Case

3.2. Multi-Tone Case

3.3. Simplified Dataflow of Phase Estimation

4. Variance Analysis of Phase Estimation in Noisy Circumstances

4.1. CRLB for Conventional DFT Phase Estimator

4.2. CRLB for the Proposed Phase Estimator

4.3. Numerical Results

5. Applying Fully-Traversed DFT in Phase-Coded SSVEP-BCI

5.1. Experiment Paradigm

5.2. Procedure of SSVEP Phase Extraction

5.3. Result of Offline Analysis

6. Conclusions

Author Contributions

Funding

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI