Kurtosis-Based Symbol Timing and Carrier Phase/Frequency Tracking

Moon, Todd K.; Gunther, Jacob H.

doi:10.3390/e23070819

Open AccessArticle

Kurtosis-Based Symbol Timing and Carrier Phase/Frequency Tracking

by

Todd K. Moon

^* and

Jacob H. Gunther

Electrical and Computer Engineering Department, Utah State University, Logan, UT 84322, USA

^*

Author to whom correspondence should be addressed.

Entropy 2021, 23(7), 819; https://doi.org/10.3390/e23070819

Submission received: 25 May 2021 / Revised: 23 June 2021 / Accepted: 23 June 2021 / Published: 27 June 2021

(This article belongs to the Special Issue Statistical Signal Processing, Detection and Estimation: Dealing with the Data Deluge)

Download

Browse Figures

Versions Notes

Abstract

:

Kurtosis is known to be effective at estimating signal timing and carrier phase offset when the processing is performed in a “burst mode,” that is, operating on a block of received signal in an offline fashion. In this paper, kurtosis-based estimation is extended to provide tracking of timing and carrier phase, and frequency offsets. The algorithm is compared with conventional PLL-type timing/phase estimation and shown to be superior in terms of speed of convergence, with comparable variance in the matched filter output symbols.

Keywords:

kurtosis; symbol timing estimation; carrier phase estimation; carrier frequency; offset estimation

1. Introduction

In [1], a method was introduced for burst-mode symbol timing estimation and carrier phase estimation based upon complex and real kurtosis, respectively, of the received signal. The method involves computing kurtosis at several different parameter values (for both delay and phase) and is thus computationally expensive and more suited to offline computation than real-time implementations or parameter tracking. In this paper, kurtosis-based methods are extended to algorithms that track timing and phase. The carrier estimation is also extended to include both carrier phase and carrier frequency offset. As tracking algorithms (rather than burst-mode algorithms, which obtain one estimate for an entire burst), these algorithm are potentially amenable to real-time tracking application. In one development, tracking is accomplished by moving downhill on an objective function surface that operates without derivatives in a manner analogous to the Nelder–Mead simplex algorithm in one dimension [2]. In another development, a gradient descent algorithm is employed for phase/frequency estimation. The gradient descent method has higher complexity than the non-derivative method but has an otherwise similar performance. As shown in simulations, the kurtosis-based algorithms typically converge in fewer symbols than conventional PLL-type timing and phase tracking methods.

While [1] applied this kurtosis-based method only to QPSK constellations, in fact, it is agnostic with respect to signal constellation. Conventional synchronization algorithms typically employ knowledge of the signal constellation using training symbols and/or in a decision-directed mode (see, e.g., [3]). In some settings, such as in cognitive radio radio settings, in which an “intelligent receiver … adapt[s] itself to a specific transmission context and blindly estimate[s] the transmitter parameters for self-reconfiguration purposes” [4], signals with unknown signal constellations may be employed. It would be helpful to be able to perform symbol and carrier sync without knowledge of the constellation, following which constellation identification may be more readily undertaken. The algorithms presented here serve that purpose. In addition, many detection problems require the symbols to be appropriately scaled, which often requires the use of automatic gain control (AGC) loops as part of the synchronization process. It may be advantageous to decouple the AGC loop from symbol timing and phase estimation, which the kurtosis-based approach provides. The relatively fast convergence of the estimators may also make this useful in short blocklength scenarios for low latency communication systems.

The paper [1] did not contemplate the problem of residual carrier frequency offset, assuming instead that the Fourier transform-based approach to removing carrier offset is completely effective. In this paper, we account for residual carrier frequency offset.

Since kurtosis involves fourth powers of the data, outliers can have a significant effect that can lead an estimate astray or can result in higher variance of the estimated symbols. Another contribution of this paper is to introduce the use of a Huber function [5] to make the estimation more robust.

Kurtosis-based estimation has also been used for blind source separation [6,7,8], where the kurtosis is used as a measure of non-Gaussianity [9] (Section 8.2). Negentropy is also used as a measure of non-Gaussianity, where the negentropy of a random variable y is defined as [9] (Section 8.3)

J (y) = H (y_{gauss}) - H (y)

where

y_{gauss}

is a Gaussian random variable with the same covariance as y. When the negentropy is approximated using higher-order cumulants, the negentropy can be expressed as

J (y) \approx \frac{1}{12} E {[y^{3}]}^{2} + \frac{1}{48} kurt {(y)}^{2} .

For a zero-mean variable with symmetric distribution, the negentropy is thus essentially equivalent to the kurtosis. This relationship between an entropy-related quantity and the kurtosis is what suggested this article as a venue of publication.

Parameter estimation for communication systems has been widely studied. Textbooks on this topic include [10,11,12,13,14,15,16]. Parameter estimation for communication is also covered in conventional digital communication textbooks, e.g., [3,17,18]. See also [19]. What distinguishes this work from all of these references is the use of complex and real kurtosis as a primary tool for adaptation toward the parameter estimates, which enables the parameter estimators to operate agnostic of the signal constellation. While there are some methods that can be applied without knowing the transmitted signal, such as taking powers of the received data to remove digital symbol information, those methods work only with some constellations. By contrast, the kurtosis-based methods are more general. As shown below, they can converge quickly to parameter estimates, more quickly than, for example, PLL-based methods. This suggests the possibility of kurtosis-based phase estimation to be used in phase acquisition and, where the constellation is known, switching over to a PLL-type technique for tracking.

This paper focuses on single-carrier linearly modulated digital communication signals. Extension to multicarrier signals is a topic for future research.

2. Signal Model

Digital information is transmitted at a rate of

1 / T_{s}

symbols per second according to the complex bandpass representation

s (t) = \sum_{k} s_{k} p (t - k T_{s}) e^{j ω_{0} t},

where

p (t)

is a unit-energy, pulse-shaping function satisfying the Nyquist zero ISI theorem (e.g., a square-root raised cosine, SRRC [3] (Appendix A)),

s_{k} = a_{k} + j b_{k}

is a complex point from the signal constellation, and

ω_{0}

is the carrier frequency. The pulse

p (t)

is assumed to be symmetric so that the matched filter is the same as

p (t)

. The received signal is

r (t) = s (t - τ) + n (t)

where

τ

is delay resulting from transmission through the channel and

n (t)

is noise, assumed to be 0 mean. At the receiver, the signal is bandpass filtered and basebanded using a frequency

ω_{1} \approx ω_{0}

. The resulting complex (nearly) basebanded signal is denoted as

u (t) = e^{- j ω_{1} t} r (t) = \sum_{k} s_{k} p (t - k T_{s} - τ) e^{j (ω_{off} t + ϕ)} + n^{'} (t),

where

ω_{off} = ω_{0} - ω_{1}

is the residual carrier frequency offset and

ϕ

accounts for the time delay and changes in index reference at the receiver.

This signal is rotated by an estimate of the offset frequency

{\hat{ω}}_{off}

and passed through a matched filter with estimated delay

\hat{τ}

to produce the signal

x (t) = e^{- j {\hat{ω}}_{off} t} (u (t) * p (t - \hat{τ}))

, where * denotes convolution. The matched filter output can be expressed in terms of the pulse autocorrelation function

x (t) = e^{j (ϕ + (ω_{off} - {\hat{ω}}_{off}) t)} \sum_{k} s_{k} r_{p} (t - k T_{s} - (τ - \hat{τ})) + z (t) .

(1)

where

z (t)

represents the noise filtered through the matched filter and

r_{p}

is the pulse autocorrelation function

r_{p} (t) = \int_{- \infty}^{\infty} p (λ) p (λ - t) d λ .

The representation in (1) is accurate provided that the frequency offset does not exceed about 5% of the symbol rate [16] restriction was pointed out by a reviewer).

In modern practice, of course, the processing steps described above are implemented in discrete time and filters must be truncated to finite length. The signal

u (t)

is sampled at a rate of P (an integer) samples per symbol. Using

T = T_{s} / P

, we write the basebanded sampled signal as

u [n] \overset{∆}{=} {u (t)|}_{t = n T}

. The pulse-shaping function

p (t)

is truncated to finite duration, which we take to be

(Q - 1) T

, where Q is an odd integer and T is the sampling interval. The pulse-shaping function is thus represented by Q samples,

p (n T)

. Different authors employ different conventions for the pulse shaping function, representing it either as a noncausal signal, centered around 0, or as a causal signal. We use a notation that accommodates both convention. If the pulse

p (t)

is centered around

t = 0

, then there are

⌊ (Q - 1) / 2 ⌋

samples before and after

n = 0

and the peak of

r_{p} (n T)

is at

n = 0

. If

p (t)

and its matched filter are causal, then the samples of interest of

p (t)

occur at indices for

n = 0, 1, \dots, Q - 1

. In this case, the peak of the autocorrelation function occurs at time

t = (Q - 1) T

, corresponding to sample

n = (Q - 1)

. In either case, let O (“offset”) be the index offset at which the peak sample of

r_{p}

occurs:

O = 0

for the pulse centered around the origin and

O = Q - 1

for the causal pulse.

Due to the zero-ISI property, samples of the autocorrelation at shifts of multiples of

T_{s}

are 0:

r_{p} (O T + ℓ T_{s}) = \{\begin{matrix} 1 & ℓ = 0 \\ 0 & ℓ \neq 0 . \end{matrix}

Let the downsampled signal be indexed so that the sample at

n = 0

corresponds to the full matched filter response of the first symbol. That is, for the causal pulse-shaping function and its matched filter,

x [n] = {x (t)|}_{t = (O + n) T}

and

z [n] = {z (t)|}_{t = (O + n) T}

. This results in

x [n] = e^{j ((ω_{off} - {\hat{ω}}_{off}) n T + ϕ)} \sum_{k} s_{k} r_{p} (O T + n T - k T_{s} - (τ - \hat{τ})) + z [n] .

Thus,

x [0]

corresponds to the symbol

s_{0}

, etc. In what follows, the noise terms

z [n]

is omitted for brevity. (The phase change due to the difference in O was absorbed here into the phase

ϕ

.)

The matched filter output is downsampled to the symbol rate, taking samples at indices

n = ℓ P

,

ℓ = 0, 1, \dots

. The downsampled matched filter output is

\begin{matrix} y [ℓ] & = x [ℓ P] \\ = e^{j ((ω_{off} - {\hat{ω}}_{off}) ℓ P T + ϕ)} . \sum_{k} s_{k} r_{p} (O T + (ℓ - k) T_{s} - (τ - \hat{τ})) \end{matrix}

This can be decomposed into the term

k = ℓ

and the other terms are

\begin{matrix} y [ℓ] = e^{j ((ω_{off} - {\hat{ω}}_{off}) ℓ P T + ϕ)} s_{ℓ} r_{p} (O T - (τ - \hat{τ})) + \\ \underset{ISI}{\underset{︸}{e^{j ((ω_{off} - {\hat{ω}}_{off}) k P T + ϕ)} \sum_{k \neq ℓ} s_{k} r_{p} (O T + (ℓ - k) T_{s} - (τ - \hat{τ}))}} . \end{matrix}

(2)

If

\hat{τ} = τ

, then the second sum, representing intersymbol interference (ISI), disappears and the downsampled matched filter output is

y [ℓ] = s_{ℓ} e^{j ((ω_{off} - {\hat{ω}}_{off}) ℓ P T + ϕ})

, a single rotated (and rotating) symbol output. On the other hand, when

\hat{τ} \neq τ

, the sample is corrupted by the terms in the ISI sum. The goal of the timing delay estimation problem is to determine

\hat{τ}

to eliminate the ISI. The goal of the phase/carrier offset problem is to eliminate the complex rotation factor

e^{j ((ω_{off} - {\hat{ω}}_{off}) ℓ P T + ϕ)}

.

3. Review of Kurtosis-Based Estimation

With this communication notation in place, we are now in a position to describe kurtosis-based parameter estimation. The kurtosis of a real zero mean random variable y is defined as [20,21]

K_{r} (y) = E [y^{4}] - 3 E {[y^{2}]}^{2},

(3)

The kurtosis of a complex random variable is [21]

K_{c} (y) = {E [| y |}^{4} {] - 2 E [| y |}^{2}]^{2} - {| E [y^{2}] |}^{2} .

The kurtosis (either real or complex) has the following key properties: (1) If y and z are independent random variables, then for constants a and b,

K (a y + b z) = {| a |}^{4} K (y) + {| b |}^{4} K (z)

, and (2) the kurtosis of a Gaussian random variable is 0. For non-Gaussian random variables, the kurtosis may be greater than zero or less than zero. Random variables representing points drawn from a symbol constellation have negative kurtosis (that is, they are sub-Gaussian).

For a stationary (or nearly stationary) sequence of random variables

(y [1], y [2], \dots, y [N_{kurt}])

, the kurtosis may be estimated by using sample averages instead of expectations. Thus,

\begin{matrix} {\hat{K}}_{c} (y) & = \frac{1}{N_{kurt}} \sum_{n = 1}^{N_{kurt}} {| y (n) |}^{4} - 2 {[\frac{1}{N_{kurt}} \sum_{n = 1}^{N_{kurt}} {| y (n) |}^{2}]}^{2} - \\ {|\frac{1}{N_{kurt}} \sum_{n = 1}^{N_{kurt}} y {(n)}^{2}|}^{2}, \end{matrix}

(4)

and, analogously, an estimate

{\hat{K}}_{r}

is computed for the real kurtosis. Note that

{\hat{K}}_{c} (y)

accepts an entire sequence of data as its argument, over which the averaging occurs to estimate the kurtosis.

The complex kurtosis of a sequence of matched filter outputs

y [ℓ]

may be used to determine

\hat{τ}

as follows. Considering the matched filter output in (2), when

\hat{τ} \neq τ

, by the central limit theorem, the sum of terms due to ISI causes

y [ℓ]

to tend toward having a Gaussian distribution, which has small absolute kurtosis. On the other hand, when

\hat{τ} = τ

,

y [ℓ]

consists only of the phase-rotated point from the signal constellation. Since the phase rotation does not affect the complex kurtosis, this

y [ℓ]

has negative kurtosis. This concept is used in [1] to synchronize a communication burst of many symbols (e.g., on the order of 100 symbols) using a method portrayed in Figure 1a. The received signal passes through a bank of

N_{τ}

matched filters, each having a different delay. (The burst-mode synchronization operation implies that the matched filtering is computed over the entire received sequence.) The delays

τ_{0}, τ_{1}, \dots, τ_{N_{τ} - 1}

uniformly sample the range

[0, T_{s})

. The kurtosis of each of the downsampled matched filter outputs is computed, and the signal having the lowest (most negative) kurtosis is selected. That is,

\hat{τ} = \arg \min_{τ} \hat{K} c (u [n] * p (n T - τ))

Since complex magnitudes are employed in the complex kurtosis, a lack of knowledge about phase and carrier offset does not affect the estimation of the delay. By contrast, in conventional time and phase estimation (e.g., [3]), timing and phase are jointly estimated in a joint computational structure, so the convergence of one estimate affects the convergence of the other.

The paper [1] recommends determining carrier frequency offset

ω_{off}

using Fourier transform techniques. This is still recommended when using the methods described in this paper when the carrier frequency offset exceeds the ability of the algorithm to track. However, even after removing the carrier frequency offset using Fourier transform techniques, some residual carrier may remain to be tracked, which we still refer to as

ω_{off}

. The method presented below reliably perform such offset frequency tracking.

For the moment, however, assume that the frequency offset is removed, so that, when the symbol timing is correctly estimated, the matched filter output is

y [ℓ] = s_{ℓ} e^{j ϕ}

. In terms of real and imaginary parts,

y [ℓ] = y_{r} [ℓ] + j y_{i} [ℓ]

and

s_{ℓ} = a_{ℓ} + j b_{ℓ}

. By stacking these components, the rotation can be written, and using

_{ℓ} = [\begin{matrix} a_{ℓ} \\ b_{ℓ} \end{matrix}] G (ϕ) = [\begin{matrix} c (ϕ) & - s (ϕ) \\ s (ϕ) & c (ϕ) \end{matrix}] and \begin{matrix} c (ϕ) = cos (ϕ) \\ s (ϕ) = sin (ϕ), \end{matrix}

the rotated symbols can be written as

y_{ℓ} \overset{∆}{=} [\begin{matrix} y_{r} [ℓ] \\ y_{i} [ℓ] \end{matrix}] = G {(ϕ)}_{ℓ} = [\begin{matrix} c (ϕ) a_{ℓ} - s (ϕ) b_{ℓ} \\ s (ϕ) a_{ℓ} + c (ϕ) b_{ℓ} \end{matrix}] .

(5)

According to (5), the rotation

G (ϕ)

produces a linear combination of the real and imaginary components of the symbol. Computing the kurtosis (separately) on the real and imaginary components

y_{r} [ℓ]

and

y_{i} [ℓ]

, the kurtosis is smaller (nearer to zero) when

ϕ \neq 0

due to the mixture of

a_{ℓ}

and

b_{ℓ}

. Remarkably, even though

y_{r}

and

y_{i}

are mixtures of only two random variables, the presence of the mixture can be detected using kurtosis.

A phase estimate

\hat{ϕ}

is selected and minimizes the sum of the kurtosis of the real and imaginary parts:

\hat{ϕ} = \arg \min_{ϕ} ({\hat{K}}_{r} (y_{r}) + {\hat{K}}_{r} (y_{i}))

The phase offset can be removed by computing the product

y_{rot} [ℓ] = e^{- j} y [ℓ]

. This can be expressed in matrix/vector form by stacking real and imaginary parts, leading to

y_{ℓ, rot} = G (- \hat{ϕ}) y_{ℓ} = G (ϕ - \hat{ϕ}) s_{ℓ} .

When

\hat{ϕ} = ϕ

, the components of

y_{ℓ, rot}

are not mixed, so that the sum of the real kurtosis of the real and imaginary parts are maximally negative.

This kurtosis-based estimation is applied as shown in Figure 1b. There are

N_{ϕ}

different rotations which span

[0, 2 π)

. The time-synchronized matched filter output

y [ℓ]

is rotated by each rotation, and the sum of the kurtosis of the real part and the kurtosis of the imaginary part are computed. The symbol having is determined from the kurtosis that is smallest (most negative) kurtosis.

4. Symbol Timing Tracking: Discrete Downhill Minimization

The kurtosis-based methods portrayed in Figure 1, representing the method in [1], require evaluation using

N_{τ}

different matched filters in the symbol timing sync, followed by the same number of kurtosis computations, or

N_{ϕ}

phase rotations, followed by the same number of kurtosis computations, where the kurtosis was computed using data over an entire burst. In [1], the expense of these computations is ameliorated by the fact that these computations are performed only once per burst. However, there is no provision in [1] to re-estimate or track the parameters. For this reason, the method is referred to as a “burst mode” algorithm, suitable for one-time estimation on a burst, without adaptation. The present paper computes only two kurtosis values for each symbol and computes the kurtosis over smaller segments of the signal, which reduces computational complexity (for each kurtosis). The authors found it surprising that, even using rather short segments to estimate the kurtosis (resulting in noisy estimates of the kurtosis), these kurtosis estimates were able to be used to estimate the timing and phase parameters.

This section describes how to use two kurtosis values computed for each symbol using a method that “slides downhill” toward the most negative kurtosis to estimate the timing offset. Only two kurtosis values are computed per symbol (as opposed to evaluating at

N_{τ}

different values). This adaptive downhill slide is able to track changes in the parameters.

Let the (ostensibly) basebanded signal be denoted as

u [n]

. It is assumed that it is available over the necessary length of indices (such as by buffering).

Let the sampled

N_{τ}

delayed matched filters be denoted as follows:

p_{i} [n] = p (n T - τ_{i}), n = 0, 1, \dots, Q - 1, i = 0, 1, \dots, N_{τ} - 1 .

(6)

The index i must be in the range

0, 1, \dots, N_{τ} - 1

. To ensure that is the case, we use the notation

p_{i % N_{τ}} [n]

, where

i % N_{τ}

denotes i mod

N_{τ}

. In (6), the range of n is for causal pulses, in which case, O is

Q - 1

, as discussed above. For 0-centered pulses,

n = ⌊ (Q - 1) / 2 ⌋, \dots, 0, \dots, ⌊ (Q - 1) / 2 ⌋

, and

O = 0

.

In this paper, instead of evaluating kurtosis at

N_{τ}

different timing offsets, at each symbol time, kurtosis is evaluated at two timing estimates. Let

i_{1}

and

i_{2}

be the indices of delay of the matched filters being used at the present time, where the indices refer to the delay estimates

τ_{i_{1}}

and

τ_{i_{2}}

. In order to produce the downsampled matched filter outputs used to estimate kurtosis,

L = Q + (N_{kurt} - 1) P

input symbols are convolved with the matched filters

p_{i_{1}}

and

p_{i_{2}}

, resulting in the matched filter outputs

x_{m} [n] = u [n : N + L - 1] * p_{i_{m} % N_{τ}} [n], m = 1, 2,

where * denotes convolution. The downsampled matched filter samples are indexed so that the first retained matched filter sample is at

n = 0

are

y_{m} = x_{m} [O : P : O + (N_{kurt} - 1) P], m = 1, 2 .

Let

K_{m} = complexkurtosis (y_{m}), m = 1, 2,

denote the complex kurtosis estimated from the sequence

y_{m} [n]

, computed as in (4).

The minimization algorithm seeks to minimize the kurtosis

K_{m}

over a series of symbol times. The minimization operates analogous to a one-dimensional Nelder–Mead algorithm [2], rolling downhill toward minimum kurtosis by moving in the direction of smaller kurtosis. This is referred to as discrete downhill minimization. It has been found by experimentation on communications data that the kurtosis is, in fact, a convex function of the delay.

At some symbol time of the two kurtoses

K_{1}

and

K_{2}

, computed using

p_{i_{1}}

and

p_{i_{2}}

, the lower (closer to

- \infty

) kurtosis is retained, the downsampled matched filter output is saved in

y_{best}

, and the index of the other delay is adjusted by

n_{τ}

steps to attempt to move toward lower kurtosis. The operation is detailed in Figure 2 and the logic is explicitly described in lines 48–60 of Algorithm 1 below. After execution of the steps,

i_{1}

indexes the delay with lower kurtosis and

i_{2}

is ready to be tested at the next time step. For example, in Case 1, since

K_{1} > K_{2}

,

i_{1}

is set to the value

i_{2}

and

i_{2}

is adjust by some number of steps

n_{τ}

.

n_{τ}

is a small integer, say 2 or 3, indicating how fast to adapt. The other cases in Figure 2 correspond to other configurations of the kurtosis as a function of the delay indices

i_{1}

and

i_{2}

.

If

i_{1}

already indexes the delay of minimum kurtosis,

i_{2}

bounces back and forth between

i_{1} - n_{τ}

and

i_{1} + n_{τ}

, leaving

i_{1}

at the delay

τ_{i_{1}}

, producing the lowest kurtosis. The descent algorithm moves at

n_{τ}

at each step, so that the average number of steps to converge is

N_{τ} / (2 n_{τ})

.

The sequence of matched filter outputs selected with the lowest kurtosis is referred to as

y_{best}

.

The number of steps of adjustment,

n_{τ}

, is adjusted to provide a variable stepsize algorithm. The variance in the changes in index values is computed. When the variance is below a threshold (suggesting that the estimate is converging to a steady value)

n_{τ}

is decremented, provided that it is

> 1

. This reduces the jitter of the estimate in steady state. This dynamic encourages rapid convergence. Tuning of the algorithm can be performed by adjusting the decision thresholds. (While not totally satisfying, this is not so different from a PLL-based estimator, for which tuning may be required to obtain a desired performance.)

When an index falls outside the range

0, 1, \dots, N_{τ} - 1

, it should be reduced modulo

N_{τ}

. However, in order to preserve the directional ordering between

i_{1}

and

i_{2}

, this modulo reduction occurs only when both indices fall outside this range. Then both indices are reduced. These operations are described in lines 69–77 of Algorithm 1.

5. Carrier Frequency and Phase Tracking: Discrete Downhill Minimization

In this section, a minimization technique is developed for carrier frequency and phase tracking, similar to that used in the previous section for timing synchronization. This allows the algorithm to track these parameters.

The phase is estimated to have one of

N_{ϕ}

different values

ϕ_{0}, ϕ_{1}, \dots, ϕ_{N_{ϕ}}

, which uniformly sample the range

[0, 2 π)

. Let

Δ_{ϕ}

denote the increment in phase between adjacent steps, e.g.,

Δ_{ϕ} = ϕ_{1} - ϕ_{0}

.

Phase is estimated by rotating

y_{best}

using two values of phase indexed by

j_{1}

and

j_{2}

and an estimate of

ω_{off}

, then by estimating the real kurtosis on the real and imaginary parts of the signal, and by using this information to move downhill. The rotating signals are produced by

y_{ϕ 1} = e^{- j (\hat{ω} (0 : N_{kurt ϕ} - 1) + ϕ_{j_{1} % N_{ϕ}})} ⊙ y_{best}

y_{ϕ 2} = e^{- j (\hat{ω} (0 : N_{kurt ϕ} - 1) + ϕ_{j_{2} % N_{ϕ}})} ⊙ y_{best},

where ⊙ denotes element-by-element multiplication. The real kurtosis is computed as

J_{1} = realkurtosis ((y_{ϕ 1})) + realkurtosis ((y_{ϕ 2}))

and similarly for

J_{2}

.

The frequency is estimated as follows. Let

j_{last}

denote the phase index of the best phase at the previous time. The change in phase from the last to the current time is

Δ ϕ = (δ_{ϕ}) (j_{1} - j_{last})

. With an index changes of 0 or

\pm 1

,

Δ ϕ

may be thought of as a discrete-time point process taking values 0 and

\pm δ ϕ

. The average value of this point process is the frequency estimate

\hat{ω}

. The frequency estimate

\hat{ω}

taken as the output of a single-pole lowpass filter with unit DC gain and with input

Δ ϕ

. The pole of the filter is denoted by

α

. The filtering is computed according to

\hat{ω} = Δ (1 - α) + α {\hat{ω}}_{last}

,

{\hat{ω}}_{last} = \hat{ω}

.

The following pseudocode (Algorithm 1) summarizes the timing and phase estimation algorithms.

Algorithm 1: Kurtosis-based timing and phase tracking

1 function [mfout,cumwraptime ] = tracker(u,n)
2 Internal (persistent or class) data:
3

P =

number of samples per symbol (integer)
4

N_{kurt} =

number of symbols used to estimate kurtosis
5

Q =

number of samples in each SRRC pulse
6

L = Q + (N_{kurt} - 1) P =

number of samples used in convolution
7 Variables for time estimation
8

N_{τ} =

number of delay steps to consider
9

τ_{i} =

array of

N_{τ}

delays in the range

[0, T_{s})

10

i_{1}, i_{2}

= indices into

τ_{i}

array 0-based indexing is used. Init: $i_{1} = 0$ , $i_{2} = ⌊ N_{τ} / 2 ⌋$
11 Stored values of the delayed SRRC pulse

p_{i} [n] \overset{∆}{=} p (n T - τ_{i})

,
for

n = (0, 1, \dots, Q - 1)

and

i = (0, 1, \dots, N_{τ} - 1)

12   cumwraptime = cumulative wrap count. Init = 0
13   Variables for variable stepsize algorithm
14

n_{τ}

= number of steps. Init = 2 or 3
15

L_{hist}

= length of

τ_{1}

history (a circular buffer)
16 hist

_{τ}

= buffer history of

i_{1}

(length =

L_{hist}

)
17

k_{τ}

= index into

{hist}_{τ}

. Init = 0
18

N_{hist}

= number of elements in

hist τ

. Init = 0
19

σ_{τ}

= variance of elements in hist

_{τ}

20

{thresh}_{τ}

= variance threshold to reduce stepsize
21 Variables for phase/frequency estimation
22

N_{ϕ}

number of phase steps to consider
23

ϕ_{j} =

array of

N_{ϕ}

phases in

[0, 2 π)

24

δ ϕ = 2 π / (N_{ϕ} - 1)

= phase step size
25

ϕ

stepsize = number of index steps to move in phase adaptation. Init = 1
26

j_{1}, j_{2}

= indices into

ϕ_{j}

array. Init:

j_{1} = 0

,

j_{2} = ⌊ N_{ϕ} / 2 ⌋

27

last j

= last value of

j_{1}

or

j_{2}

. Init=0
28   cumwraphphase = cumulative phase wrap count. Init = 0
29   Variables for frequency estimation
30

\hat{ω}

= frequency offset estimate. Init = 0
31 lastomega = previous value of

\hat{ω}

. Init = 0
32

α

= pole location of single-pole LPF.

α \approx 0.95

33 Inputs:
34 u = received basebanded signal (function uses L samples in

u [n : n + L - 1]

)
35 n = starting sample of

s [n]

at this iteration (n increased by P before each call)
36 Output:
37   mfout = synched/rotated matched filter output
38   cumwraptime = total number of timing wraparounds
39   (Timing estimation)
40

x_{1}

=

u [n : N + L - 1] * p_{i_{1} % N_{τ}}

(MF outputs (* = convolution))
41

x_{2}

=

u [n : N + L - 1] * p_{i_{2} % N_{τ}}

42

y_{1} = x_{1} [Q - 1 : P : Q - 1 + L]

43 (downsampled (

N_{kurt}

symbols out))
44

y_{2} = x_{2} [Q - 1 : P : Q - 1 + L]

45

K_{1} =

complexkurt(

y_{1}

) (Compute kurtosis using (4))
46

K_{2} =

complexkurt( $y_{2}$ )
47 (Move downhill toward best delay:)
48 if

K_{2} < K_{1}

and

i_{1} < i_{2}

(Case 1)
49

i_{1} = i_{2}

;

K_{1} = K_{2}

;

i_{2}

+=

n_{τ}

50

y_{best} = y_{2}

(assign the entire sequence)
51 elseif

K_{1} \leq K_{2}

and

i_{2} < i_{1}

(Case 2)
52

i_{2} = i_{1} + n_{τ}

53

y_{best} = y_{1}

54 elseif

K_{2} < K_{1}

and

i_{2} < i_{1}

(Case 3)
55

i_{1} = i_{2}

;

i_{2}

−=

n_{τ}

;

K_{1} = K_{2}

56

y_{best} = y_{2}

57 elseif

K_{1} < K_{2}

and

i_{1} < i_{2}

(Case 4)
58

i_{2} = i_{1} - n_{τ}

59

y_{best} = y_{1}

60   end
61   (Adjust the step size)
62   tauhist

[k_{τ}] = i_{1}

63

k_{τ} = (k_{τ} + 1) % L_{hist}

64

N_{hist}

= min(

N_{hist}

+1,

L_{hist}

)
65 if(

N_{hist}

==

L_{hist}

)
66

σ_{τ}

=

(tauhist)

67 if(

σ_{τ} < {thresh}_{τ}

and

n_{τ} > 2

)

n_{τ}

−= 1
68 end
69 (If both delays

\geq N_{τ}

or both

< 0

, wrap around:)
70 wrap = 0
71 if

i_{1} \geq N_{τ}

and

i_{2} \geq N_{τ}

72

i_{1} -

=

N_{τ}

;

i_{2} -

=

N_{τ}

73 wrap = 1
74 elseif

i_{1} < 0

and

i_{2} < 0

75

i_{1}

+=

N_{τ}

;

i_{2}

+=

N_{τ}

76       wrap = −1
77    end
78   (At this point,

y_{best}

is an array containing
79

N_{kurt}

time-aligned MF outputs)
80 (Carrier offset/frequency estimation)
81 despin1 =

e^{- j (ϕ_{j_{1} % N_{ϕ}} + \hat{ω} (0 : N_{kurt} - 1))}

(array with

N_{kurt}

elements)
82 despin2 =

e^{- j (ϕ_{j_{2} % N_{ϕ}} + \hat{ω} (0 : N_{kurt} - 1))}

83

y_{ϕ 1} = y_{best} ⊙ despin 1

(De-rotate the matched filter outputs)
84

y_{ϕ 2} = y_{best} ⊙ despin 2

85

J_{1} = realcomplexkurt (y_{ϕ 1})

(∑ kurtosis on real & imag)
86

J_{2} = realcomplexkurt (y_{ϕ 2})

87 (Move downhill toward best phase:)
88 if

J_{2} \leq J_{1}

and

j_{1} < j_{2}

89

j_{1} = j_{2}

;

K_{1} K_{2}

;

j_{2}

+=

ϕ

stepsize
90

mfout = y_{ϕ 2} [0]

(save matched filter output)
91 elseif

J_{1} \leq J_{2}

and

j_{2} < j_{1}

92

j_{2} = j_{1} + ϕ stepsize

93

mfout = y_{ϕ 1} [0]

94 elseif

J_{2} \leq J_{1}

and

j_{2} < j_{1}

95

j_{1} = j_{2}

;

K_{1} = K_{2}

96

j_{2} -

=

ϕ

stepsize
97

mfout = y_{ϕ 2} [0]

98 elseif

K_{1} < K_{2}

and

j_{1} < j_{2}

99

j_{2}

−=

ϕ stepsize

100

mfout = y_{ϕ 1} [0]

101 end
102

Δ j = j_{1} - last j

; (phase difference in counts)
103

Δ ϕ = (Δ j) (δ ϕ)

   (phase difference in rads)
104   wrap = 0    (Do phase wrap around)
105   if

j_{1} \geq N_{ϕ}

and

j_{2} \geq N_{ϕ}

106

j_{1}

−=

N_{ϕ}

;

j_{2}

−=

N_{ϕ}

;
107

wrap = - 1

;
108 elseif

j_{1} < 0

and

j_{2} < 0

109

j_{2}

+=

N_{ϕ}

;

j_{2}

+=

N_{ϕ}

;
110

wrap = 1

111 end
112

last j = j_{1}

113

cumwrapphase

+=

wrap

114 Estimate the frequency by smoothing

Δ ϕ

115

\hat{ω} = Δ ϕ (1 - α) + α lastomega

116

lastomega = \hat{ω}

117 Adjust the phase adjustment step size
118

ϕ stepsize = ⌊ | \hat{ω} | / (δ ϕ) ⌋ + 1

119 end function

6. Huber Loss Function

The fourth moments computed in the kurtosis open the algorithm to the vulnerability that outlier events unduly affect the estimate. This effect can be mitigated by employing a Huber loss function [5], which is commonly used for robust estimation. This function is defined as follows:

H_{δ_{H}} (a) = \{\begin{matrix} \frac{1}{2} a^{2} & if | a | \leq δ_{H}, \\ δ_{H} (| a | - \frac{1}{2} δ_{H}) & otherwise . \end{matrix}

H_{δ_{H}} (a)

behaves quadratically for small values of a (when

| a | \leq δ_{H}

) and linearly for larger values of a, with the transition defined such that the transition from quadratic behavior to linear behavior at

| a | = δ_{H}

is continuous and continuously differentiable. The real kurtosis of (3) is approximated using the Huber function by

K_{r}^{H} (y) = E [H_{δ_{H}} (H_{δ_{H}} (y))] - 3 H_{δ_{H}} (E [H_{δ_{H}} (y)])

In complex kurtosis, for the terms involving the magnitude

| y |

, the Huber function can be applied directly. For the term involving

y^{2}

, the Huber function is applied to the magnitude, leaving the phase quadratically varying. The complex Kurtosis is approximated using the Huber function as follows:

\begin{matrix} K_{c}^{H} (y) & = E [H_{δ_{H}} (H_{δ_{H}} (| y |))] - 2 H_{δ_{H}} (E [H_{δ_{H}} (| y |)]) - \\ H_{δ_{H}} (|E [H_{δ_{H}} (| y |) e^{j 2 ∠ y}]|) . \end{matrix}

This is estimated from a sequence of observations:

\begin{matrix} {\hat{K}}_{c}^{H} (y) & = \frac{1}{N_{kurt}} \sum_{n = 1}^{N_{kurt}} H_{δ_{H}} (H_{δ_{H}} (| y (n) |)) - \\ 2 H_{δ_{H}} (\frac{1}{N_{kurt}} \sum_{n = 1}^{N_{kurt}} H_{δ_{H}} (| y (n) |)) - \\ H_{δ_{H}} (|\frac{1}{N_{kurt}} \sum_{n = 1}^{N_{kurt}} H_{δ_{H}} (| y (n) |) e^{j 2 ∠ y (n)}|) . \end{matrix}

Huber functions can be used in Algorithm 1 by simply replacing the computations of

K_{1}

and

K_{2}

at lines 45 and 46 with the complex Huber function, and the computations of

J_{1}

and

J_{2}

at lines 85 and 86 with the real Huber function.

7. Gradient Descent for Phase Estimation

As an alternative to optimizing over a fixed number of alternatives, gradient descent may also be employed. We illustrate the method here for phase estimation; this can be modified for timing estimation.

Let the real and imaginary components of the rotated signal be denoted as

[\begin{matrix} y_{r, r o t} \\ y_{i, r o t} \end{matrix}] = [\begin{matrix} c (\hat{ϕ}) y_{r} + s (\hat{ϕ}) y_{i} \\ - s (\hat{ϕ}) y_{r} + c (\hat{ϕ}) y_{i} \end{matrix}]

where

c (\hat{ϕ}) = cos (\hat{ϕ})

and

s (\hat{ϕ}) = sin (\hat{ϕ})

. The objective function is the sum of the real kurtoses of the real and imaginary parts. Expanding this and identifying the moment terms using the variables A through D, we find

\begin{matrix} J (\hat{ϕ}) = K_{r} (y_{r, r o t}) + K_{r} (y_{i, r o t}) \\ = E [{(c (\hat{ϕ}) y_{r} + s (\hat{ϕ}) y_{i})}^{2}] - 3 (E {[{(c (\hat{ϕ}) y_{r} + s (\hat{ϕ}) y_{i})}^{2}]}^{2} + \\ E [{(- s (\hat{ϕ}) y_{r} + c (\hat{ϕ}) y_{i})}^{2}] - 3 (E {[{(- s (\hat{ϕ}) y_{r} + c (\hat{ϕ}) y_{i})}^{2}]}^{2} \\ = c {(\hat{ϕ})}^{4} \underset{A}{\underset{︸}{(E [y_{r}^{4}] - 3 E {[y_{r}^{2}]}^{2} + 4 E [y_{i}^{4}] - 3 E {[y_{i}^{2}]}^{2})}} \\ + c {(\hat{ϕ})}^{3} s (\hat{ϕ}) \times \\ \underset{B}{\underset{︸}{(4 E [y_{r}^{3} y_{i}] - 4 E [y_{r} y_{i}^{3}] - 12 E [y_{r}^{2}] E [y_{r} y_{i}] + 12 E [y_{i}^{2}] E [y_{r} y_{i}])}} \\ + c {(\hat{ϕ})}^{2} s {(\hat{ϕ})}^{2} \underset{C}{\underset{︸}{(12 E [y_{r}^{2} y_{i}^{2}] - 12 E [y_{r}^{2}] E [y_{i}^{2}])}} \\ + c (\hat{ϕ}) s {(\hat{ϕ})}^{3} \times \\ \underset{D}{\underset{︸}{(4 E [y_{r} y_{i}^{3}] - 12 E [y_{r} y_{i}] E [y_{i}^{2}] - 4 E [y_{r}^{3} y_{i}] + 12 E [y_{r}^{2}] E [y_{r} y_{i}])}} \\ + s {(\hat{ϕ})}^{4} \underset{A}{\underset{︸}{(E [y_{i}^{4}] + E [y_{r}^{4}] - 3 E {[y_{i}^{2}]}^{2} - 3 E [y_{r}^{2}] E [y_{i}^{2}])}} \end{matrix}

The gradient of

J (\hat{ϕ})

is

\begin{matrix} J (\hat{ϕ}) = A (4 s {(\hat{ϕ})}^{3} c (\hat{ϕ}) - 4 c {(\hat{ϕ})}^{3} s (\hat{ϕ})) \\ + B (c {(\hat{ϕ})}^{4} - 3 s {(\hat{ϕ})}^{2} c {(\hat{ϕ})}^{2}) + \\ C (- 2 c (\hat{ϕ}) s {(\hat{ϕ})}^{3} + 2 c {(\hat{ϕ})}^{3} s (\hat{ϕ})) + D (3 c {(\hat{ϕ})}^{2} s {(\hat{ϕ})}^{2} - s {(\hat{ϕ})}^{4}) \end{matrix}

8. Experimental Results

Kurtosis-based estimation was compared with the PLL-based timing and phase synch algorithms of [3] (Sections 7.4 and 8.4.4). In these algorithms, the timing interpolator is governed by the fractional symbol

μ

. The phase estimate is represented by the DDS (direct digital synthesizer) value. Two examples are presented here, one with

ω_{off} = 0

and one with

ω_{off} = 0.01

.

Example 1.

QPSK, SNR = 10 dB, excess bandwidth = 0.2.

ω_{o f f} = 0

. Algorithm parameters:

N_{k u r t} = 20

,

N_{τ} = 60

,

N_{ϕ} = 40

,

Q = 101

. ω-filtering parameter

α = 0.99

.

Figure 3a shows the results of the delay estimation. The inset in the figure shows the first 30 samples, illustrating that convergence has occurred in about 10 symbols periods. Figure 3b shows the phase estimation. The phase estimate (top, in radians) shows the phase estimate. The inset shows that the phase estimate has converged in less than 10 symbols. The

Δ

-phase (middle) shows

Δ ϕ

. The bottom plot shows the estimate of

ω

.

By comparison, Figure 4a shows the PLL-based estimates, with a filter time-bandwidth product

B_{N} T = 0.01

. The top plot shows the fractional symbol

μ

, demonstrating convergence somewhere around 200 symbols. The bottom plot shows the phase converges in about 100 symbols.

As another method of comparing performance, the matched filter outputs were clustered (according to nearest signal constellation point) and the average variance of the clusters was computed. This variance was compared with the variance that would occur if the only source of noise were the AWGN. These variance results are shown in Table 1. In the column labeled “Noise Var”, the variance due to the AWGN at that SNR is shown. For example, in the first row where

E_{b} / N_{0} = 10

dB, the noise variance is 0.05. The column labeled “No Offsets” shows the result of estimating the variance of the clustered matched filter outputs when there are no phase or timing offsets. When

E_{b} / N_{0} = 10

dB, this is 0.05. This value should be close to the “Noise Var.” The columns labeled “

δ_{H}

” and “Grad

μ

” indicate the settings for the parameters. “

δ_{H}

” set to a value indicates that the Huber function was used. “Grad

μ

” set to a value indicates that gradient descent was used. The column labeled “Kurtosis” indicates the variance of the clustered matched filter outputs for the various kurtosis-based estimation algorithms after convergence. For example, the first row with

δ_{H} = 0.5

shows that the variance is 0.061. This can be compared with the variance in the “No Offsets” column, indicating that using the estimate does, in fact, increase the variance of the matched filter outputs compared to not having to estimate the parameters at all. The column labeled “PLL-based“ shows the variance of the clustered matched filter outputs for the PLL-based estimators. For

E_{b} / N_{0} = 10 dB

, this variance is 0.06. Finally, the column “dB(Kurt/PLL)” shows the comparison (in dB) between the kurtosis and PLL-based estimators, where negative numbers indicate superiority of kurtosis-based vs. PLL-based. For example, in the first row, kurtosis-based estimation is 0.06 dB worse (in variance) than PLL-based estimation.

As Table 1 shows that the variance performance between kurtosis-based and PLL-based estimation is generally quite close.

Example 2.

This example shows behavior typical of the kurtosis-based phase estimate. In this case,

N_{ϕ} = 100

, and SNR = 24 dB. Figure 5 demonstrates an example of the phase estimate. After convergence, the estimator tends to jitter around the bottom of the kurtosis “bowl”, because the kurtosis is only estimated. The phase variance thus largely determined by the steps size, essentially

2 π / N_{ϕ}

.

Example 3.

QPSK, SNR = 10 dB, excess bandwidth = 0.2.

ω_{o f f} = 0.02

. Algorithm parameters:

N_{k u r t} = 20

,

N_{τ} = 60

,

N_{ϕ} = 40

. ω-filtering parameter

α = 0.95

.

Figure 6a shows the results of the delay estimation. The inset in the figure shows the first 30 samples, illustrating that convergence occurred in less than 20 symbols. Figure 6b shows the phase estimation. The phase estimate (top, in radians) shows the phase estimate. The inset shows that the phase estimate converged in less than 10 symbols. The

Δ

-phase (middle) shows

Δ ϕ

. The bottom plot shows the estimate of

ω

.

Figure 7 shows results for the PLL-based methods in this setting. While the phase tracking appears smoother in the PLL-based estimate, as the results for Example 3 in Table 1 show, there is less variance around the constellation points at the matched filter outputs for the kurtosis-based methods than the PLL-based methods when step descent is used. Interestingly, the gradient descent performs significantly worse than PLL-based estimation at all SNRs.

Figure 8 shows results for the gradient descent estimation of the phase. The convergence time is about the same as the step descent, but there is higher variance on the estimate of

ω

.

9. Comparison with Modified Cramer–Rao Lower Bound

Evaluating the performance in terms of how the recovered signal points are clustered, as in Table 1, is a natural way to evaluate the performance of these estimation algorithms, since it demonstrates how all of the estimators work together to achieve what is desired in the receiver: good signal detection. Another way of evaluating performance of estimators is to compare the estimator variance against lower bounds such as the Cramer–Rao lower bound (CRLB) or modified CRLB. The modified CRLB provides a bound when the observed data depends on multiple parameters, but only one parameter at a time is estimated [22]. The modified CRLB is, in general, lower than the CRLB.

The modified CLRB for the estimate of

ϕ

is [22] (Equation (31))

M C R B (ϕ) = \frac{B_{L} T_{s}}{E_{s} / N_{0}}

In [22],

B_{L} = 1 / (2 L T_{s})

, where

L T_{s}

is the length of window over which the estimator operates. The modified CLRB for the estimate of

τ

is [22] (Equation (32)

M C R B (τ) = \frac{B_{L} T_{s}}{4 π^{2} ξ} \frac{T_{s}^{2}}{E_{s} / N_{0}}

(7)

where

ξ = \frac{T_{s}^{2} \int_{- \infty}^{\infty} f^{2} {| P (f) |}^{2} d f}{\int_{- \infty}^{\infty} {| P (f) |}^{2} d f},

where

P (f)

is the Fourier transform of the pulse-shaping function

p (t)

.

The kurtosis is computed over

N_{kurt}

symbols (matched filter outputs). The first matched filter output occurs at sample

Q T_{s} / P

, with symbol outputs occuring every

T_{s}

seconds thereafter. Thus, the duration over which a kurtosis is computed is

Q T_{s} / P + (N_{kurt} - 1) T_{s}

. We take this as the value of

B_{L} T_{s}

over which the estimate is computed. Since the estimator steps over several symbol times to converge, this does not apply initially but should be applicable after convergence.

Figure 9a shows the variance of estimate of

τ

using the kurtosis-based method (red) and the conventional (loop-based) method (yellow) compared with the modified CRLB (blue), as a function of

E_{b} / N_{0}

. The variance of the kurtosis-based method was obtained by computing the variance of

\hat{τ}

after convergence, such as seen in Figure 3a, averaged over 10 independent runs. The variance of the loop-estimated variance was computed as the variance of

μ

(the fractional timing offset in the synchronization algorithm [3]), averaged over 10 runs. The kurtosis-based method performs significantly better than the loop-based method and demonstrates variance decreases with SNR. However, it does not decrease as fast as the modified CRLB. Additionally, it appears that the kurtosis-based method meets the modified CRLB, which is unexpected. It may be that the value for

B_{L} T_{s}

used to compute the bound in (7) is larger than it should be.

Figure 9b shows the variance of the estimate of

ϕ

using kurtosis (red) and PLL (yellow) compared with the modified CRLB (blue). The variance of the kurtosis-based method was obtained by computing the variance of

\hat{ϕ}

after convergence, such as seen in Figure 3b, averaged over 10 independent runs. The variance of the kurtosis method hardly changes with SNR (although it can be seen to vary slightly). The continued variation is due to the fixed step size, and the fact that the kurtosis estimate has some variance associated with it causing jitter in the indices. Depending on the value of

N_{ϕ}

, this can achieve variances less than those of the PLL-based method. This suggests that a variable

N_{ϕ}

algorithm would be useful: use a larger step size to converge quickly, then reduce the step size to reduce the variance.

(These plots were performed using QPSK,

Q = 101

,

P = 3

,

α = 0.2

, and

N_{kurt} = 40

and using the Hubert function to estimate the kurtosis.)

10. Discussion and Conclusions

We demonstrated that kurtosis-based methods can be applied to symbol timing and phase estimation. These offer potential advantages such as being able to synchronize without knowledge of the signal constellation in a scale-invariant way and, generally, convergence as fast as or faster than PLL-based methods. The experiments showed that the variance of the matched filter outputs for kurtosis-based methods is about the same as or slightly better than PLL-based methods. The gradient descent phase estimation, however, does not generally perform as well as discrete minimization methods. We made the following observations:

Using the Huber function instead of the full kurtosis reduces the variance of the estimators.
For lower SNRs, the gradient-based method performs significantly worse than the discrete minimization.
The gradient-based method has a higher complexity than the discrete minimization.
Kurtosis-based methods converge more quickly than PLL-based methods. The number of steps is determined primarily by $N_{τ}$ , $N_{ϕ}$ , and $n_{τ}$ . (There is a secondary effect due to the bandwidth of the lowpass filter for ${\hat{ω}}_{off}$ .)

The fast convergence time suggests that these kurtosis-based methods may be useful in situations where short packets are used, such as in the Internet of Things.

Author Contributions

Conceptualization, T.K.M. and J.H.G.; methodology, T.K.M. and J.H.G.; software, T.K.M.; validation, T.K.M.; formal analysis, T.K.M.; writing—original draft preparation, T.K.M.; writing—review and editing, T.K.M. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

Not Applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

Gunther, J.H.; Moon, T.K. Burst mode synchronization of QPSK on AWGN channels using kurtosis. IEEE Trans. Comm. 2009, 57, 2453–2462. [Google Scholar] [CrossRef]
Nelder, J.A.; Mead, R. A Simplex Method for Function Minimization. Comput. J. 1965, 7, 308–313. [Google Scholar] [CrossRef]
Rice, M. Digital Communications: A Discrete-Time Approach, 2nd ed.; Prentice Hall: Hoboken, NJ, USA, 2017; ISBN 9781790588545. [Google Scholar]
Marazin, M.; Gautier, R.; Burel, G. Dual Code Method for Blind Identification of Convolutional Encoder for Cognitive Radio Receiver Design. In Proceedings of the 2009 IEEE Globecom Workshops, Honolulu, HI, USA, 30 November–4 December 2009; pp. 1–6. [Google Scholar] [CrossRef]
Huber, P.J. Robust Estimation of a Location Parameter. Ann. Math. Statist. 1964, 35, 73–101. [Google Scholar] [CrossRef]
Delfosse, N.; Loubaton, P. Adaptive blind separation of convolutive mixtures. In Proceedings of the 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings, Atlanta, GA, USA, 9 May 1996; pp. 2940–2943. [Google Scholar]
Hyvarinen, A. Fast and robust fixed-point algorithms for independent component analysis. IEEE Trans. Neural Netw. 1999, 10, 626–634. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Hyvarinen, A.; Oja, E. A fast fixed-point algorithm for independent component analysis. Neural Comput. 1997, 9, 1483–1492. [Google Scholar] [CrossRef]
Hyvarinen, A.; Karhunen, J.; Oja, E. Independent Component Analysis; Wiley: New York, NY, USA, 2001. [Google Scholar]
Viterbi, A. Principles of Coherent Communication; McGraw-Hill: New York, NY, USA, 1966. [Google Scholar]
Stiffler, J. Theory of Synchronous Communication; Prentice-Hall: Englewood Cliffs, NJ, USA, 1971. [Google Scholar]
Lindsey, W. Synchronization Systems in Communication and Control; Prentice-Hall: Englewood Cliffs, NJ, USA, 1972. [Google Scholar]
Lindsey, W.; Simon, M. Telecommunication Systems Engineering; Prentice-Hall: Englewood Cliffs, NJ, USA, 1973. [Google Scholar]
Gardner, F. Phaselock Techniques; Wiley: New York, NY, USA, 1979. [Google Scholar]
Meyr, H.; Asheid, G. Synchronization Techniques in Digital Communications; Wiley: New York, NY, USA, 1990. [Google Scholar]
Mengali, U.; D’Andrea, A. Synchronization Techniques for Digital Receivers; Springer Science+Business Media: New York, NY, USA, 1997. [Google Scholar]
Proakis, J.G. Digital Communications; McGraw Hill: New York, NY, USA, 1995. [Google Scholar]
Barry, J.R.; Lee, E.A.; Messerschmitt, D.G. Digital Communication, 3rd ed.; Kluwer Academic: Boston, MA, USA, 2004. [Google Scholar]
Franks, L. Carrier and Bit Synchronization in Data Communication—A Tutorial Review. IEEE Trans. Comm. 1980, 28, 1107–1121. [Google Scholar] [CrossRef]
Ding, Z.; Li, Y. Blind Equalization and Identification; Marcel Dekker: New York, NY, USA, 2001. [Google Scholar]
Shalvi, O.; Weinstein, E. New criteria for blind deconvolution of non-minimum phase systems. IEEE Trans. Info. Theory 1990, 36, 312–321. [Google Scholar] [CrossRef]
D’andrea, A.; Mengali, U.; Reggiannini, R. The Modified Cramer-Rao Bound and Its Application to Synchronization Problems. IEEE Trans. Comm. 1994, 42, 1391–1399. [Google Scholar] [CrossRef]

Figure 1. Block diagram of burst-mode kurtosis-based estimation.

Figure 2. Discrete downhill minimization of kurtosis function.

Figure 3. Example 1: Kurtosis-based timing and phase estimation.

Figure 4. Example 1: PLL-based timing and phase estimation.

Figure 5. Example 2: Illustration of phase estimate jitter,

N_{ϕ} = 100

.

Figure 5. Example 2: Illustration of phase estimate jitter,

N_{ϕ} = 100

.

Figure 6. Example 3: Kurtosis-based estimates.

Figure 7. Example 3: PLL-based timing and phase estimation.

Figure 8. Example 3: Kurtosis-based estimates and phase estimation gradient descent.

Figure 9. Variance of parameter estimates compared with modified CRLB.

Table 1. Cluster variances.

Example	$E_{b} / N_{0}$ (dB)	Noise Var	No Offsets	$δ_{H}$	Grad $μ$	Kurtosis	PLL-Based	dB(Kurt/PLL)
1-9 1	10	0.05	0.05	0.5	—	0.061	0.06	0.06
				—	—	0.077		1.08
				0.5	0.01	0.054		−0.47
1-9	8	0.079	0.079	0.5	—	0.086	0.94	−0.38
				—	—	0.105		0.48
				0.5	0.01	0.128		−0.60
1-9	6	0.126	0.124	0.5	—	0.13	0.15	−0.47
				—	—	0.15		−0.016
				0.5	0.01	0.127		−0.64
1-9	4	0.2	0.19	0.5	—	0.19	0.22	−0.55
				—	—	0.22		0.056
				0.5	0.01	0.19		−0.62
1-9 2	10	0.05	0.05	0.5	—	0.057	0.6	−0.20
				—	—	0.058		−0.07
				0.5	0.2	0.21		5.4
1-9	8	0.079	0.079	0.5	—	0.085	0.093	−0.42
				—	—	0.088		−0.28
				—	—	0.23		3.91
1-9	6	0.079	0.079	0.5	—	0.13	0.15	−0.49
				—	—	0.15		0.16
				—	—	0.25		2.3
1-9	4	0.2	0.19	0.5	—	0.2	0.22	−0.51
				—	—	0.2		−0.49
				—	—	0.28		1.04

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Moon, T.K.; Gunther, J.H. Kurtosis-Based Symbol Timing and Carrier Phase/Frequency Tracking. Entropy 2021, 23, 819. https://doi.org/10.3390/e23070819

AMA Style

Moon TK, Gunther JH. Kurtosis-Based Symbol Timing and Carrier Phase/Frequency Tracking. Entropy. 2021; 23(7):819. https://doi.org/10.3390/e23070819

Chicago/Turabian Style

Moon, Todd K., and Jacob H. Gunther. 2021. "Kurtosis-Based Symbol Timing and Carrier Phase/Frequency Tracking" Entropy 23, no. 7: 819. https://doi.org/10.3390/e23070819

APA Style

Moon, T. K., & Gunther, J. H. (2021). Kurtosis-Based Symbol Timing and Carrier Phase/Frequency Tracking. Entropy, 23(7), 819. https://doi.org/10.3390/e23070819

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Kurtosis-Based Symbol Timing and Carrier Phase/Frequency Tracking

Abstract

1. Introduction

2. Signal Model

3. Review of Kurtosis-Based Estimation

4. Symbol Timing Tracking: Discrete Downhill Minimization

5. Carrier Frequency and Phase Tracking: Discrete Downhill Minimization

6. Huber Loss Function

7. Gradient Descent for Phase Estimation

8. Experimental Results

9. Comparison with Modified Cramer–Rao Lower Bound

10. Discussion and Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI