1. Background and Motivation
Joint communications and sensing (JCAS) has attracted extensive attention lately due to its potential of substantially improving the cost-, energy- and spectral-efficiency for a myriad of modern wireless systems that require both communications and radar, e.g., many smart IoT applications [
1,
2]. As a popular waveform in both communications and radar, the orthogonal frequency-division multiplexing (OFDM)-based JCAS has regained great interest after its arguable debut in 2007 [
3]. The seminal work, however, did not illustrate OFDM radar sensing, and was only focused on the impact of radar antenna set-ups (unidirectional or omni-directional) on communication performances, e.g., bit error rate and system throughput. Before the work in [
3], OFDM radar had been studied since 2000, yet without considering communications in general [
4]. Early OFDM radar works between 2000 and 2009 mainly treat the OFDM waveform the same as the conventional radar waveforms, e.g., chirp, and intend to design OFDM-based waveforms, e.g., the phases of OFDM sub-carriers, to improve radar ambiguity functions [
5,
6,
7,
8,
9,
10]. Although many of these works [
4,
5,
6,
7,
8,
9,
10] mention the applications of OFDM in data communications, they barely take into account any communication aspects, either in waveform design or in signal processing.
The true OFDM-based JCAS is enabled by the method first published in 2009 [
11] and is more comprehensively elaborated on in [
12]. At a radar receiver, the method [
11] treats each OFDM symbol as in communication systems by first removing the cyclic prefix (CP) and then taking the discrete Fourier transform (DFT). In the frequency-domain, the method [
11] removes the communication data symbols, as added on sub-carriers at the transmitter, through a point-wise division (PWD), attaining the scaled sum of the outer products between the range and Doppler steering vectors. A two-dimensional Fourier transform is then taken over the sub-carrier and time-domains, resulting in the so-called range–Doppler map (RDM), matrix or profile. Target detection and estimation can be performed using the RDM, which will be further illustrated in
Section 3.
The sensing method [
11,
12] has been extensively applied in the past decade and has become a de facto standard for OFDM radar, particularly in automotive sensing [
13,
14,
15,
16,
17,
18,
19,
20,
21,
22,
23]. For illustration convenience, we call the method [
11] the classical OFDM sensing (COS) hereafter. Recent OFDM sensing works are mainly based on COS, but also introduce new techniques to improve the RDM quality. The work in [
16] introduces the stepped carrier technique to increase the overall baseband bandwidth of the OFDM radar and, hence, the resolution of RDMs. The work in [
17] randomizes the stepped carrier and exploits the compressive sensing technique to reconstruct a high-resolution RDM. Whereas previous works generally ignore the inter-carrier interference (ICI) issue, the work in [
18] considers the impact of ICI on OFDM sensing and develops novel signaling, which repeats the same OFDM symbol over (slow-)time, to facilitate the estimation and suppression of ICI.
The OFDM sensing methods [
11,
12,
16,
17,
18] reviewed above are for the single-antenna transceiver. One of the greatest advantages of using OFDM as a radar waveform is that multiple antennas can be utilized to realize orthogonal MIMO radar-like sensing. (In theory, given
M transmitter antennas and
N receiver antennas, an orthogonal multiple-input and multiple-output (MIMO) radar can achieve an extended virtual array of
antennas.) This advantage is first noticed in [
24], where an equidistant sub-carrier interleaving scheme is developed to make the signals transmitted by different antennas orthogonal. More specifically, the scheme makes antenna
m only use sub-carriers
for
and
, where
M is the antenna number. However, as noted in [
19,
20,
21,
22], the equidistant interleaving can reduce the unambiguously measurable distance of a MIMO-OFDM radar. To address the issue, a non-equidistant sub-carrier interleaving scheme is proposed in [
19], where the genetic algorithm (GA) is used to maximize the ranging performance in terms of the sub-carrier interleaving patterns of transmitter antennas. In [
20], the random time–frequency multiplexing is proposed to enhance the inter-antenna signal orthogonality of a MIMO-OFDM radar. For the same purpose, coded MIMO-OFDM radars are developed in [
21,
22], where special coding over time-, frequency-, space- and joint-domains are developed.
Targeted at sole radar applications, the methods reviewed above pay little attention to (MIMO-)OFDM data communications. Recently, the communication community has been highly active in promoting JCAS. In fact, given its potential in improving the cost-/energy-/spectral-efficiency and in substantially benefiting emerging use cases of mobile networks, e.g., smart home/city/ transportation [
25], JCAS has been envisioned as a hallmark technology of the future sixth generation mobile communications (6G) [
26]. The communication-centric JCAS designs in the communication community generally fall into two broad categories: general designs and communication standards-based ones.
The general JCAS waveforms have been designed in spatial, time and frequency-domains without referring to some specific communication standards. In the spatial-domain, dual-functional precoders/beamformers are generally designed to, e.g., approach desired sensing waveforms subject to signal-to-interference-plus-noise ratio (SINR) requirements for multi-user downlink MIMO communications [
27]. In the time- and frequency-domains, existing works mainly resort to designing the frame structure [
28], sub-carrier occupation [
29], power allocation [
30] and pilot/preamble signals [
31]. These JCAS works [
27,
28,
29,
30,
31] evaluate the sensing performance by statistical or asymptotic metrics, e.g., the signal-to-interference-plus-noise ratio (SINR) and the Cramer–Rao low bound (CRLB). They either do not discuss specific sensing methods or refer to some common ones, e.g., COS [
11] reviewed above.
Standards-based JCAS designs generally integrate sensing into an existing communication system and prioritize communications. In this line of research, the IEEE 802.11ad-based millimeter-wave (mmWave) communication system is a popular choice. To counteract the severe attenuation of mmWave signals, IEEE 802.11ad mainly uses the DFT-spread OFDM (DFT-s-OFDM) waveform (DFT-s-OFDM is also known as the single-carrier OFDM (SC-OFDM). It performs DFT precoding before modulating data symbols onto sub-carriers and generally achieves a lower peak-to-average-power ratio (PAPR) than OFDM [
32]) for data transmission. In [
33,
34], different sensing methods are developed using the Golay complementary sequences (GCSs) in the preamble of IEEE 802.11ad communication signals. In [
35], the Doppler resilience of IEEE 802.11ad-based sensing is improved by incorporating Prouhet–Thue–Morse sequences in the preamble. In [
31], an adaptive mmWave JCAS based on IEEE 802.11ad is developed, where a few non-uniformly placed preambles are transmitted to construct several receive virtual preambles for enhancing the velocity estimation accuracy at the cost of a small reduction in the communication data rate. While these methods exploit the superb auto-correlation feature of GCSs for a high ranging performance, it can be non-trivial to adapt them for other communication standards. This is more the case for existing wifi-based JCAS designs that mainly exploit the channel state information estimated by wifi devices [
36].
The standards-based JCAS designs reviewed above exploit only a small portion of available signals in a standardized communication system. To further improve the sensing robustness against interference and noises, data signals of IEEE 802.11ad, with a much wider availability than preamble signals, are exploited for sensing in [
37,
38]. In [
37], the generalized likelihood ratio test (GLRT) is employed to formulate a maximum likelihood (ML) problem for target detection and estimation. An adaptive algorithm is developed to solve the ML problem by iteratively estimating the current strongest target, reconstructing the target echo signal and removing it for estimating the next strongest target. While the method [
37] results in a ML-like sensing performance, it has a much higher computational complexity than COS [
11]. However, COS, if directly applied to DFT-s-OFDM, can cause a severe noise enhancement, as the communication signals modulated on sub-carriers approximately conform to a centered Gaussian distribution. To address the noise enhancement issue, the work in [
38] modifies COS by replacing PWD with a point-wise product (PWP). Since the PWP of two frequency-domain signals plus a Fourier transform result in the cyclic cross-correlation (CCC) of the corresponding time-domain signals, we call the method [
38] C-COS hereafter.
COS and C-COS have complexity only dominated by Fourier transforms. Thus, they particularly suit communication platforms needing (or benefiting from) radar sensing with limited computing ability, such as low-profile IoT devices. Although COS and C-COS have a sub-optimal sensing performance compared with the optimal ML estimation, they can provide a satisfactory sensing performance for numerous scenarios, such as detecting car presence in a car park or people presence indoors. Moreover, we can also perform COS and C-COS for initial sensing and can then exploit ML to refine the initial results. Such a combination can have much lower computational complexity than using ML directly. Further, as they do not make changes to communications, COS and C-COS allow for sensing to be added onto existing communication systems with minimal changes. Therefore, we envision that COS and C-COS will promisingly contribute to speeding up the market penetration of JCAS in the near future. This would be more the case if the following issues of COS and C-COS can be effectively addressed.
Passively reusing communication signals without making any changes makes COS and C-COS suffer from the sensing constraints imposed by communication signal formats. In particular, the maximum sensing distance is limited by the CP length of the underlying communication systems, and the maximum measurable velocity is inversely proportional to OFDM symbol duration. Thus, can we relieve the sensing limits without changing communication signal formats?
COS and C-COS provide two different ways of generating RDMs. While their computational complexity is the same, a question follows naturally: which one gives the better sensing performance? It was shown through simulations in [
38] that the C-COS can have a better sensing performance than COS in certain low SNR regions. This, however, is not always the case, as disclosed in our recent work [
39]. A comprehensive analytical comparison between COS and C-COS is still missing;
Can COS and C-COS be applied to future variants of OFDM? Recently, the orthogonal time–frequency space (OTFS) waveform has become increasingly popular due to its unique ability of handling fast time-varying channels. Like DFT-s-OFDM, OTFS is also a DFT-precoded OFDM. Unlike DFT-s-OFDM, which is only precoded once along the sub-carrier dimension, OTFS is additionally precoded over (slow-)time. However, directly applying COS or C-COS to OTFS can be hard, as the OTFS with a reduced cyclic prefix (RCP), i.e., a single CP for the whole block of OTFS symbols, is the main trend in the OTFS literature;
Indeed, COS and C-COS already have quite a low computational complexity. However, should we rest on our laurels? In time-critical JCAS applications, we may require sensing to be carried out as fast as possible. This can be extremely challenging, particularly when the spatial volume to be sensed is large. All of these factors create highly stringent requirements for the sensing efficiency. Therefore, it is always beneficial to further reduce the sensing computational complexity, even only slightly.
We remark that the issues highlighted above have been rarely treated so far in the literature, including effective solutions. To raise awareness of these issues in the JCAS community, we will provide a short tutorial on them in
Section 3 after we establish the signal model in
Section 2. These two sections act as a fundamental basis to understand the recent progress and new solutions to be introduced sequentially. In particular, we will illustrate in
Section 4 some recent research efforts, which are based on our own works [
39,
40], in addressing the first three issues mentioned above. Moreover, in
Section 5, we will unveil that there exists non-trivial redundancy in OFDM-like sensing signals. To the best of our knowledge, such redundancy has not been explicitly treated in the literature yet. Noticing that, we develop a novel low-complexity sensing method based on COS by introducing efficient signal decimation. We also provide analysis and extensive simulations, demonstrating that the decimation-based COS can reduce the sensing complexity in a non-trivial manner, yet incurs only a minimal impact on the sensing performance.
2. Signal Model of OFDM-, DFT-s-OFDM- and OTFS-Based Sensing
Consider a general JCAS scenario where OFDM communication symbols are also used for sensing through a full-duplex synchronized receiver (Rx) co-located with the transmitter (Tx). We assume that proper full-duplex techniques are used to avoid/remove self-interference from Tx to Rx; see, e.g., [
2] for a review of such techniques. In addition, single-antenna Tx and Rx are employed to introduce the core idea that is independent of spatial information in theory. Note that we start with OFDM for illustration clarity and will extend the signal model to DFT-s-OFDM and OTFS later.
For the
m-th
OFDM symbol, there are
N data symbols to be transmitted, as denoted by
. In OFDM, these
N data symbols are multiplied onto
N orthogonal sub-carriers, which essentially are single-tone signals at center frequencies of
. Here,
T is the duration of the sub-carriers in the time-domain. This further indicates that the bandwidth of the considered OFDM system is
. Let
denote the sampling time that satisfies
in OFDM. Accordingly, the
m-th OFDM symbol can be expressed as a discrete function of time index
k, i.e.,
From (1), we see that multiplying data symbols with N orthogonal sub-carriers is equivalent to taking the N-dimensional inverse DFT (IDFT) of the data symbols. In turn, taking the DFT of with respect to (w.r.t.) k can recover .
According to the circular shift property [
41], the DFT of any circularly shifted
is still
, yet with extra phase shifts. Based on (1), we can write
where
denotes modulo-
N. Since the sample delay
l resembles the echo delay in the sensing Rx, it is implied by (2) that
the sequence of can always be recovered from the target echo as long as a complete (circularly shifted) OFDM symbol is available. To ensure this, a CP is generally added to
by copying the last
Q samples and pasting them to the beginning of
; refer to
Figure 1. Denoting the number of samples in the CP by
Q, the
m-th OFDM symbol becomes
which is obtained by plugging
into (1). The timing relation between
and
is described in
Figure 1.
Next, we build the signal model for target echoes. For illustration convenience and clarity, we model a single sensing target whose range, velocity and reflection coefficient are
r,
v and
, respectively. We also assume that
r,
v and
keep constant over
M OFDM symbols, as complied with the Swerling-I target fluctuation model [
42] [Ch.7]. The round trip (from Tx to the target and then back to Rx) causes a delay of
samples in the target echo, as compared with the transmitted OFDM symbol, where
is the microwave propagation speed. Note that
may not be an integer. The target velocity incurs a Doppler frequency that can be calculated as
, where
denotes the carrier frequency of the JCAS system. Taking into account
and
, the target echo can be modeled as
where
for
and
for the remaining values of
; and
denotes the time duration of a CP-OFDM symbol. Here,
rounds
x to the nearest integer. The echo timing with reference to the emitted signal is exemplified in
Figure 1. Though noises are inevitable in any practical Rx, they are suppressed in (4) for brevity. Moreover, the “stop-and-hop” model [
42] has been used to account for the Doppler effect by omitting the intra-symbol Doppler-related change. (Despite its wide applicability in conventional radar processing, the “stop-and-hop” model can be subjected to the condition that
is less than a fraction of
radians, e.g.,
[
43]. Here,
is a coherent processing time interval. Interested readers are referred to [
43] [Chap. 2] for how the phase term is derived.)
Extension to DFT-s-OFDM: As mentioned in
Section 1, DFT-s-OFDM is a DFT-precoded OFDM. The precoding happens along the sub-carrier-domain. Thus, instead of directly modulating communication data symbols onto sub-carriers, a DFT is taken first, and then the results are mapped to OFDM sub-carriers in interleaving or consecutive manners. Let
be the communication data symbols to be transmitted, where
is generally a fraction of
N. Assume
, with
L being an integer (related to the number of users in frequency division multiple access). Taking the
-point DFT of
, we obtain
. Then, we can map
onto
N sub-carriers. In the interleaving mapping, we have
In the consecutive mapping, we have
If multiple user-ends are served, they can be assigned with different l’s. Replacing in (1) with , the signal given in (4) also models the echo signal in DFT-s-OFDM sensing.
Extension to OTFS: Compared with DFT-s-OFDM, OTFS adds another DFT precoding over the slow time dimension. Let
denote the signal modulated onto sub-carriers. Different from OFDM,
is not directly from a communication constellation and, instead, is now a two-dimensional symplectic Fourier transform of data symbols, as denoted by
. If no window function is used,
is just the DFT of
over
and the IFDT of the DFT results over
. Note that
has a physical meaning of the Doppler index and
the range index. They are dual variables of
m (slow-time) and
n (sub-carrier), respectively. Replacing
in (1) with
, we obtain the time-domain symbols. In CP-OTFS [
44], each time-domain symbol is added with a CP, as shown in
Figure 1. However, the OTFS with reduced CP (RCP-OTFS) is more popular in existing OTFS studies, as illustrated in
Figure 2. Although the CP leads to a cyclically shifted version of the whole block of symbols at the sensing receiver, the ICI can be severe, particularly when the block duration is large. The severe ICI invalidates COS and C-COS for OTFS, as they implicitly require negligible ICI to generate RDMs. This will be made clear shortly in
Section 3.
Remark 1. For OFDM, the frequency-domain signals, i.e., those modulated onto sub-carriers, are independently drawn from a communication constellation, such as PSK and QAM. Thus, they conform to uniform distributions with a limited number of values. For DFT-s-OFDM and OTFS, however, their frequency-domain signals approximately conform to centered Gaussian distributions. This is because they are DFT(s) of the communication data symbols independently drawn from some constellations, while such DFT results converge in distribution to complex Gaussian random processes [45]. 3. COS and C-COS
In this section, we first review COS [
11] and C-COS [
38] based on the signal model established above. Then, we further illustrate the issues highlighted at the end of
Section 1.
The diagram of the two methods is illustrated in
Figure 3. They share the same signal preprocessing. Namely, they first remove the CP of each received symbol, i.e.,
given in (4), and then transform the CP-removed symbol into the frequency-domain via a DFT.
From
Figure 1, we see that the non-trivial part of
contains a circularly shifted OFDM symbol if
is satisfied, where
is the target delay and
Q is the CP length. Under the condition, removing the first
Q samples of
yields
for
. By taking
in (2), the DFT of
w.r.t.
k is
. Since
is a coefficient independent of
k, the DFT of
w.r.t.
k can be directly given by
.
The next step of removing the communication data symbol differentiates COS and C-COS.In COS, PWD is used, leading to
where we assume that the sensing receiver has a copy of
, as shown in
Figure 3. Taking the 2D-DFT of
gives the following RDM,
where
and
denote window functions of lengths
N and
M, respectively. If rectangular window functions are used, the
n- and
m-related summations will approach two sinc functions. They have mainlobes centered around
and
, and sidelobes elsewhere. As in the digital filter design, a proper window function, such chebshev, can be employed to suppress the sidelobes over
k and
b at the cost of an increased mainlobe width [
41]. Given
and
,
r and
v can be estimated as
where
is a modified version of
to account for negative velocities.
In C-COS, the PWD performed in (7) is replaced with PWP, which can be expressed as
where
denotes a conjugate. Then, a 2D-DFT of
yields the following RDM,
where
if
; otherwise
. Note that, if
, the
n-related summation in
can be rewritten into
where
denotes the IDFT of
(which is the frequency-domain signal),
is obtained by replacing
with its DFT expression, i.e., the
-summation in the middle result, and
is because the
n-summation can be seen as the conjugate of the IDFT of
. Note that the last result is the CCC of
and
. Thus, the
n-related summation in (11) resembles the matched filtering in the conventional radar signal processing.
As illustrated in Remark 1, approximately conforms to a centered Gaussian distribution. Since IDFT is a unitary transformation, , as the IDFT of , is also a centered Gaussian signal. Thus, the CCC result given in (12) will present a mainlobe around , which is the same as in COS. The difference is that we do not have an analytical model to depict the CCC result. Moreover, the sidelobe levels in the CCC result are unpredictable; c.f., the deterministic sidelobes of over k’s. On the other hand, comparing (8) and (11), the two RDMs share the same Doppler measurement ability, which is solely dependent on the m-related summation. Based on the above elaboration, we conclude that the estimates given in (9) also apply to C-COS.
Figure 4a illustrates the RDM
by plotting its amplitude against range and Doppler grids. Three targets are set, with parameters summarized in
Table 1. From
Figure 4a, we see three mainlobes corresponding to three targets. Based on the illustration below (8), the indexes of the range grids of the three targets can be calculated as
, 4040 and 4016, where multiplying 16 is due to the increasing of the DFT size (as illustrated in the caption of
Figure 4). Similarly, we can calculate the indexes of the Doppler grids of the targets, as given by
and 64. We see from
Figure 4a that the peak locations match the above calculations.
Target detection: If there exists a single target, detecting the target can be readily achieved through identifying the peak of
or
. The single-target scenario may sound unrealistic to the conventional radar community. However, in mmWave JCAS, the single-target sensing can be practical and has been studied in [
31,
33,
37]. To counteract the severe path loss of mmWave signals, mmWave communication systems generally use large-scale antenna arrays to steer highly directional beams. Therefore, a mmWave signal in the beam direction is likely to be blocked by the first target.
In multi-target scenarios, there can be multiple mainlobes in the RDM. Directly identifying the peak of the RDM will detect the strongest target. Then, the parameters of the strongest target can be estimated based on (9). With its parameter estimates, the target can be reconstructed and removed from the RDM, enabling the detection of the next strongest target. Such a sequential detection can be time-consuming, as the detection of each target will involve searching over the whole range-Doppler space.
Classical radar detectors can be employed to detect multiple targets efficiently. The constant false-alarm rate (CFAR) detector is one of the most commonly used radar detectors [
42]. Briefly speaking, CFAR tests all range and Doppler grids, as indexed by
k and
b, respectively, to check the presence of a target. At a grid under test (GUT), CFAR calculates the background interference-plus-noise (IN) power by averaging the power of the grids around GUT. The adjacent grids around GUT are generally excluded from the power evaluation to reduce the impact of the strongest sidelobes of a target. The estimated IN is amplified by a coefficient and used as a threshold, where the coefficient is dependent on the expected false-alarm rate. If the power of the GUT is greater than the threshold, then CFAR reports the presence of a target at the GUT; otherwise, CFAR reports target absence. A simulation tutorial of CFAR is provided by MathWorks in [
46].
Target estimation: After targets are detected, their locations can be submitted to (9) for estimating their parameters. However, the range and velocity estimations obtained in (9) suffer from errors as large as
and
, respectively. In fact, we can further refine the estimations using, e.g., the classical multiple signal clarification (MUSIC) algorithm [
47]. Some newer estimators [
48,
49,
50], which are efficient with low complexity, are also good candidates for refining target estimations. These estimators interpolate DFT coefficients around the integer range–Doppler grid of a target, as obtained in the target detection, and then solve the accurate parameter estimations from a pre-established relation between the interpolations and target parameters. Interested readers are referred to [
48,
49,
50] for more details.
CP-limited maximum measurable range (MMR): To obtain the RDMs in (8) and (11), we have assumed that the maximum target delay is no greater than the CP duration,
Q. This, in turn, indicates that the MMR of COS and C-COS is limited by CP. In particular, it can be given by
, where
Q is the CP length. Unfortunately, such a limitation remains for COS and C-COS, even when we have a sufficient link budget for sensing a longer distance. In essence, the MMR limitation exists because each communication symbol, e.g., in
Figure 1 and
Figure 2, is treated as an independent sensing waveform, and the zero inter-symbol interference is pursued in COS and C-COS. If we treat a block of consecutive symbols, e.g., symbol
in
Figure 2, as a single waveform, and use the whole signal block as a matched filter coefficient to process the received echo signal, the MMR limitation discussed here may be lifted. This whole-block processing, however, can suffer from a non-negligible intra-block Doppler impact. A pointwise Doppler compensation can be performed before range processing [
14,
18]. Moreover, a two-dimensional maximum likelihood-based range and Doppler simultaneous estimation can also be performed [
34,
51]. These options generally have a non-trivially higher complexity than COS and C-COS.
Symbol-limited maximum measurable Doppler (MMD): From the echo signal model given in (4), we see that the sampling interval over the slow time is , which is dominated by the OFDM symbol duration. The sampling frequency is . Then, the maximum measurable Doppler frequency is given by . Moreover, the Doppler resolution can be given by , which is inversely proportional to the overall time of a whole block.
COS versus C-COS: As shown in
Figure 3, COS and C-COS are differentiated by the way they handle frequency-domain communication signals. Recall that COS and C-COS apply PWD and PWP, respectively, as given in (8) and (11). When the communication signals have a constant modulus in the frequency-domain, PWD and PWP yield the same result. However, for DFT-s-OFDM and OTFS, the communication signals conform to a centered Gaussian distribution, as illustrated in Remark 1. In such cases, PWD can severely enhance the background noise in the RDM, as the signal being or approaching zero is used as a divisor; see (8) (For illustration simplicity, we ignored the noise term in (8), though it is inevitable in practice). PWP is proposed to relieve noise enhancement [
38]. Comparing
Figure 4b,c, we see that PWP indeed leads to a smaller noise background in the RDM. However, PWP can lead to a non-negligible noise floor in moderate and high SNR cases. Thus, analytical comparisons between PWP and PWD are worth investigating to provide the guidance in an ad hoc selection between them.
4. Recent Progress
The three issues discussed at the end of the last section have rarely been noticed in the literature, not to mention any solutions. Recently, we have performed some preliminary studies on relieving the issues [
39,
40]. In this section, we highlight some interesting results and remaining challenges.
Whether the waveform is OFDM, DFT-s-OFDM or OTFS, we are actually facing the same problem:
to detect and estimate targets given a block of communication signals, e.g., the middle signals in Figure 1 and Figure 2, and the target echo signals, e.g., the lower signals in Figure 1 and Figure 2. We emphasize that, in the considered JCAS, we do not intend to make any changes to the underlying communication system. In the COS and C-COS reviewed earlier, they segment the communication signals at a sensing receiver by fully complying to communications format, i.e.,
samples a symbol and
M symbols in total. From the end of
Section 3, we have seen that such compliance is the root of the sensing restrictions.
In light of the above observation, we proposed a novel sensing framework recently in [
39]. Here, we unitedly use
to denote the communication-transmitted signal in the time-domain, where the communication system can be based on either OFDM, DFT-s-OFDM or OTFS. That is,
can be the middle signal in either
Figure 1 or
Figure 2. Moreover, we point out that
is a signal sequence with
. In the case of CP-OFDM and DFT-s-OFDM, we have
, where
M is the number of symbols and
is the number of samples per symbol (including CP); see
Figure 1. In the case of RCP-OTFS (as illustrated at the end of
Section 2), we have
, where a single CP of
Q samples is applied to a block of
M symbols; see
Figure 2. As shown on the left of
Figure 5,
hits targets and propagates to the sensing receiver, resulting in the echo signal denoted by
. Thus,
is the scaling of the time-delayed
, similar to the relation between
and
depicted in (4). As mentioned in
Section 2, the co-located transceiver is considered in this paper. Thus, it is reasonable to assume that the sensing receiver shares the same clock as the transmitter and has a copy of
stored for sensing processing.
The sensing framework [
39] starts with a segmentation, as performed on both
and
in the same way. In particular, we ignore the signal format of the underlying communication system and segment
and
in a sensing-favorable manner. As shown in
Figure 5,
and
are segmented into consecutive sub-blocks (SBs) evenly, with
samples per segment, where
is no longer necessary. Adjacent SBs are allowed to overlap for
samples. Let
and
denote the signal in the
m-th SB. Due to target delay,
only contains a part of
, with the remaining part right after
; see the illustration in
Figure 5. Thus, we can add the
samples right after
onto the beginning of
, making
contain cyclically shifted versions of
. This will require
to be no smaller than the maximum target delay. Clearly, the
samples have a similar role as the CP in OFDM. Thus, we call them the virtual CP (VCP). However, we emphasize that VCP is not related to the original CP in any way. A key difference between them is that CP is determined by the communication system, but VCP is designated at the sensing receiver for sensing purposes. However, as shown in
Figure 5, adding VCP can introduce inter-SB interference, which is the price paid for pursuing flexible sensing.
The DFT results in
Figure 5 can be input to PWD and PWP for generating RDMs. For clarity, we summarize the novel sensing framework in Algorithm 1. In Step 1, the
m-th SB starts at the
-th sample and has
samples. Given
I signal samples in
and based on the illustration in the right sub-figure of
Figure 5, the number of SBs is
, where
takes flooring. In Step 2, VCP is added for the echo signal so that the
m-th SB of the echo signal becomes underlain by the
m-th SB of the transmitted signal, as illustrated in the right sub-figure of
Figure 5. Steps 3–5 are the same as the last three steps of COS and C-COS illustrated in
Figure 3. However, by introducing the novel segmentation and VCP, an ad hoc adjustment can be made to the sensing framework, and, hence, there is better catering for different sensing scenarios. For example, we can increase
for sensing a longer distance, we can reduce
to increase the maximum measurable Doppler frequency and we can adjust
in accordance with
and
to improve sensing SINR. Next, we provide more elaborations on how to determine these key parameters.
Remark 2. The criteria of configuring the sensing framework are illustrated below. First, we can set based on , the required MMR. As it is related to the VCP length , the MMR can be given by , which, equating with , yields . Second, we determine given the requirements on the velocity measurement. The maximum measurable Doppler frequency is half the sampling frequency over SBs (equivalent to the slow time in radar processing), which is . Here, is the difference between the starting times of any two adjacent SBs (and hence the sampling time over the slow-time-domain); see Figure 5. Consequently, to cater for the maximum measurable Doppler, as denoted by , we need to keep . Third, given , we can then set . To increase the SINR in both RDMs, we expect to have as large as possible; see (13) and (14), which are to be illustrated shortly. However, the larger the , the more correlated the signals between adjacent sub-blocks can be; see Figure 5. The correlation can make the results in (13) and (14) less precise. The detailed impact, however, is difficult to analyze. As shown through the simulations in [39], the SINRs in (13) and (14) are consistently precise, even when takes as large as . Algorithm 1 A novel sensing framework [39]. |
Input: (a copy of communication-transmitted signal sequence), (echo signal at a sensing receiver), (SB length), (VCP length), (overlapping between adjacent SBs) |
Segment and evenly into consecutive sub-blocks (SBs), as given by and . The m-th sub-block consists of samples ; Add VCP, i.e, the samples right after , onto the beginning of , leading to ; Take the DFT (w.r.t. n) of and , yielding and ; Under PWD, we have , whereas using PWP, we obtain ; Taking the 2D-DFT of and , generate the RDMs and , respectively.
|
The analytical SINRs of the two RDMs are helpful in investigating the sensing framework. The SINR of
, as obtained in Step 5 of Algorithm 1, can be expressed as [
39] [(33)]
where
is the SNR in the time-domain echo signal, i.e.,
in Algorithm 1;
I is the total number of samples of
;
is the total power of targets;
is the natural number; and
is a sufficiently small number, e.g.,
. Note that
accounts for the noise enhancement when diving a centered Gaussian signal by another one; see [
40] for a detailed analysis of this issue. The SINR of
, as obtained in Step 5 of Algorithm 1, can be expressed as [
39] [(35)]
Remark 3. Based on (13) and (14), we can make the following comparisons between the PWD- and PWP-based RDMs:
- (3a)
In low SNR regions where , has an SINR that is times the SINR in , where in general;
- (3b)
In high SNR regions where , can have a greater SINR than , provided ;
- (3c)
Regardless of , the always has a greater SINR than , if .
The comparisons made above are helpful in selecting between PWD and PWP when generating RDMs.
Before ending the section, we use a set of simulation results to showcase the superiority of the sensing framework illustrated in Algorithm 1 over conventional OFDM sensing (COS) in terms of the maximum measurable range. The simulation parameters are given in
Table 2. Note that the CP length
Q is much smaller than the sample delay of the target, i.e.,
. This setting is particularly employed to validate the point made at the end of
Section 3 (in terms of the limited maximum measurable range of COS and C-COS). Based on the review on COS and C-COS given in
Section 3, we know that these conventional methods would not be able to sense the target set in
Table 2. In contrast, the sensing framework given in Algorithm 1 can flexibly set
and
according to Remark 2 so as to cater to different sensing needs. In particular, to sense
, we set
,
and
. The settings further lead to
. For convenience, random signals, conforming to a centered Gaussian distribution with the unit variance, are loaded onto the OFDM sub-carriers. This essentially simulates DFT-s-OFDM, as the frequency-domain signal presents such randomness according to Remark 1. Thus, C-COS is used to simulate the conventional OFDM sensing.
Figure 6 compares the RDMs generated by C-COS and the novel sensing framework (NSF). The results in the first row are obtained by C-COS. We see from
Figure 6a that the RDM of C-COS is noise-like over the whole range–Doppler bins. Then, a close look at the range–Doppler bins around the theoretical target location is provided in
Figure 6b. We still see no obvious target. The theoretical range bin of the target is
. Since it is greater than
N, a modulo is taken, leading to 64. Based on the elaboration right after (8), the theoretical Doppler bin of the target can be calculated as
. The range and Doppler cuts of the RDM of C-COS are given in
Figure 6c,d, respectively. Again, we do not see any obvious targets. That is, C-COS fails to detect the target set in
Table 2.
The results in the second row are obtained by the NSF. Substantially different from
Figure 6a,e presents a normal RDM with the target shown as a sharp peak. The theoretical range bin here is the same as that for C-COS, i.e., 320. The theoretical Doppler bin of NSF needs to be recalculated as
(the result, however, is the same).
Figure 6f zooms in on the RDM around the bin pair
(both theoretical bins are added by one due to the fact that the MATLAB index starts from one and not zero in our calculation). From
Figure 6f, we can see that the target peak is approximately 40 dB stronger than the background noise. This strongly contrasts with
Figure 6b, validating the significant improvement of the novel sensing framework over the conventional COS. Range and Doppler cuts of NSF are given in
Figure 6g,h, respectively. We clearly see strong peaks at target locations.
There is a remaining issue of NSF on false targets. We see from
Figure 6g that, other than the true target at the 321-th range bin, there are two other weaker targets, which are located at the 65-th and 577-th range bins. These numbers have implicit relations with 321. Specifically, we have
and
. These fake targets are generated due to the partial periodicity shown in the signal after adding VCPs; see Step 2 in Algorithm 1. The issue was also revealed in [
39]. However, to date, there is no solution yet. One potential solution of suppressing fake targets is to employ the special relation between the locations of fake targets and that of the true target, in combination with the amplitude and phase features of their peaks. Another potential way of suppressing fake targets is to design NSF parameters,
,
and
, so that the partial periodicity leading to the fake targets can be removed. Validating these solutions or others calls for more research efforts.
5. A Novel Design to Further Reduce Sensing Complexity
We proceed to introduce an efficient design that further reduces the sensing complexity. Let us revisit the PWD-processed echo signal in COS, i.e.,
, as originally given in (7) and rewritten below
where the last result is due to
. From the above expression, we see that the frequency of
is
. As underlined in
Section 3, COS requires
. This indicates that the bandwidth of
is no greater than
, where
. In OFDM communication systems,
is generally satisfied [
32]. Thus, we make the following assertion:
Provided that the maximum sample delay in the target echo is no greater than the CP length, and that the CP length is much less than the sub-carrier number, the PWD-processed echo signal has a much smaller bandwidth than an OFDM symbol. Although we base our illustration on COS, the analysis and method are also applicable to C-COS and the novel sensing framework given in Algorithm 1.
Based on the signal models in
Section 3, the above assertion can be interpreted as: provided
,
has a much smaller bandwidth than
given in (1). With this fact noted, we can further conclude that only
of the whole frequency band contains useful information for sensing, and that the rest is filled with noises. In other words,
can have considerably redundant information. To this end,
we propose decimating to remove the inherent redundancy and, hence, to reduce the number of signal samples along the n-dimension, prior to sensing. The decimation leads to a smaller RDM and, hence, reduces the complexity of RDM-dependent target detection and estimation.
Remark 4. The assertion made for also applies to , the PWP-processed signal given in (10), and , the VCP-added signal obtained in Step 2 of the algorithm summarized in Algorithm 1. Therefore, the decimation proposed above can also benefit C-COS and the novel sensing framework in Algorithm 1 in reducing the RDM dimension and the complexity of target detection/estimation.
5.1. Efficient Decimation
We proceed to illustrate the efficient implementation of the proposed decimation. As seen from (15),
is a bandpass signal with frequency band
. To decimate
by the factor of
D, we develop the following procedure, as illustrated in
Figure 7a.
- (1)
Anti-aliasing filtering: is performed on
to suppress out-of-band interference and noises. The passband of the filter is the same as that of
, whereas the stopband is given by
, where ⊘ denotes a set difference. The frequency spectrum of an ideal bandpass filter is shown in Node B of
Figure 7b. The signal spectrum before and after filtering is shown in Nodes A and C of
Figure 7b, respectively. As ideally illustrated in Node C, out-of-band noises are totally removed, which is impractical, but can be well approximated by designing the anti-aliasing filter with a large stopband attenuation;
- (2)
Downsampling: is denoted by “
” in
Figure 7a. It keeps every
D-th sample (starting from sample 0) and deserts others. After downsampling, the sampling frequency is reduced to
, and the spectrum center becomes
; see Node D of
Figure 7b;
- (3)
Frequency shifting: shifts the spectrum center of the downsampled signal to zero, which leads to the spectrum shown in Node E of
Figure 7b.
Above are the general steps of a bandpass decimation. By invoking the polyphase structure, the decimation can be implemented more efficiently.
At the core of the polyphase structure is the decomposition of the anti-aliasing filter. Consider an
-order finite impulse response anti-aliasing filter. Let
denote the
l-th
filter coefficient. The
z-transform of
can be expressed as [
52] [Ch.6]
where the second equality is obtained by decomposing
and the
p-related summation is denoted by
in the last result. Note that
is assumed in the above decomposition. The condition can be readily satisfied by specifying the filter order as
when designing the anti-aliasing filter. Based on (16), we can see that the filter can be implemented in
D parallel branches, as illustrated in
Figure 7c. The input signal
goes into different branches simultaneously, and the outputs of the branch-filters, denoted by
, are supposed to be summed and then downsampled. However, in
Figure 7c, we move the downsampler to before the summation and equivalently put a downsampler in each branch. Carrying this out allows us to invoke the notable identity, as illustrated in
Figure 7c, to exchange the orders of the filter and downsampler in each branch. The order exchanging makes the delay block,
, adjacent to a downsampler. To this end, the samples to be filtered by the
d-th
branch-filter become
, where “
” reflects the
d-delay block in branch
d,
is added to sample indexes to ensure that the indexes are no less than zero and
is a result of the downsampler. Based on (16), the coefficients of the
d-th branch-filter are
.
The filter decomposition and the order exchanging illustrated above lead to the polyphase structure of bandpass decimation, as shown in
Figure 7d. In the figure, we use a buffer to collect continuous
Q samples, i.e.,
for the
d-th branch, and each branch-filter is implemented in the frequency-domain due to the following relation
where “⊛” denotes linear convolution, “≡” means that the calculations on its two sides are equivalent,
and
denote size-
IFFT and FFT, respectively, and “⊙” calculates the point-wise product. Note that the above equivalence requires
. For radix-2 (I)FFT, we can take
, such that
. Since each branch-filter produces
transient outputs and takes
Q samples as an input, the indexes of valid filter outputs are
. Thus, we keep the valid outputs and dump others, as shown in
Figure 7d).
Referring back to
Figure 7a, we are now at the last step of decimation, i.e, shifting the filtered and downsampled signal to the baseband. To differentiate with
, we use
to denote the index of valid samples after downsampling, as also highlighted in
Figure 7d. Based on (15), the filtered signal, after removing transients, can be expressed as
As a discrete function of
, the spectrum center of the above signal is now at
, since the mean value of
is
. According to the frequency shift property of Fourier transform, we know that an angular frequency shift of
can be equivalently realized by multiplying the time-domain sequence with
, which leads to the frequency shift block shown in
Figure 7d. Accordingly, the final output of the polyphase structure-based decimation is
5.2. Decimation-Based COS (DCOS)
Similar to COS reviewed in
Section 3, sensing can also be performed based on
, leading to DCOS. Taking the two-dimensional DFT of
w.r.t.
m and
generates the below RDM (referred to as DCOS-RDM), which has a smaller size than the RDM given in (8) (similarly referred to as COS-RDM),
Identifying the peaks of
along
- and
b-dimensions can estimate the range and velocity, respectively. Assume that the
-related summation achieves the maximum at
. It is easy to see from (18) that the maximum is only achieved when
, where
a takes an integer or zero. Solving the equation subject to
yields
where
denotes the estimate of
. Comparing (8) and (18), we see that COS and DCOS have the same velocity measurement. To sum up, DCOS has the following range and velocity estimates, where
is given in (9),
Again, we highlight that the illustration in this subsection can be similarly applied to C-COS and the novel sensing framework in Algorithm 1. Details are suppressed here for brevity.
The RDMs of COS and DCOS are compared in
Figure 8, where the parameter settings are given in
Table 3. From
Figure 8a,b, we can see a high similarity between the RDMs of the two methods. This validates the efficacy of the newly introduced decimation. It is noteworthy that DCOS reduces the complexity of generating the RDM shown in the figure by almost an order of magnitude, compared with COS. This can be readily validated by substituting the parameter settings in the above complexity analysis.
Figure 8c compares the range cuts between COS and DCOS. We see that DCOS has a slightly wider mainlobe than COS, which is caused by different window lengths.
Figure 8d compares the velocity cuts of the two methods. As expected, our design does not affect the velocity measurement.
5.3. Comparison between COS and DCOS
Here, we compare COS and DCOS from numerous aspects, through which, the advantages and disadvantages of introducing the efficient decimation are analyzed.
Computational Complexity:DCOS reduces the sensing complexity in two ways: first, it has a lower complexity than COS in generating RDM; second, DCOS-RDM has a smaller dimension than COS-RDM, thus reducing the complexity of RDM-dependent target estimation. From
Figure 3, we can see that the computational complexity of COS-RDM is dominated by the 2D-DFT. The complexity is
, which is equal to
by basic logarithmic laws.
DCOS has two parts of computations: the 2D-DFT for generating DCOS-RDM and decimation. Like COS, the first part of computation has the complexity of
. According to
Figure 7d, the computational complexity of the polyphase decimation is dominated by the first column of FFTs and the third columns of IFFTs. Their complexity is given by
, since the first (third) column has
D numbers of
-size FFTs (IFFTs). By designing the anti-aliasing filter such that
, we can take
, and then
becomes
. Note that
is much smaller than
, since
M can take several hundreds, whereas
D is around ten. Thus, the computational complexity generating DCOS-RDM is dominated by
.
For target detection, COS and DCOS have the same complexity if the same detection algorithm, e.g., CFAR, is used. For target estimation, particularly, the range estimation, COS would have a higher complexity than DCOS. This is because range estimation mainly relies on the row dimension of the RDM, i.e., , given in (8) for COS, and , given in (18) for DCOS. Whereas has N rows, only has Q (a fraction of N). A wall-clock time comparison between the complexities of COS and DCOS will be provided shortly through simulations.
Processing Gain:COS and DCOS have approximately the same processing gain, which is defined as the difference between the SINR in the RDM, i.e., in (8) and (18), and the SINR in the pre-processed target echo, i.e., in (7). Let denote the SINR of given in (7). Although noises are not explicitly shown in the signal models, the SINR change is easy to track. COS-RDM is obtained from a two-dimensional DFT of , and, hence, the SINR in COS-RDM becomes . Namely, the processing gain of COS is given by .
DCOS decimates first. The decimated version given in (17) has the SINR of , since the decimation with factor D does not change the signal power whereas it reduces the noise power by D times. The two-dimensional DFT performed in (18) improves the SINR to , where and the approximation is valid given . We see that the processing gain of DCOS is approximately .
Remark 5. The impact of P on DCOS can be non-trivial. For instance, as P increases, a higher quality filter can be obtained (e..g, one with a lower passband ripple, stronger stopband attenuation and narrower transition bandwidth); however, a lower processing gain, as given by , is yielded. Analytically, it is difficult to tell which of the following dominates: the SINR improvement earned by a better filter or the SINR degrading caused by the reduced processing gain. To this end, we resort to simulation next.
Figure 9a,b illustrate that, as
P increases from 1 to 50, the SINR in DCOS-RDM first increases, then plateaus and then decreases. The same pattern is seen for both small and large values of
. From this observation, we conclude that the SINR in DCOS-RDM can be maximized by properly setting
P. For the OFDM system configured in
Figure 9, the maximum is achieved at
. Using this value, we compare in
Figure 9c the SINR in DCOS-RDM with that in COS-RDM as
increases. We see that the SINRs achieved by COS and DCOS are almost identical in the whole region of
. Note that the difference between the
y-axis and
x-axis is the processing gain. Thus, the results in
Figure 9c validate COS and DCOS having approximately the same processing gain.
Range and Velocity Measurement:COS and DCOS share the same maximum unambiguous range/velocity, and they also have the same range/velocity resolution. In terms of velocity, the above statement is because the decimation does not incur any change to Doppler-related information, as manifested in (8) and (18). Based on (8), the range of Doppler frequency that can be unambiguously estimated is , where resembles the sampling frequency along the Doppler dimension. Since the number of samples is M, the Doppler frequency resolution is . Given the relation , we obtain the range of unambiguous velocity, i.e., , and the velocity resolution, as given by .
It terms of ranging, we can see from (8) and (18) that the range estimation has turned into a problem of identifying
in both COS and DCOS. Since
is independent of the sampling rate (or range dimension) in different RDMs, its estimate remains the same for COS and DCOS in theory. As illustrated in
Section 3,
is required for OFDM sensing. Let
R denote the maximum unambiguous detectable range. Solving
, we obtain
for both COS and DCOS. We can see from (8) and (18) that the resolution of
detection is unit one for both methods, and, hence, the range resolution, denoted by
, can be solved from
, leading to
.
Windowing Effect:For ranging, COS can achieve a better windowing effect than DCOS in the sense that COS has a narrower range mainlobe than DCOS given the same attenuation of the peak sidelobe, whereas, for the velocity measurement, the two methods have the same windowing effect. The reason is because the decimation in DCOS reduces the number of samples, and, hence, the window length, along the range dimension (compared with those of COS), whereas the decimation does not affect the velocity dimension.
Figure 10 compares the specific detection and estimation performances of C-COS and DCOS. CFAR is employed to evaluate the detection performance, where a
rectangular window is used to filter the RDMs (in power). The numbers of guard intervals are two and four in the Doppler and range dimensions, respectively. For range and Doppler estimations, the method in [
48] is employed. From
Figure 10a–c, we can see that C-COS and DCOS have a very similar detection and estimation performance. This is not surprising, as DCOS, simply removing redundancy through signal decimation, does not lose any essential information related to targets. From
Figure 10a, we can see that DCOS has a slightly lower detection probability compared to C-COS. This can be caused by the decimation filter with inevitable pass-band ripples and transition frequency bands. From
Figure 10d, we can see clearly that DCOS has a non-trivially lower running time than C-COS, which validates the reduced complexity of DCOS.