Information-Theoretic Characterization and Undersampling Ratio Determination for Compressive Radar Imaging in a Simulated Environment

Zhang, Jingxiong; Yang, Ke; Liu, Fengzhu; Zhang, Ying

doi:10.3390/e17085171

Open AccessArticle

Information-Theoretic Characterization and Undersampling Ratio Determination for Compressive Radar Imaging in a Simulated Environment

by

Jingxiong Zhang

^*,

Ke Yang

^*,

Fengzhu Liu

and

Ying Zhang

School of Remote Sensing and Information Engineering, Wuhan University, 129 Luoyu Road, 430079 Wuhan, China

^*

Authors to whom correspondence should be addressed.

Entropy 2015, 17(8), 5171-5198; https://doi.org/10.3390/e17085171

Submission received: 18 May 2015 / Revised: 9 July 2015 / Accepted: 20 July 2015 / Published: 24 July 2015

(This article belongs to the Section Information Theory, Probability and Statistics)

Download

Browse Figures

Versions Notes

Abstract

:

Assuming sparsity or compressibility of the underlying signals, compressed sensing or compressive sampling (CS) exploits the informational efficiency of under-sampled measurements for increased efficiency yet acceptable accuracy in information gathering, transmission and processing, though it often incurs extra computational cost in signal reconstruction. Shannon information quantities and theorems, such as source rate-distortion, trans-information and rate distortion theorem concerning lossy data compression, provide a coherent framework, which is complementary to classic CS theory, for analyzing informational quantities and for determining the necessary number of measurements in CS. While there exists some information-theoretic research in the past on CS in general and compressive radar imaging in particular, systematic research is needed to handle issues related to scene description in cluttered environments and trans-information quantification in complex sparsity-clutter-sampling-noise settings. The novelty of this paper lies in furnishing a general strategy for information-theoretic analysis of scene compressibility, trans-information of radar echo data about the scene and the targets of interest, respectively, and limits to undersampling ratios necessary for scene reconstruction subject to distortion given sparsity-clutter-noise constraints. A computational experiment was performed to demonstrate informational analysis regarding the scene-sampling-reconstruction process and to generate phase transition diagrams showing relations between undersampling ratios and sparsity-clutter-noise-distortion constraints. The strategy proposed in this paper is valuable for information-theoretic analysis and undersampling theorem developments in compressive radar imaging and other computational imaging applications.

Keywords:

compressive sampling; rate distortion; trans-information; undersampling ratios; phase diagrams; Gaussian mixtures; clutter

1. Introduction

Unlike the traditional practice of sampling followed by compression, CS provides a framework for directly acquiring data in compressed form, thus promoting sub-Nyqusit sampling that is more efficient than what is required by the Shannon-Nyquist sampling theorem [1,2,3]. It has been applied to various fields including optical and radar remote sensing [4,5,6,7,8,9,10,11,12,13]. In addition to the assumption made of the sparsity of the scene being sensed and imaged, the working of a CS-radar system relies also on the informational transferability of the measurement matrices in capturing the amount of mutual information in the measurements about the underlying scene and the targets of interest, in particular, and algorithms that can reconstruct the sparse signal from undersampled but information-laden data [8,14,15].

Classic information theory as introduced by Shannon provides the mathematics for the design of transmitter and receiver in a communication system to efficiently and reliably transmit the information from the source to the destination given the characteristics of the source and the channel, and can be applied to address issues, such as data compression, in a wide variety of fields [16,17,18,19]. As this paper focuses on information-theoretic analyses in CS-radar, a brief description of some of the basic concepts in information theory is provided here. The entropy of a source is a lower bound on the average length of the shortest description about it, implying that a description about it can be constructed with average length within 1 bit of the entropy [17]. Relaxing the constraints of recovering the source perfectly, rate distortion function quantifies the amount of information rate required to describe it up to a specific distortion measure, providing the trade-offs between information rates and distortion tolerance. In other words, a rate distortion function quantifies the minimum rate description required to achieve a particular distortion [17]. While entropy and rate distortion are both used to quantify the compressibility of a source, the concept of mutual information is useful for quantification of the amount of information transmitted through measurements concerning the underlying source. By definition, mutual information is the difference between the entropy of a random variable (RV) and its conditional entropy given the knowledge of another conditioning RV. It reflects the reduction in the uncertainty of a RV say a radar scene due to knowledge provided by another RV say a set of radar echo data acquired of the scene. In this paper, mutual information is used interchangeably with trans-information, although there are some conventional differences regarding their phrasing.

There was research carried out on information-theoretic analysis of radar systems before the advent of CS and CS-radar [20,21,22]. Information theory is fundamental to the understanding and analysis of CS and CS-radar (e.g., compressive radar imaging) [6,8], as well as conventional radar [20,22,23,24]. In particular, information theory provides theoretic explanations of CS mechanisms because “information” rather than “data” is the essence of CS. For example, based on informational analysis, we can examine source sparsity or compressibility in terms of entropy (see Orlitsky et al. [18] for discussion about pattern entropy) and rate-distortion, measurement matrices and resultant linear measurements in terms of trans-information, signal reconstruction in terms of lossless or lossy data compression and other elements in a CS context [15,17,25].

Information-theoretic principles can be applied to demarcating performance limits of a CS-based system better than otherwise, as shown by Wu and Verdú [26]. For instance, necessary conditions on sampling rates in CS, which are termed undersampling ratios because of their sub-Nyquist nature [9,27], can be discussed in the light of information theory. For undersampling theorem developments, Fano inequality, rate-distortion and channel coding theorem are often applied [28,29]. Statistical analysis of the signal reconstruction process is also important, because it is often necessary to address the issue of signal description and error probability bounds in signal reconstruction [30]. Clearly, information theory and related statistical methods constitute a theoretical strategy for characterization and sampling rate determination in CS-radar. For example, phase transition diagrams [9,27], in which inter-dependences between scene reconstruction and scene-sampling-noise configurations are described in graphics, may be created based on informational analysis [15].

Existing literature and research are lacking in several aspects. Firstly, the assumption of signal sparsity or compressibility in CS often leads to the use of simplified and non-realistic signal models (such as Bernoulli and spike sequences [31]) for computing and analysis of source entropy and rate-distortion trade-offs. This implies that CS is often oriented for sparse support recovery only. When continuous signals do get accommodated in the signal models (such as Bernoulli-Gaussian models [31]), zero or sufficiently small amplitude levels are often implicitly assumed in subsets of signals not belonging to sparse supports [30] again for ease of analysis. However, for radar target detection and estimation, we should consider background reflectivities (i.e., clutters) [32,33,34] in scene modeling and quantify properly the combined effects of interference of clutter and measurement noise upon effective trans-information. In other words, sparsity-clutter constraints should be clarified for objective evaluation of rate-distortion, while effective trans-information should be quantified properly concerning the scene and targets of interest, respectively. This will allow for determination of necessary number of measurements in CS-radar for scene reconstruction and estimation of targets of interest in a cluttered environment, depending on scene characteristics.

Secondly, the majority of published work on CS sampling and informational analysis is based on assuming randomness of measurement matrix ensembles with particular distributions (e.g., independent and identically distributed (i.i.d.) and Gaussian) [29,30,35,36,37,38]. However, those involved in radar are often deterministic because they are prescribed by the specific filters involved, as will be described in the next section. This implies limited transferability of the published results about CS sampling rates to CS-radar. Aeron et al. [39] describes how necessary CS sampling conditions may be derived in situations when deterministic measurement matrices are employed. However, their results are not directly applicable for radar imaging due to the generally non-standardized form of radar measurement matrices, as discussed by Zhang et al. [15]. Although deterministic and non-standard measurement matrices are addressed in [15], the existence of clutter interferences common in radar applications suggests merit in extending their results to situations where not only the scene as a whole, but also the targets of interest should be explicitly treated with respect to trans-information quantification.

Lastly, as mentioned previously, phase transition diagrams provide visual tools to guide undersampling by showing the relations between sampling rates and scene-noise-distortion constraints. By the computational approaches, we can simulate scenes of differing sparsity, generate linear measurements of different sensing capacity and signal-to-noise ratios (SNRs), obtain reconstructed scenes under different distortion thresholds and analyze the information chain of scene-sampling-reconstruction to create phase transition diagrams [27]. Such computationally derived phase transitions in the context of accurate and approximate signal reconstruction were discussed by Donoho and Tanner [27] and Zhang et al. [9], respectively. The other school is theoretical approaches, as exemplified by Donoho et al. [38] and Zhang et al. [15]. Donoho et al. [38] presented a formula that characterizes the allowed undersampling of generalized sparse objects and applies to approximate message passing (AMP) algorithms for CS. They proved this formula from state evolution and presented numerical results in a wide range of settings. However, entries of measurement matrices employed in their analyses need to be i.i.d. standard Gaussian, as is also the case with Wu and Verdú [26], precluding the formula’s use in applications whereby non-random measurement matrices are involved. Zhang et al. [15] proposed an information-theoretic strategy to map phase transitions, with scene-sampling-distortion trade-offs described in graphics. The phase diagrams derived are rendered for a specific distortion threshold, but they can only be interpreted in terms of scene reconstruction as a whole, not necessarily the targets of interest. It would be more useful to be able to generate phase diagrams at different distortion thresholds and make them useful not only for scene imaging as a whole but also target detection/estimation in a cluttered environment.

This paper seeks to describe, analyze and interpret information dynamics in compressive radar imaging from the perspective of information theory, which is complementary to and enriches the classic CS theory for CS-radar sampling design and performance evaluation, as informational quantities demarcate performance limits in CS better than otherwise. The proposed informational analysis will focus on information-theoretic description and analysis of the source (scene), through the channel (measurements), to the destination (radar imaging). Specifically, the studies concern: (1) information-theoretic characterization of compressibility of radar scenes, (2) trans-information quantification of radar measurements about the underlying scene and about the targets of interest against a cluttered background, respectively, and (3) derivation of necessary sampling ratios for signal reconstruction at a range of distortion tolerances. A synthetic experiment implemented aims to illustrate theoretical derivations and their use through computer-generated graphics visualizing scene-sampling-distortion inter-relations with an informational centrality. To summarize, major contributions and novelty of the paper are as follows:

(1): Use of Gaussian mixture models is clarified for both strictly and approximately sparse radar scenes where the targets to detect and estimate are in small number but possess relatively strong reflectivity, with rate-distortion described, which is related to scene sparsity and target-to-background variance ratios (termed TBRs in this paper);
(2): A generalized approach is proposed for quantifying trans-information between noisy measurements and the underlying scene as a whole and between the measurements and targets of interest against clutter interference, in particular, with the latter providing a more contingent benchmark for sparse target detection and hence estimation; this is accomplished through derivation of undersampled data’s joint differential entropy and trans-information in the context of deterministic measurement matrices, which are common in remote sensing applications;
(3): General formulas and numerical methods are devised for determining necessary under-sampling ratios for strictly sparse scenes, where clutter interference is absent or negligibly weak relative to targets of interest, and approximately sparse (or compressible) ones, where clutter needs to be taken into account, with the former being a special case of the latter, in line with the way trans-information is estimated;
(4): Phase diagrams showing relations between necessary undersampling ratios and sparsity-clutter-noise constraints are produced; they are conditional to a specified measurement matrix, specific to given distortion thresholds, able to accommodate a range of TBRs and are well suited for undersampling design and performance evaluation.

After a description of a few radar fundamentals, compressible radar scenes are modeled via Gaussian mixture distributions, and their rate distortion functions are discussed. This is followed by a description of the methods for determining mutual information measures (upper bounds, to be exact): (1) between measurements and the underlying scene, which measures the amounts of information conveyed by measurements about the scene as a whole, and (2) between measurements and the targets of interest excluding clutter, which measures the amounts of information conveyed by measurements about the targets of interest against a background of clutter. Necessary undersampling ratios are determined by requiring rate distortion not exceeding the amounts of trans-information of measurements about the scene and the targets of interest, respectively. Based on descriptions of models and methods, an experiment in a hypothetic scene-sampling environment is then reported, with results, which are conditional to the measurement matrix simulated, discussed. Lastly, some concluding remarks are given.

2. Models and Methods

2.1. Radar Measurements and Imaging: towards a Compressive Strategy

Synthetic aperture radar (SAR) systems can generate images of high spatial resolution by synthesizing the coherent pulses during integration time. A stripmap mode SAR with a single channel is considered here. For a stripmap SAR, with the platform moving in the azimuth direction (i.e., slow time direction), the antenna illuminates the scene with a transmitted signal and receives the echoes reflected from therein.

Suppose a chirp signal is transmitted, as is usually the case [40]. We can formulate the chirp signal as:

T R (τ) = r e c t (τ / T_{p}) \exp {J π K_{r} τ^{2}}, τ \in (- T_{p} / 2, T_{p} / 2]

(1)

where

T R

represents the transmitted chirp signal, τ is the fast time,

T_{p}

represents time duration of the chirp pulse,

K_{r}

is the chirp rate, rect(.) stands for the rectangular function, and

J^{2} = - 1

. The echo at slow time η and fast time τ,

Y (η, τ)

, can be written as:

\begin{array}{l} Y (η, τ) = \iint_{a, r} X (a, r) w_{a} (η - a / v) \exp {- J 4 π f_{0} R G (a, r, η) / c} \\ T R (τ - 2 R G (a, r, η) / c) d a d r + N (η, τ) \end{array}

(2)

where

T R

is specified in Equation (1), (a,r) indicates the azimuth and range position of a target,

X (a, r)

is the backscattering coefficient at (a,r),

N (η, τ)

is the thermal noise at

(η, τ)

at the receiving terminal,

w_{a}

is the azimuth weighting function,

f_{0}

is the carrier frequency,

R G (a, r, η)

is the slant range, v is the platform velocity relative to the ground, and c is the speed of light [9,15,40].

To discretize representations in Equation (2), we first simplify it as:

Y (η, τ) = \iint_{a, r} A (η, τ, a, r) X (a, r) d a d r + N (η, τ)

(3)

where

A (η, τ, a, r) = w_{a} (η - a / v) \exp {- J 4 π f_{0} R G (a, r, η) / c} T R (τ - 2 R G (a, r, η) / c)

is the convolution kernel. The discrete form of Equation (3) is:

Y (i a, i r) = \sum_{j a = 1}^{n a} \sum_{j r = 1}^{n r} A (i a, i r, j a, j r) X (j a, j r) + N (i a, i r)

(4)

where

Y (i a, i r)

is the ir-th fast time sample of the ia-th slow time observation of the echo,

X (j a, j r)

is the backscattering coefficient at the ja-th position along the azimuth direction and the jr-th position of the slant range,

N (i a, i r)

is the additive noise term, na and nr represent the maximum numbers of grid nodes along azimuth and range directions, respectively [9].

For matrix-presentation of the convolution implied in Equation (4), elements

A (i a, i r, j a, j r)

are denoted

A_{(j a, j r)} (i a, i r)

, which is also a discrete representation of

A (η, τ, a, r)

in Equation (3) but shows the convolution operation more clearly:

A_{(j a, j r)} (i a, i r) = w_{a} (η_{i a} - a_{j a} / v) \exp {- J 4 π f_{0} R G (a_{j a}, r_{j r}, η_{i a}) / c} T R (τ_{i r} - 2 R G (a_{j a}, r_{j r}, η_{i a}) / c)

where

τ_{i r}

(ir = 1, …, L) and

η_{i a}

(ia = 1, …, Q) are the sampling time indicators at range and azimuth directions, respectively; L and Q represent the maximum numbers of time samples at range and azimuth directions, respectively; ja and jr index grid positions along azimuth and range directions of the scene, respectively. {

A_{(j a, j r)} (i a, i r)

} can then be organized into a convolution matrix A:

A = (\begin{matrix} A_{(1, 1)} (1, 1) \\ A_{(1, 1)} (1, 2) \\ ⋮ \\ \begin{matrix} A_{(1, 1)} (1, L) \\ A_{(1, 1)} (2, 1) \\ \begin{matrix} ⋮ \\ A_{(1, 1)} (Q, L) \end{matrix} \end{matrix} \end{matrix} \begin{matrix} \dots \\ \dots \\ ⋱ \\ \begin{matrix} \dots \\ \dots \\ \begin{matrix} ⋱ \\ \dots \end{matrix} \end{matrix} \end{matrix} \begin{matrix} A_{(n a, n r)} (1, 1) \\ A_{(n a, n r)} (1, 2) \\ ⋮ \\ \begin{matrix} A_{(n a, n r)} (1, L) \\ A_{(n a, n r)} (2, 1) \\ \begin{matrix} ⋮ \\ A_{(n a, n r)} (Q, L) \end{matrix} \end{matrix} \end{matrix})

(5)

where L, Q, na, and nr are defined as previously. Clearly, L and Q equal nr and na, respectively. After matrix-representation of convolution kernels in A, we can form column vectors

Y_{n \times 1}

,

X_{n \times 1}

, and

N_{n \times 1}

(n = na × nr = Q × L) by row-stacking. For example, the first row of matrix-represented scene {X(ja,jr)} (ja = 1, .., na; jr = 1, …, nr) is transposed and becomes the top nr elements of

X_{n \times 1}

, and subsequent rows are transposed and placed underneath. Row-stacking for

Y_{n \times 1}

and

N_{n \times 1}

is done similarly. Thus, Equation (4) can be re-written:

Y_{n \times 1} = A_{n \times n} X_{n \times 1} + N_{n \times 1}

(6)

where

A_{n \times n}

is a block circulant matrix as shown in (5). Without causing any confusion, we may use X, Y, N, and A for

Y_{n \times 1}

,

X_{n \times 1}

,

N_{n \times 1}

, and

A_{n \times n}

, respectively, by omitting the subscripts shown in Equation (6).

Radar imaging refers to the process by which radar reflectivity X is reconstructed from echo data Y. This can be accomplished using tailor-made algorithms, such as the so-called range-Doppler algorithm [40]. For a CS-based radar imaging system, the number of samples collected in the receiver can be reduced so the lengths of column vectors Y and N and number of rows of matrix A are denoted m (m <= n) below, while a full-rank measurement matrix A is employed for a conventional radar imaging scenario [15]. Thus, a set of compressive measurements can be written as:

Y_{m \times 1} = A_{m \times n} X_{n \times 1} + N_{m \times 1}

(7)

where, again, the subscripts to indicate dimensions of the vectors and matrix concerned may be omitted without causing any ambiguity. This leads to a typical linear CS system:

Y = A X + N

(8)

As mentioned previously, radar images are complex-valued and contain information not only in amplitude but also in phase [32,41]. To implement CS with radar imaging, we can use real representations for the complex-valued radar images [42] so that the CS techniques designed for real-valued signals can be employed, given the facts that real and imaginary parts of complex-valued radar images are i.i.d. Gaussian [32] and that analysis and algorithms for complex signals are not well-developed. Thus, complex-valued vectors Y, X, and

N

and matrix A need to be decomposed into their real and imagery parts [15]:

\tilde{Y} = (\begin{matrix} R e (Y) \\ I m g (Y) \end{matrix}), \tilde{X} = (\begin{matrix} R e (X) \\ I m g (X) \end{matrix}), \tilde{N} = (\begin{matrix} R e (N) \\ I m g (N) \end{matrix}), \tilde{A} = (\begin{matrix} R e (A) & - I m g (A) \\ I m g (A) & R e (A) \end{matrix})

(9)

where Re() and Img() represent the real and imagery parts of complex-valued quantities concerned, respectively. Note that the dimensions of real-represented vectors and matrix in (9) will be twice those implied in (8). Therefore, we can formulate a CS-radar measurement model:

\tilde{Y} = \tilde{A} \tilde{X} + \tilde{N}

(10)

CS-based radar imaging is formulated as:

\arg \min_{\tilde{X}} | | \tilde{X} | |_{0} s.t. | | \tilde{Y} - \tilde{A} \tilde{X} | |_{2} \leq ε

(11)

where

ε > 0

. To simplify subsequent notations, we can keep using the typical model in Equation (8) in lieu of Equation (10), although they should be properly interpreted in terms of meanings and dimensionality. For solving (11), l_q-minimization algorithms, such as Orthogonal Matching Pursuit (OMP) [43] and compressive sampling matching pursuit (CoSaMP) [44], can be implemented [9,15], although detailed description of signal reconstruction algorithms is beyond the scope of the paper.

With linear measurement model established as in Equations (8) and (10), it is convenient to discuss CS-radar. As mentioned previously, there are typically three components in CS as implied in

Y = A X + N

: (1) sparse signals X (e.g., a radar scene dominated with a small number of targets with very strong reflectivity in contrast to a background of weak reflectivity), (2) an information sampling mechanism (i.e., an encoder) through a measurement matrix A to get compressive or undersampled measurements

Y = A X + N

, which should be reasonably efficient in conveying information in X, and (3) a signal reconstruction algorithm

\hat{χ}

(i.e., a decoder) that can detect sparsity patterns and/or estimate significant coefficients

\hat{X}

from undersampled data Y ([4,6,8,45], as discussed also in Zhang et al. [15]. In Sections 2.2 through 2.4, we discuss the compressibility of X, trans-information of A and undersampling ratios, respectively.

2.2. Sparse Radar Scenes and Rate-Distortion Characterization

In this sub-section, we propose representing radar scenes by Gaussian mixture models (GMMs) and describe their rate distortion functions. The underlying scene X can be regarded as being discrete or continuous for the task of detection or estimation. We focus on the latter, noting that the former can be seen as a special case of the latter when sparsity support recovery is all we need. Radar reflectivity is usually represented as complex-valued RVs having i.i.d. real and imaginary parts that are modeled as Gaussian distribution of zero means and variance indicative of radar image intensity, due to the large number of sub-pixel targets and their incoherently interfering reflectivities [22,32].

There are two characteristics of radar scenes and their images, which are relevant to the discussion here: the noise-likeness and very high dynamic ranges. The former refers to speckle, which can be modeled as multiplicative exponential noise, and the white noise-like phase, which is uniformly distributed in

[- π, π]

) [32]. The latter is caused by the presence of a few bright targets in a scene [46]. Because of their noise-like properties and hence high entropy, complex-valued radar images are inherently difficult to compress efficiently. This means that they have limited compressibility in any dictionary. Despite these, sparsity can be justified for radar scenes where there are only a small number of point-like strongly reflecting scatters. Sparsity can also be assumed for radar surveillance applications whereby the interest is on a few dominant targets, such as vehicles, ships or airplanes [6,8,15].

Consider an image under study. We can denote the complex-valued image X

\in ℂ^{n' \times 1}

(actually

\tilde{X}

as in Equation (10)); alternatively, it can be denoted real-valued X

\in ℝ^{n \times 1}

(n=2n’) for consistence with the previous vector dimension notations in Equation (8) without causing any ambiguity if we use real representations for real and imaginary parts of X as in Equation (10). The image X can be decomposed into two components:

X = X_{1} + X_{0}

, where

X_{1}

represents the sparse bright targets (i.e., signal of interest) while

X_{0}

indicates the background (i.e., clutter) [15,46].

When a scene is assumed to consist of homogeneous patches of different classes, each of which is assumed a Gaussian distribution with its own mean and variance, Gaussian mixture models can be used for modeling such a scene and its images [47,48]. Specifically, when the underlying scene consisting of sparse targets and clutter is represented as a vector X of length n, it can be modeled as a sequence of RVs {X(1), …, X(n)}, with each representing a grid cell of the imaged scene and drawn i.i.d. from a Gaussian two-component mixture distribution. Thus, we can describe the underlying scene via a Gaussian signal-clutter mixture distribution:

P_{X} \sim (1 - κ) Ν (μ_{0}, σ_{0}^{2}) + κ Ν (μ_{1}, σ_{1}^{2})

(12)

where

κ

≤ 1/2 indicates sparsity (i.e.,

κ

= k/n),

μ_{1} = μ_{0} = 0

,

σ_{1} > σ_{0} \geq 0

[15,48,49,50]. In Equation (12), the signal refers actually to targets of interest. Clearly,

V_{X} = V_{X_{1}} + V_{X_{0}} = κ σ_{1}^{2} + (1 - κ) σ_{0}^{2}

, where V_X stands for the variance of a RV X drawn from the vector X, and

V_{X_{1}}

and

V_{X_{0}}

represent variance of signal and clutter components, respectively. For strictly sparse signal, also known as spike, we have

σ_{0} = 0

,

V_{X} = V_{X_{1}} = κ σ_{1}^{2}

. This means that the model in (12) is reduced to the Bernoulli-Gaussian model discussed by Weidmann and Vetterli [31].

As mentioned in the introductory section, the entropy H(X) for a RV X is its minimum descriptive complexity and sets its ultimate data compression if it were to be compressed and then decompressed without loss of information (see [51] for examples in hyperspectral image data transmission and classification). For lossy data compression as in compressive sampling-based radar imaging, a more useful quantity is rate distortion function [17]. The rate-distortion function of a scene comprised of a set of pixels generically denoted by X, R_X(D), measures the minimal amount of information (in numbers of bits per pixel), which should be communicated via echo data to allow for scene reconstruction at an average distortion no greater than a threshold D specified. The operational definition of R_X(D) is equivalent to the information rate distortion function

R_{X}^{(I)} (D)

:

R_{X} (D) = R_{X}^{(I)} (D) = \min_{p (\hat{x} | x) : E (d (x, \hat{x})) \leq D} I (X; \hat{X})

(13)

where

\hat{X}

represents the reconstruction RV defined via the conditional probability mass (or density) function

p (\hat{x} | x)

,

I (X; \hat{X})

the mutual information between X and

\hat{X}

,

d (\hat{x}, x)

the distortion measure and

E (d (x, \hat{x}))

the expected distortion obtained over the joint distribution of

p (x, \hat{x})

[17,31], as reviewed by Zhang et al. [15]. A mean squared error (MSE) measure is used for a continuous signal represented by a RV X, which is a generic element in the sequence X.

Reznic et al. [52] studied the rate distortion function for a mixture of two Gaussian sources and proposed relevant formulas. To reflect the impacts of TBRs on rate-distortion behaviors, we made some modifications. For a Gaussian mixture of two components (i.e., targets X₁ and the clutter X₀), assume that the values of target and background variance are

σ_{1}^{2}

and

σ_{0}^{2}

, respectively. We can write

σ_{1}^{2} = T B R σ_{0}^{2}

. The adapted rate distortion function for this two-component Gaussian mixture can be written as:

\begin{array}{l} R_{X} (D) = {\begin{matrix} H (κ) + \frac{1 - κ}{2} \log [σ_{0}^{2} / D] + \frac{κ}{2} \log [σ_{1}^{2} / D], if D < σ_{0}^{2} \\ H (κ) + \frac{κ}{2} \log [κ σ_{1}^{2} / [D - (1 - κ) σ_{0}^{2}]], if σ_{0}^{2} < D \leq (1 - κ) σ_{0}^{2} + T B R κ σ_{0}^{2} \end{matrix} \\ = {\begin{matrix} H (κ) + \frac{1}{2} \log [σ_{0}^{2} / D] + \frac{κ}{2} \log [T B R], if D < σ_{0}^{2} \\ H (κ) + \frac{κ}{2} \log [κ σ_{0}^{2} / [D - (1 - κ) σ_{0}^{2}]] + \frac{κ}{2} \log [T B R], if σ_{0}^{2} < D \leq (1 - κ) σ_{0}^{2} + T B R κ σ_{0}^{2} \end{matrix} \end{array}

(14)

where

H (κ)

is binary entropy. For a strict sparse scene, we have

σ_{0}^{2} \to 0

, which means absence of clutter in a radar scene. We can simplify its rate distortion function [39]:

\begin{array}{l} R_{X} (D) = H (κ) + \frac{κ}{2} \log [κ σ_{1}^{2} / D], if 0 < D \leq κ σ_{1}^{2} \\ = H (κ) + \frac{κ}{2} \log [κ σ_{0}^{2} / D] + \frac{κ}{2} \log [T B R], if 0 < D \leq T B R κ σ_{0}^{2} \end{array}

(15)

meaning that the minimal rate needed for lossy compression of a strictly sparse radar scene is the sum needed for encoding sparse support, which is

H (κ)

as in Equation (14), and i.i.d. Gaussian quantity X of variance

V_{X} = V_{X_{1}} = κ σ_{1}^{2}

with a weight

κ

. Here, the underlying scene is represented as a sequence of i.i.d. generic RVs {X(1), …, X(n)}, with each modeled via GMMs, as mentioned previously.

2.3. Trans-information of Radar Measurements Concerning the Underlying Scene and the Targets of Interest

We consider mutual information conveyed by radar measurements Y (see Equation (8)) about the underlying scene X and the targets of interest X₁, respectively. We start with the former. Here, we have a deterministic matrix A

\in ℝ^{m \times n}

, whose rows are denoted A_i(i = 1, …, m), and the noise vector N consists of a sequence of i.i.d. Gaussian RVs with variance V_N. Instead of having each row restricted to a constant or unit l₂ norm, i.e.,

{A_{i}}^{T} A_{i} = 1

, as in the literature, row

A_{i}

is not necessarily to have either an equal or unit l₂ norm, but is only subject to having finite l₂ norm [15].

By definition, conditional mutual information between X and Y:

I (X; Y | A) = h (Y | A) - h (N)

(16)

where h(Y|A) and h(N) represent joint differential entropy of the vector of measurements Y and the vector of noise N, respectively. The differential entropy, denoted by lower case h rather than upper case H, is the entropy of a continuous RV, while the joint differential entropy is defined for a set of continuous RVs and determined by their joint probability density. This notation (i.e., h()) will be used in the remainder of this paper.

Also, we can derive the inequality:

h (Y | A) \leq h (Y) \leq h (Y^{*})

(17)

where

Y^{*} = A X^{*} + N

is a column vector of Gaussian RVs, and

X^{*}

is a vector of i.i.d. Gaussian RVs having the same covariance as X. The inequality

h (Y) \leq h (Y^{*})

in (17) originates from the fact that Gaussian RVs maximize entropies of distributions with the same variance. Further, we can put an upper bound on joint entropy (differential entropy, to be exact, for continuous RVs) of

Y^{*}

:

h (Y^{*}) \leq h (Y_{1}^{*}) + \sum_{i = 1}^{m - 1} h (Y_{i + 1}^{*} | Y_{i}^{*}) \leq h (Y_{1}^{*}) + \sum_{i = 1}^{m - 1} h (Y_{i + 1}^{*} - {\hat{Y}}_{i + 1}^{*})

(18)

where

{\hat{Y}}_{i + 1}^{*} = b_{i} Y_{i}^{*}

is the minimum mean squared error (MMSE) estimate for

Y_{i + 1}^{*}

, with

b_{i} = cov (Y_{i}^{*}, Y_{i + 1}^{*}) / var (Y_{i}^{*})

, where

cov (Y_{i}^{*}, Y_{i + 1}^{*}) = {A_{i}}^{T} A_{i + 1} V_{X}

and

var (Y_{i}^{*}) = {A_{i}}^{T} A_{i} V_{X} + V_{N}

, with V_N being the variance of i.i.d. noise elements in vector N [15,30].

The right most hand side of (18) can be evaluated as:

\begin{array}{l} h (Y_{1}^{*}) + \sum_{i = 1}^{m - 1} h (Y_{i + 1}^{*} - {\hat{Y}}_{i + 1}^{*}) \\ = h {(Y^{*})}_{u b} \\ = \frac{1}{2} \log {(2 π e)}^{m} + \frac{1}{2} \log (var (Y_{1}^{*})) + \frac{1}{2} \sum_{i = 1}^{m - 1} \log (var (Y_{i + 1}^{*} - {\hat{Y}}_{i + 1}^{*})) \end{array}

(19)

where

var (Y_{i + 1}^{*} - {\hat{Y}}_{i + 1}^{*})

, the variance for the error of

{\hat{Y}}_{i + 1}^{*}

, is evaluated as:

var (Y_{i + 1}^{*} - {\hat{Y}}_{i + 1}^{*}) = {A_{i + 1}}^{T} A_{i + 1} V_{X} + V_{N} - \frac{{({A_{i}}^{T} A_{i + 1} V_{X})}^{2}}{{A_{i}}^{T} A_{i} V_{X} + V_{N}}

.

From Equations (16) and (19), we can quantify upper bound of mutual information between X and Y, as

I (X; Y | A) = h (Y | A) - h (N) \leq h (Y^{*}) - h (N) \leq h {(Y^{*})}_{u b} - h (N)

. The upper bound of

I (X; Y | A)

is evaluated as:

\begin{array}{l} I {(X; Y | A)}_{u b} = h {(Y^{*})}_{u b} - h (N) \\ = \frac{1}{2} \log ({A_{1}}^{T} A_{1} s n r + 1) + \frac{1}{2} \sum_{i = 1}^{m - 1} \log [{A_{i + 1}}^{T} A_{i + 1} s n r + 1 - \frac{{({A_{i}}^{T} A_{i + 1} s n r)}^{2}}{{A_{i}}^{T} A_{i} s n r + 1}] \end{array}

(20)

where

I {(X; Y | A)}_{u b}

represents the upper bound for

I (X; Y | A)

, and snr is the ratio of variance of X over that of noise N (i.e., snr = V_X/V_N). Note we can compute per-sample signal (plus clutter actually) to noise ratio:

S (C) N R = \frac{E [| | A X | |_{2}^{2}]}{E [| | N | |_{2}^{2}]}

, which can be reduced to the usual signal to noise ratio (SNR) when there is no clutter. This measure will be useful for simplified estimation of

I {(X; Y | A)}_{u b}

assuming constant

{A_{i}}^{T} A_{i}

and zero correlation between rows of A, as shown below. In deriving Equation (20), it is easy to see the formula for h(N):

h (N) = \frac{1}{2} \log {(2 π e)}^{m} + \frac{1}{2} \log {(V_{N})}^{m}

As the scene (represented as vector X) is assumed to be consisting of the sum of sparse targets X₁ and background X₀, trans-information

I (X; Y | A)

does not reflect the amount of effective information conveyed by Y about the sparse targets X₁, unless X₀ is zero-intensity or much weaker than X₁, as is usually assumed in compressive sampling and sparse recovery. For detection of targets X₁, clutter X₀ will interfere with accurate recovery of X₁ from target-clutter mixture X [33]. It is thus more objective to derive formulas for estimating trans-information of Y about X₁, with X₀ and N acting as clutter and noise, respectively. The remainder of this sub-section will describe how effective trans-information of Y about X₁ may be estimated from signal-clutter-noise configuration. The model in Equation (8) should be decomposed as:

Y = A X + N = A (X_{1} + X_{0}) + N = A X_{1} + (A X_{0} + N) = A X_{1} + C N

(21)

As shown in Equation (16), conditional mutual information between X₁ and Y can be written as:

I (X_{1}; Y | A) = h (Y | A) - h (Y | X_{1}, A) = h (Y | A) - h (C N | A)

(22)

As in (18) and (19), we can derive an upper bound for

h (Y | A)

:

h {(Y^{*})}_{u b} = \frac{1}{2} \log {(2 π e)}^{m} + \frac{1}{2} \log (var (Y_{1}^{*})) + \frac{1}{2} \sum_{i = 1}^{m - 1} \log (var (Y_{i + 1}^{*} - {\hat{Y}}_{i + 1}^{*}))

.

The joint entropy of clutter and noise

h (C N | A)

can be similarly upper bounded by:

\begin{array}{l} h (C N | A) \leq h (C N^{*}) \leq h (C N_{1}^{*}) + \sum_{i = 1}^{m - 1} h (C N_{i + 1}^{*} | C N_{i}^{*}) \\ \leq h (C N_{1}^{*}) + \sum_{i = 1}^{m - 1} h (C N_{i + 1}^{*} - C {\hat{N}}_{i + 1}^{*}) = h {(C N)}_{u b} \end{array}

(23)

where

C {\hat{N}}_{i + 1}^{*} = (cov (C N_{i}^{*}, C N_{i + 1}^{*}) / var (C N_{i}^{*})) C N_{i}^{*}

is the minimum mean squared error (MMSE) estimate for

C N_{i + 1}^{*}

. So, we can write the right most hand side of inequality in (23) as:

h {(C N)}_{U B} = \frac{1}{2} \log {(2 π e)}^{m} + \frac{1}{2} \log (var (C N_{1}^{*})) + \frac{1}{2} \sum_{i = 1}^{m - 1} \log (var (C N_{i + 1}^{*} - C {\hat{N}}_{i + 1}^{*}))

(24)

where

var (C N_{i + 1}^{*} - C {\hat{N}}_{i + 1}^{*}) = {A_{i + 1}}^{T} A_{i + 1} V_{X_{0}} + V_{N} - \frac{{({A_{i}}^{T} A_{i + 1} V_{X_{0}})}^{2}}{{A_{i}}^{T} A_{i} V_{X_{0}} + V_{N}}

, and

V_{X_{0}} = (1 - κ) σ_{0}^{2}

.

As

I (X_{1}; Y | A) = h (Y | A) - h (Y | X_{1}, A) \leq h (Y^{*}) - h (C N^{*}) \leq h {(Y^{*})}_{u b} - h {(C N^{*})}_{u b}

, from (19) and (24), we can quantify upper bound of mutual information between X₁ and Y

I {(X_{1}; Y | A)}_{u b}

as:

\begin{array}{l} I {(X_{1}; Y | A)}_{u b} = h {(Y^{*})}_{u b} - h {(C N)}_{u b} \\ = \frac{1}{2} \log (\frac{{A_{1}}^{T} A_{1} V_{X} + V_{N}}{{A_{1}}^{T} A_{1} V_{X_{0}} + V_{N}}) + \frac{1}{2} \sum_{i = 1}^{m - 1} \log (\frac{{A_{i + 1}}^{T} A_{i + 1} V_{X} + V_{N} - \frac{{({A_{i}}^{T} A_{i + 1} V_{X})}^{2}}{{A_{i}}^{T} A_{i} V_{X} + V_{N}}}{{A_{i + 1}}^{T} A_{i + 1} V_{X_{0}} + V_{N} - \frac{{({A_{i}}^{T} A_{i + 1} V_{X_{0}})}^{2}}{{A_{i}}^{T} A_{i} V_{X_{0}} + V_{N}}}) \\ = \frac{1}{2} \log (\frac{{A_{1}}^{T} A_{1} V_{X_{0}} (1 + T B R κ / (1 - κ)) + V_{N}}{{A_{1}}^{T} A_{1} V_{X_{0}} + V_{N}}) \\ + \frac{1}{2} \sum_{i = 1}^{m - 1} \log (\frac{{A_{i + 1}}^{T} A_{i + 1} V_{X_{0}} (1 + T B R κ / (1 - κ)) + V_{N} - \frac{{({A_{i}}^{T} A_{i + 1} V_{X_{0}} (1 + T B R κ / (1 - κ)))}^{2}}{{A_{i}}^{T} A_{i} V_{X_{0}} (1 + T B R κ / (1 - κ)) + V_{N}}}{{A_{i + 1}}^{T} A_{i + 1} V_{X_{0}} + V_{N} - \frac{{({A_{i}}^{T} A_{i + 1} V_{X_{0}})}^{2}}{{A_{i}}^{T} A_{i} V_{X_{0}} + V_{N}}}) \end{array}

(25)

where

I {(X_{1}; Y | A)}_{u b}

represents the upper bound for

I (X_{1}; Y | A)

. To make the link between

I {(X_{1}; Y | A)}_{u b}

and TBRs explicit in Equation (25),

V_{X}

is replaced by

V_{X_{0}} (1 + T B R κ / (1 - κ))

. However, the upper bound (i.e.,

I {(X_{1}; Y | A)}_{u b}

) evaluated in (25) is lower than what would be obtained if

h (Y | X_{1}, A)

, which is actually

h (C N | A)

, rather than

h {(C N^{*})}_{u b}

were applied in derivation of (25). Thus, use of (25) will not imply too much overestimation of

I (X_{1}; Y | A)

, as overestimation implied in

h {(Y^{*})}_{u b}

is offset to some extent by that in

h {(C N^{*})}_{u b}

.

Clearly,

I {(X_{1}; Y | A)}_{u b}

will be less than

I {(X; Y | A)}_{u b}

, although both will be equal when X₀ is absent or assumed to be negligible in the mixture X. It is therefore more contingent to use (25) as a conservative measure of information content of Y about X₁, the targets to detect and reconstruct. Also, it is important to note that Equation (25) is a generalized form of Equation (20).

2.4. Undersampling Ratio Determination

We may derive an approximate bound on undersampling ratio m/n by requiring mutual information be no less than nR_X(D). We can do this but have to differentiate between the two scenarios discussed in Section 2.3. One is based on:

I {(X; Y | A)}_{u b} \geq n R_{X} (D)

(26)

and the other:

I {(X_{1}; Y | A)}_{u b} \geq n R_{X} (D)

(27)

In comparison, the undersampling ratio indicated in (26) will be an underestimate of that in (27). The former specifies the necessary sampling ratio required for approximate recovery of X as a whole regardless of whether it is the target of interest or the background, while the latter determines the necessary undersampling ratio required for approximate recovery of X₁ from noisy and clutter-interfering measurements. Thus, the latter is more conservative in determining the sampling necessity conditions than the former, and should be applied in the more challenging cases where detection and reconstruction of targets of interest have to deal with not only measurement noise but also clutter interference.

As m is generally implicit in (20), (25), (26), and (27), we need to use numerical methods to find critical values of m from such non-linear inequalities. To assist in analysis and interpretation, we can simplify them so that analytical expressions may be derived. To simplify use of (20) in combination with (26), we may derive an underestimated sampling ratio as:

m / n \geq 2 R_{X} (D) / \log [1 + S (C) N R]

(28)

where S(C)NR represents per-sample signal (plus clutter) to noise ratio and distortion D takes achievable values as indicated in Equation (14). S(C)NR is evaluated as the product of F and snr: S(C )NR = F ×snr, where F is the constant l₂ norm for each row of A (

{A_{i}}^{T} A_{i}

) and snr = V_X/V_N, The underestimation implied in (28) originates from the assumption made in its derivation: zero correlation (implying maximized per-sample mutual information between Y and X) and constant l₂ norm for each row of A.

To use (25) and (27) in a way similar to that of (20) and (26), we can make similar assumption as above (i.e., zero correlation and constant l₂ norm for each row of A), and derive an approximate analytical expression for (27):

m / n \geq 2 R_{X} (D) / \log (\frac{F V_{X} + V_{N}}{F V_{X_{0}} + V_{N}}) = 2 R_{X} (D) / \log (1 + S C N R)

(29)

where F represents the constant l₂ norm for each row of A (i.e.,

{A_{i}}^{T} A_{i}

), and the signal-to-clutter-plus-noise ratio (SCNR) is derived as:

S C N R = \frac{F V_{X_{1}}}{F V_{X_{0}} + V_{N}} = \frac{F V_{X_{0}} T B R κ / (1 - κ)}{F V_{X_{0}} + V_{N}}

.

Mutual information I(X;Y|A) measures the amount of information conveyed by measurements Y about the scene X being estimated. Its upper bound quantified in (20) sets an upper limit to the trans-information of Y about X and hence the amount of information retained in image

\hat{X}

reconstructed from Y (i.e., I(X;

\hat{X}

|A)). This follows the well-known data processing theorem in information theory, which states that I(X;

\hat{X}

|A) ≤ I(X;Y|A) as X→Y→

\hat{X}

forms a Markov chain [17]. Therefore, in addition to its importance for determining necessary sampling ratios, the trans-information of compressive samples Y about X is also a valuable indicator as an informational limit for the resultant reconstructed image

\hat{X}

whether

\hat{X}

is derived from convex optimization-based CS algorithms or not [15]. Similarly,

I {(X_{1}; Y | A)}_{u b}

quantified in (25) sets an upper limit to the effective trans-information of Y about X₁, which is the signal of interest excluding clutter, and hence the amount of effective information about the targets of interest retained in image

\hat{X}

reconstructed from Y.

3. A Simulated Experiment

The simulated experiment began with simulation of hypothetic scenes X’s, noise N’s, and echo data Y’s; a convolution matrix A needs to be specified to facilitate simulation of Y, as shown below. We will describe the procedures to generate various information-theoretically derived graphics. They aim to show: (1) rate-distortion characteristics (i.e., R_X(D)) of compressible radar scenes, (2) trans-information of compressive radar measurements about the underlying scene X as a whole and the targets of interest X₁, respectively, and (3) necessary undersampling ratios (m/n) for scene reconstruction given scene sparsity, TBRs, SCNRs, and distortion thresholds. As will be shown, the computational experiments and related results are parallel to and based on the methods presented in Section 2.2, Section 2.3 and Section 2.4. These will be followed by some discussion.

3.1. Simulation of Sparse Scenes and Noisy Echo Data

This sub-section describes generation of hypothetic scenes X’s (each of 100 by 100 grid cells) with various sparsity and TBRs. This is followed by specification of radar parameters and generation of the corresponding convolution kernel matrix A. All results derived hereafter (except for simulated sparse scenes) will be conditional to A. Simulation of noisy radar echo data Y (

Y = A X + N

) is then carried out using matrix A and simulated scenes X, after zero-mean Gaussian noise N’s are simulated with various noise levels.

To generate a set of realized scenes X’s, we set up a series of sparsity (equally spaced in the interval 0.01 ~ 0.50, step = 0.005; we set 100 different sparsity measures in total), and TBRs (their square roots are equally spaced in the interval 2 ~ 51, at a step of 0.495, resulting in a total of 100 different TBR values. We set

σ_{0}^{2} = 1

, so

σ_{1}^{2} = T B R \times σ_{0}^{2} = T B R

. After setting the range of scene parameters, complex-valued signal X was generated using the GMMs based sparsity model (i.e., Equation (12)). The simulation proceeded in two steps: (1) simulating locations of sparse targets according to a given sparsity level, while the rest being the background, and (2) simulating jointly the real and imaginary parts of complex reflectivities of individual target or background pixels according to their respective variance values (e.g.,

σ_{0}^{2}

and

σ_{1}^{2}

, if

σ_{0}^{2}

and TBRs are pre-set), depending on whether the pixels being simulated belong to the targets or background. The first step may be adapted to generating a map of patterned sparse targets, whose locations are specified according to template, such as an existing sparse image. We have a total of 100 × 100 possible combinations of sparsities and TBRs.

As examples, Figure 1 shows four simulated radar scenes with two sparsity levels (0.03 and 0.15), and two TBRs (25 and 900). The two sparsity levels represent extremely and moderately sparse scenes, respectively, while the two TBRs indicate small and large reflectivity differences between targets and background, respectively. Each scene is shown by a row of three sub-figures: real, imaginary and amplitude, from left to right. The first (Figure 1a–c) the second (Figure 1d–f) rows are for extremely sparse scenes of low and high TBRs, respectively, while the third (Figure 1g–i) and fourth (Figure 1j–l) rows are for moderately sparse scene of low and high TBRs, respectively. Clearly, targets of interest will be harder to detect from scenes of low TBRs than from those of high TBRs, meaning that we would need more sampled data to form images of acceptable detectability for the former than for the latter.

Figure 1. (a–c) real component, imaginary component and amplitude images of a scene of sparsity = 0.03 and TBR = 25, respectively; (d–f) real component, imaginary component and amplitude images of a scene of sparsity = 0.03 and TBR = 900, respectively; (g–i) real component, imaginary component and amplitude images of a scene of sparsity = 0.15 and TBR = 25, respectively; (j–l) real component, imaginary component and amplitude images of a scene of sparsity = 0.15 and TBR = 900, respectively.

For simulating noisy echo data, we need also to specify radar parameters, as indicated in Equation (3). They are as follows: slant range of scene center 10 km, transmitted pulse duration 1 μs, range FM rate 150 MHz/μs, signal bandwidth 150 MHz, range sampling rate 164.829 MHz, effective radar velocity 7608 m/s, radar center frequency 9.650 GHz, radar wavelength 0.031 m, azimuth FM rate 372372 Hz/s, synthetic aperture length 57.383 m, target exposure time 0.007543 s, antenna length 4.8 m, Doppler bandwidth 2808.620 Hz, azimuth sampling rate 2920.018 Hz. For convenience, these parameter values are listed in Table 1. They were used to generate a convolution kernel matrix A_full with a dimensionality of 10,000 by 10,000 at full rank.

To generate a set of realized noise N and echo data Y, we set up a series of S(C)NR (equally spaced in the interval −5 ~ 20 dB, step = 0. 253 dB, indicating a total of 100 noise levels), and undersampling ratios (equally spaced in the interval 0.01 ~ 1, step = 0.01, a total of 100 undersampling ratios), to reflect the noise levels and compression ratios of Y, respectively. For each set of simulated X and specified sampling S(C)NR, we generated the corresponding set of N and Y. Noiseless linear measurements Y₀ were generated by pre-multiplying the signal X with a compressive sampling matrix A_sub, which consists of a number of randomly drawn rows from the measurement matrix A (A_full to be exact); the number of rows m for A_sub reflects the undersampling ratio being considered. Although, for any undersampling ratios less than 1, the possible combinations of rows for Asub are

C_{m}^{n}

(the number of combinations of n distinct objects taken m at a time), we only pick up consecutive rows from A_full, starting from the first row, resulting in sets of A_sub’s (to reflect the undersampling ratios specified), which will be more conservative in terms of informational efficiency. The simulated measurements Y₀ were corrupted with additive Gaussian noise vectors N (with a length commensurate with those of Y₀’s), whose powers are restricted to the given S(C)NR level, to simulate noisy and undersampled echo data Y.

Table 1. Hypothetic radar parameters.

**Table 1.** Hypothetic radar parameters.
Parameter Name		Symbol	Value	Units
Range parameters	Slant range of scene center	$R_{0}$	10	km
	Transmitted pulse duration	Tr	1	μs
	Range FM rate	Kr	150	MHz/us
	Signal bandwidth	Br	150	MHz
	Range sampling rate	Fr	164.829	MHz
Azimuth parameters	Effective radar velocity	Vr	7608	m/s
	Radar center frequency	f0	9.650	GHz
	Radar wavelength	λ	0.031	m
	Azimuth FM rate	Ka	372372	Hz/s
	Synthetic aperture length	Ls	57.383	m
	Target exposure time	Ta	0.007543	sec
	Antenna length	La	4.8	m
	Doppler bandwidth	Ba	2808.620	Hz
	Azimuth sampling rate (PRF)	Fa	2920.018	Hz

As mentioned previously, informational analysis was performed based on simulated radar scenes X and echo measurements Y. We used the simulated scene and echo data to perform information-theoretic graphing of R_X(D), using Equation (14), and trans-information

I {(X; Y | A)}_{u b}

using (20) and

I {(X_{1}; Y | A)}_{u b}

using (25). We applied inequalities in (26) and (27) to determine minimal undersampling ratios for signal reconstruction given certain values of sparsity, TBRs, noise, and distortion D. These are explained one by one in the following sub-sections

3.2. Visualization of Scene Rate-Distortion and Echo Data’s Trans-information

For visualizing rate-distortion trade-offs, we compute required minimum information rates for a range of distortion levels. We specified distortion levels D in the interval from 0.01 to 1300 (times

σ_{0}^{2}

, which was set to 1.0 though); the end value of 1300 is the maximum value beyond which the distortion is theoretically achievable for any specified combination of sparsity, TBRs, S(C)NRs and undersampling ratios. Three values of D (0.01, 4, and 10, all relative to

σ_{0}^{2}

) were selected to represent high, medium high, and moderate accuracy levels, respectively, although many more levels of distortion can be specified in principle.

A simulated compressible scene X corresponds to a specific sparsity and a particular TBR. Given a distortion D, the rate distortion (n*R(D)) of X can be calculated using Equation (14). The case of an extremely large value of TBR (e.g., TBR = 50²) refers to strictly sparse scenes, and their rate distortion can be approximated by (15).

Figure 2. (a) Visualization of rate-distortion relations in the three-dimensional space framed by sparsity, TBRs (actually shown as

σ_{1} / σ_{0}

to reduce the range of values), and distortion; (b) three slice images representing varying rate distortion characteristics as related to different sparsities and TBRs when setting D = 0.01, 4 and 10, respectively (from bottom to top); (c–e) two-dimensional graphs showing rate-distortion relations for distortion D = 0.01, 4 and 10, respectively (all D values are relative to

σ_{0}^{2}

).

Figure 2. (a) Visualization of rate-distortion relations in the three-dimensional space framed by sparsity, TBRs (actually shown as

σ_{1} / σ_{0}

to reduce the range of values), and distortion; (b) three slice images representing varying rate distortion characteristics as related to different sparsities and TBRs when setting D = 0.01, 4 and 10, respectively (from bottom to top); (c–e) two-dimensional graphs showing rate-distortion relations for distortion D = 0.01, 4 and 10, respectively (all D values are relative to

σ_{0}^{2}

).

Figure 2a–e shows rate distortion characteristics of various simulated scenes X, which have different sparsities and TBRs, in relation to distortion D. In particular, Figure 2a indicates that rate distortion in relation to a distortion D is meaningful only within a certain region of scene sparsities and TBRs by virtue of its achievable regions, although the inner part of the 3-dimensional graphics is not visible. To see some of its inner information “landscape” implied in Figure 2a, we display slices of rate-distortion relations corresponding to distortion values of 0.01, 4, and 10, respectively, in the order from bottom to top, as shown in Figure 2b. For more clarity in graphing, Figure 2c–e highlights the slices where distortion D is fixed at 0.01, 4, and 10, respectively. Obviously, information rates required for lossy compression (or approximate reconstruction) grow with decreased distortion thresholds, and for a fixed distortion D the required information rates increase with increasing sparsity and TBRs, though the relationships are not linear.

The quantities of mutual information

I {(X; Y | A)}_{u b}

and

I {(X_{1}; Y | A)}_{u b}

measure the amounts of trans-information conveyed by measurements Y about the scene X and the targets of interest X₁, respectively. The former can be evaluated using Equation (20), while latter can be assessed using Equation (25), given differing sampling specifications (i.e., number of measurements and noise levels) and scene characteristics (i.e., sparsities and TBRs). Figure 3 and Figure 4 show the upper bounds of mutual information of echo data about X and X₁, respectively, in relation to undersampling ratios, sparsity and TBRs (for

I {(X_{1}; Y | A)}_{u b}

), and S(C)NR.

Figure 3. (a) slice images showing mutual information conveyed by echo data about the underlying scene with three undersampling ratios; (b–d) slice images for undersampling ratios of 0.3, 0.5, and 1.0, respectively.

Figure 4. Slice images representing varying mutual information conveyed by echo data about the targets of interest under different undersampling ratios, sparsity, TBR (actually

σ_{1} / σ_{0}

), and S(C)NR: (a–c) undersampling ratios = 30% , 50% and 100%, respectively, and all with TBRs (in vertical axes) of 25, 400 and 2500 (i.e., 5, 20 and 50 in terms of ratios of square root TBRs as shown in the Figures) from bottom up; (d–f) 30% , 50% and 100% undersampling ratios, respectively, and all with S(C)NR (in vertical axes) of −5, 5 and 20 dB.

Figure 4. Slice images representing varying mutual information conveyed by echo data about the targets of interest under different undersampling ratios, sparsity, TBR (actually

σ_{1} / σ_{0}

), and S(C)NR: (a–c) undersampling ratios = 30% , 50% and 100%, respectively, and all with TBRs (in vertical axes) of 25, 400 and 2500 (i.e., 5, 20 and 50 in terms of ratios of square root TBRs as shown in the Figures) from bottom up; (d–f) 30% , 50% and 100% undersampling ratios, respectively, and all with S(C)NR (in vertical axes) of −5, 5 and 20 dB.

As shown in Figure 3a, three images are created from slicing the informational cube, which can not be completely depicted here, in the three-dimensional space framed by sparsity, S(C)NR, and undersampling ratios, at three undersampling ratios: 0.3, 0.5, and 1.0 (in bottom-up sequence). For enhanced visualization, the three slices of images depicted in Figure 3a are shown in Figure 3b–d, respectively, in the two-dimensional space of sparsity and S(C)NR. Unsurprisingly, trans-information increases with increasing undersampling ratios. For a fixed undersampling ratio, trans-information increases with increasing S(C)NR, though the increase in trans-information is not related to sparsity at a fixed S(C)NR, as is apparent in Equation (20) for computing

I {(X; Y | A)}_{u b}

.

Figure 4a–c shows slice images representing varying mutual information conveyed by echo data about the targets of interest under different sparsities, TBRs, and S(C)NR with undersampling ratios of 30% , 50%, and 100%, respectively. In Figure 4a–c, TBRs are actually shown as

σ_{1} / σ_{0}

and measure 5, 20 and 50 from bottom up in the vertical axes. To visualize the other aspects of trans-information

I {(X_{1}; Y | A)}_{u b}

landscapes, Figure 4d–f shows slice images of

I {(X_{1}; Y | A)}_{u b}

under different sparsities, S(C)NR, and TBRs (

σ_{1} / σ_{0}

), again with undersampling ratios of 30% , 50%, and 100%, respectively. In Figure 4d–f, S(C)NR are shown in vertical axes and measure −5, 5, and 20 dB from the bottom up.

In comparison, trans-information quantities shown in Figure 4a–c tend to be more conservative than that in Figure 3a–c, assuming the same undersampling ratios, sparsities, and S(C)NRs, especially with lower TBRs. The graphics shown in Figure 3a–c and Figure 4d–f are not directly comparable. Thus, we will not elaborate on this here.

3.3. Undersampling Ratios in Graphics

The minimal undersampling ratios can be determined through evaluating the information-theoretic inequality in (26). This can be done by numerically solving an equation between the mutual information

I {(X; Y | A)}_{u b}

and rate distortion implied in (26). These ratios are conditional to the simulated measurement matrix A, and evaluated for a given scene with a certain sparsity and noise level as indicated by S(C)NR.

Figure 5a–c shows the images indicating minimal (i.e., necessary) undersampling ratios in relation to scene sparsity and per-sample S(C)NR, given MSE distortion levels D of 0.01, 4, and 10, respectively. As previously, distortion levels D are relative to

σ_{0}^{2}

. Clearly, with more relaxed or greater distortion thresholds, the undersampling ratios required will be reduced, given the same values of scene sparsity and per-sample S(C)NR.

Figure 5. Minimal under-sampling ratios in relation to signal sparsity and per-sample S(C)NR, given distortion level of 0.01 (a), 4 (b), and 10 (c), respectively.

Similarly, the minimal undersampling ratios for target detection in a given scene and with noise level can be determined through evaluating the information-theoretic inequality in (27). Again, this can be done by numerically solving an equation between the mutual information

I {(X_{1}; Y | A)}_{u b}

and rate distortion implied in (27). These ratios are also conditional to the simulated measurement matrix A.

As rate distortion is, by definition, distortion-dependent, we show selectively (according to D thresholds) some of the phase diagrams, which map the regions where sampling conditions are satisfied, as in Figure 5. The phase diagrams in Figure 6 show required minimal undersampling ratios in relation to scene sparsity, TBRs, and per-sample S(C)NR.

Figure 6. Minimal undersampling ratios in relation to signal sparsity and per-sample S(C)NR, given distortion level of 0.01 (a–c), 4 (d–f), and 10 (g–i), and TBRs of 25 (a,d,g), 400 (b,e,h), and 2500 (c,f,i); all MSE distortion levels are relative to pre-set

σ_{0}^{2}

.

Figure 6. Minimal undersampling ratios in relation to signal sparsity and per-sample S(C)NR, given distortion level of 0.01 (a–c), 4 (d–f), and 10 (g–i), and TBRs of 25 (a,d,g), 400 (b,e,h), and 2500 (c,f,i); all MSE distortion levels are relative to pre-set

σ_{0}^{2}

.

To assist interpretation of the results, these diagrams are organized in row and column groups. The row grouping is based on TBRs (actually

σ_{1} / σ_{0}

): 5 (Figure 6a,d,g), 20 (Figure 6b,e,h), and 50 (Figure 6c,f,i). The column grouping is based on MSE distortion levels: 4 (Figure 6a–c), 100 (Figure 6d–f), and 400 (Figure 6g–i), where all MSE distortion levels are relative to

σ_{0}^{2}

. Similar to the phase diagrams in Figure 5, greater distortion thresholds lead to smaller necessary undersampling ratios, given the same values of scene sparsities, TBRs and per-sample S(C)NRs. With increasing TBRs, necessary under-sampling ratios will be decreased, while all other settings are kept the same. An important link can be made of the similarity between phase diagrams shown in Figure 5a–c and those in Figure 6c,f,i, as the former represent the limiting cases of those in Figure 6, when TBRs become sufficiently large (e.g.,

σ_{1} / σ_{0}

= 50).

Note that the blue blocks in Figure 6d,g,h indicate that we would not need to acquire any samples for approximate reconstruction of the underlying scenes due to relaxed distortion thresholds, small sparsity, and low TBRs. This means that there is little information in such kind of scenes in the first place so that no sampling is required for reconstruction of X and X₁ with distortion thresholds indicated by D. Here, approximate reconstruction means that the underlying scene is reconstructed, with targets of interest properly detected and estimated, up to given distortion thresholds.

The derived undersampling ratios are necessary conditions, meaning that scene reconstruction would not be possible without incurring distortion larger than the prescribed thresholds if the numbers of measurements are less than what is indicated by the undersampling ratios. Even if the sampling necessary conditions are satisfied, there is no guarantee that such properly undersampled data will enable scene reconstruction meeting the specified distortion criterion. The reasons are three-fold: (1) necessary conditions are not sufficient ones, (2) information-theoretically derived sampling necessities are theoretical and algorithm-independent while algorithms may incur extra expenses of sampling, and (3) sampling necessary conditions derived previously are meaningful on probabilistic terms, suggesting variabilities in trade-offs between sampling and distortion.

As mentioned previously, the results obtained with simulated data are conditional to the particular measurement matrix A set forth, as is also the case with Zhang and Yang [15]. In other words, our results are not invariant to measurement matrices and hence the radar transmitted waveforms and other relevant parameters employed in a CS system. This raises, for instance, the issue of how waveforms should be designed to maximize mutual information between Y and X, as discussed by Bell [22]. Clearly, the existing literature on related topics and the results obtained here in this paper should be integrated to push forward research on CS-radar informatics and sampling theorems. For instance, the informational quantities described in this paper in the context of CS-radar should be made to augment the existing predominantly statistical metrics for performance evaluation of radar systems. On the other hand, we should consider the aforementioned computational approaches to phase transitions to handle peculiar signal models, sampling matrices, reconstruction algorithms and performance evaluation criteria, which are otherwise difficult to analyze theoretically.

3.4. Discussion

In this sub-section, we discuss the results by first comparing undersampling ratios derived from information-theoretic analysis and those based on the restricted isometry property (RIP) in classical CS theory. We will also reflect on future developments in compressive radar imaging, focusing on compressible radar scene modeling and sparsity-enhancement in radar imaging.

CS sensing matrices and reconstruction algorithms function like encoders and decoders in the context of information theory, with the former seeking to preserve information content in the sparse signal while the latter aiming to be efficient and robust for recovering the original signal in presence of measurement noise. The so-called RIP is one of desirable properties that we would like CS sensing matrices to have so that the sensing matrices possess adequate level of efficiency in transferring information in the original signal. With a sensing matrix satisfying the RIP, various algorithms will be able to successfully recover sparse signals from noisy measurement, according to established RIP-based CS results [53]. The necessary numbers of measurements needed to achieve the RIP are studied widely in CS literature. Here, we mention one of such results originally reviewed by Davenport et al. [53]. Let A be an

m \times n

matrix that satisfies the RIP of order 2 k with RIP constant

δ \in

(0, 0.5], where m and n represent the number of measurements Y and the length of signal vector X, respectively. Then, RIP requirements can be used to derive phase transition for a problem dimension {k, m, n} such that

m > c k \log (n / k)

with

c \approx 0.28

[53].

There exist a few major distinctions between information-theoretically determined undersampling ratios in this paper and RIP results. We highlight some of them in terms of the signal models, sensing matrices and reconstruction criteria concerned. First, sparse signals are usually described by just sparsity in RIP, while both sparsity and TBRs are accommodated in GMMs-based signal models in the paper. The two-component (i.e., target-clutter) GMMs employed therein were shown to have well supported the tasks of target detection against clutter interference in approximately sparse radar scenes. This represents a major strength and contribution of this paper.

Secondly, randomized sensing matrices such as those drawn from i.i.d. Gaussian distributions are preferred in RIP-based analysis for under-sampling ratios. Although deterministic sensing matrices can also be considered in CS literature through the concept of coherence [53] and its relationships with RIP, the derived under-sampling ratios are often unacceptably high. On the other hand, informational analysis performed in this paper was based on deterministic sensing matrices, which are common in compressive radar imaging. The information-theoretic limits on undersampling were shown to be related to l₂-norms of the sensing matrix rows, which are easier to analyze and optimize, as seen previously.

Thirdly, RIP-derived phase transition is mostly about exact or precise recovery [27], although relations between achievable minimum errors in signal recovery and RIP constants are confirmed in the literature [54]. In informational analysis reported in this paper, rate distortion was shown to be a valuable theoretic construct to analyze the trade-offs between information rates and distortion tolerance, as shown in Figure 2. Thus, the phase diagrams shown in Figure 5 and Figure 6 of the paper were distortion-specific. Furthermore, mutual information computed for a set of compressive measurements can be used as a quality measure for the images formed from under-sampled data, as discussed towards the end of Section 2. Clearly, the information-theoretic quantification about sampling-distortion trade-offs is particularly useful for compressive radar imaging where sampling efficiency, target detectability and imaging quality are important considerations.

Although GMMs were employed to model compressible radar scenes in this paper, we may well use other kinds of models for modeling complex-valued radar reflectivities if empirical evidence suggests so. For example, it was found that Laplace distributions are more suitable to model complex-valued images under wavelet transforms than Gaussian distributions, as shown in Xu et al. [13]. Further research is needed for balancing between model complexity (hence related computational cost) and precision in modeling. The mixture distribution models are versatile and worth exploring in applications. For instance, it is interesting to extend use of GMMs by modeling both sparse targets of interest and the clutter interference through their respective GMMs.

In this paper, complex-valued radar images were represented by real and imaginary parts. Since phase components are not sparse due to them being uniformly distributed in the interval of [−π, π], we often model and represent sparsity in radar images based on their amplitude components alone. However, the complex nature of original images and the mechanism of coherent radar imaging require treatment of both amplitude and phase [55], even if we need amplitude images only as the end results. Nevertheless, there remains the issue of optimum sparse representation and sparsity/feature-enhanced imaging algorithms for compressible radar scenes, as elaborated below.

We may enforce sparsity in amplitude alone in CS-based radar imaging by modifying objective functions for signal reconstruction. The so-called sparsity-enhanced methods for compressive radar imaging may be usefully explored, as described by [55]. The cost functional to minimize includes sparsity-enforcing weights on the vectors of amplitudes and their gradients, in addition to an l₂-norm on the differences between complex-valued measurements Y and reconstruction-induced projections AX. To avoid potential conflicting of these two kinds of constraints, we need to employ joint optimization of amplitudes and phases via alternating minimization of objective functions. One acts explicitly on amplitudes with initial estimation of phases, while the other on phases based on updated estimates of amplitudes. The joint optimization is done through an iterative process until convergence. This approach provides the capability to preserve and enhance multiple distinct features on different patches of the underlying scene. Nevertheless, the algorithms employed above are tailor-made, and a joint optimization strategy tends to be computationally more expensive than the algorithm adopted in this paper, as it is originally developed for real-valued signal processing and imaging.

4. Conclusions

This paper has presented an information-theoretic strategy, which is seen to be complementary to the classic CS theory, to describe, analyze and interpret information dynamics in compressive radar imaging. The undertaken informational analyses focused on compressibility of radar scenes and trans-information of radar measurements about the underlying scene and the targets of interest, respectively. The amount of information conveyed by compressive sampling about targets of interest against clutter is more conservative as a measure of trans-information than that about the scene as a whole. The formulas for estimating the former (i.e., trans-information regarding targets of interest in clutter) constitute a major thrust of innovation of this paper. Quantification of compressible radar scene’s rate-distortion and trans-information of compressive radar measurements facilitates determination of necessary undersampling ratios. The necessary undersampling ratios derived for scene reconstruction vs. target detection/estimation, within certain MSE distortion thresholds, are seen to differ greatly, with the latter being more contingent, given that it has to accommodate the task of detecting targets of interest against clutter interference, as target detection/estimation is dependent, additionally, on target-to-background variance ratios (TBRs). A simulated experiment illustrated theoretical derivations and their use through computer-generated graphics visualizing scene-sampling-distortion inter-relationships with an information-theoretic perspective. This work will also be constructive for CS-radar sampling design via undersampling ratios determination and performance evaluation by using trans-information as upper bounds on information content of reconstructed images. For instance, a specific radar imaging application with a particular scene and image distortion corresponds to a set of “points” over the information-theoretically generated graphics, and can thus be informed of its corresponding rate-distortion, trans-information and necessary under-sampling ratio. Moreover, the general framework proposed in this paper can be applied to other computational imaging applications that capitalize on CS principles and techniques.

The results derived regarding informational analysis and necessary sampling rates for compressible radar scene reconstruction from undersampled echo data can be extended to two scenarios. One concerns discrete support recovery only, which requires less amount of sampling obviously, while the other about conditions for exact reconstruction as opposed to approximate reconstruction, as presented here in this paper, although this is not elaborated here in this paper. Future research should, hopefully, also address issues related to real applications, as it is important to showcase CS theorems and their practicality in radar imaging and other fields.

Acknowledgments

This work was supported by the National Natural Science Foundation of China under Grant [numbers 41471375/41171346]; the National Basic Research Program of China under Grant [number 2010CB731905]. Comments from the anonymous reviewers were very helpful and received with thanks. Mr Xiaoqin Hu of Wuhan University contributed to revision of the paper. The authors would also like to acknowledge advice and support from Prof Zongjian Lin of the Chinese Academy of Surveying and Mapping, and Profs. Bingchen Zhang, Wen Hong and Yirong Wu of the Chinese Academy of Sciences.

Author Contributions

Jingxiong Zhang, the principal author, contributed mostly to the literature survey in the fields of information theory, radar and CS. Jingxiong Zhang was also responsible for the research design, results analysis and interpretation and the paper writing. The primary contribution of Ke Yang, a corresponding co-author, includes literature review, research-related experimentation and paper formatting, while those of Fengzhu Liu and Ying Zhang include literature surveys and some computing experiments. All authors have read and approved the final manuscript.

Conflicts of Interest

The authors declare no conflict of interest.

References

Candès, E.J.; Tao, T. Decoding by Linear Programming. IEEE Trans. Inf. Theory 2005, 51, 4203–4215. [Google Scholar] [CrossRef]
Donoho, D.L. Compressed Sensing. IEEE Trans. Inf. Theory 2006, 52, 1289–1306. [Google Scholar] [CrossRef]
Strohmer, T. Measure What Should Be Measured: Progress and Challenges in Compressive Sensing. IEEE Signal. Process. Lett. 2012, 19, 887–893. [Google Scholar] [CrossRef]
Herman, M.A.; Strohmer, T. High-Resolution Radar via Compressed Sensing. IEEE Trans. Signal. Process. 2009, 57, 2275–2284. [Google Scholar] [CrossRef]
Alonso, M.T.; López-Dekker, P.; Mallorquí, J.J. A Novel Strategy for Radar Imaging Based on Compressive Sensing. IEEE Trans. Geosci. Remote Sens. 2010, 48, 4285–4295. [Google Scholar] [CrossRef] [Green Version]
Ender, J.H.G. On Compressive Sensing Applied to Radar. Signal. Process. 2010, 90, 1402–1414. [Google Scholar] [CrossRef]
Fannjiang, A.; Strohmer, T.; Yan, P. Compressed Remote Sensing of Sparse Objects. SIAM J. Imag. Sci. 2010, 3, 595–618. [Google Scholar] [CrossRef]
Potter, L.C.; Ertin, E.; Parker, J.T.; Çetin, M. Sparsity and Compressed Sensing in Radar Imaging. IEEE Proc. 2010, 98, 1006–1020. [Google Scholar] [CrossRef]
Zhang, B.; Hong, W.; Wu, Y. Sparse Microwave Imaging: Principles and Applications. Sci. China Inf. Sci. 2012, 55, 1722–1754. [Google Scholar] [CrossRef]
Yang, J.G.; Thompson, J.; Huang, X.T.; Jin, T.; Zhou, Z.M. Segmented Reconstruction for Compressed Sensing SAR Imaging. IEEE Trans. Geosci. Remote Sens. 2013, 51, 4214–4225. [Google Scholar] [CrossRef]
Su, R.Q.; Lai, Y.C.; Wang, X. Identifying Chaotic FitzHugh–Nagumo Neurons Using Compressive Sensing. Entropy 2014, 16, 3889–3902. [Google Scholar] [CrossRef]
Goodman, N.A.; Potter, L.C. Pitfalls and Possibilities of Radar Compressive Sensing. Applied Optics 2015, 54, C1–C13. [Google Scholar] [CrossRef] [PubMed]
Xu, G.; Xing, M.D.; Xia, X.G.; Zhang, L.; Liu, Y.Y.; Bao, Z. Sparse Regularization of Interferometric Phase and Amplitude for InSAR Image Formation Based on Bayesian Representation. IEEE Trans. Geosci. Remote Sens. 2015, 53, 2123–2136. [Google Scholar] [CrossRef]
Elad, M. Sparse and Redundant Representation Modeling-What Next? IEEE Signal. Process. Lett. 2012, 19, 922–928. [Google Scholar] [CrossRef]
Zhang, J.; Yang, K. Informational Analysis for Compressive Sampling in Radar Imaging. Sensors 2015, 15, 7136–7155. [Google Scholar] [CrossRef] [PubMed]
Eichel, P.; Ives, R.W. Compression of Complex-Valued SAR Images. IEEE Trans. Image Process. 1999, 8, 1483–1487. [Google Scholar] [CrossRef] [PubMed]
Cover, T.M.; Thomas, J.A. Elements of Information Theory, 2nd, ed.; Wiley: Hoboken, NJ, USA, 2006. [Google Scholar]
Orlitsky, A.; Santhanam, N.P.; Viswanathan, K.; Zhang, J. Limit Results on Pattern Entropy. IEEE Trans. Inf. Theory 2006, 52, 2954–2964. [Google Scholar] [CrossRef]
Hou, X.S.; Han, M.; Gong, C. SAR Complex Image Data Compression Based on Quadtree and Zerotree Coding in Discrete Wavelet Transform Domain: A Comparative Study. Neurocomputing. 2015, 148, 561–568. [Google Scholar] [CrossRef]
Woodward, P.M. Probability and Information Theory, with Applications to Radar: International Series of Monographs on Electronics and Instrumentation; Fry, D.W., Higinbotham, W., Eds.; Pergamon Press: Oxford, UK, 1953. [Google Scholar]
Frost, V.; Shanmugan, K. The Information Content of Synthetic Aperture Radar Images of Terrain. IEEE Trans. Aerosp. Electron. Syst. 1983, 5, 768–774. [Google Scholar] [CrossRef]
Bell, M.R. Information Theory and Radar Waveform Design. IEEE Trans. Inf. Theory 1993, 39, 1578–1597. [Google Scholar] [CrossRef]
Zeoli, G.W. A Lower Bound on the Date Rate for Synthetic Aperture Radar. IEEE Trans. Inf. Theory 1976, 22, 708–715. [Google Scholar] [CrossRef]
Chance, Z.; Raj, R.G.; Love, D.J. Information-Theoretic Structure of Multistatic Radar Imaging. In Proceedings of the 2011 IEEE Conference on Radar Conference (RADAR), Kansas City, MO, USA, 23–27 May 2011; pp. 853–858.
Aksoylar, C.; Saligrama, V. Information-Theoretic Characterization of Sparse Recovery. In Proceedings of the 17th International Conference on Artificial Intelligence and Statistics, Reykavik, Iceland, 22–25 April 2014; Volume 2225, p. 3846.
Wu, Y.H.; Verdú, S. Optimal Phase Transitions in Compressed Sensing. IEEE Trans. Inf. Theory 2012, 58, 6241–6263. [Google Scholar] [CrossRef]
Donoho, D.; Tanner, J. Precise Undersampling Theorems. Proc. IEEE 2010, 98, 913–924. [Google Scholar] [CrossRef]
Sarvotham, S.; Baron, D.; Baraniuk, R. Measurements vs. Bits: Compressed Sensing Meets Informaiton Theory. In Proceedings of the 44th Allerton Conference on Communication, Control, Computing, Monticello, IL, USA, 27–29 September 2006.
Reeves, G.; Gastpar, M. The Sampling Rate-Distortion Tradeoff for Sparsity Pattern Recovery in Compressed Sensing. IEEE Trans. Inf. Theory 2012, 58, 3065–3092. [Google Scholar] [CrossRef]
Aeron, S.; Saligrama, V.; Zhao, M.Q. Information Theoretic Bounds for Compressed Sensing. IEEE Trans. Inf. Theory 2010, 56, 5111–5130. [Google Scholar] [CrossRef]
Weidmann, C.; Vetterli, M. Rate Distortion Behavior of Sparse Sources. IEEE Trans. Inf. Theory 2012, 58, 4969–4992. [Google Scholar] [CrossRef]
Oliver, C.; Quegan, S. Understanding Synthetic Aperture Radar Images; Scitech Publishing: Stevenage, Hertfordshire, UK, 2004. [Google Scholar]
Romero, R.A.; Bae, J.; Goodman, N.A. Theory and Application of SNR and Mutual Information Matched Illumination Waveforms. IEEE Trans. Aerosp. Electron. Syst. 2011, 47, 912–927. [Google Scholar] [CrossRef]
Tuuk, P.B.; Marple, S.L., Jr. Compressed Sensing Radar Amid Noise and Clutter Using Interference Covariance Information. IEEE Trans. Aerosp. Electron. Syst. 2014, 50, 887–897. [Google Scholar] [CrossRef]
Fletcher, A.K.; Rangan, S.; Goyal, V.K. Necessary Sufficient Conditions for Sparisity Pattern Recovery. IEEE Trans. Inf. Theory 2009, 55, 5758–5772. [Google Scholar] [CrossRef]
Wainwright, M.J. Information-Theoretic Limits on Sparsity Recovery in the High-Dimensional and Noisy Setting. IEEE Trans. Inf. Theory 2009, 55, 5728–5741. [Google Scholar] [CrossRef]
Akçakaya, M.; Tarokh, V. Shannon-Theoretic Limits on Noisy Compressive Sampling. IEEE Trans. Inf. Theory 2010, 56, 492–504. [Google Scholar] [CrossRef]
Donoho, D.L.; Johnstone, I.; Montanari, A. Accurate Prediction of Phase Transitions in Compressed Sensing via a Connection to Minimax Denoising. IEEE Trans. Inf. Theory 2013, 59, 3396–3433. [Google Scholar] [CrossRef]
Aeron, S.; Zhao, M.; Saligrama, V. On Sensing Capacity of Sensor Networks for a Class of Linear Observation Models. In Proceedings of IEEE/SP 14th Workshop on Statistical Signal Processing (SSP’07), Madison, WI, USA, 26–29 August 2007; pp. 388–392.
Cumming, I.G.; Wong, F.H. Digital Processing of Synthetic Aperture Radar Data: Algorithms and Implementation; Artech House: Norwood, MA, USA, 2005. [Google Scholar]
Rihaczek, A.; Hershkowitz, S. Man-made Target Backscattering Behavior: Applicability of Conventional Radar Resolution Theory. IEEE Trans. Aerosp. Electron. Syst. 1996, 32, 809–824. [Google Scholar] [CrossRef]
Neeser, F.D.; Massey, J.L. Proper Complex Random Processes with Applications to Information Theory. IEEE Trans. Inf. Theory 1993, 39, 1293–1302. [Google Scholar] [CrossRef]
Tropp, J.A.; Gilbert, A.C. Signal Recovery from Random Measurements via Orthogonal Matching Pursuit. IEEE Trans. Inf. Theory 2007, 53, 4655–4666. [Google Scholar] [CrossRef]
Needell, D.; Tropp, J.A. CoSaMP: Iterative Signal Recovery from Incomplete and Inaccurate Samples. Appl. Comput. Harmonic Anal. 2009, 26, 301–321. [Google Scholar] [CrossRef]
Çetin, M.; Karl, W.C.; Willsky, A.S. Feature-Preserving Regularization Method for Complex-Valued Inverse Problems with Application to Coherent Imaging. Opt. Eng. 2006, 45, 017003. [Google Scholar] [CrossRef]
Rilling, G.; Davies, M.; Mulgrew, B. Compressed Sensing Based Compression of SAR Raw Data. In Proceedings of the SPARS’09—Signal Processing with Adaptive Sparse Structured Representations, Saint Malo, France, 6–9 April 2009.
Yu, G.S.; Sapiro, G.; Mallat, S. Solving Inverse Problems With Piecewise Linear Estimators: From Gaussian Mixture Models to Structured Sparsity. IEEE Trans. Image Process. 2012, 21, 2481–2499. [Google Scholar] [PubMed]
Renna, F.; Calderbank, R.; Carin, L.; Rodrigues, M.R.D. Reconstruction of Signals Drawn From a Gaussian Mixture via Noisy Compressive Measurements. IEEE Trans. Signal. Process. 2014, 62, 2265–2277. [Google Scholar] [CrossRef]
Baron, D.; Sarvotham, S.; Baraniuk, R.G. Bayesian Compressive Sensing via Belief Propagation. IEEE Trans. Signal. Process. 2010, 58, 269–280. [Google Scholar] [CrossRef]
Barbier, J.; Krzakala, F.; Mezard, M.; Zdeborova, L. Compressed Sensing of Approximately-Sparse Signals: Phase Transitions and Optimal Reconstruction. In Proceedings 50th Annual Allerton Conference on Communication, Control, and Computing, Monticello, IL, USA, 1–5 October 2012; pp. 800–807.
Jia, X.; Richards, J.A. Efficient Transmission and Classification of Hyperspectral Image Data. IEEE Trans. Geosci. Remote Sens. 2003, 41, 1129–1131. [Google Scholar]
Reznic, Z.; Zamir, R.; Feder, M. Joint Source-Channel Coding of a Gaussian Mixture Source over the Gaussian Broadcast Channel. IEEE Trans. Inf. Theory 2002, 48, 776–781. [Google Scholar] [CrossRef]
Davenport, M.A.; Duarte, M.F.; Eldar, Y.C.; Kutyniok, G. Introduction to Compressed Sensing. In Compressed Sensing: Theory and Applications; Cambridge University Press: Cambridge, UK, 2012. [Google Scholar]
Candès, E.J. The Restricted Isometry Property and Its Implications for Compressed Sensing. Comptes Rendus Math. 2008, 346, 589–592. [Google Scholar] [CrossRef]
Çetin, M.; Stojanovic, I.; Önhon, N.O.; Varshney, K.R.; Samadi, S.; Karl, W.C.; Willsky, A.S. Sparsity-Driven Synthetic Aperture Radar Imaging: Reconstruction, Autofocusing, Moving Targets, and Compressed Sensing. IEEE Signal. Process. Mag. 2014, 31, 27–40. [Google Scholar] [CrossRef]

© 2015 by the authors; licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Zhang, J.; Yang, K.; Liu, F.; Zhang, Y. Information-Theoretic Characterization and Undersampling Ratio Determination for Compressive Radar Imaging in a Simulated Environment. Entropy 2015, 17, 5171-5198. https://doi.org/10.3390/e17085171

AMA Style

Zhang J, Yang K, Liu F, Zhang Y. Information-Theoretic Characterization and Undersampling Ratio Determination for Compressive Radar Imaging in a Simulated Environment. Entropy. 2015; 17(8):5171-5198. https://doi.org/10.3390/e17085171

Chicago/Turabian Style

Zhang, Jingxiong, Ke Yang, Fengzhu Liu, and Ying Zhang. 2015. "Information-Theoretic Characterization and Undersampling Ratio Determination for Compressive Radar Imaging in a Simulated Environment" Entropy 17, no. 8: 5171-5198. https://doi.org/10.3390/e17085171

APA Style

Zhang, J., Yang, K., Liu, F., & Zhang, Y. (2015). Information-Theoretic Characterization and Undersampling Ratio Determination for Compressive Radar Imaging in a Simulated Environment. Entropy, 17(8), 5171-5198. https://doi.org/10.3390/e17085171

Article Menu

Information-Theoretic Characterization and Undersampling Ratio Determination for Compressive Radar Imaging in a Simulated Environment

Abstract

1. Introduction

2. Models and Methods

2.1. Radar Measurements and Imaging: towards a Compressive Strategy

2.2. Sparse Radar Scenes and Rate-Distortion Characterization

2.3. Trans-information of Radar Measurements Concerning the Underlying Scene and the Targets of Interest

2.4. Undersampling Ratio Determination

3. A Simulated Experiment

3.1. Simulation of Sparse Scenes and Noisy Echo Data

3.2. Visualization of Scene Rate-Distortion and Echo Data’s Trans-information

3.3. Undersampling Ratios in Graphics

3.4. Discussion

4. Conclusions

Acknowledgments

Author Contributions

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI