1. Introduction
Synthetic Aperture Radar Interferometry (InSAR) is one of the most powerful tools to gather information on Earth’s topography and its deformation in time. It exploits two or more SAR images of the same region, acquired with some spatial and/or temporal diversity. Indeed, the phase difference between two SAR images, or interferometric phase, wrapped in the interval, bears information on the scene topography, which is eventually recovered by means of sophisticated phase unwrapping methods.
Unfortunately, SAR images are corrupted by intense noise, which makes it difficult to extract reliable information from them. This applies in particular to interferometric data which are affected by spatial and temporal decorrelation effects, associated with scattering changes in the signal backscattered at the different antennas and at the different acquisition times. It is therefore customary to adopt some forms of data restoration before phase unwrapping.
The simplest, and still widespread, solution consists in averaging the data in a rectangular window centered on the target (boxcar filtering). Despite its simplicity, boxcar filtering works very well in stationary areas of the image, as the sample average is the Maximum Likelihood (ML) estimator for the interferometric phase and coherence [
1,
2]. However, non-stationary areas abound in SAR interferograms, due to the scene topography or line-of-sight displacements in the imaged region, and they can be arguably considered the most valuable source of information for InSAR systems [
3,
4]. In such areas, boxcar filtering blends together heterogeneous data, resulting in a significant loss of resolution, and large phase and coherence estimation errors near signal discontinuities. This impairs strongly the subsequent phase unwrapping procedure that relies on both phase and coherence values [
5].
A number of methods have been proposed in the literature to perform accurate phase filtering in the presence of non-stationary signals. The filters proposed by Lee et al. [
6] and by Goldstein et al. [
7] are among the most popular. They both adapt to the local fringe morphology, by working in the spatial and frequency domains, respectively. Subsequently, a number of improvements have been proposed both to the Lee filter [
8,
9,
10,
11] and to the Goldstein filter [
12,
13,
14], aiming at a better adaptivity to the local signal geometry.
Another popular approach is to use wavelet transforms, to exploit their ability to analyze the signal in space and frequency. An early wavelet-based filter had been proposed in [
15] where the interferometric phase noise was modeled in the complex domain. Other filters rely on the undecimated wavelet transform [
16] and wavelet packets [
17]. A common trait of all such filters is a good preservation of the spatial resolution. Other promising approaches, though not as widespread as the previous ones, use polynomial approximations [
18], sparse coding and dictionary learning [
19], and Bayesian estimation with a Markov Random Field (MRF) prior [
20]. MRFs have also been used in [
21] for joint filtering of SAR phase and amplitude.
A major trend of the last decade has been the use of nonlocal filtering for SAR data restoration. The potential of this approach was immediately clear since the seminal work of Buades et al. [
22], where the Nonlocal Means (NLM) algorithm was proposed for the restoration of images corrupted by Additive White Gaussian Noise (AWGN). As the name suggests, the main novelty of nonlocal filtering consists of looking for the best predictors of the current target beyond its immediate proximity. Armed with a suitable patch-based similarity measure, the best predictors can be singled out in a large search window, or even the whole image. The self-similarity of natural images provides a theoretical basis for this approach to succeed in most situations of interest.
A huge number of contributes followed the NLM paper [
22], not only for AWGN denoising but also for other tasks and applicative fields. In particular, nonlocal methods were soon adapted to SAR imagery, showing excellent results with all types of data, from amplitude images [
23,
24,
25] to interferometric phase fields [
26,
27,
28,
29,
30], polarimetric images [
29,
31,
32], and multitemporal stacks [
33,
34,
35,
36,
37]. Interested readers are referred to [
38] for a recent review of the methods and ideas.
The first nonlocal filter for InSAR data [
26] was an iterative version of Nonlocal Means. Predictors were selected based on a probabilistic criterion relying on both image intensities and interferometric phase values. Improvements were then proposed in [
27], using pyramidal representations, and [
28], relying on higher-order singular value decomposition. In [
29] a unified framework was proposed, using an adaptive procedure for predictor selection to ensure a good preservation of image structures.
In [
30,
39] we have proposed InSAR-BM3D, a nonlocal method for InSAR phase filtering inspired to the nonlocal Block-Matching 3D (BM3D) algorithm [
40], which blends the nonlocal approach with transform-domain shrinkage and Wiener filtering. Indeed, BM3D is still a state-of-the-art filter for AWGN denoising, and its multiplicative-noise version, SAR-BM3D [
24], is one of the best filters for SAR despeckling. To deal with complex data, InSAR-BM3D filters separately the real and imaginary parts of the signal, obtained after a local decorrelating transform, by means of wavelet thresholding and empirical Wiener filtering. Experiments on both simulated and real-world phase images proved InSAR-BM3D to be very effective in terms of both objective distortion measures and perceived image quality.
Nonetheless, in the very same experiments, a clear performance decay was observed, for both InSAR-BM3D and other nonlocal filters, in regions characterized by a uniform phase gradient. Considering that such regions, corresponding to terrains with a locally uniform slope, abound in SAR interferograms, this is a relevant issue for nonlocal InSAR filtering methods. Based on preliminary analyses, we attribute this problem to the well-known “rare patch” effect [
41]. For uncommon target patches, there are very few patches similar enough to work as good predictors, which leads to inaccurate (high variance) estimates. A possible solution is to enlarge the search area, possibly to the whole image, but complexity would become quickly unbearable.
Here, to address this problem, we propose the use of an offset-compensated patch similarity measure [
39]. The core idea is to augment the set of predictors for a given target patch by including also all patches that differ from it only for a constant phase offset. Since, in uniform gradient areas, such a phase offset can be accurately estimated, this approach guarantees a wealth of new reliable estimators, thus obviating to the rare-patch problem. The proposed approach can be also regarded as a means to remove the effects of scene topography from the estimation. Under this point of view, some related work has recently appeared in the literature. In [
42,
43], assuming a DEM is available, the topography-related phase fringe pattern was removed before filtering. Other methods [
14,
44], instead, assumed a locally linear phase, estimated its fringe frequency, and removed the corresponding phase pattern. Contrary to these methods, we make no prior assumption, and rely exclusively on a suitable similarity measure. We implement this idea both in plain NLM, and in InSAR-BM3D, observing in both cases a significant performance improvement.
In the rest of the paper, we provide some background on nonlocal filtering for SAR interferograms (
Section 2), describe the proposed approach and its implementation in nonlocal filters (
Section 3), carry out experiments with both simulated and real-world data (
Section 4), discuss the results (
Section 5), and finally draw conclusions (
Section 6).
2. Background
In this section we recall the basic concepts of nonlocal filtering, with special attention to the InSAR phase filtering problem, and formulate the signal model and notation that will be used in the rest of the paper.
2.1. Nonlocal Filtering
As the name suggests, nonlocal filtering aims at overcoming the limitations of conventional or “local” filters, which estimate the value of the target pixel based on the values of its neighbors. Let
be the observed image, with
the clean image and
an additive noise component We consider an additive noise model, but the concepts are readily extended to other models. Focusing on linear filters, the estimate
of the clean image at location
p is obtained as a weighted average of noisy pixels
where
p and
q are spatial variables (we use single index for compactness), and
a suitable window centered on
p. For example, a space-invariant Gaussian filter has weights of the form
where
is the spatial distance between locations
p and
q,
C is a normalizing constant and
a decay parameter. Therefore, a Gaussian filter relies mainly on pixels close to the target to estimate the clean value. Other local filters differ in the choice of weights, but still keep relying mostly on close neighbors of the target, often limited to a small window centered on the target itself. The underlying assumption is that close pixels are generally similar to one another. This certainly holds in stationary areas of the image, where, in fact, local filters work very well, reducing the noise variance without introducing any bias in the estimate. However, in non-stationary areas, like at the boundary between different regions, or in the presence of compact objects, close pixels are not necessarily similar, and local averages introduce significant biases in the estimate, which translates into a resolution loss.
Figure 1 shows, for a sample noisy image (a), the output of boxcar filtering with a 3 × 3 (b), and a 9 × 9 (c) window, where the bias-variance trade-off appears clearly.
A large part of the recent denoising literature deals ultimately with this problem. The major breakthrough of nonlocal filtering is to abandon the underlying association between “close” and “similar”, and look for the best predictors without constraints. Accordingly, filtering is reformulated as a two-step procedure: (i) find the best predictors for the target pixel; (ii) estimate the clean value through a suitable average of the selected predictors. Of course, in this new paradigm the main issue becomes how to locate the best predictors in the whole image.
In their original paper [
22], Buades et al. introduced the concept of the patch-based similarity measure, and proposed the nonlocal-means (NLM) algorithm. The weights of NLM have the same structure as in the Gaussian filter
with the difference that the spatial distance,
, is replaced by an explicit estimate of signal dissimilarity
. Since the true signal is not available, the dissimilarity is estimated on small blocks
and
of the observed signal centered on the target and predictor pixels, for example by the sum of squared errors,
, which is optimal in the AWGN case. By using blocks, rather than individual pixels, one can reduce the impact of noise on dissimilarity estimation.
Figure 1d shows the output of nonlocal means applied to the image of
Figure 1a, where a large quality improvement appears w.r.t. local filters.
The nonlocal means paper has opened the way to intense work on nonlocal filtering, with a huge literature, including applications to SAR data filtering. Here, we only summarize, briefly, the BM3D algorithm, which is the basis of InSAR-BM3D [
30] used in the rest of the paper.
BM3D comprises three steps: grouping, collaborative filtering and aggregation, as shown graphically in
Figure 2. In
grouping, for each target block, a few similar blocks are singled out over a suitable search area, based on a block-similarity measure, and collected in a 3D stack. In the second step, these blocks are filtered jointly (that is, in a
collaborative way) to exploit both local (intra-block) and nonlocal (inter-block) dependencies. Filtering is performed by transform-domain shrinkage, by applying a separable 3D wavelet transform to the stack, shrinking the transform coefficients to attenuate noise, and back-transforming the denoised stack. Finally, in the third step, the filtered blocks are returned to their original positions in the image, and overlapping blocks are
aggregated to improve the estimation reliability.
A further significant difference with respect to basic NLM is the adoption of a two-pass filtering procedure. In the first pass, the noisy image, , is filtered as described above to produce a basic estimate, . This new image, however, is used only as a pilot to guide the second-pass filtering, which produces the final output, . The partially denoised pilot allows one to compute some key statistics for the wavelet shrinkage, and to compute more reliable similarity measures for the grouping phase, improving significantly the overall performance.
2.2. Signal Model
In our model [
2,
45], the interferometric pair
is expressed for notational simplicity, here and in the following we drop spatial dependencies unless necessary. as the product of two standard circular Gaussian random variables
with a suitable matrix
T
where
depends only on the true amplitudes,
and
, interferometric phase,
, and coherence,
.
Accordingly, the observed complex interferogram,
, can be regarded as a noisy version of the true interferogram, that is
where
depends on the true signal parameters, and
n is a zero-mean signal-dependent noise term [
46,
47,
48]
with covariance matrix
The terms on the main diagonal are the variances of the real and imaginary parts of the noise. In case of multi-looked data, the covariance matrix becomes simply , with L the number of looks.
2.3. InSAR Nonlocal Filtering
The application of the nonlocal means algorithm to InSAR data filtering is straightforward, we simply rewrite Equation (
1) to estimate the true complex interferogram as
where
p indicates the target pixel, and
q its neighbors in the window
, and the weights are computed according to the selected similarity measure. Note that NLM could be easily extended to estimate also the coherence and the signal amplitude. In this paper, however, we will focus only on the phase, and also, the similarity measure will rely only on phase information.
Although basic NLM is helpful to gain full insight into the main phenomena of interest, we will eventually focus on the state-of-the-art filter, InSAR-BM3D. At a very high level, InSAR-BM3D has the same algorithmic structure as BM3D. For each pixel, however, the grouping phase provides a couple of paired stacks, call them and , formed by blocks drawn from images and , respectively. These are used to compute a preliminary maximum-likelihood estimate of the local phase, , assumed to be constant over the block. The estimated phase is then used to decorrelate the real and imaginary parts of the interferogram. Such decorrelated components undergo collaborative filtering and aggregation, and are eventually processed to provide the filtered interferogram. A thorough description of the algorithm goes beyond the scope of this work, and we refer the reader to the original paper for more details.
3. Nonlocal Filtering with an Offset-Compensated Similarity Measure
As already said in the Introduction, we found experimentally that nonlocal filters work poorly on uniform gradient regions. Let us explain the nature of the problem by means of some synthetic examples, considering 1D signals and no noise, for the sake of simplicity. In
Figure 3(left) we show, on the top row, a simple periodical signal, with a target patch highlighted in red. This signal may be representative of a mostly flat orography, with smooth variations. Nonlocal filters look for similar patches in the neighborhood of the target, so we plot, in the middle row, the distance between the target patch and all other patches. The plot clearly shows that a large number of similar patches exist, some very close to the target, some farther away. In the bottom plot, we highlight only the most similar patches (identical to the target in this zero noise example) but many more good predictors are available, leading eventually to a very reliable estimate. In
Figure 3(right), instead, we consider a linear signal, representative of a constant slope orography. The distance plot in the middle row shows that only a few similar patches can be found, in the proximity of the target, some of which are highlighted in the bottom plot. With just a few predictors available, the variance of the estimate will grow significantly. These considerations are better substantiated in
Appendix A where, considering a simplified setting, we compute the closed form of the Equivalent Number of Looks (ENL) for the nonlocal means filter in the presence of a constant slope signal.
However, going back to the example of
Figure 3(left), it is clear that
all patches in the neighborhood share with the target the very same shape, and they differ from it only for a constant term. Therefore, if we succeed in estimating this constant term, that is, its phase offset w.r.t. the target, then all patches can contribute to the estimate, with a predictable performance improvement. In other words, with a correct estimation and compensation of the phase offset, all regions with a constant phase slope would be re-conducted to the ideal case of flat terrain.
These synthetic examples are instrumental to explain our basic idea in simple terms. Of course, real-world images differ from such ideal cases under several respects, especially for (i) a more varied underlying signal, and (ii) the presence of noise. The random nature of the signal implies that, even after phase offset compensation, not all patches will be similar enough to the target to be good predictors. Moreover, with significant noise, estimating the phase offset may become itself a problem, leading to additional errors, in which case, costs and benefits should be carefully balanced.
In summary, our proposal can be summarized with the following points:
for each phase patch in the neighborhood of the target, estimate the phase offset that minimizes the distance between the target patch and the offset-compensated predictor;
perform nonlocal filtering with offset-compensated patches in place of the original patches.
Note that we do not specify the distance measure explicitly: any measure depending on phase differences can be replaced by its offset-compensated version. Likewise, any nonlocal filter can be implemented with this new measure in place of the original one. Here, we consider only the cosine distance, already used in [
30] with good results, and implement the Offset-Compensated (OC) versions of NLM and InSAR-BM3D.
3.1. Cosine Dissimilarity
Let and be two size-N blocks of the interferometric phase image centered on pixels p and q, respectively.
We define the cosine dissimilarity between
and
as
Obviously,
attains its minimum value, zero, when
. Coherently with the above formula, we define the
offset-compensated dissimilarity between
and
as
Now, we seek the minimizer of the quantity in square brackets, say
, and the corresponding expression of
. If we rewrite the dissimilarity as
with
, the problem reduces to maximizing the latter summation. Now,
where
and
indicates the phase of
W. Of course, the maximum is obtained when
, therefore
and, consequently
Note that the problem of Equations (12) and (13) is formulated in directional statistics [
49] as the minimization of the dispersion of a circular variable, with sample
, with respect to a value
. The optimal solution of Equation (
16) is called the mean direction of the data, in that context, while Equation (
17) is the circular variance of the sample. Interestingly, the very same metric has been adopted is some recent papers concerning InSAR data filtering [
50,
51], although in a different context and with different aims.
Equation (
17) can be now used to select the best predictors for the target patch, as in InSAR-BM3D, or to assign suitable weights to all patches in the analysis window, as in NLM and related methods. Actual filtering will then rely on the offset compensated patches,
.
3.2. Preliminary Experiments with Offset-Compensated NLM
The impact of the proposed approach on state-of-the-art nonlocal filters, will be assessed in
Section 4 with reference to the offset-compensated version of InSAR-BM3D.
However, to better highlight some basic phenomena, we carried out some preliminary experiments with the much simpler OC-NLM. We wanted to understand how performance depends on noise intensity (hence coherence) and the local signal gradient. Therefore, we simulated simple test phase images, of a small size, 105 × 105 pixels, with constant coherence and phase gradient. In particular, we considered three levels of coherence,
, 0.6, and 0.9, corresponding to high, medium, and low-intensity noise, and eight phase gradients
with
covering from a very fast-varying phase (to the limit of aliasing) to almost a constant phase.
Figure 4 shows in false colors some of these test (wrapped) phase images, together with their ideal clean reference.
Figure 5, instead, compares the output of various NLM filters: basic NLM with cosine distance, the proposed offset-compensated version, OC-NLM, and a third adaptive version discussed later on. All versions use patches of 11 × 11 pixels, a search window of 21 × 21 pixels, and decay parameter
. These values were selected by grid search through preliminary experiments on the synthetic test images described in
Section 4. However, the minimum of the average error for the selected value of
appears to be rather flat, hence non-critical. At low coherence and a large phase gradient (top row), basic NLM exhibits large errors, many phase fringes disappear throughout the image, and the overall structure of the phase is barely recognizable. The use of offset-compensated similarity grants a large performance improvement, allowing a good recovery of the general phase structure, although some significant errors keep appearing here and there. These are due to the intense noise affecting the original image. Although offset compensation provides a larger number of good predictors, the offset itself is not known in advance, and its wrong estimation introduces new errors. To reduce such errors, we consider a simple two-pass strategy similar to BM3D. The filtered phase obtained in the first pass is used in place of the original noisy phase to estimate the local offset in Equation (
16) and the similarity measure in Equation (
17). These quantities are then used to filter the noisy phase (not its first-pass estimate, to avoid drifts). This prefiltering strategy leads to a very good result, as visible in the rightmost figure. At medium coherence (second row) NLM works much better than before, and all phase fringes are correctly recovered although with significant local errors. Again, offset compensation grants some improvements, with near-perfect phase restoration, and prefiltering cannot improve further. Finally, at high coherence (third row) all versions of NLM work satisfactorily. In the bottom row, we show the interesting case of low coherence and low phase gradient. With such slow phase variations, there are plenty of good predictors in the neighborhood of the target, and NLM works already satisfactorily. On the contrary, due to errors in offset estimation, the OC version causes a clear performance impairment, which is only partially recovered thorough prefiltering. This observation confirms that on flat terrains, or in the presence of a small phase gradient, offset compensation is useless or even harmful and should not be used. To obtain the desired adaptivity to the local phase gradient, we compute the power spectrum of the signal in a small window centered on the target, and activate phase compensation only if a dominant peak exists at the medium-high frequencies.
A pseudo-code of the decision procedure is reported in Algorithm 1. Offset compensation is activated if the power spectrum peaks at medium-high frequencies (), and this peak is truly dominant, namely there is no comparable (within 10 dB) peak in the power spectrum far from the maximum (). While the first condition ensures that offset compensation is activated only in the presence of a significant slope, the second one reduces the risk to setting it on due to random noise fluctuations.
Algorithm 1 OC-Switch. |
Require: | ▹ input patch, decision thresholds |
Ensure: OC-switch | ▹ output on-off switch for offset compensation |
1: set OC-switch to OFF | |
2: | ▹ compute the power spectrum of |
3: | ▹ find the peak of G |
4: | ▹ distance of peak from origin, proxy for frequency |
5: | ▹ large-G region, within 10 dB of maximum |
6: | ▹ radius of large-G region |
7: if AND then | |
8: set OC-switch to ON | ▹ activate offset compensation |
9: end if | |
5. Discussion
The experimental results confirm the potential of the proposed approach and make also clear its limits. Offset-compensated similarity improves results significantly in regions with a large phase gradient, especially at low coherence. This is especially clear with nonlocal means, in which case large improvements arise, but also with the state-of-the-art InSAR-BM3D. In the critical Ramp image, only the offset compensated version of InSAR-BM3D succeeds in preserving all phase fringes, whatever the slope, even in the very low-coherence region.
On the down side, in regions with a moderate phase gradient the performance may even be impaired, because the number of good predictors cannot be increased through offset compensation, while the estimate of the phase offset itself adds a new source of errors. For this reason, we propose an adaptive use of offset compensation, which is activated only in the presence of a significant phase slope. We also observe that, in the presence of a relatively high coherence, offset compensation does not provide significant improvements for state-of-the-art filters. On the other hand, at high coherence, improving phase filtering is not essential, because subsequent processes, like phase unwrapping, work already well. Effective and accurate filtering becomes important when the noise intensity is such that it impairs the performance of analysis methods, hence, at low coherence.
The goal of this study was to prove the potential of offset-compensated similarity in the context of InSAR phase filtering. Therefore, many topics that deserve further investigation were not considered in this first work. First, we focused on the cosine similarity measure, but the offset compensation concept may be easily applied to other measures as well. Moreover, these measures may take into account also amplitude and coherence to possibly improve performance and, of course, the aim of filtering can be also extended to the restoration of amplitude and coherence data themselves. Finally, offset-compensated versions of other nonlocal filters could be also developed and tested.
6. Conclusions
Nonlocal filtering methods suffer from the well-known rare patch effect. For uncommon signal patches, there is a scarcity of suitable predictor patches in the surrounding, which leads to low-quality restoration. This problem is especially relevant for InSAR phase filtering, because large regions of the image, characterized by a large phase gradient, and corresponding to areas with a constant slope, fit in the rare patch category. In such regions, even state-of-the-art nonlocal methods may provide poor results, with the filtered phase characterized by large errors and the presence of many residues.
In this paper, to deal with this problem, we proposed the use of offset-compensated similarity measures. For each candidate predictor patch, the mean phase offset w.r.t. the target patch was estimated and compensated for before computing similarity. By doing so, the pool of valid predictors can be significantly enlarged, allowing for a reliable estimate. Experiments on simulated and real-world data proved the validity of the approach.
Future work may explore the use of offset compensation with other similarity measures and the extension to other types of data and nonlocal filters.