3. Methodology
With the introduction of InSAR time series, we aimed to develop an enhanced method to analyze a collection of a series of InSAR pairs, which were indicated as a packet (refer to
Section 4 for detailed concept and implementation), over a given time domain. As reported by Bruzzone et al. [
12], extracting the statistics such as mean values and standard deviations for phase coherence and amplitude vectors is a basic way to build features from time series InSAR products. However, the random noise commonly associated with phase coherence and amplitude vectors of InSAR pairs can significantly skew the means and standard deviations of these characteristics. Moreover, weak correlated InSAR pairs contribute equally to these statistics, potentially resulting in unexpected feature values. To address these issues, we applied dimensional compression through principal component analysis (PCA) [
13]. The time series was transformed through PCA and the effects of weak correlated InSAR signatures were excluded. From the preliminary experiments, we observed that the first principle component usually included constant values for phase coherence and amplitude. The values were used as the replacements for the means of signatures. The second PC normally included values that varied widely. Once the random noise components were excluded, values in second PC were used as an effective indication for temporal variation. As higher order components of PCA were noisy in packets, these components were not considered in establishing surface features.
Through the processing, we also found that the noise component in the SAR signatures significantly impaired the capability of classification scheme. The errors were due primarily to inconsistent phase coherence caused by associated weather factors, such as wind and moisture change, and mis-coregistration of SAR images. Even with PCA transformation, a large number of errors in InSAR signatures were not screened out in high-order components and degraded the classification efficiency. To improve the performance, we further introduced kernel PCA (k-PCA) to replace standard PCA. Nonlinear transformations through k-PCA reduced errors in SAR signatures and constructed more robust feature spaces. This study used a nonlinear kernel matrix instead of the Gram matrix introduced for PCA as proposed by Scholkopf et al. [
14]. The kernel matrix is expressed as:
where
Φ(x) is a nonlinear function of feature vector
x.
The projection of feature vector
x on a principle axis,
wi, is represented as:
The calculation and normalization can be completed by the following eigenvector equation:
where
N is the number of observations and
λ and
a are the eigenvalues and eigenvectors of
K, respectively. This study employed the following Gaussian kernel matrix:
where
σ represents the standard deviation. As Mika et al. [
15] stated, this type of processing is useful for removing extraneous noise from the data and for constructing new feature spaces that are representative of the nonlinear component of observations. These steps are essential for rigorous analyses due to the complexity of InSAR signatures over different types of land cover particularly in phase coherence. Employment of phase coherences as the signatures for land cover classification is the variation of temporal incoherence issue. The total coherences can be decomposed as:
The thermal coherence
Cohthermal and the spatial coherence
Cohspatial are respectively expressed as:
where
B is the perpendicular baseline,
Ry is range resolution,
θ is the incidence angle,
λ is the wavelength,
α is the local surface slope and
ρ is the distance between sensor and target image [
16]. The thermal and spatial coherence are extractable from system and orbital characteristics such as by perpendicular baseline and can be compensated. Moreover, as the thermal and spatial coherences are close to unique values, their effects to land cover classifications are minor.
Temporal coherence is a complicated issue. Lee et al. [
17] described the pattern of temporal coherence as below:
where
Ct is the decay constant of temporal coherence and Δ
T is the time period of InSAR pair observation. We assumed that the decay patterns of L-band phase coherences correspond to the land cover types. Hence we modified the model of temporal coherence as listed below:
where
C1 is the constant to converging value of temporal coherences and
C2 is the depending parameter of the characteristics of land cover types. It was expected that the temporal observation of phase coherence in time series could be useful for land cover classification based on the characterized model of temporal coherence. This assumption was reviewed using the classified land cover in
Section 5.
Although k-PCA is highly effective at suppressing random noise and accounting for the nonlinearity of signatures, there are still mixed components of newly constructed InSAR features, resulting in misclassifications between land cover types. Therefore, an advanced classification scheme that manipulates mixed InSAR signatures, rather than relying on the conventional classification algorithms such as maximum likelihood (ML), was introduced. Support vector machine (SVM) analysis is expected to address classification errors in this case because it maximizes the extent to which spaces can be subdivided, using hyperplanes as distinguishing functions. This methodology was introduced by Cortes and Vapnik [
18]. The hyperplane approach of SVM effectively addresses the ambiguous nature of some InSAR signatures through nonlinear classification. SVM is also based on a kernel operation in which a vector of feature spaces is transformed into hyperplanes as follows:
Equation (10) is used to distinguish and measure a single vector. By introducing nonlinear operations into transformation
Φ(
x), the SVM can provide a robust method for classifying unstable features, such as InSAR signatures. After all, the whole processor we proposed was composed of the k-PCA of backscattering coefficients and phase coherences respectively, and the SVM classifier of newly constructed features by nonlinear transformations together with the pre-processors of speckle filtering and radar shadow masking. The k-PCA plus SVM classification schemes for accommodating the nonlinearity of feature space were conducted. To assess the performance of k-PCA plus SVM scheme, traditional PCA plus ML processor was also implemented. The results derived from two processors were analyzed and reported in
Section 5, showing the classification results and vegetation indices from optical image.
It was noted that for all constructed features, classification algorithms were tested using training vectors measured by 10 m resolution EO-1 ALI images and partially, i.e., for the eastern area, by the Google Earth historical image archive which was taken by Digital Globe IKONOS with 0.8 m panchromatic and 3.2 m multispectral resolution. It was clear that such training vector definitions using the secondary information would downgrade the classification accuracy. However, as ground surveying data over Mt. Baekdu were not available, the use of secondary information for the training vector definition was unavoidable.
4. Test Sites and Data Description
We selected two test sites with different land cover types to investigate the performance of the proposed classification scheme. The first test area is around Mountain Baekdu (Mt. Baekdu, also known as Mt. Changbai in Chinese), a volcano located on the border between North Korea and China. The site is covered by dense vegetation with minor settlements and bare fields. In order to assess the potential risk caused by various natural disasters including floods, landslides, forest fires and volcanic activities, the land cover map over Mt. Baekdu is essential. However, due to local political tensions, reliable land cover datasets were prevented from being published. Up to now, only a limited number of precedent studies for determining land cover over Mt. Baekdu have been conducted with space-borne optical imagery. For instance, Liu et al. [
19] employing Landsat Thematic Mapper (TM) images over Mt. Baekdu showed the dependency of tree species on the altitude. A study conducted by Park et al. [
20] analyzed MODIS normalized difference vegetation index (NDVI) change over the China and North Korea border together with land cover types. In order to achieve a comprehensive understanding of environment over Mt. Baekdu, we conducted a time series land cover monitoring based on InSAR analysis in the first test site and evaluated its performance.
Since SAR signatures are useful to define land cover types, it is necessary to consider the appropriate SAR wavelength. As C-band ERS and ENVISAT ASAR image data were constrained by the physical penetration depth, their InSAR signatures are primarily reflected over vegetated canopies. Thus, the C-band wave is highly sensitive to changes in moisture and wind direction over time [
4]. This indicates that, considering both the expected weak phase coherences over the vegetated and the steep sloped areas, C-band SAR images may not be suitable for investigating land cover over dense forest in the test sites. ENVISAT ASAR data was therefore excluded from the pool of potential data sources. In contrast, due to the characteristics of long wavelength, SAR images taken by Phased Array type L-band Synthetic Aperture Radar (PALSAR) installed on the Advanced Land Observation Satellite (ALOS) were expected to be ideal data sources as they were capable of producing high phase coherence over highly vegetated areas. For instance, ALOS PALSAR images pairs in our study demonstrated 2–3 times higher phase coherence over the area with Enhanced Vegetation Index (EVI) > 0.2 compared to ENVISAT ASAR case of corresponding temporal baseline. Although PALSAR images also have difficulty to effectively distinguish certain land cover types, such as grass and crops, L-band SAR signatures are useful tools to discriminate open/mixed forests, which is the main land cover types in our test sites. Time series PALSAR observations are also helpful to define seasonal effects occurred in the field. Therefore PALSAR images are applied, where a total of 12 ALOS PALSAR Fine Beam Mode Single Polarization (FBS) and Fine Beam Mode Dual Polarization (FBD) images over Mt. Baekdu acquired during 2009–2011 were employed in this study. The details of the PALSAR images used are listed in
Table 1.
The second test site was in Uljin located in the eastern Korea, where dense forests are widely populated. Since few residential and cultivation areas are also distributed here, it can be used for the assessment of an algorithm employed for classifying minor land cover types. Over Uljin target areas, nine ALOS PALSAR FBS and FBD images were employed and the details are listed in
Table 2. As ALOS PALSAR images were used in this study, considering the characteristics of L-band data, land cover type definition was introduced within the context of targeted geometric properties that were defined with respect to the standing heights and densities of vegetated materials. A total of five land cover types were assigned to Mt. Baekdu, including bare fields, open/mixed forests, conifer forests, water bodies and shrubs/grasses. While for the case of the Uljin area, the artificial structures were introduced instead of a bare field considering the land cover features of the target area. Radar-shadowed areas, which sometimes are misclassified, were predefined, according to conditions existing during the acquisition of SAR images. These areas were then masked off in each of the processing stages.
To evaluate the InSAR packet approach, a set of InSAR packets were constructed considering adjacency of InSAR pair and seasonal coverages.
Figure 1 illustrates the composition of packets in this study, including HH1 (2009/07/19–2010/03/06), HH2 (2010/03/06–2010/10/22), HH3 (2010/10/22–2011/03/09) and HH_HV (2010/06/06–2010/10/22) packets in Mt. Baekdu, while HH1 (2007/10/14–2008/05/31) and HH2 (2008/07/16–2009/01/16) in Uljin. Regarding registration accuracy of image pairs within InSAR packets, all data sets were crossly registered by the InSAR processor up to sub-pixel accuracy.
5. Processing Results and Discussion
The initial classification results derived from conventional ML classifiers and InSAR features set by PCA analysis are shown in
Figure 2, in which classification maps of HH1–HH3 packets and a hybrid packet are illustrated.
The InSAR classification maps generated through k-PCA plus SVM processor are shown in
Figure 3. Due to a lack of ground truth for verifying classification results, we alternatively found altitudinal distribution map of vegetation species originated by the volcanic investigations of Mt. Baekdu as the reference data. Zhao [
21] identified that the normal altitudinal dependency of tree species from conifer to deciduous trees is broken in the south-eastern flanks of Mt. Baekdu, which is due to directional intensive tephra covering. Subsequent studies conducted by He et al. [
22], Dai et al. [
23,
24], and Park et al. [
20] continued to identify the tree distributions. The altidudial tree distribution over Mt. Baekdu from their studies included: (1) tundra over 2000 m altitudinal line; (2) birch dominated forest between 1700–2000 m altitudinal line; (3) spruce fir and birch mixed conifer forest between 1100–1700 m altitudinal line; (4) pine and broad-leaved mixed forest between 740–1100 m. As the findings were commonly supported by most of the literatures, the altitudinal dependency was used as the reference data to assess the classification accuracy. In order to compare with our observations, we mapped our land cover classes to their altidudial tree species dependences based on following principles: (1) tundra to barefield; (2) birch dominated forest together spruce fir to conifer forest and (3) pine and broad-leaved mixed forest to open/mixed forest. It was found that the altitudinal tree species transition lines (1100 m, 1700 m and 2000 m) presented in
Figure 3 fit well with our land cover classification types, especially transition between bare field and conifer, and conifer and open/mixed forest.
Moreover, it was observed that the altitudinal dependency in the south-eastern flanks was broken in the classification results as Zhao [
21] assigned (refer to dotted ellipse in
Figure 3). While the classification demonstrated in
Figure 2 shows that open/mixed forests extend up to an altitude of 1600 m and conifers rarely distributed. The results disagreed with the altitudinal distribution summarized above, indicating a PCA feature extraction scheme failed to provide reliable signatures to support classification. Therefore we concluded that the land cover classification scheme using k-PCA and SVM retained better efficiency and identified the land cover types well, especially vegetation distributions.
Overall, the classifications from all packets in
Figure 3 produced similar results, in which different distributions of trees in open/mixed forests between HH1 and HH3 are found over the target area. The open/mixed forest coverage was minimal in the classification from the HH2 packet because the InSAR images were acquired during spring and summer. Open/mixed forest coverage was more extensive in classification results from the HH1 packet that included InSAR data from the fall and winter. InSAR images for HH3 were acquired in the winter and early spring. The images were affected by seasonal factors such as snow cover and fallen leaves, therefore, the classification results from HH3 show limited distribution of open/mixed forest. It should be noted that the land covers classified by the HH-HV packets did coincide well with the land cover classified by the HH packet signatures. The compilations of different polarizations in HH-HV packets was performed to produce an enhanced feature space to effectively distinguish forest cover types. The conifer tree lines for all packet combinations showed agreement with the 1700 m altitude, presumably owing to the consistency of InSAR signatures, covering all seasons, from conifers.
Based on the classification outputs, we also investigated the effectiveness of k-PCA, PCA and original phase coherence time series as the phase coherence signature involving the temporal coherence issues as described in the
Section 3. The temporal decaying pattern of phase coherence on land cover type followed the characterized models but demonstrated somewhat similar patterns especially over some vegetation types such as conifer-to-open/mixed forest case (see
Figure 4a). In addition, temporal phase coherence has quite dependences on the weather condition at the times of image acquisitions. For instance, compared to the European Centre for Medium-Range Weather Forecasts (ECMWF) wind information demonstrated in
Figure 4c, the deviations of third and fourth InSAR pairs from decaying model might associate with the wind effects on vegetation leaves as described in Wegmuller and Werner [
3]. As also stated in Wegmuller and Werner’s analysis, the moisture changes in vegetation canopy and soil induced by the precipitation can produced significant effects in phase coherence as the permeability of target electromagnetic matter become inconsistent interacting with wind speed and direction. However, due to the following two reasons: (1) the precipitation effects on the phase coherence over the target area was mainly produced by the local turbulences which is not predicted by ECMWF [
25]; (2) the mechanism of phase coherence change by the precipitation is largely complicated due to the interacting with the wind, we only analyzed the inter-relationship between phase coherence considering that wind component may imply some effects of local precipitation that cannot be accurately investigated independently. Those factors caused uncertainties for the establishment of land cover classification scheme using time series phase coherence. On the other hand, phase coherences with k-PCA transformations produced well distinguished temporal signatures as shown in
Figure 4b. It is noted that the spline lines fitting k-PCA values of land cover types implied that the transformed temporal migration of k-PCA values consists better discriminated signatures but less influenced by wind environment. On the other hand, eigenvalue analysis of PCA transformation presented the first components of PCA transformation retained only 60–65% of the total variance; thus non-linear k-PCA is the better approaches to create proper features from highly scattered signatures. This advantage proved that the k-PCA analysis we proposed to generate transformed signatures was appropriate to be used for an advanced classification scheme, especially a new feature space construction method, to efficiently extract land cover data from SAR signatures.
The effectiveness of k-PCA/PCA of phase coherence packets for establish the proper signature was assessed by the Jeffreys-Matusita (J-M) distance [
5]. It is represented below as the metric:
where
Mi is the covariance matrix of class
i,
µi is the mean values of class
I and |
Mi| is the determinant of the covariance matrix of class
i.
Iij is the
J-M distance, for which high distance indicates high efficiency.
Jeffreys-Matusita distances for all land cover combinations using phase coherence time series, PCA and k-PCA components were extracted and shown in
Figure 5.
It is demonstrated that the efficiency of k-PCA approach to establish better signature for land cover classification compared to the simple phase coherence time series. Firstly, it was noted that the last component of the conventional PCA signature was excluded, thus three major components among all four signatures failed to provide better separability compared to the original phase coherence time series. It could be inferred that various noise component described in
Section 3 was not well discriminated in the land cover signatures. On the contrary, the k-PCA provided better separability for all land cover combinations except water, which was expected due to the fact that the water surface induced always the lowest phase coherence. For all other land cover combinations, the separabilities of k-PCA approach were constantly higher even the case of the HH2 packet with the shorter duration. It means k-PCA approach properly addressed the seasonable phase coherence variation.
Evaluating quantitatively the classification results for the Mt. Baekdu test area was also difficult since no reliable source of information about the actual composition of land cover in the area is available. We firstly assessed the accuracy of our results by comparing MODIS land products (e.g., vegetation indexes) with the correlation between vegetation classes and altitude. MODIS EVI and NDVI maps extracted using the temporal average of MODIS MOD 13 products corresponding to InSAR packet periods (
Figure 6) show that the EVI and NDVI in the target area became maximized in the HH2 packet period corresponding to summer season and minimized in the HH3 packet period which was taken during winter-spring season. It was found that the classification results using the InSAR signature packets was capable of capturing temporal changes in vegetation density. EVI distributions (
Figure 6b), particularly those corresponding to the HH3 packet period, are similar to extracted land cover distributions, perhaps due to the less saturated nature of EVI.
As illustrated in
Figure 7, we performed additional analysis by computing the mean vegetation index and altitude for each type of land cover. Results confirmed that land cover types depended on vegetation indexes and altitude as two distinguishing characteristics of target area environments. In
Figure 7a, it is shown that PCA analysis produces less consistent values of mean vegetation indices for each land cover class than k-PCA does, indicating that k-PCA performs better classification. However, per k-PCA results (
Figure 7b), the altitudes and vegetation indexes for each land cover class are clearly consistent and possess only small deviations. For instance, low NDVI and EVI values for shrub cover on high-altitude mountain summits agree with the map presented in
Figure 6b. The results also demonstrated that NDVI values for low-altitude open forests were less than NDVI values for conifer forest coverage. According to results shown in
Figure 7, conifer forest coverage naturally returns the highest vegetation index values, which demonstrates the robustness of the approach using InSAR time series analysis.
Detailed information about the optical images employed for the inter-comparison is given in
Table 3. The temporal adjacency to the employed InSAR packet and then cloud fraction was firstly considered for the optical image selection. The subpixel registration accuracy compared to InSAR image were checked using manually measured control points and SVM classification were applied afterward. In the form of a confusion matrix,
Table 4 provides the results of quantitative comparison against the classification implemented based on Landsat 8 optical images over Mt. Baekdu. The optical image acquisition time was temporally similar to the HH2 InSAR time series. Other metrics to assess classification accuracy in
Table 3 were calculated as following notations [
26]:
where
TP is the number of corrected extract class,
TN is the number of corrected rejected class,
FN is the number of undetected class,
FP the number of incorrected extracted class,
T is total number of pixel in test and
S is
(TP + FP)(TP + FN) + (FN + TN)(FP + TN).It is observed that the k-PCA–SVM scheme performed better than PCA-ML in terms of metric of confusion matrix. Comparing optical image classification demonstrated relatively good agreement, except for the shrub and grass cases. Since there is no assurance for the classification accuracy of LANDSAT-8, the values only demonstrated the relatively high reliability of the InSAR packets approach. The low coincidence in the shrub and grass classes might have originated from the full penetrability of InSAR signatures over shrub and grass areas, given their short height.
Based on the results demonstrated over Mt. Baekdu, all packets clearly delineated reasonable boundaries between shrubs, open/mixed forests and conifer forests. Moreover, after comparing with the classification results from the PCA analyses, the performance of k-PCA-SVM scheme with time series packets was significantly improved in accuracy and efficiency.
With the improved performance, it was found that few errors occurred in the land cover classification maps. Firstly, due to radar characteristics, roads and airfields, which comprise only very small portions of the target area, could not be distinguished from barefields. Shrubs/grasses and cultivations could not be differentiated because of the similarity between their standing heights and the deep penetration of the L-band waves. Secondly, the land cover classification maps respectively derived based on the same training vectors and land cover types from SAR and optical EO-1 ALi image were compared (as shown in
Figure 8). The conifer and birch forests seem to have been classified. However, as we were unable to find SAR and optical images covered similar temporal period, which becomes the limitation of this approach, their distributions in the optical and InSAR time series schemes are not very similar. For the same reason, snow cover is identified in the InSAR time series scheme as a water body. Thirdly, decorrelation over steep topography [
27], together with snow cover in the winter, might be the reasons resulting in classification errors in summit areas. Since phase decomposition was capable of eliminating such errors as reported by Wang et al. [
28], the introduction of phase decomposition will be further introduced in future improvement.
Considering the dependences on the weather condition and consequent effect of land cover classification, L-band SAR is less sensitive for temporal phase coherence change [
5]. To investigate the robustness of L-band phase coherence signature, we tested the L-band phase coherence land cover classification scheme over Uljin, the second test site where the orographic effects caused the seasonable wind and precipitation changes [
29]. It usually presented significant draught in spring to summer season and heavy precipitation in wintertime together with corresponding wind directional change.
As shown in
Figure 1, the packets covered the winter and summer seasons, it is known that the phase coherence deviation from temporal coherence model is significant due to the orographic effects as described.
As shown in the
Figure 9a contrast image, it mostly covered dense forest where the EVI is higher than 0.3 and mostly classified as mixed and broadleaf forest area in
Figure 9b with Landsat 8 images.
Figure 9c,d show the k-PCA classification results of InSAR signatures in Uljin. Notably, we employed two HH packets and also tested their ability to discriminate artificial structures, which were less distributed in Mt. Baekdu case. The discrepancy between U-HH1 (
Figure 9c) and U-HH2 (
Figure 9) classification occurred in forest areas was explained by seasonal effects. Moreover, the distribution of artificial structures in U-HH2 classification was less than that of U-HH1, and the resolution limit of the ALOS PALSAR product could have resulted in the relatively different classification accuracy in spatially mixed land cover types.
Table 5 shows the results of classification evaluation over Uljin compared with the Landsat 8 image classification. The low accuracy of urban and residential areas might have originated from the highly mixed land cover types over farmlands, where artificial structures and crops are closely populated and thus cannot be discriminated well by the low resolution of ALOS PALSAR.
Overall, the tree distribution by land cover type over Mt. Baekdu agreed well with the results of the precedent studies. The extracted altitude dependency in particular confirmed the accuracy and effectiveness of our classification algorithm for investigating vegetation. However, the InSAR packet approach we proposed could not classify specific types of land cover (e.g., roads and cultivation areas) as the image resolution is not high enough to produce characteristic InSAR signatures. To address this, finer-resolution L-band SAR images (e.g., ALOS PALSAR-2 and future NASA-ISRO Synthetic Aperture Radar (NISAR)) should be used to classify more detailed types of land cover. Regarding the data employed to the SAR packet approach, although it is difficult to achieve reliable land cover classification over the environment covered by forest, water and snow using C-band SAR characteristics, we are expecting that short temporal baseline C-band InSAR, such as Sentinel-1, will be applicable for the land cover classification in flat built-up area where the main aim of the classification scheme is to discriminate artificial structures and partial vegetation.