1. Introduction
Currently, with the rapid development of synthetic aperture radar (SAR) system technology, image resolution is getting higher, and many SAR systems have full-polarized imaging mode with different frequencies, such as the X-band TerraSAR-X, the L-band ALOS/PALSAR, the C-band RADARSAT Constellation Mission and the C-band Chinese Gaofen-3 satellite. Such SAR systems can also acquire data at different imaging modes, including the single-polarized, dual or even qual-polarized modes. However, the pulse repetition frequency (PRF) is increased twice in the full-polarized mode compared with the single-polarized mode. Due to limiting factors such as ambiguity ratio and data rate, resolution of full-polarized mode is usually lower than the single-polarized mode in the current spaceborne SAR systems. Taking the Gaofen-3 satellite as an example, the resolution can reach 1 m in the single-polarized mode, while the highest resolution of the full-polarized mode is 8 m, which is a great difference. It should be mentioned that the above modes are not possible simultaneously, which may bring some differences in the spectral response of a given area between two different acquisitions in terms of factors such as moisture on the ground surface and eventually on the geometry of SAR system. As a consequence, high resolution and full polarization usually have their own emphasis in applications.
Land cover classification is a fundamental issue in SAR image applications. High resolution and polarimetric information both have their own advantages and limitations. Polarimetric SAR (PolSAR) data can distinguish ground targets with different scattering mechanisms and possess the capability of unsupervised classification. There is already massive research on polarimetric decomposition and classification. In the aspect of polarimetric decomposition, there are many classical methods, such as the Huynen decomposition [
1], Krogager decomposition [
2], Cloude–Pottier decomposition [
3], Freeman–Durden three-component decomposition [
4], Yamaguchi four-component decomposition [
5,
6], and Pauli decomposition [
7] etc. Moreover, the polarimetric decomposition theorems still show sustainable development in recent years [
8,
9,
10,
11]. In addition, based on the polarimetric decomposition and feature extraction theories, many PolSAR image classification methods have also been proposed [
3,
12,
13]. For instance, the classical H/alpha method [
3] can divide the image into eight categories with physical meanings, and the method has been widely used. Further study indicates that adding some image domain information such as the image intensity, texture information, and statistical information can improve the classification accuracy. The H/alpha-Wishart method [
14], Span/H/alpha-Wishart method [
15], FDD-Wishart method [
16], GD-Wishart method [
17], and the GD K-Wishart method [
18] are all typical cases. In addition, the recently proposed tDPMM-PC method has achieved desirable performance on both low- and high-resolution images over areas of different heterogeneity [
19]. However, for the spaceborne SAR system, the resolution of full-polarized mode is relatively limited, and it is currently difficult to achieve refined classification, especially in urban areas.
Comparatively, the single-polarized mode possesses a higher resolution, and the image intensity distribution, texture, and context relationship contain rich information. In research about single-polarized SAR image classification, traditional methods mainly focus on the image statistical distribution. Zhu et al. [
20] extracted villages, water surfaces, and farmland by texture wavelet transform. Hu et al. [
21] used the semi-variogram and gray level co-occurrence matrix (GLCM) to mine texture information, and then combined the support vector machine (SVM) to extract water and residential area. Based on the basic characteristics of SAR images, Wu et al. [
22] made a rough classification of built-up areas, forests, water areas, and other general areas. In addition, V.V. Chamundeeswari et al. used region texture segmentation and contour tracking method to realize the fundamental distinction of water, urban areas, and vegetation [
23], and then optimized classification by considering statistical information of texture and using the principal component analysis (PCA) method [
24]. Thomas Esch et al. [
25] realized the differentiation of basic landcover types (water area, open land, woodland, and urban) by considering the speckle and textural information in different local areas. However, as the information hidden in single-polarized SAR image is usually complex and diverse, it is still very hard to conduct unsupervised classification based on prior knowledge.
Deep Neural Network (DNN) can combine low-level features layer-wise to get abstract and distinguishing feature representation, thereby achieving effective feature extraction. In recent years, deep learning has grown by leaps and bounds and has been used extensively. Related technologies have also been applied to SAR image classification, and some contests such as the GEOVIS’s Gaofen Challenge on Automated High-resolution Earth Observation Image Interpretation has released SAR image classification datasets and set corresponding race [
26]. At present, in the field of single-polarized SAR image classification, deep learning has gained some achievements, and some common deep learning models, such as the deep belief network (DBN) [
27], convolutional neural network (CNN) [
28], auto-encoder (AE) [
29] and others have been applied. These methods mainly use the deep learning method to extract image features and combine the ground truth data to perform supervised classification. By contrast, in the field of PolSAR image classification, deep learning has been applied more. With supervised training, deep learning methods achieve higher accuracy than traditional PolSAR classification methods [
30,
31]. Currently, extracting polarimetric characteristics by polarimetric decomposition, getting image structural and texture features by deep learning methods, and combining them for supervised training to achieve higher accuracy is one of the principal thoughts of mainstream methods. Considering the statistical distribution of PolSAR data, Xie et al. [
32] put forward a classification method combining the Wishart distribution and autoencoder. Chen et al. [
33] proposed a PolSAR image feature optimization method on the integration of multilayer projection dictionary learning and sparse stacked autoencoder, effectively improving the classification. Lv et al. [
34] investigated PolSAR image classification using the DBN. Zhou et al. [
30] converted the polarimetric covariance matrix into six imensions and input it to CNN to achieve feature extraction. Taking account of both the amplitude and phase information of the complex data, Zhang et al. [
35] proposed a complex-valued CNN that extends CNN to the complex domain and obtains desirable performance. We can see that these classification methods based on deep learning are mainly data-driven currently, and the quality and quantity of datasets both have a great impact on classification performance. However, labeling SAR data is a laborious task and there are few labeled samples for study, which limits the application of deep learning in SAR image classification to a certain extent [
36].
Moreover, the introduction of the deep learning technique breaks down the barrier between polarimetric information and image domain information, so that the information can be extracted and utilized in the same network and contribute to the final classification goal. Using the compact convolutional neural network, Mete Ahishali et al. made a preliminary attempt to combine image feature and EM channels for PolSAR image landcover classification [
37]. However, this work needs labeled samples and the generalization ability needs further improving. Since the current mainstream DNNs are better at tackling information from the image domain, how to integrate polarimetric information to perform the deep learning frameworks more effectively is still an issue needing further inquiry. How to make full use of all kinds of information through network structure optimal designation and reasonable input data construction, so as to improve classification accuracy and reduce the dependence on the volume of training examples, is a focus of current research in SAR image classification.
To explore this study, it is essential to mine polarimetric characteristics and image domain characteristics extracted by DNN and analyze their traits and correlation. There are few explorations in this regard at the moment. A few relevant studies include these: Song et al. [
38] used the DNN to recover polarimetric data from single-polarized data for radar image colorization; based on Cloude–Pottier decomposition; Zhang et al. [
39] predicted the polarimetric characteristic parameters (scattering entropy, scattering alpha angle, and anisotropy) from single-/dual-polarized data using CNN; Zhao et al. [
40] studied the potential of extracting physical scattering characteristics from single-/dual- polarized data by using convolution network training; and Huang et al. [
41] learned physical scattering properties from single-polarized complex SAR data with the time-frequency analysis method and DNN. In previous work, we have used CNN to study the potential of recovering PolSAR information from high-resolution single-polarized data, which is trained by polarimetric data with lower resolution, and we have also discussed the correlation between image domain information and polarimetric information to some degree [
42]. The above research shows there is some correlation and redundancy between image domain and physics domain information. However, current research is the result of supervised training with the polarimetric results as the label and still lacks detailed comparison and analysis.
To this end, we propose a complete set of feature extraction, comparison, and analysis methods between high/medium-resolution single-polarized SAR image and medium-resolution PolSAR (MRPL-SAR) data, which is oriented towards improving unsupervised SAR image classification performance. And we summarize the experimental results and discuss the characteristics and classification performance of high-resolution single-polarized SAR (HRSP-SAR) image, medium resolution single-polarized SAR (MRSP-SAR) image, and MRPL-SAR in different application scenarios. The main contributions of our work include:
We innovatively proposed a complete set of feature extraction, comparison, and analysis method between single-polarized and PolSAR images; in particular, facing the difficulties in unsupervised classification of single-polarized SAR (SP-SAR) images, we adapted the DCEC algorithm to this problem and achieved good clustering.
Based on the feature comparison and analysis method, we carried out many experiments, including feature comparison between SP-SAR data and PolSAR data with the same resolution, between HRSP-SAR data and MRPL-SAR data, etc. Based on the results, the characteristics and relationships of different ground types under high resolution and polarimetric situations are summarized, which provides guidance for further applications such as SAR image classification.
An information fusion strategy is proposed, which breaks the boundary of information combination of images of different imaging modes. The strength of HRSP-SAR image feature and PolSAR data physical scattering information are fused for better landcover classification.
The remainder of our paper is organized as follows:
Section 2 introduces the feature extraction and comparison method.
Section 3 presents our urban area fine-grained classification approach. Experimental results and specific analysis for typical ground targets are given in
Section 4, and
Section 5 presents conclusions and prospects.