1. Introduction
Recent advances in remote sensing sensors, especially for hyperspectral imagery (HSI), increase the possibility of more accurate discrimination of materials of interest [
1,
2]. Given a set of observations (i.e., pixel vectors in a HSI), the purpose of classification is to assign a unique label to each pixel vector, such that it can be presented by a given class [
3]. Although HSI is characterized by its high spectral resolution and abundant information, which promotes capturing fine details of spectral features for classification, it has been demonstrated that the original HSI contains high redundancy, and in addition there are high correlations in both the spectral and the spatial domains [
4,
5,
6]. Therefore, the analysis problem is essentially low-rank and can be represented sparsely [
7]. In this context, sparse representation (SR) has been widely exploited for HSI. SR-based classifiers (SRC) code a testing pixel over a dictionary which is constructed by sets of labeled samples [
8,
9]. The spectral features of pixels belonging to the same class are assumed to approximately lie in a lower-dimensional subspace and yield a relatively concentrated response under specific constraints towards the dictionary, such that the result is determined by the class with the minimum residual error of representation [
10,
11]. According to the constraints imposed on the coefficients, the representation-based classification can be divided into an
l2-norm regularized collaborative representation-based classifier (CRC) and an
l1-norm regularized SRC [
12,
13]. Several approaches based on SR for HSI have proved that it can provide plausible results compared with the traditional methods [
14,
15]. In [
16], SR is imposed to the HSI, and was proven to be effective for the classification purpose. In [
17], SRC is integrated with manifold learning-based dimensionality reduction methods, and provided considerable results for the classification of HSI. In [
18], SR is fused with CR in a single classification framework based on a weighting process, and was proven to be better than both of them. Though SRC has shown its effectiveness for HSI classification, it only focuses on the usage of spectral information, and it ignores the inherent spatial information which would allow for more accurate discrimination [
19,
20].
In order to incorporate the spatial information, several models have been constructed for SR-based classification, such as imposing a smoothing constraint to the formulation or adding post processing in the framework [
21,
22,
23]. Among these, Joint SRC (JSRC) has been of great interest due to its representation of pixels in a small neighborhood together, which are weighted by a different set of coefficients, and proven to be effective for HSI classification [
24,
25]. Though JSRC brings considerable improvement to SRC, SR-based approaches still suffer from instability of sparse coefficients due to the coherency in the dictionary. To address the problem, group SRC (GSRC) has been designed to exploit the spatial coherence and the inherent structure of dictionary based on group sparsity priors, such that a testing pixel with its neighborhood can be sparsely represented together with activation of group atoms instead of individual ones in the dictionary [
26,
27]. In [
28], GSRC is firstly introduced for HSI classification, and proven to be reliable with a further development of a low rank group prior. In [
29,
30], structure sparsity priors are incorporated with manifold learning and subspace projection for better characterization, and achieved considerable results for classification. In comparison to SRC, GSRC considers spatial information through local area. However, it neglects nonlocal spatial information, where global spatial consistency can be further exploited for more comprehensive representation.
Compared with local similarity, nonlocal self-similarity (NLSS) defines the spatial consistency of materials in a global distribution [
31]. It can provide references of global structure prior by exploiting the spatial similarity in nonlocal area, such that the discrimination of a pixel can be more precisely processed through the global similarity constraint [
32]. Recently, several NLSS-based approaches have been designed for application to HSI [
33,
34]. In [
35], a nonlocal means (NL-means) algorithm based on NLSS was proposed for HSI denoising, which estimates the value of pixels with weighted average of similar ones, and provides better results compared with other local smoothing filters. In [
36], NL-means was applied as a regularization to exploit the similar structures in the abundance image, and proven to be effective by incorporating nonlocal spatial information for spectral unmixing. In [
37], NLSS was introduced into SR to reconstruct the dictionary for separation of signal and noise, and contributed to more concise and accurate restoration of HSI. Though these NLSS-based methods have shown their superiority based on global structured priors in spectral and spatial domains, they essentially tend to the direct use of nonlocal spatial similarity with concentration of spatial information from all the available regions, while the spectral features in fact can still be exploited for more accurate discrimination.
In order to further exploit the local spectral similarity through NLSS, this paper proposes a novel GSRC-based approach for HSI classification. As illustrated in
Figure 1, the proposed method considers both the nonlocal spatial self-similarity by conducting a search of nonlocal patches and the local spectral similarity by exploration of the found patches. Specifically, the proposed method can be implemented in three steps: (1) Nonlocal search of spatial similar patches, where the most similar non-overlapped patch towards the patch containing the current testing pixel is located in the whole scene. (2) Local search of spectral similar pixel, which measures the most similar pixel in the spectral domain to the testing pixel in its found nonlocal patch in step (1). (3) Spectral-spatial structure-based representation, where the neighborhoods of the testing pixel and its similar pixel in step (2) are fused together to be processed by a GSRC architecture. The final classification result is determined by the class with minimum representation error. The main contribution of the proposed approach, denoted as NSLS-GSRC (nonlocal spatial and local spectral similarity based GSRC), can be summarized as follows:
Local spatial information is first considered in the proposed framework. Compared with the traditional SR-based methods, our approach represents the testing pixel with its neighborhood simultaneously, which allows for the consideration of local spatial consistency.
Nonlocal spatial information is then emphasized by our proposed method. As an important check of local spatial consistency, NLSS is integrated with local similarity to provide a global spatial constraint under local and nonlocal spatial consistency.
Local spectral information is further exploited through NLSS. Based on the exploration of local spectral similarity, our proposed NSLS-GSRC takes into account both the nonlocal spatial and the local spectral information, and contributes to a more comprehensive representation based on a group structured prior of GSRC.
The remainder of this paper is organized as follows:
Section 2 formulates the representation models adopted in this paper.
Section 3 details the proposed NSLS-GSRC approach.
Section 4 evaluates the performances of our method compared with those of other related hyperspectral image classifiers, using data sets collected by the Airborne Visible/Infrared Imaging Spectrometer (AVIRIS) over the Indian Pines site in northwestern Indiana (USA), Salinas Valley in California USA), and the Reflective Optics Spectrographic Imaging System (ROSIS) over the University of Pavia in Italy.
Section 5 presents some concluding remarks.
3. Proposed Approach
As introduced in
Section 2.3, GSRC exploits the prior structured group of a dictionary, and integrates the spatial information from the local area for better optimization. In order to satisfy the homogeneity assumption and the local spatial consistency during the representation, GSRC usually considers a small neighborhood around the testing pixel. Assuming that this small neighborhood contains some spectral abnormal pixels (caused by the presence of same material with different spectra), GSRC probably misclassifies the testing pixel. However, in this project, when we consider a larger patch which contains the current testing neighborhood, its local spectral similar area in a nonlocal spatial similar patch can be found in the original scene, such that this new area can strengthen the constraints towards the problem testing neighborhood during the representation, and obtain an improved response. Therefore, it is reasonable to exploit the nonlocal spatial and local spectral similarities of image for adequate supplement to the discrimination of materials of interest in spectral-spatial frameworks.
As illustrated in
Figure 1, let
represent a
sized patch with the center pixel of
, the proposed NSLS-GSRC firstly searches the most spatially similar non-overlapped patch from
towards
. Then, the most similar spectral pixel from
towards
is located in the found nonlocal patch
. After that, two
sized neighborhoods
and
centered on
and
can be extracted from the original scene
. With a fusion process, the new testing matrix can now be sparsely represented by GSRC. The final class label of
is determined by the minimum representation error with group structured dictionary
and coefficient matrix
. The details of this process are presented in the following subsections.
3.1. Nonlocal Spatial and Local Spectral Similarity (NSLS)
In order to implement the search of a nonlocal patch towards each pixel in the whole image,
needs to be expanded to a
sized image. Then, we start to extract
sized patches from the upper left corner of the expanded image with step of one pixel, such that patches centered on each pixel from the original scene can be obtained. For example,
denotes the set of all the patches, where
represents the patch centered on
. The first step of determination of our proposed NSLS is to find the most similar nonlocal spatial patch
towards
. This process adopts two principles: 1.
should be the most similar patch-based one relative to
in the search area. 2.
and
cannot overlap, i.e., either the distance between abscissa or ordinate of their centered pixels should be larger than the side length of the extracted patch. Therefore, the determination of
can be represented as follows:
where
and
are the
and
pixel vector in
and
, and
calculates the Euclidian distance between them.
is the coordinate of centered pixel
of
, and the constraint condition is corresponding to the second principle listed above. After
is obtained, the most similar spectral pixel
compared with
can be located in
as follows:
where
and
are the values of
and
in
band, and
calculates their Euclidian distance. Since the measurement between
and
is on a patch-based level,
obtained here might not be located at the center of
(as illustrated in
Figure 1), i.e.,
and
are not the same pixel. Therefore, compared with other related NLSS-based methods, the innovation of the proposed NSLS-GSRC approach is the further exploration of local spectral similarity through nonlocal spatial similarity [
32,
33].
3.2. NSLS-GSRC
Following the formulation described in previous sections,
is found and can be considered the most similar local spectral pixel in nonlocal spatial similar patch
towards the current testing pixel
in its neighboring patch
. Then, a
sized neighborhood centered on
can be obtained and transformed to a
sized matrix denoted as
. After that,
is fused with
by a pixel-by-pixel average process. The fusion result can be then processed in the representation-based framework by Group Lasso optimization. Finally, the objective function of the proposed NSLS-GSRC method can be represented as follows:
where
denotes the fusion result of
and
. The final class label of testing pixel
is determined by the minimum total residual error as follows:
where
represents the operation to zero our all the elements in
that do not belong to class
. The pseudo code for the proposed NSLS-GSRC method is shown in Algorithm 1.
Algorithm 1. The proposed NSLS-GSRC method |
Input: A HSI image , dictionary is constructed by class orders and randomly selected from the labeled samples, a testing pixel and the patch-size parameters and |
Step 1: Extract a sized patch denoted as with centered pixel of from ; |
Step 2: Search the nonlocal spatial similar patch towards according to Equation (6); |
Step 3: Search the local spectral similar pixel in towards the current testing pixel according to Equation (7); |
Step 4: Extract two sized patches with the center pixels of and , and transform them in to two-dimensional formed matrix denoted as and ; |
Step 5: Obtain the fusion matrix of and , and use them using GSRC to obtain the coefficient matrix according to Equation (8); |
Step 6: Compute the minimun total residual error and identify the class label of the testing pxiel according to Equation (9); |
Output: class (). |
4. Experimental Results
In this section, the proposed NSLS-GSRC method is evaluated using three widely used hyperspectral data sets. The first one is the Indian Pines scene collected by the Airborne Visible/Infrared Imaging Spectrometer (AVIRIS) (NASA Jet Propulsion Laboratory, Los Angeles, CA, USA), with spectral coverage ranging from 0.25 to 2.4 μm and geometric resolution of 20 m per pixel. The scene contains
pixels, with 220 spectral bands. The ground reference contains sixteen classes, which are associated with different kinds of crops. The second scene is the Salinas scene recorded by the AVIRIS sensor. The spatial resolution of this image is 3.7 m per pixel. The scene contains
pixels, with 224 spectral bands ranging from 0.9 to 1.4 μm. The ground reference contains sixteen ground-truth classes. The last scene is University of Pavia scene captured by the Reflective Optics Spectrographic Imaging System (ROSIS) (DLR Institute of Optoelectronics, Berlin, Germany), with spectral coverage from 0.43 to 0.86 μm and geometric resolution of 1.3 m per pixel. The scene consists of
pixels, with 103 spectral bands and nine ground-truth classes. For comparative purposes, several competing spectral and spectral-spatial classifiers are considered in experiments, such as SVM [
43], SVM based Markov Random Field (SVM-MRF) [
44], SRC [
16], CRC [
17], JSRC [
20], GSRC [
29]. In addition, NL-means based SRC (NL-SRC) is also carried out for comparison similar to the literature [
35] to evaluate the proposed NSLS-GSRC method. We conduct 20 Monte Carlo runs while varying the randomly selected labeled samples, and report the corresponding results.
4.1. Parameter Settings
In the experiments of this paper, the regularization parameter
λ for the representation-based methods ranges from 10
−3 to 10
−1,
ωg for the group sparse representation-based methods is set to 1 due to the same number of labeled samples randomly selected per class in experiments. The parameters of the other methods have been optimized by means of a fivefold cross-validation according to the procedure provided in the literature [
16,
17,
20,
29,
43,
44]. In particular, the values of
S1 and
S2 are tested extensively and illustrated in
Figure 2.
As shown in
Figure 2, with a fixed number of labeled samples, the overall classification accuracy is relatively stable with respect to
, but obviously decreases with respect to
. This is because
defines the size of neighborhood in group sparse representation, and it needs to be constrained in a relatively small range such that to satisfy the homogeneity assumption and the local spatial consistency. Instead,
decides on the searching patch-size which determines the nonlocal spatial and local spectral similarity, and it is allowed to range in a relatively larger range compared with
due to the global structure priors. According to the results of
Figure 2, we adopt
and
for the Indian Pines scene, as well as
and
for the Salinas scene. Finally, for the University of Pavia scene,
and
are chosen to be 5 and 3.
4.2. Experiments with the AVIRIS Indian Pines Scene
In the experiment with the Indian Pines scene, eight mutually exclusive classes with a total of 8624 labeled samples are adopted from the reference data to avoid some classes with very small training samples, and also to satisfy the sparsity requirement in the process of representation.
Figure 3 shows the false-color composite of the image and the reference map. Our first test randomly selects 50 labeled samples per class with a total of 400 samples (which represents approximately 4.6% of the labeled samples) for training and dictionary construction, where the remaining samples are used for validation.
Table 1 shows the overall and individual classification results of different testing methods.
Figure 4 shows the classification maps obtained by the different testing methods. Several conclusions can be drawn.
Compared with SVM, both SRC and CRC provide considerable individual classification accuracies and a slightly better overall classification result. Firstly, it indicates that the representation-based methods can indeed provide plausible results towards traditional models for HSI classification. Also, it is a basis support for the framework of sparse representation exploited in our proposed method.
Compared with SVM and SRC, SVM-MRF and JSRC achieve higher overall classification accuracies and provide more homogeneous classification maps, demonstrating that the incorporation of local spatial information can bring improvement to the classifiers in the spectral domain. The improvement supports the homogeneity assumption and the local spatial consistency in the spectral-spatial framework for the classification of HSI.
Compared with SVM-MRF and JSRC, GSRC achieves better classification results which proves that group structured priors contribute to a more comprehensive integration of spectral and local spatial information. The improvements of GSRC over JSRC indicate the superiority of group sparsity framework and the Group Lasso optimization.
Compared with SRC, NL-SRC brings improvement in classification result that is similar to GSRC. On one hand, it proves that the incorporation of nonlocal spatial information is effective for the classifiers in the spectral domain. On the other hand, it also indicates that both the nonlocal spatial similarity and the local spatial consistency improve the sparse representation framework for the classification of HSI.
The proposed NSLS-GSRC outperforms GSRC and NL-SRC, which firstly demonstrates that the integration of both nonlocal and local spatial information contributes to a more comprehensive consideration of structured priors compared with either of them. In addition, it also indicates that the exploration of local spectral similarity through nonlocal spatial similarity provides more effective means for the discrimination of materials in spectral-spatial frameworks. Furthermore, it proves that the combination of global structured priors and group structured priors, i.e., NSLS and GSRC in our proposed method, can bring significant improvement for the classification of HSI.
In general for this case, the proposed NSLS-GSRC obtains an overall accuracy of 90.54%, which is 5.5% and 3.7% higher than NL-SRC and GSRC, and also 12.36% higher than SRC, respectively. For individual class accuracy, it also provides considerable results, especially for classes 1 and 5. The classification maps in
Figure 4 confirm the improvement achieved by the proposed method.
In the second test with the Indian Pines scene, the proposed NSLS-GSRC is compared with other test methods using a different number of labeled samples (from 10 to 50 samples per class).
Figure 5 and
Table 2 show the overall classification accuracies obtained by the different testing methods, as a function of the number of labeled samples used. As shown by
Figure 5 and
Table 2, several conclusions can be drawn:
Overall classification accuracies are generally positively correlated with the number of labeled samples selected for training and dictionary construction. The improvement is relatively obvious with the situation where the method has a limited number of labeled samples, which can be seen from the variation trend of overall classification accuracies in
Figure 5.
The integration of spatial information contributes to more accurate discrimination of materials. In particular, both local spatial information and nonlocal spatial information both contribute to better characterizing the image in the spectral-spatial domain, which can be seen from the improvement of overall classification accuracies of SVM-MRF, JSRC, GSRC and NL-SRC when compared with the original counterparts, i.e., SVM and SRC.
The combination of local and nonlocal spatial information contributes to more comprehensive consideration of global structured priors. This is especially true for the proposed method, because with a further exploration of local spectral similarity through nonlocal spatial similarity, NSLS-GSRC brings reliable and stable improvement of classification in comparison with other methods either using local or nonlocal spatial information only.
4.3. Experiments with the AVIRIS Salinas Scene
The Salinas scene used in our second experiment was recorded by the AVIRIS sensor over the Salinas Valley, California.
Figure 6 shows the false-color composite of the image and the reference map which contains a total of 54,129 labeled samples. We first randomly select 20 labeled samples per class with a total of 320 samples (which represents approximately 0.6% of the labeled samples) for training, where the remaining samples are used for testing. The classification results and maps obtained by different comparison methods are provided in
Table 3 and
Figure 7.
As shown in
Figure 7 and
Table 3, SRC and CRC provide comparable results with SVM in the spectral domain. In the spatial domain, GSRC, JSRC and SVM-MRF bring significant improvement relative to SRC and SVM by integrating local spatial consistency. On the other hand, NL-SRC also achieves an obvious increase of classification accuracy relative to SRC with consideration of nonlocal spatial self-similarity. Last but not least, the proposed NSLS-GSRC obtains the best classification result with an overall accuracy of 91.06%, which is 2.44% and 2.95% higher than NL-SRC and GSRC, also 9.57% higher than SRC, respectively. In addition, it brings considerable improvements for individual class accuracy, especially for class 14 and 15, which can be observed from the classification map illustrated in
Figure 7.
Our second test of the Salinas scene evaluates the proposed NSLS-GSRC method with a varying size of labeled samples (from 10 to 50 samples per class).
Figure 8 and
Table 4 show the overall classification accuracies obtained by different testing methods, as a function of the number of labeled samples adopted for training and dictionary construction. As shown in
Table 4 and
Figure 8, the improvement of JSRC and GSRC relative to SRC proves the effectiveness of integration of local spatial information with consideration of spatial coherence. The effectiveness of nonlocal spatial information is demonstrated by the increase of overall accuracies acquired by NL-SRC in comparison with SRC. The superiority of local spectral similarity through NSLS is confirmed by the best overall classification accuracies obtained by the proposed NSLS-GSRC method in all cases, which allows for an overall consideration of local and nonlocal spatial information.
4.4. Experiments with the ROSIS University of Pavia Scene
In the experiment with the ROISIS University of Pavia scene, a
pixel-size patch with a total of 7398 labeled samples from nine classes is extracted from the original scene with consideration of time efficiency.
Figure 9 shows the false-color composite image of the extracted region and the corresponding reference map.
Our first test randomly selected 30 labeled samples per class with a total of 270 samples for training and dictionary construction (which represents approximately 3.6% of the labeled samples), while the remaining samples are used for validation.
Table 5 reports the overall and class-specific accuracies of different testing methods, where
Figure 10 shows the corresponding classification maps for this case.
As shown in
Figure 10 and
Table 5, SVM provides considerable results in similar with SRC and CRC in the spectral domain. The methods based on local spatial consistency and structured priors generally bring improvement to their original counterparts, for instance, GSRC outperforms SRC. The methods based on nonlocal spatial similarity offer support for the notion of better consideration of global structure priors, for instance, NL-SRC outperforms SRC. Furthermore, the proposed method brings better characterization of spatial and spectral information based on nonlocal spatial and local spectral similarity, and achieves a more comprehensive discrimination of materials. In general for this case, the proposed NSLS-GSRC obtains an overall accuracy of 93.27%, which is 2.96% and 2.56% higher than NL-SRC and GSRC, also 6.69% higher than SRC, respectively. In addition, the proposed NSLS-GSRC provides reliable individual classification accuracy for each class, especially for classes 2, 7 and 8. It can also be seen from the more homogenous details of the classification map obtained by NSLS-GSRC in
Figure 10, which confirms the improvement.
Our second test of the University of Pavia scene evaluates the proposed NSLS-GSRC with a varying size of labeled samples (from 10 to 50 samples per class).
Figure 11 and
Table 6 show the overall classification accuracies obtained by different methods tested, as a function of the number of labeled samples used for training and dictionary construction. Though CRC obtains relatively poor results in this case, SRC provides competitive overall classification accuracies towards SVM with the increase of the number of training samples in the spectral domain. In the spatial domain, stable increases are obtained with consideration of local spatial consistency, which can be concluded from the comparisons and their trend of SVM-MRF with SVM, and JSRC/GSRC with SRC. On the other hand, improvement are also brought by the combination of nonlocal spatial self-similarity and spectral-domain SR, which is indicated by the performance of NL-SRC versus GSRC, especially for the situation of the limited training samples. In the spectral-spatial domain for all cases, the proposed NSLS-GSRC method achieves the best results in comparison with the other related methods, which demonstrates that the integration of nonlocal spatial and local spectral information achieves more comprehensive discrimination of materials.
Synthesizing the results and analysis in above three experiments, the proposed NSLS-GSRC method obtains comprehensive and considerable overall and individual classification accuracies, and provides more homogenous details in classification maps compared with other related methods. The classification results with varying number of training samples further verifiy the stability of the proposed method. In general, we would like to emphasize that the proposed NSLS-GSRC based on the integration of nonlocal spatial and local spectral similarity is reliable and stable for the classification of HSI.
5. Conclusions
Hyperspectral images are characterized by their abundant spectral and spatial information. Considering the high redundancy and correlation among spectral bands, it has been demonstrated that the inherent sparse property can be exploited for more accurately discriminating materials under examination. In this context, SR-based methods have shown their effectiveness by representing pixel with a linear combination of labeled samples, and obtained a certain success in spectral domain. In order to better characterize the image for classification, researches have been focused on two major aspects to incorporate the spatial information of image. In the local spatial domain, JSRC and GSRC have been designed to simultaneously represent the pixel in neighborhood based on the local spatial coherence. In the nonlocal spatial domain, NLSS is presented to measure the similarity of pixels based on the nonlocal structured priors. With a further investigation of more comprehensive classification, a new framework should allow the exploration of spectral similarity through NLSS and combination of nonlocal and local spatial information in the spectral-spatial domain.
In this paper, we proposed a new classification framework to exploit nonlocal spatial and local spectral similarity based on group sparse representation for hyperspectral image. The main contribution of the proposed method, abbreviated as NSLS-GSRC, includes the further exploitation of spectral similarity through nonlocal spatial self-similarity, and its incorporation with group structure-based sparse representation based on local spatial consistency. Experiments based on three real hyperspectral data sets demonstrate that the proposed NSLS-GSRC outperforms other related methods for the classification performance.