An Endmember Bundle Extraction Method Based on Multiscale Sampling to Address Spectral Variability for Hyperspectral Unmixing

Ye, Chuanlong; Liu, Shanwei; Xu, Mingming; Du, Bo; Wan, Jianhua; Sheng, Hui

doi:10.3390/rs13193941

Open AccessArticle

An Endmember Bundle Extraction Method Based on Multiscale Sampling to Address Spectral Variability for Hyperspectral Unmixing

by

Chuanlong Ye

¹

,

Shanwei Liu

^1,*

,

Mingming Xu

¹

,

Bo Du

²

,

Jianhua Wan

¹

and

Hui Sheng

¹

College of Oceanography and Space Informatics, China University of Petroleum (East China), Qingdao 266580, China

²

National Engineering Research Center for Multimedia Software, Institute of Artificial Intelligence, School of Computer Science, Wuhan University, Wuhan 430072, China

^*

Author to whom correspondence should be addressed.

Remote Sens. 2021, 13(19), 3941; https://doi.org/10.3390/rs13193941

Submission received: 25 August 2021 / Revised: 21 September 2021 / Accepted: 28 September 2021 / Published: 1 October 2021

(This article belongs to the Special Issue Recent Advances in Hyperspectral Image Processing)

Download

Browse Figures

Versions Notes

Abstract

:

With the improvement of spatial resolution of hyperspectral remote sensing images, the influence of spectral variability is gradually appearing in hyperspectral unmixing. The shortcomings of endmember extraction methods using a single spectrum to represent one type of material are revealed. To address spectral variability for hyperspectral unmixing, a multiscale resampling endmember bundle extraction (MSREBE) method is proposed in this paper. There are four steps in the proposed endmember bundle extraction method: (1) boundary detection; (2) sub-images in multiscale generation; (3) endmember extraction from each sub-image; (4) stepwise most similar collection (SMSC) clustering. The SMSC clustering method is aimed at solving the problem in determining which endmember bundle the extracted endmembers belong to. Experiments carried on both a simulated dataset and real hyperspectral datasets show that the endmembers extracted by the proposed method are superior to those extracted by the compared methods, and the optimal results in abundance estimation are maintained.

Keywords:

spectral variability; endmember bundle; spectral clustering

Graphical Abstract

1. Introduction

Hyperspectral images have been widely used in various fields such as classification [1,2,3], target detection [4,5], and quantitative inversion [6,7,8] due to the narrow continuous spectral bands [9] which can provide a large amount of spectral information for each pixel. No matter the spatial resolution, mixed pixels are always widely encountered in remote sensing images which is always the main reason for the limitation to the accuracy of traditional remote-sensing applications at a pixel level. To improve the precision of remote sensing applications, the problem of mixed pixels must be solved. The main task of unmixing is to decompose the mixed pixels into pure materials (called endmembers) and their corresponding proportions (called abundance).

At present, the common methods for endmember extraction are based on using a spectrum to represent one class of material. The main methods are divided into two categories: (1) when there are pure pixels in the image, the main idea of endmember extraction is to search some pixels that can form a convex simplex with maximum volume or have the largest projections in some vectors. Representative algorithms include pixel purity index (PPI) [10], N-FINDR [11], vertex component analysis (VCA) [12], orthogonal subspace projection (OSP) [13], etc.; (2) when the image does not contain pure pixels, the relative algorithms aim to seek a convex simplex with a minimum volume that contains all the pixel points. Methods in this category include iterative-constrained endmembers (ICE) [14], minimum volume-constrained nonnegative matrix factorization (MVC-NMF) [15], and minimum volume simplex analysis (MVSA) [16]. However, with the improvement of spatial resolution of remote sensing images, the phenomenon of spectral variability caused by various factors including acquisition environment [17], illumination [18] and materials per se [19] in hyperspectral images becomes more and more severe, especially in some scenes where various features are more concentrated, such as wetlands and forests. In this case, a single spectrum is insufficient to represent one class of features. Therefore, most aforementioned extraction methods above are not applicable to hyperspectral images with high spectral variability. Previous studies [20,21] have indicated that the above methods which ignore spectral variability can potentially lead to poor estimates of abundances.

Multiple endmember spectral mixture analysis (MESMA) [22] was proposed in order to solve the problem of spectral variability. MESMA, however, requires not only a known spectral library but also substantial computing costs when many spectra need to be tested. In addition, some scholars have proposed methods to address spectral variability without a known spectral library, including parametric endmember models [23,24], Bayesian methods [25,26], and endmember bundle extraction methods. Among them, parametric models and the Bayesian methods require a large number of parameters to be set [27]. However, endmember bundle extraction methods can basically make up for the above shortcomings. The concept of “bundle” [28] means that the endmembers of each feature are represented by a collection containing multiple spectra. Many algorithms [29,30,31,32,33] for endmember bundle extraction have been proposed. Among them, many methods extract endmembers from each subregion of the original image, and then cluster the extracted endmembers to obtain endmember bundles. However, the above experiment processes may extract mixed pixels as candidate endmembers when the subregion contains no pure pixels. To address these issues, scholars adopted PPI [34] and a homogeneity index (HI) [35] to ensure the purity of candidate endmembers. At present, endmember bundle extraction is still in its infancy, and the development of this field still needs additional research.

Compared with endmember extraction methods, endmember bundle extraction methods require an additional step of clustering to group the extracted endmembers. Currently, most endmember bundle extraction algorithms group extracted endmembers [30,35] using the K-means clustering method. Then, the clustering results are compared with ground truth as a priori knowledge to identify which material each endmember bundle belongs to. K-means clustering has a better performance in the datasets with more interclass variability than intraclass variability. However, due to the spectral variability, K-means clustering struggles to accurately complete the clustering of a large number of endmembers.

To further improve the accuracy of hyperspectral unmixing, it is necessary to develop a new method to address the above problems. The main contributions of this paper are as follows: (1) in order to address the spectral variability of hyperspectral images, a multiscale resampling endmember bundle extraction (MSREBE) algorithm is proposed in this paper. Candidate endmembers are extracted from sections of each sub-image generated from the original image at different sampling scales. Pure pixels, which are selected as candidate endmembers at multiple sampling scales, are optioned as endmembers. Afterward, the endmembers are clustered into endmember bundles for each material; (2) the stepwise most similar collection (SMSC) clustering method is proposed in this paper. The extracted endmembers are clustered with their most similar endmembers step by step according to prior knowledge. SMSC partitions the spectral variability of all extracted endmembers at each step, which can reduce the influence of spectral variability in the clustering.

The remainder of this paper is organized as follows: Section 2 describes four state-of-the-art algorithms which are compared with the proposed algorithm. The proposed algorithm is presented in Section 3. Section 4 displays four experiments with both a simulated dataset and real hyperspectral datasets, as well as discusses the results obtained from the experiments. Section 5 concludes this paper.

2. Relative Research Works

In this section, four typical algorithms, namely, VCA, image-based endmember bundle extraction (EBE), spatial and spectral feature-based EBE (SSEBE) and archetypal analysis EBE (AAEBE), are introduced. These methods are taken for comparison when conducting experiments on both a synthetic dataset and real datasets.

2.1. VCA

VCA is a typical endmember extraction algorithm based on convex geometry theory [12]; the main idea of this method is that projection of the data cloud formed on the hyperplane is a convex simplex whose vertices are endmembers. By projecting all pixels onto a random vector, the first endmember is the pixel with the largest projection. Then, the remaining endmembers are iteratively extracted by projecting the remaining pixels onto the orthogonal direction of the subspace, which is composed of the extracted endmembers. The pixel corresponding to the limit projection is taken as the next endmember. Suppose the collection composed by extracted endmembers is expressed as

{e_{j}}_{j = 1}^{k}

and the corresponding matrix is described as E_k. Then, the (k + 1)-th endmember

e_{k + 1}

can be extracted via Equation (1).

e_{k + 1} = \arg \max_{r_{i}, i = 1, \dots, n} {| w_{k}^{⊥} r_{i} |},

(1)

where

r_{i}

represents the remaining pixels, and

w_{k}^{⊥}

is one random vector of

E_{k}^{⊥}

, which is calculated via Equation (2).

w_{k}^{⊥} = \frac{E_{k}^{⊥} ξ}{{‖ E_{k}^{⊥} ξ ‖}_{2}},

(2)

where

ξ

is an independent identically distributed zero mean Gaussian vector, i.e.,

ξ ~ N ({0, I}_{k})

.

E_{k}^{⊥}

is the orthogonal projection matrix of E_k, which is calculated via Equation (3).

E_{k}^{⊥} {= I}_{k} {- E}_{k} {{(E}_{k}^{T} E_{k})}^{- 1} E_{k}^{T} .

(3)

2.2. EBE

The principle behind EBE is to carry out an endmember extraction algorithm in every subset [30]. The specific process is as follows: subsets are generated by sampling randomly using the pixels from the original hyperspectral images. The main assumption for adopting this random strategy in the selection of pixels is that a smaller proportion of image pixels can be used to approximate the statistics of the original image. That is, if there are numerous pure pixels for each endmember in the scene, the image subsets generated by random sampling will also have pure pixels. Then, pure pixels in each subset can be extracted by the endmember extraction methods. Subsequently, the spectral set will consist of spectral signatures from different features after all subsets are analyzed. Finally, the spectral set can be clustered into some categories corresponding to each ground component by K-means clustering with the Euclidean distance as a similarity measure. K as prior knowledge in this clustering method represents the number of clustering collections, which is determined on the basis of specific experimental needs.

2.3. SSEBE

SSEBE is proposed to extract endmember bundles using both spatial and spectral information [35]. The steps of this method are as follows: (1) to reduce the computational complexity, the PPI algorithm is used for coarse screening of original hyperspectral images. All pixels of hyperspectral image are projected onto multitude skewers. When the frequency of a pixel appearing as the maximum or the minimum of projections is more than the given threshold, the pixel is considered as a candidate endmember; (2) according to the idea that pure pixels generally locate in spatially homogeneous areas, SSEBE calculates HI, which is the index to determine whether the pixels are in a homogeneous region, between each pixel and its adjacent pixels, with spectral information divergence (SID) as a measure. A smaller HI of the pixel denotes greater similarity to its adjacent pixel and a greater likelihood of being in the homogeneous region; (3) the HI threshold of every subregion is determined by adaptively adjusting according to the proportion of the selected candidate endmembers. The principle of setting the threshold is that the percentage of candidate endmembers is no more than 2–5% of the whole data set. Then, candidate endmembers whose HI are smaller than threshold are selected to make up the spectral set; (4) the spectral set is clustered using the K-means clustering method, with the initial clustering centers obtained by OSP.

2.4. AAEBE

AAEBE was firstly applied for archetypal analysis and was originally designed for machine learning problems to endmember bundle extraction [34]. This method considers the similarity of intra-endmembers and represents each endmember by a few typical spectra to reduce computation for unmixing. The main process of the method is as follows: (1) similarly to the first step in SSEBE, PPI is used to extract candidate endmembers; (2) archetypal analysis is used to extract the first-level endmembers among the candidate endmembers extracted from step (1); (3) the candidate endmembers are clustered into several collections on the basis of the first-level endmembers, and then a second archetypal analysis is used in turn in each collection to obtain pure pixels of each endmember. After all the above steps, the endmember bundles corresponding to each material are extracted.

3. Multiscale Resampling Endmember Bundle Extraction (MSREBE)

The proposed method consists of four steps: (1) boundary detection, (2) sub-images in multiscale generation, (3) endmember extraction from each sub-image, and (4) stepwise most similar collection clustering. A brief illustration of the method is shown in Figure 1.

3.1. Boundary Detection

The pixels in the hyperspectral image can be divided into pure pixels and mixed pixels. Mixed pixels contain the spectral information of multiple materials. The boundary pixels usually occur at the intersection of two or more features, which are more likely to be mixed pixels. In order to reduce the probability that mixed pixels are selected as endmembers, boundary pixels and their four neighborhood pixels are deleted before extracting endmembers.

The first principal component obtained from the hyperspectral image according to principal component analysis (PCA) [36] contains the primary information of hyperspectral images. In this paper, the boundary pixels are detected on the basis of the first principal component of the hyperspectral image. The canny edge detector, a classical edge detection operator [37], is used to detect the boundary. This step consists of two operations: (1) carry out PCA of the hyperspectral image and obtain the first principal component; (2) use the canny detector to identify boundary pixels of the first principal component and label the location of boundary pixels and their four neighborhood pixels.

3.2. Sub-Images in Multiscale Generation

It is difficult to extract endmember bundles from a hyperspectral image with high spectral variability. In this paper, we extracted endmembers in sub-images with relatively low spectral variability generated by sampling original hyperspectral image. The adjacent pixels in the original image were assigned to different sub-images. Figure 2 shows the generation process of sub-images at a sampling scale of 2. According to the above operation, sub-images have similar feature distributions to the original image with lower spectral variability.

The spectral variability cannot be significantly reduced when the sampling scale is small. On the contrary, there may be no pure pixels in sub-images when the scale is too large. Therefore, the best sampling scale is difficult to determine. Under this condition, we extracted candidate endmembers at multiple scales, ensuring that candidate endmembers could adequately show the spectral variability of the original image. We firstly set 1, 2, 3, and 4 as the sampling scales considering some images of a small size. Then, the maximum sampling scale was determined via Equation (4) concluded from multiple experiments.

MAX_scale = \min (M / 20, N / 20),

(4)

where M and N represent the number of rows and columns of the original hyperspectral image, respectively. To reduce the computational burden, we adopted the method of exponential growth to calculate the remaining sampling scales.

3.3. Endmember Extraction from Each Sub-Image

After the above step, the spectral variability among different regions still exists because the feature distributions of sub-images are similar to the original image. If endmembers are extracted directly from sub-images, the final extracted endmembers will be in a gathering state. That is, extracting endmembers in sub-images with similar spatial structures may lead to the situation that the extracted endmembers are located in similar positions of the sub-images. When the extracted pure pixels are mapped onto the original image, pure pixels of the endmember may appear in close areas to form a gathering state. To solve the above problems, sub-images generated at the same sampling scale were alternately divided into four sections in the horizontal and vertical directions. Then, candidate endmembers were extracted from each section with VCA after deleting the boundary pixels detected in step 3.1. Following the above steps, some pixels were extracted as candidate endmembers at multiple sampling scales. When a pixel is selected as an endmember in more sub-images, the probability of the pixel being a pure pixel as the endmember is higher. To select more representative endmembers, a threshold T is required for screening candidate endmembers; in this paper, one-third of the sampling scale number was adopted as the threshold T.

3.4. Stepwise Most Similar Collection (SMSC) Clustering

A spectral set is built after all endmembers are extracted via the above steps. The next step is to cluster endmembers in the spectral set into some collections corresponding to the various components. Automatic clustering is of great significance, as it affects accuracy evaluation and abundance estimation. The common clustering methods are not applicable to spectral sets with high spectral variability because the intraclass variation in materials may be greater than the interclass variation. In this paper, we propose a new clustering method by clustering stepwise similar endmembers to reduce the influence of spectral variability.

The main idea of this method is to use some endmembers to help their similar endmembers cluster correctly. For example, endmember i belongs to class 1 but is incorrectly assigned to class 2. In this case, endmember j, which is similar to endmember i and can be assigned correctly to class 1, can be used to help endmember i be assigned to class 1. The mathematical formula is expressed as follows:

{\begin{cases} SAD (i, A) > SAD (i, B) \\ SAD (j, A) < SAD (j, B) \\ SAD (i, j) < SAD (i, A) \end{cases},

(5)

where A and B represent the typical spectral signatures of class 1 and 2, respectively. These typical spectral signatures can be acquired through endmember extraction methods, field acquisition, etc. The spectral angle distance (SAD) is a metric to evaluate the similarity of two spectra, which can be calculated via Equation (6).

SAD (x_{a} {, x}_{b}) {= \cos}^{- 1} (\frac{{(x_{a})}^{T} \cdot x_{b}}{‖ x_{a} ‖ \cdot ‖ x_{b} ‖}),

(6)

where

‖ x_{a} ‖

and

‖ x_{b} ‖

represent the norms of spectra

x_{a}

and

x_{b}

, respectively.

The steps of SMSC clustering can be summarized as follows:

(1) Regard each endmember in the spectral set as an independent endmember collection, and take a typical spectral signature of each material as a target collection.

(2) Calculate the SAD between each spectrum in the endmember collection and all elements in the remaining collections via Equation (7), including the other endmember collections and all target collections. Collection j represents the collection which is the most similar to collection i and with the minimum SAD of collection i.

J_{i} = \arg \min_{j} {(SAD (x}_{i}^{p} {, x}_{j}^{q})) (i = 1, 2, \dots {, N}_{1}, j = 1, \dots, N_{1} {+ N}_{2}, j \neq i, p = 1, 2, \dots n_{i}, q = 1, 2, \dots, n_{j}),

(7)

where

x_{i}^{p}

represents the p-th endmember in endmember collection i,

x_{j}^{q}

represents the q-th element in collection j, N₁ and N₂ represent the number of endmember collections and target collections, respectively, and n_i and n_j represent the number of spectra in collection i and j.

(3) Merge endmember collections, whose most similar collections are target collections, into the corresponding target collection, and update target collections.

(4) Repeat steps (2)–(3) until no endmember collections can be merged into target collections.

(5) Merge each remaining endmember collection with its corresponding most similar endmember collection, and update endmember collections. Then, repeat steps (2)–(4) until all endmember collections are merged into the target collections.

The specific process of the SMSC clustering method is shown in Figure 3.

The proposed SMSC clustering method gradually clusters extracted endmembers into target collections according to steps (3) and (4) instead of directly clustering all endmembers into some categories. At each step of clustering, the number of endmembers in the target collection gradually increases with the enhancement of the spectral variability, which indirectly helps the remaining endmembers cluster. According to stepwise clustering, the spectral variability of extracted endmembers will be dispersed in each clustering.

4. Experiments and Analysis

This section describes the experiments performed on both a synthetic dataset and real datasets. These experiments can demonstrate a comprehensive comparison between the proposed method and other typical methods.

In this paper, we adopted the mean spectral angle distance (MSAD) to valuate extracted endmembers and root-mean-square error (RMSE) to assess the reconstructed image and estimated abundance.

MSAD can be calculated via Equation (8),

MSAD = \frac{1}{M} \sum_{i = 1}^{M} \cos^{- 1} (\frac{x_{i}^{T} \cdot {\hat{x}}_{i}}{‖ x_{i} ‖ \cdot ‖ {\hat{x}}_{i} ‖}),

(8)

where M is the number of extracted endmembers,

{\hat{x}}_{i}

denotes the extracted endmember, and

x_{i}

is the corresponding ground truth.

RMSE was used to evaluate the performance of various methods. It is given by Equation (9),

RMSE (z, \hat{z}) = {(\frac{1}{N} {‖ z - \hat{z} ‖}_{2}^{2})}^{\frac{1}{2}},

(9)

where N is the number of elements in z, z is the truth including the true abundance and original hyperspectral image, and

\hat{z}

represents the corresponding estimated result. A smaller RMSE corresponds to a better performance. In this paper, RMSE_R represents the RMSE between reconstructed image and original image, whereas RMSE_A represents the RMSE between true abundance and estimated abundance.

4.1. Synthetic Image Dataset

The endmembers of the synthetic hyperspectral image were selected from the DIRSIG spectral library [38] with high spectral variability. Figure 4 displays the spectral library of the four materials used in this paper. The simulated data contained four features including muddy water, grass, asphalt, and concrete. The corresponding spatially correlated abundances were generated from a Gaussian random field [39]. Moreover, there was only one pure pixel of each material according to the above abundance generation method. To simulate the spectral variability of the synthetic image, we modified the abundance of a material greater than 0.95 to 1 and the abundance of other materials to 0. The spectrum of each pixel in the synthetic hyperspectral data is the sum of the spectral signature of various materials weighted by their corresponding abundances. The synthetic hyperspectral image generated by the above steps complies with the abundance non-negativity constraint (ANC) and abundance sum-to-one constraint (ASC). The simulated hyperspectral image is exhibited in Figure 5.

There are some points to be noted in this experiment: (1) in the EBE method, the number of subsets generated from the original image was set to 10; (2) in AAEBE and SSEBE, the number of projections used to extract candidate endmembers was set to 10,000, and the threshold was set to 0; (3) the number of endmembers in each material was set to 3 in AAEBE. The extraction results of the five methods are presented in Figure 6. To comprehensively compare the extraction methods, we added abundance estimation experiments. Fully constrained least squares (FCLS) was chosen as the method to estimate abundance in this paper, which can avoid the influence of parameter selection. The estimation results are shown in Figure 7. Table 1 presents the results of comparison between MSREBE and the other methods as a function of three metrics, and the best performance is bolded.

Figure 4 demonstrates that all four materials contained high spectral variability, especially asphalt, thus hindering the extraction of endmember bundle associated with asphalt. Comparing the extracted results in Figure 6 with the ground truth in Figure 4, the extracted endmembers of asphalt by VCA, EBE, and AAEBE were far from the associated ground truth, while MSREBE and SSEBE extracted partial endmembers of asphalt. It can also be seen from Figure 6 that the extraction result of MSREBE was better than that of SSEBE. In addition, the abundance maps related to MSREBE better agreed with the true abundance maps shown in Figure 7. Table 1 indicates that MSREBE was second only to AAEBE in terms of the RMSE of reconstruction error. Nevertheless, it was superior to other methods in terms of the RMSE of abundance error and MSAD. Considering the above, the proposed method performed better than other methods using the synthetic dataset.

4.2. Wetland Dataset

The original hyperspectral image was taken on 25 October 2020 in Dongying by an unmanned aerial vehicle hyperspectral camera. The wetland dataset with a size of 263 × 271, shown in Figure 8a, was a subset of the original hyperspectral image with 126 bands, covering the wavelength range of 0.450–0.946 μm. There were main four materials including reed, tamarix chinensis, bare land, and water in the wetland dataset. In this paper, the corresponding ground truth of the study area was drawn on the basis of a multispectral image with higher spatial resolution in the same area (as shown in Figure 8b).

In Figure 9, the extraction results of VCA, EBE, SSEBE, AAEBE, and the proposed method are shown from top to bottom. As in the simulated hyperspectral data experiment, the abundance estimation was performed by FCLS with endmembers extracted using the five methods, and the abundance maps are presented in Figure 10. Table 2 indicates the comparison results between MSREBE and the other methods.

Figure 8 demonstrates that the area of water was obviously smaller than other features, which means that it was difficult to extract the endmember bundle corresponding to water. From Figure 9, we can find that all methods could adequately extract the endmembers of various materials except for water. By comparing the endmembers extracted using various methods, we can find that the water endmember bundle extracted by the proposed method was better than that extracted by other methods. Figure 10 presents the abundance maps of different methods; the first row is the reference abundance, followed by the results of VCA, EBE, SSEBE, AAEBE, and MSREBE. By comparing the abundance maps associated with various methods with the reference maps, it can be found that the results of MSREBE were closer to the reference abundance maps than the other methods, especially for reeds and water. Table 2 demonstrates the comparison between MSREBE and other methods as a function of various metrics, and the best results are marked in bold. From Table 2, we can see that the MSREBE achieved the best results in terms of all evaluation indicators, again indicating its superiority to other methods in hyperspectral unmixing with high spectral variability. Comparing Table 1 with Table 2, we can find that the proposed method is more suitable for wetland data. The first reason behind this phenomenon is that the spectral variability of the wetland dataset was so large that it was difficult to adequately express with relatively few endmembers, for example, VCA and EBE. The second reason is that the spectral characteristics of reed and tamarix chinensis were relatively similar, and traditional clustering methods struggled to correctly cluster the endmembers of similar materials.

4.3. Jasper Ridge Dataset

Jasper Ridge is a popular hyperspectral dataset used in unmixing. The original image has a size of 512 × 614, with 224 channels ranging from 380 nm to 2500 nm. The spectral resolution is up to 9.46 nm [40]. In this paper, a sub-image of 100 × 100 pixels was used to conduct relative experiments after removing some channels susceptible to dense water vapor and atmosphere. There were four main features in the experimental area. Figure 11 exhibits the true color image of the experimental area and the ground truth. Table 3 presents a comparison based on three indicators calculated using the endmembers obtained by VCA, EBE, SSEBE, AAEBE, and MSREBE. The best performance is bolded. Figure 12 and Figure 13 show the extracted endmembers and the abundance maps corresponding to various methods.

The endmembers extracted by various methods indicate that the spectral variability was relatively lower than that of the other two datasets, which resulted in the advantages of the proposed method not being fully shown. In addition, in Figure 11a, we can see that the distributions of ground objects in this dataset were more complex than in other datasets. Therefore, more boundary pixels were deleted by MSREBE, which caused the spectral variability within endmembers extracted by MSREBE to be lower than within those extracted by SSEBE (as shown in Figure 12), as well as led to a relatively high RMSE between the original image and reconstructed image. However, the performances of MSREBE in terms of RMSE_A and MSAD show that MSREBE extracted effective endmembers of various materials which were more suitable for unmixing. The corresponding abundance maps of various methods are compared in Figure 13. It is not difficult to find that the abundance maps corresponding to MSREBE were more similar to the reference, again proving the effectiveness of the proposed method.

4.4. Washington DC Mall Dataset

In this section, we performed relative experiments using Washington DC Mall dataset, which was collected by the Spectral Information Technology Application Center of Virginia with the Hyperspectral Digital Imagery Collection Experiment (HYDICE) sensor. Each pixel was recorded in 210 channels ranging from 400 nm to 2400 nm. After stripping out some bands affected by the atmosphere or with high noise, 191 bands remained in the dataset. In this experiment, a subset of 151 × 141 was extracted from the original image. The subscene mainly included seven types of ground objects, namely, grass, road, water, tree, street, roof 1, and roof 2. Figure 14 shows the true color image of study area and some pure pixels selected manually corresponding to the seven features. The spectral averages of the pure pixels corresponding to various endmembers were calculated as the reference endmember signatures. Due to the lack of true abundance of this dataset, only MSAD and RMSE were used to evaluate the proposed method and other typical methods. The result is shown in Table 4, where the best performance is bolded. Figure 15 shows the four endmember extraction results corresponding to various methods, and Figure 16 shows the abundance result of Washington DC Mall estimated by the proposed method.

The Washington DC Mall dataset was different from the previous three datasets; it had more types of features and a more complex feature distribution. According to Table 4, we can find that MSREBE had the best performance in terms of MSAD among all methods and was second only to EBE in the RMSE of the reconstruction error. From Figure 15, it is not difficult to find that MSREBE had a better extraction effect than other methods. Although there were no real abundance maps, we can find that the estimated abundance maps could approximately represent the ground object distribution of the real image. According to the above experimental results, we can conclude that the proposed method can not only be used in scenes with fewer features, but also in complex scenes with several ground objects.

4.5. Discussion

From the above experiments with a synthetic dataset and real datasets, we can observe that, firstly, the proposed method had a lower mean SAD than the comparison methods, and the abundance estimation experiments showed that the abundance maps corresponding to MSREBE were more consistent with the references, again reflecting the superiority of MSREBE in hyperspectral unmixing. Secondly, the proposed method is applicable to a variety of scenarios, not only for a scene with large differences in ground feature distribution, such as the wetland dataset, but also to a scene with complex ground feature distribution or several features, such as the Jasper Ridge and Washington DC Mall datasets. However, there are still some unresolved issues. Relevant parameters are difficult to determine. The sampling scales and threshold play a significant role in MSREBE. However, these parameters were determined through multiple tests in this paper. In our future work, we will try different methods to determine the best sampling scales for various datasets. So far, we have found that the sampling scales should be related to the size of the hyperspectral image, the degree of spectral variability, and the complexity of the ground feature distribution. In further research, we will try to adaptively determine the sampling scales according to the characteristics of the image. Through many experiments, we found that it is appropriate to select candidate endmembers using one-third of the sampling scale number as the threshold. However, the threshold may be changed as a function of the rules governing the sampling scales.

5. Conclusions

In this paper, a novel multiscale resampling endmember bundle extraction (MSREBE) algorithm was proposed for hyperspectral endmember bundle extraction. To address spectral variability within hyperspectral unmixing, some strategies were employed. We performed endmember extraction in sections of sub-images with much lower spectral variability. Then, the pixels selected as candidate endmembers at multiple sampling scales were taken as endmembers. A novel stepwise most similar collection (SMSC) clustering method was proposed. The extracted endmembers were clustered stepwise on the basis of minimum SAD. In each clustering, each endmember collection was only merged with the corresponding most similar collection, which could weaken the influence of spectral variability within extracted endmembers. The experiments performed in this paper proved that the proposed method has strong practicability and can be applied to a variety of datasets, obtaining better results than other algorithms.

Author Contributions

All the authors made significant contributions to the work. C.Y., S.L., and M.X. conceptualized, designed, and performed the experiments; B.D. and J.W. analyzed the data; H.S. provided advice for the preparation and revision of the paper. All authors read and agreed to the published version of the manuscript.

Funding

This work was supported by National Natural Science Foundation of China under Grants 62071492 and 61701542, and the China High Resolution Earth Observation System Program under Grant 41-Y30F07-9001-20/22.

Conflicts of Interest

The authors declare no conflict of interest.

References

Osei Darko, P.; Kalacska, M.; Arroyo-Mora, J.P.; Fagan, M.E. Spectral Complexity of Hyperspectral Images: A New Approach for Mangrove Classification. Remote Sens. 2021, 13, 2604. [Google Scholar] [CrossRef]
Liu, J.; Yang, Z.; Liu, Y.; Mu, C. Hyperspectral Remote Sensing Images Deep Feature Extraction Based on Mixed Feature and Convolutional Neural Networks. Remote Sens. 2021, 13, 2599. [Google Scholar] [CrossRef]
Camps-Valls, G.; Bruzzone, L. Kernel-based methods for hyperspectral image classification. IEEE Trans. Geosci. Remote Sens. 2005, 43, 1351–1362. [Google Scholar] [CrossRef]
Dong, Y.; Du, B.; Zhang, L.; Hu, X. Hyperspectral Target Detection via Adaptive Information—Theoretic Metric Learning with Local Constraints. Remote Sens. 2018, 10, 1415. [Google Scholar] [CrossRef] [Green Version]
Wu, K.; Xu, G.; Zhang, Y.; Du, B. Hyperspectral image target detection via integrated background suppression with adaptive weight selection. Neurocomputing 2018, 315, 59–67. [Google Scholar] [CrossRef]
Lu, Y.; Tian, Q.; Wang, X.; Zheng, G.; Li, X. Determining oil slick thickness using hyperspectral remote sensing in the Bohai Sea of China. Int. J. Digital Earth. 2013, 6, 76–93. [Google Scholar] [CrossRef]
Hou, L.; Li, X.; Li, F. Hyperspectral-based Inversion of Heavy Metal Content in the Soil of Coal Mining Areas. J. Environ. Qual. 2019, 48, 57–63. [Google Scholar] [CrossRef] [Green Version]
Liu, H.; Zhu, H.; Wang, P. Quantitative modelling for leaf nitrogen content of winter wheat using UAV-based hyperspectral data. Int. J. Remote Sens. 2017, 38, 2117–2134. [Google Scholar] [CrossRef]
Li, Z.; Cui, X.; Wang, L.; Zhang, H.; Zhu, X.; Zhang, Y. Spectral and Spatial Global Context Attention for Hyperspectral Image Classification. Remote Sens. 2021, 13, 771. [Google Scholar] [CrossRef]
Boardman, J.W.; Kruse, F.A.; Green, R.O. Mapping target signatures via partial unmixing of AVIRIS data. In Proceedings of the Summaries, Fifth JPL Airborne Earth Science Workshop. Jet Propulsion Laboratory, Pasadena, CA, USA, 23–26 January 1995; pp. 23–26. [Google Scholar]
Winter, M.E. N-FINDR: An Algorithm for Fast Autonomous Spectral End-Member Determination in Hyperspectral Data; SPIE: Bellingham, WA, USA, 1999; Volume 3753, pp. 266–275. [Google Scholar] [CrossRef]
Nascimento, J.M.P.; Dias, J.M.B. Vertex component analysis: A fast algorithm to unmix hyperspectral data. IEEE Trans. Geosci. Remote Sens. 2005, 43, 898–910. [Google Scholar] [CrossRef] [Green Version]
Harsanyi, J.C.; Chang, C.I. Hyperspectral image classification and dimensionality reduction: An orthogonal subspace projection approach. IEEE Trans. Geosci. Remote Sens. 1994, 32, 779–785. [Google Scholar] [CrossRef] [Green Version]
Berman, M.; Kiiveri, H.; Lagerstrom, R.; Ernst, A.; Dunne, R.; Huntington, J.F. ICE: A statistical approach to identifying endmembers in hyperspectral images. IEEE Trans. Geosci. Remote Sens. 2004, 42, 2085–2095. [Google Scholar] [CrossRef]
Miao, L.; Qi, H. Endmember Extraction From Highly Mixed Data Using Minimum Volume Constrained Nonnegative Matrix Factorization. IEEE Trans. Geosci. Remote Sens. 2007, 45, 765–777. [Google Scholar] [CrossRef]
Li, J.; Bioucas-Dias, J.M. Minimum Volume Simplex Analysis: A Fast Algorithm to Unmix Hyperspectral Data. In Proceedings of the IGARSS 2008—2008 IEEE International Geoscience and Remote Sensing Symposium, Boston, MA, USA, 7–11 July 2008; pp. 250–253. [Google Scholar] [CrossRef] [Green Version]
Healey, G.; Slater, D. Models and methods for automated material identification in hyperspectral imagery acquired under unknown illumination and atmospheric conditions. IEEE Trans. Geosci. Remote Sens. 1999, 37, 2706–2717. [Google Scholar] [CrossRef] [Green Version]
Adams, J.B.; Sabol, D.E.; Kapos, V.; Almeida Filho, R.; Roberts, D.A.; Smith, M.O.; Gillespie, A.R. Classification of multispectral images based on fractions of endmembers: Application to land-cover change in the Brazilian Amazon. Remote Sens. Environ. 1995, 52, 137–154. [Google Scholar] [CrossRef]
Zhang, J.; Rivard, B.; Sánchez-Azofeifa, A.; Castro-Esau, K. Intra- and inter-class spectral variability of tropical tree species at La Selva, Costa Rica: Implications for species identification using HYDICE imagery. Remote Sens. Environ. 2006, 105, 129–141. [Google Scholar] [CrossRef]
Zare, A.; Ho, K.C. Endmember Variability in Hyperspectral Analysis: Addressing Spectral Variability During Spectral Unmixing. IEEE Signal Process. Mag. 2014, 31, 95–104. [Google Scholar] [CrossRef]
Somers, B.; Asner, G.P.; Tits, L.; Coppin, P. Endmember variability in Spectral Mixture Analysis: A review. Remote Sens. Environ. 2011, 115, 1603–1616. [Google Scholar] [CrossRef]
Roberts, D.A.; Gardner, M.; Church, R.; Ustin, S.; Scheer, G.; Green, R.O. Mapping Chaparral in the Santa Monica Mountains Using Multiple Endmember Spectral Mixture Models. Remote Sens. Environ. 1998, 65, 267–279. [Google Scholar] [CrossRef]
Smith, M.O.; Adams, J.B.; Johnson, P.E. Simple algorithms for remote determination of mineral abundances and particle sizes from reflectance spectra. J. Geophys. Res. Planets. 1992, 97, 2649–2657. [Google Scholar] [CrossRef]
Drumetz, L.; Chanussot, J.; Jutten, C. Spectral Unmixing: A Derivation of the Extended Linear Mixing Model From the Hapke Model. IEEE Geosci. Remote Sens. Lett. 2020, 17, 1866–1870. [Google Scholar] [CrossRef] [Green Version]
Liu, L.; Wang, B.; Zhang, L. Decomposition of mixed pixels based on bayesian self-organizing map and Gaussian mixture model. Pattern Recognit. Lett. 2009, 30, 820–826. [Google Scholar] [CrossRef]
Ma, Y.; Jin, Q.; Mei, X.; Dai, X.; Fan, F.; Li, H.; Huang, J. Hyperspectral Unmixing with Gaussian Mixture Model and Low-Rank Representation. Remote Sens. 2019, 11, 911. [Google Scholar] [CrossRef] [Green Version]
Borsoi, R.; Imbiriba, T.; Bermudez, J.C.; Richard, C.; Chanussot, J.; Drumetz, L.; Tourneret, J.; Zare, A.; Jutten, C. Spectral Variability in Hyperspectral Data Unmixing: A Comprehensive Review. IEEE Geosci. Remote Sens. Mag. 2021, 2–49. [Google Scholar] [CrossRef]
Bateson, C.A.; Asner, G.P.; Wessman, C.A. Endmember bundles: A new approach to incorporating endmember variability into spectral mixture analysis. IEEE Trans. Geosci. Remote Sens. 2000, 38, 1083–1094. [Google Scholar] [CrossRef]
Uezato, T.; Murphy, R.J.; Melkumyan, A.; Chlingaryan, A. A Novel Endmember Bundle Extraction and Clustering Approach for Capturing Spectral Variability Within Endmember Classes. IEEE Trans. Geosci. Remote Sens. 2016, 54, 6712–6731. [Google Scholar] [CrossRef]
Somers, B.; Zortea, M.; Plaza, A.; Asner, G.P. Automated Extraction of Image-Based Endmember Bundles for Improved Spectral Unmixing. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2012, 5, 396–408. [Google Scholar] [CrossRef] [Green Version]
Andreou, C.; Rogge, D.; Rivard, B.; Müller, R. A novel approach for endmember bundle extraction using spectral space splitting. In Proceedings of the 2015 7th Workshop on Hyperspectral Image and Signal Processing: Evolution in Remote Sensing (WHISPERS), Tokyo, Japan, 2–5 June 2015; pp. 1–4. [Google Scholar] [CrossRef]
Gao, C.; Li, Y.; Chang, C.; Huang, B.; Chang, C.; Lee, C.; Li, Y.; Du, Q. Finding Endmember Classes in Hyperspectral Imagery; SPIE: Bellingham, WA, USA, 2015; Volume 9501, 95010M. [Google Scholar] [CrossRef]
Canham, K.; Schlamm, A.; Ziemann, A.; Basener, B.; Messinger, D. Spatially Adaptive Hyperspectral Unmixing. IEEE Trans. Geosci. Remote Sens. 2011, 49, 4248–4262. [Google Scholar] [CrossRef]
Xu, M.; Zhang, G.; Fan, Y.; Du, B.; Li, J. Archetypal analysis for endmember bundle extraction considering spectral variability. In Proceedings of the 2018 9th Workshop on Hyperspectral Image and Signal Processing: Evolution in Remote Sensing (WHISPERS), Amsterdam, The Netherlands, 23–26 September 2018; pp. 1–4. [Google Scholar] [CrossRef]
Xu, M.; Zhang, L.; Du, B. An Image-Based Endmember Bundle Extraction Algorithm Using Both Spatial and Spectral Information. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2015, 8, 2607–2617. [Google Scholar] [CrossRef]
Svante, W.; Kim, E.; Paul, G. Principal component analysis. Chemometr. Intell. Lab Syst. 1987, 1–3, 37–52. [Google Scholar] [CrossRef]
Canny, J. A computational approach to edge detection. IEEE Trans Pattern Anal. Mach. Intell. 1986, 8, 679–698. [Google Scholar] [CrossRef]
Raqueno, N.G.; Smith, L.E.; Messinger, D.W.; Salvaggio, C.; Raqueno, R.V.; Schott, J.R. Megacollect 2004: Hyperspectral Collection Experiment of Terrestrial Targets and Backgrounds of the RIT Megascene and Surrounding Area (Rochester, New York); SPIE: Bellingham, WA, USA, 2005; Volume 5806, pp. 554–565. [Google Scholar] [CrossRef] [Green Version]
Hyperspectral Imagery Synthesis Tools for Matlab. Available online: http://www.ehu.es/ccwintco/index.php/Hyperspectral_Imagery_Synthesis_tools_for_MATLAB (accessed on 27 May 2021).
Jia, S.; Qian, Y. Spectral and Spatial Complexity-Based Hyperspectral Unmixing. IEEE Trans. Geosci. Remote Sens. 2007, 45, 3867–3879. [Google Scholar] [CrossRef]

Figure 1. General flowchart of the proposed method for endmember bundle extraction.

Figure 2. Schematic diagram of sub-image generation with a sampling scale of 2.

Figure 3. General flowchart of the SMSC clustering method.

Figure 4. The spectral profile of the four materials in the DIRSIG spectral library.

Figure 5. The true color image of the synthetic dataset.

Figure 6. The endmember bundle extraction results of synthetic dataset.

Figure 7. The abundance estimation results of synthetic dataset.

Figure 8. (a) The true color image of wetland; (b) the map of feature distribution in the study area.

Figure 9. The endmember extraction results of wetland dataset.

Figure 10. The abundance estimation results of Yellow River Estuary.

Figure 11. (a) The true color image of Jasper Ridge dataset; (b) ground truth of each feature.

Figure 12. The endmember extraction results of Jasper Ridge dataset.

Figure 13. The abundance estimated results of Jasper Ridge dataset.

Figure 14. The true color image of Washington DC Mall.

Figure 15. The endmember extraction results of Washington DC Mall dataset.

Figure 16. The abundance result of Washington DC Mall estimated by the proposed method.

Table 1. Experimental results of synthetic image dataset.

Methods	MSAD	RMSE_R	RMSE_A
VCA	0.221	0.178	0.208
EBE	0.167	0.124	0.168
SSEBE	0.247	0.072	0.214
AAEBE	0.277	0.035	0.208
MSREBE	0.155	0.067	0.037

Table 2. Experimental results of wetland dataset.

Methods	MSAD	RMSE_R	RMSE_A
VCA	0.279	0.067	0.382
EBE	0.224	0.178	0.321
SSEBE	0.312	0.039	0.349
AAEBE	0.291	1.107	0.479
MSREBE	0.089	0.019	0.073

Table 3. Experimental results of Jasper Ridge dataset.

Methods	MSAD	RMSE_R	RMSE_A
VCA	0.163	0.159	0.102
EBE	0.356	0.252	0.188
SSEBE	0.121	0.109	0.043
AAEBE	0.225	0.127	0.082
MSREBE	0.099	0.140	0.036

Table 4. Experimental results of Washington DC Mall dataset.

Methods	MSAD	RMSE_R
VCA	0.244	0.308
EBE	0.152	0.016
SSEBE	0.225	0.095
AAEBE	0.143	0.026
MSREBE	0.081	0.021

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Ye, C.; Liu, S.; Xu, M.; Du, B.; Wan, J.; Sheng, H. An Endmember Bundle Extraction Method Based on Multiscale Sampling to Address Spectral Variability for Hyperspectral Unmixing. Remote Sens. 2021, 13, 3941. https://doi.org/10.3390/rs13193941

AMA Style

Ye C, Liu S, Xu M, Du B, Wan J, Sheng H. An Endmember Bundle Extraction Method Based on Multiscale Sampling to Address Spectral Variability for Hyperspectral Unmixing. Remote Sensing. 2021; 13(19):3941. https://doi.org/10.3390/rs13193941

Chicago/Turabian Style

Ye, Chuanlong, Shanwei Liu, Mingming Xu, Bo Du, Jianhua Wan, and Hui Sheng. 2021. "An Endmember Bundle Extraction Method Based on Multiscale Sampling to Address Spectral Variability for Hyperspectral Unmixing" Remote Sensing 13, no. 19: 3941. https://doi.org/10.3390/rs13193941

APA Style

Ye, C., Liu, S., Xu, M., Du, B., Wan, J., & Sheng, H. (2021). An Endmember Bundle Extraction Method Based on Multiscale Sampling to Address Spectral Variability for Hyperspectral Unmixing. Remote Sensing, 13(19), 3941. https://doi.org/10.3390/rs13193941

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

An Endmember Bundle Extraction Method Based on Multiscale Sampling to Address Spectral Variability for Hyperspectral Unmixing

Abstract

1. Introduction

2. Relative Research Works

2.1. VCA

2.2. EBE

2.3. SSEBE

2.4. AAEBE

3. Multiscale Resampling Endmember Bundle Extraction (MSREBE)

3.1. Boundary Detection

3.2. Sub-Images in Multiscale Generation

3.3. Endmember Extraction from Each Sub-Image

3.4. Stepwise Most Similar Collection (SMSC) Clustering

4. Experiments and Analysis

4.1. Synthetic Image Dataset

4.2. Wetland Dataset

4.3. Jasper Ridge Dataset

4.4. Washington DC Mall Dataset

4.5. Discussion

5. Conclusions

Author Contributions

Funding

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI