Generative Simplex Mapping: Non-Linear Endmember Extraction and Spectral Unmixing for Hyperspectral Imagery

Waczak, John; Lary, David J.

doi:10.3390/rs16224316

Open AccessArticle

Generative Simplex Mapping: Non-Linear Endmember Extraction and Spectral Unmixing for Hyperspectral Imagery

by

John Waczak

and

David J. Lary

^*

Hanson Center for Space Sciences, University of Texas at Dallas, Richardson, TX 75080, USA

^*

Author to whom correspondence should be addressed.

Remote Sens. 2024, 16(22), 4316; https://doi.org/10.3390/rs16224316

Submission received: 14 October 2024 / Revised: 8 November 2024 / Accepted: 18 November 2024 / Published: 19 November 2024

(This article belongs to the Special Issue Advanced Artificial Intelligence and Deep Learning for Remote Sensing II)

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

We introduce a new model for non-linear endmember extraction and spectral unmixing of hyperspectral imagery called Generative Simplex Mapping (GSM). The model represents endmember mixing using a latent space of points sampled within a

(n - 1)

-simplex corresponding to n unique sources. Barycentric coordinates within this simplex are naturally interpreted as relative endmember abundances satisfying both the abundance sum-to-one and abundance non-negativity constraints. Points in this latent space are mapped to reflectance spectra via a flexible function combining linear and non-linear mixing. Due to the probabilistic formulation of the GSM, spectral variability is also estimated by a precision parameter describing the distribution of observed spectra. Model parameters are determined using a generalized expectation-maximization algorithm, which guarantees non-negativity for extracted endmembers. We first compare the GSM against three varieties of non-negative matrix factorization (NMF) on a synthetic data set of linearly mixed spectra from the USGS spectral database. Here, the GSM performed favorably for both endmember accuracy and abundance estimation with all non-linear contributions driven to zero by the fitting procedure. In a second experiment, we apply the GTM to model non-linear mixing in real hyperspectral imagery captured over a pond in North Texas. The model accurately identified spectral signatures corresponding to near-shore algae, water, and rhodamine tracer dye introduced into the pond to simulate water contamination by a localized source. Abundance maps generated using the GSM accurately track the evolution of the dye plume as it mixes into the surrounding water.

Keywords:

endmember extraction; spectral unmixing; hyperspectral imaging; unsupervised machine learning; source apportionment

1. Introduction

Hyperspectral imaging has emerged as a keystone technology in remote sensing, where the ability to discern variations between high-resolution spectra supports a plethora of critical applications such as environmental monitoring, biodiversity conservation, sustainable agriculture, and more. In recent years, many remote sensing platforms have been deployed with hyperspectral imaging payloads such as the Italian PRISMA mission launched in 2019, the German EnMAP launched in 2022, and recently, NASA’s PACE satellite launched in 2024 [1,2,3]. Many future missions also plan to incorporate hyperspectral imaging capabilities, such as the European Space Agency’s CHIME, which will include over 200 bands spanning visible, near-infrared (NIR), and short-wave infrared (SWIR) wavelengths [4]. The continued development of hyperspectral imaging technology has also led to a considerable reduction in size that enables its inclusion in the payloads of small unmanned aerial vehicles (UAVs) [5,6]. Despite the proliferation of hyperspectral imaging data sources, the considerable increase in data volume associated with hyperspectral images (HSIs) poses significant challenges to real-time analysis at scale.

Many approaches have been developed to make sense of the HSI data. For example, spectral indices such as the popular normalized difference vegetation index (NDVI) can be computed by taking ratios of spectral bands tailored to track specific reflectance characteristics [7,8]. These indices have the advantage of being easy to compute, but suffer from significant variability between instruments while ignoring most of the information captured in HSI spectra [9]. An alternative approach is to pair HSI data with in situ measurements to enable supervised models that map spectra directly to parameters of interest. However, this approach relies on serendipitous satellite overpasses above sensing sites to generate sufficient quantities of aligned data for model training and evaluation. For example, Aurin et al. combined data from over 30 years of oceanographic field campaigns with paired satellite imagery to develop robust models for the inversion of key water quality indicators such as colored dissolved organic matter [10]. This approach can be accelerated by combining UAV-based hyperspectral imaging with rapid in situ data collection using autonomous boats [11,12]. However, these supervised methods rely on a priori knowledge of expected sources to identify appropriate reference sensors for data collection. In light of these limitations, unsupervised methods are needed that can reliably extract source signatures from high-dimensional HSIs.

The spatial resolution of hyperspectral images generally results in pixels with mixed signals from multiple sources, called endmembers. The task of unsupervised source identification using HSI data therefore involves two steps: endmember extraction and abundance estimation. Techniques such as vertex component analysis (VCA), the pixel purity index (PPI), and N-FINDR solve this first task by identifying HSI endmember spectra assuming the presence of some pure (unmixed) pixels [13,14,15]. By further assuming a linear mixing model (LMM) in which observed spectra are described by a linear combination of endmembers with non-negative abundances, HSIs can be unmixed using a variety of techniques such as constrained least squares and multi-objective optimization [16,17,18]. Among these methods, non-negative matrix factorization (NMF) is a widely used approach that extracts endmember spectra and unmixes abundances simultaneously via matrix factorization [19,20]. The update equations for NMF can be formulated as multiplicative updates, which guarantee the non-negativity of endmember spectra and their associated abundances [21]. For this reason, the continued development of new NMF varieties remains an active area of research.

In realistic scenes, multiple scattering and surface variability can easily challenge the assumption of linear mixing [22]. Water-based HSIs specifically are prone to non-linear mixing effects due to absorption features of dissolved and suspended substances, fluorescence of organic matter, and particulate scattering in turbid waters [23,24,25]. With the growing popularity of deep learning approaches in remote sensing, a variety of models based on autoencoder architectures have been introduced for unmixing HSI data [26,27,28,29]. However, the complexity introduced by these models significantly impacts training time and decreases the interpretability of the resulting model. An ideal approach should enable both endmember extraction and non-linear unmixing while accounting for spectral variability.

The self-organizing map (SOM) is an unsupervised machine learning method that maps high-dimensional data to a low-dimensional grid while preserving the topological relationships between data points [30]. This low-dimensional representation provides a convenient way to visualize HSI data, while the weight vectors for each SOM node can be interpreted as representative spectra [31,32,33]. If labeled reference spectra are available, the SOM can be used to enable semi-supervised labeling of HSI pixels [34]. The SOM has also been shown to be effective for the compression of HSIs acquired by a CubeSat [35]. Despite these capabilities, the SOM does not offer a probabilistic interpretation and relies on a heuristic training procedure with hyperparameters that can be challenging to tune. To address these shortcomings, Bishop et al. introduced Generative Topographic Mapping (GTM), which is a probabilistic latent-variable model inspired by the SOM [36]. When choosing a two-dimensional latent space, the GTM can be used to visualize the distribution of HSI spectra, while the non-linear mapping from latent space nodes to the HSI data space provides endmembers [37]. Unfortunately, the rectangular latent space grid employed by the GTM does not directly translate into endmember abundances, and the expectation-maximization (EM) algorithm used to train the GTM does not guarantee non-negativity of GTM node spectra.

In this paper, we introduce a new variant of the GTM, dubbed Generative Simplex Mapping (GSM), which can extract endmember spectra and unmix non-linear mixtures. By replacing the rectangular latent space of the GTM with a gridded

(n - 1)

-simplex, the vertices of the GSM can be immediately interpreted as endmembers corresponding to n unique spectral signatures. The mapping from the latent space to the HSI data space accounts for linear and non-linear spectral mixing while barycentric coordinates for the latent space simplex represent relative endmember abundances. Additionally, by taking inspiration from the multiplicative updates of NMF, the GSM algorithm maintains the non-negativity of resulting endmember spectra. If only linear mixing is present, the GSM algorithm drives non-linear contributions to 0. Prior distributions included for GSM model weights yield hyperparameters, which can be tuned to control the smoothness of the resulting spectra and the degree of non-linear mixing applied. In summary, the key innovations introduced by the GSM are as follows:

The GSM can model linear and non-linear spectral mixing;
The GSM does not assume the presence of pure pixels in the data set;
The probabilistic formulation of the GSM accounts for spectral variability;
The simplex used for the latent space structure of the GSM is directly interpretable and forces abundances to satisfy both the abundance sum-to-one and abundance non-negativity constraints;
The fitting procedure introduced for the GSM maintains non-negativity of endmember spectra.

The remainder of this paper is structured as follows. Section 2 describes the proposed method. Section 3 and Section 4 describe experiments using simulated and real HSI data to evaluate the GSM model. Section 5 discusses additional applications and extensions of the GSM to be explored in future work. Finally, Section 6 finishes the paper with some closing remarks.

2. Generative Simplex Mapping

In the original GTM formulation, the data vectors

x

(reflectance spectra of length D) are described by latent variables

z

mapped into the data space by a non-linear function

ψ

. The data space distribution is taken to be normal with a precision parameter

β

to account for measurement noise and spectral variability. The GSM uses same structure, that is

p (x ∣ z, W, β) = {(\frac{β}{2 π})}^{D / 2} exp (- \frac{β}{2} {∥ ψ (z; W) - x ∥}^{2})

(1)

where

z = (z_{1}, z_{2}, . . ., z_{N_{v}})

corresponding to

N_{v}

-many sources and

W

is a

D \times M

matrix of model weights that parameterize the mapping

ψ

. As Equation (1) indicates, the precision parameter corresponds to a standard deviation of

\sqrt{β^{- 1}}

.

Assuming data are uniformly sampled from an embedded manifold in the data space, the GTM models the latent space using a rectangular grid with K-many nodes of equal prior probability. To adapt the GTM to describe endmember mixing, the GSM makes two key changes. The first is to replace the rectangular GTM grid with a gridded simplex with

N_{v}

vertices. Barycentric coordinates for each node then describe the relative abundance of each endmember with simplex vertices corresponding to pure sources. The second change is to augment the equal prior probabilities of the GTM latent space with adaptive mixing coefficients

π_{k}

to allow for nonuniform sampling across the abundance simplex. This addition allows the GSM to flexibly address many possible mixing scenarios without an a priori assumption for a specific mixing distribution. Together, these changes result in a prior distribution for the latent space given by

p (z) = \sum_{k}^{K} π_{k} δ (z - z_{k}) where \sum_{k}^{K} π_{k} = 1 .

(2)

where

δ (\cdot)

is the Dirac delta function and

z_{k}

are the positions of each node within the simplex.

For a data set containing N-many records, these definitions yield a log likelihood function given by

L = \sum_{n}^{N} ln (\sum_{k}^{K} π_{k} p (x_{n} ∣ z_{k}, W, β))

(3)

which can be maximized to obtain optimal values for model weights

W

, mixing coefficients

π_{k}

, and precision

β

. Rather than optimizing Equation (3) directly, we instead choose a particular form for

ψ

to allow fitting the GSM via an EM algorithm.

In the standard LMM model, the mapping

ψ

is given by

ψ (z; W) = W z

where the columns of

W

correspond to endmember spectra. To model non-linear mixing,

z

is replaced by the output of M-many activation functions such that

ψ (z; W) = W ϕ (z)

. The activations applied to each GSM node can then be collected to form a matrix with elements

Φ_{k m} = ϕ_{m} (z_{k})

. For

m \leq N_{v}

, we take

Φ_{k m} = {[z_{k}]}_{m}

to model linear mixing. The remaining

M - N_{v}

activations are computed using radial basis functions (RBFs) with centers

μ_{m}

distributed throughout the simplex (but not at the vertices) with

Φ_{k m} = \{\begin{matrix} \frac{s - ∥ z_{k} - μ_{m} ∥}{s}; & ∥ z_{k} - μ_{m} ∥ \leq s \\ 0; & ∥ z_{k} - μ_{m} ∥ > s \end{matrix}

(4)

where s is the spacing between RBF centers. A visual representation of the non-linear activation functions in Equation (4) is shown below in Figure 1 for a 2-component GSM.

In this form, the first

N_{v}

columns of

W

correspond to endmember spectra, while the remaining columns account for additional non-linear effects. Equation (4) is specifically chosen so that no non-linear contributions are possible for pure spectra at the vertices of the simplex. At all other points, the output of

ψ

involves both linear and non-linear contributions. If only linear mixing is present, the GSM training algorithm should therefore drive

W_{d m}

to 0 for

m \geq N_{v}

. A visualization of the GSM model for three sources is shown in Figure 2.

To further constrain the model, we introduce prior distributions on the weights

W

. For

m \leq N_{v}

we take

W_{d m} \sim N (0, λ_{e}^{- 1})

, corresponding to a zero-mean Gaussian with variance

λ_{e}^{- 1}

. For

m > N_{v}

we use a zero-mean Laplace distribution,

W_{d m} \sim \frac{λ_{w}}{2} exp (- λ_{w} | W_{d m} |)

, with scale parameter

λ_{w}^{- 1}

. Under these choices,

λ_{e}

corresponds to

L_{2}

regularization on endmember spectra while

λ_{w}

corresponds to

L_{1}

regularization on the non-linear activations. In other words,

λ_{e}

governs the smoothness of the resulting endmembers, while

λ_{w}

encourages sparsity for the non-linear contributions.

An EM algorithm for the GSM model can now be formulated as follows. Suppose that we have current estimates for the model weights

W

, mixing coefficients

π_{k}

, and precision parameter

β

. During the expectation step we compute the posterior probabilities, that is, the responsibility of each GSM node for each spectrum in the data set:

R_{k n} = p (z_{k} ∣ x_{n}, W, β) = \frac{π_{k} p (x_{n} ∣ z_{k}, W, β)}{\sum_{k^{'}}^{K} π_{k^{'}} p (x_{n} ∣ z_{k^{'}}, W, β)} .

(5)

For the maximization step, we consider the expectation of the penalized complete-data log likelihood given by

\begin{matrix} Q & = \sum_{n}^{N} \sum_{k}^{K} R_{k n} (ln π_{k} + \frac{D}{2} ln (\frac{β}{2 π}) - \frac{β}{2} \sum_{d}^{D} {(\sum_{m}^{M} W_{d m} Φ_{k m} - X_{n d})}^{2}) \\ + \frac{N_{v} D}{2} ln (\frac{λ_{e}}{2 π}) - \frac{λ_{e}}{2} \sum_{d}^{D} \sum_{m = 1}^{N_{v}} W_{d m}^{2} \\ + (M - N_{v}) D ln (\frac{λ_{w}}{2}) - λ_{w} \sum_{d}^{D} \sum_{m = N_{v} + 1}^{M} W_{d m} \end{matrix}

(6)

where

X_{n d}

is the d-th component of the n-th spectrum in the data set. Equation (6) is then maximized with respect to

π_{k}

,

β

, and

W

to obtain new parameter values. For a detailed overview of the EM procedure, the reader is directed to ref. [38].

For

π_{k}

, optimization can be performed using Lagrange multipliers to maintain the condition that

\sum_{k} π_{k} = 1

. Doing so yields

π_{k}^{new} = \frac{1}{N} \sum_{n} R_{k n}

(7)

Optimization of Equation (6) with respect to

W

leads to a linear system that can be solved using standard numerical methods. However, in this form, we cannot guarantee the non-negativity of

W

required to describe reflectance spectra. Therefore, we take inspiration from the multiplicative updates for NMF introduced by Lee and Seung [19]. A standard gradient-based update for

W

would normally take the form

W_{new} = W + η \frac{\partial Q}{\partial W}

(8)

for some learning rate

η

. Therefore, we differentiate to obtain

\frac{\partial Q}{\partial W} = - β W Φ^{T} G Φ - Λ + β X^{T} R^{T} Φ

(9)

where

G

is a diagonal matrix with

G_{k k} = \sum_{n} R_{k n}

and

Λ

is given by

Λ_{d m} = \{\begin{matrix} λ_{e} W_{d m}; & m \leq N_{v} \\ λ_{w}; & m > N_{v} \end{matrix}

(10)

If we allow individual learning rates

η_{d m}

for each element of

W

, then choosing

η_{d m} = \frac{W_{d m}}{{(β W Φ^{T} G Φ)}_{d m} + Λ_{d m}}

(11)

results in a multiplicative update rule given by

W_{d m}^{n e w} = W_{d m} \cdot \frac{{(β X^{T} R^{T} Φ)}_{d m}}{{(β W Φ^{T} G Φ)}_{d m} + Λ_{d m}}

(12)

From Equation (12), it is clear that we are multiplying

W_{d m}

by strictly non-negative values, and therefore, non-negative

W_{d m}

will remain so during each update. This update can also be repeated multiple times during each M-step to accelerate convergence.

Optimizing Equation (6) with respect to

β

yields the final update equation:

\frac{1}{β^{new}} = \frac{1}{N D} \sum_{n}^{N} \sum_{k}^{K} R_{k n} {∥ ψ (z_{k}; W) - x_{n} ∥}^{2} .

(13)

To train a GSM model, weights

W

are randomly initialized to positive values. The mixing coefficients are initially set to

π_{k} = 1 / K

. Finally, the precision parameter

β

is initialized to the variance of the

(N_{v} + 1)

-th principal component. The matrix

Φ

is pre-computed and stored for reuse during each EM cycle together with the data matrix

X

. After initialization, the expectation and maximization steps are repeated in turn until Q converges to a predetermined tolerance level. For large

N_{v}

, we note that generating a regular grid within the simplex becomes cumbersome as the number of grid nodes scales as

(\binom{k + N_{v} - 2}{N_{v} - 1})

for k nodes per edge leading to a large responsibility matrix. An alternative approach is to randomly sample points within the simplex to obtain a total of K nodes. A Dirichlet distribution

p (z) = \frac{Γ (\sum_{i}^{N_{v}} α_{i})}{\prod_{i}^{N_{v}} Γ (α_{i})} \prod_{i}^{N_{v}} {[z]}_{i}^{α_{i} - 1}

(14)

with all

α_{i} = 1

can be used to uniformly sample within the simplex. Since the mixing coefficients

π_{k}

are adaptive, variability in node separation should not significantly impact the resulting GSM.

The probabilistic form of the GSM means that a variety of information criteria can be used to evaluate the model fits. In this paper we consider two metrics, the Bayesian Information Criterion (BIC),

BIC = P ln (N) - 2 L,

(15)

and the Akaike Information Criterion (AIC),

AIC = 2 P - 2 L

(16)

where P is the total number of model parameters and

L

is the log likelihood from Equation (3).

The map

ψ

provides the representation of each node

z_{k}

in the data space. Importantly, applying

ψ

to each vertex extracts endmember spectra from the GSM. Slices

R_{[:, n]}

of the matrix

R

define the responsibility of each latent node

z_{k}

for the n-th spectrum

x_{n}

in the data set. Therefore, once the GSM has been trained,

R

can be used to unmix endmember abundances by representing each record in the latent space via the mean:

{\hat{z}}_{n} = \sum_{k}^{K} R_{k n} z_{k} .

(17)

A freely available implementation of the GSM is provided in [39]. The code is written in the Julia programming language and follows the Machine Learning in Julia (MLJ) common interface [40,41].

3. Experiments

3.1. Linear Mixing: Comparison to NMF

To illustrate the effectiveness of the GSM, we first demonstrate its ability to model linear mixing. This serves as an important limiting case, since linearly mixed spectra should not lead to the spurious introduction of non-linear contributions by the GSM. The goal of this first test is therefore to demonstrate that the GSM drives non-linear weights to zero for linearly mixed data while providing a fair test to compare the GSM to a well-established linear mixing model. This ability clearly distinguishes the GSM from other non-nonlinear unmixing approaches such as autoencoders, which, by their design, include non-linear mixing even when it is not present in the underlying data. To this end, a synthetic data set comprising linear mixtures of three sample spectra from the U.S. Geological Survey digital spectral library was generated by sampling 1000 abundance vectors from a Dirichlet distribution with

α_{1} = α_{2} = α_{3} = 1 / 3

[42]. These source spectra and abundances are visualized in Figure 3.

Zero-mean Gaussian noise was added to the data to yield 9 data sets with signal-to-noise ratios (SNRs) ranging from 0 to ∞ to examine the impact of random noise on GSM performance. For each of these data sets, a GSM was trained using 25 nodes per edge with

λ_{e}

set to the default value of

0.01

and

λ_{w}

fixed to 100. The large value of

λ_{w}

was chosen to encourage the GSM to prefer models with limited non-linear mixing.

For comparison, three NMF models were also trained on each data set, as this method is both highly popular and does not include the pure pixel assumption common to other techniques like VCA and PPI. Countless variations on the original NMF method have been introduced into the literature, with one review identifying more than 100 distinct NMF varieties [20]. For the purpose of comparing to the GSM, we considered the standard

ℓ_{2}

and KL-divergence formulations introduced by Lee and Seung [21]. We also included the robust

ℓ_{2, 1}

NMF as described by Kong et al. [43]. We note that the goal of this test is not to prove the GSM is superior to other models for linear mixing, but rather to demonstrate that the GSM can model linearly mixed data without introducing unnecessary complexity.

Four metrics were used to compare model performance. For endmember extraction, the mean spectral angle and mean RMSE between true endmembers

ρ_{i}

and extracted endmembers

{\hat{ρ}}_{i}

were computed where the spectral angle for the i-th endmember is defined as

θ (ρ_{i}, {\hat{ρ}}_{i}) = arccos (\frac{〈 ρ_{i}, {\hat{ρ}}_{i} 〉}{∥ ρ_{i} ∥ \cdot ∥ {\hat{ρ}}_{i} ∥})

(18)

and the RMSE for the i-th endmember is

RMSE (ρ_{i}, {\hat{ρ}}_{i}) = \sqrt{\frac{1}{D - 1} \sum_{d}^{D} {(ρ_{i} (λ_{d}) - {\hat{ρ}}_{i} (λ_{d}))}^{2}} .

(19)

Abundance estimation was similarly evaluated using the mean RMSE between true abundance and estimated abundance values for each endmember. Finally, the reconstruction RMSE, that is, the RMSE computed between the original data set and its reconstruction via the extracted endmembers and abundances, was computed. This provides a model-agnostic criterion to guarantee that each model sufficiently converged during training.

3.2. Non-Linear Mixing: Water Contaminant Identification

To assess the ability of the GSM to unmix realistic scenes likely to involve non-linear mixing effects, we consider a data set of real HSI collected in Montague, North Texas on 9 December 2020. A Freefly Alta-X autonomous quadcopter was used as a UAV platform and equipped with a Resonon Pika XC2 visible+near-infrared (VNIR) hyperspectral imager to acquire multiple HSIs. Each HSI pixel included 462 wavelength bins ranging from 391 to 1011 nm. To evaluate the ability of GSM to identify potential contaminant sources, rhodamine dye, a commonly used tracer in hydrological studies, was released into the pond. Two UAV flights spaced 15 min apart were used to capture the evolution of the plume as it dispersed into the surrounding water.

The hyperspectral imager is in a pushbroom configuration so that each HSI was captured one scan line at a time. Data from an embedded GPS/INS unit enable direct georectification of captured imagery using the method described in [44]. Additionally, an upward-facing Ocean Optics UV-Vis NIR spectrometer with a cosine corrector was included on the top of the UAV to measure incident solar irradiance. The configuration of the UAV together with a sample HSI are shown in Figure 4. For a more detailed description of the system, the reader is directed to references [11,12].

Raw HSIs were converted to reflectance using the downwelling irradiance spectrum captured simultaneously with each HSI. Given the UAV flies with the imager oriented to nadir, the reflectance is then given by

ρ (λ) = π L (λ) / E_{d} (λ)

(20)

where L is the spectral radiance,

E_{d}

is the downwelling irradiance, and a factor of

π

is included resulting from the assumption of diffuse upwelling radiance [45]. The UAV flights were performed near solar noon to maximize the amount of sunlight illuminating the water. For this pond in North Texas, this corresponded to an average solar zenith angle of

56 . 7^{\circ}

, resulting in HSIs with negligible sunglint effects.

From the collected HSIs, a water-only pixel mask was generated by identifying all pixel spectra with a normalized difference water index (NDWI) greater than

0.25

as defined in ref. [46]. Of these water pixels, a combined data set of

15, 000

spectra was sampled for GSM training. As a final processing step, reflectance spectra were limited to

λ \leq 900

nm, as wavelengths above this threshold showed significant noise.

In order to justify values for the appropriate number of endmembers, the final data set was decomposed using principal component analysis (PCA). A plot of the explained variance organized for each PCA component is shown in Figure 5.

The PCA decomposition of the data suggests that at least 3 endmembers should be used for a mixing model, and beyond 6 there is little added benefit. Based on these observations, multiple GSM models were trained with

N_{v}

ranging from 3 to 6,

λ_{e}

ranging from

0.001

to

1.0

, and

λ_{w}

ranging from 1 to 1000 in order to explore the GSM parameter space. From these, a final model was identified using the BIC, AIC, and reconstruction RMSE. The resulting GSM was then explored to examine extracted endmembers and map the evolution of the rhodamine plume by using the abundances given by the latent space representation of each pixel in the HSIs.

4. Results

4.1. Linear Mixing

The results of GSM training on the synthetic linear mixing data set described in Section 3 are illustrated in Figure 6. Three versions of NMF were trained corresponding to Euclidean (

ℓ_{2}

), KL-Divergence, and

ℓ_{2, 1}

cost functions. GSM models using both a regular simplex grid and a grid of points sampled using a uniform Dirichlet distribution (referred to as a big GSM model) were trained to compare performance in linear mixing tasks.

The quality of endmember extraction is measured by the mean spectral angle and the mean endmember RMSE. As Figure 6 indicates, both versions of the GSM outperformed their NMF counterparts. Additionally, we note that for all GSM models, even including

SNR = 0

, all model weights

W_{d m}

for

m > N_{v}

corresponding to non-linear mixing were driven identically to

0.0

. This confirms that, for data sets with purely linear mixing and random noise, the GSM correctly fits a mixing model without introducing unnecessary complexity.

The quality of the unmixing, that is, the estimation of the abundance, performed by each model was evaluated using the mean abundance RMSE. Here, we again see that both versions of the GSM outperformed NMF. To justify that all models were fairly trained to convergence, the data reconstruction RMSE was also computed. This metric uses the trained mixing model to compute the error between the original data set and the reconstructed spectra generated using the extracted endmembers and their abundances. For all GSM and NMF models, the data reconstruction RMSE converged to the level of random noise introduced into the data. These results reflect a fair comparison between the NMF and GSM models.

In Figure 7, we plot the endmembers extracted for the GSM model trained on the synthetic data set with

SNR = 20

.

The extracted endmembers clearly fit the original source spectra while capturing local reflectance features. Furthermore, the GSM precision parameter

β

, which is tuned during model training, provides an assessment of spectral variability due to random noise. In Figure 7, this is indicated by the colored band centered around each extracted spectrum with a width of

2 \sqrt{β^{- 1}}

corresponding to 2 standard deviations. The SNR of 20 added to this example corresponds to zero-mean Gaussian noise with a standard deviation of

σ = 0.0493

. After training, the GSM found

\sqrt{β^{- 1}} = 0.0495

, accurately identifying this introduced noise. The ability to assess the spectral variability of extracted endmembers is a key advantage resulting from the probabilistic formulation of the GSM.

4.2. Non-Linear Mixing: Rhodamine Dye Plume

For the data set of real HSI spectra described in Section 3, 80 GSM models were trained to explore the GSM hyperparameter space. The performance of the model was compared using the BIC, AIC, and reconstruction RMSE with the results of the top 10 performing models shown in Table 1.

As the table indicates, a non-linear GSM with

N_{v} = 3

,

λ_{e} = 0.01

, and

λ_{w} = 1.0

achieved minimum BIC, AIC, and reconstruction RMSE values. The lower value of

λ_{w}

identified from these models reflects the presence of non-linear mixing effects in the HSI data. Although the values of model weights

W_{d m}

for

m > N_{v}

corresponding to non-linear mixing were 0 for the synthetic data set, here these weights obtained a small but non-negligible median value of

0.0012

.

Reflectance spectra generated by the trained GSM corresponding to the maximum abundance values for each endmember are plotted in Figure 8. Based on their signatures, these endmembers are identified with water, vegetation (including near shore, filamentous blue-green algae), and the rhodamine dye plume.

By assigning a unique color to each endmember, the spatial distribution of HSI spectra can be visualized by using estimated abundances to smoothly interpolate between endmember colors as shown in panel b of Figure 8. In other words, this method provides a fuzzy semantic segmentation of HSI pixels. Alternatively, each HSI pixel could be assigned a single class corresponding to the endmember with maximal abundance. In this way, the GSM can be used to visualize high-dimensional HSI spectra.

To showcase the ability of the GSM to identify spatially localized contaminant sources, abundance maps were generated for each individual end member using the trained GTM to unmix all water pixels in the HSI. In Figure 9, these abundance maps are compared for each endmember. The abundance map for the water endmember covers a majority of the center of the pond and decreases near the edge of the rhodamine plume where the dye and water mix. The vegetation in shallow water near the shore and the rhodamine dye plume in the western half of the pond are also clearly identified by their abundance maps. Given an HSI pixel resolution of

0.1 \times 0.1

m^{2}

, the spatial extent of vegetation is estimated to be

378.6

m^{2}

while the plume initially extends across

255.7

m^{2}

.

Applying the trained GSM to unmix HSI from UAV flights provides an efficient way to map the dispersion and transport of contaminant. Figure 10 demonstrates this by mapping the growing extent of rhodamine dye between successive UAV flights. In just 15 min, the plume expands from an initial area of

255.7

m^{2}

to over

571.8

m^{2}

as the dye mixes with the surrounding water.

5. Discussion

The proliferation of hyperspectral imaging technologies in the remote sensing and environmental monitoring communities underscores the need for efficient algorithms that can make sense of high-dimensional HSIs. Of particular interest are unsupervised algorithms, which utilize all available wavelength bins to identify unique source profiles that present realistic scenes. The GSM introduced in this paper is a novel method that simultaneously performs endmember extraction and spectral unmixing. Unlike popular endmember extraction algorithms, such as VCA, PPI, and N-FINDR, GSM does not assume the presence of pure pixels in HSIs. Furthermore, the flexible structure of the mapping from the GSM latent space to the spectral data space allows the GSM to model non-linear mixing effects, distinguishing it from other widely used linear models such as NMF. Being a probabilistic model also enables the GSM to quantify the spectral variability through the precision parameter

β

as indicated in Figure 7. This additional capability is critical for analysis of realistic HSI where sensor noise, viewing geometry, and scene illumination all introduce uncertainty into extracted endmembers.

The growing popularity of deep learning methods has led many to explore applications of deep learning to the spectral unmixing problem. Variations in auto-encoder architectures are a popular choice for both endmember extraction and unmixing. For example, Borsoi et al. used a variational autoencoder to more accurately identify the spectral variability in an LMM [28]. Similarly, Palson et al. introduced an autoencoder with convolution layers to extend the LMM by incorporating spatial context from neighboring HSI pixels [29]. However, the increased complexity of deep neural network approaches makes interpreting the latent space learned by encoder layers challenging and can dramatically increase training time and data volume requirements. In contrast, the latent space of the GSM is immediately interpretable due to the imposed simplex geometry, while the form of

ψ

leaves room to explore a variety of non-linear mixing models. In this paper,

ψ

was constructed to add non-linear effects as additions or perturbations to an underlying linear mixing model, with the

λ_{w}

hyperparameter provided to control the degree of non-linearity considered. In the original GTM formulation that inspired the GSM, the mapping

ψ

strictly uses non-linear features generated by Gaussian RBFs. Alternatively, one could easily construct

ψ

for specific non-linear models such as bilinear mixing by introducing features that depend on higher-order combinations of the latent space (abundance) coordinates. However, it is important to note that the EM algorithm formulated for GSM training is made possible due to the linear combination of activations

ϕ (z)

with model weights

W

. For mixing models that instead depend non-linearly on weights

W

, an EM algorithm may require iterative non-linear solves during each M step.

The main limitation of the GSM is the curse of dimensionality encountered when generating a grid on the

(N_{v} - 1)

-simplex for large numbers of endmembers,

N_{v}

. This can be mitigated by instead randomly sampling points within the interior of the simplex using a uniform Dirichlet distribution to obtain a predetermined number of nodes across the latent space. As the mixing coefficients

π_{k}

are adapted during training, variability in spacing between nodes should not significantly affect the performance of the model. This was confirmed for the simulated data set as shown in Figure 6. In terms of computational efficiency, each expectation step involves

O (K \times N)

operations to update the entries of the responsibility matrix, leading to extended training times for considerably large data sets. This can be addressed by augmenting the EM procedure to use mini-batches of training samples, as outlined by Bishop et al. for the GTM in ref. [47]. Rather than updating the full responsibility matrix during each iteration, a subset of

R

corresponding to a single batch of training data can be evaluated with all other entries kept constant. The GSM may also be extended in other ways, for example, by replacing the precision parameter

β

with a vector

β

of parameters to model wavelength-dependent spectral variability common to many hyperspectral imaging platforms.

Based on the results of applying the GSM to real HSIs as illustrated in Figure 9 and Figure 10, one clear application of the GSM is contaminant identification and water quality assessment. Modern UAV platforms make it possible to rapidly image bodies of water where direct access for in situ data collection is restricted. By equipping UAVs with hyperspectral imagers, the GSM can be used to identify abnormal spectral signatures corresponding to localized contaminant sources. Generating semantic segmentations of collected HSIs as in panel b of Figure 8 makes it easy to compare multiple HSIs without resorting to pseudocolor images generated from a limited number of wavelength bands. Since the GSM models the distribution of all HSI spectra rather than individual parameters, it may also aid in situ data collection by suggesting sampling points that maximize the area traversed in the GSM latent space rather than uniformly sampling across a wide spatial extent. Similar approaches have been developed to guide autonomous data collection vehicles by casting route planning as a prize collecting traveling salesman problem subject to resource constraints [48,49].

As a final consideration, we note that the problem of endmember extraction and spectral unmixing for HSIs is identical to source apportionment in the context of air quality. Here, the goal is to identify measurement profiles associated with sources of ambient air pollution based on measurements at a receptor site. A popular model for source apportionment studies is Positive Matrix Factorization (PMF) introduced by Paatero and Tapper [50,51]. Just like NMF, PMF decomposes measurements into a convex combination of source profiles and relative abundances subject to non-negativity constraints. These linear receptor models assume that the sources are not transformed during transport to the receptor, ignoring changes due to chemical reactions. Non-linear mixing models such as the GSM introduced here may, therefore, prove beneficial in this additional domain.

6. Conclusions

In this paper, we introduced the GSM as a novel method for unsupervised endmember extraction and spectral unmixing. The model was compared to three varieties of NMF on a synthetic data set of linearly mixed spectra from the USGS spectral database. Across all noise levels, the GSM outperformed NMF for both endmember accuracy and abundance estimation. When applied to real HSIs collected over a north Texas pond, the GSM accurately models spectral mixing, including non-linear effects. Rhodamine dye was released into the pond to simulate a localized contaminant source. Endmember abundances estimated by the GSM corresponding to the dye accurately tracked the evolution of the plume as it mixed into the surrounding water. These examples illustrate the power of combining non-linear mixing models together with UAV-based hyperspectral imaging for environmental monitoring and water quality analysis. Future work will further develop the GSM to incorporate wavelength-dependent spectral variability. Additionally, the GSM may prove useful for air quality studies as a non-linear source apportionment model.

7. Patents

A provisional US patent application for this work has been filed on 2024-10-10 entitled Generative Simplex Mapping: Non-Linear Endmember Extraction And Spectral Unmixing For Hyperspectral Imagery, application number 63/705,854.

Author Contributions

Methodology, J.W.; conceptualization, J.W.; software, J.W.; validation, J.W.; formal analysis, J.W.; investigation J.W.; resources, D.J.L.; writing—original draft preparation, J.W.; writing—review and editing, J.W. and D.J.L.; visualization, J.W.; supervision, D.J.L.; project administration, D.J.L.; funding acquisition, D.J.L. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the following grants: the Texas National Security Network Excellence Fund award for Environmental Sensing Security Sentinels; the SOFWERX award for Machine Learning for Robotic Teams and NSF Award OAC-2115094; support from the University of Texas at Dallas Office of Sponsored Programs, Dean of Natural Sciences and Mathematics, and Chair of the Physics Department is gratefully acknowledged; TRECIS CC* Cyberteam (NSF #2019135); NSF OAC-2115094 Award; and EPA P3 grant number 84057001-0.

Data Availability Statement

The hyperspectral images supporting the conclusions of this article will be made available by the authors on request. An implementation of the GSM model in the Julia programming language is available at https://github.com/john-waczak/GenerativeTopographicMapping.jl (accessed on 21 August 2024).

Acknowledgments

David Shaefer is gratefully acknowledged for his help coordinating UAV flights during HSI collection. We thank Adam Aker, Lakitha Wijeratne, Shawhin Talebi, Ashen Fernando, Prabuddha Dewage, Mazhar Iqbal, Matthew Lary, and Gokul Balagopal for their help deploying the UAV. Don MacLaughlin, Scotty MacLaughlin, and the city of Plano, TX, are gratefully acknowledged for allowing us to deploy the UAV on their property. Christopher Simmons is gratefully acknowledged for his computational support.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:

PRISMA	Hyperspectral Precuror of the Application Mission
EnMAP	Environmental Mapping and Analysis Program
PACE	Plankton, Aerosol, Cloud, ocean Ecosystem
CHIME	Copernicus Hyperspectral Imaging Mission for the Environment
NIR	Near Infrared
SWIR	Short-wave Infrared
UAV	Unmanned Aerial Vehicle
HSI	Hyperspectral Image
NDVI	Normalized Difference Vegetation Index
VCA	Vertex Component Analysis
PPI	Pixel Purity Index
LMM	Linear Mixing Model
NMF	Non-negative Matrix Factorization
SOM	Self-Organizing Map
GTM	Generative Topographic Mapping
EM	Expectation-Maximization
GSM	Generative Simplex Mapping
RBF	Radial Basis Function
RMSE	Root Mean Square Error
PCA	Principal Component Analysis
PMF	Positive Matrix Factorization

References

Loizzo, R.; Guarini, R.; Longo, F.; Scopa, T.; Formaro, R.; Facchinetti, C.; Varacalli, G. Prisma: The Italian Hyperspectral Mission. In Proceedings of the IGARSS 2018–2018 IEEE International Geoscience and Remote Sensing Symposium, Valencia, Spain, 22–27 July 2018; pp. 175–178. [Google Scholar] [CrossRef]
Storch, T.; Honold, H.P.; Chabrillat, S.; Habermeyer, M.; Tucker, P.; Brell, M.; Ohndorf, A.; Wirth, K.; Betz, M.; Kuchler, M.; et al. The Enmap Imaging Spectroscopy Mission Towards Operations. Remote Sens. Environ. 2023, 294, 113632. [Google Scholar] [CrossRef]
Gorman, E.; Kubalak, D.A.; Deepak, P.; Dress, A.; Mott, D.B.; Meister, G.; Werdell, J. The NASA Plankton, Aerosol, Cloud, ocean Ecosystem (PACE) mission: An emerging era of global, hyperspectral Earth system remote sensing. In Proceedings of the Sensors, Systems, and Next-Generation Satellites XXIII, Strasbourg, France, 9–12 September 2019; pp. 78–84. [Google Scholar] [CrossRef]
Rast, M.; Nieke, J.; Adams, J.; Isola, C.; Gascon, F. Copernicus Hyperspectral Imaging Mission for the Environment (Chime). In Proceedings of the 2021 IEEE International Geoscience and Remote Sensing Symposium IGARSS, Brussels, Belgium, 11–16 July 2021; pp. 157–159. [Google Scholar] [CrossRef]
Adão, T.; Hruška, J.; Pádua, L.; Bessa, J.; Peres, E.; Morais, R.; Sousa, J. Hyperspectral Imaging: A Review on Uav-Based Sensors, Data Processing and Applications for Agriculture and Forestry. Remote Sensing 2017, 9, 1110. [Google Scholar] [CrossRef]
Arroyo-Mora, J.; Kalacska, M.; Inamdar, D.; Soffer, R.; Lucanus, O.; Gorman, J.; Naprstek, T.; Schaaf, E.; Ifimov, G.; Elmer, K.; et al. Implementation of a Uav-Hyperspectral Pushbroom Imager for Ecological Monitoring. Drones 2019, 3, 12. [Google Scholar] [CrossRef]
Thenkabail, P.S.; Smith, R.B.; Pauw, E.D. Hyperspectral Vegetation Indices and Their Relationships With Agricultural Crop Characteristics. Remote Sens. Environ. 2000, 71, 158–182. [Google Scholar] [CrossRef]
Thenkabail, P.S.; Lyon, J.G.; Huete, A. Hyperspectral Indices and Image Classifications for Agriculture and Vegetation; CRC Press: Boca Raton, FL, USA, 2018. [Google Scholar]
van Leeuwen, W.J.; Orr, B.J.; Marsh, S.E.; Herrmann, S.M. Multi-Sensor Ndvi Data Continuity: Uncertainties and Implications for Vegetation Monitoring Applications. Remote Sens. Environ. 2006, 100, 67–81. [Google Scholar] [CrossRef]
Aurin, D.; Mannino, A.; Lary, D.J. Remote Sensing of Cdom, Cdom Spectral Slope, and Dissolved Organic Carbon in the Global Ocean. Appl. Sci. 2018, 8, 2687. [Google Scholar] [CrossRef] [PubMed]
Lary, D.J.; Schaefer, D.; Waczak, J.; Aker, A.; Barbosa, A.; Wijeratne, L.O.H.; Talebi, S.; Fernando, B.; Sadler, J.; Lary, T.; et al. Autonomous Learning of New Environments With a Robotic Team Employing Hyper-Spectral Remote Sensing, Comprehensive In-Situ Sensing and Machine Learning. Sensors 2021, 21, 2240. [Google Scholar] [CrossRef]
Waczak, J.; Aker, A.; Wijeratne, L.O.H.; Talebi, S.; Fernando, A.; Dewage, P.M.H.; Iqbal, M.; Lary, M.; Schaefer, D.; Lary, D.J. Characterizing Water Composition With an Autonomous Robotic Team Employing Comprehensive in Situ Sensing, Hyperspectral Imaging, Machine Learning, and Conformal Prediction. Remote Sens. 2024, 16, 996. [Google Scholar] [CrossRef]
Nascimento, J.; Dias, J. Vertex Component Analysis: A Fast Algorithm To Unmix Hyperspectral Data. IEEE Trans. Geosci. Remote Sens. 2005, 43, 898–910. [Google Scholar] [CrossRef]
Boardman, J.W.; Kruse, F.A.; Green, R.O. Mapping target signatures via partial unmixing of AVIRIS data. In Proceedings of the Summaries of the 15th Annual JPL Airborne Earth Science Workshop, Pasadena, CA, USA, 23–26 January 1995; Volume 1. [Google Scholar]
Winter, M.E. N-FINDR: An algorithm for fast autonomous spectral end-member determination in hyperspectral data. In Proceedings of the SPIE Proceedings, San Diego, CA, USA, 22–25 February 1999. [Google Scholar] [CrossRef]
Keshava, N.; Mustard, J. Spectral Unmixing. IEEE Signal Process. Mag. 2002, 19, 44–57. [Google Scholar] [CrossRef]
Heinz, D.; Chang, C.I.; Althouse, M. Fully constrained least-squares based linear unmixing [hyperspectral image classification]. In Proceedings of the IEEE 1999 International Geoscience and Remote Sensing Symposium, Hamburg, Germany, 28 June–2 July 1999; Volume 2, pp. 1401–1403. [Google Scholar] [CrossRef]
Li, R.; Pan, B.; Xu, X.; Li, T.; Shi, Z. Toward convergence: A gradient-based multiobjective method with greedy hash for hyperspectral unmixing. IEEE Trans. Geosci. Remote Sens. 2023, 61, 1–14. [Google Scholar] [CrossRef]
Lee, D.D.; Seung, H.S. Learning the Parts of Objects By Non-Negative Matrix Factorization. Nature 1999, 401, 788–791. [Google Scholar] [CrossRef] [PubMed]
Feng, X.R.; Li, H.C.; Wang, R.; Du, Q.; Jia, X.; Plaza, A. Hyperspectral Unmixing Based on Nonnegative Matrix Factorization: A Comprehensive Review. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2022, 15, 4414–4436. [Google Scholar] [CrossRef]
Lee, D.; Seung, H.S. Algorithms for Non-negative Matrix Factorization. In Advances in Neural Information Processing Systems; Leen, T., Dietterich, T., Tresp, V., Eds.; MIT Press: Cambridge, MA, USA, 2000; Volume 13. [Google Scholar]
Heylen, R.; Parente, M.; Gader, P. A Review of Nonlinear Hyperspectral Unmixing Methods. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2014, 7, 1844–1868. [Google Scholar] [CrossRef]
Nima, C.; Frette, Ø.; Hamre, B.; Stamnes, J.J.; Chen, Y.C.; Sørensen, K.; Norli, M.; Lu, D.; Xing, Q.; Muyimbwa, D.; et al. Cdom Absorption Properties of Natural Water Bodies Along Extreme Environmental Gradients. Water 2019, 11, 1988. [Google Scholar] [CrossRef]
Gilerson, A.; Zhou, J.; Hlaing, S.; Ioannou, I.; Schalles, J.; Gross, B.; Moshary, F.; Ahmed, S. Fluorescence Component in the Reflectance Spectra From Coastal Waters. Dependence on Water Composition. Opt. Express 2007, 15, 15702. [Google Scholar] [CrossRef]
Witte, W.G.; Whitlock, C.H.; Harriss, R.C.; Usry, J.W.; Poole, L.R.; Houghton, W.M.; Morris, W.D.; Gurganus, E.A. Influence of Dissolved Organic Materials on Turbid Water Optical Properties and Remote-sensing Reflectance. J. Geophys. Res. Ocean. 1982, 87, 441–446. [Google Scholar] [CrossRef]
Su, Y.; Marinoni, A.; Li, J.; Plaza, A.; Gamba, P. Nonnegative sparse autoencoder for robust endmember extraction from remotely sensed hyperspectral images. In Proceedings of the 2017 IEEE International Geoscience and Remote Sensing Symposium (IGARSS), Fort Worth, TX, USA, 23–28 July 2017. [Google Scholar] [CrossRef]
Su, Y.; Li, J.; Plaza, A.; Marinoni, A.; Gamba, P.; Chakravortty, S. Daen: Deep Autoencoder Networks for Hyperspectral Unmixing. IEEE Trans. Geosci. Remote Sens. 2019, 57, 4309–4321. [Google Scholar] [CrossRef]
Borsoi, R.A.; Imbiriba, T.; Bermudez, J.C.M. Deep Generative Endmember Modeling: An Application To Unsupervised Spectral Unmixing. IEEE Trans. Comput. Imaging 2020, 6, 374–384. [Google Scholar] [CrossRef]
Palsson, B.; Ulfarsson, M.O.; Sveinsson, J.R. Convolutional Autoencoder for Spectral-Spatial Hyperspectral Unmixing. IEEE Trans. Geosci. Remote Sens. 2021, 59, 535–549. [Google Scholar] [CrossRef]
Kohonen, T. The Self-Organizing Map. Proc. IEEE 1990, 78, 1464–1480. [Google Scholar] [CrossRef]
Cantero, M.C.; Perez, R.M.; Martinez, P.J.; Aguilar, P.L.; Plaza, J.; Plaza, A. Analysis of the behavior of a neural network model in the identification and quantification of hyperspectral signatures applied to the determination of water quality. In Proceedings of the SPIE Proceedings, San Diego, CA, USA, 14–19 February 2004; pp. 174–185. [Google Scholar] [CrossRef]
Duran, O.; Petrou, M. A Time-Efficient Method for Anomaly Detection in Hyperspectral Images. IEEE Trans. Geosci. Remote Sens. 2007, 45, 3894–3904. [Google Scholar] [CrossRef]
Ceylan, O.; Taskin, G. Feature Selection Using Self Organizing Map Oriented Evolutionary Approach. In Proceedings of the 2021 IEEE International Geoscience and Remote Sensing Symposium IGARSS, Brussels, Belgium, 11–16 July 2021. [Google Scholar] [CrossRef]
Riese, F.M.; Keller, S.; Hinz, S. Supervised and Semi-Supervised Self-Organizing Maps for Regression and Classification Focusing on Hyperspectral Data. Remote Sens. 2019, 12, 7. [Google Scholar] [CrossRef]
Richardson, A.; Risien, C.; Shillington, F. Using Self-Organizing Maps To Identify Patterns in Satellite Imagery. Prog. Oceanogr. 2003, 59, 223–239. [Google Scholar] [CrossRef]
Bishop, C.M.; Svensén, M.; Williams, C.K.I. Gtm: The Generative Topographic Mapping. Neural Comput. 1998, 10, 215–234. [Google Scholar] [CrossRef]
Waczak, J.; Aker, A.; Wijeratne, L.O.H.; Talebi, S.; Fernando, A.; Dewage, P.M.H.; Iqbal, M.; Lary, M.; Schaefer, D.; Balagopal, G.; et al. Unsupervised Characterization of Water Composition With Uav-Based Hyperspectral Imaging and Generative Topographic Mapping. Remote Sens. 2024, 16, 2430. [Google Scholar] [CrossRef]
Bishop, C.M. Pattern Recognition and Machine Learning (Information Science and Statistics); Springer: Berlin/Heidelberg, Germany, 2006; pp. 435–455. [Google Scholar]
Waczak, J. GenerativeTopographicMapping.jl; Zenodo: Geneva, Switzerland, 2024. [Google Scholar] [CrossRef]
Bezanson, J.; Karpinski, S.; Shah, V.B.; Edelman, A. Julia: A Fast Dynamic Language for Technical Computing. arXiv 2012, arXiv:1209.5145. [Google Scholar] [CrossRef]
Blaom, A.D.; Kiraly, F.; Lienart, T.; Simillides, Y.; Arenas, D.; Vollmer, S.J. Mlj: A Julia Package for Composable Machine Learning. arXiv 2020, arXiv:2007.12285. [Google Scholar] [CrossRef]
Kokaly, R.F.; Clark, R.N.; Swayze, G.A.; Livo, K.E.; Hoefen, T.M.; Pearson, N.C.; Wise, R.A.; Benzel, W.; Lowers, H.A.; Driscoll, R.L.; et al. USGS Spectral Library Version 7; Technical Report; US Geological Survey: Reston, VA, USA, 2017. [CrossRef]
Kong, D.; Ding, C.; Huang, H. Robust nonnegative matrix factorization using L21-norm. In Proceedings of the 20th ACM International Conference on Information and Knowledge Management, Glasgow, UK, 24–28 October 2011. [Google Scholar] [CrossRef]
Muller, R.; Lehner, M.; Muller, R.; Reinartz, P.; Schroeder, M.; Vollmer, B. A program for direct georeferencing of airborne and spaceborne line scanner images. Int. Arch. Photogramm. Remote Sens. Spat. Inf. Sci. 2002, 34, 148–153. [Google Scholar]
Ruddick, K.G.; Voss, K.; Banks, A.C.; Boss, E.; Castagna, A.; Frouin, R.; Hieronymi, M.; Jamet, C.; Johnson, B.C.; Kuusk, J.; et al. A Review of Protocols for Fiducial Reference Measurements of Downwelling Irradiance for the Validation of Satellite Remote Sensing Data Over Water. Remote Sens. 2019, 11, 1742. [Google Scholar] [CrossRef]
McFeeters, S.K. The Use of the Normalized Difference Water Index (NDWI) in the Delineation of Open Water Features. Int. J. Remote Sens. 1996, 17, 1425–1432. [Google Scholar] [CrossRef]
Bishop, C.M.; Svensén, M.; Williams, C.K. Developments of the Generative Topographic Mapping. Neurocomputing 1998, 21, 203–224. [Google Scholar] [CrossRef]
Balas, E. The prize collecting traveling salesman problem and its applications. In The Traveling Salesman Problem and Its Variations; Springer: Berlin/Heidelberg, Germany, 2007; pp. 663–695. [Google Scholar]
Suryan, V.; Tokekar, P. Learning a Spatial Field in Minimum Time With a Team of Robots. IEEE Trans. Robot. 2020, 36, 1562–1576. [Google Scholar] [CrossRef]
Paatero, P.; Tapper, U. Positive Matrix Factorization: A Non-negative Factor Model With Optimal Utilization of Error Estimates of Data Values. Environmetrics 1994, 5, 111–126. [Google Scholar] [CrossRef]
Ulbrich, I.M.; Canagaratna, M.R.; Zhang, Q.; Worsnop, D.R.; Jimenez, J.L. Interpretation of Organic Components From Positive Matrix Factorization of Aerosol Mass Spectrometric Data. Atmos. Chem. Phys. 2009, 9, 2891–2918. [Google Scholar] [CrossRef]

Figure 1. Illustration of the non-linear activation functions for a 2-component GSM. The simplex vertices are shown as green dots (

μ_{1}

and

μ_{2}

) and three RBF centers are shown in red (

μ_{1}

,

μ_{2}

, and

μ_{3}

). With no RBF centers at the vertices, we guarantee no non-linear contributions occur for pure endmember spectra.

Figure 1. Illustration of the non-linear activation functions for a 2-component GSM. The simplex vertices are shown as green dots (

μ_{1}

and

μ_{2}

) and three RBF centers are shown in red (

μ_{1}

,

μ_{2}

, and

μ_{3}

). With no RBF centers at the vertices, we guarantee no non-linear contributions occur for pure endmember spectra.

Figure 2. Illustration of the GSM. The latent space consists of a grid of K-many points (green dots) distributed throughout a simplex with

N_{v}

vertices. Barycentric coordinates of each node in the simplex correspond to the relative abundance of

N_{v}

-many unique sources. Here,

N_{v} = 3

has been chosen for illustrative purposes. Nodes are mapped into the data space via the map

ψ (z)

utilizing M-many radially symmetric basis functions (red). Spectral variability is estimated via the precision parameter,

β

, shown here in the data space as a light blue band around the spectrum given by

ψ (z)

.

Figure 2. Illustration of the GSM. The latent space consists of a grid of K-many points (green dots) distributed throughout a simplex with

N_{v}

vertices. Barycentric coordinates of each node in the simplex correspond to the relative abundance of

N_{v}

-many unique sources. Here,

N_{v} = 3

has been chosen for illustrative purposes. Nodes are mapped into the data space via the map

ψ (z)

utilizing M-many radially symmetric basis functions (red). Spectral variability is estimated via the precision parameter,

β

, shown here in the data space as a light blue band around the spectrum given by

ψ (z)

.

Figure 3. Synthetic data set formed from USGS spectra. (a) Spectra from the USGS spectral database used as the ground truth endmembers. These spectra were selected following the example in ref. [13]. (b) The abundance distribution sampled for in the data set. Samples were generated from a Dirichlet distribution with

α_{1} = α_{2} = α_{3} = 1 / 3

.

Figure 3. Synthetic data set formed from USGS spectra. (a) Spectra from the USGS spectral database used as the ground truth endmembers. These spectra were selected following the example in ref. [13]. (b) The abundance distribution sampled for in the data set. Samples were generated from a Dirichlet distribution with

α_{1} = α_{2} = α_{3} = 1 / 3

.

Figure 4. Real HSI data set. (a) The UAV used to collect hyperspectral images. (b) The Resonon Pika XC2 hyperspectral imager used to acquire HSI. (c) A sample hyperspectral data cube. Spectra are plotted for each pixel at their geographic position. The log10-reflectance is colored along the z axis, and a pseudocolor image is shown on top of the data cube. The signature of the rhodamine dye plume is clearly identifiable in the water.

Figure 5. Explained variance of PCA components for the real HSI data set. A red horizontal line is superimposed on the graph, marking an explained variance of

1 %

. All components past the fourth explain less than

1 %

of the observed variance.

Figure 5. Explained variance of PCA components for the real HSI data set. A red horizontal line is superimposed on the graph, marking an explained variance of

1 %

. All components past the fourth explain less than

1 %

of the observed variance.

Figure 6. Comparison of GSM against NMF on simulated linear mixing data set using USGS spectra. (a) The mean spectral angle computed between extracted endmembers and original endmembers. (b) The mean RMSE computed between extracted endmembers and original endmembers. (c) The mean abundance RMSE computed between original abundance data for each endmember and extracted abundances. (d) The reconstruction RMSE, which evaluates the quality of fit. All models realized similar values, reflecting convergence of the models to the level of random noise introduced into the data.

Figure 7. Endmembers extracted by the GSM for the simulated linear mixing data set with SNR

= 20

. The dashed lines correspond to original endmember spectra from the USGS spectral database. Solid lines superimposed on the plot indicate the extracted endmember spectra. Colored bands are included around each spectrum corresponding to the spectral variability estimated by the GSM precision parameter

β

where the band width is

2 \sqrt{β^{- 1}}

corresponding to 2 standard deviations.

Figure 7. Endmembers extracted by the GSM for the simulated linear mixing data set with SNR

= 20

. The dashed lines correspond to original endmember spectra from the USGS spectral database. Solid lines superimposed on the plot indicate the extracted endmember spectra. Colored bands are included around each spectrum corresponding to the spectral variability estimated by the GSM precision parameter

β

where the band width is

2 \sqrt{β^{- 1}}

corresponding to 2 standard deviations.

Figure 8. GSM applied to water spectra from real HSI data set: (a) Spectra generated by the trained GSM for samples with maximum abundance for each endmember. Based on these spectral profiles, endmembers are identified with water, near-shore vegetation, and rhodamine dye. (b) An HSI segmented according to the relative abundance of each endmember. Each water pixel is colored by smoothing interpolating between red, green, and blue colors using the relative abundance estimated for rhodamine, vegetation, and water spectra. The rhodamine plume is clearly identifiable in the western portion of the HSI.

Figure 9. Endmember abundance distributions: (a) The spatial distribution of abundance for the water class. This source dominates in the center of the pond and decreases towards the shore where vegetation begins to dominate the reflectance signal. The water endmember abundance is also observed to decrease near the edge of the rhodamine plume reflecting dye mixing and diffusion. (b) The spatial distribution of vegetation. This endmember includes filamentous blue-green algae observed to accumulate in shallow waters near the shore. (c) The rhodamine dye plume extent segmented from the HSI. The total areas for near-shore vegetation and rhodamine are estimated to be 378.6 m² and 255.7 m², respectively.

Figure 10. Rhodamine plume evolution: Using the trained GSM we can track the dispersion of the rhodamine dye plume between successive drone flights. (a) The initial plume distribution after release. Here, the dye subsumes an area of

255.7

m^{2}

. (b) The same plume imaged 15 min later now extends across an area of

571.8

m^{2}

.

Figure 10. Rhodamine plume evolution: Using the trained GSM we can track the dispersion of the rhodamine dye plume between successive drone flights. (a) The initial plume distribution after release. Here, the dye subsumes an area of

255.7

m^{2}

. (b) The same plume imaged 15 min later now extends across an area of

571.8

m^{2}

.

Table 1. GSM hyperparameter optimization for the real HSI data set: Multiple GSM models were trained to explore the impact of model hyperparameters.

N_{v}

values from 3 to 6 were explored as suggested by the PCA decomposition of the data set.

λ_{e}

was varied from

0.001

to

1.0

and

λ_{w}

ranged from 1 to 1000. Here, we report the top 10 models ranked by increasing BIC. The AIC and reconstruction RMSE are also included for comparison.

Table 1. GSM hyperparameter optimization for the real HSI data set: Multiple GSM models were trained to explore the impact of model hyperparameters.

N_{v}

values from 3 to 6 were explored as suggested by the PCA decomposition of the data set.

λ_{e}

was varied from

0.001

to

1.0

and

λ_{w}

ranged from 1 to 1000. Here, we report the top 10 models ranked by increasing BIC. The AIC and reconstruction RMSE are also included for comparison.

$N_{v}$	$λ_{e}$	$λ_{w}$	BIC	AIC	Reconstruction RMSE
3	$0.01$	$1.0$	$- 6.195 \times 10^{7}$	$- 6.269 \times 10^{7}$	$0.000989$
3	$0.001$	$1.0$	$- 6.194 \times 10^{7}$	$- 6.268 \times 10^{7}$	$0.000989$
3	$0.1$	$1.0$	$- 6.192 \times 10^{7}$	$- 6.265 \times 10^{7}$	$0.000991$
3	$1.0$	$1.0$	$- 6.186 \times 10^{7}$	$- 6.260 \times 10^{7}$	$0.001190$
4	$1.0$	$1.0$	$- 6.181 \times 10^{7}$	$- 6.255 \times 10^{7}$	$0.001002$
4	$0.1$	$1.0$	$- 6.175 \times 10^{7}$	$- 6.249 \times 10^{7}$	$0.001008$
4	$0.01$	$1.0$	$- 6.173 \times 10^{7}$	$- 6.247 \times 10^{7}$	$0.001009$
4	$0.001$	$1.0$	$- 6.173 \times 10^{7}$	$- 6.247 \times 10^{7}$	$0.001009$
4	$0.1$	$10.0$	$- 6.171 \times 10^{7}$	$- 6.245 \times 10^{7}$	$0.001011$
3	$1.0$	$10.0$	$- 6.166 \times 10^{7}$	$- 6.239 \times 10^{7}$	$0.001014$

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Waczak, J.; Lary, D.J. Generative Simplex Mapping: Non-Linear Endmember Extraction and Spectral Unmixing for Hyperspectral Imagery. Remote Sens. 2024, 16, 4316. https://doi.org/10.3390/rs16224316

AMA Style

Waczak J, Lary DJ. Generative Simplex Mapping: Non-Linear Endmember Extraction and Spectral Unmixing for Hyperspectral Imagery. Remote Sensing. 2024; 16(22):4316. https://doi.org/10.3390/rs16224316

Chicago/Turabian Style

Waczak, John, and David J. Lary. 2024. "Generative Simplex Mapping: Non-Linear Endmember Extraction and Spectral Unmixing for Hyperspectral Imagery" Remote Sensing 16, no. 22: 4316. https://doi.org/10.3390/rs16224316

APA Style

Waczak, J., & Lary, D. J. (2024). Generative Simplex Mapping: Non-Linear Endmember Extraction and Spectral Unmixing for Hyperspectral Imagery. Remote Sensing, 16(22), 4316. https://doi.org/10.3390/rs16224316

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Generative Simplex Mapping: Non-Linear Endmember Extraction and Spectral Unmixing for Hyperspectral Imagery

Abstract

1. Introduction

2. Generative Simplex Mapping

3. Experiments

3.1. Linear Mixing: Comparison to NMF

3.2. Non-Linear Mixing: Water Contaminant Identification

4. Results

4.1. Linear Mixing

4.2. Non-Linear Mixing: Rhodamine Dye Plume

5. Discussion

6. Conclusions

7. Patents

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI