Joint Classification of Hyperspectral and LiDAR Data via Multiprobability Decision Fusion Method

Chen, Tao; Chen, Sizuo; Chen, Luying; Chen, Huayue; Zheng, Bochuan; Deng, Wu

doi:10.3390/rs16224317

Open AccessArticle

Joint Classification of Hyperspectral and LiDAR Data via Multiprobability Decision Fusion Method

by

Tao Chen

¹,

Sizuo Chen

¹,

Luying Chen

²,

Huayue Chen

^1,3,4,*,

Bochuan Zheng

^1,3,4

and

Wu Deng

^2,5

¹

School of Computer Science, China West Normal University, Nanchong 637002, China

²

School of Electronic Information and Automation, Civil Aviation University of China, Tianjin 300300, China

³

Institute of Artificial Intelligence, China West Normal University, Nanchong 637002, China

⁴

Key Laboratory of Optimization Theory and Applications, China West Normal University, Nanchong 637002, China

⁵

State Key Laboratory of Rail Transit Vehicle System, Southwest Jiaotong University, Chengdu 610031, China

^*

Author to whom correspondence should be addressed.

Remote Sens. 2024, 16(22), 4317; https://doi.org/10.3390/rs16224317

Submission received: 27 September 2024 / Revised: 12 November 2024 / Accepted: 15 November 2024 / Published: 19 November 2024

(This article belongs to the Special Issue Recent Advances in Multispectral and Hyperspectral Image Analysis and Classification)

Download

Browse Figures

Versions Notes

Abstract

:

With the development of sensor technology, the sources of remotely sensed image data for the same region are becoming increasingly diverse. Unlike single-source remote sensing image data, multisource remote sensing image data can provide complementary information for the same feature, promoting its recognition. The effective utilization of remote sensing image data from various sources can enhance the extraction of image features and improve the accuracy of feature recognition. Hyperspectral remote sensing (HSI) data and light detection and ranging (LiDAR) data can provide complementary information from different perspectives and are frequently combined in feature identification tasks. However, the process of joint use suffers from data redundancy, low classification accuracy and high time complexity. To address the aforementioned issues and improve feature recognition in classification tasks, this paper introduces a multiprobability decision fusion (PRDRMF) method for the combined classification of HSI and LiDAR data. First, the original HSI data and LiDAR data are downscaled via the principal component–relative total variation (PRTV) method to remove redundant information. In the multifeature extraction module, the local texture features and spatial features of the image are extracted to consider the local texture and spatial structure of the image data. This is achieved by utilizing the local binary pattern (LBP) and extended multiattribute profile (EMAP) for the two types of data after dimensionality reduction. The four extracted features are subsequently input into the corresponding kernel–extreme learning machine (KELM), which has a simple structure and good classification performance, to obtain four classification probability matrices (CPMs). Finally, the four CPMs are fused via a multiprobability decision fusion method to obtain the optimal classification results. Comparison experiments on four classical HSI and LiDAR datasets demonstrate that the method proposed in this paper achieves high classification performance while reducing the overall time complexity of the method.

Keywords:

HSI data; LiDAR data; image classification; principal component–relative total variation (PRTV); multiprobability decision fusion method

1. Introduction

With the development of sensor technology, observation data from different sensors have enriched the remote sensing image data of the same region [1,2,3,4]. Compared with single-source remote sensing image data, which are limited by imaging indices, multisource remote sensing image data can effectively leverage the complementary advantages of different perspectives from multiple sensors, provide multitemporal and multiangle data, and provide complementary spectral, temporal, and spatial information [5,6,7,8]. Therefore, the effective utilization of multisource remote sensing data can increase the accuracy of feature identification in remote sensing images. HSI data contain rich spectral information that can be utilized to differentiate features with distinct spectra and classify them with precision. However, a challenge arises when different features exhibit the same spectrum, making distinguishing features of varying elevations obscured by clouds and fog during the classification process challenging. On the other hand, LiDAR data can provide shape and height information for features in cloud-obscured areas. They can also be used to better distinguish features with varying heights in the same spectrum that may be obscured by clouds and fog. Therefore, the combined utilization of HSI data and LiDAR data can leverage the complementary nature of spectral information [9,10,11,12,13] and shape–height information [14,15,16,17,18,19,20,21] to extract features from various viewpoints and increase the precision of feature identification.

Due to the large volume and high dimensionality of HSI data, there is a common issue of data redundancy when using them in conjunction with LiDAR data for feature classification tasks. This redundancy can result in a decreased accuracy and efficiency of feature extraction, ultimately impacting the effectiveness of utilizing HSI and LiDAR data together for classification tasks. To eliminate redundant data information, Dong et al. [22] proposed a new spatial environment de-redundancy network (SCDNet) and designed a fusion module based on a multi-layer gating mechanism to eliminate redundant information from HSI data and LiDAR data. The classification was significant on two datasets, but the effect on more datasets of different noise and size is unknown. Vasin et al. [23] utilized a locally chi-squared basis function (LHWABF) system algorithm to remove redundant information and compress HSI data adaptively through multiple iterations. This inevitably resulted in the loss of important information while removing redundant information. Wang et al. [24] proposed observation fusion networks optimized with multiple iterations and alternations to reduce information redundancy by utilizing a feature reconstruction module for HSI data. However, the two networks are structurally complex and have high computational overhead. Although the aforementioned methods partially address the issue of data redundancy when combining HSI and LiDAR data, they all eliminate redundant information through a multilevel mechanism or multiple iterations, involving a complex network structure and high time complexity.

When feature extraction is performed on HSI and LiDAR data to better utilize the complementary information of both datasets and effectively obtain feature information of ground features, spectral features can be extracted using Principal Component Analysis (PCA) [25], Minimum/Maximum Autocorrelation Factor Analysis [26], and Kernel Principal Component Analysis [27] for HSI and LiDAR data, respectively. However, the extraction effectiveness of these basic methods on large and complex data still needs to be improved. In their study, Rasti et al. [28] combined joint feature extraction of extinction profiles with full variational analysis to extract spatial features from HSI data and LiDAR data. It is possible to ascertain the specific spatial characteristics of geometric angles. Liao et al. [29] enhanced the morphological profile method and employed a graph-based fusion strategy to extract spatial features from HSI data and LiDAR data. The graph fusion strategy was employed to preserve image details, thereby enhancing the morphological contours of the extracted spatial features. Li et al. [30] utilized LBPs to extract local texture features from remotely sensed image data. This approach proved effective in capturing local detail information. Dalla et al. [31] utilized EMAP to extract geometric structural features from HSI data. The EMAP technique is capable of extracting the significant features of an image with greater efficacy by fusing the information derived from multiple layers. Although the methods mentioned above for extracting spectral and spatial features from HSI and LiDAR data can capture information from various viewpoints, effectively utilizing the complementary information remains challenging. This is because spectral and spatial features are typically extracted independently and integrating them efficiently is hindered by the heterogeneous nature of HSI and LiDAR data collected from different sensors. In addition, some other methods have also been proposed in recent years [32,33,34,35,36,37,38,39].

Finally, to achieve effective classification of remote sensing images, machine learning-based classifiers such as Support Vector Machine (SVM) [40,41], Random Forest (RF) [42], and K Nearest Neighbors (KNN) [43] are widely used. SVM offers the advantages of rapid computation and robust generalization capabilities. It can transform linearly inseparable problems in low-dimensional space into high-dimensional space for classifying remote sensing image data. However, SVM primarily emphasizes shallow feature information while neglecting the exploration and utilization of deep feature information. This limitation hinders the enhancement of classification performance for remote sensing images. Although RF is suitable for processing high-dimensional nonlinear remote sensing image data, it has low accuracy for feature recognition on remote sensing image datasets with strong noise interference, such as the 2013 Houston dataset, which are often obscured by clouds. KNN effectively utilizes the spatial distance information of all neighboring sample points between test samples and training samples to classify remote sensing images. However, due to the high dimensionality of hyperspectral remote sensing image data and the limited number of training samples, the generalization ability of KNN weakens as the dimensions increase. The classification accuracy of remote sensing images tends to increase and then decrease with the increase in dimensions. The machine learning classifiers mentioned above may struggle to fully leverage deep feature information, identify features affected by high noise interference like cloud cover, or be susceptible to overfitting issues. To increase the classification accuracy of remote sensing images, convolutional neural networks (CNNs) are utilized. CNNs are capable of extracting deep feature information from image data, thus improving remote sensing image classification. Hang et al. [44] proposed the use of a coupled convolutional neural network (coupled CNN) for the feature classification of remote sensing image data. However, the processing was merely superficial and the resulting classification was inadequate. Lee et al. [45] classified the extracted local and global nonlinear and hidden features using a context-based deep convolutional neural network model (CCNN). While the image information was extracted in depth, the mining of different scale information was overlooked, necessitating an improvement in the classification effect. Xu et al. [46] utilized a two-branch CNN method (TBCNN) for the block-by-block classification and fusion of extracted multi-scale remote sensing image data. The extracted features were highly effective, yet the training time was prolonged, and the classification effect was unsatisfactory. Although the CNN-based classification methods mentioned above can increase the accuracy of feature recognition by extracting deep spectral spatial information from remote sensing image data, the model structure is complex. The training process requires setting a large number of parameters, resulting in poor model generalization, long training times, and high time complexity. To balance high classification accuracy and low time complexity, the KELM [47] method is employed for remote sensing image classification. KELM only needs to pre-select the kernel function and does not need to explicitly define the mapping function or set the number of hidden layer neurons. This saves time by optimizing the number of hidden layer neurons and effectively reduces the algorithmic time complexity of the training process. In addition, KELM integrates the kernel function with the extreme learning machine (ELM) and replaces random mapping with kernel mapping. This integration effectively addresses the issues of model generalization and unstable classification results caused by the random assignment of neurons in the hidden layer of traditional convolutional neural networks.

Both of the classification methods mentioned above, one based on machine learning and the other on deep learning, independently extract feature information from various remote sensing image datasets. They then employ feature fusion methods such as feature concatenation, feature-weighted averaging, or feature selection to merge the feature information from different datasets at the feature level. The fused features are subsequently utilized for remote sensing image classification. However, for heterogeneous datasets such as HSI and LiDAR, the respective feature information is highly diverse in terms of physical meaning and data form. This diversity leads to the limited effectiveness of traditional feature fusion methods and impacts the classification accuracy. Therefore, to further improve the classification accuracy of remote sensing images, this paper adopts a decision fusion method that is compatible with heterogeneous multiattribute feature information. The decision fusion method is based on feature-level information processing and adopts a decision fusion strategy to combine the classification probability matrices output by each classifier, thereby achieving the optimal classification results. This approach avoids the poor feature information fusion caused by the strong heterogeneity of heterogeneous data and helps to improve the classification accuracy. Prasad et al. [48] conducted subspace identification and band grouping of hyperspectral images. They combined multiple classifiers and decision fusion methods to increase classification accuracy and overall robustness of the classifiers. However, this fusion method requires high performance for each classifier. Li et al. [48] applied the One Against One (OAO) strategy and Kernel Discriminant Analysis (KDA) to classify hyperspectral images. They obtained the final classification results through the Majority Voting (MV) and Logarithmic Opinion Pool (LOGP) decision fusion strategies. This resulted in a reduction in the overall computational effort of the algorithm but had a negligible impact on the enhancement of classification accuracy. Jiang et al. [49] used Super PCA to extract features from regions with similar reflectance for hyperspectral image segmentation. They then employed an SVM classifier for each region’s extracted features to classify and fused the classification results using MV. However, this method caused a loss of information on some features and the classification accuracy improvement was not high. The decision fusion methods mentioned above utilize various feature extraction techniques and classifiers, in conjunction with traditional decision fusion strategies, to classify hyperspectral images. This approach increases the classification accuracy to some extent. However, these methods simply apply the classical decision fusion strategy in remote sensing image classification without further optimizing the decision fusion strategy. At the same time, the outcomes of decision fusion also rely on the performance of the classifiers [50,51,52,53]. Therefore, selecting high-performance classifiers and optimizing the decision fusion strategy to combine the classification probability matrices of the outputs of the classifiers are crucial tasks to increase the accuracy of image classification in remote sensing.

To address the aforementioned challenges in remote sensing image classification tasks, this paper introduces the PRDRMF method for the simultaneous classification of hyperspectral and LiDAR data. First, the HSI data and LiDAR data are downscaled by PRTV to remove redundant data, and then local texture features and spatial structure features are extracted from the two types of data using LBP and EMAP in the multifeature extraction module. Then, the four extracted features are input into the corresponding KELM with a simple structure and excellent classification performance to obtain four CPMs. Finally, the four CPMs are effectively combined using a multiprobability decision fusion method to achieve the optimal classification results.

The main contributions of this paper are described as follows:

The original HSI data and LiDAR data are, respectively, reduced and made de-redundant by PRTV. RTV [54] was first introduced to the field of joint classification of hyperspectral and LiDAR data. Through enhancements to this approach, a novel data de-redundancy method, PRTV, was proposed. PRTV is capable of eliminating redundant data while effectively reducing data dimensionality, thereby addressing the data redundancy issue inherent to the joint classification of HSI and LiDAR data.
A multifeature extraction module is proposed to extract feature information from HSI data and LiDAR data from various perspectives. By integrating the LBP method and EMAP method for feature extraction of HSI data and LiDAR data, respectively, not only is the information capture of the uniform and edge regions of the data taken into account, but also the spectral spatial information in the HSI data and the shape–height information in the LiDAR data are made to be complementary.
The four extracted features are input into KELM with a simple structure and superior classification performance for feature classification, respectively, and the classification probability matrix (CPM) is output. Subsequently, the CPM is probabilistically fused with a multiprobabilistic decision fusion method that is compatible with multiattribute feature information from heterogeneous sources. In this way, a lightweight and high-performance image classification model is formed, which effectively combines the two objectives of high classification accuracy and low time complexity in the process of joint classification of HSI data and LiDAR data.

The remaining part of this paper is summarized as follows. In Section 2, the proposed framework is introduced. In Section 3, the experimental dataset, experimental setup, and analysis of comparative experimental results are presented. In Section 4, the conclusion and the future work are given.

2. Framework of the PRDRMF Method

To address the issues of data redundancy, low classification accuracy, and high time complexity when combining HSI data and LiDAR data, this paper introduces the PRDRMF method for the joint classification of hyperspectral and LiDAR data.

The overall structure of PRDRMF is shown in Figure 1 and includes four parts: data de-redundancy method (PRTV), multifeature extraction module, classification module, and decision fusion module. First, the original HSI data and the original LiDAR data are downsized by PRTV to eliminate redundant information and extract meaningful data structures, respectively. Then, the processed HSI data and LiDAR data are input into the multifeature extraction module. In this module, local texture features and spatial structure features are extracted for the two types of data using LBP and EMAP, respectively. The four extracted features are then input into the classification module, where they are classified separately by the KELM classifier. This classifier is highly efficient in terms of its classification performance and has a simple structure, resulting in the four CPMs. Finally, the four CPMs are combined using the multiprobability decision fusion method to achieve the optimal classification results. This fusion process enables PRDRMF to achieve high classification accuracy while maintaining low time complexity.

2.1. Data De-Redundancy Method

Data de-redundancy is crucial for reducing the complexity of subsequent feature extraction and enhancing the final classification accuracy. This paper introduces RTV [54], a method with a simple structure, fast computation, and a good de-redundant information effect. However, it is only applicable to maps containing a small number of bands. Therefore, to make it applicable to hyperspectral data, we introduce PCA to improve RTV and ultimately propose the PRTV method. Theoretically, RTV is used for data de-redundancy and is mainly used for low-dimensional data, and we improve it using the PCA method so that it can be applied for the first time in the field of joint classification of hyperspectral data and LiDAR data. In terms of effectiveness, PRTV has a simple structure. The HSI data and LiDAR data are downscaled, and redundant information is removed separately before feature extraction. This process aims to reduce the complexity of the subsequent feature extraction and alleviate the impact of data redundancy when using HSI data and LiDAR data jointly for classification purposes.

Assuming that the original HSI data are

H ϵ R^{m \times n \times c}

, where m, n and c denote height, width and dimension, respectively, the output data are

H_{P R T V} ϵ R^{m \times n x 1}

after using PRTV for the redundancy processing of

H

. The projection matrix of PCA is

P_{c \times d}

, and the HSI data after dimensionality reduction can be expressed as

P \cdot H ϵ R^{m \times n \times d}

,

d

denotes dimension. In this paper, the ideal results of the redundancy removal of the original HSI data using PRTV are as follows:

\arg \min_{H_{PRTV}} \{\frac{1}{2} {(H_{P R T V} - P \cdot H)}^{2} + λ \sum_{i} [\frac{D_{x} (i)}{L_{x} (i) + ε} + \frac{D_{y} (i)}{L_{y} (i) + ε}]\}

(1)

where

λ

is the weight of the regular term and

\sum_{i} [\frac{D_{x} (i)}{L_{x} (i) + ε} + \frac{D_{y} (i)}{L_{y} (i) + ε}]

is the regular term.

In this context,

D_{x} (i)

and

D_{y} (i)

represent the total window variation of

i

points in the x and y directions, respectively. To facilitate the distinction between salient structural and texture elements, new intrinsic window variables,

L_{x} (i)

and

L_{y} (i)

, have been introduced in addition to

D_{x} (i)

and

D_{x} (i)

.

D_{x} (i) = \sum_{j \in R (i)} g_{i, j} |{(\partial_{x} H_{P R T V})}_{j}|

(2)

D_{y} (i) = \sum_{j \in R (i)} g_{i, j} |{(\partial_{y} H_{P R T V})}_{i}|

(3)

L_{x} (i) = |\sum_{j \in R (i)} g_{i, j} {(\partial_{x} H_{P R T V})}_{i}|

(4)

L_{y} (i) = |\sum_{j \in R (i)} g_{i, j} {(\partial_{y} H_{P R T V})}_{i}|

(5)

where

R (i)

is a rectangular region centered at point

i

. The weight function, designated as

g_{i, j}

, serves to regulate the dimensions of the window. The two pixels, designated as

x_{i}

and

x_{j}

, are situated in the X-direction, while the two pixels,

y_{i}

and

y_{j}

, are located in the Y-direction.

g_{i, j} \propto e x p (- \frac{{(x_{i} - x_{j})}^{2} + {(y_{i} - y_{j})}^{2}}{2 σ^{2}})

(6)

Assuming that the original LiDAR is

L ϵ R^{m \times n \times l}

, and also applying all the above formulas, we finally obtain the output of the de-redundancy process using the PRTV method for

L

, i.e.,

L_{P R T V} {ϵ R}^{m \times n \times t}

. Figure 2 illustrates the efficacy of the method in reducing data redundancy.

2.2. Multifeature Extraction Module

Both LBP [30] and EMAP [55] are commonly utilized as feature extraction modules for the joint utilization of HSI data and LiDAR data. To create an effective complement between the rich spectral and spatial information of HSI data and the high degree of shape information contained in LiDAR data, we adopt a strategy of combining the LBP method, which captures local spectral features, with the EMAP method, which captures spatial structure features. This method mines the respective features of HSI data and LiDAR data from different perspectives, thereby achieving effective feature extraction and improving the final image classification accuracy.

The data obtained by the data de-redundancy module are input into the multifeature extraction module to obtain the corresponding four features (i.e.,

{H S I}_{L B P} ϵ R^{m \times n \times h 1}

,

{L i D A R}_{L B P} ϵ R^{m \times n \times h 1}

,

{H S I}_{E M A P} ϵ R^{m \times n \times h 2}

, and

{L i D A R}_{E M A P} ϵ R^{m \times n \times h 2}

).

Specifically, LBP encodes the image to capture detailed local texture features and hence it can be used for feature extraction in localized regions of hyperspectral images. The LBP operator compares the pixel value of each of the n neighboring points with the pixel value of the center of the neighborhood within a range of pixels r × r. The pixel position is marked as 1 if the neighboring pixel value is higher than the center pixel value. The pixel position is marked as 1 if the surrounding pixel value is higher than the center pixel value; otherwise, it is marked as 0.

For the input image

H_{P R T V}

,

r

is the radius of the circle from the center point to the neighboring points and

n

is the total number of pixels to be counted in the neighborhood. The center pixel point is 1, and the number of neighboring pixels is

n

− 1. Let

{m_{i}}_{i = 0}^{n - 1}

represent the pixel values of the neighboring pixels and

m_{h c}

be the pixel value of the center point of the

H_{P R T V}

image.

{H S I}_{L B P}

is computed as follows:

{H S I}_{L B P} = {L B P}_{n, r} (m_{h c}) = \sum_{i = 0}^{n - 1} F_{h} (m_{i} - m_{h c}) 2^{i}

(7)

F_{h} (m_{i} - m_{h c}) = \{\begin{matrix} 1, m_{i} - m_{h c} > 0 \\ 0, m_{i} - m_{h c} \leq 0 \end{matrix}

(8)

{L i D A R}_{L B P}

can be obtained in a similar way.

On the other hand, by combining the layers from AP to EAP and finally to EMAP, the spatial features of HSI data can be well extracted, thus improving the classification accuracy.

Firstly, n principal components (

{P C}_{i}, i = 1, 2, \dots, n

) are extracted from the input image

H_{P R T V}

to compute the morphological attribute filter AP. Then, the extended attribute profiles (

{E A P}_{h}

) are formed by combining the m different APs. The EAP can be represented as follows:

{E A P}_{h} = \{A P ({P C}_{1}), A P ({P C}_{2}), \dots, A P ({P C}_{n})\}

(9)

The

{H S I}_{E M A P}

is formed by combining multiple different

{E A P}_{h i}

.

{H S I}_{E M A P} = \{{E A P}_{h_{1}}, {E A P}_{h_{2}}, \dots, {E A P}_{h_{n}}\}

(10)

Here,

h_{i}

(

i

= 1,

\dots

, n) represents the common attributes.

{L i D A R}_{E M A P}

can also be obtained using a similar method.

2.3. Classification Module

Since Huang [41] introduced the kernel function into ELM and proposed KELM, it has been widely utilized in classifying multisource remote sensing image data. This is attributed to its simple structure, high computational efficiency, and superior classification performance [46]. KELM only needs to pre-select the kernel function and does not need to explicitly define the mapping function or set the number of hidden layer neurons. This saves time by optimizing the number of hidden layer neurons and effectively reduces the algorithmic time complexity of the training process. In addition, KELM integrates the kernel function with ELM and replaces random mapping with kernel mapping. This integration effectively addresses the issues of model generalization and unstable classification results caused by the random assignment of neurons in the hidden layer of traditional convolutional neural networks.

Consequently, the KELM classifier was selected as it enables the attainment of both high classification accuracy and low time complexity. Furthermore, the RBF kernel function was selected following an experimental comparison.

Specifically, we input the obtained features into the classifier and then solve the objective function to derive the corresponding four CPMs, namely HSI_LBP, HSI_EMAP, LiDAR_LBP, and LiDAR_LBP.

F (s) = [k (s, s_{1}); \dots; k (s, s_{n})] {(\frac{I}{C} + Ω_{E L M})}^{- 1} H S I_L B P

(11)

Here,

(s_{1}, s_{2}, \dots, s)

represents the training samples,

Ω_{E L M} ϵ R^{h x h}

denotes a symmetric matrix constructed based on the kernel function.

C

represents a constant and is also known as the regularization factor.

I ϵ R^{h x h}

is the identity matrix, HSI_LBP represents the desired output and

k

denotes the kernel function (i.e., RBF). HSI_EMAP, LiDAR_LBP, and LiDAR_LBP can also be obtained using a similar method.

2.4. Decision Fusion Module

The probability matrix obtained after classification using a high-performance classifier is combined through a suitable decision fusion method, which increases the classification accuracy of remote sensing images. However, most of the existing decision fusion methods have rudimentary decision fusion strategies or poor performance of the selected classifiers, leading to unsatisfactory fusion results. Therefore, this paper selects the high-performance and simple structure of KELM as a classifier and introduces a multiprobability decision fusion method to accommodate the characteristics of multiple features discussed in this paper. The multiprobability decision fusion method does not require additional parameters and validation sets in the fusion process. It can be compatible with the four heterogeneous multiattribute features, thus avoiding the issue of poor fusion of feature information caused by the strong heterogeneity of the data. This method can help increase the classification accuracy of remote sensing images. It helps to improve the classification accuracy of remote sensing images. Nevertheless, this decision fusion method necessitates a high level of classification performance from the classifier, which introduces an additional degree of complexity.

Since

{H S I}_{L B P}

,

{L i D A R}_{L B P}

,

{H S I}_{E M A P},

and

{L i D A R}_{E M A P}

belong to different types of features or different data sources, the probabilities of their corresponding categorization labels, i.e.,

S_{i} ϵ R^{1 x h l}, T_{i} ϵ R^{1 x l t}, P_{i} ϵ R^{1 x h e}, a n d Q_{i} ϵ R^{1 x l e}

for the same

Y_{i}

point on the test sample

Y

are independent. Therefore, the multiprobability decision fusion method is designed as follows:

\underset{S_{i}, T_{i}, P_{i}, Q_{i}}{m a x} {P (S_{i}, T_{i}, P_{i}, Q_{i} | Y, Y, Y)} = \underset{S_{i}, T_{i}, P_{i}, Q_{i}}{m a x} {P (S_{i} | Y) \cdot P (T_{i} | Y) \cdot P (P_{i} | Y) \cdot P (Q_{i}| Y)

(12)

P (Y_{i}) = P_{{H S I}_{L B P}} (Y_{i}) * P_{{L i D A R}_{L B P}} (Y_{i}) * P_{{H S I}_{E M A P}} (Y_{i}) * P_{{L i D A R}_{E M A P}} (Y_{i})

(13)

where

P_{{H S I}_{L B P}} (Y_{i})

,

P_{{L i D A R}_{L B P}} (Y_{i})

,

P_{{H S I}_{E M A P}} (Y_{i})

, and

P_{{L i D A R}_{E M A P}} (Y_{i})

are probability vectors of different features corresponding to the LBP features of HSI, LBP features of LiDAR, EMAP features of HSI, and EMAP features of LiDAR.

The final category labeling is calculated as follows:

c l a s s (Y_{i}) = a r g \underset{l = 1, \dots, C}{m a x} [p (Y_{i})]

(14)

where

C

is a constant representing the total number of sample categories and

p (Y_{i}

) denotes the probability that the sample

y

belongs to the

i

th category.

The proposed method comprises four stages, each of which must be completed in sequence. Consequently, the processing effect, operating environment and hardware platform required to achieve each stage differ. This will undoubtedly result in a more intricate operational process in practical applications. Nevertheless, the subsequent experimental analysis demonstrates that the proposed method exhibits notable advantages in terms of high classification accuracy and rapid running time, thereby achieving the desired experimental outcome. These advantages are more pronounced than the associated disadvantages.

3. Experimental Results and Analysis

To address the issues of data redundancy, low classification accuracy, and high time complexity when using HSI data and LiDAR data together, we propose the PRDRMF method. To assess the efficacy of the proposed method, experiments were conducted using three publicly accessible HSI and LiDAR datasets. First, the experimental environment and evaluation metrics are described in detail. Next, detailed information about the four datasets used in the experiments is provided. Finally, experiments are conducted on the four datasets to compare the classification accuracy and time complexity with other existing methods. Additionally, ablation experiments are utilized to demonstrate the functionality of the various sources of data and the different modules in the proposed method.

The experiments utilize unified evaluation metrics to assess the classification outcomes, which include overall accuracy (OA), average accuracy (AA), and the Kappa coefficient. The experimental environment consists of MATLAB 2021a, Keras 2.3.1, an Nvidia RTX 3050 GPU, and 32 GB of memory.

3.1. Datasets

To evaluate the effectiveness of the proposed method, three HSI and LiDAR datasets were selected for experimentation. Detailed description of four multi-sensor datasets can be found in Table 1.

The 2013 Houston dataset covers the area of the University of Houston and its surrounding cities. The dataset consists of 144 HSI spectral bands with dimensions of 349 × 1905 pixels and includes 15 categories. The details are shown in Table 2.

The MUUFL dataset covers the University of Southern Mississippi Gulf Park campus. The dataset consists of 64 HSI spectral bands with a size of 325 × 220 pixels. It includes 11 categories. The details are shown in Table 3.

The Trento dataset covers a rural area south of Trento, Italy. The dataset consists of 63 HSI spectral bands with dimensions of 600 × 166 pixels and includes six categories. The details are shown in Table 4.

The 2018 Houston dataset has the same wavelength range as the 2013 Houston dataset. The dataset contains 48 spectral bands and the images have a spatial resolution of 1m. There are seven consistent classes in their scene. We extracted 48 spectral bands (wavelength range 0.38~1.05um) from the Houston 2013 scene corresponding to the Houston 2018 scene and selected an overlapping area of 209 × 955. The test and training set split ratios were consistent with the 2013 Houston dataset. The classes and the number of samples are listed in Table 5.

3.2. Parameter Analysis

To determine the optimal parameters for the PRDRMF method, the influence of the parameters in the four modules was considered. Following experiments and analysis, the optimal parameters were selected. Among them, the parameters of the data de-redundancy method PRTV can be optimized with λ set to 0.015 and σ set to 1 after experiments. The parameters of the LBP in the multifeature extraction module can be optimized with r set to 1 and n set to 8 after experiments. The optimal kernel function for the classification module is the RBF kernel function after comparisons. The decision fusion module does not require any additional parameters.

3.2.1. Parameters of PRTV

To minimize the redundancy of spectral and spatial information, this paper employs PRTV for HSI data and LiDAR data initially. It conducts dimensionality reduction and data de-redundancy processing by adjusting the smoothing degree λ and texture element σ. Therefore, the parameters λ and σ significantly affect the performance of the model. The relationships between OA and parameters λ and σ are shown in Figure 3.

In Figure 3a, as the value of λ increases, the OA value of the 2013 Houston dataset tends to increase and reaches its peak at λ = 0.04. Conversely, the OA value of the MUUFL dataset decreases as λ increases, reaching its maximum at λ = 0.00. The OA value of the Trento dataset shows slight fluctuations slightly above and below 99.70% and reaches its peak at λ = 0.01. The OA value of the 2018 Houston dataset reaches its peak at λ = 0.03.

In Figure 3b, the OA values of the 2013 Houston dataset, the MUUFL dataset, and the 2018 Houston dataset exhibit a decreasing trend as σ increases, reaching the highest value at σ = 1. In contrast, the OA value of the Trento dataset fluctuates between 99.75% and reaches its peak at σ = 5, achieving the second-highest classification accuracy at σ = 1.

Therefore, after analyzing the parameters in the four datasets, to balance the model’s performance across all datasets and minimize operational complexity, it is recommended to set λ = 0.015 and σ = 1.

3.2.2. Parameters of LBP

The size of the range diameter (r) and the number of sample points (n) in LBP will directly affect the features obtained by the multifeature extraction module. This, in turn, impacts each CPM and can significantly influence the final model performance. Therefore, this paper analyzes the accuracy of PRDRMF with different diameter sizes (r) and varying numbers of sample points (n), as illustrated in Figure 4.

In Figure 4a, the OA values of both the 2013 Houston dataset and the Trento dataset exhibit a general decreasing trend as r increases, reaching the highest value at r = 1. In contrast, the OA values of the MUUFL dataset exhibit an increasing trend followed by a decreasing trend, reaching the peak value at r = 4. The OA value of the 2018 Houston dataset reaches its peak value at r = 1.

In Figure 4b, the OA values of both the 2013 Houston dataset and the Trento dataset exhibit a significant increasing trend as n increases, reaching the peak value at n = 8. The OA values of the MUUFL dataset exhibit a slight fluctuating trend and reach the second-highest classification accuracy at n = 8. The OA value of the 2018 Houston dataset reaches its peak value at n = 8.

A parameter analysis was conducted on the four datasets, and after careful consideration, the parameter settings of r = 1 and n = 8 were selected to achieve an optimal balance in model performance across the datasets.

3.2.3. Parameters of KELM

The KELM classifier is an ELM method that utilizes kernel functions. Therefore, the selection of appropriate kernel functions is of paramount importance. The efficacy of the KELM classifier, which incorporates distinct kernel functions for classification, was evaluated on four datasets. The outcomes of this experiment are presented in Figure 5.

In Figure 5, the KELM classifier with RBF kernel function achieved the highest OA value for classification on all four datasets. Therefore, the KELM with RBF kernel function is ultimately chosen as the classifier.

3.3. Comparison Experiment and Analysis

To validate the excellent performance of the PRDRMF method, we conducted experiments on the four datasets and compared the experimental results with other existing methods, including SVM [56], CCNN, EndNet [57], CRNN [58], TBCNN, coupled CNN [21], CNNMRF [59], FusAtNet [60], S2ENet [61], CALC [62], Fusion-HCT [63], SepG-ResNet50 [64], and DSMSC²N [65]. For these methods, the parameter settings are described in the corresponding references.

To ensure the fairness of the experimental results, the training and test samples for all experimental methods were kept consistent. Please refer to Table 2, Table 3, Table 4 and Table 5. Table 6, Table 7, Table 8, Table 9, Table 10, Table 11, Table 12 and Table 13 list the OA, AA, and Kappa values obtained using different methods on the 2013 Houston, MUUFL, Trento, and the 2018 Houston datasets. The bold values in the table represent the optimal values. To visualize the classification effects of the compared methods, Figure 6, Figure 7, Figure 8 and Figure 9 give the classification plots obtained by different classification methods on the 2013 Houston, MUUFL, Trento, and the 2018 Houston datasets. For purposes of comparison, the HSI-generated pseudo-color images, the original LiDAR maps, and the ground truth maps are also presented.

The 2013 Houston dataset demonstrates the broadest coverage of urban scenes with multiple feature classes and is primarily utilized to validate the performance of the PRDRMF method for detailed classification of urban scenes based on satellite imagery.

On the 2013 Houston dataset, the OA value of the PRDRMF method reached 99.79%. In particular, the OA value was 40.39%, 12.87%, 11.27%, 11.24%, 10.88%, 9.36%, 8.86%, 9.18%,5.60%, 5.08%, 0.03%, 27.12%, and 8.30% higher than that of SVM, CCNN, EndNet, CRNN, TBCNN, coupled CNN, CNNMRF, FusAtNet, S2ENet, CALC, Fusion-HCT, SepG-ResNet50, and DSMSC²N, respectively. Furthermore, the classification accuracy was 100% for the categories of health grass, stressed grass, and artificial grass, as well as highway, railway, parking lot 1, and parking lot 2. This suggests that the multifeature extraction module provides texture information from diverse viewpoints, thereby enabling PRDRMF classification to perform exceptionally well on the aforementioned analogous categories.

As illustrated in Figure 6d, the conventional machine learning algorithm SVM exhibits the lowest classification accuracy and provides markedly inadequate classification results for four categories, including highway and car parking lot 1. Figure 6e shows that the highway and railroad categories are misclassified. This is due to the CCNN method’s inability to effectively deal with heterogeneous regions. The method’s limitations are evident in its exclusive focus on spatial neighborhood information, which leads to suboptimal classification performance on various road categories. EndNet achieves 100% classification accuracy on two categories: artificial grass and tennis court. In Figure 6g–j, it can be seen that these CNN-based methods achieve higher classification accuracy per category, reaching 100% accuracy on several categories, particularly on the category of tennis courts, which demonstrates a significant recognition effect. Notably, CRNN, TBCNN, and CNNMRF exhibited limited performance in recognizing the category of ordinary roads and highways, while coupled CNN achieved a classification accuracy of only 41.11% for the category of water. Compared to the aforementioned methods, the classification performance of FusAtNet shows a slight improvement. However, the results are still unsatisfactory for regular highways. In contrast, S2Enet demonstrates significant enhancement in classification accuracy, achieving a perfect score of 100% for stressed grass, artificial grass, water, tennis court, and runway categories. CALC achieved a classification accuracy of over 90% in 14 categories, with the exception of the category pertaining to road, where the classification accuracy requires improvement. The Transformer-based method Fusion-HCT demonstrates efficacy in classification, attaining 100% accuracy in multiple categories. However, the SepG-ResNet50 approach exhibits limitations in discerning between analogous categories, such as roads and grasses. DSMSC²N attains 100% accuracy in two categories, soil and tennis court, although there is scope for enhancement in the recognition of highway. In the upper right corner of Figure 6q, it can be observed that the internal noise of health grass is almost absent and less noisy compared to Figure 6d–p. This is attributed to the utilization of PRDRMF with PRTV, which eliminates redundant information.

The MUUFL dataset depicts small-scale neighborhood scenes within the city and is primarily utilized to validate the classification effectiveness of the PRDRMF method in localized daily scenes.

On the MUUFL dataset, the OA value of the PRDRMF method reached 92.21%. In particular, the OA value was 87.74%, 3.25%, 4.46%, 0.83%, 1.36%, 1.28%, 3.27%, 0.73%, 0.53%, 9.30%, 4.78%, 9.31%, and 1.04% higher than that of SVM, CCNN, EndNet, CRNN, TBCNN, coupled CNN, CNNMRF, FusAtNet, S2ENet, CALC, Fusion-HCT, SepG-ResNet50, and DSMSC²N, respectively. Furthermore, in the case of the sidewalk and grass categories, where all other methods demonstrated suboptimal performance, the PRDRMF method exhibited superior results, with OA values of 84.40% and 92.21%, respectively. The presence of significant misclassification is clearly evident in Figure 7d, and it is attributed to the absence of spatial information in SVM and its vulnerability to noise, leading to lower classification accuracy on the MUUFL dataset. In Figure 7e, it can be seen that the sidewalk and the yellow markers on the roadside are misclassified as mud and sandy ground. This misclassification occurs because the CCNN method is inadequate in handling heterogeneous regions; it is limited to considering spatial neighborhood information only. It is difficult to accurately discriminate between similar categories. From Figure 7f, it can be observed that there are numerous noise points in the EndNet graph. This is attributed to the limited learning capability of the encoder- and decoder-based feature representations in EndNet, which hinders their ability to effectively counteract noise interference. CRNN exhibits low classification accuracy in the category of mostly grass, but it achieves the highest classification accuracy of up to 96.97% in the category of yellow markings on the roadside. TBCNN, coupled CNN, CNNMRF, and other CNN-based methods consider spatial and spectral information, reduce noise, and achieve higher classification accuracy in pixel-level remote sensing image scenes. They have demonstrated the highest classification accuracy in several categories. However, the focus on spectral feature similarity results in the spatial elevation features of the features being ignored, with the consequence that mostly grass is misclassified as trees. The OA of CALC is lower than that of all the aforementioned CNN-based methods. In comparison to the above methods, FusAtNet and S2Enet have demonstrated increased overall classification accuracy. However, the classification accuracy is markedly deficient in the case of the roadside yellow curb category. The Transformer-based Fusion-HCT method exhibits suboptimal performance in classification, particularly in the identification of pavement. PRDRMF demonstrates the most accurate classification outcomes. A comparison of Figure 7e–p reveals that Figure 7q, which is most similar to the ground truth map, exhibits less classification noise and clearer boundaries. This is attributed to the utilization of PRDRMF with PRTV, which eliminates redundant information.

The Trento dataset showcases farm scenarios with fewer crop classes and is mainly utilized to validate the performance of the PRDRMF method for precise agricultural classification over extensive areas and in more standardized conditions.

On the Trento dataset, the OA value of the PRDRMF method reached 99.73%. In particular, the OA value was improved by 26.84%, 2.44%, 5.56%, 2.51%, 2.27%, 2.04%, 1.33%, 0.67%, 1.19%, 0.35%, 0.13%, 5.91%, and 0.80% compared to SVM, CCNN, EndNet, CRNN, TBCNN, coupled CNN, CNNMRF, FusAtNet, S2ENet, CALC, Fusion-HCT, SepG-ResNet50, and DSMSC²N, respectively. Furthermore, the classification accuracy was 100% for two similar categories, namely woods and vineyards. This suggests that the multifeature extraction module provides texture information from diverse viewpoints, thereby enabling PRDRMF classification to perform exceptionally well on the aforementioned analogous categories. The large apple tree orchard depicted in Figure 8d is erroneously classified as vineyard land due to the SVM’s lack of spatial information and susceptibility to noise interference, which can lead to misclassification of categories. From Figure 8f, it can be seen that the total classification accuracy of EndNet is only higher than that of SVM. There are many noise points in the graph due to the limited learning ability of the encoder-based and decoder-based feature representation in EndNet, which hinders its effectiveness in resisting noise interference. In Figure 8e, it can be seen that the yellow markers on the sidewalk and the roadside are misclassified as mud and sand. This is due to the limitations of the CCNN method in handling heterogeneous regions. The method only considers spatial neighborhood information, which makes it difficult to accurately discriminate between similar categories. In Figure 8e,g–j, it can be seen that the total classification accuracies of these CNN-based methods are significantly higher. Specifically, CRNN and TBCNN achieve 100% classification accuracies for both categories of ground and vineyard land, while CNNMRF demonstrates the highest classification accuracies for all three categories of apple trees, woods, and vineyard land. Compared to the aforementioned methods, FusAtNet and S2Enet show a slight improvement in classification, achieving 100% accuracy in classifying ground and woods categories, respectively. The classification maps produced by CALC and Fusion-HCT are of superior quality, exhibiting minimal noise points. In comparison, the DSMSN classification map displays a few noise points within the apple trees category, while the SepG-ResNet50 map is of inferior quality, displaying significant noise points. In addition, there is noticeable noise in the extensive apple tree orchard depicted in Figure 8e–j,o. However, in Figure 8q, the large, well-maintained farm scene displays distinct boundaries with minimal noise. This is attributed to the utilization of PRDRMF with PRTV, which eliminates redundant information. Only a slight cross-bar phenomenon of misclassification within the ground features persists.

The 2018 Houston dataset is a selection of the same seven categories as the 2013 Houston dataset, showing areas of the same location at different points in time, and is mainly used to compare the 2013 Houston dataset as a complement to the multi-temporal data. The study of this dataset shows the applicability of this paper’s method on multi-temporal data.

On the 2018 Houston dataset, the OA value of the PRDRMF method reached 96.93%. In particular, this OA value was 15.44%, 6.84%, 6.21%, 5.77%, 5.72%, 4.72%, 4.58%, 5.35%, 2.34%, 2.13%, 0.25%, 8.63%, and 3.38% higher than that of SVM, CCNN, EndNet, CRNN, TBCNN, coupled CNN, CNNMRF, FusAtNet, S2ENet, CALC, Fusion-HCT, SepG-ResNet50, and DSMSC²N, respectively. In addition, it achieved 100% classification accuracy on the water category. This is better than PRDRMF’s recognition of the water category on the 2013 Houston dataset, probably because the 2018 Houston dataset has only seven categories and the sample size of the water category accounts for too much of the total. The dataset was simplified to make it easier to distinguish different target categories.

As shown in Figure 9d, the traditional machine learning algorithm, SVM, has the lowest classification accuracy and provides extremely poor classification results on three categories, such as grass healthy and residential buildings. And in Figure 9e, it can be seen that the road and non-residential buildings categories are misclassified, which is due to the shortcomings of the CCNN method in dealing with heterogeneous regions and to the poor classification performance on spatially neighboring categories. EndNet performs better only on the two categories of grass stressed and non-residential buildings, and the rest needs to be improved. In Figure 9g–j, it can be seen that these CNN-based methods have a higher classification accuracy per category, with all of them achieving 100% accuracy on the category of water. The TBCNN achieves better recognition performance on the category of grass healthy and the CNNMRF achieves better recognition performance on the category of trees, with a classification accuracy of 97.37%. FusAtNet, on the other hand, does not perform as well on the category of trees. The improvement in classification accuracy achieved by the above methods is not much, while S2Enet has great improvement in classification accuracy, with 94.59%. CALC achieves more than 80% accuracy for eight categories. The Transformer-based Fusion-HCT method performs well in terms of classification and is the best for the residential buildings category compared to all methods. SepG-ResNet50 had a low overall classification accuracy, but only achieves the highest accuracy in the grass stressed category, with an OA of 98,59%. DSMSC²N classified the categories of both residential buildings and non-residential buildings with a superior accuracy. In particular, the identification of grass healthy was poor for all methods on the 2018 Houston dataset, unlike the 2013 Houston dataset. This may be due to the fact that the total data volume was smaller and PRDRMF’s identification effect on specific categories was weakened.

Overall, it can be observed from the figure that the other methods exhibit significant noise in the four datasets and do not accurately identify the types of objects. In contrast, the PRDRMF method produces fewer mislabeled classification maps across the four datasets, with clearer boundaries that closely align with the corresponding ground truth. Among all the compared methods, PRDRMF demonstrates the best classification performance and is competitive in the task of feature recognition when utilizing both HSI data and LiDAR data simultaneously.

3.4. Computation Time Comparisons

To verify the significant advantage of the proposed PRDRMF method in reducing time complexity, we compared the running time on four datasets with the running time of the aforementioned comparison methods.

CNNs have a complex structure with multi-layer convolutions, typically resulting in high time complexity. For a fair comparison, we standardized the number of running rounds for CNN-based methods to 200 and provided their running times on all datasets to compare their time complexity in Table 14.

In Table 14, it can be seen that the running time of PRDRMF is second only to SVM among all the compared methods and is shorter than the time of the rest of the CNN-based methods in the same column. Specifically, SVM has the shortest running time, but it compromises classification accuracy, while the remaining CNN-based methods, such as CCNN and CRNN, exhibit good classification accuracy, but their running time exceeds that of PRDRMF. Therefore, PRDRMF represents the optimal method as it addresses the limitations of existing approaches by balancing high classification accuracy with low time complexity.

3.5. Comparison of Decision Fusion Methods

MV is a relatively simple and straightforward approach that does not significantly increase the complexity of the process. However, it fails to consider a substantial amount of crucial information and important details, which is not conducive to achieving the desired level of accuracy in the final classification. Naive Bayes requires the calculation of the a priori probability, which is susceptible to a considerable degree of error in the classification decision. LOGP belongs to the soft decision fusion strategy, which employs uniform weight coefficients for decision fusion. However, the respective classification performance of the subclassifiers is not optimally evaluated, resulting in an impact on the final classification effect. The adaptive decision fusion method fuses the classification results of the subclassifiers based on the strategy of optimally assigning weight coefficients, achieving the highest classification accuracy. However, the method requires the constant identification of optimal weight coefficients, which increases the time complexity. The method presented in this paper demonstrates an improvement in final classification accuracy, ranking second only to the adaptive decision fusion method. Furthermore, the overall running time of the algorithm presented in this paper is notably brief, indicating that the time cost associated with this fusion method is not significant. In Table 15, we have experimented the above method with our proposed method on the 2018 Houston dataset and the detailed comparison results are as follows.

3.6. Ablation Experiments

To validate the superiority of combining joint HSI data and LiDAR data over using single data sources, to confirm the contribution of each CPM to the classification performance, and to verify the essential role of each module in the PRDRMF method, we designed three ablation experiments. These experiments were conducted on all datasets, and the detailed results are presented in Table 16, Table 17 and Table 18. To make the validation process more structured and organized, the entire argumentation process is divided into two parts.

3.6.1. Ablation Analysis of Different Source of Data Inputs

Considering the impact of different data sources on the model’s classification performance, three sets of experiments were conducted using single HSI data, single LiDAR data, and joint HSI and LiDAR data inputs. The experimental results are shown in Table 16.

In Table 16, the best classification accuracy is achieved by jointly using HSI data and light detection and ranging (LiDAR) data, as evidenced by the comparison of the overall accuracy (OA), average accuracy (AA), and Kappa values across the four datasets. This confirms that combining two types of data offers a significant advantage over using a single type of data. It can lead to effective information complementarity, thereby enhancing the accuracy of classification. It also confirms that the designed multiprobability decision module can effectively utilize information from various sources and ultimately achieve improved classification results.

In addition, there is a gap between the classification assessment metrics of single HSI data or single LiDAR data and the assessment metrics of the joint use of HSI data and LiDAR data. It is challenging to differentiate between spectrally coherent features and recognize occluded areas such as clouds using single HSI data. In contrast, LiDAR data can offer height and shape information on occluded features and identify features with varying heights within the same spectrum. The two types of data can obtain features from different perspectives and provide complementary information, ultimately enhancing the accuracy of feature recognition.

3.6.2. Ablation Analysis of Different CPM Inputs

As the proposed method benefits from four feature-corresponding CPMs, an analysis of the contribution of each CPM to the classification performance is conducted through burn-in ablation experiments. Specifically, there are four main components: HSI_LBP, HSI_EMAP, LiDAR_LBP, and LiDAR_EMAP. Extensive ablation experiments were conducted on the 2013 Houston dataset as a representative sample to validate the effectiveness of the CPMs corresponding to the four features in the PRDRMF for model classification. PRDRMF is evaluated by removing each CPM through comparison in Table 17, and the optimal results are highlighted in bold.

3.6.3. Ablation Analysis of Different Module Inputs

The PRDRMF method contributes by incorporating the data de-redundancy method PRTV, LBP, and EMAP for the multifeature extraction module, as well as the multiprobability decision fusion method for the decision fusion module. Extensive ablation experiments were conducted on the four datasets to verify the indispensable role of these modules in PRDRMF, demonstrating their necessity to the PRDRMF method. PRDRMF is evaluated by systematically removing each module in Table 18, with the optimal results highlighted in bold.

Table 18 demonstrates that the PRTV module increases the classification accuracy on the MUUFL, Trento, and 2018 Houston datasets, particularly on MUUFL. The MUUFL dataset comprises small-scale neighborhood scenes in the inner city, with a spatial resolution of less than 1 m. This results in the presence of excessive noise and redundant information, as well as images that lack clarity, thereby compromising the availability of detailed information. It can thus be concluded that the PRTV module is of considerable assistance in the process of de-redundancy of the MUUFL dataset. However, in the case of the 2013 Houston dataset, the PRTV module has the effect of reducing the 0A value. This is due to the fact that the 2013 Houston dataset comprises a greater number of categories and a larger quantity of data. Consequently, the PRTV module resulted in a slight loss of information in this complex feature scenario. However, this loss is minimal and may even be considered negligible in terms of the final classification results.

LBP exerts a beneficial influence on all four datasets, yet the enhancement effect is less than 1%. In comparison to other modules, it exhibits the least pronounced impact.

EMAP plays a significant role in the multifeature extraction module, demonstrating robust performance across all four datasets, particularly in the case of the MUUFL dataset. This suggests that EMAP is capable of performing well on datasets characterized by low spatial resolution, data redundancy, and reduced clarity, with the ability to capture more detailed spatial information.

The multiprobability decision fusion method was observed to be effective across all four datasets, resulting in an improvement in the final classification accuracy. Nevertheless, it did not exert a significant influence.

In conclusion, the results demonstrate that each module in PRDRMF is indispensable for achieving optimal outcomes.

Each module in PRDRMF can enhance the distinguishability between different categories to a certain extent, which can aid in category classification. Therefore, we gradually added modules to the original HSI data in sequence and output the data feature distribution to confirm that PRDRMF has the advantage of aiding in the separation of different categories, as illustrated in Figure 10, Figure 11, Figure 12 and Figure 13.

4. Conclusions

The combined use of hyperspectral remote sensing and LiDAR data can yield advantageous outcomes in classification tasks. However, there are also problems such as data redundancy, lower classification accuracy, and high time complexity. Our research aims to fully utilize the significant advantages of hyperspectral data and LiDAR data via the PRDRMF method. These data sources offer information from various perspectives, enabling us to address the challenges currently complicating classification tasks and enhance the accuracy of feature recognition.

Compared with the existing methods, the PRDRMF method preprocesses the data before feature extraction. It reduces data dimensionality, eliminates redundant remote sensing data from multiple sources, and reduces the complexity of subsequent feature extraction processes. This provides a data preprocessing method for the research field that involves the joint use of HSI data and LiDAR data for feature classification.

Second, the PRDRMF method performs multifeature extraction, using LBP and EMAP to extract local texture features and spatial structure features. This approach considers both the local area and the overall spatial structure of HSI data and LiDAR data, ultimately increasing the accuracy of image classification.

The PRDRMF method is a lightweight method. It uses a kernel limited learning machine with a simple structure and superior classification performance to produce a classification probability matrix corresponding to multiple features. This matrix is then used in a multiprobability decision-making process that does not require additional parameters for decision fusion. This approach reduces the time complexity of the model. Moreover, the PRDRMF method outperforms the other methods in terms of running speed and model performance. It achieved the highest classification accuracy on the four datasets, with OA values of 99.79%, 92.21%, 99.73%, and 96.93%, respectively. Of particular importance, the PRDRMF method even achieved 100% classification accuracy for certain feature classes.

However, our method has certain limitations. First, information loss occurs due to the use of the data deduplication module, and further research is needed to explore ways to prevent the loss of valuable information. In addition, the applicability of the PRDRMF method to larger and more complex scenarios with uneven sample distributions or to tasks with too many similar categories is subject to experimentation.

Author Contributions

Conceptualization, S.C. and H.C.; methodology, S.C. and B.Z.; software T.C.; validation, W.D.; resources, H.C.; data curation, S.C.; writing—original draft preparation, S.C. and H.C.; writing—review and editing, T.C. and W.D.; visualization, L.C.; supervision, B.Z.; project administration, T.C.; funding acquisition, H.C. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported in part by the National Natural Science Foundation of China (62176217), the Innovation Team Funds of China West Normal University (KCXTD2022-3), the Sichuan Science and Technology Program of China (2023YFG0028, 2023YFS0431), the A Ba Achievements Transformation Program (R23CGZH0001), the Sichuan Science and Technology Program of China (2023ZYD0148, 2023YFG0130), and the Sichuan Province Transfer Payment Application and Development Program (R22ZYZF0004).

Data Availability Statement

The 2013 Houston dataset used in this study is available at https://hyperspectral.ee.uh.edu/?page_id=1075 (accessed on 18 November 2024); the MUUFL dataset is available from https://github.com/GatorSense/MUUFLGulfport/ (accessed on 18 November 2024); The Trento dataset used in this study is available at https://github.com/AnkurDeria/MFT?tab=readme-ov-file; The 2018 Houston dataset used in this study is available at https://github.com/YuxiangZhang-BIT/IEEE_TIP_SDEnet. All the websites can be accessed on 18 November 2024.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Ding, Z.; Liao, X.; Su, F.; Fu, D. Mining Coastal Land Use Sequential Pattern and Its Land Use Associations Based on Association Rule Mining. Remote Sens. 2017, 9, 116. [Google Scholar] [CrossRef]
Chen, H.; Ru, J.; Long, H.; He, J.; Chen, T.; Deng, W. Semi-supervised adaptive pseudo-label feature learning for hyperspectral image classification in internet of things. IEEE Internet Things J. 2024, 11, 30754–30768. [Google Scholar] [CrossRef]
Simani, S.; Lam, Y.P.; Farsoni, S.; Castaldi, P. Dynamic Neural Network Architecture Design for Predicting Remaining Useful Life of Dynamic Processes. J. Data Sci. Intell. Syst. 2024, 2, 141–152. [Google Scholar] [CrossRef]
Li, X.; Zhao, H.; Xu, J.; Zhu, G.; Deng, W. APDPFL: Anti-Poisoning Attack Decentralized Privacy Enhanced Federated Learning Scheme for Flight Operation Data Sharing. IEEE Trans. Wirel. Commun. 2024, 1. [Google Scholar] [CrossRef]
Gu, Y.; Wang, Q. Discriminative Graph-Based Fusion of HSI and LiDAR Data for Urban Area Classification. IEEE Geosci. Remote Sens. Lett. 2017, 14, 906–910. [Google Scholar] [CrossRef]
Zhao, H.; Gao, Y.; Deng, W. Defect detection using shuffle Net-CA-SSD lightweight network for turbine blades in IoT. IEEE Internet Things J. 2024, 11, 32804–32812. [Google Scholar] [CrossRef]
Xie, P.; Deng, L.; Ma, Y.; Deng, W.Q. EV-Call 120: A new-generation emergency medical service system in China. J. Transl. Intern. Med. 2024, 12, 209–212. [Google Scholar] [CrossRef] [PubMed]
Li, W.; Liu, D.; Li, Y.; Hou, M.; Liu, J.; Zhao, Z.; Guo, A.; Zhao, H.; Deng, W. Fault diagnosis using variational autoencoder GAN and focal loss CNN under unbalanced data. Struct. Health Monit. 2024. [Google Scholar] [CrossRef]
Huang, C.; Wu, D.Q.; Zhou, X.B.; Song, Y.J.; Chen, H.L.; Deng, W. Competitive swarm optimizer with dynamic multi-competitions and convergence accelerator for large-scale optimization problems. Appl. Soft Comput. 2024, 167, 112252. [Google Scholar] [CrossRef]
Rasti, B.; Ghamisi, P.; Plaza, J.; Plaza, A. Fusion of Hyperspectral and LiDAR Data Using Sparse and Low-Rank Component Analysis. IEEE Trans. Geosci. Remote Sens. 2017, 55, 6354–6365. [Google Scholar] [CrossRef]
Deng, W.; Li, X.; Xu, J.; Li, W.; Zhu, G.; Zhao, H. BFKD: Blockchain-Based Federated Knowledge Distillation for Aviation Internet of Things. IEEE Trans. Reliab. 2024, 1–14. [Google Scholar] [CrossRef]
Ran, X.J.; Suyaroj, N.; Tepsan, W.; Ma, J.H.; Zhou, X.B.; Deng, W. A hybrid genetic-fuzzy ant colony optimization algorithm for automatic K-means clustering in urban global positioning system. Eng. Appl. Artif. Intell. 2024, 137, 109237. [Google Scholar] [CrossRef]
Shao, H.; Zhou, X.; Lin, J.; Liu, B. Few-shot cross-domain fault diagnosis of bearing driven by Task-supervised ANIL. IEEE Internet Things J. 2024, 11, 22892–22902. [Google Scholar] [CrossRef]
Li, T.; Shu, X.; Wu, J.; Zheng, Q.; Lv, X.; Xu, J. Adaptive weighted ensemble clustering via kernel learning and local information preservation. Knowl.-Based Syst. 2024, 294, 111793. [Google Scholar] [CrossRef]
Xu, J.; Li, T.; Zhang, D.; Wu, J. Ensemble clustering via fusing global and local structure information. Expert Syst. Appl. 2024, 237, 121557. [Google Scholar] [CrossRef]
Wang, R.; Qiu, H.; Jiang, G.; Liu, X.; Cheng, X. Class-Imbalanced Spatial–Temporal Feature Learning for Blade Icing Recognition of Wind Turbine. IEEE Trans. Ind. Inform. 2024, 20, 10249–10258. [Google Scholar] [CrossRef]
Gu, Y.; Wang, Q.; Jia, X.; Benediktsson, J.A. Novel MKL Model of Integrating LiDAR Data and MSI for Urban Area Classification. IEEE Trans. Geosci. Remote Sens. 2015, 53, 5312–5326. [Google Scholar]
Chen, T.; Wang, T.; Chen, H.; Zheng, B.; Deng, W. Cross-Hopping Graph Networks for Hyperspectral–High Spatial Resolution (H₂) Image Classification. Remote Sens. 2024, 16, 3155. [Google Scholar] [CrossRef]
Peng, B.; Gao, D.R.; Wang, M.Q.; Zhang, Y.Q. 3D-STCNN: Spatiotemporal convolutional neural network based on EEG 3D features for detecting driving fatigue. J. Data Sci. Intell. Syst. 2024, 2, 1–13. [Google Scholar] [CrossRef]
Chen, H.; Wang, T.; Chen, T. Hyperspectral Image Classification Based on Fusing S3-PCA, 2D-SSA and Random Patch Network. Remote Sens. 2023, 15, 3402. [Google Scholar] [CrossRef]
Cheng, X.; He, T.; Shi, F.; Zhao, M.; Liu, X.; Chen, S. Selective Feature Fusion and Irregular-Aware Network for Pavement Crack Detection. IEEE Trans. Intell. Transp. Syst. 2024, 25, 3445–3456. [Google Scholar] [CrossRef]
Dong, L.; Jiang, W.; Geng, J. Hyperspectral and LiDAR Data Classification Using Spatial Context and De-Redundant Fusion Network. IEEE Geosci. Remote Sens. Lett. 2023, 20, 5510305. [Google Scholar] [CrossRef]
Vasin, D.Y.; Gromov, V.P.; Pakhomov, P.A. Elimination of information redundancy of hyperspectral images using the “well-adapted” basis method. J. Phys. Conf. Ser. 2019, 1368, 032025. [Google Scholar] [CrossRef]
Wang, W.; Fu, X.; Zeng, W. Enhanced Deep Blind Hyperspectral Image Fusion. IEEE Trans. Neural Netw. Learn. Syst. 2023, 34, 1513–1523. [Google Scholar] [CrossRef]
Chavez, P.S., Jr.; Kwarteng, A.Y. Extracting spectral contrast in Lands at thematic mapper image data using selective principal component analysis. Photogramm. Eng. Remote Sens. 1989, 55, 339–348. [Google Scholar]
Switzer, P.; Green, A. Min/Max. Autocorrelation Factors for Multivariate Spatial Imagery; Deptartment of Statistics, Stanford University: Stanford, CA, USA, 1984. [Google Scholar]
Licciardi, G.; Marpu, P.R.; Chanussot, J.A. Benediktsson Linear versus nonlinear PCA for the classification of hyperspectral data based on the extended morphological profiles. IEEE Geosci. Remote Sens. Lett. 2012, 9, 447–451. [Google Scholar] [CrossRef]
Rasti, B.; Ghamisi, P.; Gloaguen, R. Hyperspectral and LiDAR fusion using extinction profiles and total variation component analysis. IEEE Trans. Geosci. Remote Sens. 2017, 55, 3997–4007. [Google Scholar] [CrossRef]
Liao, W.; Pizurica, A.; Bellens, R.; Gautama, S.; Philips, W. Generalized graph-based fusion of hyperspectral and LiDAR data usingmorphological features. IEEE Geosci. Remote Sens. Lett. 2015, 12, 552–556. [Google Scholar] [CrossRef]
Li, W.; Chen, C.; Su, H.; Du, Q. Local binary patterns and extreme learning machine for hyperspectral imagery classification. IEEE Trans. Geosci. Remote Sens. 2015, 53, 3681–3693. [Google Scholar] [CrossRef]
Dalla Mura, M.; Villa, A.; Benediktsson, J.A.; Chanussot, J.; Bruzzone, L. Classification of hyperspectral images by using extended morphological attribute profiles and independent component analysis. IEEE Geosci. Remote Sens. Lett. Dec. 2010, 8, 542–546. [Google Scholar] [CrossRef]
Li, M.; Lv, Z.; Cao, Q.; Gao, J.; Hu, B. Automatic assessment method and device for depression symptom severity based on emotional facial expression and pupil-wave. IEEE Trans. Instrum. Meas. 2024, 73, 2531215. [Google Scholar] [CrossRef]
Zhao, H.; Wang, L.; Zhao, Z.; Deng, W. A new fault diagnosis approach using parameterized time-reassigned multisynchrosqueezing transform for rolling bearings. IEEE Trans. Reliab. 2024, 1–10. [Google Scholar] [CrossRef]
Song, Y.J.; Han, L.H.; Zhang, B.; Deng, W. A dual-time dual-population multi-objective evolutionary algorithm with application to the portfolio optimization problem. Eng. Appl. Artif. Intell. 2024, 133, 108638. [Google Scholar] [CrossRef]
Li, X.; Zhao, H.; Deng, W. IOFL: Intelligent-optimization-based federated learning for Non-IID data. IEEE Internet Things J. 2024, 11, 16693–16699. [Google Scholar] [CrossRef]
Zhu, T.; Ren, R.; Li, Y.; Liu, W. A Model-Based Reinforcement Learning Method with Conditional Variational Auto-Encoder. J. Data Sci. Intell. Syst. 2024. [Google Scholar] [CrossRef]
Sun, Q.; Chen, J.; Zhou, L.; Ding, S.; Han, S. A study on ice resistance prediction based on deep learning data generation method. Ocean Eng. 2024, 301, 117467. [Google Scholar] [CrossRef]
Cheng, X.; Li, G.; Skulstad, R.; Zhang, H. SAFENESS: A Semi-Supervised Transfer Learning Approach for Sea State Estimation Using Ship Motion Data. IEEE Trans. Intell. Transp. Syst. 2024, 25, 3352–3363. [Google Scholar] [CrossRef]
Zhang, Z.; Guo, D.; Zhou, S.; Zhang, J.; Lin, Y. Flight trajectory prediction enabled by time-frequency wavelet transform. Nat. Commun. 2023, 14, 5258. [Google Scholar] [CrossRef]
Wang, Y.; Duan, H. Classification of Hyperspectral Images by SVM Using a Composite Kernel by Employing Spectral, Spatial and Hierarchical Structure Information. Remote Sens. 2018, 10, 26. [Google Scholar] [CrossRef]
Huang, G.B. An insight into extreme learning machines: Random neurons, random features and kernels. Cogn. Comput. 2014, 6, 376–390. [Google Scholar] [CrossRef]
Huang, K.; Li, S.; Kang, X.; Fang, L. Spectral–Spatial Hyperspectral Image Classification Based on KNN. Sens. Imaging 2016, 17, 1. [Google Scholar] [CrossRef]
Hang, R.; Li, Z.; Ghamisi, P.; Hong, D.; Xia, G.; Liu, Q. Classification of hyperspectral and LiDAR data using coupled CNNs. IEEE Trans. Geosci. Remote Sens. 2020, 58, 4939–4950. [Google Scholar] [CrossRef]
Lee, H.; Kwon, H. Going deeper with contextual CNN for hyperspectral image classification. IEEE Trans. Image Process. 2017, 26, 4843–4855. [Google Scholar] [CrossRef] [PubMed]
Xu, X.; Li, W.; Ran, Q.; Du, Q.; Gao, L.; Zhang, B. Multisource remote sensing data classification based on convolutional neural network. IEEE Trans. Geosci. Remote Sens. 2018, 56, 937–949. [Google Scholar] [CrossRef]
Pal, M.; Maxwell, A.E.; Warner, T.A. Kernel-based extreme learning machine for remote-sensing image classification. Remote Sens. Lett. 2013, 4, 853–862. [Google Scholar] [CrossRef]
Prasad, S.; Bruce, L.M. Decision fusion with confidence-based weight assignment for hyperspectral target recognition. IEEE Trans. Geosci. Remote Sens. 2008, 46, 1448–1456. [Google Scholar] [CrossRef]
Li, W.; Du, Q. Gabor-filtering-based nearest regularized subspace for hyperspectral image classification. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2014, 7, 1012–1022. [Google Scholar] [CrossRef]
Jiang, J.; Ma, J.; Chen, C.; Wang, Z.; Cai, Z.; Wang, L. SuperPCA: A superpixelwise PCA approach for unsupervised feature extraction of hyperspectral imagery. IEEE Trans. Geosci. Remote Sens. 2018, 56, 4581–4593. [Google Scholar] [CrossRef]
Li, M.; Wang, Y.; Yang, C.; Lu, Z.; Chen, J. Investigation of ice wedge bearing capacity based on an anisotropic beam analogy. Ocean Eng. 2024, 302, 117611. [Google Scholar] [CrossRef]
Li, M.; Wang, Y.; Yang, C.; Lu, Z.; Chen, J. Automatic diagnosis of depression based on facial expression information and deep convolutional neural network. IEEE Trans. Comput. Soc. Syst. 2024, 11, 5728–5739. [Google Scholar] [CrossRef]
Lin, Y.; Guo, D.; Wu, Y.; Li, L.; Wu, E.Q.; Ge, W. Fuel consumption prediction for pre-departure flights using attention-based multi-modal fusion. Inf. Fusion 2024, 101, 101983. [Google Scholar] [CrossRef]
Guo, D.; Wu, E.Q.; Wu, Y.; Zhang, J.; Law, R.; Lin, Y. FlightBERT: Binary Encoding Representation for Flight Trajectory Prediction. IEEE Trans. Intell. Transp. Syst. 2023, 24, 1828–1842. [Google Scholar] [CrossRef]
Li, X.; Qiong, Y. Structure extraction from texture via relative total variation. ACM Trans. Graph. 2012, 31, 1–10. [Google Scholar]
Mura, M.D.; Benediktsson, J.A.; Waske, B.; Bruzzone, L. Extended profiles with morphological attribute filters for the analysis of hyperspectral data. Int. J. Remote Sens. 2010, 31, 5975–5991. [Google Scholar] [CrossRef]
Hearst, M.A.; Dumais, S.T.; Osuna, E. Support vector machines. IEEE Intell. Syst. Appl. 1998, 13, 18–28. [Google Scholar] [CrossRef]
Hong, D.; Gao, L.; Hang, R.; Zhang, B.; Chanussot, J. Deep encoder-decoder networks for classification of hyperspectral and LiDAR data. IEEE Geosci. Remote Sens. Lett. 2022, 19, 5500205. [Google Scholar] [CrossRef]
Mou, L.; Ghamisi, P.; Zhu, X.X. Deep recurrent neural networks for hyperspectral image classification. IEEE Trans. Geosci. Remote Sens. 2017, 55, 3639–3655. [Google Scholar] [CrossRef]
Cao, X.; Zhou, F.; Xu, L.; Meng, D.; Xu, Z.; Paisley, J. Hyperspectral image classification with Markov random fields and a convolutional neural network. IEEE Trans. Image Process. 2018, 27, 2354–2367. [Google Scholar]
Mohla, S.; Pande, S.; Banerjee, B.; Chaudhuri, S. Fusatnet: Dual attention based spectrospatial multimodal fusion network for hyperspectral and lidar classification. In Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), Seattle, WA, USA, 14–19 June 2020; pp. 92–93. [Google Scholar] [CrossRef]
Fang, S.; Li, K.; Li, Z. S²ENet: Spatial-spectral cross-modal enhancement network for classification of hyperspectral and LiDAR data. IEEE Geosci. Remote Sens. Lett. 2022, 19, 6504205. [Google Scholar] [CrossRef]
Lu, T.; Ding, K.; Fu, W.; Li, S.; Guo, A. Coupled adversarial learning for fusion classification of hyperspectral and LiDAR data. Inf. Fusion. 2023, 93, 118–131. [Google Scholar] [CrossRef]
Zhao, G.; Ye, Q.; Sun, L.; Wu, Z.; Pan, C.; Jeon, B. Joint Classification of Hyperspectral and LiDAR Data Using a Hierarchical CNN and Transformer. IEEE Trans. Geosci. Remote Sens. 2023, 61, 5500716. [Google Scholar] [CrossRef]
Yang, Y.; Zhu, D.; Qu, T.; Wang, Q.; Ren, F.; Cheng, C. Single-stream CNN with learnable architecture for multisource remote sensing data. IEEE Trans. Geosci. Remote Sens. 2022, 60, 5409218. [Google Scholar] [CrossRef]
Wang, A.; Dai, S.; Wu, H.; Iwahori, Y. Multimodal Semantic Collaborative Classification for Hyperspectral Images and LiDAR Data. Remote Sens. 2024, 16, 3082. [Google Scholar] [CrossRef]

Figure 1. Framework of PRDRMF.

Figure 2. The impact of PRTV on data de-redundancy. (a) Image after PRTV processing of raw HIS. (b) Image of original HSI data. (c) Output of PRDRMF.

Figure 3. Parameters of PRTV on classification accuracy for four datasets: (a) smoothing degree λ; (b) texture element σ.

Figure 4. Parameters of LBP on classification accuracy for four datasets: (a) size of range diameter size r; (b) number of sample points n.

Figure 5. Performance of KELM with different kernel functions.

Figure 6. Classification maps of the 2013 Houston dataset using different methods. (a) Pseudo-color image of HSI, (b) LiDAR, (c) ground truth map, (d) SVM (59.40%), (e) CCNN (86.92%), (f) EndNet (88.52%), (g) CRNN (88.55%), (h) TBCNN (88.91%), (i) coupled CNN (90.43%), (j) CNNMRF (90.61%), (k) FusAtNet (89.98%), (l) S2ENet (94.19%), (m) CALC (94.71%), (n) Fusion-HCT (99.76%), (o) SepG-ResNET50 (72.67%), (p) DSMSC²N (91.49%), (q) PRDRMF (99.79%).

Figure 7. Classification maps of the MUUFL dataset using different methods. (a) Pseudo-color image of HSI, (b) LiDAR, (c) ground truth map, (d) SVM (4.47%), (e) CCNN (88.96%), (f) EndNet (87.75%), (g) CRNN (91.38%), (h) TBCNN (90.85%), (i) coupled CNN (90.93%), (j) CNNMRF (88.94%), (k) FusAtNet (91.48%), (l) S2ENet (91.68%), (m) CALC (82.91%), (n) Fusion-HCT (87.43%), (o) SepG-ResNET50 (82.90%), (p) DSMSC²N (91.17%), (q) PRDRMF (92.21%).

Figure 8. Classification maps of the Trento daaset using different methods. (a) Pseudo-color image of HSI, (b) LiDAR, (c) ground truth map, (d) SVM (72.89%), (e) CCNN (97.29%), (f) EndNet (94.17%), (g) CRNN (97.22%), (h) TBCNN (97.46%), (i) coupled CNN (97.69%), (j) CNNMRF (98.40%), (k) FusAtNet (99.06%) (l) S2ENet (98.54%), (m) CALC (99.38%), (n) Fusion-HCT (99.60%), (o) SepG-ResNET50 (93.82%), (p) DSMSC²N (98.93%), (q) PRDRMF (99.73%).

Figure 9. Classification maps of the 2018 Houston dataset using different methods. (a) Pseudo-color image of HSI, (b) LiDAR, (c) ground truth map, (d) SVM (81.49%), (e) CCNN (90.09%), (f) EndNet (90.72%), (g) CRNN (91.16%), (h) TBCNN (91.21%), (i) coupled CNN (92.21%), (j) CNNMRF (92.35%), (k) FusAtNet (91.58%), (l) S2ENet (94.59%), (m) CALC (94.80%), (n) Fusion-HCT (96.68%), (o) SepG-ResNET50 (88.30%), (p) DSMSC²N (93.55%), (q) PRDRMF (96.93%).

Figure 10. Visualization of data feature distribution for the 2013 Houston dataset. (a) Raw HSI, (b) PRTV, (c) PRTV+ multifeature extraction module, (d) PRDRMF.

Figure 11. Visualization of data feature distribution for the MUUFL dataset. (a) Raw HSI, (b) PRTV, (c) PRTV+ multifeature extraction module, (d) PRDRMF.

Figure 12. Visualization of data feature distribution for the Trento dataset. (a) Raw HSI, (b) PRTV, (c) PRTV+ multifeature extraction Module, (d) PRDRMF.

Figure 13. Visualization of data feature distribution for the Trento dataset. (a) Raw HSI, (b) PRTV, (c) PRTV+ multifeature extraction module, (d) PRDRMF.

Table 1. Detailed description of four multi-sensor datasets.

Dataset	2013 Houston		MUUFL		Trento		2018 Houston
Location	Houston, RX, USA		Long Beach, MS, USA		Trento, Italy		Houston, TX, USA
Data source	HSI	LiDAR	HSI	LiDAR	HSI	LiDAR	HSI	LiDAR
Size	345 × 1905	345 × 1905	325 × 220	325 × 220	600 × 166	600 × 166	209 × 955	209 × 955
Spatial resolution	2.5 m	2.5 m	0.54 m × 1.0 m	0.60 m× 0.78 m	1 m	1 m	2.5 m	2.5 m
Bands	144	1	64	2	63	1	48	1
Wavelength range	0.38–1.05	-	0.375–1.05	1.06	0.42–0.99	-	0.38–1.05	-
Sensor Type	CASI-1500	-	CASI-1500	Gemini ALTM LIDAR	AISA Eagle	Optech ALTM 3100EA	CASI-1500	-

Table 2. The 2013 Houston dataset.

Class	Class Name	Train	Test	Total
1	Health grass	198	1053	1251
2	Stressed grass	190	1064	1254
3	Synthetic	192	505	697
4	Tress	188	1056	1244
5	Soil	186	1056	1242
6	Water	182	143	325
7	Residential	196	1072	1268
8	Commercial	191	1053	1244
9	Road	193	1059	1252
10	Highway	191	1036	1227
11	Railway	181	1054	1235
12	Parking lot 1	192	1041	1233
13	Parking lot 2	183	285	469
14	Tennis court	181	247	428
15	Running track	187	473	660
Total	-	2832	12,197	15,029

Table 3. The MUUFL dataset.

Class	Class Name	Train	Test	Total
1	Trees	150	23,096	23,246
2	Mostly grass	150	4120	4270
3	Mixed ground surface	150	6732	6882
4	Dirt and sand	150	1676	1826
5	Road	150	6537	6687
6	Water	150	316	466
7	Building shadow	150	2083	2233
8	Building	150	6090	6240
9	Sidewalk	150	1235	1385
10	Yellow curb	150	33	183
11	Cloth panels	150	119	269
Total	-	2832	12,197	15,029

Table 4. The Trento dataset.

Class	Class Name	Train	Test	Total
1	Apple trees	129	3095	4034
2	Buildings	125	2778	2903
3	Ground	105	374	479
4	Woods	154	8969	9123
5	Vineyard	184	10,317	10,501
6	Roads	122	2952	3074
Total	-	819	29,395	30,214

Table 5. The 2018 Houston dataset.

Class	Class Name	Train	Test	Total
1	Grass healthy	214	1139	1353
2	Grass stressed	740	4148	4888
3	Trees	418	2348	2766
4	Water	12	10	22
5	Residential buildings	826	4521	5347
6	Non-residential buildings	4983	27,476	32,459
7	Roads	981	5384	6365
Total	-	8174	45,026	53,200

Table 6. Comparison of overall accuracy (OA) and kappa of different methods on the 2013 Houston dataset.

No.	Class	Classification Algorithms
No.	Class	SVM	CCNN	EndNet	CRNN	TBCNN	Coupled CNN	CNNMRF	PRDRMF
1	Health grass	86.32	99.32	81.58	83.00	83.10	98.51	85.77	100.00
2	Stressed grass	61.65	87.56	83.65	79.41	81.20	97.83	86.28	100.00
3	Synthetic grass	96.63	98.80	100.00	99.80	100.00	70.60	99.00	100.00
4	Trees	94.51	97.48	93.09	90.15	92.90	99.06	92.85	99.72
5	Soil	77.65	99.81	99.91	99.71	99.81	100.00	100.00	100.00
6	Water	85.31	99.31	95.10	83.21	100.00	41.11	98.15	100.00
7	Residential	47.39	73.23	82.65	88.06	92.54	83.14	91.64	99.91
8	Commercial	34.66	88.65	81.29	88.61	94.87	98.39	80.79	98.10
9	Road	81.11	82.34	88.29	66.01	83.85	94.81	91.37	99.81
10	Highway	0.00	75.81	89.00	52.22	69.89	92.98	73.35	100.00
11	Railway	78.65	72.10	83.78	81.97	86.15	90.88	98.87	100.00
12	Parking lot1	0.00	85.39	90.39	69.83	92.60	91.02	89.38	100.00
13	Parking lot2	0.35	94.29	82.46	79.64	79.30	97.09	92.75	100.00
14	Tennis court	94.74	83.62	100.00	100.00	100.00	100.00	100.00	100.00
15	Running track	96.41	99.55	98.10	100.00	100.00	97.85	100.00	100.00
OA (%)		59.40	86.92	88.52	88.55	88.91	90.43	90.61	99.79
AA (%)		62.36	89.15	89.95	90.30	90.42	90.22	92.01	99.84
Kappa (×100)		56.02	85.80	87.59	87.56	87.96	89.68	89.87	99.77

Table 7. Comparison of overall accuracy (OA) and kappa of different methods on the 2013 Houston dataset.

No.	Class	Classification Algorithms
No.	Class	FusAtNet	S2ENet	CALC	Fusion-HCT	SepG-ResNet50	DSMSC²N	PRDRMF
1	Health grass	83.10	82.91	90.07	100.00	72.36	90.12	100.00
2	Stressed grass	96.05	100.00	96.12	99.53	77.35	84.59	100.00
3	Synthetic grass	100.00	100.00	99.23	100.00	34.85	98.81	100.00
4	Trees	93.09	96.88	95.87	99.90	86.84	90.91	99.72
5	Soil	99.43	99.91	99.98	100.00	91.38	100.00	100.00
6	Water	100.00	100.00	94.15	100.00	95.10	95.80	100.00
7	Residential	93.53	95.15	95.21	99.90	81.62	91.12	99.91
8	Commercial	92.12	93.92	92.51	98.57	61.73	95.38	98.10
9	Road	83.63	91.31	89.67	99.43	86.31	95.04	99.81
10	Highway	64.09	92.95	93.89	100.00	46.26	67.66	100.00
11	Railway	90.13	94.69	95.17	99.90	69.35	97.22	100.00
12	Parking lot1	91.93	89.43	90.63	100.00	86.94	93.66	100.00
13	Parking lot2	88.42	83.16	97.73	100.00	78.25	92.63	100.00
14	Tennis court	100.00	100.00	99.90	100.00	87.04	100.00	100.00
15	Running track	99.15	100.00	99.51	100.00	18.82	97.46	100.00
OA (%)		89.98	94.19	94.71	99.76	72.67	91.49	99.79
AA (%)		94.65	94.69	95.63	99.81	71.63	92.69	99.84
Kappa (×100)		89.13	93.69	94.43	99.74	70.40	90.76	99.77

Table 8. Comparison of overall accuracy (OA) and kappa of different methods on the MUUFL dataset.

No.	Class	Classification Algorithms
No.	Class	SVM	CCNN	EndNet	CRNN	TBCNN	Coupled CNN	CNNMRF	PRDRMF
1	Trees	0.00	98.35	96.86	91.43	98.11	98.90	93.04	92.87
2	Mostly grass	1.19	74.90	72.09	63.16	83.38	78.60	60.17	92.21
3	Mixed ground surface	0.00	82.42	80.24	90.20	81.77	90.66	90.60	88.59
4	Dirt and sand	0.00	75.66	73.67	93.44	84.66	90.60	97.20	92.78
5	Road	9.76	95.47	96.56	87.62	96.34	96.90	92.00	92.20
6	Water	95.25	80.62	64.89	95.89	83.36	75.98	99.68	99.37
7	Building shadow	5.52	63.93	66.52	90.16	70.29	73.54	95.39	94.72
8	Building	18.21	95.84	95.41	89.29	98.77	96.66	94.71	93.99
9	Sidewalk	0.00	62.88	60.45	82.91	73.52	64.93	30.53	84.40
10	Yellow curb	0.00	52.05	47.03	96.97	33.15	19.47	36.36	87.88
11	Cloth panels	97.48	71.73	83.49	96.64	60.77	67.76	95.80	98.32
OA (%)		4.47	88.96	87.75	91.38	90.85	90.93	88.94	92.21
AA (%)		20.67	77.62	76.11	88.88	78.56	77.18	85.02	92.39
Kappa (×100)		2.612	85.67	84.06	84.41	88.06	88.22	85.55	89.75

Table 9. Comparison of overall accuracy (OA) and kappa of different methods on the MUUFL dataset.

No.	Class	Classification Algorithms
No.	Class	FusAtNet	S2ENet	CALC	Fusion-HCT	SepG-ResNet50	DSMSC²N	PRDRMF
1	Trees	98.10	98.16	89.79	89.20	86.78	94.23	92.87
2	Mostly grass	71.66	81.64	73.46	84.41	78.47	85.81	92.21
3	Mixed ground surface	87.65	90.55	66.53	80.60	71.20	81.97	88.59
4	Dirt and sand	86.42	83.02	89.91	92.66	89.98	86.65	92.78
5	Road	95.09	94.50	75.85	81.87	76.38	89.72	92.20
6	Water	90.73	72.14	99.74	99.36	99.73	99.65	99.37
7	Building shadow	74.27	79.46	84.36	91.31	91.70	94.18	94.72
8	Building	97.55	97.93	94.54	94.64	87.31	90.87	93.99
9	Sidewalk	60.44	65.45	43.18	77.48	69.88	79.75	84.40
10	Yellow curb	9.39	33.40	62.35	96.96	90.36	92.00	87.88
11	Cloth panels	93.02	80.18	96.73	99.15	99.41	98.82	98.32
OA (%)		91.48	91.68	82.91	87.43	82.90	91.19	92.21
AA (%)		78.58	79.67	79.67	89.79	85.56	90.87	92.39
Kappa (×100)		88.65	89.15	77.82	83.62	77.94	88.33	89.75

Table 10. Comparison of overall accuracy (OA) and kappa of different methods on the Trento dataset.

No.	Class	Classification Algorithms
No.	Class	SVM	CCNN	EndNet	CRNN	TBCNN	Coupled CNN	CNNMRF	PRDRMF
1	Apple trees	73.12	99.76	88.19	97.72	98.51	99.87	99.95	96.26
2	Buildings	79.88	96.40	98.49	95.69	92.49	83.84	89.97	99.64
3	Ground	93.32	99.44	95.19	100.00	100.00	87.09	98.33	99.73
4	Woods	76.73	97.75	99.30	96.85	97.32	99.98	100.00	100.00
5	Vineyard	77.16	97.32	91.96	100.00	100.00	99.61	100.00	100.00
6	Roads	59.35	93.27	90.14	77.76	92.56	98.75	93.98	97.79
OA (%)		72.89	97.29	94.17	97.22	97.46	97.69	98.40	99.73
AA (%)		73.74	97.32	93.88	94.67	96.80	94.86	97.04	99.55
Kappa (×100)		63.66	96.39	92.22	96.29	96.61	96.91	97.86	99.64

Table 11. Comparison of overall accuracy (OA) and kappa of different methods on the Trento dataset.

No.	Class	Classification Algorithms
No.	Class	FusAtNet	S2ENet	CALC	Fusion-HCT	SepG-ResNet50	DSMSC²N	PRDRMF
1	Apple trees	99.54	99.85	99.47	98.92	93.28	99.33	96.26
2	Buildings	98.49	98.17	98.61	98.34	99.38	97.42	99.64
3	Ground	99.73	100.00	96.76	100.00	74.35	96.66	99.73
4	Woods	100.00	99.42	100.00	100.00	99.88	99.29	100.00
5	Vineyard	99.90	99.65	99.97	100.00	95.91	99.70	100.00
6	Roads	93.32	90.83	96.42	99.11	68.05	96.60	97.79
OA (%)		99.06	98.54	99.38	99.60	93.82	98.93	99.73
AA (%)		98.50	97.99	98.53	99.39	88.47	98.16	99.55
Kappa (×100)		98.75	98.06	99.12	99.47	91.79	98.57	99.64

Table 12. Comparison of overall accuracy (OA) and kappa of different methods on the 2018 Houston dataset.

No.	Class	Classification Algorithms
No.	Class	SVM	CCNN	EndNet	CRNN	TBCNN	Coupled CNN	CNNMRF	PRDRMF
1	Grass healthy	47.15	69.87	74.63	66.55	80.82	66.01	71.37	75.61
2	Grass stressed	98.57	98.19	97.72	85.51	97.73	89.12	86.46	93.07
3	Trees	72.73	84.92	86.48	63.54	88.14	97.37	68.73	93.42
4	Water	50.09	90.01	70.01	100.00	100.00	100.00	100.00	100.00
5	Residential buildings	44.51	73.56	75.09	96.53	76.46	97.64	97.28	98.53
6	Non-residential buildings	96.97	95.82	95.92	98.64	96.73	98.91	98.79	99.36
7	Roads	81.53	75.01	77.63	70.28	79.28	73.12	75.46	92.57
OA (%)		81.49	90.09	90.72	91.16	91.21	92.21	92.35	96.93
AA (%)		63.67	83.88	82.45	82.98	88.25	84.43	85.37	93.19
Kappa (×100)		65.93	83.04	84.16	84.75	85.04	86.59	86.87	94.80

Table 13. Comparison of overall accuracy (OA) and kappa of different methods on the 2018 Houston dataset.

No.	Class	Classification Algorithms
No.	Class	FusAtNet	S2ENet	CALC	Fusion-HCT	SepG-ResNet50	DSMSC²N	PRDRMF
1	Grass healthy	64.10	72.31	74.87	77.73	61.86	74.12	75.61
2	Grass stressed	85.95	90.73	90.57	92.03	98.59	88.19	93.07
4	Trees	63.09	81.98	80.37	92.40	78.82	74.71	93.42
6	Water	100.00	100.00	100.00	100.00	40.03	100.00	100.00
7	Residential buildings	97.53	97.85	98.01	98.77	68.22	98.37	98.53
8	Non-residential buildings	98.91	99.02	99.71	99.17	95.43	98.98	99.36
9	Roads	74.46	82.91	84.47	92.09	70.91	78.64	92.57
OA (%)		91.58	94.59	94.80	96.68	88.30	93.55	96.93
AA (%)		83.09	89.21	89.57	93.12	73.38	87.54	93.19
Kappa (×100)		85.49	90.79	91.14	94.39	79.85	88.91	94.80

Table 14. Comparison of running time of different methods on the experimental datasets (in seconds).

Classifier	2013 Houston	MUUFL	Trento	2018 Houston
SVM	8.47	7.18	2.38	5.71
CCNN	167.41	115.1	91.01	103.46
EndNet	173.92	131.26	92.29	117.83
CRNN	512.37	476.21	404.14	456.12
TBCNN	215.55	115.2	87.77	104.78
Coupled CNN	185.80	143.67	118.94	138.21
CNNMRF	2104.21	1744.54	1635.90	1689.53
FusAtNet	1218.41	681.35	352.20	578.92
S2ENet	231.57	192.59	114.75	182.31
CALC	921.01	434.40	287.76	412.41
Fusion-HCT	1727.72	597.71	322.79	489.73
SepG-ResNET50	879.32	413.72	233.65	398.01
DSMSC²N	1560.00	521.17	312.23	479.73
PRDRMF	84.37	65.14	44.43	56.04

Table 15. Comparison of different decision fusion methods on the 2018 Houston datasets.

Methods	2018 Houston
Methods	OA (%)	AA (%)	Kappa (×100)
MV	95.98	92.89	93.94
Naive Bayes	96.12	93.07	94.35
LOGP	96.11	92.98	94.11
Adaptive decision fusion method	97.13	94.11	95.72
Multiprobability decision fusion method	96.93	93.19	94.80

Table 16. Ablation analysis of different sources of data inputs.

Cases	2013 Houston			MUUFL			Trento			2018 Houston
Cases	OA (%)	AA (%)	K × 100	OA (%)	AA (%)	K × 100	OA (%)	AA (%)	K × 100	OA (%)	AA (%)	K × 100
Only HSI	99.50	99.53	99.46	99.34	98.90	99.12	90.77	91.55	87.92	96.65	92.88	94.33
Only LiDAR	98.58	98.90	98.46	97.66	96.76	96.86	75.39	83.47	69.13	95.76	96.16	95.79
HSI+ LiDAR	99.79	99.84	99.77	99.73	99.55	99.64	92.21	92.39	89.75	96.93	93.19	94.80

Table 17. Ablation analysis of different CPM inputs on the 2013 Houston dataset (- represents removal, √ represents inclusion).

Cases	Component				Indicators
Cases	HSI_LBP	HSI_EMAP	LiDAR_LBP	LiDAR_LBP	OA (%)	AA (%)	K × 100
1	-	√	√	√	99.76	99.77	99.74
2	√	-	√	√	99.77	99.82	99.75
3	√	√	-	√	99.12	99.28	99.05
4	√	√	√	-	99.44	99.57	99.39
5	-	-	√	√	99.00	99.02	98.91
6	-	√	-	√	94.97	95.57	94.55
7	-	√	√	-	99.65	99.66	99.62
8	√	-	-	√	98.80	99.07	98.70
9	√	-	√	-	99.73	99.77	99.71
10	√	√	-	-	98.52	98.86	98.40
11	√	-	-	-	98.07	98.49	97.91
12	-	√	-	-	94.74	95.92	94.29
13	-	-	√	-	98.23	98.56	98.08
14	-	-	-	√	52.48	57.71	49.48
15	√	√	√	√	99.79	99.84	99.77

Table 18. Ablation analysis of different module inputs.

Component	OA (%)
Component	2013 Houston	MUUFL	Trento	2018 Houston
NO PRTV	99.80	90.65	99.31	96.21
NO LBP	99.63	89.53	99.70	96.70
NO EMAP	98.79	80.70	99.28	94.28
NO multiprobability decision fusion method	98.78	86.41	99.45	96.45
PRDRMF	99.79	92.21	99.73	96.93

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Chen, T.; Chen, S.; Chen, L.; Chen, H.; Zheng, B.; Deng, W. Joint Classification of Hyperspectral and LiDAR Data via Multiprobability Decision Fusion Method. Remote Sens. 2024, 16, 4317. https://doi.org/10.3390/rs16224317

AMA Style

Chen T, Chen S, Chen L, Chen H, Zheng B, Deng W. Joint Classification of Hyperspectral and LiDAR Data via Multiprobability Decision Fusion Method. Remote Sensing. 2024; 16(22):4317. https://doi.org/10.3390/rs16224317

Chicago/Turabian Style

Chen, Tao, Sizuo Chen, Luying Chen, Huayue Chen, Bochuan Zheng, and Wu Deng. 2024. "Joint Classification of Hyperspectral and LiDAR Data via Multiprobability Decision Fusion Method" Remote Sensing 16, no. 22: 4317. https://doi.org/10.3390/rs16224317

APA Style

Chen, T., Chen, S., Chen, L., Chen, H., Zheng, B., & Deng, W. (2024). Joint Classification of Hyperspectral and LiDAR Data via Multiprobability Decision Fusion Method. Remote Sensing, 16(22), 4317. https://doi.org/10.3390/rs16224317

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Joint Classification of Hyperspectral and LiDAR Data via Multiprobability Decision Fusion Method

Abstract

1. Introduction

2. Framework of the PRDRMF Method

2.1. Data De-Redundancy Method

2.2. Multifeature Extraction Module

2.3. Classification Module

2.4. Decision Fusion Module

3. Experimental Results and Analysis

3.1. Datasets

3.2. Parameter Analysis

3.2.1. Parameters of PRTV

3.2.2. Parameters of LBP

3.2.3. Parameters of KELM

3.3. Comparison Experiment and Analysis

3.4. Computation Time Comparisons

3.5. Comparison of Decision Fusion Methods

3.6. Ablation Experiments

3.6.1. Ablation Analysis of Different Source of Data Inputs

3.6.2. Ablation Analysis of Different CPM Inputs

3.6.3. Ablation Analysis of Different Module Inputs

4. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI