1. Introduction
Forest height is one of the important components of global forest-resource surveying and is of great significance for the study of the global carbon cycle and the estimation of forest biomass [
1,
2,
3,
4]. Accurate and rapid acquisition of large-scale forest height data contributes positively to large-scale studies of forest biomass acquisition, the water cycle, and atmospheric movement [
5,
6,
7,
8,
9]. Polarimetric interferometric synthetic aperture radar (PolInSAR) introduces polarization measurement technology on the basis of the traditional InSAR technology. PolInSAR not only has the characteristic of InSAR technology, in that it is sensitive to the height and position of the target scatterers, but also has the characteristic of being sensitive to the biophysical properties and structure of vegetation. As a result, PolInSAR can be used to distinguish the different scattering mechanisms within the same resolution unit, thus possessing the ability to effectively distinguish the scattering phase centers of the ground surface and the canopy [
10,
11,
12], and has been widely used in forest height estimation.
The key to accurate forest height inversion based on PolInSAR is to find the polarization modes corresponding to “pure body scattering” and “pure ground scattering”. However, these two polarization modes cannot be obtained directly from PolInSAR observations due to the depolarization effect of the forest canopy. In view of this, Treuhaft et al. [
10,
11,
12,
13,
14] proposed the random volume over ground (RVoG) model, which establishes the relationship between the PolInSAR complex coherence coefficients, forest height, the extinction coefficient, and other forest biophysical parameters. At present, the RVoG model and its improved models are the most widely used models in the field of PolInSAR forest height inversion. In 2001, based on the single-baseline PolInSAR configuration, Papathanassiou and Cloude [
12] established the correlation between polarization complex coherence and the RVoG model and generalized the inversion of forest height and other parameters based on the RVoG model into a six-dimensional nonlinear system of equations, laying the foundation for the inversion of the forest parameters of PolInSAR. However, the solving of the nonlinear system of equations faces problems, such as the iterative initial value being difficult to determine and the results easily falling into local optima. Therefore, subsequent researchers have successively proposed a series of intelligent optimization algorithms, such as genetic algorithms, back propagation (BP) neural network algorithms, etc. [
15,
16,
17,
18]. However, these intelligent optimization algorithms, although they have improved the accuracy of the parameter inversion when compared to the traditional nonlinear iterative algorithms, still cannot easily overcome the defects of the nonlinear solving. To address this problem, in 2003, Cloude and Papathanassiou [
19] proposed a three-stage method to invert forest height based on the geometric characteristics of the RVoG model expressed as a straight line in the unit circle of the complex plane. However, it has been shown that the three-stage method depends on the accuracy of the straight-line fitting, and high noise causes the accuracy of the straight-line fitting to decrease, resulting in failure of the forest height inversion [
20,
21,
22,
23,
24]. It can be said that the six-dimensional nonlinear iterative algorithm and the three-stage algorithm are the two most classical single-baseline algorithms in the field of PolInSAR forest height inversion, and they have achieved good forest height inversion results in forest scenarios with a uniform canopy and a flat understory topography [
25,
26,
27]. However, for most forest scenarios, the forest height inversion accuracy of these two classic algorithms is still insufficient, mainly due to the fact that the PolInSAR observation information provided by the single-baseline configuration is not enough to support unknown parameter inversion based on the RVoG model, and it is necessary to introduce unknown parameter a priori information in order to solve the problem.
Compared to single-baseline PolInSAR forest height inversion, multi-baseline PolInSAR data can obtain rich observation information, which is conducive to forest height inversion. In 2003, Cloude and Papathanassiou [
10,
19,
28] extended the traditional single-baseline three-phase algorithm to a dual-baseline configuration, which solved the limitation of the assumption of a zero amplitude ratio of the ground-to-volume scattering for the body-scattering-dominated polarization approach. However, this method can only consider dual-baseline configurations and is difficult to generalize to other baseline conditions. Meanwhile, the dual-baseline algorithm requires that the two baselines involved in the forest height inversion have certain differences to ensure the accuracy of the forest height inversion [
28]. On the other hand, some researchers have focused on selecting the “optimal” baseline among multi-baseline data for forest parameter inversion [
23,
28,
29,
30,
31] in methods such as the optimal elevation accuracy method [
29], which judges the quality of the baselines according to the standard deviation of the interferometric phase of the body-scattering-dominant polarization mode. The baseline with the smallest elevation accuracy standard deviation is then selected for the forest parameter inversion. In addition, based on the theory of machine learning, it is also possible to select the optimal observation geometry for the forest parameter inversion among multiple baselines [
30,
32,
33]. However, the baseline selection criterion considers only a single factor, and it is difficult to determine a baseline selection method that is applicable to all experimental regions, which makes it difficult to map forest height on a large scale. In addition to the above research work, Zhu et al. [
6,
34,
35,
36,
37,
38] explained forest height inversion based on the RVoG model framework from the perspective of measurement adjustment; however, for this method, there are too many unknown parameters, it is difficult to solve, and it fails to take into account optimal baseline selection and irrational baseline exclusion, so it has limited practical application.
In this paper, aiming at the above problems, we propose a multi-baseline forest height estimation method combining analytic and geometric expression of the RVoG model in order to ensure that the rich observation data in the multi-baseline data can be fully utilized to solve the problem of insufficient observation information in the parameter solving. At the same time, the proposed method reduces the uncertainty of the accuracy of the forest height inversion results brought by baselines that are greatly affected by noise and geometrically non-optimal baselines. The specific process is as follows: (1) According to the method of optimal geometrical structure of the coherence region shape, an optimal baseline in the multi-baseline dataset is selected as a reference baseline and all the other baselines are considered to be geometrically constrained baselines. (2) For the reference baseline, a functional model is constructed based on analytic expression of the RVoG model, considering the complex coherence coefficients as observations, and the vegetation biophysical parameters, such as the forest height and extinction coefficient, as the parameters to be solved. (3) For the geometrically constrained baselines of the PolInSAR observations, straight-line fitting is performed based on the geometric expression of the RVoG model, and then the geometric constraints are constructed. The sum of squares of the distances from the expected value of the pure body decoherence coefficients to the fitted straight line is minimized so as to make them participate in the forest height inversion at the same time. Finally, the algorithm proposed in this paper was experimentally validated using PolInSAR data covering the Krycklan coniferous forest experimental area and the Mabounie tropical rainforest experimental area.
The rest of the paper is organized as follows:
Section 2 describes the theoretical foundation of this paper.
Section 3 explains the research methodology, including the optimal baseline selection method based on the geometric structure of the coherence region shape and the constraint construction method based on geometric expression of the RVoG model.
Section 4 presents the experimental results.
Section 5 discusses the experimental results obtained in the coniferous forest region as well as the limitations and future studies.
Section 6 concludes and summarizes the current work.
3. The Proposed Multi-Baseline Forest Height Estimation Method
When forest height inversion is performed based on multi-baseline PolInSAR data, the observation information is richer, which is favorable for forest height inversion. At the same time, the various error factors in the acquisition process of multi-baseline SAR data will lead to uncertainty in the forest height inversion. As shown in
Figure 1c, the elliptical coherence region shape
s of the different interferometric baselines has a different eccentricity, which means that the different interferometric baselines are affected by different factors, such as noise; the baselines that are more affected by noise can lead to lower accuracy or even failure of the forest height inversion. In addition, although the multi-baseline PolInSAR configuration can observe the forest scene from multiple angles, an inappropriate observation geometry will also bring about forest height inversion bias. Careful selection of the observation geometry is thus required. Therefore, the existing multi-baseline algorithms mainly focus on baseline selection, and, for a single pixel, this is still essentially single-baseline forest height inversion, which fails to utilize the rich multi-baseline observation data. On the other hand, although the dual-baseline algorithm solves the problem regarding insufficient observation information in forest height inversion, it is difficult to extend it to multi-baseline configurations.
In view of this, we built on the idea of the dual-baseline forest height inversion algorithm proposed by Cloude, as described in
Section 2.4, and extended it to the field of multi-baseline forest height inversion, as shown as follows in
Figure 4:
Specifically, in the proposed method, we combine optimal baseline selection with the utilization of multi-baseline observation information. Firstly, the optimal baseline (taking into account the observation quality and interference geometry) is selected and the inverted forest height is expressed by the RVoG model function. For the other baselines which are slightly more affected by noise, as well as the geometrically non-optimal baselines, the observation information of these baselines is transformed into a geometric constraint condition according to the geometric expression of the RVoG model. The algorithm proposed in this paper can not only ensure that the rich observation data in the multi-baseline data can be fully utilized to solve the problem of insufficient observation information in the parameter solving but can also reduce the uncertainty in the accuracy of the forest height inversion results brought by baselines that are greatly affected by noise and the geometrically non-optimal baselines, as shown in the following section.
3.1. Optimal Baseline Selection Based on Coherence Region Shape Geometry
According to
Section 2.2, we here define the shape of the coherence region as ellipsoid-like, as in
Figure 5. The geometry of the ellipse can then be used as a mixed criterion for the quality of the observed data of this baseline as well as the merit of the observation geometry. That is, for the reference baseline selection criterion, the larger the ellipse eccentricity, the higher the ellipse linearity and the higher the accuracy of the parameter estimation; conversely, the smaller the eccentricity of the ellipse, the lower the ellipse linearity and the lower the accuracy of the parameter inversion. The baseline with the largest elliptical eccentricity can be used as the reference baseline [
27]. Specifically, the reference baseline is selected as follows:
Equation (13) can be obtained from the polarization interference matrix
T6, as follows:
where
represents the conjugate transpose of the matrix and
are matrices of
. The new complex matrix
can be redefined according to
, as shown as follows in Equation (14):
The Schur decomposition of the complex matrix
A yields the following Equation (15):
From the Schur decomposition, it can be seen that the eigenvalues in matrix
R can be arbitrarily combined into three groups of elliptic parameters. In the proposed approach, this group of parameters
is selected for the elliptical coherence region shape of the centroid. The elliptical coherence region shape of the length of the long axis
a, the length of the short axis
b, as well as the elliptical coherence region shape of the centroid rate of the Eccentricity (
Ecc) can be calculated from Equations (16) and (17), as follows:
When
interferometric baselines exist, the
Ecc of the elliptical coherence region shape corresponding to each baseline is calculated separately. The reference baseline selection criterion can then be expressed by Equation (18), where
stands for taking the maximum value of the matrix.
3.2. Constraint Construction Based on Geometric Expression of the RVoG Model
Based on the multi-baseline PolInSAR configuration, multiple fitted straight lines can be observed in the unit circle of the complex plane, as shown in
Figure 5a. Theoretically, the “pure body decoherence coefficient” points related to vegetation biophysical parameters such as forest height and the extinction coefficient should be located on the fitted straight line, where the distance from the pure body decoherence point to the fitted straight line is 0. However, due to the influence of various factors such as observation noise, data processing, etc., the pure body decoherence coefficient points in the actual situation may deviate from the fitted straight line. As shown in
Figure 5b, there exists a certain perpendicular distance between the pure body decoherence coefficients corresponding to each geometrically constrained baseline and the corresponding fitted straight line
d. Accordingly, in the proposed approach, we construct a restriction based on the geometric expression of the RVoG model as follows: within the unit circle of the complex plane, the sum of the squared distances between the pure body decoherence coefficients corresponding to each geometrically constrained baseline and the corresponding fitted straight line is minimized.
Assuming that
interferometric baselines exist, the geometric constraint baseline is
. Also, assuming that the equation of the geometric constraint baseline fitting a straight line in the unit circle of the complex plane can be expressed as
, the specifics of the geometric constraints are as shown as follows in Equations (19)–(21):
where the subscript of
,
,
represents the geometric constraint baseline labeling;
represents the
th geometrically constrained baseline;
represents the perpendicular distance from the pure body decoherence coefficient corresponding to the
th geometrically constrained baseline to the corresponding fitted straight line; Re( ): represents taking the real part of the complex number; Im( ): represents taking the imaginary part of the complex number;
represents the slope of the geometrically constrained baseline corresponding to the equation of the fitted straight line in the unit circle of the complex plane, which can be obtained by fitting a straight line in the first step of the three-stage algorithm [
19]; and
represents the intercept of the geometrically constrained baseline corresponding to the equation of the fitted straight line in the unit circle of the complex plane, which can be obtained by fitting a straight line in the first step of the three-stage algorithm [
19].
Note that the pure body decoherence coefficient points cannot be obtained directly from observations. Therefore, in the proposed approach, we adopt the approach of obtaining the mathematical expectation value instead of the true value to construct the constraints based on Equation (19). Since the pure body decoherence coefficient is related to vegetation biophysical parameters such as forest height and extinction coefficient, the above two parameters are unknown. In the proposed approach, we choose to calculate the expected value of the pure body decoherence coefficient using Equation (22) and use it to construct the above constraints instead of the theoretical value of the pure body decoherence coefficient.
where
represents the estimate of the pure body decoherence coefficient coming from the geometrically constrained baseline of
;
represents the estimate of the unknown parameter of forest height; and
represents the estimate of the unknown parameter of the extinction coefficient. Accordingly, Equation (19) is converted to Equation (23), as follows:
According to Equations (22) and (23), the constraints of the PolInSAR forest height inversion algorithm proposed in this paper are as shown as follows in Equation (24):
3.3. Multi-Baseline PolInSAR Forest Height Inversion with Additional RVoG Model Geometric Constraints
3.3.1. Multi-Baseline Function Expression for RVoG Modeling
The RVoG model in multi-baseline mode with the PolInSAR configuration can be expressed as shown in Equation (25), as follows:
where
represents the normalized parameter estimate of the amplitude ratio of the ground-to-volume scattering corresponding to the reference baseline. From Equation (25), it can be seen that the surface phase
parameter is the only baseline-correlated parameter. That is, based on the multi-baseline PolInSAR framework, the differences existing in the interferometric geometries of the different baselines lead to the baseline correlation of the surface phase. On the other hand, the surface elevation does not change with the baseline change, so the baseline-correlated surface phase can be considered to be converted to the unique baseline-uncorrelated surface elevation through the phase-to-elevation conversion relationship, as shown in
Figure 6. In the multi-baseline configuration, assuming that there are Q interferometric baselines, the Q surface phase
parameters can be reduced to one surface elevation parameter
using Equation (26). Based on the above, Equation (25) can be rewritten as Equation (27).
3.3.2. Multi-Baseline Forest Height Inversion Functional Model for Joint RVoG Model Analysis and Geometric Expression
Under the multi-baseline PolInSAR configuration, we first select one of the baselines as the reference baseline, according to the method described in
Section 3.1, and then perform the forest height inversion calculation based on the expression of the RVoG function. The observation data of all the other baselines are converted into constraints, as described in
Section 3.2, and then the functional model of the PolInSAR forest height inversion algorithm, taking into account the constraints of the multiple baselines, can be expressed as shown in Equation (28), as follows:
When the number of observations in Equation (28) is greater than the number of unknown parameters, joint RVoG model analysis and geometric expression of the multi-baseline forest height inversion can be performed.
In the proposed approach, based on the idea of measurement adjustment, the forest height and extinction coefficient are regarded as indirect leveling problems with constraints in the complex domain. Using the functional model, the proposed approach adopts the leveling criterion of minimizing the sum of squares of the modes of the complex residual values to construct the complex least squares leveling algorithm for parameter solving. In addition, due to the difficulty of linearizing the RVoG model, the proposed approach still adopts the nonlinear least squares iterative method for parameter solving.
3.3.3. Forest Height Estimation Using Nonlinear Least Squares Optimization
Under the multi-baseline configuration, assuming that there are
Q baselines and
P polarization modes for each baseline, a nonlinear least-square optimization algorithm is employed to solve the parameters. The corresponding objective function is shown as follows in Equation (29):
The initial values of the surface elevation , extinction coefficient , and forest height are calculated as follows.
Based on the RVoG model, the proposed approach adopts a three-stage algorithm and combines it with the reference baseline selection method described in
Section 3.1 to calculate the initial values of surface elevation, extinction coefficient, and forest height. The specific steps are as follows.
Firstly, the reference baseline is selected, based on the multi-baseline PolInSAR framework. The Ecc of the elliptical coherence region shape corresponding to each pixel of each baseline is calculated according to each equation from Equation (19) to Equation (23). Then, for each pixel, Equation (24) is used to select the reference baseline corresponding to that pixel.
Secondly, the initial value of the surface elevation
, the initial value of the extinction coefficient
, and the initial value of the forest height
corresponding to each pixel are calculated using the method of calculating the initial value of the unknown parameters described in
Section 3.3.2.
The initial value of the normalization parameter of the amplitude ratio of the ground-to-volume scattering is set to 0, i.e., .
According to experience, the “forest height” obtained by the three-stage algorithm generally lies between one-half of the real forest height and the top of the forest canopy. Therefore, in the proposed approach, we set the forest height iteration range to .
5. Discussion
5.1. Forest Height Inversion in the Krycklan Coniferous Forest Region
5.1.1. Experimental Data and Preprocessing in the Krycklan
A further experimental validation was carried out using four PolInSAR images from the Krycklan experimental area in Sweden; these images constitute three interferometric pairs. The main parameters of the E-SAR P-band airborne SAR system are given in
Table 2. The basic details of the four PolInSAR images and their constituent interferometric baseline parameters are given in
Table 3.
Three interferometric baselines were formed, with the image labeled 1 acting as the main image and the images labeled 2, 3, and 4 acting as the auxiliary images. The free SAR data processing software PolSARpro 4.2.0 from the European Space Agency was used to perform the data preprocessing for each interferometric baseline, which included SAR image alignment, flat-earth phase removal, multiview processing, and interferometric processing. The scale of the multiview processing was 2 × 1 (azimuthal × distance direction); the coherence estimation window in the interferometric processing step was set to 11 × 11; and the polarization complex coherence coefficients corresponding to each interferometric baseline were estimated using the following phase diversity (PD) coherence optimization algorithm [
10]:
,
,
,
,
,
, with the superscripts 1, 2, and 3 representing the different interferometric baselines and PDHigh and PDLow representing the polarization mode obtained by PD coherence optimization.
After obtaining the polarized complex coherence coefficients, these complex observations could be used to test the effectiveness of the multi-baseline PolInSAR forest height inversion algorithm proposed in this paper. The traditional multi-baseline forest height inversion algorithm based on the geometrically optimal structure of the coherence region shape was used for comparison, and the LiDAR forest height products covering the experimental area were used as ground-truth data for the cross-validation.
5.1.2. Forest Height Inversion Results and Analysis in the Krycklan
The forest height inversion results based on the traditional multi-baseline algorithm and the algorithm proposed in this paper are shown in
Figure 9a and
Figure 9b, respectively. In addition,
Figure 9c shows the LiDAR forest height products in the Krycklan experimental area based on the LiDAR products obtained during the BioSAR 2008 project.
Firstly, a qualitative analysis was carried out. Intuitively, the trend of the forest height results in the Krycklan experimental area is basically the same between the algorithm proposed in this paper and the traditional multi-baseline algorithm. Overestimation of the forest height can be seen on the right side of the forest height inversion results. The main reason for this phenomenon is that, due to the airborne SAR data used in this study, the incidence angle at the far end is larger. This corresponds to a smaller vertical effective wave number, which is insensitive to the change in forest height and has a weaker ability in regard to resisting the noise, which ultimately leads to a certain degree of overestimation of the forest height in the forest height inversion.
We also quantitatively analyzed the forest heights inverted by the two algorithms with the LiDAR forest height products. We selected 1318 forest sample plots in the study area and calculated the average forest height value corresponding to each sample plot for quantitative verification of the accuracy of the proposed algorithm in forest height inversion.
Figure 10 shows the cross-validation plots of the forest heights based on the inversion results of the two algorithms with the LiDAR forest height products. According to
Figure 10a,b, the RMSE of the inversion results of the algorithm proposed in this paper and the traditional multi-baseline algorithm are 4.28 m and 5.15 m, respectively. The accuracy of the forest height inversion results of the algorithm proposed in this paper represents an improvement of about 17%.
The reason for the superior accuracy in forest height inversion achieved via the algorithm proposed in this paper is that the traditional multi-baseline algorithm first selects an “optimal” baseline for any pixel in the interferogram and then carries out the forest height inversion based on the single-baseline observation data provided by the “optimal” baseline. Due to the lack of observation information, it is necessary to carry out the forest height inversion based on the assumption of a zero amplitude ratio for the ground-to-volume scattering [
19]. Since we adopted P-band SAR data for the real-data experiments, which have a strong penetrative ability, even though the body-scattering-dominant polarization mode (the PDHigh polarization mode adopted in this study) still contains strong ground-scattering signals, the assumption of a zero amplitude ratio for the ground-to-volume scattering is no longer valid, resulting in bias in the forest height inversion.
The algorithm proposed in this paper does not need to assume that the amplitude ratio of the ground-to-volume scattering of a certain polarization channel existing in the reference baseline is zero; however, at the same time, it takes into account the utilization of the rich observation information under the multi-baseline PolInSAR configuration. This approach solves the problem of insufficient observation information for forest height inversion based on the RVoG model and improves the accuracy of the forest height inversion.
Our results are corroborated in the studies of Xie et al. [
41] and Zhao et al. [
42]. In the study by Xie et al. a multi-baseline decorrelation simultaneously accounts for the ground-scattering contribution, the spatial baseline configuration, and the temporal decorrelation (the main part of the non-volume decorrelation) in a multi-baseline PolInSAR forest canopy height inversion method. The Krycklan region was used as the study area and the accurate spatial distribution of the forest canopy height was obtained. In the study of Zhao et al. they proposed an RVoG model incorporating a hierarchical Bayesian framework to realize the inversion of forest canopy height. In their study, experiments were conducted based on the Krycklan region using P-band data and L-band data, respectively, to obtain the spatial distribution of forest canopy height in the region. In our study, the spatial distribution of forest canopy height in this study area obtained in this paper is similar to that of the two previous researchers, and, compared with their study, we considered the utilization of rich observation information under multi-baseline PolInSAR configuration, thus obtaining a more accurate spatial distribution map of forest canopy height in the Krycklan region.
5.2. Limitations and Future Research
In this paper, we have proposed a multi-baseline forest height estimation method combining the analytic and geometric expression of the RVoG model that makes full use of the rich observation data in the multi-baseline data to carry out forest height inversion. The method makes full use of the rich observation data in the multi-baseline data for forest height inversion and, at the same time, reduces the uncertainty of the accuracy of the forest height inversion results caused by baselines affected by noise and the geometrically non-optimal baselines. However, the method still has some limitations.
In the previously reported studies, research on the selection of optimal baselines has received extensive attention, with methods based on the optimal elevation accuracy, the method of optimal geometric structure, artificial intelligence methods, and other algorithms being examined. However, most of the reported studies on optimal baseline selection have focused on a single-baseline selection criterion, which is more suitable for specific forest height scenarios. When the selected baseline is applied to areas with large differences in the natural environment, its universality needs to be improved. For the proposed algorithm, it only uses the optimal baseline selection method based on the geometric structure of the coherence region shape. In future research, we will focus on the study of an adaptive optimal baseline selection method in order to realize optimal baseline selection for complex natural scenarios.
On the other hand, in the forest height inversion task under multi-baseline conditions, there may be multiple baselines that are suitable for forest height inversion. In this condition, increasing the multi-baseline observations helps to improve the robustness of the forest height inversion accuracy. For the algorithm proposed in this paper, the optimal baseline selection method based on only the geometric structure of the coherence region shape obtains a single optimal baseline. It ignores the other baselines suitable for forest height inversion and directly uses them as constrained baselines, which, to a certain extent, undermines the richness of the observations brought by the multi-baseline PolInSAR configuration. Therefore, how to design a baseline selection method that selects multiple preferred baselines as computational baselines and uses the remaining suboptimal baselines as constraining baselines will be of concern in our future research.
Furthermore, the accuracy and reliability of forest height inversion based on the PolInSAR technique are closely related to the stability of the forest scatterers and the length of the interferometric baseline. For the dual-station mode, it is not affected by temporal decoherence, but the interferometric baseline selection methodology considers only a single factor, with which it is difficult to cope with the spatial complexity of forest height. For the heavy-track mode, it has the advantage of using multiple interferometric baseline lengths to observe the same target at multiple scales, but the effect of temporal decoherence is unavoidable. The hybrid mode of dual-station + heavy-track not only makes up for the respective disadvantages of each of these modes in forest height inversion but also enhances the accuracy and reliability of PolInSAR-based forest height inversion. It can also improve the update rate for forest height mapping based on PolInSAR technology. For the algorithm proposed in this paper, how to take into account the advantages of the dual-station mode without temporal decoherence and the heavy-track mode with various interferometric baselines is worthy of attention. Furthermore, how to construct a forest height inversion algorithm integrating dual-station + heavy-track PolInSAR modes under the conditions of the different morphological features of the forest, so as to improve the accuracy and robustness of the inversion results, will be one of the goals of our future research.