1. Introduction
The leaf area index (LAI) is not only an essential parameter to characterize the structure of the vegetation canopy [
1] but also an essential input parameter for land-atmosphere interaction models [
2], climate models [
3], water cycle models [
4], carbon cycle models [
5] and other models.
Many methods based on empirical models or physical models [
6,
7,
8,
9] have been developed to retrieve LAI values from satellite observations. Empirical models retrieve LAI based on the statistical relationship between the spectral characteristics of vegetation canopy and LAI. Empirical methods are simple and easy. However, empirical methods are restricted by saturation of the vegetation index, which directly affects the accuracy of the retrieved LAI values. Physical models establish the relationship between LAI parameter of vegetation canopy and reflectance by using physical theory. The inversion method based on the physical model has certain generality, but nonunique solutions and ill-conditioned problems occur in physical methods [
10]. Fast-growing machine learning methods provide a new way to retrieve LAI values from satellite observations. Machine learning not only simulates and simplifies the physical model but also effectively establishes the relationship between satellite observations and surface parameters [
11].
At present, several global LAI products have been generated from remote sensing data based on machine learning algorithms. The Carbon cYcle and Change in Land Observational Products from an Ensemble of Satellites (CYCLOPES) LAI product was generated from SPOT-VEGETATION data by a dedicated back-propagation neural network (BPNN) that depends on the data simulated by the SAIL + PROPSPECT model [
12]. The first versions of Geoland2 (GEOV1) LAI products were also generated by a BPNN that was trained with the LAI values produced by fusing the Moderate Resolution Imaging Spectroradiometer (MODIS) and CYCLOPES products, and the SPOT-VEGETATION or PROBA-V reflectances over the second version of Benchmark Land Multisite Analysis and Intercomparison of Products (BELMANIP) sites [
13]. The Global Land Surface Satellite (GLASS) LAI product was generated using general regression neural networks (GRNNs) trained with the LAI values produced by fusing the MODIS and CYCLOPES products and the MODIS or Advanced Very High-Resolution Radiometer (AVHRR) surface reflectances over the BELMANIP sites [
14,
15]. In addition, neural network-based methods were also used to generate the National Centers for Environmental Information (NCEI) LAI product [
16] and the third-generation Global Inventory Monitoring and Modeling System (GIMMS3g) LAI product [
17] from AVHRR reflectances.
Nevertheless, the above machine learning algorithms have different ways to construct training datasets. Some methods make use of simulation data of the physical models to establish training datasets, and the other methods utilize existing LAI products and the corresponding sensor data to build training datasets. The inconsistency of these training datasets constructed in different ways directly leads to the discrepancies between the existing global LAI products.
Furthermore, the training datasets for the neural networks are sensor specific. It is difficult to obtain good results when the trained neural network for certain sensor data is directly applied to retrieve LAI values from other sensor data. This is mainly due to the discrepancy existing in different sensor data. The discrepancy is mainly caused by the inconsistent physical conditions during observation, including sensor performance, observation geometry, and atmosphere and spectral response function [
18]. In this case, it suffers from retrieving LAI values from different satellite observations using the existing training dataset precisely. To address this problem, Zhong [
19] proposed a spectral normalization method that is canopy or atmospheric radiative transfer model-based to decrease discrepancies between different satellite observations. The spectral normalization method considers the effects of observation geometry, atmospheric conditions and differences in spectral response function for sensors but does not consider the impacts of clouds and snow. It is difficult for physical models to consider all factors for various sensor observations simultaneously.
In addition, it is sometimes difficult to obtain high-quality training datasets. As we know, the highest spatial resolution of LAI products released thus far is 300 m. Thus, it is difficult to construct training datasets with spatial resolutions higher than 300 m using the existing LAI and reflectance products. In another case, it is difficult to construct accurate and abundant training datasets when there is error in the geographic information of reflectance products, such as the FengYun reflectance dataset.
Recently, transfer learning (TL) has gradually become a hot topic. TL is capable of applying the knowledge learned under the existing environment (source domain) to solve different but related new tasks (target domain) [
20]. According to the transfer situation, TL can be basically classified into three parts: inductive, transductive and unsupervised TL methods. Transductive TL has a significant embranchment, namely, domain adaptation (DA), which is designed to transfer the existing knowledge to new tasks with few or no labels. The current DA methods roughly contain the following three classifications [
21]: instance-based, feature-based and model-based DA methods.
TL methods have been widely adopted in pedestrian re-recognition [
22], batteries of state-of-health estimation [
23], human activity recognition [
24] and other fields. In these areas, the multiple applications mentioned above have proven that TL is an effective approach. Recently, TL has been employed to remote sensing image classification. Xia et al. [
25] addressed unsupervised DA based on an ensemble strategy for hyperspectral image classification. Matasci et al. [
26] used feature-based domain adaptation for land-cover classification, and investigation showed that the knowledge learned from the available images with ground truth data can be transferred to interesting images, and the accuracy of classification by using transfer component analysis (TCA) algorithms to extract features is significantly higher than that of other algorithms.
Although many studies have focused on the application of TL algorithms for classification problems, TL algorithms have rarely been used in regression problems and have not been applied to estimate biophysical parameters from satellite observations. In this paper, we intend to propose an unsupervised DA-based method to retrieve LAI values from the VIIRS surface reflectance dataset based on the training dataset that was constructed from the MODIS surface reflectance dataset. The MODIS and VIIRS surface reflectances were first mapped to the same subspace to calculate the embedded datasets through the TCA algorithm. Then, GRNNs were trained by the embedded dataset drawn from the MODIS surface reflectance dataset and the LAI values produced by fusing the MODIS and CYCLOPES products. Eventually, the trained GRNNs were utilized to estimate LAI values from the embedded data drawn from the VIIRS surface reflectance dataset.
The organization of this paper is as follows.
Section 2 explains the method used for LAI retrieval, including the TCA and GRNNs, and provides an introduction to the experimental data used in this research. What is presented in
Section 3 are the comparisons of the retrieved VIIRS LAI values with the TCA method and without the TCA method, the VIIRS and MODIS LAI product and the reference LAI values acquired from high-resolution reference maps. In addition, discussions and conclusions drawn are in
Section 4 and
Section 5, respectively.
3. Results
The retrieval method based on an unsupervised DA was applied to retrieve the VIIRS LAI values in this paper. The performance of this LAI retrieval method was evaluated by comparing the retrieved LAI values with the VIIRS and MODIS LAI products with a 500 m spatial resolution and the LAI values acquired from high-resolution reference maps over the selected sites. To reasonably evaluate the retrieval results, the spatial resolution of the VIIRS and MODIS LAI products were upsampled to 1 km using the nearest neighbor resampling method. For comparison, the GRNNs trained with the MODIS surface reflectances and the LAI values produced by fusing the MODIS and CYCLOPES products were directly used to derive LAI values from the VIIRS surface reflectances without TCA. The inversion results without the TCA method are denoted by LAI-GRNN.
Figure 3 shows the temporal trajectories of the LAI-TCA and the LAI-GRNN for the SouthWest_1 site. By the MODIS land-cover classification data, the biome class of this site is the broadleaf crop. For comparison, the VIIRS and MODIS LAI values and their NDVI values calculated from the raw reflectances are also shown in
Figure 3. The VIIRS and MODIS LAI values marked by the circles in
Figure 3 are retrieved by the main algorithm (MA). The trajectories of the LAI-TCA and the LAI-GRNN display a similar seasonality to the trajectories of the VIIRS and MODIS LAI values. Moreover, the time series of the LAI-TCA and that of the VIIRS NDVI values have consistent seasonal variation. The discrepancy between the temporal trend of the LAI-TCA and that of the LAI-GRNN is slight. However, the LAI-TCA is universally higher than the LAI-GRNN. During the nongrowing season, the LAI-TCA and LAI-GRNN are agree well with the VIIRS LAI values. Nevertheless, during the growing season, the LAI-TCA and LAI-GRNN are obviously lower than the VIIRS LAI values and the quality of the VIIRS LAI values retrieved by the backup algorithm is poor. These LAI values shown in
Figure 3 are all evidently smaller than the reference LAI value. However, the estimated LAI-TCA is more approximate to the reference LAI value than the LAI-GRNN.
Figure 4 shows the LAI-TCA in a 5 km
5 km SouthWest_1 site areas on Day 230, 2013.
Figure 4a,b show that the images of the LAI-TCA and the LAI-GRNN have very similar distribution trends. The map of the LAI-TCA has a similar variation pattern to the reference LAI map in this region, especially in the northwestern corner and southwestern corner. The LAI-TCA values are generally higher than the VIIRS and MODIS LAI values. Nevertheless, the LAI-TCA values are highly consistent with the reference LAI values compared with the VIIRS and MODIS LAI values. In addition, there are obvious differences between these LAI images in the middle where the LAI-TCA and LAI-GRNN values and the VIIRS and MODIS LAI values are underestimated comparing to the reference LAI values.
Figure 4f–i demonstrate the statistical distributions of discrepancies between the LAI-TCA, the LAI-GRNN, the VIIRS and MODIS LAI values and the reference LAI values at the SouthWest_1 site. The standard deviation of the discrepancies between the LAI-TCA and the reference LAI values is 0.31, while that of the discrepancies between the LAI-GRNN, the VIIRS and MODIS LAI values and the reference LAI values are 0.34, 0.49 and 0.43, respectively. This indicates that the discrepancies between the LAI-TCA and the reference LAI values have a more concentrated distribution. The mean value of the discrepancies between the LAI-TCA and the reference LAI values are also lower than that of the discrepancies between the LAI-GRNN, the VIIRS and MODIS LAI values and the reference LAI values.
To a certain extent, these phenomena indicate that the LAI-TCA are more consistent with the reference LAI values than the LAI-GRNN and the VIIRS and MODIS LAI values, although the LAI-TCA are slightly underestimated at this site.
Figure 5 depicts the temporal trajectories of the LAI-TCA and the LAI-GRNN for the Collelongo site with the deciduous broadleaf forest biome class. During the growing season, these trajectories all exhibited fluctuations and fluctuations of the LAI-GRNN trajectory, the VIIRS LAI trajectory and the MODIS LAI trajectory were stronger than those of the LAI-TCA trajectory. During the nongrowing season, the trends of these trajectories were more consistent. On Julian Day 329, the VIIRS and MODIS LAI values decrease markedly because of the influence of the cloud-contaminated VIIRS and MODIS reflectances. However, the LAI-TCA and LAI-GRNN values have no fluctuation on this Julian day. Moreover, the LAI-TCA outperforms the LAI-GRNN and the VIIRS and MODIS LAI values in comparison with the reference LAI value at this site.
Figure 6 shows images of the LAI-TCA, the LAI-GRNN, and the VIIRS and MODIS LAI values at the Collelongo site on Day 266, 2015. By analyzing these images, it can be observed that from the southwestern corner to the northeastern corner, the LAI-TCA agrees with the reference LAI values. In the east and west, the LAI-TCA has a slight underestimation comparing to the reference LAI values. The distributions of the LAI-TCA and LAI-GRNN are generally uniform in this area, except for the center where the LAI-GRNN are usually higher than the LAI-TCA. However, there are more LAI values that are apparently underestimated in the VIIRS LAI map, particularly when the reference LAI values are relatively low.
Figure 6f–i demonstrate the statistical distributions of the discrepancy between the LAI-TCA, the LAI-GRNN, the VIIRS and MODIS LAI values and the reference LAI values at the Collelongo site. Discrepancies between the LAI-TCA and reference LAI values are more concentrated with a smaller standard deviation of 0.62 than discrepancies between the VIIRS LAI values and reference LAI values with a standard deviation of 1.43 and discrepancies between the MODIS LAI values and reference LAI values with a standard deviation of 0.98. The range of discrepancies between LAI-TCA and reference LAI is −1.0~1.37, whereas the range of discrepancies between VIIRS LAI and reference LAI is −2.38~2.01 and the range of discrepancies between MODIS LAI and reference LAI is −2.2~1.31. Furthermore, the mean value of the discrepancies between the LAI-TCA and the reference LAI values are lowest. Thus, discrepancies between the estimated LAI-TCA map and reference LAI map are smaller.
The LAI-TCA, the LAI-GRNN, and the VIIRS and MODIS LAI temporal trajectories at the Capitanata site with the grasses/cereal crop biome class are shown in
Figure 7. From Day 129 to Day 153, the LAI-TCA and LAI-GRNN values are significantly higher than the MODIS LAI values. However, the VIIRS and MODIS NDVI values calculated from the raw reflectances show no significant difference in this period, and the LAI-TCA and LAI-GRNN are retrieved from the preprocessed VIIRS surface reflectance dataset. Thus, this discrepancy may be caused by the discrepancies between the preprocessed VIIRS and MODIS surface reflectances. In the other periods, the LAI-TCA has a similar seasonality to the VIIRS and MODIS LAI values. Moreover, the LAI-TCA temporal trajectory has a smoother trend than the VIIRS LAI temporal trajectory. At this site, the LAI-TCA is more approximate to the reference LAI value than the LAI-GRNN.
Figure 8 shows images of the LAI-TCA, the LAI-GRNN, and the VIIRS and MODIS LAI values at the Capitanata site on Day 113, 2015. It can be clearly seen from these images that for the majority of pixels, the VIIRS LAI values are lower than the reference LAI values. In addition, the LAI-GRNN values are slightly lower than the reference LAI values in the north. Surprisingly, the spatial distribution of the LAI-TCA is relatively consistent with the reference LAI values.
Figure 8f–i demonstrate the statistical distributions of the discrepancy between the LAI-TCA, the LAI-GRNN, the VIIRS and MODIS LAI values and the reference LAI values at the Capitanata site. Compared with the discrepancy between the LAI-GRNN, the VIIRS and MODIS LAI values and the reference LAI values, the discrepancy between the LAI-TCA and reference LAI values has a lower standard deviation (0.51). Moreover, the distribution of the discrepancies between the LAI-TCA and reference LAI values is more concentrated than that of the discrepancies between the LAI-GRNN, the VIIRS and MODIS LAI values and the reference LAI values.
Figure 9 displays the temporal trajectories of the LAI-TCA and the LAI-GRNN for the Peyrousse site with the broadleaf crop biome class. Good temporal consistency is realized among these temporal trajectories during the nongrowing season. Nevertheless, the LAI-TCA and the LAI-GRNN values are slightly larger than the MODIS LAI values throughout almost the entire year, except for the period from Day 209 to Day 241. On Julian Days 81 and 113, the VIIRS LAI values are obviously affected by cloud contamination. However, the LAI-TCA and LAI-GRNN values have no mutations on these two Julian days. This is because the LAI-TCA and LAI-GRNN were retrieved from the elimination-cloud-contaminated VIIRS surface reflectance dataset. At this site, the LAI-TCA is nearer to the reference LAI value comparing to the LAI-GRNN.
Figure 10 shows images of the LAI-TCA, the LAI-GRNN, and the VIIRS and MODIS LAI values at the Peyrousse site on Day 174, 2015. The spatial distributions of these images are approximate and show a decreasing trend from west to east. The values of these images have a great difference in the southwestern corner where the LAI-TCA and TCA-GRNNs and the VIIRS and MODIS LAI values all exceed the reference LAI values.
Figure 10f–i demonstrate the statistical distributions of the discrepancy between the LAI-TCA, the LAI-GRNN, the VIIRS and MODIS LAI values and the reference LAI values at the Peyrousse site. The discrepancy between the LAI-TCA and reference LAI values has a lower standard deviation (0.36) than the discrepancy between the LAI-GRNN, the VIIRS and MODIS LAI values and the reference LAI values. Moreover, compared with discrepancies between the LAI-GRNN, the VIIRS and MODIS LAI values and the reference LAI values, discrepancies between the LAI-TCA and the reference LAI values have a smaller range. Thus, the LAI-TCA values are in better agreement with the reference LAI values than the VIIRS and MODIS LAI values.
Figure 11 displays the temporal trajectories of the LAI-TCA and the LAI-GRNN for the 25de_Mayo_Shurb site with the shrub biome type. These trajectories all exhibit limited seasonality. From Day 41 to Day 81, the discrepancy between the retrieved LAI and MODIS LAI trajectories is consistent with the discrepancy between the VIIRS and MODIS NDVI trajectories. From Day 281 to Day 361, the MODIS NDVI values are lower than the VIIRS NDVI values, but the LAI-TCA are lower than the MODIS LAI, which demonstrates that the LAI-TCA are easily affected by the reflectances of bands except the red band and near-infrared band at this site with sparse vegetation.
Figure 12 shows images of the LAI-TCA, the LAI-GRNN, and the VIIRS and MODIS LAI values at the 25de_Mayo_Shurb site on Day 40, 2014. The LAI values of these images show similar spatial patterns. However, in the west, there are some observable discrepancies.
Figure 12f–i demonstrate the statistical distributions of the discrepancy between the LAI-TCA, the LAI-GRNN, the VIIRS and MODIS LAI values and the reference LAI values at the 25de_Mayo_Shurb site. The range of discrepancies between LAI-TCA and reference LAI is −0.36~0.75, but the range of discrepancies between LAI-GRNN and reference LAI is −0.72~0.62, the range of discrepancies between VIIRS LAI and reference LAI is −0.49~1.09 and the range of discrepancies between MODIS LAI and reference LAI is −1.18~0.33. Moreover, the standard deviation (0.25) of the discrepancy between LAI-TCA and reference LAI is lower than that of the discrepancy between LAI-GRNN, VIIRS and MODIS LAI and the reference LAI. Thus, the LAI-TCA has fewer discrepancies with the reference LAI.
Figure 13 displays the temporal trajectories of the LAI-TCA and the LAI-GRNN for the Albufera site with the grasses/cereal crop biome type. These trajectories all show normal seasonal changes. During the growing season, the peak values of the LAI-TCA, LAI-GRNN and VIIRS LAI temporal trajectories are apparently lower than that of the MODI LAI temporal trajectory. On Day 217, the LAI-TCA, the LAI-GRNN and the VIIRS LAI that from the VIIRS surface reflectance are lower than the reference LAI value. However, comparing to the LAI-GRNN, the LAI-TCA is nearer to the reference LAI value.
Figure 14 shows images of the LAI-TCA, the LAI-GRNN, and the VIIRS and MODIS LAI values at the Albufera site on Day 219, 2014. At this site, the LAI-TCA and the LAI-GRNN have a small discrepancy. The LAI-TCA and the LAI-GRNN are generally less than the reference LAI values, but the spatial distribution of these values is concentrated as the reference LAI values. However, many VIIRS and MODIS LAI values are higher than the reference LAI values and the spatial distribution of these values is obviously discrete.
Figure 14f–i demonstrate the statistical distributions of the discrepancy between the LAI-TCA, the LAI-GRNN, the VIIRS and MODIS LAI values and the reference LAI values at the Albufera site. The range of discrepancies between LAI-TCA, LAI-GRNN and reference LAI are significantly less than that between VIIRS LAI, MODIS LAI and reference LAI. The standard deviation (0.5) of the discrepancy between LAI-TCA and reference LAI is evidently smaller than that of the discrepancy between VIIRS and MODIS LAI and the reference LAI. The standard deviation of the discrepancy between LAI-TCA, LAI-GRNN and reference LAI is approximate, but the range and mean of discrepancies between LAI-TCA and reference LAI are lower than these of discrepancies between LAI-GRNN and reference LAI. Hence, the LAI-TCA values have a better consistency with the reference LAI values.
Figure 15 demonstrates the scatter plots of the LAI-TCA, the LAI-GRNN, the VIIRS and the MODIS LAI values versus the reference LAI values at all selected sites. The VIIRS and the MODIS LAI values contaminated by cloud have been removed. The slope of the regression line for the LAI-TCA versus the reference LAI values is 0.78. This reveals that the LAI-TCA slightly overestimate the reference LAI values along with low values and underestimate the reference LAI values along with high values. Despite all this, the correlation of the LAI-TCA and LAI reference values (R
2 = 0.88) outperforms that of the LAI-GRNN, the VIIRS LAI product and the MODIS LAI product. In addition, the LAI-TCA (RMSE = 0.68) provide better accuracy than the LAI-GRNN, the VIIRS LAI product and the MODIS LAI product. These results demonstrate that the retrieval results with the TCA method is in a better consistency with the reference LAI values than the LAI-GRNN and the VIIRS and MODIS LAI products.
4. Discussion
The source dataset and target dataset defined in this paper contain the preprocessed MODIS and VIIRS surface reflectance dataset of six bands. Therefore, the maximum dimension of the embedded dataset
m is 6. The impact of different dimensions on the LAI inversion performances must be discussed. By analyzing the mean RMSE and R
2 values of the results for different dimensions over selected sites in
Figure 16, it can be seen that the number of dimensions has a significant influence on the inversion accuracy. When the number of dimensions is 2, it is obvious that the inversion performance is best, with the lowest mean RMSE value of 0.65 and the highest mean R
2 value of 0.54. However, the inversion accuracy apparently decreases when the number of dimensions is larger than 2. This phenomenon, called the “curse of dimensionality”, is due to the significant sparsity of the high-dimensional spatial distribution. This problem can be solved by reducing the number of dimensions on account of redundant information included in feature components [
36]. Therefore, the optimum number of dimensions should be 2 in this paper.
In this study, the results of the LAI-TCA are good in spatial distribution and time series. Meanwhile, the LAI-TCA with the processing of transfer learning is better than the LAI-GRNN without that. In order to improve the accuracy of inversion, the training dataset used to train GRNNs is separated by biome type and day of year. Hence, from
Figure 15, the performance of retrieved LAI is promoted comparing to the VIIRS LAI product and the MODIS LAI product.
The advantage of the LAI retrieved method proposed in this paper is that it can directly use the existing training dataset constructed by the MODIS surface reflectances and use the transfer learning method to obtain LAI values from other sensor data with a good quality. It should be noted that the period of the existing training dataset from 2001 to 2003 is different from that of the VIIRS surface reflectances since 2012, in this study. However, comparing
Figure 15a,b, the accuracy of the retrieved LAI values based on transfer learning can be slightly improved. The difference shown in
Figure 15a,b is sufficient to illustrate that there is a difference in the distribution of MODIS surface reflectances and VIIRS surface reflectances. Thus, when using the existing training dataset to retrieve LAI product from other sensors, it is necessary to transfer knowledge. Moreover, the transfer learning method can realize positive transfer under the context proposed in this paper.
In
Section 3 the retrieved LAI values are cross validated and directly validated. The vegetation type of these selected sites for validation includes different biome types, namely forest, shrub, grass and crop. The MODIS surface reflectances as the source domain and the VIIRS surface reflectances as the target domain in this paper are processed to eliminate the influence of cloud by the same method. In addition, design of the VIIRS was quite similar to the MODIS [
37]. To some extent, this leads to no significant difference between
Figure 15a,b.
The method proposed in this paper is an exploratory experiment of applying transfer learning to parameter inversion. At the same time, it is a knowledge transfer method based on data. In further work, we will use the model-based transfer learning method to retrieve biophysical parameters, so as to build a more convenient, better and faster unified inversion platform. The application of transfer learning in the parameter inversion field is not limited to reducing the difference in data distribution from different sensors. Aiming at the problem that the geographic information deviation of Chinese satellite data affects the retrieval accuracy [
38], transfer learning can be used to improve retrieval accuracy. Furthermore, the transfer learning method can also transfer the reflectance data information with different spatiotemporal scales for inversion of parameter products with high spatiotemporal resolution. Transfer learning methods have great application value and potential, whether used to build a unified training dataset and model for inversion of LAI parameters from different sensors, or to acquire parameter products with high spatiotemporal resolution.