1. Introduction
Gross primary production (GPP) is the total amount of carbon fixed by vegetation through photosynthesis, contributing the largest global carbon flux and driving ecosystem functions [
1,
2]. The carbon cycle of terrestrial ecosystems is complicated and the uncertainty of terrestrial ecosystem flux remains high, resulting in the so-called “mystery of missing carbon” [
3,
4]. This means that there is an imbalance between carbon sink and carbon sources [
5]. GPP is a key index that can be used to estimate the carbon flux of terrestrial ecosystems [
6,
7]. Therefore, the accurate estimation of GPP is important for studies on the carbon cycle and climate change.
There are four methods commonly used to estimate GPP: (1) the eddy covariance (EC) technique [
8], (2) the light use efficiency (LUE) model [
9], (3) the process-based model [
10], and (4) sun-induced chlorophyll fluorescence (SIF) [
11,
12,
13,
14]. The EC method is considered the most accurate of the four methods [
15,
16,
17]. However, it requires the establishment of flux towers. At present, the distribution of flux towers is uneven and sparse, making it impossible to obtain GPP globally with a high accuracy. The LUE model is widely used to estimate GPP. The LUE approach can be expressed as the product of PAR (the photosynthetically active radiation received by the plant canopy), fPAR (the proportion of PAR absorbed by plant canopies), and LUE
P (light use efficiency). MODIS GPP products (e.g., MOD17A2, MYD17A2, MOD17A2H, and MYD17A2H) are estimated using the LUE model [
18]. However, the LUE model depends on multiple types of ground parameters. Therefore, the accuracy of each parameter will affect the accuracy of the estimated GPP [
19,
20,
21]. One of the process models commonly used is the soil-canopy observation of photosynthesis and energy (SCOPE) balance model. The SCOPE model is a one-dimensional vertical model. It is assumed that the canopy is homogeneous in the horizontal and vertical directions [
22]. In addition, the photosynthesis, stomatal adjustment, and atmospheric disturbance module parameters need to be set with reference to the measured data. Setting a large number of parameters brings great uncertainty to the estimation of GPP. In recent years, SIF has brought new prospects for estimating GPP and monitoring vegetation growth states [
11,
12,
23,
24]. SIF is a by-product of photosynthesis [
25]. Therefore, compared with the traditional vegetation index, SIF can be used to reflect the growth state of vegetation and helps estimate the GPP more accurately [
26]. Many studies have investigated performances of estimating GPP using SIF products derived from greenhouse gases observing satellite (GOSAT) and global ozone monitoring experiment-2 (GOME-2) [
27]. Because of the low spatial resolution of GOSAT and GOME-2 (GOME-2: 40 × 80 km; GOSAT: 10 km diameter), it is impossible to estimate GPP at an ecosystem scale [
13,
28]. With the launch of orbiting carbon observatory-2 (OCO-2), which provides SIF products with a fine spatial resolution (1.3 × 2.2 km), the spatial resolution of estimated GPP has been significantly improved [
13]. However, the footprint of OCO-2 is very sparse and cannot match the flux tower well, which generates huge challenges in estimating the global GPP using OCO-2 SIF [
19,
23]. To solve this problem, Li and Xiao have developed a new SIF product based on OCO-2 SIF and meteorological data, named ‘GOSIF’, which has a time resolution of eight days and a spatial resolution of 0.05° [
18].
The data of OCO-2 SIF, MODIS GPP, and GOSIF all have fine resolutions and have the potential to estimate GPP at an ecological scale. However, many results have demonstrated that MODIS GPP (MOD17A2) underestimates vegetation GPP, especially in Africa [
29]. Other results have shown that MOD17A2 has a limited capacity for representing the temporal and spatial variation of annual GPP in croplands [
8]. Furthermore, only a few studies have validated MODIS GPP (MYD17A2H) with in-situ GPP measurements. In addition, Li and Xiao only compared GOSIF data with OCO-2 coarse-resolution SIF data (1°) in seasonal cycles [
18]. Consequently, there is an urgent need to evaluate those products comprehensively. Moreover, as introduced above, the three products are based on very different techniques. It is thus speculated that GPP estimates based on these products would demonstrate different characteristics. Therefore, it is necessary to conduct inter-comparisons to explore the performances of different products when estimating GPP at an ecological scale. In this study, we utilized in-situ measurements of EC flux towers as the truth value of GPP and established relationships between truth values of GPP and three products of SIF/GPP to evaluate their performances. Inter-comparison experiments were carried out from two perspectives, namely, the site and the vegetation type, in this study. The location of an EC flux site determines the geographic coordinates and meteorological parameters. Performances of satellite products are determined by the original signal quality, which is related to the observation geometry and the sun’s position. Further, meteorological parameters exert significant influences on generations of GPP [
15]. This explains why it is important to evaluate the performance of satellite-observation-derived products in terms of the site. In addition, GPP and its seasonal cycle vary with the vegetation type. The response of SIF to GPP could also vary with the vegetation type. Hence, it is necessary to evaluate the performance of SIF-GPP models from the perspective of vegetation types.
Except for the above stated issues, there are another two topics to be discussed. Firstly, the extraction range of SIF data around the flux tower has always been a problem in correlating remotely sensed data and in-situ measurements. With a small extraction range, one often cannot obtain adequate SIF data to correlate with GPP. With a large extraction range, the spatial mismatch introduces additional errors, resulting in a reduced accuracy of GPP estimates [
23]. Secondly, few studies have been dedicated to investigating the effects of different observation modes and bands of OCO-2 on the accuracy of estimating GPP.
The rest of this paper is organized as follows.
Section 2 introduces the materials and methods.
Section 3 introduces the primary results.
Section 4 presents our discussion. Finally, the whole study is concluded in
Section 5.
4. Discussion
There is a significant linear relationship between SIF and GPP. We demonstrated that SIF757 is better than SIF771 in estimating GPP, which is consistent with many previous research results [
12,
23]. This is because 771 nm falls farther away from peak emission on the SIF spectrum and OCO-2 SIF757 is considered to have a higher level of retrieval precision than SIF771 [
12]. Our results demonstrate that the linear relationship between SIF in nadir mode and GPP is slightly better than that for SIF in glint and target mode. We believe that SIF data in nadir mode has the highest spatial resolution. In this study, we can conclude that the ability of GOSIF and SIF757 to evaluate the GPP of a single tower is not significantly different. However, for different vegetation types, GOSIF was better at estimating GPP than SIF757 [
19]. One of the main reasons for this is that the footprint of the OCO-2 is sparse. OCO-2 has a fine spatial resolution, which provides us with an unprecedented opportunity to explore the relationship between OCO-2 SIF and GPP of different vegetation types at an ecosystem scale. However, the amount of SIF data of OCO-2 around each flux tower is small. This causes the following problems. First, at present, the main research on SIF aims to estimate GPP by SIF. Many researches have demonstrated that there is a strong linear relationship between SIF and GPP compared with the traditional vegetation index [
40,
41]. There is a clear and simple mechanism for estimating GPP using SIF [
42]. However, SIF estimates of the GPP have been limited by the satellite spatial resolution. The GOME-2 data covers the globe and has a long time series, but the noise of the data leads to a poor GOME-2 SIF data quality [
43]. OCO-2 data has a high spatial resolution, but the data volume is small and the time series is short. Currently, there are no satellites dedicated to SIF observation, which will limit our ability to use SIF to estimate GPP. Therefore, we also cannot accurately estimate global carbon budgets. Secondly, many researchers have shown that the significance of the linear relationship between SIF and GPP for different vegetation types is different [
13,
44]. OCO-2 has a small amount of data, which affects our ability to explore SIF to estimate GPP for different vegetation types. Thirdly, in this study, the seasonal cycle of SIF757 presents a ‘single peak’. However, because the data volume of SIF is too small, we cannot compare whether the seasonal cycles of OCO-2 SIF and GPP
EC from 2015 to 2017 are advanced or delayed. Therefore, if we use OCO-2 SIF to explore the phenology of vegetation, it will be a great challenge [
24]. Moreover, most flux tower data are not publicly available. Hence, we validated SIF data by using flux tower data on a global scale, which produces limitations. Above all, scientists using OCO-2 SIF data to estimate global GPP will also face huge challenges.
Although OCO-2 has a high temporal and spatial resolution, the dataset still cannot match flux towers well on an ecological scale [
19,
23]. On one hand, in order to get more OCO-2 SIF data in this study, we had to expand our study area to 25 km. However, the expansion of the research scope led to an increase in mixed pixels, and the increase in pixel heterogeneity may have had an impact on our research results. On the other hand, measuring GPP by EC is complicated with many factors influence the final measurement. For example, GPP values may be influenced by the temperature and tower height. GPP values obtained by EC technology vary in quality. Therefore, when establishing the linear relationship of GOSIF, SIF, and MODIS with GPP
EC, inaccurate GPP
EC values can also cause errors. For example, R
2 of GOSIF-GPP
EC of US-Ro2 and US-ALQ is low. The reason of our analysis is that the accuracy of GPP
EC value is low. In this study, we compared the SIF757-GPP
EC, GOSIF-GPP
EC, and MYD-GPP
EC within 25 km around each flux tower. However, SIF757, GOSIF, and MYD have different spatial resolutions, which means that even within the same extraction range, SIF757, GOSIF, and MYD obtain different amounts of data and the heterogeneity of pixels is different [
23]. These factors also affect the accuracy of the results. Moreover, except for the uncertainty caused by the scale problem, the land cover and the quality of SIF757 are also sources of uncertainty of GPP
SIF.
The GOSIF data have a higher temporal and spatial resolution than the OCO-2 SIF dataset. Our results show that the GOSIF-GPP
EC model is an effective method for estimating GPP. However, the seasonal cycle of GOSIF of GRA and ENF occurred earlier than GPP
EC. This phenomenon was also observed in the MODIS GPP dataset. To calculate the GOSIF data, a large number of EVI and PAR values from the MODIS dataset were used. In addition, the cubist regression tree model’s input parameters from the MERRA-2 dataset have a coarse spatial resolution (0.5° × 0.625°), and the data were resampled to 0.05° [
18]. The resampling process may cause discontinuities between the datasets, creating jagged or smoothing effects, and thus leading to errors in the predictions of SIF [
45]. We can conclude that inconsistent seasonal cycles of GRA and ENF between GPP estimates and GPP
EC could be attributed to inaccurate data from the input model for MODIS or from the MERRA-2 data resampling. Therefore, we call for necessary improvements in the quality of input data for producing GOSIF datasets. When Li and Xiao developed GOSIF data, in order to improve the calculation efficiency of the model and to reduce the uncertainty of the data, only LAI, fPAR, NDVI, and the normalized difference water index (NDWI) were input into the cubist regression tree model [
18]. However, change of land cover type has a great influence on the SIF value. The value of SIF varies with different vegetation types. Therefore, we suggest that land cover types should be added to the model input data to obtain more accurate GOSIF data. Based on our results, we believe that GOSIF data has great potential to estimate the GPP of different vegetation types globally. GOSIF data has a spatial resolution of 0.05 degrees and temporal resolution of eight days, which has important indicative significance for the study of the global carbon cycle using SIF data. At the same time, GOSIF data covering the whole world is of great significance to exploring the function of SIF in drought monitoring, vegetation phenology, and stress monitoring. MODIS GPP products are based on the LUE model. Many inaccurate ground parameters are input to the LUE model, resulting in errors of its GPP products [
9,
46]. In addition, few land cover classifications are incorporated into the MYD17A2H algorithm. At a 500 m resolution, the MODIS GPP dataset can cause the overlapping of land cover types rather than classifying individual pixels [
30]. In this study, we averaged all MYD17A2H pixels within 25 km. To some extent, it can reduce errors and smooth the MYD17A2H data. However, the average MODIS GPP may contain many land types, which could introduce additional errors. Many studies have shown that MODIS data (MOD17A2) underestimate the GPP [
47]. Our study also found that MYD17A2H data underestimate the GPP. The accuracy of MODIS GPP is largely determined by the accuracy of the LUE model and input parameters [
48,
49]. Therefore, we suggest improving the LUE model or controlling the quality of input parameters so that they have a higher precision.
In recent years, more and more researchers are exploring the wider application of SIF data of different vegetation types on the ecosystem scale [
19,
50,
51]. It is a simple and effective method to use data of EC flux tower to verify the accuracy of their simulation results. However, EC flux towers are not evenly distributed on the globe, and data of most flux towers are not available to the public. Therefore, verification and use of SIF data is limited. What’s more, the problem that the range of the observation of the flux tower does not match the extraction range of the SIF data seriously affected the accuracy of using SIF to estimate GPP. Ideally, we believe that the type of vegetation around the flux towers is homogeneous and single. Yet that is not the case. In general, we believe that the smaller the extraction range of SIF data around the flux tower, the higher the accuracy of the result may be. Li et al. [
23] extracted 2 × 2 km SIF data around tow flux towers and GPP data proved that OCO-2 SIF has a significant linear relationship with GPP. Lu et al. [
50] extracted 10 km SIF data around flux towers found that a strong correlation between GPP and SIF. Wood et al. [
26] and Verma et al. [
15] made the evaluation of OCO-2 SIF using tower GPP in corn and grassland ecosystems within a radius of 10 to 25 km. Therefore, the extraction range of SIF varies for different research purposes. However, if the extraction range of SIF data around the flux towers is too large, the more vegetation types may be included and the higher pixel heterogeneity of SIF data will be. This will eventually affect the accuracy of results. Therefore, we calculated max(LC) to ensure the accuracy of the this study. With the launch of the OCO-3 (orbiting carbon observatory 3) satellite and the TROPOMI (TROPOspheric Monitoring Instrument) satellite with finer resolution [
52]. An increasing amount of flux tower data are also public. The mismatch between extraction range of satellite’s SIF data and observation range of EC flux towers may be better solved in the future.
5. Conclusions
In this study, we evaluated the performance of the three remotely sensed products MYD17A2H, GOSIF, and OCO-2 SIF when estimating GPP. We modeled and compared remotely sensed product-derived GPP estimates with GPPEC data from 23 flux towers and eight vegetation types (evergreen needleleaf forests, deciduous broadleaf forests, open shrublands, grasslands, closed shrublands, mixed forests, permeland wetlands, and croplands). The GOSIF dataset yields the best results in the three datasets, though it leans towards overestimating GPP for deciduous broadleaf forests and grasslands. The MYD17A2H dataset leans towards underestimating GPP by 1.157 to 3.884 gCm−2day−1 for all vegetation types, especially for croplands, grasslands, and evergreen needleleaf forests. The insufficient amounts of OCO-2 SIF make it difficult to establish good relationships between OCO-2 SIF and EC GPP for different vegetation types. In terms of the seasonal cycle, GOSIF and MODIS GPP demonstrate a good consistency with GPPEC. Seasonal cycles of MYD17A2H and GOSIF show advanced trends compared with GPPEC for grassland and evergreen needleleaf forest. Though the amount of OCO-2 SIF data is small, its seasonal cycle basically follows the trend of GPPEC. Observation modes of OCO-2 SIF exert an evident influence on estimating GPP. After applying a unified extraction range for the three remotely sensed products in the spatial registration process, the results still show that the ability of GOSIF data to estimate GPP is better than that of MYD17A2H and OCO-2 SIF. This also suggests better performances of SIF in estimating GPP than those of the LUE model. The MYD17A2H product has a long time series and is thus suitable for monitoring the seasonal cycle change of vegetation, as well as total amounts of GPP. The GOSIF dataset has a large amount of data with a high temporal and spatial resolution, which is useful for estimating global GPP and monitoring the vegetation growth status. The data volume of OCO-2 SIF is relatively small, but OCO-2 provides the first opportunity to estimate GPP at an ecological scale. With the development of spaceborne high-resolution spectrometer techniques, similar satellite-derived SIF products would demonstrate very promising prospects in monitoring GPP at an ecological scale.