1. Introduction
In recent decades, an increasing number of satellite (and sensor) systems for Earth observation have provided large datasets of remote sensed imagery and indices, contributing to monitoring environmental changes at both regional and global scales. However, despite their increasing availability, this information cannot be easily compared due to slight differences among sensors, and thus, it is essential to define standards for cross-device validation, as well as reliable algorithms for dataset difference reductions [
1,
2].
To date, the cross-comparison analysis covers most of the different optical- and radar-based satellite systems currently in use for earth observation. These studies involve both the intercalibration among different satellites [
3,
4,
5,
6,
7,
8,
9] and different sensors built within the same satellite systems. For example, regarding the Landsat system, different intercalibration algorithms have been identified both with different satellite systems [
7,
10,
11,
12,
13,
14,
15,
16,
17,
18] and with different sensors such as Multi Spectral Scanner (MSS), Enhanced Thematic Mapper plus (ETM+) and Thematic Mapper (TM) [
19,
20,
21,
22,
23,
24,
25].
Generally, the intercalibration analysis achieves poor performance among different satellite systems due to the different spatial and spectral resolution of remote sensing imagery [
26]. On the other hand, the intercalibration analysis among different sensors within the same satellite has shown good results. As an example, in Xu and Guo [
27], the cross-comparison between the Normalized Difference Vegetation Index (NDVI) extracted from Landsat 8 and Landsat 7 images, showed larger differences between NDVI calculated from the two generations of Landsat in lower vegetation covered areas; however, the difference decreases at higher vegetation cover (i.e., the NDVI values increase).
In other cases, even if the differences in terms of reflectance for the spectral bands [
28] and other vegetation indices [
29] was more significant, mainly for near infrared (NIR) and short wave infrared (SWIR), the regression analysis among the vegetation indices calculated with the two sensors provided high correlation values for other indices such as Land Surface Water Index (LSWI), NDVI and Normalized Burn Ratio (NBR) [
28].
Within this context, vegetation indices are mostly implemented in the cross-comparison quantitative analysis [
3,
30,
31,
32,
33] due to the low sensitivity to the atmospheric correction errors and to the different satellite visual angles. In particular, the NDVI is one of the most well-known and widely implemented at global scale for the environmental bio-physical characterization (vegetation cover, biomass, net primary production, etc.), climate changes, and environmental and hydrological modelling. A large set of intercalibration functions among NDVI indices from different sensors is reported in Steven et al. [
7].
Among the various satellite platforms, Landsat can potentially provide long-term regional and global-scale high-quality NDVI data, thanks to the sensor’s high resolution. Indeed, NOAA (National Oceanic and Atmospheric Administration) Advanced Very High-Resolution Radiometer (AVHRR) has provided data since 1989 at 1 km geometric resolution, and since 1982 at 4 km geometric resolution. The SPOT (Satellite Pour l’Observation de la Terre) VEGETATION NDVI has provided NDVI data since 1999 at 1.5 km spatial resolution [
34], while the Moderate Resolution Imaging Spectroradiometer (MODIS) NDVI datasets have been available since 2000 with different spatial resolutions, 250 m, 500 m, or 1 km [
35].
The different Landsat generations, including the Landsat MSS, the Landsat 4-and 5 TM, the Landsat ETM+ and the current Landsat 8 Operational Land Imager (OLI), have provided data since 1972 [
36], at 79 m spatial resolution before 1982, and at 30 m resolution since 1982.
In order to use long-term information, it is necessary to intercalibrate the images provided by the various Landsat sensor generations, obtaining standardized vegetation indices. Besides the NDVI, many other vegetation indices sensitive to the spectral bands difference effects (SBDE) have been formulated over the years, such as those resulting from the combination of visible bands as Atmospheric Resistant Vegetation Index (ARVI) and the Modified Triangular Vegetation Index (MTVI); whilst, the SBDE from other indices (e.g., Normalized Difference Water Index-NDWI, LSWI) have not been reported yet.
Landsat TM and ETM+ sensors allowed earth observation since the launch of Landsat 4 in 1982. The Landsat mission has continued with the Landsat 5 launched in 1984, carrying the same instrumentation of Landsat TM, and later with the launch of Landsat 7 in 1999 with the ETM+ sensor. The Landsat missions have provided a large amount of earth reflectance data collected in six spectral bands with different wavelengths at 30 m spatial resolution. Although the reflectance measured by TM and ETM+ sensors can be considered comparable, due to the similarities in the bandwidth and position, several studies on the intercalibration between the images yielded by different sensors have been carried out in order to provide long-term data [
37]. The last Landsat generation satellite was launched on February 2013, initially known as the LDCM (Landsat Data Continuity Mission) was finally named Landsat-8. The satellite has been equipped with two new sensors: The Operational Land Imager (OLI), designed in order to operate in continuity with TM and ETM+; and the Thermal Infrared Sensor (TIRS), which features two bands in the thermal infrared region. The OLI sensor includes enhanced bands due to new linear detector arrays which collect images in a push-broom scanner mode providing a better signal with a high signal-to-noise ratio, compared to the previous whiskbroom scanner-based sensor [
38]. The improved signal-to-noise performance is quantized over a 12-bit dynamic range, enabling a better land cover status characterization, with 12-bit images (scaled to 55,000 grey levels) [
39]. Along these lines, the sensor is able to highlight a higher earth surface variability due to the maximization of the radiance ranges for all the spectral bands [
40], and at the same time to reduce the saturation of highly reflective surfaces. However, the OLI sensor maintains the same geometric resolution, scene size (170 km × 183 km) and revisit time (16 days) compared to previous Landsat generations.
Thus, the main differences between OLI and previous TM and ETM+ sensors refer not only to the overall image quality but also to the different number of spectral bands, their width and their spatial resolutions [
41]. In particular, the OLI sensor provides new bands such as: i) Band 1 (deep blue and violet) with a shorter wavelength (0.43–0.45 μm), also called the coastal/aerosol band due to its main uses; and ii) Band 9 covering a very short range of wavelengths in the short-wave infrared (1.36–1.39μm), also called Cirrus band due to its cloud cover sensing capacity (
Table 1).
Furthermore, the Landsat 8 OLI bands are narrower compared to the Landsat 7 ETM+, avoiding atmospheric absorption [
38]. In particular, the OLI NIR band is more similar to the MODIS NIR band, except for the 0.825 μm wavelength relative to the ETM+ for the absorption of water vapour [
39]. The OLI Bands 6 and 7 are narrower compared to the ETM+ bands 5 and 7 respectively, reducing the atmospheric absorption and thus reducing the sensibility to atmospheric changes in terms of water vapour content.
To improve the standardization among different Landsat image generations, it is thus necessary to verify if the substantial differences between the two sensors regarding both bandwidth for the visible and SWIR, and the detection system technology, allowing for an efficient comparison among the images by the two sensors.
The main objectives of this study are: i) to extend the knowledge on the effects that the differences between Landsat 7 ETM+ and Landsat 8 OLI spectral responses may have through the analysis of different land use types through a wide range of derived vegetation indices; ii) to evaluate specific intercalibration functions for the standardization of vegetation indices in order to perform long-term time series analysis.
Up to date, the differences between Landsat 7 ETM+ and Landsat 8 OLI spectral responses have only been analyzed on a single derived vegetation index such as the NDVI, while in the present study, we extended the analysis to seven different indices and different land use types in order to achieve a more comprehensive analysis between the two sensors.
3. Results
Figure 3 shows significant differences in the average reflectance values between the ETM+ and OLI bands by land use classes. In particular, the OLI NIR band seems to be more sensitive (higher average reflectance value compared to ETM+) in the forest class, while the reflectance in the visible band (and especially in the red one) is greater in ETM+ than OLI, determining the higher values of OLI NDVI than ETM+ NDVI.
The SWIR1 and SWIR2 infrared bands show few differences for the forest land use class with a slight absorption for the SWIR1 OLI band. The NIR OLI band shows higher average reflectance value than ETM+ for the land use classes characterized by high vegetation cover than for low vegetation cover areas (arable lands, urbanized areas). The SWIR (SWIR1 e SWIR2) OLI bands show higher absorption values than ETM+, although these differences tend to decrease in the area with high vegetation cover. The highest differences between the two sensors refer to the water bodies with significantly higher average reflectance values for all the ETM+ bands, and particularly for the SWIR1 band. The sensor sensitivity to visible bands shows an opposite behaviour for the NIR, in all the land use classes, with ETM+ average reflectance values constantly higher than OLI (especially for the red band) with significant differences for pastures, arable lands, and water bodies. To compare the ETM+ and OLI derived vegetation indices, for the 4 plots, the NDVI, LSWI, NDWI, NBR, VIgreen, SAVI, and EVI have been calculated from the ETM+ and OLI images. The Student’s
t-test results (
Table 5) show a significant difference between the ETM+ and OLI derived vegetation indices (NDVI, LSWI, NDWI, NBR, VIgreen, SAVI, and EVI) for the 4 plots.
As shown in
Figure 4, there are relevant dissimilarities between the two sensors in all indices. The NDWI, SAVI, and LSWI indices show the higher correlation between the two sensors with a mean difference of −0.0500, −0.0725, and 0.0819, respectively. Instead, the standard deviation is smaller than the other indices. The other indices (NDVI, VIgreen, and EVI), with the exception of NBR (which has the difference average value equal to −0.8446, and the standard deviation of 1.27) show difference average values higher than 0.1. Among these, the EVI shows a greater similarity between the two sensors (with very low deviations), followed by the NDVI with an average difference value equal to 0.1223. The VIgreen shows the higher differences between OLI and ETM+ (average difference value equal to 0.1367).
The indices based on non-visible bands combinations (LSWI and NBR) are therefore sufficiently similar and stable between the two sensors, with few fluctuations in the average difference over the four plots. However, the NDVI and NDWI show the greater stability in the different plots with very few fluctuations. The NDWI also shows the lower differences between the two sensors in all the plots. The VIgreen index shows the lower stability, confirming that indices using only visible bands show higher variation due to the SBDE (spectral difference between bands) [
2,
29].
In particular, these results are related to the main differences in the red band between the two sensors for all the land uses with the exception of urbanized areas.
Figure 4 was characterized by small differences in every index. This can be attributed to the morphological characteristics of the area depending on climatic conditions and on land use classes. In particular the plot 4 (
Table 2), located in the Southeast of the regional survey area, is characterized by low altitude and flat terrain. As for the land use classes,
Figure 4 shows a predominance of arable land (about 50% of the area), followed by forests, for about 1/3 of the entire area, and tree crops that reach here the greater extension (12% of the plot area) among all the plots.
The results of the analysis of the difference variability between the vegetation indices for each of the test area and for each land use class are reported in
Figure 5 and
Table 6.
The results in
Table 6 show that, for almost all the considered indices, statistically significant differences exist between the two sensors by land use class. Only for water bodies class, NDWI (often used for the identification of wet areas), SAVI and EVI indices do not show significant differences between the sensors, and SAVI also does not show significant differences for the Urbanized and Mixed agricultural areas classes. This result also confirms the robustness of the correction of the soil brightness introduced in the index for both sensors, for the discrimination of the classes in which the vegetation is absent (Urbanized areas) or it is interspersed with bare soils (Mixed agricultural areas).
According to the box-plot results showed in
Figure 5, for forest land use, some indices based on the combination of NIR and Red (SAVI and EVI) show a remarkable difference between the two sensors with significant variability. The same performance for the forest land use has been found in NBR, based on the combination of infrared bands, and VIgreen, based only on visible bands. NDVI and NDWI for forest land use show good results with low average difference values and low variability. Tree crops show average difference values generally close to zero and low variability with respect to the other land uses for all indices.
Even arable land classes show a similar trend to the tree crops with generally low average difference values between the two sensors, both for the vegetation indices based on the combination of infrared bands (NBR, LSWI), and based only on visible bands (VIgreen).
The plot 4, characterized by a lower percentage of forest land use cover and combined with the higher percentage of tree crops cover and high arable lands cover, shows a low variability of the average difference values for all the vegetation indices.
Plots 1 and 2, characterized by a higher percentage of forest land use cover (more than 60% of the plot area), by a lower percentage cover of arable land, and by a significant reduction, compared to the fourth plot, of tree crops land use (respectively 0.1% and 2% of the surface of the plot), show the highest variability for all vegetation indices.
Finally, the regression analysis between the vegetation indices ETM+ based and OLI based has been conducted in order to identify the intercalibration functions between the two sensors datasets. The goodness of fit of the OLS regressions were defined by the coefficient of determination (R
2) and the significance of the OLS regressions was defined by examination of the regression overall F-statistic
p-value. To provide simple overall measures of similarity, the Root Mean Square Deviation between the OLI and ETM+ data has derived:
The identified functions and the related indices (
Table 7) show a good linear relationship between the dependent variables (indices calculated with OLI) and the independent variables (indices calculated with ETM+). The intercalibration functions, however, show significant dissimilarities among vegetation indices: the indices based on the combination between infrared and visible bands (NDVI, NDWI, SAVI, and EVI) show a better correlation with a determination coefficient always higher than 0.8. Instead, the indices based only on infrared (NBR and LSWI) or on visible bands (VIgreen) show a lower correlation. In particular, the VIgreen shows a determination coefficient less than 0.5, highlighting the differences SBDE in the range of the visible waveband, and, thus, confirming the previously described analysis.
NBR and LSWI also show a poor correlation, particularly for NBR, which highlight the existence of a non-linear component between the two sensors. In addition, in this case, the differences are most likely related to SBDE, as for the bands SWIR highlighting a significant difference in the bandwidth between the two sensors.
From the regression analysis, it has been found that lowest values are reported in plot 4, whilst plot 2 and 3 show higher correlation values. In particular, plot 4 is characterized by the lowest average values for all the calculated vegetation indices, whilst plot 2 and 3 are characterized by highest average values for all the calculated vegetation indices.
These differences also suggest that the higher variability within vegetation indices values occur within areas characterized by low vegetation indices values and thus for areas with a low percentage of vegetation cover.
Figure 6 shows different trends according to the land use type and vegetation index. The OLI NDVI is always higher than the ETM+ NDVI in all land uses, with higher differences for pasture and arable land and a decrease for tree crops and forests. This may suggest that the greatest differences of index values are related to low biomass content (pastures and arable land), whilst lower differences are related to the high value of biomass content. In particular, pasture, arable lands and tree crops show a regression function with a slope almost parallel to the 1:1 line, and forest land use shows the largest differences for lower NDVI values. With the increase of the index value, there is a constant reduction of the differences between OLI and ETM+ derived indices. The EVI shows the same trends described for NDVI and for indices based on the normalized difference of infrared bands (LSWI in
Figure 6). VIgreen, based on visible bands (green and red), shows the greatest differences for land use classes characterized by high values of vegetation cover (forests) with differences nearly constant. This confirms the existence of differences due to SBDE between the two sensors in the visible range, especially with respect to the red band. For arable land, the trends of the index are different because OLI is higher than ETM+. For low index values, the increase of the VIgreen values ETM+ is higher than OLI ones due to a greater absorption in OLI red band together with the intensification of the vegetation cover.
4. Discussions
The comparison between Landsat 7 ETM+ and Landsat 8 OLI to identify the main differences between the two sensors and the spectral responses for different land uses presented in this study, showed significant differences among spectral ranges and vegetation indices derived from the two sensors. However, the 8-day difference between the two Landsat 7 ETM+ and Landsat 8 OLI scenes and the period of the analysis minimizes the effects of vegetation phenology and thus increasing the reliability of the results of the present study. The comparison among the spectral bands provided by the two sensors showed a good similarity. Nevertheless, significant differences are evident in the various spectral ranges: i) high reflectance values of OLI in the band of NIR for all the land use classes, particularly for high values of vegetation cover, ii) and lower reflectance in the SWIR bands, especially for low vegetation cover land uses as for the pastures and grasslands. There are decreasing differences in SWIR band with the increase of biomass values up to maximum values represented by forest cover.
For the visible bands, the ETM+ sensor shows reflectance values greater than OLI ones. The analysis of vegetation indices based on the two sensors shows low differences between the indices based on the combination of NIR and visible bands (NDVI, EVI, NDWI, and SAVI), whilst the indices based only on the visible bands (VIgreen) or only on the infrared bands such as NBR, show higher values of difference. A different result is showed by the analysis of LSWI which, according to the results showed by Li et al. [
29], is not only more similar but also more stable between the two sensors, with low fluctuations in the various plot areas. The regression analysis among the vegetation indices calculated with the two sensors highlights a good correlation, especially for NDVI, NDWI, SAVI, and EVI, with an index of determination higher than 0.8. On the other hand, both VIgreen and NBR show a high dispersion, probably due to the differences in the spectral response of corresponding bands.
With regard to the study of the intercalibration functions between ETM+ and OLI derived vegetation indices for the Mediterranean region is fundamental to ensure an effective operational continuity between the current and previous Landsat missions.
As showed by the literature and confirmed by the results of this study, the two sensors derived vegetation indices are not directly comparable. Differences between Landsat 7 ETM+ and Landsat 8 OLI vegetation indices are directly related to the different width of spectral bands and to the different radiometric resolution between the two sensors [
104].
Different studies showed how the differences for high values of NDVI are directly related to the spectral bandwidth; other studies highlighted the inverse relationship between the increases in bandwidth and mean NDVI values [
46].
Another factor affecting the differences of vegetation indices between the two sensors, and in particular, the dynamic range of vegetation indices, is the difference in spectral band resolution [
105]: the higher is the number of bits, the higher is the variability of the estimated index. According to this, the Landsat 8 OLI derived vegetation indices represent an effective tool to study the ecosystem variability in the Mediterranean region, characterized by a very high land use and land cover variability, due to the combination of both land management practices and physical characteristics mainly related to high climatic variability.