1.1. Accuracy of Satellite-Based Acquisition of Lake Water Temperature Measurements
Lake surface water temperature (LSWT) has been identified by the global climate observing system (GCOS) as one of the essential climate variables (ECV) [
1]. Consequently, there is ongoing interest to monitor this variable to detect long-term trends. Water temperature is not only an important ecological parameter in lacustrine eco-systems [
2], it can also serve as a proxy for detection of local climate change [
3]. Studies have revealed global warming trends for LSWT within different climate zones (e.g., [
4,
5]), and there is evidence that indications of climate change are sometimes even stronger within lake than in air temperature records (e.g., [
6,
7]). To further explore these trends on a continental or even a global scale, there is a need to generate accurate and homogeneous LSWT time series from different climatic regions. The World Meteorological Organisation (WMO) recommends that time series prepared for climate studies should ideally consist of data records that exceed 30 years.
Traditionally, lake water temperature monitoring is based on in-situ measurements acquired from individual lakes. Thus, the availability of measurement data is often restricted to point-based locations, and limited to a specific duration. Moreover, a range of measurement methods have been applied to retrieve these in-situ records (e.g., based on the use of different instruments, defined measurement depths, etc.). This heterogeneity in terms of data coverage and retrieval methods hampers the detection of climate signals within individual lakes and even more notably when comparing climate signals associated with different lakes.
Satellite-based temperature measurements overcome these limitations. The data is spatially and temporally homogeneous and is acquired with a consistent measurement method across the defined regions of interest. They can be used to create independent time-series datasets, to complement existing in-situ temperature records or to merge different datasets by serving as a relatively more robust and extensive baseline record. The advanced very high-resolution radiometer (AVHRR) sensors provide thermal infrared (TIR) data at two separate wavelengths, from NOAA-satellites since the early 1980s and from EUMSAT MetOp-satellites since 2002. Consequently, time series data that spans more than 30 years is available. The potential to generate extensive and temporally homogeneous time series provides motivation to further improve AVHRR data processing methods.
However, the accuracy of LSWT retrieval from raw satellite data depends on many factors. Many sources along the processing chain contribute to the aggregated uncertainty associated with the final data product [
8]. The retrieval method itself (accounting for atmospheric correction), hardware/sensor-related uncertainties, and uncertainties from different sources along the processing chain (cloud screening, geolocation, resampling, and calibration) are all known sources of contribution. Furthermore, the accuracy of LSWT retrieval from satellite data strongly depends on the characteristics of the target lake defined by its local properties (climate, latitude, altitude) and its morphology (depth, size, flow dynamics) [
9,
10].
Under optimal conditions and with careful post-processing, an accuracy of <0.5 K can be achieved [
11,
12,
13]. However, the accuracy target of 0.2 K, mentioned by GCOS for satellite-based lake water temperature retrieval [
14], is still beyond the capabilities of current LSWT retrieval methods for earth observation data. Furthermore, even the more realistic accuracy target of 0.5 K (with a threshold of 0.8 K) required for the operational sea surface temperature and ice analysis (OSTIA) product used in numerical weather predictions at the European Centre for Medium-Range Weather Forecasts (ECMWF) [
15], is difficult to achieve [
16]. In general, validation studies of automated retrieval methods designed to be used with data from multiple inland water bodies are in general not yet able to reach the 0.5 K accuracy target [
13,
17,
18,
19,
20,
21]. The best results have been achieved with AATSR (0.56 K [
13]) and ATSR-2 (0.47–0.64 K [
17]) sensors. Operational accuracies associated with AVHRR and MODIS sensors range between 1 K and 2 K in comparison [
18,
19,
20,
21].
The first part of this study, focuses on optimizing the split-window retrieval method itself. Thus, only the correction of the atmospheric attenuation of the TIR signal along its path from the earth’s surface to the top-of-the-atmosphere (TOA) is analyzed. Other sources of uncertainty from the pre- and post-processing of the data (e.g., geo-referencing, instrument calibration, cloud filtering, skin-bulk temperature conversion, etc.) are not addressed within the scope of this study. We hypothesized that the more we tailor the retrieval method to specific atmospheric conditions, the higher the associated accuracy. However, this would also effectively restrict the applicability of the method to a wider range of scenarios. We compared the performances of different customized retrieval methods to identify reasonable and optimal trade-offs between accuracy and the applicability of the respective methods to a range of scenarios.
The second part considered the implications of the results from the first part of the study within the whole processing chain (i.e., starting from data preprocessing to the generation of the final LSWT product). This procedure evaluates the impact of different tailoring approaches on the overall accuracy of the processing chain. Firstly, the different resultant AVHRR LSWT products were validated against in-situ data to determine if the inherent SWC uncertainty could be effectively linked to the overall uncertainty. Secondly, a theoretical analysis with a simple error propagation model was conducted to estimate the impact of different tailoring methods with respect to different sensors (e.g., (A) ATSR, MODIS, and AVHRR).
1.2. The Satellite-Based LSWT Retrieval Method
The split-window approach is based on making use of the different absorption characteristics of water vapor between differential TIR spectral windows [
22]. Two algorithms are currently applied in various applications. The multi-channel SST (MCSST) [
23] assumes that there is a linear relationship between the atmospheric transmittance of the signal and the difference in absorption between the two channels. The non-linear SST (NLSST) used with NOAA SST products [
24] improves the accuracy particularly for day-time measurements [
25], because it considers the non-linear nature of atmospheric absorption processes. However, when adapted and applied to inland water bodies, no significant improvement was found with NLSST rather than with the MCSST algorithm [
13,
26,
27]. Thus, we use the well-established MCSST-based split-window method (Equation (1)) with radiative transfer modelling for synthetic matchup generation. This was previously applied in [
13,
19].
split-window coefficients (SWC)
brightness temperature from AVHRR bands 4 and 5
satellite’s view zenith angle (VZA)
1.2.1. Empirical Matchup Data for SWC Generation
The split-window coefficients (SWC) in Equation (1) are empirical coefficients, which are determined by multi-linear regression with a set of empirical matchup data (hereafter referred to as matchup data). The selection and the quality of the matchup data is of crucial importance, as the accuracy associated with the MCSST retrieval approach depends on the quality of the split-window coefficients. The matchup data is comprised of simultaneously taken measurements of the same feature. Each pair contains a TOA measurement from the satellite, and a coincident measurement at the earth’s surface (in-situ measurement).
Current sea surface temperature (SST) products use in-situ data from buoys, which are distributed in the oceans around the world, to determine globally valid coefficients. However, buoy data is often sparse or unavailable. While other in-situ measurement data from stations near the shore can be used instead, such data does often not adequately represent the LSWT of open water. Especially when temperatures are measured in shallow waters or within semi-isolated areas like narrow bights or harbors.
An alternative to in-situ data is the use of synthetic matchup data generated by radiative transfer modelling. This has been successfully applied and validated in recent studies (e.g., [
13,
17,
19]). The idea is to generate matchup data using a radiative transfer (RT) model. The RT model ingests atmospheric profiles from NWP re-analysis to yield the brightness temperatures as observed by the satellite’s sensors for any given surface temperature. This underlying concept offers the advantage of generating a synthetic dataset that is independent of the quality and availability of in-situ data. Moreover, synthetic matchup data can be generated in large quantities, which offers more flexibility to produce tailored SWC.
1.2.2. Tailoring the Split-Window Coefficients
The SWCs are derived from the polynomial split-window equation (Equation (1)) by regression, using empirical pre-calculated matchup data. Thus, the SWC are tailored to and valid for situations that they were trained for, using the matchup data. For example, if only matchup data with surface temperatures below 5 °C are used to train SWCs, then the applications of the resulting coefficients are limited to cases with comparable surface temperature. While highly tailored SWC have limited ranges of applicability, they yield results with greater accuracy.
The concept of reducing the validity of SWC across a wider range of conditions to improve their performance under appropriate scenarios has been successfully applied in various studies. For example [
13,
19] spatially restrict the coefficients’ validity to lake specific coefficients; the MODIS product uses two sets of coefficients [
28], each tailored to different ranges of water vapor; and the Pathfinder algorithm [
29] uses temporally-tailored monthly coefficients combined with coefficients tailored to different water vapor ranges. The contrary can also be done to simplify and maximize the utility of derived SWC under a wider range of conditions. For instance, Hulley et al. [
13] assumed that all of the atmospheric states at a local point are covered within year of matchup data, such that the coefficients are universally valid with respect to time.
While all of these SWC tailoring approaches have proven merits, it is uncertain as to which approach is associated with better relative performance with respect to the others. This study addresses this knowledge gap by quantifying and comparing the relative performance related to the most important types of SWC tailoring approaches. A sensitivity analysis is conducted with data from five different regions. This quantification exercise supports the direct comparison of the coefficients’ intrinsic errors, and effectively highlights the main sources of SWC related uncertainties.