3. Materials and Methods
3.1. Investigated Time Period
The investigation focuses on winter of 2014 and 2015, since they were amongst the coldest and longest since MODIS and VIIRS launched, and produced substantial ice cover within the study area. In total, 317 days were evaluated, ranging from Day 305 (1 November) in 2013 through Day 110 (20 April) in 2014 and from Day 311 (7 November) in 2014 through Day 91 (1 April) in 2015. The time ranges were selected to include those where ice presence could be inferred from the discharge data quality flag at the Harrisburg gauge (USGS01570500), plus an anterior and posterior buffer period of at least two weeks. While 2015 did not have any notable mid-winter thaws, a thaw period of about two weeks was noted within the 2014 time-range referred to as “ice” period. Visual inspection also showed that ice and snow are mostly concurrent, and thus time periods identified as “ice” also refer approximately to snow/ice cover periods.
On few occasions, data were either missing or otherwise not produced for the shortwave infrared (SWIR) bands used to develop the contrast-based cloud mask, and were treated as non-observations. Non-observations were also grouped together with cloudy data (overall referred to as cloudy), although in some instances the non-observed scene may have been cloud-free. This treatment is not considered to greatly impact any of the conclusions and presented results, since: (1) non-observations were rare; and (2) some of them would likely have been cloudy. For a time range spanning 317 days, MODIS-Terra, MODIS-Aqua and VIIRS, respectively, had seven (Day 345, 2013; Day 31, 2014; Day 73, 2014; Day 346, 2014; Day 50, 2015; Day 56, 2015; Day 63, 2015), one (Day 77, 2015) and two (Day 35, 2014; Day 316, 2014) non-observations.
3.2. MODIS/VIIRS Data
Data from MODIS collection 5 and 6 (MOD/MYD09GA) and VIIRS collection 3110 (NPP_DSRFLD _L2GD) for tile h12v04 were used, obtained from LAADS Web. Both products are similar in that they have surface reflectance data in the visible, near infrared and shortwave infrared bands gridded to 500 m on sinusoidal projection, are atmospherically corrected, include viewing geometry and respective cloud data on 1 km sinusoidal grids. MODIS data include two cloud masks, but only the “external” MOD35 mask was used. A binary cloud mask was derived from the unobstructed field-of-view (FOV) flags that are passed on from the MOD35 cloud products. In both cases, the products are composed of several observations that may have occurred during the day. We only took data from the first layer of the composited product. These data should be considered the best available observations according to the scoring/compositing scheme used [
32].
3.3. Binary Cloud Mask for MODIS, VIIRS
The VIIRS (MODIS) cloud masks are categorized as confidently clear (clear), probably clear (not set), probably cloudy (mixed) and confidently cloudy (cloudy). A binary cloud mask was constructed by grouping “confidently clear” and “probably clear” as cloud-free and “probably cloudy” and “confidently cloudy” as clouds. To avoid alignment errors, coarser resolution cloud data were first sampled to 500 m grids via nearest neighbor interpolation, before further sub-setting to the region of study. The region of study consists of 158 by 154 grid cells (24,332 total).
3.4. Land-Water Mask
In order to ensure that identified grid cells mainly consist of water, it was also necessary to develop a land/water mask on 500 m resolution sinusoidal grid. This was accomplished from multispectral (bands 1–5 and 7) maximum likelihood classification of MYD09GA data. The classification was done on imagery from different times during low flows in summer and only those grid cells classified as water in every instance was kept. The process helps to limit sampling to mostly water only grids, and also ensures temporal stability in case of wandering river bed. We did not notice any significant changes in the river bed location during the investigated time periods (2001–2016). The final river mask consists of 402 grid cells, and was used to process all of MODIS and VIIRS data.
3.5. Parameters: Effective Revisit Time, Data, Absolute Cloud Cover
Cloud products and biases were evaluated using the metric “effective revisit time”,
rev,
where
dtot is the number of samples (days), data is the average proportion of cloud free grids (
cfg) within
dtot, 402 represents the total number of river grids within the field of view,
cfg refers to the number of cloud-free grids (ranging from 0 to 402) on Day n. For example, if the river were fully free of clouds each day (
data =
dtot),
rev is equal to 1. If on average only half of the river is sampled every day (
data = 0.5 ×
dtot),
rev becomes 2 days, and so forth. While it may be a stretch to refer to results based on analysis of only 317 days as a climatology, results are nonetheless expected to approximate the climatological mean.
Based on data presented in
Section 2.2, a cloud cover estimate in terms of effective revisit time can be established. Assuming an absolute winter cloud cover of 75% for MODIS-Terra data yields
rev = 4.0 days. Assuming a 5% higher absolute cloud cover of 80% for MODIS-Aqua data yields
rev = 5.0 days. These approximate estimates correspond to the mean improvement of STC processing with NASA processing for Winter 2014: prior results, using slightly different cloud masking, yielded
rev of 5.1 and 4.1 days, respectively, for NASA(Aqua) and STC(Aqua) [
12]. We note that small differences in absolute cloud cover may have a disproportionate impact on effective revisit time: 5% can be the difference from
rev = 1.05 to 1.0 days (no clouds), or that of
rev = infinite to 20.0 days (100% vs. 95%). Since the base value is already high (~75%) in the region of study, small differences in absolute cloud cover are relevant.
3.6. Contrast-Based Cloud Mask Approach (STC)
The product cloud masks were also compared to cloudiness determined from STC [
12]. The primary objective of STC is to detect the easier-to-identify the otherwise error-producing, opaque clouds: clouds which for the application of snow/ice monitoring obstruct the field of view to such a degree that it is not possible to accurately identify water, ice or snow. As indicated by the high occurrence of semitransparent single-layer clouds in the area (
Section 2.2), more frequent, reasonably accurate observations may be realized. With respect to semitransparent clouds, it is difficult to ascertain whether a grid is exactly cloud-free or not, and whether a cloud mask truly produced a false detection: the cloud mask may have correctly detected an optically thin cloud, which in some cases may appear as “invisible” under visual inspection. In these cases, the view of the underlying surface is usually sufficiently clear to accurately classify whether there is water, ice/snow or bare land. We showed in previous work that STC(Aqua) vs. NASA(Aqua) outside of snow/ice cover provided similar effective revisit times (4.6 vs. 4.0 days), but performed substantially better during snow/ice cover (3.8 vs. 7.1 days), with no non-detections or false detections with respect to hydrometric station data. Further details on the algorithm and validation for MODIS-Aqua may be found in Kraatz et al. [
12]. The portion of the STC algorithm relevant to cloud screening, with information on the tests, conditions and thresholds is provided in
Figure 3.
STC cloud discrimination consists of two test: Test 1 for evaluating the scene for optically thick clouds and if deemed reasonably cloud-free, individual grid cells are evaluated for clouds with Test 2. While the tests are intended to mainly detect view-obstructing opaque clouds, obstructing semitransparent clouds also tend to be routinely identified. Thresholds were derived from band statistics of land and water grid cells for each instrument, and thus are different for each platform. The thresholds for Test 1 were applied to the field of view reflectance mean of all land (river) grids, rather than individual grid cells. The obtained snow/cloud delineating thresholds correspond well with those found in literature. For band 7 (2.1 μm) a threshold of <0.1 was proposed by Dozier [
22] and also adopted by Thompson et al. [
11].
As part of Test 1, river and land grid cells need to be spectrally separable for snow-free (th1) and snow-covered (th3) scenes. Thresholds for snow-free scenes (th1) requires stricter contrasting thresholds (ranging from 0.4 to 0.6) but allows for higher 1.6 μm (VIIRS) or 2.1 μm (MODIS) mean scene reflectance (th2) compared to snow-covered scenes (th4). Low view angle snow-covered scenes often meet both criteria of Test 1. If neither C1 nor C2 of Test 1 is met, the scene is not deemed of sufficient quality and not processed.
For cloud/ice discrimination of Test 2 for the low and moderate confidence layers, cloud delineating thresholds of 0.11, 0.15 (previously 0.19 in [
12]) and 0.14 were used for MODIS-Terra, MODIS-Aqua and VIIRS, respectively. To be categorized as high confidence ice the SWIR bands are required to have values less than or equal to 0.08, 0.11 and 0.10 for MODIS-Terra, MODIS-Aqua and VIIRS, respectively. The range between the high and low thresholds represents the semitransparent range allowed (and discriminated) by the algorithm.
Due to how the algorithm is currently implemented, it is rarely possible that an apparently clear-sky, snow-covered scene may not be processed: while not necessarily deemed cloudy, there was not enough contrast to delineate snow-covered land from water/snow/ice grid cells in the band of choice. Overall, due to scenes not being selected by the algorithm at all if not passing Test 1, but being nonetheless partially sampled when cloud products are used, STC estimates of opaque clouds are expected to be somewhat conservative.
3.7. Validation Approach
The cloud-truth at any particular observation time of MODIS/VIIRS within the river reach is unknown. It is also not known a priori which of the cloud masks produces the closest approximation to cloud-truth. Cloud mask inter-comparisons would primarily yield information on their correspondence, rather than truth. While statistics based on rev can be informative and inform on obscuring cloud cover, its validity may be more solidly established via additional information. In this particular study cloud statistics are also linked to more readily verifiable data: that of ice cover maps [
4,
12]. We are able to evaluate whether STC cloud masking is credible not only by inference from cloud statistics from NASA and STC processing, but also from comparison of resulting ice cover maps to ice presence inferred from hydrometric station data. Unlike other available options, this method is practical and can be expected to provide a reasonable estimate of cloud-truth. Specific evaluations may be done on: (1) individual day basis by visual inspections of false color imagery; (2) ice extent time series for longer term evaluations; and (3) inferences based on effective revisit times.
The hydrometric station (USGS 01570500) is located near the center of the field-of-view, at Harrisburg, PA. The station has a record of a daily discharge quality flag (DQF), and we previously showed that it can be reliably used to infer whether the portion of river within the FOV bears ice [
12]. The discharge quality flag is set where it is not possible to accurately report discharge/stage within a given tolerance. Specifically, the DQF is set when the gauge malfunctions, or sufficient river ice is present such that it significantly impacts flow determinations. A look at discharge data for a >10-year period revealed that the flag is rarely raised outside of ice periods. Thus, it can be used to reliably inform on the presence of river ice. It is presumed unlikely that areal ice extent could consistently be related to the setting of the DQF, as it depends on the judgment of a Hydrologist. However, a reasonably strict estimate is that it would be set by when 10% river grids are ice covered (
Table 1).
The validation criteria presented in
Table 1 allow for some inference with respect to the correct typing between cloud and cloud-free conditions, especially in instances where ice has not been observed or could not have formed due to warm temperatures. Missed opaque clouds, especially outside of ice cover periods, would be clearly represented by false ice detections. Generally, false ice detections are not solely attributable to missed clouds that were classified as ice. More problematic than missed clouds are false ice detections due to adjacent-to-river snow cover, when viewed at larger sensor zenith angle (SZA, e.g., more than 40 degrees). Besides footprint growth issues (more so for MODIS than VIIRS [
28]), anisotropic scattering can also be problematic [
15]. Both can combine to result in false positives, and from experience false positives due to snow occur more frequently and are more difficult to address using visible and SWIR bands only.
Using ice extent time series or cloud statistics within the field of view to ascertain whether clouds were missed is more difficult during ice/snow periods since: (1) STC is supposed to ignore some clouds, and estimate whether ice is underneath semitransparent clouds; and (2) not only missed clouds, but also adjacent snow cover may be classified as ice. However, by using three independently generated cloud statistics and ice time series, it is possible to ascertain more confidently about whether obscuring clouds are properly detected: significant ice cover overestimates due to e.g., missed clouds, or poor view geometry would be apparent between the three ice cover time series. Cloud errors would also be expected to show in the cloud statistic. However, using ice time series offers a clearer visual comparison.
4. Results
4.1. Cloud Statistics from NASA Processing
Results of NASA processing for 2014 and 2015 data are shown in
Table 2. A break-down of 2014 and 2015 is not shown as results were similar.
Table 2 data clearly shows that only NASA(Aqua) features a dramatically larger rev (7.0 vs. 3.9 days) the during snow/ice cover period. NASA(VIIRS) features a relatively smaller 2.7 vs. 3.1 days, while NASA(Terra) results remain nearly the same irrespective of snow/ice cover. NASA(Terra) and NASA(Aqua) revisit times outside of snow/ice periods are identical. NASA(VIIRS) has the by far lowest effective revisit times of 2.9 days compared to 4.0 and 5.0 for NASA(Terra) and NASA(Aqua), respectively. This is equivalent to 31 (40%) and 47 (75%) more full river observations (data) compared to NASA(Terra) and NASA(AQUA), respectively. Most of the difference stems from the 160 days considered “snow/ice” period: NASA(VIIRS) has 21 (54%) and 37 (160%) more full river observations (data) for NASA(Terra) and NASA(Aqua), respectively. We also note that NASA(VIIRS) has the most “Obs”, with 219 out of 317 scenes sampled from, versus 170 and 158 for NASA(Terra) and NASA(Aqua), respectively.
4.2. Cloud Statistics from STC Processing
Results of STC processing for 2014 and 2015 data are shown in
Table 3. A break-down of 2014 and 2015 is not shown as results were similar.
Table 3 data show that the effective revisit times for STC(Aqua) and STC(VIIRS) are relatively decreased during “ice”: rev is 3.4 days, compared to 4.5 and 3.9 days during “no ice”, respectively. STC(Terra) has a slightly increased effective revisit time during “ice”, increased from 4.1 to 4.3 days. During “ice”, data obtained by STC(Aqua) and STC(VIIRS) are identical, corresponding to 47 full river observations. Overall, effective revisit times are similar: 4.2, 3.9 and 3.7 days, respectively, for STC(Terra), STC(Aqua) and STC(VIIRS). STC(Terra), STC(Aqua) and STC (VIIRS), respectively, fully observe the river 76, 82 and 87 times within the 317 days. Data sampling (“Obs”) is also similar with river observations stemming from 79, 84 and 92 selected scenes, respectively, for STC(Terra), STC(Aqua) and STC(VIIRS). This corresponds to an overall mean sampling rate of 96% of river grids when a scene is selected for processing by STC.
Image selection of STC processing is compared to that of NASA in
Table 4: A breakdown by year is shown, detailing how much data are obtained from NASA processing when STC accepts or rejects a scene. The data provided in
Table 4 indicate nearly identical results between 2014 and 2015 data. It shows that NASA(Terra), NASA(Aqua) and NASA(VIIRS), respectively, indicate that 73%, 56% and 85% of river grids are cloud free in scenes accepted by STC and that 9%, 7% and 15% of river grids are cloud free in scenes rejected by STC.
Provided that STC processing overall samples 96% of river grids as cloud free, it samples 23%, 40% and 11% more data from imagery than NASA(Terra), NASA(Aqua) and NASA(VIIRS).
4.3. Ice/Cloud Results from STC Processing
An example of STC processing of data for clouds and ice is provided in
Figure 4. The example is for an ongoing ice jam, as identified per the National Weather Service, on 9 January 2014.
The leftmost column shows an overlay of NASA cloud processing results on top of false color imagery. MODIS data are cloudier than VIIRS data. The second column from the left provides the false color imagery, showing that river (black), ice (yellow-green), snow (yellow-green) and land (magenta) can be reasonably well identified by visual inspection. The third column is the main product from which the time series and cloud statistics are developed, and shows the result of STC processing: ice vs. cloud confidence in three levels (“Low”, “Mod”, and “High”), a warning flag for large sensor zenith angle (SZA, >40°, only marked at “ice” grids), water and clouds. Terra data were obtained at poor SZA, and as result STC classified some river grids as clouds in the upper portion of the river. The fourth column shows the running composite. It is only shown here to more clearly indicate the ice/cloud confidence levels, which are further subset according to reflectance magnitude, since the running composite maps do not overlay information on clouds or SZA. The figure shows the cloud-clearing ability of STC vs. NASA processing. All classified ice maps are in good agreement with visual inspection. Ice/water delineations are nearly identical. Ice is classified with similar confidence between the platforms.
Figure 5 shows the river ice time series for 2014 and 2015, indicating that: (1) ice detection is consistent with that inferred from the hydrometric station; and (2) generally, similar results are obtained irrespective of platform or bands used. Based on the consistency of ice extent time series, a high correlation is evident. The correlations of “ice extent” for the different platform combinations were computed based on >50 common observations from STC for each possible combination and range from 95%–99%. Mean absolute differences of ice extent based on these common observations range from 3%–7%. The accumulated freezing degree-day (AFDD) curve is provided only for contextual, qualitative information regarding whether freezing or thawing air-temperatures persist: it should be interpreted here only in terms of that its increase (decrease) represents below-zero (above-zero) temperatures [
12,
33,
34].
Table 5 shows the reach-wide ice detection error metrics, for probability of detection of ice (POD), proportion correct (PC), false detections (FD) and non-detections (ND)—after filtering out a gauge malfunction (2015, Days 15 and 16) since it could not be reasonably used as indicator of ice: it was too early in the season and temperatures were too warm, as evident from the AFDD curve. Error metrics indicate good performance overall, with 95%–100% proportion correct.
4.4. On Collection 5 vs. Collection 6 Differences
Collection 6 (C6) NASA(MODIS) features a higher resolution land/water mask (250 m vs. 1 km in C5). Although NASA(MODIS) does not feature a dedicated inland water processing path, the new land/water mask may still lead to improved cloud masking over river grids. For this reason, and to generally assess how STC and NASA processing are impacted by re-processed data, we include a brief comparison between C5 and C6. Results of a direct comparison of NASA(Aqua) data are provided in
Table 6 and show that there are few, if any, differences between C6 and C5 data with respect to cloud masking over river. On average for MODIS (Terra and Aqua) data, only 10 grids per day (2% of river grids) changed between cloudy and clear. Differences amount to one (zero) fewer full river observations over the 317 days for MODIS-Terra (MODIS-Aqua). Differences in excess of 30% were noted on four occasions, three of which trended towards fewer clear-sky river grids.
Impacts of Collection 6 data on STC were also investigated, but, due to small differences, are not detailed here. The contrast based algorithm is not sensitive to differences between C5 and C6 processing: only on three occasions (one for MODIS-Terra, two for MODIS-Aqua) did the slightly different reflectance magnitudes result in selection (one) or rejection (two) of originally near-threshold scenes. Outside of these scenes, only 0.1 out of the 402 river grids/scene was classified differently in light of collection 6 data. Overall, we find that C5 and C6 data are virtually identical to each other over river grids thus C5 and C6 data may be used interchangeably, and that conclusions drawn from investigation of either should be applicable to both.
5. Discussion
Cloud statistics obtained from NASA processing clearly indicate that there are significant differences with respect to cloud identification. Considering the differences between detector capabilities, algorithm and ancillary data sources used for cloud masking, differences are to be expected. Most relevant to river ice monitoring is that (1) NASA(Aqua) appears to strongly overestimate clouds during snow/ice presence, (2) NASA(MODIS) regularly may identify apparently cloud-free river ice as clouds and (3) NASA(VIIRS) indicates substantially fewer clouds than NASA(MODIS).
5.1. NASA(MODIS) Snow/Ice Underdetections
Attribution is difficult, especially in light of the focus of this work which is an evaluation of an alternate approach to better observe river ice. We point to some of the differences noted in using band 7 (2.1 μm) NDSI, namely that (a) it provides results of up to 5% larger snow extent; (b) the NDSI threshold for snow is substantially larger for NASA(Aqua) than NASA(Terra) (>0.54 vs. >0.40); (c) that the Normalized Difference Vegetation Index (NDVI) portion of the snow decision region has been disabled for NASA(Aqua) due to snow errors and (d) that differences are more apparent in low illumination and dense forests. The NDVI portion of the snow decision region is used to improve snow classification underneath canopies, and this consideration allows for lower-than NDSI threshold data to also be classified as snow.
Most likely, the difference stems from use of a globally calibrated threshold that is not ideal for this region, nor ice detection. Based on the thresholds presented in this and prior work, we can provide an illustrative sample calculation. Our own threshold for snow detection was found to correctly identify 21 out of 22 scenes as having snow and used values of 0.1 for band 7. We further note that a fully snow covered, cloud-free scene may have a mean reflectance as low as 0.05—the point at which contrast-based approach can fail. The values of 0.1 (0.05) may be substituted into the NDSI test, to solve for the required magnitude of visible reflectance. The result yields a required band 4 (0.55 μm) reflectance of at least 0.34 (0.17). Noting that band 7 reflectance over snow covered land is usually much closer to 0.1 than 0.05 in our histograms and that many (including snow covered) land grids may not have a band 4 reflectance in excess of 0.34 (
Figure 6 in [
12]), it is not surprising that the band 7 based NDSI test may fail in this setting and that snow and ice can instead be identified as clouds. It is important to note that the NDSI tests use at-satellite reflectance data, which are different from our dataset because the latter underwent additional processing via an atmospheric correction algorithm [
35].
A representative example is provided in
Figure 6, showing a snow covered scene. Even under full snow cover, multiple darker appearing features are apparent. More easily identified are those due to terrain (hills, ridges). Less obvious are those patches near the center of the field of view, where river ice is also usually found (
Figure 6a). These patches are consistent with snow masked by vegetation (or other) canopy or frozen water. The cloud mask overlay (
Figure 6b) shows that the relatively darker patches tend to be identified as clouds, while the brightest grids are “clear”. This is consistent with the general sample computation, indicating that the threshold for NDSI may not be readily met in the vicinity of where river ice is found. It also is not helpful that NASA(Aqua) does not allow for a potentially lower NDSI threshold on basis of vegetation masking of snow, as its NDSI/NDVI decision region is disabled. A second factor is that snow-free or water-ice mixtures tend to have a significantly smaller band 4 reflectance than snow.
With respect to river ice being detected as clouds, we note that ice cover may occur without substantial snow cover. When determining a processing path, NASA(MODIS) refers to the background flag, set by a coarse resolution product (25 km by 25 km). If the grid is only partially snow covered the flag may not be set as snow, no NDSI test is done, and high reflectance grids would be classified as clouds [
11].
5.2. NASA(VIIRS) Bias
We find that NASA(VIIRS) detects substantially fewer clouds than NASA(MODIS). Key et al. [
28] provide some information on NASA(VIIRS), but their results are reported on basis of only sampling data from “confidently clear” grids. They note substantial errors due to extensive cloud misses and snow overestimates when data are sampled from “probably clear” and “probably cloudy” grids. Even with the strictest cloud mask they noted obvious cloud misses occurring in the midst of large cloud masses, resulting in snow overestimates.
Our own observations and data corroborate the observations made by Key et al. [
28]. Our results (
Table 4) show that NASA(VIIRS) samples exceedingly from scenes rejected by STC processing—at about twice the rate than NASA(MODIS). STC(VIIRS) selects 92 out of 317 days for processing. At a sampling rate of 15% from rejected scenes, there would be nearly 34 additional full river observations, whereas NASA(Terra) and NASA(VIIRS) would only obtain 19 and 16, respectively. NASA(VIIRS) tendency to sample data from cloudy scenes is also apparent from its high value for “Obs” (
Table 3): NASA(VIIRS) samples the river on 30%–40% more days compared to NASA(MODIS). Correspondingly, NASA(VIIRS) also has almost proportionally more data (~40% compared to NASA(Terra)).
The remaining factor providing NASA(VIIRS) with a lower effective revisit time can be attributed to it identifying 12% and 29% more river grids as clear-sky (STC accepted scenes), compared to NASA(Terra) and NASA(Aqua) (
Table 4).
Figure 7 presents qualitative examples on how NASA(VIIRS) obtains additional river observations where NASA(MODIS) or STC processing would not allow for a substantial amount of river data to be sampled.
5.3. Diurnal Cloud Cover Discrepancy
NASA processing provided inconclusive data on whether there are fewer clouds in the morning or afternoon. According to Ackerman et al. [
31], NASA(Aqua) appears to observe 5% more clouds than NASA(Terra). Our results in
Table 2 also indicate a 5% mean absolute cloud cover difference corresponding to the difference in “rev” for NASA(Terra) (4.0 days, 0.75 absolute cloud cover) versus NASA(Aqua) (rev = 5.0 days, 0.80 absolute cloud cover). However, NASA(VIIRS) data suggested a much smaller absolute cloud cover of 0.65, but as shown in
Section 5.2. NASA(VIIRS) does not provide reliable information. The reference provided by STC processing suggests that afternoon observations may have 2%–3% less absolute cloud cover. Much of the overall difference stems from the “ice” period, during which afternoon observations suggest an absolute cloud cover of only 0.71 compared to 0.77 for STC(Terra). The precise reason for this difference, mainly occurring only during snow/ice cover period, is unclear. Although STC(VIIRS) uses the 1.6 μm band instead of 2.1 μm for STC (MODIS), both STC(VIIRS) and STC(Aqua) indicate the same. Concluding, it is more likely than not that afternoons are slightly less cloud obstructed, contradicting results obtained from NASA processing.
5.4. Suitability of NASA Processing for River Ice Mapping
Each of the NASA processing was shown to yield substantially different results. NASA(Aqua) is clearly the least well suited for river ice monitoring, as it most frequently identifies ice as clouds, and only samples 56% (
Table 4) of river grids in mostly clear scenes. On basis of results in
Table 2 and
Table 3, NASA(Terra) statistics approximate that of STC more closely than NASA(VIIRS). However, since NASA(MODIS) has a general and somewhat persistent tendency to identify ice as clouds, it is less suitable than indicated by cloud statistics alone: although, on average it may sample 73% of river grids, it may mask the data being sought. NASA(VIIRS) provides the smallest effective revisit times, but we were able to show that much of it is due to clouds that were not identified by NASA(VIIRS). STC(VIIRS) provides good ice and cloud identification (
Figure 2 and
Figure 3,
Table 5), and its results suggest that its effective revisit time should be closer to 3.7 days (0.73 absolute cloud cover) than 2.9 days (0.65 absolute cloud cover) for NASA(VIIRS). Nonetheless, when STC accepted a scene for processing, NASA(VIIRS) sampling most closely approximated STC results: not only because it sampled 85% of the river compared to 96% for STC, but also because there were no obvious, systematic misidentifications of ice as clouds. Owing to these properties NASA(VIIRS) is more suitable for river ice monitoring than NASA(MODIS), but care should to be taken to screen for undetected clouds. These would be more difficult to notice within an ice product while the river already bears ice: it may appear as correctly identified ice, surrounded by clouds (e.g.,
Figure 7a,b)—although all data should have been classified as cloud.
5.5. Improved River Ice Mapping through Improved Cloud Masking
Figure 2 and
Figure 3 and
Table 3,
Table 4 and
Table 5 establish the capabilities of STC processing, that its results are credible and that it is more likely to approximate a cloud truth than any of the NASA processing.
Table 3 details that STC processing provided consistent cloud identification across several platforms.
Table 4 shows that STC is consistent with the NASA processing results: the fact that STC identified cloud-free scenes correctly is generally corroborated by the fact that 56% (NASA(Aqua)), 73% (NASA(Terra)) and 85% (NASA(VIIRS)) of river grids were also “clear”. Scenes identified as cloudy by STC were likewise generally identified as cloudy by NASA processing. Visual inspection of
Figure 4 shows that ice cover is similarly and accurately identified, irrespective of platform or bands used. Thus, through consistency of data and low error rates (
Table 3,
Table 4 and
Table 5), we infer that obstructing clouds must have been accurately identified by STC.
The overall expected effective revisit time appears to be near 4 days for each instrument. While this is approximately comparable to NASA(MODIS), worse than NASA(VIIIRS) and better than NASA(Aqua), the resulting data provide more reliable information and are quality checked. Furthermore, STC processing during “ice” has an effective revisit time of up to 3.4 days. Ice maps are readily composited between the platforms, allowing for a more continuous data record via cloud-clearing by using data obtained at different times or in more favorable physical settings (e.g., better view geometry). Specifically, in
Table 3 for STC processing total “Obs” are listed respectively as 84, 79 and 92 for NASA(Aqua), NASA(Terra) and NASA(VIIRS). However, 116 individual days were selected for processing by STC between them. At 96% grids sampled in each of 116 days, out of 317, it is possible to achieve an effective revisit time of 2.8 days. Likewise, it is possible to further reduce errors by substituting the most accurate ice map of either platform. For the investigated time range it is possible to remove two false detections in STC(Terra) and STC(VIIRS) by substitution of STC(Aqua) data for those days. One non-detection for STC(VIIRS) can also be removed by substitution of STC(Terra) data.
6. Conclusions
The main goal of this work was to elucidate on cloud mask performance and suitability over ice-bearing rivers and on improving river ice observations. The goal was accomplished via a series of steps. First, cloud statistics based on binary cloud masks developed from the unobstructed field of view flags from the Moderate Resolution Imaging Spectroradiometer (MODIS) and the Visible Infrared Imaging Radiometer Suite (VIIRS) cloud products (MOD35, MYD35 and VCM) were developed. In all cases, data were sampled only from “confidently clear” and “probably clear” grids. Results show a large possible range for absolute cloud cover while ice and snow are found: 0.63, 0.76 and 0.84, respectively, for NASA(VIIRS), NASA(Terra) and NASA(Aqua). These cloud amounts correspond to one full river observation every 2.7, 4.1 and 7.0 days (effective revisit time), during the ice cover period. We showed that, when data are averaged, these difference become less apparent for MODIS, and obtained effective revisit times matching a 0.05 absolute cloud difference extrapolated from Ackerman et al. [
31].
A contrast-based algorithm (STC) operating on MODIS and VIIRS surface reflectance data was used to provide reference cloud statistics. The validity of STC results are established on basis of: (1) providing accurate ice detection relative to that inferred from an in-situ, reach-representing, discharge quality flag; and (2) consistency of cloud statistics and resulting river ice time series to show STC has few, if any, serious cloud detection errors. Cloud statistics obtained from STC processing provided more consistent absolute cloud cover values while ice and snow are found: 0.71, 0.77 and 0.71 for STC (VIIRS), STC(Terra) and STC(Aqua) data, respectively. STC processing suggests that afternoon data are obtained under less cloudy conditions: MODIS-Aqua and the VIIRS data suggested absolute cloud amounts of 0.74 and 0.73 overall, compared to the MODIS-Terra value of 0.76. This conclusion contradicts results obtained from the original MODIS cloud masks, which indicated that the afternoon has 0.05 greater absolute cloud coverage.
We also provided insight and attribution into the reasons for various cloud biases. Our qualitative analysis suggests that differences between NASA(Aqua) and NASA(Terra) is probably due to the modified Normalized Difference Snow Index (NDSI) test that NASA(Aqua) uses. In addition, NDSI test may be more likely to fail for ice or ice/water mixed grids owing to relatively smaller 0.55 μm reflectance compared to snow. As result, these locations would be more frequently mapped as clouds. We also noted a tendency of both MODIS instruments to misidentify ice as clouds, especially when there is no or only partial snow cover, probably due to a scale-related inaccuracy [
11]. The substantially smaller absolute cloud cover determined from NASA(VIIRS) was attributed to: (1) its identification of optically thick clouds as “clear” data; and (2) it generally sampling more data from apparently clear scenes as established from both visual inspection and STC processing. Differences and impacts of using Collection 6 (C6) vs. Collection 5 (C5) data in NASA and STC processing were also investigated. Although C6 features various algorithm tweaks and an improved land/water mask, we find that there are no significant differences for river monitoring, and conclude that C6 and C5 data may be used interchangeably.
We also noted how the improved cloud screening (e.g., that provided from STC) may lead to greatly improved river ice mapping capabilities. The improvement is mainly due the additional quality controls providing more reliable data than provided by NASA processing, and reduction of biases present in all of the NASA processing. Furthermore, during ice periods, STC provides lower revisit times than all but NASA(VIIRS) processing. Since the ice and cloud mapping is consistent and done over an identical river mask, the ice maps may be used interchangeably. We show that such cross-platform compositing would allow for an improved effective revisit time of up to 2.8 days. With cross-platform compositing it is also possible to further improve data quality by building ice cover time series based on the best quality observations.
Beyond its utility to river ice monitoring, ice data generated by this approach may be a useful input for ice jam modeling. Statistical modeling approaches have been mainly limited to using AFDD (and its few day difference) and discharge to model river ice dynamics such as break-up or freeze-up timing [
36,
37,
38]. With ice data (reflectance as proxy) in addition to AFDD (air temperatures as proxy), it may be possible to build better models. MODIS has been operational since 2000 and a more complete series of river observations may lead to more accurate determination of break-up timing, freeze-up or other work related to river ice processes and climatology [
18,
39].
Likewise, it would be of interest to explore the cloud clearing potential of GOES-R, with respect to river ice observations. The GOES-R Advanced Baseline Imager (ABI) is compatible with the STC approach, as it features an appropriate shortwave infrared and 500 m resolution visible band. GOES-R should provide excellent river ice monitoring capability for this stretch of the Susquehanna River.