1. Introduction
Timely and continuous monitoring of the extent of surface water bodies is very important in the management of the living environment. On the one hand, during periods when precipitation is significantly lower than the average one, the consequent shortage of water propagates through the hydrological system and may cause drought in the different segments of the hydrological system itself, including surface water [
1], whose spatial distribution represents therefore a fundamental piece of information for drought management. On the other hand, when heavy precipitation causes floods, near-real time data about floodwater extent and duration are essential to support rescue and damage recovery decisions and to facilitate rapid assessment of property loss and damage [
2].
Several past studies (e.g., [
3,
4,
5]) demonstrated that synthetic aperture radar (SAR) systems are suitable tools for surface water mapping. They combine a high spatial resolution (from hundreds of meters to approximately 1 m, depending on the SAR acquisition mode) with the capability, typical of microwaves, to provide data during day and night and even in the presence of cloud cover. Moreover, there is a clear physical mechanism for radar scattering from water, that is, specular reflection [
6]. Unless strong wind is blowing above the water surface, the latter has a very low roughness, and most of the incident radar energy is scattered towards the specular direction. Therefore, a water surface is generally characterized by a low backscattering and appears dark in a SAR image. Conversely, the rough non-flooded terrain exhibits a higher backscattering and a brighter image tone [
7], because it scatters the radar signal in many different directions, included the backscattering one.
Specular reflection is not always the prevailing effect of the radar scattering from water. In urban and agricultural flooded areas (if plants emerge from water), a dihedral effect involving water surface and vertical structures such as stems, shrubs, or walls can take place, producing an increase of the backscattering. However, several unknowns (e.g., structure and geometry for vegetation, or height and orientation of buildings for urban areas) influence the radar response from these targets [
8]. Consequently, although new insights were provided on detecting inundations in agricultural areas [
9,
10,
11,
12,
13] and urban settlements [
8,
14,
15,
16,
17], this task is still very challenging. Not only missed detections (i.e., omission errors), but also commission errors may occur when mapping surface water. In areas with complex topography, the shadowing effect strongly influences the radiometry of SAR data and produces a low backscatter that can be confused with the backscatter from water bodies. Low backscatter values are measured by SAR even in the presence of heavy precipitation (especially when dealing with X-band data) and wet snow [
18,
19,
20,
21]. Furthermore, very smooth surfaces like asphalt (roads, airports, parking lots) and/or very dry areas like deserts appear dark in SAR imagery and may be misclassified as water.
Despite the challenges previously discussed, many processing algorithms have been proposed for mapping permanent water and floodwater using SAR data in a fully automated way (e.g., [
11,
17,
22,
23,
24,
25,
26]). Therefore, this application can be considered mature for operational/pre-operational implementation, at least for what concerns open water. Currently, the Copernicus Emergency Management Service (CEMS) uses satellite imagery to rapidly provide geospatial information about natural disasters. Regarding floods, SAR data are very often used by CEMS for mapping purposes to overcome the limits of optical data related to the presence of clouds and the impossibility to acquire data during nighttime (e.g., [
27]). CEMS is available 24 h a day, 365 days-a-year, but is an on-demand service, which is triggered on request by authorized users. Hence, an automatic tool enabling continuous and systematic monitoring of water surfaces by routinely producing medium-high resolution maps of floodwater/permanent water can complement the rapid mapping component of CEMS for what concerns flood management. In particular, it can cope with the timeliness in the availability of maps because a continuous service does not require any user activation. Moreover, it can contribute to the effectiveness of CEMS activation through a better identification of the area of interest, when images with a very high spatial resolution are required for an accurate delineation of the flood extent.
The regular acquisitions of Sentinel-1 (S1) images, performed in the framework of the Copernicus program, guarantee a continuous streaming of C-band SAR data useful to systematically produce maps of water surfaces at medium-high spatial resolution (S1 data over land have a single-look resolution of 5 × 20 m
2,
Section 2.1). The availability of S1 data represents therefore the driver for the implementation of the aforementioned tool. A systematic near real-time (NRT) surface water mapping service has been recently requested by the Italian Department of Civil Protection (DCP) in the framework of its competences assigned by law.
This paper presents an automatic processor, which does not need any supervision, or user intervention, that performs a systematic daily mapping of water surfaces (both floodwater and permanent water bodies) using S1 data. The acronym AUTOWADE (AUTOmatic Water Areas DEtector) is used to denote this tool. AUTOWADE has been designed in the framework of the agreement between DCP and CIMA Research Foundation, which is one of the DCP competence centers. Therefore, it has been conceived to work on a national (Italian) scale. The implementation of a service for fast mapping of floods and inland waters using S1 data requires the design of a chain formed by different components. The main component is obviously water detection, but other components must be included, like the automatic downloading of S1 images and their preprocessing to derive geocoded and calibrated data, the generation of reference and exclusion masks, and the delivery of the data to the end user. All these components must be sequentially executed in a fully automated way. Despite the large number of studies dealing with NRT water mapping, the problem of the creation of a tool that automates the various steps needed to set up a daily service based on SAR data was rarely tackled in past studies. A fully automated TerraSAR-X based flood service was developed in [
25] and adapted to S1 data in [
28].
Although the present release of AUTOWADE focuses on open water (i.e., open floodwater and permanent water bodies), vegetated areas close to open water that are likely inundated as well are searched for, despite the criticalities previously underlined; the objective is to, at least partially, fill the gaps typically present in flood maps due to undetected flooded vegetation. Conversely, urban areas are not considered because the related methodologies require interferometric data (e.g., [
8,
15,
16]), whose processing requires the use of S1 orbit ephemerides data, available 20 days after the data acquisition.
The algorithms implemented in AUTOWADE use different image processing techniques. Flooded vegetation is detected through a combined use of co- and cross-polarized S1 data performed taking advantage of the fuzzy logic. Open water is mapped using a new methodology, denoted as buffer-from-edge (BFE) approach, which combines clustering, edge filtering, automatic thresholding, and region growing; it is applied to co-polarized data only. At present, a combined use of different image processing techniques such as image tiling [
29], used to identify a threshold separating open water from the surrounding land, and region growing is well-established for water mapping from SAR (e.g., [
18,
24,
26,
30,
31]). However, AUTOWADE works on the national scale and, for each S1 pass over Italy, it must process a number of image slices (
Section 2.1) between 2 and 7, depending on the orbit. Hence, AUTOWADE does not use the approach proposed in [
29] to determine the threshold, mainly because image tiling might be time-consuming. Through the BFE approach, AUTOWADE tackles the problem in a new way, taking advantage of the availability of a reference water mask to directly analyze portions of a SAR image that likely contain comparable amounts of water and non-water pixels. The open water mapping algorithm implemented in AUTOWADE has been quantitatively validated considering maps produced by the CEMS in the period 2019–2021 as a benchmark; for the method to detect flooded vegetation close to open water, a qualitative comparison with optical data and aerial photos has been carried out.
Section 2 introduces the data used by AUTOWADE, as well as the data used to validate the AUTOWADE-derived product.
Section 3 describes in detail the different modules that make up AUTOWADE, with particular emphasis on the open water mapping performed through the BFE approach.
Section 4 presents the results that are discussed in
Section 5, while
Section 6 draws the main conclusions.
5. Discussion
This paper presents the AUTOWADE tool, which has been conceived to provide the Italian DCP with a product that daily monitors floods and inland waters in a fully automatic way using S1 data. The service implemented by AUTOWADE is tested in pre-operational mode since early 2020 and has been extensively used to monitor the location and the extent of the floods that hit Italy in autumn 2020 (e.g., the events considered in
Figure 13 and
Figure 14). AUTOWADE utilizes not only S1 data, but also S2 ones generated by a processor described in a previous paper [
34], as well as ancillary data provided by the Copernicus Land Monitoring Service.
With respect to a previous paper dealing with an automated S1-based flood service [
28], AUTOWADE monitors not only floods, but also permanent water bodies and searches for flooded vegetation in the gaps present in the open floodwater map. Moreover, it is tested on a quite large number of case studies.
With respect to previous studies on flood mapping from SAR data, this paper considers all the aspects related to the provision of an automated, national flood/water monitoring product to an end-user, such as the automatic procurement of satellite data and the delivery of the product to the end-user. As for the algorithmic aspect, a new method is proposed, which does not use an image tile-based approach to determine a threshold that separates water from non-water, as commonly done in the literature (see the Introduction). The BFE approach implemented in AUTOWADE takes advantage of the availability of a reference water mask to identify the edges separating land from open water, also using a clustering method as the ISODATA. Focusing directly on the portion of a SAR image where standing water is present, instead of searching for it by iteratively diving the image in tiles of different sizes, represents a simple method that improves computational efficiency. From this point of view, the application of a despeckling filter, performed in the preprocessing step, is the task with the higher computational burden, as pointed out also in [
23], while neither the ISODATA, nor the histogram fitting are very time-consuming.
The combined use of an edge-filter and an automatic thresholding method was performed also in [
51], where the filter was applied directly on a spectral index, derived from multispectral data, sensitive to the presence of water (Modified Normalized Difference Water Index [
52]). In AUTOWADE, considering that SAR images are affected by speckle noise (despite of the application of a despeckling filter, some noise still remains in the images), it has been chosen to preliminarily use a clustering algorithm to make the data more immune to noise, as done in [
53].
Regarding accuracy, for permanent water and open floodwater, it has been evaluated considering a set of test cases presenting large differences in terms of total flooded surface, ranging from 280 ha to 9460 ha (
Table 2, last column and
Table 8, 2nd column). Even the ratio between the flooded surface and the size of the AOI presents large differences ranging from less than 1% (EMSR411) to about 7% (EMSR429). The maximum discrepancy between CEMS-derived and AUTOWADE-derived maps in terms of total flooded surface is 858 ha (EMSR417; AUTOWADE: 8602 ha, CEMS: 9460 ha, see
Table 8). However, in terms of percentage with respect to the size of the AOI, this discrepancy corresponds only to 0.6% (AUTOWADE: 5.8%, CEMS: 6.4%). The maximum percentage difference is 2.2% (EMSR496; AUTOWADE: 5.9%, CEMS: 3.7%).
Fairly good results have been obtained for all the case studies, especially considering κ and the F1-score. Almost equal values of the F1-score (0.90–0.91) have been obtained for the biggest flood (EMSR417) and the smallest ones (EMSR359-411). The same applies considering the F1-score computed as the average of the F1-scores of the two water classes only (0.85–0.87).
As for the errors of omission and commission, even better results were obtained in the literature (e.g., [
23,
31,
54]), but it must be considered that in this study, 3 classes are considered, while in previous studies, only non-water and floodwater were generally distinguished. It has been found that the major source of disagreement between AUTOWADE-derived maps and reference maps is the misclassification between permanent water and floodwater. On the one hand, this happens because CEMS does not use a pre-flood image, while AUTOWADE labels as permanent water the pixels classified as open water in the flood image and in the pre-flood one. On the other hand, the misclassification is also due to differences between the hydrography data included in the CEMS products and the reference water mask. This has nothing to do with the accuracy of the AUTOWADE-derived maps.
For what concerns permanent water, the high value of ε
om obtained for the EMSR492 case study (30.5%,
Table 6) is mostly due to a small water course whose delineation with S1 was partially hampered by the presence of many mixed land-water pixels (data with higher spatial resolution would have been necessary for a more precise delineation). The high value of ε
om obtained for the EMSR501 case study (32%,
Table 6) is mostly due to a water body (Drin River) whose detection was missed because of unexpected high values of backscattering likely due to the presence of wind blowing above the water surface.
The extent of the objects labelled as open floodwater is the major cause of misclassification between open floodwater and non-water. An example of this kind of misclassification is shown in
Figure 15 for the Springfield Clonara test site (EMSR429). For the class of open floodwater, ε
comm and ε
om are equal to about 20%, in this case. Looking at
Figure 15, it can be seen that, although there is a substantial agreement between CEMS and AUTOWADE classifications, the size of the floodwater objects is slightly different. This is due to the differences in the flood detection methods; in particular, the BFE approach uses RG and uncertainties in the determination of the tolerance of the RG may imply uncertainties in the extent of the floodwater objects. The red pixels in the lower left part of
Figure 15 correspond to omission errors due to small differences between the
values measured by S1 under non-flooded and flooded conditions, considering that the BFE approach applies change detection. Note that for this test case commission and omission errors are of the same order of magnitude (around 20%); thus, the difference between the total flooded surface estimated by CEMS and that estimated by AUTOWADE is quite small (7 ha,
Table 8).
Thanks to the availability of some aerial photos, it has been verified that the quite high value of ε
comm obtained for the EMSR496 case study (24.4%, class of open floodwater), is due to the presence of ponding water not detected by CEMS. This can be inferred by looking at
Figure 16 that compares the maps with two photos of the area affected by the flood. In the areas highlighted by the ellipses, CEMS seems to underestimate water, while in the AUTOWADE-derived map, the underestimation is significantly reduced (especially in the magenta ellipse). The photos were taken on 26 January 2021 at 11:00 UTC, i.e., about 17 h before the S1 acquisition, but, reportedly, water did not recede. Hence, the disagreement with respect to the reference data in terms of ratio between the extent of the flooded area and the size of the AOI highlighted above for this case study (2.2%) is probably not due to an overestimation of the flooded area done by AUTOWADE.
It was not possible to robustly validate the method designed to map flooded vegetation, because CEMS products do not map this target when they are derived from SAR data. Nonetheless, by comparing the AUTOWADE-derived map with that derived from a S2 image acquired on the same day, it has been found that the image objects marked as flooded vegetation allowed us to fill some gaps present in the floodwater map that are not realistic from a hydrological point of view (Vercelli test case,
Figure 13f), because the area affected by the flood has a quite uniform surface elevation. Several gaps are still present in the map and it is likely that flooded vegetation is underestimated. However, it must be considered that, when dense vegetation is present, SAR is not sensitive to the presence of floodwater beneath the canopy (we remind again that pixels with
> 0.7 are masked by AUTOWADE). Moreover, double bounce is the prevailing effect caused by flooded vegetation for stem-dominated vegetation, while, for leaf-dominated vegetation, volume scattering generally prevails [
12] and surface water detection is complicated. Even in
Figure 16, the presence of flooded vegetation is evident (red ellipse in panel d); while this target is not detected by the CEMS-derived map, in the AUTOWADE derived map the presence of gaps is reduced, and the area highlighted by the red ellipses is correctly labelled as flooded vegetation.
Flooded vegetation mapping builds on the results of a previous study regarding the combined use of VV and VH polarizations [
47] (confirmed in [
11,
48]), based on the consideration that the dihedral effect, generated by an horizontal surface (water) and a vertical one (plants’ stems), keeps the same polarization throughout the radar scattering path. Note that it has been demonstrated that in the presence of slanted vegetation, the double bounce scattering also generates a cross-polarized signal [
55]. However, plants having a slanted root structure like saltwater mangroves are typical of tropical areas rather than temperate ones like Italy.
The major limitation of AUTOWADE and of the BFE method implemented in AUTOWADE to map open water is related to the processing of a S1 frame where big water bodies are missing, and a reference water mask cannot be derived. In this case, the MCDWA cannot be created from the CDWA. This problem can be at least partially overcome using the edge between the original CDWA and the other areas to identify, through the buffer zones created around it, a subset of image pixels characterized by a bimodal histogram (as done in [
35]). Although AUTOWADE has been conceived to work on a national (Italian) scale, the results of the validation (
Section 4.3) demonstrate that the BFE can be applied in other European countries. In principle, nothing prevents a user from applying the BFE method outside Europe, but the reference mask derived from the WAW product is currently available only for Europe.
The other constraints are related to the SAR ability to detect water in some landscapes, like areas of high relief or dense vegetation and to the possibility to confuse some targets like melting snow, frozen soil, or desert areas with water surfaces. These constraints are common to any automatic algorithm that does not use any amount of supervision. However, the presence of water is very unlikely in steep slope areas. Moreover, melting snow is mapped using ancillary S2 data if cloud cover does not hamper snow detection, whereas desert areas are not present in Italy.
The use of a buffer zone of fixed buffering distance (1 km), done to classify pixels belonging only to the TDWA/TDFA and not to the CDWA/CDFA (
Section 3.2.2 and
Section 3.2.3), can be considered as a critical aspect, because the choice of 1 km, done based on our previous experience with SAR images of inundations, is subjective and may be questioned. However, this situation generally regards a few pixels (often forming objects smaller than the minimum mapping unit), as demonstrated in
Figure 12, and does not have a significant impact on the final map.
6. Conclusions
The AUTOWADE tool designed for a fully automated service of floodwater/permanent water mapping using Sentinel-1 data has been presented. Although several studies available in the literature dealt with the problem of flood mapping using SAR data, the creation of a processing chain that automates the various steps needed to set up an operational service was rarely tackled. These steps include not only the processing of the SAR images, but also the data procurement and the delivery of the maps to the end-user.
The service implemented by AUTOWADE daily generates a map of floodwater/permanent water on a national scale (Italy) with a spatial resolution of 20 × 20 m2 in order to provide the Italian Department of Civil Protection with a systematic detection and monitoring of any event occurring in Italy without requiring any activation. For a given location, the average revisit frequency of the maps is 2–3 days (considering both ascending and descending orbits and orbit overlap). AUTOWADE has been tested in fully pre-operational mode since early 2020. Its daily products are automatically delivered to the Italian Department of Civil Protection and can be visualized through a web portal.
AUTOWADE includes an algorithm to map open water, based on the new buffer-form-edge (BFE) approach. The BFE approach combines different techniques such as clustering, edge filtering, thresholding, and region growing. Nonetheless, it is quite simple in computation and effective in identifying bare/scarcely vegetated surfaces covered by water. It is therefore suitable for real-time applications, especially if a large amount of SAR data has to be processed, as necessary for a service working on a national scale. The comparison with maps produced by the Copernicus Emergency Management Service has shown that the BFE approach performs quite well. Moreover, vegetated areas adjacent to those classified as flooded are analyzed in order to fill a portion of the gaps present in the maps likely due to undetected flooded vegetation.
Future work will concern the inclusion in the AUTOWADE-derived product of an evaluation of the classification uncertainty. An uncertainty map complementing the water map will be made available to the end user. Moreover, future releases of AUTOWADE will take advantage of more accurate ancillary data, such as the Copernicus DEM and a refined land cover map obtained by complementing the Corine Land Cover product with data provided by the Global Human Settlement Layer to better identify urban areas.