Exploring Contrastive Representation for Weakly-Supervised Glacial Lake Extraction
Abstract
:1. Introduction
- We proposed a simple yet effective glacial lake extraction model, SimGL, which effectively learns lake representations from unlabeled RS images via contrastive learning in the training stage.
- We further introduced the NDWI map into our model to provide location cues of the glacial lakes and proposed a location loss to encourage the segmentations to coincide with the true glacial lake boundaries.
- We evaluated our model SimGL using four metrics and compared the segmentation performance with other glacial lake mapping methods on the Landsat-8 imagery. The results demonstrate that our model SimGL surpasses other unsupervised methods and narrows the performance difference with supervised DL methods.
2. Methodology
2.1. Preliminaries
2.2. Weakly Location Cues from NDWI Map
2.3. Contrastive Semantic Segmentation
2.4. Image Augmentation
- (1)
- Color jitter. We use color jitter with {brightness, contrast, saturation, hue} strength of {0.4, 0.4, 0.4, 0.2} for the RGB bands of the RS images as the hue is only well-defined for the RGB data.
- (2)
- Gray scaling. We use gray scaling to remove the color information and represent each pixel only by its intensity.
- (3)
- Flipping. We randomly flip an image along with a horizontal or vertical location.
- (4)
- Rotating. We randomly rotate an image by an angle in the set of {90°, 180°, 270°}.
- (5)
- Blurring. The images are blurred with the Gaussian kernel; here, the kernel size is 3 × 3 and the other parameters maintain the default values.
- (6)
- Random area erasing. We randomly masked some pixels (less than 1% image size) with 0 to erase the spectral information.
- (7)
- Noise addition. We add the Gaussian noise to the image.
2.5. Detailed Network Architecture
- Encoder and decoder: The encoder and decoder are the same as the structures in the U-net. We use the encoder to capture the feature maps at different scales from the input image and the decoder to reconstruct the segmentation results of the lakes from the feature maps. In our model, each selected feature map is from the results before the down-sampling operation.
- Projector: The projector has three fully-connected (fc) layers and batch normalization (BN) layers, and the first two layers are activated by ReLU. The output of the projector is a 2048-d vector.
- Predictor: The predictor has two fc layers. The first is connected to a BN and a ReLU layer, and the last is without any other operations. The output dimension is 2048-d, while in hidden layers, it was set as 512-d.
3. Experiment Results
3.1. Dataset and Evaluation Metrics
3.2. Implementation Details
3.3. Diagnostic Experiments
3.4. Comparison with the State-of-the-Arts
- NDWI: WI is the most simple and widely used method in glacial lake mapping, including in NDWI [32] and MNDWI [35]. Among these indexes, the NDWI is the most feasible way to highlight the lake information and suppress the background information [31]. To test the segmentation performance of using NDWI only, we set the segmentation threshold to 0.6, the same as we used in our model.
- Global–local iterative segmentation algorithm (GLSeg) [8,9]: The GLSeg includes two hierarchical image segmentation stages. First, segment the NDWI map to delineate the potential lake areas using a global threshold. Second, calculate the local threshold to determine the extent of each potential lake within a buffer zone of the lake. Moreover, the auxiliary data (such as DEM) are introduced to filter the noise pixels with similar NDWI values with the glacial lakes. For fairness, we only use the RS imagery as the input, and the parameters are set following [8,9]. We set NDWI > 0.1, NIR < 0.15 and SWIR < 0.05 in the global segmentation stage and the local threshold was computed according to the mean and variance of the lake and background pixels.
- C-V model (C-V) [14,37]: as a region-based segmentation method, the C-V model shows great anti-noise ability, which improves the segmentation accuracy in the homogenous areas of glacial lakes and avoids the influences of individual noise pixels from the surroundings. The C-V model employs an active curve to separate the image into inner parts and outer parts and uses an energy function to evaluate the segment results. When the energy function reaches an optimal state, the curve will converge to the true lake boundaries. The parameters are set following [14,37].
- Random Forest classification (RF) [11]: RF has good robustness and generalization in classification tasks because of its random sampling operations on the input data and features in each decision tree. For the RF training, we set 1000 trees to vote whether a pixel belongs to the glacial lake or not, and our training set includes 93,431 glacial lake samples and 93,431 non-lake samples, each of which has seven band values and a class label.
- U-net [18,20,38]: U-net is the first DL model for glacial lake segmentation. It learns the pattern of glacial lakes in order to eliminate the dependence on the auxiliary data (such as using DEM to remove the mountain shadows). U-net contains four pairs of encoder and decoder units, and a skip connection is employed to concatenate feature maps from different scales and capture more details of the lake boundaries. Finally, the output mask is the segmentation results. We set the parameters to be the same as [18].
- GAN-GL [21]: GAN-GL uses a zero-sum game between a generator and a discriminator to find the stable state, and a water attention module is also introduced to accelerate the convergence process. This GAN-based method can delineate the glacial lake boundaries more easily, without any distribution assumptions.
4. Discussion
4.1. Visualization of Comparisons
4.2. Applicability to Different Sensors
4.3. Possibility in Monitoring GLOF Events
4.4. Impaction of Locaton Cues
5. Conclusions
Author Contributions
Funding
Conflicts of Interest
References
- Chen, F.; Zhang, M.; Guo, H.; Allen, S.; Kargel, J.S.; Haritashya, U.K.; Watson, C.S. Annual 30 m dataset for glacial lakes in High Mountain Asia from 2008 to 2017. Earth Syst. Sci. Data 2021, 13, 741–766. [Google Scholar] [CrossRef]
- Zhang, M.; Chen, F.; Guo, H.; Yi, L.; Zeng, J.; Li, B. Glacial Lake Area Changes in High Mountain Asia During 1990–2020 Using Satellite Remote Sensing. Research 2022, 2022, 1275. [Google Scholar] [CrossRef]
- Wang, X.; Guo, X.Y.; Yang, C.D.; Liu, Q.H.; Wei, J.F.; Zhang, Y.; Liu, S.Y.; Zhang, Y.L.; Jiang, Z.L.; Tang, Z.G. Glacial lake inventory of High Mountain Asia (1990–2018) derived from Landsat images. Earth Syst. Sci. Data 2020, 12, 2169–2182. [Google Scholar] [CrossRef]
- Wilson, R.; Glasser, N.F.; Reynolds, J.M.; Harrison, S.; Anacona, P.I.; Schaefer, M.; Shannon, S. Glacial lakes of the Central and Patagonian Andes. Glob. Planet. Change 2018, 162, 275–291. [Google Scholar] [CrossRef]
- Rounce, D.R.; Watson, C.S.; McKinney, D.C. Identification of hazard and risk for glacial lakes in the Nepal Himalaya using satellite imagery from 2000–2015. Remote Sens. 2017, 9, 654. [Google Scholar] [CrossRef] [Green Version]
- Hu, J.; Yao, X.; Duan, H.; Zhang, Y.; Wang, Y.; Wu, T. Temporal and Spatial Changes and GLOF Susceptibility Assessment of Glacial Lakes in Nepal from 2000 to 2020. Remote Sens. 2022, 14, 5034. [Google Scholar] [CrossRef]
- Allen, S.K.; Sattar, A.; King, O.; Zhang, G.Q.; Bhattacharya, A.; Yao, T.D.; Bolch, T. Glacial lake outburst flood hazard under current and future conditions: First insights from a transboundary Himalayan basin. Nat. Hazard Earth Syst. Sci. 2021, 22, 3765–3785. [Google Scholar] [CrossRef]
- Li, J.; Sheng, Y. An automated scheme for glacial lake dynamics mapping using Landsat imagery and digital elevation models: A case study in the Himalayas. Int. J. Remote Sens. 2012, 33, 5194–5213. [Google Scholar] [CrossRef]
- Song, C.Q.; Sheng, Y.W.; Ke, L.H.; Nie, Y.; Wang, J.D. Glacial lake evolution in the Southeastern Tibetan Plateau and the cause of rapid expansion of pro-glacial lakes linked to glacial hydrogeomorphic Processes. J. Hydrol. 2016, 540, 504–514. [Google Scholar] [CrossRef] [Green Version]
- Gardelle, J.; Arnaud, Y.; Berthier, E. Contrasted evolution of glacial lakes along the Hindu Kush Himalaya mountain range between 1990 and 2009. Glob. Planet. Change 2010, 75, 47–55. [Google Scholar] [CrossRef] [Green Version]
- Wangchuk, S.T.; Bolch, T. Mapping of glacial lakes using Sentinel-1 and Sentinel-2 data and a random forest classifier: Strengths and challenges. Sci. Remote Sens. 2020, 2, 8. [Google Scholar] [CrossRef]
- Shen, J.X.; Yang, L.; Chen, X.; Li, J.L.; Peng, Q.; Ju, H. A Method for Object-oriented Automatic Extraction of Lakes in the Mountain Area from Remote Sensing Image. Remote Sens. Land Resour. 2012, 3, 84–91. [Google Scholar] [CrossRef]
- Li, W.; Wang, W.; Gao, X.; Wu, Y.; Wang, X.; Liu, Q. A Lake Extraction method in mountainous regions based on the integration of object-oriented approach and watershed algorithm. J. Geo-Inf. Sci. 2021, 23, 1272–1285. [Google Scholar] [CrossRef]
- Zhao, H.; Chen, F.; Zhang, M. A Systematic Extraction Approach for Mapping Glacial Lakes in High Mountain Regions of Asia. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2018, 11, 2788–2799. [Google Scholar] [CrossRef]
- Mitkari, K.V.; Arora, M.K.; Tiwari, R.K. Extraction of Glacial Lakes in Gangotri Glacier Using Object-Based Image Analysis. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2017, 10, 5275–5283. [Google Scholar] [CrossRef]
- Zhang, M.; Chen, F.; Tian, B.; Liang, D. Using a Phase-Congruency-Based Detector for Glacial Lake Segmentation in High-Temporal Resolution Sentinel-1A/1B Data. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2019, 12, 2771–2780. [Google Scholar] [CrossRef]
- Kaushik, S.; Singh, T.; Joshi, P.K.; Dietz, A.J. Automated mapping of glacial lakes using multisource remote sensing data and deep convolutional neural network. Int. J. Appl. Earth Obs. Geoinf. 2022, 115, 103085. [Google Scholar] [CrossRef]
- Qayyum, N.; Ghuffar, S.; Ahmad, H.M.; Yousaf, A.; Shahid, I. Glacial Lakes Mapping Using Multi Satellite PlanetScope Imagery and Deep Learning. ISPRS Int. J. Geo-Inf. 2020, 9, 560. [Google Scholar] [CrossRef]
- Wu, R.; Liu, G.; Zhang, R.; Wang, X.; Li, Y.; Zhang, B.; Cai, J.; Xiang, W. A Deep Learning Method for Mapping Glacial Lakes from the Combined Use of Synthetic-Aperture Radar and Optical Satellite Images. Remote Sens. 2020, 12, 4020. [Google Scholar] [CrossRef]
- Thati, J.; Ari, S. A systematic extraction of glacial lakes for satellite imagery using deep learning based technique. Measurement 2022, 192, 110858. [Google Scholar] [CrossRef]
- Zhao, H.; Zhang, M.; Chen, F. GAN-GL: Generative Adversarial Networks for Glacial Lake Mapping. Remote Sens. 2021, 13, 4728. [Google Scholar] [CrossRef]
- Oord, A.; Li, Y.; Vinyals, O. Representation learning with contrastive predictive coding. arXiv 2018, arXiv:1807.03748. [Google Scholar]
- Tian, Y.; Krishnan, D.; Isola, P. Contrastive multiview coding. arXiv 2019, arXiv:1906.05849. [Google Scholar]
- Chen, X.; Fan, H.; Girshick, R.; He, K. Improved baselines with momentum contrastive learning. arXiv 2020, arXiv:2003.04297. [Google Scholar]
- Wu, M.; Zhuang, C.; Mosse, M.; Yamins, D.; Goodman, N. On mutual information in contrastive learning for visual representations. arXiv 2020, arXiv:2005.13149. [Google Scholar]
- Chen, T.; Kornblith, S.; Norouzi, M.; Hinton, G. A simple framework for contrastive learning of visual representations. arXiv 2020, arXiv:2002.05709. [Google Scholar]
- He, K.; Fan, H.; Wu, Y.; Xie, S.; Girshick, R. Momentum contrast for unsupervised visual representation learning. arXiv 2019, arXiv:1911.05722. [Google Scholar]
- Caron, M.; Misra, I.; Mairal, J.; Goyal, P.; Bojanowski, P.; Joulin, A. Unsupervised learning of visual features by contrasting cluster assignments. arXiv 2020, arXiv:2006.09882. [Google Scholar]
- Grill, J.B.; Strub, F.; Altche, F.; Tallec, C.; Richemond, P.H.; Buchatskaya, E.; Doersch, C.; Pires, B.A.; Guo, Z.D.; Azar, M.G.; et al. Bootstrap your own latent: A new approach to self-supervised learning. arXiv 2020, arXiv:2006.07733v1. [Google Scholar]
- Chen, X.; He, K. Exploring Simple Siamese Representation Learning. arXiv 2020, arXiv:2011.10566. [Google Scholar]
- Zhang, M.M.; Zhao, H.; Chen, F.; Zeng, J.Y. Evaluation of effective spectral features for glacial lake mapping by using Landsat-8 OLI imagery. J. Mt. Sci. 2020, 17, 2707–2723. [Google Scholar] [CrossRef]
- McFeeters, S.K. The use of Normalized Difference Water Index (NDWI) in the delineation of open water features. Int. J. Remote Sens. 1996, 17, 1425–1432. [Google Scholar] [CrossRef]
- Kolesnikov, A.; Lampert, C.H. Seed, Expand Constrain: Three Principles for Weakly-Supervised Image Segmentation. arXiv 2016, arXiv:1603.06098v3. [Google Scholar]
- Zhou, B.; Khosla, A.L.; Oliva, A.; Torralba, A. Learning deep features for discriminative localization. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 27–30 June 2016; pp. 2921–2929. [Google Scholar]
- Xu, H.Q. Modification of normalized difference water index (NDWI) to enhance open water features in remotely sense imagery. Int. J. Remote Sens. 2006, 27, 3025–3033. [Google Scholar] [CrossRef]
- Bhardwaj, A.; Singh, M.K.; Joshi, P.K.; Snehmani; Singh, S.; Sam, L.; Gupta, R.D.; Kumar, R. A lake detection algorithm (LDA) using Landsat 8 data: A comparative approach in glacial environment. Int. J. Appl. Earth Obs. Geoinf. 2015, 38, 150–163. [Google Scholar] [CrossRef]
- Zhang, M.M.; Chen, F.; Tian, B.S. An automated method for glacial lake mapping in High Mountain Asia using Landsat 8 imagery. J. Mt. Sci. 2018, 15, 13–24. [Google Scholar] [CrossRef]
- Chen, F. Comparing Methods for Segmenting Supra-Glacial Lakes and Surface Features in the Mount Everest Region of the Himalayas Using Chinese GaoFen-3 SAR Images. Remote Sens. 2021, 13, 2429. [Google Scholar] [CrossRef]
- Ahmed, R.; Ahmad, S.T.; Wani, G.F.; Mir, R.A.; Ahmed, P. High resolution inventory and hazard assessment of potentially dangerous glacial lakes in upper Jhelum basin, Kashmir Himalaya, India. Geocarto Int. 2022, 37, 10681–10712. [Google Scholar] [CrossRef]
- Nie, Y.; Sheng, Y.W.; Liu, Q.; Liu, L.S.; Liu, S.Y.; Zhang, Y.L.; Song, C.Q. A regional-scale assessment of Himalayan glacial lake changes using satellite observations from 1990 to 2015. Remote Sens. Environ. 2017, 189, 1–13. [Google Scholar] [CrossRef] [Green Version]
- Shrestha, F.; Gao, X.; Khanal, N.R.; Maharjan, S.B.; Shrestha, R.B.; Wu, L.Z.; Mool, P.K.; Bajracharya, S.R. Decadal glacial lake changes in the Koshi basin, central Himalaya, from 1977 to 2010, derived from Landsat satellite images. J. Mt. Sci. 2017, 14, 1969–1984. [Google Scholar] [CrossRef] [Green Version]
Item | Description |
---|---|
Data source | 103 Landsat-8 OLI images |
Spatial resolution | 30 m |
Acquisition date | Between 20 July 2016 and 4 November 2016 |
Cloud cover | ≤3.40% |
Number of image patches | 1540 image patches |
Each patch size | 256 × 256 × 7 |
Bands information | Coastal; blue; green; red; NIR; SWIR1 and SWIR2. |
Covered areas | Hengduan Shan; Pamir; Tianshan; Himalaya; Nyainqentanglha; Gangdise Shan and Qilian. |
Average number of glacial lake pixels in each patch | 1225.39 (>1% area of the patch) |
Contrastive Loss | Location Loss | Precision | Recall | F1 Score | IoU |
---|---|---|---|---|---|
✔ | 0.1184 | 0.8869 | 0.1356 | 0.1083 | |
✔ | 0.8412 | 0.5912 | 0.6360 | 0.5289 | |
✔ | ✔ | 0.9406 | 0.6285 | 0.6661 | 0.5855 |
Transformation Type | Precision | Recall | F1 Score | IoU |
---|---|---|---|---|
Color jitter | 0.9213 | 0.6353 | 0.6688 | 0.5912 |
Gray scaling | 0.9209 | 0.6170 | 0.6535 | 0.5749 |
Flipping | 0.9510 | 0.6336 | 0.6821 | 0.5949 |
Rotating | 0.9427 | 0.6252 | 0.6641 | 0.5856 |
Blurring | 0.9389 | 0.6071 | 0.6497 | 0.5671 |
Random area erasing | 0.9379 | 0.5912 | 0.6371 | 0.5629 |
Noise addition | 0.9510 | 0.6021 | 0.6464 | 0.5653 |
Spectral transform | 0.9411 | 0.6125 | 0.6537 | 0.5725 |
Location transform | 0.9451 | 0.6072 | 0.6633 | 0.5803 |
Model | Label | Threshold | Precision | Recall | F1 Score | IoU |
---|---|---|---|---|---|---|
NDWI | ✔ | 0.8243 | 0.4143 | 0.5306 | 0.4392 | |
GLSeg | ✔ | 0.4828 | 0.6593 | 0.5091 | 0.4092 | |
C-V | ✔ | 0.5518 | 0.6756 | 0.5989 | 0.4347 | |
RF | ✔ | 0.6796 | 0.7443 | 0.6649 | 0.5634 | |
U-net | ✔ | 0.8669 | 0.8060 | 0.8353 | 0.7173 | |
GAN-GL | ✔ | 0.9334 | 0.9201 | 0.9217 | 0.8634 | |
ours | ✔ | 0.9406 | 0.6285 | 0.6661 | 0.5855 |
Sensors | Acquired Date | Cloud Cover (%) | Path/Row |
---|---|---|---|
Landsat 5 TM | 12 October 1988 | 3.00 | 141/040 |
Landsat 7 ETM+ | 24 October 2001 | 0.69 | 141/040 |
Landsat 8 OLI | 7 October 2015 | 1.44 | 141/040 |
Sentinel-2A MSI | 28 November 2015 | 66.07 | 119 * |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Zhao, H.; Wang, S.; Liu, X.; Chen, F. Exploring Contrastive Representation for Weakly-Supervised Glacial Lake Extraction. Remote Sens. 2023, 15, 1456. https://doi.org/10.3390/rs15051456
Zhao H, Wang S, Liu X, Chen F. Exploring Contrastive Representation for Weakly-Supervised Glacial Lake Extraction. Remote Sensing. 2023; 15(5):1456. https://doi.org/10.3390/rs15051456
Chicago/Turabian StyleZhao, Hang, Shuang Wang, Xuebin Liu, and Fang Chen. 2023. "Exploring Contrastive Representation for Weakly-Supervised Glacial Lake Extraction" Remote Sensing 15, no. 5: 1456. https://doi.org/10.3390/rs15051456
APA StyleZhao, H., Wang, S., Liu, X., & Chen, F. (2023). Exploring Contrastive Representation for Weakly-Supervised Glacial Lake Extraction. Remote Sensing, 15(5), 1456. https://doi.org/10.3390/rs15051456