1. Introduction
As global climate change intensifies, extreme weather events are becoming more frequent and severe, making ice avalanches (IAs) a common natural disaster in High Mountain Asia (HMA). IAs occur when glacier ice collapses along shear planes or weak surfaces on steep slopes, forming debris flows and triggering secondary disasters [
1]. With the increasing frequency and scale of IAs, IA hidden danger, which refers to the likelihood of an IA occurring due to glacier instability, steep terrain, crack development, and external environmental factors, poses an increasingly severe threat to ecosystems and human safety [
2,
3]. On the Tibetan Plateau (TP) and other glacier-dense regions, over 40,963 glaciers cover about 45,000 km
2, significantly affecting local life, property, and the environment [
4,
5]. Herein, the early identification of IA hidden dangers is crucial for prevention and danger mitigation.
At present, the traditional methods of identifying IA hidden dangers are mainly site geological survey, remote sensing interpretation, and historical data evaluation [
6]. LaChapelle [
7] used artificial control and empirical judgment for avalanche identification, but the accuracy remains insufficient for more precise predictions. Tang et al. [
2] identified the hidden dangers of ice avalanche on the Tibetan Plateau using an interactive remote sensing interpretation manually and via computer. Due to the high altitude and the harsh environment of the IA area, it is very difficult and inefficient to identify that by site investigation or manual remote sensing.
In recent years, due to the outstanding performance of image processing and pattern recognition, deep learning technologies have shown great potential in the intelligent recognition of geohazards. Numerous scholars have explored applying deep learning methods to identify and predict geohazards [
8]. These efforts improve recognition accuracy and automation, reducing experimental expenses and human bias. For example, Choubin et al. [
9,
10] combined glacier-monitoring data with terrain features and used deep learning technology to predict the possibility and danger levels of avalanches, thus demonstrating the application of deep learning in IA hidden danger identification. As interdisciplinary research progresses, the demand for deep learning in image recognition and classification is increasing in geological engineering [
11]. Although some traditional machine learning methods like Support Vector Machines (SVM), Artificial Neural Networks (ANN), and Random Forests (RF) perform well on simple classification problems [
12,
13,
14], their efficiency and accuracy can be limited in scenarios involving complex image datasets and high-level abstract feature extraction. Therefore, as part of deep learning, Convolutional Neural Networks (CNN) have demonstrated a superior capability in image recognition.
CNN effectively captures and recognizes complex image patterns by learning hierarchical feature representations of images. Lin et al. [
15] and Wang et al. [
16] studied high-resolution geological maps, geological structures, and landslide susceptibility, successfully achieving identification, prediction, and categorization. However, traditional CNN models, often using large convolutional kernels, face challenges such as low computational efficiency and the need for extensive network parameters. To enhance the accuracy of network training, researchers have begun exploring deeper network architectures to extract features more effectively. Advanced models such as the Visual Geometry Group 16 (VGG16) and Residual Neural Network 50 (ResNet50) have emerged to improve performance by deepening the network structure. These models provide new technical solutions for the intelligent early warning of geohazards. However, directly applying existing deep learning models like VGG16 or ResNet50 to identify the hidden danger of IA faces challenges such as model overfitting, low training efficiency, and insufficient generalization ability. These issues require further optimization.
Identifying IA hidden danger is challenging, and the efficiency of current optical interpretation methods needs improvement. Herein, we use advanced remote sensing technology and Geographic Information Systems (GIS) to analyze and identify the hidden danger points of IAs on the TP. After evaluating the performance of different deep learning models, the optimized Early Stopping—L2 Norm Regularization—Visual Geometry Group 16 (ES-L2-VGG16) model is developed to enhance accuracy and efficiency in identifying IA hidden dangers. The core purpose is to explore the application of deep learning technology in the intelligent identification of hidden danger of IA. At the same time, the effectiveness and reliability of the model are verified through real-world cases. This promotes the widespread application of deep learning models in natural disaster prediction.
2. Geological Background
Due to unique geological and topographic conditions, coupled with global warming, the TP has become a region prone to frequent IA events. Covering approximately 2.5 million square kilometers (
Figure 1), the TP is the largest plateau in western China, featuring a complex terrain with plateaus, mountains, and basins at altitudes exceeding 3000 m. It is one of the world’s largest glacier concentration areas, formed by the collision between the Indian and Eurasian plates, and contains multiple geological tectonic units [
17]. These tectonic activities have caused significant terrain fluctuations and height instability, contributing to the occurrence of IAs. According to Tang et al. [
18], 40,269 glaciers on the TP were identified using remote sensing techniques combined with human–machine interaction, resulting in a database of 581 IA hidden danger points (
Figure 1).
In addition, the glaciers are widely distributed in the Yarlung Tsangpo River Basin in southeastern TP, particularly in the Nyingchi region, which is part of the eastern extension of the Himalayan orogenic belt [
19]. The terrain is complex and steep, characterized by deep valleys and large slopes [
20]. Due to the collision between the Indian and Eurasian plates, the rocks in this region have undergone intense metamorphism, resulting in complex structural geomorphology, including multiple faults and uplifted areas. The region is rich in glacier resources, and the dynamic changes in glaciers directly impact the occurrence of IAs. Glaciers are widely covered, and their movement and changes are critical factors in IA occurrence. In particular, glacier cracks and slope changes significantly impact glacial stability. Increases in glacier cracks and slope steepness may weaken the ice structure, thus promoting IA occurrences.
Therefore, the Yarlung Tsangpo River Gorge in Nyingchi was selected as the core area for identification and validation (
Figure 1 and
Figure 2). The validation area refers to this glacier-rich region, where a trained model was used to evaluate the hidden danger of IAs, and the model’s accuracy is validated through observational data. The complex topography and dynamic glacial changes in this region make it an ideal area for testing and refining the model’s ability to identify IA hidden dangers.
In addition, through data collection and field survey, this paper summarizes the historical events of IA in the Yarlung Tsangpo River basin in Linzhi City since the 20th century (
Table 1). In the 1980s, the number of IAs showed an increasing trend. Since the turn of the 21st century, the number of IAs in the Yarlung Tsangpo River basin of Linzhi City has exceeded 10 [
3]. Among them, the most typical and representative case is the Sedongpu Valley IA.
Geological surveys show that the strata in the Sedongpu area are primarily composed of metamorphic and intrusive rocks, which have undergone multiple folding and faulting events, resulting in complex geological structures. These formations are key factors driving glacier deformation and fracturing. Additionally, the area’s unique geographical location and geological conditions make it highly susceptible to surface temperature and precipitation changes, which accelerate glacier crevasse formation and heighten the hidden danger of IAs.
These geological and environmental factors are amplified by the Sedongpu Valley’s unique topography. Located on the left bank of the lower Yarlung Tsangpo River, at the terminus of the Eastern Himalayan tectonic knot, the valley has a distinctive funnel-shaped terrain, surrounded by mountains on three sides with steep upper slopes. The terrain not only promotes glacier formation but also, due to its specific slope and orientation, increases the frequency of IAs (
Figure 3). The 3D stereo terrain of the Sedongpu Valley is based on DEM elevation data processed on 30 November 2018, highlighting how the abundant glaciers provide ample energy and material for IAs to occur. In the past ten years, there have been several ice avalanche disasters in this region. This paper will take this as a typical case to analyze and extract the following ice avalanche identification indicators.
3. Identification Method
Figure 4 summarizes the overall methodology for identifying IA hidden dangers. The methodology is divided into three main sections, data processing and the establishment of identification indicators and danger level classification; model construction, training and optimization; and finally hidden danger identification and validation. A detailed explanation of each step is provided in the following sections.
3.1. Data Collection and Processing
The study area was calibrated using Google Earth pro to ensure the accurate location data for 581 IA hidden danger points on the TP [
18]. Additionally, data for 500 uncertain ice avalanches (UAs) hidden danger points were collected for model comparison and training. The primary remote sensing data source was Sentinel-2A satellite imagery, supplemented by a 12.5 m resolution Digital Elevation Model (a) and satellite images with a focus on areas within a 5 km radius around each identified IA hidden danger point.
ArcGIS pro was used to extract key terrain parameters, such as slope, which are crucial for assessing glacier instability. Special attention was given to detecting terrain cracks near the hidden danger points, as these cracks serve as important indicators for identifying IAs. Python scripts were employed to automate the crack detection process and extract relevant features from the images. To minimize interference from cloud cover and snow, images taken during spring and autumn were selected, ensuring high clarity and minimal obstruction.
By integrating remote sensing data with GIS, terrain features were accurately extracted and prepared for modeling. ArcGIS pro was used to calculate the slope at each hidden danger point, represented by a color gradient to highlight the varying steepness. Python’s edge detection algorithms identified terrain cracks from satellite images, providing detailed surface fracture data.
To enhance model training, the extracted slope and crack features are combined using layer superposition based on their spatial relevance. This integration resulted in a comprehensive training dataset that accurately captured these essential features, enabling the creation of fully recognizable training images for the model (
Figure 5).
3.2. Identification Index and Danger Levels Classification of IA
By integrating remote sensing data with geospatial techniques, and focusing on IA characteristics in typical case areas, key factors such as slope and crack distribution are examined to develop an index system for IA hidden danger identification, and several typical cases are used for comparative validation. Finally, IA hidden danger levels are classified based on slope and crack development.
3.2.1. Characteristics of Typical IA
Based on the remote sensing images from the Resource No.3 02 satellite on 22 December 2016, 28 December 2017, and 30 December 2018 (
Figure 4), the Sedongpu Basin was divided into nine independent snow and ice accumulation zones (regions ①–⑨) during the pre-avalanche period. The classification was primarily derived from remote sensing imagery and topographic characteristics, including glacier recharge areas, deformation body distribution, movement directions, and terrain features (detailed in
Table 2). The stability of each glacier and the hidden danger of an IA was assessed based on slope and crack distribution.
Since 2014, the Sedongpu Glacier has experienced continuous evolution. By December 2016, numerous cracks had developed within the glacier’s deformation bodies. By December 2017, these cracks expanded further and penetrated the glacier surface. In 2018, the continued expansion of these cracks triggered an IA, resulting in significant debris flow, blocking the river and causing large-scale sliding.
Figure 6 illustrates the critical role of cracks in inducing IAs.
In addition, by processing the digital elevation model (DEM) of the Sedongpu Glacier as of 22 December 2016, and performing slope calculations using ArcGIS, a direct correlation between slope and IA occurrence was identified.
Figure 7 shows the proportion of glacial deformation in different slope intervals, with yellow to red areas representing slopes steeper than 30°. In regions ① and ②, the average slope of areas prone to frequent IAs ranged between 30° and 60°. A higher proportion of glacier deformation areas with slopes exceeding 30° is linked to an increased likelihood of IA events. In particular, the upstream residual glacier deformation area in region ③ shows a significant average slope and a notable proportion of areas with slopes over 30°.
3.2.2. Identification Index
The Sedongpu No. 6 Glacier in China serves as the primary focus, from which remote sensing images are used to examine glacial IA events. To further validate the universality of the findings, data from the Chamoli region in India (7 February 2021) [
32] and the Monte Maldarà region in Italy (3 July 2022) [
33] are also compared. The Sedongpu Glacier’s satellite images from 4 December 2017 revealed significant slope angles and tensile cracks (
Figure 8a). Similarly, images from the Chamoli glacier (5 February 2021) and the Marmolada glacier (3 July 2022) (
Figure 8b,c) show comparable patterns of steep slopes and crack development. These comparisons are made to reinforce that slope steepness and crack formation are universal precursors to glacial disintegration and IA events, observed across different regions.
Figure 6 illustrates the slope and crack distribution across these glaciers. IA hidden danger areas are highlighted with red ellipses, derived from remote sensing interpretation and slope calculations using DEM data. Cracks are emphasized with red lines to indicate areas most vulnerable to disintegration. For the Sedongpu glacier (
Figure 8a) and Chamoli glacier (
Figure 8b), prominent cracks are observed in steep sections, indicating high-hidden danger IA zones. Similarly, the Marmolada glacier (
Figure 8c) shows crack expansion in steep regions, indicating potential future disintegration.
These comparisons confirm that despite geographical differences, steep slopes and crack propagation are consistent indicators of IA hidden danger. These findings, observed through the remote sensing of the Sedongpu Glacier, are reinforced by the results from Chamoli and Marmolada, demonstrating that slope steepness and crack expansion are reliable precursors to IA events.
In summary, identifying IA hidden danger areas relies on evaluating three key indicators: source glacier, steep slope, and cracks (
Figure 9).
3.2.3. Danger Levels Classification
Based on statistical data on 40,269 glaciers on the TP, IA development follows a normal distribution relative to slope [
34]. The sensitive slope range for IA development is between 35° and 55°, accounting for 95% of the total number of IA hidden danger. When the slope is less than 25° or greater than 55°, the number of IA hidden danger decreases significantly [
2]. At lower slopes, insufficient potential energy prevents sliding, while steeper slopes increase stability, reducing the chance of collapse. Herein, slope is divided into three IA danger levels, low (<25°), medium (25–55°), and high (>55°) [
2,
18].
Crack development is classified by factors such as the number of crack groups, density, width, and extension length [
35]. Weakly developed cracks have fewer than five groups with sparse distribution and small openings, indicating low danger. Moderately developed cracks (5–10 groups) show moderate density and extension, signaling increased instability. Strongly developed cracks (over 10 groups) have dense distribution, large openings, and long extensions, indicating high collapse danger [
36].
IA danger levels are determined by both slope and crack development. Low danger occurs when both are low, medium danger when both are moderate, and high danger when both are high. A moderate slope with strong crack development can also elevate the danger level (
Table 3).
3.3. Model Construction for Artificial Intelligent Identification
3.3.1. Construction Principle
ResNet50, developed by Kaiming He and colleagues at Microsoft Research in 2015, addresses degradation in deep neural network training [
37]. As network depth increases, performance typically improves, then saturates, and may decline. ResNet50 introduces a residual learning structure, where each block has a shortcut connection that bypasses one or more layers. Through this, the network learns residual mappings instead of direct mappings, mitigating vanishing and exploding gradients, and enhancing training efficiency and model accuracy [
38].
The architecture begins with a 7 × 7 convolutional layer (stride of 2), followed by batch normalization, a ReLU activation layer, and max pooling. Subsequent layers include four groups of residual blocks, each containing multiple blocks with three convolutional layers (1 × 1, 3 × 3, 1 × 1) to reduce complexity [
39]. Each block includes batch normalization and ReLU activation, with jump connections linking the blocks. The design concludes with a global average pooling layer to reduce parameters and minimize overfitting risk, followed by a fully connected layer and a Softmax layer for output (
Figure 10a). The skip connections significantly enhance training stability and model accuracy.
VGG, an advanced CNN model based on TensorFlow, was first proposed by Simonyan and Zisserman in 2014 [
40]. It plays a crucial role in deep learning and data mining, particularly in high-precision classification and prediction [
41]. The model features 3 × 3 convolutional kernels and a streamlined architecture. It includes multi-layer nonlinear processing layers to enhance the learning of complex patterns, making it ideal for image recognition and classification. VGG16, a variant of the VGG series, illustrated in
Figure 10b, consists of thirteen convolutional layers and three fully connected layers. It accepts a 224 × 224 pixel RGB image, starting with two convolutional layers of 64 kernels each, followed by ReLU activation to maintain spatial resolution [
42]. The subsequent 2 × 2 max pooling layer reduces the feature map size progressively. As the network deepens, the number of convolution kernels increases to 128, 256, and 512, allowing in-depth feature extraction. The model culminates in a 7 × 7 × 512 feature map passing through three fully connected layers, and a 1000-unit output layer employing the Softmax function for final classification [
43,
44].
3.3.2. Parameter Setting and Model Training
This study employed detailed data preprocessing techniques to ensure the quality and reliability of the 1081 samples, consisting of 581 identified IA hidden danger point samples and 500 UA samples. These samples are divided into training and test sets in an 8/2 ratio, with 20% of the training set reserved for validation (
Table 4). The image preprocessing steps included resizing the images to 224 × 224 pixels, random shuffling, and normalizing pixel values between 0 and 1, thereby improving model convergence and stability.
To prevent overfitting and optimize the training process, careful settings were chosen for learning rates, epochs, and batch sizes. The ResNet50 model used the Adam optimizer, which adaptively adjusts the learning rate based on the training progress, with no fixed settings for epochs or batch size, allowing for greater flexibility during training. In contrast, the VGG16 model used the SGD optimizer, requiring precise settings for the learning rate, epochs, and batch size to achieve optimal training performance.
3.3.3. Performance Evaluation
Six key indicators are selected to ensure the comprehensive and accurate assessment of the performance of two models, Accuracy (Acc), Loss rate (Loss), Precision (Pre), Recall rate (Rec), F1 score (F1score), and Model Run Time (Time). These metrics measure the performance of the model on the training, validation, and test sets, and reflect its performance in dealing with different categories in the dataset.
Specifically, Acc is the proportion of instances correctly predicted by the model, which reflects the predictive ability of the model. Loss quantifies the deviation between the model prediction and the actual label, a statistical measure of the error. The Pre, Rec, and F1
score are calculated based on the Confusion Matrix (CM) and return the recognition accuracy, recall rate, and F1
score for each type of data sample. The correlation expression is presented below. All evaluation metrics are derived from the confusion matrix (CM), which categorizes prediction results into True Positives (
TP), True Negatives (
TN), False Positives (
FP), and False Negatives (
FN). A detailed description is provided in
Table 5.
For the loss function, Loss utilizes Binary Cross-Entropy for computation; NP is the number of correctly identified samples; OS is the total number of samples; and N represents the number of samples. Moreover, is the actual label of the sample and is the predicted probability of the model.
To carefully evaluate the performance of the model in each category, training set accuracy rate (train_acc), training set loss rate (train_loss), validation set accuracy rate (valid_acc), validation set loss rate (valid_loss), test set accuracy rate (test_acc), and test set loss rate (test_loss) are used to evaluate the model. At the same time, the Macro Average (MA) and Weighted Average (WA) methods are used to calculate the macro average (MAA) and weighted average (WAA) of precision, the macro average (MAR) and weighted average (WAR) of recall, and the macro average (MAF) and weighted average (WAF) of F1
score.
Table 6 shows the model’s overall performance evaluation indicators. It reveals the adaptability and precision of the model when facing datasets with unevenly distributed categories.
In this context, n represents the number of categories, denotes the -th category, and represents the number of samples in the -th category.
3.3.4. Model Training Results
The diagnostic performance of the models was evaluated using the Acc curve, with identification accuracy and loss rates for different epochs shown in
Figure 11 and
Figure 12. The ResNet50 model achieved optimal performance after 10 epochs, with a batch size of 128. On the other hand, the VGG16 model reached its peak performance at 18 epochs, with a learning rate of 0.001 and a batch size of 128.
The confusion matrix in
Figure 13 and the performance metrics in
Table 7 further illustrate the comparison between the ResNet50 and VGG16 models in terms of training and validation accuracy. While VGG16 outperformed ResNet50 in the training and validation phases, it lagged slightly behind in the test set performance. This suggests that, although VGG16 demonstrated strong capabilities in identifying IA hidden danger, further optimization is required to enhance its accuracy and efficiency on unseen data.
3.3.5. Model Optimization
The VGG16 model was optimized for identifying IA hidden danger using pre-optimization measures such as data enhancement, Early Stopping (ES), and L2 Regularization. While data enhancement typically boosts generalization, it was found that in this specific task, the data-enhanced model (D-VGG16) absorbed excessive noise from over-processed training samples, reducing accuracy (
Figure 14). Conversely, the ES-VGG16 model effectively prevented overfitting and shortened the training cycle by terminating when validation set loss ceased to decrease, as illustrated in
Figure 15. Additionally, the introduction of L2 Norm Regularization (L2-VGG16) significantly enhanced performance on both training and test sets and reduced overfitting risks (
Figure 16), with related indicators such as Precision and Recall detailed in
Table 8.
By integrating Early Stopping and regularization, the optimized ES-L2-VGG16 model demonstrated improved accuracy, operational efficiency, and faster convergence in identifying IA hidden danger, as shown in
Figure 17. The experience with VGG16 underscores the importance of selecting appropriate optimization strategies, tailored to the model’s structure and task requirements. Understanding the adaptability of different technologies across various model architectures is crucial for effective optimization.
3.4. Identification Procedure of IA Hidden Danger
In identifying IA hidden danger, the process starts with the efficient detection of source glaciers in the target region using advanced machine vision technology, where the boundaries are clearly marked with green contour lines.
Subsequently, the optimized ES-L2-VGG16 deep neural network model accurately identifies cracks in the glacier source area, marking these with red ellipses based on the model’s learned characteristics of glacier cracks and slopes.
Meanwhile, slopes within the glacier area are meticulously measured and translated into pixel values using GIS technology, with the data derived from Sentinel-2A satellite images and analyzed using ArcGIS and Google Earth pro 7.3.6 software. Pretrained by the model, slope features are automatically recognized and highlighted with blue rectangles to assess the area’s slope status, as detailed in
Figure 18.
In
Figure 18c, the intersection of glacier boundaries (marked in green), cracks (marked with red ellipses), and slope markings (highlighted with blue rectangles) ultimately represents the identified IA hidden danger areas. The regions labeled ”p1–p8” correspond to specific locations within the glacier, where the model identified critical combinations of these indicators, marking them as areas with hidden danger of IA.
In addition, the slope features are extracted at the pixel level, which allows for the precise measurement of slope variations across the glacier surface. For crack detection, an edge detection algorithm is employed to accurately identify and extract cracks. The process of extracting and analyzing slope and crack features is shown in
Figure 19.
Ultimately, this comprehensive integration of glacier boundaries, cracks, and slope data enables the precise identification of hidden danger, allowing IA hidden danger areas to be located more efficiently and accurately during the glacier identification and validation process.
5. Discussion
Recent research shows that deep learning algorithms have been widely used in predicting natural disasters, particularly landslides and IA hidden dangers. Liu et al. [
45] and Li et al. [
46] demonstrated that deep learning models outperform traditional methods. Deep learning models like CNN and VGG have been successfully applied to predicting geohazards such as landslides, IAs, and debris flows [
47]. These models achieved assessment accuracies of 90% and 91.89% in identifying landslides and glaciers, respectively [
48,
49]. This verifies their feasibility and accuracy in natural disaster identification. However, traditional algorithms like logistic regression, SVM, MDA, BPNN, and RF, remain effective in specific scenarios [
50,
51,
52]. Yet, in the complex terrain of the TP, these methods struggle to extract sufficiently accurate information.
In this paper, the ES-L2-VGG16 model was constructed by further improving the deep learning model with ES and L2. The data fusion method integrates multivariate data, such as slope and crack, achieving a training accuracy of 98.61%, which is significantly better than that of existing models. This highlights the importance of advanced deep learning strategies in improving model generalization and optimization. Consequently, the ES-L2-VGG16 model has become a powerful tool in natural disaster prediction.
Future research suggests focusing on optimizing the model to better recognize increasingly complex geological features and to expand the range of IA identification indicators to improve learning effectiveness. The model will integrate advanced algorithms such as DenseNet, Particle Swarm Optimization, and Long Short-Term Memory networks to enhance training accuracy and precision. The ultimate goal is to identify IA hidden danger in glacier regions worldwide, establish a comprehensive global IA hidden danger database, and provide support for identifying geological disasters.