1. Introduction
The precision of identifying disaster-prone entities through the natural disaster risk census directly impacts the efficiency and effectiveness of post-disaster relief and recovery endeavors [
1]. Consequently, the precise identification of these disaster-prone entities contributes to accurately defining areas at risk of disasters and facilitates continuous monitoring of potential disaster occurrences [
2]. However, extracting the targets of disaster-prone entities presents certain challenges. Primarily, the environmental conditions surrounding these entities are often intricate and influenced by geographic features, meteorological conditions, and human activities. Additionally, the diverse characteristics of damage exhibited by disaster-prone entities post-disaster, stemming from variations in structure, morphology, and extent of damage, further complicate the extraction process [
3,
4]. The combination of these influencing factors gives rise to difficulties in distinguishing between disaster-prone entities and surrounding features, thereby intensifying the complexity of disaster risk delineation.
Currently, data on disaster-bearing entities primarily originate from field surveys and building data [
5], demanding substantial human, material, and financial resources [
6]. With the advancement of remotely sensed technology, the utilization of remotely sensed target recognition technology has emerged as a crucial method for recognizing and extracting disaster-bearing entities [
7,
8,
9]. Notably, typical disaster-bearing entities, such as dams, bridges, roads, and reservoir embankments, share striking similarities in remotely sensed images. As depicted in
Figure 1, they exhibit evident linear and color features, and their relatively concentrated distribution in geospatial space amplifies the challenge of extracting these target disaster-bearing objects, thereby complicating the disaster census. The reliance on feature-based image extraction methods fails to fully exploit structural and textural information in remotely sensed images, leading to diminished classification accuracy at the pixel level. Moreover, an excessive dependence on limited pixel spectral information can constrain the classification results, making it arduous to accurately distinguish between various targets. In essence, feature-based remotely sensed image extraction methods struggle to fulfill the requirements for the high-precision and high-efficiency extraction of disaster-bearing entities.
In recent years, the emergence of deep learning technology has offered innovative solutions to address the constraints of feature-based image extraction methods [
10,
11]. For instance, Biffi et al. introduced a ground-based RGB image detection method utilizing adaptive training samples to select a deep learning model, effectively overcoming issues of target self-obscuration and mutual occlusion [
12]. Similarly, Balaniuk et al. combined cloud computing, free open-source software, and deep learning techniques to automatically identify and classify large-scale mining tailing dams nationwide [
13]. With the rapid evolution of this domain, deep learning based on convolutional neural networks (CNNs) has demonstrated robust feature extraction capabilities and high accuracy rates [
14]. For example, Shao et al. leveraged nighttime remotely sensed data to enhance ship detection, refining the YOLOv5 algorithm model to effectively improve the accuracy and completeness of ship datasets [
15]. Additionally, Yan et al. developed an intelligent, high-precision method for extracting information from tailing ponds, addressing the challenge of incomplete data by enhancing deep learning target detection models. This advancement not only enhances the recognition accuracy of tailing pond failures but also boosts decision-making efficiency in tailing pond management, laying the groundwork for global-scale tailing dam detection [
16]. While deep learning stands as a potent machine learning algorithm capable of handling nonlinear and complex data, remotely sensed images remain susceptible to interference from various factors such as background conditions, lighting, and target shape diversity. Deep learning models encounter difficulties in distinguishing targets with high similarity [
17,
18], as illustrated in
Figure 2, where bridges, roads, and reservoir embankments are erroneously categorized under the same label with relatively high confidence levels.
Furthermore, while deep learning models excel at recognizing targets within images, they often lack precise location information about these targets. However, disaster census and post-disaster rescue operations typically necessitate accurate target location data. Consequently, scholars have introduced spatial constraint strategies to enhance the understanding of target shape, structure, and location. For instance, Van Soesbergen et al. devised a globally available remotely sensed imagery method for automated dam reservoir extraction, effectively distinguishing dam reservoirs from natural water bodies to bridge the geolocation data gap between dams and reservoirs [
19]. Similarly, Asbury and Aly integrated remotely sensed and Geographic Information System (GIS) techniques to examine the impact of drought on ten selected surface reservoirs in San Angelo and Dallas, Texas, emphasizing the need for continuous monitoring to understand drought effects on reservoirs [
20]. Additionally, Yang et al. introduced a GIS-based accident analysis framework centered on spatial feature distribution [
21], while Chen et al. proposed a model for assessing human settlement suitability at the village scale, combining decision analysis and spatial analysis for the first time [
22]. The spatial constraint strategy delves into the intricate spatial relationships between targets, eliminating various interfering factors that may introduce errors, thus refining the discrimination scope. This approach enables more accurate problem analysis and the development of effective solutions.
In summary, the challenges in recognizing and extracting disaster-bearing objects can be categorized as follows: (1) Distinguishing disaster-bearing objects from surrounding features with similar geometric and spectral characteristics poses a significant obstacle. Consequently, employing a single method for their recognition and extraction proves challenging. (2) The presence of various background factors within the region introduces interference, impacting the accuracy of disaster-bearing body recognition. This interference often leads to confusion between disaster-bearing bodies and other features present in remotely sensed images.
Based on the aforementioned analysis, it is evident that remotely sensed images are vulnerable to interference from background and lighting conditions, as well as the diverse shapes of targets. This poses challenges for deep learning models in distinguishing targets with high similarity during recognition. Consequently, misjudgments can occur, and the precise location of the target remains unknown when using deep learning models alone. However, by integrating spatial constraint strategies, interference factors can be mitigated, and the discriminatory range can be narrowed, aiding the deep learning model in achieving more accurate target localization. Therefore, this paper focuses on dams as a representative example of typical disaster-bearing bodies and explores the feasibility of combining deep learning techniques with spatial constraint strategies. The results demonstrate significant potential for the extraction of typical disaster-bearing bodies using this approach.
The subsequent sections of the paper are structured as follows.
Section 2 offers an overview of the study area, detailing the methodological process for dataset construction and the acquisition of additional necessary experimental data. Furthermore, it introduces the concepts and composition of two distinct classes of target recognition models and spatial constraint strategies.
Section 3 explores the extraction results achieved through the fusion of deep learning and spatial constraint strategies, along with discussing the evaluation method employed to assess extraction accuracy.
Section 4 examines the applicability of the spatial constraint strategy and provides supplementary analysis on target extraction from disaster-bearing bodies in small watersheds.
Section 5 summarizes the conclusions drawn from this study.
5. Conclusions
Within the existing disaster management system, conducting a thorough census and implementing efficient management of disaster-bearing entities are recognized as fundamental stages in disaster risk assessment and zoning. Ensuring the quality and completeness of pertinent data are crucial for accurately delineating areas at risk of disasters, optimizing resource allocation, and formulating effective strategies for disaster response.
In this study, we tackle the challenge posed by the difficulty of a single remotely sensed image extraction method in distinguishing between similar features. To address this, we propose an extraction method that combines deep learning and spatial constraint strategies. This approach involves comparing spatial features of dams with those of similar features, like bridges, roads, and reservoir embankments. By carefully selecting appropriate constraint strategies, establishing separability index rules for dams and the mentioned features, and employing hydrological analyses to narrow the research scope to the river, interference from roads and reservoir embankments can be excluded. Additionally, terrain analysis is utilized to eliminate interference from bridges. The accuracy of the extraction results for a typical disaster-bearing entity—dams—is evaluated using three indicators: extraction rate, omission rate, and false extraction rate.
The accuracy of dam extraction is contingent not only on the precision of the neural network but also on factors such as image quality, the effectiveness of water body extraction, and post-processing accuracy. Our dataset comprises high-quality dam images encompassing various types, sizes, and geographic locations. The accuracy of image quality and class not only enhances the performance of the deep learning model but also elucidates the texture and morphological features of dams more clearly. The introduction of DDRM in this study was pivotal in narrowing down the dam discrimination range. This effectively eliminated interfering point locations within the study area, making significant contributions to the accurate extraction of dam candidates. The hybrid method, combining deep learning and spatial constraint strategies, not only identifies typical disaster-bearing entities but also successfully pinpoints dam locations. The method achieves high accuracy, with extraction rates, omission rates, and false extraction rates of 94.73%, 5.27%, and 11.11%, respectively. Our findings indicate that, built upon open geographic data products, the proposed technological process in this paper demonstrates reliability in dam extraction and effectively identifies and locates targets. This approach not only overcomes the limitations and challenges of traditional methods but also uncovers dams not recorded in the database, offering new insights and possibilities for the management, monitoring, and planning of typical hazard-bearing bodies.
This study serves as a valuable reference for the methodology and implementation of dam target detection using open geographic data and technology. While some progress has been achieved in this research direction, there are still potential challenges and areas for improvement. On one hand, the model’s training dataset could be further refined to enhance its generalization ability, and the integration of more advanced recognition models may contribute to further improvements in dam recognition performance. On the other hand, exploring additional high-resolution remotely sensed data sources could enhance the accuracy of dam recognition. Furthermore, considering the application of the framework on a global scale and in other fields could help address challenges in global dam management and geological research. This broader application could play a role in contributing to the sustainable development of human society.