Dune Morphology Classification and Dataset Construction Method Based on Unmanned Aerial Vehicle Orthoimagery
Abstract
:1. Introduction
2. Materials and Methods
2.1. Overview of the Study Area
2.2. Dune Morphology Classification System
- (1)
- Barchan dunes and dune chains exhibit a crescent-shaped planar morphology, influenced by unidirectional winds. These dunes have two wing-like extensions downwind, with asymmetrical slopes on either side. The windward slope is convex and relatively gentle, while the leeward slope is concave and steeper. Under conditions of abundant sand supply, a chain of barchan dunes, known as a dune chain, can form [16].
- (2)
- Linear dunes are formed under the influence of two winds that intersect at an acute angle. Their prominent feature is a long and straight dune crest line, with symmetrical slip faces on either side of the dune. In deserts, linear dunes are often arranged in parallel and exhibit relatively regular spacing.
- (3)
- Reticulate dunes are composed of two sets of intersecting dunes, and the plane shape is grid. Based on the differences in the morphological characteristics of reticulate dunes, they can be further divided into square reticulate dunes and long reticulate dunes [17].
- (4)
- Nebkhas form when aeolian sand is obstructed by shrubby vegetation, causing a reduction in wind speed and the continuous accumulation of sand particles at the base of the shrubs. In plane view, these dunes typically exhibit an oval, circular, or tadpole-like shape, with a certain amount of shrubby vegetation covering the top of the dune mound [18].
- (5)
- In flat sandy land, there are no dunes of any form, presenting an extensive and continuous planar morphology [19].
2.3. Data Sources
2.4. Dataset Construction Method
2.5. CNN Model Selection and Parameter Configuration
2.6. Model Evaluation Metrics
3. Results
3.1. Single Category Classification Results and Analysis
- (a)
- In the manually segmented dataset, the nebkhas exhibit the highest classification accuracy. This is attributed to the presence of shrubby vegetation covering the surface of these dunes, which provides a distinct textural characteristic that sets them apart from other types of dunes. Conversely, barchan dunes and dune chains are most frequently misclassified. This misclassification is likely due to their similar morphological features when compared with linear dunes, making it challenging to differentiate them based on visual or textural cues alone.
- (b)
- In the 1024 × 1024 pixels dataset, the VGG16 model often misclassifies linear dunes as nebkhas. This misclassification can be attributed to the presence of shrubby vegetation in the areas where linear dunes were photographed, which may lead to confusion during the classification process. The VGG19 model, on the other hand, predominantly misclassifies barchan dunes and dune chains as linear dunes and reticulate dunes. Additionally, there was a significant amount of misclassification between linear dunes and reticulate dunes, which resulted in lower classification accuracy rates for barchan dunes, dune chains, and reticulate dunes.
- (c)
- In the 512 × 512 pixels dataset, the VGG16 model frequently misclassifies barchan dunes and dune chains as linear dunes and flat sandy land. For the VGG19 model, linear dunes were most often misclassified as reticulate dunes. This is likely because, during the small-scale regular segmentation, the reticulate dunes may only include a portion of their morphological features, which can resemble those of linear dunes, thus making misclassifications more likely.
- (d)
- In the 256 × 256 pixels dataset, barchan dunes and dune chains were often misclassified as flat sandy land. When segmenting the orthoimages of barchan dunes and dune chains, the interdune areas share similar characteristics with flat sandy land. At the smaller segmentation scale, a single image may not capture the morphological features of barchan dunes and dune chains, leading to a higher likelihood of misclassification between these two types.
- (e)
- In the 128 × 128 pixels dataset, due to the small segmentation scale, there was a significant amount of misclassification. Specifically, barchan dunes and dune chains were frequently misclassified as flat, sandy land. Additionally, reticulate dunes were also misclassified, particularly barchan dunes and dune chains, as some of their morphological features are similar. The smaller segmentation scale does not fully capture the morphological characteristics, which can lead to confusion when the CNN models are extracting deep semantic features.
3.2. Results and Analysis of Different Dataset Construction Methods
- (a)
- When the dataset is segmented at a scale of 1024 × 1024 pixels using a regular segmentation method, the VGG16 model achieves the highest accuracy, precision, recall, and F1-Score, which are 97.05%, 96.91%, 96.76%, and 96.82%, respectively.
- (b)
- The VGG16 model’s classification accuracy, arranged from highest to lowest, is as follows: the dataset with 1024 × 1024 pixels dataset, the 512 × 512 pixels dataset, the 256 × 256 pixels dataset, the 128 × 128 pixels dataset, and the manually segmented dataset.
- (c)
- The VGG19 model’s classification accuracy, arranged from highest to lowest, is as follows: the 1024 × 1024 pixels dataset, the 512 × 512 pixels dataset, the 256 × 256 pixels dataset, the manually segmented dataset, and the 128 × 128 pixels dataset.
3.3. Visualization and Analysis of Semantic Feature Results for Test Set Images
- (a)
- In the manually segmented dataset, nebkhas, flat sandy land, and reticulate dunes exhibit distinct cluster structures. However, there is an overlap in the data points for barchan dunes, dune chains, and linear dunes, suggesting that these dune types have closer semantic distances and share similar textural characteristics.
- (b)
- In the 1024 × 1024 pixels dataset, each type exhibits a distinct cluster structure, while barchan dunes and dune chains, linear dunes, reticulate dunes, and nebkhas display close semantic distances.
- (c)
- In the 512 × 512 pixels dataset, data points for barchan dunes and dune chains, nebkhas, and linear dunes are interspersed, with the three exhibiting similar textural characteristics.
- (d)
- In the 256 × 256 pixels dataset, the classification performance of the VGG16 model surpasses that of the VGG19 model. The VGG16 model exhibits distinct cluster structures, whereas in the VGG19 model, there is a clear misclassification across various types, with data points showing a pronounced interspersion.
- (e)
- In the 128 × 128 pixels dataset, data points for nebkhas and linear dunes are clustered together, while other types are interspersed, exhibiting close semantic distances and similar morphological features.
4. Discussion
4.1. Methods for Dune Morphology Dataset Construction
4.2. Performance of Different Models in Dune Morphology Classification Datasets
5. Conclusions
- (1)
- When the segmentation scale of UAV orthoimagery is set to 1024 × 1024 pixels with an overlap of 100 pixels, the classification outcome for dune morphologies is optimal. The VGG16 model achieved classification accuracy, precision, recall, and an F1-Score of 97.05%, 96.91%, 96.76%, and 96.82%, respectively. Compared with the manually segmented dataset, these metrics improved by 6.32%, 6.27%, 6.23%, and 6.38%, respectively.
- (2)
- The semantic feature maps of the test set visually demonstrate that the 1024 × 1024 pixels dataset has distinct cluster structures for each type of dune morphology, resulting in the best classification performance.
- (1)
- Due to the significant differences in scale and height of the dunes in the study area, different Ground Sampling Distance (GSD) were utilized when collecting UAV data. The impact of varying GSD on the classification results of dune morphology can be further explored in future research.
- (2)
- In future research, a larger collection of UAV orthoimagery data will be gathered to explore further the specific impact of varying quantities of each dune morphology type within the dataset on model classification. Additionally, the influence of different quantities of dune morphology datasets on model classification will be investigated.
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Zhao, X.M. Research on Information Extraction of Sand Dune Type Based on Deep Learning. Master’s Thesis, Xinjiang University, Urumqi, China, 2021. [Google Scholar]
- Li, Z.S.; Ni, J.R. A Review of Desert Dune Geomorphology. J. Sediment Res. 2000, 5, 73–81. [Google Scholar]
- Qian, G.Q.; Yang, Z.L.; Dong, Z.B.; Tian, M. Three-dimensional Morphological Characteristics of Barchan Dunes Based on Photogrammetry with A Multi-rotor UAV. J. Desert Res. 2019, 39, 18–25. [Google Scholar]
- Atito, S.; Yanikoglu, B.A.; Aptoula, E. Plant Identification with Large Number of Classes: SabanciU-GebzeTU System in PlantCLEF 2017. In Proceedings of the Conference and Labs of the Evaluation Forum, Dublin, Ireland, 11–14 September 2017. [Google Scholar]
- Cui, B.C. Deep Learning-Based Multi-Source Remote Sensing Image Classification of Dunes—A Case Study of the Southern Edge of Guerbantungut Desert. Master’s Thesis, Xinjiang University, Urumqi, China, 2020. [Google Scholar]
- Tang, Y.F.; Wang, Z.J.; Jiang, Y.Z.; Zhang, T.; Yang, W.J. An Auto-Detection and classification algorithm for identification of sand dunes based on remote sensing images. Int. J. Appl. Earth Obs. Geoinf. 2023, 125, 103592. [Google Scholar] [CrossRef]
- Van Der Merwe, B.; Pillay, N.; Coetzee, S. An Application of CNN to Classify Barchan Dunes into Asymmetry Classes. Aeolian Res. 2023, 56, 100801. [Google Scholar] [CrossRef]
- Azzaoui, M.A.; Masmoudi, L.; El Belrhiti, H.; Chaouki, I.E. Barchan Sand Dunes Collisions Detection in High Resolution Satellite Images based on Image Clustering and Transfer Learning. Int. J. Adv. Comput. Sci. Appl. (IJACSA) 2020, 11, 376–383. [Google Scholar] [CrossRef]
- Cunez, E.; Franklin, E. Detection and tracking of barchan dunes using Artificial Intelligence. Res. Sq. 2024; in press. [Google Scholar] [CrossRef]
- Jiang, T.; Wang, X.J. Convolutional Neural Network for GF-2 Image Stand Type Classification. J. Beijing For. Univ. 2019, 41, 20–29. [Google Scholar]
- Xu, H.; Sun, H.Z.; Wang, L.B.; Yu, X.C.; Li, T.Y. Urban Architectural Style Recognition and Dataset Construction Method under Deep Learning of street View Images: A Case Study of Wuhan. ISPRS Int. J. Geo-Inf. 2023, 12, 264. [Google Scholar] [CrossRef]
- Kasim, N.; Shi, Q.D.; Liu, S.H.; Imin, B.; Li, H. Automatic Classification Method of Oasis Plant Community in Desert Hinterland Based on VGGNet and ResNet Models. Trans. Chin. Soc. Agric. Mach. 2019, 50, 217–225. [Google Scholar]
- Feng, L.Y.; Zhou, H.Y.; Zhao, X.D. Spatial Distribution Patterns and Changes of Haloxylon Ammodendron and Artemisia Desertorum Population in Ulan Buh Desert. J. Nanjing For. Univ. (Nat. Sci. Ed.) 2024, 48, 155–160. [Google Scholar]
- Yang, G.; Ma, W.X.; Bao, S.Q.; Zhang, C.F. Relationship between the Structure of Oxytropis aciphylla and Caragana microphylia Shrubberies and Their Wind-Shadow Dunes. Arid Zone Res. 2016, 33, 540–547. [Google Scholar]
- Wu, Z. Sandy Deserts and Its Control in China; Science Press: Beijing, China, 2009; pp. 120–133. [Google Scholar]
- Dong, Z.B.; Su, Z.Z.; Qian, G.Q.; Luo, W.Y.; Zhang, Z.S.; Wu, J.F. Aeolian Geomorphology of the Kumtagh Desert; Science Press: Beijing, China, 2011; pp. 204–206. [Google Scholar]
- Yang, F.N.; Lv, P.; Ma, F.; Cao, M.; Xiao, N.; Gu, L.X.; Yang, Y. Morphological Evolution and Migration Characteristics of Reticulate Dunes at Southern Fringe of Tengger Desert. J. Desert Res. 2023, 43, 107–115. [Google Scholar]
- Huang, Q.N.; Dong, Z.B.; Xiao, N.; Feng, M.Y.; Shi, H.Y.; Chen, G.X.; Bai, Z.Y.; Zhang, X.; Gao, C.; Wang, X.Z. Sediments and Morphology of Coastal Spinifex Littoreus Nebkhas on Jingxinjiao Beach of Hainan Island, China. J. Desert Res. 2021, 41, 157–168. [Google Scholar]
- Li, M.; Yan, J.H.; Ye, W.Z.; Dong, S.; Yang, Z.K. Automatic Classification of Sand Dune Morphology Based on Convolutional Neural Networks. J. Arid Land Resour. Environ. 2024, 38, 121–129. [Google Scholar]
- Gu, C.R.; Liu, Y.H.; Zhao, X.T. Outdoor Group Emotion Recognition Based on Skeleton Keypoints. Transducer Microsyst. Technol. 2024, 43, 119–123+127. [Google Scholar]
- Huang, Y.L.; Ai, X. Research on Classification of Corn Leaf Disease Image by Improved Residual Network. Comput. Eng. Appl. 2021, 57, 178–184. [Google Scholar]
- Simonyan, K.; Zisserman, A. Very deep convolutional networks for large-scale image recognition. In Proceedings of the 3rd International Conference on Learning Representations (ICLR), San Diego, CA, USA, 7–9 May 2015; pp. 1–14. [Google Scholar]
- Wu, L.Y.; Tong, J.B.; Wang, Z.F.; Ma, D.; Zhang, J.W.; Liao, J.A. Classification of Damaged Grade on Rural Houses after Flood Disaster Based on Deep Convolutional Neural Network and Transfer Learning. Earth Sci. 2023, 48, 1742–1754. [Google Scholar]
- Yang, A.R.; Zhang, Z. Parameter Extraction of Pinus yunnanensis Canopy Based on UAV Remote Sensing. J. Northwest For. Univ. 2024, 39, 1–9. [Google Scholar]
- Yang, Q.C.; Liu, M.; Zhang, Z.T.; Yang, S.Q.; Ning, J.F.; Han, W.T. Mapping plastic mulched farmland for high resolution images of unmanned aerial vehicle using deep semantic segmentation. Remote Sens. 2019, 11, 2008. [Google Scholar] [CrossRef]
- Lin, Z.W.; Ding, Q.L.; Huang, J.H.; Tu, W.H.; Hu, D.; Liu, J.F. Study on Tree Species Classification of UAV Optical Image based on DenseNet. Remote Sens. Technol. Appl. 2019, 34, 704–711. [Google Scholar]
- Pouliot, D.; Latifovic, R.; Pasher, J.; Duffe, J. Assessment of Convolution Neural Networks for Wetland Mapping with Landsat in the Central Canadian Boreal Forest Region. Remote Sens. 2019, 11, 772. [Google Scholar] [CrossRef]
- Dyrmann, M.; Karstoft, H.; Midtiby, H.S. Plant species classification using deep convolutional neural network. Biosyst. Eng. 2016, 151, 72–80. [Google Scholar] [CrossRef]
UAV Parameters | Numerical Value | Camera Parameters | Numerical Value/Form |
---|---|---|---|
Weight (including battery and paddles) | 1388 g | Image sensor | 1-inch CMOS effective pixels 20 million |
Wheelbase | 350 mm | ||
Maximum flight time | About 30 min | Lens | FOV84°8.8 mm/24 mm (35 mm format equivalent) f/2.8–f/11 with autofocus (Focus distance 1 m- infinity) |
Maximum horizontal flight speed | 72 km/h | ||
Maximum tilt angle | 42° | Photo size | 3:2 aspect ratio: 5472 × 3648 4:3 aspect ratio: 4864 × 3648 16:9 aspect ratio: 5472 × 3078 |
Maximum bearable wind speed | 10 m/s | ||
Maximum takeoff altitude | 6000 m | ||
Satellite Positioning Module (SPM) | GPS/GLONASS dual mode | Picture format | JPE; GDNG(RAW), JPEG + DNG |
Dune Types | Flight Altitude/m | Lateral Overlap | Forward Overlap | GSD/cm/Pixels |
---|---|---|---|---|
Linear dunes | 80 | 75% | 80% | 2.19 |
Nebkhas | 30 | 75% | 80% | 0.82 |
Barchan dunes and dune chains | 80 | 75% | 80% | 2.19 |
Reticulate dunes | 100 | 75% | 80% | 2.74 |
Flat sandy land | 80 | 70% | 80% | 2.19 |
Dune Types | Number of Training Sets | Number of Test Sets | Total |
---|---|---|---|
Flat sandy land | 606 | 151 | 757 |
Barchan dunes and dune chains | 575 | 143 | 718 |
Reticulate dunes | 585 | 146 | 731 |
Nebkhas | 635 | 157 | 792 |
Linear dunes | 632 | 158 | 790 |
Dune Types | Number of Training Sets | Number of Test Sets | Total |
---|---|---|---|
Flat sandy land | 2765 | 691 | 3456 |
Barchan dunes and dune chains | 2480 | 620 | 3100 |
Reticulate dunes | 1863 | 465 | 2328 |
Nebkhas | 2436 | 608 | 3044 |
Linear dunes | 2145 | 536 | 2681 |
Dune Types | Number of Training Sets | Number of Test Sets | Total |
---|---|---|---|
Flat sandy land | 6042 | 1510 | 7552 |
Barchan dunes and dune chains | 8314 | 2078 | 10,392 |
Reticulate dunes | 9770 | 2442 | 12,212 |
Nebkhas | 6672 | 1668 | 8340 |
Linear dunes | 7087 | 1771 | 8858 |
Dune Types | Number of Training Sets | Number of Test Sets | Total |
---|---|---|---|
Flat sandy land | 7736 | 1934 | 9670 |
Barchan dunes and dune chains | 10,112 | 2527 | 12,639 |
Reticulate dunes | 8627 | 2156 | 10,783 |
Nebkhas | 11,300 | 2824 | 14,124 |
Linear dunes | 9266 | 2316 | 11,582 |
Dune Types | Number of Training Sets | Number of Test Sets | Total |
---|---|---|---|
Flat sandy land | 8592 | 2148 | 10,740 |
Barchan dunes and dune chains | 9352 | 2338 | 11,690 |
Reticulate dunes | 9259 | 2314 | 11,573 |
Nebkhas | 11,191 | 2797 | 13,988 |
Linear dunes | 10,907 | 2726 | 13,633 |
Name | Configuration |
---|---|
Operating system | Windows 10 64-bit operating system |
Processor | Intel(R) Xeon(R) Silver 4216 [email protected] GHz 2.10 GHz (2 Processors) (Intel, Santa Clara, CA, USA) |
Video memory | NVIDIA GeForce RTX 3060 (NVIDIA, Santa Clara, CA, USA) |
Memory | 64 GB |
Deep learning framework | pytorch2.0.1 |
Programming language | python3.9 |
CUDA version | 12.2 |
Confusion Matrix | Predicted Value | ||
---|---|---|---|
Positive | Negative | ||
True Value | Positive | True Positive = TP | False Negative = FN |
Negative | False Positive = FP | True Negative = TN |
Datasets | Models | Accuracy | Precision | Recall | F1-Score |
---|---|---|---|---|---|
Manually segmented dataset | VGG16 | 90.73% | 90.64% | 90.53% | 90.44% |
VGG19 | 90.73% | 90.57% | 90.65% | 90.59% | |
1024 × 1024 pixels dataset | VGG16 | 97.05% | 96.91% | 96.76% | 96.82% |
VGG19 | 95.75% | 95.47% | 95.50% | 95.47% | |
512 × 512 pixels dataset | VGG16 | 94.18% | 94.09% | 94.16% | 94.09% |
VGG19 | 93.25% | 93.21% | 93.40% | 93.28% | |
256 × 256 pixels dataset | VGG16 | 94.07% | 94.00% | 94.15% | 94.03% |
VGG19 | 92.86% | 92.80% | 92.91% | 92.83% | |
128 × 128 pixels dataset | VGG16 | 91.66% | 91.24% | 91.30% | 91.23% |
VGG19 | 90.58% | 90.09% | 90.09% | 90.08% |
Datasets | Classification Model | Training Time |
---|---|---|
1024 × 1024 pixels dataset | VGG16 | 26 h 30 min |
VGG19 | 26 h 56 min | |
512 × 512 pixels dataset | VGG16 | 18 h 26 min |
VGG19 | 19 h | |
256 × 256 pixels dataset | VGG16 | 7 h 16 min |
VGG19 | 11 h 3 min | |
128 × 128 pixels dataset | VGG16 | 6 h 47 min |
VGG19 | 8 h 3 min |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Li, M.; Yang, Z.; Yan, J.; Li, H.; Ye, W. Dune Morphology Classification and Dataset Construction Method Based on Unmanned Aerial Vehicle Orthoimagery. Sensors 2024, 24, 4974. https://doi.org/10.3390/s24154974
Li M, Yang Z, Yan J, Li H, Ye W. Dune Morphology Classification and Dataset Construction Method Based on Unmanned Aerial Vehicle Orthoimagery. Sensors. 2024; 24(15):4974. https://doi.org/10.3390/s24154974
Chicago/Turabian StyleLi, Ming, Zekun Yang, Jiehua Yan, Haoran Li, and Wangzhong Ye. 2024. "Dune Morphology Classification and Dataset Construction Method Based on Unmanned Aerial Vehicle Orthoimagery" Sensors 24, no. 15: 4974. https://doi.org/10.3390/s24154974
APA StyleLi, M., Yang, Z., Yan, J., Li, H., & Ye, W. (2024). Dune Morphology Classification and Dataset Construction Method Based on Unmanned Aerial Vehicle Orthoimagery. Sensors, 24(15), 4974. https://doi.org/10.3390/s24154974