SAR-NTV-YOLOv8: A Neural Network Aircraft Detection Method in SAR Images Based on Despeckling Preprocessing
Abstract
:1. Introduction
- A paradigm for speckle reduction preprocessing of SAR images is proposed, followed by a CNN based aircraft detection networks. In the despeckling preprocessing stage, a new noise reduction model based on non-convex total variation is proposed. This approach effectively smooths the image while preserving its details. By significantly enhancing aircraft scattering information and reducing background clutter and speckle noise, despeckling allows the subsequent neural detection network to extract aircraft features more effectively.
- Aiming at the aircraft detection task of SAR image, this paper chooses YOLOv8 (SAR-YOLOv8) as the basic structure. At the same time, a high-resolution small target feature head detector suitable for YOLOv8 is adopted. We introduce an innovative attention pyramid that combines local and contextual features. In SAR-YOLOv8, local attention dynamically captures the target’s local characteristics. The cross-scale attention mechanism aids the SAR-YOLOv8 in collecting crucial contextual information, thereby effectively minimizing false positives. Then, we propose a progressive feature network of spatial feature fusion, which makes the different scale feature information generated by each stage effectively utilize the multi-stage context, so as to significantly strengthen the precises of aircraft targets in the scattering area.
- We construct a SAR-NTV-YOLOv8 aircraft detection framework and conduct sufficient comparative and ablation experiments. The high accuracy of SAR data sets shows that SAR-NTV-YOLOv8 is suitable for aircraft detection. Meanwhile, through a large number of comparative experiments, ablation experiments and despeckling experiments, it is shown that the proposed SAR-NTV-YOLOv8 method has the most advanced detection capability compared to other object detection networks.
2. Related Work
2.1. SAR Image Despeckling Method
2.2. YOLOv8
3. Proposed Method
3.1. Overall Scheme of the Proposed Method
3.2. Despeckling Preprocessing Module
3.2.1. SAR Image Multiplicative Noise Signal Model
3.2.2. Based on Convex Total Variation Regularization Model
3.3. Aircraft Target Detection Module
- In SAR images, the targets varies significantly, with a higher number of aircraft with fewer scattering points, and complex and blurry background object scattering. Many similar image features increase the difficulty of extracting key features.
- The network model is very complex, which is not conducive to deploy on-board or airborne equipment. This paper has made some improvements for YOLOv8.
3.3.1. High Resolution Small Target Feature Heads
3.3.2. Efficient Multi-Scale Attention Based on Cross Spatial Learning
3.3.3. Progressive Feature Pyramid Based on Adaptive Spatial Feature Fusion
3.4. Loss Function
4. Experiments and Results
4.1. Dataset and Settings
4.2. Evaluation Metrics
4.2.1. SAR Image Despeckling Evaluation Metrics
4.2.2. Aircraft Detection Evaluation Metrics
4.3. Results and Analysis
4.3.1. Comparison with Other Despeckling Methods
4.3.2. Comparison with Other Target Detecting Methods
- Shape feature difference: Ships typically have a more regular shape, making them easier to identify in SAR images, especially in stationary environments with minimal background interference. In contrast, aircraft are smaller, usually ranging from 10 to 80 m in length. For example, a Boeing 737 has a wingspan of about 34 m, a fuselage length of 39 m, and a total surface area of approximately 556 square meters. In comparison, a container ship like the “Maersk Magellan” measures about 400 m in length, 60 m in width, and has a deck area of around 24,000 square meters. This indicates that the area of a ship is roughly 43 times that of a medium-sized passenger aircraft.
- Differences in SAR Scattering Characteristics: Ships typically have flat metal surfaces that exhibit strong backscattering characteristics. In contrast, the scattering characteristics of aircraft are more complex, primarily arising from components such as the wings, fuselage, and tail. While the surfaces of aircraft are generally smoother, their complex shapes can lead to more dispersed scattering characteristics, often resulting in weaker echoes in SAR images.
- Neural Network Architecture: Neural networks customized for ship target detection typically utilize deeper convolutional layers to extract richer features. Additionally, these networks often employ specific pooling strategies to preserve the geometric shapes of the ships. In contrast, neural networks designed for aircraft target detection commonly incorporate attention mechanisms to enhance focus on aircraft features, particularly in complex backgrounds.
4.3.3. Ablation Experiment
5. Discussion
6. Conclusions
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
Abbreviations
AA | Aubert and Aujol |
AFPN | Asymptotic Feature Pyramid Network |
ASFF | Adaptive Spatial Feature Fusion |
CNN | Convolutional neural network |
DFL | distribution focal loss |
EMA | efficient multi-scale attention |
ENL | equivalent number of looks |
EPI | edge preserving index |
IoU | Intersection over Union |
MAP | maximum a posterior |
NTV | nonconvex total variation |
SAR | Synthetic aperture radar |
SFR-Net | scattering feature relationship network |
STDH | small target detection head |
TV | total variation |
YOLOv8 | You Only Look Once v8 |
References
- Moreira, A.; Prats-Iraola, P.; Younis, M.; Krieger, G.; Hajnsek, I.; Papathanassiou, K.P. A tutorial on synthetic aperture radar. IEEE Geosci. Remote Sens. Mag. 2013, 1, 6–43. [Google Scholar] [CrossRef]
- Zeng, K.; Wang, Y. A deep convolutional neural network for oil spill detection from spaceborne SAR images. Remote Sens. 2020, 12, 1015. [Google Scholar] [CrossRef]
- Luti, T.; De Fioravante, P.; Marinosci, I.; Strollo, A.; Riitano, N.; Falanga, V.; Mariani, L.; Congedo, L.; Munafò, M. Land consumption monitoring with SAR data and multispectral indices. Remote Sens. 2021, 13, 1586. [Google Scholar] [CrossRef]
- Zhang, Y.; Lu, D.; Qiu, X.; Li, F. Scattering-Point-Guided RPN for Oriented Ship Detection in SAR Images. Remote Sens. 2023, 15, 1411. [Google Scholar] [CrossRef]
- Zheng, Y.; Liu, P.; Qian, L.; Qin, S.; Liu, X.; Ma, Y.; Cheng, G. Recognition and depth estimation of ships based on binocular stereo vision. J. Mar. Sci. Eng. 2022, 10, 1153. [Google Scholar] [CrossRef]
- Feng, Y.; Chen, J.; Huang, Z.; Wan, H.; Xia, R.; Wu, B. A Lightweight Position-Enhanced Anchor-Free Algorithm for SAR Ship Detection. Remote Sens. 2022, 14, 1908. [Google Scholar] [CrossRef]
- Reigber, A.; Scheiber, R.; Jager, M.; Prats-Iraola, P.; Hajnsek, I.; Jagdhuber, T.; Papathanassiou, K.P.; Nannini, M.; Aguilera, E.; Baumgartner, S.; et al. Very-High-Resolution Airborne Synthetic Aperture Radar Imaging: Signal Processing and Applications. Proc. IEEE 2013, 101, 759–783. [Google Scholar] [CrossRef]
- He, C.; Tu, M.; Xiong, D.; Tu, F.; Liao, M. A component-based multi-layer parallel network for airplane detection in SAR imagery. Remote Sens. 2018, 10, 1016. [Google Scholar] [CrossRef]
- Xiao, X.; Jia, H.; Xiao, P.; Wang, H. Aircraft Detection in SAR Images Based on Peak Feature Fusion and Adaptive Deformable Network. Remote Sens. 2022, 14, 6077. [Google Scholar] [CrossRef]
- Yu, W.; Li, J.; Wang, Z.; Yu, Z. Boosting SAR Aircraft Detection Performance with Multi-Stage Domain Adaptation Training. Remote Sens. 2022, 15, 4614. [Google Scholar] [CrossRef]
- Rostami, M.; Kolouri, S.; Eaton, E.; Kim, K. Deep Transfer Learning for Few-Shot SAR Image Classification. Remote Sens. 2019, 11, 1374. [Google Scholar] [CrossRef]
- Wu, F.; Hu, T.; Xia, Y.; Ma, B.; Sarwar, S.; Zhang, C. WDFA-YOLOX: A Wavelet-Driven and Feature-Enhanced Attention YOLOX Network for Ship Detection in SAR Images. Remote Sens. 2024, 16, 1760. [Google Scholar] [CrossRef]
- Zhang, X.; Hu, D.; Li, S.; Luo, Y.; Li, J.; Zhang, C. Aircraft Detection from Low SCNR SAR Imagery Using Coherent Scattering Enhancement and Fused Attention Pyramid. Remote Sens. 2023, 15, 4480. [Google Scholar] [CrossRef]
- Jia, Z.; Zheng, H.; Wang, R.; Zhou, W. FedDAD: Solving the Islanding Problem of SAR Image Aircraft Detection Data. Remote Sens. 2024, 15, 3620. [Google Scholar] [CrossRef]
- Gao, J.; Gao, X.; Sun, X. Geometrical features-based method for aircraft target interpretation in high-resolution SAR images. Foreign Electron. Meas. Technol. 2022, 34, 21–28. [Google Scholar]
- Guo, Q.; Wang, H.; Xu, F. Aircraft target detection from spaceborne synthetic aperture radar image. Aerosp. Shanghai 2018, 35, 57–64. [Google Scholar]
- He, C.; Tu, M.; Liu, X.; Xiong, D.; Liao, M. Mixture statistical distribution based multiple component model for target detection inhigh resolution SAR imagery. ISPRS Int. J. Geo-Inf. 2017, 6, 336. [Google Scholar] [CrossRef]
- Ren, S.; He, K.; Girshick, R.; Sun, J. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks. IEEE Trans. Pattern Anal. Mach. Intell. 2017, 39, 1137–1149. [Google Scholar] [CrossRef]
- Lin, T.-Y.; Goyal, P.; Girshick, R.; He, K.; Dollar, P. Focal loss for dense object detection. In Proceedings of the IEEE InternationalConference on Computer Vision (ICCV), Venice, Italy, 22–29 October 2017; pp. 2999–3007. [Google Scholar]
- Tian, Z.; Shen, C.; Chen, H.; He, T. FCOS: Fully Convolutional One-Stage Object Detection. In Proceedings of the International Conference on Computer Vision (ICCV), Seoul, Republic of Korea, 27 October–2 November 2019; pp. 9627–9636. [Google Scholar]
- Zhao, Y.; Zhao, L.; Li, C.; Kuang, G. Pyramid Attention Dilated Network for Aircraft Detection in SAR Images. IEEE Geosci. Remote Sens. Lett. 2020, 18, 662–666. [Google Scholar] [CrossRef]
- Zhao, W.; Lan, D.; Jia, M.; Bin, L.; Dong, Y. SAR Target Detection Based on SSD with Data Augmentation and Transfer Learning. IEEE Geosci. Remote Sens. Lett. 2019, 16, 150–154. [Google Scholar]
- Wang, J.; Xiao, H.; Chen, L.; Xing, J.; Pan, Z.; Luo, R.; Cai, X. Integrating Weighted Feature Fusion and the Spatial Attention Module with Convolutional Neural Networks for Automatic Aircraft Detection from SAR Images. Remote Sens. 2021, 13, 910. [Google Scholar] [CrossRef]
- Guo, Q.; Wang, H.; Xu, F. Scattering Enhanced Attention Pyramid Network for Aircraft Detection in SAR Images. IEEE Trans. Geosci. Remote Sens. 2021, 59, 7570–7587. [Google Scholar] [CrossRef]
- Kang, Y.; Wang, Z.; Fu, J.; Sun, X.; Fu, K. SFR-Net: Scattering Feature Relation Network for Aircraft Detection in Complex SARImages. IEEE Trans. Geosci. Remote Sens. 2022, 60, 5218317. [Google Scholar] [CrossRef]
- Chen, L.; Luo, R.; Xing, J.; Li, Z.; Yuan, Z.; Cai, X. Geospatial Transformer Is What You Need for Aircraft Detection in SAR Imagery. IEEE Trans. Geosci. Remote Sens. 2022, 60, 5225715. [Google Scholar] [CrossRef]
- Yu, Y.; Acton, S.T. Speckle reducing anisotropic diffusion. IEEE Trans. Image Process. 2002, 11, 1260–1270. [Google Scholar] [PubMed]
- Rudin, L.; Osher, S. Total variation based image restoration with free local constraints. In Proceedings of the 1st International Conference on Image Processing, Austin, TX, USA, 13–16 November 1994; Volume 1, pp. 31–35. [Google Scholar]
- Osher, S.; Burger, M.; Goldfarb, D.; Xu, J.; Yin, W. An iterative regularization method for total variation-based image restoration. Multiscale Model. Simul. 2005, 4, 460–489. [Google Scholar] [CrossRef]
- Aubert, G.; Aujol, J.-F. A variational approach to removing multiplicative noise. SIAM J. Appl. Math. 2008, 68, 925–946. [Google Scholar] [CrossRef]
- Bioucas-Dias, J.M.; Figueiredo, M.A. Multiplicative noise removal using variable splitting and constrained optimization. IEEE Trans. Image Process. 2010, 7, 1720–1730. [Google Scholar] [CrossRef] [PubMed]
- Wang, P.; Zhang, H.; Patel, V. SAR Image Despeckling Using a Convolutional Neural Network. IEEE Signal Process. Lett. 2017, 24, 1763–1767. [Google Scholar] [CrossRef]
- Cozzolino, D.; Verdoliva, L.; Scarpa, G.; Poggi, G. Nonlocal CNN SAR Image Despeckling. Remote Sens. 2020, 12, 1006. [Google Scholar] [CrossRef]
- Dalsasso, E.; Yang, X.; Denis, L.; Tupin, F.; Yang, W. SAR Image Despeckling by Deep Neural Networks: From a Pre-Trained Model to an End-to-End Training Strategy. Remote Sens. 2020, 12, 2636. [Google Scholar] [CrossRef]
- Wang, H.; Ding, Z.; Li, X.; Shen, S.; Ye, X.; Zhang, D.; Tao, S. Convolutional Neural Network with a Learnable Spatial Activation Function for SAR Image Despeckling and Forest Image Analysis. Remote Sens. 2021, 13, 3444. [Google Scholar] [CrossRef]
- Zhang, J.; Wei, Z.; Xiao, L. A fast adaptive reweighted residual feedback iterative algorithm for fractional-order total variation regularized multiplicative noise removal of partly-textured images. Signal Process. 2014, 7, 381–395. [Google Scholar] [CrossRef]
- Chen, D.; Sun, S.; Zhang, C.; Chen, Y.; Xue, D. Fractional-order tvl2 model for image denoising. Cent. Eur. J. Phys. 2013, 1, 1414–1422. [Google Scholar]
- Gu, S.; Zhang, L.; Zuo, W.; Feng, X. Weighted nuclear norm minimization with application to image denoising. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Columbus, OH, USA, 23–28 June 2014; pp. 2862–2869. [Google Scholar]
- Dillon, R.; Jordan, K.; Jacqueline, H.; Ahmad, D. Real-Time Flying Object Detection with YOLOv8. arXiv 2018, arXiv:2305.09972. [Google Scholar]
- Zhan, W.; Sun, C.; Wang, M.; She, J.; Zhang, Y.; Zhang, Z.; Sun, Y. An improved yolov5 real-time detection method for small objects captured by uav. Soft Comput. 2022, 26, 361–373. [Google Scholar] [CrossRef]
- Zhu, X.; Lyu, S.; Wang, X.; Zhao, Q. Tph-yolov5: Improved yolov5 based on transformer prediction head for object detection on drone-captured scenarios. In Proceedings of the IEEE/CVF International Conference on Computer vision, Virtual Conference, 11–17 October 2021; pp. 2778–2788. [Google Scholar]
- Guo, Y.; Chen, S.; Zhan, R.; Wang, W.; Zhang, J. LMSD-YOLO: A Lightweight YOLO Algorithm for Multi-Scale SAR Ship Detection. Remote Sens. 2022, 14, 4801. [Google Scholar] [CrossRef]
- Wang, Z.; Kang, Y.; Zeng, X. SAR-AIRcraft-1.0: High-resolution SAR aircraft detection and recognition dataset. J. Radars 2023, 12, 906–922. [Google Scholar]
- Zhang, P.; Xu, H.; Tian, T.; Gao, P.; Li, L.; Zhao, T.; Zhang, N.; Tian, J. SEFEPNet: Scale Expansion and Feature Enhancement Pyramid Network for SAR Aircraft Detection with Small Sample Dataset. IEEE J. Sel. Top. Appl. Earth Obs. Remote. Sens. 2022, 15, 3365. [Google Scholar] [CrossRef]
Image | Algorithm | ENL | EPI | Time |
---|---|---|---|---|
SAR Image 1 | MIDAL | 11.43 | 0.31 | 7.13 s |
TV | 5.96 | 0.27 | 0.98 s | |
SAR-CNN-M-xUnit | 12.57 | 0.39 | 2.73 s | |
NTV | 21.73 | 0.43 | 1.02 s | |
SAR Image 2 | MIDAL | 9.15 | 0.33 | 7.01 s |
TV | 7.78 | 0.25 | 0.93 s | |
SAR-CNN-M-xUnit | 14.91 | 0.36 | 2.67 s | |
NTV | 23.94 | 0.40 | 0.99 s |
Algorithm | Params | FLOPs | SAR-AIRcraft-1.0 | SADD | ||||||
---|---|---|---|---|---|---|---|---|---|---|
(M) | (G) | AP (%) | P (%) | R (%) | F1 (%) | AP (%) | P (%) | R (%) | F1 (%) | |
FCOS | 31.89 | 206.20 | 57.93 | 82.86 | 85.01 | 83.37 | 73.89 | 87.92 | 86.45 | 87.18 |
YOLOv7 | 37.23 | 105.23 | 70.64 | 89.71 | 88.45 | 87.41 | 81.47 | 92.11 | 90.35 | 91.21 |
LMSD-YOLO | 3.5 | 6.6 | 73.43 | 88.24 | 89.07 | 87.41 | 83.18 | 92.47 | 90.92 | 92.00 |
PFF-ADN | 43.15 | 198.50 | 76.15 | 87.36 | 90.97 | 90.33 | 85.62 | 93.12 | 91.75 | 93.16 |
SEFEPNet | 40.23 | 167.21 | 80.33 | 90.78 | 90.62 | 91.34 | 86.98 | 94.13 | 92.88 | 93.05 |
SAR-NTV-YOLOv8 | 49.33 | 117.61 | 83.41 | 93.53 | 92.17 | 93.10 | 88.64 | 93.82 | 93.15 | 92.73 |
No. | STDH | EMA | AFPN | NTV | Precision (%) | Recall (%) | F1 (%) | mAP50 (%) |
---|---|---|---|---|---|---|---|---|
1 | x | x | x | x | 87.05 | 81.72 | 82.13 | 75.61 |
2 | ✓ | x | x | x | 88.63 | 86.43 | 84.43 | 77.37 |
3 | ✓ | ✓ | x | x | 90.21 | 87.23 | 87.36 | 79.81 |
4 | ✓ | ✓ | ✓ | x | 91.21 | 89.74 | 90.08 | 80.23 |
5 | ✓ | ✓ | ✓ | ✓ | 93.53 | 92.17 | 93.17 | 83.41 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Guo, X.; Xu, B. SAR-NTV-YOLOv8: A Neural Network Aircraft Detection Method in SAR Images Based on Despeckling Preprocessing. Remote Sens. 2024, 16, 3420. https://doi.org/10.3390/rs16183420
Guo X, Xu B. SAR-NTV-YOLOv8: A Neural Network Aircraft Detection Method in SAR Images Based on Despeckling Preprocessing. Remote Sensing. 2024; 16(18):3420. https://doi.org/10.3390/rs16183420
Chicago/Turabian StyleGuo, Xiaomeng, and Baoyi Xu. 2024. "SAR-NTV-YOLOv8: A Neural Network Aircraft Detection Method in SAR Images Based on Despeckling Preprocessing" Remote Sensing 16, no. 18: 3420. https://doi.org/10.3390/rs16183420
APA StyleGuo, X., & Xu, B. (2024). SAR-NTV-YOLOv8: A Neural Network Aircraft Detection Method in SAR Images Based on Despeckling Preprocessing. Remote Sensing, 16(18), 3420. https://doi.org/10.3390/rs16183420