A Survey of SAR Image Target Detection Based on Convolutional Neural Networks
Abstract
:1. Introduction
- For the traditional SAR image target detection algorithm, we divided the traditional detection algorithm into three categories, and studied the detection algorithm of each category with the relevant references, analyzed the basic idea, advantages, and disadvantages of different algorithms under the same category. Based on this, we summarized the characteristics of these three algorithms, which then lead to the necessity of using CNN for SAR image target detection.
- We analyzed the fundamental theory and network structure of CNN and studied the SAR target detection data sets which are frequently used at present.
- Based on a mass number of references, we studied the CNN-based SAR image target detection. According to the main problems faced by CNN in SAR image target detection, we divided the literature review analysis into five categories. We summarized the innovative ideas of various improved algorithms. Simultaneously, we compared CNN with traditional SAR target detection algorithms and obtained the characteristics of various algorithms.
- The difficulties and challenges in the field of SAR image target detection were derived from the analysis of references, which pointed out the direction for future research.
2. Research Methodology
3. SAR Image Target Detection Based on Traditional Algorithm
3.1. Object Detection Algorithm Based on Structural Features
3.2. Object Detection Algorithm Based on Gray Features
3.3. Object Detection Algorithm Based on Texture Features
- Statistical texture features;
- Model texture features;
- Structural texture features;
- Signal-processing texture features [18].
3.4. Chapter Summary
- The model has high classification accuracy. Deep learning can fully make use of deep networks and use nonlinear activation functions to conduct layer-by-layer nonlinear transformation, which has a better approximation effect on complex functions.
- Deep learning can accurately extract high-level features and avoid complex manual feature extraction, which greatly reduces the workload.
- When the amount of data required for the model is substantial enough, the robustness and generalization of the algorithm will be relatively strong, and it has favorable adaptability to some complex environments as well.
4. Convolutional Neural Network
4.1. Basic Theory of CNN
4.1.1. Locally Connected Layer
4.1.2. Weight Sharing
4.1.3. Sampling Layer
4.2. Research Progress of CNN in Optical Image Field
5. SAR Image Research
5.1. SAR Image Detection and Processing
5.1.1. SAR Image Dataset
5.1.2. SAR Image Preprocessing
5.2. Research on SAR Image Target Detection Based on CNN
- Target detection in complex scenes, improve detection accuracy, and reduce false alarm rate or missed detection rate;
- Aiming at the shortage of existing data, transfer learning and small sample learning methods are developed;
- Real-time model detection and lightweight network;
- Multi-scale small target detection;
- Combination of traditional detection algorithms and deep convolutional neural networks.
5.2.1. Target Detection in Complex Scenes
5.2.2. Transfer Learning and Small Sample Learning Methods
5.2.3. Real-Time Model Detection and Lightweight Network
5.2.4. Multi-Scale Small Target Detection
5.2.5. Combination of Traditional Target Detection Algorithm and CNN
5.3. Summary of Research Status
6. Future Prospects and Key Challenges
- In a complex background, the image contains substantial speckle noise, which inevitably interferes with the target detection model. Therefore, it is very important to further advance the model’s robustness. In addition, the algorithmic accuracy is relatively low, which still needs to be improved. This makes it difficult to be widely used in practical fields. Therefore, in some complex backgrounds, by further improving the model’s structure and training strategy, the algorithm accuracy and generalization performance can be promoted.
- CNN contains a considerable number of network hyperparameters. The appropriateness of hyperparameter selection substantially impacts detection accuracy. However, at present, the selection of hyperparameters for the CNN target detection network mostly relies on manual work. It is challenging to select a set of reasonable data among many hyperparameters, and artificially selected hyperparameters can easily make the detection performance of CNN worse. Therefore, the adaptive selection of hyperparameters should also be a crucial research direction in the next step.
- Aiming at less data and lower data complexity in the SAR image dataset, a feasible way is to use unsupervised training methods, in other words, to label a small number of samples, to train the network with unsupervised training parties. In some cases, we could directly use unsupervised training methods, and use this method to cluster. The use of unsupervised training methods can promote detection performance, so the use of unsupervised network training is also a method that cannot be ignored.
- Rationally design the depth of the network. Nowadays, in order to improve detection accuracy, some researchers blindly increase the depth of the network and ignore the network parameters. This situation has caused some detection models to be very bloated, and it is difficult to perform real-time detection at some terminals. Moreover, if a network model is too bloated, its training time is also very long. Therefore, it is essential to pay attention to the impact of network parameters on practical applications. In view of this situation, it is necessary to apply some ideas of the lightweight network to SAR image target detection. The lightweight network is a hot research topic in the CV field. It is an inevitable trend of SAR image target detection to consider both the detection accuracy and the amount of network model parameters.
- Introduce advanced CV algorithms. Nowadays, in CV, in addition to the above traditional deep learning convolution target detection algorithm, the algorithm derived from natural language processing has gradually gained more and more attention. One of the most influential is Transformer [94]. Based on Transformer, some algorithms such as Vision Transformer [95] and Swin Transformer [96] have been generated in the CV field. These two algorithms have obtained perfect results in some top competitions, indicating that the algorithm has better target detection performance in the optical field. Therefore, f making full use of some excellent algorithms can greatly improve the research status of SAR image target detection.
- Although at present, the detection algorithm based on CNN is developing rapidly, we cannot completely abandon those traditional target detection algorithms. The advantages of traditional target detection algorithms are still worth learning. We can fully combine the detection algorithm of deep learning with the traditional target detection algorithm to complement each other and promote the progress of SAR image target detection.
- Attach importance to the development of CNN to a lightweight network. We should not make the designed network very bloated in order to pursue network accuracy, that is, we should not blindly pursue the depth of the network, and should achieve a balance between the detection accuracy and speed of the network.
- A favorable SAR image dataset is still the key factor to decide whether CNN can make full use of its advantages in the SAR field, so the construction of SAR image data set cannot be ignored.
- Some lightweight network optimization methods in CNN are very important and worthy of further study by researchers.
- How to make sufficient use of the self-attention mechanism in transform to maximize its own advantages and enable it to better extract the SAR image features is also a problem worth further investigation.
7. Conclusions
Author Contributions
Funding
Acknowledgments
Conflicts of Interest
References
- Chen, C.; He, C.; Hu, C.; Pei, H.; Jiao, L. MSARN: A Deep Neural Network Based on An Adaptive Recalibration Mechanism for Multiscale and Arbitrary-oriented SAR Ship Detection. IEEE Access 2019, 7, 159262–159283. [Google Scholar] [CrossRef]
- Wang, Y.; Wang, C.; Zhang, H. Ship Classification in High-resolution SAR Images Using Deep Learning of Small Datasets. Sensors 2018, 18, 2929. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Lencun, Y.; Bengio, Y.; Hinton, G. Deep Learning. Nature 2015, 521, 436–444. [Google Scholar] [CrossRef]
- Zhao, Z.; Zheng, P.; Xu, S.; Wu, X. Object Detection with Deep Learning: A Review. IEEE Trans. Neural Netw. Learn. Syst. 2019, 30, 3212–3232. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Guo, Y.; Liu, Y.; Georgiou, T.; Lew, M.S. A Review of Semantic Segmentation Using Deep Neural Networks. Int. J. Multimed. Inf. Retr. 2018, 7, 87–93. [Google Scholar] [CrossRef] [Green Version]
- Jiao, L.; Zhang, F.; Liu, F.; Yang, S.; Li, L.; Feng, Z.; Qu, R. A Survey of Deep Learning-Based Object Detection. IEEE Access 2019, 7, 128837–128868. [Google Scholar] [CrossRef]
- Yasir, M.; Jianhua, W.; Mingming, X.; Hui, S.; Zhe, Z.; Shanwei, L.; Colak, A.T.I.; Hossain, M.S. Ship Detection Based on Deep Learning Using SAR Imagery: A Systematic Literature Review. Soft Comput. 2022, 26, 1–22. [Google Scholar]
- Chen, W.; Xing, X.; Ji, K. A Survey of Ship Target Recognition in SAR image. Mod. Radar 2012, 34, 53–58. [Google Scholar]
- Zhen, Y.; Liu, W.; Chen, J.; Zhao, Y. Geometric Structure Feature Extraction of Ship Target in High-Resolution SAR Image. Signal Process. 2016, 32, 424–429. [Google Scholar]
- Gu, D.; Xu, X. Multi-Feature Extraction of Ships from SAR Images. In Proceedings of the 2013 6th International Congress on Image and Signal Processing (CISP), Hangzhou, China, 1 December 2013; pp. 454–458. [Google Scholar]
- Sun, Z.; Xiong, B.; Leng, X.; Ji, K. Geometric Structure Feature Extraction of Ship Target in SAR Image Based on Fine Segmentation. Chin. J. Radio Sci. 2020, 35, 585–593. [Google Scholar]
- Novak, L.M.; Owirka, G.J.; Netishen, C.M. Radar Target Identification Using Spatial Matched Filters. Pattern Recognit. 1994, 27, 607–617. [Google Scholar] [CrossRef] [Green Version]
- Ai, J.; Cao, Z.; Mao, Y.; Wang, Z.; Wang, F.; Jin, J. An Improved Bilateral CFAR Ship Detection Algorithm for SAR Image in Complex Environment. J. Radars 2021, 10, 499–515. [Google Scholar]
- Chang, J.; Zhao, J.; Li, N. An Improved 2P-CFAR SAR Ship Detection Method. Foreign Electron. Meas. Technol. 2021, 40, 7–12. [Google Scholar]
- Huang, Y.; Liu, F. Detecting Cars in VHR SAR Images Via Semantic CFAR Algorithm. IEEE Geosci. Remote Sens. Lett. 2016, 13, 801–805. [Google Scholar] [CrossRef]
- Qu, Y.; Huang, J. Ship Target Detection in SAR Images Based on Similarity Judgment. Comput. Digit. Eng. 2019, 47, 231–236. [Google Scholar]
- Lin, X.; Hong, J.; Sun, X.; Yan, Y. New CFAR Ship Detection Algorithm Based on Adaptive Background Clutter Model in Wide Swath SAR Images. Remote Sens. Technol. Appl. 2014, 29, 75–81. [Google Scholar]
- Liu, C.; Zhu, W. Review of Target Detection in SAR Image Based on CNN. J. Ordnance Equip. Eng. 2021, 42, 15–21. [Google Scholar]
- Cheng, B.; Hu, F.; Yang, R. Study on Target Detection of SAR Image Using Improved Fractal Features. J. Electron. Inform. 2009, 31, 164–168. [Google Scholar]
- Charalampidis, D.; Kasparis, T. Wavelet-Based Rotational Invariant Roughness Features for Texture Classification and Segmentation. IEEE Trans Image Process. 2002, 11, 825–837. [Google Scholar] [CrossRef] [PubMed]
- Stein, G.W.; Charalampidis, D. Target detection using an improved fractal scheme. In Proceedings of the SPIE, Orlando, FL, USA, 5 May 2006; pp. 172–180. [Google Scholar]
- Du, H.; Li, X.; Chen, F.; Ye, Q.; Sun, L. Mean Extended Fractal for SAR Image Target Detection. Ship Sci. Technol. 2016, 38, 127–131. [Google Scholar]
- Goodfellow, I.; Bengio, Y.; Courville, A. Deep Learning; MIT Press: Cambridge, MA, USA, 2016; pp. 331–360. [Google Scholar]
- Wang, H.; Wang, Y.; Lou, Y.; Song, Z. The role of activation function in CNN. In Proceedings of the 2020 2nd International Conference on Information Technology and Computer Application (ITCA), Guangzhou, China, 1 December 2020; pp. 429–432. [Google Scholar]
- Fukushima, K. Neocognitron: A Self-Organizing Neural Network Model for A Mechanism of Pattern Recognition Unaffected by Shift in Position. Biol. Cybern. 1980, 36, 193–202. [Google Scholar] [CrossRef] [PubMed]
- Lecun, Y. Gradient-Based Learning Applied to Document Recognition. Proc. IEEE 1998, 86, 2278–2324. [Google Scholar] [CrossRef] [Green Version]
- Krizhevsky, A.; Sutskever, I.; Hinton, G.E. ImageNet Classification with Deep Convolutional Neural Networks. Commun. ACM 2017, 60, 84–90. [Google Scholar] [CrossRef] [Green Version]
- Simonyan, K.; Zisserman, A. Very deep convolutional networks for large-scale image recognition. In Proceedings of the International Conference on Learning Representations, San Diego, CA, USA, 7–9 May 2015; pp. 1–14. [Google Scholar]
- Zeiler, M.D.; Fergus, R. Visualizing and understanding convolutional networks. In Proceedings of the 2014 European Conference on Computer Vision, Zurich, Switzerland, 5–12 September 2014; pp. 818–833. [Google Scholar]
- He, K.; Zhang, X.; Ren, S.; Sun, J. Deep residual learning for image recognition. In Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, 26 June–1 July 2016; pp. 770–778. [Google Scholar]
- Girshick, R.; Donahue, J.; Darrell, T.; Malik, J. Rich feature hierarchies for accurate object detection and semantic segmentation. In Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Columbus, OH, USA, 23–28 June 2014; pp. 580–587. [Google Scholar]
- He, K.; Zhang, X.; Ren, S.; Sun, J. Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition. IEEE Trans. Pattern Anal. Mach. Intell. 2015, 37, 1901–1916. [Google Scholar] [CrossRef] [Green Version]
- Girshick, R. Fast R-CNN. arXiv 2015, arXiv:Abs/1504.08083. [Google Scholar]
- Ren, S.; He, K.; Girshick, R.; Sun, J. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks. IEEE Trans. 2017, 39, 1137–1149. [Google Scholar] [CrossRef] [Green Version]
- Redmon, J.; Divvala, S.; Girshick, R.; Farhadi, A. You only look once: Unified, real-time object detection. In Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, 26 June–1 July 2016; pp. 779–788. [Google Scholar]
- Liu, W.; Anguelov, D.; Erhan, D.; Szegedy, C.; Berg, A.C. SSD: Single shot multibox detector. In Proceedings of the 2016 European Conference on Computer Vision, Amsterdam, Netherlands, 11–14 October 2016; pp. 21–37. [Google Scholar]
- Howard, A.G.; Zhu, M.; Chen, B.; Kalenichenko, D.; Wang, W.; Weyand, T.; Andreetto, M.; Adam, H. MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications. arXiv 2017, arXiv:Abs/1704.04861. [Google Scholar]
- Zhang, X.; Zhou, X.; Lin, M.; Sun, J. ShuffleNet: An Extremely Efficient Convolutional Neural Network for Mobile Devices. arXiv 2017, arXiv:Abs/1707.01083. [Google Scholar]
- Li, J.; Qu, C.; Shao, J. Ship detection in SAR images based on an improved Faster R-CNN. In Proceedings of the 2017 SAR in Big Data Era: Models, Methods and Applications, Beijing, China, 13–14 November 2017; pp. 1–6. [Google Scholar]
- Wei, S.; Zeng, X.; Qu, Q.; Wang, M.; Su, H.; Shi, J. HRSID: A High-Resolution SAR Images Dataset for Ship Detection and Instance Segmentation. IEEE Access 2020, 8, 120234–120254. [Google Scholar] [CrossRef]
- Lin, T.-Y.; Maire, M.; Belongie, S.; Hays, J.; Perona, P.; Ramanan, D.; Dollár, P.; Zitnick, C.L. Microsoft COCO: Common Objects in Context. In Proceedings of the 2014 European Conference on Computer Vision, Zurich, Switzerland, 5–12 September 2014; pp. 740–755. [Google Scholar]
- Huang, L.; Liu, B.; Li, B.; Guo, W.; Yu, W.; Zhang, Z.; Yu, W. OpenSARShip: A Dataset Dedicated to Sentinel-1 Ship Interpretation. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2018, 11, 195–208. [Google Scholar] [CrossRef]
- Hummel, R. Model-based ATR using synthetic aperture radar. In Proceedings of the Record of the IEEE 2000 International Radar Conference, Alexandria, VA, USA, 1 February 2000; pp. 856–861. [Google Scholar]
- Song, Q.; Xu, F.; Zhu, X.X.; Jin, Y.Q. Learning to Generate SAR Images with Adversarial Autoencoder. IEEE Trans. Geosci. Remote Sens. 2022, 60, 1–15. [Google Scholar] [CrossRef]
- Wang, Y.; Wang, C.; Zhang, H.; Dong, Y.; Wei, S. Automatic Ship Detection Based on RetinaNet Using Multi-Resolution Gaofen-3 Imagery. Remote Sens. 2019, 11, 531. [Google Scholar] [CrossRef] [Green Version]
- Zhu, C.; Zhao, D.; Qi, J.; Qi, X.; Shi, Z. Cross-domain transfer for ship instance segmentation in SAR images. In Proceedings of the 2021 IEEE International Geoscience and Remote Sensing Symposium, Brussels, Belgium, 11 July 2021; pp. 2206–2209. [Google Scholar]
- Zhang, W.; Zhu, Y.; Fu, Q. Semi-Supervised Deep Transfer Learning-Based on Adversarial Feature Learning for Label Limited SAR Target Recognition. IEEE Access 2019, 7, 152412–152420. [Google Scholar] [CrossRef]
- Lu, Z.; Jia, X.; Zhu, W.; Zeng, C. Study on SAR Image Despeckling Algorithm. J. Ordnance Equip. Eng. 2017, 38, 104–108. [Google Scholar]
- Goodman, J.W. Some Fundamental Properties of Speckle. J. Opt. Soc. Am. 1976, 66, 1145–1150. [Google Scholar] [CrossRef]
- Eom, K.B. Anisotropic Adaptive Filtering for Speckle Reduction in Synthetic Aperture Radar Images. Opt. Eng. 2011, 50, 97–108. [Google Scholar] [CrossRef]
- Lee, J.S.; Jurkevich, L.; Dewaele, P.; Wambacq, P.; Oosterlinck, A. Speckle Filtering of Synthetic Aperture Radar Images: A Review. Remote Sens. Rev. 1994, 8, 313–340. [Google Scholar] [CrossRef]
- Frost, V.S.; Stiles, J.A.; Shanmugan, K.S. A Mode for Radar Image and Its Application to Adaptive Digital Filtering of Multiplicative Noise. IEEE Trans. Pattern Anal. Mach. Intell. 1982, 4, 157–165. [Google Scholar] [CrossRef]
- Torres, L.; Frery, A.C. SAR Image Despeckling Algorithms Using Stochastic Distances and Nonlocal Means. arXiv 2013, arXiv:Abs/1308.4338. [Google Scholar]
- Liu, S.; Hu, S.; Xiao, Y. SAR Image Denoising Based on Wavelet Contourlet Transform and Cycle Spinning. Signal Process. 2011, 27, 837–842. [Google Scholar]
- Liu, S.; Shi, M.; Hu, S.; Xiao, Y. Synthetic Aperture Radar Image De-Noising Based on Shearlet Transform Using the Context-Based Model. Phys. Commun. 2014, 13, 221–229. [Google Scholar] [CrossRef]
- Fang, J.; Wang, D.; Xiao, Y.; Ajay Saikrishna, D. Denoising of SAR images based on wavelet-contourlet domain and PCA. In Proceedings of the 2014 12th International Conference on Signal Processing (ICSP), Hangzhou, China, 19–23 October 2014; pp. 942–945. [Google Scholar]
- Zhang, K.; Zuo, W.; Gu, S.; Zhang, L. Learning deep CNN denoiser prior for image restoration. In Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA, 21–26 July 2017; pp. 2808–2817. [Google Scholar]
- Lv, B.; Zhao, J. Image Denoising Algorithm Based on Composite Convolutional Neural Network. Pattern Recognit. Artif. Intell. 2017, 30, 97–105. [Google Scholar]
- Wang, P.; Zhang, H.; Patel, V.M. SAR Image Despeckling Using a Convolutional Neural Network. IEEE Signal Process. Lett. 2017, 24, 1763–1767. [Google Scholar] [CrossRef] [Green Version]
- Xu, Q.; Li, W.; Loumbi, P. Applications of Deep Convolutional Neural Network in SAR Automatic Target Recognition: A Summarization. Telecommun. Eng. 2018, 58, 106–112. [Google Scholar]
- Xiao, Q.; Cheng, Y.; Xiao, M.; Zhang, J.; Shi, H.; Niu, L.; Ge, C.; Lang, H. Improved Region Convolutional Neural Network for Ship Detection in Multiresolution Synthetic Aperture Radar Images. Concurr. Comput. Pract. Exper. 2020, 32, 1–10. [Google Scholar] [CrossRef]
- Yue, B.; Han, S. A SAR Ship Detection Method Based on Improved Faster R-CNN. Comput. Mod. 2019, 9, 90–101. [Google Scholar]
- Xue, Y.; Jin, G.; Hou, X.; Tan, L.; Xu, J. SAR Ship Detection Method Incorporating Attention Mechanism and Improved SSD Algorithm. Comput. Appl. Res. 2022, 39, 265–269. [Google Scholar]
- Zhang, Y.; Zhu, W.; Wu, X. Target Detection Based on Fully Convolutional Neural Network for SAR Images. Telecommun. Eng. 2018, 58, 1244–1251. [Google Scholar]
- Fu, X.; Wang, Z. SAR Ship Target Rapid Detection Method Combined with Scene Classification in the Inshore Region. J. Signal Process. 2020, 36, 2123–2130. [Google Scholar]
- Liu, L.; Chen, G.; Pan, Z.; Lei, B.; An, Q. Inshore ship detection in SAR images based on deep neural networks. In Proceedings of the 2018 IEEE International Geoscience and Remote Sensing Symposium, Valencia, Spain, 1 July 2018; pp. 25–28. [Google Scholar]
- Dai, W.; Mao, Y.; Yuan, R.; Liu, Y.; Pu, X.; Li, C. A Novel Detector Based on Convolution Neural Networks for Multiscale SAR Ship Detection in Complex Background. Sensors 2020, 20, 2547. [Google Scholar] [CrossRef]
- Fu, J.; Sun, X.; Wang, Z.; Fu, K. An Anchor-Free Method Based on Feature Balancing and Refinement Network for Multiscale Ship Detection in SAR images. IEEE Trans. Geosci. Remote Sens. 2021, 59, 1331–1344. [Google Scholar] [CrossRef]
- Liu, J.; Zhao, T.; Liu, M. SAR Ship Target Rapid Detection Method Combined with Scene Classification in the Inshore Region. J. Hunan Univ. 2020, 47, 85–91. [Google Scholar]
- Li, Y.; Ding, Z.; Zhang, C.; Wang, Y.; Chen, J. SAR ship detection based on resnet and transfer learning. In Proceedings of the 2019 IEEE International Geoscience and Remote Sensing Symposium, Yokohama, Japan, 28 July–2 August 2019; pp. 1188–1191. [Google Scholar]
- Shao, J.; Qu, C. CNN Based Ship Target Recognition of Imbalanced SAR Image. Electron. Opt. Control 2019, 26, 90–97. [Google Scholar]
- Wang, Z.; Du, L.; Mao, J.; Liu, B.; Yang, D. SAR Target Detection Based on SSD with Data Augmentation and Transfer Learning. IEEE Geosci. Remote Sens. Lett. 2019, 16, 150–154. [Google Scholar] [CrossRef]
- Hu, C.; Chen, C.; He, C.; Pei, H.; Zhang, J. SAR Detection for Small Target Ship Based on Deep Convolutional Neural Network. Chin. J. Inert. Technol. 2019, 27, 397–405. [Google Scholar]
- Radford, A.; Metz, L.; Chintala, S. Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks. arXiv 2017, arXiv:abs/1511.06434. [Google Scholar]
- Cui, Z.; Zhang, M.; Cao, Z.; Cao, C. Image Data Augmentation for SAR Sensor Via Generative Adversarial Nets. IEEE Access 2019, 7, 42255–42268. [Google Scholar] [CrossRef]
- Guo, Y.; Du, L.; Lyu, G. SAR Target Detection Based on Domain Adaptive Faster R-CNN with Small Training Data size. Remote Sens. 2021, 13, 4202. [Google Scholar] [CrossRef]
- Chang, Y.-L.; Anagaw, A.; Chang, L.; Wang, Y.; Hsiao, C.-Y.; Lee, W.-H. Ship Detection Based on YOLOv2 for SAR Imagery. Remote Sens. 2019, 11, 786. [Google Scholar] [CrossRef] [Green Version]
- Zhang, T.; Zhang, X.; Shi, J.; Wei, S. High-speed ship detection in SAR images by improved Yolov3. In Proceedings of the 2019 16th International Computer Conference on Wavelet Active Media Technology and Information Processing, Chengdu, China, 13 December 2019; pp. 149–152. [Google Scholar]
- Zhang, T.; Zhang, X. High-Speed Ship Detection in SAR Images Based on A Grid Convolutional Neural Network. Remote Sens. 2019, 11, 1206. [Google Scholar] [CrossRef] [Green Version]
- Shen, F.; Wang, Q.; Jiang, J. Real-Time Target Detection Algorithm Based on Improved Convolutional Neural Network Ship. Comput. Appl. Res. 2020, 4, 316–319. [Google Scholar]
- Chen, S.; Zhan, R.; Wang, W.; Zhang, J. Learning Slimming SAR Ship Object Detector Through Network Pruning And Knowledge Distillation. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2021, 14, 1267–1282. [Google Scholar] [CrossRef]
- Zhou, L.; Wei, S.; Cui, Z.; Fang, J.; Yang, X.; Ding, W. Lira-YOLO: A Lightweight Model for Ship Detection in Radar Images. J. Syst. Eng. Electron. 2020, 31, 950–956. [Google Scholar]
- Zhang, T.; Zhang, X.; Shi, J.; Wei, S. Depthwise Separable Convolution Neural Network for High-Speed SAR Ship Detection. Remote Sens. 2019, 11, 2483. [Google Scholar] [CrossRef] [Green Version]
- Lin, T.-Y.; Dollár, P.; Girshick, R.; He, K.; Hariharan, B.; Belongie, S. Feature pyramid networks for object detection. In Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA, 21 July 2017; pp. 2117–2125. [Google Scholar]
- Wei, S.; Jiang, P.; Yuan, Q.; Liu, L. SAR Ship Target Detection and Discrimination Model Based on Deep Neural Network. J. Northwestern Polytech. Univ. 2019, 37, 587–593. [Google Scholar] [CrossRef]
- Dou, Q.; Yan, M.; Zhu, D. Small Marine Target Detection in Space-Borne SAR Image Based on YOLO-v5. Appl. Sci. Technol. 2021, 48, 1–7. [Google Scholar]
- Chen, P.; Li, Y.; Zhou, H.; Liu, B.; Liu, P. Detection of Small Ship Objects Using Anchor Boxes Cluster and Feature Pyramid Network Model for SAR Imagery. J. Mar. Sci. Eng. 2020, 8, 112. [Google Scholar] [CrossRef] [Green Version]
- Zhang, D. Ship Detection of SAR Images Based on Deep Learning. Master’ Thesis, Xi’an University of Electronic Science and Technology, Xi’an, China, June 2021. [Google Scholar]
- Liu, S.; Kong, W.; Chen, X.; Xu, M.; Yasir, M.; Zhao, L.; Li, J. Multi-Scale Ship Detection Algorithm Based on a Lightweight Neural Network for Spaceborne SAR Images. Remote Sens. 2022, 14, 1149. [Google Scholar] [CrossRef]
- Cui, Z.; Li, Q.; Cao, Z.; Liu, N. Dense Attention Pyramid Networks for Multi-Scale Ship Detection in SAR Images. IEEE Trans. Geosci. Remote Sens. 2019, 57, 8983–8997. [Google Scholar] [CrossRef]
- Xu, P.; Li, Q.; Zhang, B.; Wu, F.; Zhao, K.; Du, X.; Yang, C.; Zhong, R. On-Board Real-Time Ship Detection in HISEA-1 SAR Images Based on CFAR And Lightweight Deep Learning. Remote Sens. 2021, 13, 1995. [Google Scholar] [CrossRef]
- Cui, Z.; Quan, H.; Cao, Z.; Xu, S.; Ding, C.; Wu, J. SAR Target CFAR Detection Via GPU Parallel Operation. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2018, 11, 4884–4894. [Google Scholar] [CrossRef]
- Kang, M.; Leng, X.; Lin, Z.; Ji, K. A modified Faster R-CNN based on CFAR algorithm for SAR ship detection. In Proceedings of the 2017 International Workshop on Remote Sensing with Intelligent Processing, Shanghai, China, 18–21 May 2017; pp. 1–4. [Google Scholar]
- Vaswani, A.; Shazeer, N.; Parmar, N.; Uszkoreit, J.; Jones, L.; Gomez, A.N.; Kaiser, L.; Polosukhin, I. Attention Is All You Need. arXiv 2017, arXiv:abs/1706.03762. [Google Scholar]
- Dosovitskiy, A.; Beyer, L.; Kolesnikov, A.; Weissenborn, D.; Zhai, X.; Unterthiner, T.; Dehghani, M.; Minderer, M.; Heigold, G.; Gelly, S. An Image Is Worth 16x16 Words: Transformers for Image Recognition at Scale. arXiv 2020, arXiv:2010.11929. [Google Scholar]
- Liu, Z.; Lin, Y.; Cao, Y.; Hu, H.; Wei, Y.; Zhang, Z.; Lin, S.; Guo, B. Swin Transformer: Hierarchical Vision Transformer Using Shifted Windows. arXiv 2021, arXiv:abs/2103.14030. [Google Scholar]
Algorithm Ideology | Advantage | Disadvantage |
---|---|---|
Geometric feature extraction based on azimuth angle [9] | Higher accuracy and better stability | The process is complex and the application is limited |
Multi-feature combination [10] | Effectively extracts the features | High complexity and unfavorable for subsequent processing |
Geometric feature extraction based on fine segmentation [11] | More accurate segmentation and better performance | Features not yet combined with other methods to further exploit the algorithm |
Algorithm Ideology | Advantage | Disadvantage |
---|---|---|
IB-CFAR in the complex environment [13] | Strong robustness and high detection rate | Partial false alarms would occur |
Two-Parameter CFAR detection [14] | Low FAR and high precision | Worse robustness and universality, incomplete individual parameters |
A CFAR algorithm based on shadow feature semantics [15] | Lower FAR | Large amount of computation, poor migration generalization ability |
An improved CFAR based on similarity judgment and attention mechanism [16] | High efficiency, strong ability of the multi-scale target detection | High requirements for device memory, unfavorable detection of large numbers of images |
CFAR based on adaptive background clutter model [17] | Lower missed detection rate and higher accuracy | Long detection time and lower efficiency |
Algorithm Ideology | Advantage | Disadvantage |
---|---|---|
Target detection by using improved fractal feature [19] | Low FAR, better spatial resolution and more accurate position indication | As the background complexity increases, the FAR also increases |
Algorithm of combining single-scale and multi-scale features [20,21] | Lower FAR, strong ability to distinguish targets, high accuracy | Imperfection classification techniques, incomplete comparison |
Mean extended fractal [22] | Better resolution for bright and dark targets | Poor algorithm robustness |
Target Detection Algorithm | Backbone Network | Image Size | mAP (VOC) | mAP50 (COCO) | FPS (TitanX) |
---|---|---|---|---|---|
YOLOv1 | 24-layer convolution | 448 × 448 | 63.4 | - | 45 |
YOLOv2 | Darknet-19 | 416 × 416 | 76.8 | 44.0 | 67 |
YOLOv3 | Darknet-53 | 416 × 416 | - | 55.3 | 78 |
SSD300 | VGG-16 | 300 × 300 | 74.3 | 41.2 | 46 |
SSD512 | ResNet-101 | 512 × 512 | 76.8 | 46.5 | 19 |
Faster R-CNN | VGG-16 | 1000 × 600 | 73.2 | 42.7 | 7 |
Dataset | Height | Width | Sample Size | Target Quantity | Average Height/Pixel | Average Width/Pixel | Average Area/Pixel |
---|---|---|---|---|---|---|---|
SSDD | 190~256 | 214~668 | 1160 | 2540 | 39.05 | 36.08 | 1882.89 |
HRSID | 800 | 800 | 5604 | 16,951 | 33.16 | 37.65 | 1809.72 |
SAR-Ship-Dataset | 256 | 256 | 43,819 | 59,535 | 33.32 | 31.32 | 1133.88 |
Category | Training Set of Pitch Angle 17° | Test Set of Pitch Angle 15° |
---|---|---|
BMP2(sn-9566) | 233 | 196 |
BTR70(sn-c71) | 233 | 196 |
T72(sn-132) | 232 | 196 |
BTR60 | 256 | 195 |
2S1 | 299 | 274 |
BRDM2 | 298 | 274 |
D7 | 299 | 274 |
T62 | 299 | 273 |
ZIL131 | 299 | 274 |
ZSU23/4 | 299 | 274 |
Summation | 2747 | 3203 |
Name | Title Principal Method |
---|---|
Rotation, flip, zoom, pan | To rotate an image at a certain angle, flip, zoom in or out, or shift in a plane |
Reflexive transformation | Axial reflection transformation, specular reflection transformation of images |
Scale transformation | Scale the image according to the scale factor to adjust the blur degree of the image |
Noise disturbance | Adding some noises such as exponential, salt, pepper, and Gaussian noise |
Algorithm Improvement | Specific Ideas | Pending Problems | Algorithm Effect |
---|---|---|---|
Algorithmic Optimization | Integrating attention mechanism | SAR target detection in complex background | Ref. [63]: high accuracy and strong robustness |
Converting target detection to pixel classification | The interference problem of scene clutter | Ref. [64]: lower FAR and better universality | |
Combining scene classification | Land-sea segmentation | Ref. [65]: better detection accuracy, faster speed | |
Lightweighting the network with knowledge distillation | Algorithm detection speed and algorithm redundancy | Ref. [81]: minor algorithmic size, faster detection speed | |
Combining dese and residual connection, cluster convolution | Model real-time detection | Ref. [82]: low algorithmic complexity, small parameters, high accuracy | |
Expansion of the Dataset | Enhancing datasets by using GAN | Insufficient SAR image data | Ref. [75]: effectively detects small targets, and expands the dataset |
Multi-feature Fusion | Combining adaptive anchor box algorithm and GDAL | Small target detection in spaceborne SAR images | Ref. [86]: high speed and low missed detection rate |
Introducing ResNet and FPN | Multi-scale small target detection | Ref. [73]: faster detection, better real-time performance | |
Introducing attention mechanism and feature noise reduction | Multi-scale small target detection | Ref. [88]: meets the requirements of real-time detection | |
Combination of CNN and Traditional Algorithm | Combining CFAR with convolution and pooling | Long detection time of traditional algorithm | Ref. [92]: higher detection efficiency and shorter running time |
Algorithm Classification | Advantage | Disadvantage |
---|---|---|
Target detection algorithm based on structural feature | Fine stability, fast detection speed | Needs prior information, easy to be affected by clutter |
Target detection algorithm based on gray feature | Nice stability, easy implement | Needs prior information, difficult to establish a unified target statistical model |
Target detection algorithm based on image texture feature | High accuracy | Poor robustness, difficult to extract features |
Target detection algorithm based on CNN | High accuracy and fast detection | Strong dependence on the sample, higher requirements of hardware and computing power |
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Zhang, Y.; Hao, Y. A Survey of SAR Image Target Detection Based on Convolutional Neural Networks. Remote Sens. 2022, 14, 6240. https://doi.org/10.3390/rs14246240
Zhang Y, Hao Y. A Survey of SAR Image Target Detection Based on Convolutional Neural Networks. Remote Sensing. 2022; 14(24):6240. https://doi.org/10.3390/rs14246240
Chicago/Turabian StyleZhang, Ying, and Yisheng Hao. 2022. "A Survey of SAR Image Target Detection Based on Convolutional Neural Networks" Remote Sensing 14, no. 24: 6240. https://doi.org/10.3390/rs14246240
APA StyleZhang, Y., & Hao, Y. (2022). A Survey of SAR Image Target Detection Based on Convolutional Neural Networks. Remote Sensing, 14(24), 6240. https://doi.org/10.3390/rs14246240