Lightweight Segmentation Method for Wood Panel Images Based on Improved DeepLabV3+
Abstract
:1. Introduction
- An improved DeepLabV3+ network model is proposed to address the challenges of segmentation accuracy and detection performance in wood image segmentation tasks, particularly those related to large model parameters and high computational complexity.
- A multi-level upsampling feature fusion mechanism combined with a coordinate attention mechanism is proposed to enhance feature extraction and integration capabilities, which can more effectively capture key details and local features in wood images, and contribute to the improvement of segmentation results.
- The proposed model is well balanced between segmentation speed and accuracy through architectural modifications that make it suitable for practical applications in complex industrial environments.
2. Development of the Improved DeepLabV3+ Model Architecture
2.1. Overall Model Architecture
2.2. Feature Extraction Based MobileNetV3
2.3. Integration of the Coordinate Attention Mechanism
- Global Pooling Decomposition: Instead of traditional global pooling, the CA mechanism performs pooling operations separately in both the horizontal and vertical directions. This generates feature maps with dimensions of 1 × H × C and W × 1 × C, respectively. The calculation formula is as follows:
- Feature Map Concatenation and Dimensionality Reduction: The feature maps from the horizontal and vertical pooling operations are concatenated, resulting in a feature map with dimensions of 1 × (H + W) × C. Subsequently, a 1 × 1 convolution is applied to reduce the number of channels to C/r. After batch normalization and activation, a new feature map, denoted as F1, with dimensions of 1 × (H + W) × C/r is obtained.
- Feature Map Splitting and Attention Weight Calculation: The feature map F1 is split into two separate feature maps, Fh1 and Fw1, corresponding to the height and width directions, respectively. These feature maps are then processed through activation functions to generate attention weights, denoted as Ah for the height direction and Aw for the width direction. The calculation formula is as follows:
- Weighted Calculation: The attention weights Ah and Aw are applied to the original feature map. The final feature map with attention weights is obtained through element-wise multiplication, enhancing the network’s focus on critical regions.
2.4. Feature Fusion for Integrating Multi-Scale Spatial Information
3. Experimental Results and Analysis
3.1. Overview of the Semantic Segmentation Dataset
3.2. Experimental Environment and Network Model Training
3.3. Evaluation Indicators
- Mean Intersection and Union Ratio (MIoU):
- 2.
- Precision:
- 3.
- Recall:
- 4.
- Segmentation speed:
- 5.
- Model Parameter Size:
3.4. Comparison of Experimental Results
3.4.1. Ablation Experiment
- Group A utilized the original DeepLabV3+ network model as the baseline for comparison.
- Group B replaced the backbone with MobileNetV3, which resulted in a significant reduction in the model’s parameter count to 22.3 M and improved detection speed, with minimal impact on accuracy.
- Group C, building on Group B, introduced the CA mechanism. This modification resulted in a notable increase in recall and accuracy, and the MIoU improved by 0.86%, demonstrating the ability of CA to enhance feature extraction and mitigate background interference.
- Group D, further refining Group C by improving the feature fusion structure, achieved an accuracy, recall, and MIoU of 99.03%, 99.50%, and 98.43%, respectively. Additionally, model parameters were reduced by 89.3%, and the segmentation speed was increased by 59.2%.
3.4.2. Comparison with Other Semantic Segmentation Algorithms
3.4.3. Analysis of Segmentation Results
4. Conclusions
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Pulkki, R.; McDonald, G. Wood as a renewable resource: Societal benefits and sustainable development. J. Sustain. For. 2020, 39, 573–589. [Google Scholar]
- Wiedenhoeft, A.C.; Miller, R.B. Wood Handbook: Wood as an Engineering Material; Forest Products Laboratory: Madison, WI, USA, 2019; pp. 1–5. ISBN 978-1422050068. [Google Scholar]
- Tsoumis, G. Science and Technology of Wood: Structure, Properties, Utilization; Van Nostrand Reinhold: New York, NY, USA, 2020; pp. 255–275. ISBN 978-0442243181. [Google Scholar]
- Zobel, B.J.; van Buijtenen, J.P. Wood Variation: Its Causes and Control; Springer: Berlin/Heidelberg, Germany, 2019; pp. 65–85. [Google Scholar] [CrossRef]
- Simatupang, M.H.; Welling, J. Wood processing: An overview of the industry. Wood Sci. Technol. 2021, 55, 505–530. [Google Scholar] [CrossRef]
- Zhang, X.; Wu, Q.; Zuo, W.; Lin, L. Machine vision in automated manufacturing: Current status and challenges. IEEE Trans. Ind. Inform. 2021, 17, 3480–3492. [Google Scholar]
- Li, X.; Zhang, Y.; Wang, Y.; Wang, X. Wood defect detection based on image segmentation using machine vision. J. Manuf. Process. 2019, 35, 120–128. [Google Scholar] [CrossRef]
- Sonka, M.; Hlavac, V.; Boyle, R. Image Processing, Analysis, and Machine Vision; Cengage Learning: Boston, MA, USA, 2014; pp. 650–660. ISBN 978-1133593607. [Google Scholar]
- Singh, N.; Vatsa, M.; Singh, R.; Noore, A. Automated Face Recognition via a Lightweight Threshold Segmentation Algorithm. IEEE Access 2020, 8, 68105–68115. [Google Scholar]
- Sun, J.; Yin, Y.; Luo, Z.; Zhang, G. A novel image thresholding method based on Gaussian mixture model. Neurocomputing 2021, 452, 535–544. [Google Scholar] [CrossRef]
- Xie, S.; Tu, Z. Holistically-Nested Edge Detection. Int. J. Comput. Vis. 2017, 125, 3–18. [Google Scholar] [CrossRef]
- Ma, H.; Qi, W.; Zhang, M.; Zhang, T.; Zhang, J. Canny edge detection enhancement by soft threshold and interpolation optimization. Signal Process. Image Commun. 2021, 91, 116039. [Google Scholar]
- Fan, J.; Zeng, G.; Xia, X.; Wu, Y.; Zhu, Q. A New Robust Adaptive Region-Based Image Segmentation Model Using Local Statistical Information. IEEE Trans. Image Process. 2018, 27, 827–837. [Google Scholar]
- Khan, S.; Siddiqui, M.F.; Ullah, A. An efficient region-based segmentation method using local binary pattern and morphological operations. Multimed. Tools Appl. 2021, 80, 18877–18897. [Google Scholar]
- Gonzalez, J.; Sanchez-Azofeifa, G. Image Processing and Morphological Transformations for Remote Sensing Forest Applications. Remote Sens. 2020, 12, 1653. [Google Scholar] [CrossRef]
- Mehta, A.; Manhas, J.; Kumar, R. Morphological segmentation of MRI data for object separation. J. Appl. Res. Technol. 2020, 18, 59–68. [Google Scholar]
- Gonzalez, R.C.; Woods, R.E. Digital Image Processing; Pearson: London, UK, 2018; pp. 520–530. ISBN 978-0133356724. [Google Scholar]
- Wang, S.; Fan, J.; Wang, L. Challenges in traditional image segmentation for industrial applications: A review. Comput. Ind. 2020, 125, 103328. [Google Scholar] [CrossRef]
- Long, J.; Shelhamer, E.; Darrell, T. Fully Convolutional Networks for Semantic Segmentation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Boston, MA, USA, 7–12 June 2015; pp. 3431–3440. [Google Scholar]
- Zhao, H.; Shi, J.; Qi, X.; Wang, S.; Jia, J. Pyramid Scene Parsing Network. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA, 21–26 July 2017; pp. 6230–6239. [Google Scholar]
- Lin, T.-Y.; Goyal, P.; Girshick, R.; He, K.; Dollár, P. Focal Loss for Dense Object Detection. IEEE Trans. Pattern Anal. Mach. Intell. 2020, 42, 318–327. [Google Scholar] [CrossRef]
- Yang, W.; Wang, Y.; Zhang, X.; Zhu, L.; Ren, Z.; Ji, Y.; Li, L.; Xie, Y. RCEAU-Net: Cascade Multi-Scale Convolution and Attention-Mechanism-Based Network for Laser Beam Target Image Segmentation with Complex Background in Coal Mine. Sensors 2024, 24, 2552. [Google Scholar] [CrossRef] [PubMed] [PubMed Central]
- Ju, A.; Wang, Z. A novel fully convolutional network based on marker-controlled watershed segmentation algorithm for industrial soot robot target segmentation. Evol. Intell. 2023, 16, 963–980. [Google Scholar] [CrossRef]
- Wang, H.; Zhang, X.; Li, Y.; Zhang, Y. Applications of Deep Learning in the Wood Industry: A Review. Forests 2021, 12, 1341. [Google Scholar] [CrossRef]
- Zhou, Y.; Wang, S.; Sun, Y.; Liu, L. Wood Defect Detection Based on Improved Deep Learning Algorithm. Sensors 2020, 20, 4891. [Google Scholar] [CrossRef]
- He, T.; Liu, Y.; Xu, C.; Zhou, X.; Hu, Z.; Fan, J. A Fully Convolutional Neural Network for Wood Defect Location and Identification. IEEE Access 2019, 7, 123453–123462. [Google Scholar] [CrossRef]
- Hu, K.; Wang, B.; Shen, Y.; Guan, J.; Cai, Y. Defect identification method for poplar veneer based on progressive growing generated adversarial network and MASK R-CNN Model. BioResources 2020, 15, 3041–3052. [Google Scholar] [CrossRef]
- Tao, R.; Li, S.; Zhang, X.; Wang, W. Patch-U-Net: A Semantic Segmentation Method for Tree Species Classification Using Remote Sensing Data. Remote Sens. 2021, 13, 542. [Google Scholar]
- Moru, D.K.; Borro, D. Analysis of Different Parameters of Influence in Industrial Cameras Calibration Processes. Measurement 2021, 171, 108750. [Google Scholar] [CrossRef]
- Bürmen, M.; Pernuš, F.; Likar, B. LED Light Sources: A Survey of Quality-Affecting Factors and Methods for Their Assessment. Meas. Sci. Technol. 2008, 19, 122002. [Google Scholar] [CrossRef]
- Jia, L.; Wang, T.; Chen, Y.; Zang, Y.; Li, X.; Shi, H.; Gao, L. MobileNet-CA-YOLO: An Improved YOLOv7 Based on the MobileNetV3 and Attention Mechanism for Rice Pests and Diseases Detection. Agriculture 2023, 13, 1285. [Google Scholar] [CrossRef]
- He, J.; Cheng, Y.; Wang, W.; Ren, Z.; Zhang, C.; Zhang, W. A Lightweight Building Extraction Approach for Contour Recovery in Complex Urban Environments. Remote Sens. 2024, 16, 740. [Google Scholar] [CrossRef]
- Peng, H.; Xue, C.; Shao, Y.; Chen, K.; Xiong, J.; Xie, Z.; Zhang, L. Semantic Segmentation of Litchi Branches Using DeepLabV3+ Model. IEEE Access 2020, 8, 164546–164555. [Google Scholar] [CrossRef]
- Perez, L.; Wang, J. The Effectiveness of Data Augmentation in Image Classification Using Deep Learning. arXiv 2017, arXiv:1712.04621. Available online: https://arxiv.org/abs/1712.04621 (accessed on 17 October 2024).
- Howard, A.G.; Zhu, M.; Chen, B.; Kalenichenko, D.; Wang, W.; Weyand, T.; Andreetto, M.; Adam, H. MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications. arXiv 2017, arXiv:1704.04861. Available online: https://arxiv.org/abs/1704.04861 (accessed on 17 October 2024).
- Howard, A.; Sandler, M.; Chen, B.; Wang, W.; Chen, L.-C.; Tan, M.; Chu, G.; Vasudevan, V.; Zhu, Y.; Pang, R.; et al. Searching for MobileNetV3. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), Seoul, Republic of Korea, 27 October–2 November 2019; pp. 1314–1324. [Google Scholar] [CrossRef]
- Sandler, M.; Howard, A.; Zhu, M.; Zhmoginov, A.; Chen, L.-C. MobileNetV2: Inverted Residuals and Linear Bottlenecks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Salt Lake City, UT, USA, 18–22 June 2018; pp. 4510–4520. [Google Scholar]
- Hou, Q.; Zhou, D.; Feng, J. Coordinate Attention for Efficient Mobile Network Design. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Nashville, TN, USA, 19–25 June 2021; pp. 13713–13722. [Google Scholar]
- Li, Y.; Geng, T.; Stein, S.; Li, A.; Yu, H. GAAF: Searching Activation Functions for Binary Neural Networks through Genetic Algorithm. Tsinghua Sci. Technol. 2022, 28, 207–220. [Google Scholar] [CrossRef]
- Wang, R.; Zhang, G.; Liang, F.; Wang, B.; Mou, X.; Chen, Y.; Sun, P.; Wang, C. WPS-Dataset: A benchmark for wood plate segmentation in bark removal processing. arXiv 2024, arXiv:2404.11051. [Google Scholar] [CrossRef]
- Ronneberger, O.; Fischer, P.; Brox, T. U-Net: Convolutional Networks for Biomedical Image Segmentation. In Proceedings of the Medical Image Computing and Computer-Assisted Intervention (MICCAI), Munich, Germany, 5–9 October 2015; pp. 234–241. [Google Scholar]
Input | Operator | Exp Size | #out | SE | NL | s |
---|---|---|---|---|---|---|
512 × 512 × 3 | Conv2d, 3 × 3 | - | 16 | - | HS | 2 |
256 × 256 × 16 | bneck, 3 × 3 | 16 | 16 | √ | RE | 2 |
128 × 128 × 16 | bneck, 3 × 3 | 72 | 24 | - | RE | 2 |
64 × 64 × 24 | bneck, 3 × 3 | 88 | 24 | - | RE | 1 |
64 × 64 × 24 | bneck, 5 × 5 | 96 | 40 | √ | HS | 2 |
32 × 32 × 40 | bneck, 5 × 5 | 240 | 40 | √ | HS | 1 |
32 × 32 × 40 | bneck, 5 × 5 | 240 | 40 | √ | HS | 1 |
32 × 32 × 40 | bneck, 5 × 5 | 120 | 48 | √ | HS | 1 |
32 × 32 × 48 | bneck, 5 × 5 | 144 | 48 | √ | HS | 1 |
32 × 32 × 48 | bneck, 5 × 5 | 288 | 96 | √ | HS | 1 |
32 × 32 × 96 | bneck, 5 × 5 | 576 | 96 | √ | HS | 1 |
32 × 32 × 96 | bneck, 5 × 5 | 576 | 96 | √ | HS | 1 |
32 × 32 × 96 | bneck, 5 × 5 | 96 | 576 | √ | HS | 1 |
32 × 32 × 576 | Conv2d | - | 320 | - | - | 1 |
Experimental Group | Improvement Method | Precision/% | Recall/% | MIoU/% | Parameter Size/M | Segmentation Speed/(s/Image) |
---|---|---|---|---|---|---|
A | DeeplabV3+ | 98.36 | 98.82 | 97.38 | 20.9 | 0.452 |
B | MobileNetV3 | 98.02 | 98.75 | 97.08 | 22.3 | 0.177 |
C | MobileNetV3+CA | 98.85 | 99.37 | 97.94 | 22.3 | 0.180 |
D | Final | 99.03 | 99.50 | 98.43 | 22.4 | 0.184 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Mou, X.; Chen, H.; Yu, X.; Chen, L.; Peng, Z.; Wang, R. Lightweight Segmentation Method for Wood Panel Images Based on Improved DeepLabV3+. Electronics 2024, 13, 4658. https://doi.org/10.3390/electronics13234658
Mou X, Chen H, Yu X, Chen L, Peng Z, Wang R. Lightweight Segmentation Method for Wood Panel Images Based on Improved DeepLabV3+. Electronics. 2024; 13(23):4658. https://doi.org/10.3390/electronics13234658
Chicago/Turabian StyleMou, Xiangwei, Hongyang Chen, Xinye Yu, Lintao Chen, Zhujing Peng, and Rijun Wang. 2024. "Lightweight Segmentation Method for Wood Panel Images Based on Improved DeepLabV3+" Electronics 13, no. 23: 4658. https://doi.org/10.3390/electronics13234658
APA StyleMou, X., Chen, H., Yu, X., Chen, L., Peng, Z., & Wang, R. (2024). Lightweight Segmentation Method for Wood Panel Images Based on Improved DeepLabV3+. Electronics, 13(23), 4658. https://doi.org/10.3390/electronics13234658