Toward Efficient Image Recognition in Sensor-Based IoT: A Weight Initialization Optimizing Method for CNN Based on RGB Influence Proportion
Abstract
:1. Introduction
- (i)
- It presents an initialization option for CNN to make use of RGB proportions, which can be modified in the future, and thereby shortens the convergence of the learning models.
- (ii)
- It proved that k-NN can be used to extract the early proportion for the RGB-based initialization, which can be deemed as a pre-training process.
2. Related Work
2.1. Weight Initialization Methods
2.2. K-Nearest Neighbors Algorithm
3. Materials and Method
3.1. Utilizing the RGB Influence Proportion
3.2. Experiment Configurations
Learning Models
3.3. Weight Initialization
3.4. k-NN for the Early Proportion
4. Performing Experiments
4.1. Experiment Setup
4.2. Experimental Results
4.2.1. Results of Proposed Method on CNN
4.2.2. RGB-Based Initialization on a Fully-Connected Network
4.2.3. Evaluating the Changes of Weights
4.2.4. Early Proportion by k-NN
5. Discussion
- (i)
- As far as a specific dataset is concerned, the RGB influence proportion is existed and fixed. It is not a variable. It is supposed to be given by some other means. In this paper, we got the proportion by ensemble learning and k-NN, while one is too intricate and another one is slightly inaccurate. A more convenient way to compute the proportion is still in need. Besides, if the RGB proportion can be made as a property of the dataset, whoever needs to use the proportion can make use of it directly without measuring the proportion by himself.
- (ii)
- The approach we use to deal with the RGB proportion is quite simple right now, as we just apply it on the original standard normal distribution and utilize it on the first layer of the model, even though it is proven to be useful in first epochs. Firstly, the not ideal result might be the consequence of using a learning model with too many layers, so that the total influence is reduced. Additionally, we can try forward propagation after generating the first layer to fully use the color difference. These are both worth further investigation and experimentation.
- (iii)
- The difference among three channels from the dataset we used here is not very significant in this work. Thus, a more specific dataset which includes data with higher RGB proportions is recommended. Additionally, it is worth mentioning that in some image recognition cases, color difference might be much more evident. In that case, the proposed method will be more effective.
- (iv)
- Currently, there are different initialization methods, such as He and Xavier initialization, which can also be applied like the proposed method but probably with better performance. A controlled experiment with them is recommended so that the conclusion can be more trustworthy.
- (v)
- Recently, it has been a great trend of the IoT system to include more visual sensor-based devices and keep data security in the process of analysis. In order to realize it, the proposed method which is used to optimize the training process can be utilized in IoT systems after some modifications and improvements in the future.
6. Conclusions and Future Work
Author Contributions
Funding
Conflicts of Interest
References
- Islam, S.R.; Kwak, D.; Kabir, M.H.; Hossain, M.; Kwak, K.S. The internet of things for health care: A comprehensive survey. IEEE Access 2015, 3, 678–708. [Google Scholar] [CrossRef]
- Song, F.; Zhu, M.; Zhou, Y.; You, I.; Zhang, H. Smart Collaborative Tracking for Ubiquitous Power IoT in Edge-Cloud Interplay Domain. IEEE Internet Things J. 2019, 1. [Google Scholar] [CrossRef]
- Li, S.; Sun, W. Utility maximisation for resource allocation of migrating enterprise applications into the cloud. In Enterprise Information Systems; Taylor & Francis: Abingdon, UK, 2020. [Google Scholar]
- Yu, X.; Wang, J.; Kays, R.; Jansen, P.A.; Wang, T.; Huang, T. Automated identification of animal species in camera trap images. EURASIP J. Image Video Process. 2013, 1, 52. [Google Scholar] [CrossRef] [Green Version]
- Hsu, Y.-L.; Chou, P.-H.; Chang, H.-C.; Lin, S.-L.; Yang, S.-C.; Su, H.-Y.; Chang, C.-C.; Cheng, Y.-S.; Kuo, Y.-C. Design and Implementation of a Smart Home System Using Multisensor Data Fusion Technology. Sensors 2017, 17, 1631. [Google Scholar] [CrossRef]
- Chena, Y.; Zhao, D.; Le, L.; Zhang, Q. Multi-task learning for dangerous object detection in autonomous driving. Inf. Sci. 2017, 432, 559–571. [Google Scholar] [CrossRef]
- Song, F.; Zhou, Y.; Chang, L.; Zhang, H. Modeling Space-Terrestrial Integrated Networks with Smart Collaborative Theory. IEEE Netw. 2019, 33, 51–57. [Google Scholar] [CrossRef]
- Lee, Y.-S.; Chung, W.-Y. Visual Sensor Based Abnormal Event Detection with Moving Shadow Removal in Home Healthcare Applications. Sensors 2012, 12, 573–584. [Google Scholar] [CrossRef]
- Song, F.; Zhou, Y.-T.; Wang, Y.; Zhao, T.; You, I.; Zhang, H. Smart Collaborative Distribution for Privacy Enhancement in Moving Target Defense. Inf. Sci. 2019, 479, 593–606. [Google Scholar] [CrossRef]
- Ai, Z.-Y.; Liu, Y.; Chang, L.; Lin, F.; Song, F. A Smart Collaborative Authentication Framework for Multi-Dimensional Fine-Grained Control. IEEE Access 2020, 8, 8101–8113. [Google Scholar] [CrossRef]
- Girshick, R.; Donahue, J.; Darrell, T.; Malik, J. Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Columbus, OH, USA, 24–27 June 2014; pp. 580–587. [Google Scholar]
- Song, F.; Ai, Z.-Y.; Zhou, Y.-T.; You, I.; Choo, K.R.; Zhang, H. Smart Collaborative Automation for Receive Buffer Control in Multipath Industrial Networks. IEEE Trans. Ind. Inform. 2020, 16, 1385–1394. [Google Scholar] [CrossRef]
- Ai, Z.-Y.; Liu, Y.; Song, F.; Zhang, H. A Smart Collaborative Charging Algorithm for Mobile Power Distribution in 5G Networks. IEEE Access 2018, 6, 28668–28679. [Google Scholar] [CrossRef]
- Ai, Z.-Y.; Zhou, Y.-T.; Song, F. A Smart Collaborative Routing Protocol for Reliable Data Diffusion in IoT Scenarios. Sensors 2018, 18, 1926. [Google Scholar] [CrossRef] [Green Version]
- Mohammadi, M.; Al-Fuqaha, A.; Sorour, S.; Guizani, M. Deep Learning for IoT Big Data and Streaming Analytics: A Survey. IEEE Commun. Surv. Tutorials 2018, 20, 2923–2960. [Google Scholar] [CrossRef] [Green Version]
- Hurlbert, A.; Ling, Y. Understanding colour perception and preference. In Colour Design: Theories and Applications, 2nd ed.; Best, J., Ed.; Woodhead Publishing: Oxford, UK, 2012; pp. 129–157. [Google Scholar]
- Alexander, G.S.; Raquel, U. Fully Connected Deep Structured Networks. arXiv 2015, arXiv:1503.02351. [Google Scholar]
- Saxe, A.; Mcclelland, J.; Ganguli, S. Exact solutions to the nonlinear dynamics of learning in deep linear neural networks. arXiv 2013, arXiv:1312.6120. [Google Scholar]
- Lilleberg, J.; Zhu, Y.; Zhang, Y. Support vector machines and Word2vec for text classification with semantic features. In Proceedings of the IEEE 14th International Conference on Cognitive Informatics & Cognitive Computing (ICCI*CC), Beijing, China, 6–8 July 2015; pp. 136–140. [Google Scholar]
- Graves, A.; Mohamed, A.; Hinton, G. Speech recognition with deep recurrent neural networks. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, Vancouver, BC, Canada, 26–31 May 2013; pp. 6645–6649. [Google Scholar]
- O’Shea, T.; Hoydis, J. An Introduction to Deep Learning for the Physical Layer. IEEE Trans. Cogn. Commun. Netw. 2017, 3, 563–575. [Google Scholar] [CrossRef] [Green Version]
- Hinton, G.; Vinyals, O.; Dean, J. Distilling the Knowledge in a Neural Network. arXiv 2015, arXiv:1503.02531. [Google Scholar]
- Sutskever, I.; Martens, J.; Dahl, G.; Hinton, G. On the importance of initialization and momentum in deep learning. In Proceedings of the International Conference on International Conference on Machine Learning 2013 (ICML2013), Atlanta, GA, USA, 16–21 June 2013. [Google Scholar]
- Mishkin, D.; Matas, J. All you need is a good init. In Proceedings of the International Conference on Learning Representations (ICLR), San Juan, Puerto Rico, 2–4 May 2016. [Google Scholar]
- Hu, Y.; Huber, A.; Anumula, J.; Liu, S.C. Overcoming the vanishing gradient problem in plain recurrent networks. arXiv 2018, arXiv:1801.06105. [Google Scholar]
- Xavier, G.; Bengio, Y. Understanding the difficulty of training deep feedforward neural networks. J. Mach. Learn. Res. Proc. Track 2010, 9, 249–256. [Google Scholar]
- He, K.; Zhang, X.; Ren, S.; Sun, J. Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification. In Proceedings of the 2015 IEEE International Conference on Computer Vision (ICCV), Santiago, Chile, 7–13 December 2015; pp. 1026–1034. [Google Scholar]
- Zhang, H.; Dauphin, Y.; Ma, T. Fixup Initialization: Residual Learning without Normalization. arXiv 2019, arXiv:1901.09321. [Google Scholar]
- Schwedhelm, P.; Baldauf, D.; Treue, S. The lateral prefrontal cortex of primates encodes stimulus colors and their behavioral relevance during a match-to-sample task. Sci. Rep. 2020, 10, 4216. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- LeCun, Y.; Kavukcuoglu, K.; Farabet, C. Convolutional networks and applications in vision. In Proceedings of the 2010 IEEE International Symposium on Circuits and Systems, Paris, France, 30 May–2 June 2010; pp. 253–256. [Google Scholar]
- Lai, K.; Bo, L.; Ren, X.; Fox, D. Detection-based object labeling in 3D scenes. In Proceedings of the 2012 IEEE International Conference on Robotics and Automation, Saint Paul, MN, USA, 14–18 May 2012; pp. 1330–1337. [Google Scholar]
- David, S.; Abbott, L.F. Random Walk Initialization for Training Very Deep Feedforward Networks. arXiv 2014, arXiv:1412.6558. [Google Scholar]
- Krähenbühl, P.; Doersch, C.; Donahue, J.; Darrell, T. Data-dependent Initializations of Convolutional Neural Networks. arXiv 2015, arXiv:1511.06856. [Google Scholar]
- Abeywickrama, T.; Cheema, M.A.; Taniar, D. k-Nearest Neighbors on Road Networks: A Journey in Experimentation and In-Memory Implementation. arXiv 2016, arXiv:1601.01549. [Google Scholar] [CrossRef]
- Syaliman, K.; Nababan, E.; Sitompul, O. Improving the accuracy of k-nearest neighbor using local mean based and distance weight. J. Phys. Conf. Ser. 2018, 978, 012047. [Google Scholar] [CrossRef] [Green Version]
- Cover, T.; Hart, P. Nearest neighbor pattern classification. IEEE Trans. Inf. Theory 1967, 13, 21–27. [Google Scholar] [CrossRef]
- Deng, Z.; Jiang, Y.; Cao, Y.; Yi, Y.; Li, P.; Liu, Q. Evaluating the Influence of Separated RGB Channels with Ensemble Learning. In Proceedings of the 11th International Conference on Graphics and Image Processing (ICGIP), Hangzhou, China, 12–14 October 2019. [Google Scholar]
- Hahnloser, R.H.R.; Seung, H.S.; Slotine, J. Permitted and Forbidden Sets in Symmetric Threshold-Linear Networks. Neural Comput. 2013, 15, 621–638. [Google Scholar] [CrossRef]
- Srivastava, N.; Hinton, G.; Krizhevsky, A.; Sutskever, I. Salakhutdinov, R. Dropout: A Simple Way to Prevent Neural Networks from Overfitting. J. Mach. Learn. Res. 2014, 15, 1929–1958. [Google Scholar]
- François, C. Keras. Available online: https://github.com/keras-team/keras (accessed on 8 February 2020).
- Google Brain Project. Tensorflow. Available online: https://github.com/tensorflow/tensorflow (accessed on 19 January 2020).
- Krizhevsky, A. Learning Multiple Layers of Features from Tiny Images. Technical Report(TR-2009); University of Toronto: Toronto, ON, Canada, 2009. [Google Scholar]
- Kingma, D.; Ba, J. Adam: A Method for Stochastic Optimization. In Proceedings of the International Conference on Learning Representations (ICLR), Banff, AB, Canada, 14–16 April 2014. [Google Scholar]
Method | Using Color Influence Differences | Influence on Convergence |
---|---|---|
Traditional CNN | In the convolution process | / |
Gaussian distribution initialization | / | / |
Xavier Initialization | / | By keeping variance consistent |
He Initialization | / | Xavier’s method based on ReLU |
Layer (Type) | Output Shape | Param |
---|---|---|
input ( RGB image) | ||
conv2d_1 (Conv2D) | (None, 32, 32, 32) | 896 |
dropout_1 (Dropout) | (None, 32, 32, 32) | 0 |
conv2d_2 (Conv2D) | (None, 32, 32, 32) | 9248 |
max_pooling2d_1 (MaxPooling2) | (None, 16, 16, 32) | 0 |
conv2d_3 (Conv2D) | (None, 16, 16, 64) | 18,496 |
dropout_2 (Dropout) | (None, 16, 16, 64) | 0 |
conv2d_4 (Conv2D) | (None, 16, 16, 64) | 36,928 |
max_pooling2d_2 (MaxPooling2) | (None, 8, 8, 64) | 0 |
conv2d_5 (Conv2D) | (None, 8, 8, 128) | 73,856 |
dropout_3 (Dropout) | (None, 8, 8, 128) | 0 |
conv2d_6 (Conv2D) | (None, 8, 8, 128) | 147,584 |
max_pooling2d_3 (MaxPooling2) | (None, 4, 4, 128) | 0 |
flatten_1 (Flatten) | (None, 2048) | 0 |
dense_1 (Dense) | (None, 2500) | 5,122,500 |
dropout_5 (Dropout) | (None, 2500) | 0 |
dense_2 (Dense) | (None, 1500) | 3,751,500 |
dropout_6 (Dropout) | (None, 1500) | 0 |
dense_3 (Dense) | (None, 10) | 15,010 |
Layer (Type) | Output Shape | Param |
---|---|---|
input ( RGB image) | ||
dense_1 (Dense) | (None, 10,000) | 30,730,000 |
dropout_1 (Dropout) | (None, 10,000) | 0 |
dense_2 (Dense) | (None, 1000) | 10,001,000 |
dropout_2 (Dropout) | (None, 1000) | 0 |
dense_3 (Dense) | (None, 100) | 100,100 |
dropout_3 (Dropout) | (None, 100) | 0 |
dense_4 (Dense) | (None, 2) | 202 |
© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
Share and Cite
Deng, Z.; Cao, Y.; Zhou, X.; Yi, Y.; Jiang, Y.; You, I. Toward Efficient Image Recognition in Sensor-Based IoT: A Weight Initialization Optimizing Method for CNN Based on RGB Influence Proportion. Sensors 2020, 20, 2866. https://doi.org/10.3390/s20102866
Deng Z, Cao Y, Zhou X, Yi Y, Jiang Y, You I. Toward Efficient Image Recognition in Sensor-Based IoT: A Weight Initialization Optimizing Method for CNN Based on RGB Influence Proportion. Sensors. 2020; 20(10):2866. https://doi.org/10.3390/s20102866
Chicago/Turabian StyleDeng, Zile, Yuanlong Cao, Xinyu Zhou, Yugen Yi, Yirui Jiang, and Ilsun You. 2020. "Toward Efficient Image Recognition in Sensor-Based IoT: A Weight Initialization Optimizing Method for CNN Based on RGB Influence Proportion" Sensors 20, no. 10: 2866. https://doi.org/10.3390/s20102866
APA StyleDeng, Z., Cao, Y., Zhou, X., Yi, Y., Jiang, Y., & You, I. (2020). Toward Efficient Image Recognition in Sensor-Based IoT: A Weight Initialization Optimizing Method for CNN Based on RGB Influence Proportion. Sensors, 20(10), 2866. https://doi.org/10.3390/s20102866