Hand Gesture Recognition in Indian Sign Language Using Deep Learning †
Abstract
:1. Introduction
- Implementation of machine learning and deep learning algorithms used in hand gesture recognition.
- To fine-tune the deep learning model in order to achieve maximum performance.
- What flaws or drawbacks does this model have that turn up in applications of the algorithms? Discuss these flaws.
- How these flaws can be corrected, if they can be corrected currently, or what can be done about them in the future.
2. Literature Survey
3. Methodology
3.1. Dataset
3.2. Image Processing
3.3. Image Augmentation
3.4. Model Training
- Conv2D: It applies a 2D convolution operation to the input data. A filter, which is a matrix whose dimensions are specified by the kernel size, is used to produce an output.
- MaxPool2D: This is a pooling layer and is usually applied after a convolution layer. It applies a filter and selects a single value from each subregion of the specified dimension from the input data. In this case, the filter applied is Max, which selects the maximum value.
- Flatten: It converts multi-dimensional data into a 1-D shape, i.e., flattens the data.
- Dense: A dense layer is one where each node is connected to every node of the previous layer. In this case, connect each node to the flattened layer.
- Dropout: The dropout layer randomly drops out nodes from its input layer at a specified rate. It is used to reduce overfitting.
3.5. Evaluating Performance
4. Results and Discussions
5. Conclusions and Future Work
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Rajput, L.; Gupta, S. Sentiment Analysis Using Latent Dirichlet Allocation for Aspect Term Extraction. J. Comput. Mech. Manag. 2023, 2, 8–13. [Google Scholar] [CrossRef]
- Rosalina; Yusnita, L.; Hadisukmana, N.; Wahyu, R.B.; Roestam, R.; Wahyu, Y. Implementation of Real-Time Static Hand Gesture Recognition Using Artificial Neural Network. In Proceedings of the 2017 4th International Conference on Computer Applications and Information Processing Technology (CAIPT), Bali, Indonesia, 8–10 August 2017; pp. 1–6. [Google Scholar]
- Hangün, B.; Eyecioğlu, Ö. Performance Comparison Between OpenCV Built in CPU and GPU Functions on Image Processing Operations. Int. J. Eng. Sci. Appl. 2017, 1, 34–41. [Google Scholar]
- Li, S.; Deng, W. Deep Facial Expression Recognition: A Survey. IEEE Trans. Affect. Comput. 2020, 13, 1195–1215. [Google Scholar] [CrossRef]
- Prasanna, D.M.; Reddy, C.G. Development of Real Time Face Recognition System Using OpenCV. Development 2017, 4, 791. [Google Scholar]
- Oyedotun, O.K.; Khashman, A. Deep Learning in Vision-Based Static Hand Gesture Recognition. Neural Comput. Appl. 2017, 28, 3941–3951. [Google Scholar] [CrossRef]
- Zhan, F. Hand Gesture Recognition with Convolution Neural Networks. In Proceedings of the 2019 IEEE 20th International Conference on Information Reuse and Integration for Data Science (IRI), Los Angeles, CA, USA, 30 July–1 August 2019; pp. 295–298. [Google Scholar]
- Pigou, L.; Van Herreweghe, M.; Dambre, J. Gesture and Sign Language Recognition with Temporal Residual Networks. In Proceedings of the 2017 IEEE International Conference on Computer Vision Workshops (ICCVW), Venice, Italy, 22–29 October 2017; pp. 3086–3093. [Google Scholar]
- Hsu, H.-W.; Wu, T.-Y.; Wan, S.; Wong, W.H.; Lee, C.-Y. QuatNet: Quaternion-Based Head Pose Estimation With Multiregression Loss. IEEE Trans. Multimed. 2019, 21, 1035–1046. [Google Scholar] [CrossRef]
- Gupta, J. Hand Gesture Recognition for Emoji Prediction. Int. J. Res. Appl. Sci. Eng. Technol. 2020, 8, 1310–1317. [Google Scholar] [CrossRef]
- Mo, T.; Sun, P. Research on Key Issues of Gesture Recognition for Artificial Intelligence. Soft Comput. 2020, 24, 5795–5803. [Google Scholar] [CrossRef]
- Muthu Mariappan, H.; Gomathi, V. Real-Time Recognition of Indian Sign Language. In Proceedings of the 2019 International Conference on Computational Intelligence in Data Science (ICCIDS), Chennai, India, 21–23 February 2019; pp. 1–6. [Google Scholar]
- Rastogi, R.; Mittal, S.; Agarwal, S. A Novel Approach for Communication among Blind, Deaf and Dumb People. In Proceedings of the 2015 2nd International Conference on Computing for Sustainable Global Development (INDIACom), New Delhi, India, 11–13 March 2015. [Google Scholar]
- Nagpal, N. Design Issue and Proposed Implementation of Communication Aid for Deaf & Dumb People. Int. J. Recent Innov. Trends Comput. Commun. 2015, 3, 147–149. [Google Scholar]
- Ahire, P.G.; Tilekar, K.B.; Jawake, T.A.; Warale, P.B. Two Way Communicator between Deaf and Dumb People and Normal People. In Proceedings of the 2015 International Conference on Computing Communication Control and Automation, Pune, India, 26–27 February 2015; pp. 641–644. [Google Scholar]
- Sharma, S.; Gupta, R.; Kumar, A. Trbaggboost: An Ensemble-Based Transfer Learning Method Applied to Indian Sign Language Recognition. J. Ambient Intell. Humaniz. Comput. 2022, 13, 3527–3537. [Google Scholar] [CrossRef]
- Kishore, C.R.; Pemula, R.; Vijaya Kumar, S.; Rao, K.P.; Chandra Sekhar, S. Deep Learning Models for Identification of COVID-19 Using CT Images. In Proceedings of the Soft Computing: Theories and Applications; Kumar, R., Ahn, C.W., Sharma, T.K., Verma, O.P., Agarwal, A., Eds.; Springer Nature: Singapore, 2022; pp. 577–588. [Google Scholar]
- Lin, H.-I.; Hsu, M.-H.; Chen, W.-K. Human Hand Gesture Recognition Using a Convolution Neural Network. In Proceedings of the 2014 IEEE International Conference on Automation Science and Engineering (CASE), New Taipei, Taiwan, 18–22 August 2014; Volume 2014, pp. 1038–1043. [Google Scholar]
- Sharma, C.M.; Tomar, K.; Mishra, R.K.; Chariar, V.M. Indian Sign Language Recognition Using Fine-Tuned Deep Transfer Learning Model. In Proceedings of the Innovations In Computer And Information Science (ICICIS), Ganzhou, China, 27–29 August 2021; pp. 62–67. [Google Scholar]
- Arora, M.; Dhawan, S.; Singh, K. Exploring Deep Convolution Neural Networks with Transfer Learning for Transformation Zone Type Prediction in Cervical Cancer. In Proceedings of the Soft Computing: Theories and Applications; Pant, M., Sharma, T.K., Verma, O.P., Singla, R., Sikander, A., Eds.; Springer: Singapore, 2020; pp. 1127–1138. [Google Scholar]
System Configuration | Optimizer | Input Size | Training Time | Testing Loss | Testing Accuracy |
---|---|---|---|---|---|
Google Colab GPU: Tesla K80 CPU: Intel(R) Xeon | rmsprop | (100, 75, 1) Grayscale | 3 h 52 min | 0.0687 | 0.9713 |
Google Colab GPU: Tesla K80 CPU: Intel(R) Xeon | rmsprop | (80, 60, 3) HSV | 3 h 26 min | 0.0178 | 0.999 |
Local Machine GPU: GTX 1650S CPU: Intel(R) Core i5 10400F | rmsprop | (100, 75, 1) Grayscale | 2 h 24 min | 0.1874 | 0.9383 |
Local Machine GPU: GTX 1650S CPU: Intel(R) Core i5 10400F | adam | (200, 150, 1) Grayscale | 2 h 17 min | 0.166 | 0.9274 |
Local Machine GPU: RTX 3050 CPU: Ryzen 9 5900HS | rmsprop | (100, 75, 1) Grayscale | 2 h 11 min | 0.025 | 0.9899 |
Paper | Task | Model | Accuracy |
---|---|---|---|
Muthu Mariappan et al. [12] | 40 ISL words and sentences in real time | Fuzzy c-means | 75% |
Rosalina et al. [2] | 39 ASL signs (26 alphabet letters, 10 digits, and 3 punctuation) | ANN | 90% |
Oyebade K. Oyedotun et al. [6] | 24 ASL Hand Gestures | CNN | 92.83% |
Hsien-I Lin et al. [17] | 7 hand gestures | CNN | 99% |
Gupta et al. [10] | 11 hand gestures | CNN | 99.6% |
Proposed model | 26 ISL Hand Signs | CNN | 99% |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Vashisth, H.K.; Tarafder, T.; Aziz, R.; Arora, M.; Alpana. Hand Gesture Recognition in Indian Sign Language Using Deep Learning. Eng. Proc. 2023, 59, 96. https://doi.org/10.3390/engproc2023059096
Vashisth HK, Tarafder T, Aziz R, Arora M, Alpana. Hand Gesture Recognition in Indian Sign Language Using Deep Learning. Engineering Proceedings. 2023; 59(1):96. https://doi.org/10.3390/engproc2023059096
Chicago/Turabian StyleVashisth, Harsh Kumar, Tuhin Tarafder, Rehan Aziz, Mamta Arora, and Alpana. 2023. "Hand Gesture Recognition in Indian Sign Language Using Deep Learning" Engineering Proceedings 59, no. 1: 96. https://doi.org/10.3390/engproc2023059096
APA StyleVashisth, H. K., Tarafder, T., Aziz, R., Arora, M., & Alpana. (2023). Hand Gesture Recognition in Indian Sign Language Using Deep Learning. Engineering Proceedings, 59(1), 96. https://doi.org/10.3390/engproc2023059096