Short Text Classification Based on Explicit and Implicit Multiscale Weighted Semantic Information
Abstract
:1. Introduction
2. Acquisition of Explicit and Implicit Multiscale Semantic Information
2.1. The Process of Explicit and Implicit Multiscale Semantic Information
2.2. Acquisition of Explicit Multiscale Semantic Information
2.3. Acquisition of Implicit Multiscale Semantic Information
3. Semantic Information Mining and Text Classification
3.1. Explicit and Implicit Semantic Information Interaction and Feature Mining
3.2. Semantic Feature Fusion and Text Classification
4. The Experimental Result
4.1. Experimental Environment and Dataset
4.2. Experimental Model and Evaluation Criteria
4.3. Experimental Setup and Experimental Results
5. Analysis of Experimental Results
6. Conclusions
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Quazi, S.; Musa, S.M. Text Classification and Categorization through Deep Learning. In Proceedings of the 14th International Conference on Computational Intelligence and Communication Networks, Al-Khobar, Saudi Arabia, 4–6 December 2022; pp. 513–519. [Google Scholar]
- Uriarte-Arcia, A.V.; López-Yáñez, I.; Yáñez-Márquez, C. One-Hot Vector Hybrid Associative Classifier for Medical Data Classification. PLoS ONE 2014, 9, e95715. [Google Scholar] [CrossRef]
- Parida, U.; Nayak, M.; Nayak, A.K. Ranking of Odia Text Document Relevant to User Query Using Vector Space Model. In Proceedings of the 2019 International Conference on Applied Machine Learning, Bhubaneswar, India, 25–26 May 2019; pp. 165–169. [Google Scholar]
- Chen, S.; Bolufé-Röhler, A.; Montgomery, J.; Zhang, W.; Hendtlass, T. Using Average-Fitness Based Selection to Combat the Curse of Dimensionality. In Proceedings of the 2022 IEEE Congress on Evolutionary Computation, Padua, Italy, 18–23 July 2022; pp. 1–8. [Google Scholar]
- Sumarsono, A. Application of RXD Algorithm to Word Vector Representation for Keyword Identification. In Proceedings of the 10th Annual Computing and Communication Workshop and Conference, Las Vegas, NV, USA, 6–8 January 2020; pp. 306–310. [Google Scholar]
- Dogan, G.; Ergen, B. A new mobile convolutional neural network-based approach for pixel-wise road surface crack detection. Measurement 2022, 195, 111119–111123. [Google Scholar] [CrossRef]
- Dhyani, M.; Kumar, R. An intelligent Chatbot using deep learning with Bidirectional RNN and attention model. Mater. Today 2021, 34, 817–824. [Google Scholar] [CrossRef] [PubMed]
- Kim, Y. Convolutional Neural Networks for Sentence Classification. In Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), Doha, Qatar, 25–29 October 2014; pp. 1746–1751. [Google Scholar] [CrossRef]
- Arevian, G. Recurrent Neural Networks for Robust Real-World Text Classification. In Proceedings of the IEEE/WIC/ACM International Conference on Web Intelligence (WI’07), Fremont, CA, USA, 2–5 November 2007; pp. 326–329. [Google Scholar]
- You, H.; Yu, L.; Tian, S.; Ma, X.; Cai, W. MC-Net: Multiple max-pooling integration module and cross multi-scale deconvolution network. Knowl.-Based Syst. 2021, 231, 107456–107464. [Google Scholar] [CrossRef]
- Yu, Y. Research on Music Emotion Classification Based on CNN-LSTM Network. In Proceedings of the 5th Asian Conference on Artificial Intelligence Technology (ACAIT), Haikou, China, 29–31 October 2021; pp. 473–476. [Google Scholar]
- Lidong, H.; Hui, Z. A new short text sentimental classification method based on multi-mixed convolutional neural network. In Proceedings of the 3rd International Conference on Cloud Computing and Big Data Analysis, Chengdu, China, 20–22 April 2018; pp. 93–99. [Google Scholar]
- Zhou, C.; Sun, C.; Liu, Z.; Lau, F.C.M. A C-LSTM Neural Network for Text Classification. Comput. Sci. 2015, 4, 39–44. [Google Scholar]
- Vijayaprabakaran, K.; Sathiyamurthy, K. Towards activation function search for long short-term model network: A differential evolution based approach. J. King Saud Univ.-Comput. Inf. Sci. 2020, 34, 2637–2650. [Google Scholar] [CrossRef]
- Prakash, S.; Jalal, A.S.; Pathak, P. Forecasting COVID-19 Pandemic using Prophet, LSTM, hybrid GRU-LSTM, CNN-LSTM, Bi-LSTM and Stacked-LSTM for India. In Proceedings of the 6th International Conference on Information Systems and Computer Networks, Mathura, India, 1–6 March 2023. [Google Scholar] [CrossRef]
- Lai, S.; Lei, D. Calculation of sentence vector similarity based on fasttext model of weighted fusion. In Proceedings of the 4th International Conference on Advances in Computer Technology, Information Science and Communications (CTISC), Suzhou, China, 22–24 April 2022; pp. 1–6. [Google Scholar]
- Lu, W.; Duan, Y.; Song, Y. Self-Attention-Based Convolutional Neural Networks for Sentence Classification. In Proceedings of the 6th International Conference on Computer and Communications (ICCC), Chengdu, China, 11–14 December 2020; pp. 2065–2069. [Google Scholar]
- Yin, W.; Schütze, H. Multichannel Variable-Size Convolution for Sentence Classification. arXiv 2016. [Google Scholar] [CrossRef]
- Wang, C.; Wang, B.; Xu, M. Tree-Structured Neural Networks With Topic Attention for Social Emotion Classification. IEEE Access 2019, 7, 95505–95515. [Google Scholar] [CrossRef]
- Yang, X.; Liu, X. Convolutional Recurrent neural network with attention mechanism based improved skip-gram algorithm for text sentiment classification. In Proceedings of the 7th International Conference on Information Science and Control Engineering, Changsha, China, 18–20 December 2020; pp. 410–414. [Google Scholar]
- Xian, T.; Wen, L.; Yi, D.; Ting, W. Short Text Feature Extraction and Classification Based on Serial-Parallel Convolutional Gated Recurrent Neural Network. Adv. Eng. Sci. 2019, 51, 125–132. [Google Scholar]
- Cheng, Z.; Tong, H.; Man, X. BLSTM_MLPCNN Model for Short Text Classification. Comput. Sci. 2019, 46, 206–211. [Google Scholar]
- Li, J.; Yang, X. A Cyclical Learning Rate Method in Deep Learning Training. In Proceedings of the International Conference on Computer, Information and Telecommunication Systems (CITS), Hangzhou, China, 5–7 October 2020; pp. 1–5. [Google Scholar]
- Saravanan, V.; Ranjana, P. Stochastic Gradient Descent on Modern Hardware for Business Environment. In Proceedings of the 7th International Conference on Intelligent Computing and Control Systems, Madurai, India, 17–19 May 2023; pp. 812–815. [Google Scholar]
Data | c (pcs) | len (pcs) | train (pcs) | dev (pcs) | test (pcs) | |V| |
---|---|---|---|---|---|---|
MR | 2 | 21 | 10,662 | - | CV | 20,191 |
Subj | 2 | 23 | 10,000 | - | CV | 21,057 |
TREC | 6 | 10 | 5452 | - | 500 | 9137 |
SST1 | 5 | 18 | 8544 | 1101 | 2210 | 17,836 |
SST2 | 2 | 19 | 6920 | 872 | 1821 | 16,185 |
Model | MR (%) | Subj (%) | TREC (%) | SST1 (%) | SST2 (%) |
---|---|---|---|---|---|
FastText | 79.7 | 91.5 | 92.1 | 46.1 | 85.4 |
CNN-static | 81.0 | 93.0 | 92.8 | 45.5 | 86.8 |
CNN-non-static | 81.5 | 93.4 | 93.6 | 48.0 | 87.2 |
CNN-multi-channel | 81.1 | 93.2 | 92.2 | 47.4 | 88.1 |
MVCNN | - | 93.9 | - | 49.6 | 89.4 |
LSTM | - | - | - | 46.4 | 84.9 |
Bi-LSTM | - | - | - | 49.1 | 87.5 |
LSTM-Tree | - | - | - | 51.0 | 88.0 |
RCNN | - | - | - | 47.2 | - |
SPCGRU | 82.3 | 94.9 | 95.8 | - | - |
BLSTM MLPCNN | 83.0 | 95.0 | 95.7 | 49.0 | 88.2 |
Ours-static | 84.1 | 95.9 | 97.3 | 52.1 | 90.6 |
Ours-prestatic | 84.4 | 95.8 | 96.9 | 52.4 | 89.9 |
Ours-nonstatic | 85.1 | 96.3 | 97.6 | 52.7 | 90.8 |
Ours-prenonstatic | 85.7 | 96.9 | 98.1 | 53.4 | 91.8 |
MR (%) | Subj (%) | TREC (%) | SST1 (%) | SST2 (%) | |
---|---|---|---|---|---|
0.5 | 85.7 | 96.8 | 98.0 | 53.4 | 91.7 |
0.7 | 85.6 | 96.8 | 98.0 | 53.3 | 91.6 |
1.0 | 85.7 | 96.7 | 98.1 | 53.4 | 91.8 |
1.3 | 85.7 | 96.9 | 98.1 | 53.2 | 91.5 |
1.5 | 85.6 | 96.7 | 98.1 | 53.2 | 91.5 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Gong, J.; Zhang, J.; Guo, W.; Ma, Z.; Lv, X. Short Text Classification Based on Explicit and Implicit Multiscale Weighted Semantic Information. Symmetry 2023, 15, 2008. https://doi.org/10.3390/sym15112008
Gong J, Zhang J, Guo W, Ma Z, Lv X. Short Text Classification Based on Explicit and Implicit Multiscale Weighted Semantic Information. Symmetry. 2023; 15(11):2008. https://doi.org/10.3390/sym15112008
Chicago/Turabian StyleGong, Jun, Juling Zhang, Wenqiang Guo, Zhilong Ma, and Xiaoyi Lv. 2023. "Short Text Classification Based on Explicit and Implicit Multiscale Weighted Semantic Information" Symmetry 15, no. 11: 2008. https://doi.org/10.3390/sym15112008
APA StyleGong, J., Zhang, J., Guo, W., Ma, Z., & Lv, X. (2023). Short Text Classification Based on Explicit and Implicit Multiscale Weighted Semantic Information. Symmetry, 15(11), 2008. https://doi.org/10.3390/sym15112008