Chinese Named Entity Recognition for Automobile Fault Texts Based on External Context Retrieving and Adversarial Training
Abstract
:1. Introduction
2. Related Work
2.1. Named Entity Recognition
2.2. Pre-Training Models in Chinese NER
2.3. Adversarial Training for NER
3. Methodology
3.1. Input Layer
3.2. Embedding Layer
3.3. Fusion Layer
3.4. BiLSTM Layer
3.5. CRF Layer
3.6. Adversarial Training
4. Experiments and Results
4.1. Datasets
4.2. External Retrieval
4.3. Experimental Parameters and Evaluation Metrics
4.4. Comparison of Keyword Extraction Methods
4.5. Ablation Study
4.6. Performance on Different Entities
4.7. Effect of Adversarial Training
4.8. Model Generalization
5. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Wang, Y.; Tong, H.; Zhu, Z.; Li, Y. Nested named entity recognition: A survey. ACM Trans. Knowl. Discov. Data 2022, 16, 108. [Google Scholar] [CrossRef]
- Cheng, M.; Li, H.; Yang, Z.; Fan, W.; Gan, Y. Named entity recognition for medical dialogue based on BERT and adversarial training. In Proceedings of the 2022 5th International Conference on Pattern Recognition and Artificial Intelligence (PRAI), Chengdu, China, 19–21 August 2022; pp. 955–959. [Google Scholar]
- Su, S.; Qu, J.; Cao, Y.; Li, R.; Wang, G. Adversarial training lattice LSTM for named entity recognition of rail fault texts. IEEE Trans. Intell. Transp. Syst. 2022, 23, 21201–21215. [Google Scholar] [CrossRef]
- Li, J.; Sun, A.; Han, J.; Li, C. A survey on deep learning for named entity recognition. IEEE Trans. Knowl. Data Eng. 2022, 34, 50–70. [Google Scholar] [CrossRef]
- Li, J.; Zhao, S.; Yang, J.; Huang, Z.; Liu, B.; Chen, S.; Pan, H.; Wang, Q. WCP-RNN: A novel RNN-based approach for Bio-NER in Chinese EMRs. J. Supercomput. 2020, 76, 1450–1467. [Google Scholar] [CrossRef]
- Xu, C.; Wang, F.; Han, J.; Li, C. Exploiting multiple embeddings for Chinese named entity recognition. In Proceedings of the 28th ACM International Conference on Information and Knowledge Management, Beijing, China, 3–7 November 2019; pp. 2269–2272. [Google Scholar]
- Liu, W.; Xu, T.; Xu, Q.; Song, J.; Zu, Y. An encoding strategy based word-character LSTM for Chinese NER. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long & Short Papers), Minneapolis, MN, USA, 2–7 June 2019; pp. 2379–2389. [Google Scholar]
- Cao, P.; Chen, Y.; Liu, K.; Zhao, J.; Liu, S. Adversarial transfer learning for Chinese named entity recognition with self-attention mechanism. In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, 31 October–4 November 2018; pp. 182–192. [Google Scholar]
- Chiu, J.P.C.; Nichols, E. Named entity recognition with bidirectional LSTM-CNNs. Trans. Assoc. Comput. Linguist. 2016, 4, 357–370. [Google Scholar] [CrossRef]
- Zhang, M.; Lu, L. A local information perception enhancement–based method for Chinese NER. Appl. Sci. 2023, 13, 9948. [Google Scholar] [CrossRef]
- Peng, N.; Dredze, M. Named entity recognition for Chinese social media with jointly trained embeddings. In Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, Lisbon, Portugal, 17–21 September 2015; pp. 548–554. [Google Scholar]
- Zhang, Y.; Yang, J. Chinese NER using lattice LSTM. In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, Melbourne, Australia, 15–20 July 2018; pp. 1554–1564. [Google Scholar]
- Li, X.; Yan, H.; Qiu, X.; Huang, X. Flat: Chinese NER using flat-lattice transformer. arXiv 2020, arXiv:2004.11795. [Google Scholar]
- Li, Y.; Yan, H.; Yang, Y.; Wang, X. A method for cultural relics named entity recognition based on enhanced lexical features. In Proceedings of the 2024 International Joint Conference on Neural Networks (IJCNN), Yokohama, Japan, 30 June–5 July 2024; pp. 1–8. [Google Scholar]
- Wang, S.; Liu, F.; Huang, B.; Li, J.; Feng, P. Chinese named entity recognition based on multi-metadata embedding and global pointer. In Proceedings of the 2023 3rd International Conference on Electronic Information Engineering and Computer Science (EIECS), Changchun, China, 22–24 September 2023; pp. 978–984. [Google Scholar]
- Zhang, B.; Cai, J.; Zhang, H.; Shang, J. VisPhone: Chinese named entity recognition model enhanced by visual and phonetic features. Inf. Process. Manag. 2023, 60, 103314. [Google Scholar] [CrossRef]
- Wang, X.; Jiang, Y.; Bach, N.; Wang, T.; Huang, Z.; Huang, F.; Tu, K. Improving named entity recognition by external context retrieving and cooperative learning. arXiv 2021, arXiv:2105.03654. [Google Scholar]
- Kenton, J.; Devlin, J.; Chang, M.-W.; Toutanova, K. BERT: Pre-training of deep bidirectional transformers for language understanding. Proceedings of NAACL-HLT 2019, Minneapolis, MN, USA, 2–7 June 2019; pp. 4171–4186. [Google Scholar]
- Hu, S.; Zhang, H.; Hu, X. Chinese Named Entity Recognition based on BERT-CRF Model. In Proceedings of the 2022 IEEE/ACIS 22nd International Conference on Computer and Information Science (ICIS), Zhuhai, China, 26–28 June 2022; pp. 105–108. [Google Scholar]
- Xue, M.; Yu, B.; Liu, T.; Zhang, Y.; Meng, E.; Wang, B. Porous lattice transformer encoder for Chinese NER. In Proceedings of the 28th International Conference on Computational Linguistics, Barcelona, Spain, 8–13 December 2020; pp. 3831–3841. [Google Scholar]
- Ma, R.; Peng, M.; Zhang, Q.; Wei, Z.; Huang, X. Simplify the usage of lexicon in Chinese NER. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, Online, 5–10 July 2020; pp. 5951–5960. [Google Scholar]
- Gui, T.; Zou, Y.; Zhang, Q.; Peng, M.; Fu, J.; Wei, Z.; Huang, X. A lexicon-based graph neural network for Chinese NER. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing & 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), Hong Kong, China, 3–7 November 2019. [Google Scholar]
- Liu, W.; Fu, X.; Zhang, Y.; Xiao, W. Lexicon enhanced Chinese sequence labeling using BERT adapter. In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics & 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), Online, 1–6 August 2021; pp. 5847–5858. [Google Scholar]
- Liang, J.; Li, D.; Lin, Y.; Wu, S.; Huang, Z. Named Entity Recognition of Chinese Crop Diseases and Pests Based on RoBERTa-wwm with Adversarial Training. Agronomy 2023, 13, 941. [Google Scholar] [CrossRef]
- Ma, H.; Zhang, Y.; Wang, J. Pretrained Models with Adversarial Training for Named Entity Recognition in Scientific Text. In Proceedings of the 2022 International Conference on Asian Language Processing (IALP), Singapore, 27–28 October 2022; pp. 259–264. [Google Scholar] [CrossRef]
- Mao, Q.; Li, J.; Meng, K. Improving Chinese named entity recognition by search engine augmentation. arXiv 2022, arXiv:2210.12662. [Google Scholar]
- Mihalcea, R.; Tarau, P. TextRank: Bringing order into texts. In Proceedings of the 2004 Conference on Empirical Methods in Natural Language Processing (EMNLP), Barcelona, Spain, 25–26 July 2004; pp. 404–411. [Google Scholar]
- Robertson, S.E.; Walker, S. Some simple effective approximations to the 2-poisson model for probabilistic weighted retrieval. In Proceedings of the 17th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR’94), Dublin, Ireland, 3–6 July 1994; pp. 232–241. [Google Scholar]
- Dai, L.L.; Jiang, K. Chinese text classification based on fastText. Comput. Mod. 2018, 5, 37–38. [Google Scholar]
- Diao, S.; Bai, J.; Song, Y.; Zhang, T.; Wang, Y. Zen: Pre-training Chinese text encoder enhanced by n-gram representations. arXiv 2019, arXiv:1911.00720. [Google Scholar]
- Wen, P.; Feng, L.; Zhang, T. A hybrid Chinese word segmentation model for quality management-related texts based on transfer learning. PLoS ONE 2022, 17, e0270154. [Google Scholar] [CrossRef] [PubMed]
- Zhu, P.; Cheng, D.; Yang, F.; Luo, Y.; Huang, D.; Qian, W.; Zhou, A. Improving Chinese named entity recognition by large-scale syntactic dependency graph. IEEE/ACM Trans. Audio Speech Lang. Process. 2022, 30, 979–991. [Google Scholar] [CrossRef]
- Pan, J.; Zhang, C.; Wang, H.; Wu, Z. A comparative study of Chinese named entity recognition with different segment representations. Appl. Intell. 2022, 52, 12457–12469. [Google Scholar] [CrossRef]
- Zhang, Z.; Li, W.; Bao, R.; Harimoto, K.; Wu, Y.; Sun, X. ASAT: Adaptively scaled adversarial training in time series. Neurocomputing 2023, 522, 11–23. [Google Scholar] [CrossRef]
- Wang, J.; Xu, W.; Fu, X.; Xu, G.; Wu, Y. ASTRAL: Adversarial trained LSTM-CNN for named entity recognition. Knowl.-Based Syst. 2020, 197, 105842. [Google Scholar] [CrossRef]
- Southwest Jiaotong University. ASP/SaaS-Based Manufacturing Industry Value Chain Collaboration Platform. Data Provided by Manufacturing Industry Chain Collaboration and Information Support Technology Key Laboratory of Sichuan Province. 2022. Available online: http://www.autosaas.cn/ (accessed on 28 September 2024).
- He, H.; Sun, X. F-score driven max margin neural network for named entity recognition in Chinese social media. In Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 2, Short Papers, Valencia, Spain, 3–7 April 2017; pp. 713–718. [Google Scholar]
- Brown, D. Rank-BM25: A Collection of BM25 Algorithms in Python. 2020. Available online: https://zenodo.org/records/4520057 (accessed on 28 October 2024).
- Liu, H.; Song, J.; Peng, W.; Sun, J.; Xin, X. TFM: A triple fusion module for integrating lexicon information in Chinese named entity recognition. Neural Process. Lett. 2022, 54, 3425–3442. [Google Scholar] [CrossRef]
- Zhang, Q.; Sun, Y.; Zhang, L.; Jiao, Y.; Tian, Y. Named entity recognition method in health preserving field based on BERT. Procedia Comput. Sci. 2021, 183, 142–149. [Google Scholar] [CrossRef]
- Pan, S.; Li, Z.; Dai, J. An improved TextRank keywords extraction algorithm. In Proceedings of the ACM Turing Celebration Conference–China, Chengdu China, 17–19 May 2019; pp. 1–7. [Google Scholar]
- Fan, H.; Qin, Y. Research on text classification based on improved TF-IDF algorithm. In Proceedings of the 2018 International Conference on Network, Communication, Computer Engineering (NCCE 2018), Chongqing, China, 26–27 May 2018; pp. 501–506. [Google Scholar]
- Yao, L.; Pengzhou, Z.; Chi, Z. Research on news keyword extraction technology based on TF-IDF and TextRank. In Proceedings of the 2019 IEEE/ACIS 18th International Conference on Computer and Information Science (ICIS), Beijing, China, 17–19 June 2019; pp. 452–455. [Google Scholar]
- Chen, T.; Xu, R.; He, Y.; Wang, X. Improving sentiment analysis via sentence type classification using BiLSTM-CRF and CNN. Expert Syst. Appl. 2017, 72, 221–230. [Google Scholar] [CrossRef]
Dataset | Type | Train | Test | Dev |
---|---|---|---|---|
D1 | Sentence | 3.6 k | 0.45 k | 0.45 k |
D2 | Sentence | 4.4 k | 0.55 k | 0.55 k |
D3 | Sentence | 3.2 k | 0.4 k | 0.4 k |
Sentence | 1.4 k | 0.27 k | 0.27 k | |
Resume | Sentence | 3.8 k | 0.46 k | 0.48 k |
Resume | D1 | D2 | ||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|
Models | P | R | F1 | P | R | F1 | P | R | F1 | P | R | F1 |
Basic | 70.75 | 70.01 | 70.38 | 96.02 | 96.7 | 96.36 | 85.11 | 88.19 | 86.62 | 86.21 | 88.82 | 87.01 |
+EK | 71.10 | 71.78 | 71.44 | 96.45 | 96.97 | 96.71 | 86.97 | 89.94 | 88.43 | 88.91 | 89.63 | 89.27 |
+AT | 71.29 | 71.83 | 71.56 | 96.51 | 97.05 | 96.78 | 87.08 | 90.78 | 88.89 | 89.09 | 90.95 | 90.01 |
CNEREA | 72.68 | 70.69 | 71.67 | 96.60 | 97.46 | 97.03 | 87.69 | 91.20 | 89.41 | 89.27 | 90.84 | 90.05 |
Resume | D1 | D2 | ||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|
Models | P | R | F1 | P | R | F1 | P | R | F1 | P | R | F1 |
BiLSTM-CRF | 68.8 | 49.3 | 57.4 | 94.53 | 94.29 | 94.41 | 83.14 | 78.93 | 80.98 | 83.36 | 78.93 | 81.08 |
Lattice-LSTM | 53.04 | 62.25 | 58.79 | 94.18 | 94.11 | 94.46 | 85.47 | 80.41 | 82.86 | 84.35 | 82.78 | 83.56 |
FLAT-Lattice | - | - | 63.42 | - | - | 94.93 | - | - | - | - | - | - |
BERT-CRF | 70.33 | 66.11 | 68.13 | 95.49 | 96.07 | 95.78 | 86.03 | 80.27 | 83.05 | 86.66 | 83.72 | 84.68 |
CL-L2 | 69.85 | 68.18 | 69.01 | 96.97 | 96.2 | 96.59 | 86.53 | 80.91 | 83.63 | 86.04 | 84.15 | 85.08 |
MEGP | 70.24 | 71.54 | 70.68 | 96.79 | 96.26 | 96.51 | - | - | - | - | - | - |
Vis-Phone | - | - | 70.79 | 96.26 | 96.44 | 96.26 | - | - | - | - | - | - |
LEBERT | 69.88 | 71.05 | 70.46 | 96.26 | 96.44 | 96.35 | 86.23 | 82.03 | 84.08 | 86.29 | 84.02 | 85.14 |
NER-ELF | 70.85 | 70.47 | 70.66 | 96.04 | 97.02 | 96.53 | - | - | - | - | - | - |
CNEREA | 72.68 | 70.69 | 71.67 | 96.60 | 97.46 | 97.03 | 87.69 | 91.20 | 89.41 | 89.27 | 90.84 | 90.05 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Wang, S.; Sun, L. Chinese Named Entity Recognition for Automobile Fault Texts Based on External Context Retrieving and Adversarial Training. Entropy 2025, 27, 133. https://doi.org/10.3390/e27020133
Wang S, Sun L. Chinese Named Entity Recognition for Automobile Fault Texts Based on External Context Retrieving and Adversarial Training. Entropy. 2025; 27(2):133. https://doi.org/10.3390/e27020133
Chicago/Turabian StyleWang, Shuhai, and Linfu Sun. 2025. "Chinese Named Entity Recognition for Automobile Fault Texts Based on External Context Retrieving and Adversarial Training" Entropy 27, no. 2: 133. https://doi.org/10.3390/e27020133
APA StyleWang, S., & Sun, L. (2025). Chinese Named Entity Recognition for Automobile Fault Texts Based on External Context Retrieving and Adversarial Training. Entropy, 27(2), 133. https://doi.org/10.3390/e27020133