Complex Knowledge Base Question Answering for Intelligent Bridge Management Based on Multi-Task Learning and Cross-Task Constraints
Abstract
:1. Introduction
- (1)
- Due to the privacy policy, bridge management data have not been fully disclosed, and the industry lacks ready-made knowledge base and question answering corpus, resulting in insufficient data support for research in this field. In addition, important information, such as basic attributes, defect damage, and technical conditions are not fully utilized, leading to insufficient knowledge services in the field of bridge management.
- (2)
- Bridge management information has great domain characteristics in data storage, term description, expression, etc. For example, the topic entity boundary is not clear with a significant amount of specialized words, and the question types are not evenly distributed. Existing methods cannot directly solve these problems.
- (3)
- The bridge management questions are relatively complex, involving multiple-hops, judgment, constraints, numerical calculation, aggregation operation, etc.
- (1)
- We constructed a bridge management domain knowledge base and question answering corpus, realized the integration and utilization of data, and laid the data foundation for the question answering task in this field.
- (2)
- Cross-task constraints (CTC) make the semantics of subtasks interrelated, enrich the expression of contextual semantic features of domain questions, and can effectively solve problems, such as unclear entity boundaries and inaccurate professional vocabulary recognition. MTL strategy can reduce the error propagation of TEE and QC tasks. The template matching method combines semantic analysis and neural network technology to convert natural language questions into answer templates, which can answer more complex questions and maximize the accuracy of QA, meeting the complex application scenarios of bridge management.
- (3)
- This research compensates for the shortage of fine-grained knowledge service in the bridge management field and realizes the fine-grained information interaction between bridge management users and domain knowledge base.
2. Related Work
2.1. Complex Knowledge Base Question Answering
2.2. Domain-Specific KBQA Approaches
2.3. Gaps and Challenges
- (1)
- The bridge management process is concerned with numerical values, such as technical condition level and score, as well as textual information, such as structural defect and maintenance advice. Therefore, the Neo4j attribute graph structure can be used for storing bridge management knowledge. In addition, the question types in the field of bridge management are relatively fixed; therefore, template matching can give full scope to its advantages, which can better ensure the QA effect in the practical application scenarios of bridge management.
- (2)
- Bridge management questions often contain multiple topic entities with ill-defined boundaries. For example, the question “A桥桥面系的技术状况等级是多少?” (“What is the technical condition level of the bridge deck system of Bridge A?”) contains two topic entities “A桥” (“Bridge A”) and “桥面系” (“Deck System”). The same character “桥” (“bridge”) exists between topic entities, but without any separators. In addition, the short text of bridge management questions lacks contextual semantics, which increases the difficulty in professional vocabulary recognition. Moreover, bridge management questions can be classified according to two levels: Coarse-grained and fine-grained. For example, the coarse-grained type for the question “A桥和B桥有哪些共同缺陷?” (“What common defects do Bridge A and Bridge B have?”) is “损伤缺陷” (“damage defects”), and its fine-grained type is “共同缺陷” (“common defects”). Therefore, it is necessary to clarify the entity boundary and assist in the fine classification of questions through POS tagging. To improve the contextual comprehension ability of questions, the semantic correlation between subtasks should be strengthened to form CTC. (Due to the privacy property of bridge management information, letters are used rather than real bridge names in this paper).
- (3)
- There is no strict execution order between the TEE and QC tasks. Therefore, the multiple subtasks of bridge management C-KBQA can be jointly trained to avoid error propagation and save computational resources.
3. Methodology
3.1. Encoder
3.2. Multi-Task Learning
3.3. Cross-Task Constraints
4. Experiments
4.1. Neo4j Knowledge Base
4.2. Experimental Dataset
4.3. Baselines and Configurations
4.4. Experimental Results
4.5. Ablation Study
5. System Prototype
6. Conclusions
- (1)
- The template still needs to be designed manually and is closely related to the storage form of the knowledge base; therefore, it cannot be flexibly improved. As a result, the automatic generation of query templates will be the focus of future research. The following work will carry out syntactic dependency analysis and part of speech analysis on natural language questions to obtain the core sentence pattern of the questions. The QA pairs are used as training data, and feature extraction is carried out through the deep neural network to automatically construct question templates.
- (2)
- The bridge management knowledge base contains large numbers of entities and relationships; therefore, the follow-up work can build a domain knowledge graph embedding model to integrate prior knowledge into the model in advance. In order to retain the structural and semantic features between entities and relations at the same time, the graph neural network and attention mechanism will be combined to generate the knowledge graph embedding to capture the deep interaction information between entities and relationships.
- (3)
- With the advancement in deep learning, end-to-end C-KBQA models have been extensively studied in the general domain; therefore, researching C-KBQA end-to-end models for bridge management is another important and challenging task. The future work will be based on the semantic support of knowledge graph embedding, and automatically construct structured queries through the automatic generation of templates, in an attempt to obtain an automatic question answering solution for complex knowledge bases with enhanced domain knowledge semantics.
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Acknowledgments
Conflicts of Interest
References
- Soares, M.A.C.; Parreiras, F.S. A literature review on question answering techniques, paradigms and systems. J. King Saud Univ. Comput. Inf. Sci. 2020, 32, 635–646. [Google Scholar]
- Razzaghnoori, M.; Sajedi, H.; Jazani, I.K. Question classification in Persian using word vectors and frequencies. Cogn. Syst. Res. 2018, 47, 16–27. [Google Scholar] [CrossRef]
- Lu, R.; Jin, X.; Zhang, S.; Qiu, M.; Wu, X. A study on big knowledge and its engineering issues. IEEE Trans. Knowl. Data Eng. 2018, 31, 1630–1644. [Google Scholar] [CrossRef]
- Chen, X.; Jia, S.; Xiang, Y. A review: Knowledge reasoning over knowledge graph. Expert Syst. Appl. 2020, 141, 112948. [Google Scholar] [CrossRef]
- Pregnolato, M. Bridge safety is not for granted-A novel approach to bridge management. Eng. Struct. 2019, 196, 109193. [Google Scholar] [CrossRef]
- Gui, C.; Zhang, J.; Lei, J.; Hou, Y.; Zhang, Y.; Qian, Z. A comprehensive evaluation algorithm for project-level bridge maintenance decision-making. J. Clean. Prod. 2021, 289, 125713. [Google Scholar] [CrossRef]
- Wu, C.; Wu, P.; Wang, J.; Jiang, R.; Chen, M.; Wang, X. Critical review of data-driven decision-making in bridge operation and maintenance. Struct. Infrastruct. Eng. 2021, 18, 47–70. [Google Scholar] [CrossRef]
- Yang, X.; Yang, J.; Li, R.; Luo, M.; Jiang, S.; Wang, G.; Yang, Y. Knowledge graph construction and knowledge question answering method for bridge inspection domain. J. Comput. Appl. 2022, 42, 28–36. [Google Scholar]
- Pan, Z.; Su, C.; Deng, Y.; Cheng, J. Video2Entities: A computer vision-based entity extraction framework for updating the architecture, engineering and construction industry knowledge graphs. Autom. Constr. 2021, 125, 103617. [Google Scholar] [CrossRef]
- Sarazin, A.; Bascans, J.; Sciau, J.B.; Song, J.; Supiot, B.; Montarnal, A.; Lorca, X.; Truptil, S. Expert system dedicated to condition-based maintenance based on a knowledge graph approach: Application to an aeronautic system. Expert Syst. Appl. 2021, 186, 115767. [Google Scholar] [CrossRef]
- Wan, C.; Zhou, Z.; Li, S.; Ding, Y.; Xu, Z.; Yang, Z.; Xia, Y.; Yin, F. Development of a bridge management system based on the building information modeling technology. Sustainability 2019, 11, 4583. [Google Scholar] [CrossRef] [Green Version]
- Diefenbach, D.; Lopez, V.; Singh, K.; Maret, P. Core techniques of question answering systems over knowledge bases: A survey. Knowl. Inf. Syst. 2018, 55, 529–569. [Google Scholar] [CrossRef]
- Hu, S.; Zou, L.; Zhang, X. A state-transition framework to answer complex questions over knowledge base. In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, Brussels, Belgium, 31 October–4 November 2018; pp. 2098–2108. [Google Scholar]
- Hua, Y.; Li, Y.F.; Qi, G.; Wu, W.; Zhang, J.; Qi, D. Less is more: Data-efficient complex question answering over knowledge bases. J. Web Semant. 2020, 65, 100612. [Google Scholar] [CrossRef]
- Cai, Q.; Yates, A. Large-scale semantic parsing via schema matching and lexicon extension. In Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, 4–9 August 2013; pp. 423–433. [Google Scholar]
- Sun, Y.; Li, P.; Cheng, G.; Qu, Y. Skeleton parsing for complex question answering over knowledge bases. J. Web Semant. 2022, 72, 100698. [Google Scholar] [CrossRef]
- Zhang, J.; Zhang, L.; Hui, B.; Tian, L. Improving complex knowledge base question answering via structural information learning. Knowl. -Based Syst. 2022, 242, 108252. [Google Scholar] [CrossRef]
- Guo, D.; Tang, D.; Duan, N.; Zhou, M.; Yin, J. Dialog-to-action: Conversational question answering over a large-scale knowledge base. In Proceedings of the 2018 Neural Information Processing Systems Conference, Montreal, Canada, 3–8 December 2018; pp. 2942–2951. [Google Scholar]
- Wu, W.; Zhu, Z.; Zhang, G.; Kang, S.; Liu, P. A reasoning enhance network for muti-relation question answering. Appl. Intell. 2021, 51, 4515–4524. [Google Scholar] [CrossRef]
- Höffner, K.; Walter, S.; Marx, E.; Usbeck, R.; Lehmann, J.; Ngonga, N.; Axel, C. Survey on challenges of question answering in the semantic web. Semant. Web 2017, 8, 895–920. [Google Scholar] [CrossRef] [Green Version]
- Yang, M.C.; Lee, D.G.; Park, S.Y.; Rim, H.C. Knowledge-based question answering using the semantic embedding space. Expert Syst. Appl. 2015, 42, 9086–9104. [Google Scholar] [CrossRef]
- Zhou, M.; Huang, M.; Zhu, X. An interpretable reasoning network for multi-relation question answering. arXiv 2018, arXiv:1801.04726. [Google Scholar]
- Jin, H.; Luo, Y.; Gao, C.; Tang, X.; Yuan, P. ComQA: Question answering over knowledge base via semantic matching. IEEE Access 2019, 7, 75235–75246. [Google Scholar] [CrossRef]
- Esteva, A.; Kale, A.; Paulus, R.; Hashimoto, K.; Yin, W.; Radev, D.; Socher, R. COVID-19 information retrieval with deep-learning based semantic search, question answering, and abstractive summarization. NPJ Digit. Med. 2021, 4, 68. [Google Scholar] [CrossRef] [PubMed]
- Unger, C.; Bühmann, L.; Lehmann, J.; Ngonga Ngomo, A.C.; Gerber, D.; Cimiano, P. Template-based question answering over RDF data. In Proceedings of the 21st international conference on World Wide Web, Lyon, France, 16–20 April 2012; pp. 639–648. [Google Scholar]
- Gomes, J.J.; de Mello, R.C.; Ströele, V.; de Souza, J.F. A hereditary attentive template-based approach for complex Knowledge Base Question Answering systems. Expert Syst. Appl. 2022, 205, 117725. [Google Scholar] [CrossRef]
- Ojokoh, B.; Adebisi, E. A review of question answering systems. J. Web Eng. 2018, 17, 717–758. [Google Scholar] [CrossRef]
- Lan, Y.; He, G.; Jiang, J.; Jiang, J.; Zhao, W.X.; Wen, J. A survey on complex knowledge base question answering: Methods, challenges and solutions. arXiv 2021, arXiv:2105.11644. [Google Scholar]
- Pereira, A.; Trifan, A.; Lopes, R.P.; Oliveira, J.L. Systematic review of question answering over knowledge bases. IET Softw. 2022, 16, 1–13. [Google Scholar] [CrossRef]
- Chen, Y.; Li, H. DAM: Transformer-based relation detection for Question Answering over Knowledge Base. Knowl. Based Syst. 2020, 201, 106077. [Google Scholar] [CrossRef]
- Wu, H.; Cheng, S.; Wang, Z.; Zhang, S.; Yuan, F. Multi-task learning based on question-answering style reviews for aspect category classification and aspect term extraction on GPU clusters. Clust. Comput. 2020, 23, 1973–1986. [Google Scholar] [CrossRef]
- Wu, C.; Luo, G.; Guo, C.; Ren, Y.; Zheng, A.; Yang, C. An attention-based multi-task model for named entity recognition and intent analysis of Chinese online medical questions. J. Biomed. Inform. 2020, 108, 103511. [Google Scholar] [CrossRef]
- Yang, M.; Tu, W.; Qu, Q.; Zhou, W.; Liu, Q.; Zhu, J. Advanced community question answering by leveraging external knowledge and multi-task learning. Knowl. Based Syst. 2019, 171, 106–119. [Google Scholar] [CrossRef]
- Cheng, L.; Xie, F.; Ren, J. KB-QA based on multi-task learning and negative sample generation. Inf. Sci. 2021, 574, 349–362. [Google Scholar] [CrossRef]
- Hamon, T.; Grabar, N.; Mougin, F. Querying biomedical linked data with natural language questions. Semant. Web 2017, 8, 581–599. [Google Scholar] [CrossRef] [Green Version]
- Demner-Fushman, D.; Mrabet, Y.; Ben Abacha, A. Consumer health information and question answering: Helping consumers find answers to their health-related information needs. J. Am. Med. Inform. Assoc. 2020, 27, 194–201. [Google Scholar] [CrossRef]
- Zhao, W.; Liu, J. Application of Knowledge Map Based on BiLSTM-CRF Algorithm Model in Ideological and Political Education Question Answering System. Mob. Inf. Syst. 2022, 2022, 4139323. [Google Scholar] [CrossRef]
- Wu, C.; Li, X.; Guo, Y.; Wang, J.; Ren, Z.; Wang, M.; Yang, Z. Natural language processing for smart construction: Current status and future directions. Autom. Constr. 2022, 134, 104059. [Google Scholar] [CrossRef]
- Zhong, B.; He, W.; Huang, Z.; Love, P.E.D.; Tang, J.; Luo, H. A building regulation question answering system: A deep learning methodology. Adv. Eng. Inform. 2020, 46, 101195. [Google Scholar] [CrossRef]
- Li, T.; Alipour, M.; Harris, D.K. Mapping textual descriptions to condition ratings to assist bridge inspection and condition assessment using hierarchical attention. Autom. Constr. 2021, 129, 103801. [Google Scholar] [CrossRef]
- Xia, Y.; Lei, X.; Wang, P.; Sun, L. A data-driven approach for regional bridge condition assessment using inspection reports. Struct. Control. Health Monit. 2022, 29, e2915. [Google Scholar] [CrossRef]
- Li, R.; Mo, T.; Yang, J.; Jiang, S.; Li, T.; Liu, Y. Ontologies-based domain knowledge modeling and heterogeneous sensor data integration for bridge health monitoring systems. IEEE Trans. Ind. Inform. 2020, 17, 321–332. [Google Scholar] [CrossRef]
- Li, R.; Mo, T.; Yang, J.; Li, D.; Jiang, S.; Wang, D. Bridge inspection named entity recognition via BERT and lexicon augmented machine reading comprehension neural model. Adv. Eng. Inform. 2021, 50, 101416. [Google Scholar] [CrossRef]
- Li, R.; Li, D.; Yang, J.; Xiang, F.; Ren, H.; Jiang, S.; Zhang, L. Joint extraction of entities and relations via an entity correlated attention neural model. Inf. Sci. 2021, 581, 179–193. [Google Scholar] [CrossRef]
- Yang, J.; Xiang, F.; Li, R.; Zhang, L.; Yang, X.; Jiang, S.; Zhang, H.; Wang, D.; Liu, X. Intelligent bridge management via big data knowledge engineering. Autom. Constr. 2022, 135, 104118. [Google Scholar] [CrossRef]
Entity Name | Entity Labels | POS Labels | Vocabulary Examples |
---|---|---|---|
Bridge | BRI | nb | Bridge A (A桥) |
Bridge Structure | BST | ns | Bridge deck system (桥面系) |
Structural Element | BSE | ne | Left web plate (左腹板) |
Structural Location | BSL | nl | Within 2 m (2米范围内) |
Structural Defect | BSD | nd | Mesh crack (网状裂缝) |
Technical Condition | BTC | nt | Technical condition (技术状况) |
Entity Types | Training Set | Validation Set | Test Set | Total |
---|---|---|---|---|
BRI | 21,438 | 2680 | 2731 | 26,849 |
BST | 14,432 | 1961 | 2248 | 18,641 |
BSE | 2073 | 301 | 198 | 2572 |
BSL | 1483 | 189 | 257 | 1929 |
BSD | 5124 | 682 | 776 | 6582 |
BTC | 10,940 | 1504 | 1325 | 13,769 |
Coarse-Grained | Fine-Grained | Number | Total |
---|---|---|---|
基础信息 (Basic information) | 桥梁信息 (Bridge information) | 3018 | 6283 |
结构信息 (Structural information) | 3265 | ||
损伤缺陷 (Damage defect) | 桥梁缺损 (Bridge defect) | 126 | 11,400 |
结构缺损 (Structural damage) | 4047 | ||
缺损描述 (Defect description) | 3970 | ||
缺损属性 (Defect properties) | 673 | ||
缺损位置 (Defect location) | 1056 | ||
共同缺损 (Common defect) | 1528 | ||
技术状况 (Technical condition) | 桥梁技术状况 (Bridge technical condition) | 504 | 9115 |
结构技术状况 (Structural technical condition) | 8611 |
Complex Types | QA Pairs Examples |
---|---|
Multiple-hops | Q: What is the technical condition level of the bridge deck system of Bridge A? (A桥桥面系的技术状况等级为多少?) A: The technical condition level of the bridge deck system of Bridge A is level 3. (A桥桥面系的技术状况等级为3类.) |
Numerical calculation | Q: How many defects exist in Bridge D? (D桥存在几种缺陷?) A: There are seven defects in Bridge D. (D桥有7种缺陷.) |
Judgment | Q: Are Bridge B and Bridge C the same technical condition level? (A桥和B桥的技术状况等级一样吗?) A: Bridge B and Bridge C are the same technical condition level. (A桥和B桥的技术状况等级一样.) |
Constraint | Q: What are the bridges with technical condition level 1? (技术状况等级为1类的桥有哪些?) A: The bridges with technical condition level 1 are Bridge E, Bridge J, and Bridge I. (技术状况等级为1类的桥有E桥, J桥和I桥.) |
Aggregation operations | Q: What defects are common to Bridge D and Bridge F? (D桥和F桥存在哪些共同缺损?) A: The common defects are shear deformation and longitudinal cracks. (D桥和F桥的共同缺损有剪切变形和纵向裂缝.) |
Parameters | TEE Task | QC Task |
---|---|---|
Pretrained model | Bert_base_Chinese | Bert_base_Chinese |
Optimizer | AdamW | AdamW |
Batch size | 64 | 64 |
Learning rate | 0.00005 | 0.00005 |
Word vector size | 100 | 100 |
BiLSTM_hidden_size | 128 | 128 |
Epoches | 30 | 15 |
Models | |||
---|---|---|---|
CNN-CRF | 90.32 | 91.47 | 90.89 |
BiLSTM-CRF | 92.51 | 91.29 | 91.90 |
BERT-CRF | 94.25 | 93.63 | 93.94 |
Our model | 93.86 | 95.68 | 94.76 |
Models | ||||
---|---|---|---|---|
TextRNN_Att | 98.02 | 97.97 | 94.81 | 95.87 |
TextRCNN | 98.26 | 96.67 | 96.94 | 96.79 |
ERNIE | 99.14 | 98.35 | 98.71 | 98.51 |
BERT | 99.18 | 98.40 | 98.94 | 98.66 |
BERT_CNN | 99.10 | 98.33 | 98.87 | 98.59 |
BERT_RNN | 99.12 | 98.19 | 99.03 | 98.57 |
Our model | 99.26 | 98.68 | 99.01 | 98.84 |
Question Types | |||
---|---|---|---|
Basic information | 92.32 ± 0.02 | 87.9 ± 0.03 | 90.06 ± 0.03 |
Damage defect | 82.57 ± 0.03 | 78.89 ± 0.03 | 80.69 ± 0.05 |
Technical condition | 95.24 ± 0.02 | 92.65 ± 0.01 | 93.93 ± 0.02 |
Overall evaluation | 91.17 ± 0.02 | 88.93 ± 0.01 | 90.04 ± 0.02 |
Models | |||
---|---|---|---|
BERT-Similarity | 67.93 | 66.25 | 67.08 |
BERT-BiLSTM-Similarity | 72.31 | 70.89 | 71.59 |
BERT-BiLSTM-Attention | 75.58 | 75.04 | 75.31 |
Our Model | 91.17 | 88.93 | 90.04 |
Task Types | TEE | QC | C-KBQA | ||||
---|---|---|---|---|---|---|---|
F1 (%) | F1 (%) | F1 (%) | |||||
Single-Task | 94.65 | 93.32 | 93.98 | 98.21 | 98.89 | 98.54 | 88.32 |
+POS emb | 94.79 | 93.23 | 94.00 | 98.16 | 98.97 | 98.66 | 89.43 |
+Entity.emb | / | / | / | 98.48 | 99.07 | 98.77 | / |
+Q-Type.emb | 94.95 | 93.87 | 94.41 | / | / | / | / |
Our model | 94.89 | 94.64 | 94.76 | 98.68 | 99.01 | 98.84 | 90.04 |
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Yang, X.; Yang, J.; Li, R.; Li, H.; Zhang, H.; Zhang, Y. Complex Knowledge Base Question Answering for Intelligent Bridge Management Based on Multi-Task Learning and Cross-Task Constraints. Entropy 2022, 24, 1805. https://doi.org/10.3390/e24121805
Yang X, Yang J, Li R, Li H, Zhang H, Zhang Y. Complex Knowledge Base Question Answering for Intelligent Bridge Management Based on Multi-Task Learning and Cross-Task Constraints. Entropy. 2022; 24(12):1805. https://doi.org/10.3390/e24121805
Chicago/Turabian StyleYang, Xiaoxia, Jianxi Yang, Ren Li, Hao Li, Hongyi Zhang, and Yue Zhang. 2022. "Complex Knowledge Base Question Answering for Intelligent Bridge Management Based on Multi-Task Learning and Cross-Task Constraints" Entropy 24, no. 12: 1805. https://doi.org/10.3390/e24121805
APA StyleYang, X., Yang, J., Li, R., Li, H., Zhang, H., & Zhang, Y. (2022). Complex Knowledge Base Question Answering for Intelligent Bridge Management Based on Multi-Task Learning and Cross-Task Constraints. Entropy, 24(12), 1805. https://doi.org/10.3390/e24121805