An Interpretable Fake News Detection Method Based on Commonsense Knowledge Graph
Abstract
:1. Introduction
- A method for extracting commonsense knowledge contained in news is proposed. This method is based on the Lexical Analysis of Chinese (LAC) framework [12] and the Universal Information Extraction (UIE) framework [13]. The extracted knowledge serves as the basis for the subsequent use of graph matching algorithms for fake news detection.
- An interpretable model for fake news detection is proposed based on the random walks graph matching algorithm. Using this algorithm, the commonsense knowledge embedded in the news is compared with the relevant knowledge in the knowledge graph. The truth or falsity of the news is determined by comparing the conflicts of entities and attributes among the knowledge triads in the matching results. This approach leads to an interpretable fake news detection method.
- The proposed method is evaluated on a fake news dataset containing commonsense errors. The experimental results demonstrate that our method outperforms the baseline method.
2. Related Work
2.1. Graph Matching
2.2. Knowledge Graph
3. Extraction of Commonsense Knowledge
3.1. Bi-GRU Based Entity Extraction
- The label of the first character of a sentence cannot be an I-tag.
- The previous tag of each I-tag can only be a B-tag or an I-tag. For example, the tag before “LOC-I” can only be “LOC-B” or “LOC-I”.
3.2. Template-Based Knowledge Extraction
3.2.1. Knowledge Extraction Based on a Relational Word Dictionary
3.2.2. Knowledge Extraction Algorithm Process
3.3. Knowledge Extraction Based on UIE
4. Fake News Identification Based on Random Walk Graph Matching
4.1. Principle of Random Wandering Graph Matching Algorithm
4.2. Algorithm Implementation
4.2.1. Overall Algorithm Implementation
4.2.2. Extraction of Commonsense Subgraph
- All the nodes in the extracted knowledge graph are treated as entities, the relationships of the entity nodes are queried in the general knowledge graph, and all the relationships and the nodes involved are recorded.
- The nodes and relationships of records are deduplicated.
- The entity and attribute nodes with the same value are merged, as is the relationship of two nodes with the same value.
- The processed nodes and relations are constructed as a commonsense knowledge subgraph.
4.2.3. Construction of Similarity Matrix
4.2.4. Reasoning Explanation of Identification Results
5. Experiment and Analysis
5.1. Experimental Result Comparison Analysis
5.2. Experimental Result Comparison Analysis
6. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Barakat, K.A.; Dabbous, A.; Tarhini, A. An empirical approach to understanding users’ fake news identification on social media. Online Inf. Rev. 2021, 45, 1080–1096. [Google Scholar]
- Arafeh, M.; El Barachi, M.; Mourad, A.; Belqasmi, F. A blockchain based architecture for the detection of fake sensing in mobile crowdsensing. In Proceedings of the 2019 4th International Conference on Smart and Sustainable Technologies (SpliTech), Split, Croatia, 18–21 June 2019; pp. 1–6. [Google Scholar]
- Melki, J.; Tamim, H.; Hadid, D.; Makki, M.; El Amine, J.; Hitti, E. Mitigating infodemics: The relationship between news exposure and trust and belief in COVID-19 fake news and social media spreading. PLoS ONE 2021, 16, e0252830. [Google Scholar] [CrossRef] [PubMed]
- Hamza, A.; Javed, A.R.R.; Iqbal, F.; Kryvinska, N.; Almadhor, A.S.; Jalil, Z.; Borghol, R. Deepfake Audio Detection via MFCC Features Using Machine Learning. IEEE Access 2022, 10, 134018–134028. [Google Scholar] [CrossRef]
- Mahdi, A.; Farah, M.F.; Ramadan, Z. What to believe, whom to blame, and when to share: Exploring the fake news experience in the marketing context. J. Consum. Mark. 2022, 39, 306–316. [Google Scholar] [CrossRef]
- Zhang, H.; Fang, Q.; Qian, S.; Xu, C. Multi-modal knowledge-aware event memory network for social media rumor detection. In Proceedings of the 27th ACM international conference on multimedia, Nice, France, 21–25 October 2019; pp. 1942–1951. [Google Scholar]
- Dun, Y.; Tu, K.; Chen, C.; Hou, C.; Yuan, X. Kan: Knowledge-aware attention network for fake news detection. In Proceedings of the AAAI Conference on Artificial Intelligence, Virtually, Palo Alto, CA, USA, 2–9 February 2021; Volume 35, pp. 81–89. [Google Scholar]
- Wang, Y.; Qian, S.; Hu, J.; Fang, Q.; Xu, C. Fake news detection via knowledge-driven multimodal graph convolutional networks. In Proceedings of the 2020 International Conference on Multimedia Retrieval, Dublin, Ireland, 8–11 June 2020; pp. 540–547. [Google Scholar]
- Cho, M.; Lee, J.; Lee, K.M. Reweighted random walks for graph matching. In Proceedings of the Computer Vision–ECCV 2010: 11th European Conference on Computer Vision, Heraklion, Crete, Greece, 5–11 September 2010. [Google Scholar]
- Leordeanu, M.; Hebert, M. A spectral technique for correspondence problems using pairwise constraints. In Proceedings of the Tenth IEEE International Conference on Computer Vision (ICCV’05) Volume 1, Beijing, China, 17–21 October 2005; Volume 2, pp. 1482–1489. [Google Scholar]
- Cour, T.; Srinivasan, P.; Shi, J. Balanced graph matching. Adv. Neural Inf. Process. Syst. 2006, 19, 313–320. [Google Scholar]
- Jiao, Z.; Sun, S.; Sun, K. Chinese lexical analysis with deep bi-gru-crf network. arXiv 2018, arXiv:1807.01882. [Google Scholar]
- Lu, Y.; Liu, Q.; Dai, D.; Xiao, X.; Lin, H.; Han, X.; Sun, L.; Wu, H. Unified structure generation for universal information extraction. arXiv 2022, arXiv:2203.12277. [Google Scholar]
- Kang, U.; Hebert, M.; Park, S. Fast and scalable approximate spectral graph matching for correspondence problems. Inf. Sci. 2013, 220, 306–318. [Google Scholar] [CrossRef]
- Torr, P.H. Solving markov random fields using semi definite programming. In Proceedings of the International Workshop on Artificial Intelligence and Statistics, Key West, FL, USA, 3–6 January 2003; pp. 292–299. [Google Scholar]
- Schellewald, C.; Schnrr, C. Probabilistic subgraph matching based on convex relaxation. In Proceedings of the Energy Minimization Methods in Computer Vision and Pattern Recognition: 5th International Workshop, EMMCVPR 2005, St. Augustine, FL, USA, 9–11 November 2005; pp. 171–186. [Google Scholar]
- Egozi, A.; Keller, Y.; Guterman, H. A probabilistic approach to spectral graph matching. IEEE Trans. Pattern Anal. Mach. Intell. 2013, 35, 1798–1828. [Google Scholar] [CrossRef] [PubMed]
- Gori, M.; Maggini, M.; Sarti, L. Exact and approximate graph matching using random walks. IEEE Trans. Pattern Anal. Mach. Intell. 2005, 27, 1100–1111. [Google Scholar] [CrossRef] [PubMed]
- Zass, R.; Shashua, A. Probabilistic graph and hypergraph matching. In Proceedings of the 2008 IEEE Conference on Computer Vision and Pattern Recognition, Anchorage, AK, USA, 23–28 June 2008; pp. 1–8. [Google Scholar]
- Wang, H.; Zhang, F.; Xie, X.; Guo, M. DKN: Deep knowledge-aware network for news recommendation. In Proceedings of the 2018 World Wide Web Conference, Lyon, France, 23–27 April 2018; pp. 1835–1844. [Google Scholar]
- Zhong, W.; Xu, J.; Tang, D.; Xu, Z.; Duan, N.; Zhou, M.; Wang, J.; Yin, J. Reasoning Over Semantic-Level Graph for Fact Checking. In ACL; Association for Computational Linguistics: Cedarville, OH, USA, 2020; pp. 6170–6180. [Google Scholar]
- Shi, X. Automatic Commonsense Knowledge Base Construction and Completion for Chinese. Master’s Thesis, East China Normal University, Shanghai, China, 2022. [Google Scholar]
- Grishman, R. Twenty-five years of information extraction. Nat. Lang. Eng. 2019, 25, 677–692. [Google Scholar] [CrossRef]
- Socher, R.; Perelygin, A.; Wu, J.; Chuang, J.; Manning, C.D.; Ng, A.Y.; Potts, C. Recursive deep models for semantic compositionality over a sentiment treebank. In Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing, Seattle, WA, USA, 18–21 October 2013; pp. 1631–1642. [Google Scholar]
- Sak, H.; Senior, A.; Beaufays, F. Long short-term memory recurrent neural network architectures for large scale acoustic modeling. Interspeech 2014, 338–342. [Google Scholar] [CrossRef]
- Chung, J.; Gulcehre, C.; Cho, K.; Bengio, Y. Empirical evaluation of gated recurrent neural networks on sequence modeling. arXiv 2014, arXiv:1412.3555. [Google Scholar]
- Lafferty, J.; McCallum, A.; Pereira, F.C. Conditional random fields: Probabilistic models for segmenting and labeling sequence data. In Proceedings of the 18th International Conference on Machine Learning 2001, Williamstown, MA, USA, 28 June–1 July 2001. [Google Scholar]
- Bengio, Y.; Ducharme, R.; Vincent, P. A neural probabilistic language model. Adv. Neural Inf. Process. Syst. 2000, 13, 1137–1155. [Google Scholar]
- Yan, J. Algorithmic Studies and Design on Graph Matching. Ph.D. Thesis, Shanghai Jiao Tong University, Shanghai, China, 2015. [Google Scholar]
- Zhou, L. Research and Implementation of Commonsense Reasoning Technology Based on Path Mining. Master’s Thesis, University of Electronic Science and Technology of China, Chengdu, China, 2022. [Google Scholar]
- Conte, D.; Foggia, P.; Sansone, C.; Vento, M. Thirty years of graph matching in pattern recognition. Int. J. Pattern Recognit. Artif. Intell. 2004, 18, 265–298. [Google Scholar] [CrossRef]
- Xu, X. Random Walk Learning on Graph. Ph.D. Thesis, Nanjing University of Aeronautics and Astronautics, Nanjing, China, 2008. [Google Scholar]
- Wang, Y. Research and Application of Random Walk Algorithm Based on Distance. Master’s Thesis, Beijing Jiao Tong University, Beijing, China, 2012. [Google Scholar]
- Li, Y.; Gao, D. Research on Entities Similarity Calculation in Knowledge Graph. J. Chin. Inf. Process. 2017, 31, 140–146+154. [Google Scholar]
Matching Algorithm | RRWM | SM | SMAC |
---|---|---|---|
Accuracy | 91.07% | 82.14% | 78.57% |
Precision | 86.36% | 76.47% | 68.75% |
Recall | 90.48% | 68.42% | 61.11% |
Method | Accuracy | Precision | Recall |
---|---|---|---|
SVM-TS | 63.12% | 63.29% | 63.01% |
CNN | 71.12% | 71.30% | 71.12% |
GRU | 79.27% | 81.39% | 79.27% |
RRWM | 91.07% | 85.00% | 89.47% |
Model | Accuracy | Precision | Recall |
---|---|---|---|
No semantic matching model | 80.36% | 83.33% | 65.22% |
Knowledge extraction with semantic matching | 83.93% | 84.21% | 72.73% |
Graph matching with semantic matching | 87.50% | 85.71% | 81.82% |
Semantic matching is added to both parts | 91.07% | 86.36% | 90.48% |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Gao, X.; Chen, W.; Lu, L.; Cui, Y.; Dai, X.; Dai, L.; Wang, K.; Shen, J.; Wang, Y.; Wang, S.; et al. An Interpretable Fake News Detection Method Based on Commonsense Knowledge Graph. Appl. Sci. 2023, 13, 6680. https://doi.org/10.3390/app13116680
Gao X, Chen W, Lu L, Cui Y, Dai X, Dai L, Wang K, Shen J, Wang Y, Wang S, et al. An Interpretable Fake News Detection Method Based on Commonsense Knowledge Graph. Applied Sciences. 2023; 13(11):6680. https://doi.org/10.3390/app13116680
Chicago/Turabian StyleGao, Xiang, Weiqing Chen, Liangyu Lu, Ying Cui, Xiang Dai, Lican Dai, Kan Wang, Jing Shen, Yue Wang, Shengze Wang, and et al. 2023. "An Interpretable Fake News Detection Method Based on Commonsense Knowledge Graph" Applied Sciences 13, no. 11: 6680. https://doi.org/10.3390/app13116680
APA StyleGao, X., Chen, W., Lu, L., Cui, Y., Dai, X., Dai, L., Wang, K., Shen, J., Wang, Y., Wang, S., Yu, Z., & Liu, H. (2023). An Interpretable Fake News Detection Method Based on Commonsense Knowledge Graph. Applied Sciences, 13(11), 6680. https://doi.org/10.3390/app13116680