Metamorphic Testing of Relation Extraction Models
Abstract
:1. Introduction
- We propose applying MT to RE models. Our method can evaluate RE models without using ground truth labels.
- We design eight MRs for RE, each of which focuses on one specific property of the RE task. These MRs support the application of MT in RE and also contribute to the investigation and understanding of the characteristics of RE models.
- We conduct experiments on three RE models, demonstrating the feasibility and effectiveness of MT in evaluating RE models. Further analysis of the experimental results reveals the characteristics of the subject RE models and also uncovers typical issues for RE models.
2. Metamorphic Testing
3. Approach
3.1. Applying Metamorphic Testing to RE
3.2. Metamorphic Relations for RE
3.2.1. MRs with the Replacement Operation
- MR-R1: Replacement with the same type of entity.
- MR-R2: Replacement with the coarser-grained type of entity.
- MR-R3: Replacement with co-related entities having different entity types.
- MR-R4: Replacement with coreferential entities.
3.2.2. MRs with the Swap Operation
- MR-R1: Swap entities with symmetrical relations.
- MR-R2: Swap entities with antisymmetric relations.
3.2.3. MRs with the Combination Operation on Multiple Source Inputs
- MR-R1: Combining two pairs of entities sharing the same head entity.
- MR-R2: Combining two pairs of entities sharing the same tail entity.
4. Experimental Set-Up
4.1. Subject RE Models
4.2. Data Source
4.3. Construction of MGs
5. Results and Analysis
5.1. Overall Results of MT
5.2. Performance Comparison of Subject RE Models
5.3. Analysis of the Detected Failure Examples
- (1)
- First, for the MR-R with the highest VR values, we observed considerable prediction failures. As shown in the first row of Table 7, we were surprised to find that NCB gave inconsistent prediction results when the sentences and entities entered into the model were identical but only had the label of the entity type name changed. This further confirms the issue of NCB’s heavy reliance on entity type information mentioned in the previous section.
- (2)
- The second type of prediction failure was revealed by violating MR-R. An illustrative example is shown in the second row of Table 7. For substitutions between two entities, namely “Catholic” and “Muslim” with the same entity type RELIGION but different entity mentions (names), the RE model predicted different relations.In addition, we found that most of the prediction failure cases in LUKE and BERTEM+ MTB were due to replacing the original entity with entities that appeared less frequently in the TACRED dataset. (This issue was not exhibited in NCB because entities of the same type were represented in the same form after entity masking.) From this, we speculate that RE models may overfit the training samples and thus only pass part of the test cases where entities appear frequently in the training samples .Therefore, we conducted further experiments on LUKE and BERTEM+MTB. We divided the entities into two categories according to the frequency of the entity vocabulary (names) in the training samples of the dataset, one of which was high-frequency entity vocabulary while the other was low-frequency entity vocabulary. Then, we limited the replacement objects in MR-R to these two different categories of entities; that is, we randomly selected one of the high-frequency entity vocabulary and the other of the low-frequency entity vocabulary to replace it. It was found that the model performance varied greatly, as shown in Table 8. Substitutions from high-frequency entity words yielded a low average VR value (VR = 16.4%), while substitutions from low-frequency entity words showed a high average VR value (VR = 62.5%). These variations indicate that the model was overfitting the training samples to some extent.
- (3)
- Based on violations of MR-R, the failures of RE related to handling coreferential entities were detected. In the third row of Table 7, the entities “Dunne” and “his” have the same referential meaning. However, the RE model predicted a “per:other_family” relation between the entities “Joan Didion” and “his” but a “per:spouse” relation between “Joan Didion” and “Dunne”. This reveals a prediction failure: replacing the original entity with its coreferential entity should not affect the relation between the entities.
- (4)
- Based on violations of MR-S, the failures of RE in the face of an entity order swap in antisymmetric relations were detected. As shown in the fourth row of Table 7, the RE model predicted an antisymmetric relation “per:parents” between the entities “Lynne” and “Jamie Lynn”. After exchanging the head and tail entities, it failed to successfully predict the opposite relation, namely per:children.
- (5)
- Violations of MR-C revealed another type of failure. The fifth row of Table 7 shows that the RE model predicted that the entity “Madagascan Football Federation” was an alias for the entity “FMF”, and there was a “per:employee_of” relation between the entities “Madagascan Football Federation” and “Ahmad”. However, the RE model failed to capture the “per:employee_of” relation between the entities “FMF” and “Ahmad”.
5.4. Findings
- (1)
- RE models are more sensitive to changes in entity type than changes in entity mentions (names). Entity mentions (names) and entity types are important pieces of information for entities. Peng et al. [11] reported that RE models may improve model performance with some cues that entity mentions exhibit, while Rosenman et al. [36] also observed that RE models expose shallow heuristics in the type of candidate arguments. In this study, two MRs, MR-R and MR-R, focused on the entity mentions and entity types of RE, respectively. As can be seen from Figure 3, the performances of the three RE models on MR-R varied, but they all showed the highest VR value on MR-R, which indicates that the failure caused by the change in entity type information was widely revealed in all three models. The RE models showed poor robustness when facing changes in entity type information. From this perspective, our findings are consistent with existing observations that RE models suffer from overly dependence on entity types [12,36].
- (2)
- Compared with entity exchange in symmetric relations, RE models are more sensitive to the changes in entity order in antisymmetric relations. In this study, the two MRs, namely MR-S and MR-S, applied entity swap operations to symmetric and antisymmetric relations, respectively. To reveal whether symmetric and antisymmetric relations were more affected by changes in entity order, we examined the VR values of individual MRs of MR-S and MR-S. The results are depicted in Figure 3. It was found that for every RE model, the VR value of MR-S was lower than that of MR-S. These results indicate that RE models are more sensitive to the changes in entity order in antisymmetric relations than in symmetric relations. In other words, entity order perturbations in antisymmetric relations are more likely to incur prediction failures.
5.5. Discussion
6. Threats to Validity
7. Related Works
7.1. Applications of MT
7.2. Evaluation of RE Models
8. Conclusions
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Yu, H.; Li, H.; Mao, D.; Cai, Q. A relationship extraction method for domain knowledge graph construction. World Wide Web 2020, 23, 735–753. [Google Scholar] [CrossRef]
- Diefenbach, D.; Lopez, V.; Singh, K.; Maret, P. Core techniques of question answering systems over knowledge bases: A survey. Knowl. Inf. Syst. 2018, 55, 529–569. [Google Scholar] [CrossRef]
- Sharma, D.; Shukla, R.; Giri, A.K.; Kumar, S. A brief review on search engine optimization. In Proceedings of the 2019 9th International Conference on Cloud Computing, Data Science & Engineering (Confluence), Noida, India, 10–11 January 2019; pp. 687–692. [Google Scholar]
- Zad, S.; Heidari, M.; Jones, J.H.; Uzuner, O. A Survey on Concept-Level Sentiment Analysis Techniques of Textual Data. In Proceedings of the 2021 IEEE World AI IoT Congress (AIIoT), Seattle, WA, USA, 10–13 May 2021; pp. 0285–0291. [Google Scholar] [CrossRef]
- Bharti, S.K.; Babu, K.S. Automatic keyword extraction for text summarization: A survey. arXiv 2017, arXiv:1704.03242. [Google Scholar]
- Zhang, Y.; Zhong, V.; Chen, D.; Angeli, G.; Manning, C.D. Position-aware attention and supervised data improve slot filling. In Proceedings of the Conference on Empirical Methods in Natural Language Processing, Copenhagen, Denmark, 7–11 September 2017. [Google Scholar]
- Peters, M.E.; Neumann, M.; Logan IV, R.L.; Schwartz, R.; Joshi, V.; Singh, S.; Smith, N.A. Knowledge enhanced contextual word representations. arXiv 2019, arXiv:1909.04164. [Google Scholar]
- Yamada, I.; Asai, A.; Shindo, H.; Takeda, H.; Matsumoto, Y. LUKE: Deep contextualized entity representations with entity-aware self-attention. arXiv 2020, arXiv:2010.01057. [Google Scholar]
- Han, X.; Gao, T.; Lin, Y.; Peng, H.; Yang, Y.; Xiao, C.; Liu, Z.; Li, P.; Sun, M.; Zhou, J. More data, more relations, more context and more openness: A review and outlook for relation extraction. arXiv 2020, arXiv:2004.03186. [Google Scholar]
- Wang, Y.; Chen, M.; Zhou, W.; Cai, Y.; Liang, Y.; Liu, D.; Yang, B.; Liu, J.; Hooi, B. Should We Rely on Entity Mentions for Relation Extraction? Debiasing Relation Extraction with Counterfactual Analysis. arXiv 2022, arXiv:2205.03784. [Google Scholar]
- Peng, H.; Gao, T.; Han, X.; Lin, Y.; Li, P.; Liu, Z.; Sun, M.; Zhou, J. Learning from context or names? An empirical study on neural relation extraction. arXiv 2020, arXiv:2010.01923. [Google Scholar]
- Brody, S.; Wu, S.; Benton, A. Towards Realistic Few-Shot Relation Extraction. In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, Online, 7–11 November 2021; pp. 5338–5345. [Google Scholar]
- Alt, C.; Gabryszak, A.; Hennig, L. TACRED revisited: A thorough evaluation of the TACRED relation extraction task. arXiv 2020, arXiv:2004.14855. [Google Scholar]
- Bassignana, E.; Plank, B. What Do You Mean by Relation Extraction? A Survey on Datasets and Study on Scientific Relation Classification. arXiv 2022, arXiv:2204.13516. [Google Scholar]
- Barr, E.T.; Harman, M.; McMinn, P.; Shahbaz, M.; Yoo, S. The oracle problem in software testing: A survey. IEEE Trans. Softw. Eng. 2014, 41, 507–525. [Google Scholar] [CrossRef]
- Chen, T.Y.; Kuo, F.C.; Liu, H.; Poon, P.L.; Towey, D.; Tse, T.; Zhou, Z.Q. Metamorphic testing: A review of challenges and opportunities. ACM Comput. Surv. CSUR 2018, 51, 1–27. [Google Scholar] [CrossRef]
- Segura, S.; Towey, D.; Zhou, Z.Q.; Chen, T.Y. Metamorphic testing: Testing the untestable. IEEE Softw. 2018, 37, 46–53. [Google Scholar] [CrossRef]
- Chen, T.Y.; Cheung, S.C.; Yiu, S.M. Metamorphic testing: A new approach for generating next test cases. arXiv 2020, arXiv:2002.12543. [Google Scholar]
- Zhou, Z.Q.; Xiang, S.; Chen, T.Y. Metamorphic testing for software quality assessment: A study of search engines. IEEE Trans. Softw. Eng. 2015, 42, 264–284. [Google Scholar] [CrossRef]
- Zhou, Z.Q.; Sun, L.; Chen, T.Y.; Towey, D. Metamorphic relations for enhancing system understanding and use. IEEE Trans. Softw. Eng. 2018, 46, 1120–1154. [Google Scholar] [CrossRef]
- Xie, X.; Wong, W.E.; Chen, T.Y.; Xu, B. Metamorphic slice: An application in spectrum-based fault localization. Inf. Softw. Technol. 2013, 55, 866–879. [Google Scholar] [CrossRef]
- Jiang, M.; Chen, T.Y.; Zhou, Z.Q.; Ding, Z. Input test suites for program repair: A novel construction method based on metamorphic relations. IEEE Trans. Reliab. 2020, 70, 285–303. [Google Scholar] [CrossRef]
- Chen, T.Y.; Tse, T.; Zhou, Z.Q. Semi-proving: An integrated method for program proving, testing, and debugging. IEEE Trans. Softw. Eng. 2010, 37, 109–125. [Google Scholar] [CrossRef]
- Yuan, Y.; Wang, S.; Jiang, M.; Chen, T.Y. Perception matters: Detecting perception failures of vqa models using metamorphic testing. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Nashville, TN, USA, 20–25 June 2021; pp. 16908–16917. [Google Scholar]
- Ribeiro, M.T.; Wu, T.; Guestrin, C.; Singh, S. Beyond accuracy: Behavioral testing of NLP models with CheckList. arXiv 2020, arXiv:2005.04118. [Google Scholar]
- Tu, K.; Jiang, M.; Ding, Z. A metamorphic testing approach for assessing question answering systems. Mathematics 2021, 9, 726. [Google Scholar] [CrossRef]
- Ma, P.; Wang, S.; Liu, J. Metamorphic Testing and Certified Mitigation of Fairness Violations in NLP Models. In Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence (IJCAI 2020), Yokohama, Japan, 7–15 January 2021; pp. 458–465. [Google Scholar]
- Asyrofi, M.H.; Yang, Z.; Yusuf, I.N.B.; Kang, H.J.; Thung, F.; Lo, D. Biasfinder: Metamorphic test generation to uncover bias for sentiment analysis systems. IEEE Trans. Softw. Eng. 2021, 48, 5087–5101. [Google Scholar] [CrossRef]
- Jiang, M.; Bao, H.; Tu, K.; Zhang, X.Y.; Ding, Z. Evaluating Natural Language Inference Models: A Metamorphic Testing Approach. In Proceedings of the 2021 IEEE 32nd International Symposium on Software Reliability Engineering (ISSRE), Wuhan, China, 25–28 October 2021; pp. 220–230. [Google Scholar]
- Clark, K.; Manning, C.D. Improving coreference resolution by learning entity-level distributed representations. arXiv 2016, arXiv:1606.01323. [Google Scholar]
- Hargittai, M.; Hargittai, I. Symmetry through the Eyes of a Chemist; Springer Science & Business Media: Berlin/Heidelberg, Germany, 2009. [Google Scholar]
- Devlin, J.; Chang, M.W.; Lee, K.; Toutanova, K. Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv 2018, arXiv:1810.04805. [Google Scholar]
- Soares, L.B.; FitzGerald, N.; Ling, J.; Kwiatkowski, T. Matching the blanks: Distributional similarity for relation learning. arXiv 2019, arXiv:1906.03158. [Google Scholar]
- Zhou, W.; Chen, M. Learning from noisy labels for entity-centric information extraction. arXiv 2021, arXiv:2104.08656. [Google Scholar]
- Stoica, G.; Platanios, E.A.; Póczos, B. Re-TACRED: A New Relation Extraction Dataset. In Proceedings of the 4th Knowledge Representation and Reasoning Meets Machine Learning Workshop (KR2ML 2020), at NeurIPS, Virtual, 11–12 December 2020. [Google Scholar]
- Rosenman, S.; Jacovi, A.; Goldberg, Y. Exposing shallow heuristics of relation extraction models with challenge data. arXiv 2020, arXiv:2010.03656. [Google Scholar]
- Chen, S.; Jin, S.; Xie, X. Validation on machine reading comprehension software without annotated labels: A property-based method. In Proceedings of the 29th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering, Athens, Greece, 23–28 August 2021. [Google Scholar]
- He, P.; Meister, C.; Su, Z. Structure-invariant testing for machine translation. In Proceedings of the 2020 IEEE/ACM 42nd International Conference on Software Engineering (ICSE), Seoul, Republic of Korea, 27 June 2020–19 July 2020; pp. 961–973. [Google Scholar]
- Segura, S.; Fraser, G.; Sanchez, A.B.; Ruiz-Cortés, A. A survey on metamorphic testing. IEEE Trans. Softw. Eng. 2016, 42, 805–824. [Google Scholar] [CrossRef]
- Jiang, M.; Chen, T.Y.; Kuo, F.C.; Towey, D.; Ding, Z. A metamorphic testing approach for supporting program repair without the need for a test oracle. J. Syst. Softw. 2017, 126, 127–140. [Google Scholar] [CrossRef]
- Segura, S.; Parejo, J.A.; Troya, J.; Ruiz-Cortés, A. Metamorphic testing of RESTful web APIs. IEEE Trans. Softw. Eng. 2017, 44, 1083–1099. [Google Scholar] [CrossRef]
- Segura, S.; Troya, J.; Durán, A.; Ruiz-Cortés, A. Performance metamorphic testing: A proof of concept. Inf. Softw. Technol. 2018, 98, 1–4. [Google Scholar] [CrossRef]
- Tian, Y.; Pei, K.; Jana, S.; Ray, B. Deeptest: Automated testing of deep-neural-network-driven autonomous cars. In Proceedings of the 40th International Conference on Software Engineering, Gothenburg, Sweden, 30 May–1 June 2018; pp. 303–314. [Google Scholar]
- Zhang, M.; Zhang, Y.; Zhang, L.; Liu, C.; Khurshid, S. DeepRoad: GAN-based metamorphic testing and input validation framework for autonomous driving systems. In Proceedings of the 2018 33rd IEEE/ACM International Conference on Automated Software Engineering (ASE), Montpellier, France, 3–7 September 2018; pp. 132–142. [Google Scholar]
- Sun, Z.; Zhang, J.M.; Harman, M.; Papadakis, M.; Zhang, L. Automatic testing and improvement of machine translation. In Proceedings of the ACM/IEEE 42nd International Conference on Software Engineering, Seoul, Republic of Korea, 27 June 2020–19 July 2020; pp. 974–985. [Google Scholar]
- Sun, L.; Zhou, Z.Q. Metamorphic testing for machine translations: MT4MT. In Proceedings of the 2018 25th Australasian Software Engineering Conference (ASWEC), Adelaide, SA, Australia, 26–30 November 2018; pp. 96–100. [Google Scholar]
- Lee, D.T.; Zhou, Z.Q.; Tse, T. Metamorphic robustness testing of Google Translate. In Proceedings of the IEEE/ACM 42nd International Conference on Software Engineering Workshops, Seoul, Republic of Korea, 27 June 2020–19 July 2020; pp. 388–395. [Google Scholar]
- Mao, C.; Yi, X.; Chen, T.Y. Metamorphic Robustness Testing for Recommender Systems: A Case Study. In Proceedings of the 2020 7th International Conference on Dependable Systems and Their Applications (DSA), Xi’an, China, 28–29 November 2020; pp. 331–336. [Google Scholar]
- Taillé, B.; Guigue, V.; Scoutheeten, G.; Gallinari, P. Let’s Stop Incorrect Comparisons in End-to-end Relation Extraction! arXiv 2020, arXiv:2009.10684. [Google Scholar]
- Li, L.; Chen, X.; Ye, H.; Bi, Z.; Deng, S.; Zhang, N.; Chen, H. On robustness and bias analysis of bert-based relation extraction. In Proceedings of the China Conference on Knowledge Graph and Semantic Computing, Guangzhou, China, 4–7 November 2021; Springer: Berlin/Heidelberg, Germany, 2021; pp. 43–59. [Google Scholar]
- Jin, D.; Jin, Z.; Zhou, J.T.; Szolovits, P. Is bert really robust? a strong baseline for natural language attack on text classification and entailment. In Proceedings of the AAAI Conference on Artificial Intelligence, New York, NY, USA, 7–12 February 2020; Volume 34, pp. 8018–8025. [Google Scholar]
- Gardner, M.; Artzi, Y.; Basmova, V.; Berant, J.; Bogin, B.; Chen, S.; Dasigi, P.; Dua, D.; Elazar, Y.; Gottumukkala, A.; et al. Evaluating Models’ Local Decision Boundaries via Contrast Sets. arXiv 2020, arXiv:2004.02709. [Google Scholar]
- Shah, D.; Schwartz, H.A.; Hovy, D. Predictive biases in natural language processing models: A conceptual framework and overview. arXiv 2019, arXiv:1912.11078. [Google Scholar]
MR Description | |||
---|---|---|---|
Category | Name | Transformation Operation | Output Relationship |
Replacement | MR-R | Replacing head (tail) entity with the same type of entity | Identical |
MR-R | Replacing head (tail) entity with the coarser-grained type of entity | Consistent with the transformation of entity granularity | |
MR-R | Replacing head (tail) entity with co-related entity having different entity type | Identical | |
MR-R | Replacing head (tail) entity with its coreferential entity | Identical | |
Swap | MR-S | Swaping head and tail entities in symmetric relations | Identical |
MR-S | Swaping head and tail entities in antisymmetric relations | Opposite | |
Combination | MR-C | Combining two pairs of entities sharing the same head entity on multiple source inputs | Identical with the second source input |
MR-C | Combining two pairs of entities sharing the same tail entity on multiple source inputs | Identical with the second source input |
MR | Source Input | Source Output | Follow-Up Input | Follow-Up Output |
---|---|---|---|---|
MR-R | French filmmaker Claude dies at 80. | per: origin | American filmmaker Claude dies at 80. | per: origin |
MR-R | Richard was born in San Francisco. | per: city_of_birth | Richard was born in the United States. | per: country_of_birth |
MR-R | Jupp, 46, works in the lab. | per: age | Jupp, 46-years-old, works in the lab. | per: age |
MR-R | John is a father, he loves his child Mary. | per: parents | John is a father, he loves his child Mary. | per: parents |
MR-S | Lily is Mary’s sister. | per: siblings | Lily is Mary’s sister. | per: siblings |
MR-S | John is Mary’s father. | per: children | John is Mary’s father. | per: parents |
MR | First Source Input | First Source Output | Second Source Input | Second Source Output | Follow-Up Input | Follow-Up Output |
---|---|---|---|---|---|---|
MR-C | Peterson leaves behind his wife, Kelly, and their daughter Celine. | per:spouse | Peterson leaves behind his wife,Kelly, and their daughter Celine. | per:children | Petersonleaves behind his wife,Kelly, and their daughter Celine. | per:children |
MR-C | Liu Mingkang, chairman of the China Banking Regulatory Commission (CBRC), was the representative of the mainland to host a meeting. | org:alternate_names | Liu Mingkang, chairman of theChina Banking Regulatory Commission(CBRC), was the representativeof the mainland to host a meeting. | per:employee_of | Liu Mingkang chairman of theChina Banking Regulatory Commission(CBRC), was the representative ofthe mainland to host a meeting. | per:employee_of |
Model | Year | Backbone |
---|---|---|
BERTEM+MTB | 2019 | BERT |
LUKE | 2020 | Transformer |
NCB | 2021 | NCB |
MR | No. of MGs |
---|---|
MR-R | 8198 |
MR-R | 4350 |
MR-R | 3866 |
MR-R | 2605 |
MR-S | 2731 |
MR-S | 7584 |
MR-C | 1650 |
MR-C | 1435 |
Total | 32,419 |
RE Model | BERTEM+MTB | LUKE | NCB | Average |
---|---|---|---|---|
MR-R | 34.2 | 28.6 | 4.2 | 25.2 |
MR-R | 26.8 | 19.8 | 15.6 | 20.4 |
MR-R | 39.8 | 39.7 | 49.5 | 42.7 |
MR-R | 20.6 | 18.4 | 19.9 | 19.6 |
MR-S | 10.5 | 5.5 | 9.2 | 8.4 |
MR-S | 20.4 | 17.7 | 26.0 | 21.3 |
MR-C | 24.5 | 19.8 | 28.4 | 24.2 |
MR-C | 26.3 | 18.2 | 30.1 | 24.9 |
Overall | 25.9 | 22.5 | 21.7 | 23.6 |
MR | RE Model | Source Input | Source Output | Follow-Up Input | Follow-Up Output | Correct Result |
---|---|---|---|---|---|---|
MR-R | NCB | UStype:NATIONALITY actress Patricia Neal, winner of both Academy and Tony awards, died at her home... | per: origin | UStype:COUNTRY actress Patricia Neal, winner of both Academy and Tony awards, died at her home... | per:countries_of_ residence | per: origin |
MR-R | BERT + MTB | Alessi said that she was Catholic but that she had long ago lost her illusions. | per: religion | Alessi said that she was Muslim but that she had long ago lost her illusions. | no_relation | per: religion |
MR-R | NCB | Dunne was part of a famous family that also included his brother, novelist and screenwriter John Gregory Dunne; his brother’s wife, author Joan Didion... | per: other_family | Dunne was part of a famous family that also included his brother, novelist and screenwriter John Gregory Dunne; his brother’s wife, author Joan Didion... | per: spouse | per: other_family |
MR-S | LUKE | It is unknown as of now whether or not Britney’s mother Lynne, pregnant sister Jamie Lynn or brother Brian are on their way to LA. | per: parents | It is unknown as of now whether or not Britney’s mother Lynne, pregnant sister Jamie Lynn or brother Brian are on their way to LA. | per: siblings | per: children |
MR-CBERT+ MTB | World soccer chief Joseph Sepp Blatter is expected in Madagascar on Tuesday, the president of the Madagascan Football Federation -LRB- FMF-RRB-Ahmad said on Monday. | org: alternate_names | World soccer chief Joseph Sepp Blatter is expected in Madagascar on Tuesday, the president of the Madagascan Football Federation -LRB- FMF -RRB-Ahmad said on Monday. | no_relation | per: employee_of | |
World soccer chief Joseph Sepp Blatter is expected in Madagascar on Tuesday, the president of the Madagascan Football Federation -LRB- FMF -RRB-Ahmad said on Monday. | per: employee_of |
RE Model | High-Frequency Entity Vocabulary | Low-Frequency Entity Vocabulary |
---|---|---|
BERTEM+MTB | 15.9 | 64.4 |
LUKE | 17.8 | 60.8 |
Average | 16.4 | 62.5 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Sun, Y.; Ding, Z.; Huang, H.; Zou, S.; Jiang, M. Metamorphic Testing of Relation Extraction Models. Algorithms 2023, 16, 102. https://doi.org/10.3390/a16020102
Sun Y, Ding Z, Huang H, Zou S, Jiang M. Metamorphic Testing of Relation Extraction Models. Algorithms. 2023; 16(2):102. https://doi.org/10.3390/a16020102
Chicago/Turabian StyleSun, Yuhe, Zuohua Ding, Hongyun Huang, Senhao Zou, and Mingyue Jiang. 2023. "Metamorphic Testing of Relation Extraction Models" Algorithms 16, no. 2: 102. https://doi.org/10.3390/a16020102
APA StyleSun, Y., Ding, Z., Huang, H., Zou, S., & Jiang, M. (2023). Metamorphic Testing of Relation Extraction Models. Algorithms, 16(2), 102. https://doi.org/10.3390/a16020102