Affiliation:
1. School of Computer Science and Technology, Zhejiang Sci-Tech University, Hangzhou 310018, China
2. Library Multimedia Big Data Center, Zhejiang Sci-Tech University, Hangzhou 310018, China
Abstract
Relation extraction (RE) is a fundamental NLP task that aims to identify relations between some entities regarding a given text. RE forms the basis for many advanced NLP tasks, such as question answering and text summarization, and thus its quality is critical to the relevant downstream applications. However, evaluating the quality of RE models is non-trivial. On the one hand, obtaining ground truth labels for individual test inputs is tedious and even difficult. On the other hand, there is an increasing need to understand the characteristics of RE models in terms of various aspects. To mitigate these issues, this study proposes evaluating RE models by applying metamorphic testing (MT). A total of eight metamorphic relations (MRs) are identified based on three categories of transformation operations, namely replacement, swap, and combination. These MRs encode some expected properties of different aspects of RE. We further apply MT to three popular RE models. Our experiments reveal a large number of prediction failures in the subject RE models, confirming that MT is effective for evaluating RE models. Further analysis of the experimental results reveals the advantages and disadvantages of our subject models and also uncovers some typical issues of RE models.
Funder
National Nature Science Foundation of China
Subject
Computational Mathematics,Computational Theory and Mathematics,Numerical Analysis,Theoretical Computer Science
Reference53 articles.
1. A relationship extraction method for domain knowledge graph construction;Yu;World Wide Web,2020
2. Core techniques of question answering systems over knowledge bases: A survey;Diefenbach;Knowl. Inf. Syst.,2018
3. Sharma, D., Shukla, R., Giri, A.K., and Kumar, S. (2019, January 10–11). A brief review on search engine optimization. Proceedings of the 2019 9th International Conference on Cloud Computing, Data Science & Engineering (Confluence), Noida, India.
4. Zad, S., Heidari, M., Jones, J.H., and Uzuner, O. (2021, January 10–13). A Survey on Concept-Level Sentiment Analysis Techniques of Textual Data. Proceedings of the 2021 IEEE World AI IoT Congress (AIIoT), Seattle, WA, USA.
5. Bharti, S.K., and Babu, K.S. (2017). Automatic keyword extraction for text summarization: A survey. arXiv.