Affiliation:
1. Department of Foreign Languages, R.O.C. Military Academy, Kaohsiung 830, Taiwan
2. Department of Management Sciences, R.O.C. Military Academy, Kaohsiung 830, Taiwan
Abstract
The use of corpus assessment approaches to determine and rank keywords for corpus data is critical due to the issues of information retrieval (IR) in Natural Language Processing (NLP), such as when encountering COVID-19, as it can determine whether people can rapidly obtain knowledge of the disease. The algorithms used for corpus assessment have to consider multiple parameters and integrate individuals’ subjective evaluation information simultaneously to meet real-world needs. However, traditional keyword-list-generating approaches are based on only one parameter (i.e., the keyness value) to determine and rank keywords, which is insufficient. To improve the evaluation benefit of the traditional keyword-list-generating approach, this paper proposed an extended analytic hierarchy process (AHP)-based corpus assessment approach to, firstly, refine the corpus data and then use the AHP method to compute the relative weights of three parameters (keyness, frequency, and range). To verify the proposed approach, this paper adopted 53 COVID-19-related research environmental science research articles from the Web of Science (WOS) as an empirical example. After comparing with the traditional keyword-list-generating approach and the equal weights (EW) method, the significant contributions are: (1) using the machine-based technique to remove function and meaningless words for optimizing the corpus data; (2) being able to consider multiple parameters simultaneously; and (3) being able to integrate the experts’ evaluation results to determine the relative weights of the parameters.
Funder
National Science and Technology Council, Taiwan
Subject
Geometry and Topology,Logic,Mathematical Physics,Algebra and Number Theory,Analysis
Reference49 articles.
1. Anthony, L. (2022, January 01). AntConc (Version 3.5.8), Corpus Software. Available online: https://www.laurenceanthony.net/software/antconc/.
2. Choosing specialized vocabulary to teach with data-driven learning: An example from civil engineering;Otto;Engl. Specif. Purp.,2021
3. A corpus-aided study of stance adverbs in judicial opinions and the implications for English for legal purposes instruction;Poole;Engl. Specif. Purp.,2021
4. Financial contagion during COVID-19 crisis;Akhtaruzzaman;Financ. Res. Lett.,2021
5. Leadership to defeat COVID-19;Antonakis;Group Process Intergroup Relat.,2021
Cited by
4 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献