Abstract
AbstractIn this paper, we outline the results of our recent research on terminology saturation analysis (TSA) in subject domain-bounded textual corpora. We present the developed TSA method. We further report about the two use cases that proved the validity, efficiency, and effectiveness of TSA. Based on our experience of TSA use, we analyse the shortcomings of the method and figure out the ways to refinement and improvement. Further, we share our prognoses on how TSA could be used for: (i) generating quality datasets of minimal size for training large language models for performing better in scientific domains; (ii) iteratively constructing domain ontologies and knowledge graphs that representatively describe a subject domain, or topic; or (iii) detecting and predicting events based on the TSA of textual streams data.
Publisher
Springer Nature Switzerland
Reference43 articles.
1. Kosa, V., Ermolayev, V.: Terminology saturation: detection, measurement, and use. Cognitive Science and Technology, Springer, Singapore (2022). https://doi.org/10.1007/978-981-16-8630-6
2. Kosa, V., Ermolayev, V.: Related work and our approach. In: Terminology Saturation: Detection, Measurement, and Use. Cognitive Science and Technology, Springer, Singapore, pp. 7−39 (2022) https://doi.org/10.1007/978-981-16-8630-6
3. Kosa, V., Ermolayev, V.: Saturated terminology extraction and analysis in use. In: Terminology Saturation: Detection, Measurement, and Use. Cognitive Science and Technology. Springer, Singapore, pp. 155−170 (2022)
4. Ermolayev, V., Kosa, V.: Terminology saturation analysis for machine learning and event detection. In: Akkerkar, R. (ed.): Symposium on AI, Data and Digitalization (SAIDD 2023), Sogndal, Norway, 09–10 May 2023, Western Norway Research Institute (2023)
5. Tatarintseva, O., Ermolayev, V., Keller, B., Matzke, W.-E.: Quantifying ontology fitness in OntoElect using saturation- and vote-based metrics. Revised Selected Papers of ICTERI 2013. CCIS, vol. 412, pp. 136–162. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-319-03998-5_8