Affiliation:
1. Vytautas Magnus University, Lithuania
2. Mykolas Romeris University, Lithuania
Abstract
The paper presents the results of research on deep learning methods aiming to determine the most effective one for automatic extraction of Lithuanian terms from a specialized domain (cybersecurity) with very restricted resources. A semi-supervised approach to deep learning was chosen for the research as Lithuanian is a less resourced language and large amounts of data, necessary for unsupervised methods, are not available in the selected domain. The findings of the research show that Bi-LSTM network with Bidirectional Encoder Representations from Transformers (BERT) can achieve close to state-of-the-art results.
Cited by
3 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Lithuanian-English Cybersecurity Termbase;Rasprave Instituta za hrvatski jezik i jezikoslovlje;2023
2. Lexical ambiguity detection in professional discourse;Information Processing & Management;2022-09
3. Tagging terms in text;Terminology;2022-01-10