Abstract
Deep neural networks are becoming ubiquitous in text mining and natural language processing, but semantic resources, such as taxonomies and ontologies, are yet to be fully exploited in a deep learning setting. This paper presents an efficient semantic text mining approach, which converts semantic information related to a given set of documents into a set of novel features that are used for learning. The proposed Semantics-aware Recurrent deep Neural Architecture (SRNA) enables the system to learn simultaneously from the semantic vectors and from the raw text documents. We test the effectiveness of the approach on three text classification tasks: news topic categorization, sentiment analysis and gender profiling. The experiments show that the proposed approach outperforms the approach without semantic knowledge, with highest accuracy gain (up to 10%) achieved on short document fragments.
Funder
Javna Agencija za Raziskovalno Dejavnost RS
European Research Council
Horizon 2020
Subject
General Economics, Econometrics and Finance
Reference54 articles.
1. A survey of text classification algorithms;Aggarwal,2012
2. Machine learning in automated text categorization
3. Semantic Data Mining: An Ontology-Based Approach;Ławrynowicz,2017
Cited by
13 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献