Text classification by untrained sentence embeddings-Reference-Cited by-同舟云学术

Text classification by untrained sentence embeddings

Published:2021-01-11 Issue:2 Volume:14 Page:245-259
ISSN:1724-8035
Container-title:Intelligenza Artificiale
language:
Short-container-title:IA

Author:

Di Sarli Daniele¹,Gallicchio Claudio¹,Micheli Alessio¹

Affiliation:

1. Department of Computer Science, University of Pisa, Largo B. Pontecorvo, Pisa, Italy

Abstract

Recurrent Neural Networks (RNNs) represent a natural paradigm for modeling sequential data like text written in natural language. In fact, RNNs and their variations have long been the architecture of choice in many applications, however in practice they require the use of labored architectures (such as gating mechanisms) and computationally heavy training processes. In this paper we address the question of whether it is possible to generate sentence embeddings via completely untrained recurrent dynamics, on top of which to apply a simple learning algorithm for text classification. This would allow to obtain extremely efficient models in terms of training time. Our work investigates the extent to which this approach can be used, by analyzing the results on different tasks. Finally, we show that, within certain limits, it is possible to build extremely efficient models for text classification that remain competitive in accuracy with reference models in the state-of-the-art.

Publisher

IOS Press

Subject

Artificial Intelligence

Reference56 articles.

1. Semisupervised learning using frequent itemset and ensemble learning for SMS classification;Ahmed;Expert Syst Appl,2015

2. Almeida T.A. , Hidalgo J.M.G. , Yamakami A. , Contributions to the study of SMS spam filtering: new collection and results. In HardyM. R. B. and TompaF. W., editors, Proceedings of the 2011 ACM Symposium on Document Engineering, Mountain View, CA, USA, September 19-22, 2011, pp. 259–262. ACM, 2011.

3. Bahdanau D. , Cho K. , Bengio Y. , Neural machine translation by jointly learning to align and translate. In 3rd International Conference on Learning Representations, ICLR 2015, Conference Track Proceedings, 2015.

4. Spam filtering using integrated distribution-based balancing approach and regularized deep neural networks;Barushka;Appl Intell,2018

5. Learning long-term dependencies with gradient descent is difficult;Bengio;IEEE Trans Neural Networks,1994

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Object-Assisted Question Featurization and Multi-CNN Image Feature Fusion for Visual Question Answering;International Journal of Intelligent Information Technologies;2023-03-03

2. A strategy for predicting waste production and planning recycling paths in e-logistics based on improved EMD-LSTM;Mathematical Biosciences and Engineering;2023

3. On the effectiveness of Gated Echo State Networks for data exhibiting long-term dependencies;Computer Science and Information Systems;2022