Survey of Neural Text Representation Models-Reference-Cited by-同舟云学术

Survey of Neural Text Representation Models

Published:2020-10-30 Issue:11 Volume:11 Page:511
ISSN:2078-2489
Container-title:Information
language:en
Short-container-title:Information

Author:

Babić Karlo^ORCID,Martinčić-Ipšić Sanda^ORCID,Meštrović Ana^ORCID

Abstract

In natural language processing, text needs to be transformed into a machine-readable representation before any processing. The quality of further natural language processing tasks greatly depends on the quality of those representations. In this survey, we systematize and analyze 50 neural models from the last decade. The models described are grouped by the architecture of neural networks as shallow, recurrent, recursive, convolutional, and attention models. Furthermore, we categorize these models by representation level, input level, model type, and model supervision. We focus on task-independent representation models, discuss their advantages and drawbacks, and subsequently identify the promising directions for future neural text representation models. We describe the evaluation datasets and tasks used in the papers that introduced the models and compare the models based on relevant evaluations. The quality of a representation model can be evaluated as its capability to generalize to multiple unrelated tasks. Benchmark standardization is visible amongst recent models and the number of different tasks models are evaluated on is increasing.

Publisher

MDPI AG

Subject

Information Systems

Link

https://www.mdpi.com/2078-2489/11/11/511/pdf

Reference104 articles.

1. Introduction to Information Retrieval;Manning,2008

2. Neural Network Methods for Natural Language Processing

3. Language models are unsupervised multitask learners;Radford;OpenAI Blog,2019

4. Deep Learning;Goodfellow,2016

5. Sequence to Sequence Learning with Neural Networks https://papers.nips.cc/paper/5346-sequence-to-sequence-learning-with-neural-networks.pdf

Cited by 28 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Leveraging Natural Language Processing for Enhanced Text Analysis in Business Intelligence;Advances in Computational Intelligence and Robotics;2024-08-30

2. An Enhanced Topic Modeling Method in Educational Domain by Integrating LDA with Semantic;2024 26th International Conference on Advanced Communications Technology (ICACT);2024-02-04

3. Recursively Autoregressive Autoencoder for Pyramidal Text Representation;IEEE Access;2024

4. The text-package: An R-package for analyzing and visualizing human language using natural language processing and transformers.;Psychological Methods;2023-12

5. Exploring unsupervised textual representations generated by neural language models in the context of automatic tweet stream summarization;Online Social Networks and Media;2023-09