Generalizing Long Short-Term Memory Network for Deep Learning from Generic Data-Reference-Cited by-同舟云学术

Generalizing Long Short-Term Memory Network for Deep Learning from Generic Data

Published:2020-04-30 Issue:2 Volume:14 Page:1-28
ISSN:1556-4681
Container-title:ACM Transactions on Knowledge Discovery from Data
language:en
Short-container-title:ACM Trans. Knowl. Discov. Data

Author:

Han Huimei¹,Zhu Xingquan²^ORCID,Li Ying³

Affiliation:

1. Zhejiang University of Technology and Florida Atlantic University, Zhejiang, P.R. China

2. Florida Atlantic University, Boca Raton, FL

3. Xidian University, Shannxi, P.R. China

Abstract

Long Short-Term Memory (LSTM) network, a popular deep-learning model, is particularly useful for data with temporal correlation, such as texts, sequences, or time series data, thanks to its well-sought after recurrent network structures designed to capture temporal correlation. In this article, we propose to generalize LSTM to generic machine-learning tasks where data used for training do not have explicit temporal or sequential correlation. Our theme is to explore feature correlation in the original data and convert each instance into a synthetic sentence format by using a two-gram probabilistic language model. More specifically, for each instance represented in the original feature space, our conversion first seeks to horizontally align original features into a sequentially correlated feature vector, resembling to the letter coherence within a word. In addition, a vertical alignment is also carried out to create multiple time points and simulate word sequential order in a sentence ( i.e., word correlation). The two dimensional horizontal-and-vertical alignments not only ensure feature correlations are maximally utilized, but also preserve the original feature values in the new representation. As a result, LSTM model can be utilized to achieve good classification accuracy, even if the underlying data do not have temporal or sequential dependency. Experiments on 20 generic datasets show that applying LSTM to generic data can improve the classification accuracy, compared to conventional machine-learning methods. This research opens a new opportunity for LSTM deep learning to be broadly applied to generic machine-learning tasks.

Funder

US National Science Foundation

Publisher

Association for Computing Machinery (ACM)

Subject

General Computer Science

Link

https://dl.acm.org/doi/pdf/10.1145/3366022

Reference59 articles.

1. Sequence MAP decoding of trellis codes for Gaussian and Rayleigh channels

2. On optimum choice of k in nearest neighbor classification

3. Representation Learning: A Review and New Perspectives

Cited by 9 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Prediction of User Behavior in International Trade E-commerce: Application of LSTM Algorithm;2024 International Conference on Machine Intelligence and Digital Applications;2024-05-30

2. A distributed load balancing method for IoT/Fog/Cloud environments with volatile resource support;Cluster Computing;2024-05-11

3. Learning and processing framework using Fuzzy Deep Neural Network for trading and portfolio rebalancing;Applied Soft Computing;2024-02

4. Fault-tolerant scheduling of graph-based loads on fog/cloud environments with multi-level queues and LSTM-based workload prediction;Computer Networks;2023-11

5. An enterprise adaptive tag extraction method based on multi-feature dynamic portrait;Complex & Intelligent Systems;2023-03-18