Informer: Beyond Efficient Transformer for Long Sequence Time-Series Forecasting-Reference-Cited by-同舟云学术

Informer: Beyond Efficient Transformer for Long Sequence Time-Series Forecasting

Published:2021-05-18 Issue:12 Volume:35 Page:11106-11115
ISSN:2374-3468
Container-title:Proceedings of the AAAI Conference on Artificial Intelligence
language:
Short-container-title:AAAI

Author:

Zhou Haoyi,Zhang Shanghang,Peng Jieqi,Zhang Shuai,Li Jianxin,Xiong Hui,Zhang Wancai

Abstract

Many real-world applications require the prediction of long sequence time-series, such as electricity consumption planning. Long sequence time-series forecasting (LSTF) demands a high prediction capacity of the model, which is the ability to capture precise long-range dependency coupling between output and input efficiently. Recent studies have shown the potential of Transformer to increase the prediction capacity. However, there are several severe issues with Transformer that prevent it from being directly applicable to LSTF, including quadratic time complexity, high memory usage, and inherent limitation of the encoder-decoder architecture. To address these issues, we design an efficient transformer-based model for LSTF, named Informer, with three distinctive characteristics: (i) a ProbSparse self-attention mechanism, which achieves O(L log L) in time complexity and memory usage, and has comparable performance on sequences' dependency alignment. (ii) the self-attention distilling highlights dominating attention by halving cascading layer input, and efficiently handles extreme long input sequences. (iii) the generative style decoder, while conceptually simple, predicts the long time-series sequences at one forward operation rather than a step-by-step way, which drastically improves the inference speed of long-sequence predictions. Extensive experiments on four large-scale datasets demonstrate that Informer significantly outperforms existing methods and provides a new solution to the LSTF problem.

Publisher

Association for the Advancement of Artificial Intelligence (AAAI)

Subject

General Medicine

Cited by 1498 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. TFformer: A time–frequency domain bidirectional sequence-level attention based transformer for interpretable long-term sequence forecasting;Pattern Recognition;2025-02

2. Multimodal fusion for large-scale traffic prediction with heterogeneous retentive networks;Information Fusion;2025-02

3. Data-driven stock forecasting models based on neural networks: A review;Information Fusion;2025-01

4. A long-term dissolved oxygen prediction model in aquaculture using transformer with a dynamic adaptive mechanism;Expert Systems with Applications;2025-01

5. Multi-task oriented team formation in online collaborative learning;Expert Systems with Applications;2025-01