A2C: Attention-Augmented Contrastive Learning for State Representation Extraction-Reference-Cited by-同舟云学术

A2C: Attention-Augmented Contrastive Learning for State Representation Extraction

Published:2020-08-26 Issue:17 Volume:10 Page:5902
ISSN:2076-3417
Container-title:Applied Sciences
language:en
Short-container-title:Applied Sciences

Author:

Chen Haoqiang,Liu Yadong,Zhou Zongtan^ORCID,Zhang Ming

Abstract

Reinforcement learning (RL) faces a series of challenges, including learning efficiency and generalization. The state representation used to train RL is one of the important factors causing these challenges. In this paper, we explore providing a more efficient state representation for RL. Contrastive learning is used as the representation extraction method in our work. We propose an attention mechanism implementation and extend an existing contrastive learning method by embedding the attention mechanism. Finally an attention-augmented contrastive learning method called A2C is obtained. As a result, using the state representation from A2C, the robot achieves better learning efficiency and generalization than those using state-of-the-art representations. Moreover, our attention mechanism is proven to be able to calculate the correlation of arbitrary distance among pixels, which is conducive to capturing more accurate obstacle information. What is more, we remove the attention mechanism from A2C. It is shown that the rewards available for the attention-removed A2C are reduced by more than 70%, which indicates the important role of the attention mechanism.

Funder

National Natural Science Foundation of China

Publisher

MDPI AG

Subject

Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science

Link

https://www.mdpi.com/2076-3417/10/17/5902/pdf

Reference51 articles.

1. Deep Reinforcement Learning: An Overview;Li;arXiv,2017

2. State representation learning for control: An overview

3. Deep Reinforcement Learning: A Brief Survey

Cited by 7 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A Survey of Machine Learning Approaches for Mobile Robot Control;Robotics;2024-01-09

2. A Portfolio Model Based on BiLSTM Prediction and Improved PPO Algorithm;2023 2nd International Joint Conference on Information and Communication Engineering (JCICE);2023-05

3. A Portfolio Model Based on BiLSTM Prediction and Improved PPO Algorithm;2022 International Symposium on Intelligent Robotics and Systems (ISoIRS);2022-10

4. Ensemble Investment Strategies Based on Reinforcement Learning;Scientific Programming;2022-09-08

5. Dynamic Multiscale Feature Fusion Method for Underwater Target Recognition;Journal of Sensors;2022-07-21