Table-Balancing Cooperative Robot Based on Deep Reinforcement Learning-Reference-Cited by-同舟云学术

Table-Balancing Cooperative Robot Based on Deep Reinforcement Learning

Published:2023-05-31 Issue:11 Volume:23 Page:5235
ISSN:1424-8220
Container-title:Sensors
language:en
Short-container-title:Sensors

Author:

Kim Yewon¹^ORCID,Kim Dae-Won²^ORCID,Kang Bo-Yeong³^ORCID

Affiliation:

1. Department of Artificial Intelligence, Kyungpook National University, Daegu 41566, Republic of Korea

2. School of Computer Science and Engineering, Chung-Ang University, 84 Heukseok-Ro, Seoul 06974, Republic of Korea

3. Department of Robot and Smart System Engineering, Kyungpook National University, Daegu 41566, Republic of Korea

Abstract

Reinforcement learning is one of the artificial intelligence methods that enable robots to judge and operate situations on their own by learning to perform tasks. Previous reinforcement learning research has mainly focused on tasks performed by individual robots; however, everyday tasks, such as balancing tables, often require cooperation between two individuals to avoid injury when moving. In this research, we propose a deep reinforcement learning-based technique for robots to perform a table-balancing task in cooperation with a human. The cooperative robot proposed in this paper recognizes human behavior to balance the table. This recognition is achieved by utilizing the robot’s camera to take an image of the state of the table, then the table-balance action is performed afterward. Deep Q-network (DQN) is a deep reinforcement learning technology applied to cooperative robots. As a result of learning table balancing, on average, the cooperative robot showed a 90% optimal policy convergence rate in 20 runs of training with optimal hyperparameters applied to DQN-based techniques. In the H/W experiment, the trained DQN-based robot achieved an operation precision of 90%, thus verifying its excellent performance.

Funder

National Research Foundation of Korea

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry

Link

https://www.mdpi.com/1424-8220/23/11/5235/pdf

Reference41 articles.

1. Mnih, V., Kavukcuoglu, K., Silver, D., Graves, A., Antonoglou, I., Wierstra, D., and Riedmiller, M. (2013). Playing atari with deep reinforcement learning. arXiv.

2. Q-learning;Watkins;Mach. Learn.,1992

3. Schulman, J., Levine, S., Abbeel, P., Jordan, M., and Moritz, P. (2015, January 6–11). Trust region policy optimization. Proceedings of the International Conference on Machine Learning, Lille, France.

4. Schulman, J., Wolski, F., Dhariwal, P., Radford, A., and Klimov, O. (2017). Proximal policy optimization algorithms. arXiv.

5. Lillicrap, T.P., Hunt, J.J., Pritzel, A., Heess, N., Erez, T., Tassa, Y., Silver, D., and Wierstra, D. (2016, January 2–4). Continuous control with deep reinforcement learning. Proceedings of the 4th International Conference on Learning Representations, ICLR 2016—Conference Track Proceedings, San Juan, Puerto Rico. Available online: http://xxx.lanl.gov/abs/1509.02971.

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Research on Trends and Key Issues in Industrial Collaborative Robots and Worker Interaction;The Journal of Korean Institute of Information Technology;2023-08-31