Abstract
In this paper, we propose a controller for a bicycle using the DDPG (Deep Deterministic Policy Gradient) algorithm, which is a state-of-the-art deep reinforcement learning algorithm. We use a reward function and a deep neural network to build the controller. By using the proposed controller, a bicycle can not only be stably balanced but also travel to any specified location. We confirm that the controller with DDPG shows better performance than the other baselines such as Normalized Advantage Function (NAF) and Proximal Policy Optimization (PPO). For the performance evaluation, we implemented the proposed algorithm in various settings such as fixed and random speed, start location, and destination location.
Funder
National Research Foundation of Korea
Subject
Physics and Astronomy (miscellaneous),General Mathematics,Chemistry (miscellaneous),Computer Science (miscellaneous)
Reference22 articles.
1. Introducing the self-driving bicycle in The Netherlandshttps://www.youtube.com/watch?v=LSZPNwZex9s
2. Linearized dynamics equations for the balance and steer of a bicycle: a benchmark and review;Meijaard,2007
3. Some recent developments in bicycle dynamics;Schwab,2007
4. Learning bicycle stunts
Cited by
19 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献