Abstract
Abstract
We employ Q learning, a variant of reinforcement learning, so that an active particle learns by itself to navigate on the fastest path toward a target while experiencing external forces and flow fields. As state variables, we use the distance and direction toward the target, and as action variables the active particle can choose a new orientation along which it moves with constant velocity. We explicitly investigate optimal navigation in a potential barrier/well and a uniform/ Poiseuille/swirling flow field. We show that Q learning is able to identify the fastest path and discuss the results. We also demonstrate that Q learning and applying the learned policy works when the particle orientation experiences thermal noise. However, the successful outcome strongly depends on the specific problem and the strength of noise.
Graphical abstract
Funder
Berlin University Alliance
Publisher
Springer Science and Business Media LLC
Subject
Surfaces and Interfaces,General Materials Science,General Chemistry,Biophysics,Biotechnology
Reference59 articles.
1. M.C. Marchetti, J.F. Joanny, S. Ramaswamy, T.B. Liverpool, J. Prost, M. Rao, R.A. Simha, Rev. Mod. Phys. 85, 1143 (2013)
2. A. Zöttl, H. Stark, J. Phys.: Condens. Matter 28, 253001 (2016)
3. S. Ramaswamy, J. Stat. Mech. 2017, 054002 (2017)
4. T. Vicsek, A. Zafeiris, Phys. Rep. 517, 71 (2012)
5. J. Elgeti, R.G. Winkler, G. Gompper, Rep. Prog. Phys. 78, 056601 (2015)
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献