Neuro-Inspired Reinforcement Learning to Improve Trajectory Prediction in Reward-Guided Behavior

Author:

Chen Bo-Wei1,Yang Shih-Hung2,Kuo Chao-Hung134,Chen Jia-Wei1,Lo Yu-Chun5,Kuo Yun-Ting1,Lin Yi-Chen1,Chang Hao-Cheng1,Lin Sheng-Huang167,Yu Xiao89,Qu Boyi89,Ro Shuan-Chu Vina10,Lai Hsin-Yi8911,Chen You-Yin15

Affiliation:

1. Department of Biomedical Engineering, National Yang Ming Chiao Tung University, No.155, Sec. 2, Linong St., Taipei 11221, Taiwan

2. Department of Mechanical Engineering, National Cheng Kung University, No. 1, University Rd., Tainan 70101, Taiwan

3. Department of Neurosurgery, Neurological Institute, Taipei Veterans General Hospital, No. 201, Sec. 2, Shipai Rd., Taipei 11217, Taiwan

4. Department of Neurological Surgery, University of Washington, 1959 NE Pacific St., Seattle, WA 98195-6470, U.S.A.

5. The Ph.D. Program for Neural Regenerative Medicine, College of Medical Science and Technology, Taipei Medical University, No. 250, Wu-Xing St., Taipei 11031, Taiwan

6. Department of Neurology, Hualien Tzu Chi Hospital, Buddhist Tzu Chi Medical Foundation, No. 707, Sec. 3, Zhongyang Rd., Hualien 97002, Taiwan

7. Department of Neurology, School of Medicine, Tzu Chi University, No. 701, Sec. 3, Zhongyang Rd., Hualien 97004, Taiwan

8. Department of Neurology of the Second Affiliated, Hospital, Interdisciplinary Institute of Neuroscience and Technology, Key Laboratory of Medical Neurobiology of Zhejiang Province, Zhejiang University School of Medicine, Hangzhou 310029, P. R. China

9. College of Biomedical Engineering and Instrument, Science, Key Laboratory for Biomedical Engineering, of Ministry of Education, Zhejiang University, Hangzhou 310027, P. R. China

10. Department of Biomedical Engineering, Johns Hopkins School of Medicine, 720 Rutland Ave., Baltimore, MD 21205, U.S.A.

11. MOE Frontier Science Center for Brain Science and Brain-Machine Integration, School of Brain Science and Brain Medicine, Zhejiang University, Hangzhou 310012, China

Abstract

Hippocampal pyramidal cells and interneurons play a key role in spatial navigation. In goal-directed behavior associated with rewards, the spatial firing pattern of pyramidal cells is modulated by the animal’s moving direction toward a reward, with a dependence on auditory, olfactory, and somatosensory stimuli for head orientation. Additionally, interneurons in the CA1 region of the hippocampus monosynaptically connected to CA1 pyramidal cells are modulated by a complex set of interacting brain regions related to reward and recall. The computational method of reinforcement learning (RL) has been widely used to investigate spatial navigation, which in turn has been increasingly used to study rodent learning associated with the reward. The rewards in RL are used for discovering a desired behavior through the integration of two streams of neural activity: trial-and-error interactions with the external environment to achieve a goal, and the intrinsic motivation primarily driven by brain reward system to accelerate learning. Recognizing the potential benefit of the neural representation of this reward design for novel RL architectures, we propose a RL algorithm based on [Formula: see text]-learning with a perspective on biomimetics (neuro-inspired RL) to decode rodent movement trajectories. The reward function, inspired by the neuronal information processing uncovered in the hippocampus, combines the preferred direction of pyramidal cell firing as the extrinsic reward signal with the coupling between pyramidal cell–interneuron pairs as the intrinsic reward signal. Our experimental results demonstrate that the neuro-inspired RL, with a combined use of extrinsic and intrinsic rewards, outperforms other spatial decoding algorithms, including RL methods that use a single reward function. The new RL algorithm could help accelerate learning convergence rates and improve the prediction accuracy for moving trajectories.

Funder

the Ministry of Science and Technology of Taiwan

“Key and Novel Therapeutics Development Program for Major Diseases” project of Academia Sinica, Taiwan, R.O.C.

Publisher

World Scientific Pub Co Pte Ltd

Subject

Computer Networks and Communications,General Medicine

Cited by 4 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3