Neuro-Inspired Reinforcement Learning to Improve Trajectory Prediction in Reward-Guided Behavior-Reference-Cited by-同舟云学术

Neuro-Inspired Reinforcement Learning to Improve Trajectory Prediction in Reward-Guided Behavior

Published:2022-08-19 Issue:09 Volume:32 Page:
ISSN:0129-0657
Container-title:International Journal of Neural Systems
language:en
Short-container-title:Int. J. Neur. Syst.

Author:

Chen Bo-Wei¹,Yang Shih-Hung²,Kuo Chao-Hung¹³⁴,Chen Jia-Wei¹,Lo Yu-Chun⁵,Kuo Yun-Ting¹,Lin Yi-Chen¹,Chang Hao-Cheng¹,Lin Sheng-Huang¹⁶⁷,Yu Xiao⁸⁹,Qu Boyi⁸⁹,Ro Shuan-Chu Vina¹⁰,Lai Hsin-Yi⁸⁹¹¹,Chen You-Yin¹⁵

Affiliation:

1. Department of Biomedical Engineering, National Yang Ming Chiao Tung University, No.155, Sec. 2, Linong St., Taipei 11221, Taiwan

2. Department of Mechanical Engineering, National Cheng Kung University, No. 1, University Rd., Tainan 70101, Taiwan

3. Department of Neurosurgery, Neurological Institute, Taipei Veterans General Hospital, No. 201, Sec. 2, Shipai Rd., Taipei 11217, Taiwan

4. Department of Neurological Surgery, University of Washington, 1959 NE Pacific St., Seattle, WA 98195-6470, U.S.A.

5. The Ph.D. Program for Neural Regenerative Medicine, College of Medical Science and Technology, Taipei Medical University, No. 250, Wu-Xing St., Taipei 11031, Taiwan

6. Department of Neurology, Hualien Tzu Chi Hospital, Buddhist Tzu Chi Medical Foundation, No. 707, Sec. 3, Zhongyang Rd., Hualien 97002, Taiwan

7. Department of Neurology, School of Medicine, Tzu Chi University, No. 701, Sec. 3, Zhongyang Rd., Hualien 97004, Taiwan

8. Department of Neurology of the Second Affiliated, Hospital, Interdisciplinary Institute of Neuroscience and Technology, Key Laboratory of Medical Neurobiology of Zhejiang Province, Zhejiang University School of Medicine, Hangzhou 310029, P. R. China

9. College of Biomedical Engineering and Instrument, Science, Key Laboratory for Biomedical Engineering, of Ministry of Education, Zhejiang University, Hangzhou 310027, P. R. China

10. Department of Biomedical Engineering, Johns Hopkins School of Medicine, 720 Rutland Ave., Baltimore, MD 21205, U.S.A.

11. MOE Frontier Science Center for Brain Science and Brain-Machine Integration, School of Brain Science and Brain Medicine, Zhejiang University, Hangzhou 310012, China

Abstract

Hippocampal pyramidal cells and interneurons play a key role in spatial navigation. In goal-directed behavior associated with rewards, the spatial firing pattern of pyramidal cells is modulated by the animal’s moving direction toward a reward, with a dependence on auditory, olfactory, and somatosensory stimuli for head orientation. Additionally, interneurons in the CA1 region of the hippocampus monosynaptically connected to CA1 pyramidal cells are modulated by a complex set of interacting brain regions related to reward and recall. The computational method of reinforcement learning (RL) has been widely used to investigate spatial navigation, which in turn has been increasingly used to study rodent learning associated with the reward. The rewards in RL are used for discovering a desired behavior through the integration of two streams of neural activity: trial-and-error interactions with the external environment to achieve a goal, and the intrinsic motivation primarily driven by brain reward system to accelerate learning. Recognizing the potential benefit of the neural representation of this reward design for novel RL architectures, we propose a RL algorithm based on [Formula: see text]-learning with a perspective on biomimetics (neuro-inspired RL) to decode rodent movement trajectories. The reward function, inspired by the neuronal information processing uncovered in the hippocampus, combines the preferred direction of pyramidal cell firing as the extrinsic reward signal with the coupling between pyramidal cell–interneuron pairs as the intrinsic reward signal. Our experimental results demonstrate that the neuro-inspired RL, with a combined use of extrinsic and intrinsic rewards, outperforms other spatial decoding algorithms, including RL methods that use a single reward function. The new RL algorithm could help accelerate learning convergence rates and improve the prediction accuracy for moving trajectories.

Funder

the Ministry of Science and Technology of Taiwan

“Key and Novel Therapeutics Development Program for Major Diseases” project of Academia Sinica, Taiwan, R.O.C.

Publisher

World Scientific Pub Co Pte Ltd

Subject

Computer Networks and Communications,General Medicine

Link

https://www.worldscientific.com/doi/pdf/10.1142/S0129065722500381

Reference110 articles.

1. A new neural framework for visuospatial processing

2. The Integration of Goal-Directed Signals onto Spatial Maps of Hippocampal Place Cells

3. Auditory Spatial Processing in the Human Cortex

4. Towards explaining spatial touch perception: Weighted integration of multiple location codes

5. Advances in Experimental Medicine and Biology;Jin W.,2020

Cited by 6 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Symbolic Interpretation of Trained Neural Network Ensembles;International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems;2024-07

2. Decoding the Brain Neuro-Inspired Deep Learning Architectures of Cognitive Computing;2024 International Conference on Knowledge Engineering and Communication Systems (ICKECS);2024-04-18

3. Improving the Effectiveness of Eigentrust in Computing the Reputation of Social Agents in Presence of Collusion;International Journal of Neural Systems;2023-10-07

4. Leveraging Neuro-Inspired Reinforcement Learning for Secure Reputation-based Communication in Connected Vehicles;2023 IEEE Conference on Communications and Network Security (CNS);2023-10-02

5. 3D reconstruction based on hierarchical reinforcement learning with transferability;Integrated Computer-Aided Engineering;2023-08-31