Sequential Classification-Based Optimization for Direct Policy Search-Reference-Cited by-同舟云学术

Sequential Classification-Based Optimization for Direct Policy Search

Published:2017-02-13 Issue:1 Volume:31 Page:
ISSN:2374-3468
Container-title:Proceedings of the AAAI Conference on Artificial Intelligence
language:
Short-container-title:AAAI

Author:

Hu Yi-Qi,Qian Hong,Yu Yang

Abstract

Classification-based optimization is a recently developed framework for derivative-free optimization, which has shown to be effective for non-convex optimization problems with many local optima. This framework requires to sample a batch of solutions for every update of the search model. However, in reinforcement learning, direct policy search often offers only sequential policy evaluation. Thus, classificationbased optimization is not efficient for direct policy search where solutions have to be sampled sequentially. In this paper, we adapt the classification-based optimization for sequential sampled solutions by forming the batch of reused historical solutions. Experiments on helicopter hovering control task and reinforcement learning benchmark tasks in OpenAI Gym show that the new algorithm is superior to state-of-the-art derivative-free optimization approaches.

Publisher

Association for the Advancement of Artificial Intelligence (AAAI)

Subject

General Medicine

Cited by 12 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Interpretable classifier design by axiomatic fuzzy sets theory and derivative-free optimization;Expert Systems with Applications;2024-07

2. Optimizing OD-based up-front discounting strategies for enroute ridepooling services;Transportation Research Part B: Methodological;2024-07

3. Zero-One Attack: Degrading Closed-Loop Neural Network Control Systems using State-Time Perturbations;2024 ACM/IEEE 15th International Conference on Cyber-Physical Systems (ICCPS);2024-05-13

4. A survey on model-based reinforcement learning;Science China Information Sciences;2024-01-23

5. Scenario-Based Flexible Modeling and Scalable Falsification for Reconfigurable CPSs;Lecture Notes in Computer Science;2024