Value Iteration Networks-Reference-Cited by-同舟云学术

Value Iteration Networks

Published:2017-08 Issue: Volume: Page:
ISSN:
Container-title:Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence
language:
Short-container-title:

Author:

Tamar Aviv¹,Wu Yi¹,Thomas Garrett¹,Levine Sergey¹,Abbeel Pieter¹²

Affiliation:

1. UC Berkeley

2. OpenAI

Abstract

We introduce the value iteration network (VIN): a fully differentiable neural network with a `planning module' embedded within. VINs can learn to plan, and are suitable for predicting outcomes that involve planning-based reasoning, such as policies for reinforcement learning. Key to our approach is a novel differentiable approximation of the value-iteration algorithm, which can be represented as a convolutional neural network, and trained end-to-end using standard backpropagation.We evaluate VIN based policies on discrete and continuous path-planning domains, and on a natural-language based search task. We show that by learning an explicit planning computation, VIN policies generalize better to new, unseen domains.This paper is a significantly abridged and IJCAI audience targeted version of the original NIPS 2016 paper with the same title, available here: https://arxiv.org/abs/1602.02867

Publisher

International Joint Conferences on Artificial Intelligence Organization

Cited by 68 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. An Adaptive Pricing Framework for Real-Time AI Model Service Exchange;IEEE Transactions on Network Science and Engineering;2024-09

2. A collaborative combat decision-making method based on multi-agent deep reinforcement learning;2024 36th Chinese Control and Decision Conference (CCDC);2024-05-25

3. Model-Based Reinforcement Learning with System Identification and Fuzzy Reward;2024 Intermountain Engineering, Technology and Computing (IETC);2024-05-13

4. -Equivariant Graph Planning for Navigation;IEEE Robotics and Automation Letters;2024-04

5. Deep reinforcement learning navigation via decision transformer in autonomous driving;Frontiers in Neurorobotics;2024-03-19