Online 3D Bin Packing with Constrained Deep Reinforcement Learning-Reference-Cited by-同舟云学术

Online 3D Bin Packing with Constrained Deep Reinforcement Learning

Published:2021-05-18 Issue:1 Volume:35 Page:741-749
ISSN:2374-3468
Container-title:Proceedings of the AAAI Conference on Artificial Intelligence
language:
Short-container-title:AAAI

Author:

Zhao Hang,She Qijin,Zhu Chenyang,Yang Yin,Xu Kai

Abstract

We solve a challenging yet practically useful variant of 3D Bin Packing Problem (3D-BPP). In our problem, the agent has limited information about the items to be packed into a single bin, and an item must be packed immediately after its arrival without buffering or readjusting. The item's placement also subjects to the constraints of order dependence and physical stability. We formulate this online 3D-BPP as a constrained Markov decision process (CMDP). To solve the problem, we propose an effective and easy-to-implement constrained deep reinforcement learning (DRL) method under the actor-critic framework. In particular, we introduce a prediction-and-projection scheme: The agent first predicts a feasibility mask for the placement actions as an auxiliary task and then uses the mask to modulate the action probabilities output by the actor during training. Such supervision and projection facilitate the agent to learn feasible policies very efficiently. Our method can be easily extended to handle lookahead items, multi-bin packing, and item re-orienting. We have conducted extensive evaluation showing that the learned policy significantly outperforms the state-of-the-art methods. A preliminary user study even suggests that our method might attain a human-level performance.

Publisher

Association for the Advancement of Artificial Intelligence (AAAI)

Subject

General Medicine

Cited by 55 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. An online dynamic dual bin packing with lookahead approach for server-to-cell assignment in computer server industry;Computers & Industrial Engineering;2024-10

2. Surrogate-Assisted Multi-Objective Optimization for Simultaneous Three-Dimensional Packing and Motion Planning Problems Using the Sequence-Triple Representation;Applied Artificial Intelligence;2024-09-05

3. A pattern-based algorithm with fuzzy logic bin selector for online bin packing problem;Expert Systems with Applications;2024-09

4. Comprehensive Review of Robotized Freight Packing;Logistics;2024-07-08

5. 3D dynamic heterogeneous robotic palletization problem;European Journal of Operational Research;2024-07