Integrating Heuristic Methods with Deep Reinforcement Learning for Online 3D Bin-Packing Optimization-Reference-Cited by-同舟云学术

Integrating Heuristic Methods with Deep Reinforcement Learning for Online 3D Bin-Packing Optimization

Published:2024-08-20 Issue:16 Volume:24 Page:5370
ISSN:1424-8220
Container-title:Sensors
language:en
Short-container-title:Sensors

Author:

Wong Ching-Chang¹^ORCID,Tsai Tai-Ting¹,Ou Can-Kun¹

Affiliation:

1. Department of Electrical and Computer Engineering, Tamkang University, New Taipei City 25137, Taiwan

Abstract

This study proposes a method named Hybrid Heuristic Proximal Policy Optimization (HHPPO) to implement online 3D bin-packing tasks. Some heuristic algorithms for bin-packing and the Proximal Policy Optimization (PPO) algorithm of deep reinforcement learning are integrated to implement this method. In the heuristic algorithms for bin-packing, an extreme point priority sorting method is proposed to sort the generated extreme points according to their waste spaces to improve space utilization. In addition, a 3D grid representation of the space status of the container is used, and some partial support constraints are proposed to increase the possibilities for stacking objects and enhance overall space utilization. In the PPO algorithm, some heuristic algorithms are integrated, and the reward function and the action space of the policy network are designed so that the proposed method can effectively complete the online 3D bin-packing task. Some experimental results illustrate that the proposed method has good results in achieving online 3D bin-packing tasks in some simulation environments. In addition, an environment with image vision is constructed to show that the proposed method indeed enables an actual robot manipulator to successfully and effectively complete the bin-packing task in a real environment.

Funder

National Science and Technology Council (NSTC) of Taiwan, R.O.C.

Publisher

MDPI AG

Link

https://www.mdpi.com/1424-8220/24/16/5370/pdf

Reference31 articles.

1. The three-dimensional bin packing problem;Martello;Oper. Res.,2000

2. A three-dimensional adaptive PSO-based packing algorithm for an IoT-based automated e-fulfillment packaging system;Li;IEEE Access,2017

3. New heuristics for one-dimensional bin-packing;Fleszar;Comput. Oper. Res.,2002

4. Castillo, O., and Melin, P. (2023). Comparative Study of Heuristics for the One-Dimensional Bin Packing Problem. Hybrid Intelligent Systems Based on Extensions of Fuzzy Logic, Neural Networks and Metaheuristics, Springer.

5. Learning practically feasible policies for online 3D bin packing;Zhao;Sci. China Inf. Sci.,2022