Combining Reinforcement Learning and Constraint Programming for Combinatorial Optimization-Reference-Cited by-同舟云学术

Combining Reinforcement Learning and Constraint Programming for Combinatorial Optimization

Published:2021-05-18 Issue:5 Volume:35 Page:3677-3687
ISSN:2374-3468
Container-title:Proceedings of the AAAI Conference on Artificial Intelligence
language:
Short-container-title:AAAI

Author:

Cappart Quentin,Moisan Thierry,Rousseau Louis-Martin,Prémont-Schwarz Isabeau,Cire Andre A.

Abstract

Combinatorial optimization has found applications in numerous fields, from aerospace to transportation planning and economics. The goal is to find an optimal solution among a finite set of possibilities. The well-known challenge one faces with combinatorial optimization is the state-space explosion problem: the number of possibilities grows exponentially with the problem size, which makes solving intractable for large problems. In the last years, deep reinforcement learning (DRL) has shown its promise for designing good heuristics dedicated to solve NP-hard combinatorial optimization problems. However, current approaches have an important shortcoming: they only provide an approximate solution with no systematic ways to improve it or to prove optimality. In another context, constraint programming (CP) is a generic tool to solve combinatorial optimization problems. Based on a complete search procedure, it will always find the optimal solution if we allow an execution time large enough. A critical design choice, that makes CP non-trivial to use in practice, is the branching decision, directing how the search space is explored. In this work, we propose a general and hybrid approach, based on DRL and CP, for solving combinatorial optimization problems. The core of our approach is based on a dynamic programming formulation, that acts as a bridge between both techniques. We experimentally show that our solver is efficient to solve three challenging problems: the traveling salesman problem with time windows, the 4-moments portfolio optimization problem, and the 0-1 knapsack problem. Results obtained show that the framework introduced outperforms the stand-alone RL and CP solutions, while being competitive with industrial solvers.

Publisher

Association for the Advancement of Artificial Intelligence (AAAI)

Subject

General Medicine

Cited by 45 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Deep graph representation learning for influence maximization with accelerated inference;Neural Networks;2024-12

2. Efficient opportunistic maintenance strategies via pruning in parallel–series systems with economic dependence;Computers & Industrial Engineering;2024-10

3. Graph Neural Network-Based SLO-Aware Proactive Resource Autoscaling Framework for Microservices;IEEE/ACM Transactions on Networking;2024-08

4. Applicability of Neural Combinatorial Optimization: A Critical View;ACM Transactions on Evolutionary Learning and Optimization;2024-07-23

5. Deep Reinforcement Learning Enabled Multi-UAV Scheduling for Disaster Data Collection With Time-Varying Value;IEEE Transactions on Intelligent Transportation Systems;2024-07