A lightweight CNN-transformer model for learning traveling salesman problems

Author:

Jung Minseop,Lee Jaeseung,Kim JibumORCID

Abstract

AbstractSeveral studies have attempted to solve traveling salesman problems (TSPs) using various deep learning techniques. Among them, Transformer-based models show state-of-the-art performance even for large-scale Traveling Salesman Problems (TSPs). However, they are based on fully-connected attention models and suffer from large computational complexity and GPU memory usage. Our work is the first CNN-Transformer model based on a CNN embedding layer and partial self-attention for TSP. Our CNN-Transformer model is able to better learn spatial features from input data using a CNN embedding layer compared with the standard Transformer-based models. It also removes considerable redundancy in fully-connected attention models using the proposed partial self-attention. Experimental results show that the proposed CNN embedding layer and partial self-attention are very effective in improving performance and computational complexity. The proposed model exhibits the best performance in real-world datasets and outperforms other existing state-of-the-art (SOTA) Transformer-based models in various aspects. Our code is publicly available at https://github.com/cm8908/CNN_Transformer3.

Funder

National research foundation of korea

MSIT

Publisher

Springer Science and Business Media LLC

Reference46 articles.

1. Papadimitriou CH (1977) The euclidean travelling salesman problem is np-complete. Theoretical Comput Sci 4(3):237–244

2. Christofides N (1976) Worst-case analysis of a new heuristic for the travelling salesman problem. Technical Report

3. Perron L, Furnon V (2022) Or-tools. https://developers.google.com/optimization/, 2022-11-25

4. Kool W, Van Hoof H, Welling M (2018) Attention, learn to solve routing problems! arXiv:1803.08475

5. Bresson X, Laurent T (2021) The transformer network for the traveling salesman problem. arXiv:2103.03012

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3