Comprehensive Survey of Model Compression and Speed up for Vision Transformers-Reference-Cited by-同舟云学术

Comprehensive Survey of Model Compression and Speed up for Vision Transformers

Published:2024-04-04 Issue: Volume: Page:1-12
ISSN:3041-0649
Container-title:Journal of Information, Technology and Policy
language:
Short-container-title:JITP

Author:

Chen Feiyang,Luo Ziqian,Zhou Lisang,Pan Xueting,Jiang Ying

Abstract

Vision Transformers (ViT) have marked a paradigm shift in computer vision, outperforming state-of-the-art models across diverse tasks. However, their practical deployment is hampered by high computational and memory demands. This study addresses the challenge by evaluating four primary model compression techniques: quantization, low-rank approximation, knowledge distillation, and pruning. We methodically analyze and compare the efficacy of these techniques and their combinations in optimizing ViTs for resource-constrained environments. Our comprehensive experimental evaluation demonstrates that these methods facilitate a balanced compromise between model accuracy and computational efficiency, paving the way for wider application in edge computing devices.

Publisher

Global Science Publishing Pty. Lte.

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Optimizing Robotic Mobile Fulfillment Systems for Order Picking Based on Deep Reinforcement Learning;Sensors;2024-07-20

2. Enhanced Detection Classification via Clustering SVM for Various Robot Collaboration Task;2024 6th International Conference on Communications, Information System and Computer Engineering (CISCE);2024-05-10

3. Cross-Modal Domain Adaptation in Brain Disease Diagnosis: Maximum Mean Discrepancy-Based Convolutional Neural Networks;2024 6th International Conference on Communications, Information System and Computer Engineering (CISCE);2024-05-10