Measurements With A Quantum Vision Transformer: A Naive Approach-Reference-Cited by-同舟云学术

Measurements With A Quantum Vision Transformer: A Naive Approach

Published:2024 Issue: Volume:295 Page:12003
ISSN:2100-014X
Container-title:EPJ Web of Conferences
language:
Short-container-title:EPJ Web of Conf.

Author:

Pasquali Dominic,Grossi Michele,Vallecorsa Sofia

Abstract

In mainstream machine learning, transformers are gaining widespread usage. As Vision Transformers rise in popularity in computer vision, they now aim to tackle a wide variety of machine learning applications. In particular, transformers for High Energy Physics (HEP) experiments continue to be investigated for tasks including jet tagging, particle reconstruction, and pile-up mitigation. An improved Quantum Vision Transformer (QViT) with a quantum-enhanced self-attention mechanism is introduced and discussed. A shallow circuit is proposed for each component of self-attention to leverage current Noisy Intermediate Scale Quantum (NISQ) devices. Variations of the hybrid architecture/model are explored and analyzed. The results demonstrate a successful proof of concept for the QViT, and establish a competitive performance benchmark for the proposed design and implementation. The findings also provide strong motivation to experiment with different architectures, hyperparameters, and datasets, setting the stage for implementation in HEP environments where transformers are increasingly used in state of the art machine learning solutions.

Publisher

EDP Sciences

Link

https://www.epj-conferences.org/10.1051/epjconf/202429512003/pdf

Reference20 articles.

1. Vaswani A., Shazeer N., Parmar N., Uszkoreit J., Jones L., Gomez A.N., Kaiser L., Polosukhin I., Attention Is All You Need (2023), arXiv:1706.03762

2. Brown T.B., Mann B., Ryder N., Subbiah M., Kaplan J., Dhariwal P., Neelakantan A., Shyam P., Sastry G., Askell A. et al., Language Models are Few-Shot Learners (2020), arXiv:2005.14165

3. LLaMA: Open and Efficient Foundation Language Models, author=Hugo Touvron and Thibaut Lavril and Gautier Izacard and Xavier Martinet and Marie-Anne Lachaux and Timothée Lacroix and Baptiste Rozière and Naman Goyal and Eric Hambro and Faisal Azhar and Aurelien Rodriguez and Armand Joulin and Edouard Grave and Guillaume Lample (2023), arXiv:2302.13971

4. Devlin J., Chang M.W., Lee K., Toutanova K., BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding (2019), arXiv:1810.04805

5. Li G., Zhao X., Wang X., Quantum Self-Attention Neural Networks for Text Classification (2023), arXiv:2205.05625