Segment Shards: Cross-Prompt Adversarial Attacks against the Segment Anything Model-Reference-Cited by-同舟云学术

Segment Shards: Cross-Prompt Adversarial Attacks against the Segment Anything Model

Published:2024-04-15 Issue:8 Volume:14 Page:3312
ISSN:2076-3417
Container-title:Applied Sciences
language:en
Short-container-title:Applied Sciences

Author:

Huang Shize¹²^ORCID,Fan Qianhui¹²^ORCID,Zhang Zhaoxin¹,Liu Xiaowen¹,Song Guanqun¹,Qin Jinzhe¹^ORCID

Affiliation:

1. The Key Laboratory of Road and Traffic Engineering, Ministry of Education, Tongji University, 4800 Caoan Rd., Shanghai 201804, China

2. Shanghai Key Laboratory of Rail Infrastructure Durability and System Safety, 4800 Caoan Rd., Shanghai 201804, China

Abstract

Foundation models play an increasingly pivotal role in the field of deep neural networks. Given that deep neural networks are widely used in real-world systems and are generally susceptible to adversarial attacks, securing foundation models becomes a key research issue. However, research on adversarial attacks against the Segment Anything Model (SAM), a visual foundation model, is still in its infancy. In this paper, we propose the prompt batch attack (PBA), which can effectively attack SAM, making it unable to capture valid objects or even generate fake shards. Extensive experiments were conducted to compare the adversarial attack performance among optimizing without prompts, optimizing all prompts, and optimizing batches of prompts as in PBA. Numerical results on multiple datasets show that the cross-prompt attack success rate (ASR∗) of the PBA method is 17.83% higher on average, and the attack success rate (ASR) is 20.84% higher. It is proven that PBA possesses the best attack capability as well as the highest cross-prompt transferability. Additionally, we introduce a metric to evaluate the cross-prompt transferability of adversarial attacks, effectively fostering research on cross-prompt attacks. Our work unveils the pivotal role of the batched prompts technique in cross-prompt adversarial attacks, marking an early and intriguing exploration into this area against SAM.

Funder

Natural Science Foundation of Chongqing, China

Publisher

MDPI AG

Link

https://www.mdpi.com/2076-3417/14/8/3312/pdf

Reference54 articles.

1. Bommasani, R., Hudson, D.A., Adeli, E., Altman, R., Arora, S., von Arx, S., Bernstein, M.S., Bohg, J., Bosselut, A., and Brunskill, E. (2022). On the Opportunities and Risks of Foundation Models. arXiv.

2. Devlin, J., Chang, M.W., Lee, K., and Toutanova, K. (2019). BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. arXiv.

3. Brown, T.B., Mann, B., Ryder, N., Subbiah, M., Kaplan, J., Dhariwal, P., Neelakantan, A., Shyam, P., Sastry, G., and Askell, A. (2020, January 6–12). Language Models Are Few-Shot Learners. Proceedings of the 34th International Conference on Neural Information Processing Systems, Red Hook, NY, USA.

4. Radford, A., Kim, J.W., Hallacy, C., Ramesh, A., Goh, G., Agarwal, S., Sastry, G., Askell, A., Mishkin, P., and Clark, J. (2021, January 18–24). Learning Transferable Visual Models From Natural Language Supervision. Proceedings of the International Conference on Machine Learning, Virtual.

5. Dosovitskiy, A., Beyer, L., Kolesnikov, A., Weissenborn, D., Zhai, X., Unterthiner, T., Dehghani, M., Minderer, M., Heigold, G., and Gelly, S. (2021, January 3–7). An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale. Proceedings of the International Conference on Learning Representations, Virtual.