Author:
Pal Soham,Gupta Yash,Shukla Aditya,Kanade Aditya,Shevade Shirish,Ganapathy Vinod
Abstract
Machine learning models are increasingly being deployed in practice. Machine Learning as a Service (MLaaS) providers expose such models to queries by third-party developers through application programming interfaces (APIs). Prior work has developed model extraction attacks, in which an attacker extracts an approximation of an MLaaS model by making black-box queries to it. We design ActiveThief – a model extraction framework for deep neural networks that makes use of active learning techniques and unannotated public datasets to perform model extraction. It does not expect strong domain knowledge or access to annotated data on the part of the attacker. We demonstrate that (1) it is possible to use ActiveThief to extract deep classifiers trained on a variety of datasets from image and text domains, while querying the model with as few as 10-30% of samples from public datasets, (2) the resulting model exhibits a higher transferability success rate of adversarial examples than prior work, and (3) the attack evades detection by the state-of-the-art model extraction detection method, PRADA.
Publisher
Association for the Advancement of Artificial Intelligence (AAAI)
Cited by
35 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. STMS: An Out-Of-Distribution Model Stealing Method Based on Causality;2024 International Joint Conference on Neural Networks (IJCNN);2024-06-30
2. Poisoning-Free Defense Against Black-Box Model Extraction;ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP);2024-04-14
3. Model extraction via active learning by fusing prior and posterior knowledge from unlabeled data;Journal of Intelligent & Fuzzy Systems;2024-03-19
4. Army of Thieves: Enhancing Black-Box Model Extraction via Ensemble based sample selection;2024 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV);2024-01-03
5. AugSteal: Advancing Model Steal With Data Augmentation in Active Learning Frameworks;IEEE Transactions on Information Forensics and Security;2024