Model extraction via active learning by fusing prior and posterior knowledge from unlabeled data-Reference-Cited by-同舟云学术

Model extraction via active learning by fusing prior and posterior knowledge from unlabeled data

Published:2024-03-19 Issue: Volume: Page:1-16
ISSN:1064-1246
Container-title:Journal of Intelligent & Fuzzy Systems
language:
Short-container-title:IFS

Author:

Gao Lijun¹,Liu Kai¹,Liu Wenjun¹,Wu Jiehong¹,Jin Xiao¹

Affiliation:

1. Department of Computer Science, Shenyang Aerospace University, Shenyang, Liaoning, China

Abstract

As machine learning models become increasingly integrated into practical applications and are made accessible via public APIs, the risk of model extraction attacks has gained prominence. This study presents an innovative and efficient approach to model extraction attacks, aimed at reducing query costs and enhancing attack effectiveness. The method begins by leveraging a pre-trained model to identify high-confidence samples from unlabeled datasets. It then employs unsupervised contrastive learning to thoroughly dissect the structural nuances of these samples, constructing a dataset of high quality that precisely mirrors a variety of features. A mixed information confidence strategy is employed to refine the query set, effectively probing the decision boundaries of the target model. By integrating consistency regularization and pseudo-labeling techniques, reliance on authentic labels is minimized, thus improving the feature extraction capabilities and predictive precision of the surrogate models. Evaluation on four major datasets reveals that the models crafted through this method bear a close functional resemblance to the original models, with a real-world API test success rate of 62.35%, which vouches for the method’s validity.

Publisher

IOS Press

Reference18 articles.

1. Black-box ripper: Copying black-box models using generative evolutionary algorithms;Barbalau;Advances in Neural Information Processing Systems,2020

2. Active learning with statistical models;Cohn;Journal of Artificial Intelligence Research,1996

3. Stealing machine learning models via prediction {APIs}, in;Tramèr;25th USENIX security symposium (USENIX Security 16),2016

4. Decoding clinical biomarker space of COVID-19: Exploring matrix factorization-based feature selection methods;Saberi-Movahed;Computers in Biology and Medicine,2022

5. Apmsa: Adversarial perturbation against model stealing attacks;Zhang;IEEE Transactions on Information Forensics and Security,2023