Model extraction via active learning by fusing prior and posterior knowledge from unlabeled data

Author:

Gao Lijun1,Liu Kai1,Liu Wenjun1,Wu Jiehong1,Jin Xiao1

Affiliation:

1. Department of Computer Science, Shenyang Aerospace University, Shenyang, Liaoning, China

Abstract

As machine learning models become increasingly integrated into practical applications and are made accessible via public APIs, the risk of model extraction attacks has gained prominence. This study presents an innovative and efficient approach to model extraction attacks, aimed at reducing query costs and enhancing attack effectiveness. The method begins by leveraging a pre-trained model to identify high-confidence samples from unlabeled datasets. It then employs unsupervised contrastive learning to thoroughly dissect the structural nuances of these samples, constructing a dataset of high quality that precisely mirrors a variety of features. A mixed information confidence strategy is employed to refine the query set, effectively probing the decision boundaries of the target model. By integrating consistency regularization and pseudo-labeling techniques, reliance on authentic labels is minimized, thus improving the feature extraction capabilities and predictive precision of the surrogate models. Evaluation on four major datasets reveals that the models crafted through this method bear a close functional resemblance to the original models, with a real-world API test success rate of 62.35%, which vouches for the method’s validity.

Publisher

IOS Press

Reference18 articles.

1. Black-box ripper: Copying black-box models using generative evolutionary algorithms;Barbalau;Advances in Neural Information Processing Systems,2020

2. Active learning with statistical models;Cohn;Journal of Artificial Intelligence Research,1996

3. Stealing machine learning models via prediction {APIs}, in;Tramèr;25th USENIX security symposium (USENIX Security 16),2016

4. Decoding clinical biomarker space of COVID-19: Exploring matrix factorization-based feature selection methods;Saberi-Movahed;Computers in Biology and Medicine,2022

5. Apmsa: Adversarial perturbation against model stealing attacks;Zhang;IEEE Transactions on Information Forensics and Security,2023

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3