A Post-Training Framework for Improving the Performance of Deep Learning Models via Model Transformation-Reference-Cited by-同舟云学术

A Post-Training Framework for Improving the Performance of Deep Learning Models via Model Transformation

Published:2023-10-23 Issue: Volume: Page:
ISSN:1049-331X
Container-title:ACM Transactions on Software Engineering and Methodology
language:en
Short-container-title:ACM Trans. Softw. Eng. Methodol.

Author:

Jiang Jiajun¹,Yang Junjie¹,Zhang Yingyi¹,Wang Zan¹,You Hanmo¹,Chen Junjie¹

Affiliation:

1. Tianjin University College of Intelligence and Computing, China

Abstract

Deep learning (DL) techniques have attracted much attention in recent years and have been applied to many application scenarios. To improve the performance of DL models regarding different properties, many approaches have been proposed in the last decades, such as improving the robustness and fairness of DL models to meet the requirements for practical use. Among existing approaches, post-training is an effective method that has been widely adopted in practice due to its high efficiency and good performance. Nevertheless, its performance is still limited due to the incompleteness of training data. Additionally, existing approaches are always specifically designed for certain tasks, such as improving model robustness, which cannot be used for other purposes. In this paper, we aim to fill this gap and propose an effective and general post-training framework, which can be adapted to improve the model performance from different aspects. Specifically, it incorporates a novel model transformation technique that transforms a classification model into an isomorphic regression model for fine-tuning, which can effectively overcome the problem of incomplete training data by forcing the model to strengthen the memory of crucial input features and thus improve the model performance eventually. To evaluate the performance of our framework, we have adapted it to two emerging tasks for improving DL models, i.e., robustness and fairness improvement, and conducted extensive studies by comparing it with state-of-the-art approaches. The experimental results demonstrate that our framework is indeed general as it is effective in both tasks. Specifically, in the task of robustness improvement, our approach Dare has achieved the best results on 61.1% cases ( vs. 11.1% cases achieved by baselines). In the task of fairness improvement, our approach FMT can effectively improve the fairness without sacrificing the accuracy of the models.

Publisher

Association for Computing Machinery (ACM)

Subject

Software

Link

https://dl.acm.org/doi/pdf/10.1145/3630011

Reference145 articles.

1. Evan Ackerman. Accessed: 2021. News. https://spectrum.ieee.org/three-small-stickers-on-road-can-steer-tesla-autopilot-into-oncoming-lane Evan Ackerman. Accessed: 2021. News. https://spectrum.ieee.org/three-small-stickers-on-road-can-steer-tesla-autopilot-into-oncoming-lane

2. Yang Bai Xin Yan Yong Jiang Shu-Tao Xia and Yisen Wang. 2021. Clustering Effect of Adversarial Robust Models. In NeurIPS. 29590–29601. Yang Bai Xin Yan Yong Jiang Shu-Tao Xia and Yisen Wang. 2021. Clustering Effect of Adversarial Robust Models. In NeurIPS. 29590–29601.

3. Iveta Becková , Stefan Pócos , and Igor Farkas . 2020 . Computational Analysis of Robustness in Neural Network Classifiers. In 29th International Conference on Artificial Neural Networks , Bratislava, Slovakia. Springer, 65–76. Iveta Becková, Stefan Pócos, and Igor Farkas. 2020. Computational Analysis of Robustness in Neural Network Classifiers. In 29th International Conference on Artificial Neural Networks, Bratislava, Slovakia. Springer, 65–76.

4. AI Fairness 360: An extensible toolkit for detecting and mitigating algorithmic bias

5. Sumon Biswas and Hridesh Rajan . 2020 . Do the machine learning models on a crowd sourced platform exhibit bias? an empirical study on model fairness. In ESEC/FSE ’20 : 28th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering , Virtual Event, USA , November 8-13, 2020, Prem Devanbu, Myra B. Cohen, and Thomas Zimmermann (Eds.). ACM, 642–653. Sumon Biswas and Hridesh Rajan. 2020. Do the machine learning models on a crowd sourced platform exhibit bias? an empirical study on model fairness. In ESEC/FSE ’20: 28th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, Virtual Event, USA, November 8-13, 2020, Prem Devanbu, Myra B. Cohen, and Thomas Zimmermann (Eds.). ACM, 642–653.

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A Large-Scale Empirical Study on Improving the Fairness of Image Classification Models;Proceedings of the 33rd ACM SIGSOFT International Symposium on Software Testing and Analysis;2024-09-11