Demo2Test: Transfer Testing of Agent in Competitive Environment with Failure Demonstrations-Reference-Cited by-同舟云学术

Demo2Test: Transfer Testing of Agent in Competitive Environment with Failure Demonstrations

Published:2024-09-13 Issue: Volume: Page:
ISSN:1049-331X
Container-title:ACM Transactions on Software Engineering and Methodology
language:en
Short-container-title:ACM Trans. Softw. Eng. Methodol.

Author:

Chen Jianming¹^ORCID,Wang Yawen¹^ORCID,Wang Junjie¹^ORCID,Xie Xiaofei²^ORCID,Wang Dandan¹^ORCID,Wang Qing¹^ORCID,Xu Fanjiang¹^ORCID

Affiliation:

1. Institute of Software, Chinese Academy of Sciences, China

2. Singapore Management University, Singapore

Abstract

The competitive game between agents exists in many critical applications, such as military unmanned aerial vehicles. It is urgent to test these agents to reduce the significant losses caused by their failures. Existing studies mainly are to construct a testing agent that competes with the target agent to induce its failures. These approaches usually focus on a single task, requiring much more time for multi-task testing. However, if the previously tested tasks (source tasks) and the task to be tested (target task) share similar agents or task objectives, the transferable knowledge in source tasks can potentially increase the effectiveness of testing in the target task. We propose Demo2Test for conducting transfer testing of agents in the competitive environment, i.e., leveraging the demonstrations of failure scenarios from the source task to boost the testing effectiveness in the target task. It trains a testing agent with demonstrations and incorporates the action perturbation at key states to balance the number of revealed failures and their diversity. We conduct experiments in the simulated robotics competitive environments of MuJoCo. The results indicate that Demo2Test outperforms the best-performing baseline with improvements ranging from 22.38% to 87.98%, and 12.69% to 60.98%, in terms of the number and diversity of discovered failure scenarios, respectively.

Publisher

Association for Computing Machinery (ACM)

Link

https://dl.acm.org/doi/pdf/10.1145/3696001

Reference75 articles.

1. Transfer deep learning approach for detecting coronavirus disease in X-ray images;Al-Smadi Mohammed;International Journal of Electrical and Computer Engineering,2021

2. Testing, Validation, and Verification of Robotic and Autonomous Systems;Araujo Hugo;A Systematic Review. ACM Trans. Softw. Eng. Methodol. (TOSEM),2023

3. Trapit Bansal, Jakub Pachocki, Szymon Sidor, Ilya Sutskever, and Igor Mordatch. 2018. Emergent Complexity via Multi-Agent Competition. arXiv preprint arXiv:1710.03748 (2018).

4. Vahid Behzadan and Arslan Munir. 2017. Vulnerability of Deep Reinforcement Learning to Policy Induction Attacks. In Machine Learning and Data Mining in Pattern Recognition - 13th International Conference, MLDM 2017, New York, NY, USA, July 15-20, 2017, Proceedings (Lecture Notes in Computer Science, Vol. 10358). Springer, 262–275.

5. Lukas Berglund, Tim Grube, Gregory Gay, Francisco Gomes de Oliveira Neto, and Dimitrios Platis. 2023. Test Maintenance for Machine Learning Systems: A Case Study in the Automotive Industry. In 2023 IEEE Conference on Software Testing, Verification and Validation (ICST). 410–421.