Testing the Plasticity of Reinforcement Learning-based Systems-Reference-Cited by-同舟云学术

Testing the Plasticity of Reinforcement Learning-based Systems

Published:2022-07-12 Issue:4 Volume:31 Page:1-46
ISSN:1049-331X
Container-title:ACM Transactions on Software Engineering and Methodology
language:en
Short-container-title:ACM Trans. Softw. Eng. Methodol.

Author:

Biagiola Matteo¹^ORCID,Tonella Paolo¹

Affiliation:

1. Università della Svizzera italiana, Lugano, Switzerland

Abstract

The dataset available for pre-release training of a machine-learning based system is often not representative of all possible execution contexts that the system will encounter in the field. Reinforcement Learning (RL) is a prominent approach among those that support continual learning, i.e., learning continually in the field, in the post-release phase. No study has so far investigated any method to test the plasticity of RL-based systems, i.e., their capability to adapt to an execution context that may deviate from the training one. We propose an approach to test the plasticity of RL-based systems. The output of our approach is a quantification of the adaptation and anti-regression capabilities of the system, obtained by computing the adaptation frontier of the system in a changed environment. We visualize such frontier as an adaptation/anti-regression heatmap in two dimensions, or as a clustered projection when more than two dimensions are involved. In this way, we provide developers with information on the amount of changes that can be accommodated by the continual learning component of the system, which is key to decide if online, in-the-field learning can be safely enabled or not.

Funder

H2020 project PRECRIME

ERC Advanced Grant 2017 Program

Publisher

Association for Computing Machinery (ACM)

Subject

Software

Link

https://dl.acm.org/doi/pdf/10.1145/3511701

Reference90 articles.

1. Testing advanced driver assistance systems using multi-objective search and neural networks

2. Testing autonomous cars for feature interaction failures using many-objective search

3. Spinning up in deep reinforcement learning;Achiam Joshua;Website,2018

4. A Hitchhiker's guide to statistical tests for assessing randomized algorithms in software engineering

5. Neuronlike adaptive elements that can solve difficult learning control problems

Cited by 10 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Boundary State Generation for Testing and Improvement of Autonomous Driving Systems;IEEE Transactions on Software Engineering;2024-08

2. MarMot: Metamorphic Runtime Monitoring of Autonomous Driving Systems;ACM Transactions on Software Engineering and Methodology;2024-07-15

3. Test Input Prioritization for 3D Point Clouds;ACM Transactions on Software Engineering and Methodology;2024-06-04

4. Towards Building AI-CPS with NVIDIA Isaac Sim: An Industrial Benchmark and Case Study for Robotics Manipulation;Proceedings of the 46th International Conference on Software Engineering: Software Engineering in Practice;2024-04-14

5. Testing of Deep Reinforcement Learning Agents with Surrogate Models;ACM Transactions on Software Engineering and Methodology;2023-11-11