The quantum cartpole: A benchmark environment for non-linear reinforcement learning-Reference-Cited by-同舟云学术

The quantum cartpole: A benchmark environment for non-linear reinforcement learning

Published:2024-05-07 Issue:2 Volume:7 Page:
ISSN:2666-9366
Container-title:SciPost Physics Core
language:
Short-container-title:SciPost Phys. Core

Author:

Meinerz Kai¹^ORCID,Trebst Simon¹,Rudner Mark²,van Nieuwenburg Evert³

Affiliation:

1. University of Cologne

2. University of Washington

3. Lorentz Institute

Abstract

Feedback-based control is the de-facto standard when it comes to controlling classical stochastic systems and processes. However, standard feedback-based control methods are challenged by quantum systems due to measurement induced backaction and partial observability. Here we remedy this by using weak quantum measurements and model-free reinforcement learning agents to perform quantum control. By comparing control algorithms with and without state estimators to stabilize a quantum particle in an unstable state near a local potential energy maximum, we show how a trade-off between state estimation and controllability arises. For the scenario where the classical analogue is highly nonlinear, the reinforcement learned controller has an advantage over the standard controller. Additionally, we demonstrate the feasibility of using transfer learning to develop a quantum control agent trained via reinforcement learning on a classical surrogate of the quantum control problem. Finally, we present results showing how the reinforcement learning control strategy differs from the classical controller in the non-linear scenarios.

Funder

Deutsche Forschungsgemeinschaft

Publisher

Stichting SciPost

Link

https://scipost.org/10.21468/SciPostPhysCore.7.2.026/pdf

Reference31 articles.

1. Optimal control of quantum-mechanical systems: Existence, numerical approximation, and applications

2. Optimal Control Technique for Many-Body Quantum Dynamics

3. Quantum optimal control theory

4. Feedback control of quantum systems using continuous state estimation

5. Quantum simulation