Toward on-sky adaptive optics control using reinforcement learning-Reference-Cited by-同舟云学术

Toward on-sky adaptive optics control using reinforcement learning

Published:2022-08 Issue: Volume:664 Page:A71
ISSN:0004-6361
Container-title:Astronomy & Astrophysics
language:
Short-container-title:A&A

Author:

Nousiainen J.,Rajani C.,Kasper M.,Helin T.,Haffert S. Y.,Vérinaud C.,Males J. R.,Van Gorkom K.,Close L. M.,Long J. D.,Hedglen A. D.,Guyon O.,Schatz L.,Kautz M.,Lumbres J.,Rodack A.,Knight J. M.,Miller K.

Abstract

Context. The direct imaging of potentially habitable exoplanets is one prime science case for the next generation of high contrast imaging instruments on ground-based, extremely large telescopes. To reach this demanding science goal, the instruments are equipped with eXtreme Adaptive Optics (XAO) systems which will control thousands of actuators at a framerate of kilohertz to several kilohertz. Most of the habitable exoplanets are located at small angular separations from their host stars, where the current control laws of XAO systems leave strong residuals. Aims. Current AO control strategies such as static matrix-based wavefront reconstruction and integrator control suffer from a temporal delay error and are sensitive to mis-registration, that is, to dynamic variations of the control system geometry. We aim to produce control methods that cope with these limitations, provide a significantly improved AO correction, and, therefore, reduce the residual flux in the coronagraphic point spread function (PSF). Methods. We extend previous work in reinforcement learning for AO. The improved method, called the Policy Optimization for Adaptive Optics (PO4AO), learns a dynamics model and optimizes a control neural network, called a policy. We introduce the method and study it through numerical simulations of XAO with Pyramid wavefront sensor (PWFS) for the 8-m and 40-m telescope aperture cases. We further implemented PO4AO and carried out experiments in a laboratory environment using Magellan Adaptive Optics eXtreme system (MagAO-X) at the Steward laboratory. Results. PO4AO provides the desired performance by improving the coronagraphic contrast in numerical simulations by factors of 3–5 within the control region of deformable mirror and PWFS, both in simulation and in the laboratory. The presented method is also quick to train, that is, on timescales of typically 5–10 s, and the inference time is sufficiently small (<ms) to be used in real-time control for XAO with currently available hardware even for extremely large telescopes.

Publisher

EDP Sciences

Subject

Space and Planetary Science,Astronomy and Astrophysics

Link

https://www.aanda.org/10.1051/0004-6361/202243311/pdf

Reference76 articles.

1. Origin of the asymmetry of the wind driven halo observed in high-contrast images

2. Fundamental limitations on Earth-like planet detection with extremely large telescopes

3. Pyramid wavefront sensor optical gains compensation using a convolutional model

4. Chua K., Calandra R., McAllister R., & Levine S. 2018, in Advances in Neural Information Processing Systems, 4754

5. Conan J.-M., Raynaud H.A.R., Kulcsár C., Meimon S., & Sivo G. 2011, in Adaptive Optics for Extremely Large Telescopes (Singapore: World Scientific)

Cited by 13 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Image-based wavefront correction using model-free reinforcement learning;Optics Express;2024-08-13

2. Power of prediction: spatiotemporal Gaussian process modeling for predictive control in slope-based wavefront sensing;Journal of Astronomical Telescopes, Instruments, and Systems;2024-07-12

3. Reinforcement learning-trained optimisers and Bayesian optimisation for online particle accelerator tuning;Scientific Reports;2024-07-08

4. Reinforcement learning;Astronomy and Computing;2024-07

5. Making the unmodulated Pyramid wavefront sensor smart;Astronomy & Astrophysics;2024-04