Parameterized reinforcement learning for optical system optimization-Reference-Cited by-同舟云学术

Parameterized reinforcement learning for optical system optimization

Published:2021-05-18 Issue:30 Volume:54 Page:305104
ISSN:0022-3727
Container-title:Journal of Physics D: Applied Physics
language:
Short-container-title:J. Phys. D: Appl. Phys.

Author:

Wankerl Heribert^ORCID,Stern Maike L^ORCID,Mahdavi Ali,Eichler Christoph,Lang Elmar W^ORCID

Abstract

Abstract Engineering a physical system to feature designated characteristics states an inverse design problem, which is often determined by several discrete and continuous parameters. If such a system must feature a particular behavior, the mentioned combination of both, discrete and continuous, parameters results in a challenging optimization problem that requires an extensive search for an optimal system design. However, if the corresponding inverse design problem can be reformulated as a parameterized Markov decision process, reinforcement learning (RL) provides a heuristic framework to solve it. In this work, we use multi-layer thin films as an example of the aforementioned optimization problems and consider three design parameters: Each of the thin film layer’s dielectric material (discrete) and thickness (continuous), as well as the total number of layers (discrete). While recent methods merely determine the optimal thicknesses and—less commonly—the layers’ materials, our approach optimizes the total number of stacked layers as well. In summary, we further develop a Q-learning variant to solve inverse design optimization and thereby outperform human experts and current approaches like needle-point optimization or naive RL. For this purpose, we propose an exponentially transformed reward signal that eases policy search and enables constrained optimization. Moreover, the learned Q-values contain information about the optical properties of multi-layer thin films, which allows us a physical interpretation or what-if analysis and thus enables explainability.

Publisher

IOP Publishing

Subject

Surfaces, Coatings and Films,Acoustics and Ultrasonics,Condensed Matter Physics,Electronic, Optical and Magnetic Materials

Link

https://iopscience.iop.org/article/10.1088/1361-6463/abfddb/pdf

Reference72 articles.

1. Numerical methods for the design of gradient-index optical coatings;Anzengruber;Appl. Opt.,2012

2. Machine learning enables design of on-chip integrated silicon t-junctions with footprint of 1.2 micrometer × 1.2 micrometer;Banerji;Nano Commun. Netw.,2020

3. Ultra-compact integrated photonic devices enabled by machine learning and digital metamaterials;Banerji;OSA Continuum,2021

4. Design and realization of advanced multi-index systems;Becker;Appl. Opt.,2014

Cited by 17 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Evaluation of action spaces for reinforcement learning in optical design;Machine Learning in Photonics;2024-06-18

2. Intelligent computational techniques for physical object properties discovery, detection, and prediction: A comprehensive survey;Computer Science Review;2024-02

3. Automated Design in Hybrid Action Spaces by Reinforcement Learning and Differential Evolution;Lecture Notes in Computer Science;2024

4. Automated Design of Broadband Folded-Waveguide Slow-Wave Structures for Traveling-Wave Tubes via Deep Reinforcement Learning;IEEE Transactions on Electron Devices;2023-07

5. Improved Consistency in Price Negotiation Dialogue System Using Parameterized Action Space with Generative Adversarial Imitation Learning;2023 6th International Conference on Information and Computer Technologies (ICICT);2023-03