Proximal Policy Optimization for Radiation Source Search-Reference-Cited by-同舟云学术

Proximal Policy Optimization for Radiation Source Search

Published:2021-09-30 Issue:4 Volume:2 Page:368-397
ISSN:2673-4362
Container-title:Journal of Nuclear Engineering
language:en
Short-container-title:JNE

Author:

Proctor Philippe^ORCID,Teuscher Christof^ORCID,Hecht Adam,Osiński Marek

Abstract

Rapid search and localization for nuclear sources can be an important aspect in preventing human harm from illicit material in dirty bombs or from contamination. In the case of a single mobile radiation detector, there are numerous challenges to overcome such as weak source intensity, multiple sources, background radiation, and the presence of obstructions, i.e., a non-convex environment. In this work, we investigate the sequential decision making capability of deep reinforcement learning in the nuclear source search context. A novel neural network architecture (RAD-A2C) based on the advantage actor critic (A2C) framework and a particle filter gated recurrent unit for localization is proposed. Performance is studied in a randomized 20×20 m convex and non-convex simulation environment across a range of signal-to-noise ratio (SNR)s for a single detector and single source. RAD-A2C performance is compared to both an information-driven controller that uses a bootstrap particle filter and to a gradient search (GS) algorithm. We find that the RAD-A2C has comparable performance to the information-driven controller across SNR in a convex environment. The RAD-A2C far outperforms the GS algorithm in the non-convex environment with greater than 95% median completion rate for up to seven obstructions.

Funder

Defense Threat Reduction Agency

Publisher

MDPI AG

Link

https://www.mdpi.com/2673-4362/2/4/29/pdf

Reference45 articles.

1. International energy outlook;Sieminski;Energy Inf. Adm. (EIA),2014

2. Sources and Effects of Ionizing Radiation,2008

3. Emergency response to the nuclear accident at the Fukushima Daiichi Nuclear Power Plants using mobile rescue robots

4. Radiation Detection and Measurement;Knoll,2010

5. What Makes a Neural Code Convex?

Cited by 11 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. IMUPF-BIN: A new autonomous search method for radioactive sources;Progress in Nuclear Energy;2024-08

2. Intelligent Scheduling Technology of Swarm Intelligence Algorithm for Drone Path Planning;Drones;2024-03-26

3. UAV Detection Using Reinforcement Learning;Sensors;2024-03-14

4. Adaptive Target Localization Under Uncertainty Using Multi-Agent Deep Reinforcement Learning with Knowledge Transfer;2024

5. Autonomous exploration for radioactive sources localization based on radiation field reconstruction;Nuclear Engineering and Technology;2023-11