Abstract
This work introduces a sample-efficient algorithm to optimize the control parameters of an aim point controller for solar power tower plants. Optimizing the control parameters increases the performance of the aim point controller, and thus the efficiency of the plant. However, optimizing the parameters in simulation will not yield the true optimal parameters at the real plant due to mismatches between simulation and reality. Thus, optimization must be done at the real tower to find a true optimum. As this can be time consuming and costly, the optimizer should require a minimum number of steps. Hence, a sample-efficient optimization strategy is needed. This work introduces a new algorithm based on Bayesian Optimization (BO), which leverages multiple sets of simulation data to accelerate the optimization. The algorithm is tested on a six-dimensional test function representing an arbitrary aim point controller. The proposed algorithm outperformed standard Bayesian Optimization by reaching near optimal parameter configurations of 95% accuracy within 33% less optimization steps. In a second test, the proposed algorithm is used to optimize a simulated Vant-Hull aim point controller with two hyperparameters. Here, the algorithm also needs 33% less optimization iterations than the standard BO.
Funder
Bundesministerium für Wirtschaft und Energie
Reference14 articles.
1. K. Chatzilygeroudis, V. Vassiliades, F. Stulp, S. Calinon, and J.-B. Mouret, “A survey on policy search algorithms for learning robot controllers in a handful of trials,” Jul. 2018. [Online]. Available: https://arxiv.org/pdf/1807.02303
2. C. E. Rasmussen and C. K. I. Williams, Gaussian processes for machine learning. Cambridge Mass.: MIT Press, 2006.
3. G. de Ath, J. E. Fieldsend, and R. M. Everson, “What do you mean?,” 2020. [Online]. Available: https://arxiv.org/pdf/2004.08349
4. A. Cully, J. Clune, D. Tarapore, and J.-B. Mouret, “Robots that can adapt like animals,” 7553, Jul. 2014. [Online]. Available: https://arxiv.org/pdf/1407.3501
5. Aaron Wilson, Alan Fern, and Prasad Tadepalli, “Using Trajectory Data to Improve Bayesian Optimization for Reinforcement Learning,” Journal of Machine Learning Research, vol. 15, no. 8, pp. 253–282, 2014.