Global optimization based on active preference learning with radial basis functions-Reference-Cited by-同舟云学术

Global optimization based on active preference learning with radial basis functions

Published:2020-12-28 Issue:2 Volume:110 Page:417-448
ISSN:0885-6125
Container-title:Machine Learning
language:en
Short-container-title:Mach Learn

Author:

Bemporad Alberto^ORCID,Piga Dario

Abstract

AbstractThis paper proposes a method for solving optimization problems in which the decision-maker cannot evaluate the objective function, but rather can only express apreferencesuch as “this is better than that” between two candidate decision vectors. The algorithm described in this paper aims at reaching the global optimizer by iteratively proposing the decision maker a new comparison to make, based on actively learning a surrogate of the latent (unknown and perhaps unquantifiable) objective function from past sampled decision vectors and pairwise preferences. A radial-basis function surrogate is fit via linear or quadratic programming, satisfying if possible the preferences expressed by the decision maker on existing samples. The surrogate is used to propose a new sample of the decision vector for comparison with the current best candidate based on two possible criteria: minimize a combination of the surrogate and an inverse weighting distance function to balance between exploitation of the surrogate and exploration of the decision space, or maximize a function related to the probability that the new candidate will be preferred. Compared to active preference learning based on Bayesian optimization, we show that our approach is competitive in that, within the same number of comparisons, it usually approaches the global optimum more closely and is computationally lighter. Applications of the proposed algorithm to solve a set of benchmark global optimization problems, for multi-objective optimization, and for optimal tuning of a cost-sensitive neural network classifier for object recognition from images are described in the paper. MATLAB and a Python implementations of the algorithms described in the paper are available athttp://cse.lab.imtlucca.it/~bemporad/glis.

Funder

Scuola IMT Alti Studi Lucca

Publisher

Springer Science and Business Media LLC

Subject

Artificial Intelligence,Software

Link

http://link.springer.com/content/pdf/10.1007/s10994-020-05935-y.pdf

Reference67 articles.

1. Abdolshah, M., Shilton, A., Rana, S., Gupta, S., & Venkatesh, S. (2019). Multi-objective Bayesian optimisation with preferences over objectives. arXiv:190204228.

2. Akrour, R., Schoenauer, M., & Sebag, M. (2012). April: Active preference learning-based reinforcement learning. In Joint European conference on machine learning and knowledge discovery in databases (pp. 116–131). Springer

3. Akrour, R., Schoenauer, M., Sebag, M., & Souplet, J. C. (2014). Programming by feedback. International Conference on Machine Learning, 32, 1503–1511.

4. Bemporad, A. (2020). Global optimization via inverse distance weighting and radial basis functions. Computational Optimization and Applications (In press). https://arxiv.org/pdf/1906.06498.pdf.

5. Brochu, E., de Freitas, N., & Ghosh, A. (2008). Active preference learning with discrete choice data. In Advances in neural information processing systems (pp. 409–416).

Cited by 31 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A framework for human–robot collaboration enhanced by preference learning and ergonomics;Robotics and Computer-Integrated Manufacturing;2024-10

2. F-Chain: personalized overall survival prediction based on incremental adaptive indicators and multi-source clinical records;Memetic Computing;2024-07-16

3. APDF: An active preference-based deep forest expert system for overall survival prediction in gastric cancer;Expert Systems with Applications;2024-07

4. A Personalizable Controller for the Walking Assistive omNi-Directional Exo-Robot (WANDER);2024 IEEE International Conference on Robotics and Automation (ICRA);2024-05-13

5. Feedback Controller Optimization for Mechatronic Systems with Unexpected Plant Perturbations Using Support Vector Machine;IEEJ Journal of Industry Applications;2024-05-01