Publisher
Springer Nature Switzerland
Reference29 articles.
1. Akrour, R., Schoenauer, M., Sebag, M.: APRIL: active preference learning-based reinforcement learning. In: Flach, P.A., De Bie, T., Cristianini, N. (eds.) Machine Learning and Knowledge Discovery in Databases: European Conference, ECML PKDD 2012, Bristol, UK, 24–28 September 2012, Proceedings, Part II 23, pp. 116–131. Springer, Cham (2012). https://doi.org/10.1007/978-3-642-33486-3_8
2. Bradley, R.A., Terry, M.E.: Rank analysis of incomplete block designs: I. the method of paired comparisons. Biometrika 39(3/4), 324–345 (1952)
3. Cao, Z., Wong, K., Lin, C.T.: Weak human preference supervision for deep reinforcement learning. IEEE Trans. Neural Netw. Learn. Syst. 32(12), 5369–5378 (2021)
4. Christiano, P.F., Leike, J., Brown, T., Martic, M., Legg, S., Amodei, D.: Deep reinforcement learning from human preferences. In: Advances in Neural Information Processing Systems, vol. 30 (2017)
5. Clabaugh, C., Matarić, M.: Robots for the people, by the people: personalizing human-machine interaction. Sci. Robot. 3(21), eaat7451 (2018)