Learning and decision-making in artificial animals-Reference-Cited by-同舟云学术

Learning and decision-making in artificial animals

Published:2018-07-01 Issue:1 Volume:9 Page:55-82
ISSN:1946-0163
Container-title:Journal of Artificial General Intelligence
language:en
Short-container-title:

Author:

Strannegård Claes¹,Svangård Nils²,Lindström David²,Bach Joscha³,Steunebrink Bas⁴

Affiliation:

1. Department of Computer Science and Engineering, Chalmers University of Technology, Gothenburg , Sweden

2. Department of Applied Information Technology, University of Gothenburg, Gothenburg , Sweden

3. Evolutionary Dynamics, Harvard University, Cambridge , USA

4. NNAISENSE, Lugano , Switzerland

Abstract

Abstract A computational model for artificial animals (animats) interacting with real or artificial ecosystems is presented. All animats use the same mechanisms for learning and decisionmaking. Each animat has its own set of needs and its own memory structure that undergoes continuous development and constitutes the basis for decision-making. The decision-making mechanism aims at keeping the needs of the animat as satisfied as possible for as long as possible. Reward and punishment are defined in terms of changes to the level of need satisfaction. The learning mechanisms are driven by prediction error relating to reward and punishment and are of two kinds: multi-objective local Q-learning and structural learning that alter the architecture of the memory structures by adding and removing nodes. The animat model has the following key properties: (1) autonomy: it operates in a fully automatic fashion, without any need for interaction with human engineers. In particular, it does not depend on human engineers to provide goals, tasks, or seed knowledge. Still, it can operate either with or without human interaction; (2) generality: it uses the same learning and decision-making mechanisms in all environments, e.g. desert environments and forest environments and for all animats, e.g. frog animats and bee animats; and (3) adequacy: it is able to learn basic forms of animal skills such as eating, drinking, locomotion, and navigation. Eight experiments are presented. The results obtained indicate that (i) dynamic memory structures are strictly more powerful than static; (ii) it is possible to use a fixed generic design to model basic cognitive processes of a wide range of animals and environments; and (iii) the animat framework enables a uniform and gradual approach to AGI, by successively taking on more challenging problems in the form of broader and more complex classes of environments

Publisher

Walter de Gruyter GmbH

Link

https://www.sciendo.com/pdf/10.2478/jagi-2018-0002

Reference48 articles.

1. Adams, S. S., and Burbeck, S. 2012. Beyond the Octopus: From General Intelligence toward a Human-like Mind. In Theoretical Foundations of Artificial General Intelligence. Springer. 49-65.10.2991/978-94-91216-62-6_4

2. Avila-García, O., and Cañamero, L. 2005. Hormonal modulation of perception in motivation-based action selection architectures. In Procs of the Symposium on Agents that Want and Like. SSAISB.

3. Bach, J. 2009. Principles of synthetic intelligence. Oxford University Press.

4. Bach, J. 2015. Modeling motivation in MicroPsi 2. In AGI 2015 Conference Proceedings, 3-13. Springer.10.1007/978-3-319-21365-1_1

5. Bear, M. F.; Connors, B. W.; and Paradiso, M. A. 2015. Neuroscience. Wolters Kluwer.

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Reverse Engineering the Brain Based on Machine Learning;Advances in Neural Computation, Machine Learning, and Cognitive Research IV;2020-10-02

2. AGI Brain: A Learning and Decision Making Framework for Artificial General Intelligence Systems Based on Modern Control Theory;Artificial General Intelligence;2019