1. Ito J, Nakano K, Sakurama K, et al (2008) Adaptive immunity-based reinforcement learning. Artif Life Robotics 13(1):188–193
2. Watkins CJCH, Dayan P (1992) Technical note: q-learning. Mach Learn 8(3–4):279–292
3. Grefenstette JJ (1988) Credit assignment in rule discovery systems based on genetic algorithms. In: Shavlik JW, Dietterich TG (eds) Readings in machine learning. Kaufmann, San Mateo, pp 524–534
4. Matsui T, Inuzuka N, Seki H (2002) Profit sharing with linear function approximation (in Japanese). 16th Annual Conference of the Japanese Society for Artificial Intelligence, pp 2D3–03
5. Kimura H, Kobayashi S (2000) An analysis of actor-critic algorithms using eligibility traces: reinforcement learning with imperfect value functions (in Japanese). J Jpn Soc Artif Intell 15(2):267–275