1. A Markovian extension of Valiant's learning model;Aldous,1990
2. Learning with a slowly changing distribution;Bartlett,1992
3. Estimation and approximation bounds for gradient-based reinforcement learning;Bartlett,2000
4. Separating PAC and mistake-bound learning models over the boolean domain;Blum,1990
5. Proc. 31st Conf. on Decision and Control;Campi,1996