Adam White
Adam White
University of Alberta, Deepmind
Verified email at ualberta.ca - Homepage
TitleCited byYear
Horde: A scalable real-time architecture for learning knowledge from unsupervised sensorimotor interaction
RS Sutton, J Modayil, M Delp, T Degris, PM Pilarski, A White, D Precup
The 10th International Conference on Autonomous Agents and Multiagent …, 2011
2112011
RL-Glue: Language-independent software for reinforcement-learning experiments
B Tanner, A White
Journal of Machine Learning Research 10 (Sep), 2133-2136, 2009
1322009
Multi-timescale nexting in a reinforcement learning robot
J Modayil, A White, RS Sutton
Adaptive Behavior 22 (2), 146-160, 2014
622014
Feature construction for reinforcement learning in hearts
NR Sturtevant, AM White
International Conference on Computers and Games, 122-134, 2006
462006
Report on the 2008 reinforcement learning competition
S Whiteson, B Tanner, A White
AI Magazine 31 (2), 81-81, 2010
452010
Developing a predictive approach to knowledge
A White
University of Alberta, 2015
292015
Reinforcement learning benchmarks and bake-offs II
A Dutech, T Edmunds, J Kok, M Lagoudakis, M Littman, M Riedmiller, ...
Advances in Neural Information Processing Systems (NIPS) 17, 6, 2005
262005
Multi-timescale nexting in a reinforcement learning robot
J Modayil, A White, RS Sutton
International Conference on Simulation of Adaptive Behavior, 299-309, 2012
232012
Scaling life-long off-policy learning
RSS Adam White, Joseph Modayil
2012 IEEE International Conference on Development and Learning and …, 2013
19*2013
Investigating practical linear temporal difference learning
A Adam, M White
Proceedings of the 2016 International Conference on Autonomous Agents …, 2016
162016
A greedy approach to adapting the trace parameter for temporal difference learning
M White, A White
Proceedings of the 2016 International Conference on Autonomous Agents …, 2016
132016
Surprise and curiosity for big data robotics
A White, J Modayil, RS Sutton
Workshops at the Twenty-Eighth AAAI Conference on Artificial Intelligence, 2014
132014
Acquiring a broad range of empirical knowledge in real time by temporal-difference learning
J Modayil, A White, PM Pilarski, RS Sutton
2012 IEEE International Conference on Systems, Man, and Cybernetics (SMC …, 2012
132012
Interval Estimation for Reinforcement-Learning Algorithms in Continuous-State Domains
M White, A White
Advances in Neural Information Processing Systems, 2010
132010
Accelerated gradient temporal difference learning
Y Pan, A White, M White
Thirty-First AAAI Conference on Artificial Intelligence, 2017
82017
Introspective agents: Confidence measures for general value functions
C Sherstan, A White, MC Machado, PM Pilarski
International Conference on Artificial General Intelligence, 258-261, 2016
62016
Organizing experience: a deeper look at replay mechanisms for sample-based planning in continuous state domains
Y Pan, M Zaheer, A White, A Patterson, M White
arXiv preprint arXiv:1806.04624, 2018
52018
Directly estimating the variance of the {\lambda}-return using temporal-difference methods
C Sherstan, B Bennett, K Young, DR Ashley, A White, M White, RS Sutton
arXiv preprint arXiv:1801.08287, 2018
52018
Acquiring Diverse Predictive Knowledge in Real Time by Temporal-difference Learning
J Modayil, A White, PM Pilarski, RS Sutton
52012
Online off-policy prediction
S Ghiassian, A Patterson, M White, RS Sutton, A White
arXiv preprint arXiv:1811.02597, 2018
32018
The system can't perform the operation now. Try again later.
Articles 1–20