Harm van Seijen
TitleCited byYear
A theoretical and empirical analysis of Expected Sarsa
H Van Seijen, H Van Hasselt, S Whiteson, M Wiering
Adaptive Dynamic Programming and Reinforcement Learning, 2009. ADPRL'09 …, 2009
962009
True Online TD (lambda)
HV Seijen, R Sutton
Proceedings of the 31st International Conference on Machine Learning (ICML …, 2014
732014
True Online Temporal-Difference Learning
H van Seijen, AR Mahmood, PM Pilarski, MC Machado, RS Sutton
arXiv preprint arXiv:1512.04087, 2015
522015
A Deeper Look at Planning as Learning from Replay
H van Seijen, RS Sutton
International Conference on Machine Learning, 2015
242015
Planning by Prioritized Sweeping with Small Backups
H Van Seijen, RS Sutton
arXiv preprint arXiv:1301.2343, 2013
20*2013
Exploiting best-match equations for efficient reinforcement learning
H van Seijen, S Whiteson, H van Hasselt, M Wiering
The Journal of Machine Learning Research 12, 2045-2094, 2011
192011
Efficient abstraction selection in reinforcement learning
H Seijen, S Whiteson, L Kester
Computational Intelligence 30 (4), 657-699, 2014
112014
Switching between representations in reinforcement learning
H Van Seijen, S Whiteson, L Kester
Interactive Collaborative Information Systems, 65-84, 2010
102010
Switching between different state representations in reinforcement learning
H Seijen, B Bakker, L Kester
ACTA, 2008
62008
Switching between different state representations in reinforcement learning
H Van Seijen, B Bakker, L Kester
Proceedings of the 26th IASTED International Conference on Artificial …, 2008
62008
Postponed updates for temporal-difference reinforcement learning
H Van Seijen, S Whiteson
Intelligent Systems Design and Applications, 2009. ISDA'09. Ninth …, 2009
42009
An Empirical Evaluation of True Online TD ({\ lambda})
H van Seijen, AR Mahmood, PM Pilarski, RS Sutton
arXiv preprint arXiv:1507.00353, 2015
12015
Reinforcement learning under space and time constraints
H van Seijen
Informatics Institute, University of Amsterdam, 2011
2011
HYBASE: hyperspectral band selection
PBW Schwering, HHPT Bekman, HH van Seijen
SPIE Defense, Security, and Sensing, 733422-733422-11, 2009
2009
Efficient Reinforcement Learning by Approximating the Best-Match Values
H van Seijen, S Whiteson
The system can't perform the operation now. Try again later.
Articles 1–15