Brendan Bennett
Cited by
Cited by
Directly estimating the variance of the {\lambda}-return using temporal-difference methods
C Sherstan, B Bennett, K Young, DR Ashley, A White, M White, RS Sutton
arXiv preprint arXiv:1801.08287, 2018
Comparing Direct and Indirect Temporal-Difference Methods for Estimating the Variance of the Return.
C Sherstan, DR Ashley, B Bennett, K Young, A White, M White, RS Sutton
UAI, 63-72, 2018
Predicting Periodicity with Temporal Difference Learning
K De Asis, B Bennett, RS Sutton
arXiv preprint arXiv:1809.07435, 2018
Back to Square One: Superhuman Performance in Chutes and Ladders Through Deep Neural Networks and Tree Search
D Ashley, A Kanervisto, B Bennett
arXiv preprint arXiv:2104.00698, 2021
Incrementally Learning Functions of the Return
B Bennett, W Chung, M Zaheer, V Liu
arXiv preprint arXiv:1907.04651, 2019
Nexting and State Discovery in Robot Microworlds
J Modayil, A White, AR Mahmood, B Bennett, DCP Prauchner, RS Sutton
RLDM 2013, 73, 2013
The system can't perform the operation now. Try again later.
Articles 1–6