Will Dabney
Will Dabney
DeepMind
Verified email at google.com - Homepage
Title
Cited by
Cited by
Year
Rainbow: Combining improvements in deep reinforcement learning
M Hessel, J Modayil, H Van Hasselt, T Schaul, G Ostrovski, W Dabney, ...
Thirty-Second AAAI Conference on Artificial Intelligence, 2018
4682018
A distributional perspective on reinforcement learning
MG Bellemare*, W Dabney*, R Munos
arXiv preprint arXiv:1707.06887, 2017
3202017
Successor features for transfer in reinforcement learning
A Barreto, W Dabney, R Munos, JJ Hunt, T Schaul, HP van Hasselt, ...
Advances in neural information processing systems, 4055-4065, 2017
1612017
The cramer distance as a solution to biased wasserstein gradients
MG Bellemare, I Danihelka, W Dabney, S Mohamed, ...
arXiv preprint arXiv:1705.10743, 2017
1112017
Distributed distributional deterministic policy gradients
G Barth-Maron, MW Hoffman, D Budden, W Dabney, D Horgan, D Tb, ...
arXiv preprint arXiv:1804.08617, 2018
1082018
Distributional reinforcement learning with quantile regression
W Dabney, M Rowland, MG Bellemare, R Munos
Thirty-Second AAAI Conference on Artificial Intelligence, 2018
762018
Implicit quantile networks for distributional reinforcement learning
W Dabney, G Ostrovski, D Silver, R Munos
arXiv preprint arXiv:1806.06923, 2018
582018
Adaptive step-size for online temporal difference learning
W Dabney, AG Barto
Twenty-Sixth AAAI Conference on Artificial Intelligence, 2012
482012
Recurrent experience replay in distributed reinforcement learning
S Kapturowski, G Ostrovski, J Quan, R Munos, W Dabney
462018
Rlpy: a value-function-based reinforcement learning framework for education and research
A Geramifard, C Dann, RH Klein, W Dabney, JP How
MIT Press, 2015
332015
Proximal reinforcement learning: A new theory of sequential decision making in primal-dual spaces
S Mahadevan, B Liu, P Thomas, W Dabney, S Giguere, N Jacek, I Gemp, ...
arXiv preprint arXiv:1405.6757, 2014
332014
The Reactor: A fast and sample-efficient Actor-Critic agent for Reinforcement Learning
A Gruslys, W Dabney, MG Azar, B Piot, M Bellemare, R Munos
arXiv preprint arXiv:1704.04651, 2017
262017
An analysis of categorical distributional reinforcement learning
M Rowland, MG Bellemare, W Dabney, R Munos, YW Teh
arXiv preprint arXiv:1802.08163, 2018
252018
Projected natural actor-critic
PS Thomas, WC Dabney, S Giguere, S Mahadevan
Advances in neural information processing systems, 2337-2345, 2013
212013
Utile Distinctions for Relational Reinforcement Learning.
W Dabney, A McGovern
IJCAI 7, 738-743, 2007
202007
Autoregressive quantile networks for generative modeling
G Ostrovski, W Dabney, R Munos
arXiv preprint arXiv:1806.05575, 2018
182018
Adaptive step-sizes for reinforcement learning
WC Dabney
142014
A geometric perspective on optimal representations for reinforcement learning
M Bellemare, W Dabney, R Dadashi, AA Taiga, PS Castro, N Le Roux, ...
Advances in Neural Information Processing Systems, 4360-4371, 2019
112019
Natural temporal difference learning
W Dabney, P Thomas
Twenty-Eighth AAAI Conference on Artificial Intelligence, 2014
102014
The termination critic
A Harutyunyan, W Dabney, D Borsa, N Heess, R Munos, D Precup
arXiv preprint arXiv:1902.09996, 2019
62019
The system can't perform the operation now. Try again later.
Articles 1–20