David Silver
TitleCited byYear
Human-level control through deep reinforcement learning
V Mnih, K Kavukcuoglu, D Silver, AA Rusu, J Veness, MG Bellemare, ...
Nature 518 (7540), 529, 2015
55122015
Mastering the game of Go with deep neural networks and tree search
D Silver, A Huang, CJ Maddison, A Guez, L Sifre, G Van Den Driessche, ...
nature 529 (7587), 484, 2016
47142016
Playing atari with deep reinforcement learning
V Mnih, K Kavukcuoglu, D Silver, A Graves, I Antonoglou, D Wierstra, ...
arXiv preprint arXiv:1312.5602, 2013
22982013
Asynchronous methods for deep reinforcement learning
V Mnih, AP Badia, M Mirza, A Graves, T Lillicrap, T Harley, D Silver, ...
International conference on machine learning, 1928-1937, 2016
16592016
Mastering the game of go without human knowledge
D Silver, J Schrittwieser, K Simonyan, I Antonoglou, A Huang, A Guez, ...
Nature 550 (7676), 354, 2017
16132017
Continuous control with deep reinforcement learning
TP Lillicrap, JJ Hunt, A Pritzel, N Heess, T Erez, Y Tassa, D Silver, ...
arXiv preprint arXiv:1509.02971, 2015
16122015
Deep reinforcement learning with double q-learning
H Van Hasselt, A Guez, D Silver
Thirtieth AAAI Conference on Artificial Intelligence, 2016
9452016
Deterministic policy gradient algorithms
D Silver, G Lever, N Heess, T Degris, D Wierstra, M Riedmiller
ICML, 2014
6722014
Prioritized experience replay
T Schaul, J Quan, I Antonoglou, D Silver
arXiv preprint arXiv:1511.05952, 2015
6212015
Combining online and offline knowledge in UCT
S Gelly, D Silver
Proceedings of the 24th international conference on Machine learning, 273-280, 2007
6012007
Monte-Carlo planning in large POMDPs
D Silver, J Veness
Advances in neural information processing systems, 2164-2172, 2010
5842010
Fast gradient-descent methods for temporal-difference learning with linear function approximation
RS Sutton, HR Maei, D Precup, S Bhatnagar, D Silver, C Szepesvári, ...
Proceedings of the 26th Annual International Conference on Machine Learning …, 2009
3462009
Cooperative Pathfinding.
D Silver
AIIDE 1, 117-122, 2005
3372005
Reinforcement learning with unsupervised auxiliary tasks
M Jaderberg, V Mnih, WM Czarnecki, T Schaul, JZ Leibo, D Silver, ...
arXiv preprint arXiv:1611.05397, 2016
3212016
Mastering chess and shogi by self-play with a general reinforcement learning algorithm
D Silver, T Hubert, J Schrittwieser, I Antonoglou, M Lai, A Guez, M Lanctot, ...
arXiv preprint arXiv:1712.01815, 2017
3172017
Monte-Carlo Search and Rapid Action Value Estimation in Computer Go
S Gelly, D Silver
Artificial Intelligence 175 (11), 1856-1875, 2011
2712011
The Grand Challenge of Computer Go: Monte Carlo Tree Search and Extensions
S Gelly, M Schoenauer, M Sebag, O Teytaud, L Kocsis, D Silver, ...
Communications of the ACM, 0
200*
Rainbow: Combining improvements in deep reinforcement learning
M Hessel, J Modayil, H Van Hasselt, T Schaul, G Ostrovski, W Dabney, ...
Thirty-Second AAAI Conference on Artificial Intelligence, 2018
1832018
Massively parallel methods for deep reinforcement learning
A Nair, P Srinivasan, S Blackwell, C Alcicek, R Fearon, A De Maria, ...
arXiv preprint arXiv:1507.04296, 2015
1772015
Learning continuous control policies by stochastic value gradients
N Heess, G Wayne, D Silver, T Lillicrap, T Erez, Y Tassa
Advances in Neural Information Processing Systems, 2944-2952, 2015
1762015
The system can't perform the operation now. Try again later.
Articles 1–20