Richard S. Sutton
Richard S. Sutton
DeepMind, Amii, and University of Alberta
Verified email at richsutton.com - Homepage
Title
Cited by
Cited by
Year
Reinforcement learning: An Introduction, 2nd edition
RS Sutton, AG Barto
MIT press, 2018
382842018
Reinforcement learning: An Introduction, 1st edition
RS Sutton, AG Barto
MIT press, 1998
7070*1998
Learning to predict by the methods of temporal differences
RS Sutton
Machine learning 3 (1), 9-44, 1988
58541988
Neuronlike adaptive elements that can solve difficult learning control problems
AG Barto, RS Sutton, CW Anderson
IEEE transactions on systems, man, and cybernetics 13 (5), 834-846, 1983
39781983
Policy gradient methods for reinforcement learning with function approximation
RS Sutton, DA McAllester, SP Singh, Y Mansour
Advances in neural information processing systems, 1057-1063, 2000
34892000
Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning
RS Sutton, D Precup, S Singh
Artificial intelligence 112 (1-2), 181-211, 1999
25131999
Guidelines: Guidelines for the diagnosis and management of syncope (version 2009): The Task Force for the Diagnosis and Management of Syncope of the European Society of …
A Moya, R Sutton, F Ammirati, JJ Blanc, M Brignole, JB Dahm, JC Deharo, ...
European heart journal 30 (21), 2631, 2009
18102009
Neural networks for control
WT Miller, PJ Werbos, RS Sutton
MIT press, 1995
1665*1995
Integrated architectures for learning, planning, and reacting based on approximating dynamic programming
RS Sutton
Proceedings of the International Conference on Machine Learning, 216-224, 1990
16191990
Toward a modern theory of adaptive networks: Expectation and prediction.
RS Sutton, AG Barto
Psychological review 88 (2), 135, 1981
15931981
Generalization in reinforcement learning: Successful examples using sparse coarse coding
RS Sutton
Advances in neural information processing systems, 1038-1044, 1996
14541996
Temporal credit assignment in reinforcement learning
RS Sutton
University of Massachusetts, Amherst, http://www.incompleteideas.net/papers …, 1984
8861984
Reinforcement learning with replacing eligibility traces
SP Singh, RS Sutton
Machine learning 22 (1-3), 123-158, 1996
8181996
Time-derivative models of Pavlovian reinforcement.
RS Sutton, AG Barto
Learning and Computational Neuroscience: Foundations of Adaptive Networks …, 1990
7121990
A menu of designs for reinforcement learning over time
PJ Werbos, WT Miller, RS Sutton
Neural networks for control, 67-95, 1990
6241990
2018 ESC Guidelines for the diagnosis and management of syncope
M Brignole, A Moya, FJ de Lange, JC Deharo, PM Elliott, A Fanciulli, ...
European heart journal 39 (21), 1883-1948, 2018
5902018
S., Barto A., G.,“
R Sutton
Reinforcement Learning, An Introduction, 2000
589*2000
Learning and sequential decision making
AG Barto, RS Sutton, CJCH Watkins
Learning and Computational Neuroscience: Foundations of Adaptive Networks, 1990
5631990
Natural actor-critic algorithms
S Bhatnagar, RS Sutton, M Ghavamzadeh, M Lee
Automatica 7 (8), 12, 2009
545*2009
Incremental natural actor-critic algorithms
S Bhatnagar, RS Sutton, M Ghavamzadeh, M Lee
Advances in neural information processing systems, 2008
5452008
The system can't perform the operation now. Try again later.
Articles 1–20