Satinder Singh
Satinder Singh
Professor, Computer Science & Engineering, University of Michigan
Verified email at - Homepage
TitleCited byYear
Policy gradient methods for reinforcement learning with function approximation
RS Sutton, DA McAllester, SP Singh, Y Mansour
Advances in neural information processing systems, 1057-1063, 2000
Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning
RS Sutton, D Precup, S Singh
Artificial intelligence 112 (1-2), 181-211, 1999
Learning to act using real-time dynamic programming
AG Barto, SJ Bradtke, SP Singh
Artificial intelligence 72 (1-2), 81-138, 1995
Near-optimal reinforcement learning in polynomial time
M Kearns, S Singh
Machine learning 49 (2-3), 209-232, 2002
Convergence of stochastic iterative dynamic programming algorithms
T Jaakkola, MI Jordan, SP Singh
Advances in neural information processing systems, 703-710, 1994
Reinforcement learning with replacing eligibility traces
SP Singh, RS Sutton
Machine learning 22 (1-3), 123-158, 1996
Perseus: Randomized point-based value iteration for POMDPs
MTJ Spaan, N Vlassis
Journal of artificial intelligence research 24, 195-220, 2005
Graphical models for game theory
M Kearns, ML Littman, S Singh
arXiv preprint arXiv:1301.2281, 2013
Convergence results for single-step on-policy reinforcement-learning algorithms
S Singh, T Jaakkola, ML Littman, C Szepesvári
Machine learning 38 (3), 287-308, 2000
Intrinsically motivated reinforcement learning
S Singh, A Barto, N Chentanez
Advances in neural information processing systems, 2005
Predictive representations of state
ML Littman, RS Sutton
Advances in neural information processing systems, 1555-1561, 2002
Learning without state-estimation in partially observable Markovian decision processes
SP Singh, T Jaakkola, MI Jordan
Machine Learning Proceedings 1994, 284-292, 1994
Intrinsically motivated learning of hierarchical collections of skills
AG Barto, S Singh, N Chentanez
Proceedings of the 3rd International Conference on Development and Learning …, 2004
Transfer of learning by composing solutions of elemental sequential tasks
SP Singh
Machine Learning 8 (3-4), 323-339, 1992
General approach to the synthesis of short. alpha.-helical peptides
DY Jackson, DS King, J Chmielewski, S Singh, PG Schultz
Journal of the American Chemical Society 113 (24), 9391-9392, 1991
Optimizing dialogue management with reinforcement learning: Experiments with the NJFun system
S Singh, D Litman, M Kearns, M Walker
Journal of Artificial Intelligence Research 16, 105-133, 2002
Reinforcement learning for dynamic channel allocation in cellular telephone systems
SP Singh, DP Bertsekas
Advances in neural information processing systems, 974-980, 1997
Reinforcement learning with soft state aggregation
SP Singh, T Jaakkola, MI Jordan
Advances in neural information processing systems, 361-368, 1995
Nash Convergence of Gradient Dynamics in General-Sum Games.
SP Singh, MJ Kearns, Y Mansour
UAI, 541-548, 2000
Intrinsically motivated reinforcement learning: An evolutionary perspective
S Singh, RL Lewis, AG Barto, J Sorg
IEEE Transactions on Autonomous Mental Development 2 (2), 70-82, 2010
The system can't perform the operation now. Try again later.
Articles 1–20