Gerald Tesauro
Gerald Tesauro
IBM Research
Verified email at us.ibm.com - Homepage
Title
Cited by
Cited by
Year
Temporal difference learning and TD-Gammon
G Tesauro
Communications of the ACM 38 (3), 58-68, 1995
23141995
Practical issues in temporal difference learning
G Tesauro
Machine learning 8 (3), 257-277, 1992
13451992
TD-Gammon, a self-teaching backgammon program, achieves master-level play
G Tesauro
Neural computation 6 (2), 215-219, 1994
10441994
Utility functions in autonomic systems
WE Walsh, G Tesauro, JO Kephart, R Das
International Conference on Autonomic Computing, 2004. Proceedings., 70-77, 2004
5822004
A hybrid reinforcement learning approach to autonomic resource allocation
G Tesauro, NK Jong, R Das, MN Bennani
2006 IEEE International Conference on Autonomic Computing, 65-73, 2006
4202006
A multi-agent systems approach to autonomic computing
G Tesauro, DM Chess, WE Walsh, R Das, A Segal, I Whalley, JO Kephart, ...
Proceedings of the Third International Joint Conference on Autonomous Agents …, 2004
3452004
Agent-human interactions in the continuous double auction
R Das, JE Hanson, JO Kephart, G Tesauro
International joint conference on artificial intelligence 17 (1), 1169-1178, 2001
3382001
On-line policy improvement using Monte-Carlo search
G Tesauro, G Galperin
Advances in Neural Information Processing Systems 9, 1068-1074, 1996
2761996
Programming backgammon using self-teaching neural nets
G Tesauro
Artificial Intelligence 134 (1-2), 181-199, 2002
2582002
Neural networks for computer virus recognition
GJ Tesauro, JO Kephart, GB Sorkin
IEEE expert 11 (4), 5-6, 1996
2361996
R 3: Reinforced ranker-reader for open-domain question answering
S Wang, M Yu, X Guo, Z Wang, T Klinger, W Zhang, S Chang, G Tesauro, ...
Thirty-Second AAAI Conference on Artificial Intelligence, 2018
2252018
Extending Q-learning to general adaptive multi-agent systems
G Tesauro
Advances in neural information processing systems 16, 871-878, 2003
2202003
Advances in neural information processing systems
GE Hinton, RS Zemel, J Cowan, G Tesauro, J Alspector
MIT Press 6, 3-10, 1994
2051994
A parallel network that learns to play backgammon
G Tesauro, TJ Sejnowski
Artificial Intelligence 39 (3), 357-390, 1989
1991989
Analyzing complex strategic interactions in multi-agent systems
WE Walsh, R Das, G Tesauro, JO Kephart
AAAI-02 Workshop on Game-Theoretic and Decision-Theoretic Agents, 109-118, 2002
1982002
Coordinating multiple autonomic managers to achieve specified power-performance tradeoffs
JO Kephart, H Chan, R Das, DW Levine, G Tesauro, F Rawson, C Lefurgy
Fourth International Conference on Autonomic Computing (ICAC'07), 24-24, 2007
1922007
Learning to learn without forgetting by maximizing transfer and minimizing interference
M Riemer, I Cases, R Ajemian, M Liu, I Rish, Y Tu, G Tesauro
arXiv preprint arXiv:1810.11910, 2018
1912018
Reinforcement learning in autonomic computing: A manifesto and case studies
G Tesauro
IEEE Internet Computing 11 (1), 22-30, 2007
1902007
Multiresolution recurrent neural networks: An application to dialogue response generation
I Serban, T Klinger, G Tesauro, K Talamadupula, B Zhou, Y Bengio, ...
Proceedings of the AAAI Conference on Artificial Intelligence 31 (1), 2017
1892017
Biologically inspired defenses against computer viruses
JO Kephart, GB Sorkin, WC Arnold, DM Chess, GJ Tesauro, SR White, ...
IJCAI (1), 985-996, 1995
1891995
The system can't perform the operation now. Try again later.
Articles 1–20