Gerald Tesauro

Cited by

	All	Since 2019
Citations	18840	6837
h-index	60	38
i10-index	116	73

1400

700

350

1050

19891990199119921993199419951996199719981999200020012002200320042005200620072008200920102011201220132014201520162017201820192020202120222023202470 104 95 86 152 174 137 177 178 218 192 223 303 280 411 359 444 523 602 596 670 626 663 610 605 605 527 586 611 844 1028 1168 1288 1320 1353 680

Gerald Tesauro

IBM Research

Verified email at us.ibm.com - Homepage

Machine Learning Reinforcement Learning Multi-Agent Learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Temporal difference learning and TD-Gammon G Tesauro Communications of the ACM 38 (3), 58-68, 1995	3022	1995
Practical issues in temporal difference learning G Tesauro Advances in neural information processing systems 4, 1991	1463	1991
TD-Gammon, a self-teaching backgammon program, achieves master-level play G Tesauro Neural computation 6 (2), 215-219, 1994	1299	1994
Learning to learn without forgetting by maximizing transfer and minimizing interference M Riemer, I Cases, R Ajemian, M Liu, I Rish, Y Tu, G Tesauro arXiv preprint arXiv:1810.11910, 2018	764	2018
Utility functions in autonomic systems WE Walsh, G Tesauro, JO Kephart, R Das International Conference on Autonomic Computing, 2004. Proceedings., 70-77, 2004	618	2004
A hybrid reinforcement learning approach to autonomic resource allocation G Tesauro, NK Jong, R Das, MN Bennani 2006 IEEE International Conference on Autonomic Computing, 65-73, 2006	483	2006
R³: Reinforced Ranker-Reader for Open-Domain Question Answering S Wang, M Yu, X Guo, Z Wang, T Klinger, W Zhang, S Chang, G Tesauro, ... Proceedings of the AAAI Conference on Artificial Intelligence 32 (1), 2018	389	2018
Agent-human interactions in the continuous double auction R Das, JE Hanson, JO Kephart, G Tesauro International joint conference on artificial intelligence 17 (1), 1169-1178, 2001	380	2001
A multi-agent systems approach to autonomic computing G Tesauro, DM Chess, WE Walsh, R Das, A Segal, I Whalley, JO Kephart, ... Proceedings of the Third International Joint Conference on Autonomous Agents …, 2004	364	2004
On-line policy improvement using Monte-Carlo search G Tesauro, G Galperin Advances in neural information processing systems 9, 1996	359	1996
Programming backgammon using self-teaching neural nets G Tesauro Artificial Intelligence 134 (1-2), 181-199, 2002	298	2002
Extending Q-learning to general adaptive multi-agent systems G Tesauro Advances in neural information processing systems 16, 2003	295	2003
Diverse few-shot text classification with multiple metrics M Yu, X Guo, J Yi, S Chang, S Potdar, Y Cheng, G Tesauro, H Wang, ... arXiv preprint arXiv:1805.07513, 2018	289	2018
Neural networks for computer virus recognition GJ Tesauro, JO Kephart, GB Sorkin IEEE expert 11 (4), 5-6, 1996	262	1996
Multiresolution recurrent neural networks: An application to dialogue response generation I Serban, T Klinger, G Tesauro, K Talamadupula, B Zhou, Y Bengio, ... Proceedings of the AAAI Conference on Artificial Intelligence 31 (1), 2017	248	2017
Analyzing complex strategic interactions in multi-agent systems WE Walsh, R Das, G Tesauro, JO Kephart AAAI-02 Workshop on Game-Theoretic and Decision-Theoretic Agents, 109-118, 2002	226	2002
Metric learning for kernel regression KQ Weinberger, G Tesauro Artificial intelligence and statistics, 612-619, 2007	225	2007
Biologically inspired defenses against computer viruses JO Kephart, GB Sorkin, WC Arnold, DM Chess, GJ Tesauro, SR White, ... IJCAI (1), 985-996, 1995	225	1995
Pricing in agent economies using multi-agent Q-learning G Tesauro, JO Kephart Autonomous agents and multi-agent systems 5, 289-304, 2002	214	2002
Reinforcement learning in autonomic computing: A manifesto and case studies G Tesauro IEEE Internet Computing 11 (1), 22-30, 2007	208	2007

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by