Gerald Tesauro

Cited by

	All	Since 2019
Citations	18633	6572
h-index	61	37
i10-index	118	74

1400

700

350

1050

19891990199119921993199419951996199719981999200020012002200320042005200620072008200920102011201220132014201520162017201820192020202120222023202471 104 95 84 152 175 135 178 178 219 194 224 300 281 406 354 451 532 609 602 675 622 671 610 610 618 539 591 613 844 1040 1172 1297 1342 1369 352

Gerald Tesauro

IBM Research

Verified email at us.ibm.com - Homepage

Machine Learning Reinforcement Learning Multi-Agent Learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Temporal difference learning and TD-Gammon G Tesauro Communications of the ACM 38 (3), 58-68, 1995	2963	1995
Practical issues in temporal difference learning G Tesauro Advances in neural information processing systems 4, 1991	1465	1991
TD-Gammon, a self-teaching backgammon program, achieves master-level play G Tesauro Neural computation 6 (2), 215-219, 1994	1282	1994
Learning to learn without forgetting by maximizing transfer and minimizing interference M Riemer, I Cases, R Ajemian, M Liu, I Rish, Y Tu, G Tesauro arXiv preprint arXiv:1810.11910, 2018	714	2018
Utility functions in autonomic systems WE Walsh, G Tesauro, JO Kephart, R Das International Conference on Autonomic Computing, 2004. Proceedings., 70-77, 2004	614	2004
A hybrid reinforcement learning approach to autonomic resource allocation G Tesauro, NK Jong, R Das, MN Bennani 2006 IEEE International Conference on Autonomic Computing, 65-73, 2006	483	2006
Agent-human interactions in the continuous double auction R Das, JE Hanson, JO Kephart, G Tesauro International joint conference on artificial intelligence 17 (1), 1169-1178, 2001	378	2001
R³: Reinforced Ranker-Reader for Open-Domain Question Answering S Wang, M Yu, X Guo, Z Wang, T Klinger, W Zhang, S Chang, G Tesauro, ... Proceedings of the AAAI Conference on Artificial Intelligence 32 (1), 2018	377	2018
A multi-agent systems approach to autonomic computing G Tesauro, DM Chess, WE Walsh, R Das, A Segal, I Whalley, JO Kephart, ... Proceedings of the Third International Joint Conference on Autonomous Agents …, 2004	364	2004
On-line policy improvement using Monte-Carlo search G Tesauro, G Galperin Advances in neural information processing systems 9, 1996	354	1996
Programming backgammon using self-teaching neural nets G Tesauro Artificial Intelligence 134 (1-2), 181-199, 2002	294	2002
Extending Q-learning to general adaptive multi-agent systems G Tesauro Advances in neural information processing systems 16, 2003	289	2003
Diverse few-shot text classification with multiple metrics M Yu, X Guo, J Yi, S Chang, S Potdar, Y Cheng, G Tesauro, H Wang, ... arXiv preprint arXiv:1805.07513, 2018	272	2018
Neural networks for computer virus recognition GJ Tesauro, JO Kephart, GB Sorkin IEEE expert 11 (4), 5-6, 1996	264	1996
Multiresolution recurrent neural networks: An application to dialogue response generation I Serban, T Klinger, G Tesauro, K Talamadupula, B Zhou, Y Bengio, ... Proceedings of the AAAI Conference on Artificial Intelligence 31 (1), 2017	241	2017
Metric learning for kernel regression KQ Weinberger, G Tesauro Artificial intelligence and statistics, 612-619, 2007	225	2007
Analyzing complex strategic interactions in multi-agent systems WE Walsh, R Das, G Tesauro, JO Kephart AAAI-02 Workshop on Game-Theoretic and Decision-Theoretic Agents, 109-118, 2002	220	2002
Biologically inspired defenses against computer viruses JO Kephart, GB Sorkin, WC Arnold, DM Chess, GJ Tesauro, SR White, ... IJCAI (1), 985-996, 1995	218	1995
Proceedings of the 6th International Conference on Neural Information Processing Systems JD Cowan, G Tesauro, J Alspector Morgan Kaufmann Publishers Inc., 1993	212	1993
Pricing in agent economies using multi-agent Q-learning G Tesauro, JO Kephart Autonomous agents and multi-agent systems 5, 289-304, 2002	210	2002

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by