Gerald Tesauro
Gerald Tesauro
IBM Research
Verified email at - Homepage
Cited by
Cited by
Temporal difference learning and TD-Gammon
G Tesauro
Communications of the ACM 38 (3), 58-68, 1995
Practical issues in temporal difference learning
G Tesauro
Machine learning 8 (3), 257-277, 1992
TD-Gammon, a self-teaching backgammon program, achieves master-level play
G Tesauro
Neural computation 6 (2), 215-219, 1994
Utility functions in autonomic systems
WE Walsh, G Tesauro, JO Kephart, R Das
International Conference on Autonomic Computing, 2004. Proceedings., 70-77, 2004
A hybrid reinforcement learning approach to autonomic resource allocation
G Tesauro, NK Jong, R Das, MN Bennani
2006 IEEE International Conference on Autonomic Computing, 65-73, 2006
A multi-agent systems approach to autonomic computing
G Tesauro, DM Chess, WE Walsh, R Das, A Segal, I Whalley, JO Kephart, ...
Proceedings of the Third International Joint Conference on Autonomous Agents …, 2004
Agent-human interactions in the continuous double auction
R Das, JE Hanson, JO Kephart, G Tesauro
International joint conference on artificial intelligence 17 (1), 1169-1178, 2001
On-line policy improvement using Monte-Carlo search
G Tesauro, G Galperin
Advances in Neural Information Processing Systems 9, 1068-1074, 1996
Programming backgammon using self-teaching neural nets
G Tesauro
Artificial Intelligence 134 (1-2), 181-199, 2002
Neural networks for computer virus recognition
GJ Tesauro, JO Kephart, GB Sorkin
IEEE expert 11 (4), 5-6, 1996
R 3: Reinforced ranker-reader for open-domain question answering
S Wang, M Yu, X Guo, Z Wang, T Klinger, W Zhang, S Chang, G Tesauro, ...
Thirty-Second AAAI Conference on Artificial Intelligence, 2018
Extending Q-learning to general adaptive multi-agent systems
G Tesauro
Advances in neural information processing systems 16, 871-878, 2003
Advances in neural information processing systems
GE Hinton, RS Zemel, J Cowan, G Tesauro, J Alspector
MIT Press 6, 3-10, 1994
A parallel network that learns to play backgammon
G Tesauro, TJ Sejnowski
Artificial Intelligence 39 (3), 357-390, 1989
Analyzing complex strategic interactions in multi-agent systems
WE Walsh, R Das, G Tesauro, JO Kephart
AAAI-02 Workshop on Game-Theoretic and Decision-Theoretic Agents, 109-118, 2002
Coordinating multiple autonomic managers to achieve specified power-performance tradeoffs
JO Kephart, H Chan, R Das, DW Levine, G Tesauro, F Rawson, C Lefurgy
Fourth International Conference on Autonomic Computing (ICAC'07), 24-24, 2007
Learning to learn without forgetting by maximizing transfer and minimizing interference
M Riemer, I Cases, R Ajemian, M Liu, I Rish, Y Tu, G Tesauro
arXiv preprint arXiv:1810.11910, 2018
Reinforcement learning in autonomic computing: A manifesto and case studies
G Tesauro
IEEE Internet Computing 11 (1), 22-30, 2007
Multiresolution recurrent neural networks: An application to dialogue response generation
I Serban, T Klinger, G Tesauro, K Talamadupula, B Zhou, Y Bengio, ...
Proceedings of the AAAI Conference on Artificial Intelligence 31 (1), 2017
Biologically inspired defenses against computer viruses
JO Kephart, GB Sorkin, WC Arnold, DM Chess, GJ Tesauro, SR White, ...
IJCAI (1), 985-996, 1995
The system can't perform the operation now. Try again later.
Articles 1–20