Gregory Farquhar
Gregory Farquhar
Verified email at cs.ox.ac.uk
TitleCited byYear
Counterfactual multi-agent policy gradients
JN Foerster, G Farquhar, T Afouras, N Nardelli, S Whiteson
Thirty-Second AAAI Conference on Artificial Intelligence, 2018
2472018
Stabilising experience replay for deep multi-agent reinforcement learning
J Foerster, N Nardelli, G Farquhar, T Afouras, PHS Torr, P Kohli, ...
Proceedings of the 34th International Conference on Machine Learning-Volume …, 2017
1642017
QMIX: monotonic value function factorisation for deep multi-agent reinforcement learning
T Rashid, M Samvelyan, CS De Witt, G Farquhar, J Foerster, S Whiteson
arXiv preprint arXiv:1803.11485, 2018
942018
Treeqn and atreec: Differentiable tree planning for deep reinforcement learning
G Farquhar, T Rocktäschel, M Igl, SA Whiteson
International Conference on Learning Representations, 2018
35*2018
The starcraft multi-agent challenge
M Samvelyan, T Rashid, C Schroeder de Witt, G Farquhar, N Nardelli, ...
Proceedings of the 18th International Conference on Autonomous Agents and …, 2019
172019
Dice: The infinitely differentiable monte-carlo estimator
J Foerster, G Farquhar, M Al-Shedivat, T Rocktäschel, EP Xing, ...
arXiv preprint arXiv:1802.05098, 2018
152018
A Survey of Reinforcement Learning Informed by Natural Language
J Luketina, N Nardelli, G Farquhar, J Foerster, J Andreas, E Grefenstette, ...
arXiv preprint arXiv:1906.03926, 2019
52019
Multi-Agent Common Knowledge Reinforcement Learning
JN Foerster, CAS de Witt, G Farquhar, PHS Torr, W Boehmer, S Whiteson
arXiv preprint arXiv:1810.11702, 2018
42018
Convergence Rates of Distributed TD (0) with Linear Function Approximation for Multi-Agent Reinforcement Learning
TT Doan, ST Maguluri, J Romberg
arXiv preprint arXiv:1902.07393, 2019
32019
A baseline for any order gradient estimation in stochastic computation graphs
J Mao, J Foerster, T Rocktaschel, M Al-Shedivat, G Farquhar, S Whiteson
Journal of Machine Learning Research, 2019
22019
Growing action spaces
G Farquhar, L Gustafson, Z Lin, S Whiteson, N Usunier, G Synnaeve
arXiv preprint arXiv:1906.12266, 2019
12019
Loaded DiCE: Trading off Bias and Variance in Any-Order Score Function Gradient Estimators for Reinforcement Learning
G Farquhar, S Whiteson, J Foerster
Advances in Neural Information Processing Systems, 8149-8160, 2019
2019
The system can't perform the operation now. Try again later.
Articles 1–12