Joel Z Leibo

Cited by

	All	Since 2019
Citations	13452	11524
h-index	41	36
i10-index	66	56

2700

1350

675

2025

20132014201520162017201820192020202120222023202462 84 92 155 435 890 1300 1743 2130 2262 2696 1373

Public access

View all

10 articles

1 article

available

not available

Based on funding mandates

Co-authors

Thore GraepelGlobal Lead Computational Science, AI & ML at Altos Labs and Chair of Machine Learning, UCLVerified email at ucl.ac.uk
TOMASO POGGIOMcDermott Professor in Brain Sciences, MITVerified email at ai.mit.edu
Edward HughesStaff Research Engineer, DeepMindVerified email at google.com
Marc LanctotResearch Scientist, Google DeepMindVerified email at google.com
Edgar A. Duéñez-GuzmánGoogle DeepMindVerified email at oeb.harvard.edu
Karl TuylsFounder at H company, ex-Google DeepMind, Prof at University of LiverpoolVerified email at hcompany.ai
Wojciech Marian Czarnecki.Verified email at google.com
Matthew BotvinickGoogle DeepMind, Yale Law School, University College LondonVerified email at google.com
Charlie BeattieSoftware Engineer, DeepMindVerified email at google.com
Peter SunehagGoogle - DeepMindVerified email at google.com
Tom SchaulSenior Staff Scientist, DeepMindVerified email at nyu.edu
Kevin R. McKeeStaff Research Scientist, Google DeepMindVerified email at deepmind.com
Raphael KösterGoogle DeepMindVerified email at google.com
Audrūnas GruslysVerified email at gruslys.com
Jane X. WangStaff Research Scientist, DeepMindVerified email at google.com
Max JaderbergChief AI Scientist, Isomorphic LabsVerified email at robots.ox.ac.uk
Fabio AnselmiAssistant professor at University of Trieste, MIT affiliateVerified email at units.it
Vinicius ZambaldiGoogle DeepmindVerified email at google.com
Dharshan KumaranGoogle DeepMindVerified email at fil.ion.ucl.ac.uk
Zeb Kurth-NelsonDeepMind, UCLVerified email at google.com

Joel Z Leibo

Research scientist

Verified email at google.com - Homepage

Cooperation in AI & Neuroscience Multi-Agent Reinforcement Learning Machine Learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Value-decomposition networks for cooperative multi-agent learning P Sunehag, G Lever, A Gruslys, WM Czarnecki, V Zambaldi, M Jaderberg, ... arXiv preprint arXiv:1706.05296, 2017	1703	2017
Deep q-learning from demonstrations T Hester, M Vecerik, O Pietquin, M Lanctot, T Schaul, B Piot, D Horgan, ... Proceedings of the AAAI conference on artificial intelligence 32 (1), 2018	1389*	2018
Reinforcement learning with unsupervised auxiliary tasks M Jaderberg, V Mnih, WM Czarnecki, T Schaul, JZ Leibo, D Silver, ... arXiv preprint arXiv:1611.05397, 2016	1388	2016
Learning to reinforcement learn JX Wang, Z Kurth-Nelson, D Tirumala, H Soyer, JZ Leibo, R Munos, ... arXiv preprint arXiv:1611.05763, 2016	1065	2016
Human-level performance in 3D multiplayer games with population-based reinforcement learning M Jaderberg, WM Czarnecki, I Dunning, L Marris, G Lever, AG Castaneda, ... Science 364 (6443), 859-865, 2019	945	2019
Multi-agent reinforcement learning in sequential social dilemmas JZ Leibo, V Zambaldi, M Lanctot, J Marecki, T Graepel arXiv preprint arXiv:1702.03037, 2017	883	2017
Prefrontal cortex as a meta-reinforcement learning system JX Wang, Z Kurth-Nelson, D Kumaran, D Tirumala, H Soyer, JZ Leibo, ... Nature neuroscience 21 (6), 860-868, 2018	634	2018
Deepmind lab C Beattie, JZ Leibo, D Teplyashin, T Ward, M Wainwright, H Küttler, ... arXiv preprint arXiv:1612.03801, 2016	601	2016
Social influence as intrinsic motivation for multi-agent deep reinforcement learning N Jaques, A Lazaridou, E Hughes, C Gulcehre, P Ortega, DJ Strouse, ... International conference on machine learning, 3040-3049, 2019	525	2019
Model-free episodic control C Blundell, B Uria, A Pritzel, Y Li, A Ruderman, JZ Leibo, J Rae, ... arXiv preprint arXiv:1606.04460, 2016	296	2016
The dynamics of invariant object recognition in the human visual system L Isik, EM Meyers, JZ Leibo, T Poggio Journal of neurophysiology 111 (1), 91-102, 2014	279	2014
Using fast weights to attend to the recent past J Ba, GE Hinton, V Mnih, JZ Leibo, C Ionescu Advances in neural information processing systems 29, 2016	268	2016
Inequity aversion improves cooperation in intertemporal social dilemmas E Hughes, JZ Leibo, M Phillips, K Tuyls, E Dueñez-Guzman, ... Advances in neural information processing systems 31, 2018	246	2018
A multi-agent reinforcement learning model of common-pool resource appropriation J Perolat, JZ Leibo, V Zambaldi, C Beattie, K Tuyls, T Graepel Advances in neural information processing systems 30, 2017	217	2017
Open problems in cooperative ai A Dafoe, E Hughes, Y Bachrach, T Collins, KR McKee, JZ Leibo, K Larson, ... arXiv preprint arXiv:2012.08630, 2020	197	2020
Unsupervised predictive memory in a goal-directed agent G Wayne, CC Hung, D Amos, M Mirza, A Ahuja, A Grabska-Barwinska, ... arXiv preprint arXiv:1803.10760, 2018	196	2018
Emergent communication through negotiation K Cao, A Lazaridou, M Lanctot, JZ Leibo, K Tuyls, S Clark arXiv preprint arXiv:1804.03980, 2018	184	2018
How important is weight symmetry in backpropagation? Q Liao, J Leibo, T Poggio Proceedings of the AAAI Conference on Artificial Intelligence 30 (1), 2016	177	2016
Unsupervised learning of invariant representations F Anselmi, JZ Leibo, L Rosasco, J Mutch, A Tacchetti, T Poggio Theoretical Computer Science 633, 112-121, 2016	144	2016
Kickstarting deep reinforcement learning S Schmitt, JJ Hudson, A Zidek, S Osindero, C Doersch, WM Czarnecki, ... arXiv preprint arXiv:1803.03835, 2018	143	2018

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors