Jacob Beck
Jacob Beck
University of Oxford
Verified email at - Homepage
Cited by
Cited by
A survey of meta-reinforcement learning
J Beck, R Vuorio, EZ Liu, Z Xiong, L Zintgraf, C Finn, S Whiteson
arXiv preprint arXiv:2301.08028, 2023
Amrl: Aggregated memory for reinforcement learning
J Beck, K Ciosek, S Devlin, S Tschiatschek, C Zhang, K Hofmann
International Conference on Learning Representations, 2019
Monotonic Improvement Guarantees under Non-stationarity for Decentralized PPO
M Sun, S Devlin, J Beck, K Hofmann, S Whiteson
arXiv preprint arXiv:2202.00082, 2022
Hypernetworks in Meta-Reinforcement Learning
J Beck, MT Jackson, R Vuorio, S Whiteson
6th Annual Conference on Robot Learning, 2022
On the practical consistency of meta-reinforcement learning algorithms
Z Xiong, L Zintgraf, J Beck, R Vuorio, S Whiteson
arXiv preprint arXiv:2112.00478, 2021
Stackelberg punishment and bully-proofing autonomous vehicles
M Cooper, JK Lee, J Beck, JD Fishman, M Gillett, Z Papakipos, A Zhang, ...
Social Robotics: 11th International Conference, ICSR 2019, Madrid, Spain†…, 2019
No DICE: An investigation of the bias-variance tradeoff in meta-gradients
R Vuorio, JA Beck, G Farquhar, JN Foerster, S Whiteson
Deep RL Workshop NeurIPS 2021, 2021
Trust region bounds for decentralized ppo under non-stationarity
M Sun, S Devlin, J Beck, K Hofmann, S Whiteson
Proceedings of the 2023 International Conference on Autonomous Agents and†…, 2023
Reneg and backseat driver: Learning from demonstration with continuous human feedback
J Beck, Z Papakipos, M Littman
arXiv preprint arXiv:1901.05101, 2019
Quality aspects of annotated data: A research synthesis
J Beck
AStA Wirtschafts-und Sozialstatistisches Archiv, 1-23, 2023
Annotation Sensitivity: Training Data Collection Methods Affect Model Performance
C Kern, S Eckman, J Beck, R Chew, B Ma, F Kreuter
arXiv preprint arXiv:2311.14212, 2023
Recurrent Hypernetworks are Surprisingly Strong in Meta-RL
J Beck, R Vuorio, Z Xiong, S Whiteson
arXiv preprint arXiv:2309.14970, 2023
Universal Morphology Control via Contextual Modulation
Z Xiong, J Beck, S Whiteson
arXiv preprint arXiv:2302.11070, 2023
An Investigation of the Bias-Variance Tradeoff in Meta-Gradients
R Vuorio, J Beck, S Whiteson, J Foerster, G Farquhar
arXiv preprint arXiv:2209.11303, 2022
Human-Actor Human-Critic
J Beck, N Srinivasan, A Shah, J Roy
ReNeg and Backseat Driver: Learning from demonstration with continuous human feedback
Z Papakipos, J Beck, M Littman
Neural Mesh: Introducing a Notion of Space and Conservation of Energy to Neural Networks
J Beck, Z Papakipos
arXiv preprint arXiv:1807.11121, 2018
Hypernetworks in Meta-Reinforcement Learning Supplementary Materials
J Beck, M Jackson, R Vuorio, S Whiteson
Collaboration in Deep MARL
J Beck
The system can't perform the operation now. Try again later.
Articles 1–19