A survey of meta-reinforcement learning J Beck, R Vuorio, EZ Liu, Z Xiong, L Zintgraf, C Finn, S Whiteson arXiv preprint arXiv:2301.08028, 2023 | 126 | 2023 |
Hypernetworks in Meta-Reinforcement Learning J Beck, MT Jackson, R Vuorio, S Whiteson 6th Annual Conference on Robot Learning, 2022 | 28 | 2022 |
Amrl: Aggregated memory for reinforcement learning J Beck, K Ciosek, S Devlin, S Tschiatschek, C Zhang, K Hofmann International Conference on Learning Representations, 2020 | 22 | 2020 |
Monotonic Improvement Guarantees under Non-stationarity for Decentralized PPO M Sun, S Devlin, J Beck, K Hofmann, S Whiteson arXiv preprint arXiv:2202.00082, 2022 | 12 | 2022 |
Trust region bounds for decentralized ppo under non-stationarity M Sun, S Devlin, J Beck, K Hofmann, S Whiteson arXiv preprint arXiv:2202.00082, 2022 | 9 | 2022 |
On the practical consistency of meta-reinforcement learning algorithms Z Xiong, L Zintgraf, J Beck, R Vuorio, S Whiteson arXiv preprint arXiv:2112.00478, 2021 | 9 | 2021 |
Stackelberg punishment and bully-proofing autonomous vehicles M Cooper, JK Lee, J Beck, JD Fishman, M Gillett, Z Papakipos, A Zhang, ... Social Robotics: 11th International Conference, ICSR 2019, Madrid, Spain …, 2019 | 9 | 2019 |
Universal morphology control via contextual modulation Z Xiong, J Beck, S Whiteson International Conference on Machine Learning, 38286-38300, 2023 | 5 | 2023 |
No DICE: An investigation of the bias-variance tradeoff in meta-gradients R Vuorio, JA Beck, G Farquhar, JN Foerster, S Whiteson Deep RL Workshop NeurIPS 2021, 2021 | 5 | 2021 |
Recurrent hypernetworks are surprisingly strong in meta-RL J Beck, R Vuorio, Z Xiong, S Whiteson Advances in Neural Information Processing Systems 36, 2024 | 3 | 2024 |
Reneg and backseat driver: Learning from demonstration with continuous human feedback J Beck, Z Papakipos, M Littman arXiv preprint arXiv:1901.05101, 2019 | 2 | 2019 |
SplAgger: Split Aggregation for Meta-Reinforcement Learning J Beck, M Jackson, R Vuorio, Z Xiong, S Whiteson arXiv preprint arXiv:2403.03020, 2024 | | 2024 |
Distilling Morphology-Conditioned Hypernetworks for Efficient Universal Morphology Control Z Xiong, R Vuorio, J Beck, M Zimmer, K Shao, S Whiteson arXiv preprint arXiv:2402.06570, 2024 | | 2024 |
An Investigation of the Bias-Variance Tradeoff in Meta-Gradients R Vuorio, J Beck, S Whiteson, J Foerster, G Farquhar arXiv preprint arXiv:2209.11303, 2022 | | 2022 |
Human-Actor Human-Critic J Beck, N Srinivasan, A Shah, J Roy | | 2020 |
Neural Mesh: Introducing a Notion of Space and Conservation of Energy to Neural Networks J Beck, Z Papakipos arXiv preprint arXiv:1807.11121, 2018 | | 2018 |
Hypernetworks in Meta-Reinforcement Learning Supplementary Materials J Beck, M Jackson, R Vuorio, S Whiteson | | |
ReNeg and Backseat Driver: Learning from demonstration with continuous human feedback Z Papakipos, J Beck, M Littman | | |
Collaboration in Deep MARL J Beck | | |