One-shot imitation learning Y Duan, M Andrychowicz, B Stadie, OAI Jonathan Ho, J Schneider, ... Advances in neural information processing systems 30, 2017 | 717 | 2017 |
Incentivizing exploration in reinforcement learning with deep predictive models BC Stadie, S Levine, P Abbeel arXiv preprint arXiv:1507.00814, 2015 | 490 | 2015 |
Evolved policy gradients R Houthooft, Y Chen, P Isola, B Stadie, F Wolski, OAI Jonathan Ho, ... Advances in Neural Information Processing Systems 31, 2018 | 254 | 2018 |
Third-person imitation learning BC Stadie, P Abbeel, I Sutskever arXiv preprint arXiv:1703.01703, 2017 | 247 | 2017 |
Some considerations on learning to explore via meta-reinforcement learning BC Stadie, G Yang, R Houthooft, X Chen, Y Duan, Y Wu, P Abbeel, ... arXiv preprint arXiv:1803.01118, 2018 | 121 | 2018 |
Maximum entropy gain exploration for long horizon multi-goal reinforcement learning S Pitis, H Chan, S Zhao, B Stadie, J Ba International Conference on Machine Learning, 7750-7761, 2020 | 102 | 2020 |
World model as a graph: Learning latent landmarks for planning L Zhang, G Yang, BC Stadie International Conference on Machine Learning, 12611-12620, 2021 | 58 | 2021 |
The importance of sampling inmeta-reinforcement learning B Stadie, G Yang, R Houthooft, P Chen, Y Duan, Y Wu, P Abbeel, ... Advances in Neural Information Processing Systems 31, 9280-9290, 2018 | 34 | 2018 |
One-shot pruning of recurrent neural networks by jacobian spectrum evaluation MS Zhang, B Stadie arXiv preprint arXiv:1912.00120, 2019 | 33 | 2019 |
Transfer learning for estimating causal effects using neural networks SR Künzel, BC Stadie, N Vemuri, V Ramakrishnan, JS Sekhon, P Abbeel arXiv preprint arXiv:1808.07804, 2018 | 32 | 2018 |
Learning intrinsic rewards as a bi-level optimization problem B Stadie, L Zhang, J Ba Conference on Uncertainty in Artificial Intelligence, 111-120, 2020 | 10 | 2020 |
To the Noise and Back: Diffusion for Shared Autonomy T Yoneda, L Sun, B Stadie, G Yang, M Walter arXiv preprint arXiv:2302.12244, 2023 | 3 | 2023 |
Invariance through inference T Yoneda, G Yang, M Walter, BC Stadie | 3 | 2021 |
Estimating heterogeneous treatment effects using neural networks with the Y-Learner BC Stadie, SR Künzel, N Vemuri, JS Sekhon | 3 | 2018 |
Invariance through latent alignment T Yoneda, G Yang, MR Walter, B Stadie arXiv preprint arXiv:2112.08526, 2021 | 1 | 2021 |
Learning intrinsic rewards as a bi-level optimization problem L Zhang, BC Stadie, J Ba Thirty-sixth Conference on Uncertainty in Artificial Intelligence (UAI), 2020 | 1 | 2020 |
One demonstration imitation learning BC Stadie, S Zhao, Q Xu, B Li, L Zhang Advances in neural information processing systems 30, 2019 | 1 | 2019 |
Simulating the stochastic dynamics and cascade failure of power networks C Matthews, B Stadie, J Weare, M Anitescu, C Demarco arXiv preprint arXiv:1806.02420, 2018 | 1 | 2018 |
Learning as a Sampling Problem BC Stadie UC Berkeley, 2018 | 1 | 2018 |
Cold Diffusion on the Replay Buffer: Learning to Plan from Known Good States Z Wang, T Oba, T Yoneda, R Shen, M Walter, BC Stadie arXiv preprint arXiv:2310.13914, 2023 | | 2023 |