Adam Gleave
Adam Gleave
Verified email at eecs.berkeley.edu - Homepage
Title
Cited by
Cited by
Year
Stable baselines
A Hill, A Raffin, M Ernestus, A Gleave, A Kanervisto, R Traore, P Dhariwal, ...
5022018
Firmament: Fast, centralized cluster scheduling at scale
I Gog, M Schwarzkopf, A Gleave, RNM Watson, S Hand
12th {USENIX} Symposium on Operating Systems Design and Implementation …, 2016
1932016
Adversarial policies: Attacking deep reinforcement learning
A Gleave, M Dennis, C Wild, N Kant, S Levine, S Russell
International Conference on Learning Representations, 2020
1322020
Stable baselines3
A Raffin, A Hill, M Ernestus, A Gleave, A Kanervisto, N Dormann
GitHub repository, 2019
1272019
Inverse reinforcement learning for video games
A Tucker, A Gleave, S Russell
Deep Reinforcement Learning Workshop at NeurIPS, 2018
252018
Multi-task maximum entropy inverse reinforcement learning
A Gleave, O Habryka
GoalsRL Workshop at ICML, 2018
192018
Active inverse reward design
S Mindermann, R Shah, A Gleave, D Hadfield-Menell
GoalsRL Workshop at ICML, 2018
152018
Making compression algorithms for Unicode text
A Gleave, C Steinruecken
Data Compression Conference, 2017
122017
Quantifying differences in reward functions
A Gleave, M Dennis, S Legg, S Russell, J Leike
International Conference on Learning Representations, 2021
82021
The imitation library for imitation learning and inverse reinforcement learning
S Wang, S Toyer, A Gleave, S Emmons
52020
Understanding learned reward functions
EJ Michaud, A Gleave, S Russell
Deep Reinforcement Learning Workshop at NeurIPS, 2020
42020
DERAIL: Diagnostic Environments for Reward And Imitation Learning
P Freire, A Gleave, S Toyer, S Russell
Deep Reinforcement Learning Workshop at NeurIPS, 2020
22020
Fast and accurate cluster scheduling using flow networks
A Gleave
Computer Science Tripos Part II Dissertation. University of Cambridge …, 2015
22015
A modular architecture for Unicode text compression
A Gleave
University of Cambridge, 2016
12016
The system can't perform the operation now. Try again later.
Articles 1–14