|An off-policy policy gradient theorem using emphatic weightings|
E Imani, E Graves, M White
Advances in Neural Information Processing Systems 31, 2018
|ScriptEase II: Platform independent story creation using high-level patterns|
K Schenk, A Lari, M Church, E Graves, J Duncan, R Miller, N Desai, ...
Proceedings of the AAAI Conference on Artificial Intelligence and …, 2013
|Off-policy actor-critic with emphatic weightings|
E Graves, E Imani, R Kumaraswamy, M White
Journal of Machine Learning Research 24 (146), 1-63, 2023
|A demonstration of ScriptEase II|
M Church, E Graves, J Duncan, A Lari, R Miller, N Desai, R Zhao, ...
Proceedings of the AAAI Conference on Artificial Intelligence and …, 2011
|Value-aware Importance Weighting for Off-policy Reinforcement Learning|
K De Asis, E Graves, RS Sutton
Conference on Lifelong Learning Agents, 745-763, 2023
|Importance Sampling Placement in Off-Policy Temporal-Difference Methods|
E Graves, S Ghiassian
arXiv preprint arXiv:2203.10172, 2022
|ScriptEase II and Platform Independent Story Creation Using High-Level Game Design Patterns|
A Lari, M Church, E Graves, J Duncan, R Miller, N Desai, R Zhao, ...