Multi-step reinforcement learning: A unifying algorithm K De Asis, JF Hernandez-Garcia, GZ Holland, RS Sutton Thirty-Second AAAI Conference on Artificial Intelligence, 2018 | 140 | 2018 |
Student of Games: A unified learning algorithm for both perfect and imperfect information games M Schmid, M Moravčík, N Burch, R Kadlec, J Davidson, K Waugh, N Bard, ... Science Advances 9 (46), eadg3256, 2023 | 65 | 2023 |
The Effect of Planning Shape on Dyna-style Planning in High-dimensional State Spaces GZ Holland, E Talvitie, M Bowling arXiv preprint arXiv:1806.01825, 2018 | 54 | 2018 |
Reward-respecting subtasks for model-based reinforcement learning RS Sutton, MC Machado, GZ Holland, D Szepesvari, F Timbers, B Tanner, ... Artificial Intelligence 324, 104001, 2023 | 22 | 2023 |
Reward-Respecting Subtasks for Model-Based Reinforcement Learning (Abstract Reprint) RS Sutton, MC Machado, GZ Holland, D Szepesvari, F Timbers, B Tanner, ... Proceedings of the AAAI Conference on Artificial Intelligence 38 (20), 22713 …, 2024 | | 2024 |
Player of games M Schmid, M Moravcik, N Burch, R Kadlec, J Davidson, K Waugh, N Bard, ... arXiv preprint arXiv:2112.03178, 2021 | | 2021 |