General value function networks M Schlegel, A Jacobsen, Z Abbas, A Patterson, A White, M White Journal of Artificial Intelligence Research 70, 497-543, 2021 | 44 | 2021 |
Importance Resampling for Off-policy Prediction M Schlegel, W Chung, D Graves, J Qian, M White Advances in Neural Information Processing Systems, 1797-1807, 2019 | 43 | 2019 |
Meta-descent for online, continual prediction A Jacobsen, M Schlegel, C Linke, T Degris, A White, M White Proceedings of the AAAI Conference on Artificial Intelligence 33 (01), 3943-3950, 2019 | 23 | 2019 |
Context-dependent upper-confidence bounds for directed exploration R Kumaraswamy, M Schlegel, A White, M White Advances in Neural Information Processing Systems 31, 2018 | 18 | 2018 |
Adapting kernel representations online using submodular maximization M Schlegel, Y Pan, J Chen, M White International Conference on Machine Learning, 3037-3046, 2017 | 12 | 2017 |
Structural credit assignment in neural networks using reinforcement learning D Gupta, G Mihucz, M Schlegel, J Kostas, PS Thomas, M White Advances in Neural Information Processing Systems 34, 30257-30270, 2021 | 7 | 2021 |
Continual auxiliary task learning M McLeod, C Lo, M Schlegel, A Jacobsen, R Kumaraswamy, M White, ... Advances in Neural Information Processing Systems 34, 12549-12562, 2021 | 7 | 2021 |
Discovery of Predictive Representations With a Network of General Value Functions M Schlegel, A Patterson, A White, M White | 4 | 2018 |
A baseline of discovery for general value function networks under partial observability M Schlegel, A White, M White NeurIPS Workshop on Reinforcement Learning under Partial Observability …, 2018 | 4 | 2018 |
Generalized Munchausen Reinforcement Learning using Tsallis KL Divergence L Zhu, Z Chen, M Schlegel, M White arXiv preprint arXiv:2301.11476, 2023 | 2 | 2023 |
Investigating Action Encodings in Recurrent Neural Networks in Reinforcement Learning M Schlegel, V Tkachuk, A White, M White Transactions on Machine Learning Research, 2022 | 2 | 2022 |
Stable predictive representations with general value functions for continual learning M Schlegel, A White, M White Continual Learning and Deep Networks workshop at the Neural Information …, 2017 | 2 | 2017 |
General munchausen reinforcement learning with tsallis kullback-leibler divergence L Zhu, Z Chen, M Schlegel, M White Advances in Neural Information Processing Systems 36, 2024 | 1 | 2024 |
Predictions Predicting Predictions MK Schlegel, M White Multidisciplinary Conference on Reinforcement Learning and Decision Making, 2022 | 1 | 2022 |
Leveraging Off-Policy Prediction in Recurrent Networks for Reinforcement Learning MK Schlegel | | 2023 |
Offline Reinforcement Learning via Tsallis Regularization L Zhu, MK Schlegel, H Wang, M White Transactions on Machine Learning Research, 0 | | |
Importance Resampling for Off-policy Policy Evaluation M Schlegel, W Chung, D Graves, M White | | |