Follow
Kenny Young
Title
Cited by
Cited by
Year
Minatar: An atari-inspired testbed for thorough and reproducible reinforcement learning experiments
K Young, T Tian
arXiv preprint arXiv:1903.03176, 2019
66*2019
Comparing Direct and Indirect Temporal-Difference Methods for Estimating the Variance of the Return.
C Sherstan, DR Ashley, B Bennett, K Young, A White, M White, RS Sutton
UAI, 63-72, 2018
29*2018
Neurohex: A deep q-learning hex agent
K Young, G Vasan, R Hayward
Computer Games: 5th Workshop on Computer Games, CGW 2016, and 5th Workshop…, 2017
282017
Metatrace Actor-Critic: Online Step-Size Tuning by Meta-gradient Descent for Reinforcement Learning Control
K Young, B Wang, ME Taylor
International Joint Conference on Artificial Intelligence, 4185--4191, 2019
18*2019
Integrating episodic memory into a reinforcement learning agent using reservoir sampling
KJ Young, RS Sutton, S Yang
arXiv preprint arXiv:1806.00540, 2018
52018
Variance Reduced Advantage Estimation with Hindsight Credit Assignment
K Young
arXiv preprint arXiv:1911.08362, 2019
32019
Understanding the pathologies of approximate policy evaluation when combined with greedification in reinforcement learning
K Young, RS Sutton
arXiv preprint arXiv:2010.15268, 2020
22020
A Reverse Hex Solver
K Young, RB Hayward
Computers and Games: 9th International Conference, CG 2016, Leiden, The…, 2016
22016
MoHex wins Hex 11 11 and 13 13 tournament
RB Hayward, N Weninger, K Young, K Takada, T Zhang
ICGA J.(2017, To appear), 0
2
The Benefits of Model-Based Generalization in Reinforcement Learning
K Young, A Ramesh, L Kirsch, J Schmidhuber
arXiv preprint arXiv:2211.02222, 2022
2022
Hindsight Network Credit Assignment: Efficient Credit Assignment in Networks of Discrete Stochastic Units
K Young
Proceedings of the AAAI Conference on Artificial Intelligence 36 (8), 8919-8926, 2022
2022
Doubly-Asynchronous Value Iteration: Making Value Iteration Asynchronous in Actions
T Tian, K Young, RS Sutton
Advances in Neural Information Processing Systems (NeurIPS), 2022
2022
Learning What to Remember with Online Policy Gradient Over a Reservoir
K Young, RS Sutton
MOHEX WINS 2016 HEX 11X11 AND 13X13 TOURNAMENTS
R Hayward, N Weninger, K Young, K Takada, T Zhang
The system can't perform the operation now. Try again later.
Articles 1–14