Follow
Han Wang
Han Wang
Verified email at ualberta.ca
Title
Cited by
Cited by
Year
The in-sample softmax for offline reinforcement learning
C Xiao, H Wang, Y Pan, A White, M White
arXiv preprint arXiv:2302.14372, 2023
192023
Investigating the properties of neural network representations in reinforcement learning
H Wang, E Miahi, M White, MC Machado, Z Abbas, R Kumaraswamy, ...
Artificial Intelligence, 104100, 2024
152024
Measuring and mitigating interference in reinforcement learning
V Liu, H Wang, RY Tao, K Javed, A White, M White
Conference on Lifelong Learning Agents, 781-795, 2023
32023
No more pesky hyperparameters: Offline hyperparameter tuning for RL
H Wang, A Sakhadeo, A White, J Bell, V Liu, X Zhao, P Liu, T Kozuno, ...
arXiv preprint arXiv:2205.08716, 2022
32022
Replay Memory as An Empirical MDP: Combining Conservative Estimation with Experience Replay
H Zhang, C Xiao, H Wang, J Jin, M Müller
The Eleventh International Conference on Learning Representations, 2022
12022
The system can't perform the operation now. Try again later.
Articles 1–5