Deep reinforcement learning that matters
P Henderson*, R Islam*, P Bachman, J Pineau, D Precup, D Meger
Proceedings of 32nd AAAI Conference on Artificial Intelligence (AAAI-18), 2017
Deep Bayesian Active Learning with Image Data
Y Gal, R Islam, Z Ghahramani
Proceedings of the 34th International Conference on Machine Learning (ICML-17), 2017
An introduction to deep reinforcement learning
V François-Lavet, P Henderson, R Islam, MG Bellemare, J Pineau
arXiv preprint arXiv:1811.12560, 2018
Reproducibility of Benchmarked Deep Reinforcement Learning Tasks for Continuous Control
R Islam, P Henderson, M Gomrokchi, D Precup
Reproducibility in Machine Learning Workshop, ICML 2017, 2017
Bayesian Hypernetworks
D Krueger, CW Huang, R Islam, R Turner, A Lacoste, A Courville
arXiv preprint arXiv:1710.04759, 2017
InfoBot : Transfer and Exploration via the Information Bottleneck
A Goyal, R Islam, DJ Strouse, Z Ahmed, H Larochelle, M Botvinick, ...
International Conference on Learning Representations (ICLR) 2019, 2018
Bayesian Policy Gradients via Alpha Divergence Dropout Inference
P Henderson, T Doan, R Islam, D Meger
Bayesian Deep Learning Workshop, NIPS 2017, 2017
VFunc: a Deep Generative Model for Functions
P Bachman, R Islam, A Sordoni, Z Ahmed
Prediction and Generative Modeling in Reinforcement Learning workshop, ICML 2018, 2018
Re-evaluate: Reproducibility in evaluating reinforcement learning algorithms
K Khetarpal, Z Ahmed, A Cianflone, R Islam, J Pineau
Active Learning for High Dimensional Inputs using Bayesian Convolutional Neural Networks
R Islam, Y Gal, Z Ghahramani
University of Cambridge, Masters Thesis, 2016
Entropy Regularization with Discounted Future State Distribution in Policy Gradient Methods
R Islam, R Seraj, PL Bacon, D Precup
arXiv preprint arXiv:1912.05104, 2019
Marginalized State Distribution Entropy Regularization in Policy Optimization
R Islam, Z Ahmed, D Precup
arXiv preprint arXiv:1912.05128, 2019
Prioritizing starting states for reinforcement learning
A Tavakoli, V Levdik, R Islam, P Kormushev
arXiv preprint arXiv:1811.11298, 2018
Off-Policy Policy Gradient Algorithms by Constraining the State Distribution Shift
R Islam, KK Teru, D Sharma, J Pineau
https://arxiv.org/abs/1911.06970, 2019
Discrete off-policy policy gradient using continuous relaxations
A Cianflone, Z Ahmed, R Islam, AJ Bose, WL Hamilton
unpublished, 2019
Variational state encoding as intrinsic motivation in reinforcement learning
M Klissarov, R Islam, K Khetarpal, D Precup
Task-Agnostic Reinforcement Learning Workshop at Proceedings of the …, 2019
InfoBot: Structured Exploration in ReinforcementLearning Using Information Bottleneck
A Goyal, R Islam, D Strouse, Z Ahmed, M Botvinick, H Larochelle, ...
Alpha-Divergences in Variational Dropout
B Mazoure, R Islam
arXiv preprint:1711.04345v1, 2017
Doubly Robust Off-Policy Actor-Critic Algorithms for Reinforcement Learning
R Islam, R Seraj, SY Arnob, D Precup
arXiv preprint arXiv:1912.05109, 2019
Transfer Learning by Modeling a Distribution over Policies
D Shrivastava, EG Dhekane, R Islam
arXiv preprint arXiv:1906.03574, 2019
