Ahmed Touati
Ahmed Touati
PhD candidate, University of Montreal
Verified email at umontreal.ca
Cited by
Cited by
Learning Dynamics Model in Reinforcement Learning by Incorporating the Long Term Future
NR Ke, A Singh, A Touati, A Goyal, Y Bengio, D Parikh, D Batra
ICLR 2019 - Proceedings of the Seventh International Conference on Learning …, 2019
Convergent Tree Backup and Retrace with Function Approximation
A Touati, PL Bacon, D Precup, P Vincent
ICML 2018, Proceedings of the 35th International Conference on Machine …, 2017
Randomized value functions via multiplicative normalizing flows
A Touati, H Satija, J Romoff, J Pineau, P Vincent
UAI2019: Conference on Uncertainty in Artificial Intelligence 2019, 2018
Learnable explicit density for continuous latent space and variational inference
CW Huang, A Touati, L Dinh, M Drozdzal, M Havaei, L Charlin, ...
ICML 2017 Workshop on Principle Approaches to Deep Learning (padl), 2017
Real-time privacy-preserving model-based estimation of traffic flows
J Le Ny, A Touati, GJ Pappas
2014 ACM/IEEE International Conference on Cyber-Physical Systems (ICCPS), 92-102, 2014
Parametric adversarial divergences are good task losses for generative modeling
G Huang, H Berard, A Touati, G Gidel, P Vincent, S Lacoste-Julien
MAIS18, Montreal AI Symposium 2018, 2017
Separable value functions across time-scales
J Romoff, P Henderson, A Touati, Y Ollivier, J Pineau, E Brunskill
ICML 2019, Proceedings of the 36th International Conference on Machine …, 2019
Efficient learning in non-stationary linear Markov decision processes
A Touati, P Vincent
arXiv preprint arXiv:2010.12870, 2020
Stable Policy Optimization via Off-Policy Divergence Regularization
A Touati, A Zhang, J Pineau, P Vincent
UAI2020: Conference on Uncertainty in Artificial Intelligence 2020, 2020
Zooming for efficient model-free reinforcement learning in metric spaces
A Touati, AA Taiga, MG Bellemare
arXiv preprint arXiv:2003.04069, 2020
SVRG for policy evaluation with fewer gradient evaluations
Z Peng, A Touati, P Vincent, D Precup
IJCAI2020: the 29th International Joint Conference on Artificial Intelligence, 2019
Stochastic Neural Network with Kronecker Flow
CW Huang, A Touati, P Vincent, GK Dziugaite, A Lacoste, A Courville
AISTATS 2020 - Proceedings of the 23nd International Conference on …, 2019
TDprop: Does Adaptive Optimization With Jacobi Preconditioning Help Temporal Difference Learning?
J Romoff, P Henderson, D Kanaa, E Bengio, A Touati, PL Bacon, J Pineau
Proceedings of the 20th International Conference on Autonomous Agents and …, 2021
Learning One Representation to Optimize All Rewards
A Touati, Y Ollivier
NeurIPS 2021: Thirty-fifth Conference on Neural Information Processing Systems, 2021
Maximum Reward Formulation In Reinforcement Learning
SK Gottipati, Y Pathak, R Nuttall, R Chunduru, A Touati, SG Subramanian, ...
arXiv preprint arXiv:2010.03744, 2020
Adaptive Stochastic Dual Coordinate Ascent for Conditional Random Fields
RL Priol, A Touati, S Lacoste-Julien
OPTML 2017: 10th NIPS Workshop on Optimization for Machine Learning (NIPS 2017), 2017
Sharp Analysis of Smoothed Bellman Error Embedding
A Touati, P Vincent
ICML 2020 Workshop on Theoretical Foundations of Reinforcement Learning, 2020
The system can't perform the operation now. Try again later.
Articles 1–17