Pierre-Luc Bacon
Pierre-Luc Bacon
University of Montreal
Verified email at mila.quebec - Homepage
Title
Cited by
Cited by
Year
The option-critic architecture
PL Bacon, J Harb, D Precup
Thirty-First AAAI Conference on Artificial Intelligence, 2017
3512017
Conditional computation in neural networks for faster models
E Bengio, PL Bacon, J Pineau, D Precup
arXiv preprint arXiv:1511.06297, 2015
1102015
When waiting is not an option: Learning options with a deliberation cost
J Harb, PL Bacon, M Klissarov, D Precup
Thirty-Second AAAI Conference on Artificial Intelligence, 2018
402018
Optiongan: Learning joint reward-policy options using generative adversarial inverse reinforcement learning
P Henderson, WD Chang, PL Bacon, D Meger, J Pineau, D Precup
Thirty-Second AAAI Conference on Artificial Intelligence, 2018
262018
Convergent TREE BACKUP and RETRACE with function approximation
A Touati, PL Bacon, D Precup, P Vincent
arXiv preprint arXiv:1705.09322, 2017
182017
Learnings options end-to-end for continuous action tasks
M Klissarov, PL Bacon, J Harb, D Precup
arXiv preprint arXiv:1712.00004, 2017
142017
Learning with options that terminate off-policy
A Harutyunyan, P Vrancx, PL Bacon, D Precup, A Nowe
Thirty-Second AAAI Conference on Artificial Intelligence, 2018
112018
Learning with options: Just deliberate and relax
PL Bacon, D Precup
NIPS Bounded Optimality and Rational Metareasoning Workshop, 2015
102015
Learning robust options
DJ Mankowitz, TA Mann, PL Bacon, D Precup, S Mannor
Thirty-Second AAAI Conference on Artificial Intelligence, 2018
82018
On the bottleneck concept for options discovery: Theoretical underpinnings and extension in continuous state spaces
PL Bacon
McGill University Libraries, 2014
62014
Using label propagation for learning temporally abstract actions in reinforcement learning
PL Bacon, D Precup
Proceedings of the Workshop on Multiagent Interaction Networks, 1-7, 2013
62013
Temporal Representation Learning
PL Bacon
McGill University Libraries, 2018
52018
On the bottleneck concept for options discovery
PL Bacon
Masters thesis, McGill University, 2013
42013
Constructing temporal abstractions autonomously in reinforcement learning
PL Bacon, D Precup
Ai Magazine 39 (1), 39-50, 2018
32018
A matrix splitting perspective on planning with options
PL Bacon, D Precup
arXiv preprint arXiv:1612.00916, 2016
32016
Learning and Planning with Timing Information in Markov Decision Processes.
PL Bacon, B Balle, D Precup
UAI, 111-120, 2015
32015
Conditional computation in neural networks using a decision-theoretic approach
PL Bacon, E Bengio, J Pineau, D Precup
Proceedings of the 2nd Multidisciplinary Conference on Reinforcement …, 2015
32015
The Barbados 2018 List of Open Issues in Continual Learning
T Schaul, H van Hasselt, J Modayil, M White, A White, PL Bacon, J Harb, ...
arXiv preprint arXiv:1811.07004, 2018
22018
Analyzing open data from the city of Montreal
J Pineau, PL Bacon
22015
Understanding the Curse of Horizon in Off-Policy Evaluation via Conditional Importance Sampling
Y Liu, PL Bacon, E Brunskill
arXiv preprint arXiv:1910.06508, 2019
12019
The system can't perform the operation now. Try again later.
Articles 1–20