Pierre-Luc Bacon
Pierre-Luc Bacon
Verified email at cs.stanford.edu - Homepage
TitleCited byYear
The option-critic architecture
PL Bacon, J Harb, D Precup
Thirty-First AAAI Conference on Artificial Intelligence, 2017
2712017
Conditional computation in neural networks for faster models
E Bengio, PL Bacon, J Pineau, D Precup
arXiv preprint arXiv:1511.06297, 2015
862015
When waiting is not an option: Learning options with a deliberation cost
J Harb, PL Bacon, M Klissarov, D Precup
Thirty-Second AAAI Conference on Artificial Intelligence, 2018
272018
Optiongan: Learning joint reward-policy options using generative adversarial inverse reinforcement learning
P Henderson, WD Chang, PL Bacon, D Meger, J Pineau, D Precup
Thirty-Second AAAI Conference on Artificial Intelligence, 2018
152018
Convergent TREE BACKUP and RETRACE with function approximation
A Touati, PL Bacon, D Precup, P Vincent
arXiv preprint arXiv:1705.09322, 2017
152017
Learning with options that terminate off-policy
A Harutyunyan, P Vrancx, PL Bacon, D Precup, A Nowe
Thirty-Second AAAI Conference on Artificial Intelligence, 2018
112018
Learning with options: Just deliberate and relax
PL Bacon, D Precup
NIPS Bounded Optimality and Rational Metareasoning Workshop, 2015
102015
Learning robust options
DJ Mankowitz, TA Mann, PL Bacon, D Precup, S Mannor
Thirty-Second AAAI Conference on Artificial Intelligence, 2018
82018
Learnings options end-to-end for continuous action tasks
M Klissarov, PL Bacon, J Harb, D Precup
arXiv preprint arXiv:1712.00004, 2017
72017
On the bottleneck concept for options discovery: Theoretical underpinnings and extension in continuous state spaces
PL Bacon
McGill University Libraries, 2014
52014
Using label propagation for learning temporally abstract actions in reinforcement learning
PL Bacon, D Precup
Proceedings of the Workshop on Multiagent Interaction Networks, 1-7, 2013
42013
A matrix splitting perspective on planning with options
PL Bacon, D Precup
arXiv preprint arXiv:1612.00916, 2016
32016
Learning and Planning with Timing Information in Markov Decision Processes.
PL Bacon, B Balle, D Precup
UAI, 111-120, 2015
32015
On the bottleneck concept for options discovery
PL Bacon
Masters thesis, McGill University, 2013
32013
Constructing Temporal Abstractions Autonomously in Reinforcement Learning.
PL Bacon, D Precup
AI Magazine 39 (1), 2018
22018
Analyzing Open Data from the City of Montreal.
J Pineau, PL Bacon
MUD@ ICML, 11-16, 2015
22015
Conditional computation in neural networks using a decision-theoretic approach
PL Bacon, E Bengio, J Pineau, D Precup
Proceedings of the 2nd Multidisciplinary Conference on Reinforcement …, 2015
22015
The Barbados 2018 List of Open Issues in Continual Learning
T Schaul, H van Hasselt, J Modayil, M White, A White, PL Bacon, J Harb, ...
arXiv preprint arXiv:1811.07004, 2018
12018
Temporal Representation Learning
PL Bacon
McGill University Libraries, 2018
12018
Reinforcement learning of conditional computation policies for neural networks
E Bengio, PL Bacon, R Lowe, J Pineau, D Precup
ICML Workshop on Abstractions in RL, 2016
12016
The system can't perform the operation now. Try again later.
Articles 1–20