Pierre-Luc Bacon
Pierre-Luc Bacon
University of Montreal
Verified email at mila.quebec - Homepage
Title
Cited by
Cited by
Year
The option-critic architecture
PL Bacon, J Harb, D Precup
Proceedings of the AAAI Conference on Artificial Intelligence 31 (1), 2017
5762017
Conditional computation in neural networks for faster models
E Bengio, PL Bacon, J Pineau, D Precup
arXiv preprint arXiv:1511.06297, 2015
1772015
When waiting is not an option: Learning options with a deliberation cost
J Harb, PL Bacon, M Klissarov, D Precup
Proceedings of the AAAI Conference on Artificial Intelligence 32 (1), 2018
722018
Optiongan: Learning joint reward-policy options using generative adversarial inverse reinforcement learning
P Henderson, WD Chang, PL Bacon, D Meger, J Pineau, D Precup
Proceedings of the AAAI Conference on Artificial Intelligence 32 (1), 2018
382018
Convergent TREE BACKUP and RETRACE with function approximation
A Touati, PL Bacon, D Precup, P Vincent
International Conference on Machine Learning, 4955-4964, 2018
262018
Learning robust options
D Mankowitz, T Mann, PL Bacon, D Precup, S Mannor
Proceedings of the AAAI Conference on Artificial Intelligence 32 (1), 2018
252018
Learnings options end-to-end for continuous action tasks
M Klissarov, PL Bacon, J Harb, D Precup
arXiv preprint arXiv:1712.00004, 2017
232017
Learning with options that terminate off-policy
A Harutyunyan, P Vrancx, PL Bacon, D Precup, A Nowe
Proceedings of the AAAI Conference on Artificial Intelligence 32 (1), 2018
152018
Temporal Representation Learning
PL Bacon
McGill University Libraries, 2018
132018
Understanding the curse of horizon in off-policy evaluation via conditional importance sampling
Y Liu, PL Bacon, E Brunskill
International Conference on Machine Learning, 6184-6193, 2020
102020
Options of interest: Temporal abstraction with interest functions
K Khetarpal, M Klissarov, M Chevalier-Boisvert, PL Bacon, D Precup
Proceedings of the AAAI Conference on Artificial Intelligence 34 (04), 4444-4451, 2020
102020
Learning with options: Just deliberate and relax
PL Bacon, D Precup
NIPS Bounded Optimality and Rational Metareasoning Workshop, 2015
102015
On the bottleneck concept for options discovery: Theoretical underpinnings and extension in continuous state spaces
PL Bacon
82014
On the bottleneck concept for options discovery
PL Bacon
Ph. D. dissertation, Masters thesis, 2013
62013
The barbados 2018 list of open issues in continual learning
T Schaul, H van Hasselt, J Modayil, M White, A White, PL Bacon, J Harb, ...
arXiv preprint arXiv:1811.07004, 2018
52018
Conditional computation in neural networks using a decision-theoretic approach
PL Bacon, E Bengio, J Pineau, D Precup
Proceedings of the 2nd Multidisciplinary Conference on Reinforcement …, 2015
52015
Using label propagation for learning temporally abstract actions in reinforcement learning
PL Bacon, D Precup
Proceedings of the Workshop on Multiagent Interaction Networks, 1-7, 2013
52013
Xlvin: executed latent value iteration nets
A Deac, P Veličković, O Milinković, PL Bacon, J Tang, M Nikolić
arXiv preprint arXiv:2010.13146, 2020
42020
Policy evaluation networks
J Harb, T Schaul, D Precup, PL Bacon
arXiv preprint arXiv:2002.11833, 2020
42020
Entropy regularization with discounted future state distribution in policy gradient methods
R Islam, R Seraj, PL Bacon, D Precup
arXiv preprint arXiv:1912.05104, 2019
42019
The system can't perform the operation now. Try again later.
Articles 1–20