Pierre-Luc Bacon
Pierre-Luc Bacon
University of Montreal
Verified email at mila.quebec - Homepage
Title
Cited by
Cited by
Year
The option-critic architecture
PL Bacon, J Harb, D Precup
Proceedings of the AAAI Conference on Artificial Intelligence 31 (1), 2017
6672017
Conditional computation in neural networks for faster models
E Bengio, PL Bacon, J Pineau, D Precup
arXiv preprint arXiv:1511.06297, 2015
2022015
When waiting is not an option: Learning options with a deliberation cost
J Harb, PL Bacon, M Klissarov, D Precup
Thirty-Second AAAI Conference on Artificial Intelligence, 2018
852018
Optiongan: Learning joint reward-policy options using generative adversarial inverse reinforcement learning
P Henderson, WD Chang, PL Bacon, D Meger, J Pineau, D Precup
Proceedings of the AAAI conference on artificial intelligence 32 (1), 2018
492018
Learnings options end-to-end for continuous action tasks
M Klissarov, PL Bacon, J Harb, D Precup
arXiv preprint arXiv:1712.00004, 2017
322017
Convergent TREE BACKUP and RETRACE with function approximation
A Touati, PL Bacon, D Precup, P Vincent
International Conference on Machine Learning, 4955-4964, 2018
302018
Learning robust options
DJ Mankowitz, TA Mann, PL Bacon, D Precup, S Mannor
Thirty-Second AAAI Conference on Artificial Intelligence, 2018
302018
Temporal Representation Learning
PL Bacon
McGill University (Canada), 2018
172018
Learning with options that terminate off-policy
A Harutyunyan, P Vrancx, PL Bacon, D Precup, A Nowe
Proceedings of the AAAI Conference on Artificial Intelligence 32 (1), 2018
162018
Understanding the curse of horizon in off-policy evaluation via conditional importance sampling
Y Liu, PL Bacon, E Brunskill
International Conference on Machine Learning, 6184-6193, 2020
142020
Options of interest: Temporal abstraction with interest functions
K Khetarpal, M Klissarov, M Chevalier-Boisvert, PL Bacon, D Precup
Proceedings of the AAAI Conference on Artificial Intelligence 34 (04), 4444-4451, 2020
142020
Learning with options: Just deliberate and relax
PL Bacon, D Precup
NIPS Bounded Optimality and Rational Metareasoning Workshop, 2015
92015
On the bottleneck concept for options discovery: Theoretical underpinnings and extension in continuous state spaces
PL Bacon
92014
Xlvin: executed latent value iteration nets
A Deac, P Veličković, O Milinković, PL Bacon, J Tang, M Nikolić
arXiv preprint arXiv:2010.13146, 2020
82020
Policy evaluation networks
J Harb, T Schaul, D Precup, PL Bacon
arXiv preprint arXiv:2002.11833, 2020
72020
Graph neural induction of value iteration
A Deac, PL Bacon, J Tang
arXiv preprint arXiv:2009.12604, 2020
62020
On the bottleneck concept for options discovery
PL Bacon
Ph. D. dissertation, Masters thesis, 2013
62013
Using label propagation for learning temporally abstract actions in reinforcement learning
PL Bacon, D Precup
Proceedings of the Workshop on Multiagent Interaction Networks, 1-7, 2013
62013
The barbados 2018 list of open issues in continual learning
T Schaul, H van Hasselt, J Modayil, M White, A White, PL Bacon, J Harb, ...
arXiv preprint arXiv:1811.07004, 2018
52018
Conditional computation in neural networks using a decision-theoretic approach
PL Bacon, E Bengio, J Pineau, D Precup
Proceedings of the 2nd Multidisciplinary Conference on Reinforcement …, 2015
52015
The system can't perform the operation now. Try again later.
Articles 1–20