Pierre-Luc Bacon

Cited by

	All	Since 2019
Citations	2402	2100
h-index	17	17
i10-index	23	20

520

260

130

390

20162017201820192020202120222023202432 74 184 263 351 351 422 516 197

Public access

View all

7 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Doina PrecupDeepMind and McGill UniversityVerified email at cs.mcgill.ca
Jean HarbOpenAIVerified email at openai.com
Emmanuel BengioMcGill UniversityVerified email at mail.mcgill.ca
Joelle PineauSchool of Computer Science, McGill University; FAIR, Meta AI; MilaVerified email at cs.mcgill.ca
Martin KlissarovMcGill University, MilaVerified email at mail.mcgill.ca
Ahmed TouatiMeta AIVerified email at umontreal.ca
Pascal VincentFacebook AI Research; U. Montreal (Professor, Computer Sc. & Op. Res.); MILA; CIFARVerified email at iro.umontreal.ca
Emma BrunskillAssociate Professor of Computer Science, Stanford UniversityVerified email at cs.stanford.edu
Yao LiuAmazonVerified email at stanford.edu
Timothy A MannMetaVerified email at fb.com
Daniel J. MankowitzGoogle DeepmindVerified email at google.com
Shie MannorProfessor of Electrical Engineering @ Technion & Researcher @ Nvidia ResearchVerified email at technion.ac.il
Anna HarutyunyanDeepMindVerified email at google.com
Borja BalleDeepMindVerified email at google.com
Anima AnandkumarCalifornia Institute of Technology and NVIDIAVerified email at caltech.edu
David MegerAssociate Professor at McGill UniversityVerified email at cim.mcgill.ca

Pierre-Luc Bacon

University of Montreal

Verified email at mila.quebec - Homepage

reinforcement learning artificial intelligence


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
The option-critic architecture PL Bacon, J Harb, D Precup Proceedings of the AAAI conference on artificial intelligence 31 (1), 2017	1184	2017
Conditional computation in neural networks for faster models E Bengio, PL Bacon, J Pineau, D Precup arXiv preprint arXiv:1511.06297, 2015	327	2015
When waiting is not an option: Learning options with a deliberation cost J Harb, PL Bacon, M Klissarov, D Precup Proceedings of the AAAI Conference on Artificial Intelligence 32 (1), 2018	152	2018
The primacy bias in deep reinforcement learning E Nikishin, M Schwarzer, P D’Oro, PL Bacon, A Courville International conference on machine learning, 16828-16847, 2022	98	2022
Learnings options end-to-end for continuous action tasks M Klissarov, PL Bacon, J Harb, D Precup arXiv preprint arXiv:1712.00004, 2017	56	2017
Convergent tree backup and retrace with function approximation A Touati, PL Bacon, D Precup, P Vincent International Conference on Machine Learning, 4955-4964, 2018	46	2018
Learning robust options D Mankowitz, T Mann, PL Bacon, D Precup, S Mannor Proceedings of the AAAI Conference on Artificial Intelligence 32 (1), 2018	44	2018
Sample-efficient reinforcement learning by breaking the replay ratio barrier P D'Oro, M Schwarzer, E Nikishin, PL Bacon, MG Bellemare, A Courville Deep Reinforcement Learning Workshop NeurIPS 2022, 2022	42	2022
Options of interest: Temporal abstraction with interest functions K Khetarpal, M Klissarov, M Chevalier-Boisvert, PL Bacon, D Precup Proceedings of the AAAI Conference on Artificial Intelligence 34 (04), 4444-4451, 2020	42	2020
Understanding the curse of horizon in off-policy evaluation via conditional importance sampling Y Liu, PL Bacon, E Brunskill International Conference on Machine Learning, 6184-6193, 2020	39	2020
Policy evaluation networks J Harb, T Schaul, D Precup, PL Bacon arXiv preprint arXiv:2002.11833, 2020	38	2020
Control-oriented model-based reinforcement learning with implicit differentiation E Nikishin, R Abachi, R Agarwal, PL Bacon Proceedings of the AAAI Conference on Artificial Intelligence 36 (7), 7886-7894, 2022	29	2022
Temporal Representation Learning PL Bacon McGill University (Canada), 2018	28	2018
Direct behavior specification via constrained reinforcement learning J Roy, R Girgis, J Romoff, PL Bacon, C Pal arXiv preprint arXiv:2112.12228, 2021	23	2021
Learning with options that terminate off-policy A Harutyunyan, P Vrancx, PL Bacon, D Precup, A Nowe Proceedings of the AAAI Conference on Artificial Intelligence 32 (1), 2018	23	2018
An information-theoretic perspective on credit assignment in reinforcement learning D Arumugam, P Henderson, PL Bacon arXiv preprint arXiv:2103.06224, 2021	19	2021
Xlvin: executed latent value iteration nets A Deac, P Veličković, O Milinković, PL Bacon, J Tang, M Nikolić arXiv preprint arXiv:2010.13146, 2020	18	2020
Continuous-time meta-learning with forward mode differentiation T Deleu, D Kanaa, L Feng, G Kerg, Y Bengio, G Lajoie, PL Bacon arXiv preprint arXiv:2203.01443, 2022	17	2022
Neural algorithmic reasoners are implicit planners AI Deac, P Veličković, O Milinkovic, PL Bacon, J Tang, M Nikolic Advances in Neural Information Processing Systems 34, 15529-15542, 2021	15	2021
The barbados 2018 list of open issues in continual learning T Schaul, H van Hasselt, J Modayil, M White, A White, PL Bacon, J Harb, ... arXiv preprint arXiv:1811.07004, 2018	13	2018

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors