Doina Precup

Cited by

	All	Since 2019
Citations	32737	23648
h-index	63	54
i10-index	234	184

6000

3000

1500

4500

20022003200420052006200720082009201020112012201320142015201620172018201920202021202220232024118 119 179 217 246 307 327 331 320 380 408 483 590 610 875 1085 1897 2611 3405 4324 5253 5926 2101

Public access

View all

61 articles

5 articles

available

not available

Based on funding mandates

Co-authors

Joelle PineauSchool of Computer Science, McGill University; FAIR, Meta AI; MilaVerified email at cs.mcgill.ca
Satinder SinghGoogle DeepMind / U. of MichiganVerified email at umich.edu
Prakash PanangadenProfessor of Computer Science, McGill UniversityVerified email at cs.mcgill.ca
Tal ArbelProfessor of Electrical & Computer Engineering, McGill UniversityVerified email at cim.mcgill.ca
Riashat IslamResearch ScientistVerified email at dreamfold.ai
Andre BarretoResearch Scientist, Google DeepMindVerified email at google.com
Emmanuel BengioMcGill UniversityVerified email at mail.mcgill.ca
Yoshua BengioProfessor of computer science, University of Montreal, Mila, IVADO, CIFARVerified email at umontreal.ca
Shie MannorProfessor of Electrical Engineering @ Technion & Researcher @ Nvidia ResearchVerified email at technion.ac.il
David SilverDeepMind, UCLVerified email at google.com
Jean HarbOpenAIVerified email at openai.com
Guilherme Sant AnnaProfessor (Full) of Pediatrics, McGill UniversityVerified email at mcgill.ca
Philip WarrickPerigen Inc.Verified email at perigen.com
Csaba SzepesvariDeepMind & University of AlbertaVerified email at cs.ualberta.ca
Norm FernsVerified email at normferns.com
Jordan FrankSoftware Engineer, FacebookVerified email at cs.mcgill.ca
Amir-massoud FarahmandUniversity of TorontoVerified email at cs.toronto.edu
Pablo Samuel CastroGoogleVerified email at google.com
Hamid MaeiNetflixVerified email at netflix.com
Borja BalleDeepMindVerified email at google.com

Doina Precup

DeepMind and McGill University

Verified email at cs.mcgill.ca

Artificial Intelligence machine learning reinforcement learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
The multimodal brain tumor image segmentation benchmark (BRATS) BH Menze, A Jakab, S Bauer, J Kalpathy-Cramer, K Farahani, J Kirby, ... IEEE transactions on medical imaging 34 (10), 1993-2024, 2014	5450	2014
Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning RS Sutton, D Precup, S Singh Artificial intelligence 112 (1-2), 181-211, 1999	4317	1999
Deep reinforcement learning that matters P Henderson, R Islam, P Bachman, J Pineau, D Precup, D Meger Proceedings of the AAAI conference on artificial intelligence 32 (1), 2018	2249	2018
Off-policy deep reinforcement learning without exploration S Fujimoto, D Meger, D Precup International conference on machine learning, 2052-2062, 2019	1390	2019
The option-critic architecture PL Bacon, J Harb, D Precup Proceedings of the AAAI conference on artificial intelligence 31 (1), 2017	1191	2017
Eligibility traces for off-policy policy evaluation D Precup Computer Science Department Faculty Publication Series, 80, 2000	922	2000
Fast gradient-descent methods for temporal-difference learning with linear function approximation RS Sutton, HR Maei, D Precup, S Bhatnagar, D Silver, C Szepesvári, ... Proceedings of the 26th annual international conference on machine learning …, 2009	699	2009
Learning with pseudo-ensembles P Bachman, O Alsharif, D Precup Advances in neural information processing systems 27, 2014	634	2014
Horde: A scalable real-time architecture for learning knowledge from unsupervised sensorimotor interaction RS Sutton, J Modayil, M Delp, T Degris, PM Pilarski, A White, D Precup The 10th International Conference on Autonomous Agents and Multiagent …, 2011	579	2011
Algorithms for multi-armed bandit problems V Kuleshov, D Precup arXiv preprint arXiv:1402.6028, 2014	536	2014
Reward is enough D Silver, S Singh, D Precup, RS Sutton Artificial Intelligence 299, 103535, 2021	515	2021
Off-policy temporal-difference learning with function approximation D Precup, RS Sutton, S Dasgupta ICML, 417-424, 2001	458	2001
Learning options in reinforcement learning M Stolle, D Precup Abstraction, Reformulation, and Approximation: 5th International Symposium …, 2002	452	2002
Exploring uncertainty measures in deep networks for multiple sclerosis lesion detection and segmentation T Nair, D Precup, DL Arnold, T Arbel Medical image analysis 59, 101557, 2020	446	2020
Temporal abstraction in reinforcement learning D Precup University of Massachusetts Amherst, 2000	388	2000
Metrics for Finite Markov Decision Processes. N Ferns, P Panangaden, D Precup UAI 4, 162-169, 2004	336	2004
Convergent temporal-difference learning with arbitrary smooth function approximation H Maei, C Szepesvari, S Bhatnagar, D Precup, D Silver, RS Sutton Advances in neural information processing systems 22, 2009	329	2009
Conditional computation in neural networks for faster models E Bengio, PL Bacon, J Pineau, D Precup arXiv preprint arXiv:1511.06297, 2015	328	2015
Reproducibility of benchmarked deep reinforcement learning tasks for continuous control R Islam, P Henderson, M Gomrokchi, D Precup arXiv preprint arXiv:1708.04133, 2017	303	2017
Gradient starvation: A learning proclivity in neural networks M Pezeshki, O Kaba, Y Bengio, AC Courville, D Precup, G Lajoie Advances in Neural Information Processing Systems 34, 1256-1272, 2021	237	2021

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors