Pierre Ménard

Cited by

	All	Since 2019
Citations	1316	1263
h-index	19	19
i10-index	25	23

380

190

285

2016201720182019202020212022202320248 4 24 55 110 222 282 362 231

Public access

View all

23 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Michal ValkoLlama @ Meta Paris & Inria & MVA - Ex: Gemini and BYOL @ Google DeepMindVerified email at meta.com
Omar Darwiche DominguesCohereVerified email at cohere.com
Emilie KaufmannCNRS & Univ. Lille (CRIStAL)Verified email at inria.fr
Aurélien GarivierEcole Normale Supérieure de LyonVerified email at ens-lyon.fr
Rémi MunosGoogle DeepMindVerified email at inria.fr
Edouard LeurentDeepMindVerified email at deepmind.com
Anders JonssonArtificial Intelligence and Machine Learning group, Universitat Pompeu FabraVerified email at upf.edu
Xuedong ShangINRIA (SequeL -> SCOOL)Verified email at inria.fr
Matteo PirottaResearch Scientist, Meta (FAIR)Verified email at fb.com
Daniil TiapkinÉcole PolytechniqueVerified email at polytechnique.edu
Alexey NaumovNational Research University Higher School of EconomicsVerified email at hse.ru
Prof. Dr. Denis BelomestnyDuisburg-Essen UniversityVerified email at uni-due.de
Tadashi KozunoOMRON SINIC XVerified email at alumni.oist.jp
Rémy DegenneInria LilleVerified email at inria.fr
Eric MoulinesProfesseur, Ecole Polytechnique, Membre de l'Académie des SciencesVerified email at polytechnique.edu
Rianne de HeideAssistant professor, Mathematics department, Vrije Universiteit AmsterdamVerified email at utwente.nl
Hedi HADIJICentraleSupelecVerified email at centralesupelec.fr
Wouter M. KoolenCentrum Wiskunde & Informatica; University of TwenteVerified email at cwi.nl
Alessandro LazaricResearch Scientist, Facebook Artificial Intelligence ResearchVerified email at inria.fr
Sébastien GerchinovitzResearch scientist, IRT Saint Exupéry, ToulouseVerified email at math.univ-toulouse.fr

Pierre Ménard

OvGU Magdeburg

Verified email at inria.fr - Homepage


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Explore first, exploit next: The true shape of regret in bandit problems A Garivier, P Ménard, G Stoltz Mathematics of Operations Research 44 (2), 377-399, 2019	195	2019
Episodic Reinforcement Learning in Finite MDPs: Minimax Lower Bounds Revisited O Darwiche Domingues, P Ménard, E Kaufmann, M Valko arXiv e-prints, arXiv: 2010.03531, 2020	117*	2020
Fast active learning for pure exploration in reinforcement learning P Ménard, OD Domingues, A Jonsson, E Kaufmann, E Leurent, M Valko International Conference on Machine Learning, 7599-7608, 2021	99	2021
Adaptive reward-free exploration E Kaufmann, P Ménard, OD Domingues, A Jonsson, E Leurent, M Valko Algorithmic Learning Theory, 865-891, 2021	89	2021
Non-asymptotic pure exploration by solving games R Degenne, WM Koolen, P Ménard Advances in Neural Information Processing Systems 32, 2019	89	2019
Gamification of pure exploration for linear bandits R Degenne, P Ménard, X Shang, M Valko International Conference on Machine Learning, 2432-2442, 2020	88	2020
Fixed-confidence guarantees for bayesian best-arm identification X Shang, R Heide, P Menard, E Kaufmann, M Valko International Conference on Artificial Intelligence and Statistics, 1823-1832, 2020	65	2020
A minimax and asymptotically optimal algorithm for stochastic bandits P Ménard, A Garivier International Conference on Algorithmic Learning Theory, 223-237, 2017	55	2017
Kernel-based reinforcement learning: A finite-time analysis OD Domingues, P Ménard, M Pirotta, E Kaufmann, M Valko International Conference on Machine Learning, 2783-2792, 2021	48*	2021
KL-UCB-switch: optimal regret bounds for stochastic bandits from both a distribution-dependent and a distribution-free viewpoints A Garivier, H Hadiji, P Menard, G Stoltz Journal of Machine Learning Research 23 (179), 1-66, 2022	43	2022
Ucb momentum q-learning: Correcting the bias without forgetting P Ménard, OD Domingues, X Shang, M Valko International Conference on Machine Learning, 7609-7618, 2021	42	2021
A Kernel-Based Approach to Non-Stationary Reinforcement Learning in Metric Spaces O Darwiche Domingues, P Ménard, M Pirotta, E Kaufmann, M Valko arXiv e-prints, arXiv: 2007.05078, 2020	38*	2020
Fano’s inequality for random variables S Gerchinovitz, P Ménard, G Stoltz	37	2020
Learning in two-player zero-sum partially observable Markov games with perfect recall T Kozuno, P Ménard, R Munos, M Valko Advances in Neural Information Processing Systems 34, 11987-11998, 2021	36	2021
A single algorithm for both restless and rested rotting bandits J Seznec, P Menard, A Lazaric, M Valko International Conference on Artificial Intelligence and Statistics, 3784-3794, 2020	32	2020
Planning in markov decision processes with gap-dependent sample complexity A Jonsson, E Kaufmann, P Ménard, O Darwiche Domingues, E Leurent, ... Advances in Neural Information Processing Systems 33, 1253-1263, 2020	32	2020
Thresholding bandit for dose-ranging: The impact of monotonicity A Garivier, P Ménard, L Rossi, P Menard arXiv preprint arXiv:1711.04454, 2017	29	2017
Bandits with many optimal arms R De Heide, J Cheshire, P Ménard, A Carpentier Advances in Neural Information Processing Systems 34, 22457-22469, 2021	20	2021
Planning in entropy-regularized Markov decision processes and games JB Grill, O Darwiche Domingues, P Ménard, R Munos, M Valko Advances in Neural Information Processing Systems 32, 2019	19	2019
rlberry-A Reinforcement Learning Library for Research and Education OD Domingues, Y Flet-Berliac, E Leurent, P Ménard, X Shang, M Valko October, 2021	18	2021

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors