Matthieu Geist

Cited by

	All	Since 2019
Citations	6848	5467
h-index	42	38
i10-index	103	81

2000

1000

500

1500

200920102011201220132014201520162017201820192020202120222023202428 45 95 112 162 182 145 255 151 188 253 444 734 899 1176 1954

Public access

View all

15 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Olivier PietquinCohere | ex Google DeepMind (On leave - Professor at University of Lille)Verified email at univ-lille.fr
Bilal PiotGoogle DeepmindVerified email at google.com
Léonard HussenotGoogle DeepMindVerified email at google.com
Olivier BachemResearch Scientist, Google BrainVerified email at google.com
Nino VieillardGoogle DeepMindVerified email at google.com
Mathieu LaurièreAssistant professor of Mathematics and Data Science, NYU ShanghaiVerified email at nyu.edu
Senthilkumar ChandramohanDirector of ML EngineeringVerified email at staples.com
julien perolatDeepMindVerified email at google.com
Prof. Cédric PradalierGeorgiaTech Lorraine, UMI2958 GT-CNRS, MetzVerified email at georgiatech-metz.fr
Romuald ElieDeepmind & Université Gustave EiffelVerified email at u-pem.fr
Robert DadashiGoogle DeepMindVerified email at google.com
Anton RaichukGoogle AIVerified email at google.com
Erinc MerdivanHelmholtz AI (HMGU)Verified email at helmholtz-muenchen.de
Johan FerretResearch Scientist, Google DeepMindVerified email at google.com
Edouard KLEINBeaver LabsVerified email at beaver-labs.com
Raphaël MarinierGoogle AIVerified email at google.com
Sten HankeAssoc. Prof at FH JoanneumVerified email at fh-joanneum.at
Piotr StanczykGoogleVerified email at google.com
Johannes KropfAIT Austrian Institute of TechnologyVerified email at kropf.at
Marcin AndrychowiczGoogle BrainVerified email at openai.com

Matthieu Geist

Cohere (ex Google, on leave of Professor, Université de Lorraine)

Verified email at univ-lorraine.fr

reinforcement learning machine learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Gemini: a family of highly capable multimodal models G Team, R Anil, S Borgeaud, Y Wu, JB Alayrac, J Yu, R Soricut, ... arXiv preprint arXiv:2312.11805, 2023	1042	2023
What matters for on-policy deep actor-critic methods? a large-scale study M Andrychowicz, A Raichuk, P Stańczyk, M Orsini, S Girgin, R Marinier, ... International conference on learning representations, 2021	399*	2021
A theory of regularized markov decision processes M Geist, B Scherrer, O Pietquin International Conference on Machine Learning, 2160-2169, 2019	313	2019
Human activity recognition using recurrent neural networks D Singh, E Merdivan, I Psychoula, J Kropf, S Hanke, M Geist, A Holzinger Machine Learning and Knowledge Extraction: First IFIP TC 5, WG 8.4, 8.9, 12 …, 2017	210	2017
Approximate modified policy iteration and its application to the game of Tetris. B Scherrer, M Ghavamzadeh, V Gabillon, B Lesner, M Geist J. Mach. Learn. Res. 16 (49), 1629-1676, 2015	153	2015
IQ-Learn: Inverse soft-Q Learning for Imitation D Garg, S Chakraborty, C Cundy, J Song, M Geist, S Ermon arXiv preprint arXiv:2106.12142, 2022	132	2022
Primal wasserstein imitation learning R Dadashi, L Hussenot, M Geist, O Pietquin arXiv preprint arXiv:2006.04678, 2020	130	2020
Inverse reinforcement learning through structured classification E Klein, M Geist, B Piot, O Pietquin Advances in neural information processing systems 25, 2012	125	2012
Kalman temporal differences M Geist, O Pietquin Journal of artificial intelligence research 39, 483-532, 2010	124	2010
Algorithmic survey of parametric value function approximation M Geist, O Pietquin IEEE Transactions on Neural Networks and Learning Systems 24 (6), 845-867, 2013	122*	2013
On the convergence of model free learning in mean field games R Elie, J Perolat, M Laurière, M Geist, O Pietquin Proceedings of the AAAI Conference on Artificial Intelligence 34 (05), 7143-7150, 2020	121*	2020
Fictitious play for mean field games: Continuous time analysis and applications S Perrin, J Pérolat, M Laurière, M Geist, R Elie, O Pietquin Advances in neural information processing systems 33, 13199-13213, 2020	120	2020
Sample-efficient batch reinforcement learning for dialogue management optimization O Pietquin, M Geist, S Chandramohan, H Frezza-Buet ACM Transactions on Speech and Language Processing (TSLP) 7 (3), 1-21, 2011	120	2011
User simulation in dialogue systems using inverse reinforcement learning S Chandramohan, M Geist, F Lefevre, O Pietquin Interspeech 2011, 1025-1028, 2011	118	2011
Leverage the average: an analysis of kl regularization in reinforcement learning N Vieillard, T Kozuno, B Scherrer, O Pietquin, R Munos, M Geist Advances in Neural Information Processing Systems 33, 12163-12174, 2020	111*	2020
Off-policy learning with eligibility traces: a survey. M Geist, B Scherrer J. Mach. Learn. Res. 15 (1), 289-333, 2014	110	2014
Bridging the gap between imitation learning and inverse reinforcement learning B Piot, M Geist, O Pietquin IEEE transactions on neural networks and learning systems 28 (8), 1814-1826, 2016	108	2016
Convolutional and recurrent neural networks for activity recognition in smart environment D Singh, E Merdivan, S Hanke, J Kropf, M Geist, A Holzinger Towards Integrative Machine Learning and Knowledge Extraction: BIRS Workshop …, 2017	96	2017
Munchausen reinforcement learning N Vieillard, O Pietquin, M Geist Advances in Neural Information Processing Systems 33, 4235-4246, 2020	94	2020
Boosted bellman residual minimization handling expert demonstrations B Piot, M Geist, O Pietquin Machine Learning and Knowledge Discovery in Databases: European Conference …, 2014	92	2014

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors