Matthieu Geist
Matthieu Geist
Google Brain (on leave of Professor, Université de Lorraine)
Verified email at univ-lorraine.fr
Title
Cited by
Cited by
Year
Kalman temporal differences
M Geist, O Pietquin
Journal of artificial intelligence research 39, 483-532, 2010
832010
Algorithmic survey of parametric value function approximation
M Geist, O Pietquin
IEEE Transactions on Neural Networks and Learning Systems 24 (6), 845-867, 2013
78*2013
Sample-efficient batch reinforcement learning for dialogue management optimization
O Pietquin, M Geist, S Chandramohan, H Frezza-Buet
ACM Transactions on Speech and Language Processing (TSLP) 7 (3), 1-21, 2011
772011
User simulation in dialogue systems using inverse reinforcement learning
S Chandramohan, M Geist, F Lefevre, O Pietquin
762011
Off-policy learning with eligibility traces: A survey
M Geist, B Scherrer
The Journal of Machine Learning Research 15 (1), 289-333, 2014
672014
Inverse reinforcement learning through structured classification
E Klein, M Geist, B Piot, O Pietquin
Advances in neural information processing systems, 1007-1015, 2012
662012
Laugh-aware virtual agent and its impact on user amusement
R Niewiadomski, J Hofmann, J Urbain, T Platt, J Wagner, B Piot, ...
Proceedings of the 2013 international conference on Autonomous agents and …, 2013
562013
A comprehensive reinforcement learning framework for dialogue management optimization
L Daubigney, M Geist, S Chandramohan, O Pietquin
IEEE Journal of Selected Topics in Signal Processing 6 (8), 891-902, 2012
512012
Approximate modified policy iteration and its application to the game of Tetris.
B Scherrer, M Ghavamzadeh, V Gabillon, B Lesner, M Geist
Journal of Machine Learning Research 16 (49), 1629-1676, 2015
442015
Human activity recognition using recurrent neural networks
D Singh, E Merdivan, I Psychoula, J Kropf, S Hanke, M Geist, A Holzinger
International Cross-Domain Conference for Machine Learning and Knowledge …, 2017
412017
Sample efficient on-line learning of optimal dialogue policies with kalman temporal differences
O Pietquin, M Geist, S Chandramohan
Twenty-Second International Joint Conference on Artificial Intelligence, 2011
412011
Managing uncertainty within the ktd framework
M Geist, O Pietquin
Proceedings of the Workshop on Active Learning and Experimental Design (AL&E …, 2010
39*2010
Approximate modified policy iteration
B Scherrer, V Gabillon, M Ghavamzadeh, M Geist
arXiv preprint arXiv:1205.3054, 2012
382012
Parametric value function approximation: A unified view
M Geist, O Pietquin
2011 IEEE Symposium on Adaptive Dynamic Programming And Reinforcement …, 2011
362011
Boosted bellman residual minimization handling expert demonstrations
B Piot, M Geist, O Pietquin
Joint European Conference on Machine Learning and Knowledge Discovery in …, 2014
352014
Kalman Temporal Differences: the deterministic case
M Geist, O Pietquin, G Fricout
2009 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement …, 2009
352009
A cascaded supervised learning approach to inverse reinforcement learning
E Klein, B Piot, M Geist, O Pietquin
Joint European conference on machine learning and knowledge discovery in …, 2013
322013
Performance evaluation for particle filters
R Chou, Y Boers, M Podt, M Geist
14th International Conference on Information Fusion, 1-7, 2011
312011
Optimizing spoken dialogue management with fitted value iteration
S Chandramohan, M Geist, O Pietquin
Eleventh Annual Conference of the International Speech Communication Association, 2010
302010
Uncertainty management for on-line optimisation of a POMDP-based large-scale spoken dialogue system
L Daubigney, M Gašić, S Chandramohan, M Geist, O Pietquin, S Young
292011
The system can't perform the operation now. Try again later.
Articles 1–20