GHEORGHE COMANICI
GHEORGHE COMANICI
Research Scientist, DeepMind
Verified email at google.com
Title
Cited by
Cited by
Year
Optimal policy switching algorithms for reinforcement learning
G Comanici, D Precup
Proceedings of the 9th International Conference on Autonomous Agents and†…, 2010
372010
On-the-fly algorithms for bisimulation metrics
G Comanici, P Panangaden, D Precup
2012 Ninth International Conference on Quantitative Evaluation of Systems†…, 2012
182012
Basis function discovery using spectral clustering and bisimulation metrics
G Comanici, D Precup
International Workshop on Adaptive and Learning Agents, 85-99, 2011
182011
The option keyboard: Combining skills in reinforcement learning
A Barreto, D Borsa, S Hou, G Comanici, E AygŁn, P Hamel, DK Toyama, ...
122019
Representation discovery for mdps using bisimulation metrics
S Ruan, G Comanici, P Panangaden, D Precup
Proceedings of the AAAI Conference on Artificial Intelligence 29 (1), 2015
102015
What can I do here? A Theory of Affordances in Reinforcement Learning
K Khetarpal, Z Ahmed, G Comanici, D Abel, D Precup
International Conference on Machine Learning, 5243-5253, 2020
92020
An empirical analysis of off-policy learning in discrete mdps
C Păduraru, D Precup, J Pineau, G Comănici
European Workshop on Reinforcement Learning, 89-102, 2013
82013
Basis refinement strategies for linear value function approximation in MDPs
G Comanici, D Precup, P Panangaden
Advances in Neural Information Processing Systems 28, 2899-2907, 2015
42015
A study of off-policy learning in computational sustainability
C Paduraru, D Precup, J Pineau, G Comanici
European Workshop on Reinforcement Learning (EWRL) 24, 89-102, 2012
42012
Knowledge representation for reinforcement learning using general value functions
G Comanici, D Precup, A Barreto, DK Toyama, E AygŁn, P Hamel, ...
32018
Representation discovery for Markov decision processes using behavioural similarity
G Comanici
2016
Optimal Time Scales for Reinforcement Learning Behaviour Strategies
G Comanici, D Precup
McGill University, 2010
2010
The system can't perform the operation now. Try again later.
Articles 1–12