Hamid Maei

Citée par

	Toutes	Depuis 2019
Citations	3377	1708
indice h	16	14
indice i10	17	15

360

180

270

20072008200920102011201220132014201520162017201820192020202120222023202411 21 36 77 148 133 146 188 164 202 174 333 323 344 338 349 271 83

Coauteurs

Richard S. SuttonKeen, Amii, and University of AlbertaAdresse e-mail validée de richsutton.com
Csaba SzepesvariDeepMind & University of AlbertaAdresse e-mail validée de cs.ualberta.ca
Shalabh BhatnagarProfessor in the Department of Computer Science and Automation, Indian Institute of ScienceAdresse e-mail validée de iisc.ac.in
Doina PrecupDeepMind and McGill UniversityAdresse e-mail validée de cs.mcgill.ca
David SilverDeepMind, UCLAdresse e-mail validée de google.com
Zheng WenGoogle DeepMindAdresse e-mail validée de google.com
Susan MurphyHarvard UniversityAdresse e-mail validée de g.harvard.edu

Suivre

Hamid Maei

Netflix

Adresse e-mail validée de netflix.com

Artificial Intelligence Reinforcement Learning Machine Learning Computer Science


Titre Trier par citations Trier par année Trier par titre	Citée par Citée par	Année
Fast gradient-descent methods for temporal-difference learning with linear function approximation RS Sutton, HR Maei, D Precup, S Bhatnagar, D Silver, C Szepesvári, ... Proceedings of the 26th annual international conference on machine learning …, 2009	698	2009
Toward off-policy learning control with function approximation. HR Maei, C Szepesvári, S Bhatnagar, RS Sutton ICML 10, 719-726, 2010	332	2010
Convergent temporal-difference learning with arbitrary smooth function approximation HR Maei, S Szepesvári, Csaba, Bhatnagar, D Precup, D Silver, RS Sutton Advances in Neural Information Processing Systems, 1204-1212, 2009	329	2009
Involvement of the anterior cingulate cortex in the expression of remote spatial memory CM Teixeira, SR Pomedli, HR Maei, N Kee, PW Frankland Journal of Neuroscience 26 (29), 7555-7564, 2006	313	2006
Optimal demand response using device-based reinforcement learning Z Wen, D O’Neill, H Maei IEEE Transactions on Smart Grid 6 (5), 2312-2324, 2015	303	2015
A Convergent Temporal-difference Algorithm for Off-policy Learning with Linear Function Approximation RS Sutton, H Maei, C Szepesvári Advances in neural information processing systems 21, 2008	272	2008
What is the most sensitive measure of water maze probe test performance? HR Maei, K Zaslavsky, CM Teixeira, PW Frankland Frontiers in integrative neuroscience 3, 493, 2009	245	2009
A convergent O (n) algorithm for off-policy temporal-difference learning with linear function approximation RS Sutton, C Szepesvári, HR Maei Advances in neural information processing systems 21 (21), 1609-1616, 2008	224	2008
Gradient temporal-difference learning algorithms HR Maei	194	2011
GQ (lambda): A general gradient algorithm for temporal-difference prediction learning with eligibility traces HR Maei, RS Sutton 3d Conference on Artificial General Intelligence (AGI-2010), 100-105, 2010	161	2010
Deep reinforcement learning for visual object tracking in videos D Zhang, H Maei, X Wang, YF Wang arXiv preprint arXiv:1701.08936, 2017	133	2017
Randomly connected networks have short temporal memory E Wallace, HR Maei, PE Latham Neural computation 25 (6), 1408-1439, 2013	42	2013
Convergent actor-critic algorithms under off-policy training and function approximation HR Maei arXiv preprint arXiv:1802.07842, 2018	36	2018
Development and validation of a sensitive entropy-based measure for the water maze HR Maei, K Zaslavsky, AH Wang, AP Yiu, CM Teixeira, SA Josselyn, ... Frontiers in integrative neuroscience 3, 870, 2009	28	2009
Correlated quantum percolation in the lowest Landau level N Sandler, HR Maei, J Kondev Physical Review B 70 (4), 045309, 2004	26	2004
A batch, off-policy, actor-critic algorithm for optimizing the average reward SA Murphy, Y Deng, EB Laber, HR Maei, RS Sutton, K Witkiewitz arXiv preprint arXiv:1607.05047, 2016	24	2016
Quantum and classical localization in the lowest Landau level N Sandler, HR Maei, J Kondev Physical Review B 68 (20), 205315, 2003	14	2003
How can realistic networks process time-varying signals? H Maei PQDT-Global, 2005	3	2005
A novel analytic measure for the water maze utilizing the concept of entropy HR Lee, BK Kaang Frontiers in Neuroscience 4, 1825, 2010		2010
Convergent Temporal-Difference Learning with Arbitrary Differentiable Function Approximator HR Maei, C Szepesvári, S Bhathnagar, D Silver, D Precup, R Sutton		2010

Le système ne peut pas réaliser cette opération maintenant. Veuillez réessayer plus tard.

Articles 1–20

Nombre de citations par an

Citations en double

Citations fusionnées

Ajouter les coauteursCoauteurs

Suivre

Citée par

Coauteurs