Adam White

引用次数

	总计	2019 年至今
引用	1996	1292
h 指数	22	19
i10 指数	35	32

320

160

240

2007200820092010201120122013201420152016201720182019202020212022202320247 5 8 16 39 58 69 77 70 77 92 157 164 199 229 286 310 101

开放获取的出版物数量

查看全部

10 篇文章

0 篇文章

可查看的文章

无法查看的文章

根据资助方的强制性开放获取政策

合著作者

Martha WhiteUniversity of Alberta在 ualberta.ca 的电子邮件经过验证
Joseph ModayilOpenmind Research Institute & Keen AGI在 openmindresearch.org 的电子邮件经过验证
Patrick M. PilarskiUniversity of Alberta, Amii (Alberta Machine Intelligence Institute)在 ualberta.ca 的电子邮件经过验证
Thomas DegrisDeepMind在 google.com 的电子邮件经过验证
Marlos C. MachadoUniversity of Alberta在 ualberta.ca 的电子邮件经过验证
Doina PrecupDeepMind and McGill University在 cs.mcgill.ca 的电子邮件经过验证
Nathan SturtevantUniversity of Alberta, Alberta Machine Intelligence Institute (Amii)在 ualberta.ca 的电子邮件经过验证
Shimon WhitesonProfessor of Computer Science, University of Oxford / Senior Staff Research Scientist, Waymo在 cs.ox.ac.uk 的电子邮件经过验证
Craig SherstanResearch Scientist, Sony AI在 sony.com 的电子邮件经过验证

关注

Adam White

University of Alberta, Amii (Alberta Machine Intelligence Institute)

在 ualberta.ca 的电子邮件经过验证 - 首页

Artificial Intelligence Reinforcement Learning


标题按引用次数排序按年份排序按标题排序	引用次数引用次数	年份
Horde: A scalable real-time architecture for learning knowledge from unsupervised sensorimotor interaction RS Sutton, J Modayil, M Delp, T Degris, PM Pilarski, A White, D Precup The 10th International Conference on Autonomous Agents and Multiagent …, 2011	578	2011
RL-Glue: Language-independent software for reinforcement-learning experiments B Tanner, A White The Journal of Machine Learning Research 10, 2133-2136, 2009	169	2009
Multi-timescale nexting in a reinforcement learning robot J Modayil, A White, RS Sutton Adaptive Behavior 22 (2), 146-160, 2014	141	2014
Feature construction for reinforcement learning in hearts NR Sturtevant, AM White Computers and Games: 5th International Conference, CG 2006, Turin, Italy …, 2007	80	2007
Developing a predictive approach to knowledge A White University of Alberta, 2015	78	2015
Report on the 2008 reinforcement learning competition S Whiteson, B Tanner, A White AI Magazine 31 (2), 81-81, 2010	57	2010
Organizing experience: a deeper look at replay mechanisms for sample-based planning in continuous state domains Y Pan, M Zaheer, A White, A Patterson, M White arXiv preprint arXiv:1806.04624, 2018	53	2018
Adapting behavior via intrinsic reward: A survey and empirical study C Linke, NM Ady, M White, T Degris, A White Journal of artificial intelligence research 69, 1287-1332, 2020	45	2020
Gradient temporal-difference learning with regularized corrections S Ghiassian, A Patterson, S Garg, D Gupta, A White, M White International Conference on Machine Learning, 3524-3534, 2020	43	2020
A greedy approach to adapting the trace parameter for temporal difference learning M White, A White arXiv preprint arXiv:1607.00446, 2016	43	2016
General value function networks M Schlegel, A Jacobsen, Z Abbas, A Patterson, A White, M White Journal of Artificial Intelligence Research 70, 497-543, 2021	40	2021
Investigating practical linear temporal difference learning A White, M White arXiv preprint arXiv:1602.08771, 2016	40	2016
Interval Estimation for Reinforcement-Learning Algorithms in Continuous-State Domains M White, A White Advances in Neural Information Processing Systems, 2010	38	2010
Surprise and curiosity for big data robotics A White, J Modayil, RS Sutton Workshops at the Twenty-Eighth AAAI Conference on Artificial Intelligence, 2014	36	2014
Improving performance in reinforcement learning by breaking generalization in neural networks S Ghiassian, B Rafiee, YL Lo, A White arXiv preprint arXiv:2003.07417, 2020	33	2020
Accelerated gradient temporal difference learning Y Pan, A White, M White Proceedings of the AAAI Conference on Artificial Intelligence 31 (1), 2017	31	2017
Scaling life-long off-policy learning RSS Adam White, Joseph Modayil 2012 IEEE International Conference on Development and Learning and …, 2013	31*	2013
Online off-policy prediction S Ghiassian, A Patterson, M White, RS Sutton, A White arXiv preprint arXiv:1811.02597, 2018	30	2018
Reinforcement learning benchmarks and bake-offs II A Dutech, T Edmunds, J Kok, M Lagoudakis, M Littman, M Riedmiller, ... Advances in Neural Information Processing Systems (NIPS) 17, 6, 2005	30	2005
Loss of plasticity in continual deep reinforcement learning Z Abbas, R Zhao, J Modayil, A White, MC Machado Conference on Lifelong Learning Agents, 620-636, 2023	28	2023

系统目前无法执行此操作，请稍后再试。

文章 1–20

每年引用数

重复的引用

合并的引用

添加合著者合著作者

关注

引用次数

合著作者