Follow
Zaheer Abbas
Zaheer Abbas
Research Engineer, Google DeepMind
Verified email at google.com
Title
Cited by
Cited by
Year
Gemini: a family of highly capable multimodal models
G Team, R Anil, S Borgeaud, Y Wu, JB Alayrac, J Yu, R Soricut, ...
arXiv preprint arXiv:2312.11805, 2023
22492023
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context
M Reid, N Savinov, D Teplyashin, D Lepikhin, T Lillicrap, J Alayrac, ...
arXiv preprint arXiv:2403.05530, 2024
7012024
Loss of plasticity in continual deep reinforcement learning
Z Abbas, R Zhao, J Modayil, A White, MC Machado
Conference on Lifelong Learning Agents, 620-636, 2023
712023
Organizing experience: a deeper look at replay mechanisms for sample-based planning in continuous state domains
Y Pan, Z Abbas, A White, A Patterson, M White
IJCAI'18, 2018
572018
General value function networks
M Schlegel, A Jacobsen, Z Abbas, A Patterson, A White, M White
arXiv preprint arXiv:1807.06763, 2018
442018
Many-Shot In-Context Learning
R Agarwal, A Singh, LM Zhang, B Bohnet, S Chan, A Anand, Z Abbas, ...
arXiv preprint arXiv:2404.11018, 2024
422024
Selective Dyna-style Planning Under Limited Model Capacity
Z Abbas, S Sokota, EJ Talvitie, M White
ICML'20, 2020
392020
Planning with expectation models
Y Wan, Z Abbas, A White, M White, RS Sutton
IJCAI'19, 2019
292019
Investigating the properties of neural network representations in reinforcement learning
H Wang, E Miahi, M White, MC Machado, Z Abbas, R Kumaraswamy, ...
Artificial Intelligence 330, 104100, 2024
272024
From Eye-blinks to State Construction: Diagnostic Benchmarks for Online Representation Learning
B Rafiee, Z Abbas, S Ghiassian, R Kumaraswamy, R Sutton, E Ludvig, ...
arXiv preprint arXiv:2011.04590, 2020
112020
Model-based reinforcement learning with non-linear expectation models and stochastic environments
Y Wan, Z Abbas, M White, RS Sutton
FAIM Workshop on Prediction and Generative Modeling in Reinforcement …, 2018
62018
Towards model-free RL algorithms that scale well with unstructured data
J Modayil, Z Abbas
arXiv preprint arXiv:2311.02215, 2023
22023
Selective Dyna-style Planning Using Neural Network Models with Limited Capacity
Z Abbas
2*2020
Incrementally Learning Functions of the Return
B Bennett, W Chung, Z Abbas, V Liu
arXiv preprint arXiv:1907.04651, 2019
12019
Controlling agents using auxiliary prediction neural networks that generate state value estimates
M Zaheer, JV Modayil
US Patent App. 18/230,056, 2024
2024
The system can't perform the operation now. Try again later.
Articles 1–15