Follow
Rosie Zhao
Title
Cited by
Cited by
Year
Loss of plasticity in continual deep reinforcement learning
Z Abbas, R Zhao, J Modayil, A White, MC Machado
Conference on Lifelong Learning Agents, 620-636, 2023
442023
Using deep learning and social network analysis to understand and manage extreme flooding
A Romascanu, H Ker, R Sieber, S Greenidge, S Lumley, D Bush, ...
Journal of Contingencies and Crisis Management 28 (3), 251-261, 2020
342020
Continuous mdp homomorphisms and homomorphic policy gradient
S Rezaei-Shoshtari, R Zhao, P Panangaden, D Meger, D Precup
Advances in Neural Information Processing Systems 35, 20189-20204, 2022
152022
Lower bound methods for sign-rank and their limitations
H Hatami, P Hatami, W Pires, R Tao, R Zhao
Approximation, Randomization, and Combinatorial Optimization. Algorithms and …, 2022
102022
Feature emergence via margin maximization: case studies in algebraic tasks
D Morwani, BL Edelman, CA Oncescu, R Zhao, S Kakade
arXiv preprint arXiv:2311.07568, 2023
62023
Boolean functions with small approximate spectral norm
TM Cheung, H Hatami, R Zhao, I Zilberstein
Electronic Colloquium on Computational Complexity, 2022
42022
On the peel number and the leaf-height of Galton–Watson trees
L Devroye, MK Goh, RY Zhao
Combinatorics, Probability and Computing 32 (1), 68-90, 2023
3*2023
Arithmetic subsequences in a random ordering of an additive set
MK Goh, RY Zhao
arXiv preprint arXiv:2012.12339, 2020
32020
Beyond implicit bias: The insignificance of sgd noise in online learning
N Vyas, D Morwani, R Zhao, G Kaplun, S Kakade, B Barak
arXiv preprint arXiv:2306.08590, 2023
22023
Bridging the gap between supervised classification and unsupervised topic modelling for social-media assisted crisis management
M Brunila, R Zhao, A Mircea, S Lumley, R Sieber
arXiv preprint arXiv:2103.11835, 2021
22021
Policy Gradient Methods in the Presence of Symmetries and State Abstractions
P Panangaden, S Rezaei-Shoshtari, R Zhao, D Meger, D Precup
Journal of Machine Learning Research 25 (71), 1-57, 2024
12024
Leaf multiplicity in a Bienaym\'e-Galton-Watson tree
AM Brandenberger, L Devroye, MK Goh, RY Zhao
Discrete Mathematics & Theoretical Computer Science 24 (Analysis of Algorithms), 2022
12022
Continuous Homomorphisms and Leveraging Symmetries in Policy Gradient Algorithms for Markov Decision Processes
RY Zhao
McGill University (Canada), 2022
12022
Deconstructing What Makes a Good Optimizer for Language Models
R Zhao, D Morwani, D Brandfonbrener, N Vyas, S Kakade
arXiv preprint arXiv:2407.07972, 2024
2024
A Study of Policy Gradient on a Class of Exactly Solvable Models
G McCracken, C Daniels, R Zhao, A Brandenberger, P Panangaden, ...
arXiv preprint arXiv:2011.01859, 2020
2020
The system can't perform the operation now. Try again later.
Articles 1–15