Benchmarking model-based reinforcement learning T Wang, X Bao, I Clavera, J Hoang, Y Wen, E Langlois, S Zhang, G Zhang, ... arXiv preprint arXiv:1907.02057, 2019 | 247 | 2019 |
One-shot pruning of recurrent neural networks by jacobian spectrum evaluation MS Zhang, B Stadie arXiv preprint arXiv:1912.00120, 2019 | 20 | 2019 |
Analysis of Langevin Monte Carlo from Poincaré to Log-Sobolev S Chewi, MA Erdogdu, MB Li, R Shen, M Zhang arXiv preprint arXiv:2112.12662, 2021 | 8 | 2021 |
Convergence of Langevin Monte Carlo in chi-squared and Rényi divergence MA Erdogdu, R Hosseinzadeh, S Zhang International Conference on Artificial Intelligence and Statistics, 8151-8175, 2022 | 5 | 2022 |
Towards a theory of non-log-concave sampling: first-order stationarity guarantees for Langevin Monte Carlo K Balasubramanian, S Chewi, MA Erdogdu, A Salim, S Zhang Conference on Learning Theory, 2896-2923, 2022 | 4 | 2022 |
Convergence and Optimality of Policy Gradient Methods in Weakly Smooth Settings MS Zhang, MA Erdogdu, A Garg Proceedings of the AAAI Conference on Artificial Intelligence 36 (8), 9066-9073, 2022 | | 2022 |