The implicit bias of gradient descent on separable data D Soudry, E Hoffer, MS Nacson, S Gunasekar, N Srebro The Journal of Machine Learning Research 19 (1), 2822-2878, 2018 | 323 | 2018 |
Convergence of gradient descent on separable data MS Nacson, J Lee, S Gunasekar, PHP Savarese, N Srebro, D Soudry The 22nd International Conference on Artificial Intelligence and Statistics …, 2019 | 55 | 2019 |
Stochastic gradient descent on separable data: Exact convergence with a fixed learning rate MS Nacson, N Srebro, D Soudry The 22nd International Conference on Artificial Intelligence and Statistics …, 2019 | 32 | 2019 |
Lexicographic and depth-sensitive margins in homogeneous and non-homogeneous deep models MS Nacson, S Gunasekar, J Lee, N Srebro, D Soudry International Conference on Machine Learning, 4683-4692, 2019 | 17 | 2019 |
At Stability's Edge: How to Adjust Hyperparameters to Preserve Minima Selection in Asynchronous Training of Neural Networks? N Giladi, MS Nacson, E Hoffer, D Soudry arXiv preprint arXiv:1909.12340, 2019 | 3 | 2019 |
On the Implicit Bias of Initialization Shape: Beyond Infinitesimal Mirror Descent S Azulay, E Moroshko, MS Nacson, B Woodworth, N Srebro, A Globerson, ... arXiv preprint arXiv:2102.09769, 2021 | | 2021 |
How Learning Rate and Delay Affect Minima Selection in Asynchronous Training of Neural Networks N Giladi, MS Nacson, E Hoffer, D Soudry | | |