Devansh Arpit
Devansh Arpit
Verified email at
Cited by
Cited by
A closer look at memorization in deep networks
D Arpit, S Jastrzębski, N Ballas, D Krueger, E Bengio, MS Kanwal, ...
ICML 2017 (arXiv preprint arXiv:1706.05394), 2017
On the spectral bias of deep neural networks
N Rahaman, D Arpit, A Baratin, F Draxler, M Lin, FA Hamprecht, Y Bengio, ...
ICML 2019 (arXiv preprint arXiv:1806.08734), 2018
Three factors influencing minima in SGD
S Jastrzębski, Z Kenton, D Arpit, N Ballas, A Fischer, Y Bengio, A Storkey
ICANN 2018 (arXiv preprint arXiv:1711.04623), 2017
Normalization propagation: A parametric technique for removing internal covariate shift in deep networks
D Arpit, Y Zhou, BU Kota, V Govindaraju
ICML 2016 (arXiv preprint arXiv:1603.01431), 2016
The Break-Even Point on Optimization Trajectories of Deep Neural Networks
S Jastrzebski, M Szymczak, S Fort, D Arpit, J Tabor, K Cho, K Geras
ICLR 2020 (arXiv preprint arXiv:2002.09572), 2020
Residual connections encourage iterative inference
S Jastrzebski, D Arpit, N Ballas, V Verma, T Che, Y Bengio
ICLR 2018 (arXiv preprint arXiv:1710.04773), 2017
A walk with sgd
C Xing, D Arpit, C Tsirigotis, Y Bengio
arXiv preprint arXiv:1802.08770, 2018
Why regularized auto-encoders learn sparse representation?
D Arpit, Y Zhou, H Ngo, V Govindaraju
ICML 2016 (arXiv preprint arXiv:1505.05561), 2015
Deep Nets Don't Learn via Memorization
D Krueger, N Ballas, S Jastrzebski, D Arpit, MS Kanwal, T Maharaj, ...
ICLR 2017 Workshop, 2017
Ensemble of averages: Improving model selection and boosting performance in domain generalization
D Arpit, H Wang, Y Zhou, C Xiong
NeurIPS 2022, 2021
Fraternal Dropout
K Zolna, D Arpit, D Suhubdy, Y Bengio
ICLR 2018 (arXiv preprint arXiv:1711.00066), 2017
Catastrophic Fisher Explosion: Early Phase Fisher Matrix Impacts Generalization
S Jastrzebski, D Arpit, O Astrand, G Kerg, H Wang, C Xiong, R Socher, ...
ICML 2021, 2020
How to Initialize your Network? Robust Initialization for WeightNorm & ResNets
D Arpit, V Campos, Y Bengio
NeurIPs 2019, 2019
h-detach: Modifying the LSTM Gradient Towards Better Optimization
D Arpit, B Kanuparthi, G Kerg, NR Ke, I Mitliagkas, Y Bengio
ICLR 2019 (arXiv preprint arXiv:1810.03023), 2018
Is joint training better for deep auto-encoders?
Y Zhou, D Arpit, I Nwogu, V Govindaraju
arXiv preprint arXiv:1405.1380, 2014
Variational bi-lstms
S Shabanian, D Arpit, A Trischler, Y Bengio
arXiv preprint arXiv:1711.05717, 2017
Finding Flatter Minima with SGD
S Jastrzębski, Z Kenton, D Arpit, N Ballas, A Fischer, Y Bengio, A Storkey
ICLR 2018 Workshop, 2018
The benefits of over-parameterization at initialization in deep ReLU networks
D Arpit, Y Bengio
arXiv preprint arXiv:1901.03611, 2019
Merlion: A machine learning library for time series
A Bhatnagar, P Kassianik, C Liu, T Lan, W Yang, R Cassius, D Sahoo, ...
arXiv preprint arXiv:2109.09265, 2021
Person re-identification for improved multi-person multi-camera tracking by continuous entity association
N Narayan, N Sankaran, D Arpit, K Dantu, S Setlur, V Govindaraju
Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2017
The system can't perform the operation now. Try again later.
Articles 1–20