Follow
Sudarshan Srinivasan
Sudarshan Srinivasan
Verified email at intel.com
Title
Cited by
Cited by
Year
Sigma: A sparse and irregular gemm accelerator with flexible interconnects for dnn training
E Qin, A Samajdar, H Kwon, V Nadella, S Srinivasan, D Das, B Kaul, ...
2020 IEEE International Symposium on High Performance Computer Architecture …, 2020
3802020
A study of BFLOAT16 for deep learning training
D Kalamkar, D Mudigere, N Mellempudi, D Das, K Banerjee, S Avancha, ...
arXiv preprint arXiv:1905.12322, 2019
2982019
Mixed precision training with 8-bit floating point
N Mellempudi, S Srinivasan, D Das, B Kaul
arXiv preprint arXiv:1905.12334, 2019
692019
Optimizing deep learning recommender systems training on cpu cluster architectures
D Kalamkar, E Georganas, S Srinivasan, J Chen, M Shiryaev, A Heinecke
SC20: International Conference for High Performance Computing, Networking …, 2020
422020
Astra-sim: Enabling sw/hw co-design exploration for distributed dl training platforms
S Rashidi, S Sridharan, S Srinivasan, T Krishna
2020 IEEE International Symposium on Performance Analysis of Systems and …, 2020
362020
Enabling compute-communication overlap in distributed deep learning training platforms
S Rashidi, M Denton, S Sridharan, S Srinivasan, A Suresh, J Nie, ...
2021 ACM/IEEE 48th Annual International Symposium on Computer Architecture …, 2021
302021
Reliability evaluation of compressed deep learning models
BF Goldstein, S Srinivasan, D Das, K Banerjee, L Santiago, VC Ferreira, ...
2020 IEEE 11th Latin American Symposium on Circuits & Systems (LASCAS), 1-5, 2020
232020
Exploring heterogeneity within a core for improved power efficiency
S Srinivasan, N Kurella, I Koren, S Kundu
IEEE Transactions on Parallel and Distributed Systems 27 (4), 1057-1069, 2015
202015
Cross-level protection of circuits against faults and malicious attacks
V Tomashevich, S Srinivasan, F Foerg, I Polian
2012 IEEE 18th International On-Line Testing Symposium (IOLTS), 150-155, 2012
192012
Themis: A network bandwidth-aware collective scheduling policy for distributed training of dl models
S Rashidi, W Won, S Srinivasan, S Sridharan, T Krishna
Proceedings of the 49th Annual International Symposium on Computer …, 2022
172022
Extending sparse tensor accelerators to support multiple compression formats
E Qin, G Jeong, W Won, SC Kao, H Kwon, S Srinivasan, D Das, GE Moon, ...
2021 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2021
152021
Program phase duration prediction and its application to fine-grain power management
S Srinivasan, R Kumar, S Kundu
2013 IEEE Computer Society Annual Symposium on VLSI (ISVLSI), 127-132, 2013
142013
A lightweight error-resiliency mechanism for deep neural networks
BF Goldstein, VC Ferreira, S Srinivasan, D Das, AS Nery, S Kundu, ...
2021 22nd International Symposium on Quality Electronic Design (ISQED), 311-316, 2021
122021
A study of BFLOAT16 for deep learning training (2019)
D Kalamkar, D Mudigere, N Mellempudi, D Das, K Banerjee, S Avancha, ...
arXiv preprint arXiv:1905.12322, 1905
121905
Online mechanism for reliability and power-efficiency management of a dynamically reconfigurable core
S Srinivasan, I Koren, S Kundu
2015 33rd IEEE International Conference on Computer Design (ICCD), 327-334, 2015
102015
Training google neural machine translation on an intel cpu cluster
DD Kalamkar, K Banerjee, S Srinivasan, S Sridharan, E Georganas, ...
2019 IEEE International Conference on Cluster Computing (CLUSTER), 1-10, 2019
82019
Astra-sim2. 0: Modeling hierarchical networks and disaggregated systems for large-model training at scale
W Won, T Heo, S Rashidi, S Sridharan, S Srinivasan, T Krishna
2023 IEEE International Symposium on Performance Analysis of Systems and …, 2023
72023
A wavelet-based spatio-temporal heat dissipation model for reordering of program phases to produce temperature extremes in a chip
S Srinivasan, KP Ganeshpure, S Kundu
IEEE Transactions on Computer-Aided Design of Integrated Circuits and …, 2012
72012
Functional test pattern generation for maximizing temperature in 3D IC chip stack
S Srinivasan, S Kundu
Thirteenth International Symposium on Quality Electronic Design (ISQED), 109-116, 2012
72012
A study of BFLOAT16 for deep learning training
K Dhiraj, M Dheevatsa, M Naveen, D Dipankar, B Kunal, A Sasikanth, ...
arXiv preprint arXiv:1905.12322, 2019
62019
The system can't perform the operation now. Try again later.
Articles 1–20