Minsoo Rhu
Title
Cited by
Cited by
Year
Scnn: An accelerator for compressed-sparse convolutional neural networks
A Parashar, M Rhu, A Mukkara, A Puglielli, R Venkatesan, B Khailany, ...
ACM SIGARCH Computer Architecture News 45 (2), 27-40, 2017
6942017
vDNN: Virtualized deep neural networks for scalable, memory-efficient neural network design
M Rhu, N Gimelshein, J Clemons, A Zulfiqar, SW Keckler
2016 49th Annual IEEE/ACM International Symposium on Microarchitecture …, 2016
2472016
A locality-aware memory hierarchy for energy-efficient gpu architectures
M Rhu, M Sullivan, J Leng, M Erez
2013 46th Annual IEEE/ACM International Symposium on Microarchitecture …, 2013
1382013
Compressing DMA engine: Leveraging activation sparsity for training deep neural networks
M Rhu, M O'Connor, N Chatterjee, J Pool, Y Kwon, SW Keckler
2018 IEEE International Symposium on High Performance Computer Architecture …, 2018
1082018
Priority-based cache allocation in throughput processors
D Li, M Rhu, DR Johnson, M O'Connor, M Erez, D Burger, DS Fussell, ...
2015 IEEE 21st International Symposium on High Performance Computer …, 2015
932015
CAPRI: Prediction of compaction-adequacy for handling control-divergence in GPGPU architectures
M Rhu, M Erez
ACM SIGARCH Computer Architecture News 40 (3), 61-71, 2012
752012
Architecting an energy-efficient dram system for gpus
N Chatterjee, M O’Connor, D Lee, DR Johnson, SW Keckler, M Rhu, ...
2017 IEEE International Symposium on High Performance Computer Architecture …, 2017
692017
The dual-path execution model for efficient GPU control flow
M Rhu, M Erez
2013 IEEE 19th International Symposium on High Performance Computer …, 2013
652013
Maximizing SIMD resource utilization in GPGPUs with SIMD lane permutation
M Rhu, M Erez
Proceedings of the 40th Annual International Symposium on Computer …, 2013
612013
Tensordimm: A practical near-memory processing architecture for embeddings and tensor operations in deep learning
Y Kwon, Y Lee, M Rhu
Proceedings of the 52nd Annual IEEE/ACM International Symposium on …, 2019
542019
GPUVolt: Modeling and characterizing voltage noise in GPU architectures
J Leng, Y Zu, M Rhu, M Gupta, VJ Reddi
Proceedings of the 2014 international symposium on Low power electronics and …, 2014
382014
Beyond the memory wall: A case for memory-centric hpc system for deep learning
Y Kwon, M Rhu
2018 51st Annual IEEE/ACM International Symposium on Microarchitecture …, 2018
342018
Virtualizing deep neural networks for memory-efficient neural network design
M Rhu, N Gimelshein, J Clemons, A Zulfiqar, SW Keckler
arXiv preprint arXiv:1602.08124 43, 2016
262016
Prema: A predictive multi-task scheduling algorithm for preemptible neural processing units
Y Choi, M Rhu
2020 IEEE International Symposium on High Performance Computer Architecture …, 2020
252020
Optimization of arithmetic coding for JPEG2000
M Rhu, IC Park
IEEE Transactions on Circuits and systems for Video Technology 20 (3), 446-451, 2009
252009
Clean-ecc: High reliability ecc for adaptive granularity memory system
SL Gong, M Rhu, J Kim, J Chung, M Erez
Proceedings of the 48th International Symposium on Microarchitecture, 611-622, 2015
222015
Centaur: A chiplet-based, hybrid sparse-dense accelerator for personalized recommendations
R Hwang, T Kim, Y Kwon, M Rhu
2020 ACM/IEEE 47th Annual International Symposium on Computer Architecture …, 2020
192020
System, method, and computer program product for prioritized access for multithreaded processing
DR Johnson, M Rhu, JM O'Connor, SW Keckler
US Patent App. 14/147,395, 2015
112015
A disaggregated memory system for deep learning
Y Kwon, M Rhu
IEEE Micro 39 (5), 82-90, 2019
102019
A case for memory-centric HPC system architecture for training deep neural networks
Y Kwon, M Rhu
IEEE Computer Architecture Letters 17 (2), 134-138, 2018
102018
The system can't perform the operation now. Try again later.
Articles 1–20