Gaia: Geo-Distributed Machine Learning Approaching LAN Speeds K Hsieh, A Harlap, N Vijaykumar, D Konomis, GR Ganger, PB Gibbons, ... 14th USENIX Symposium on Networked Systems Design and Implementation (NSDI …, 0 | 218* | |
Transparent offloading and mapping (TOM): Enabling programmer-transparent near-data processing in GPU systems K Hsieh, E Ebrahimi, G Kim, N Chatterjee, M O'Connor, N Vijaykumar, ... Proceedings of the 43rd International Symposium on Computer Architecture …, 2016 | 169 | 2016 |
Understanding latency variation in modern DRAM chips: Experimental characterization, analysis, and optimization KK Chang, A Kashyap, H Hassan, S Ghose, K Hsieh, D Lee, T Li, ... Proceedings of the 2016 ACM SIGMETRICS International Conference on …, 2016 | 145 | 2016 |
Fast bulk bitwise AND and OR in DRAM V Seshadri, K Hsieh, A Boroum, D Lee, MA Kozuch, O Mutlu, PB Gibbons, ... IEEE Computer Architecture Letters 14 (2), 127-131, 2015 | 145 | 2015 |
Accelerating pointer chasing in 3D-stacked memory: Challenges, mechanisms, evaluation K Hsieh, S Khan, N Vijaykumar, KK Chang, A Boroumand, S Ghose, ... 2016 IEEE 34th International Conference on Computer Design (ICCD), 25-32, 2016 | 124 | 2016 |
LazyPIM: An Efficient Cache Coherence Mechanism for Processing-in-Memory A Boroumand, S Ghose, B Lucia, K Hsieh, K Malladi, H Zheng, O Mutlu IEEE Computer Architecture Letters, 2016 | 113 | 2016 |
Focus: Querying large video datasets with low latency and low cost K Hsieh, G Ananthanarayanan, P Bodik, S Venkataraman, P Bahl, ... 13th {USENIX} Symposium on Operating Systems Design and Implementation …, 2018 | 107 | 2018 |
Zorua: A holistic approach to resource virtualization in gpus N Vijaykumar, K Hsieh, G Pekhimenko, S Khan, A Shrestha, S Ghose, ... 2016 49th Annual IEEE/ACM International Symposium on Microarchitecture …, 2016 | 57 | 2016 |
The locality descriptor: A holistic cross-layer abstraction to express data locality in GPUs N Vijaykumar, E Ebrahimi, K Hsieh, PB Gibbons, O Mutlu 2018 ACM/IEEE 45th Annual International Symposium on Computer Architecture …, 2018 | 38 | 2018 |
The non-iid data quagmire of decentralized machine learning K Hsieh, A Phanishayee, O Mutlu, P Gibbons International Conference on Machine Learning, 4387-4398, 2020 | 32 | 2020 |
Enabling the Adoption of Processing-in-Memory: Challenges, Mechanisms, Future Research Directions S Ghose, K Hsieh, A Boroumand, R Ausavarungnirun, O Mutlu arXiv preprint arXiv:1802.00320, 2018 | 32 | 2018 |
A case for richer cross-layer abstractions: Bridging the semantic gap with expressive memory N Vijaykumar, A Jain, D Majumdar, K Hsieh, G Pekhimenko, E Ebrahimi, ... 2018 ACM/IEEE 45th Annual International Symposium on Computer Architecture …, 2018 | 28 | 2018 |
Toward standardized near-data processing with unrestricted data placement for GPUs G Kim, N Chatterjee, M O'Connor, K Hsieh Proceedings of the International Conference for High Performance Computing …, 2017 | 22 | 2017 |
CoNDA: efficient cache coherence support for near-data accelerators A Boroumand, S Ghose, M Patel, H Hassan, B Lucia, R Ausavarungnirun, ... Proceedings of the 46th International Symposium on Computer Architecture …, 2019 | 19 | 2019 |
The Processing-in-Memory Paradigm: Mechanisms to Enable Adoption S Ghose, K Hsieh, A Boroumand, R Ausavarungnirun, O Mutlu Beyond-CMOS Technologies for Next Generation Computer Design, 133-194, 2019 | 17 | 2019 |
LazyPIM: Efficient Support for Cache Coherence in Processing-in-Memory Architectures A Boroumand, S Ghose, M Patel, H Hassan, B Lucia, N Hajinazar, ... arXiv preprint arXiv:1706.03162, 2017 | 11 | 2017 |
Machine Learning Systems for Highly-Distributed and Rapidly-Growing Data K Hsieh arXiv preprint arXiv:1910.08663, 2019 | 2 | 2019 |
Flexible-Latency DRAM: Understanding and Exploiting Latency Variation in Modern DRAM Chips KK Chang, A Kashyap, H Hassan, S Ghose, K Hsieh, D Lee, T Li, ... arXiv preprint arXiv:1805.03154, 2018 | 1 | 2018 |
Decoupling GPU Programming Models from Resource Management for Enhanced Programming Ease, Portability, and Performance N Vijaykumar, K Hsieh, G Pekhimenko, S Khan, A Shrestha, S Ghose, ... arXiv preprint arXiv:1805.02498, 2018 | 1 | 2018 |
Decoupling the Programming Model from Resource Management in Throughput Processors N Vijaykumar, K Hsieh, G Pekhimenko, S Khan, A Shrestha, S Ghose, ... Many-Core Computing: Hardware and Software, IET, 2018 | 1 | 2018 |