Gennady Pekhimenko

Cited by

	All	Since 2019
Citations	6004	4410
h-index	38	34
i10-index	62	59

920

460

230

690

20132014201520162017201820192020202120222023202424 58 149 338 344 599 538 661 821 900 915 571

Public access

View all

41 articles

1 article

available

not available

Based on funding mandates

Co-authors

Onur MutluETH Zürich and Carnegie Mellon UniversityVerified email at inf.ethz.ch
Donghyuk LeeNVIDIAVerified email at nvidia.com
Todd C. MowryProfessor of Computer Science, Carnegie Mellon UniversityVerified email at cs.cmu.edu
Samira KhanUniversity of VirginiaVerified email at virginia.edu
Vivek SeshadriStudent, CS, CMUVerified email at cs.cmu.edu
Saugata GhoseUniversity of Illinois Urbana-ChampaignVerified email at illinois.edu
Phillip GibbonsCarnegie Mellon UniversityVerified email at cs.cmu.edu
Michael A. KozuchIntelVerified email at intel.com
Bojian ZhengUniversity of TorontoVerified email at cs.toronto.edu
Yoongu KimGraduate Student, Carnegie Mellon UniversityVerified email at cmu.edu
Hasan HassanETH ZurichVerified email at inf.ethz.ch
Rachata AusavarungnirunMangoBoostVerified email at mangoboost.io
Anand JayarajanUniversity of TorontoVerified email at cs.toronto.edu
Kevin HsiehPrincipal Researcher at MicrosoftVerified email at microsoft.com
Oğuz ErginProfessor, TOBB ETÜ, Ankara, TürkiyeVerified email at etu.edu.tr
Amar PhanishayeeMicrosoft ResearchVerified email at cs.cmu.edu
Hadi EsmaeilzadehAssociate Professor; Computer Science and Engineering; University of California, San DiegoVerified email at eng.ucsd.edu
Yixin LuoCarnegie Mellon UniversityVerified email at cs.cmu.edu
Chris FallinGraduate Student, Carnegie Mellon UniversityVerified email at cmu.edu
Adwait JogUniversity of Virginia (UVA)Verified email at virginia.edu

Gennady Pekhimenko

University of Toronto

Verified email at cs.toronto.edu - Homepage

Computer Architecture Systems Systems for ML Machine Learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
RowClone: fast and energy-efficient in-DRAM bulk data copy and initialization V Seshadri, Y Kim, C Fallin, D Lee, R Ausavarungnirun, G Pekhimenko, ... Proceedings of the 46th Annual IEEE/ACM International Symposium on …, 2013	511	2013
Base-delta-immediate compression: practical data compression for on-chip caches G Pekhimenko, V Seshadri, O Mutlu, PB Gibbons, MA Kozuch, TC Mowry Proceedings of the 21st international conference on Parallel architectures …, 2012	490	2012
Mlperf inference benchmark VJ Reddi, C Cheng, D Kanter, P Mattson, G Schmuelling, CJ Wu, ... 2020 ACM/IEEE 47th Annual International Symposium on Computer Architecture …, 2020	482	2020
MLPerf Training Benchmark P Mattson, C Cheng, C Coleman, G Diamos, P Micikevicius, D Patterson, ... Proceedings of Machine Learning and Systems 2020, 336-349, 2020	317	2020
Adaptive-Latency DRAM: Optimizing DRAM Timing for the Common-Case D Lee, Y Kim, G Pekhimenko, S Khan, V Seshadri, K Chang, O Mutlu High Performance Computer Architecture (HPCA), 2015 IEEE 21st International …, 2015	260	2015
Understanding latency variation in modern DRAM chips: Experimental characterization, analysis, and optimization KK Chang, A Kashyap, H Hassan, S Ghose, K Hsieh, D Lee, T Li, ... Proceedings of the 2016 ACM SIGMETRICS International Conference on …, 2016	229	2016
Benchmarking and analyzing deep neural network training H Zhu, M Akrout, B Zheng, A Pelegris, A Jayarajan, A Phanishayee, ... 2018 IEEE International Symposium on Workload Characterization (IISWC), 88-100, 2018	222	2018
Gist: Efficient data encoding for deep neural network training A Jain, A Phanishayee, J Mars, L Tang, G Pekhimenko 2018 ACM/IEEE 45th Annual International Symposium on Computer Architecture …, 2018	194	2018
Linearly compressed pages: a low-complexity, low-latency main memory compression framework G Pekhimenko, V Seshadri, Y Kim, H Xin, O Mutlu, PB Gibbons, ... Proceedings of the 46th Annual IEEE/ACM International Symposium on …, 2013	194	2013
ChargeCache: Reducing DRAM latency by exploiting row access locality H Hassan, G Pekhimenko, N Vijaykumar, V Seshadri, D Lee, O Ergin, ... 2016 IEEE International Symposium on High Performance Computer Architecture …, 2016	179	2016
Simultaneous multi-layer access: Improving 3D-stacked memory bandwidth at low cost D Lee, S Ghose, G Pekhimenko, S Khan, O Mutlu ACM Transactions on Architecture and Code Optimization (TACO) 12 (4), 1-29, 2016	179	2016
Priority-based parameter propagation for distributed DNN training A Jayarajan, J Wei, G Gibson, A Fedorova, G Pekhimenko Proceedings of Machine Learning and Systems 2019, 2019	177	2019
Design-induced latency variation in modern DRAM chips: Characterization, analysis, and latency reduction mechanisms D Lee, S Khan, L Subramanian, S Ghose, R Ausavarungnirun, ... Proceedings of the ACM on Measurement and Analysis of Computing Systems 1 (1 …, 2017	149	2017
SoftMC: A flexible and practical open-source infrastructure for enabling experimental DRAM studies H Hassan, N Vijaykumar, S Khan, S Ghose, K Chang, G Pekhimenko, ... 2017 IEEE International Symposium on High Performance Computer Architecture …, 2017	144	2017
A case for core-assisted bottleneck acceleration in GPUs: enabling flexible data compression with assist warps N Vijaykumar, G Pekhimenko, A Jog, A Bhowmick, R Ausavarungnirun, ... ACM SIGARCH Computer Architecture News 43 (3S), 41-53, 2015	140	2015
RFVP: Rollback-free value prediction with safe-to-approximate loads A Yazdanbakhsh, G Pekhimenko, B Thwaites, H Esmaeilzadeh, O Mutlu, ... ACM Transactions on Architecture and Code Optimization (TACO) 12 (4), 1-26, 2016	97	2016
Shifted Hamming distance: a fast and accurate SIMD-friendly filter to accelerate alignment verification in read mapping H Xin, J Greth, J Emmons, G Pekhimenko, C Kingsford, C Alkan, O Mutlu Bioinformatics 31 (10), 1553-1560, 2015	96*	2015
{StreamBox}: Modern Stream Processing on a Multicore Machine H Miao, H Park, M Jeon, G Pekhimenko, KS McKinley, FX Lin 2017 USENIX Annual Technical Conference (USENIX ATC 17), 617-629, 2017	95	2017
A case for toggle-aware compression for GPU systems G Pekhimenko, E Bolotin, N Vijaykumar, O Mutlu, TC Mowry, SW Keckler 2016 IEEE International Symposium on High Performance Computer Architecture …, 2016	91	2016
Software automatic tuning: from concepts to state-of-the-art results RS Ken Naono, Keita Teranishi, John Cavazos Springer Science & Business Media, 2010	90	2010

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors