Debunking the 100X GPU vs. CPU myth: an evaluation of throughput computing on CPU and GPU VW Lee, C Kim, J Chhugani, M Deisher, D Kim, AD Nguyen, N Satish, ... Proceedings of the 37th annual international symposium on Computer …, 2010 | 985 | 2010 |

Designing efficient sorting algorithms for manycore GPUs N Satish, M Harris, M Garland 2009 IEEE International Symposium on Parallel & Distributed Processing, 1-10, 2009 | 765 | 2009 |

Scalable bayesian optimization using deep neural networks J Snoek, O Rippel, K Swersky, R Kiros, N Satish, N Sundaram, M Patwary, ... International conference on machine learning, 2171-2180, 2015 | 378 | 2015 |

Sort vs. hash revisited: Fast join implementation on modern multi-core CPUs C Kim, T Kaldewey, VW Lee, E Sedlar, AD Nguyen, N Satish, J Chhugani, ... Proceedings of the VLDB Endowment 2 (2), 1378-1389, 2009 | 309 | 2009 |

Clearpath: highly parallel collision avoidance for multi-agent simulation SJ Guy, J Chhugani, C Kim, N Satish, M Lin, D Manocha, P Dubey Proceedings of the 2009 ACM SIGGRAPH/Eurographics Symposium on Computer …, 2009 | 309 | 2009 |

FAST: fast architecture sensitive tree search on modern CPUs and GPUs C Kim, J Chhugani, N Satish, E Sedlar, AD Nguyen, T Kaldewey, VW Lee, ... Proceedings of the 2010 ACM SIGMOD International Conference on Management of …, 2010 | 304 | 2010 |

3.5-D blocking optimization for stencil computations on modern CPUs and GPUs A Nguyen, N Satish, J Chhugani, C Kim, P Dubey SC'10: Proceedings of the 2010 ACM/IEEE International Conference for High …, 2010 | 289 | 2010 |

Proving program termination B Cook, A Podelski, A Rybalchenko Communications of the ACM 54 (5), 88-98, 2011 | 258* | 2011 |

Fast sort on CPUs and GPUs: a case for bandwidth oblivious SIMD sort N Satish, C Kim, J Chhugani, AD Nguyen, VW Lee, D Kim, P Dubey Proceedings of the 2010 ACM SIGMOD International Conference on Management of …, 2010 | 247 | 2010 |

Graphmat: High performance graph analytics made productive N Sundaram, NR Satish, MMA Patwary, SR Dulloor, SG Vadlamudi, ... arXiv preprint arXiv:1503.07241, 2015 | 182 | 2015 |

Navigating the maze of graph analytics frameworks using massive graph datasets N Satish, N Sundaram, MMA Patwary, J Seo, J Park, MA Hassaan, ... Proceedings of the 2014 ACM SIGMOD international conference on Management of …, 2014 | 177* | 2014 |

Dyser: Unifying functionality and parallelism specialization for energy-efficient computing V Govindaraju, CH Ho, T Nowatzki, J Chhugani, N Satish, ... IEEE Micro 32 (5), 38-51, 2012 | 166 | 2012 |

Fast updates on read-optimized databases using multi-core CPUs J Krueger, C Kim, M Grund, N Satish, D Schwalb, J Chhugani, H Plattner, ... Proceedings of the VLDB Endowment 5 (1), 61-72, 2011 | 147 | 2011 |

Graphicionado: A high-performance and energy-efficient accelerator for graph analytics TJ Ham, L Wu, N Sundaram, N Satish, M Martonosi 2016 49th Annual IEEE/ACM International Symposium on Microarchitecture …, 2016 | 123 | 2016 |

Can traditional programming bridge the ninja performance gap for parallel computing applications? N Satish, C Kim, J Chhugani, H Saito, R Krishnaiyer, M Smelyanskiy, ... 2012 39th Annual International Symposium on Computer Architecture (ISCA …, 2012 | 120 | 2012 |

Can traditional programming bridge the ninja performance gap for parallel computing applications? N Satish, C Kim, J Chhugani, H Saito, R Krishnaiyer, M Smelyanskiy, ... 2012 39th Annual International Symposium on Computer Architecture (ISCA …, 2012 | 120 | 2012 |

Efficient parallelization of h. 264 decoding with macro block level scheduling J Chong, N Satish, B Catanzaro, K Ravindran, K Keutzer 2007 IEEE international conference on multimedia and expo, 1874-1877, 2007 | 109 | 2007 |

Streaming similarity search over one billion tweets using parallel locality-sensitive hashing N Sundaram, A Turmukhametova, N Satish, T Mostak, P Indyk, S Madden, ... Proceedings of the VLDB Endowment 6 (14), 1930-1941, 2013 | 105 | 2013 |

PALM: Parallel architecture-friendly latch-free modifications to B+ trees on many-core processors J Sewall, J Chhugani, C Kim, N Satish, P Dubey Proc. VLDB Endowment 4 (11), 795-806, 2011 | 105 | 2011 |

Data tiering in heterogeneous memory systems SR Dulloor, A Roy, Z Zhao, N Sundaram, N Satish, R Sankaran, ... Proceedings of the Eleventh European Conference on Computer Systems, 1-16, 2016 | 104 | 2016 |