Follow
John Pennycook
Title
Cited by
Cited by
Year
Exploring SIMD for Molecular Dynamics, Using Intel Xeon Processors and Intel Xeon Phi Coprocessors
SJ Pennycook, CJ Hughes, M Smelyanskiy, SA Jarvis
IEEE International Parallel & Distributed Processing Symposium, 2013
1992013
CosmoFlow: Using deep learning to learn the universe at scale
A Mathuriya, D Bard, P Mendygral, L Meadows, J Arnemann, L Shao, ...
SC18: International Conference for High Performance Computing, Networking …, 2018
1082018
Performance analysis of a hybrid MPI/CUDA implementation of the NASLU benchmark
SJ Pennycook, SD Hammond, SA Jarvis, GR Mudalige
ACM SIGMETRICS Performance Evaluation Review 38 (4), 23-29, 2011
912011
An investigation of the performance portability of OpenCL
SJ Pennycook, SD Hammond, SA Wright, JA Herdman, I Miller, SA Jarvis
Journal of Parallel and Distributed Computing 73 (11), 1439-1450, 2013
852013
Implications of a metric for performance portability
SJ Pennycook, JD Sewall, VW Lee
Future Generation Computer Systems 92, 947-958, 2019
632019
A metric for performance portability
SJ Pennycook, JD Sewall, VW Lee
arXiv preprint arXiv:1611.07409, 2016
602016
Parallel file system analysis through application I/O tracing
SA Wright, SD Hammond, SJ Pennycook, RF Bird, JA Herdman, I Miller, ...
The Computer Journal 56 (2), 141-155, 2013
362013
On the acceleration of wavefront applications using distributed many-core architectures
SJ Pennycook, SD Hammond, GR Mudalige, SA Wright, SA Jarvis
The Computer Journal 55 (2), 138-153, 2012
312012
Effective performance portability
SL Harrell, J Kitson, R Bird, SJ Pennycook, J Sewall, D Jacobsen, ...
2018 IEEE/ACM International Workshop on Performance, Portability and …, 2018
302018
Developing performance-portable molecular dynamics kernels in OpenCL
SJ Pennycook, SA Jarvis
2012 SC Companion: High Performance Computing, Networking Storage and …, 2012
222012
Methods and apparatus for multi-load and multi-store vector instructions
L Meadows, A Duran, S Pennycook, J Sewall
US Patent App. 15/859,033, 2019
152019
Ldplfs: Improving i/o performance without application modification
SA Wright, SD Hammond, SJ Pennycook, I Miller, JA Herdman, SA Jarvis
2012 IEEE 26th International Parallel and Distributed Processing Symposium …, 2012
132012
Evaluating the impact of proposed openmp 5.0 features on performance, portability and productivity
SJ Pennycook, JD Sewall, JR Hammond
2018 IEEE/ACM International Workshop on Performance, Portability and …, 2018
112018
Interpreting and visualizing performance portability metrics
J Sewall, SJ Pennycook, D Jacobsen, T Deakin, S McIntosh-Smith
2020 IEEE/ACM International Workshop on Performance, Portability and …, 2020
82020
Unveiling the Early Universe: Optimizing Cosmology Workloads for Intel Xeon Phi Coprocessors in an SGI UV2000 System
J Briggs, SJ Pennycook, EPS Shellard, C Martins, M Woodacre, K Feind
Tech. Rep.(SGI/Intel White Paper, 2014), 2014
72014
Towards a portable and future-proof particle-in-cell plasma physics code
RF Bird, SJ Pennycook, SA Wright, SA Jarvis
72013
Light-weight parallel I/O analysis at scale
SA Wright, SD Hammond, SJ Pennycook, SA Jarvis
Computer Performance Engineering: 8th European Performance Engineering …, 2011
72011
Navigating performance, portability, and productivity
SJ Pennycook, JD Sewall, DW Jacobsen, T Deakin, S McIntosh-Smith
Computing in Science & Engineering 23 (5), 28-38, 2021
62021
WMTrace-A Lightweight Memory Allocation Tracker and Analysis Framework
O Perks, SD Hammond, SJ Pennycook, SA Jarvis
Proceedings of the UK Performance Engineering Workshop (UKPEW 2011), 2011
62011
Model-led optimisation of a geometric multigrid application
R Bunt, S Pennycook, S Jarvis, L Lapworth, Y Ho
2013 IEEE 10th International Conference on High Performance Computing and …, 2013
52013
The system can't perform the operation now. Try again later.
Articles 1–20