Follow
Joshua Hursey
Title
Cited by
Cited by
Year
Why It’s Worth the Hassle: The Value of In-Situ Studies When Designing Ubicomp: (Nominated for the Best Paper Award)
Y Rogers, K Connelly, L Tedesco, W Hazlewood, A Kurtz, RE Hall, ...
UbiComp 2007: Ubiquitous Computing: 9th International Conference, UbiComp …, 2007
2822007
The design and implementation of checkpoint/restart process fault tolerance for Open MPI
J Hursey, JM Squyres, TI Mattox, A Lumsdaine
2007 IEEE International Parallel and Distributed Processing Symposium, 1-8, 2007
2652007
An evaluation of user-level failure mitigation support in MPI
W Bland, A Bouteiller, T Herault, J Hursey, G Bosilca, JJ Dongarra
Recent Advances in the Message Passing Interface: 19th European MPI Users …, 2012
1432012
PMIx: Process management for exascale environments
RH Castain, J Hursey, A Bouteiller, D Solt
Parallel Computing 79, 9-29, 2018
942018
Interconnect agnostic checkpoint/restart in Open MPI
J Hursey, TI Mattox, A Lumsdaine
Proceedings of the 18th ACM international symposium on High Performance …, 2009
862009
Run-through stabilization: An MPI proposal for process fault tolerance
J Hursey, RL Graham, G Bronevetsky, D Buntinas, H Pritchard, DG Solt
Recent Advances in the Message Passing Interface: 18th European MPI Users …, 2011
712011
An evaluation of user-level failure mitigation support in MPI
W Bland, A Bouteiller, T Herault, J Hursey, G Bosilca, JJ Dongarra
Computing 95, 1171-1184, 2013
512013
Coordinated checkpoint/restart process fault tolerance for MPI applications on HPC systems
J Hursey
Indiana University, 2010
442010
A log-scaling fault tolerant agreement algorithm for a fault tolerant MPI
J Hursey, T Naughton, G Vallee, RL Graham
Recent Advances in the Message Passing Interface: 18th European MPI Users …, 2011
432011
Locality-aware parallel process mapping for multi-core HPC systems
J Hursey, JM Squyres, T Dontje
2011 IEEE international conference on cluster computing, 527-531, 2011
382011
A checkpoint and restart service specification for Open MPI
J Hursey, JM Squyres, A Lumsdaine
Indiana University, Computer Science Department, Technical Report, 2006
332006
Netloc: Towards a comprehensive view of the HPC system topology
B Goglin, J Hursey, JM Squyres
2014 43rd International Conference on Parallel Processing Workshops, 216-225, 2014
312014
Building a fault tolerant MPI application: A ring communication example
J Hursey, RL Graham
2011 IEEE International Symposium on Parallel and Distributed Processing …, 2011
292011
A performance analysis and optimization of PMIx-based HPC software stacks
AY Polyakov, BI Karasev, J Hursey, J Ladd, M Brinskii, E Shipunova
Proceedings of the 26th European MPI Users' Group Meeting, 1-10, 2019
182019
A composable runtime recovery policy framework supporting resilient HPC applications
J Hursey, A Lumsdaine
Indiana University, Bloomington, Indiana, USA, Tech. Rep. TR686, 2010
182010
An extensible framework for distributed testing of mpi implementations
J Hursey, E Mallove, JM Squyres, A Lumsdaine
European Parallel Virtual Machine/Message Passing Interface Users’ Group …, 2007
182007
Preserving collective performance across process failure for a fault tolerant MPI
J Hursey, RL Graham
2011 IEEE International Symposium on Parallel and Distributed Processing …, 2011
172011
Checkpoint/restart-enabled parallel debugging
J Hursey, C January, M O’Connor, PH Hargrove, D Lecomber, ...
Recent Advances in the Message Passing Interface: 17th European MPI Users …, 2010
172010
Advancing application process affinity experimentation: Open MPI's LAMA-based affinity interface
J Hursey, JM Squyres
Proceedings of the 20th European MPI Users' Group Meeting, 163-168, 2013
152013
Design considerations for building and running containerized MPI applications
J Hursey
2020 2nd International Workshop on Containers and New Orchestration …, 2020
142020
The system can't perform the operation now. Try again later.
Articles 1–20