Follow
Toshio Endo
Title
Cited by
Cited by
Year
Peta-scale phase-field simulation for dendritic solidification on the TSUBAME 2.0 supercomputer
T Shimokawabe, T Aoki, T Takaki, T Endo, A Yamanaka, N Maruyama, ...
Proceedings of 2011 International Conference for High Performance Computing …, 2011
2592011
Statistical power modeling of GPU kernels using performance counters
H Nagasaka, N Maruyama, A Nukada, T Endo, S Matsuoka
International conference on green computing, 115-122, 2010
2502010
An 80-fold speedup, 15.0 TFlops full GPU acceleration of non-hydrostatic weather model ASUCA production code
T Shimokawabe, T Aoki, C Muroi, J Ishida, K Kawano, T Endo, A Nukada, ...
SC'10: Proceedings of the 2010 ACM/IEEE International Conference for High …, 2010
1772010
Bandwidth intensive 3-D FFT kernel for GPUs using CUDA
A Nukada, Y Ogata, T Endo, S Matsuoka
SC'08: Proceedings of the 2008 ACM/IEEE conference on Supercomputing, 1-11, 2008
1722008
An efficient, model-based CPU-GPU heterogeneous FFT library
Y Ogata, T Endo, N Maruyama, S Matsuoka
2008 IEEE international symposium on parallel and distributed processing, 1-10, 2008
1202008
Exploration of lossy compression for application-level checkpoint/restart
N Sasaki, K Sato, T Endo, S Matsuoka
2015 IEEE international parallel and distributed processing symposium, 914-922, 2015
1112015
A scalable mark-sweep garbage collector on large-scale shared-memory machines
T Endo, K Taura, A Yonezawa
Proceedings of the 1997 ACM/IEEE Conference on Supercomputing, 1-14, 1997
1031997
Phoenix: a parallel programming model for accommodating dynamically joining/leaving resources
K Taura, K Kaneda, T Endo, A Yonezawa
ACM SIGPLAN Notices 38 (10), 216-229, 2003
1002003
Linpack evaluation on a supercomputer with heterogeneous accelerators
T Endo, S Matsuoka, A Nukada, N Maruyama
2010 IEEE International Symposium on Parallel & Distributed Processing …, 2010
772010
Petaflop biofluidics simulations on a two million-core system
M Bernaschi, M Bisson, T Endo, S Matsuoka, M Fatica, S Melchionna
Proceedings of 2011 International Conference for High Performance Computing …, 2011
572011
Massive supercomputing coping with heterogeneity of modern accelerators
T Endo, S Matsuoka
2008 IEEE International Symposium on Parallel and Distributed Processing, 1-10, 2008
562008
AN5D: automated stencil framework for high-degree temporal blocking on GPUs
K Matsumura, HR Zohouri, M Wahib, T Endo, S Matsuoka
Proceedings of the 18th ACM/IEEE International Symposium on Code Generation …, 2020
532020
Power-aware dynamic task scheduling for heterogeneous accelerated clusters
T Hamano, T Endo, S Matsuoka
2009 IEEE International Symposium on Parallel & Distributed Processing, 1-8, 2009
482009
GPU accelerated computing–from hype to mainstream, the rebirth of vector computing
S Matsuoka, T Aoki, T Endo, A Nukada, T Kato, A Hasegawa
Journal of Physics: Conference Series 180 (1), 012043, 2009
472009
A parallel optimization method for stencil computation on the domain that is bigger than memory capacity of GPUs
G Jin, T Endo, S Matsuoka
2013 IEEE International Conference on Cluster Computing (CLUSTER), 1-8, 2013
462013
Access-pattern and bandwidth aware file replication algorithm in a grid environment
H Sato, S Matsuoka, T Endo, N Maruyama
2008 9th IEEE/ACM International Conference on Grid Computing, 250-257, 2008
392008
A multi-level optimization method for stencil computation on the domain that is bigger than memory capacity of GPU
G Jin, T Endo, S Matsuoka
2013 IEEE International Symposium on Parallel & Distributed Processing …, 2013
352013
File clustering based replication algorithm in a grid environment
H Sato, S Matsuoka, T Endo
2009 9th IEEE/ACM International Symposium on Cluster Computing and the Grid …, 2009
352009
Software technologies coping with memory hierarchy of GPGPU clusters for stencil computations
T Endo, G Jin
2014 IEEE International Conference on Cluster Computing (CLUSTER), 132-139, 2014
312014
A stencil framework to realize large-scale computations beyond device memory capacity on GPU supercomputers
T Shimokawabe, T Endo, N Onodera, T Aoki
2017 IEEE International Conference on Cluster Computing (CLUSTER), 525-529, 2017
292017
The system can't perform the operation now. Try again later.
Articles 1–20