Tareq Malas, Ph.D.
Tareq Malas, Ph.D.
Verified email at intel.com - Homepage
TitleCited byYear
Multicore-optimized wavefront diamond blocking for optimizing stencil updates
T Malas, G Hager, H Ltaief, H Stengel, G Wellein, D Keyes
SIAM Journal on Scientific Computing 37 (4), C439-C464, 2015
572015
Applying the roofline performance model to the intel xeon phi knights landing processor
D Doerfler, J Deslippe, S Williams, L Oliker, B Cook, T Kurth, M Lobet, ...
International Conference on High Performance Computing, 339-353, 2016
492016
Deep learning at 15pf: supervised and semi-supervised classification for scientific data
T Kurth, J Zhang, N Satish, E Racah, I Mitliagkas, MMA Patwary, T Malas, ...
Proceedings of the International Conference for High Performance Computing …, 2017
432017
Evaluating and optimizing the nersc workload on knights landing
T Barnes, B Cook, J Deslippe, D Doerfler, B Friesen, Y He, T Kurth, ...
2016 7th International Workshop on Performance Modeling, Benchmarking and …, 2016
352016
Multidimensional intratile parallelization for memory-starved stencil computations
TM Malas, G Hager, H Ltaief, DE Keyes
ACM Transactions on Parallel Computing (TOPC) 4 (3), 12, 2018
192018
Feature selection for recognizing handwritten Arabic letters
GA Abandah, TM Malas
Dirasat Engineering Sciences Journal 37 (2), 2010
192010
Toward optimal Arabic keyboard layout using genetic algorithm
TM Malas, SS Taifour, GA Abandah
Proc. 9th Int’l Middle Eastern Multiconf. on Simulation and Modeling (MESM …, 2008
182008
Optimization of an electromagnetics code with multicore wavefront diamond blocking and multi-dimensional intra-tile parallelization
TM Malas, J Hornich, G Hager, H Ltaief, C Pflaum, DE Keyes
2016 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2016
142016
Towards energy efficiency and maximum computational intensity for stencil algorithms using wavefront diamond temporal blocking
T Malas, G Hager, H Ltaief, D Keyes
arXiv preprint arXiv:1410.5561, 2014
102014
Optimizing the performance of streaming numerical kernels on the IBM Blue Gene/P PowerPC 450 processor
T Malas, AJ Ahmadia, J Brown, JA Gunnels, DE Keyes
The International Journal of High Performance Computing Applications 27 (2 …, 2013
82013
Optimization of the sparse matrix-vector products of an IDR Krylov iterative solver in EMGeo for the Intel KNL manycore processor
T Malas, T Kurth, J Deslippe
International Conference on High Performance Computing, 378-389, 2016
62016
High-Performance Seismic Modeling with Finite-Difference Using Spatial and Temporal Cache Blocking
V Etienne, T Tonellot, T Malas, H Ltaief, S Kortas, P Thierry, D Keyes
Third EAGE Workshop on High Performance Computing for Upstream, 2017
42017
Analyzing Performance of Selected NESAP Applications on the Cori HPC System
T Kurth, W Arndt, T Barnes, B Cook, J Deslippe, D Doerfler, B Friesen, ...
International Conference on High Performance Computing, 334-347, 2017
12017
Tiling and asynchronous communication optimizations for stencil computations
TMY Malas
12015
Optimization of finite-difference kernels on multi-core architectures for seismic applications
V Etienne, T Tonellot, K Akbudak, H Ltaief, S Kortas, T Malas, P Thierry, ...
2018
Analyzing Performance of Selected NESAP Applications on the Cori HPC System
J Deslippe, D Doerfler, B Friesen, YH He, T Koskela, M Lobet, T Malas, ...
High Performance Computing: ISC High Performance 2017 International …, 2017
2017
Towards Fast Reverse Time Migration Kernels using Multi-threaded Wavefront Diamond Tiling
T Malas, G Hager, H Ltaief, D Keyes
Second EAGE Workshop on High Performance Computing for Upstream, 2015
2015
Optimizing Stencil Computations: Multicore-optimized wavefront diamond blocking on Shared and Distributed Memory Systems
T Malas, G Hager, H Ltaief, H Stengel, G Wellein, D Keyes
2014
Advanced tiling techniques for memory-starved streaming numerical kernels
T Malas, H Ltaief, G Hager, D Keyes
The system can't perform the operation now. Try again later.
Articles 1–19