Xulong Tang
Title
Cited by
Cited by
Year
Scheduling techniques for GPU architectures with processing-in-memory capabilities
A Pattnaik, X Tang, A Jog, O Kayiran, AK Mishra, MT Kandemir, O Mutlu, ...
Proceedings of the 2016 International Conference on Parallel Architectures …, 2016
1192016
Controlled kernel launch for dynamic parallelism in GPUs
X Tang, A Pattnaik, H Jiang, O Kayiran, A Jog, S Pai, M Ibrahim, ...
2017 IEEE International Symposium on High Performance Computer Architecture …, 2017
422017
μC-States: Fine-grained GPU datapath power management
O Kayiran, A Jog, A Pattnaik, R Ausavarungnirun, X Tang, MT Kandemir, ...
2016 International Conference on Parallel Architecture and Compilation …, 2016
372016
Improving bank-level parallelism for irregular applications
X Tang, M Kandemir, P Yedlapalli, J Kotra
2016 49th Annual IEEE/ACM International Symposium on Microarchitecture …, 2016
342016
Data movement aware computation partitioning
X Tang, O Kislal, M Kandemir, M Karakoy
Proceedings of the 50th Annual IEEE/ACM International Symposium on …, 2017
292017
Memory row reuse distance and its role in optimizing application performance
M Kandemir, H Zhao, X Tang, M Karakoy
Proceedings of the 2015 ACM SIGMETRICS International Conference on …, 2015
232015
Opportunistic computing in gpu architectures
A Pattnaik, X Tang, O Kayiran, A Jog, A Mishra, MT Kandemir, ...
2019 ACM/IEEE 46th Annual International Symposium on Computer Architecture …, 2019
182019
Optimizing off-chip accesses in multicores
W Ding, X Tang, M Kandemir, Y Zhang, E Kultursay
Proceedings of the 36th ACM SIGPLAN Conference on Programming Language …, 2015
162015
FlexBFS: a parallelism-aware implementation of breadth-first search on GPU
G Liu, H An, W Han, X Li, T Sun, W Zhou, X Wei, X Tang
Proceedings of the 17th ACM SIGPLAN symposium on Principles and Practice of …, 2012
132012
DEMM: a Dynamic Energy-saving mechanism for Multicore Memories
A Sharifi, W Ding, D Guttman, H Zhao, X Tang, M Kandemir, C Das
2017 IEEE 25th International Symposium on Modeling, Analysis, and Simulation …, 2017
122017
Enhancing computation-to-core assignment with physical location information
O Kislal, J Kotra, X Tang, MT Kandemir, M Jung
ACM SIGPLAN Notices 53 (4), 312-327, 2018
82018
POSTER: Location-Aware Computation Mapping for Manycore Processors
O Kislal, J Kotra, X Tang, MT Kandemir, M Jung
2017 26th International Conference on Parallel Architectures and Compilation …, 2017
82017
Quantifying and Optimizing Data Access Parallelism on Manycores
J Ryoo, O Kislal, X Tang, MT Kandemir
2018 IEEE 26th International Symposium on Modeling, Analysis, and Simulation …, 2018
72018
Oversubscribed command queues in GPUs
S Puthoor, X Tang, J Gross, BM Beckmann
Proceedings of the 11th Workshop on General Purpose GPUs, 50-60, 2018
72018
Computing with near data
X Tang, M Taylan Kandemir, H Zhao, M Jung, M Karakoy
ACM SIGMETRICS Performance Evaluation Review 47 (1), 27-28, 2019
52019
Quantifying Data Locality in Dynamic Parallelism in GPUs
X Tang, A Pattnaik, O Kayiran, A Jog, MT Kandemir, C Das
Proceedings of the ACM on Measurement and Analysis of Computing Systems 2 (3 …, 2018
52018
VC-Bench: A Video Coding Benchmark Suite for Evaluation of Processor Capability
X Tang, H An, G Sun, D Fan
Software Engineering, Artificial Intelligence, Networking and Parallel …, 2013
52013
Quantifying Data Locality in Dynamic Parallelism in GPUs
X Tang, A Pattnaik, O Kayiran, A Jog, MT Kandemir, C Das
ACM SIGMETRICS Performance Evaluation Review 47 (1), 25-26, 2019
42019
Co-optimizing memory-level parallelism and cache-level parallelism
X Tang, MT Kandemir, M Karakoy, M Arunachalam
Proceedings of the 40th ACM SIGPLAN Conference on Programming Language …, 2019
42019
Main Memory Performance Optimization of Phase Change RAM in Stream Processor [J]
XR Hao, H An, XQ Li, XL Tang
Computer Engineering 24, 2011
22011
The system can't perform the operation now. Try again later.
Articles 1–20