Sitao Huang
Title
Cited by
Cited by
Year
Fpga/dnn co-design: An efficient design methodology for 1ot intelligence on the edge
C Hao, X Zhang, Y Li, S Huang, J Xiong, K Rupnow, W Hwu, D Chen
2019 56th ACM/IEEE Design Automation Conference (DAC), 1-6, 2019
912019
Towards Neural Phrase-based Machine Translation
PS Huang, C Wang, S Huang, D Zhou, L Deng
Sixth International Conference on Learning Representations (ICLR), 2018
782018
Hardware acceleration of the pair-HMM algorithm for DNA variant calling
S Huang, GJ Manikandan, A Ramachandran, K Rupnow, WW Hwu, ...
Proceedings of the 2017 ACM/SIGDA International Symposium on Field …, 2017
562017
Accelerating subsequence similarity search based on dynamic time warping distance with FPGA
Z Wang, S Huang, L Wang, H Li, Y Wang, H Yang
Proceedings of the ACM/SIGDA international symposium on Field programmable …, 2013
362013
Automatic generation of warp-level primitives and atomic instructions for fast and portable parallel reduction on GPUs
SG De Gonzalo, S Huang, J Gómez-Luna, S Hammond, O Mutlu, W Hwu
2019 IEEE/ACM International Symposium on Code Generation and Optimization …, 2019
202019
Analysis and modeling of collaborative execution strategies for heterogeneous CPU-FPGA architectures
S Huang, LW Chang, I El Hajj, S Garcia de Gonzalo, J Gómez-Luna, ...
Proceedings of the 2019 ACM/SPEC International Conference on Performance …, 2019
182019
Collaborative computing for heterogeneous integrated systems
LW Chang, J Gómez-Luna, I El Hajj, S Huang, D Chen, W Hwu
Proceedings of the 8th ACM/SPEC on International Conference on Performance …, 2017
162017
Accelerating frequent item counting with fpga
Y Sun, Z Wang, S Huang, L Wang, Y Wang, R Luo, H Yang
Proceedings of the 2014 ACM/SIGDA international symposium on Field …, 2014
162014
Hardware-software co-design for an analog-digital accelerator for machine learning
J Ambrosi, A Ankit, R Antunes, SR Chalamalasetti, S Chatterjee, I El Hajj, ...
2018 IEEE International Conference on Rebooting Computing (ICRC), 1-13, 2018
132018
Triangle counting and truss decomposition using fpga
S Huang, M El-Hadedy, C Hao, Q Li, VS Mailthody, K Date, J Xiong, ...
2018 IEEE High Performance extreme Computing Conference (HPEC), 1-7, 2018
132018
Accelerating sparse deep neural networks on fpgas
S Huang, C Pearson, R Nagi, J Xiong, D Chen, W Hwu
2019 IEEE High Performance Extreme Computing Conference (HPEC), 1-7, 2019
112019
Mind mappings: enabling efficient algorithm-accelerator mapping space search
K Hegde, PA Tsai, S Huang, V Chandra, A Parashar, CW Fletcher
Proceedings of the 26th ACM International Conference on Architectural …, 2021
72021
Analysis and optimization of I/O cache coherency strategies for soc-fpga device
SW Min, S Huang, M El-Hadedy, J Xiong, D Chen, W Hwu
2019 29th International Conference on Field Programmable Logic and …, 2019
72019
Near-memory and in-storage FPGA acceleration for emerging cognitive computing workloads
A Dhar, S Huang, J Xiong, D Jamsek, B Mesnet, J Huang, NS Kim, W Hwu, ...
2019 IEEE Computer Society Annual Symposium on VLSI (ISVLSI), 68-75, 2019
52019
DTW-based subsequence similarity search on AMD heterogeneous computing platform
S Huang, G Dai, Y Sun, Z Wang, Y Wang, H Yang
2013 IEEE 10th International Conference on High Performance Computing and …, 2013
52013
Mixed precision quantization for ReRAM-based DNN inference accelerators
S Huang, A Ankit, P Silveira, R Antunes, SR Chalamalasetti, I El Hajj, ...
2021 26th Asia and South Pacific Design Automation Conference (ASP-DAC), 372-377, 2021
42021
Thoughts on massively-parallel heterogeneous computing for solving large problems
W Hwu, M Hidayetoglu, WC Chew, C Pearson, S Garcia, S Huang, ...
2017 Computing and Electromagnetics International Workshop (CEM), 67-68, 2017
32017
Acceleration of the Pair-HMM algorithm for DNA variant calling
GJ Manikandan, S Huang, K Rupnow, WMW Hwu, D Chen
2016 IEEE 24th Annual International Symposium on Field-Programmable Custom …, 2016
32016
Pylog: An algorithm-centric python-based FPGA programming and synthesis flow
S Huang, K Wu, H Jeong, C Wang, D Chen, WM Hwu
IEEE Transactions on Computers 70 (12), 2015-2028, 2021
22021
PyTorch-Direct: Enabling GPU Centric Data Access for Very Large Graph Neural Network Training with Irregular Accesses
SW Min, K Wu, S Huang, M Hidayetoğlu, J Xiong, E Ebrahimi, D Chen, ...
arXiv preprint arXiv:2101.07956, 2021
22021
The system can't perform the operation now. Try again later.
Articles 1–20