Follow
Ke Hong
Title
Cited by
Cited by
Year
Flashdecoding++: Faster large language model inference on gpus
K Hong, G Dai, J Xu, Q Mao, X Li, J Liu, K Chen, H Dong, Y Wang
arXiv preprint arXiv:2311.01282, 2023
172023
A learning-based aoa estimation method for device-free localization
K Hong, T Wang, J Liu, Y Wang, Y Shen
IEEE Communications Letters 26 (6), 1264-1267, 2022
92022
Exploiting hardware utilization and adaptive dataflow for efficient sparse convolution in 3d point clouds
K Hong, Z Yu, G Dai, X Yang, Y Lian, N Xu, Y Wang
Proceedings of Machine Learning and Systems 5, 2023
62023
Torchsparse++: Efficient point cloud engine
H Tang, S Yang, Z Liu, K Hong, Z Yu, X Li, G Dai, Y Wang, S Han
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
62023
An efficient accelerator for point-based and voxel-based point cloud neural networks
X Yang, T Fu, G Dai, S Zeng, K Zhong, K Hong, Y Wang
2023 60th ACM/IEEE Design Automation Conference (DAC), 1-6, 2023
42023
Torchsparse++: Efficient training and inference framework for sparse convolution on gpus
H Tang, S Yang, Z Liu, K Hong, Z Yu, X Li, G Dai, Y Wang, S Han
Proceedings of the 56th Annual IEEE/ACM International Symposium on …, 2023
32023
LLM-MQ: Mixed-precision Quantization for Efficient LLM Deployment
S Li, X Ning, K Hong, T Liu, L Wang, X Li, K Zhong, G Dai, H Yang, ...
2
Ada3d: Exploiting the spatial redundancy with adaptive inference for efficient 3d object detection
T Zhao, X Ning, K Hong, Z Qiu, P Lu, Y Zhao, L Zhang, L Zhou, G Dai, ...
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023
12023
A Survey on Efficient Inference for Large Language Models
Z Zhou, X Ning, K Hong, T Fu, J Xu, S Li, Y Lou, L Wang, Z Yuan, X Li, ...
arXiv preprint arXiv:2404.14294, 2024
2024
A Point Transformer Accelerator with Fine-Grained Pipelines and Distribution-Aware Dynamic FPS
Y Lian, X Yang, K Hong, Y Wang, G Dai, N Xu
2023 IEEE/ACM International Conference on Computer Aided Design (ICCAD), 1-9, 2023
2023
The system can't perform the operation now. Try again later.
Articles 1–10