Follow
Minglun Han
Minglun Han
ByteDance; Previously CASIA.
Verified email at bytedance.com
Title
Cited by
Cited by
Year
VLP: A Survey on Vision-language Pre-training
F Chen, D Zhang, M Han, X Chen, J Shi, S Xu, B Xu
Machine Intelligence Research 20 (1), 38-56, 2023
1562023
X-LLM: Bootstrapping Advanced Large Language Models by Treating Multi-Modalities as Foreign Languages
F Chen, M Han, H Zhao, Q Zhang, J Shi, S Xu, B Xu
arXiv preprint arXiv:2305.04160, 2023
692023
Improving End-to-End Contextual Speech Recognition with Fine-Grained Contextual Knowledge Selection
M Han, L Dong, Z Liang, M Cai, S Zhou, Z Ma, B Xu
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
342022
Complex Dynamic Neurons Improved Spiking Transformer Network for Efficient Automatic Speech Recognition
Q Wang, T Zhang, M Han, Y Wang, D Zhang, B Xu
Proceedings of the AAAI Conference on Artificial Intelligence 37 (1), 102-109, 2023
232023
CIF-based Collaborative Decoding for End-to-End Contextual Speech Recognition
M Han, L Dong, S Zhou, B Xu
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
212021
Knowledge Transfer from Pre-trained Language Models to CIF-based Speech Recognizers via Hierarchical Distillation
M Han, F Chen, J Shi, S Xu, B Xu
INTERSPEECH 2023, 2023
82023
VILAS: Exploring the Effects of Vision and Language Context in Automatic Speech Recognition
Z Ni, M Han, F Chen, L Meng, J Shi, S Xu, B Xu
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2023
2*2023
Matching-based Term Semantics Pre-training for Spoken Patient Query Understanding
Z Hu, X Chen, H Wu, M Han, Z Ni, J Shi, S Xu, B Xu
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
22023
Seed-ASR: Understanding Diverse Speech and Contexts with LLM-based Speech Recognition
Y Bai, J Chen, J Chen, W Chen, Z Chen, C Ding, L Dong, Q Dong, Y Du, ...
arXiv preprint arXiv:2407.04675, 2024
2024
Enhancing Visual Question Answering via Deconstructing Questions and Explicating Answers
F Chen, M Han, J Shi, S Xu, B Xu
INTERSPEECH 2023, 2023
2023
The system can't perform the operation now. Try again later.
Articles 1–10