Shizhe Diao

Cited by

	All	Since 2019
Citations	981	980
h-index	16	16
i10-index	19	19

500

250

125

375

2020202120222023202416 31 93 357 482

Public access

View all

3 articles

1 article

available

not available

Based on funding mandates

Co-authors

Tong ZhangUIUCVerified email at tongzhang-ml.org
Jipeng ZhangHong Kong University of Science and TechnologyVerified email at connect.ust.hk
Yong Linthe Hong Kong University of Science and TechnologyVerified email at connect.ust.hk
KaShun SHUMThe Hong Kong University of Science and TechnologyVerified email at connect.ust.hk
Hanze DongSalesforce ResearchVerified email at salesforce.com
Wei XiongComputer Science, University of Illinois Urbana-ChampaignVerified email at illinois.edu
Renjie PiHKUSTVerified email at connect.ust.hk
Ruijia XuVerified email at connect.ust.hk
Wangchunshu ZhouAIWavesVerified email at aiwaves.cn
xinsong zhangByteDance AI LabVerified email at bytedance.com
Xiao ZHOUPhD of CSE, Hong Kong University of Science and Technology (HKUST)Verified email at connect.ust.hk
Pengcheng WangComputer Engineering Student, University of TorontoVerified email at mail.utoronto.ca
Zhichao HuangHong Kong University of Science and TechnologyVerified email at connect.ust.hk
Zhihong ChenStanford UniversityVerified email at stanford.edu
SU HongjinPhD student, The University of Hong KongVerified email at cs.hku.hk
Weizhong ZhangFudan UniversityVerified email at ust.hk
Zonghao ChenUniversity College LondonVerified email at ucl.ac.uk
Xinwei ShenETH ZurichVerified email at stat.math.ethz.ch
Yan ZengByteDance, ResearchVerified email at bytedance.com

Shizhe Diao

Hong Kong University of Science and Technology

Verified email at connect.ust.hk - Homepage

Large Language Models Natural Language Processing


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Raft: Reward ranked finetuning for generative foundation model alignment H Dong, W Xiong, D Goyal, Y Zhang, W Chow, R Pan, S Diao, J Zhang, ... arXiv preprint arXiv:2304.06767, 2023	187	2023
ZEN: Pre-training Chinese text encoder enhanced by n-gram representations S Diao, J Bai, Y Song, T Zhang, Y Wang Findings of EMNLP 2020, 2019	128	2019
Active prompting with chain-of-thought for large language models S Diao, P Wang, Y Lin, T Zhang arXiv preprint arXiv:2302.12246, 2023	114	2023
Black-Box Prompt Learning for Pre-trained Language Models S Diao, Z Huang, R Xu, X Li, Y Lin, X Zhou, T Zhang Transactions on Machine Learning Research (TMLR), 2022	72	2022
Automatic prompt augmentation and selection with chain-of-thought from labeled data KS Shum, S Diao, T Zhang arXiv preprint arXiv:2302.12822, 2023	63	2023
Detgpt: Detect what you need via reasoning R Pi, J Gao, S Diao, R Pan, H Dong, J Zhang, L Yao, J Han, H Xu, L Kong, ... arXiv preprint arXiv:2305.14167, 2023	55	2023
Taming pre-trained language models with n-gram representations for low-resource domain adaptation S Diao, R Xu, H Su, Y Jiang, Y Song, T Zhang Proceedings of the 59th Annual Meeting of the Association for Computational …, 2021	49	2021
Lmflow: An extensible toolkit for finetuning and inference of large foundation models S Diao, R Pan, H Dong, KS Shum, J Zhang, W Xiong, T Zhang arXiv preprint arXiv:2306.12420, 2023	36	2023
Efficient neural network training via forward and backward propagation sparsification X Zhou, W Zhang, Z Chen, S Diao, T Zhang Advances in neural information processing systems 34, 15216-15229, 2021	35	2021
R-tuning: Teaching large language models to refuse unknown questions H Zhang, S Diao, Y Lin, YR Fung, Q Lian, X Wang, Y Chen, H Ji, T Zhang arXiv preprint arXiv:2311.09677, 2023	30	2023
Vlue: A multi-task multi-dimension benchmark for evaluating vision-language pre-training W Zhou, Y Zeng, S Diao, X Zhang International Conference on Machine Learning, 27395-27411, 2022	24*	2022
Unitime: A language-empowered unified model for cross-domain time series forecasting X Liu, J Hu, Y Li, S Diao, Y Liang, B Hooi, R Zimmermann Proceedings of the ACM on Web Conference 2024, 4095-4106, 2024	22	2024
Mixture-of-domain-adapters: Decoupling and injecting domain knowledge to pre-trained language models memories S Diao, T Xu, R Xu, J Wang, T Zhang arXiv preprint arXiv:2306.05406, 2023	20	2023
TILGAN: transformer-based implicit latent GAN for diverse and coherent text generation S Diao, X Shen, K Shum, Y Song, T Zhang Findings of the Association for Computational linguistics: ACL-IJCNLP 2021 …, 2021	20	2021
Towards unifying medical vision-and-language pre-training via soft prompts Z Chen, S Diao, B Wang, G Li, X Wan Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023	18	2023
Arithmetic control of llms for diverse user preferences: Directional preference alignment with multi-objective rewards H Wang, Y Lin, W Xiong, R Yang, S Diao, S Qiu, H Zhao, T Zhang arXiv preprint arXiv:2402.18571, 2024	16	2024
Speciality vs generality: An empirical study on catastrophic forgetting in fine-tuning foundation models Y Lin, L Tan, H Lin, Z Zheng, R Pi, J Zhang, S Diao, H Wang, H Zhao, ... arXiv preprint arXiv:2309.06256, 2023	14	2023
Write and Paint: Generative Vision-Language Models are Unified Modal Learners S Diao, W Zhou, X Zhang, J Wang ICLR 2023, 0	14*
On the Difference of BERT-style and CLIP-style Text Encoders Z Chen, GH Chen, S Diao, X Wan, B Wang arXiv preprint arXiv:2306.03678, 2023	10	2023
Keyphrase generation with cross-document attention S Diao, Y Song, T Zhang arXiv preprint arXiv:2004.09800, 2020	9	2020

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors