Follow
Hanze Dong
Hanze Dong
Other names董 瀚泽, 董 瀚澤
Salesforce Research
Verified email at salesforce.com - Homepage
Title
Cited by
Cited by
Year
Raft: Reward ranked finetuning for generative foundation model alignment
H Dong*, W Xiong*, D Goyal, Y Zhang, W Chow, R Pan, S Diao, J Zhang, ...
TMLR (Invited Presentation @ ICLR 2025), 2023
3322023
Iterative preference learning from human feedback: Bridging theory and practice for rlhf under kl-constraint
W Xiong*, H Dong*, C Ye*, Z Wang, H Zhong, H Ji, N Jiang, T Zhang
ICML 2024, 2024
133*2024
Local augmentation for graph neural networks
S Liu, R Ying, H Dong, L Li, T Xu, Y Rong, P Zhao, J Huang, D Wu
ICML 2022, 14054-14072, 2022
1272022
Weakly supervised disentangled generative causal representation learning
X Shen, F Liu, H Dong, Q Lian, Z Chen, T Zhang
JMLR (Invited Presentation @ ICML 2023) 23 (241), 1-55, 2022
126*2022
Mitigating the alignment tax of RLHF
Y Lin, H Lin, W Xiong, S Diao, J Liu, J Zhang, R Pan, H Wang, W Hu, ...
EMNLP 2024, 2024
106*2024
DetGPT: Detect What You Need via Reasoning
R Pi, J Gao, S Diao, R Pan, H Dong, J Zhang, L Yao, J Han, H Xu, ...
EMNLP 2023, 14172–14189, 2023
99*2023
Rlhf workflow: From reward modeling to online rlhf
H Dong*, W Xiong*, B Pang*, H Wang*, H Zhao, Y Zhou, N Jiang, ...
TMLR, 2024
802024
Bayesian invariant risk minimization
Y Lin*, H Dong*, H Wang, T Zhang
CVPR 2022 (Oral), 16021-16030, 2022
802022
Lmflow: An extensible toolkit for finetuning and inference of large foundation models
S Diao*, R Pan*, H Dong*, KS Shum, J Zhang, W Xiong, T Zhang
NAACL 2024 (Best Demo Award), 2024
522024
MLLM-Protector: Ensuring MLLM's Safety without Hurting Performance
R Pi, T Han, Y Xie, R Pan, Q Lian, H Dong, J Zhang, T Zhang
EMNLP 2024, 2024
502024
Mathematical models of overparameterized neural networks
C Fang, H Dong, T Zhang
Proceedings of the IEEE 109 (5), 683-703, 2021
392021
Higher-order weighted graph convolutional networks
S Liu, L Chen, H Dong, Z Wang, D Wu, Z Huang
arXiv preprint arXiv:1911.04129, 2019
352019
Online iterative reinforcement learning from human feedback with general preference model
C Ye*, W Xiong*, Y Zhang*, H Dong*, N Jiang, T Zhang
NeurIPS 2024, 2024
32*2024
Vocabulary-informed Zero-shot and Open-set Learning
Y Fu, X Wang, H Dong, YG Jiang, M Wang, X Xue, L Sigal
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2019
262019
Reverse Diffusion Monte Carlo
X Huang*, H Dong*, HAO Yifan, Y Ma, T Zhang
ICLR 2024, 2024
25*2024
Spurious feature diversification improves out-of-distribution generalization
Y Lin, L Tan, Y Hao, H Wong, H Dong, W Zhang, Y Yang, T Zhang
ICLR 2024, 2024
222024
Over parameterized two-level neural networks can learn near optimal feature representations
C Fang, H Dong, T Zhang
arXiv preprint arXiv:1910.11508, 2019
212019
Particle-based variational inference with preconditioned functional gradient flow
H Dong, X Wang, Y Lin, T Zhang
ICLR 2023, 2023
192023
Learning the Compositional Domains for Generalized Zero-shot Learning
H Dong, Y Fu, SJ Hwang, L Sigal, X Xue
Computer Vision and Image Understanding, 103454, 2022
15*2022
Think: Thinner key cache by query-driven pruning
Y Xu, Z Jie, H Dong, L Wang, X Lu, A Zhou, A Saha, C Xiong, D Sahoo
ICLR 2025 (Spotlight), 2025
92025
The system can't perform the operation now. Try again later.
Articles 1–20