Follow
Deyao Zhu
Deyao Zhu
Research Scientist, ByteDance
Verified email at bytedance.com - Homepage
Title
Cited by
Cited by
Year
MiniGPT-4: Enhancing vision-language understanding with advanced large language models
D Zhu, J Chen, X Shen, X Li, M Elhoseiny
International Conference on Learning Representations 2024, 2023
10882023
MiniGPT-v2: large language model as a unified interface for vision-language multi-task learning
J Chen, D Zhu, X Shen, X Li, Z Liu, P Zhang, R Krishnamoorthi, ...
arXiv preprint arXiv:2310.09478, 2023
1712023
ChatGPT Asks, BLIP-2 Answers: Automatic Questioning Towards Enriched Visual Descriptions
D Zhu, J Chen, K Haydarov, X Shen, W Zhang, M Elhoseiny
Transactions on Machine Learning Research (TMLR), 2023
632023
Social-Implicit: Rethinking Trajectory Prediction Evaluation and The Effectiveness of Implicit Maximum Likelihood Estimation
A Mohamed, D Zhu, W Vu, M Elhoseiny, C Claudel
European Conference on Computer Vision (ECCV) 2022, 2022
442022
Video ChatCaptioner: Towards the Enriched Spatiotemporal Descriptions
J Chen, D Zhu, K Haydarov, X Li, M Elhoseiny
arXiv preprint arXiv:2304.04227, 2023
182023
Exploring Open-Vocabulary Semantic Segmentation from CLIP Vision Encoder Distillation Only
J Chen, D Zhu, G Qian, B Ghanem, Z Yan, C Zhu, F Xiao, SC Culatana, ...
Proceedings of the IEEE/CVF International Conference on Computer Vision, 699-710, 2023
16*2023
Motion forecasting with unlikelihood training in continuous space
D Zhu, M Zahran, LE Li, M Elhoseiny
Conference on Robot Learning, 1003-1012, 2022
142022
RelTransformer: A Transformer-Based Long-Tail Visual Relationship Recognition
J Chen, A Agarwal, S Abdelkarim, D Zhu, M Elhoseiny
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022
12*2022
Guiding Online Reinforcement Learning with Action-Free Offline Pretraining
D Zhu, Y Wang, J Schmidhuber, M Elhoseiny
arXiv preprint arXiv:2301.12876, 2023
62023
HalentNet: Multimodal Trajectory Forecasting with Hallucinative Intents
D Zhu, M Zahran, LE Li, M Elhoseiny
International Conference on Learning Representations, 2021, 2021
52021
Value Memory Graph: A Graph-Structured World Model for Offline Reinforcement Learning
D Zhu, LE Li, M Elhoseiny
International Conference on Learning Representations 2023, 2022
42022
Learning to disentangle latent physical factors for video prediction
D Zhu, M Munderloh, B Rosenhahn, J Stückler
Pattern Recognition: 41st DAGM German Conference, DAGM GCPR 2019, Dortmund …, 2019
42019
The system can't perform the operation now. Try again later.
Articles 1–12