Follow
Fuxiao Liu
Title
Cited by
Cited by
Year
Mitigating hallucination in large multi-modal models via robust instruction tuning
F Liu, K Lin, L Li, J Wang, Y Yacoob, L Wang
ICLR 2024, 2023
206*2023
Hallusionbench: You see what you think? or you think what you see? an image-context reasoning benchmark challenging for gpt-4v (ision), llava-1.5, and other multi-modality models
F Liu*, T Guan*, Z Li, L Chen, Y Yacoob, D Manocha, T Zhou
CVPR 2024, 2023
115*2023
Visual News: Benchmark and Challenges in News Image Captioning
F Liu, Y Wang, T Wang, V Ordonez
EMNLP 2021 (Oral), 2021
109*2021
MMC: Advancing multimodal chart understanding with large-scale instruction tuning
F Liu, X Wang, W Yao, J Chen, K Song, S Cho, Y Yacoob, D Yu
NAACL 2024, 2024
412024
COVID-VTS: Fact Extraction and Verification on Short Video Platforms
F Liu, Y Yacoob, A Shrivastava
EACL 2023 (Oral), 2023
312023
Mementos: A Comprehensive Benchmark for Multimodal Large Language Model Reasoning over Image Sequences
X Wang, Y Zhou, X Liu, H Lu, Y Xu, F He, J Yoon, T Lu, G Bertasius, F Liu, ...
ACL 2024, 2024
262024
Towards understanding in-context learning with contrastive demonstrations and saliency maps
F Liu, P Xu, Z Li, H Song
arXiv preprint arXiv:2307.05052, 2023
252023
DocumentCLIP: Linking Figures and Main Body Text in Reflowed Documents
F Liu, H Tan, C Tensmeyer
ICPRAI 2024, 2023
212023
Large language models and causal inference in collaboration: A comprehensive survey
X Liu, P Xu, J Wu, J Yuan, Y Yang, Y Zhou, F Liu, T Guan, H Wang, T Yu, ...
arXiv preprint arXiv:2403.09606, 2024
152024
On the Safety Concerns of Deploying LLMs/VLMs in Robotics: Highlighting the Risks and Vulnerabilities
X Wu, R Xian, T Guan, J Liang, S Chakraborty, F Liu, B Sadler, ...
CVPR 2024 Workshop on Vision and Language for Autonomous Driving and Robotics, 2024
52024
Mosaic IT: Enhancing Instruction Tuning with Data Mosaics
M Li, P Chen, C Wang, H Zhao, Y Liang, Y Hou, F Liu, T Zhou
arXiv preprint arXiv:2405.13326, 2024
12024
SCP: Soft Conditional Prompt Learning for Aerial Video Action Recognition
X Wang, R Xian, T Guan, F Liu, D Manocha
IROS 2024 (Oral), 2024
2024
From Multimodal LLM to Human-level AI: Modality, Instruction, Reasoning, Efficiency and Beyond
H Fei, Y Yao, Z Zhang, F Liu, A Zhang, T Chua
LREC-COLING 2024, 2024
2024
The system can't perform the operation now. Try again later.
Articles 1–13