‪Jiannan Wu‬ - ‪Google Scholar‬

Get my own profile

Cited by

	All	Since 2019
Citations	465	465
h-index	8	8
i10-index	7	7

0

240

120

60

180

20212022202320246 28 237 192

Public access

6 articles

1 article

available

not available

Based on funding mandates

Co-authors

Ping Luo (羅平)Associate Professor, The University of Hong KongVerified email at hku.hk
Yi JiangBytedanceVerified email at bytedance.com
Zehuan YuanBytedance Inc.Verified email at bytedance.com
Peize SunThe University of Hong KongVerified email at connect.hku.hk
Bin YanPhD student of Computer Vision, Dalian University of TechnologyVerified email at mail.dlut.edu.cn

Jiannan Wu

Jiannan Wu

The University of Hong Kong

Verified email at connect.hku.hk

Computer Vision Video Understanding Multimodal LLMs


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Visionllm: Large language model is also an open-ended decoder for vision-centric tasks W Wang, Z Chen, X Chen, J Wu, X Zhu, G Zeng, P Luo, T Lu, J Zhou, ... Advances in Neural Information Processing Systems (NeurIPS), 2023	166	2023
Language as queries for referring video object segmentation J Wu, Y Jiang, P Sun, Z Yuan, P Luo Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022	103	2022
Universal instance perception as object discovery and retrieval B Yan, Y Jiang, J Wu, D Wang, P Luo, Z Yuan, H Lu Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023	84	2023
Watch only once: An end-to-end video action detection framework S Chen, P Sun, E Xie, C Ge, J Wu, L Ma, J Shen, P Luo Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021	51	2021
Internvl: Scaling up vision foundation models and aligning for generic visual-linguistic tasks Z Chen, J Wu, W Wang, W Su, G Chen, S Xing, Z Muyan, Q Zhang, X Zhu, ... IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024	15	2024
Development of an effective model for computing rightmost eigenvalues of power systems with inclusion of time delays C Li, J Wu, C Duan, Z Du IEEE Transactions on Power Systems 34 (6), 4216-4227, 2019	11	2019
Self-supervised video representation learning with motion-aware masked autoencoders H Yang, D Huang, B Wen, J Wu, H Yao, Y Jiang, X Zhu, Z Yuan arXiv preprint arXiv:2210.04154, 2022	10	2022
Towards high-quality temporal action detection with sparse proposals J Wu, P Sun, S Chen, J Yang, Z Qi, L Ma, P Luo arXiv preprint arXiv:2109.08847, 2021	9	2021
Segment every reference object in spatial and temporal spaces J Wu, Y Jiang, B Yan, H Lu, Z Yuan, P Luo Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023	6	2023
The first visual object tracking segmentation vots2023 challenge results M Kristan, J Matas, M Danelljan, M Felsberg, HJ Chang, LČ Zajc, ... Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023	5	2023
Exploring transformers for open-world instance segmentation J Wu, Y Jiang, B Yan, H Lu, Z Yuan, P Luo Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023	3	2023
UniRef++: Segment Every Reference Object in Spatial and Temporal Spaces J Wu, Y Jiang, B Yan, H Lu, Z Yuan, P Luo arXiv preprint arXiv:2312.15715, 2023	1	2023
A Simple Baseline for Open-World Tracking via Self-training B Wang, T Li, J Wu, Y Jiang, H Lu, Y He Proceedings of the 31st ACM International Conference on Multimedia, 2765-2774, 2023	1	2023
Groma: Localized Visual Tokenization for Grounding Multimodal Large Language Models C Ma, Y Jiang, J Wu, Z Yuan, X Qi arXiv preprint arXiv:2404.13013, 2024		2024
Multi-Level Contrastive Learning for Dense Prediction Task Q Guo, Y Yu, Y Jiang, J Wu, Z Yuan, P Luo arXiv preprint arXiv:2304.02010, 2023		2023

The system can't perform the operation now. Try again later.

Articles 1–15