Visionllm: Large language model is also an open-ended decoder for vision-centric tasks W Wang, Z Chen, X Chen, J Wu, X Zhu, G Zeng, P Luo, T Lu, J Zhou, ... Advances in Neural Information Processing Systems (NeurIPS), 2023 | 166 | 2023 |
Language as queries for referring video object segmentation J Wu, Y Jiang, P Sun, Z Yuan, P Luo Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022 | 103 | 2022 |
Universal instance perception as object discovery and retrieval B Yan, Y Jiang, J Wu, D Wang, P Luo, Z Yuan, H Lu Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 84 | 2023 |
Watch only once: An end-to-end video action detection framework S Chen, P Sun, E Xie, C Ge, J Wu, L Ma, J Shen, P Luo Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021 | 51 | 2021 |
Internvl: Scaling up vision foundation models and aligning for generic visual-linguistic tasks Z Chen, J Wu, W Wang, W Su, G Chen, S Xing, Z Muyan, Q Zhang, X Zhu, ... IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024 | 15 | 2024 |
Development of an effective model for computing rightmost eigenvalues of power systems with inclusion of time delays C Li, J Wu, C Duan, Z Du IEEE Transactions on Power Systems 34 (6), 4216-4227, 2019 | 11 | 2019 |
Self-supervised video representation learning with motion-aware masked autoencoders H Yang, D Huang, B Wen, J Wu, H Yao, Y Jiang, X Zhu, Z Yuan arXiv preprint arXiv:2210.04154, 2022 | 10 | 2022 |
Towards high-quality temporal action detection with sparse proposals J Wu, P Sun, S Chen, J Yang, Z Qi, L Ma, P Luo arXiv preprint arXiv:2109.08847, 2021 | 9 | 2021 |
Segment every reference object in spatial and temporal spaces J Wu, Y Jiang, B Yan, H Lu, Z Yuan, P Luo Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 6 | 2023 |
The first visual object tracking segmentation vots2023 challenge results M Kristan, J Matas, M Danelljan, M Felsberg, HJ Chang, LČ Zajc, ... Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 5 | 2023 |
Exploring transformers for open-world instance segmentation J Wu, Y Jiang, B Yan, H Lu, Z Yuan, P Luo Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 3 | 2023 |
UniRef++: Segment Every Reference Object in Spatial and Temporal Spaces J Wu, Y Jiang, B Yan, H Lu, Z Yuan, P Luo arXiv preprint arXiv:2312.15715, 2023 | 1 | 2023 |
A Simple Baseline for Open-World Tracking via Self-training B Wang, T Li, J Wu, Y Jiang, H Lu, Y He Proceedings of the 31st ACM International Conference on Multimedia, 2765-2774, 2023 | 1 | 2023 |
Groma: Localized Visual Tokenization for Grounding Multimodal Large Language Models C Ma, Y Jiang, J Wu, Z Yuan, X Qi arXiv preprint arXiv:2404.13013, 2024 | | 2024 |
Multi-Level Contrastive Learning for Dense Prediction Task Q Guo, Y Yu, Y Jiang, J Wu, Z Yuan, P Luo arXiv preprint arXiv:2304.02010, 2023 | | 2023 |