Visual chatgpt: Talking, drawing and editing with visual foundation models C Wu, S Yin, W Qi, X Wang, Z Tang, N Duan arXiv preprint arXiv:2303.04671, 2023 | 433 | 2023 |
Nuwa-xl: Diffusion over diffusion for extremely long video generation S Yin, C Wu, H Yang, J Wang, X Wang, M Ni, Z Yang, L Li, S Liu, F Yang, ... arXiv preprint arXiv:2303.12346, 2023 | 37 | 2023 |
Dragnuwa: Fine-grained control in video generation by integrating text, image, and trajectory S Yin, C Wu, J Liang, J Shi, H Li, G Ming, N Duan arXiv preprint arXiv:2308.08089, 2023 | 31 | 2023 |
ORES: Open-vocabulary Responsible Visual Synthesis M Ni, C Wu, X Wang, S Yin, L Wang, Z Liu, N Duan Proceedings of the AAAI Conference on Artificial Intelligence 38 (19), 21473 …, 2024 | 3 | 2024 |
Using Left and Right Brains Together: Towards Vision and Language Planning J Cen, C Wu, X Liu, S Yin, Y Pei, J Yang, Q Chen, N Duan, J Zhang arXiv preprint arXiv:2402.10534, 2024 | 1 | 2024 |
StrokeNUWA: Tokenizing Strokes for Vector Graphic Synthesis Z Tang, C Wu, Z Zhang, M Ni, S Yin, Y Liu, Z Yang, L Wang, Z Liu, J Li, ... arXiv preprint arXiv:2401.17093, 2024 | 1 | 2024 |
Learning 3D photography videos via self-supervised diffusion on single images X Wang, C Wu, S Yin, M Ni, J Wang, L Li, Z Yang, F Yang, L Wang, Z Liu, ... arXiv preprint arXiv:2302.10781, 2023 | | 2023 |