Discrete Contrastive Diffusion for Cross-Modal Music and Image Generation Y Zhu, Y Wu, K Olszewski, J Ren, S Tulyakov, Y Yan International Conference on Learning Representations (ICLR), 2023 | 45 | 2023 |
Quantized GAN for Complex Music Generation from Dance Videos Y Zhu, K Olszewski, Y Wu, P Achlioptas, M Chai, Y Yan, S Tulyakov European Conference on Computer Vision (ECCV), 2022 | 23 | 2022 |
Learning Audio-Visual Correlations from Variational Cross-Modal Generation Y Zhu, Y Wu, H Latapie, Y Yang, Y Yan The International Conference on Acoustics, Speech, & Signal Processing (ICASSP), 2021 | 21 | 2021 |
Skeleton Sequence and RGB Frame Based Multi-Modality Feature Fusion Network for Action Recognition X Zhu, Y Zhu, H Wang, H Wen, Y Yan, P Liu ACM Transactions on Multimedia Computing, Communications, and Applications …, 2022 | 20 | 2022 |
Describing Unseen Videos via Multi-Modal Cooperative Dialog Agents Y Zhu, Y Wu, Y Yang, Y Yan European Conference on Computer Vision (ECCV), 2020 | 11 | 2020 |
Hierarchical HMM for Eye Movement Classification Y Zhu, Y Yan, O Komogortsev European Conference on Computer Vision Workshop (ECCV Workshop), 2020 | 10 | 2020 |
Saying the Unseen: Video Descriptions via Dialog Agents Y Zhu, Y Wu, Y Yang, Y Yan IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2021 | 8 | 2021 |
Boundary Guided Learning-Free Semantic Control with Diffusion Models Y Zhu, Y Wu, Z Deng, O Russakovsky, Y Yan Conference on Neural Information Processing Systems (NeurIPS), 2023 | 7 | 2023 |
Vision+ X: A Survey on Multimodal Learning in the Light of Data Y Zhu, Y Wu, N Sebe, Y Yan arXiv preprint arXiv:2210.02884, 2022 | 5 | 2022 |
Denoising Diffusion Probabilistic Models to Predict the Density of Molecular Clouds D Xu, J Tan, CJ Hsu, Y Zhu The Astrophysical Journal (APJ), 2023 | 4 | 2023 |
Multiview based 3D scene understanding on partial point sets Y Zhu, SE Shepstone, P Martínez-Nuevo, MS Kristoffersen, F Moutarde, ... arXiv preprint arXiv:1812.01712, 2018 | 4 | 2018 |
Diffusion in Diffusion: Cyclic One-Way Diffusion for Text-Vision-Conditioned Generation Y Yang, R Wang, Z Qian, Y Zhu, Y Wu International Conference on Learning Representations (ICLR), 2024 | 3 | 2024 |
Discrete Diffusion Reward Guidance Methods for Offline Reinforcement Learning M Coleman, O Russakovsky, C Allen-Blanchette, Y Zhu International Conference on Machine Learning Workshop (ICML Workshop), 2023 | 2 | 2023 |
Supplementing Missing Visions via Dialog for Scene Graph Generations Y Zhu, X Zhu, Y Shang, Z Zhao, Y Yan The International Conference on Acoustics, Speech, & Signal Processing (ICASSP), 2024 | 1 | 2024 |
D: Scaling Up Deepfake Detection by Learning from Discrepancy Y Yang, Z Qian, Y Zhu, Y Wu arXiv preprint arXiv:2404.04584, 2024 | | 2024 |
Mining and Unifying Heterogeneous Contrastive Relations for Weakly-Supervised Actor-Action Segmentation B Duan, H Tang, C Sun, Y Zhu, Y Yan Proceedings of the IEEE/CVF Winter Conference on Applications of Computer …, 2024 | | 2024 |
DETER: Detecting Edited Regions for Deterring Generative Manipulations S Wang, Y Zhu, R Wang, A Dharmasiri, O Russakovsky, Y Wu arXiv preprint arXiv:2312.10539, 2023 | | 2023 |
Unseen Image Synthesis with Diffusion Models Y Zhu, Y Wu, Z Deng, O Russakovsky, Y Yan arXiv preprint arXiv:2310.09213, 2023 | | 2023 |
Multimodal Learning and Generation Toward a Multisensory and Creative AI System Y Zhu Illinois Institute of Technology, 2023 | | 2023 |
Denoising Diffusion Probabilistic Models to Predict the Number Density of Molecular Clouds in Astronomy D Xu, J Tan, CJ Hsu, Y Zhu ICLR 2023 Workshop on Physics for Machine Learning (ICLR Workshop), 2023 | | 2023 |