Taming transformers for high-resolution image synthesis P Esser, R Rombach, B Ommer Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2021 | 309 | 2021 |
A disentangling invertible interpretation network for explaining latent representations P Esser, R Rombach, B Ommer Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2020 | 36 | 2020 |
High-resolution image synthesis with latent diffusion models R Rombach, A Blattmann, D Lorenz, P Esser, B Ommer Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022 | 26 | 2022 |
Network-to-network translation with conditional invertible neural networks R Rombach, P Esser, B Ommer Advances in Neural Information Processing Systems 33, 2784-2797, 2020 | 23 | 2020 |
Imagebart: Bidirectional context with multinomial diffusion for autoregressive image synthesis P Esser, R Rombach, A Blattmann, B Ommer Advances in Neural Information Processing Systems 34, 3518-3532, 2021 | 22 | 2021 |
Geometry-free view synthesis: Transformers and no 3d priors R Rombach, P Esser, B Ommer Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021 | 17 | 2021 |
Stochastic image-to-video synthesis using cinns M Dorkenwald, T Milbich, A Blattmann, R Rombach, KG Derpanis, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021 | 15 | 2021 |
Making sense of cnns: Interpreting deep representations and their invariances with inns R Rombach, P Esser, B Ommer European Conference on Computer Vision, 647-664, 2020 | 10 | 2020 |
A note on data biases in generative models P Esser, R Rombach, B Ommer arXiv preprint arXiv:2012.02516, 2020 | 8 | 2020 |
High-Resolution Complex Scene Synthesis with Transformers M Jahn, R Rombach, B Ommer arXiv preprint arXiv:2105.06458, 2021 | 3 | 2021 |
Retrieval-Augmented Diffusion Models A Blattmann, R Rombach, K Oktay, B Ommer arXiv preprint arXiv:2204.11824, 2022 | 1 | 2022 |
Network fusion for content creation with conditional inns R Rombach, P Esser, B Ommer | 1 | 2020 |
Text-Guided Synthesis of Artistic Images with Retrieval-Augmented Diffusion Models R Rombach, A Blattmann, B Ommer arXiv preprint arXiv:2207.13038, 2022 | | 2022 |
Invertible neural networks for understanding semantics of invariances of CNN representations R Rombach, P Esser, A Blattmann, B Ommer Deep Neural Networks and Data for Automated Driving, 197-224, 2022 | | 2022 |
An Image is Worth 16× 16 Tokens: Visual Priors for Efficient Image Synthesis with Transformers R Rombach, P Esser, B Ommer, HCI IWR | | |