DiffDreamer: View Generation with Conditional Diffusion Models | ICCV 2023

Shengqu Cai, Eric Ryan Chan, Songyou Peng, Mohamad Shahbazi, Anton Obukhov, Luc Van Gool, Gordon Wetzstein

Single-image-conditioned “infinite nature” approach with conditional diffusion models.

ABSTRACT

Perpetual view generation-the task of generating long-range novel views by flying into a given image-has been a novel yet promising task. We introduce DiffDreamer, an unsupervised framework capable of synthesizing novel views depicting a long camera trajectory while training solely on internet-collected images of nature scenes. We demonstrate that image-conditioned diffusion models can effectively perform long-range scene extrapolation while preserving both local and global consistency significantly better than prior GAN-based methods.

FILES

CITATION

S. Cai, E. R. Chan, S. Peng, M. Shahbazi, A. Obukhov, L. Van Gool, G. Wetzstein, DiffDreamer: Consistent Single-view Perpetual View Generation with Conditional Diffusion Models, ICCV 2023.

@inproceedings{cai2022diffdreamer,
author = {S. Cai and E. R. Chan and S. Peng and M. Shahbazi and A. Obukhov and L. Van Gool and G. Wetzstein},
title = {DiffDreamer: Consistent Single-view Perpetual View Generation with Conditional Diffusion Models},
booktitle = {ICCV},
year = {2022},
}

Overview and Results


DiffDreamer extrapolates a long-range image sequence with high 3D multi-view consistency (bottom) given a single input image (top).

DiffDreamer extrapolates a long-range image sequence with high 3D multi-view consistency (bottom) given a single input image (top).

Please see additional results on the project website!

RELATED PROJECTS

You may also be interested in related projects on neural scene representations, such as :

  • Chan et al. EG3D. CVPR 2022 (link)
  • Chan et al. pi-GAN. CVPR 2021 (link)
  • Sitzmann et al. Implicit Neural Representations with Periodic Activation Functions. NeurIPS 2020 (link)
  • Mildenhall et al. NeRF, ECCV 2020 (link)
  • Sitzmann et al. Scene Representation Networks. NeurIPS 2019 (link)