P68

Diffusion Models for Large Contents

同样的方法也可用于Applications such as long images, looped motion, 360 images…

  • Suppose model is trained on small, squared images, how to extend it to larger images?
  • Outpainting is always a solution, but not a very efficient one!

Let us generate this image with a diffusion model only trained on squared regions:

  1. Generate the center region \(q(\mathbf{x} _ 1,\mathbf{x} _ 2)\)
  2. Generate the surrounding region conditioned on parts of the center image \(q(\mathbf{x} _ 3|\mathbf{x} _ 2)\)

Latency scales linearly with the content size!

✅ 根据左边的图生成右边的图,存在的问题:慢
✅ 直接生成大图没有这样的数据。
✅ 并行化的生成。

P69

DiffCollage

  • Unlike autoregressive models, diffusion models can generate large contents in parallel!

P70

  • A “large” diffusion model from “small” diffusion models!

P71

More Works

IDYearNameNoteTagsLink
2023Zhang et al., "DiffCollage: Parallel Generation of Large Content with Diffusion Models"
2023Jiménez, "Mixture of Diffusers for scene composition and high resolution image generation", arXiv 2023- Based on similar ideas but differ in how overlapping regions are mixed.
✅ 这种并行化方法可以用于各种 overlapping 的场景。
2023Bar-Tal et al., "MultiDiffusion: Fusing Diffusion Paths for Controlled Image Generation", ICML 2023

本文出自CaterpillarStudyGroup,转载请注明出处。

https://caterpillarstudygroup.github.io/ImportantArticles/