P68
Diffusion Models for Large Contents
同样的方法也可用于Applications such as long images, looped motion, 360 images…
- Suppose model is trained on small, squared images, how to extend it to larger images?
- Outpainting is always a solution, but not a very efficient one!
Let us generate this image with a diffusion model only trained on squared regions:

- Generate the center region \(q(\mathbf{x} _ 1,\mathbf{x} _ 2)\)
- Generate the surrounding region conditioned on parts of the center image \(q(\mathbf{x} _ 3|\mathbf{x} _ 2)\)

Latency scales linearly with the content size!
✅ 根据左边的图生成右边的图,存在的问题:慢
✅ 直接生成大图没有这样的数据。
✅ 并行化的生成。
P69
DiffCollage
- Unlike autoregressive models, diffusion models can generate large contents in parallel!

P70
- A “large” diffusion model from “small” diffusion models!


P71
More Works
| ID | Year | Name | Note | Tags | Link |
|---|---|---|---|---|---|
| 2023 | Zhang et al., "DiffCollage: Parallel Generation of Large Content with Diffusion Models" | ||||
| 2023 | Jiménez, "Mixture of Diffusers for scene composition and high resolution image generation", arXiv 2023 | - Based on similar ideas but differ in how overlapping regions are mixed. ✅ 这种并行化方法可以用于各种 overlapping 的场景。 | |||
| 2023 | Bar-Tal et al., "MultiDiffusion: Fusing Diffusion Paths for Controlled Image Generation", ICML 2023 |
本文出自CaterpillarStudyGroup,转载请注明出处。
https://caterpillarstudygroup.github.io/ImportantArticles/