Weng, Lilian. (Jul 2021). What are diffusion models? Lil’Log. https://lilianweng.github.io/posts/2021-07-11-diffusion-models/.

  • Diffusion models are inspired by non-equilibrium thermodynamics. They define a Markov chain of diffusion steps to slowly add random noise to data and then learn to reverse the diffusion process to construct desired data samples from the noise. Unlike VAE or flow models, diffusion models are learned with a fixed procedure and the latent variable has high dimensionality (same as the original data).
  • Several diffusion-based generative models have been proposed with similar ideas underneath, including diffusion probabilistic models (Sohl-Dickstein et al., 2015), noise-conditioned score network (NCSNYang & Ermon, 2019), and denoising diffusion probabilistic models (DDPMHo et al. 2020).
扩散模型的导火索,是始于2020 年所提出的 DDPM(Denoising Diffusion Probabilistic Model)
VAE 采用了编码器、概率潜在空间和解码器。在训练过程中,编码器预测每个图像的均值和方差。然后从高斯分布中对这些值进行采样,并将其传递到解码器中,其中输入的图像预计与输出的图像相似。这个过程包括使用KL Divergence来计算损失。VAEs的一个显著优势在于它们能够生成各种各样的图像。在采样阶段简单地从高斯分布中采样,解码器创建一个新的图像。

在第一篇 GAN 论文发表六年后,在 VAE 论文发表七年后,一个开创性的模型出现了:去噪扩散概率模型(DDPM)。DDPM 结合了两模型的优势,擅长于创造多样化和逼真的图像。

扩散模型的基本思想受到非平衡统计物理的启发,即通过迭代向前扩散的方法逐步消除数据分布中的结构。然后学习一个反向扩散过程,在 data中恢复结构


  • One of the principal advantages of DDPM is that the formation of the original motion sequence can be retained. It means that we can easily apply more constraints during the denoising process.
  • By specifying desired properties, such as noise levels and variability, diffusion modelling can generate predictable and consistent synthetic data, making it easier to train models.
  • However, this regularity can also limit the ability of synthetic data to capture the complexity and variability of real-world data. In general, real data is preferred for training models, as it better reflects the diversity of realworld conditions. Synthetic data, however, can be a useful supplement shown in Fig. 4 when real data is scarce or expensive


