Contents

Stable Diffusion

Published on Mar 14, 2026 Updated on Mar 17, 2026 One minute

Contents

arXiv Hugging Face

TL;DR

Motivations & Innovations

Approach

Model

Training Recipe

Data Recipe

Experiments

李宏毅

前提知识

KL 散度：衡量两概率分布差异，与交叉熵的关系及在生成模型中的作用。
最大似然估计（MLE）：从数据估计模型参数，负对数似然与 KL 散度的等价性。

text encoder
generation Model (Diffusion Model) -> latent 中间产物 (小分辨率图片或者看不懂的隐变量特征)
image decoder

1&2&3都是分段训练后合并的。

图片生成模型本质：

目标定义：分布越接近越好。