GlossaryFoundations

Diffusion

Diffusion models generate images (and sometimes video) by iteratively refining random noise into coherent visuals guided by a text prompt.

Most consumer image AI (Midjourney, DALL·E, Stable Diffusion) uses diffusion, which behaves differently from chat LLMs in latency, controls, and failure modes.

What it means

A diffusion model denoises step-by-step from noise to pixels, conditioned on your prompt and settings like aspect ratio, style, or reference image.

Why designers should care

Image UX needs progress for multi-second runs, seed/history for reproducibility, negative prompts or filters, and clear rights or safety messaging on outputs.

Example

A marketing tool generates ad variants from a brief; users see step progress, pick from four thumbnails, edit prompt, and regenerate one slot without rerunning the whole batch.

Common mistakes

  • Treating image generation like instant chat with no wait or cancel states.
  • No gallery/history when outputs are non-deterministic and users need to compare runs.
  • Hiding that diffusion can produce artifacts, wrong text, or off-brand visuals without review.

Weekly AI UX notes

Patterns, prompts, and glossary updates for designers building AI products on Substack. No spam.

Subscribe on Substack