Diffusion
Diffusion models generate images (and sometimes video) by iteratively refining random noise into coherent visuals guided by a text prompt.
Most consumer image AI (Midjourney, DALL·E, Stable Diffusion) uses diffusion, which behaves differently from chat LLMs in latency, controls, and failure modes.
What it means
A diffusion model denoises step-by-step from noise to pixels, conditioned on your prompt and settings like aspect ratio, style, or reference image.
Why designers should care
Image UX needs progress for multi-second runs, seed/history for reproducibility, negative prompts or filters, and clear rights or safety messaging on outputs.
Example
A marketing tool generates ad variants from a brief; users see step progress, pick from four thumbnails, edit prompt, and regenerate one slot without rerunning the whole batch.
Common mistakes
- • Treating image generation like instant chat with no wait or cancel states.
- • No gallery/history when outputs are non-deterministic and users need to compare runs.
- • Hiding that diffusion can produce artifacts, wrong text, or off-brand visuals without review.