Start from Noise: Unique Fingerprint

Denoising Steps: Gradual Clarification

The model removes noise across multiple steps to restore the image.

Instead of finishing at once, it gradually refines toward the target image.

Adjust the step count with the slider and press Play to see the process.

Denoising Steps (Timeline)0

Step 0/50

Noise Level: 100%

Starting Point

Diffusion 'excavates' images from random noise.

Different starting numbers create different noise patterns, resulting in completely different images.

Two starting numbers compared - same prompt, different starting points yield different images.

Same prompt, different starting number → different results

Starting Number: 42

Different fingerprint≠

Starting Number: 123

Pure Noise

Signal

Scroll to watch the noise clear away

Text is converted to numbers, and those numbers guide image creation.

At each denoising step, text embedding guides the noise removal direction.

See the prompt → CLIP encoder → embedding → Cross-Attention → result pipeline.

Prompt

CLIP Encoder

Text Embedding

Cross-Attention

Guided Generation

Example Prompt:

"A cat sitting on a rainbow"

Text embedding influences noise removal direction at each step, converging toward an image matching the description.

ControlNet adds extra conditions like pose, depth, and edges.

When text isn't enough, image-based control signals provide precise guidance.

Click the 4 control types to see the workflow visualization.

Human skeleton guides body position and pose

Reference Image

Extract Control

Generate

Combine with Text

“From noise to art, guided by text and control.”