What resolution do Stable Diffusion and Flux output natively?

SD 1.5: 512×512 default. SDXL: 1024×1024 default. Flux.1: 1024×1024 default with support up to 2048×2048. All significantly below 4K (3840×2160) — upscaling is needed for print or high-res display.

Should I upscale in SD/ComfyUI or with a separate tool?

Integrated upscaling (Hires.fix, img2img upscale) re-runs generation at higher resolution and may change composition. External tools like Real-ESRGAN preserve exact output. Use integrated for composition variation, external for print-ready sizes.

Does Real-ESRGAN work well on Flux outputs?

Yes. Flux produces clean, noise-free outputs that Real-ESRGAN handles excellently. Expected quality is 9/10 for 4× upscales, comparable to or better than ComfyUI upscaling workflows.

Can I upscale LoRA-generated images?

Yes. The upscaler doesn't care about the generation method — it only sees the output image. LoRAs with strong stylization (anime, painterly) may benefit from anime-specific upscale variants, but Real-ESRGAN-thick works for most styles.

What about upscaling video frames from AnimateDiff?

Technically possible — export frames, upscale each, recompile. But slow (15 sec × 24 fps × video length). For video, use specialized tools like Topaz Video AI or run Real-ESRGAN in a batch script locally.

Can I chain multiple upscales for 8x or 16x?

Yes but quality degrades. Better to upscale once (4×) and use a higher-resolution source. For 8×, generate at 2048×2048 then 4× upscale to 8192×8192. Avoid compounding AI hallucinations.

Upscale Stable Diffusion and Flux Images to 4K (Free, No Upload)

The short answer: Stable Diffusion 1.5 outputs 512×512, SDXL and Flux output 1024×1024 natively. For 4K/print sizes, upscale with Real-ESRGAN. Upscale Free processes a 1024×1024 SDXL or Flux image to 4096×4096 in about 15 seconds, free, unlimited, no upload. Your generated art stays on your device.

Generating AI images is half the workflow. Getting them to usable sizes for print, social media covers, or high-res sharing is the other half. Here’s how to upscale your Stable Diffusion and Flux outputs efficiently.

Native Resolution by Model

Different AI image generators output different native resolutions:

Model	Default	Maximum	Notes
Stable Diffusion 1.5	512×512	768×768 (struggles above)	Older, smaller model
SD 2.1	768×768	1024×1024	Improved but superseded
SDXL	1024×1024	1536×1536	Current SD standard
SDXL Turbo	512×512	1024×1024	Fast but limited
Flux.1 [dev]	1024×1024	2048×2048	Newer, Black Forest Labs
Flux.1 [pro]	1024×1024	2048×2048	Commercial version

Above native resolution, models tend to generate repeating artifacts (“2-heads” problem). Better to generate at native size and upscale.

Why External Upscaling Beats In-Generation

AI image generators have built-in upscaling options:

SD Hires.fix: generates low-res then resamples at higher res
img2img upscale: uses input as reference for new generation at higher res
ComfyUI upscale nodes: various built-in options

These work but share a problem: they re-run generation, which means:

Composition may change slightly (different facial features, slightly different poses)
Style may drift toward the training data’s average
Processing is slow (10-60 seconds for full re-generation)

External upscaling (Real-ESRGAN) is different:

Preserves exact composition — only adds detail, doesn’t change content
Preserves style fidelity — doesn’t drift toward baseline
Fast — 10-15 seconds for 4× upscale

For final outputs you care about (portfolio pieces, prints, commercial work), external upscaling is the right choice.

Workflow for SD Users

ComfyUI / Automatic1111 Workflow

Generate at native resolution (1024×1024 for SDXL)
Save as PNG (preserves full quality)
Open Upscale Free
Drag the PNG, select 4×
Download 4096×4096 result

Optional refinements:

Run through face restoration (GFPGAN/CodeFormer) first if portraits
Apply any style-specific filters in generation UI
Save metadata (prompt info) in a text file before upscaling

Forge / Fooocus Users

Same workflow — these UIs also save PNG at native resolution. The upscale step is identical.

Workflow for Flux Users

Replicate/HuggingFace API

Generate at 1024×1024 (fast) or 2048×2048 (higher quality, slower)
Download the output PNG
Upscale with Upscale Free

For print-quality Flux output: generate at 2048×2048, then 4× upscale → 8192×8192. This is A2 poster quality.

BFL Pro API

Same workflow. Flux Pro’s native output quality is superb and upscales cleanly.

Quality Considerations

Real-ESRGAN handles AI-generated images well because:

Clean pixels: AI outputs don’t have sensor noise, JPEG blocking, or lens softness. Real-ESRGAN was trained on these degradations, so it “expects” to remove them — but on AI input, it just preserves the clean input quality.

Smooth gradients: AI images have smoother color gradients than photos. Upscaling doesn’t introduce banding.

Consistent style: AI output has consistent detail density across the frame. Real-ESRGAN handles this consistency well.

Potential issues:

Over-smoothing of intentional artistic texture (if your prompt wanted “rough brush strokes,” upscaling may smooth them)
Loss of subtle noise that adds character to certain art styles
Face quality on portraits — faces are Real-ESRGAN’s weakest point

For portrait-heavy work, consider this sequence: generate → face restoration (GFPGAN) → Real-ESRGAN upscale. Produces best-in-class results.

Batch Processing for Artists

Generating 50-100 SDXL images per day? Don’t upscale one at a time.

Generate your day’s batch in SD/Flux
Review and delete rejects
Select final 10-20 images
Drop entire folder into Upscale Free
Let it process in a browser tab for 5-10 minutes
Download ZIP

This scales to any catalog size up to your device’s memory limits.

Compression and Format

After 4× upscale, your 1024×1024 PNG (~2MB) becomes a 4096×4096 PNG (~15-30MB). Considerations:

For POD platforms: keep as PNG if transparency needed, JPEG quality 95 otherwise
For web upload: WebP at quality 90 is 50% smaller than equivalent JPEG
For printing: PNG or TIFF at full quality
For archiving: PNG

The 4× Sweet Spot

Real-ESRGAN variants come in 2×, 4×, and (rarely) 8×. Why 4× is usually right:

4× gives 16× area increase — 1024² → 4096² = 16× more pixels
Enough for most use cases — 4K monitors, A2 prints, POD products
Less hallucination than 8× — the further you upscale, the more the AI invents
Fast enough — 10-15 seconds vs 30-60 for multi-pass

For 2×, you can use the 4× model and downsample (Upscale Free does this automatically). For 8×, chain 4× + 2× if needed but expect quality degradation.

Privacy Advantage for AI Artists

Many AI artists are commercial users with concerns about their work appearing in future training datasets or being exposed in breaches. Browser-based upscaling eliminates these risks:

Your generation never uploads to upscale service
No third-party logs of your outputs
No risk of unreleased commercial work leaking

For Midjourney Pro users, Discord community builds, or sellers on Etsy/Redbubble, this matters.

Start Upscaling

Drop your latest SD or Flux output into Upscale Free and see the 4× result in 15 seconds. Generated with care, upscaled with privacy.