Vidu Contest
WaveSpeed.ai
Inicio/Explorar/Seedream AI Models/bytedance/seedream-v4.5
text-to-image

text-to-image

ByteDance Seedream 4.5

bytedance/seedream-v4.5

ByteDance Seedream 4.5 is a next-gen text-to-image model optimized for typography—crisper text rendering, stronger prompt adherence, and up to 4K output for posters and brand visuals. Ready-to-use REST inference API, best performance, no cold starts, affordable pricing.

Input
width
height
2048 × 2048 px
Range: 1024 - 4096
If enabled, the output will be encoded into a BASE64 string instead of a URL. This property is only available through the API.
If set to true, the function will wait for the result to be generated and uploaded before returning the response. It allows you to get the result directly in the response. This property is only available through the API.

Idle

Nighttime outdoor photoshoot: A young man standsinside a public phone booth, holding a blue phonereceiver to his ear. One hand is casually tucked into his pocket, and he strikes a relaxed posture. He wears a white T-shirt with a pattern, loose brown pants, and jacket draped over his arm. The booth's glass reflects city streetlights with a bokeh effect, creating a vintage film style.

Tu solicitud costará $0.04 por ejecución.

Con $1 puedes ejecutar este modelo aproximadamente 25 veces.

Una cosa más:

EjemplosVer todo

Nighttime outdoor photoshoot: A young man standsinside a public phone booth, holding a blue phonereceiver to his ear. One hand is casually tucked into his pocket, and he strikes a relaxed posture. He wears a white T-shirt with a pattern, loose brown pants, and jacket draped over his arm. The booth's glass reflects city streetlights with a bokeh effect, creating a vintage film style.
Golden hour rooftop gathering, 16:9, 4K.
A group of four friends with diverse skin tones sit and stand around a low table on a city rooftop café, laughing and talking, no visible brands or logos. Use the provided reference portraits (if any) to keep their faces, hairstyles and outfits consistent.
Warm sunlight from the side, soft rim light on hair and shoulders, blurred city skyline in the background, string lights glowing above.
Casual modern outfits with natural wrinkles and fabric texture, realistic skin details, subtle makeup.
Cinematic lifestyle photography, shallow depth of field, slight film grain, clean composition, publish-ready campaign image.
Afternoon side light illuminates the camping tent, withthe rough texture of the tent canvas clearly visible. AGerman Shepherd lies inside; its owner squats down to shake hands with it.
Five shimmering goldfish weave through crevicesbetween stones; four are red-and-white, while one issilver-white.By the pond's edge, a golden shadedBritish Shorthair cat watches them intently, counting onblind luck. Watercolor style
High-end perfume bottle on a reflective black glass surface in a dark studio.
Transparent glass bottle with a simple rectangular shape, metallic cap, soft golden liquid inside, no logo or text.
A focused warm spotlight from above creates a bright highlight on the bottle and a soft reflection on the surface.
Background fades into deep black with a subtle gradient, ultra-realistic product photography, 4K, no text.

README

bytedance/seedream-v4.5 — Text-to-Image

Seedream 4.5 is ByteDance’s latest high-resolution image generation model, upgraded through large-scale training and architecture refinement. It is especially strong at typography, poster composition, and branded visuals, with clear text rendering and strong prompt adherence.

Model highlights

  • Enhanced typography – Renders sharp, legible text for posters, logos, UI, and marketing layouts.
  • Designer-level composition – Handles complex poster-style layouts with clear hierarchy (title, subtitle, body text, logos).
  • Strong prompt adherence – Closely follows detailed descriptions for subjects, layout, and style.
  • High-resolution output – Supports custom width/height with total pixel count from 2560×1440 up to 4096×4096.
  • Aesthetic quality – Benchmarked with strong performance on MagicBench and other visual quality suites.

Recommended use cases

  • Poster, banner, and KV design with embedded text
  • Brand visual, logo, and campaign asset creation
  • E-commerce product imagery and hero shots
  • Social media graphics where typography is part of the design
  • Presentation, landing-page, and in-app visuals

Pricing

  • $0.04 per generated image

How to use

  1. Enter your prompt Describe the subject, composition, text elements (e.g., title / subtitle / tagline), and overall style.

  2. Set size (width & height) Choose pixel dimensions for your image. The model supports custom sizes as long as the total pixel count is within 2560×1440 ≤ width × height ≤ 4096×4096.

  3. Run the job Click Run to generate the image, then refine your prompt or size for the next iteration.

Suggested resolutions

Below are example resolutions that work well in practice and stay within the supported pixel range:

Aspect RatioSuggested Resolution (W × H)
1:12048 × 2048
4:32688 × 2016
3:22688 × 1792
16:92560 × 1440
Square 4K4096 × 4096

You can freely adjust width and height as long as they respect the total pixel range.

Notes

  • For text-heavy posters, slightly higher resolutions (e.g. 2048×2048 or above) give noticeably cleaner typography.
  • Keep logos and key text explicitly described (e.g. “white all-caps title at the top, small gray subtitle below”).
  • If you are using an image URL, make sure it is publicly accessible so the system can retrieve it.

Model Comparison on WaveSpeedAI

Use Seedream 4.5 together with other models, depending on your priorities:

  • google/nano-banana-pro/text-to-image – Google’s Nano Banana Pro (Gemini 3.0 Pro Image family) is ideal for ultra-low cost, multi-image generation, great for large batches and exploratory runs.

  • Tongyi-MAI/Z-Image-Turbo (available as Z-Image on WaveSpeedAI) – Tongyi-MAI’s 6B, 8-step turbo model focuses on maximum speed and throughput while keeping photorealism and bilingual (EN/ZH) support.

  • wavespeed-ai/flux-2-pro/text-to-image – FLUX.2 [pro] is a flagship, general-purpose model for cinematic quality and complex scenes, great when you need broad stylistic range beyond typography-heavy posters.

  • bytedance/seedream-v4 – The previous Seedream generation, strong at high-resolution illustration and diverse styles; Seedream 4.5 builds on it with noticeably better text rendering and layout control for branding work.

Rule of thumb:

  • Choose Seedream 4.5 for posters, brand layouts, and any text-heavy creative.
  • Choose Nano Banana Pro or Z-Image-Turbo for fast, cheap, large-scale image batches.
  • Choose FLUX.2 [pro] for cinematic, style-flexible hero shots.
  • Choose Seedream V4 when you want its familiar look or need variety across illustration styles.