Home/Explore/Seedance Video Models/bytedance/seedream-v4

text-to-image

bytedance/seedream-v4

Seedream 4.0 is a state-of-art image model by Bytedance. Seedream 4.0: Surpassing Nano Banana in every aspect.

Doc
width
height
If enabled, the output will be encoded into a BASE64 string instead of a URL. This property is only available through the API.
If set to true, the function will wait for the image to be generated and uploaded before returning the response. It allows you to get the image directly in the response. This property is only available through the API.
Attention: Only paid user can use this model. You can top up to continue.

Idle

Transform it into 3D pixel art.

Your request will cost $0.027 per run.

For $1 you can run this model approximately 37 times.

One more thing:

ExamplesView all

American retro style: a girl wearing a polka-dot dress with sunglasses adorning her head.
Relief style, a Cupid angel
series key visuals template, consistent lighting and angle, title top: “ZENITH HEADPHONES”, per-image variable subject: a pair of black wireless headphones, brand palette subtle gray and metallic sheen, same background texture, identical composition for all items, studio look
portrait KV series, cinematic and moody style, consistent color grading deep blue and gold accents, fixed camera look (85mm shallow depth), interchangeable persona: a thoughtful tech entrepreneur, mid-shot, looking to the side, reserved lower-third text: "ALEX CHEN -- Founder & CEO"
tech key visual, a stylized 3D rendered human brain with glowing neural pathways, representing intelligence, with UI label callouts: "Real-time Analytics", "Secure Encryption", "Cross-platform Sync", dark background with bright cyan and magenta neon accents, precise spacing for text, depth and rim lighting, clear text areas
event KV series, a modern tech event poster with a badge in the top-right corner that reads "DAY 1". The poster features a prominent headline "EXPLORE THE FUTURE" and a secondary line "Join us for a day of groundbreaking technology." The background is a cohesive gradient from vibrant teal to dark violet. The text should appear as a visual element, bold and stylized, maintaining a clean layout. Focus on the overall visual impact, not precise typography.
Draw a chart showing the typical vegetation distribution in four diferent climate zones: tropical rainforest. temperate forest,desert, and tundra.
Design a retro website for a high-end art museum, adopting an earthy color tone, with a concise and neat layout,focusing on displaying large images of the museum's collection of artworks.
An elderly man sits in the park, documentary photography, Leica monochrome filter, faded high contrast, surreal composition.
A girl dressed in elegant attire walks along a tree-lined path, holding a parasol, in the style of a Monet oil painting.
Rococo style, a young girl in an ornate dress
Outdoor photography: A teenage girl with a backpack is rock climbing on a mountain peak.
Close-up of a girl's face, side lighting
Cinematic quality: The princess and the web consultant embrace before the castle, gazing at each other with tender affection.
Urban photography: The bustling streets of Tokyo
Cthulhu-style: A girl stands before an ancient castle, facing the camera.
Documentary photography style: A girl stands in an old-fashioned alleyway, smiling at the camera.

README

Seedream 4.0 is a state-of-art image model by Bytedance. Seedream 4.0: Surpassing Nano Banana in every aspect.

Multi-modal image generation support: Seedream 4.0 is the first to support multi-modal image generation, enabling text-to-image, image-editing, and group image generation with a single model.

Outstanding model advantages: It features five major highlights: precise instruction editing, high feature retention, deep understanding ability, ultra-fast inference speed, and ultra-high-resolution output. For example, it takes as little as 1.8 seconds to generate a 2K image in text-to-image mode.

Diverse scene applications: Covers various scenarios such as commercial design (including posters, brand clothing, packaging, e-commerce, etc.), entertainment (such as anime and movie roles), fine art, architectural design, and more. It supports various complex editing operations, such as object addition and deletion, attribute change, style change, structural adjustment (e.g., face swapping), etc.

Prompt writing guide: Use clear and specific instructions following the formula of "change action + change object + target feature." While generating multiple images, use words like "a series of," "group of images" to maintain consistency.

Text-to-image writing guide: The image content in coherent natural language and group images for a specific scene description. Use professional vocabulary in its original language and special image terms to match the scene description.

Problem self-check list: Identify common problems such as incorrect instructions, inadequate description, missing details, or redundant instructions. It recommends rechecking image editing objects, and insufficient aesthetic effects, and enhancing background consistency.

Rich editing functions: It includes style/image editing (element addition and deletion, frame/brush editing, style transformation, texture replacement, etc.), structural editing (object structure modification, object reshaping, etc.), and feature editing (color tone, detail transformation, attribute adjustment, etc.).