Seedream 4.0 至 5.0 完整教學：文字生圖、圖像編輯與多圖生成

ByteDance 的 Seedream 系列從 4.0 快速演進至 5.0，每個版本都帶來圖像生成、編輯與智能推理方面的新能力。本教學涵蓋 4.0 至 5.0 的完整範圍——每個版本的最佳應用場景、應選用哪些模型變體，以及如何透過 WaveSpeedAI 的 API 獲得生產品質的成果。

模型系列總覽

Seedream 4.0 至 5.0 的產品線支援三種輸入類型——文字、單張圖像及多張圖像——可實現文字生成圖像、圖像編輯、多圖融合，以及具備主題一致性的批次序列生成。

每個主要版本都有其獨特優勢：

版本	定位	最適合	價格（WaveSpeedAI）
4.0	高效率	快速迭代、版面感知海報、網格設計、成本敏感型生產	$0.027/張
4.5	深度編輯與排版	人像、品牌視覺、清晰文字渲染、4K 海報構圖	$0.04/張
5.0-Lite	輕量級 5.0	快速 5.0 生成與編輯，易用的入門選項	現已推出
5.0-Preview	知識與推理	熱門話題、網路搜尋、邏輯推理、領域專業內容	即將推出

Seedream 4.0：版面感知生成

Seedream 4.0 針對多格海報、含文案的概念設計、系列主視覺（KV）及社群媒體素材進行了優化。它擅長網格排版、為標題與副標題規劃留白空間，以及提升文字可讀性。

主要規格

預設輸出：2048x2048（2K）
最高解析度：4096x4096
推理速度：2K 圖像約 1.8 秒
長寬比：1:1、3:2、4:3、16:9、21:9 及自訂

模型變體

Seedream 4.0 在 WaveSpeedAI 上提供四個變體，各自針對不同的工作流程設計：

bytedance/seedream-v4 — 文字生成圖像。從文字提示詞生成圖像。適合海報、概念藝術及社群媒體圖像。

bytedance/seedream-v4/edit — 圖像到圖像。修改現有圖像：服裝替換、背景更換、材質變更、室內重新設計。最多支援 10 張參考圖像。

bytedance/seedream-v4/sequential — 批次文字生成圖像。一次生成多張具有跨圖一致性的圖像。適合角色設定集、廣告活動及步驟示意圖。

bytedance/seedream-v4/edit-sequential — 批次圖像到圖像。多圖輸入搭配批次輸出。可實現多圖融合、跨組風格轉換及 A/B 變體比較。

文字生成圖像提示詞（V4）

使用 Seedream 4.0 撰寫提示詞時，請指定主體、版面（網格、三聯畫等）、文字位置（標題、副標題、行動呼籲）及偏好風格。

2x2 網格海報

2x2 grid poster layout, clean margins for typography, title at top center:
"SUMMER COLLECTION", subtitle: "New Arrivals 2026". Panel 1: beachside resort;
Panel 2: sunset cocktail; Panel 3: tropical flowers; Panel 4: ocean waves.
Consistent color grading, cinematic lighting, brand color #3CA2F6,
high legibility background, minimal clutter

三聯畫

Horizontal triptych panels, left-to-right narrative: mountain sunrise ->
hiking trail -> summit celebration, unified palette warm earth tones,
soft vignette, clear gutters, strong typographic hierarchy,
space reserved for CTA "START YOUR ADVENTURE"

極簡海報

Minimal poster, large title center: "INNOVATION SUMMIT", small subtitle
below: "March 2026 • San Francisco", single focal object: abstract
geometric sculpture, monochrome + accent #3CA2F6, high legibility
background, grid-based layout

漫畫格

4-panel comic strip layout, speech bubble placeholders.
Panel 1: developer stares at screen; Panel 2: AI generates solution;
Panel 3: developer celebrates; Panel 4: "It was that easy?"
Bold line art, flat shading, clear gutters, high readability

API 範例：文字生成圖像

import wavespeed

output = wavespeed.run(
    "bytedance/seedream-v4",
    {"prompt": "2x2 grid poster, title: 'TECH EXPO 2026', four futuristic product concepts, clean margins, cinematic lighting, brand color blue"},
)

print(output["outputs"][0])

圖像編輯（V4 Edit）

編輯變體在保留主體身份、光線與構圖的同時修改現有圖像。請使用清晰、結構化的提示詞，遵循以下模式：動作 + 對象 + 目標特徵 + 約束條件。

服裝替換

Outfit swap for portrait, replace clothing with elegant navy blazer;
keep pose and composition; accessories: gold watch;
makeup/hair unchanged; preserve skin tone and lighting;
clean edges, no artifacts

背景替換

Background replacement for subject, keep subject edges;
new environment: modern office with floor-to-ceiling windows;
match light direction and color temperature;
soft contact shadows; no haloing

室內重新設計

Interior finish swap, update wall to exposed brick,
floor to dark hardwood, furniture upholstery to charcoal linen;
layout and lighting unchanged; realistic PBR textures

API 範例：圖像編輯

import wavespeed

output = wavespeed.run(
    "bytedance/seedream-v4/edit",
    {
        "prompt": "Replace the background with a tropical beach at sunset, match light direction, soft shadows",
        "image": "https://example.com/portrait.jpg",
    },
)

print(output["outputs"][0])

序列生成（V4 Sequential）

序列變體在一次呼叫中生成多張圖像，整組圖像在風格、身份與色調上保持一致。您必須在提示詞和 max_images 參數中同時指定圖像數量。

角色設計集

Generate 6 character sheets of a cyberpunk hacker.
Image 1: neutral pose; Image 2: action pose; Image 3: side profile;
Image 4: back view; Image 5: happy expression; Image 6: serious expression.
Same outfit and palette, clean turnaround style.

廣告活動

Generate 4 poster concepts of the same coffee brand campaign.
Image 1: headline "WAKE UP", morning light;
Image 2: headline "FUEL UP", afternoon energy;
Image 3: headline "WIND DOWN", evening warmth;
Image 4: headline "DREAM ON", night ambiance.
Keep brand color brown/gold, consistent grid and margins, cinematic lighting.

API 範例：序列生成

import wavespeed

output = wavespeed.run(
    "bytedance/seedream-v4/sequential",
    {
        "prompt": "Generate 4 images of a sneaker in different colorways. Image 1: white/blue; Image 2: black/gold; Image 3: red/white; Image 4: green/cream. Studio lighting, identical angle and composition, clean background.",
        "max_images": 4,
    },
)

for url in output["outputs"]:
    print(url)

費用說明：序列模型按 max_images 計費，而非實際輸出數量。若您設定 max_images=4 但提示詞中只描述了 2 張圖像，仍會被收取 4 張的費用。請務必使提示詞中的數量與 max_images 一致。

Seedream 4.5：排版與深度編輯

Seedream 4.5 在 4.0 的基礎上大幅改進了文字渲染、提示詞遵循度、美學品質及參考圖像一致性。凡涉及排版、品牌視覺或人像編輯的工作，均推薦使用此版本。

相較 4.0 的主要改進

強化排版：適用於海報、標誌、UI 及行銷版面的清晰可讀文字
設計師級構圖：處理具有清晰層次結構的複雜海報式版面
更強的提示詞遵循度：嚴格遵循對主體、版面及風格的詳細描述
更高解析度：支援 2560x1440 至 4096x4096（最低解析度高於 V4）
更佳的參考一致性：保留參考圖像的面部特徵、光線及色調

模型變體

與 V4 相同，Seedream 4.5 在 WaveSpeedAI 上提供四個變體：

變體	模型路徑	類型	使用場景
基礎版	`bytedance/seedream-v4.5`	文字生成圖像	排版密集海報、品牌視覺
編輯版	`bytedance/seedream-v4.5/edit`	圖像到圖像	人像編輯、產品修圖
序列版	`bytedance/seedream-v4.5/sequential`	批次文字生成圖像	一致系列、活動組合
編輯序列版	`bytedance/seedream-v4.5/edit-sequential`	批次圖像到圖像	多圖融合、風格轉換

長寬比	建議解析度
1:1	2048x2048
4:3	2688x2016
3:2	2688x1792
16:9	2560x1440
正方形 4K	4096x4096

文字渲染最佳實踐

Seedream 4.5 的突出特點是圖像內的精確文字生成。請遵循以下準則以獲得最佳效果：

使用雙引號括住必須出現在圖像中的文字：Generate a poster with the title "Seedream 4.5"
指定字型特徵：“bold sans-serif”、“elegant script”、“handwritten”
描述文字位置：“title top-center”、“subtitle below”、“CTA bottom-right”
保持文字簡短：1-10 個字詞效果最佳；長段落可能出現不一致問題
使用較高解析度：2048x2048 或以上能獲得明顯更清晰的排版效果

範例：品牌海報

Minimalist tech conference poster, dark navy background.
Large white all-caps title at the top: "AI SUMMIT 2026".
Small gray subtitle below: "San Francisco • June 15-17".
Abstract holographic geometric shape centered.
Brand color accent #3CA2F6. Clean grid layout, generous whitespace.

API 範例：排版密集生成

import wavespeed

output = wavespeed.run(
    "bytedance/seedream-v4.5",
    {
        "prompt": "Coffee shop menu board, chalkboard style, title 'DAILY SPECIALS' in bold chalk lettering, items: Espresso $3, Latte $4, Cappuccino $4.50, warm ambient lighting, cozy cafe atmosphere",
        "size": "2048x2048",
    },
)

print(output["outputs"][0])

基於參考的生成（V4.5 Edit）

Seedream 4.5 Edit 擅長從參考圖像中提取並保留視覺特徵：

色調轉換

Change Image 1's color tone to match Image 2's color tone

妝容轉換

Transfer the makeup from Image 2 onto the person in Image 1

品牌風格套用

Apply Image 1's brand design style to the product in Image 2,
create a similar brand series promotional image,
include all design modules from Image 1

Seedream 5.0-Preview：智能與推理

Seedream 5.0-Preview 引入了超越傳統圖像生成的能力。它優先考慮知識與智能而非純粹的美感，新增了即時網路搜尋、精確編輯控制及進階邏輯推理。

注意：對於純粹的視覺美感與寫實主義，Seedream 4.5 仍是推薦選擇。完整的 5.0 正式版將同時結合智能與美感。

即時網路搜尋

5.0-Preview 是首個支援基於搜尋的生成的圖像生成模型。模型會根據您的提示詞智能判斷是否需要搜尋：

時效性詞彙：近期產品發布、當前事件
特定實體：名人、品牌、地點
長尾查詢：需要事實準確性的利基主題

會觸發搜尋的範例提示詞：

Generate iPhone 17 Pro Max concept design

Reference the Duolingo app interface, design a vocabulary
flashcard page with word and streak counter, incorporate
the green owl mascot

Generate a Nordic Winter Olympics poster: Norwegian aurora
background, skier in national uniform, include Olympic
elements and mascot

智能邏輯推理

5.0-Preview 能處理需要理解情境與多步驟決策的複雜操作：

分類與分配

Classify the flowers in Image 1 by variety, arrange them
separately in the three vases shown in Image 2

物理世界理解

Two stationery rulers, top is a 20cm plastic ruler,
bottom is a 10cm steel ruler

3D 推理

Generate the 3D assembled form based on the packaging
flat layout diagram

領域專業知識

Reference this set of CAD drawings, generate a realistic
building visualization

Human respiratory system anterior view diagram showing:
nasal cavity, nostrils, oral cavity, pharynx, larynx,
trachea, left and right main bronchi, left and right
lungs, and diaphragm

基於範例的編輯

無需描述複雜的轉換方式，只需透過前後對比範例向模型展示您想要的效果：

Reference the change from Image 1 to Image 2, apply the
same operation to Image 3

適用於髮型改變、場景替換、材質轉換及視角變換。

提示詞工程指南

以下技巧適用於所有 Seedream 4.0 至 5.0 版本。

使用自然語言，而非標籤列表

撰寫連貫的敘述，而非零碎的關鍵字列表：

避免：

girl, lavish dress, parasol, tree-lined path, oil painting, Monet style

建議：

A girl in a lavish dress walking under a parasol along a tree-lined path,
in the style of a Monet oil painting

提示詞結構公式

[主體] + [動作/姿態] + [環境/場景] + [風格] + [技術細節] + [文字內容]

範例：

A professional barista (subject) crafting latte art (action) in a modern
specialty coffee shop (environment), photorealistic style (style),
warm morning light through large windows, shallow depth of field (technical),
a chalkboard behind them reading "ARTISAN ROASTERS" (text content)

編輯提示詞

進行圖像編輯時，請使用具體、明確的指令，清楚說明哪些部分需要變更，哪些部分保持不變：

避免：Make it look better

建議：Replace the overcast sky with a vivid sunset backdrop, warm orange tones; keep the building and foreground unchanged

複雜編輯的視覺標記

當文字描述不足以精確定位時，可在參考圖像上使用箭頭、邊界框或塗鴉來標示需要修改的特定區域。

常見錯誤

指令相互矛盾：“Photorealistic cartoon character”——請選擇單一風格方向
提示詞過於複雜：從簡單開始，逐步增加細節
忽略長寬比：根據用途匹配尺寸（社群媒體用正方形，橫幅用橫向）
編輯指令模糊：避免使用”change it”等代名詞——請明確說明”it”指的是什麼

選擇正確的版本

快速決策指南

需要速度和低成本？ → Seedream 4.0
需要圖像中清晰的文字？ → Seedream 4.5
需要品牌級海報？ → Seedream 4.5
需要一致的多圖組合？ → V4 或 V4.5 Sequential
需要編輯現有照片？ → V4 或 V4.5 Edit
需要當前事件的圖像？ → Seedream 5.0-Preview
需要知識驅動的內容？ → Seedream 5.0-Preview

詳細比較

能力	4.0	4.5	5.0-Preview
文字生成圖像	是	是	是
圖像編輯	是	是（更佳）	是
多圖輸入	是	是	是
序列生成	是	是	是
文字渲染	良好	優秀	良好
網路搜尋	否	否	是
邏輯推理	基礎	基礎	進階
最高解析度	4096x4096	4096x4096	4K
最低解析度	~320x320	2560x1440	—
速度	最快	中等	中等
費用	$0.027	$0.04	—

版本限制

Seedream 4.0：小型文字可能重複或降質；編輯精確度低於 4.5。

Seedream 4.5：偶有模糊或裁切問題；成本與生成時間高於 4.0。

Seedream 5.0-Preview：部分圖像具有 AI 生成感；偶有比例問題；文字結構不穩定；圖表/數據推理能力有限。目前優先考慮智能而非美感。

WaveSpeedAI 上的所有可用模型

模型	類型	價格	最適合
`bytedance/seedream-v4`	文字生成圖像	$0.027	海報、網格版面、概念設計
`bytedance/seedream-v4/edit`	圖像到圖像	$0.027	服裝替換、背景更換、修圖
`bytedance/seedream-v4/sequential`	批次文字生成圖像	$0.027/張	角色設定集、活動組合
`bytedance/seedream-v4/edit-sequential`	批次圖像到圖像	$0.027/張	多圖融合、A/B 變體
`bytedance/seedream-v4.5`	文字生成圖像	$0.04	排版、品牌視覺、4K 海報
`bytedance/seedream-v4.5/edit`	圖像到圖像	$0.04	人像編輯、風格/特徵轉換
`bytedance/seedream-v4.5/sequential`	批次文字生成圖像	$0.04/張	品牌系列、一致活動
`bytedance/seedream-v4.5/edit-sequential`	批次圖像到圖像	$0.04/張	多圖編輯、設計探索
`bytedance/seedream-v5.0-lite`	文字生成圖像	$0.035	知識驅動生成、網路搜尋
`bytedance/seedream-v5.0-lite/edit`	圖像到圖像	$0.035	智能編輯、特徵轉換
`bytedance/seedream-v5.0-lite/sequential`	批次文字生成圖像	$0.035/張	一致的智能系列
`bytedance/seedream-v5.0-lite/edit-sequential`	批次圖像到圖像	$0.035/張	多圖智能編輯

快速入門

在 WaveSpeedAI 註冊並取得您的 API 金鑰
安裝 SDK：pip install wavespeed
根據上方的決策指南選擇您的模型
運用結構公式與最佳實踐撰寫提示詞
生成並迭代：根據結果優化提示詞

import wavespeed

# 使用 Seedream 4.5 進行文字生成圖像
output = wavespeed.run(
    "bytedance/seedream-v4.5",
    {"prompt": "A sleek product showcase poster, title 'NEXT GEN' in bold white sans-serif, dark gradient background, floating smartphone with holographic screen, cinematic lighting, brand color #3CA2F6"},
)

print(output["outputs"][0])

import wavespeed

# 使用 Seedream 4.0 進行圖像編輯
output = wavespeed.run(
    "bytedance/seedream-v4/edit",
    {
        "prompt": "Change the outfit to a formal black suit, keep the same pose and background lighting",
        "image": "https://example.com/portrait.jpg",
    },
)

print(output["outputs"][0])

import wavespeed

# 使用 Seedream 4.0 進行序列生成
output = wavespeed.run(
    "bytedance/seedream-v4/sequential",
    {
        "prompt": "Generate 3 step-by-step tutorial visuals for making pour-over coffee. Image 1: grinding beans; Image 2: pouring water in circular motion; Image 3: finished cup with steam. Uniform warm style, numbered labels.",
        "max_images": 3,
    },
)

for url in output["outputs"]:
    print(url)

無論您是在建構行銷自動化、大規模創建社群媒體內容，還是開發創意應用程式，WaveSpeedAI 上的 Seedream 4.0 至 5.0 系列都能提供從快速迭代到智能知識驅動生成的完整解決方案。

模型系列總覽

Seedream 4.0：版面感知生成

主要規格

模型變體

文字生成圖像提示詞（V4）

API 範例：文字生成圖像

圖像編輯（V4 Edit）

API 範例：圖像編輯

序列生成（V4 Sequential）

API 範例：序列生成

Seedream 4.5：排版與深度編輯

相較 4.0 的主要改進

模型變體

推薦解析度（V4.5）

文字渲染最佳實踐

API 範例：排版密集生成

基於參考的生成（V4.5 Edit）

Seedream 5.0-Preview：智能與推理

即時網路搜尋

智能邏輯推理

基於範例的編輯

提示詞工程指南

使用自然語言，而非標籤列表

提示詞結構公式

編輯提示詞

複雜編輯的視覺標記

常見錯誤

選擇正確的版本

快速決策指南

詳細比較

版本限制

WaveSpeedAI 上的所有可用模型

快速入門

相關文章

Phota Edit現已登陸WaveSpeedAI

Phota Text-to-Image現已登陸WaveSpeedAI

2026年最佳免費AI圖像生成器：10+模型，一鍵生成，零煩惱

Kling Image O3現已登陸WaveSpeedAI

WaveSpeedAI vs Media.io 去浮水印工具：哪個才是真正的贏家？

Recraft V4：一家小型AI新創如何在圖像生成領域超越Midjourney與DALL-E