Vidu Q3 Pro 출시 — 지금 사용해 보세요
OpenAI Models

OpenAI Models

OpenAI's state-of-the-art AI models for text, image, and multimodal applications, Sora 2 is included

OpenAI's state-of-the-art AI models for text, image, and multimodal applications, Sora 2 is included

전체 모델

22개 모델
openai/gpt-image-2/edit
image-to-image

openai/gpt-image-2/edit

OpenAI's GPT Image 2 Edit enables image editing from natural-language instructions with one or more reference images. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

openai/gpt-image-2/text-to-image
text-to-image

openai/gpt-image-2/text-to-image

OpenAI's GPT Image 2 Text-to-Image generates high-quality images from natural-language prompts. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

wavespeed-ai/openai-whisper
speech-to-text

wavespeed-ai/openai-whisper

Whisper Large v3 speech-to-text: instant, accurate multilingual transcripts with automatic language detection and punctuation. Upload audio to get transcripts. Ready-to-use REST API, no coldstarts, affordable pricing.

wavespeed-ai/openai-whisper-turbo
speech-to-text

wavespeed-ai/openai-whisper-turbo

Accurate speech-to-text with OpenAI Whisper Large v3 Turbo: multilingual transcripts with auto language detection and punctuation. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

openai/dall-e-2
text-to-image

openai/dall-e-2

Original DALL-E 2 from OpenAI for classic text-to-image generation via the OpenAI Image Generation API. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

openai/gpt-image-1-high-fidelity
text-to-image

openai/gpt-image-1-high-fidelity

OpenAI GPT Image 1 High-Fidelity produces photorealistic, high-detail images for creative and production workflows, delivering improved texture and color fidelity. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

openai/sora-2/image-to-video
image-to-video

openai/sora-2/image-to-video

OpenAI Sora 2 generates realistic image-to-video content with synchronized audio, improved physics, sharper realism and steerability. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

openai/sora-2/image-to-video-pro
image-to-video

openai/sora-2/image-to-video-pro

OpenAI Sora 2 Image-to-Video Pro creates physics-aware, realistic videos with synchronized audio and greater steerability. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

openai/sora-2/text-to-video
text-to-video

openai/sora-2/text-to-video

OpenAI Sora 2 is a state-of-the-art text-to-video model with realistic visuals, accurate physics, synchronized audio, and strong steerability. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

openai/sora-2/text-to-video-pro
text-to-video

openai/sora-2/text-to-video-pro

OpenAI Sora 2 Text-to-Video Pro creates high-fidelity videos with synchronized audio, realistic physics, and enhanced steerability. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

openai/sora
text-to-video

openai/sora

Sora is OpenAI's multi-modal model that generates videos from text, images, or existing video inputs. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

openai/dall-e-3
text-to-image

openai/dall-e-3

OpenAI DALL·E 3 for high-fidelity text-to-image generation available as a managed API on WaveSpeedAI. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

openai/gpt-image-1-mini/text-to-image
text-to-image

openai/gpt-image-1-mini/text-to-image

GPT Image 1 Mini is a cost-efficient multimodal OpenAI model powered by GPT-5 that turns text or image prompts into high-quality images. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

openai/gpt-image-1-mini/edit
image-to-image

openai/gpt-image-1-mini/edit

GPT Image 1 Mini is a cost-efficient, natively multimodal OpenAI model that pairs GPT-5 language understanding with compact image editing and generation from text and image inputs to produce high-quality images. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

openai/gpt-image-1.5/text-to-image
text-to-image

openai/gpt-image-1.5/text-to-image

GPT Image 1.5 text to image is OpenAI’s fast, cost-efficient text-to-image generator powered by GPT-5 guidance. Create photorealistic shots, product renders, concept art, and stylized graphics from natural-language prompts (optionally conditioned with an image). Supports custom aspect ratios, seeds, negative prompts, hex color hints, and style presets. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

openai/gpt-image-1.5/edit
image-to-image

openai/gpt-image-1.5/edit

GPT Image 1.5 Edit is OpenAI’s image model for precise, natural-language edits. Add/remove objects, swap backgrounds, retouch faces, adjust colors/lighting, edit text/graphics, crop/resize, and apply hex color control. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

wavespeed-ai/openai-whisper-with-video
speech-to-text

wavespeed-ai/openai-whisper-with-video

OpenAI Whisper Large v3 (Video-to-Text) delivers high-accuracy multilingual transcription directly from video files, with automatic language detection and optional timestamped, subtitle-ready segments. Built for stable production use with a ready-to-use REST API, fast response, no cold starts, and predictable pricing.

openai/sora-2-pro/text-to-video
text-to-video

openai/sora-2-pro/text-to-video

OpenAI Sora 2 Pro is a state-of-the-art text-to-video model with realistic physics, synchronized audio, and strong steerability. Supports multiple resolutions up to 1080p and durations up to 20 seconds. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

openai/sora-2-pro/image-to-video
image-to-video

openai/sora-2-pro/image-to-video

OpenAI Sora 2 Pro Image-to-Video creates physics-aware, realistic videos from reference images with synchronized audio and strong steerability. Supports 720p and 1080p resolutions with durations up to 20 seconds. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

openai/sora-2/characters
video-to-text

openai/sora-2/characters

OpenAI Sora 2 Characters creates reusable character IDs from video references for consistent character appearance across Sora 2 generations. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

openai/gpt-image-1/text-to-image
text-to-image

openai/gpt-image-1/text-to-image

OpenAI GPT Image-1 generates images from text prompts from OpenAI's latest text-to-image model, ideal for creating visual assets. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

openai/gpt-image-1
image-to-image

openai/gpt-image-1

OpenAI's gpt-image-1 enables image generation and image editing via OpenAI's image API, ideal for creating and refining images. Ready-to-use REST inference API, best performance, no coldstarts, affordable pricing.

OpenAI Models

Cutting-edge OpenAI models across text, image, and multimodal creation—curated in one place. These models sit at the front line of generative AI, combining strong reasoning, cinematic rendering, and reliable performance for real-world workflows.

Catalog

  1. Sora-2 / Image-to-Video — Add motion to a single image with physics-aware dynamics and stable identities.
  2. Sora-2 / Image-to-Video Pro — Higher fidelity and longer, smoother camera language for editorial or production shots.
  3. Sora-2 / Text-to-Video — Generate scenes directly from text prompts; strong temporal consistency.
  4. Sora-2 / Text-to-Video Pro — Pro-grade steerability and long-range coherence for complex sequences.
  5. GPT-Image-1 / Text-to-Image — Fast, prompt-faithful images with editability and tool-friendly outputs.
  6. DALL·E 3 — Clean composition and rich detail for concepting and illustration.
  7. DALL·E 2 — Lightweight text-to-image for quick drafts and style exploration.
  8. Sora (legacy) — Earlier Sora generation for baseline motion tests and rapid previews.
  9. Openai-whisper — High-accuracy multilingual speech recognition model for precise transcription with automatic language detection and punctuation.
  10. Openai-whisper-turbo — Optimized Whisper variant delivering the same accuracy with significantly faster transcription speed for real-time and large-scale use.
  11. Openai/gpt-image-1-mini/text-to-image generates high-quality images directly from text prompts with GPT-5-level understanding and efficiency, ideal for creative and design tasks.
  12. Openai/gpt-image-1-mini/edit enables intelligent image editing and refinement via natural-language instructions, preserving style and composition while applying precise changes.
  13. Openai/gpt-image-1-high-fidelity delivers ultra-detailed, photorealistic image generation powered by GPT-5, offering superior texture, lighting, and realism for professional-grade creative and design applications.
  14. Openai/gpt-image-1.5/text-to-image generates high-quality images from natural-language prompts with cost-efficient performance, producing coherent composition and clean aesthetics for UI concepts, marketing visuals, and fast creative ideation.
  15. Openai/gpt-image-1.5/text-to-image delivers high-quality text-to-image generation with strong prompt understanding and optimized synthesis, enabling rapid iteration and scalable visual production for design, prototyping, and creative workflows.
  16. Openai/gpt-image-2/edit enables high-fidelity image editing from natural-language instructions and reference images, preserving visual coherence, stylistic consistency, and fine-grained detail for marketing assets, design refinement, and fast creative iteration.
  17. Openai/gpt-image-2/text-to-image delivers high-quality text-to-image generation with strong prompt adherence, clean composition, and polished aesthetics, enabling scalable visual creation for UI concepts, campaign assets, and rapid creative ideation.

Why OpenAI Models?

  1. State-of-the-art quality — Physics-aware video, synchronized audio, and high-fidelity images with strong prompt adherence.
  2. End-to-end workflow — Text-to-image, image-to-video, and text-to-video in one stack; smooth handoff between models.
  3. Pro-grade control — Seeds, duration/aspect, camera language, and edit ops for consistent, repeatable results.
  4. Wide style range — From photoreal and documentary to anime, illustration, and cinematic looks—without plastic over-sharpening.

OpenAI Models API — 가격 및 성능

OpenAI Models 컬렉션의 모든 모델을 단일 REST API로 실행하세요. 생성당 과금 — 구독 없음, 최소 요금 없음 — 99.9% 가동률 인프라에서 업계 최고의 지연 시간을 제공합니다.

WaveSpeedAI에서 OpenAI Models을 사용하는 이유

투명한 가격

모든 OpenAI Models 모델에 대한 호출당 가격. 가격은 각 모델 페이지에 표시되며 플랫폼 수수료는 추가되지 않습니다.

낮은 지연 시간에 최적화

대부분의 OpenAI Models 이미지 모델은 2초 이내에 완료됩니다. 비디오 및 3D 모델은 셀프 호스팅 대안보다 몇 배 더 빠릅니다.

99.9% 가동률

다중 리전 페일오버와 자동 재시도로 프로바이더 장애 중에도 운영 트래픽을 온라인 상태로 유지합니다.

자주 묻는 질문

OpenAI Models API는 얼마인가요?+

각 모델에는 모델 페이지에 호출당 자체 가격이 표시되어 있습니다. 성공한 생성 단위로 청구되며 구독 요금이나 최소 요금은 없습니다.

WaveSpeedAI에서 OpenAI Models 모델은 얼마나 빠릅니까?+

이 컬렉션의 이미지 모델은 일반적으로 2초 이내에 완료됩니다. 비디오 및 3D 모델은 길이와 해상도에 따라 다르지만 보통 셀프 호스팅 실행보다 몇 배 더 빠릅니다.

신용카드 없이 API를 시험해 볼 수 있나요?+

예 — 가입 시 모든 계정에 $20의 무료 크레딧이 제공되며, 이는 대부분의 OpenAI Models 모델에서 수백 번의 호출에 충분합니다.

속도 제한이 있나요?+

표준 계정에는 넉넉한 동시 작업 제한이 있습니다. Enterprise 플랜은 맞춤형 RPM, 더 높은 동시성, 전용 용량을 제공합니다 — 자세한 내용은 영업팀에 문의하세요.