#controlnet
80 articles - Page 7
所有标签 gemini-3-5-flash google google-io ai-models agent-tools deepmind gemini-3-5 gemini-3-5-pro gemini-omni gemini-omni-flash video-generation ai-video gemini-4 claude-mythos gpt-5-5 seedance bytedance pricing model-release gpt-5-6 openai chatgpt alignment leak gemini omni veo hidream open-source image-generation diffusion-transformer multimodal tutorial rumor llm api comparison guide best alternative openrouter aws azure google-cloud inference image-to-3d 3d 3d-generation tripo3d h3.1 pbr quad-mesh announcement wavespeedai multiview-to-3d text-to-3d text-to-image nucleus nucleus-image image-to-image materials texture patina image-to-map game-dev unreal unity blender material-extract video-to-video video-inpainting object-removal masking void sam3 text-to-audio music-generation music-cover style-transfer minimax image-to-video pixverse animation reference-to-video character-consistency text-to-video transition baidu ernie multilingual chinese fast audio-to-video music-video lip-sync runway-ml ideogram image-editing sora portrait-effect photo-styling parkour action-video talking-photo travel-photo ad-generation spokesperson virtual-try-on fashion ai-avatar free-tools avatar-generator talking-head ai-image image-generator video-generator wan-2-7 kling gpt-image-2 predictions deevid-ai wan alibaba video-editing video-edit wan-2.7 video-extend veo3 start-end-to-video kuaishou elements vace video-joiner wavespeed-ai gemma-4 on-device-ai audio-converter audio-processing file-conversion image-converter image-processing face-blur privacy video-converter video-processing ai-tools 4k midjourney flux nano-banana seedream best-ai-image-generator pixverse-v6 audio video-effects glm zhipu-ai claude gpt deepseek ai-news phota image-enhance upscaler image-quality photorealistic camera-control vfx anthropic cybersecurity ai-music suno lyria magihuman davinci sand-ai digital-human audio-video davinci-magihuman professional ai-image-generator qwen-image pollo-ai lovart freepik ai-video-generator vidu best-ai-video-generator higgsfield kling-image-o3 ai-image-generation girl-filter face-transformation portrait smile-filter photo-editing watermark-removal sora-alternative sora-shutdown pika grok ltx veo-4 photo-colorizer colorize photo-restoration body-swap face-swap portrait-transfer prismaudio video-to-audio foley ai-audio sound-generation hunyuan audio-generation v2a iclr recraft recraft-v4 text-to-vector svg design dall-e vocal-remover karaoke music-production stem-separation people-remover inpainting fotor photo-editor content-creation desktop-app mp3 wav flac aac png jpg webp heic mp4 mov avi webm janitor-ai media-io video-editor m2.7 ai-model agent coding benchmark age-filter entertainment aging dog-selfie pet-content gender-swap ghibli-filter anime studio-ghibli midjourney-v8 stable-diffusion best-tools ai-content-detector content-moderation content-safety nsfw-detection text-moderation image-moderation video-moderation moderation-api developer-guide sora-2 sketch-to-video infinitetalk celebrity-look-alike face-recognition clothes-changer fat-filter meme fortune-teller math-solver education story-generator creative-writing review baseten 2026 canva fal-ai fireworks-ai leonardo-ai modal gpu-cloud replicate cloudflare runpod together-ai ai-research helios bitdance bitdance-14b autoregressive qwen-image-2 typography skyreels skyreels-v3 talking-avatar portrait-animation soulx flashhead soulx-flashhead real-time streaming nano-banana-2 nano-banana-pro ai-images wavespeed-desktop android mobile playground batch-processing lora workflow ai-pipeline ffmpeg audio-conversion image-conversion video-conversion video-merge video-trimming video-enhancement video-upscale inworld tts text-to-speech voice-ai coming-soon gpt-image kimi moonshot-ai ai-assistant local-ai personal-ai prediction genie-3 world-model interactive-environments mova clawdbot personal-assistant automation chatbot javascript typescript sdk python speculation ai-collaboration productivity ai-agents no-code app-builder development apple background-remover face-enhancer image-enhancement image-eraser inpaint tools claude-code codex ai-coding cursor developer-tools image-enhancer ai-platforms hedra avatars heygen creative video-marketing ideas adobe firefly quality image-translation localization rankings image-upscaling enhancement video-extension video-upscaling enterprise developer clipdrop stability-ai dalle deepai performance black-forest-labs vertex-ai infrastructure hailuo-ai hugging-face tencent text-rendering imagen kling-ai luma-ai dream-machine serverless nightcafe ai-art pika-labs lm-arena runway digital-twins tips video-production synthesia dalle-3 prompting avatar multi-modal aimlapi byteplus comfyui dreamina kie-ai openart poyo-ai skywork topaz upscaling qwen training fine-tuning depth controlnet pose upscale outpaint canny lightricks sdxl background-removal marketing event e-commerce product-photography mochi cogvideo social-media instagram
Kling Reference-to-Video 现已登陆WaveSpeedAI
Kling Reference-to-Video允许您基于主体参考图像或视频生成全新的视频内容,同时在所有帧中保持一致的外观、身份和场景逻辑。
1 分钟阅读
Veo 3.1 现已在 WaveSpeedAI 上可用
全球多模态推理加速平台 WaveSpeedAI 今日宣布推出 Veo 3.1——谷歌最新的视频和音频生成模型——现已通过 WaveSpeedAI API 访问。
1 分钟阅读
Wan2.2-Fun-Control 现已登陆WaveSpeedAI
Wan2.2-Fun-Control是一款下一代视频生成和控制模型,设计用于生成高质量视频,严格遵循预定义的控制条件。
1 分钟阅读
Framepack 现已登陆WaveSpeedAI
Framepack 是来自 lllyasviel(ControlNet 的创建者)的尖端自回归图像转视频模型,现已在 WaveSpeedAI 上推出。Framepack 重新定义了静态图像如何转变为视频——通过基于前一帧生成每一帧,它比传统方法产生更流畅的运动、更高的时间一致性和更连贯的叙事。
1 分钟阅读
吉卜力现已登陆WaveSpeedAI
发现WaveSpeedAI上突破性的吉卜力模型,实现高质量视频生成,前所未有的易用性和效率。探索其功能、应用场景,以及为什么WaveSpeedAI是您创意需求的理想平台。
1 分钟阅读
InstantCharacter 模型现已登陆WaveSpeedAI
我们很高兴地宣布,腾讯 AI Lab 的最新创新成果 — InstantCharacter,一个最先进的个性化角色生成模型 — 现已正式在 WaveSpeed 平台上线。InstantCharacter 基于可扩展的扩散变换器框架构建,提供高保真度、强泛化能力和细粒度文本可控性,为角色生成技术树立了新的基准。
1 分钟阅读
AI图像编辑的下一步:认识Qwen-Image-Edit-2509
AI图像编辑的下一步:认识Qwen-Image-Edit-2509
1 分钟阅读
WaveSpeedAI视频扩展工具:无需裁剪即可快速扩展视频并转换宽高比
如果您需要为多个平台重新利用视频——TikTok、YouTube、Instagram、Reels、Shorts、广告——裁剪不是一个选项。
1 分钟阅读