Longcat Image Text To Image
Playground
Try it on WavespeedAI!LongCat-Image is a 6B parameter bilingual (Chinese-English) text-to-image model from Meituan, excelling at multilingual text rendering, photorealism, and deployment efficiency. Ready-to-use REST inference API with best performance and no cold starts.
Features
LongCat-Image — Text-to-Image
LongCat-Image is an open-source, bilingual (Chinese-English) foundation model for image generation developed by Meituan. With only 6B parameters, it addresses key challenges in multilingual text rendering, photorealism, deployment efficiency, and developer accessibility.
Where LongCat-Image fits best
- Chinese and English text rendering in images
- Photorealistic image generation
- High-volume generation with efficient resource usage
- Marketing and product visuals with text overlays
Key Features
• Exceptional Efficiency and Performance
With only 6B parameters, LongCat-Image outperforms larger open-source models across multiple benchmarks, demonstrating efficient model design.
• Powerful Chinese Text Rendering
Superior accuracy in rendering Chinese characters with industry-leading Chinese dictionary coverage. More stable than existing SOTA open-source models.
• Remarkable Photorealism
Innovative data strategy and training framework delivers high-quality, photorealistic image generation.
• Bilingual Support
Natively supports both Chinese and English prompts with excellent text rendering in both languages.
• Resource-conscious
The efficient 6B parameter architecture keeps GPU usage moderate, ideal for batch jobs and cost-sensitive pipelines.
Related Models on WaveSpeedAI
- LongCat-Image Edit – Image editing with the same bilingual text rendering capabilities.
More Image Tools on WaveSpeedAI
- Nano Banana Pro – Google’s Gemini-based text-to-image model for sharp, coherent, prompt-faithful visuals.
- Seedream V4 – ByteDance’s style-consistent, multi-image generator ideal for posters and campaigns.
- Qwen Edit Plus – An enhanced Qwen-based image editor for precise inpainting and local style changes.
Authentication
For authentication details, please refer to the Authentication Guide.
API Endpoints
Submit Task & Query Result
# Submit the task
curl --location --request POST "https://api.wavespeed.ai/api/v3/wavespeed-ai/longcat-image/text-to-image" \
--header "Content-Type: application/json" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}" \
--data-raw '{
"size": "1024*1024",
"output_format": "jpeg",
"seed": -1,
"enable_sync_mode": false,
"enable_base64_output": false
}'
# Get the result
curl --location --request GET "https://api.wavespeed.ai/api/v3/predictions/${requestId}/result" \
--header "Authorization: Bearer ${WAVESPEED_API_KEY}"
Parameters
Task Submission Parameters
Request Parameters
| Parameter | Type | Required | Default | Range | Description |
|---|---|---|---|---|---|
| prompt | string | Yes | - | The positive prompt for the generation. | |
| size | string | No | 1024*1024 | 256 ~ 1536 per dimension | The size of the generated image in pixels (width*height). |
| output_format | string | No | jpeg | jpeg, png, webp | The format of the output image. |
| seed | integer | No | -1 | -1 ~ 2147483647 | The random seed to use for the generation. -1 means a random seed will be used. |
| enable_sync_mode | boolean | No | false | - | If set to true, the function will wait for the result to be generated and uploaded before returning the response. It allows you to get the result directly in the response. This property is only available through the API. |
| enable_base64_output | boolean | No | false | - | If enabled, the output will be encoded into a BASE64 string instead of a URL. This property is only available through the API. |
Response Parameters
| Parameter | Type | Description |
|---|---|---|
| code | integer | HTTP status code (e.g., 200 for success) |
| message | string | Status message (e.g., “success”) |
| data.id | string | Unique identifier for the prediction, Task Id |
| data.model | string | Model ID used for the prediction |
| data.outputs | array | Array of URLs to the generated content (empty when status is not completed) |
| data.urls | object | Object containing related API endpoints |
| data.urls.get | string | URL to retrieve the prediction result |
| data.has_nsfw_contents | array | Array of boolean values indicating NSFW detection for each output |
| data.status | string | Status of the task: created, processing, completed, or failed |
| data.created_at | string | ISO timestamp of when the request was created (e.g., “2023-04-01T12:34:56.789Z”) |
| data.error | string | Error message (empty if no error occurred) |
| data.timings | object | Object containing timing details |
| data.timings.inference | integer | Inference time in milliseconds |