Overview
**Kling V3 Pro 静音** is a silent video generation model available through UniAll AI with the public model ID **`kling-v3-pro-silent`**. It is designed for teams that need high-quality AI video output without generated audio, making it useful for short-form assets, ads, product showcases, storyboard clips, and automated video pipelines where sound is added separately.
The model supports asynchronous generation through the `/v1/videos/generations` endpoint.
What the model can do
Kling V3 Pro 静音 supports three main generation modes:
- **Text to video**: create a video from a prompt.
- **Image to video**: animate a reference image with a prompt.
- **First/last frame video**: generate motion between a starting frame and an ending frame.
Supported parameters include:
- `prompt`: required for all modes
- `duration`: 3–15 seconds
- `aspect_ratio`: `16:9`, `9:16`, or `1:1`
- `resolution`: `standard`, `pro`, or `4k`
- `image_url`: for image-to-video
- `first_image_url` and `last_image_url`: for first/last-frame generation
- `video_count`: fixed at 1 video per request
Pricing angle
Kling V3 Pro 静音 is billed by generated video duration. The listed user price for the Pro silent variant is **$0.07616 per second**, shown as approximately **¥0.55 / second**.
Example estimated user costs:
| Duration | Estimated cost | |---:|---:| | 5 seconds | $0.3808 | | 10 seconds | $0.7616 | | 15 seconds | $1.1424 |
Pricing can vary by selected resolution and account context. UniAll AI also lists related Kling V3 variants such as Standard Silent, Pro Audio, and 4K options, each with separate per-second pricing. Failed generations are eligible for refund when the task fails according to platform rules.
Who should use it
This model is a good fit for:
- Developers building AI video generation into apps or SaaS products
- Marketing teams producing silent ad creatives or product videos
- E-commerce sellers creating short product motion clips
- Content teams generating 9:16 vertical shorts or 16:9 landscape assets
- Workflow builders that need API-based asynchronous video jobs
- Agencies or platform operators integrating video generation into customer-facing tools
API usage
Endpoint:
```http POST /v1/videos/generations ```
Example request for image-to-video:
```json { "model": "kling-v3-pro-silent", "generation_mode": "image_to_video", "prompt": "A cinematic product reveal, soft studio lighting, smooth camera movement.", "image_url": "https://example.com/reference.png", "duration": 5, "aspect_ratio": "16:9", "resolution": "pro", "video_count": 1 } ```
Example request for text-to-video:
```json { "model": "kling-v3-pro-silent", "generation_mode": "text_to_video", "prompt": "A futuristic city at sunrise, smooth drone shot, cinematic lighting.", "duration": 5, "aspect_ratio": "9:16", "resolution": "pro" } ```
Example request for first/last-frame generation:
```json { "model": "kling-v3-pro-silent", "generation_mode": "first_last_frame", "prompt": "Create a smooth transition between the two frames with natural camera motion.", "first_image_url": "https://example.com/start.png", "last_image_url": "https://example.com/end.png", "duration": 6, "aspect_ratio": "16:9", "resolution": "pro" } ```
Because generation is asynchronous, applications should submit the task, store the returned job ID, and poll or handle task status according to the UniAll AI API workflow.
Practical selection advice
Choose **Kling V3 Pro 静音** when you want Pro-tier visual quality and plan to handle music, voice-over, or sound design separately. If you need generated audio, consider an audio-enabled variant instead. If cost is the main constraint, compare the Standard Silent tier; if maximum visual detail is required, compare the 4K options.
常见问题
How is Kling V3 Pro 静音 API priced?
The public model ID kling-v3-pro-silent is billed per generated second. The listed user price for the Pro silent variant is $0.07616 per second, approximately ¥0.55 / second. Final cost depends on duration and selected variant or account pricing context.
What generation modes does kling-v3-pro-silent support?
It supports text-to-video, image-to-video, and first/last-frame video generation. All modes require a prompt, while image-based modes also require the relevant image URL fields.
What video settings are supported?
The model supports 3 to 15 second outputs, one video per request, aspect ratios of 16:9, 9:16, and 1:1, and resolution options including standard, pro, and 4k depending on the selected variant.