Overview
Kling V3 Pro 有声 is a Pro-tier video generation model for creating short AI videos with audio output. On UniAll AI, the public model id is `kling-v3-pro-audio`.
It supports three generation modes:
- **Text to video**: generate a video from a prompt.
- **Image to video**: animate a reference image with a prompt.
- **First/last-frame video**: guide the motion between a starting frame and an ending frame.
The model is useful for short video assets, product reveals, social ads, storyboards, cinematic clips, and automated video workflows that need synchronized audio output.
Pricing
Kling V3 Pro 有声 is billed by generated video duration.
- **Billing unit**: per second
- **Default user price for Pro Audio**: **$0.11424 per second**
- Displayed reference price: **¥0.82 / second**
- Supported duration range: **3 to 15 seconds**
- Output count: **1 video per generation**
- Refund on failed generation: **enabled**
- Automatic retry on failure: **not enabled**
Example cost estimates for `kling-v3-pro-audio` at $0.11424/second:
| Duration | Estimated cost | |---:|---:| | 3 seconds | $0.34272 | | 5 seconds | $0.57120 | | 10 seconds | $1.14240 | | 15 seconds | $1.71360 |
UniAll AI also exposes related Kling V3 variants with different price levels, including standard, pro, 4K, silent, and audio options. Choose Pro Audio when you need higher-quality generation with audio output.
Supported parameters
Common request parameters include:
| Parameter | Type | Notes | |---|---|---| | `model` | string | Use `kling-v3-pro-audio` | | `generation_mode` | string | `text_to_video`, `image_to_video`, or `first_last_frame` | | `prompt` | string | Required for all modes | | `image_url` | string | Required for image-to-video mode | | `first_image_url` | string | Required for first/last-frame mode | | `last_image_url` | string | Required for first/last-frame mode | | `duration` | integer | 3–15 seconds | | `aspect_ratio` | string | `16:9`, `9:16`, or `1:1` | | `resolution` | string | `standard`, `pro`, or `4k` | | `video_count` | integer | Fixed at 1 |
Accepted image formats include PNG, JPEG, and WebP.
API usage
Kling V3 Pro 有声 uses an asynchronous video generation endpoint on UniAll AI.
```http POST /v1/videos/generations Content-Type: application/json Authorization: Bearer YOUR_API_KEY ```
Image-to-video example
```json { "model": "kling-v3-pro-audio", "generation_mode": "image_to_video", "prompt": "A cinematic product reveal, soft studio lighting, smooth camera movement.", "image_url": "https://example.com/reference.png", "duration": 5, "aspect_ratio": "16:9", "resolution": "pro", "video_count": 1 } ```
Text-to-video example
```json { "model": "kling-v3-pro-audio", "generation_mode": "text_to_video", "prompt": "A futuristic city street at night, neon reflections, slow dolly shot, atmospheric sound.", "duration": 5, "aspect_ratio": "9:16", "resolution": "pro", "video_count": 1 } ```
First/last-frame example
```json { "model": "kling-v3-pro-audio", "generation_mode": "first_last_frame", "prompt": "Create a smooth transition between the two frames with cinematic motion and natural audio ambience.", "first_image_url": "https://example.com/start.png", "last_image_url": "https://example.com/end.png", "duration": 6, "aspect_ratio": "16:9", "resolution": "pro", "video_count": 1 } ```
Who should use it
Kling V3 Pro 有声 is a good fit for:
- Developers building AI video generation features through API calls.
- Marketing teams producing short ad creatives and product videos.
- E-commerce teams creating animated product showcases.
- Content platforms that need text-to-video or image-to-video workflows.
- Agencies and automation teams generating repeatable short-form video assets.
For cost-sensitive workloads, compare Pro Audio with Standard Audio or silent variants. For premium visual output, compare against 4K audio options.
常见问题
How is Kling V3 Pro 有声 API priced on UniAll AI?
The `kling-v3-pro-audio` model is billed per generated second. The listed user price is $0.11424 per second, shown as approximately ¥0.82 per second. A 5-second generation is estimated at $0.57120 before any account-specific adjustments.
What generation modes does kling-v3-pro-audio support?
It supports text-to-video, image-to-video, and first/last-frame video generation. All modes require a prompt. Image-to-video also requires `image_url`, while first/last-frame mode requires both `first_image_url` and `last_image_url`.
What video duration and aspect ratios are supported?
The supported duration range is 3 to 15 seconds. Available aspect ratios are `16:9`, `9:16`, and `1:1`. The model generates one video per request.