Video Generator

Create AI-generated video clips from text descriptions or reference images

Sign in to Generate Videos

Create AI-generated videos from text or images.

Features

Text-to-Video - Describe any scene and get a video clip
Image-to-Video - Upload an image as a starting frame and animate it
Multi-Shot Storyboard - Build sequences with up to 6 shots, each with its own prompt and duration
Start and End Frames - Pin specific images as the first or last frame of your video
Reference Images - Upload reference images and use @Element1, @Element2 in your prompt to keep characters consistent
Native Audio - Generates synchronized audio with dialogue support (Chinese and English)
3-15 Second Clips - Flexible duration from quick shots to longer sequences
Negative Prompt - Specify what you don't want in your video
Prompt Adherence - Fine-tune how closely the model follows your description

Layout

The generator uses the same section order in all modes:

Mode selector
Scene Input (single prompt or storyboard shots)
Reference Frames & Characters
Render Controls
Advanced options

Tips for Best Results

Describe the scene, subject, and motion clearly in your prompt
Include camera directions when relevant (e.g., "slow pan", "dolly zoom", "tracking shot")
Mention lighting, mood, and visual style (e.g., "cinematic", "golden hour", "noir")
Keep prompts focused on a single scene or action for best coherence
Use the negative prompt to exclude common artifacts like blur and distortion
Lower the Prompt Adherence slider for more creative freedom, raise it for tighter control
For image-to-video, describe the motion you want rather than repeating what's in the image
For dialogue, write speech in your prompt naturally and Kling will generate matching voices

Multi-Shot Storyboard

The Storyboard mode lets you build a video sequence from multiple shots. Each shot gets its own prompt and duration, and Kling renders them together into a single continuous video.

You can add up to 6 shots. Two split modes are available: Custom durations lets you set each shot's length independently (3-15s each), while Auto split lets the model decide how to distribute time across your shots.

Think of each shot like a cut in a film. Describe what happens in each one: the framing, action, and camera movement. The model handles transitions between them.

Start and End Frames

Use the Reference Frames & Characters section to upload a Start Frame and End Frame.

Start Frame is required in Image-to-Video
Start Frame is also required if you use an End Frame or Reference Images
End Frame is optional in every mode

Reference Images

Upload up to 4 reference images in the Reference Frames & Characters section. Each gets an @Element tag (@Element1, @Element2, etc.) that you use in your prompt or shot prompts to tell the model which character or object to place.

Use a clear frontal photo of the character with good lighting and no obstructions. The model will maintain that character's appearance throughout the video, including across multi-shot storyboards.

Reference image requirements

For Kling v3 Pro image inputs, keep every start/end/reference image at minimum 300x300, with aspect ratio between 0.40 and 2.50, and file size under 10MB. PNG and JPEG work well.

If an image is smaller than 300px on either side (for example 591x295), generation can fail even when the prompt is valid.

Example Prompts

Text-to-Video:

"A cat stretching lazily on a sun-drenched windowsill, dust motes floating in warm afternoon light, cozy indoor atmosphere"
"Drone flyover of a winding river through autumn forest, vibrant red and gold foliage, golden hour lighting"
"Close-up of espresso being poured into a ceramic cup, steam rising, shallow depth of field, cafe ambiance"

Image-to-Video:

"The character slowly turns to face the camera and smiles, gentle wind blowing through their hair"
"Camera slowly zooms out while leaves fall from the tree, soft natural lighting"
"The landscape transitions from day to golden sunset, clouds moving across the sky"

Storyboard (3 shots):

"Wide establishing shot of a misty mountain valley at dawn, camera slowly drifting forward"
"Medium shot of a lone hiker walking along the ridge, wind blowing their jacket"
"Close-up of their face as they reach the summit and smile, golden light breaking through clouds"

With Reference Images:

Upload a photo of a person as @Element1, then prompt: "@Element1 walks through a bustling city street, camera tracking from the side, golden hour lighting"

Video Generator ​

Sign in to Generate Videos

Features ​

Layout ​

Tips for Best Results ​

Multi-Shot Storyboard ​

Start and End Frames ​

Reference Images ​

Reference image requirements ​

Example Prompts ​