
Native 4K at 60fps. Lock to Elements. Multilingual lip-sync.
Kuaishou's flagship. First AI video model to generate native 4K at 60fps, producing broadcast-quality footage without post-processing. The Elements feature lets you feed up to four reference images (a character, a prop, a location) and Kling locks onto them with surgical fidelity throughout the shot. Generates multi-language dialogue with precise lip-sync.

“An automotive commercial at 60fps: a black sports car enters frame left, camera tracks alongside as it accelerates, dramatic low angle, wet road reflections”

“A multi-shot food commercial: wide shot of a sizzling steak on a grill, cut to close-up of seasoning being sprinkled, cut to plating, warm restaurant lighting”

“A product launch reveal: a premium speaker emerges from darkness, spotlights illuminate it in sequence, 360-degree orbit, bass-heavy ambient audio”

“An international brand campaign: a presenter speaks directly to camera in English, warm studio lighting, professional backdrop, confident delivery”

“A game trailer sequence: wide shot of a fantasy cityscape, cut to a character drawing a sword, cut to a battle scene, orchestral music building”

“A claymation penguin sliding down an icy slope, tumbling and rolling, landing in a snowdrift, playful stop-motion style, fun soundtrack”
Type a detailed prompt describing the video you want, or upload a reference image as a starting frame.
Pick your resolution and duration. See the credit cost before you generate.
Your video is ready in 1-3 minutes. Download, iterate, or extend the sequence.
Jump into the Studio and start generating. Plans from £10/month.
Kling O3 is the first AI video model to generate at native 4K resolution and 60 frames per second simultaneously. The result is broadcast-quality footage with smooth, cinematic motion. For automotive reveals where smooth panning is critical, architectural presentations on large screens, or any production workflow that demands broadcast standards, 60fps eliminates the stutter and judder that lower framerates introduce.
The Elements feature is the headline upgrade. Feed Kling O3 up to four reference images, a character, a hero product, a specific location, a piece of wardrobe, and the model locks onto each one with the kind of fidelity that single-image reference modes cannot reach. A drinks brand can lock to the actual can. An automotive client can lock to the actual hero car. A mascot stays on-model from frame one to frame last. This is the first time AI video has been usable for brand-critical work without a manual cleanup pass.
Multilingual dialogue generation covers 5 languages with precise lip-sync. Kling O3 maps phonemes to correct lip shapes, producing characters that speak naturally in English, Mandarin, Spanish, Japanese, and Korean. For international brands, multilingual teams, and global marketing campaigns, this eliminates the need for separate localisation passes.
Professional video generation. Plans from £10/month.