
Six HeyGen tools under one roof: trained avatars, talking photos, text-to-video, lipsync, and 179-language dubbing. One credit pool, one place to learn.
HeyGen pioneered AI talking avatars and still sets the benchmark, and the lineup now goes well beyond a single model. Stensyl integrates the full suite: Avatar V Digital Twin trained from your own footage, Talking Photo to make any portrait speak, Text to Video that scripts and assembles a clip from a prompt, Video Translate into 179 languages, and V3 Lipsync in Precision and Speed. Six tools, one credit pool, one studio.

“Train a Digital Twin from a 2-minute clip, then render a 15-second product launch teaser in your own voice and gestures.”

“Make a generated character portrait talk: upload the image, type a 10-second script, pick a voice (Talking Photo).”

“Turn a one-paragraph brief into a finished 30-second explainer with a presenter, script, and cuts (Text to Video).”

“Translate a tutorial from English into Spanish, Hindi, and Japanese with the same presenter on screen (Video Translate).”

“Dub a finished promo with a new voiceover using frame-accurate lip movement (V3 Lipsync Precision).”

“Generate caption-tracked social cuts of a podcast clip for TikTok, Reels, and Shorts (V3 Lipsync Speed).”
Type a detailed prompt describing the video you want, or upload a reference image as a starting frame.
Pick your resolution and duration. See the credit cost before you generate.
Your video is ready in 1-3 minutes. Download, iterate, or extend the sequence.
Jump into the Studio and start generating. Plans from £10/month.
HeyGen is the most respected name in AI talking-head video, and the lineup now reaches far beyond a single model. Avatar V Digital Twin trains a persistent avatar on 2 to 3 minutes of your own footage, learning your face, gestures, body language, and voice in one holistic model. Talking Photo brings a single portrait to life with a script and a voice. Text to Video turns a written brief into a finished, presenter-led cut. Video Translate re-voices and re-lip-syncs any clip into 179 languages. V3 Lipsync replaces the audio on existing footage with frame-accurate mouth movement. Stensyl integrates all of it under one roof.
Avatar V is the upgrade to HeyGen's earlier Avatar IV engine, and it is a real step up. Where the previous generation animated a likeness, V learns gestures, body language, and expression, so renders read like genuine footage of you rather than an animated photo, at effectively the same cost. The trained twin is persistent: build it once and it still works on every render months later. It auto-appears as a Cast member in Storyboards and Film Studio, slots into Canvas as an Avatar Video node, lives in Generate's Talking Avatar mode, and answers to Ray. You can also add new outfits to an existing twin without retraining from scratch.
Everything HeyGen is on every Stensyl plan, billed from one shared credit pool. The transform tools work on anything you bring: Video Translate is 8 credits per input second, V3 Lipsync Speed is 11 and Precision 21 credits per output second, and every lipsync render returns an SRT caption file mirrored into permanent storage. The generative tools bill per output second by duration: Talking Photo at 12 credits a second with no setup, Text to Video at 9. Avatar V renders are 80, 160, and 240 credits at the 5, 10, and 15 second buckets; training a twin is a one-time 425 credits and adding an outfit is 250, since both carry real cost on HeyGen's side.
Professional video generation. Plans from £10/month.