Kokoro TTS example output

Kokoro TTS

Budget text-to-speech. 20 voices, adjustable speed.

Kokoro is the cheapest TTS option in the roster. 20 voices (10 female, 10 male), adjustable playback speed, and fast generation. It won't match ElevenLabs or MiniMax for expressiveness, but for drafts, internal reviews, placeholder narration, and high-volume content where cost matters more than nuance, Kokoro delivers clean speech at rock-bottom pricing.

Try these prompts

How it works

01

Describe your vision

Type a detailed prompt or upload a reference sketch, photo, or mood board.

02

Choose your settings

Pick your resolution and aspect ratio. See the credit cost before you generate.

03

Generate in seconds

Your image is delivered in seconds. Download, iterate, or pipe into video.

Ready to create with Kokoro TTS?

Jump into the Studio and start generating. Plans from £10/month.

Choose a Plan

Clean speech at the lowest cost

Not every voiceover needs to be perfect. Drafts, internal reviews, placeholder audio for prototypes, bulk content generation. These all need speech that is clear and natural enough to be useful, without the cost of premium models. Kokoro fills that gap.

20 voices: 10 female (af_heart, af_alloy, af_bella, af_jessica, af_nicole, and more) and 10 male (am_adam, am_echo, am_eric, am_liam, am_michael, and more). Each voice is distinct and clear. Adjustable speed lets you match the pace to your content.

Use Kokoro for iteration and drafting, then switch to ElevenLabs, MiniMax Speech, or OpenAI TTS HD for final deliverables. The workflow stays the same. The quality scales with the model. The cost stays proportional to the stage of the project.

20 voices to choose from

Ten female and ten male voices. Each has a distinct character. Try multiple voices before committing. Find the right tone for your project without worrying about cost.

Speed control

Adjustable playback speed from slow to fast. Slow down for emphasis in presentations. Speed up for rapid-fire social content. The voice quality stays consistent across the speed range.

Frequently asked

Questions about Kokoro TTS.

Kokoro is the cheapest TTS option in the roster. 20 voices (10 female, 10 male), adjustable playback speed, and fast generation. It won't match ElevenLabs or MiniMax for expressiveness, but for drafts, internal reviews, placeholder narration, and high-volume content where cost matters more than nuance, Kokoro delivers clean speech at rock-bottom pricing.
Built differently

Why Stensyl?

A small indie studio building creative tools the way they should be built. No VC theatre, no funnel games, no faceless support.