
Budget text-to-speech. 20 voices, adjustable speed.
Kokoro is the cheapest TTS option in the roster. 20 voices (10 female, 10 male), adjustable playback speed, and fast generation. It won't match ElevenLabs or MiniMax for expressiveness, but for drafts, internal reviews, placeholder narration, and high-volume content where cost matters more than nuance, Kokoro delivers clean speech at rock-bottom pricing.
Type a detailed prompt or upload a reference sketch, photo, or mood board.
Pick your resolution and aspect ratio. See the credit cost before you generate.
Your image is delivered in seconds. Download, iterate, or pipe into video.
Jump into the Studio and start generating. Plans from £10/month.
Choose a PlanNot every voiceover needs to be perfect. Drafts, internal reviews, placeholder audio for prototypes, bulk content generation. These all need speech that is clear and natural enough to be useful, without the cost of premium models. Kokoro fills that gap.
20 voices: 10 female (af_heart, af_alloy, af_bella, af_jessica, af_nicole, and more) and 10 male (am_adam, am_echo, am_eric, am_liam, am_michael, and more). Each voice is distinct and clear. Adjustable speed lets you match the pace to your content.
Use Kokoro for iteration and drafting, then switch to ElevenLabs, MiniMax Speech, or OpenAI TTS HD for final deliverables. The workflow stays the same. The quality scales with the model. The cost stays proportional to the stage of the project.
Ten female and ten male voices. Each has a distinct character. Try multiple voices before committing. Find the right tone for your project without worrying about cost.
Adjustable playback speed from slow to fast. Slow down for emphasis in presentations. Speed up for rapid-fire social content. The voice quality stays consistent across the speed range.
A small indie studio building creative tools the way they should be built. No VC theatre, no funnel games, no faceless support.
Professional audio generation. Plans from £10/month.