Local TTS model

Kokoro TTS

Ultra-lightweight yet stunning quality. 82M params only - runs on CPU in real-time. Best quality-to-size ratio of any TTS model.

Edge ready text-to-speech generation 9 languages Apache 2.0
Quality
9.2/10
Speed
9.8/10
Model size
0.33 GB
Voices
54 built-in + voice mixing

Can Kokoro TTS run locally?

Kokoro TTS can generate speech locally for private voice workflows. Start with pip install kokoro.

Apache 2.0 license. Still verify upstream usage notes before shipping.

realtimestreaminglow-latencymultilingual

Audio profile

Quality
9.2
Speed
9.8
Local
9.6

Best fit

Kokoro TTS is best for fast on-device voice responses and local assistants.

Hardware: cpugpuappleedge

Model details

Type
Local TTS model
Family
kokoro
Latency
ultra-low
Formats
pytorchonnx
Languages
en, fr, es, pt, it, ja, zh, ko, hi
Context
StyleTTS2 architecture

Install locally

01
Check runtimeConfirm the backend supports pytorch, onnx on your machine.
02
Install modelUse the upstream command or repository instructions.
03
Test locallyRun a short private audio prompt before moving into production workflows.
pip install kokoro

Good for

  • text-to-speech generation
  • Edge ready local workflows
  • realtime, streaming, low-latency

Watch before shipping

  • Validate pronunciation, latency and artifacts with your own voice samples.
  • Review the upstream license and acceptable-use notes.
  • Benchmark on your target CPU, Apple Silicon or GPU setup.

Related TTS and speech models

CompareBrowse all TTS models Local AIBrowse LLM models macOS appGet LocalClaw