Local TTS model
Chatterbox TTS
Open-source SOTA voice cloning from Resemble AI. Outperforms ElevenLabs on naturalness benchmarks. Supports emotion exaggeration control and ultra-stable generation.
Apple Silicon ready
text-to-speech generation
1 languages
MIT
Quality
9.4/10
Speed
8/10
Model size
1.2 GB
Voices
Zero-shot cloning (3s sample)
Can Chatterbox TTS run locally?
Chatterbox TTS can generate speech locally for private voice workflows. Start with pip install chatterbox-tts.
MIT license. Still verify upstream usage notes before shipping.
pip install chatterbox-tts
Upstream source
cloningemotionstreaming
Audio profile
Best fit
Chatterbox TTS is best for local voice cloning and expressive speech generation.
Hardware: gpuapple
Model details
Type
Local TTS model
Family
chatterbox
Latency
low
Formats
pytorch
Languages
en
Context
Emotion exaggeration slider
Install locally
01
Check runtimeConfirm the backend supports pytorch on your machine.02
Install modelUse the upstream command or repository instructions.03
Test locallyRun a short private audio prompt before moving into production workflows.pip install chatterbox-tts
Good for
- text-to-speech generation
- Apple Silicon ready local workflows
- cloning, emotion, streaming
Watch before shipping
- Validate pronunciation, latency and artifacts with your own voice samples.
- Review the upstream license and acceptable-use notes.
- Benchmark on your target CPU, Apple Silicon or GPU setup.
Related TTS and speech models
Zyphra
Zonos v0.1
Local TTS model · Q 9.5 · Speed 8.5
Alibaba FunAudioLLM
CosyVoice 2
Local TTS model · Q 9.3 · Speed 8.8
OpenBMB
VoxCPM2
Local TTS model · Q 9.4 · Speed 8.3
Bilibili
IndexTTS 2
Local TTS model · Q 9.4 · Speed 8
Canopy Labs
Orpheus TTS
Local TTS model · Q 9.6 · Speed 7.5
Boson AI
Higgs Audio v2
Local TTS model · Q 9.7 · Speed 7
StepFun
Step-Audio 2 Mini
Local TTS model · Q 9.3 · Speed 7.5
Nari Labs
Dia
Local TTS model · Q 9.3 · Speed 7