Local TTS model
MeloTTS
High-quality multilingual TTS with extremely natural voice cloning. Best for Chinese and English with fast inference.
Apple Silicon ready
text-to-speech generation
9 languages
MIT
Quality
9/10
Speed
9/10
Model size
1.5 GB
Voices
Built-in + custom cloning
Can MeloTTS run locally?
MeloTTS can generate speech locally for private voice workflows. Start with pip install melotts.
MIT license. Still verify upstream usage notes before shipping.
pip install melotts
Upstream source
cloningrealtimemultilingual
Audio profile
Best fit
MeloTTS is best for local voice cloning and expressive speech generation.
Hardware: cpugpuapple
Model details
Type
Local TTS model
Family
melo
Latency
low
Formats
pytorchonnx
Languages
en, zh, fr, de, es, it, pt, ja, ko
Context
Speed, pitch control
Install locally
01
Check runtimeConfirm the backend supports pytorch, onnx on your machine.02
Install modelUse the upstream command or repository instructions.03
Test locallyRun a short private audio prompt before moving into production workflows.pip install melotts
Good for
- text-to-speech generation
- Apple Silicon ready local workflows
- cloning, realtime, multilingual
Watch before shipping
- Validate pronunciation, latency and artifacts with your own voice samples.
- Review the upstream license and acceptable-use notes.
- Benchmark on your target CPU, Apple Silicon or GPU setup.
Related TTS and speech models
Speech Research (SWivid)
F5-TTS v1.1
Local TTS model · Q 9.5 · Speed 9.2
Alibaba FunAudioLLM
CosyVoice 2
Local TTS model · Q 9.3 · Speed 8.8
OpenBMB
VoxCPM2
Local TTS model · Q 9.4 · Speed 8.3
RedNote HiLab
Dots TTS MF
Local TTS model · Q 9.4 · Speed 8
MyShell
OpenVoice V2
Local TTS model · Q 8.9 · Speed 9
OpenMOSS / MOSI.AI
MOSS-TTS-Nano
Local TTS model · Q 8.5 · Speed 9.7
Fish Audio
Fish Speech
Local TTS model · Q 9 · Speed 8.5
hexgrad
Kokoro TTS
Local TTS model · Q 9.2 · Speed 9.8