Local TTS model
Qwen3 TTS
State-of-the-art multilingual TTS with natural prosody and emotion control. Supports 30+ languages with streaming inference.
Apple Silicon ready
text-to-speech generation
30 languages
Apache 2.0
Quality
9.5/10
Speed
8.5/10
Model size
2.8 GB
Voices
Multiple speakers + voice cloning
Can Qwen3 TTS run locally?
Qwen3 TTS can generate speech locally for private voice workflows. Start with lmstudio install qwen3-tts.
Apache 2.0 license. Still verify upstream usage notes before shipping.
lmstudio install qwen3-tts
Upstream source
streamingrealtimemultilingualemotion
Audio profile
Best fit
Qwen3 TTS is best for fast on-device voice responses and local assistants.
Hardware: gpuapple
Model details
Type
Local TTS model
Family
qwen
Latency
ultra-low
Formats
ggufonnx
Languages
en, zh, fr, de, es, it, pt, ru, ja, ko, ar
Context
Emotion tags, speed control
Install locally
01
Check runtimeConfirm the backend supports gguf, onnx on your machine.02
Install modelUse the upstream command or repository instructions.03
Test locallyRun a short private audio prompt before moving into production workflows.lmstudio install qwen3-tts
Good for
- text-to-speech generation
- Apple Silicon ready local workflows
- streaming, realtime, multilingual
Watch before shipping
- Validate pronunciation, latency and artifacts with your own voice samples.
- Review the upstream license and acceptable-use notes.
- Benchmark on your target CPU, Apple Silicon or GPU setup.
Related TTS and speech models
Alibaba Cloud (Qwen Team)
Qwen3-ASR
Local ASR model · Q 9.5 · Speed 9
Alibaba FunAudioLLM
CosyVoice 2
Local TTS model · Q 9.3 · Speed 8.8
OpenBMB
VoxCPM2
Local TTS model · Q 9.4 · Speed 8.3
hexgrad
Kokoro TTS
Local TTS model · Q 9.2 · Speed 9.8
Speech Research (SWivid)
F5-TTS v1.1
Local TTS model · Q 9.5 · Speed 9.2
Zyphra
Zonos v0.1
Local TTS model · Q 9.5 · Speed 8.5
Kyutai
Moshi
Local TTS model · Q 9 · Speed 9.5
Supertone
Supertonic 3
Local TTS model · Q 8.8 · Speed 9.8