Local TTS model

EmotiVoice

Multi-voice TTS with granular emotion control. Choose from 2000+ built-in voices and specify emotions like happy, sad, angry, or surprised per sentence.

GPU recommended text-to-speech generation 2 languages Apache 2.0
Quality
8.5/10
Speed
7.5/10
Model size
1.5 GB
Voices
2000+ built-in voices

Can EmotiVoice run locally?

EmotiVoice can generate speech locally for private voice workflows. Start with pip install emotivoice.

Apache 2.0 license. Still verify upstream usage notes before shipping.

emotionmultilingualcontrollable

Audio profile

Quality
8.5
Speed
7.5
Local
8.4

Best fit

EmotiVoice is best for multilingual local speech generation.

Hardware: gpucpu

Model details

Type
Local TTS model
Family
emotivoice
Latency
low
Formats
pytorch
Languages
en, zh
Context
Emotion: happy, sad, angry, surprised

Install locally

01
Check runtimeConfirm the backend supports pytorch on your machine.
02
Install modelUse the upstream command or repository instructions.
03
Test locallyRun a short private audio prompt before moving into production workflows.
pip install emotivoice

Good for

  • text-to-speech generation
  • GPU recommended local workflows
  • emotion, multilingual, controllable

Watch before shipping

  • Validate pronunciation, latency and artifacts with your own voice samples.
  • Review the upstream license and acceptable-use notes.
  • Benchmark on your target CPU, Apple Silicon or GPU setup.

Related TTS and speech models

CompareBrowse all TTS models Local AIBrowse LLM models macOS appGet LocalClaw