Local ASR model
Whisper v3 Turbo
OpenAI's optimized Whisper v3 with 4 decoder layers instead of 32. 8× faster than Whisper Large v3 with only minor accuracy trade-off. 99 languages supported. New gold standard for fast local transcription.
Apple Silicon ready
speech-to-text transcription
99 languages
MIT
Quality
9.1/10
Speed
9.5/10
Model size
1.6 GB
Voices
N/A (ASR: outputs text)
Can Whisper v3 Turbo run locally?
Whisper v3 Turbo can run locally for offline speech-to-text. Start with pip install openai-whisper.
MIT license. Still verify upstream usage notes before shipping.
pip install openai-whisper
Upstream source
streamingrealtimemultilinguallow-latency
Audio profile
Best fit
Whisper v3 Turbo is best for offline transcription, speech indexing and local voice pipelines.
Hardware: cpugpuapple
Model details
Type
Local ASR model
Family
whisper
Latency
ultra-low
Formats
pytorchsafetensorsgguf
Languages
multilingual
Context
809M params, 4 decoder layers, 99 languages
Install locally
01
Check runtimeConfirm the backend supports pytorch, safetensors, gguf on your machine.02
Install modelUse the upstream command or repository instructions.03
Test locallyRun a short private audio prompt before moving into production workflows.pip install openai-whisper
Good for
- speech-to-text transcription
- Apple Silicon ready local workflows
- streaming, realtime, multilingual
Watch before shipping
- Validate pronunciation, latency and artifacts with your own voice samples.
- Review the upstream license and acceptable-use notes.
- Benchmark on your target CPU, Apple Silicon or GPU setup.
Related TTS and speech models
Kyutai
Kyutai STT 2.6B
Local ASR model · Q 9.4 · Speed 9.5
Alibaba Cloud (Qwen Team)
Qwen3-ASR
Local ASR model · Q 9.5 · Speed 9
NVIDIA
Parakeet TDT 0.6B v2
Local ASR model · Q 9.4 · Speed 10
NVIDIA
Canary 1B v2
Local ASR model · Q 9.3 · Speed 9
IBM Granite Team
Granite Speech 4.1 2B
Local ASR model · Q 9.2 · Speed 8
Microsoft Research
VibeVoice ASR
Local ASR model · Q 9.3 · Speed 7.5
Cohere
Cohere Transcribe 03-2026
Local ASR model · Q 9 · Speed 8
hexgrad
Kokoro TTS
Local TTS model · Q 9.2 · Speed 9.8