Local ASR model
Cohere Transcribe 03-2026
Open-weights Conformer ASR model (speech-to-text), not a TTS model. Converts speech audio into transcribed text with multilingual support.
GPU recommended
speech-to-text transcription
14 languages
Apache 2.0
Quality
9/10
Speed
8/10
Model size
4 GB
Voices
N/A (ASR: outputs text)
Can Cohere Transcribe 03-2026 run locally?
Cohere Transcribe 03-2026 can run locally for offline speech-to-text. Start with huggingface-cli download CohereLabs/cohere-transcribe-03-2026.
Apache 2.0 license. Still verify upstream usage notes before shipping.
huggingface-cli download CohereLabs/cohere-transcribe-03-2026
Upstream source
streamingrealtimemultilingual
Audio profile
Best fit
Cohere Transcribe 03-2026 is best for offline transcription, speech indexing and local voice pipelines.
Hardware: gpuapple
Model details
Type
Local ASR model
Family
cohere
Latency
low
Formats
safetensors
Languages
en, fr, de, it, es, pt, el, nl, pl, zh, ja, ko, vi, ar
Context
Input: audio waveform -> log-Mel spectrogram; 2B params
Install locally
01
Check runtimeConfirm the backend supports safetensors on your machine.02
Install modelUse the upstream command or repository instructions.03
Test locallyRun a short private audio prompt before moving into production workflows.huggingface-cli download CohereLabs/cohere-transcribe-03-2026
Good for
- speech-to-text transcription
- GPU recommended local workflows
- streaming, realtime, multilingual
Watch before shipping
- Validate pronunciation, latency and artifacts with your own voice samples.
- Review the upstream license and acceptable-use notes.
- Benchmark on your target CPU, Apple Silicon or GPU setup.
Related TTS and speech models
Kyutai
Kyutai STT 2.6B
Local ASR model · Q 9.4 · Speed 9.5
Alibaba Cloud (Qwen Team)
Qwen3-ASR
Local ASR model · Q 9.5 · Speed 9
OpenAI
Whisper v3 Turbo
Local ASR model · Q 9.1 · Speed 9.5
NVIDIA
Canary 1B v2
Local ASR model · Q 9.3 · Speed 9
IBM Granite Team
Granite Speech 4.1 2B
Local ASR model · Q 9.2 · Speed 8
Microsoft Research
VibeVoice ASR
Local ASR model · Q 9.3 · Speed 7.5
NVIDIA
Parakeet TDT 0.6B v2
Local ASR model · Q 9.4 · Speed 10
hexgrad
Kokoro TTS
Local TTS model · Q 9.2 · Speed 9.8