Local ASR model

Cohere Transcribe 03-2026

Open-weights Conformer ASR model (speech-to-text), not a TTS model. Converts speech audio into transcribed text with multilingual support.

GPU recommended speech-to-text transcription 14 languages Apache 2.0
Quality
9/10
Speed
8/10
Model size
4 GB
Voices
N/A (ASR: outputs text)

Can Cohere Transcribe 03-2026 run locally?

Cohere Transcribe 03-2026 can run locally for offline speech-to-text. Start with huggingface-cli download CohereLabs/cohere-transcribe-03-2026.

Apache 2.0 license. Still verify upstream usage notes before shipping.

streamingrealtimemultilingual

Audio profile

Quality
9
Speed
8
Local
8.5

Best fit

Cohere Transcribe 03-2026 is best for offline transcription, speech indexing and local voice pipelines.

Hardware: gpuapple

Model details

Type
Local ASR model
Family
cohere
Latency
low
Formats
safetensors
Languages
en, fr, de, it, es, pt, el, nl, pl, zh, ja, ko, vi, ar
Context
Input: audio waveform -> log-Mel spectrogram; 2B params

Install locally

01
Check runtimeConfirm the backend supports safetensors on your machine.
02
Install modelUse the upstream command or repository instructions.
03
Test locallyRun a short private audio prompt before moving into production workflows.
huggingface-cli download CohereLabs/cohere-transcribe-03-2026

Good for

  • speech-to-text transcription
  • GPU recommended local workflows
  • streaming, realtime, multilingual

Watch before shipping

  • Validate pronunciation, latency and artifacts with your own voice samples.
  • Review the upstream license and acceptable-use notes.
  • Benchmark on your target CPU, Apple Silicon or GPU setup.

Related TTS and speech models

CompareBrowse all TTS models Local AIBrowse LLM models macOS appGet LocalClaw