Whisper Small

Compact multilingual Whisper for local transcription

MultilingualApache-2.0

Desktop app Open the Models screen and click install.

CLI

$ openasr pull whisper-small:q8

Overview

Whisper Small is OpenAI's 244M-parameter multilingual Whisper checkpoint. It uses the standard Whisper encoder-decoder architecture for automatic speech recognition and speech translation, trained with large-scale weak supervision on 680k hours of labelled speech. Compared with larger Whisper checkpoints, the small model is easier to run locally while retaining the broad zero-shot behavior that makes Whisper useful across noisy datasets and domains. This OpenASR repo repackages the original openai/whisper-small weights as .oasr packs that run natively in the OpenASR runtime with no Python at inference time. For most users the q8_0 build is the recommended default; q4_k is for tighter memory budgets and fp16 is for verification or maximum fidelity.

Highlights

🎧 Multilingual ASR — transcribes many languages and can translate speech to English
🧠 244M parameters — the small Whisper checkpoint balances accuracy, footprint, and speed
🌐 Weak-supervision scale — trained with Whisper's 680k-hour labelled speech corpus
🦀 Native in OpenASR — .oasr packs run with no Python at inference, engineered for CPU and Apple Silicon

Pull string	Size	Quant	JFK ΔWER
`whisper-small:fp16`	466.1 MB	fp16	0%
`whisper-small:q8`default	288.9 MB	q8_0	0%
`whisper-small:q4`	194.4 MB	q4_k	0%

Usage

These are CLI / local-server examples. The desktop app runs this model without typing a command — see the desktop install path above.

bash · transcribe a file

$ openasr pull whisper-small:q8
↓ whisper-small.oasr  288.9 MB  ✓ verified sha256
$ openasr transcribe meeting.wav --backend native --model-pack ~/.openasr/models/whisper-small/q8_0/whisper-small-q8_0.oasr
✓ local transcript · 0 bytes sent

bash · serve a local API

$ openasr serve --backend native --model-pack ~/.openasr/models/whisper-small/q8_0/whisper-small-q8_0.oasr --addr 127.0.0.1:8080
▶ http://127.0.0.1:8080 · model=whisper-small · 0 bytes will leave this host

python · client.py

from openai import OpenAI
client = OpenAI(base_url="http://127.0.0.1:8080/v1", api_key="local")
audio = open("meeting.wav", "rb")
text = client.audio.transcriptions.create(model="whisper-small", file=audio)

Overview

Highlights

Tags

Usage

Other models