VoxFlow Local · macOS dictation

Dictation you actually own.

On-device speech to text, your professional vocabulary, and a voice line straight into your coding agent. Nothing leaves your Mac unless you say so.

Built on
WhisperKitOllamaApple SiliconGemmaSwift 6
100%
On-device
0 bytes
To the cloud, by default
Your terms
Vocabulary
Your tools
Commands
See it work

Speak. It polishes. It acts.

Dictate a paragraph, watch it clean up on-device, then fire a command into your open agent. Fifteen seconds, no cloud round trip.

Dictation · private + vocabulary
"um, the parties agreed to iso forty two thousand one, comma, effective today"
on-device polish
The parties agreed to ISO 42001, effective today.
WhisperKit and a local model. Your vocabulary applied. 0 bytes uploaded by default.
Command lane · drives your agent
You hold the key and say "security review"
claude ❯ /security-review
Reviewing the current diff for security issues...
Resolved on-device, then typed into your open Claude Code or Codex session. You press enter.
Three things that matter

Private work. Your vocabulary. Your commands.

Private work

Private by default

WhisperKit speech to text and a local model run entirely on your Mac. Cloud fallback is off by default and redacted when you turn it on. No account, no telemetry, no upload.

stt_fallback_active: false
Vocabulary

Knows your words

Teach it once. It biases recognition before it guesses and corrects after the fact. ISO 42001, GDPR, RCW, the names in your matters. Built for how professionals actually speak.

"iso forty two thousand one" → ISO 42001
Commands Experimental

Drives your agent

Hold a key, speak, and the command lands in your open Claude Code or Codex session. Connectors are plain files, so you and anyone who forks it can add their own.

"security review" → /security-review
Why local

Your voice, on your own terms.

The same reasons the best local-first tools exist, applied to dictation.

01

Own your data

Audio and transcripts stay on the machine. There is no server to trust, breach, or subpoena.

02

No subscription

You run it. No per-seat pricing that climbs every year, no usage caps, no lock-in.

03

Fork it, extend it

Add vocabulary packs, write new agent connectors, wire in your own tools. The codebase is yours to shape.

04

Audit every line

Open source under MIT. No black box around how audio is handled or how text is cleaned.

vs cloud dictation

What you give up with the cloud.

VoxFlow LocalCloud dictation (Wispr, Otter, cloud STT)
Where audio goesStays on your MacUploaded to a server
Monthly costFree, you run itSubscription, per seat
Your vocabularyLearned and biased on-deviceGeneric, often capped
Drives your agentVoice into Claude Code / Codex ExperimentalNo
Source codeOpen, MIT, fork itClosed
Works offlineYesNo
Get started

Clone it. Build it. Keep your voice.

macOS 14 or later, Apple Silicon. A free Apple ID developer certificate keeps the accessibility grant across rebuilds; ad-hoc signing works without an Apple account.

zsh
# 1. clone
git clone https://github.com/ZOLAtheCodeX/voxflow-local.git
cd voxflow-local

# 2. bootstrap the backend and models
./scripts/bootstrap_all.sh

# 3. run the backend, then the app
./scripts/run_backend.sh
swift run VoxFlowLocal

Full signing matrix and Ollama setup in the README. Text polish uses a local Ollama model; the regex pipeline is the fallback when Ollama is not running.

Own the stack

Dictation that never leaves your Mac.