SPEECH-TO-TEXT

Hold a key. Speak. Release.
Text appears where your cursor is.

Push-to-talk dictation that works in every app. VoxBee captures your voice, transcribes it on-device using Whisper AI, and injects the text wherever you're typing — no copy-paste needed.

VoxBee dictation interface — push-to-talk speech-to-text on Mac and Linux

How it works

1

Hold your hotkey

Option, Fn, or Command

2

Speak naturally

In any of 13 languages

3

Release to transcribe

Text appears at your cursor

Everything you need for voice typing

Push-to-Talk & Hands-Free

Hold Option (or Fn) to record, release to transcribe. Or use hands-free mode — tap the hotkey combo to start, tap again to stop.

13 Languages + Auto-Detect

English, Spanish, French, German, Hindi, Chinese, Japanese, Korean, Arabic, Russian, Italian, Portuguese, and more. Auto-detect picks the right one.

Grammar Correction

Powered by Harper, VoxBee cleans up grammar and removes filler words (um, uh, er, ah) so your text reads naturally.

Configurable Hotkeys

Choose between Option, Fn (Globe), or Command as your push-to-talk key. Each has a companion key for hands-free mode.

Works in Every App

Text is injected wherever your cursor is — Slack, VS Code, Cursor, Notion, Notes, Terminal, or any other app.

7 Whisper Models

From Tiny (75MB, instant) to Large v3 (2.9GB, most accurate). Pick the right balance of speed and accuracy for your use case.

Your words never leave your device.

All transcription runs locally using Whisper AI models. No internet connection, no cloud processing, no data collection. Your voice stays on your Mac or Linux machine.

One price. Yours forever.

$39 one-time purchase. No subscriptions. No cloud fees.

See Pricing