SPEECH-TO-TEXT
Hold a key. Speak. Release.
Text appears where your cursor is.
Push-to-talk dictation that works in every app. VoxBee captures your voice, transcribes it on-device using Whisper AI, and injects the text wherever you're typing — no copy-paste needed.

How it works
Hold your hotkey
Option, Fn, or Command
Speak naturally
In any of 13 languages
Release to transcribe
Text appears at your cursor
Everything you need for voice typing
Push-to-Talk & Hands-Free
Hold Option (or Fn) to record, release to transcribe. Or use hands-free mode — tap the hotkey combo to start, tap again to stop.
13 Languages + Auto-Detect
English, Spanish, French, German, Hindi, Chinese, Japanese, Korean, Arabic, Russian, Italian, Portuguese, and more. Auto-detect picks the right one.
Grammar Correction
Powered by Harper, VoxBee cleans up grammar and removes filler words (um, uh, er, ah) so your text reads naturally.
Configurable Hotkeys
Choose between Option, Fn (Globe), or Command as your push-to-talk key. Each has a companion key for hands-free mode.
Works in Every App
Text is injected wherever your cursor is — Slack, VS Code, Cursor, Notion, Notes, Terminal, or any other app.
7 Whisper Models
From Tiny (75MB, instant) to Large v3 (2.9GB, most accurate). Pick the right balance of speed and accuracy for your use case.
Your words never leave your device.
All transcription runs locally using Whisper AI models. No internet connection, no cloud processing, no data collection. Your voice stays on your Mac or Linux machine.