SPEECH-TO-TEXT

Hold a key. Speak. Release.
Text appears where your cursor is.

Push-to-talk dictation that works in every app. VoxBee captures your voice, transcribes it on-device using Whisper and NVIDIA Parakeet models, and injects the text wherever you're typing — no copy-paste needed.

VoxBee dashboard showing dictation stats, waveform controls, and scratchpad notes

How it works

1

Hold your hotkey

Option, Fn, or Control

2

Speak naturally

In 30 supported languages

3

Release to transcribe

Text appears at your cursor

Everything you need for voice typing

Push-to-Talk & Hands-Free

Hold Option, Fn, or Control to record, release to transcribe. Or use hands-free mode — tap the hotkey combo to start, tap again to stop.

30 Languages + Guided Setup

Choose from 30 supported languages. VoxBee recommends a compatible model, offers one-click downloads, and warns you if the active model cannot serve your language.

Grammar Correction

Powered by Harper, VoxBee cleans up grammar and removes filler words (um, uh, er, ah) so your text reads naturally.

Configurable Hotkeys

Choose between Option, Fn (Globe), or Control (⌃) as your push-to-talk key. Each has a companion key for hands-free mode.

Works in Every App

Text is injected wherever your cursor is — Slack, VS Code, Cursor, Notion, Notes, Terminal, or any other app.

10 On-Device Models

Choose between 7 Whisper models and 3 NVIDIA Parakeet models, including fast English options and multilingual European support.

11 Cloud Providers (BYO Key)

Prefer a hosted model? Plug in your own key for OpenAI (gpt-4o-transcribe, whisper-1), Deepgram nova-3, AssemblyAI universal-3-pro, ElevenLabs scribe_v2, Groq whisper-large-v3, xAI Grok, Mistral Voxtral, Cohere Transcribe, Speechmatics, Alibaba Qwen3-ASR, or Soniox. A persistent purple cloud badge shows whenever audio leaves the device.

On-Device Auto-Format (Beta)

On macOS 26 with Apple Intelligence, an on-device Apple Foundation Models pass adds punctuation, capitalization, and list cues before injecting your text. Local, off by default, falls back to raw transcription if the model stalls.

Voice Notes

Dictate into a scratch pad, then transform with AI. Turn voice notes into emails, meeting notes, to-do lists, blog posts, and more with 8 built-in templates.

VoxBee voice notes view showing scratchpad text and AI transformation templates

Screenshot Smart Paste

Capture a screenshot while dictating — drag to select any region. VoxBee automatically pastes the image into 20+ apps alongside your text.

On-device by default.

Dictation runs locally with Whisper or NVIDIA Parakeet on your Mac or Linux machine — no internet, no cloud, no data collection. If you opt into a cloud provider with your own API key, a persistent purple cloud badge shows whenever audio leaves your device, so you always know where your voice is going.

One price. Yours forever.

$49 one-time purchase. No subscriptions. No cloud fees.

See Pricing