TRANSCRIPTION

Drop a file or paste a URL.
Get a timestamped transcript.

Drag and drop any audio file — MP3, WAV, M4A, AIFF — or paste a podcast, YouTube, or video URL. VoxBee downloads the audio, transcribes it locally with accurate timestamps, and generates an AI-powered summary with key topics, highlights, and takeaways.

VoxBee file transcription — timestamped audio transcript with drag-and-drop

See it in action

Transcribe anything, anywhere

Drag-and-Drop or File Picker

Drop any audio file — MP3, WAV, M4A, AIFF — directly into VoxBee, or use the file picker to select files.

Paste a URL from 1,800+ Sites

Paste a YouTube, podcast, Vimeo, SoundCloud, X/Twitter, or Twitch URL. VoxBee downloads the audio and transcribes it locally.

AI-Powered Summaries

After transcription, send the text to OpenAI, Anthropic, or your local Ollama server for structured summaries with key topics, highlights, and takeaways.

Timestamped Segments

Every transcript includes accurate timestamps so you can jump to specific moments in the original audio.

Smart Chunking

Long files are split into 30-second segments at silence gaps, transcribed in parallel, and stitched back together with word-level deduplication.

Export Formats

Export your transcripts as plain text (.txt), subtitles (.srt), or markdown (.md). Files are saved to ~/Documents/VoxBee/Transcriptions/.

Your files stay on your device.

All transcription runs locally using Whisper AI models. Audio files are processed on your machine and never uploaded anywhere. Your recordings remain private.

One price. Yours forever.

$39 one-time purchase. No subscriptions. No cloud fees.

See Pricing