Changelog

New updates and improvements to VoxBee.

v0.7.0

Google Drive Sync

Added

  • Google Drive sync for meetings, transcriptions, and notes — connect once from AI & Summaries settings and VoxBee mirrors your local library to a folder it creates in your own Drive. Scope is limited to files this app creates (drive.file), syncs run automatically after recording, editing, re-transcribing, or saving notes, and you can drag the VoxBee folder anywhere in your Drive without breaking sync.
  • Horizontal swipe navigation with page dots on the What's New sheet — drag between feature cards instead of scrolling.

Fixed

  • Long Ollama summaries no longer fail with a 60s timeout. Local Ollama now streams tokens with a per-token idle window, so cold model loads and slow generations on big meetings complete instead of being cut off mid-summary. Cloud providers keep their fast-fail 60s timeout.

v0.6.6

Your Dictation Year at a Glance

Added

  • GitHub-style usage graph on the home tab — a 52-week heatmap of daily dictation activity, with cell intensity driven by word count. Page back through prior 52-week windows with the ◀ ▶ buttons.
  • Blinking red dot on the menu bar icon whenever a meeting is being recorded, so it's always clear when the mic and system audio are being captured.

Fixed

  • Push-to-talk no longer gets stuck "recording" when Mission Control swallows the modifier-up event for Ctrl+Arrow space-switching chords — the hotkey manager now reconciles modifier state every 250ms and recovers automatically.

v0.6.5

Inline Re-Transcribe & Re-Analyze

Added

  • Re-transcribe and Re-analyze buttons on the transcription detail header — no more right-click required to re-run transcription or LLM analysis. The right-click menu still offers the granular Re-transcribe as / Re-analyze as / Remove options.
  • Pointing-hand cursor on action buttons across Transcribe (Re-transcribe, Re-analyze, Identify Speakers, Copy, Export) and Meetings (Re-summarize, Re-transcribe, Identify Speakers, Copy summary) for clearer affordance.

v0.6.4

Apple Intelligence Diagnostics

Changed

  • Auto-format provider row now surfaces the specific Apple Intelligence availability reason (Preparing, Setup needed, Unavailable) instead of a generic "Setup needed" — so you can see exactly why on-device auto-format is gated off on a given Mac.
  • Logs the Foundation Models availability case once at app launch to help diagnose installs where auto-format mysteriously stays disabled.

v0.6.3

On-Device Auto-Format & Summary Reliability

Added

  • On-device auto-format for dictation (Beta) — on macOS 26 with Apple Intelligence enabled, Apple Foundation Models add punctuation, capitalization, and detect list cues locally before text is injected. Off by default; enable from Speech settings.
  • X and LinkedIn brand links in the sidebar below Settings.

Fixed

  • Summary reliability — Ollama is now preflight-checked before a run, long transcripts are chunked so they fit provider context windows, and the chosen summary model is remembered per provider instead of being reset when switching.
  • Auto-format now rejects all-caps output from Apple Foundation Models and falls back to the raw transcription, so dictation never lands in shouty text.
  • Auto-format no longer holds dictation hostage when the on-device provider stalls — the timeout now releases the caller immediately and falls back to raw text.

v0.6.2

More Cloud Providers & Sharper Summaries

Added

  • Five new cloud speech-to-text providers — Mistral Voxtral, Cohere Transcribe, Speechmatics, Alibaba Qwen3-ASR, and Soniox. Bring your own API key from Cloud Speech settings; the purple cloud badge stays visible whenever audio leaves the device.

Changed

  • Meeting summary prompt sharpened to favor concrete details — names, numbers, decisions, and action items — instead of vague recaps.

v0.6.1

xAI Grok Speech-to-Text

Added

  • xAI Grok (grok-stt) as a new cloud speech-to-text provider — bring your own xAI API key from Cloud Speech settings to dictate through Grok's hosted endpoint, with smart formatting enabled by default.

v0.6.0

Cloud Speech Models & Reliability

Added

  • Cloud speech-to-text providers — pick from OpenAI (gpt-4o-transcribe, whisper-1, plus a diarized variant), Deepgram (nova-3), AssemblyAI (universal-3-pro), ElevenLabs (scribe_v2), and Groq (whisper-large-v3 / turbo). Bring your own API key from AI & Speech settings; a purple cloud badge stays visible whenever audio is leaving the device.
  • Fail-fast preflight before every recording — missing transcription models, summarization API keys, mic permission, and screen recording permission are now caught up front with one-click fixes (Open Settings, Choose Model, Open API Key page) instead of failing mid-pipeline.
  • Structured, actionable error banners across meetings and file transcription — invalid API key, out-of-credits, model-not-found, and offline Ollama all surface human messages with primary and secondary recovery actions.
  • Direct "Get API Key" links in Cloud Speech settings that jump to each provider's API key dashboard.

Changed

  • Meeting pipeline telemetry now emits paired started/completed (or failed) events for transcription, diarization, and summary stages, so dashboards can detect hangs, sleep/wake stalls, and crashes between stages.
  • Cloud transcription requests retry on rate limits, server errors, and transient network failures with exponential backoff (up to 3 attempts), while user-actionable errors skip retry and surface immediately.

v0.5.7

Meeting Error UX & Transcript Polish

Added

  • Stage-checkpointed meeting pipeline — transcription, diarization, and summary progress now survive app crashes and relaunches, resuming at the first incomplete step instead of starting over.
  • Actionable error banners for meeting summaries — missing API keys, out-of-credits, invalid models, and offline Ollama now surface a human message with one-click retry or open-settings, replacing raw API error text.
  • Meeting summary retry and transcript timeline view, so you can re-run a failed summary or jump through the transcript by speaker turn.
  • Inline markdown rendering (links, italics, inline code) inside meeting and note summaries.

Fixed

  • Clipboard is now restored after dictation in Electron apps and browsers where Accessibility detection is inconclusive — your previous clipboard content comes back instead of being replaced by the transcription.

v0.5.6

Fewer Dictionary False Positives

Fixed

  • Personal Dictionary corrections are now much less aggressive on common English words — a new phonetic validation step plus consonant-group-normalized edit distance prevents words like "open" from being rewritten to brand-name dictionary entries.

v0.5.5

Smarter Text Injection

Fixed

  • Text injection now always fires Cmd+V paste regardless of text field detection, preventing silent failures in Electron apps and web inputs
  • Tri-state text field detection — clipboard is restored only when a text field is confirmed, and a helpful toast appears only when no text field is detected

v0.5.4

OpenRouter & Clipboard Safety Net

Added

  • OpenRouter as an LLM API provider for meeting summaries and note transformations
  • Transcribed text now stays on the clipboard when the paste target has no text field, so you can manually paste it

v0.5.3

Meeting Transcripts & Speaker Diarization

Added

  • Speaker diarization for meetings (Beta) — on-device speaker detection identifies who said what using NVIDIA Sortformer
  • Full meeting transcript display with speaker-labeled segments
  • Word wrap and copy button on transcription history rows
  • In-app review prompts to help us improve VoxBee

Fixed

  • Replaced misleading Export button with copy feedback in the meetings tab
  • Fixed a crash when stopping audio recording if no audio tap was installed

v0.5.2

Parakeet Recommended Tags

Changed

  • Parakeet models are now prioritized in the model picker with a recommended tag

v0.5.0

Parakeet Models & Guided Language Setup

Added

  • NVIDIA Parakeet speech models via FluidAudio, including fast English models and a multilingual European model option
  • Acoustic vocabulary biasing for Parakeet live dictation so personal dictionary terms are recognized more reliably
  • Guided language picker with compatible model recommendations, download flow, and mismatch warnings when the active model cannot serve the selected language

Fixed

  • Recording now falls back to the built-in microphone if the default input device hangs after sleep/wake or Bluetooth audio issues

v0.4.1

Overlay Feedback & Smart Paste Fix

Added

  • Error feedback on recording overlay when transcription returns empty results

Fixed

  • Screenshot smart paste now works correctly in Codex and other rich-content editors

v0.4.0

Notes, Screenshots & What's New

Added

  • Notes tab — voice-driven scratch pad with LLM template transformations for reformatting, summarizing, and editing dictated text
  • Hands-free screenshot capture during dictation — capture screen regions without interrupting your recording
  • What's New sheet shown once after each app update with release highlights
  • Drop zone in media transcription is now clickable to open the file picker directly

Fixed

  • What's New video now renders correctly in the looping video player

v0.3.10

UI Redesign & Download Reliability

Changed

  • Redesigned UI with refined color palette, glass-effect cards, and polished interactions throughout the app

Fixed

  • Robust first-time model download experience with clear state machine transitions and better error recovery
  • Screen Recording permission banner now shown when meeting auto-detect is enabled

v0.3.9

Permissions & Recovery

Added

  • Missing permissions banner on all main pages when microphone, accessibility, or screen recording access is needed
  • App version number displayed in the About settings tab

Fixed

  • Transcriptions and meetings now recovered from disk on fresh install or data reset
  • Microphone permission prompt now correctly opens System Settings when previously denied
  • Hotkey monitors now properly reinstall after accessibility permission is granted
  • Stale accessibility permission polling no longer continues after permissions are granted
  • Auto-update downloads no longer serve stale cached versions

v0.3.8

Automatic Meeting Detection

Added

  • Automatic meeting detection — VoxBee detects when you join Zoom, Teams, FaceTime, Google Meet, or Webex and offers to record
  • Scratchpad and collapsible history sections restored on the Home page

Changed

  • History page now lazy-loads by day groups for faster performance
  • Meetings feature is no longer in beta

Fixed

  • Reduced false positives in meeting detection for browser-based calls

v0.3.7

Control Key Hotkey & License Fix

Added

  • Control (⌃) as a third hotkey modifier option — hold Control to dictate, Control+Shift for hands-free mode

Fixed

  • Simplified license keychain storage, fixing intermittent activation failures
  • Reduced unnecessary keychain writes that could trigger repeated system prompts

v0.3.6

Download Reliability

Changed

  • App downloads now served from a dedicated custom domain for improved reliability

v0.3.5

Activation Help & Keychain Fix

Added

  • Support email links in the license activation screen for easier help access

Fixed

  • Fixed license access check failing after keychain migration in v0.3.4

v0.3.4

Keychain Permissions Fix

Fixed

  • Eliminated repeated keychain permission prompts by migrating to Data Protection Keychain

v0.3.3

Frosted Obsidian & Reliability

Changed

  • "Frosted Obsidian" visual refresh — dark glass-inspired UI across the entire app
  • Faster first dictation — grammar engine now pre-warms on app launch

Added

  • Text selection on transcription history and list item titles

Fixed

  • Fixed license activation failing after reinstall or app update from a different build
  • Fixed double-click rename not working on meeting titles
  • Fixed transcriptions returning empty text when WhisperKit produces incomplete audio segments
  • Fixed file transcriptions not saving immediately, risking data loss on unexpected quit

v0.3.2

Transcription Reliability & Security

Fixed

  • Fixed transcription returning empty results when personal dictionary words are active — now automatically retries without dictionary biasing
  • Fixed Ollama host URL validation to reject non-HTTP schemes that could leak transcript data

v0.3.1

Personal Dictionary & Settings Redesign

Added

  • Personal dictionary — add custom words and aliases so VoxBee recognizes domain-specific vocabulary (names, jargon, brand terms) with phonetic matching

Changed

  • Redesigned settings with a 4-tab horizontal layout (Speech, AI & Summaries, Appearance, About) replacing the single scrollable view

Fixed

  • Fixed silence detection threshold rejecting normal speech as empty audio
  • Fixed temp file cleanup deleting files that belong to active transcription entries

v0.3.0

Media Transcription & Intelligence

Added

  • Media transcription — import audio/video files or paste a URL to transcribe with AI-powered analysis (summaries, key topics, action items)
  • Automatic disk export for transcriptions and meeting summaries to ~/Documents/VoxBee/
  • Update-available badges on menu bar icon, Dock icon, and dropdown menu

Changed

  • Redesigned Transcribe UI with import wizard and rich summary cards
  • LLM API key is no longer required for transcription — analysis gracefully skips if no key is configured

Fixed

  • Fixed duplicate category label in feedback form

v0.2.1

In-App Feedback

Added

  • In-app feedback form — submit bug reports and feature requests directly from VoxBee

v0.2.0

Meeting Recording & Summaries

Added

  • Meeting recording — capture system audio and mic from video calls, then get an automatic AI-powered summary
  • Structured meeting summaries with one-liner, discussion topics, action items, and key decisions
  • Inline meeting summary editing with auto-save
  • Meeting rename for easy organization
  • Multiple LLM provider options for summarization (OpenAI, Anthropic, Ollama)
  • Configurable summary storage folder in Settings

Fixed

  • Fixed system audio capture failing silently on some devices
  • Fixed silence detection in meeting recordings
  • Fixed a crash when stopping audio recording due to engine teardown race condition

v0.1.17

Audio File Transcription

Added

  • Audio file transcription — drag and drop MP3, WAV, AIFF, or any audio file to get a full transcript
  • Smart chunking splits long recordings at natural silence boundaries for accurate results
  • Timestamped transcript view with per-segment inline editing
  • Export transcripts as plain text (.txt) or SRT subtitles (.srt)
  • File queue with real-time progress tracking — process multiple files back-to-back
  • Optional grammar correction and filler word removal for file transcriptions
  • Telemetry opt-out toggle in Settings

Fixed

  • Fixed waveform overlay not appearing on wired microphones
  • Fixed audio latency with external devices like AirPods
  • Fixed text clipping in transcript detail view

v0.1.16

Multi-Language Support

Added

  • Support for 12 languages — English, Hindi, Spanish, French, German, Portuguese, Chinese, Japanese, Korean, Arabic, Russian, and Italian
  • Language quick-switcher on the main screen for fast switching between languages
  • Intelligent auto-switching to a compatible model when you change languages
  • Model/language mismatch warning with a quick "Download Model" action
  • Silence detection to prevent hallucinated transcriptions on empty recordings
  • Privacy policy page on the website

Changed

  • Grammar correction is now automatically disabled for non-English languages
  • Accurate word counting for CJK and other non-Latin scripts

v0.1.15

Reliability & Analytics

Added

  • App analytics with PostHog to help us understand usage and improve VoxBee

Fixed

  • Fixed app becoming unresponsive during long transcriptions — added 60-second timeout with automatic recovery
  • Fixed stuck recording state after sleep/wake or screen lock — stale hotkey presses are now detected and cleared

v0.1.14

Bug Fixes & Improvements

Fixed

  • Fixed a crash in the grammar correction engine when processing long transcriptions

v0.1.13

Grammar Correction

Added

  • Automatic grammar and punctuation correction powered by Harper — fixes capitalization, missing periods, and common grammar mistakes in transcriptions
  • Filler word removal — automatically strips "um", "uh", "er", and word stutters from transcriptions
  • Toggle grammar correction on/off in Settings

v0.1.12

Menu Bar & Dock Polish

Changed

  • Cleaner menu bar layout — removed CPU icon and aligned text padding

Fixed

  • Dock icon lifecycle bugs when opening and closing the main window
  • Menu bar icon sizing to match macOS conventions

v0.1.11

Free Trial

Added

  • 14-day free trial — no account required, start using VoxBee immediately

v0.1.10

Bug Fixes & Improvements

Fixed

  • Bug fixes and improvements to model loading reliability on startup

v0.1.9

License Activation

Added

  • License activation system — enter your license key to unlock VoxBee
  • License management in Settings with deactivate option

v0.1.8

Bug Fixes & Improvements

Fixed

  • Bug fixes and improvements to update toast reliability

v0.1.7

Bug Fixes & Improvements

Fixed

  • Bug fixes and improvements to update toast reliability

v0.1.6

Bug Fixes & Improvements

Fixed

  • Bug fixes and improvements to update check UX

v0.1.5

In-App Update Notifications

Added

  • In-app update notifications with download progress

v0.1.4

Dictation Stats

Added

  • Dictation statistics on home screen — total words, duration, and average WPM

v0.1.3

Bug Fixes & Improvements

Added

  • Check for Updates button in Settings and app menu

v0.1.2

Bug Fixes & Improvements

Fixed

  • Bug fixes and improvements to installer and auto-update pipeline

v0.1.1

Bug Fixes & Improvements

Changed

  • Default Whisper model changed to small.en for better accuracy
  • Fewer permissions required — removed Input Monitoring requirement

v0.1.0

Initial Release

Added

  • Hold-to-talk dictation — hold Option or Fn to record, release to transcribe
  • On-device transcription powered by WhisperKit (fully offline)
  • 7 Whisper model options from tiny to large-v3-turbo
  • Customizable hotkey system — Option or Fn (Globe) modifier
  • Hands-free dictation mode with companion key toggle
  • 8 color themes and 8 animated waveform styles
  • Light, Dark, and System appearance modes
  • Transcription history with word count and WPM stats
  • Dynamic Dock icon matching selected color theme
  • Floating overlay panel during recording (never steals focus)
  • Automatic updates — get notified when new versions are available
  • Automatic trailing space after punctuation in transcribed text