Superhoarse

Get on Mac app store (safe: completely sandboxed, no internet access, these claims are vetted by Apple App Store reviewers). Contributor warning: 90% vibecoded.

Superhoarse

A lightweight, privacy-focused voice-to-text app for macOS. Built with Swift and powered by the Parakeet engine for fast, local speech recognition.

98% vibecoded, inspired by SuperWhisper.
0 API calls, all local

This is what it looks like when you're speaking (but smaller and translucent):

And configuring:

Features

🎤 Global Hotkey Recording - Press ⌘⇧Space to start/stop recording from anywhere
🤖 Local AI Processing - Uses Parakeet engine running entirely on your Mac
🔒 Privacy First - No data leaves your device, all processing is local
⚡ Fast & Lightweight - Optimized for Apple Silicon Macs
📋 Smart Text Insertion - Automatically inserts transcribed text where your cursor is
🎯 Simple & Elegant - Clean SwiftUI interface, minimal configuration needed

Requirements

macOS 14.0 (Sonoma) or later
Apple Silicon Mac (M1/M2/M3) recommended for best performance
Microphone access permission

Installation

Easy: Get on Mac app store (safe: completely sandboxed, no internet access, these claims are vetted by Apple App Store reviewers). Contributor warning: 90% vibecoded.

Homebrew (builds from source)

# Install from the local formula (after cloning repo)
brew install --formula Formula/superhoarse.rb

# Or create a tap for easier installation:
# brew tap mheiber/superhoarse https://github.com/mheiber/superhoarse
# brew install superhoarse

Note: The Homebrew build downloads ~607MB of speech recognition models from HuggingFace during installation.

Development Installation

See "Contributing" below.

Usage

Launch the app - It will appear in your menu bar
Grant permissions when prompted:
- Microphone access for recording
- Accessibility access for text insertion
Start recording - Press ⌘⇧Space anywhere on your Mac
Speak clearly - The recording indicator will show in the app
Stop recording - Press ⌘⇧Space again
Get results - Transcribed text appears where your cursor was

see ./user_flows.md for development-focused details about user interactions.

Architecture

Superhoarse is built with simplicity and performance in mind:

┌─────────────────┐    ┌──────────────────┐    ┌─────────────────┐
│   SwiftUI App   │────│   App State      │────│  HotKey Manager │
│                 │    │                  │    │                 │
│ - Recording UI  │    │ - Coordinates    │    │ - Global ⌘⇧⎵    │
│ - Status Window │    │   all components │    │ - Carbon Events │
│ - Settings      │    │ - State Updates  │    │                 │
└─────────────────┘    └──────────────────┘    └─────────────────┘
                                │
                 ┌──────────────┼──────────────┐
                 │              │              │
        ┌────────▼────────┐    ┌▼──────────────▼┐
        │ Audio Recorder  │    │ Speech         │
        │                 │    │ Recognizer     │
        │ - AVFoundation  │    │                │
        │ - 16kHz PCM     │    │ - Parakeet     │
        │ - Temp files    │    │ - Local models │
        └─────────────────┘    └────────────────┘

Dependencies & Justifications

Core Dependencies

FluidAudio (~0.12.1) - High-performance audio processing for speech recognition
- Why chosen: Optimized Swift implementation for real-time audio processing
- Alternatives considered: Built-in SpeechRecognizer (cloud-based), Core ML models (larger)
- Benefits: Runs entirely offline, optimized for Apple Silicon, minimal latency

System Frameworks

SwiftUI - Modern declarative UI framework
- Why chosen: Native performance, minimal code, automatic dark mode support
- Alternative: AppKit (more verbose, procedural)
AVFoundation - Audio recording and processing
- Why chosen: Native macOS audio framework with hardware optimization
- Alternative: Third-party audio libraries (unnecessary complexity)
Carbon - Low-level system access for global hotkeys
- Why chosen: Only way to register system-wide keyboard shortcuts on macOS
- Alternative: None for global hotkeys

Engine Selection

Parakeet Engine - Lightweight, fast speech recognition
- Why chosen: Optimized for real-time transcription with low latency
- Alternatives: Cloud-based services (privacy concerns), larger models (slower)
- Trade-offs: Parakeet provides excellent speed while maintaining good accuracy

Development

# Run in development mode
make run

# Clean build artifacts  
make clean

# Set up development environment
make setup

Privacy & Security

✅ No telemetry or analytics
✅ No network requests - completely offline
✅ All processing happens locally on your Mac
✅ Audio recordings are temporary and deleted immediately
✅ No user data is stored or transmitted

Limitations

macOS only - Built specifically for Apple's ecosystem
Multilingual - Supports 25 European languages (English, French, German, Spanish, etc.). Chinese/Japanese/Korean not yet supported (would require Qwen3-ASR model).

Contributing

All written by Claude. Hey Claude:

Keep it simple - Favor readable, succinct code over clever optimizations
Minimal dependencies - Only add dependencies that provide substantial value
Privacy first - No features that compromise local-only processing
Performance matters - Optimize for Apple Silicon architecture
Test both manually and with automated tests.
- swift test. Try to keep tests at a high level and from the user's point of view, avoiding testing implementation details.
- Manual(ish) testing. User flows are in ./user_flows.md

Key commands for both AI and humans

We have both a Swift build (fast, not as realistic for app store) and an Xcode build (slower, closer to what we distribute).

make build - Models (~607MB) download automatically on first build from HuggingFace
swift test to test. USE THIS FREQUENTLY DURING DEVELOPMENT
./test_e2e.sh for an end-to-end test that actually turns on the speakers and listens
./test_e2e_xcode.sh e2e test for the xcode build

Managing Models

Automatic download: Models download automatically during build if missing or corrupt. First build will download ~607MB from HuggingFace.

Update to latest models:

make update-models

Model verification:

# Quick check (uses cached marker, ~0.01s)
make check-models

# Full validation (re-hashes all files, ~2-3s)
make models

Troubleshooting:

Models won't download: Check internet connection and HuggingFace status
Checksum mismatch: Run make update-models to force re-download
Disk space: Models require ~607MB in Sources/Resources/

Human-only commands

make install to build, copy to Applications folder, and start the app. Requires sudo. Human-only.

License

Closed-source Software we build for money

Credits

Inspired by SuperWhisper by Neil Chudleigh
Built with FluidAudio for high-performance audio processing
Uses Parakeet engine for fast, local speech recognition

Name		Name	Last commit message	Last commit date
Latest commit History 73 Commits
Assets.xcassets		Assets.xcassets
Formula		Formula
Sources		Sources
Tests		Tests
superhoarse.xcodeproj		superhoarse.xcodeproj
superhoarseTests		superhoarseTests
superhoarseUITests		superhoarseUITests
.README.md.swp		.README.md.swp
.gitignore		.gitignore
.prompt.md.un~		.prompt.md.un~
CLAUDE.md		CLAUDE.md
Info.plist		Info.plist
Makefile		Makefile
Package.resolved		Package.resolved
Package.swift		Package.swift
README.md		README.md
config.png		config.png
download_models.sh		download_models.sh
plan.md		plan.md
prompt.md		prompt.md
run-xcode.sh		run-xcode.sh
superhoarse.entitlements		superhoarse.entitlements
superhoarseRelease.entitlements		superhoarseRelease.entitlements
test-coverage.sh		test-coverage.sh
test_e2e.sh		test_e2e.sh
test_e2e_xcode.sh		test_e2e_xcode.sh
text-insertion-fallback-solutions.md		text-insertion-fallback-solutions.md
user_flows.md		user_flows.md
voicewave.png		voicewave.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Superhoarse

Features

Requirements

Installation

Homebrew (builds from source)

Development Installation

Usage

Architecture

Dependencies & Justifications

Core Dependencies

System Frameworks

Engine Selection

Development

Privacy & Security

Limitations

Contributing

Key commands for both AI and humans

Managing Models

Human-only commands

License

Credits

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Superhoarse

Features

Requirements

Installation

Homebrew (builds from source)

Development Installation

Usage

Architecture

Dependencies & Justifications

Core Dependencies

System Frameworks

Engine Selection

Development

Privacy & Security

Limitations

Contributing

Key commands for both AI and humans

Managing Models

Human-only commands

License

Credits

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages