Private interview assistant and bilingual live subtitle overlay for macOS.
v2s helps you follow interviews, meetings, calls, streams, and videos with bilingual subtitles, multi-source audio capture, and an optional GPT assistant that can use transcript history, screen OCR, and current-screen context when you explicitly ask for help.
- Use a discreet macOS overlay to keep interview or meeting context visible while staying in your current app.
- Follow live conversations with bilingual subtitles, including original and translated text.
- Capture from one or more microphones and running macOS apps instead of your entire system mix.
- Ask GPT for a follow-up, explanation, or likely answer using the conversation history and visible screen context.
- Keep a lightweight menu bar workflow instead of juggling browser tabs or full-screen caption apps.
- Menu bar app built for always-available subtitle access.
- Live subtitle overlay with translated text on the first line and source text on the second.
- Multi-source audio capture for microphones and running macOS apps.
- Per-source language selection for bilingual or mixed-language conversations.
- On-device speech transcription powered by Apple Speech.
- On-device translation powered by Apple Translation.
- Transcript summarization powered by Apple Intelligence for quick overview of conversations.
- GPT interview assistant with Follow Up and Ask actions, global hotkeys, configurable model, API key, and API base URL.
- Screen context support through macOS screenshot capture plus Vision OCR; text-only fallback is used when a provider rejects images.
- Scrollable subtitle and GPT reply modes with a hotkey to switch between them.
- Privacy mode for the overlay and settings windows so they can be hidden from screenshots, screen recording, and screen sharing.
- Single-instance launch guard: opening v2s again wakes the existing window, or restarts it if the old instance is unresponsive.
- Overlay styling controls so the subtitle bar stays readable on top of real work.
- No built-in account, cloud backend, analytics, or telemetry.
- Audio transcription and translation are handled through Apple's system frameworks.
- Translation uses Apple's on-device Translation framework. Some language packs may need to be downloaded first through System Settings.
- Speech recognition uses Apple's on-device Speech framework. Some locales may fall back to Apple's servers unless on-device recognition is explicitly configured.
- GPT features are optional and user configured. When you press Follow Up or Ask, v2s sends the prior transcript, timestamps, conversation/source name, skills prompt, OCR text, and, when available and supported by the provider, a current-screen screenshot to your configured API endpoint.
- If screen capture permission is missing or the selected API/model does not support images, v2s falls back to text and OCR context and shows an in-overlay warning indicator.
- Download the latest
.app.zipfrom Releases. - Unzip and move
v2s.appto your Applications folder. - Launch v2s — it appears as an icon in your menu bar.
- Select an input source (a running app or microphone).
- Choose your input and subtitle languages.
- Click Start.
- Optional: configure GPT API settings, skills, and hotkeys for interview assistance.
v2s will ask for permissions on first use:
- Speech Recognition — to transcribe audio into text.
- Microphone — when using a microphone as the input source.
- Audio Capture — when capturing audio from another app.
- Screen Capture — only when you ask GPT to use visible screen context.
- Translation requires macOS 26 or newer
git clone https://github.com/NX-lite/v2s.git
cd v2s
open v2s.xcodeprojOr from the terminal:
xcodebuild -project v2s.xcodeproj -scheme v2s -configuration Debug buildMIT

