A simple, cross-platform application for generating text-to-speech audio using Kokoro with an intuitive graphical interface.
Kokoro TTS GUI simplifies text-to-speech generation by providing:
- Project-based organization for your voice content
- Real-time audio generation and playback
- Customizable voice parameters (speed, voice type)
- Simple interface built with ImGui and GLFW
- Project Management: Organize text content into sections and projects
- Voice Customization: Choose from 50+ voices and adjust speech speed
- Audio Workflow: Generate → Preview → Export workflow
- Cross-Platform Support: Linux (Windows coming soon)
- One-Click Setup: Automatic dependency installation
- Git (with submodule support)
- C++20 compatible compiler
- Python 3+
Run the setup script to install dependencies and configure the environment:
python3 setup.pyThe script will:
- Create Python virtual environment
- Install Kokoro dependencies
- Download voice models (800MB)
- Configure build system
- Open project in VSCode
- Build: Ctrl+Shift+B
- Run: F5 (will rebuild)
During the first time running the application (which may take several minutes), you'll see this initialization screen while downloading the models and voices:
cd TTS_app
.vscode/build.sh
make -j
bin/Debug-linux-x86_64/TTS_app/TTS_app- Sidebar (left): Application controls
- Settings Panel: Configure voice parameters
- Project Panel: Mange projects
- Content Area (center): Manage text sections
- Add Sections: Click "Add Section" to create content groups
- Enter Text: Type/paste text into input fields
- Generate Audio: Click → next to text field
- Preview: Click 🔊 to hear generated audio
- Export: Audio files save to
audio/directory
| Button | Function |
|---|---|
| Generate audio from text | |
| Play generated audio | |
| Stop playback | |
| Voice configuration | |
| Project management |
Modify these settings via the settings panel (gear icon):
- Voice Type: 50+ options (e.g.,
am_onyx,bf_emma) - Speed: 0.5x-2.0x normal speech rate
- Audio Format: WAV (16-bit PCM)
Problem: Audio playback fails
Solution: Install system audio packages:
sudo apt install alsa-utils mpg123 # Ubuntu/Debian
sudo dnf install alsa-utils mpg123 # FedoraProblem: Python module errors
Solution: Re-run setup script:
python3 setup.py Contributions welcome! Please:
- Fork the repository
- Follow existing C++20 coding style
- Test on Linux systems
- Update documentation for new features


