Kokoro TTS GUI

A simple, cross-platform application for generating text-to-speech audio using Kokoro with an intuitive graphical interface.

1. Overview

Kokoro TTS GUI simplifies text-to-speech generation by providing:

Project-based organization for your voice content
Real-time audio generation and playback
Customizable voice parameters (speed, voice type)
Simple interface built with ImGui and GLFW

2. Features

Project Management: Organize text content into sections and projects
Voice Customization: Choose from 50+ voices and adjust speech speed
Audio Workflow: Generate → Preview → Export workflow
Cross-Platform Support: Linux (Windows coming soon)
One-Click Setup: Automatic dependency installation

3. Prerequisites

Git (with submodule support)
C++20 compatible compiler
Python 3+

4. Installation & Setup

Automated Setup (Recommended)

Run the setup script to install dependencies and configure the environment:

python3 setup.py

The script will:

Create Python virtual environment
Install Kokoro dependencies
Download voice models (800MB)
Configure build system

5. Building the Application

VSCode (Recommended)

Open project in VSCode
Build: Ctrl+Shift+B
Run: F5 (will rebuild)

During the first time running the application (which may take several minutes), you'll see this initialization screen while downloading the models and voices:

Manual Build

cd TTS_app
.vscode/build.sh
make -j
bin/Debug-linux-x86_64/TTS_app/TTS_app

6. Usage Guide

Interface Overview

Sidebar (left): Application controls
- Settings Panel: Configure voice parameters
- Project Panel: Mange projects
Content Area (center): Manage text sections

Basic Workflow

Add Sections: Click "Add Section" to create content groups
Enter Text: Type/paste text into input fields
Generate Audio: Click → next to text field
Preview: Click 🔊 to hear generated audio
Export: Audio files save to audio/ directory

Key Controls

Button	Function
	Generate audio from text
	Play generated audio
	Stop playback
	Voice configuration
	Project management

7. Customization

Modify these settings via the settings panel (gear icon):

Voice Type: 50+ options (e.g., am_onyx, bf_emma)
Speed: 0.5x-2.0x normal speech rate
Audio Format: WAV (16-bit PCM)

8. Troubleshooting

Problem: Audio playback fails
Solution: Install system audio packages:

sudo apt install alsa-utils mpg123     # Ubuntu/Debian
sudo dnf install alsa-utils mpg123     # Fedora

Problem: Python module errors
Solution: Re-run setup script:

python3 setup.py

9. Contributing

Contributions welcome! Please:

Fork the repository
Follow existing C++20 coding style
Test on Linux systems
Update documentation for new features

10. License

Apache License

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
.github/resources		.github/resources
assets		assets
config		config
kokoro		kokoro
scripts		scripts
src		src
vendor		vendor
.gitignore		.gitignore
.gitmodules		.gitmodules
LICENSE		LICENSE
README.md		README.md
dependencies.lua		dependencies.lua
premake5.lua		premake5.lua
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Kokoro TTS GUI

1. Overview

2. Features

3. Prerequisites

4. Installation & Setup

Automated Setup (Recommended)

5. Building the Application

VSCode (Recommended)

Manual Build

6. Usage Guide

Interface Overview

Basic Workflow

Key Controls

7. Customization

8. Troubleshooting

9. Contributing

10. License

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Kokoro TTS GUI

1. Overview

2. Features

3. Prerequisites

4. Installation & Setup

Automated Setup (Recommended)

5. Building the Application

VSCode (Recommended)

Manual Build

6. Usage Guide

Interface Overview

Basic Workflow

Key Controls

7. Customization

8. Troubleshooting

9. Contributing

10. License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages