CGC – Chess Game Content

Hero-perspective board
Kokoro TTS narration (hexgrad/Kokoro-82M, voice af_nicole, American English)
Word-level karaoke subtitles (ASS hard-burned)
Continuous perimeter progress bar
Final-board outro with fade to black

1. First run

On a fresh machine you need to:

Install dependencies:

uv sync

(Optional but recommended) Preload Kokoro into the project-local HF cache:

uv run python -m cgc.tools.bootstrap_models

This downloads hexgrad/Kokoro-82M once into:

cgc/.hf_cache/models--hexgrad--Kokoro-82M/...

After that, Hugging Face will reuse the cached model for all runs.

You can also skip the bootstrap step; the first --no-fake-tts pipeline run will trigger the same download automatically.

2. Requirements

Python 3.11+
uv installed
ffmpeg available on your PATH
Internet access at least once to download Kokoro (either via bootstrap or first real-TTS run)

Models and alignment assets are cached under cgc/.hf_cache via Hugging Face Hub.

3. Running the pipeline

All examples below use --device cpu.

3.1 Fast run (fake TTS, fake alignment)

For quick visual checks:

uv run python -m cgc.pipeline scripts/game.yaml --device cpu

3.2 Real TTS (Kokoro), fake alignment

Recommended normal run:

uv run python -m cgc.pipeline scripts/game.yaml --device cpu --no-fake-tts

TTS: Kokoro KPipeline
- Repo: hexgrad/Kokoro-82M
- Voice: af_nicole — see available voices
- Lang: "a" (American English)
- Speed: 1.0
Alignment: fake (no WhisperX).

3.3 Real TTS + real WhisperX alignment (CPU)

When you want real word timings:

uv run python -m cgc.pipeline scripts/game.yaml --device cpu --no-fake-tts --no-fake-alignment

WhisperX models are cached under cgc/.hf_cache/alignment_models.

4. Story scripts (YAML)

To build a video, CGC needs a YAML story script under scripts/.

Flow:

Lichess game URL → YAML story script → scripts/ → pipeline → MP4

Example: scripts/game.yaml

version: 1

source:
  type: lichess_url
  value: "https://lichess.org/anonymousGameId"

meta:
  voice: cinematic-bullet
  perspective: black

cards:
  - id: intro-1
    type: text
    duration: 2.8
    lines:
      - "They were outrated."
      - "The clock was against them."
      - "Nobody expected an upset."

  - id: opening-1
    type: board
    ply: 1
    role: opening
    duration: 2.1
    highlight:
      mode: last_move
    lines:
      - "A sharp opening choice."
      - "No room for quiet play."
      - "From move one, both sides were fighting."

  # ... midgame cards ...

  - id: finish-1
    type: board
    ply: 66
    role: finish
    duration: 2.1
    highlight:
      mode: last_move
    lines:
      - "One final precise move."
      - "And everything collapsed."

  - id: outro-1
    type: text
    duration: 2.8
    lines:
      - "One game."
      - "One chance."
      - "Complete domination."

Notes:

Put scripts under scripts/, e.g. scripts/game.yaml, scripts/other_game.yaml.
source.value must be a full Lichess game URL.
meta.voice is a style label (currently mapped to Kokoro’s af_nicole in code).
meta.perspective controls hero side (white or black) and board flipping.
cards:
- type: board + ply show specific positions; highlight.mode: last_move is supported.
- type: text is narration-only:
  - intro-* ids show the starting position.
  - outro-* ids show the final position.

To render a different script:

uv run python -m cgc.pipeline scripts/other_game.yaml --device cpu --no-fake-tts

5. Outputs

After a run you’ll see:

Story JSON: output/story/<game_id>.json
Frames: output/frames/<game_id>_XX_<scene_id>.png
Subtitles (ASS): output/subtitles/<game_id>.ass
Audio clips: audio/clips/<game_id>/...
Merged audio: audio/merged/<game_id>.wav
Final video: output/video/<game_id>.mp4

6. GPU acceleration (optional)

CGC runs fully on CPU, but TTS + alignment are much faster on a CUDA GPU.

6.1 Torch with CUDA (via uv)

This project uses uv for environment management. To install a CUDA-enabled PyTorch build:

Go to the official PyTorch “Get Started” page:
https://pytorch.org/get-started/locally/
Select:
- PyTorch build: Stable
- Your OS
- Package: pip
- Compute platform: your CUDA version (e.g. CUDA 12.6)

Copy the recommended install command. For CUDA 12.6 it looks like:

pip3 install torch --index-url https://download.pytorch.org/whl/cu126

Translate that into pyproject.toml + uv:

[project]
name = "cgc"
version = "0.1.0"
requires-python = ">=3.11"
dependencies = [
  "python-chess",
  "requests",
  "pyyaml",
  "numpy",
  "soundfile",
  "imageio[ffmpeg]",
  "cairosvg",
  "kokoro>=0.9.2",
  "misaki[en]",
  "whisperx==3.7.1",
  "torch",
]

[tool.uv]
index = [
  { name = "pytorch-cu126", url = "https://download.pytorch.org/whl/cu126", explicit = true },
]

[tool.uv.sources]
torch = { index = "pytorch-cu126" }

Sync and verify:

uv sync
uv run python -c "import torch; print(torch.__version__, torch.cuda.is_available())"

You should see something like:

2.11.0+cu126 True

6.2 Running the GPU pipeline

CPU pipeline:

uv run python -m cgc.pipeline scripts/1ORTExZg.yaml --device cpu --no-fake-tts --no-fake-alignment

CUDA pipeline:

uv run python -m cgc.pipeline scripts/1ORTExZg.yaml --device cuda --no-fake-tts --no-fake-alignment

Quick benchmark (CPU vs CUDA):

uv run python scripts/benchmark.py scripts/1ORTExZg.yaml

On an RTX-class GPU we observed approximately:

CPU: 83.8s
CUDA: 38.8s

6.3 Choosing CPU vs GPU

The pipeline has a --device flag that controls where Kokoro TTS and WhisperX run:

cpu → run all models on the CPU
cuda → run all models on the GPU (requires a CUDA-enabled PyTorch build)

Examples:

# Force CPU (safe on any machine)
uv run python -m cgc.pipeline scripts/game.yaml --device cpu --no-fake-tts --no-fake-alignment

# Force GPU (fastest if torch.cuda.is_available() is True)
uv run python -m cgc.pipeline scripts/game.yaml --device cuda --no-fake-tts --no-fake-alignment

If you only care about real audio and are OK with fake alignment, just omit --no-fake-alignment in either command.

TTS

CGC uses the Kokoro-82M text-to-speech model by hexgrad (Apache-2.0). See THIRD_PARTY_NOTICES.md for details.

Offline mode

Once models and voices are cached, you can force a fully offline run with:

uv run python -m cgc.pipeline scripts/game.yaml --device cpu --no-fake-tts --no-fake-alignment --offline

Thank you for reading :)

Name		Name	Last commit message	Last commit date
Latest commit History 53 Commits
cgc		cgc
debug		debug
examples		examples
prompts		prompts
scripts		scripts
.gitignore		.gitignore
.python-version		.python-version
LICENSE		LICENSE
README.md		README.md
THIRD_PARTY_NOTICES.md		THIRD_PARTY_NOTICES.md
main.py		main.py
pyproject.toml		pyproject.toml
test.py		test.py
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CGC – Chess Game Content

CGC – Chess Game Content

1. First run

2. Requirements

3. Running the pipeline

3.1 Fast run (fake TTS, fake alignment)

3.2 Real TTS (Kokoro), fake alignment

3.3 Real TTS + real WhisperX alignment (CPU)

4. Story scripts (YAML)

5. Outputs

6. GPU acceleration (optional)

6.1 Torch with CUDA (via uv)

6.2 Running the GPU pipeline

6.3 Choosing CPU vs GPU

TTS

Offline mode

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

CGC – Chess Game Content

CGC – Chess Game Content

1. First run

2. Requirements

3. Running the pipeline

3.1 Fast run (fake TTS, fake alignment)

3.2 Real TTS (Kokoro), fake alignment

3.3 Real TTS + real WhisperX alignment (CPU)

4. Story scripts (YAML)

5. Outputs

6. GPU acceleration (optional)

6.1 Torch with CUDA (via uv)

6.2 Running the GPU pipeline

6.3 Choosing CPU vs GPU

TTS

Offline mode

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages