llm-dit-experiments

Multi-pipeline LLM-DiT generation platform. LLM hidden states -> flow-matching DiT -> VAE decode. Single GPU (24GB).

Backend: PyTorch, FastAPI, TOML config. Frontend: React 19, Vite 7, Bun (web/frontend-v2/).

Pipelines

Pipeline	Task	Encoder	Notes
FLUX.2 Klein	text-to-image, image editing	Qwen3-8B/4B	Distilled, multi-layer extraction, LoRA support
Z-Image	text-to-image, img2img	Qwen3-4B	CFG=0 baked, 1504 token limit
LTX-2	text-to-video	Gemma3-12B	Pure PyTorch, FP8, persistent component caching
Qwen-Image-2512	text-to-image	Qwen2.5-VL-7B	39GB transformer, requires fp8 on 24GB
Qwen-Image-Edit-2511	image editing, multi-image	Qwen2.5-VL-7B	Multi-image composition, instruction editing

Quick Start

1. Backend

uv sync
cp config.toml.example config.toml   # edit model paths
uv run web/server.py --config config.toml

API on port 7860.

2. Frontend

cd web/frontend-v2
bun install
bun run dev

UI on http://localhost:5175. Vite proxies /api to the backend.

3. CLI (optional)

# Requires server running (step 1)
uv run scripts/gen.py flux2 --prompt "A photo of a cat" --seed 42

4. Batch Generation

Process a directory of images with the same prompt and model. Reads config.toml for server URL and default model.

# Basic -- uses config.toml defaults for server + model
uv run scripts/batch_flux2.py \
  --input-dir /path/to/images \
  --prompt "make this a watercolor painting"

# Override model, match output size to input
uv run scripts/batch_flux2.py \
  --input-dir /path/to/images \
  --output-dir /path/to/outputs \
  --prompt "transform this" \
  --model-name klein-9b-kv-fp8 \
  --match-image-size "0 (First Image)"

Supports resume -- interrupted runs skip already-completed images. Use --no-resume to regenerate all.

API

Endpoint	Method	Description
`/api/generate`	POST	Z-Image generation
`/api/flux2/generate`	POST	FLUX.2 generation
`/api/ltx2/generate/stream`	POST	LTX-2 video (streaming)
`/api/qwen-image/edit-layer`	POST	Single image editing
`/api/qwen-image/edit-multi`	POST	Multi-image composition
`/api/qwen-image-2512/generate`	POST	Qwen-Image T2I
`/api/models/{id}/load`	POST	Load pipeline
`/api/models/{id}/unload`	POST	Unload pipeline
`/api/loras`	GET	List LoRAs
`/api/config/session`	GET/PUT	Session config
`/api/rewrite`	POST	Prompt expansion
`/health`	GET	Health check

Experiments

Ablation sweeps and comparison tools in experiments/. See experiments/README.md.

Reference

Configuration -- TOML config, hardware profiles, HTTPS
CLI flags -- all command-line options
API endpoints -- request/response reference
Quantization -- methods, tradeoffs, backends
config.toml.example -- annotated example

Name		Name	Last commit message	Last commit date
Latest commit History 742 Commits
docs		docs
experiments		experiments
presets		presets
scripts		scripts
src/llm_dit		src/llm_dit
templates/z_image		templates/z_image
tests		tests
web		web
.claudeignore		.claudeignore
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
LICENSE		LICENSE
README.md		README.md
THIRD_PARTY_NOTICES.md		THIRD_PARTY_NOTICES.md
VISION.md		VISION.md
config.toml.example		config.toml.example
pyproject.toml		pyproject.toml
pytest.ini		pytest.ini
spec.md		spec.md
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

llm-dit-experiments

Pipelines

Quick Start

1. Backend

2. Frontend

3. CLI (optional)

4. Batch Generation

API

Experiments

Reference

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

llm-dit-experiments

Pipelines

Quick Start

1. Backend

2. Frontend

3. CLI (optional)

4. Batch Generation

API

Experiments

Reference

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages