Deterministic synthetic two-party conversation corpus generator for testing AI scoring systems.
-
Updated
May 27, 2026 - Python
Deterministic synthetic two-party conversation corpus generator for testing AI scoring systems.
Go toolkit + library: structured adversarial corpora for LLM/RAG safety + quality testing. Prompt injection, KB exfiltration, jailbreak, system-prompt probing. CI/CD-ready.
Deterministic offline corpus compiler for Unicode, tokenizer, and parser stress testing
Generation pipeline: 19 LLMs rewrite 16 classical fairy tales (Master's research corpus)
Add a description, image, and links to the corpus-generation topic page so that developers can more easily learn about it.
To associate your repository with the corpus-generation topic, visit your repo's landing page and select "manage topics."