TRACE: HMM-Guided Text Generation

Tractable Reasoning for Adaptable Controllable gEneration

TRACE is a method for controllable text generation that uses Hidden Markov Models (HMMs) to guide language models away from unwanted attributes (like toxicity) while maintaining fluency and diversity.

🚀 Quick Start

1. Setup API Key

Get a Google Perspective API key and configure it:

Recommended: Edit environment.yml (or environment_cpu.yml), replace 'your_key_here' with your actual key.

Alternative: Set environment variable (temporary):

export PERSPECTIVE_API_KEY="your_key_here"

2. Setup Environment

Recommended: Use environment.yml (auto-detects CUDA version):

conda env create -f environment.yml
conda activate trace

If you encounter CUDA issues: Use CPU-only environment:

conda env create -f environment_cpu.yml
conda activate trace

The GPU environment automatically detects and installs the correct PyTorch version for your CUDA installation (supports CUDA 11.8+ and 12.x).

3. Download Data and Models

Download all required data files at once:

# 1. Download pre-trained HMM model (~850MB)
python -c "
from huggingface_hub import snapshot_download
snapshot_download(repo_id='gwenweng/hmm-gpt2-large', local_dir='models/hmm_gpt2-large_uncon_seq-len-32_4096_10M')
"

# 2. Download RTP training data (~6.6MB) - for custom classifier training
cd data/
wget https://github.com/yidouweng/trace/releases/download/v1.0.0/RTP_train.jsonl.tar.gz
tar -xzf RTP_train.jsonl.tar.gz

# 3. Download RTP test data (~6.6MB) - for large-scale evaluation  
wget https://github.com/yidouweng/trace/releases/download/v1.0.0/RTP_test.jsonl.tar.gz
tar -xzf RTP_test.jsonl.tar.gz
cd ..

What you just downloaded:

HMM Model: Pre-trained Hidden Markov Model for toxicity control
RTP Train: 100k prompts for training custom classifiers (optional)
RTP Test: 10k prompts for large-scale evaluation (optional)
Demo prompts: Already included in data/prompts.jsonl (12 examples)

4. HMM Distillation (Optional)

The pre-trained HMM downloaded above is ready to use. If you want to train your own HMM (e.g., for a different base LM, or with a different model family like second-order HMMs), the distillation pipeline is available at Ctrl-G/distillation.

Our pre-trained HMM was distilled from GPT2-Large using the following setup:

Training data: 10M sequences sampled unconditionally from the base LM (BOS token only, no prompt), using nucleus sampling (top-p = 0.9, temperature = 1.0) at a fixed length of 32 tokens
Hidden state size: h = 4096 (h = 256 for the lightweight personalization experiments)
Training: Mini-batch EM with batch size 4096, 50 epochs, step size linearly annealed from 1.0 to 0.0

These settings can be freely adapted. For example, you could distill from a different base LM, use longer sequences, or experiment with different hidden state sizes depending on your compute budget and use case.

5. Run Tutorial

🎯 Start here: Open and run tutorial.ipynb for a complete interactive walkthrough!

Starting Jupyter Notebook

# Make sure you're in the trace environment
conda activate trace

# Option A: Jupyter Lab (recommended)
jupyter lab

# Option B: Classic Jupyter Notebook  
jupyter notebook

Important:

Always activate trace environment first - the base environment lacks required packages
In your editor: Select the trace environment as the Python interpreter for the notebook
Kernel issues: If notebook shows wrong kernel, click the kernel selector (top right) and choose trace

The tutorial demonstrates:

Environment setup and verification
Text generation with TRACE vs baseline comparison
Toxicity, fluency, and diversity evaluation
Analysis of where TRACE successfully reduces toxicity

📁 Repository Structure

trace/
├── tutorial.ipynb          # 🎯 START HERE - Interactive tutorial
├── src/
│   ├── generate.py         # Text generation script
│   ├── score.py            # Evaluation metrics  
│   ├── fit.py              # Train custom classifiers
│   ├── score_attribute.py  # Score custom attributes with zero-shot
│   └── ...                 # Core implementation
├── data/
│   ├── prompts.jsonl       # Demo prompts (12 examples)
│   ├── coefficients.csv    # Pre-trained toxicity classifier  
│   ├── RTP_train.jsonl     # Training data (100k prompts)
│   └── RTP_test.jsonl      # Test data (10k prompts)
├── models/                 # Pre-trained HMM model
├── environment.yml         # GPU environment
└── environment_cpu.yml     # CPU environment

🔬 Advanced Usage

Custom Generation

python src/generate.py \
    --hmm_model_path models/hmm_gpt2-large_uncon_seq-len-32_4096_10M \
    --prompts_path data/prompts.jsonl \
    --a 1.0 --max_len 20 --num_generations 3

Large-Scale Evaluation

Now that you have the RTP test dataset (10k prompts), you can run comprehensive evaluation:

# Generate text for all 10k test prompts
python src/generate.py --prompts_path data/RTP_test.jsonl

# Score the generated text for toxicity, fluency, and diversity
python src/score.py

This will take significantly longer than the 12-prompt demo, but provides robust statistical evaluation.

Train Custom Classifiers

TRACE can control any attribute, not just toxicity! With the RTP training data (100k prompts) you downloaded, you can train classifiers for any attribute:

Option 1: Score RTP Data for Custom Attribute

# Example: Train a "politics" classifier
# 1. Score training data for your attribute (just provide keyword!)
python src/score_attribute.py --attribute politics

# 2. Train classifier  
python src/fit.py --data_path data/RTP_train_politics.jsonl --attribute politics

# 3. Use in generation
python src/generate.py --weights_path data/coefficients_politics.csv --a 1.0

Other example attributes: sports, emotion, formality, sentiment, entertainment

Option 2: Use Your Own Dataset

# Prepare your data in the same format as RTP_train.jsonl:
# {"prompt": {"text": "...", "your_attribute": 0.8}, "continuation": {"text": "...", "your_attribute": 0.2}}

python src/fit.py --data_path your_custom_data.jsonl --attribute your_attribute

🛠️ Troubleshooting

Environment Setup Issues

CUDA/PyTorch Import Errors

# Error: "undefined symbol: cudaLaunchKernelExC" or PyTorch import fails
# This indicates CUDA version mismatch

Solution: The environment automatically detects and installs the correct CUDA version. If you encounter CUDA issues:

Check your CUDA version:

nvidia-smi  # Look for "CUDA Version: X.X" in the output

Recreate environment (this will auto-detect your CUDA version):

conda deactivate
conda env remove -n trace -y
conda env create -f environment.yml
conda activate trace

Test installation:

python -c "import torch; print(f'PyTorch: {torch.__version__}'); print(f'CUDA available: {torch.cuda.is_available()}')"

Still having issues? Use CPU-only environment:

conda env create -f environment_cpu.yml
conda activate trace

Environment Detection

CUDA 11.8+: Use environment.yml (recommended)
CUDA 12.x: Use environment.yml (auto-detects)
No CUDA/CPU only: Use environment_cpu.yml
Unsure?: Try environment.yml first, fallback to environment_cpu.yml

Jupyter Notebook Issues

Can't open notebook in VS Code/Cursor?

Activate environment: conda activate trace
Start Jupyter server: jupyter lab --no-browser --port=8888
In editor: Select trace Python interpreter
Connect: Point editor to http://localhost:8888

Wrong kernel/environment in notebook?

Click the kernel selector (top-right of notebook)
Choose Python 3 (ipykernel) from trace environment
If not listed: conda activate trace && python -m ipykernel install --user --name=trace

"Module not found" errors?

Check you're in trace environment: echo $CONDA_DEFAULT_ENV
If showing base: conda activate trace then restart Jupyter

Other Issues

Having more issues? Check our comprehensive FAQ.md for solutions to:

Environment setup problems
Scoring issues (0.0/NA results)
CUDA/memory errors
API key configuration
Performance optimization

📈 Expected Results

With default settings, TRACE typically achieves:

70%+ toxicity reduction vs baseline
Minimal fluency impact (<10% perplexity change)
Maintained diversity (>85% distinct-2)

📜 Citation

@inproceedings{yidou-weng2025trace,
  title={TRACE Back from the Future: A Probabilistic Reasoning Approach to Controllable Language Generation},
  author={Weng-Yidou, Gwen and Wang, Benjie and Van den Broeck, Guy},
  booktitle={Proceedings of the 42nd International Conference on Machine Learning (ICML)},
  year={2025}
}

📄 License

This project is licensed under the MIT License.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TRACE: HMM-Guided Text Generation

🚀 Quick Start

1. Setup API Key

2. Setup Environment

3. Download Data and Models

4. HMM Distillation (Optional)

5. Run Tutorial

Starting Jupyter Notebook

Important:

📁 Repository Structure

🔬 Advanced Usage

Custom Generation

Large-Scale Evaluation

Train Custom Classifiers

Option 1: Score RTP Data for Custom Attribute

Option 2: Use Your Own Dataset

🛠️ Troubleshooting

Environment Setup Issues

Jupyter Notebook Issues

Other Issues

📈 Expected Results

📜 Citation

📄 License

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
data		data
src		src
.gitignore		.gitignore
FAQ.md		FAQ.md
README.md		README.md
environment.yml		environment.yml
environment_cpu.yml		environment_cpu.yml
tutorial.ipynb		tutorial.ipynb

Folders and files

Latest commit

History

Repository files navigation

TRACE: HMM-Guided Text Generation

🚀 Quick Start

1. Setup API Key

2. Setup Environment

3. Download Data and Models

4. HMM Distillation (Optional)

5. Run Tutorial

Starting Jupyter Notebook

Important:

📁 Repository Structure

🔬 Advanced Usage

Custom Generation

Large-Scale Evaluation

Train Custom Classifiers

Option 1: Score RTP Data for Custom Attribute

Option 2: Use Your Own Dataset

🛠️ Troubleshooting

Environment Setup Issues

Jupyter Notebook Issues

Other Issues

📈 Expected Results

📜 Citation

📄 License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages