Benchmark Script for local models
Summary
Add scripts/benchmark.py to compare local HF models on a fixed diff corpus.
Scope
- Inputs: a small corpus of representative diffs (JSONL).
- Outputs: CSV with
model, tokens_in, tokens_out, latency_ms, provider_device.
- Optional: save a plot to
docs/benchmarks/.
Tasks
Acceptance criteria
Benchmark Script for local models
Summary
Add
scripts/benchmark.pyto compare local HF models on a fixed diff corpus.Scope
model, tokens_in, tokens_out, latency_ms, provider_device.docs/benchmarks/.Tasks
Acceptance criteria