Add LinearBatchEnsemble folding benchmark by tippered1-debug · Pull Request #23 · yandex-research/tabm

tippered1-debug · 2026-06-11T20:33:29Z

Adds a standalone benchmark for an inference-only folding of LinearBatchEnsemble.

The script materializes an equivalent LinearEnsemble by folding the BatchEnsemble scaling vectors into per-submodel weights:

folded.weight[k, i, o] = r[k, i] * weight[o, i] * s[k, o]

Before timing, it checks numerical equivalence with torch.testing.assert_close. Then it compares the original layer and the folded layer in eager and torch.compile modes.

This is kept outside the public API and does not change training behavior or the model implementation. The point is to make the latency/memory tradeoff measurable before considering any inference helper.

I ran:

.venv/bin/python -m py_compile benchmarks/benchmark_fold_linear_batchensemble.py
.venv/bin/python -m ruff check benchmarks/benchmark_fold_linear_batchensemble.py
.venv/bin/python -m ruff format --check benchmarks/benchmark_fold_linear_batchensemble.py
.venv/bin/python benchmarks/benchmark_fold_linear_batchensemble.py --device cpu --quick
.venv/bin/python benchmarks/benchmark_fold_linear_batchensemble.py --device mps --quick
.venv/bin/python benchmarks/benchmark_fold_linear_batchensemble.py --device all --quick --output results.json

On my local quick runs, folded eager was about 1.3x–1.7x faster on CPU and about 1.2x–2.6x faster on MPS for the tested shapes. Compiled timings were less stable, and I did not have CUDA available locally. The script also reports the extra parameter memory needed for materialized per-submodel weights.

Add LinearBatchEnsemble folding benchmark

672b64a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add LinearBatchEnsemble folding benchmark#23

Add LinearBatchEnsemble folding benchmark#23
tippered1-debug wants to merge 1 commit into
yandex-research:mainfrom
tippered1-debug:benchmark-fold-linear-batchensemble

tippered1-debug commented Jun 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

tippered1-debug commented Jun 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant