brier

Here is 1 public repository matching this topic...

Tajaddin / conf-calib

Confidence calibration toolkit for LLM verbalized-probability outputs. Real benchmark on 998 BoolQ questions with Llama-3.1-8B: ECE 0.148 -> 0.030, log-loss 3.9 -> 0.41.

calibration ece isotonic-regression platt-scaling groq temperature-scaling llm anthropic brier boolq

Updated May 12, 2026
Python

Improve this page

Add a description, image, and links to the brier topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the brier topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

brier

Here is 1 public repository matching this topic...

Tajaddin / conf-calib

Improve this page

Add this topic to your repo