refusal-calibration

Here is 1 public repository matching this topic...

YusufMalu001 / VeritasBench

Production-grade LLM evaluation framework measuring model behavior across 5 dimensions with human-vs-LLM judge agreement validation and Cohen's Kappa scoring

python natural-language-processing cohens-kappa huggingface streamlit human-evaluation instruction-following large-language-models rlhf llm-evaluation llm-benchmarking llm-as-judge behavioral-testing refusal-calibration

Updated Jun 8, 2026
Python

Improve this page

Add a description, image, and links to the refusal-calibration topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the refusal-calibration topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

refusal-calibration

Here is 1 public repository matching this topic...

YusufMalu001 / VeritasBench

Improve this page

Add this topic to your repo