KarmaEnchanter

KarmaEnchanter

Achievements

mental-health-llm-eval mental-health-llm-eval Public

Open evaluation harness for mental health LLM responses. 5 clinically-grounded rubrics, LLM-as-judge with bias controls, crisis-detection routing to 988 protocols.

Python
inspect_evals inspect_evals Public

Forked from UKGovernmentBEIS/inspect_evals

Collection of evals for Inspect AI

Python
awesome-ai-eval awesome-ai-eval Public

Forked from Vvkmnn/awesome-ai-eval

☑️ A curated list of tools, methods & platforms for evaluating AI reliability in real applications