AI Red Team Operations Console
-
Updated
Jan 29, 2026 - TypeScript
AI Red Team Operations Console
Behavioral auditing & repair toolkit for LLMs. Measures 8 dimensions via confidence probes. Rho-Surgery fixes miscalibration with contrastive LoRA + gamma protection. CatSAE for feature-level intervention. SVD compression with knowledge preservation.
Reproducible pipeline for silent-failure auditing in ECG accept-sets (MIT-BIH) with Newton–Puiseux onset scoring
Add a description, image, and links to the model-auditing topic page so that developers can more easily learn about it.
To associate your repository with the model-auditing topic, visit your repo's landing page and select "manage topics."