SolomonB14D3 / knowledge-fidelity Star 1 Code Issues Pull requests Behavioral auditing & repair toolkit for LLMs. Measures 8 dimensions via confidence probes. Rho-Surgery fixes miscalibration with contrastive LoRA + gamma protection. CatSAE for feature-level intervention. SVD compression with knowledge preservation. transformers pytorch svd interpretability confidence bias-detection truthfulness model-merging sycophancy llm-compression mergekit activation-engineering model-auditing steering-vectors rho-audit behavioral-evaluation Updated Mar 2, 2026 Python