stat-my-agent ; benchmark consistency, tool-use, failure-recovery and goal-faithfulness — locally reproducible & shareable
-
Updated
Mar 22, 2026 - Python
stat-my-agent ; benchmark consistency, tool-use, failure-recovery and goal-faithfulness — locally reproducible & shareable
Add a description, image, and links to the anthropi topic page so that developers can more easily learn about it.
To associate your repository with the anthropi topic, visit your repo's landing page and select "manage topics."