Skip to content

feat: add eval assertion types#8

Draft
luohui1 wants to merge 1 commit into
RPSingh1990:mainfrom
luohui1:eval/assertion-types
Draft

feat: add eval assertion types#8
luohui1 wants to merge 1 commit into
RPSingh1990:mainfrom
luohui1:eval/assertion-types

Conversation

@luohui1

@luohui1 luohui1 commented Jun 14, 2026

Copy link
Copy Markdown

Summary

  • add forbidden_regex for patterns that must not appear in saved output
  • add required_json_key for required keys in JSON-shaped saved output
  • extend the eval runner self-test and document both assertion types

Fixes #6

Validation

  • python scripts/run_evals.py
  • python tests/run_evals_selftest.py
  • python scripts/validate_public_repo.py
  • python scripts/agent_ops_validate.py --strict
  • python -m compileall scripts tests
  • git diff --check

Local note

  • tests/init_selftest.py, tests/agent_ops_validate_selftest.py, and tests/agent_ops_guard_selftest.py still fail in this Windows environment because they invoke the unavailable python3 launcher internally.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Add more deterministic eval assertion types

1 participant