test: add integ tests for batch evaluation and evaluator lifecycle#1110
test: add integ tests for batch evaluation and evaluator lifecycle#1110
Conversation
Adds 20 integration tests covering: - batch-evaluation CLI flag validation (requires --runtime, --evaluator) - run eval CLI flag validation - Evaluator lifecycle: SESSION, TRACE, TOOL_CALL levels, code-based (external lambda + managed), config file import - Online eval config lifecycle: builtin evaluator refs, enable-on-create, multiple evaluators, sampling rate - Evaluator validation edge cases: invalid level, --model with code-based, --lambda-arn without code-based - Ground truth file parsing: malformed JSON, wrong structure, valid array format, valid object format with sessionMetadata key Originally from private-agentcore-cli-staging PR #104.
Coverage Report
|
|
Reviewed the new integ test file. There are a few issues that I think will make several tests fail (or silently pass without testing anything) on 1.
|
The code-based evaluator restriction in OnlineEvalConfigPrimitive was already removed, so this test expected a rejection that no longer occurs.
eaca03a to
62f9d80
Compare
|
addressed comments |
Summary
Adds 20 integration tests covering batch evaluation, evaluator, and online eval config CLI lifecycle:
--runtimerequired,--evaluatorrequired--evaluatorrequired--modelwith code-based,--lambda-arnwithout code-basedOriginally from private-agentcore-cli-staging PR #104. Only the test file is needed — the code changes from that PR (remove code-based evaluator blocks, add test utils) are already on preview.
Test plan
npm run test:integpasses (20 new tests)