Releases: Corbell-AI/evalmonkey
Releases · Corbell-AI/evalmonkey
v1.4.0
Compare
Sorry, something went wrong.
No results found
What's Changed
feat: regression guard, agent card, recommend command, external datasets by @himmi-01 in #18 Like :
Full Changelog : v1.3.0...v1.4.0
v1.3.0
Compare
Sorry, something went wrong.
No results found
What's Changed
feat: regression guard, agent card, recommend command, external datasets by @himmi-01 in #14
Full Changelog : v1.2.0...v1.3.0
v1.2.0
Compare
Sorry, something went wrong.
No results found
What's Changed
Add voice agents support into EvalMonkey by @himmi-01 in #13
Full Changelog : v1.1.1...v1.2.0
v1.1.1
Compare
Sorry, something went wrong.
No results found
What's Changed
feat: add lightweight coding agent sample app with chaos profiles by @himmi-01 in #9
fix: stop test suite hanging + add coding agent demo script by @himmi-01 in #10
docs: add real-world benchmark leaderboard for 10 open-source agents by @himmi-01 in #11
Full Changelog : v1.1.0...v1.1.1
v1.1.0
Compare
Sorry, something went wrong.
No results found
What's Changed
Full Changelog : v1.0.2...v1.1.0
v1.0.2
Compare
Sorry, something went wrong.
No results found
What's Changed
docs: add web dashboard screenshots to README by @himmi-01 in #5
feat: add evalmonkey generate-ci command for easy GitHub Actions setup by @himmi-01 in #6
Full Changelog : v1.0.1...v1.0.2
v1.0.1
Compare
Sorry, something went wrong.
No results found
What's Changed
feat: add framework adapters for LangGraph, LlamaIndex, and PydanticAI by @himmi-01 in #4
Full Changelog : v1.0.0...v1.0.1
v1.0.0
Compare
Sorry, something went wrong.
No results found
v0.1.3
Compare
Sorry, something went wrong.
No results found
implement automated eval asset generation and improvement prompts for failed benchmark traces ea99606
optimize benchmark loading by enabling streaming mode and add testing/inspection utilities 88a8fb5
strip markdown code fences from LLM judge responses 46be866
remove webarena benchmark 179d5ab
v0.1.2
Compare
Sorry, something went wrong.
No results found