Live, open-source benchmark for comparing AI coding agents on real GitHub issues
benchmark machine-learning developer-tools awesome-list dev-tools live-data ai-research ai-benchmarks ai-engineering ai-tools auto-updated ai-evaluation llm-testing agent-evaluation swe-bench llm-benchmarks agent-eval ai-coding-agent-benchmark codex-vs-opencode coding-agent-benchmark
-
Updated
May 25, 2026 - Python