structured-evaluation

Star

Here are 5 public repositories matching this topic...

thedataquarry / structured-outputs

Star

Structured output benchmarks comparing DSPy and BAML with different LLMs

information-extraction structured-evaluation structured-output baml dspy llm llm-eval llm-evaluation

Updated Dec 23, 2025
Python

noprompt / burdock-mode

Sponsor

Star

An Emacs mode for Ruby providing structured editing and evaluation operations.

ruby emacs structured-editing structured-evaluation

Updated Apr 5, 2024
Ruby

A hands-on course repository for Evaluating AI Agents, created with Arize AI, that teaches you how to systematically evaluate, debug, and improve AI agents using observability tools, structured experiments, and reliable metrics. Learn production-grade techniques to enhance agent performance during development and after deployment.

structured-evaluation ai-agents ai-in-production ai-observability prompt-engineering llmdevops llm-evaluation debugging-and-testing ai-experiment opensource-ai agent-observability

Updated May 12, 2025
Jupyter Notebook

chigwell / optimistic-pessimistic-evaluator

Sponsor

Star

A new package is designed to facilitate structured evaluation of a system's performance comparison between optimistic and pessimistic approaches. It takes a textual description or data snippet as inpu

pattern-matching structured-evaluation automated-analysis performance-comparison retry-logic text-based-interface structured-prompts textual-input optimistic-vs-pessimistic language-model-assessment results-reporting response-conformity reliable-insights validation-strategies systematic-comparison trade-offs-analysis performance-trade-offs data-snippet-processing criteria-based-evaluation

Updated Dec 21, 2025
Python

f0ster / NovaAegis

Star

LLM Agent Engine

learning pytorch knowledge-graph structured-evaluation pattern-recognition knowledge-distillation llm reasoning-agent langchain llm-inference llm-framework structured-reasoning

Updated Feb 17, 2025
Python

Improve this page

Add a description, image, and links to the structured-evaluation topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the structured-evaluation topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

structured-evaluation

Here are 5 public repositories matching this topic...

thedataquarry / structured-outputs

noprompt / burdock-mode

ksm26 / Evaluating-AI-Agents

chigwell / optimistic-pessimistic-evaluator

f0ster / NovaAegis

Improve this page

Add this topic to your repo