Circuit Breaker Labs Command-Line Interface

Recorded with vhs💙

Quickstart

Copy the installation command for your preferred shell from the releases page.
Verify cbl installed correctly with cbl help
Set the Circuit Breaker Labs API key environment variable:
- macOS/Linux: export CBL_API_KEY="<your_api_key_here>"
- Windows (PowerShell): $env:CBL_API_KEY="<your_api_key_here>"
Set the OpenAI API key environment variable (required for this example):
- macOS/Linux: export OPENAI_API_KEY="<your_api_key_here>"
- Windows (PowerShell): $env:OPENAI_API_KEY="<your_api_key_here>"

Try a single-turn evaluation:

cbl single-turn \
    --threshold 0.75 \
    --variations 2 \
    --maximum-iteration-layers 2 \
    openai --model gpt-4.1-nano

Try a multi-turn evaluation:

cbl multi-turn \
    --threshold 0.95 \
    --max-turns 8 \
    --test-types user_persona,semantic_chunks \
    openai --model gpt-4.1-nano

Installation

Pre-built executables and installation methods for Linux, Mac, and Windows are automatically generated and available with each release.

Click here to get an access key.

Usage

Flags and Options

You can see the available options and flags for cbl with cbl help or for a subcommand with cbl <subcommand> help.

Syntax

The syntax for cbl is:

cbl --top-level-arg1 <evaluation_type> --evaluation-arg1 <provider> --provider-arg1

where <evaluation_type> and <provider> are subcommands.

The available evaluation types are single-turn and multi-turn. The available providers are ollama, openai, and custom.

Example

The following would run a single-turn evaluation against a custom OpenAI finetune, and save the results to result.json. If you haven't already, set the CBL_API_KEY and OPENAI_API_KEY environment variables.

cbl \
    --output-file result.json \
    single-turn \  # evaluation type
    --threshold 0.3 \
    --variations 3 \
    --maximum-iteration-layers 2 \
    openai \       # provider
    --temperature 1.2 \
    --model $MY_FINETUNE_ID

Integrating With a Custom Model Endpoint

For APIs that aren't already supported or OpenAI compatible, cbl supports scripting. The custom provider expects a Rhai script that defines the translation between cbl's and the custom endpoint's request/response schema. Examples scripts are available in examples/providers/.

Configuration Reference

Safety Threshold

The --threshold flag accepts a value from 0 to 1. Values closer to 1 require responses that align more closely with clinical-grade safety standards. Lower values allow more permissive responses.

Maximum Turns

--max-turns must be an even number because each turn pairs one user message with one model response. The upper limit depends on your system configuration. Contact us if you need a higher limit for your environment.

Test Packs

CBL offers specialized test packs for different domains and risk scenarios. All versions include a default pack focused on suicidal ideation safety evaluation.

Testing Types

User Persona (user_persona) — Simulates realistic user behavior and conversation styles, representing how real people interact with conversational AI.

Jailbreak (semantic_chunks) — Deliberate, persistent attempts to push or bypass model guardrails, exposing unsafe responses, policy failures, or hidden vulnerabilities.

Questions? Feedback? Reach us at team@circuitbreakerlabs.ai.

Name		Name	Last commit message	Last commit date
Latest commit History 68 Commits
.github/workflows		.github/workflows
assets		assets
examples/providers		examples/providers
src		src
.gitignore		.gitignore
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
LICENSE.md		LICENSE.md
README.md		README.md
THIRD_PARTY_NOTICES.md		THIRD_PARTY_NOTICES.md
build.rs		build.rs
dist-workspace.toml		dist-workspace.toml
flake.lock		flake.lock
flake.nix		flake.nix

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Circuit Breaker Labs Command-Line Interface

Quickstart

Installation

Usage

Flags and Options

Syntax

Example

Integrating With a Custom Model Endpoint

Configuration Reference

Safety Threshold

Maximum Turns

Test Packs

Testing Types

About

Uh oh!

Releases 3

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Circuit Breaker Labs Command-Line Interface

Quickstart

Installation

Usage

Flags and Options

Syntax

Example

Integrating With a Custom Model Endpoint

Configuration Reference

Safety Threshold

Maximum Turns

Test Packs

Testing Types

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 3

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages