agent-vm

Run AI coding agents inside sandboxed Linux VMs. The agent gets full autonomy while your host system stays safe.

Uses Lima to create lightweight Debian VMs on macOS and Linux. Ships with dev tools, Docker, and a headless Chrome browser with Chrome DevTools MCP pre-configured.

Supports Claude Code, OpenCode, and Codex CLI out of the box. Other agents can be run via agent-vm shell.

Never install attack vectors such as npm, claude or even Docker on your host machine again!

Feedback welcome!

Prerequisites

macOS or Linux
Lima (installed automatically via Homebrew if available)
A subscription or API key for your agent of choice

Install

git clone https://github.com/sylvinus/agent-vm.git
cd agent-vm

# Add to your shell config
echo "source $(pwd)/agent-vm.sh" >> ~/.zshrc   # zsh
echo "source $(pwd)/agent-vm.sh" >> ~/.bashrc  # or bash

Usage

One-time setup

agent-vm setup

Creates a base VM template with dev tools, Docker, Chromium, and AI coding agents pre-installed.

Options:

Flag	Description	Default
`--disk GB`	VM disk size in GB	20
`--memory GB`	VM memory in GB	8
`--cpus N`	Number of CPUs	4

agent-vm setup --disk 50 --memory 16 --cpus 8   # Larger VM for heavy workloads

Run an agent in a VM

cd your-project
agent-vm claude                # Claude Code
agent-vm opencode              # OpenCode
agent-vm codex                 # Codex CLI

Creates a persistent VM for the current directory (or reuses it if one already exists), mounts your working directory, and runs the agent with full permissions. The VM persists after the agent exits so you can reconnect later. Ports opened inside the VM (e.g. by Docker containers or dev servers) are automatically forwarded to your host by Lima.

Each agent runs with its respective auto-approve flag:

claude runs with --dangerously-skip-permissions
opencode does not yet have an auto-approve flag (waiting on this PR)
codex runs with --full-auto

Any extra arguments are forwarded to the agent command:

agent-vm claude -p "fix all lint errors"        # Run with a prompt
agent-vm claude --resume                         # Resume previous session
agent-vm opencode -p "refactor auth module"      # OpenCode with a prompt
agent-vm codex -q "explain this codebase"        # Codex with a query

Shell access and running commands

agent-vm shell                         # Open a zsh shell in the VM
agent-vm run npm install               # Run a one-off command in the VM
agent-vm run docker compose up -d      # Start services

VM lifecycle

Each directory gets its own persistent VM. You can manage it with:

agent-vm status      # Show VM status for the current directory
agent-vm stop        # Stop the VM (can be restarted later)
agent-vm destroy     # Stop and permanently delete the VM
agent-vm destroy-all # Stop and delete all agent-vm VMs

To resize an existing VM's disk or memory, just pass --disk or --memory again — the VM will be stopped, reconfigured, and restarted automatically:

agent-vm --disk 50 claude              # Grow disk to 50GB, then run Claude
agent-vm --memory 16 --cpus 8 shell    # Increase memory and CPUs, then open shell

Note: disk can only be grown, not shrunk.

Running agent-vm setup again updates the base template but does not update existing VMs. You'll see a warning when using a VM cloned from an older base. Use --reset to re-clone:

agent-vm --reset claude                # Destroy and re-clone VM, then run Claude

Offline mode and read-only mounts

agent-vm --offline claude              # Block outbound internet access
agent-vm --readonly shell              # Mount project directory as read-only
agent-vm --offline --readonly claude   # Both

--offline blocks outbound internet from the VM using iptables while preserving host/VM communication (mounts, port forwarding). Useful for ensuring agents don't phone home or download unexpected packages.

--readonly remounts the project directory as read-only. Useful for code review or audit tasks where the agent shouldn't modify files. Both flags are per-session and reset when the VM restarts.

Customization

Per-user setup: `~/.agent-vm/setup.sh`

Create this file to install extra tools into the base VM template. It runs once during agent-vm setup, as the default VM user (with sudo available):

# ~/.agent-vm/setup.sh
sudo apt-get install -y postgresql-client
pip install pandas numpy

Per-user runtime: `~/.agent-vm/runtime.sh`

Create this file to run commands inside every VM on each start. Runs before the per-project runtime script:

# ~/.agent-vm/runtime.sh
export MY_API_KEY="..."

Per-project: `.agent-vm.runtime.sh`

Create this file at the root of any project. It runs inside the VM each time a new VM is created for the project, just before you get access. Use it for project-specific setup like installing dependencies or starting services:

# your-project/.agent-vm.runtime.sh
npm install
docker compose up -d

MCP servers

The base VM comes with Chrome DevTools MCP pre-configured for Claude, giving the agent headless browser access.

To add more MCP servers, add them to ~/.claude.json in your ~/.agent-vm/setup.sh, or edit the file directly inside a VM via agent-vm shell. Add entries to the mcpServers object:

{
  "mcpServers": {
    "chrome-devtools": {
      "command": "npx",
      "args": ["-y", "chrome-devtools-mcp@latest", "--headless=true", "--isolated=true"]
    },
    "postgres": {
      "command": "npx",
      "args": ["-y", "@modelcontextprotocol/server-postgres", "postgresql://localhost:5432/mydb"]
    }
  }
}

How it works

agent-vm setup creates a Debian 13 VM with Lima, runs agent-vm.setup.sh inside it to install dev tools + Chrome + agents, and stops it as a reusable base template
agent-vm claude|opencode|codex [args] clones the base template into a persistent per-directory VM, mounts your working directory, runs optional runtime scripts (~/.agent-vm/runtime.sh then .agent-vm.runtime.sh), then launches the agent with full permissions
The VM persists after exit. Running any agent command or agent-vm shell in the same directory reuses the same VM
Use agent-vm stop to stop the VM or agent-vm destroy to delete it

Each VM is fully isolated — agents must authenticate independently inside their VM (e.g. claude login). Credentials persist within the VM across restarts but are not shared between VMs or with the host.

Project structure

File	Description
`agent-vm.sh`	Main script — source this in your shell config
`agent-vm.setup.sh`	Package installation script that runs inside the base VM during setup

What's in the VM by default

Category	Packages
Core	git, curl, wget, jq, build-essential, unzip, zip
Python	python3, pip, venv
Node.js	Node.js 24 LTS (via NodeSource)
Search	ripgrep, fd-find
Utilities	htop, GitHub CLI (gh)
Browser	Chromium (headless), xvfb
Containers	Docker Engine, Docker Compose
AI	Claude Code, OpenCode, Codex CLI, Chrome DevTools MCP server

Security model

AI coding agents need full permissions to be useful — they install dependencies, run builds, execute tests, start servers. But running npm install or pip install means executing arbitrary third-party code on your machine.

This is not a theoretical risk. The Shai-Hulud worm compromised thousands of npm packages in 2025 by injecting malicious code that runs during npm install. It harvested npm tokens, GitHub PATs, SSH keys, and cloud credentials from developers' machines, then used those credentials to spread to other packages the developer maintained. All of this happened silently, in the background, while the legitimate install appeared normal.

An AI agent running with --dangerously-skip-permissions on your host would give such an attack full access to everything: your SSH keys, your cloud credentials, your browser sessions, your entire filesystem.

agent-vm runs all code inside the VM. The VM only has access to your project directory (read-write mount, or read-only with --readonly). It has no access to your SSH keys, npm tokens, cloud credentials, git config, browser sessions, or anything else on your host. If a supply chain attack executes inside the VM, it finds nothing to steal (except your source code) and nowhere to spread. Use --offline to block internet access entirely.

Meanwhile, your host machine stays clean. You don't need Node.js, Docker, or any dev tooling installed locally. The only host dependency is Lima. Your SSH keys and signing credentials never enter the VM — we recommend running git commit on the host yourself.

Why not Docker?

	No sandbox	Docker	VM (agent-vm)
Agent can run any command	Yes	Yes	Yes
File system isolation	None	Partial (shared kernel)	Full
Network isolation	None	Partial	Optional (`--offline`)
Can run Docker inside	Yes	Requires DinD or socket mount	Yes (native)
Kernel-level isolation	None	None (shares host kernel)	Full (separate kernel)
Protection from container escapes	None	None	Yes
Browser / GUI tools	Host only	Complex setup	Built-in (headless Chromium)

Docker containers share the host kernel. A motivated attacker (or a compromised dependency running inside the container) could exploit kernel vulnerabilities to escape. A VM runs its own kernel — even root access inside the VM can't reach the host.

A VM also avoids the practical headaches of Docker sandboxing. Docker runs natively inside the VM without Docker-in-Docker hacks. Headless Chromium works out of the box. Lima automatically forwards ports to your host. The agent gets a normal Linux environment where everything just works.

This workflow also replaces Docker Desktop on the Mac, which has become more and more bloated over the years.

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
agent-vm.setup.sh		agent-vm.setup.sh
agent-vm.sh		agent-vm.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

agent-vm

Prerequisites

Install

Usage

One-time setup

Run an agent in a VM

Shell access and running commands

VM lifecycle

Offline mode and read-only mounts

Customization

Per-user setup: `~/.agent-vm/setup.sh`

Per-user runtime: `~/.agent-vm/runtime.sh`

Per-project: `.agent-vm.runtime.sh`

MCP servers

How it works

Project structure

What's in the VM by default

Security model

Why not Docker?

License

About

Uh oh!

Releases

Packages

Contributors 3

Languages

License

sylvinus/agent-vm

Folders and files

Latest commit

History

Repository files navigation

agent-vm

Prerequisites

Install

Usage

One-time setup

Run an agent in a VM

Shell access and running commands

VM lifecycle

Offline mode and read-only mounts

Customization

Per-user setup: ~/.agent-vm/setup.sh

Per-user runtime: ~/.agent-vm/runtime.sh

Per-project: .agent-vm.runtime.sh

MCP servers

How it works

Project structure

What's in the VM by default

Security model

Why not Docker?

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Per-user setup: `~/.agent-vm/setup.sh`

Per-user runtime: `~/.agent-vm/runtime.sh`

Per-project: `.agent-vm.runtime.sh`

Packages