phantom-secops

Lab-only, mock-first SecOps proof module for the phantom-mesh ecosystem. Cooperating agents demonstrate both defensive ops (alert triage, log anomaly, threat correlation) and red-team simulation (recon, vuln scan, POC suggestion) in an isolated lab. The reliable public demo today uses canned data; live Docker/tool execution remains a hardening path.

Current Verification

Latest local verification:

python scenarios/run_kill_chain.py --target juice-shop --mock: passed
python -m pytest tests -q: 28 passed

The current claim is red/blue orchestration, report generation, and lab-only evidence. It does not claim production SOC automation, 0-day discovery, or third-party scanning.

What it does (60 seconds)

Two sets of phantom-mesh agents run in parallel against an intentionally vulnerable target (OWASP Juice Shop, DVWA, Metasploitable) running in a Docker compose lab:

RED TEAM (attack simulation)              BLUE TEAM (defensive ops)
─────────────────────────────             ─────────────────────────────
Recon ── Nmap, dnsrecon, subfinder        Alert Triage ── classify SIEM
   │                                          │             alerts, dedupe
   ▼                                          ▼
Vuln Scan ── Nuclei, Nikto                Log Anomaly ── baseline +
   │                                          │            outlier detect
   ▼                                          ▼
Exploit Suggest ── CVE matcher,           Threat Correlate ── kill chain
   │                  POC text only           │                reconstruction
   ▼                                          ▼
Pentest Report ─── markdown out           Incident Report ── exec summary

Both teams produce markdown reports. The interesting part is the side-by-side comparison: how long it took the attacker to reach impact vs. how long the defender took to detect — a metric that maps directly to MTTD (mean time to detect) used in real SOCs.

Why this exists

phantom-mesh's multi-agent runtime is well-suited to security operations because:

XDR is multi-source correlation by nature. Trend Vision One™, Microsoft Defender XDR, CrowdStrike Falcon all cross-reference signals from endpoint + network + identity + cloud. Mapping each source to an agent and letting them coordinate via phantom-mesh is a clean fit.
Pentest workflows are sequential pipelines that branch. Recon results feed vuln scanning, which feeds exploit suggestion. Each step is an agent with a tool budget.
LLM-assisted triage reduces alert fatigue. The blue-team agents demonstrate this in a small, observable way.

This repo is a research playground — not a production tool, not a 0-day weapon, not a service offering.

Quick start

Mock mode — no docker, no API key, runs anywhere in <1 second

git clone https://github.com/markl-a/phantom-secops
cd phantom-secops
make demo-mock

Output:

→ phantom-secops kill-chain :: target=juice-shop mock=True
  [t+  0.0s] red-recon          → 1 open ports
  [t+  0.0s] red-vuln-scan      → 5 findings (1 medium, 2 low, ...)
  [t+  0.0s] red-exploit-suggest done
  [t+  0.0s] blue-log-anomaly   → 21 raw alerts
  [t+  0.0s] blue-alert-triage  → 5 triaged groups
  [t+  0.0s] blue-threat-correlate → 1 actor(s)
  [t+  0.0s] done

→ artifacts: reports/runs/<ts>/{pentest-report.md, incident-report.md,
                                recon.json, vuln-scan.json,
                                alerts.jsonl, triage-queue.jsonl,
                                kill-chains.jsonl, exploit-suggestions.md}

This runs the full red/blue agent pipeline on canned data. Use it to explore the artifact shapes and the report templates without bringing up docker. Tests run via make test (7 unit tests covering pattern matchers and triage logic).

Live mode — against the docker lab

make lab-up                # bring up Juice Shop + DVWA on the private docker network
make demo                  # full kill-chain against the live lab
make lab-down              # tear down

# Optional: with phantom-mesh LLM-driven prose
phantom serve &            # phantom-mesh HTTP API at :7878
make demo  # runner picks it up if phantom is reachable

The lab targets are bound to a private docker network. They are not exposed to your host or the internet (see docker-compose.yml). All Makefile targets are listed via make help.

phantom-mesh integration (live mode v2)

As of 2026-05-04, phantom-secops ships three MCP server wrappers that let phantom-mesh agents drive the kill-chain pipeline directly:

secops_recon — wraps tools/nmap_runner.py
secops_log — wraps tools/log_anomaly.py
secops_self_audit — scans phantom's own agents.toml

To enable on a phantom-mesh-equipped host:

export PHANTOM_SECOPS_ROOT=$(pwd)
make mesh-mcp-config       # prints [[mcp_servers]] entries
make mesh-sync             # prints [agent.X] rendered fragments

# Append both outputs to ~/.phantom-mesh/agents.toml on the phantom-mesh
# coordinator host, then restart phantom serve.

Design: see docs/specs/2026-05-04-phantom-mesh-integration.md. Plan: see docs/superpowers/plans/2026-05-04-phantom-mesh-integration.md.

Repo layout

phantom-secops/
├── docker-compose.yml          # isolated lab (Juice Shop, DVWA, Metasploitable)
├── agents/
│   ├── red/                    # attack-side agent configs (TOML, phantom format)
│   │   ├── recon.toml
│   │   ├── vuln-scan.toml
│   │   ├── exploit-suggest.toml
│   │   └── pentest-report.toml
│   └── blue/                   # defense-side agent configs
│       ├── alert-triage.toml
│       ├── log-anomaly.toml
│       ├── threat-correlate.toml
│       └── incident-report.toml
├── tools/                      # phantom tool wrappers (Python)
│   ├── nmap_runner.py
│   ├── nuclei_runner.py
│   └── log_ingest.py
├── lab/                        # docs for each target's setup
├── scenarios/                  # markdown scenarios runnable by phantom
│   ├── full-kill-chain.md
│   └── alert-triage-demo.md
├── reports/                    # sample output reports (anonymized)
├── docs/
│   ├── ARCHITECTURE.md
│   └── INTERVIEW-TALK-TRACK.md
├── ETHICS.md                   # legal/ethical framing — read first
└── LICENSE

Status

Component	State
Docker compose lab (Juice Shop, DVWA)	✅ syntax verified, runs
Mock-mode end-to-end demo (`make demo-mock`)	✅ runnable on any machine, <1s
Recon agent (Nmap orchestration)	✅ working with lab-target gate
Vuln scan agent (Nuclei wrapper)	⚙️ wrapper done; live integration WIP
Exploit suggester (CVE → POC text)	✅ template-driven prose; LLM-driven opt-in via `--use-llm`
Blue team log-anomaly (URL-decoded pattern matchers)	✅ working, 7 unit tests pass
Blue team triage + correlation (group by actor + ATT&CK phase)	✅ working
Side-by-side red/blue report (pentest + incident markdown)	✅ working
Tests (`python -m pytest tests -q`)	✅ 28 tests passing in the latest local verification
Live-mode kill-chain (against running docker lab)	⚙️ partial — recon path works; nuclei path needs container with nuclei pre-installed

Ethics & legality

Read ETHICS.md before use.

Short version:

All targets in this lab are legally distributed, intentionally vulnerable applications maintained for security research and education (OWASP Juice Shop, DVWA, Metasploitable).
All tools used (Nmap, Nuclei, Nikto) are legitimate, publicly available defensive research tools.
The Exploit Suggester agent only generates POC descriptions in text form. It does not generate or execute weaponized exploits.
The lab runs on an isolated docker network — never on a public network or third-party system.

Related projects

🌟 phantom-mesh — The agent runtime this depends on.
📖 GarageSwarm — Python predecessor of phantom-mesh.

License

Apache-2.0

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.github/workflows		.github/workflows
agents		agents
docs		docs
lab		lab
phantom_secops		phantom_secops
reports		reports
scenarios		scenarios
scripts		scripts
tests		tests
tools		tools
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile.attacker		Dockerfile.attacker
ETHICS.md		ETHICS.md
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
STATUS.md		STATUS.md
docker-compose.yml		docker-compose.yml
requirements-dev.txt		requirements-dev.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

phantom-secops

Current Verification

What it does (60 seconds)

Why this exists

Quick start

Mock mode — no docker, no API key, runs anywhere in <1 second

Live mode — against the docker lab

phantom-mesh integration (live mode v2)

Repo layout

Status

Ethics & legality

Related projects

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

phantom-secops

Current Verification

What it does (60 seconds)

Why this exists

Quick start

Mock mode — no docker, no API key, runs anywhere in <1 second

Live mode — against the docker lab

phantom-mesh integration (live mode v2)

Repo layout

Status

Ethics & legality

Related projects

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages