Grepture Proxy

An LLM API proxy that detects and redacts PII, blocks sensitive content, and tokenizes fields — all before requests reach your AI provider.

Quick Start

# Copy and edit the example rules
cp rules.example.json rules.json

# Start the proxy
bun run src/index.ts

The proxy starts on port 4001 by default. Send requests through it by setting the X-Grepture-Target header to your upstream API:

curl http://localhost:4001/proxy/ \
  -H "Authorization: Bearer any-token" \
  -H "X-Grepture-Target: https://api.anthropic.com/v1/messages" \
  -H "X-Grepture-Auth-Forward: Bearer sk-ant-..." \
  -H "Content-Type: application/json" \
  -d '{
    "model": "claude-sonnet-4-5-20250514",
    "max_tokens": 1024,
    "messages": [{"role": "user", "content": "My email is john@example.com"}]
  }'

The proxy will redact john@example.com before it reaches the API, based on your rules.

Configuration

Environment Variable	Required	Default	Description
`GREPTURE_API_KEY`	No	—	If set, only requests with this Bearer token are allowed. If unset, any token is accepted.
`GREPTURE_RULES_FILE`	No	`rules.json`	Path to rules configuration file
`GREPTURE_PLUGINS`	No	—	Comma-separated paths to plugin modules to load at startup
`PORT`	No	`4001`	Port to listen on
`R2_ENDPOINT`	No (cloud mode)	—	S3-compatible endpoint for body offload (e.g. `https://<account_id>.r2.cloudflarestorage.com`)
`R2_ACCESS_KEY_ID`	No (cloud mode)	—	Access key with `Object Write` permission on the bucket
`R2_SECRET_ACCESS_KEY`	No (cloud mode)	—	Secret key paired with `R2_ACCESS_KEY_ID`
`R2_BUCKET`	No (cloud mode)	—	Bucket name where large bodies are stored

Body Storage

The proxy logs request and response bodies as part of its traffic log. For local-mode runs, bodies are written to stdout and there is no size limit beyond what fits in memory (default 10MB per request).

For cloud-mode runs that persist logs to a database, bodies are stored inline up to 50KB. Anything larger is offloaded to S3-compatible object storage configured via the four R2_* variables above (any S3-compatible backend works — the name is historical). The full body is uploaded under hot-bodies/<team_id>/<log_id>/<field>.json, and a 5KB preview stays in the database alongside a pointer key for the offloaded object.

Offload happens during the log-batch flush, after the proxy has already responded to the client. It does not add latency to the request path.
If R2_* is not configured or the upload fails, bodies fall back to a 50KB inline truncation.
Bodies are encrypted in transit (HTTPS) and rely on the bucket's own access controls for encryption at rest.

Rules

Rules are defined in a JSON file (default: rules.json). See rules.example.json for the full format.

Each rule has:

conditions — when to apply (match on headers, body, URL, model)
actions — what to do (redact PII, find/replace, tokenize, block, log)
apply_to — input (before forwarding), output (on response), or both
sampling_rate — percentage of requests to apply to (1-100)

Available Actions

Action	Description
`redact_pii`	Detect and redact PII using regex patterns (email, phone, SSN, credit card, IP, address, DOB)
`find_replace`	Find and replace text (literal or regex)
`tokenize`	Replace JSON fields with tokens, store originals for later restoration
`redact_field`	Replace specific JSON fields with a fixed value
`block_request`	Block the request with a custom status code and message
`log_only`	Tag the request for logging without modifying it

Rules are reloaded automatically when the file changes, or on SIGHUP.

Docker

docker build -t grepture-proxy .

docker run -p 4001:4001 \
  -v $(pwd)/rules.json:/app/rules.json \
  grepture-proxy

How It Works

Client → Proxy → [Auth] → [Input Rules] → [Forward] → [Output Rules] → [Detokenize] → Client

Authenticate the request (optionally validate API key if GREPTURE_API_KEY is set)
Apply input rules (redact PII, block, tokenize)
Forward to the upstream API (set via X-Grepture-Target)
Apply output rules to the response
Restore tokenized values
Return the response

Supports both buffered and streaming (SSE) responses. Token restoration works across streamed chunks.

Name		Name	Last commit message	Last commit date
Latest commit History 46 Commits
.github/workflows		.github/workflows
scripts		scripts
src		src
test		test
.gitignore		.gitignore
Dockerfile		Dockerfile
Dockerfile.cloud		Dockerfile.cloud
LICENSE		LICENSE
README.md		README.md
bun.lock		bun.lock
bunfig.toml		bunfig.toml
docker-compose.cloud.yml		docker-compose.cloud.yml
examples.md		examples.md
package.json		package.json
rules.example.json		rules.example.json
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Grepture Proxy

Quick Start

Configuration

Body Storage

Rules

Available Actions

Docker

How It Works

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Grepture Proxy

Quick Start

Configuration

Body Storage

Rules

Available Actions

Docker

How It Works

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages