SafePrompt

SafePrompt redacts PII in text by replacing only the sensitive spans with placeholders like [FIRSTNAME], [EMAIL], [IPV4]. It keeps everything else identical. The model’s final answer is always wrapped in <safe> ... </safe> so downstream code can trust the contract.

Model adapter: chinu-codes/llama-3.2-3b-pii-redactor-lora
Base model: meta-llama/Llama-3.2-3B-Instruct
Backend: FastAPI, CPU-only inference with Transformers + PEFT
Extension: Chrome MV3 right-click to redact selected text

Project structure

SafePrompt/
├─ backend/
│  ├─ run.sh
│  └─ app/
│     ├─ __init__.py
│     ├─ config.py
│     ├─ main.py
│     ├─ models.py
│     ├─ schemas.py
│     └─ service.py
├─ extension/
│   ├─ background.js
│   └─ manifest.json
└── model/
    └─ SafePrompt.ipynb

Repo tree and file roles match the current codebase.

Quick start

1) Backend on CPU

Requirements: Python 3.10+, internet for the first run to download weights.

cd backend
python -m venv .venv && source .venv/bin/activate
pip install -r requirements.txt

Set your Hugging Face token if the base model is gated:

export HF_TOKEN="hf_...your_token..."

Start the server with safe CPU defaults:

export BASE_MODEL="meta-llama/Llama-3.2-3B-Instruct"
export ADAPTER_REPO="chinu-codes/llama-3.2-3b-pii-redactor-lora"

# CPU hygiene
export SEQ_LEN=256
export MAX_NEW_TOKENS=64
export TORCH_NUM_THREADS=2
export OMP_NUM_THREADS=2
export MKL_NUM_THREADS=2
export TOKENIZERS_PARALLELISM=false
export PYTORCH_NO_CUDA=1

uvicorn app.main:app --host 127.0.0.1 --port 8000 --workers 1

Health check:

curl -s http://127.0.0.1:8000/health

Redaction:

curl -s -X POST http://127.0.0.1:8000/redact \
  -H "Content-Type: application/json" \
  -d '{"text":"Hi, I am Vishal Shinde. Email vishal@example.com and call +1 415 555 0199."}' | jq

Example response:

{
  "safe_text": "<safe>Hi, I am [FIRSTNAME] [LASTNAME]. Email [EMAIL]</safe>",
  "redacted_text": "Hi, I am [FIRSTNAME] [LASTNAME]. Email [EMAIL]",
  "placeholders": ["FIRSTNAME","LASTNAME","EMAIL"],
  "base_model": "meta-llama/Llama-3.2-3B-Instruct",
  "adapter_repo": "chinu-codes/llama-3.2-3b-pii-redactor-lora",
  "seq_len": 256,
  "max_new_tokens": 64,
  "latency_ms": 1234
}

Offline runs later: once weights are cached locally you can set HF_LOCAL_ONLY=true.

Low-RAM tip

On laptops with 8–12 GB RAM, add a small swap file to avoid OOM during model load:

sudo fallocate -l 12G /swapfile2 || sudo dd if=/dev/zero of=/swapfile2 bs=1M count=12288 status=progress
sudo chmod 600 /swapfile2
sudo mkswap /swapfile2
sudo swapon /swapfile2
echo '/swapfile2 none swap sw 0 0' | sudo tee -a /etc/fstab

2) Chrome extension (MV3)

Open chrome://extensions
Enable Developer mode
Load unpacked → choose the extension/ folder
Select text on any page → right-click → Redact with SafePrompt

The extension posts to http://127.0.0.1:8000/redact and copies the redacted text to your clipboard. If injection is blocked, it opens a new tab with the result. The background script is already wired to the new JSON shape.

How it works

The backend builds a short chat prompt that instructs the model to mirror the input and replace only PII spans with placeholders.
The model replies inside <safe> ... </safe>.
The API returns both safe_text and the inner redacted_text, plus a list of placeholders.

Configuration

You can control behaviour with environment variables:

Variable	Default	Purpose
`BASE_MODEL`	`meta-llama/Llama-3.2-3B-Instruct`	Base model
`ADAPTER_REPO`	`chinu-codes/llama-3.2-3b-pii-redactor-lora`	LoRA adapter repo
`SEQ_LEN`	`512` (I used 256 on CPU)	Context length
`MAX_NEW_TOKENS`	`96` (I used 64 on CPU)	Generation cap
`HF_TOKEN`	empty	HF token if base is gated
`HF_LOCAL_ONLY`	`false`	Set to `true` after cache is populated
`TORCH_NUM_THREADS`	`2`	Torch threads
`OMP_NUM_THREADS`	`2`	OpenMP threads
`MKL_NUM_THREADS`	`2`	MKL threads

Values above match the backend code and scripts in this repo.

API

GET /health Returns status, device, threads and model ids.

POST /redact Request:

{ "text": "raw input", "max_new_tokens": 64 }

Response:

{
  "safe_text": "<safe>...</safe>",
  "redacted_text": "...",
  "placeholders": ["EMAIL","FIRSTNAME"],
  "base_model": "...",
  "adapter_repo": "...",
  "seq_len": 256,
  "max_new_tokens": 64,
  "latency_ms": 1234
}

Results

Small eval on 300 samples from the dataset:

Exact match rate: ~0.67
Placeholder micro-F1: ~0.90
Formatting error rate: ~0.00

I report both strict exact match and span-level F1. The exact match is intentionally harsh for redaction, as multiple placeholder choices can be equally safe. My span-level micro-F1 is around 0.90, with a 0.00 formatting error rate, which I believe better reflects utility and safety. In other words, even when the string isn't an exact character-for-character match, the PII is still correctly replaced, and the text is preserved.

Notes and limits

English only for this adapter.
If unsure, the model keeps the original span as designed.
Very long inputs should be chunked client-side.
Please handle personal data responsibly.

Acknowledgements

Base: meta-llama/Llama-3.2-3B-Instruct
Dataset: ai4privacy/pii-masking-200k
Adapter: chinu-codes/llama-3.2-3b-pii-redactor-lora

Licence

MIT for the code in this repo. Follow the licences and usage terms of the base model and dataset.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
backend		backend
extension		extension
model		model
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SafePrompt

Project structure

Quick start

1) Backend on CPU

Low-RAM tip

2) Chrome extension (MV3)

How it works

Configuration

API

Results

Notes and limits

Acknowledgements

Licence

About

Uh oh!

Languages

vishal-codes/SafePrompt

Folders and files

Latest commit

History

Repository files navigation

SafePrompt

Project structure

Quick start

1) Backend on CPU

Low-RAM tip

2) Chrome extension (MV3)

How it works

Configuration

API

Results

Notes and limits

Acknowledgements

Licence

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Languages