Sentinel: Real-Time AI Fraud Detection

Enterprise-grade, event-driven anomaly detection pipeline with sub-millisecond ONNX inference.

Sentinel is an enterprise-grade, real-time fraud detection system. It simulates high-throughput financial transactions via streaming (Redpanda/Kafka) and evaluates them in milliseconds using an optimized ONNX inference engine.

Quick Start

1. Clone the Repository

Clone the project to your local machine and navigate into the root directory:

    git clone https://github.com/enesgulerdev/sentinel.git
    cd sentinel

2. Configure Environment Variables

The data ingestion process requires a Google Drive File ID to fetch the raw dataset via gdown. Copy the example environment file to create your local configuration:

    cp .env.example .env

3. Install Dependencies

Install all required Python packages and set up the local development environment. This command utilizes uv to create a virtual environment and strictly syncs the dependencies locked in uv.lock.

    task env:install

4. Execute the ML Pipeline

Run the complete machine learning pipeline. This automated task will fetch the raw dataset using your provided .env variable, apply preprocessing transformations, and train the baseline model.

    task ml:pipeline

5. Launch and Manage Application

The Sentinel project utilizes a microservices architecture. Start the Docker containers to spin up the Prefect orchestration server, API gateway, and all other core services in detached mode:

    # Start all services
    task docker:up

    # Stop and remove containers, networks, and volumes
    task docker:down

Kubernetes Orchestration & Autoscaling (Local Development via Kind)

Note: If you experience any issues or hanging pods during the Kubernetes deployment, please refer to the Troubleshooting section at the bottom of this page.

Sentinel provides streamlined Taskfile commands for local Kubernetes orchestration, eliminating the need for complex kubectl management.

Prerequisites: Cluster & Metrics Server

Before running the deployment tasks, you need to create the local Kind cluster and install the Metrics Server (required for Horizontal Pod Autoscaling).

Create the cluster:

    kind create cluster

Apply the official metrics-server manifest:

    kubectl apply -f https://github.com/kubernetes-sigs/metrics-server/releases/latest/download/components.yaml

Patch the deployment to allow insecure TLS (a requirement for local Kind nodes without proper certificates):

    # For Linux/macOS (Bash/Zsh):
    kubectl patch deployment metrics-server -n kube-system --type='json' -p='[{"op": "add", "path": "/spec/template/spec/containers/0/args/-", "value": "--kubelet-insecure-tls"}]'

Windows PowerShell Users: If the command above fails due to JSON formatting errors, use a patch file instead:

    Set-Content -Path patch.json -Value '[{"op":"add","path":"/spec/template/spec/containers/0/args/-","value":"--kubelet-insecure-tls"}]'
    kubectl patch -n kube-system deployment metrics-server --type=json --patch-file patch.json
    Remove-Item patch.json

Streamlined Deployment Workflow

1. Build and Load Images:

Ensure your local Kind cluster has the latest images built directly from your source code.

    task k8s:build-load

2. Deploy the Architecture:

Apply all infrastructure (Redis, Redpanda, Postgres) and application (API, Consumer) manifests sequentially.

    task k8s:up

3. Check Pod Status (Crucial):

Before accessing the services, ensure all pods have reached the Running state. If you attempt to port-forward while pods are in Init or ContainerCreating states, the connection will fail.

    task k8s:status

4. Access the Services:

Run the following command to bind all necessary K8s ports (API Gateway and Redpanda Console) to your local machine simultaneously. This process runs in the foreground; simply press Ctrl+C to terminate all connections when done.

    task k8s:forward

5. Teardown:

Clean up the entire Sentinel stack from your Kubernetes cluster.

    task k8s:down

Optional: Architecture Stress Testing (HPA in Action)

Sentinel is designed with high availability and elasticity in mind. We use Kubernetes Horizontal Pod Autoscaling (HPA) to dynamically scale the stateless API layer based on sudden traffic spikes.

You can safely benchmark the API's scaling capabilities locally using oha:

Keep the Port-Forward Running: Ensure task k8s:forward is running in your first terminal.
Monitor the Autoscaler: Open a second terminal and watch the HPA react in real-time:
```
kubectl get hpa sentinel-api-hpa -w
```
Trigger the Load Test: Open a third terminal and blast the API with 250 concurrent workers for 60 seconds:
```
task test:load-health
```
Observe the second terminal: You will see the CPU utilization spike dramatically, prompting Kubernetes to autonomously clone the API pods (up to 5 replicas) to distribute the load, maintaining a 100% success rate without dropping connections.

Local Services & Ports

Once the Docker containers are up and running, you can access the core services via the following local addresses:

Service	Local URL
Prefect	http://localhost:4200
API Gateway	http://localhost:8000
Redpanda	http://localhost:8080

Testing & Performance

This project uses pytest for unit and integration testing, and oha for HTTP load testing. We use Taskfile to automate these processes.

Running Unit and Integration Tests

To execute the entire test suite, which includes logic validation and idempotency checks, run the following command:

Prerequisite: Before running any tasks, ensure your virtual environment is active to access project dependencies:

Windows: .venv\Scripts\activate
macOS/Linux: source .venv/bin/activate

    task test:run

Running Performance Tests

To benchmark the API Gateway's connection capacity and measure the health endpoint's throughput under heavy concurrent load (250 workers for 1 minute), execute:

Prerequisite: Ensure that Docker is running and your infrastructure (Redis, Redpanda) is healthy before starting load tests.

   task test:load-health

Note: Note on Performance Bottlenecks: If you observe high average latency (ms) during this extreme load test, it is because the API is currently deployed as a single, standalone Docker container. This creates a natural bottleneck at the single-process level. In the upcoming Kubernetes (K8s) deployment phase, we will implement horizontal scaling. By increasing the pod replica count behind a load balancer, the concurrent traffic will be distributed across multiple instances, effectively mitigating this latency issue and maximizing overall throughput.

Database Verification (Sanity Check)

To ensure the end-to-end data pipeline is successfully capturing events and persisting them to PostgreSQL, you can query the database directly from within the Kubernetes cluster.

Run the following command to check the latest records and their AI-assigned risk scores:

Prerequisite: Port Forwarding Since the API is running inside the cluster, you must first forward the port to your local machine:

    task k8s:forward

Note: Keep this terminal open or run it in the background.

Trigger a Test Transaction Send a mock transaction with all required features to the API:

Invoke-RestMethod -Uri "http://localhost:8000/api/v1/transactions" -Method Post -Headers @{"Content-Type"="application/json"} -Body '{"transaction_id": "TEST-1001", "user_id": "user_777", "Amount": 999.99, "Time": 10.0, "V1": 0.0, "V2": 0.0, "V3": 0.0, "V4": 0.0, "V5": 0.0, "V6": 0.0, "V7": 0.0, "V8": 0.0, "V9": 0.0, "V10": 0.0, "V11": 0.0, "V12": 0.0, "V13": 0.0, "V14": 0.0, "V15": 0.0, "V16": 0.0, "V17": 0.0, "V18": 0.0, "V19": 0.0, "V20": 0.0, "V21": 0.0, "V22": 0.0, "V23": 0.0, "V24": 0.0, "V25": 0.0, "V26": 0.0, "V27": 0.0, "V28": 0.0}'

Verify Inference Logs Check if the Consumer captured the event from Redpanda and processed it via the ONNX model:

kubectl logs deploy/sentinel-consumer

Query the Database Directly query the PostgreSQL statefulset to see the persisted record and its AI-assigned risk score:

    kubectl exec -it postgres-0 -- psql -U sentinel -d sentinel_db -c "SELECT transaction_id, user_id, amount, risk_score FROM transactions LIMIT 10;"

Expected Output: If the pipeline is functioning correctly, you should see a table displaying the transactions that have passed through Redpanda and the AI Consumer.

Troubleshooting

"task: command not found"

Sentinel leverages the Taskfile runner for efficient task automation and documentation. If the task command is not recognized, you need to install the task runner on your system:

macOS (Homebrew): brew install go-task/tap/go-task
Windows (Chocolatey or Scoop): choco install go-task or scoop install task
Linux: sh -c "$(curl --location https://taskfile.dev/install.sh)" -- -d

Alternatively, you can visit the official Task installation guide for more options.

"oha: command not found"

Issue: Running task load-test-health fails with a "command not found: oha" error.

Solution: The performance testing tasks strictly depend on the oha HTTP load generator. You can quickly install it directly via your system's package manager:

Windows (Winget): winget install hatoo.oha
macOS (Homebrew): brew install oha
Linux (Arch): pacman -S oha
Universal (Cargo/Rust): cargo install oha

After installation, ensure that the installation directory is added to your system's PATH environment variable.

"Missing kind command on Windows"

If you encounter a "kind": executable file not found in $PATH error during the build phase, it means the Kind CLI is not installed on your system.

Open PowerShell as Administrator and install Kind via winget:
```
winget install Kubernetes.kind
```
Restart your terminal (VS Code or PowerShell) to refresh the environment variables.
Create your local Kind cluster before running the tasks:
```
kind create cluster
```

Name		Name	Last commit message	Last commit date
Latest commit History 170 Commits
.github/workflows		.github/workflows
data		data
docker		docker
docs/images		docs/images
k8s		k8s
requirements		requirements
src		src
tests		tests
.dockerignore		.dockerignore
.env.example		.env.example
.gitattributes		.gitattributes
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE		LICENSE
README.md		README.md
Taskfile.yml		Taskfile.yml
docker-compose.yml		docker-compose.yml
payload.json		payload.json
profile_report.html		profile_report.html
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Sentinel: Real-Time AI Fraud Detection

Quick Start

1. Clone the Repository

2. Configure Environment Variables

3. Install Dependencies

4. Execute the ML Pipeline

5. Launch and Manage Application

Kubernetes Orchestration & Autoscaling (Local Development via Kind)

Prerequisites: Cluster & Metrics Server

Streamlined Deployment Workflow

1. Build and Load Images:

2. Deploy the Architecture:

3. Check Pod Status (Crucial):

4. Access the Services:

5. Teardown:

Optional: Architecture Stress Testing (HPA in Action)

Local Services & Ports

Testing & Performance

Running Unit and Integration Tests

Running Performance Tests

Database Verification (Sanity Check)

Troubleshooting

"task: command not found"

"oha: command not found"

"Missing kind command on Windows"

About

Uh oh!

Releases 11

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Sentinel: Real-Time AI Fraud Detection

Quick Start

1. Clone the Repository

2. Configure Environment Variables

3. Install Dependencies

4. Execute the ML Pipeline

5. Launch and Manage Application

Kubernetes Orchestration & Autoscaling (Local Development via Kind)

Prerequisites: Cluster & Metrics Server

Streamlined Deployment Workflow

1. Build and Load Images:

2. Deploy the Architecture:

3. Check Pod Status (Crucial):

4. Access the Services:

5. Teardown:

Optional: Architecture Stress Testing (HPA in Action)

Local Services & Ports

Testing & Performance

Running Unit and Integration Tests

Running Performance Tests

Database Verification (Sanity Check)

Troubleshooting

"task: command not found"

"oha: command not found"

"Missing kind command on Windows"

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 11

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages