Skip to content
View Pawansingh3889's full-sized avatar

Highlights

  • Pro

Block or report Pawansingh3889

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Pawansingh3889/README.md

Pawan Singh Kapkoti

Data engineer based in the UK. I build small, free, on-prem tools for data and SQL work, and send fixes upstream to the projects I use. MSc Data Analytics, Aston.

Portfolio · LinkedIn · PyPI

Tools

  • schema-scout — reverse-engineers a SQL Server database into an AI-ready catalog: infers the undeclared foreign keys, flags PII, scores how AI-ready the schema is, and renders an offline dashboard. Runs entirely on-prem.
  • sql-sop — a SQL linter that catches dangerous and non-conforming SQL in CI and pre-commit. pip install sql-sop · browser playground
  • sql-sop-mcp — an MCP server wrapping the linter, so Claude / Cursor / any MCP client can call it. pip install sql-sop-mcp
  • pr-sop — small PR-governance checks: changelog drift, version mismatch, stale pre-commit pins. pip install pr-sop
  • morning-brief — rule-based Gmail triage, read-only, no LLM. pip install morning-brief

AI / on-prem

  • OpsMind — on-prem AI over manufacturing data: natural-language-to-SQL through a LangGraph agent, RAG over pgvector / ChromaDB, models served locally by Ollama, with an eval harness.

Data pipelines

  • production-analytics-pipeline — incremental ETL from a manufacturing ERP; FastAPI + Next.js + Power BI, Prefect orchestration, 53 tests.
  • uk-crime-pipeline — Police UK API into PostgreSQL and BigQuery; dbt marts, ~100k records, Polars ingestion.

Tech

Python · SQL Server / PostgreSQL · dbt · pandas / Polars · FastAPI · Ollama / LangGraph (on-prem AI) · Docker · GitHub Actions

Open source

My pull requests across OSS projects.

Pinned Loading

  1. sql-guard sql-guard Public

    Fast rule-based SQL linter on PyPI (sql-sop). 38 rules, 149 tests, libCST injection scanner, SARIF output, browser playground. Pre-commit hook + GitHub Action. 500+ monthly downloads.

    Python 1 7

  2. uk-crime-pipeline uk-crime-pipeline Public

    End-to-end pipeline: Police UK API to PostgreSQL + BigQuery. dbt staging/marts, 65 tests, 3 CI/CD workflows, Looker Studio + Streamlit dashboards.

    Python

  3. OpsMind OpsMind Public

    On-prem NL→SQL for factory ops, answers in seconds — LangGraph agent, Gemma 3 12B, ChromaDB/pgvector, 4-layer read-only safety, 100+ tests.

    Python 1

  4. Hackathon-mediask Hackathon-mediask Public

    MediAsk — health Q&A platform for factory workers. Flask, PostgreSQL, Gemini AI, Docker. Live on Render.

    Python

  5. forthepeople-uk forthepeople-uk Public

    UK citizen transparency platform. Free council-level dashboards: weather, population, housing, crime, health, schools, elections, benefits.

    Python

  6. manufacturing-compliance-dashboard manufacturing-compliance-dashboard Public

    BRC/HACCP food safety dashboard. Batch traceability, temperature monitoring, allergen matrix, weight variance. Streamlit + Sentry.

    Python