Clinical Intelligence RAG

A privacy-first RAG pipeline for healthcare: PHI scrubbing on ingest, multi-provider LLM inference, and strict context grounding validated at Faithfulness 1.00.

Quick Start

docker-compose up --build
# API:  http://localhost:8000/docs
# UI:   http://localhost:8501

The Problem

Clinical PDFs and discharge summaries are unstructured. Querying them with a generative model introduces hallucination risk — the model interpolates beyond retrieved context. In a healthcare setting, that failure mode is not acceptable.

The Architecture

Documents are chunked with overlap using LangChain's text splitter before embedding, preserving sentence boundaries and reducing context fragmentation. Vectors are stored in Pinecone using OpenAI text-embedding-3-small (1536 dimensions by default; swap via .env). At query time, a contextual compression step filters retrieved chunks to only the segments semantically relevant to the question — reducing token overhead by 20–30% and keeping the LLM prompt tight.

LLM providers are pluggable: OpenAI, Anthropic, or AWS Bedrock, configured entirely via .env. No code changes required to switch.

PHI is scrubbed from document text before embedding. Scrubbed content is never written to the vector index.

graph TD
    A["Clinical PDFs"] --> B["Ingestion (Parse & Chunk)"]
    B -->|PHI Scrubbing| C["Embeddings (Multi-Provider)"]
    C --> D[("Pinecone Vector DB")]

    subgraph RAG["RAG Engine"]
    D <-->|Semantic Search| E["Retrieval & Compression"]
    E -->|Relevant Context| F["LLM Provider Selection"]
    F -->|OpenAI/Anthropic/Bedrock| G["Generate Answer"]
    end

    G -->|Grounded Response| H["FastAPI REST API"]

    subgraph UI["User Interface"]
    H --> I["Streamlit Dashboard"]
    I --> J["Clinical Intelligence"]
    end

The Objective

Accelerate clinical data discovery — surfacing relevant findings across patient records, discharge notes, and lab reports — while enforcing strict context boundaries: the system answers only from retrieved context, never from model weights.

Evaluation

Metric	Score
Faithfulness	1.00
Answer Relevancy	0.97
Context Precision	1.00
Overall	0.99

python eval/evaluate_rag.py

Tech Stack

Layer	Technology	Purpose
Backend API	FastAPI	REST endpoints, async request handling
Frontend UI	Streamlit	Interactive dashboard, document upload
LLM Orchestration	LangChain	Chain-of-thought reasoning, prompt management
Vector Database	Pinecone	Semantic search, embeddings storage
Evaluation	Ragas	Faithfulness, Precision, Recall metrics
Containerization	Docker Compose	Multi-service orchestration
LLM Providers	OpenAI, Anthropic, AWS Bedrock	Plug-and-play multi-provider support
Monitoring	LangSmith	Pipeline tracing, token tracking, latency

Docs

SETUP.md — Install and configure
API.md — REST API reference
ARCHITECTURE.md — Design decisions
MONITORING.md — LangSmith tracing
TROUBLESHOOTING.md — Common issues

Troubleshooting

ValidationException (Bedrock): Ensure your AWS region supports the selected model and that you have active model access in your Bedrock console.

IndexNotFound (Pinecone): Ensure PINECONE_INDEX_NAME in .env matches the index you created in the Pinecone dashboard.

No working Bedrock models found: The system falls back automatically to Anthropic (Claude 3.5 Sonnet) or OpenAI (GPT-4o) as configured in .env.

ModuleNotFoundError: Activate your virtual environment before running:

# Windows
.venv\Scripts\Activate.ps1
# Linux/Mac
source .venv/bin/activate

Enable LangSmith tracing (optional, for debugging pipeline execution):

LANGCHAIN_TRACING_V2=true
LANGCHAIN_API_KEY=your-langsmith-api-key

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
api		api
core		core
data/samples		data/samples
docs		docs
eval		eval
models		models
services		services
tests		tests
ui		ui
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
docker-compose.yml		docker-compose.yml
eval_results.json		eval_results.json
main.py		main.py
requirements.txt		requirements.txt
setup.sh		setup.sh
sync_data.py		sync_data.py
test_query.py		test_query.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Clinical Intelligence RAG

Quick Start

The Problem

The Architecture

The Objective

Evaluation

Tech Stack

Docs

Troubleshooting

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Clinical Intelligence RAG

Quick Start

The Problem

The Architecture

The Objective

Evaluation

Tech Stack

Docs

Troubleshooting

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages