Healthcare RAG System Backend

A comprehensive Retrieval-Augmented Generation (RAG) system built with FastAPI for healthcare document analysis and medical question answering.

🚀 Features

User Authentication: JWT-based authentication for patients and doctors
Document Management: Upload and process medical documents (PDF, TXT)
Vector Database: FAISS-based vector storage for document embeddings
RAG Pipeline: LangChain-powered question answering with source retrieval
Conversation History: Maintain context-aware chat sessions
Role-based Access: Different permissions for patients and doctors
Async API: High-performance asynchronous endpoints
CORS Support: Frontend-friendly with configurable origins

🏗️ Architecture

app/
├── __init__.py
├── config.py          # Configuration and environment variables
├── database.py        # Database connection and session management
├── models.py          # SQLAlchemy database models
├── schemas.py         # Pydantic request/response schemas
├── auth.py            # JWT authentication and security
├── services/          # Business logic layer
│   ├── __init__.py
│   ├── rag_service.py      # RAG pipeline and vector operations
│   ├── user_service.py     # User management operations
│   ├── document_service.py # Document processing and storage
│   └── conversation_service.py # Chat and conversation management
└── routers/           # API endpoint definitions
    ├── __init__.py
    ├── auth.py        # Authentication endpoints
    ├── users.py       # User management endpoints
    ├── documents.py   # Document upload/management endpoints
    └── rag.py         # RAG and conversation endpoints

🛠️ Technology Stack

Backend Framework: FastAPI (Python)
Database: SQLite (configurable to PostgreSQL)
ORM: SQLAlchemy 2.0
Authentication: JWT with python-jose
Vector Database: FAISS (Facebook AI Similarity Search)
Document Processing: LangChain, PyPDF2
Embeddings: Sentence Transformers (HuggingFace)
Password Hashing: bcrypt
API Documentation: Auto-generated with FastAPI

📋 Prerequisites

Python 3.8+
Virtual environment (recommended)
Git

🚀 Installation

Clone the repository

git clone <repository-url>
cd healthcare-rag-system

Create and activate virtual environment

python -m venv venv
# On Windows
venv\Scripts\activate
# On macOS/Linux
source venv/bin/activate

Install dependencies
```
pip install -r requirements.txt
```

Environment Configuration

# Copy environment template
cp env.example .env

# Edit .env file with your configuration
# Update SECRET_KEY, OPENAI_API_KEY, etc.

Run the application
```
python main.py
```
The API will be available at http://localhost:8000

📚 API Documentation

Once running, access the interactive API documentation:

Swagger UI: http://localhost:8000/docs
ReDoc: http://localhost:8000/redoc

🔐 Authentication

The system uses JWT tokens for authentication. Include the token in the Authorization header:

Authorization: Bearer <your-jwt-token>

Authentication Flow

Register: POST /api/v1/auth/register
Login: POST /api/v1/auth/login
Use Token: Include in subsequent requests

📁 API Endpoints

Authentication (`/api/v1/auth`)

POST /register - User registration
POST /login - User authentication
GET /me - Get current user info
POST /logout - User logout

Users (`/api/v1/users`)

GET / - List all users (doctors only)
GET /{user_id} - Get user by ID
PUT /{user_id} - Update user
DELETE /{user_id} - Deactivate user
GET /profile/me - Get own profile
PUT /profile/me - Update own profile

Documents (`/api/v1/documents`)

POST /upload - Upload medical document
GET / - List user's documents
GET /{document_id} - Get document details
DELETE /{document_id} - Delete document
POST /{document_id}/process - Process document manually
GET /stats/summary - Document statistics
GET /{document_id}/chunks - Get document chunks

RAG (`/api/v1/rag`)

POST /ask - Ask medical question
GET /conversations - List conversations
GET /conversations/{id} - Get conversation
GET /conversations/{id}/messages - Get conversation messages
DELETE /conversations/{id} - Delete conversation
PUT /conversations/{id}/title - Update conversation title
GET /conversations/summary - Conversation summary
POST /conversations/new - Create new conversation

🔄 Workflow Example

1. User Registration

curl -X POST "http://localhost:8000/api/v1/auth/register" \
  -H "Content-Type: application/json" \
  -d '{
    "email": "doctor@example.com",
    "username": "dr_smith",
    "password": "secure_password",
    "full_name": "Dr. John Smith",
    "is_doctor": true
  }'

2. User Login

curl -X POST "http://localhost:8000/api/v1/auth/login" \
  -H "Content-Type: application/json" \
  -d '{
    "username": "dr_smith",
    "password": "secure_password"
  }'

3. Upload Medical Document

curl -X POST "http://localhost:8000/api/v1/documents/upload" \
  -H "Authorization: Bearer <your-jwt-token>" \
  -F "file=@medical_report.pdf"

4. Ask Medical Question

curl -X POST "http://localhost:8000/api/v1/rag/ask" \
  -H "Authorization: Bearer <your-jwt-token>" \
  -H "Content-Type: application/json" \
  -d '{
    "question": "What are the side effects of Metformin?"
  }'

⚙️ Configuration

Environment Variables

Variable	Description	Default
`DATABASE_URL`	Database connection string	`sqlite:///./healthcare_rag.db`
`SECRET_KEY`	JWT secret key	`your-secret-key-here`
`ALGORITHM`	JWT algorithm	`HS256`
`ACCESS_TOKEN_EXPIRE_MINUTES`	Token expiration time	`30`
`OPENAI_API_KEY`	OpenAI API key for LLM	``
`EMBEDDING_MODEL_NAME`	HuggingFace model name	`all-MiniLM-L6-v2`
`CHUNK_SIZE`	Document chunk size	`1000`
`CHUNK_OVERLAP`	Chunk overlap size	`200`
`HOST`	Server host	`0.0.0.0`
`PORT`	Server port	`8000`
`DEBUG`	Debug mode	`True`
`ALLOWED_ORIGINS`	CORS allowed origins	`["http://localhost:3000"]`

🔧 Development

Project Structure

├── main.py                 # FastAPI application entry point
├── requirements.txt        # Python dependencies
├── env.example            # Environment variables template
├── app/                   # Application package
│   ├── __init__.py
│   ├── config.py          # Configuration management
│   ├── database.py        # Database setup
│   ├── models.py          # Database models
│   ├── schemas.py         # Pydantic schemas
│   ├── auth.py            # Authentication utilities
│   ├── services/          # Business logic services
│   └── routers/           # API route handlers
├── uploads/               # File upload directory
├── vector_store/          # FAISS vector store
└── healthcare_rag.db      # SQLite database

Adding New Features

New Model: Add to app/models.py
New Schema: Add to app/schemas.py
New Service: Create in app/services/
New Endpoint: Add to appropriate router in app/routers/

Database Migrations

The system uses SQLAlchemy with automatic table creation. For production, consider using Alembic for migrations.

🚀 Deployment

Production Considerations

Environment Variables: Set proper production values
Database: Use PostgreSQL instead of SQLite
Security: Change default secret keys
CORS: Configure allowed origins properly
File Storage: Use cloud storage (S3, Azure Blob) instead of local files
Vector Store: Consider cloud vector databases (Pinecone, Weaviate)

Docker Deployment

FROM python:3.8-slim

WORKDIR /app

COPY requirements.txt .
RUN pip install -r requirements.txt

COPY . .

EXPOSE 8000

CMD ["python", "main.py"]

🧪 Testing

Manual Testing

Start the application
Use the interactive docs at /docs
Test endpoints with sample data

Automated Testing

# Install test dependencies
pip install pytest pytest-asyncio httpx

# Run tests
pytest

🤝 Contributing

Fork the repository
Create a feature branch
Make your changes
Add tests if applicable
Submit a pull request

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

⚠️ Disclaimer

This system is for educational and research purposes. Medical information should not be used as a substitute for professional medical advice. Always consult with qualified healthcare professionals.

🆘 Support

For issues and questions:

Check the API documentation at /docs
Review the logs for error messages
Open an issue on the repository

🔮 Future Enhancements

Integration with actual LLM APIs (OpenAI GPT, Claude)
Support for more document formats (DOCX, images)
Advanced search and filtering
User analytics and insights
Multi-tenant architecture
Real-time notifications
Mobile app support
HIPAA compliance features

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
__pycache__		__pycache__
alembic		alembic
app		app
uploads		uploads
vector_store		vector_store
venv		venv
.env		.env
ENDPOINTS.md		ENDPOINTS.md
QUICKSTART.md		QUICKSTART.md
README.md		README.md
alembic.ini		alembic.ini
healthcare_rag.db		healthcare_rag.db
main.py		main.py
requirements.txt		requirements.txt
start.bat		start.bat
start.py		start.py

Folders and files

Latest commit

History

Repository files navigation

Healthcare RAG System Backend

🚀 Features

🏗️ Architecture

🛠️ Technology Stack

📋 Prerequisites

🚀 Installation

📚 API Documentation

🔐 Authentication

Authentication Flow

📁 API Endpoints

Authentication (/api/v1/auth)

Users (/api/v1/users)

Documents (/api/v1/documents)

RAG (/api/v1/rag)

🔄 Workflow Example

1. User Registration

2. User Login

3. Upload Medical Document

4. Ask Medical Question

⚙️ Configuration

Environment Variables

🔧 Development

Project Structure

Adding New Features

Database Migrations

🚀 Deployment

Production Considerations

Docker Deployment

🧪 Testing

Manual Testing

Automated Testing

🤝 Contributing

📄 License

⚠️ Disclaimer

🆘 Support

🔮 Future Enhancements

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Authentication (`/api/v1/auth`)

Users (`/api/v1/users`)

Documents (`/api/v1/documents`)

RAG (`/api/v1/rag`)

Packages