Medical Result VLM Extractor

Dự án này là công cụ tự động trích xuất thông tin từ hình ảnh giấy tờ, phiếu kết quả khám bệnh hoặc hồ sơ y tế bằng Vision Language Model (VLM) thông qua OpenRouter API.

Tính Năng

VLM Extraction: Sử dụng Gemini 1.5 Pro, GPT-4o, Claude 3 để chuyển đổi ảnh xét nghiệm sang JSON
Template-based: Điền dữ liệu theo template.json định sẵn
Auto-retry: Tự động retry khi VLM trả về JSON không hợp lệ
Async Processing: Hỗ trợ RabbitMQ queue cho web app integration

Yêu Cầu

Python 3.10+
uv (recommended)

Cài Đặt

# Clone và cd vào thư mục
git clone https://github.com/108-PJIOrthoGen/Extract_data_from_images.git
cd Extract_data_from_images

# Tạo và kích hoạt virtual environment
python -m venv .venv

# Linux / macOS
source .venv/bin/activate

# Windows (PowerShell)
.venv\Scripts\Activate

# Kiểm tra đã active chưa
which python  # Linux/macOS
Get-Command python  # Windows

# Cài đặt dependencies
uv sync
pip install -e .

Cấu Hình

Copy .env.example thành .env và thêm API key:

OPENROUTER_API_KEY="your_api_key_here"
OPENROUTER_MODEL="google/gemini-3.1-flash-lite-preview"

Tổ Chức Thư Mục

Extract_data_from_images/
├── src/extractor/                 # Main package
│   ├── api/                       # FastAPI application
│   │   ├── app.py                 # App factory + lifespan
│   │   └── routes.py              # API endpoints
│   ├── clients/                   # External API clients
│   │   └── vlm_client.py          # OpenRouter VLM client
│   ├── core/                      # Domain logic
│   │   ├── extractor.py           # ExtractionPipeline orchestrator
│   │   ├── prompt_builder.py      # Prompt construction
│   │   ├── response_validator.py  # Validation logic
│   │   └── template_parser.py     # Template parsing
│   ├── loaders/                   # Input adapters
│   │   └── image_loader.py        # Image loading & Base64 encoding
│   ├── models/                    # Pydantic schemas
│   │   └── template.py            # Template validation
│   ├── worker/                    # RabbitMQ consumer
│   │   └── consumer.py            # Async message consumer
│   ├── utils/
│   │   └── logger.py              # Logging configuration
│   ├── config.py                  # Settings with validation
│   ├── exceptions.py              # Exception hierarchy
│   └── main.py                    # CLI entry point
├── tests/                         # Unit tests
├── templates/
│   └── template.json              # JSON template
├── images/                        # Input images
├── outputs/                       # Output JSON files
├── Dockerfile
├── docker-compose.yml
├── Makefile
└── pyproject.toml

Cách Sử Dụng

CLI (Command Line)

# Chạy với thư mục ảnh mặc định
uv run extract-data --input images/test_case_01

# Tùy chọn thêm
uv run extract-data --input images/test_case_01 --output custom_output.json
uv run extract-data --input images/test_case_01 --max-retries 3

Sử dụng Makefile (Recommended)

make install      # Cài đặt dependencies
make dev          # Cài đặt dev dependencies
make run          # Chạy CLI
make run-api      # Chạy FastAPI server
make run-worker   # Chạy RabbitMQ worker
make test         # Chạy tests
make lint         # Kiểm tra code style

Docker

docker-compose up --build

API Endpoints

Method	Endpoint	Mô tả
POST	`/upload`	Upload 1 hoặc nhiều ảnh, gửi vào queue
GET	`/result/{job_id}`	Lấy kết quả JSON theo job_id
GET	`/health`	Health check

Upload Ảnh

# Upload 1 ảnh
curl -X POST http://localhost:8000/upload \
  -F "files=@image.jpg"

# Upload nhiều ảnh
curl -X POST http://localhost:8000/upload \
  -F "files=@page1.jpg" \
  -F "files=@page2.jpg"

Kiểm Tra Kết Quả

# Poll kết quả
curl http://localhost:8000/result/{job_id}

# Response khi đang xử lý:
# {"job_id": "...", "status": "processing"}

# Response khi hoàn thành:
# {"job_id": "...", "status": "completed", "data": {...}}

RabbitMQ Configuration

Thêm vào .env:

RABBITMQ_HOST=localhost
RABBITMQ_PORT=5672
RABBITMQ_USER=guest
RABBITMQ_PASSWORD=guest
RABBITMQ_QUEUE=image_processing

Development

# Cài đặt dev dependencies
pip install -e ".[dev]"

# Chạy tests
pytest -v

# Chạy lint
ruff check .
ruff format --check .

# Pre-commit hooks
pre-commit install

Architecture

graph TB
    subgraph "Client Layer"
        CLI[CLI: extract-data]
        WEB[Web App / API Client]
    end

    subgraph "API Layer"
        API[FastAPI: extractor.api.app]
        UPLOAD[POST /upload]
        RESULT[GET /result]
    end

    subgraph "Message Queue"
        MQ[RabbitMQ]
        QUEUE[Queue: image_processing]
    end

    subgraph "Worker Layer"
        WORKER[Worker: extractor.worker.consumer]
    end

    subgraph "Core Processing"
        LOADER[Image Loader]
        TEMPLATE[Template Parser]
        BUILDER[Prompt Builder]
        VLM[OpenRouter VLM Client]
        VALIDATOR[Response Validator]
        PIPELINE[ExtractionPipeline]
    end

    subgraph "Storage"
        UPLOAD_DIR[uploads/]
        OUTPUTS[outputs/*.json]
    end

    %% CLI Flow
    CLI --> LOADER
    CLI --> TEMPLATE
    CLI --> PIPELINE
    PIPELINE --> VLM
    VLM --> OUTPUTS

    %% API Flow
    WEB --> UPLOAD
    UPLOAD --> UPLOAD_DIR
    UPLOAD --> MQ
    MQ --> QUEUE
    QUEUE --> WORKER
    WORKER --> LOADER
    WORKER --> TEMPLATE
    WORKER --> PIPELINE
    PIPELINE --> VLM
    WORKER --> OUTPUTS
    WEB --> RESULT
    RESULT --> OUTPUTS

Luồng Xử Lý Chi Tiết

flowchart LR
    subgraph Client
        CLI[CLI]
        WEB[Web]
    end
    
    subgraph Server
        API[API]
        MQ{Queue}
    end
    
    subgraph Worker
        W[Worker]
    end
    
    subgraph VLM
        V[VLM]
    end
    
    subgraph Storage
        IMG[images/]
        TMP[templates/]
        OUT[outputs/]
    end
    
    CLI --> IMG --> W --> TMP --> V --> OUT
    WEB --> API --> MQ --> W
    OUT --> WEB

License

MIT

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Medical Result VLM Extractor

Tính Năng

Yêu Cầu

Cài Đặt

Cấu Hình

Tổ Chức Thư Mục

Cách Sử Dụng

CLI (Command Line)

Sử dụng Makefile (Recommended)

Docker

API Endpoints

Upload Ảnh

Kiểm Tra Kết Quả

RabbitMQ Configuration

Development

Architecture

Luồng Xử Lý Chi Tiết

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 58 Commits
.serena		.serena
images/test_case_01		images/test_case_01
src/extractor		src/extractor
templates		templates
tests		tests
.env.example		.env.example
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
Dockerfile		Dockerfile
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
docker-compose.yml		docker-compose.yml
pyproject.toml		pyproject.toml

Folders and files

Latest commit

History

Repository files navigation

Medical Result VLM Extractor

Tính Năng

Yêu Cầu

Cài Đặt

Cấu Hình

Tổ Chức Thư Mục

Cách Sử Dụng

CLI (Command Line)

Sử dụng Makefile (Recommended)

Docker

API Endpoints

Upload Ảnh

Kiểm Tra Kết Quả

RabbitMQ Configuration

Development

Architecture

Luồng Xử Lý Chi Tiết

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages