Skip to content

NobleCoder69/GA4GH-RegBot

 
 

Repository files navigation

GA4GH-RegBot: Compliance Assistant Status: Proposal Stage for GSoC 2026

Overview RegBot is an LLM-powered tool designed to help researchers map their consent forms against GA4GH regulatory frameworks. It uses RAG (Retrieval-Augmented Generation) to flag compliance gaps automatically.

Architecture (Planned) Core: Python

LLM Framework: LangChain / LlamaIndex

Vector Store: ChromaDB / FAISS

UI: Streamlit

Code Quality

This project uses automated tools to maintain code quality:

  • Black - Code formatting
  • Ruff - Linting and PEP 8 compliance
  • isort - Import organization
  • mypy - Type checking

Running Code Quality Checks

# Automatic (via pre-commit hooks)
pre-commit install
git commit ...

# Manual
pre-commit run --all-files


Roadmap
Phase 1: Ingest GA4GH "Framework for Responsible Sharing" policy documents.

Phase 2: Build RAG pipeline for clause extraction.

Phase 3: Develop Streamlit frontend for user uploads.

About

No description, website, or topics provided.

Resources

License

Code of conduct

Contributing

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages

  • Python 100.0%