Skip to content

AustralianCancerDataNetwork/omop-emb

Repository files navigation

omop-emb

Embedding layer for OMOP CDM.

Installation

omop-emb now exposes backend-specific optional dependencies so installation can match the embedding backend you actually intend to use.

pip install "omop-emb[postgres]"
pip install "omop-emb[faiss]"
pip install "omop-emb[all]"

Notes:

  • postgres installs the PostgreSQL/pgvector dependencies.
  • faiss installs the FAISS-based backend dependencies. This currently only includes CPU support
  • all installs both backend stacks for development or mixed environments.
  • A plain pip install omop-emb installs the shared core package only.
  • PostgreSQL-specific embedding dependencies are now optional, but omop-emb still requires some database backend for OMOP access and model registration.
  • Non-PostgreSQL database backends have not yet been tested.

Extended documentation can be found here.

Project Roadmap

  • Interface for postgres storage of vectors
  • Interface for FAISS storage of embeddings
  • Extensive unit testing
    • Backend testing
    • Corruption and restoration of DB testing
  • Support non-Flat indices for each backend
  • faiss GPU support
  • pgvectorscale support
  • Vector-quantisation for more efficient storage

About

Embedding layer for OMOP CDM

Resources

Stars

Watchers

Forks

Packages

 
 
 

Contributors

Languages