The Multimodal RAG System enables users to get precise answers to their questions by retrieving information from specific documents. Unlike traditional text-only retrieval systems, it can seamlessly process text, images, and tables, delivering rich, context-aware responses.
You must download these applications first
- Download Poppler binaries from:
https://github.com/oschwartz10612/poppler-windows/releases- Install Tesseract:
https://github.com/UB-Mannheim/tesseract/wiki- Use venv:
python -m venv .venv
.venv\Scripts\activate- Install requirements:
pip install -r requirements.txt- Create a
.envfile in the root folder:
GOOGLE_API_KEY = <GOOGLE_API_KEY>- Run the app:
streamlit run main.py