Skip to content

amjadAwad95/multimodal-rag-system

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

33 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Multimodal Rag System

The Multimodal RAG System enables users to get precise answers to their questions by retrieving information from specific documents. Unlike traditional text-only retrieval systems, it can seamlessly process text, images, and tables, delivering rich, context-aware responses.

How to run the system

You must download these applications first

  1. Download Poppler binaries from:
https://github.com/oschwartz10612/poppler-windows/releases
  1. Install Tesseract:
https://github.com/UB-Mannheim/tesseract/wiki
  1. Use venv:
python -m venv .venv
.venv\Scripts\activate
  1. Install requirements:
pip install -r requirements.txt
  1. Create a .env file in the root folder:
GOOGLE_API_KEY = <GOOGLE_API_KEY>
  1. Run the app:
 streamlit run main.py

About

The Multimodal RAG System enables users to get precise answers to their questions by retrieving information from specific documents.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages