GenRAG

GenRAG is an AI-powered document intelligence system that implements Retrieval-Augmented Generation (RAG) from scratch. Built without high-level frameworks like LangChain or vector databases, it features a modern GUI, semantic search through 1,400+ text chunks, and AI-powered responses. The system processes PDFs, generates embeddings, and provides instant answers to questions about document content with sub-millisecond search speeds.

Features

🎨 Modern GUI: GitHub-inspired dark theme interface
📄 PDF Processing: Automatic text extraction and chunking
🧠 Smart Embeddings: Sentence-BERT powered semantic search
⚡ Fast Search: Sub-millisecond query processing
🤖 AI Responses: Google Gemini API integration
💾 Lightweight: CSV-based storage (no vector database needed)
🔧 Python 3.8+ Compatible: Works on older Python versions

Installation

Clone the repository:

git clone https://github.com/AkprasadoP/GenRAG.git
cd GenRAG

Create and activate a virtual environment:

python -m venv env
call env\Scripts\activate  # On Windows
# source env/bin/activate  # On Linux/Mac

Install compatible dependencies:
```
pip install -r requirements_minimal.txt
```

Setup

Generate Embeddings (The PDF is already included):
```
python create_embeddings_auto.py
```
This will process "The Intelligent Investor" PDF and create embeddings.
Configure Gemini API (Optional for enhanced responses):
- Edit .env file and add your API key:
```
GEMINI_API_KEY=your_actual_api_key_here
```

Usage

GUI Application (Recommended)

python app.py

Or simply double-click run.bat

Terminal Version

python main.py

Test Multiple Queries

python test_rag.py

LLM Response

You can use both a local LLM or an LLM from an API like Gemini for generating responses.

Local LLM: If you have the capability to run a local LLM, you can use it for generating responses. Cause mine is too slow :(
LLM from API: If your system is not powerful enough for local inference, you can use an API like Gemini. To do this, create a .env file and pass the Gemini API key.

Using Gemini API

Get your API key from Google AI Studio
Edit the existing .env file:
```
GEMINI_API_KEY=your_actual_api_key_here
```
Restart the application to use enhanced AI responses

Note: The system works perfectly with fallback responses even without the API key.

Author

Ashish Prasad - @AkprasadoP

This project is developed and maintained by Ashish Prasad.

Credits

Special thanks to the following YouTube channels and research papers for their invaluable resources and insights:

YouTube Channels

Research Papers

Patrick Lewis ., "Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks" arXiv:2005.11401
Vaswani et al., "Attention is All You Need" arXiv:1706.03762
Reimers et al., "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks" arXiv:1908.10084

License

This project is licensed under the MIT License - see the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
data		data
llm		llm
services		services
.gitignore		.gitignore
AUTHOR.md		AUTHOR.md
LICENSE		LICENSE
README.md		README.md
SETUP_COMPLETE.md		SETUP_COMPLETE.md
app.py		app.py
create_embeddings.py		create_embeddings.py
create_embeddings_auto.py		create_embeddings_auto.py
create_embeddings_simple.py		create_embeddings_simple.py
gui_app.py		gui_app.py
gui_enhanced.py		gui_enhanced.py
launch.bat		launch.bat
launch_enhanced.bat		launch_enhanced.bat
main.py		main.py
requirements.txt		requirements.txt
requirements_compatible.txt		requirements_compatible.txt
requirements_minimal.txt		requirements_minimal.txt
run.bat		run.bat
test_rag.py		test_rag.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GenRAG

Features

Installation

Setup

Usage

GUI Application (Recommended)

Terminal Version

Test Multiple Queries

LLM Response

Using Gemini API

Author

Credits

YouTube Channels

Research Papers

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

GenRAG

Features

Installation

Setup

Usage

GUI Application (Recommended)

Terminal Version

Test Multiple Queries

LLM Response

Using Gemini API

Author

Credits

YouTube Channels

Research Papers

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages