Semantic Document Search using RAG

This project implements a semantic document search system using text embeddings and a FAISS vector database.

Architecture

Document → Text Extraction → Chunking → Embeddings → FAISS Vector Database → Semantic Search

Tech Stack

Python Sentence Transformers FAISS NumPy

Features

PDF document ingestion
Text chunking
Embedding generation
Vector similarity search
Semantic search over documents

Run the Project

Install dependencies

pip install -r requirements.txt

Run the application

python app.py

Semantic Document Search using RAG

• Built a Retrieval-Augmented Generation (RAG) system using Sentence Transformers embeddings and FAISS vector search.
• Integrated Llama3 via Ollama to generate context-aware answers using retrieved document content.
• Designed a semantic search pipeline enabling efficient retrieval of relevant document chunks for AI-assisted knowledge discovery.

Tech Stack: Python, Sentence Transformers, FAISS, Llama3 (Ollama)

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
data		data
src		src
README.md		README.md
app.py		app.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Semantic Document Search using RAG

Architecture

Tech Stack

Features

Run the Project

Semantic Document Search using RAG

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Semantic Document Search using RAG

Architecture

Tech Stack

Features

Run the Project

Semantic Document Search using RAG

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages