FINDER

A tool to help you search through research papers and other documents quickly and effectively. This tool uses Adaptive RAG to generate answers based on information from your uploaded documents. If no relevant information is found in your documents, FINDER scours the internet to find relevant information before replying.

Link to Video Demostration

Adaptive RAG is a strategy for Retrieval-Augmented Generation (RAG) that unites (1) query analysis with (2) active/self-corrective RAG.

Features

Efficient parsing and hierarchical context retention for complex documents.
Semantic chunking with fallback mechanisms for optimal input size.
Embedding and retrieval using state-of-the-art methods for accuracy.
Internet augmentation for queries when local information is insufficient.
Intuitive web app with two-way communication using WebSockets.

System Architecture

1. Indexing

Parsing:
Marker, an open-source parser, is used to parse PDFs into markdown and chunk documents by headers.
Context Retention:
Parsed markdown is converted into an Abstract Syntax Tree (AST). A Breadth-First Search (BFS) is applied to prepend header paths to the metadata of each chunk, retaining hierarchical context.
Chunking:
- Context windows that are too large reduce accuracy, causing the model to ignore explicit prompts.
- Semantically chunk documents into smaller parts by identifying gradients in semantic changes. This is particularly effective for research papers with high inter-chunk correlation.
- Remaining large chunks are split using LangChain's RecursiveCharacterTextSplitter, which respects sentence, paragraph, and document structure while adhering to token limits.
  Learn more about semantic chunking
  LangChain RecursiveCharacterTextSplitter documentation
Embedding:
A lightweight model was selected from the MTEB leaderboard to embed chunks and queries efficiently.
Database:
Milvus, an open-source vector database, is used for storage. It provides high efficiency across various environments.
Milvus documentation

2. Retrieval

Used merged-rank retrieval to search and retrieve information from relevant documents.
Retrieved and ranked documents by cosine similarity.

3. Generation

Follows the adaptive rag workflow to generate a reply. The workflow is created using LangGraph.
Utilized the Llama 3 model from Ollama for grading and generating responses.

4. Web Application

Built with Flask and Jinja for a user-friendly interface.
Implemented two-way communication using WebSockets.

How to Run

Install all dependencies as specified in requirements.txt.
Setup Milvus Container, follow docker container installation instruction here
Start the Milvus instance
Run the flask application:
```
python app.py
```

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
bin		bin
data		data
parsed_data		parsed_data
rag_model		rag_model
static		static
templates		templates
.gitignore		.gitignore
README.md		README.md
app.py		app.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

FINDER

Features

System Architecture

1. Indexing

2. Retrieval

3. Generation

4. Web Application

How to Run

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

FINDER

Features

System Architecture

1. Indexing

2. Retrieval

3. Generation

4. Web Application

How to Run

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages