StructRAG — Vectorless RAG Engine

A lightweight, high-performance Retrieval-Augmented Generation system that uses hierarchical JSON tree traversal instead of vector databases. Built to stay within Groq's free-tier limits.

How It Works

┌───────────────┐     ┌──────────────────┐     ┌───────────────────┐
│  Markdown Doc │────▶│    indexer.py     │────▶│ knowledge_tree.json│
│  (Your Data)  │     │ Parse + Summarize │     │  Structured Tree   │
└───────────────┘     └──────────────────┘     └────────┬──────────┘
                                                        │
                      ┌─────────────────────────────────┘
                      ▼
               ┌─────────────┐    Step 1: Router     ┌──────────────┐
               │ User Query  │───────────────────────▶│ Select Node  │
               │             │   (ToC only, low       │ (node_id)    │
               │             │    tokens)             └──────┬───────┘
               │             │                               │
               │             │    Step 2: Generator   ┌──────▼───────┐
               │             │───────────────────────▶│ Full Answer  │
               └─────────────┘   (full section text)  └──────────────┘

Zero vectors. Zero embeddings. Just structured JSON + smart prompting.

Quick Start

# 1. Install dependencies
pip install -r requirements.txt

# 2. Set up your API key
cp .env.example .env
# Edit .env and add your Groq API key from https://console.groq.com

# 3. Index a document
python indexer.py docs/sample_policy.md

# 4. Ask questions
python app.py

API Mode

# Start the FastAPI server
python app.py --api --port 8000

# Query via curl
curl -X POST http://localhost:8000/query \
  -H "Content-Type: application/json" \
  -d '{"question": "What is the PTO policy?"}'

Project Structure

├── indexer.py              # Markdown parser + Groq summarizer
├── rag_engine.py           # 2-call retrieval pipeline
├── app.py                  # CLI + FastAPI interface
├── docs/
│   └── sample_policy.md    # Sample document for testing
├── requirements.txt
├── .env.example
└── knowledge_tree.json     # Generated after indexing

Models Used

Step	Model	Purpose
Indexing	`llama-3.1-8b-instant`	Generate section summaries
Routing	`llama-3.1-8b-instant`	Select relevant section (low tokens)
Generation	`llama-3.3-70b-versatile`	Produce detailed answers

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

StructRAG — Vectorless RAG Engine

How It Works

Quick Start

API Mode

Project Structure

Models Used

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
docs		docs
.env.example		.env.example
.gitignore		.gitignore
README.md		README.md
app.py		app.py
indexer.py		indexer.py
rag_engine.py		rag_engine.py
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

StructRAG — Vectorless RAG Engine

How It Works

Quick Start

API Mode

Project Structure

Models Used

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages