A lightweight, production-ready Agentic RAG (Retrieval-Augmented Generation) service that analyzes startup signals (hiring, funding, remote readiness) using semantic search + LLM reasoning.
Built with:
• FastAPI
• FAISS (vector search)
• Sentence Transformers
• Groq LLM (Llama 3.3 70B)
• Docker-ready architecture
- Embeds startup-related signals into a FAISS vector store
- Retrieves the most relevant context using semantic similarity
- Sends structured context to an LLM
- Returns structured JSON analysis
Example analysis output:
{
"startup_name": "ExampleAI",
"hiring_signal": true,
"remote_possible": true,
"funding_stage": "Seed",
"reasoning": "Raised seed round and actively hiring Flutter developer.",
"source_url": "https://example.com/post"
}User Query
↓
Retriever (FAISS + Embeddings)
↓
Context Assembly
↓
Groq LLM (Structured JSON Output)
↓
FastAPI Response• embedding_service.py → Generates sentence embeddings
• vector_store.py → FAISS index + persistence
• retrieval_service.py → Semantic retrieval logic
• llm_service.py → Groq structured JSON generation
• main.py → FastAPI endpoints
Python 3.10+
FastAPI
FAISS (CPU)
SentenceTransformers (all-MiniLM-L6-v2)
Groq LLM API
Docker
###🚀 Getting Started
git clone https://github.com/bold-ronin/Lightweight-Agentic-RAG-Service.git
cd agentic-lead-ragpython -m venv .venv
source .venv/bin/activate # Windows: .venv\Scripts\activatepip install -r requirements.txtCreate .env file:
LLM_API_KEY=your_api_key_here [I used Groq you an whatever you want]uvicorn app.main:app --reloadOpen:
Use /analyze endpoint.
Dockerized FastAPI agent for RAG tasks.
Build image:
docker build -t agentic-lead-rag .Run container:
docker run -p 8000:8000 --env-file .env agentic-lead-ragVisit: http://localhost:8000/docs
- Structured JSON enforcement from LLM
- Async Groq integration
- Semantic search retrieval
- Source URL tracking
- FAISS index persistence
- Dockerized for portability
"Startup hiring Flutter developer remotely after seed funding"Returns structured intelligence analysis based on stored signals.
- Live Reddit & X ingestion
- LinkedIn signal scraping
- Scheduled background refresh
- Frontend dashboard
- Multi-source ingestion pipeline
- Deployment (Render / Railway)
- Usage-based monetization
This is not a chatbot.
It is a structured intelligence engine designed to extract startup signals for: • Freelancers • Recruiters • Founders • Investors
Built by Naol — AI-focused mobile + systems engineer exploring Agentic architectures and applied intelligence systems.