AskMyDoc: NotebookLM RAG

A production-ready RAG application that lets you upload and chat with your documents.

Turn any static PDF into an intelligent conversation in seconds. No setup required.

🌟 Overview

AskMyDoc is an end-to-end Retrieval-Augmented Generation (RAG) web application inspired by Google NotebookLM. It allows users to upload PDF documents, parses and splits them into intelligent semantic chunks, and allows for natural language conversations grounded strictly in the document's content.

✨ Key Features

🚀 Lightning Fast LLM: Powered by Groq (llama-3.1-8b-instant) for instantaneous, deterministic, and grounded responses.
🧠 Free Local Embeddings: Utilizes HuggingFace (Xenova/all-MiniLM-L6-v2) to generate local embeddings at zero cost.
🗄️ Advanced Vector Storage: Integrates with Pinecone for high-performance semantic retrieval and namespace isolation.
🏗️ Parent Document Retrieval: Implements a dual-chunking strategy (3000-char parent context / 400-char child vectors) to maximize retrieval precision while preserving deep context for the LLM.
💬 Conversational Memory: Features active chat history routing. The AI intelligently reformulates follow-up questions into standalone queries before searching the vector database, allowing for natural, continuous conversations.
🎯 Answer Confidence Scoring: Calculates similarity scores from Pinecone and displays dynamic grounding confidence badges (HIGH CONFIDENCE, LOW CONFIDENCE, or NOT FOUND IN DOCUMENT) in the UI.
📁 Preloaded Demo Mode: Instantly load and test-query the preloaded "RBI Integrated Ombudsman Scheme 2021" policy document without uploading.
🛡️ Safety Guardrails & Limits: Enforces a strict 3-document upload limit to control resource consumption and namespace clutter.
🎨 Premium User Interface: Features a highly aesthetic, responsive, and minimalist frontend inspired perfectly by Google NotebookLM (built with pure HTML/CSS/VanillaJS).
🔐 Strict Grounding: System prompts explicitly force the AI to answer only from the provided context, eliminating hallucinations.

🛠️ Technology Stack

Backend: Node.js, Express.js
Frontend: Vanilla HTML, CSS (Tailwind), JavaScript
RAG Pipeline: LangChain.js (@langchain/community, @langchain/groq, @langchain/pinecone)
Document Parsing: pdf-parse & pdfjs-dist with OCR API fallback for scanned documents

🚀 Getting Started

1. Prerequisites

Node.js (v18+)
A Groq API Key (Free)
A Pinecone API Key & Index Name (Free)

2. Installation

Clone the repository and install the dependencies (use --legacy-peer-deps due to LangChain peer requirements):

git clone https://github.com/swarnika-cmd/AskMyDoc.git
cd AskMyDoc
npm install --legacy-peer-deps

3. Environment Setup

Rename .env.example to .env and fill in your keys:

GROQ_API_KEY=your_groq_api_key_here
PINECONE_API_KEY=your_pinecone_api_key_here
PINECONE_INDEX=askmydoc
PORT=3000

4. Running the App

Start the Express server:

node server.js

Navigate to http://localhost:3000 in your browser. Upload one or multiple PDFs, wait for the ingestion process, and start asking questions! The AI will automatically search across all documents added to the sidebar.

📦 Deployment

This project is perfectly structured for immediate deployment on PaaS providers like Render.com or Railway. Simply connect the GitHub repository, set your Build Command to npm install --legacy-peer-deps, set your Start Command to node server.js, and add your .env variables in the dashboard!

Name		Name	Last commit message	Last commit date
Latest commit History 32 Commits
demo		demo
public		public
.env.example		.env.example
.gitattributes		.gitattributes
.gitignore		.gitignore
.prettierrc		.prettierrc
CONTRIBUTING.md		CONTRIBUTING.md
README.md		README.md
TECHNICAL_DOCS.md		TECHNICAL_DOCS.md
ingest.js		ingest.js
ingest_demo.js		ingest_demo.js
package-lock.json		package-lock.json
package.json		package.json
retrieve.js		retrieve.js
server.js		server.js
vercel.json		vercel.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AskMyDoc: NotebookLM RAG

🌟 Overview

✨ Key Features

🛠️ Technology Stack

🚀 Getting Started

1. Prerequisites

2. Installation

3. Environment Setup

4. Running the App

📦 Deployment

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

AskMyDoc: NotebookLM RAG

🌟 Overview

✨ Key Features

🛠️ Technology Stack

🚀 Getting Started

1. Prerequisites

2. Installation

3. Environment Setup

4. Running the App

📦 Deployment

About

Topics

Resources

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages