🔍 DevLens — Chat with Your Codebase

AI-powered developer assistant that lets you ask natural language questions about any GitHub repository and get answers with exact file paths and line numbers.

📌 What is DevLens?

DevLens is a backend API that enables developers to chat with any GitHub codebase using natural language. Instead of manually searching through hundreds of files, you paste a repository URL, wait for indexing, and start asking questions.

Example:

❓ "Where is the database connection handled in this project?"

✅ "The database connection is handled in app/database.py at lines 15–28. It uses SQLAlchemy's create_engine() with a connection pool..."

The system retrieves the exact file, line numbers, and a code snippet — grounded in the real source code, not hallucinated.

✨ Features

🔗 Load any public GitHub repo — just paste the URL
🧠 RAG pipeline — answers are grounded in actual code, not guessed
⚡ SSE streaming — Gemini responses stream token-by-token
📁 Source attribution — every answer includes file path + line numbers
🗂️ Namespace isolation — multiple repos indexed independently in Pinecone
🩺 Health check endpoint — live probes for Pinecone and Gemini
🐳 Dockerized — runs with a single docker compose up command

🏗️ Architecture

User Question
     │
     ▼
POST /api/v1/ask-question
     │
     ▼
┌─────────────────────────────────────────────────┐
│                  RAG Pipeline                   │
│                                                 │
│  Question → Embed (text-embedding-004)          │
│           → Search Pinecone (cosine similarity) │
│           → Top-K code chunks retrieved         │
│           → Build grounded prompt               │
│           → Gemini 1.5 Pro generates answer     │
│           → SSE stream → client                 │
└─────────────────────────────────────────────────┘

POST /api/v1/load-repo
     │
     ▼
┌─────────────────────────────────────────────────┐
│               Indexing Pipeline                 │
│                                                 │
│  GitHub URL → GitPython shallow clone           │
│             → LangChain TextLoader              │
│             → RecursiveCharacterTextSplitter    │
│             → Google text-embedding-004         │
│             → Pinecone upsert (namespaced)      │
└─────────────────────────────────────────────────┘

🛠️ Tech Stack

Layer	Technology	Purpose
Web Framework	FastAPI 0.111	Async API with auto OpenAPI docs
LLM	Google Gemini 1.5 Pro	Answer generation
Embeddings	text-embedding-004	768-dim semantic code vectors
Vector DB	Pinecone (Serverless)	Fast cosine similarity search
Orchestration	LangChain 0.2	Chunking, loading, retrieval
Repo Loading	GitPython 3.1	Shallow git clone at runtime
Streaming	SSE (sse-starlette)	Token-by-token streaming
Logging	structlog 24.2	Structured JSON logs
Containerisation	Docker + Compose	One-command deployment

📁 Project Structure

devlens/
│
├── backend/
│   ├── .env                      # Environment variables (never commit)
│   ├── Dockerfile                # Multi-stage production image
│   ├── requirements.txt          # Pinned Python dependencies
│   └── app/
│       ├── main.py               # FastAPI app factory + startup
│       ├── schema.py             # All Pydantic request/response models
│       ├── helpers.py            # Pure utility functions
│       ├── api/
│       │   ├── health.py         # GET  /health
│       │   └── routes.py         # POST /load-repo, /ask-question
│       ├── core/
│       │   ├── config.py         # Pydantic-settings env loader
│       │   └── rag_pipeline.py   # Central RAG orchestrator
│       └── services/
│           ├── repo_loader.py    # Clone + LangChain document loading
│           ├── embeddings.py     # Pinecone setup + Google embeddings
│           └── retriever.py      # Semantic search over namespace
│
├── docker-compose.yml
├── Makefile
└── README.md

🚀 Getting Started

Prerequisites

Python 3.11 (3.14 is not yet supported by all dependencies)
Git installed on your system
Google AI Studio API key (free tier works)
Pinecone account + API key (free starter plan works)

1. Clone the repository

git clone https://github.com/your-username/devlens.git
cd devlens/backend

2. Create a virtual environment with Python 3.11

# Windows
py -3.11 -m venv .venv
.venv\Scripts\activate

# macOS / Linux
python3.11 -m venv .venv
source .venv/bin/activate

3. Install dependencies

pip install -r requirements.txt

4. Configure environment variables

cp .env .env.local    # or just edit .env directly

Open .env and fill in your keys:

GOOGLE_API_KEY=your_google_api_key_here
PINECONE_API_KEY=your_pinecone_api_key_here
PINECONE_INDEX_NAME=devlens-codebase
PINECONE_CLOUD=aws
PINECONE_REGION=us-east-1

5. Run the server

uvicorn app.main:app --reload --host 0.0.0.0 --port 8000

Open http://localhost:8000/docs to access the interactive Swagger UI.

🐳 Docker Setup (Recommended)

# Build and start
docker compose up --build

# Backend → http://localhost:8000
# Swagger → http://localhost:8000/docs

# Tail logs
docker compose logs -f backend

# Stop
docker compose down

📡 API Reference

`GET /health`

Check live connectivity status of Pinecone and Gemini.

{
  "status": "ok",
  "environment": "development",
  "version": "1.0.0",
  "services": {
    "pinecone": "ok",
    "gemini": "ok (47 models visible)"
  }
}

`POST /api/v1/load-repo`

Clone, chunk, embed, and index a GitHub repository into Pinecone.

Request:

{
  "repo_url": "https://github.com/tiangolo/fastapi",
  "branch": "master"
}

Response:

{
  "status": "ready",
  "namespace": "tiangolo_fastapi-a3f2c8d91b4e",
  "repo_name": "tiangolo_fastapi",
  "total_chunks": 842,
  "message": "Repository 'tiangolo_fastapi' indexed successfully. 842 code chunks are ready for querying."
}

⚠️ Save the namespace value — you need it for /ask-question.

`POST /api/v1/ask-question` (SSE Streaming)

Ask a natural language question about the indexed codebase.

Request:

{
  "question": "Where is the database connection handled?",
  "namespace": "tiangolo_fastapi-a3f2c8d91b4e"
}

SSE Event Stream:

event: token
data: The database connection is handled in

event: token
data:  app/db/session.py at lines 12–28...

event: sources
data: [{"file_path":"app/db/session.py","start_line":12,"end_line":28,"snippet":"..."}]

event: done
data: [DONE]

`GET /api/v1/ask-question` (JSON fallback)

Same as above but returns a single JSON response. Useful for testing.

GET /api/v1/ask-question?question=Where is auth handled?&namespace=your_namespace

⚙️ Configuration Reference

Variable	Default	Description
`GOOGLE_API_KEY`	—	Google AI Studio key (required)
`PINECONE_API_KEY`	—	Pinecone API key (required)
`PINECONE_INDEX_NAME`	`devlens-codebase`	Auto-created on first use
`PINECONE_CLOUD`	`aws`	Cloud provider for serverless index
`PINECONE_REGION`	`us-east-1`	Must match your Pinecone project
`APP_ENV`	`development`	`development` or `production`
`CHUNK_SIZE`	`1000`	Max characters per code chunk
`CHUNK_OVERLAP`	`150`	Overlap between consecutive chunks
`TOP_K_RESULTS`	`6`	Chunks retrieved per question
`EMBEDDING_DIMENSION`	`768`	text-embedding-004 output size
`REPO_CLONE_DIR`	`/tmp/devlens_repos`	Temp dir for clones

🧪 Quick Test Flow

Once the server is running, test the full pipeline in order:

# 1. Health check
curl http://localhost:8000/health

# 2. Index a repo (takes 30–90s depending on repo size)
curl -X POST http://localhost:8000/api/v1/load-repo \
  -H "Content-Type: application/json" \
  -d '{"repo_url": "https://github.com/tiangolo/fastapi", "branch": "master"}'

# 3. Ask a question (copy namespace from step 2 response)
curl "http://localhost:8000/api/v1/ask-question?question=Where+is+routing+handled&namespace=YOUR_NAMESPACE"

🗺️ Roadmap

Author

Ali Sajid

AI Engineer | Deep Learning | Computer Vision | GEN AI

🤝 Contributing

Pull requests are welcome. For major changes, please open an issue first to discuss what you would like to change.

Fork the repository
Create your feature branch: git checkout -b feature/my-feature
Commit your changes: git commit -m 'Add some feature'
Push to the branch: git push origin feature/my-feature
Open a Pull Request

📄 License

This project is licensed under the MIT License — see the LICENSE file for details.

Built with ❤️ using FastAPI · LangChain · Google Gemini · Pinecone

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🔍 DevLens — Chat with Your Codebase

📌 What is DevLens?

✨ Features

🏗️ Architecture

🛠️ Tech Stack

📁 Project Structure

🚀 Getting Started

Prerequisites

1. Clone the repository

2. Create a virtual environment with Python 3.11

3. Install dependencies

4. Configure environment variables

5. Run the server

🐳 Docker Setup (Recommended)

📡 API Reference

`GET /health`

`POST /api/v1/load-repo`

`POST /api/v1/ask-question` (SSE Streaming)

`GET /api/v1/ask-question` (JSON fallback)

⚙️ Configuration Reference

🧪 Quick Test Flow

🗺️ Roadmap

Author

🤝 Contributing

📄 License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
backend		backend
.gitignore		.gitignore
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
docker-compose.yml		docker-compose.yml

Folders and files

Latest commit

History

Repository files navigation

🔍 DevLens — Chat with Your Codebase

📌 What is DevLens?

✨ Features

🏗️ Architecture

🛠️ Tech Stack

📁 Project Structure

🚀 Getting Started

Prerequisites

1. Clone the repository

2. Create a virtual environment with Python 3.11

3. Install dependencies

4. Configure environment variables

5. Run the server

🐳 Docker Setup (Recommended)

📡 API Reference

GET /health

POST /api/v1/load-repo

POST /api/v1/ask-question (SSE Streaming)

GET /api/v1/ask-question (JSON fallback)

⚙️ Configuration Reference

🧪 Quick Test Flow

🗺️ Roadmap

Author

🤝 Contributing

📄 License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

`GET /health`

`POST /api/v1/load-repo`

`POST /api/v1/ask-question` (SSE Streaming)

`GET /api/v1/ask-question` (JSON fallback)

Packages