Building autonomous AI systems, multi-agent platforms, and LLM-powered applications.
Backend Engineer specializing in AI-powered systems and multi-agent architectures. I build production LLM applications with RAG, semantic caching, and scalable infrastructure.
π€ 40-80% LLM cost reduction via semantic caching (pgvector)
π Multi-agent platforms handling 1000+ concurrent conversations
β‘ <200ms response latency with intelligent caching
π° 99.7% infrastructure cost reduction through cloud automation
π€ Assist+ β Multi-Agent AI Platform
Enterprise-grade SaaS deploying autonomous chatbots across Facebook, Instagram & WhatsApp.
Bun Hono PostgreSQL pgvector Redis BullMQ OpenRouter
- RAG-Powered Responses β Knowledge retrieval with pgvector embeddings
- Semantic Caching β 40-80% LLM cost reduction via similarity search
- Multi-Agent Engine β Independent bots with context, lead extraction, handoff
- Background Processing β BullMQ workers for async message handling
π CV-lize β AI Document Analysis
Resume optimization with NLP pipelines and multi-model AI architecture.
Python FastAPI spaCy OpenRouter MongoDB
- Multi-Model AI β OpenRouter + Gemini fallback with auto-failover
- NLP Pipeline β Entity extraction, skill detection, keyword analysis
- 95%+ ATS Scoring β Job description matching with STAR methodology
π¬ OmniAssistant β RAG Conversational Agent
Knowledge retrieval chatbot with lead qualification automation.
Next.js Google Genkit Gemini TypeScript
π’ MERIDIEN β Multi-Tenant SaaS Backend
Scalable retail management system with clean architecture.
Go Gin PostgreSQL GORM Flutter
Specializations: RAG Systems β’ Semantic Caching β’ Multi-Agent Architectures β’ LLM Integration β’ Vector Databases β’ Background Processing
- π Go + Python AI Stack β Go (Golang) for high-performance backend, Python for agentic AI intelligence
- π€ Google Agent Development Kit (ADK) β Building multi-agent systems with Google's agent framework
- π Multi-Agent Collaboration β Agents delegating to specialized sub-agents
- π οΈ Tool-Using Agents β LLMs executing code and calling external APIs
- π§ Long-Term Memory β Persistent memory across conversation sessions
