Logara AI is a modular observability platform designed to transform raw, noisy log streams into actionable intelligence. By combining high-performance ingestion with vector-based semantic search and local LLM processing, it provides developers with instant insights into system behavior without the overhead of manual pattern matching.
- Semantic Log Search: Transition from keyword-based Grep to natural language queries using Qdrant vector embeddings.
- Root Cause Synthesis: Automated analysis of error clusters to identify underlying infrastructure or application issues.
- Local-First Processing: Designed to run with Ollama for sensitive log data that shouldn't leave your infrastructure.
- Anomaly Correlation: Detects statistical outliers in log volume and type to preempt site reliability issues.
- Security-Aware Log Sanitization: Automatically redacts sensitive data such as API keys, JWTs, emails, bearer tokens, and credit card patterns before logs enter downstream processing pipelines.
Logara is built as a series of decoupled microservices to ensure scalability during log spikes:
graph TD
subgraph "Ingestion Layer"
A[Log Sources] --> B[FastAPI Ingestor]
B --> C[Redis Queue]
end
subgraph "Processing Layer"
C --> D[Log Processor]
D --> E[(Qdrant Vector DB)]
end
subgraph "AI Engine"
E --> I[AI Engine Service]
J[Ollama Local LLM] <--> I
end
subgraph "Interface"
I --> G[GraphQL/REST API]
G --> H[React Dashboard]
end
Logara AI is currently in active development (Alpha). We are focusing on stabilization of the ingestion pipeline and refining the embedding strategy for nested JSON logs.
Logara AI includes a configurable backend redaction pipeline designed to sanitize sensitive information before logs enter queue processing, vectorization, or downstream AI workflows.
- JWT tokens
- API keys
- AWS access keys
- Bearer tokens
- Email addresses
- Credit card patterns (Luhn validated)
- Optional IPv4 masking
The backend ingestion pipeline also supports:
- lightweight redaction metrics tracking
- structured redaction summaries
- nested payload sanitization
- recursive dictionary/list redaction handling
This helps improve operational visibility while reducing the risk of sensitive data exposure during observability workflows.
- Q2: Implementation of OpenTelemetry (OTel) collector integration.
- Q2: Support for persistent vector storage partitioning by 'service_id'.
- Q3: AI Engine microservice foundation for Semantic Search and RAG pipelines.
- Q3: Beta release of the "Explain Error" hover-state in the dashboard.
- Q4: Multi-tenant RBAC for enterprise-grade deployments.
Logara AI provides two main ingestion endpoints:
-
Standard Ingest (
POST /ingest):- For single, raw log strings.
- Body format:
{"log_data": "[2026-05-16 10:30:00] INFO: service started"}
-
OpenTelemetry Log Ingest (
POST /v1/logs):- For standard OpenTelemetry (OTLP) log collector HTTP exports.
- Accepts standard JSON batches of resource logs, scope logs, and log records.
- Automatically merges resource attributes, extracts timestamps/severity levels, and preserves metadata.
- Python 3.10+
- Node.js 20+
- Docker & Docker Compose (for Qdrant & Redis)
-
Clone & Setup:
git clone https://github.com/Dharanish-AM/Logara-AI.git cd Logara-AI
Before running, set your Redis password in .env:
cp .env.example .env
# Edit .env and set REDIS_PASSWORD-
Start Infrastructure:
docker-compose up -d
-
Backend:
cd backend python -m venv venv source venv/bin/activate pip install -r requirements.txt # In terminal 1: Start the ingestor API fastapi dev main.py # In terminal 2: Start the background log processor python worker.py
-
AI Engine Service:
cd ai-engine python -m venv venv source venv/bin/activate pip install -r requirements.txt # Start the AI Engine on port 8001 uvicorn main:app --port 8001
-
Frontend:
cd frontend npm install npm run dev
The repository now includes GitHub Actions validation for pull requests and deploy-readiness checks for the main branch.
CIruns on pull requests and manual dispatch.Pre-Deploy Validationruns on pushes tomainand manual dispatch.- Shared logic lives in
.github/workflows/repo-validation.ymlso CI and pre-deploy stay aligned.
Current validation covers:
- backend dependency install, import compilation, and
pytest - frontend dependency install,
eslint, and production build - repository deploy prerequisite checks via
.github/scripts/validate_deploy.py - Docker Compose configuration validation with
docker compose config - backend smoke checks that import the FastAPI app and worker successfully
- changed-files-aware PR CI to avoid unnecessary jobs on smaller pull requests
- backend coverage artifact generation in CI
- Docker image build validation for backend and frontend images
- live Redis/Qdrant integration smoke testing in CI
- PR title lint, commit message lint, label automation, and auto-assignment workflows
Repository governance also includes:
- structured GitHub issue forms and an issue chooser
CODEOWNERSfor automatic reviewer routing- Dependabot updates for GitHub Actions, backend Python packages, and frontend npm packages
- a security audit workflow for GitHub Actions, Python dependencies, and production npm dependencies
- a branch-protection setup guide in
.github/branch-protection.md
We welcome contributions that focus on performance optimizations in the log processing pipeline. Please see CONTRIBUTING.md for our technical standards.
Thanks goes to these wonderful people for contributing to this project ❤️
MIT License.