Skip to content
View dev404ai's full-sized avatar

Highlights

  • Pro

Block or report dev404ai

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
dev404ai/README.md

Oleg Solozobov

Production AI Reliability & Operational Evidence | Observability · Evals · Replay

Platform artifacts for production AI and agent runtime systems: workflow orchestration, tool-call permission records, agent-step telemetry, release manifests, inference / decision event schemas, eval traces, replay / reconstruction packets, and rollout gates.

PhD research: operational evidence for production AI.


Focus Areas

  • Production AI Reliability (release manifests, eval-to-release gates, incident evidence)
  • Agentic Runtime Evidence (tool-call permission records, workflow / agent-step evidence)
  • Observability & Evals Infrastructure (eval / telemetry linkage, traces, quality loops)
  • Operational Evidence & Replay (event identity, lineage, reconstruction packets)
  • Policy / Permission Controls (rules / policy lifecycle, authorization evidence)
  • Release Gates & Incident Reconstruction (canary, shadow, rollback, postmortem packets)
  • Platform Engineering (distributed services, event streaming, Kubernetes, GitOps, multi-cloud)

Selected Public Proof

Current strongest public proof:

  • operational-evidence-plane — public reference implementation for production AI / agent-runtime operational evidence: release manifest, agent-step event, tool-call permission packet, operational trace / eval result, reconstruction packet, deterministic code-review demo, Bedrock translation. Apache-2.0. Citable archive: Zenodo DOI 10.5281/zenodo.20051037.
  • decision-trace-reconstructor - reconstructs agent / automated-decision traces and reports missing or opaque decision facts across LangSmith, OpenTelemetry, Bedrock, OpenAI Agents, Anthropic, MCP, and other adapters.

Foundational operational-evidence artifacts:

Supporting policy-as-code project:

  • RuleHub - Policy-as-Code ecosystem for AI / ML guardrails, policy enforcement, and reproducible evidence.

Decision & Scoring Infrastructure

Go Python Kafka Flink ClickHouse OPA/Rego Envoy Redis


Platform Engineering

Kubernetes Istio Terraform Helm Argo CD Backstage


Observability & SRE

Prometheus Thanos Grafana Loki Tempo OpenTelemetry Sentry


Policy-as-Code & Infrastructure Security

Semgrep Checkov Trivy Vault Kyverno CodeQL

Pinned Loading

  1. rulehub/rulehub rulehub/rulehub Public

    Policy-as-Code guardrails for ML and LLM systems: OPA/Kyverno policies, compliance mappings, signed bundles, evidence trails, and plugin index.

    Python 5 1

  2. costscope/costscope costscope/costscope Public

    Open FinOps and governance data plane for FOCUS 1.2 cost normalization across cloud, on-prem, GPU, and AI/LLM workloads.

    Go 1 1

  3. governance-evidence/decision-event-schema governance-evidence/decision-event-schema Public

    JSON Schema for decision events as governance evidence units in automated decision and real-time risk systems. MIT.

    Python 1 1