Skip to content

Content hash freshness + KU schema: potential interop with IETF draft #1

@tomjwxf

Description

@tomjwxf

Hi - the knowledge-engine's dual-layer architecture (wiki + Memvid) with content hashing and drift detection maps closely to patterns formalized in the IETF Knowledge Unit draft. Wanted to point out the specific overlaps in case interoperability is interesting.

Content hashing and source provenance

Your Bridge layer uses content hashes to keep wiki and Memvid layers synchronized, flagging drift before inconsistencies compound. The KU draft's source provenance section (Section 3.4) standardizes exactly this pattern:

{
  "sources": [
    {
      "uri": "file:///docs/architecture.pdf",
      "retrieved_at": "2026-04-01T10:00:00Z",
      "content_hash": "sha256:a1b2c3..."
    }
  ]
}

Every KU carries the content_hash of its source material, so downstream consumers can detect when the underlying document has changed - the same drift detection your Bridge performs, but standardized across systems.

Layer mapping

Your two-layer design (human-readable wiki + machine-searchable Memvid) maps to the KU draft's disclosure levels:

knowledge-engine layer KU disclosure level
Wiki page (curated, structured) L0 (assertion) - L1 (sourced claim)
Memvid Smart Frame (raw, comprehensive) L2 (deliberation trace) - L3 (full provenance)
Source attribution in search results Source provenance array (Section 3.4)

The .mv2 files with their append-only, immutable Smart Frames are a natural fit for L3 (full deliberation) storage - they already carry timestamps and checksums.

Freshness as a first-class property

Your system queries wiki first, then falls back to Memvid, merging results with source attribution. The KU schema makes freshness explicit through retrieved_at timestamps and content_hash fields on every source, so a consumer can programmatically decide whether a knowledge unit needs re-ingestion.

What this could look like

Your wiki markdown pages could carry KU-compatible frontmatter:

---
title: "Deployment Architecture"
ku_version: "draft-04"
canonical_question: "How is the production system deployed?"
sources:
  - uri: "file:///docs/deploy-guide.pdf"
    content_hash: "sha256:abc..."
    retrieved_at: "2026-04-01T10:00:00Z"
confidence: 0.88
---

This would let other KU-compatible systems consume your wiki output without needing access to the Memvid layer.

Links

Happy to discuss how these could work together.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions