From a2cb8e495cfe04afbb763ae8c037553a4e6f44ac Mon Sep 17 00:00:00 2001 From: dadachi Date: Sat, 23 May 2026 22:23:52 +0900 Subject: [PATCH] docs(readme): add the 12/12 VISUAL=2 validation matrix; bump status to 0.2.1 MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit Both editions × both platforms × all three specs pass full NATIVEAPPTEMPLATE_VISUAL=2 runs (re-verified 2026-05-23 on 0.2.1 code). Adds a matrix table under Demo and corrects the stale 'v0.1.2 stable' status line. --- README.md | 14 +++++++++++++- 1 file changed, 13 insertions(+), 1 deletion(-) diff --git a/README.md b/README.md index 59c31f4..5e359f1 100644 --- a/README.md +++ b/README.md @@ -17,7 +17,7 @@ Coherent across all three, in under an hour. npx nativeapptemplate-agent "a walk-in clinic queue for small veterinary practices" ``` -> **Status: v0.1.2 stable.** First built during [Built with Opus 4.7: a Claude Code Hackathon](https://cerebralvalley.ai/e/built-with-4-7-hackathon) (April 21–27, 2026); shipped to npm post-hackathon. Verified end-to-end on the three demo specs below from a fresh `/tmp/` cwd. Stage 2 (scripted-CRUD walk-through via `mobile-mcp` against a live Rails server) and paid-edition parity validation landed post-launch. Active development continues — see the [roadmap](./ROADMAP.md) for what's next. +> **Status: 0.2.1 stable.** First built during [Built with Opus 4.7: a Claude Code Hackathon](https://cerebralvalley.ai/e/built-with-4-7-hackathon) (April 21–27, 2026); shipped to npm post-hackathon. Verified end-to-end on the three demo specs below from a fresh `/tmp/` cwd — the full 3-spec × 2-edition × 2-platform matrix is green at `NATIVEAPPTEMPLATE_VISUAL=2` (see the [matrix](#demo)). Stage 2 (scripted-CRUD walk-through via `mobile-mcp` against a live Rails server) and paid-edition parity validation landed post-launch. Active development continues — see the [roadmap](./ROADMAP.md) for what's next. --- @@ -64,6 +64,18 @@ Layer 2 ran in build mode — real `xcodebuild build` and `./gradlew assembleDeb The agent works on either the **free (MIT) edition** or the **paid edition** without code changes — the same pipeline handles both substrates; multi-tenant features (org switching, invitations, role permissions) survive the rename pipeline when targeting paid. The agent tests paid first because free is a strict subset of paid; running paid first catches regressions that wouldn't surface against free alone. +### Validation matrix — 12/12 green at `VISUAL=2` + +All three specs pass a full end-to-end `NATIVEAPPTEMPLATE_VISUAL=2` run (build → rename → boot → scripted-CRUD walk → vision judge) on **both editions and both platforms** — 12 cells (3 specs × 2 editions × 2 platforms), each **Layer 1 3/3 · Layer 2 3/3 · Layer 3 2/2 · reviewer PASS** on a real iPhone 17 Pro simulator + Android emulator: + +| Spec | Free (iOS · Android) | Paid (iOS · Android) | +|---|---|---| +| Walk-in queue | ✅ · ✅ | ✅ · ✅ | +| Restaurant / sushi waitlist | ✅ · ✅ | ✅ · ✅ | +| Personal task tracker | ✅ · ✅ | ✅ · ✅ | + +After the queue cell's Stage-2 hardening landed, sushi and the task tracker passed first-try on both editions with no further changes — the scripted scenario is genuinely spec- and edition-agnostic (the task tracker is a non-queue spec running the same walk). + Both screenshots are real captures from the booted iOS Simulator and Android emulator post-`./gradlew assembleDebug` / `xcodebuild build`, after the agent installed and launched the generated app. ## Architecture