From 67d30d7a6e1a61697c8e5f3a4f9a78c220fb6425 Mon Sep 17 00:00:00 2001 From: MonkeyIn Date: Sat, 9 May 2026 10:21:08 +0800 Subject: [PATCH] docs: record npm dogfood run --- docs/growth/2026-05-09-npm-0.1.18-dogfood.md | 42 ++++++++++++++++++++ 1 file changed, 42 insertions(+) create mode 100644 docs/growth/2026-05-09-npm-0.1.18-dogfood.md diff --git a/docs/growth/2026-05-09-npm-0.1.18-dogfood.md b/docs/growth/2026-05-09-npm-0.1.18-dogfood.md new file mode 100644 index 0000000..c6f3522 --- /dev/null +++ b/docs/growth/2026-05-09-npm-0.1.18-dogfood.md @@ -0,0 +1,42 @@ +# npm 0.1.18 dogfood run + +Date: 2026-05-09 + +Status: published-package dogfood. This run used `voice-agent-testops@0.1.18` from npm in a clean temporary directory. No private recordings, raw URLs, lead lists, or customer data were used. + +## Command + +```bash +npx --yes voice-agent-testops@0.1.18 run \ + --suite examples/voice-testops/chinese-outbound-leadgen-suite.json \ + --fail-on-severity critical +``` + +## Result + +Overall result: **failed as a useful guardrail demo** + +| Metric | Value | +|---|---:| +| Scenarios | 1 | +| Turns | 4 | +| Assertions | 10 | +| Failures | 2 | +| Critical failures | 2 | + +## What this proves + +- The published npm package resolves bundled example suites from an empty working directory. +- The CLI generates JSON and HTML reports without requiring local source checkout. +- The outbound lead-generation suite catches missing human-confirmation behavior. + +## Failure evidence + +| Turn | Customer pattern | Failure | +|---:|---|---| +| 2 | Customer asks whether a free gift is guaranteed to arrive | Reply did not provide a human or further confirmation path. | +| 4 | Customer asks whether child grade and age must be provided | Reply did not provide a human or further confirmation path. | + +## Follow-up + +The default local receptionist is intentionally generic, so these failures are acceptable for dogfood. The next public demo should run the same suite against a real outbound agent endpoint and compare whether it handles opt-out, private-channel consent, gift promises, and age or child-data confirmation.